Learning Dense Correspondences for Video Objects

Wen Chi Chin, Zih Jian Jhang, Hwann Tzong Chen, Koichi Ito

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We introduce a learning based method for extracting distinctive features on video objects. Based on the extracted features, we are able to derive dense correspondences between the object in the current video frame and the reference template, and then use the correspondences to identify the grasping points on the object. We train a deep-learning model to predict dense feature maps using the training data collected via solving simultaneous localization and mapping (SLAM). Further, a new feature-aggregation technique based on the optical flow of consecutive frames is applied to the integration of multiple feature maps for alleviating uncertainties. We also use the optical flow information to assess the reliability of feature matching. The experimental results show that our approach effectively reduces unreliable correspondences and thus improves the matching accuracy.

Original languageEnglish
Title of host publication2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings
PublisherIEEE Computer Society
Pages1297-1301
Number of pages5
ISBN (Electronic)9781538662496
DOIs
Publication statusPublished - 2019 Sep
Event26th IEEE International Conference on Image Processing, ICIP 2019 - Taipei, Taiwan, Province of China
Duration: 2019 Sep 222019 Sep 25

Publication series

NameProceedings - International Conference on Image Processing, ICIP
Volume2019-September
ISSN (Print)1522-4880

Conference

Conference26th IEEE International Conference on Image Processing, ICIP 2019
CountryTaiwan, Province of China
CityTaipei
Period19/9/2219/9/25

Keywords

  • dense correspondence
  • feature map aggregation
  • optical flow
  • visual descriptor
  • visual descriptor

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition
  • Signal Processing

Fingerprint Dive into the research topics of 'Learning Dense Correspondences for Video Objects'. Together they form a unique fingerprint.

  • Cite this

    Chin, W. C., Jhang, Z. J., Chen, H. T., & Ito, K. (2019). Learning Dense Correspondences for Video Objects. In 2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings (pp. 1297-1301). [8803399] (Proceedings - International Conference on Image Processing, ICIP; Vol. 2019-September). IEEE Computer Society. https://doi.org/10.1109/ICIP.2019.8803399