Using multi-modal 3D contours and their relations for vision and robotics


BAŞESKİ E., Pugeault N., Kalkan S. , BODENHAGEN L., Piater J. H. , KRÜGER N.

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, cilt.21, ss.850-864, 2010 (SCI İndekslerine Giren Dergi) identifier identifier

  • Cilt numarası: 21 Konu: 8
  • Basım Tarihi: 2010
  • Doi Numarası: 10.1016/j.jvcir.2010.06.006
  • Dergi Adı: JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION
  • Sayfa Sayıları: ss.850-864

Özet

In this work, we make use of 3D contours and relations between them (namely, coplanarity, cocolority, distance and angle) for four different applications in the area of computer vision and vision-based robotics. Our multi-modal contour representation covers both geometric and appearance information. We show the potential of reasoning with global entities in the context of visual scene analysis for driver assistance, depth prediction, robotic grasping and grasp learning. We argue that, such 3D global reasoning processes complement widely-used 2D local approaches such as bag-of-features since 3D relations are invariant under camera transformations and 3D information can be directly linked to actions. We therefore stress the necessity of including both global and local features with different spatial dimensions within a representation. We also discuss the importance of an efficient use of the uncertainty associated with the features, relations, and their applicability in a given context. (c) 2010 Elsevier Inc. All rights reserved.