Using multi-modal 3D contours and their relations for vision and robotics

BAŞESKİ, Emre; Pugeault, Nicolas; Kalkan, SİNAN; BODENHAGEN, Leon; Piater, Justus; KRÜGER, Norbert

doi:10.1016/j.jvcir.2010.06.006

Using multi-modal 3D contours and their relations for vision and robotics

BAŞESKİ E., Pugeault N., Kalkan S., BODENHAGEN L., Piater J. H., KRÜGER N.

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, cilt.21, sa.8, ss.850-864, 2010 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 21 Sayı: 8
Basım Tarihi: 2010
Doi Numarası: 10.1016/j.jvcir.2010.06.006
Dergi Adı: JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.850-864
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

In this work, we make use of 3D contours and relations between them (namely, coplanarity, cocolority, distance and angle) for four different applications in the area of computer vision and vision-based robotics. Our multi-modal contour representation covers both geometric and appearance information. We show the potential of reasoning with global entities in the context of visual scene analysis for driver assistance, depth prediction, robotic grasping and grasp learning. We argue that, such 3D global reasoning processes complement widely-used 2D local approaches such as bag-of-features since 3D relations are invariant under camera transformations and 3D information can be directly linked to actions. We therefore stress the necessity of including both global and local features with different spatial dimensions within a representation. We also discuss the importance of an efficient use of the uncertainty associated with the features, relations, and their applicability in a given context. (c) 2010 Elsevier Inc. All rights reserved.