TopoBDA: Towards Bezier deformable attention for road topology understanding


Kalfaoglu M. E., Öztürk H. İ., Kilinc O., Temizel A.

NEUROCOMPUTING, cilt.670, 2026 (SCI-Expanded, Scopus) identifier identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 670
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1016/j.neucom.2025.132360
  • Dergi Adı: NEUROCOMPUTING
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, EMBASE, INSPEC, zbMATH
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Understanding road topology is crucial for autonomous driving. This paper introduces TopoBDA (Topology with Bezier Deformable Attention), a novel approach that enhances road topology comprehension by lever aging Bezier Deformable Attention (BDA). TopoBDA processes multi-camera 360-degree imagery to generate Bird's Eye View (BEV) features, which are refined through a transformer decoder employing BDA. BDA utilizes Bezier control points to drive the deformable attention mechanism, improving the detection and representa tion of elongated and thin polyline structures, such as lane centerlines. Additionally, TopoBDA integrates two auxiliary components: an instance mask formulation loss and a one-to-many set prediction loss strategy, to further refine centerline detection and enhance road topology understanding. Experimental evaluations on the OpenLane-V2 dataset demonstrate that TopoBDA outperforms existing methods, achieving state-of-the-art re sults in centerline detection and topology reasoning. TopoBDA also achieves the best results on the OpenLane-V1 dataset in 3D lane detection. Further experiments on integrating multi-modal data-such as LiDAR, radar, and SDMap-show that multimodal inputs can further enhance performance in road topology understanding. Project page: https://artest08.github.io/TopoBDA.github.io/.