Semi-Automatic Annotation For Visual Object Tracking

Ince K. G., Köksal A., Alatan A. A.

2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, Canada, 11 - 18 October 2021 identifier identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/iccvw54120.2021.00143
  • City: Montreal
  • Country: Canada
  • Middle East Technical University Affiliated: Yes


We propose a semi-automatic bounding box annotation method for visual object tracking by utilizing temporal information with a tracking-by-detection approach. For detection, we use an off-the-shelf object detector which is trained iteratively with the annotations generated by the proposed method, and we perform object detection on each frame independently. We employ Multiple Hypothesis Tracking (MHT) to exploit temporal information and to reduce the number of false positives which makes it possible to use lower objectness thresholds for detection to increase recall. The tracklets formed by MHT are evaluated by human operators to enlarge the training set. This novel incremental learning approach helps to perform annotation iteratively. The experiments performed on AUTH Multidrone Dataset reveal that the annotation workload can be reduced up to 96% by the proposed approach. Resulting uav_detection_2 annotations and our codes are publicly available at Video-Annotation-OGAM.