Unsupervised Metric Learning for Face Identification in TV Video

Creative Commons License

Cinbiş R. G., Verbeek J., Schmid C.

IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain, 6 - 13 November 2011, pp.1559-1566 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • Doi Number: 10.1109/iccv.2011.6126415
  • City: Barcelona
  • Country: Spain
  • Page Numbers: pp.1559-1566
  • Middle East Technical University Affiliated: No


The goal of face identification is to decide whether two faces depict the same person or not. This paper addresses the identification problem for face-tracks that are automatically collected from uncontrolled TV video data. Face-track identification is an important component in systems that automatically label characters in TV series or movies based on subtitles and/or scripts: it enables effective transfer of the sparse text-based supervision to otherfaces. We show that, without manually labeling any examples, metric learning can be effectively used to address this problem. This is possible by using pairs of faces within a track as positive examples, while negative training examples can be generated from pairs of face tracks of different people that appear together in a video frame. In this manner we can learn a cast-specific metric, adapted to the people appearing in a particular video, without using any supervision. Identification performance can be further improved using semi-supervised learning where we also include labels for some of the face tracks. We show that our cast-specific metrics not only improve identification, but also recognition and clustring.