Multimodal concept detection in broadcast media: KavTan

SOYSAL, Medeni; Logoglu, ABDULLAH; TEKİN, Mashar; ESEN, Ersin; SARACOĞLU, Ahmet; Acar, Banu; Ozan, Ezgi; Ates, Tugrul; SEVİMLİ, Hakan; SEVİNÇ, Muge; ATIL, Ilkay; Ozkan, Savas; Arabaci, Mehmet; TANKIZ, Seda; KARADENİZ, Talha; ÖNÜR, Duygu; SELÇUK, Sezin; Alatan, A.; Ciloglu, TOLGA

doi:10.1007/s11042-013-1564-z

Multimodal concept detection in broadcast media: KavTan

SOYSAL M., Logoglu K. B., TEKİN M., ESEN E., SARACOĞLU A., Acar B. O., ...Daha Fazla

MULTIMEDIA TOOLS AND APPLICATIONS, cilt.72, sa.3, ss.2787-2832, 2014 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 72 Sayı: 3
Basım Tarihi: 2014
Doi Numarası: 10.1007/s11042-013-1564-z
Dergi Adı: MULTIMEDIA TOOLS AND APPLICATIONS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.2787-2832
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Concept detection stands as an important problem for efficient indexing and retrieval in large video archives. In this work, the KavTan System, which performs high-level semantic classification in one of the largest TV archives of Turkey, is presented. In this system, concept detection is performed using generalized visual and audio concept detection modules that are supported by video text detection, audio keyword spotting and specialized audio-visual semantic detection components. The performance of the presented framework was assessed objectively over a wide range of semantic concepts (5 high-level, 14 visual, 9 audio, 2 supplementary) by using a significant amount of precisely labeled ground truth data. KavTan System achieves successful high-level concept detection performance in unconstrained TV broadcast by efficiently utilizing multimodal information that is systematically extracted from both spatial and temporal extent of multimedia data.