Multimodal query-level fusion for efficient multimedia information retrieval

Sattari S., YAZICI A.

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, vol.33, no.10, pp.2019-2037, 2018 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 33 Issue: 10
  • Publication Date: 2018
  • Doi Number: 10.1002/int.21920
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Page Numbers: pp.2019-2037
  • Keywords: cross-modal retrieval, multimedia database, multimodal query, query expansion, query level fusion, ONTOLOGY, ACCESS, VIDEO
  • Middle East Technical University Affiliated: Yes


Managing a large volume of multimedia data containing various modalities such as visual, audio, and text reveals the necessity for efficient methods for modeling, processing, storing, and retrieving complex data. In this paper, we propose a fusion-based approach at the query level to improve query retrieval performance of multimedia data. We discuss various flexible query types including the combination of content as well as concept-based queries that provide users with the ability to efficiently perform multimodal querying. We have carried out a number of experiments on a video database to show the efficiency of our approach for various types of queries. Our experimental results show that our query-level fusion approach presents a notable improvement in retrieval performance especially for the concept-based queries.