Explicit diversification of search results across multiple dimensions for educational search


Yigit-Sert S., ALTINGÖVDE İ. S. , Macdonald C., Ounis I., ULUSOY Ö.

JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, vol.72, no.3, pp.315-330, 2021 (Peer-Reviewed Journal) identifier identifier

  • Publication Type: Article / Article
  • Volume: 72 Issue: 3
  • Publication Date: 2021
  • Doi Number: 10.1002/asi.24403
  • Journal Name: JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY
  • Journal Indexes: Science Citation Index Expanded, Social Sciences Citation Index, Scopus, Academic Search Premier, FRANCIS, Periodicals Index Online, ABI/INFORM, Aerospace Database, Applied Science & Technology Source, Business Source Elite, Business Source Premier, CINAHL, Communication Abstracts, Compendex, Computer & Applied Sciences, EBSCO Education Source, Education Abstracts, Information Science and Technology Abstracts, INSPEC, Library and Information Science Abstracts, Library Literature and Information Science, Library, Information Science & Technology Abstracts (LISTA), Metadex, MLA - Modern Language Association Database, PAIS International, Public Affairs Index, Civil Engineering Abstracts
  • Page Numbers: pp.315-330

Abstract

Making use of search systems to foster learning is an emerging research trend known assearch as learning. Earlier works identified result diversification as a useful technique to support learning-oriented search, since diversification ensures a comprehensive coverage of various aspects of the queried topic in the result list. Inspired by this finding, first we define a new research problem, multidimensional result diversification, in the context of educational search. We argue that in a search engine for the education domain, it is necessary to diversify results across multiple dimensions, that is, not only for the topical aspects covered by the retrieved documents, but also for other dimensions, such as the type of the document (e.g., text, video, etc.) or its intellectual level (say, for beginners/experts). Second, we propose a framework that extends the probabilistic and supervised diversification methods to take into account the coverage of such multiple dimensions. We demonstrate its effectiveness upon a newly developed test collection based on a real-life educational search engine. Thorough experiments based on gathered relevance annotations reveal that the proposed framework outperforms the baseline by up to 2.4%. An alternative evaluation utilizing user clicks also yields improvements of up to 2% w.r.t. various metrics.