Defining Image Memorability Using the Visual Memory Schema

Akagunduz, ERDEM; Bors, Adrian; Evans, Karla

doi:10.1109/tpami.2019.2914392

Defining Image Memorability Using the Visual Memory Schema

Atıf İçin Kopyala

Akagunduz E., Bors A. G., Evans K. K.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, cilt.42, sa.9, ss.2165-2178, 2020 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 42 Sayı: 9
Basım Tarihi: 2020
Doi Numarası: 10.1109/tpami.2019.2914392
Dergi Adı: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, ABI/INFORM, Aerospace Database, Applied Science & Technology Source, Business Source Elite, Business Source Premier, Communication Abstracts, Compendex, Computer & Applied Sciences, EMBASE, INSPEC, MEDLINE, Metadex, zbMATH, Civil Engineering Abstracts
Sayfa Sayıları: ss.2165-2178
Anahtar Kelimeler: Visualization, Observers, Semantics, Psychology, Organizations, Image recognition, Computer vision, Image memorability, visual memory schema, memory experiments, deep features
Orta Doğu Teknik Üniversitesi Adresli: Hayır

Özet

Memorability of an image is a characteristic determined by the human observers' ability to remember images they have seen. Yet recent work on image memorability defines it as an intrinsic property that can be obtained independent of the observer. The current study aims to enhance our understanding and prediction of image memorability, improving upon existing approaches by incorporating the properties of cumulative human annotations. We propose a new concept called the Visual Memory Schema (VMS) referring to an organization of image components human observers share when encoding and recognizing images. The concept of VMS is operationalised by asking human observers to define memorable regions of images they were asked to remember during an episodic memory test. We then statistically assess the consistency of VMSs across observers for either correctly or incorrectly recognised images. The associations of the VMSs with eye fixations and saliency are analysed separately as well. Lastly, we adapt various deep learning architectures for the reconstruction and prediction of memorable regions in images and analyse the results when using transfer learning at the outputs of different convolutional network layers.