A new framework of multi-objective evolutionary algorithms for feature selection and multi-label classification of video data


Karagoz G. N., YAZICI A., Dokeroglu T., Cosar A.

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, cilt.12, sa.1, ss.53-71, 2021 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 12 Sayı: 1
  • Basım Tarihi: 2021
  • Doi Numarası: 10.1007/s13042-020-01156-w
  • Dergi Adı: INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, INSPEC
  • Sayfa Sayıları: ss.53-71
  • Anahtar Kelimeler: Multi-label classification, Multi-objective optimization, Evolutionary, Machine learning, Feature selection, GENETIC ALGORITHM, OPTIMIZATION
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

There are few studies in the literature to address the multi-objective multi-label feature selection for the classification of video data using evolutionary algorithms. Selecting the most appropriate subset of features is a significant problem while maintaining/improving the accuracy of the prediction results. This study proposes a framework of parallel multi-objective Non-dominated Sorting Genetic Algorithms (NSGA-II) for exploring a Pareto set of non-dominated solutions. The subsets of non-dominated features are extracted and validated by multi-label classification techniques, Binary Relevance (BR), Classifier Chains (CC), Pruned Sets (PS), and Random k-Labelset (RAkEL). Base classifiers such as Support Vector Machines (SVM), J48-Decision Tree (J48), and Logistic Regression (LR) are performed in the classification phase of the algorithms. Comprehensive experiments are carried out with local feature descriptors extracted from two multi-label data sets, the well-known MIR-Flickr dataset and a Wireless Multimedia Sensor (WMS) dataset that we have generated from our video recordings. The prediction accuracy levels are improved by 6.36% and 25.7% for the MIR-Flickr and WMS datasets respectively while the number of features is significantly reduced. The results verify that the algorithms presented in this new framework outperform the state-of-the-art algorithms.