Hierarchical distance learning by stacking nearest neighbor classifiers

Ozay, Mete; YARMAN VURAL, FATOŞ

doi:10.1016/j.inffus.2015.09.004

Hierarchical distance learning by stacking nearest neighbor classifiers

Atıf İçin Kopyala

Ozay M., YARMAN VURAL F. T.

INFORMATION FUSION, cilt.29, ss.14-31, 2016 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 29
Basım Tarihi: 2016
Doi Numarası: 10.1016/j.inffus.2015.09.004
Dergi Adı: INFORMATION FUSION
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.14-31
Anahtar Kelimeler: Decision fusion, Classification, Nearest neighbor rule, Ensemble learning, Hierarchical distance learning, ENSEMBLES, SELECTION, IMAGE, INFORMATION, COMBINATION, LIBRARY
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

We propose a two-layer decision fusion technique, called Fuzzy Stacked Generalization (FSG) which establishes a hierarchical distance learning architecture. At the base-layer of an FSG, fuzzy k-NN classifiers receive different feature sets each of which is extracted from the same dataset to gain multiple views of the dataset At the meta-layer, first, a fusion space is constructed by aggregating decision spaces of all the base-layer classifiers. Then, a fuzzy k-NN classifier is trained in the fusion space by minimizing the difference between the large sample and N-sample classification error. In order to measure the degree of collaboration among the base-layer classifiers and the diversity of the feature spaces, a new measure called, shareability, is introduced. Shearability is defined as the number of samples that are correctly classified by at least one of the base-layer classifiers in FSG. In the experiments, we observe that FSG performs better than the popular distance learning and ensemble learning algorithms when the shareability measure is large enough such that most of the samples are correctly classified by at least one of the base-layer classifiers. The relationship between the proposed and state-of-the-art diversity measures is experimentally analyzed. The tests performed on a variety of artificial and real-world benchmark datasets show that the classification performance of FSG increases compared to that of state-of-the art ensemble learning and distance learning methods as the number of classes increases. (C) 2015 Elsevier B.V. All rights reserved.