A Supervised Biclustering Optimization Model for Feature Selection in Biomedical Dataset Classification


Arikan S. D. O., İYİGÜN C.

1st International Conference on Data Mining and Big Data (DMBD), Balvi, Letonya, 25 - 30 Haziran 2016, cilt.9714, ss.196-204 identifier identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 9714
  • Doi Numarası: 10.1007/978-3-319-40973-3_19
  • Basıldığı Şehir: Balvi
  • Basıldığı Ülke: Letonya
  • Sayfa Sayıları: ss.196-204
  • Anahtar Kelimeler: Biclustering, Feature selection, Classification, Optimization, Microarray, GENE-EXPRESSION, PREDICTION, CANCER
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Biclustering groups samples and features simultaneously in the given set of data. When biclusters are obtained from the data, clusters of samples and clusters of features that determine the partitioning of samples into the underlying clusters are also obtained. We focus on a supervised biclustering problem leading to unsupervised feature selection. We formulate this problem as an optimization model which aims to maximize classification accuracy by selecting a small subset of features. We solve the model with exact and inexact solution methods based on optimization techniques. Microarray cancer datasets are used to experiment our approach.