A Supervised Biclustering Optimization Model for Feature Selection in Biomedical Dataset Classification


Arikan S. D. O. , İYİGÜN C.

1st International Conference on Data Mining and Big Data (DMBD), Balvi, Latvia, 25 - 30 June 2016, vol.9714, pp.196-204 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume: 9714
  • Doi Number: 10.1007/978-3-319-40973-3_19
  • City: Balvi
  • Country: Latvia
  • Page Numbers: pp.196-204

Abstract

Biclustering groups samples and features simultaneously in the given set of data. When biclusters are obtained from the data, clusters of samples and clusters of features that determine the partitioning of samples into the underlying clusters are also obtained. We focus on a supervised biclustering problem leading to unsupervised feature selection. We formulate this problem as an optimization model which aims to maximize classification accuracy by selecting a small subset of features. We solve the model with exact and inexact solution methods based on optimization techniques. Microarray cancer datasets are used to experiment our approach.