Bi-k-bi clustering: mining large scale gene expression data using two-level biclustering

Creative Commons License

Carkacioglu L., Atalay R. C., KONU KARAKAYALI Ö., Atalay V., CAN T.

INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, vol.4, no.6, pp.701-721, 2010 (SCI-Expanded) identifier identifier


Due to the increase in gene expression data sets in recent years, various data mining techniques have been proposed for mining gene expression profiles. However, most of these methods target single gene expression data sets and cannot handle all the available gene expression data in public databases in reasonable amount of time and space. In this paper, we propose a novel framework, bi-k-bi clustering, for finding association rules of gene pairs that can easily operate on large scale and multiple heterogeneous data sets. We applied our proposed framework on the available NCBI GEO Homo sapiens data sets. Our results show consistency and relatedness with the available literature and also provides novel associations.