Subtree selection in kernels for graph classification

TAN, MEHMET; POLAT, FARUK; Alhajj, Reda

doi:10.1504/ijdmb.2013.056080

Subtree selection in kernels for graph classification

TAN M., POLAT F., Alhajj R.

INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, cilt.8, sa.3, ss.294-310, 2013 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 8 Sayı: 3
Basım Tarihi: 2013
Doi Numarası: 10.1504/ijdmb.2013.056080
Dergi Adı: INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.294-310
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Classification of structured data is essential for a wide range of problems in bioinformatics and cheminformatics. One such problem is in silico prediction of small molecule properties such as toxicity, mutagenicity and activity. In this paper, we propose a new feature selection method for graph kernels that uses the subtrees of graphs as their feature sets. A masking procedure which boils down to feature selection is proposed for this purpose. Experiments conducted on several data sets as well as a comparison of our method with some frequent subgraph based approaches are presented.