ILP-based concept discovery in multi-relational data mining

Kavurucu, Y.; Senkul, PINAR; TOROSLU, İSMAİL

doi:10.1016/j.eswa.2009.02.100

ILP-based concept discovery in multi-relational data mining

Kavurucu Y., Senkul P., TOROSLU İ. H.

EXPERT SYSTEMS WITH APPLICATIONS, cilt.36, sa.9, ss.11418-11428, 2009 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 36 Sayı: 9
Basım Tarihi: 2009
Doi Numarası: 10.1016/j.eswa.2009.02.100
Dergi Adı: EXPERT SYSTEMS WITH APPLICATIONS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.11418-11428
Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Multi-relational data mining has become popular due to the limitations of propositional problem definition in structured domains and the tendency of storing data in relational databases. Several relational knowledge discovery systems have been developed employing various search strategies, heuristics, language pattern limitations and hypothesis evaluation criteria, in order to cope with intractably large search space and to be able to generate high-quality patterns. In this work, an ILP-based concept discovery method, namely Confidence-based Concept Discovery (C(2)D), is described in which strong declarative biases and user-defined specifications are relaxed. Moreover, this new method directly works on relational databases. In addition to this, a new confidence-based pruning is used in this technique. We also describe how to define and use aggregate predicates as background knowledge in the proposed method. In order to use aggregate predicates, we show how to handle numerical attributes by using comparison operators on them. Finally, we analyze the effect of incorporating unrelated facts for generating transitive rules on the proposed method. A set of experiments are conducted on real-world problems to test the performance of the proposed method. (c) 2009 Elsevier Ltd. All rights reserved.