Loop-based conic multivariate adaptive regression splines is a novel method for advanced construction of complex biological networks


Ayyıldız Demirci E., Purutçuoğlu Gazi V., Weber G. W.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, cilt.270, ss.852-861, 2018 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 270
  • Basım Tarihi: 2018
  • Doi Numarası: 10.1016/j.ejor.2017.12.011
  • Dergi Adı: EUROPEAN JOURNAL OF OPERATIONAL RESEARCH
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.852-861
  • Anahtar Kelimeler: OR in medicine, Conic multivariate adaptive regression splines, Gaussian graphical model, Biological networks, Accuracy measures, PARAMETER-ESTIMATION, REGULATORY NETWORKS, OPTIMIZATION, INFERENCE, SELECTION, MODELS
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

The Gaussian Graphical Model (GGM) and its Bayesian alternative, called, the Gaussian copula graphical model (GCGM) are two widely used approaches to construct the undirected networks of biological systems. They define the interactions between species by using the conditional dependencies of the multivariate normality assumption. However, when the system's dimension is high, the performance of the model becomes computationally demanding, and, particularly, the accuracy of GGM decreases when the observations are far from normality. Here, we suggest a Conic Multivariate Adaptive Regression Splines (CMARS) as an alternative to GGM and GCGM to ameliorate both problems. CMARS is a modified version of the Multivariate Adaptive Regression Spline, a well-known modeling approaches used in Operational Research (OR) to represent biological, environmental, and economic data. The main benefit of this model is its compatibility with high-dimensional and correlated measurements of serious nonlinearity, which allows for a wide field of application. We adapted CMARS to describe biological systems and called it "LCMARS" due to its loop-based description. We then applied LCMARS to simulated and real datasets, and LCMARS produced more accurate results compared to GGM and GCGM. Hereby, the ability to use LCMARS in the description of biological networks has the potential to open up new avenues in the application of OR to computational biology and bioinformatics, and can thus help us better understanding complex diseases like cancer and hepatitis. (C) 2017 Elsevier B.V. All rights reserved.