Data Centric Molecular Analysis and Evaluation of Hepatocellular Carcinoma Therapeutics Using Machine Intelligence-Based Tools

Cetin-Atalay R., Kahraman D. C., Nalbat E., Rifaioğlu A. S., Atakan A., Dönmez A., ...More

JOURNAL OF GASTROINTESTINAL CANCER, vol.52, no.4, pp.1266-1276, 2021 (ESCI) identifier identifier identifier

  • Publication Type: Article / Review
  • Volume: 52 Issue: 4
  • Publication Date: 2021
  • Doi Number: 10.1007/s12029-021-00768-x
  • Journal Indexes: Emerging Sources Citation Index (ESCI), Scopus, EMBASE, MEDLINE
  • Page Numbers: pp.1266-1276
  • Keywords: Artificial intelligence, Machine learning, Knowledge graphs, Hepatocellular carcinoma, Drug discovery and repurposing, INFORMATION-SYSTEM, SORAFENIB, NETWORKS, CANCER
  • Middle East Technical University Affiliated: Yes


Purpose Computational approaches have been used at different stages of drug development with the purpose of decreasing the time and cost of conventional experimental procedures. Lately, techniques mainly developed and applied in the field of artificial intelligence (AI), have been transferred to different application domains such as biomedicine. Methods In this study, we conducted an investigative analysis via data-driven evaluation of potential hepatocellular carcinoma (HCC) therapeutics in the context of AI-assisted drug discovery/repurposing. First, we discussed basic concepts, computational approaches, databases, modeling approaches, and featurization techniques in drug discovery/repurposing. In the analysis part, we automatically integrated HCC-related biological entities such as genes/proteins, pathways, phenotypes, drugs/compounds, and other diseases with similar implications, and represented these heterogeneous relationships via a knowledge graph using the CROssBAR system. Results Following the system-level evaluation and selection of critical genes/proteins and pathways to target, our deep learning-based drug/compound-target protein interaction predictors DEEPScreen and MDeePred have been employed for predicting new bioactive drugs and compounds for these critical targets. Finally, we embedded ligands of selected HCC-associated proteins which had a significant enrichment with the CROssBAR system into a 2-D space to identify and repurpose small molecule inhibitors as potential drug candidates based on their molecular similarities to known HCC drugs. Conclusions We expect that these series of data-driven analyses can be used as a roadmap to propose early-stage potential inhibitors (from database-scale sets of compounds) to both HCC and other complex diseases, which may subsequently be analyzed with more targeted in silico and experimental approaches.