SLPred: a multi-view subcellular localization prediction tool for multi-location human proteins


ÖZSARI G., Rifaioglu A. S., ATAKAN A., DOĞAN T., Martin M. J., Atalay R. C., ...Daha Fazla

BIOINFORMATICS, cilt.38, sa.17, ss.4226-4229, 2022 (SCI-Expanded) identifier identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 38 Sayı: 17
  • Basım Tarihi: 2022
  • Doi Numarası: 10.1093/bioinformatics/btac458
  • Dergi Adı: BIOINFORMATICS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.4226-4229
  • Orta Doğu Teknik Üniversitesi Adresli: Evet

Özet

Accurate prediction of the subcellular locations (SLs) of proteins is a critical topic in protein science. In this study, we present SLPred, an ensemble-based multi-view and multi-label protein subcellular localization prediction tool. For a query protein sequence, SLPred provides predictions for nine main SLs using independent machine-learning models trained for each location. We used UniProtKB/Swiss-Prot human protein entries and their curated SL annotations as our source data. We connected all disjoint terms in the UniProt SL hierarchy based on the corresponding term relationships in the cellular component category of Gene Ontology and constructed a training dataset that is both reliable and large scale using the re-organized hierarchy. We tested SLPred on multiple benchmarking datasets including our-in house sets and compared its performance against six state-of-the-art methods. Results indicated that SLPred outperforms other tools in the majority of cases.