HotSprint: database of computational hot spots in protein interfaces


Guney E., Tuncbag N. , Keskin O., Gursoy A.

NUCLEIC ACIDS RESEARCH, cilt.36, 2008 (SCI İndekslerine Giren Dergi) identifier identifier identifier

  • Cilt numarası: 36
  • Basım Tarihi: 2008
  • Doi Numarası: 10.1093/nar/gkm813
  • Dergi Adı: NUCLEIC ACIDS RESEARCH

Özet

We present a new database of computational hot spots in protein interfaces: HotSprint. Hot spots are residues comprising only a small fraction of interfaces yet accounting for the majority of the binding energy. HotSprint contains data for 35 776 protein interfaces among 49 512 protein interfaces extracted from the multi-chain structures in Protein Data Bank (PDB) as of February 2006. The conserved residues in interfaces with certain buried accessible solvent area (ASA) and complex ASA thresholds are flagged as computational hot spots. The predicted hot spots are observed to correlate with the experimental hot spots with an accuracy of 76. Several machine-learning methods (SVM, Decision Trees and Decision Lists) are also applied to predict hot spots, results reveal that our empirical approach performs better than the others. A web interface for the HotSprint database allows users to browse and query the hot spots in protein interfaces. HotSprint is available at http://prism.ccbb.ku.edu.tr/hotsprint; and it provides information for interface residues that are functionally and structurally important as well as the evolutionary history and solvent accessibility of residues in interfaces.