CTSS: A robust and efficient method for protein structure alignment based on local geometrical and biological features

Can T., WANG Y.

2nd International Computational Systems Bioinformatics Conference, California, United States Of America, 11 - 14 August 2003, pp.169-179 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/csb.2003.1227316
  • City: California
  • Country: United States Of America
  • Page Numbers: pp.169-179
  • Middle East Technical University Affiliated: No


We present a new method for conducting protein structure similarity searches, which improves on the accuracy, robustness, and efficiency of some existing techniques. Our method is grounded in the theory of differential geometry on 3D space curve matching. We generate shape signatures for proteins that are invariant, localized, robust, compact, and biologically meaningful. To improve matching accuracy, we smooth the noisy raw atomic coordinate data with spline fitting. To improve matching efficiency, we adopt a hierarchical coarse-to-fine strategy. We use an efficient hashing-based technique to screen out unlikely candidates and perform detailed pairwise alignments only for a small number of candidates that survive the screening process. Contrary to other hashing based techniques, our technique employs domain specific information (not just geometric information) in constructing the hash key, and hence, is more tuned to the domain of biology. Furthermore, the invariancy, localization, and compactness of the shape signatures allow us to utilize a well-known local sequence alignment algorithm for aligning two protein structures. One measure of the efficacy of the proposed technique is that we were able to discover new, meaningful motifs that were not reported by other structure alignment methods.