Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Statistical significance of hierarchical multi‐body potentials based on Delaunay tessellation and their application in sequence‐structure alignment

Statistical significance of hierarchical multi‐body potentials based on Delaunay tessellation and... Statistical potentials based on pairwise interactions between Cα atoms are commonly used in protein threading/fold‐recognition attempts. Inclusion of higher order interaction is a possible means of improving the specificity of these potentials. Delaunay tessellation of the Cα‐atom representation of protein structure has been suggested as a means of defining multi‐body interactions. A large number of parameters are required to define all four‐body interactions of 20 amino acid types (204 = 160,000). Assuming that residue order within a four‐body contact is irrelevant reduces this to a manageable 8,855 parameters, using a nonredundant dataset of 608 protein structures. Three lines of evidence support the significance and utility of the four‐body potential for sequence‐structure matching. First, compared to the four‐body model, all lower‐order interaction models (three‐body, two‐body, one‐body) are found statistically inadequate to explain the frequency distribution of residue contacts. Second, coherent patterns of interaction are seen in a graphic presentation of the four‐body potential. Many patterns have plausible biophysical explanations and are consistent across sets of residues sharing certain properties (e.g., size, hydrophobicity, or charge). Third, the utility of the multi‐body potential is tested on a test set of 12 same‐length pairs of proteins of known structure for two protocols: Sequence‐recognizes‐structure, where a query sequence is threaded (without gap) through the native and a non‐native structure; and structure‐recognizes‐sequence, where a query structure is threaded by its native and another non‐native sequence. Using cross‐validated training, protein sequences correctly recognized their native structure in all 24 cases. Conversely, structures recognized the native sequence in 23 of 24 cases. Further, the score differences between correct and decoy structures increased significantly using the three‐ or four‐body potential compared to potentials of lower order. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Protein Science Wiley

Statistical significance of hierarchical multi‐body potentials based on Delaunay tessellation and their application in sequence‐structure alignment

Protein Science , Volume 6 (7) – Jul 1, 1997

Loading next page...
 
/lp/wiley/statistical-significance-of-hierarchical-multi-body-potentials-based-xanlmPgIxa

References (23)

Publisher
Wiley
Copyright
Copyright © 2008 The Protein Society
ISSN
0961-8368
eISSN
1469-896X
DOI
10.1002/pro.5560060711
pmid
9232648
Publisher site
See Article on Publisher Site

Abstract

Statistical potentials based on pairwise interactions between Cα atoms are commonly used in protein threading/fold‐recognition attempts. Inclusion of higher order interaction is a possible means of improving the specificity of these potentials. Delaunay tessellation of the Cα‐atom representation of protein structure has been suggested as a means of defining multi‐body interactions. A large number of parameters are required to define all four‐body interactions of 20 amino acid types (204 = 160,000). Assuming that residue order within a four‐body contact is irrelevant reduces this to a manageable 8,855 parameters, using a nonredundant dataset of 608 protein structures. Three lines of evidence support the significance and utility of the four‐body potential for sequence‐structure matching. First, compared to the four‐body model, all lower‐order interaction models (three‐body, two‐body, one‐body) are found statistically inadequate to explain the frequency distribution of residue contacts. Second, coherent patterns of interaction are seen in a graphic presentation of the four‐body potential. Many patterns have plausible biophysical explanations and are consistent across sets of residues sharing certain properties (e.g., size, hydrophobicity, or charge). Third, the utility of the multi‐body potential is tested on a test set of 12 same‐length pairs of proteins of known structure for two protocols: Sequence‐recognizes‐structure, where a query sequence is threaded (without gap) through the native and a non‐native structure; and structure‐recognizes‐sequence, where a query structure is threaded by its native and another non‐native sequence. Using cross‐validated training, protein sequences correctly recognized their native structure in all 24 cases. Conversely, structures recognized the native sequence in 23 of 24 cases. Further, the score differences between correct and decoy structures increased significantly using the three‐ or four‐body potential compared to potentials of lower order.

Journal

Protein ScienceWiley

Published: Jul 1, 1997

There are no references for this article.