Mining of protein contact maps for protein fold prediction
Mining of protein contact maps for protein fold prediction
No Thumbnail Available
Date
2011-07-01
Authors
Durga Bhavani, S.
Suvarnavani, K.
Sinha, Somdatta
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The three-dimensional structure of proteins is useful to carry out the biophysical and biochemical functions in a cell. Approaches to protein structure/fold prediction typically extract amino acid sequence features, and machine learning approaches are then applied to classification problem. Protein contact maps are two-dimensional representations of the contacts among the amino acid residues in the folded protein structure. This paper highlights the need for a systematic study of these contact networks. Mining of contact maps to derive features pertaining to fold information offers a new mechanism for fold discovery from the protein sequence via the contact maps. These ideas are explored in the structural class of all-alpha proteins to identify structural elements. A simple and computationally inexpensive algorithm based on triangle subdivisionmethod is proposed to extract additional features from the contactmap. The method successfully characterizes the off-diagonal interactions in the contact map for predicting specific 'folds'. The decision tree classification results show great promisein developing a new and simple tool for the challenging problem of fold prediction. © 2011 John Wiley & Sons, Inc.
Description
Keywords
Citation
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery. v.1(4)