Application of Neural Networks for Protein Sequence Classification

dc.contributor.author Sharma, Sameer
dc.contributor.author Kumar, Vinod
dc.contributor.author Rani, T. Sobha
dc.contributor.author Bhavani, S. Durga
dc.contributor.author Raju, S. Bapi
dc.date.accessioned 2022-03-27T05:50:53Z
dc.date.available 2022-03-27T05:50:53Z
dc.date.issued 2004-05-04
dc.description.abstract Protein sequence classification is modelled as a binary classification problem where an unlabeled protein sequence is checked to see if it belongs to a known set of protein superfamilies or not. In this paper we used multilayer perceptrons with supervised learning algorithm to learn the binary classification. The training data consists of two sets - a positive set belonging to an identified set of protein superfamily and a negative set comprising sequences from other superfamilies. When applying neural networks the first problem to be addressed is feature extraction. In this paper we used the new feature extraction techniques proposed by Wang et al. [4]. Simulations reveal that the neural network is able to classify with good precision for Myosin and Photochrome superfamilies in the data set that we have chosen as positive . Also the results for Globin superfamily are good, thus validating the methodology of feature extraction and the application of neural networks for protein sequence classification as suggested by Wang et al. But, for Actin and Ribonuclease superfamilies the network showed poor performance. One possible reason for this may be that the choice of sequences in the negative data set is not optimal. We conclude from this work that the classification performance depends upon a proper selection of sequences for positive and negative data sets.
dc.identifier.citation Proceedings of International Conference on Intelligent Sensing and Information Processing, ICISIP 2004
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8272
dc.subject Bi-gram features
dc.subject Feature Extraction
dc.subject Neural Networks
dc.title Application of Neural Networks for Protein Sequence Classification
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: