SUST Repository

A Data Oriented Approach to Assess the Accuracy of a Protein Secondary Structure Predictor

Show simple item record

dc.contributor.author A. , Saad Subair
dc.date.accessioned 2017-04-25T07:32:33Z
dc.date.available 2017-04-25T07:32:33Z
dc.date.issued 2015
dc.identifier.citation A. , Saad Subair . A Data Oriented Approach to Assess the Accuracy of a Protein Secondary Structure Predictor \ Saad Subair A. .- Journal of Engineering and Computer Sciences (ECS) .- vol 16 , no 2 .- 2015.-article en_US
dc.identifier.issn ISSN 1605-427X
dc.identifier.uri http://repository.sustech.edu/handle/123456789/16617
dc.description article en_US
dc.description.abstract Researchers in the field of protein secondary structure prediction use typical three states of secondary structures, namely: alpha helices (H) beta strands (E), and coils (C). The series of amino acids polymers linked together into adjacent chains are known as proteins. Protein secondary structure prediction is a fundamental step in determining the final structure and functions of a protein. In this work we developed a prediction machine for protein secondary structure. By investigating the amino acids benchmark data sets, it was observed that the data is grouped into two distinct states or groups almost 50% each. In this scheme, researchers classify any state which is not classified as helix or strands as coils. Hence, in this work a new way of looking to the data set is adopted. For this type of data, the Receiver Operating Characteristic (ROC) analysis is considered for analysing and interpreting the results of assessing the protein secondary structure classifier. The results revealed that ROC analysis showed similar results to that obtained using other non ROC classification methods. The ROC curves were able to discriminate the coil states from non-coil states by 72% prediction accuracy with very small standard error. en_US
dc.description.sponsorship Sudan University of Science and Technology en_US
dc.language.iso en_US en_US
dc.publisher Sudan University of Science and Technology en_US
dc.subject Protein Secondary Structure Prediction, Receiver Operating Characteristics (ROC), Area Under Curve (AUC), Binary Classification, Bioinformatics. en_US
dc.title A Data Oriented Approach to Assess the Accuracy of a Protein Secondary Structure Predictor en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Share

Search SUST


Browse

My Account