Improving protein structural class prediction using novel combined sequence information and predicted secondary structural features |
| |
Authors: | Dai Qi Wu Li Li Lihua |
| |
Affiliation: | College of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China. daiailiu2004@yahoo.com.cn |
| |
Abstract: | Protein structural class prediction solely from protein sequences is a challenging problem in bioinformatics. Numerous efficient methods have been proposed for protein structural class prediction, but challenges remain. Using novel combined sequence information coupled with predicted secondary structural features (PSSF), we proposed a novel scheme to improve prediction of protein structural classes. Given an amino acid sequence, we first transformed it into a reduced amino acid sequence and calculated its word frequencies and word position features to combine novel sequence information. Then we added the PSSF to the combine sequence information to predict protein structural classes. The proposed method was tested on four benchmark datasets in low homology and achieved the overall prediction accuracies of 83.1%, 87.0%, 94.5%, and 85.2%, respectively. The comparison with existing methods demonstrates that the overall improvements range from 2.3% to 27.5%, which indicates that the proposed method is more efficient, especially for low-homology amino acid sequences. |
| |
Keywords: | protein structural class prediction word frequency information word position information predicted secondary structure support vector machine |
本文献已被 PubMed 等数据库收录! |
|