Abstract
More and more proteins have been observed to display functions through intrinsic disorder. Such structurally flexible regions are shown to play important roles in biological processes and are estimated to be abundant in eukaryotic proteomes. Previous studies largely use evolutionary information and combinations of physicochemical properties of amino acids to detect disordered regions from primary sequences. In our recent work DisPSSMP, it is demonstrated that the accuracy of protein disorder prediction is greatly improved if the disorder propensity of amino acids is considered when generating the condensed PSSM features. This work aims to investigate how the information of secondary structure can be incorporated in DisPSSMP to enhance the predicting power. We propose a new representation of secondary structure information and compare it with three naïve representations that have been discussed or employed in some related works. The experimental results reveal that the refined information from secondary structure prediction is of benefit to this problem.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Dunker, A.K., Obradovic, Z., Romero, P., Kissinger, C., Villafrance, E.: On the importance of being disordered. PDB Newsletter 81, 3–5 (1997)
Wright, P.E., Dyson, H.J.: Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. J. Mol. Biol. 293(2), 321–331 (1999)
Ward, J.J., Sodhi, J.S., McGuffin, L.J., Buxton, B.F., Jones, D.T.: Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J. Mol. Biol. 337, 635–645 (2004)
Fink, A.L.: Natively unfolded proteins. Current Opinion in Structural Biology 15, 35–41 (2005)
Dunker, A.K., Garner, E., Guilliot, S., Romero, P., Albercht, K., Hart, J., Obradovic, Z., Kissinger, C., Villafranca, J.E.: Protein disorder and the evolution of molecular recognition: theory, predictions and observations. Pac. Symp. Biocomput. 3, 473–484 (1998)
Jones, D.T., Ward, J.J.: Prediction of disordered regions in proteins from position specific scoring matrices. Proteins 53, 573–578 (2003)
Romero, P., Obradovic, Z., Kissinger, C., Villafranca, J.E., Dunker, A.K.: Identifying disordered regions in proteins from amino acid sequence. In: Proc. IEEE Int. Conf. Neural Networks, vol. 1, pp. 90–95. IEEE, Los Alamitos (1997)
Romero, P., Obradovic, Z., Kissinger, C., Villafranca, J.E., Garner, E., Guilliiot, S., Dunker, A.K.: Thousands of proteins likely to have long disordered regions. Pac. Symp. Biocomput. 3, 437–448 (1998)
Obradovic, Z., Peng, K., Vucetic, S., Radivojac, P., Brown, C.J., Dunker, A.K.: Predicting intrinsic disorder from amino acid sequence. Proteins 53, 566–572 (2003)
Wotton, J.C., Federhen, S.: Statistics of local complexity in amino acid sequences and sequence databases. Comput. Chem. 17, 149–163 (1993)
Brown, C.J., Takayama, S., Campen, A.M., Vise, P., Marshall, T.W., Oldfield, C.J., Williams, C.J., Dunker, A.K.: Evolutionary rate heterogeneity in proteins with long disordered regions. J. Mol. Evol. 55, 104–110 (2002)
Vihinen, M., Torkkila, E., Riikonen, P.: Accuracy of protein flexibility predictions. Proteins 19, 141–149 (1994)
Su, C.T., Chen, C.Y., Ou, Y.Y.: Protein disorder prediction by condensed PSSM considering propensity for order or disorder. BMC Bioinformatics 7, 319 (2006)
Liu, J., Rost, B.: NORSp: predictions of long regions without regular secondary structure. Nucl. Acids Res. 31(13), 3833–3835 (2003)
Liu, J., Tan, H., Rost, B.: Loopy proteins appear conserved in evolution. J. Mol. Biol. 322, 53–64 (2002)
Linding, R., Russell, R.B., Neduva, V., Gibson, T.J.: GlobPlot: exploring protein sequences for globularity and disorder. Nucl. Acids Res. 31, 3701–3708 (2003)
Cheng, J., Sweredoski, M.J., Baldi, P.: Accurate prediction of protein disordered regions by mining protein structure data. Data Mining and Knowledge Discovery 11, 213–222 (2005)
Obradovic, Z., Peng, K., Vucetic, S., Radivojac, P., Dunker, A.K.: Exploiting heterogeneous sequence properties improves prediction of protein disorder. Proteins Suppl. 7, 176–182 (2005)
Peng, K., Radivojac, P., Vucetic, S., Dunker, A.K., Obradovic, Z.: Length-dependent prediction of protein intrinsic disorder. BMC Bioinformatics 7, 208 (2006)
Yang, Z.R., Thomson, R., McNeil, P., Esnouf, R.M.: RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins. Bioinformatics Advance Access Published June 9, 2005 (2005)
Uversky, V.N., Gillespie, J.R., Fink, A.L.: Why are “natively unfolded” proteins unstructured under physiologic conditions? Proteins 41, 415–427 (2000)
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data Bank. Nucl. Acids Res. 28, 235–242 (2000)
Vucetic, S., Obradovic, Z., Vacic, V., Radivojac, P., Peng, K., Lakoucheva, L.M., Cortese, M.S., Lawson, J.D., Brown, C.J., Sikes, J.G., Newton, C.D., Dunker, A.K.: DisProt: a database of protein disorder. Bioinformatics 21, 137–140 (2005)
Li, W., Jaroszewski, L., Godzik, A.: Tolerating some redundancy significantly speeds up clustering of large proteins databases. Bioinformatics 18, 77–82 (2002)
Cuff, J.A., Barton, G.J.: Application of enhanced multiple sequence alignment profiles to improve protein secondary structure prediction. Proteins 40, 502–511 (2000)
Ou, Y.Y., Chen, C.Y., Oyang, Y.J.: A Novel Radial Basis Function Network Classifier with Centers Set by Hierarchical Clustering. In: IJCNN ’05, Proceedings, vol. 3, pp. 1383–1388 (2005)
Melamud, E., Moult, J.: Evaluation of disorder predictions in CASP5. Proteins 53, 561–565 (2003)
Jin, Y., Dunbrack, R.L.: Assessment of disorder predictions in CASP6. Proteins, Early View (2005)
Ward, J.J., McGuffin, L.J., Bryson, K., Buxton, B.F., Jones, D.T.: The DISOPRED server for the prediction of protein disorder. Bioinformatics 20, 2138–2139 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Su, CT., Hsu, TM., Chen, CY., Ou, YY., Oyang, YJ. (2007). Enhancing Protein Disorder Detection by Refined Secondary Structure Prediction. In: Hochreiter, S., Wagner, R. (eds) Bioinformatics Research and Development. BIRD 2007. Lecture Notes in Computer Science(), vol 4414. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71233-6_31
Download citation
DOI: https://doi.org/10.1007/978-3-540-71233-6_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71232-9
Online ISBN: 978-3-540-71233-6
eBook Packages: Computer ScienceComputer Science (R0)