Abstract
The secondary structure of RNA is closely related to its role in biological function, and it is difficult to predict the secondary structure of RNA sequence containing pseudoknots due to its complex structure. In this paper, a DT (Decision Tree) based RNA secondary structure prediction model is proposed, and a training algorithm is constructed. By adjusting the size of the window, the secondary structure prediction of RNA sequences containing pseudoknots is realized. The comparison experiments are carried out on the authoritative dataset RNA STRAND with classical classification algorithms such as LR (Logistic Regression), RF (Random forests) and SVM (Support Vector Machine). The experimental results show that compared with SVM, LR and RF, DT has more advantages in classification accuracy and robustness, and its accuracy rate has increased by 8.68%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Rivas, E., Eddy, S.R.: A dynamic programming algorithm for RNA structure prediction including pseudoknots. J. Mol. Biol. 285(5), 1–2068 (1999)
Rodland, A.E.: Pseudoknots in RNA secondary structures: representation, enumeration, and prevalence. J. Comput. Biol. 13(6), 1197–1213 (2006)
Reeder, J., Giegerich, R.: Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics. BMC Bioinformatics 5(1), 104 (2004)
Ieong, S., Kao, M.Y., Lam, T.W., et al.: Predicting RNA secondary structures with arbitrary pseudoknots by maximizing the number of stacking Pairs. J. Comput. Biol. 10(6), 981–995 (2003)
Ren, J., Rastegari, B., Hoos, H.H.: HotKnots: heuristic prediction of RNA secondary structures including pseudoknots. Rna-a Publ. Rna Soc. 11(10), 1494–1504 (2005)
Yu, N., Zhao, W., et al.: Evaluation of RNA secondary structure prediction for both base-pairing and topology. Biophysics 4(3), 123–132 (2018). English edition
Liu, Y., Zhao, Q., Zhang, H., et al.: A New method to predict RNA secondary structure based on RNA folding simulation. IEEE/ACM Trans. Comput. Biol. Bioinf. 13(5), 990–995 (2016)
Madera, M., Calmus, R., Thiltgen, G., et al.: Improving protein secondary structure prediction using a simple k-mer model. Bioinformatics 26(5), 596–602 (2010)
Yonemoto, H., Asai, K., Hamada, M.: A semi-supervised learning approach for RNA secondary structure prediction. Comput. Biol. Chem. 57, 72–79 (2015)
Liu, Z., Zhu, D.: New algorithm for predicting RNA secondary structure with pseudoknots. Mater. Sci. Inform. Technol. II 2, 1796–1799 (2012)
Sprinzl, M., Horn, C., Brown, M., et al.: Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res. 26(1), 148–153 (1998)
Zhang, X., Deng, Z., Song, D.: Neural network approach to predict RNA secondary structures. J. Tsinghua Univ. 46(10), 1793–1796 (2006)
Yang, R., Wu, H., Fu, Q., et al.: Optimizing HP model using reinforcement learning (2018)
Chen, C., Wu, H., Bian, K.: β-barrel transmembrane protein predicting using support vector machine. In: Huang, D.-S., Hussain, A., Han, K., Gromiha, M.M. (eds.) ICIC 2017. LNCS (LNAI), vol. 10363, pp. 360–368. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-63315-2_31
Wu, H., Li, H., Jiang, M., et al.: Identify high-quality protein structural models by enhanced K-Means. Biomed. Res. Int. 2017(18), 1–9 (2017)
Acknowledgements
This paper is supported by the National Natural Science Foundation of China (61772357, 61502329, 61672371, and 61876217), Jiangsu Province 333 Talent Project, Top Talent Project (DZXX-010), Suzhou Foresight Research Project (SYG201704, SNG201610, and SZS201609).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Lu, W., Cao, Y., Wu, H., Huang, H., Ding, Y. (2019). Research on RNA Secondary Structure Prediction Based on Decision Tree. In: Huang, DS., Jo, KH., Huang, ZK. (eds) Intelligent Computing Theories and Application. ICIC 2019. Lecture Notes in Computer Science(), vol 11644. Springer, Cham. https://doi.org/10.1007/978-3-030-26969-2_41
Download citation
DOI: https://doi.org/10.1007/978-3-030-26969-2_41
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-26968-5
Online ISBN: 978-3-030-26969-2
eBook Packages: Computer ScienceComputer Science (R0)