Quantitative Biology > Biomolecules
[Submitted on 20 Jun 2016 (v1), last revised 13 Dec 2016 (this version, v2)]
Title:Knowledge-based machine learning methods for macromolecular 3D structure prediction
View PDFAbstract:Predicting the 3D structure of a macromolecule, such as a protein or an RNA molecule, is ranked top among the most difficult and attractive problems in bioinformatics and computational biology. Its importance comes from the relationship between the 3D structure and the function of a given protein or RNA. 3D structures also help to find the ligands of the protein, which are usually small molecules, a key step in drug design. Unfortunately, there is no shortcut to accurately obtain the 3D structure of a macromolecule. Many physical measurements of macromolecular 3D structures cannot scale up, due to their large labor costs and the requirements for lab conditions.
In recent years, computational methods have made huge progress due to advance in computation speed and machine learning methods. These methods only need the sequence information to predict 3D structures by employing various mathematical models and machine learning methods. The success of computational methods is highly dependent on a large database of the proteins and RNA with known structures.
However, the performance of computational methods are always expected to be improved. There are several reasons for this. First, we are facing, and will continue to face sparseness of data.Secondly, the 3D structure space is too large for our computational capability.
The two obstacles can be removed by knowledge-based methods, which combine knowledge learned from the known structures and biologists' knowledge of the folding process of protein or RNA. In the dissertation, I will present my results in building a knowledge-based method by using machine learning methods to tackle this problem. My methods include the knowledge constraints on intermediate states, which can highly reduce the solution space of a protein or RNA, in turn increasing the efficiency of the structure folding method and improving its accuracy.
Submission history
From: Zhiyong Wang [view email][v1] Mon, 20 Jun 2016 13:56:19 UTC (2,604 KB)
[v2] Tue, 13 Dec 2016 16:47:54 UTC (3,609 KB)
References & Citations
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.