Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Solving the secondary structure matching problem in cryo-EM de novo modeling using a constrained K-shortest path graph algorithm

Published: 01 March 2014 Publication History

Abstract

Electron cryomicroscopy is becoming a major experimental technique in solving the structures of large molecular assemblies. More and more three-dimensional images have been obtained at the medium resolutions between 5 and 10Å. At this resolution range, major α-helices can be detected as cylindrical sticks and β-sheets can be detected as plain-like regions. A critical question in de novo modeling from cryo-EM images is to determine the match between the detected secondary structures from the image and those on the protein sequence. We formulate this matching problem into a constrained graph problem and present an O2N22N) algorithm to this NP-Hard problem. The algorithm incorporates the dynamic programming approach into a constrained K-shortest path algorithm. Our method, DP-TOSS, has been tested using α-proteins with maximum 33 helices and α-β proteins up to five helices and 12 β-strands. The correct match was ranked within the top 35 for 19 of the 20 α-proteins and all nine α-β proteins tested. The results demonstrate that DP-TOSS improves accuracy, time and memory space in deriving the topologies of the secondary structure elements for proteins with a large number of secondary structures and a complex skeleton.

References

[1]
S.J. Ludtke, P.R. Baldwin, and W. Chiu, "EMAN: Semi-Automated Software for High Resolution Single Particle Reconstructions," J. Structural Biology, vol. 128, no. 1, pp. 82-97, 1999.
[2]
M.L. Baker, S.S. Abeysinghe, S. Schuh, R.A. Coleman, A. Abrams, M.P. Marsh, C.F. Hryc, T. Ruths, W. Chiu, and T. Ju, "Modeling Protein Structure at Near Atomic Resolutions with Gorgon," J. Structural Biology, vol. 174, no. 2, pp. 360-373, 2011.
[3]
W. Chiu, "Electron Microscopy of Frozen, Hydrated Biological Specimens," Ann. Rev. of Biophysics and Biophysical Chemistry, vol. 15, pp. 237-257, 1986.
[4]
W. Chiu and M.F. Schmid, "Pushing Back the Limits of Electron Cryomicroscopy," Nature Structural Biology, vol. 4, pp. 331-333, 1997.
[5]
Z.H. Zhou, M. Dougherty, J. Jakana, J. He, F.J. Rixon, and W. Chiu, "Seeing the Herpesvirus Capsid at 8.5 A," Science, vol. 288, no. 5467, pp. 877-880, May 2000.
[6]
S.J. Ludtke, D.H. Chen, J.L. Song, D.T. Chuang, and W. Chiu, "Seeing GroEL at 6 A Resolution by Single Particle Electron Cryomicroscopy," Structure, vol. 12, no. 7, pp. 1129-1136, July 2004.
[7]
W. Chiu, M.L. Baker, W. Jiang, and Z.H. Zhou, "Deriving Folds of Macromolecular Complexes through Electron Cryomicroscopy and Bioinformatics Approaches," Current Opinion in Structural Biology, vol. 12, no. 2, pp. 263-269, Apr. 2002.
[8]
Y. Cong, M.L. Baker, J. Jakana, D. Woolford, E.J. Miller, S. Reissmann, R.N. Kumar, A.M. Redding-Johanson, T.S. Batth, A. Mukhopadhyay, S.J. Ludtke, J. Frydman, and W. Chiu, "4.0-Å Resolution Cryo-EM Structure of the Mammalian Chaperonin TRiC/CCT Reveals Its Unique Subunit Arrangement," Proc. Nat'l Academy of Sciences USA, vol. 107, no. 11, pp. 4967-4972, Mar. 2010.
[9]
X. Zhang, L. Jin, Q. Fang, W.H. Hui, and Z.H. Zhou, "3.3 Å Cryo-EM Structure of a Nonenveloped Virus Reveals a Priming Mechanism for Cell Entry," Cell, vol. 141, no. 3, pp. 472-482, Apr. 2010.
[10]
D. Si and J. He, "Beta-Sheet Detection and Representation from Medium Resolution Cryo-EM Density Maps," Proc. ACM Conf. Bioinformatics, Computational Biology and Biomedical Informatics (BCB'13:), Sept. 2013.
[11]
E.F. Pettersen, T.D. Goddard, C.C. Huang, G.S. Couch, D.M. Greenblatt, E.C. Meng, and T.E. Ferrin, "UCSF Chimera--A Visualization System for Exploratory Research and Analysis," J. Computational Chemistry, vol. 25, no. 13, pp. 1605-1612, 2004.
[12]
X. Yu, L. Jin, and Z. H. Zhou, "3.88 A Structure of Cytoplasmic Polyhedrosis Virus by Cryo-Electron Microscopy," Nature, vol. 453, no. 7193, pp. 415-419, May 2008.
[13]
C.L. Lawson, M.L. Baker, C. Best, C. Bi, M. Dougherty, P. Feng, G. van Ginkel, B. Devkota, I. Lagerstedt, S.J. Ludtke, R.H. Newman, T.J. Oldfield, I. Rees, G. Sahni, R. Sala, S. Velankar, J. Warren, J.D. Westbrook, K. Henrick, G.J. Kleywegt, H.M. Berman, and W. Chiu, "EMDataBank.org: Unified Data Resource for CryoEM," Nucleic Acids Research, vol. 39, Suppl 1, pp. D456-464, Jan. 2011.
[14]
W. Jiang, M.L. Baker, S.J. Ludtke, and W. Chiu, "Bridging the Information Gap: Computational Tools for Intermediate Resolution Structure Interpretation," J. Molecular Biology, vol. 308, no. 5, pp. 1033-1044, May 2001.
[15]
A. Del Palu, J. He, E. Pontelli, and Y. Lu, "Identification of Alpha-Helices from Low Resolution Protein Density Maps," Proc. Computational Systems Bioinformatics Conf. (CSB), pp. 89-98, 2006.
[16]
M.L. Baker, T. Ju, and W. Chiu, "Identification of Secondary Structure Elements in Intermediate-Resolution Density Maps," Structure, vol. 15, no. 1, pp. 7-19, Jan. 2007.
[17]
D. Si, S. Ji, K. Al Nasr, and J. He, "A Machine Learning Approach for the Identification of Protein Secondary Structure Elements from CryoEM Density Maps," Biopolymers, vol. 97, pp. 698-708, 2012.
[18]
Y. Kong and J. Ma, "A Structural-Informatics Approach for Mining Beta-Sheets: Locating Sheets in Intermediate-Resolution Density Maps," J. Molecular Biology, vol. 332, no. 2, pp. 399-413, Sept. 2003.
[19]
Y. Kong, X. Zhang, T.S. Baker, and J. Ma, "A Structural-Informatics Approach for Tracing Beta-Sheets: Building Pseudo-C (alpha) Traces for Beta-Strands in Intermediate-Resolution Density Maps," J. Molecular Biology, vol. 339, no. 1, pp. 117-130, May 2004.
[20]
Y. Zeyun and C. Bajaj, "Computational Approaches for Automatic Structural Analysis of Large Biomolecular Complexes," IEEE/ACM Trans. Computational Biology and Bioinformatics, vol. 5, no. 4, pp. 568-582, 2008.
[21]
T. Ju, M.L. Baker, and W. Chiu, "Computing a Family of Skeletons of Volumetric Models for Shape Description," Computer Aided Designs, vol. 39, no. 5, pp. 352-360, May 2007.
[22]
D.T. Jones, "Protein Secondary Structure Prediction Based on Position-Specific Scoring Matrices," J. Molecular Biology, vol. 292, no. 2, pp. 195-202, Sept. 1999.
[23]
J. Cheng, A.Z. Randall, M.J. Sweredoski, and P. Baldi, "SCRATCH: A Protein Structure and Structural Feature Prediction Server," Nucleic Acids Research, vol. 33, no. suppl 2, pp. W72-W76, July 2005.
[24]
G. Pollastri and A. McLysaght, "Porter: A New, Accurate Server for Protein Secondary Structure Prediction," Bioinformatics, vol. 21, no. 8, pp. 1719-1720, Apr. 2005.
[25]
J.J. Ward, L.J. McGuffin, B.F. Buxton, and D.T. Jones, "Secondary Structure Prediction with Support Vector Machines," Bioinformatics, vol. 19, no. 13, pp. 1650-1655, Sept. 2003.
[26]
K. Al Nasr, W. Sun, and J. He, "Structure Prediction for the Helical Skeletons Detected from the Low Resolution Protein Density Map," BMC Bioinformatics, vol. 11, no. 1, article S44, 2010.
[27]
S. Lindert, R. Staritzbichler, N. Wötzel, M. Karakas, P.L. Stewart, and J. Meiler, "EM-Fold: De Novo Folding of Alpha-Helical Proteins Guided by Intermediate-Resolution Electron Microscopy Density Maps," Structure, vol. 17, no. 7, pp. 990-1003, July 2009.
[28]
S. Lindert, N. Alexander, N. Wotzel, M. Karaka, P.L. Stewart, and J. Meiler, "EM-Fold: De Novo Atomic-Detail Protein Structure Determination from Medium-Resolution Density Maps," Structure, vol. 20, pp. 464-478, 2012.
[29]
Y. Lu, J. He, and C.E. Strauss, "Deriving Topology and Sequence Alignment for the Helix Skeleton in Low-Resolution Protein Density Maps," J. Bioinformatics and Computational Biology, vol. 6, no. 1, pp. 183-201, Feb. 2008.
[30]
J. He, Y. Lu, and E. Pontelli, "A Parallel Algorithm for Helix Mapping between 3-D and 1-D Protein Structure Using the Length Constraints," Proc. Second Int'l Conf. Parallel and Distributed Processing and Applications, vol. 3358, pp. 746-756, 2004.
[31]
W. Sun and J. He, "Reduction of the Secondary Structure Topological Space through Direct Estimation of the Contact Energy Formed by the Secondary Structures," BMC Bioinformatics, vol. 10, no. Suppl 1, article S40, 2009.
[32]
Y. Wu, M. Chen, M. Lu, Q. Wang, and J. Ma, "Determining Protein Topology from Skeletons of Secondary Structures," J. Molecular Biology, vol. 350, no. 3, pp. 571-586, July 2005.
[33]
S. Abeysinghe, T. Ju, M.L. Baker, and W. Chiu, "Shape Modeling and Matching in Identifying 3D Protein Structures," Computer-Aided Design, vol. 40, no. 6, pp. 708-720, 2008.
[34]
K. Al Nasr, D. Ranjan, M. Zubair, and J. He, "Ranking Valid Topologies of the Secondary Structure Elements Using a Constraint Graph," J. Bioinformatics and Computational Biology, vol. 9, no. 3, pp. 415-430, June 2011.
[35]
A. McKnight, D. Si, K. Al Nasr, A. Chernikov, N. Chrisochoides, and J. He, "Estimating Loop Length from CryoEM Images at Medium Resolutions," BMC Structural Biology, vol. 13, no. suppl 1, article S5, 2013.
[36]
C. Bron and J. Kerbosch, "Algorithm 457: Finding All Cliques of an Undirected Graph," Comm. ACM, vol. 16, no. 9, pp. 575-577, 1973.
[37]
W. Hoffman and R. Pavley, "A Method for the Solution of the Nth Best Path Problem," J. ACM, vol. 6, no. 4, pp. 506-514, 1959.
[38]
M. Pollack, "The kth Best Route through a Network," Operations Research, vol. 9, no. 4, pp. 578-580, 1961.
[39]
J.Y. Yen, "Finding the K Shortest Loopless Paths in a Network," Management Science, vol. 17, no. 11, pp. 712-716, 1971.
[40]
J. Hershberger, M. Maxel, and S. Suri, "Finding the k Shortest Simple Paths: A New Algorithm and Its Implementation," ACM Trans. Algorithms, vol. 3, no. 4, article 45, 2007.
[41]
S.E. Dreyfus, "An Appraisal of Some Shortest-Path Algorithms," Operations Research, vol. 17, no. 3, pp. 395-412, May/June 1969.
[42]
A.W. Brander and M. Sinclair, "A Comparative Study of k-Shortest Path Algorithms," Proc. 11th UK Performance Eng. Workshop, 1995.
[43]
E.D.Q.V. Martins, M.M.B. Pascoal, and J.L.E. d. Santos, "Deviation Algorithms for Ranking Shortest Paths," Int'l J. Foundation of Computer Science, vol. 10, no. 3, pp. 247-263, 1999.
[44]
K. Al Nasr, "De Novo Structure Modeling from CryoEM Data through a Dynamic Programming Algorithm in the Secondary Structure Topology Graph," PhD thesis Dept. of Computer Science, Old Dominion Univ., 2012.
[45]
S.J. Ludtke, P.R. Baldwin, and W. Chiu, "EMAN: Semi-Automated Software for High Resolution Single Particle Reconstructions," J. Structural Biology, vol. 128, no. 1, pp. 82-97, 1999.
[46]
W. Sun and J. He, "Native Secondary Structure Topology Has Near Minimum Contact Energy Among All Possible Geometrically Constrained Topologies," Proteins: Structure, Function, and Bioinformatics, vol. 77, no. 1, pp. 159-173, Oct. 2009.

Cited By

View all
  • (2021)SSAComputational Biology and Chemistry10.1016/j.compbiolchem.2021.10755294:COnline publication date: 1-Oct-2021
  • (2020)Planning Above the API Clouds Before Flying Above the Clouds: A Real-Time Personalized Air Travel Planning ApproachInternational Journal of Parallel Programming10.1007/s10766-019-00649-848:1(137-156)Online publication date: 1-Feb-2020
  • (2018)Exploratory Studies Detecting Secondary Structures in Medium Resolution 3D Cryo-EM Images Using Deep Convolutional Neural NetworksProceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/3233547.3233704(628-632)Online publication date: 15-Aug-2018
  • Show More Cited By

Index Terms

  1. Solving the secondary structure matching problem in cryo-EM de novo modeling using a constrained K-shortest path graph algorithm

            Recommendations

            Comments

            Information & Contributors

            Information

            Published In

            cover image IEEE/ACM Transactions on Computational Biology and Bioinformatics
            IEEE/ACM Transactions on Computational Biology and Bioinformatics  Volume 11, Issue 2
            March/April 2014
            160 pages

            Publisher

            IEEE Computer Society Press

            Washington, DC, United States

            Publication History

            Published: 01 March 2014
            Accepted: 01 January 2014
            Revised: 31 December 2013
            Received: 29 July 2013
            Published in TCBB Volume 11, Issue 2

            Author Tags

            1. algorithm
            2. electron cryomicroscopy
            3. graph
            4. image
            5. protein structure
            6. secondary structure

            Qualifiers

            • Article

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • Downloads (Last 12 months)9
            • Downloads (Last 6 weeks)1
            Reflects downloads up to 20 Jan 2025

            Other Metrics

            Citations

            Cited By

            View all
            • (2021)SSAComputational Biology and Chemistry10.1016/j.compbiolchem.2021.10755294:COnline publication date: 1-Oct-2021
            • (2020)Planning Above the API Clouds Before Flying Above the Clouds: A Real-Time Personalized Air Travel Planning ApproachInternational Journal of Parallel Programming10.1007/s10766-019-00649-848:1(137-156)Online publication date: 1-Feb-2020
            • (2018)Exploratory Studies Detecting Secondary Structures in Medium Resolution 3D Cryo-EM Images Using Deep Convolutional Neural NetworksProceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/3233547.3233704(628-632)Online publication date: 15-Aug-2018
            • (2017)Analysis of ß-strand Twist from the 3-dimensional Image of a ProteinProceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics10.1145/3107411.3107507(650-654)Online publication date: 20-Aug-2017
            • (2017)Geometry Analysis for Protein Secondary Structures Matching ProblemProceedings of the 8th ACM International Conference on Bioinformatics, Computational Biology,and Health Informatics10.1145/3107411.3107505(716-721)Online publication date: 20-Aug-2017
            • (2017)Efficient top-k shortest path query processing in sparse graph databasesProceedings of the 7th International Conference on Web Intelligence, Mining and Semantics10.1145/3102254.3102264(1-10)Online publication date: 19-Jun-2017
            • (2017)An Effective Computational Method Incorporating Multiple Secondary Structure Predictions in Topology Determination for Cryo-EM ImagesIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2016.254372114:3(578-586)Online publication date: 1-May-2017
            • (2014)Construction of protein backbone pieces using segment-based FBCCD and Cryo-EM skeletonProceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/2649387.2660838(711-716)Online publication date: 20-Sep-2014
            • (2014)Orientations of beta-strand traces and near maximum twistProceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics10.1145/2649387.2660835(690-694)Online publication date: 20-Sep-2014

            View Options

            Login options

            Full Access

            View options

            PDF

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader

            Media

            Figures

            Other

            Tables

            Share

            Share

            Share this Publication link

            Share on social media