Abstract
We consider the problem of protein folding in the HP model on the 3D square lattice. This problem is combinatorially equivalent to folding a string of 0’s and 1’s so that the string forms a self-avoiding walk on the lattice and the number of adjacent pairs of 1’s is maximized. The previously best-known approximation algorithm for this problem has a guarantee of \(\frac{3}{8}=.375\) [HI95]. In this paper, we first present a new \(\frac{3}{8}\)-approximation algorithm for the 3D folding problem that improves on the absolute approximation guarantee of the previous algorithm. We then show a connection between the 3D folding problem and a basic combinatorial problem on binary strings, which may be of independent interest. Given a binary string in { a,b }*, we want to find a long subsequence of the string in which every sequence of consecutive a’s is followed by at least as many consecutive b’s. We show a non-trivial lower-bound on the existence of such subsequences. Using this result, we obtain an algorithm with a slightly improved approximation ratio of at least .37501 for the 3D folding problem. All of our algorithms run in linear time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Berger, B., Leighton, T.: Protein Folding in the Hydrophobic-Hydrophilic (HP) Model is NP-Complete. In: Proceedings of the 2nd Conference on Computational Molecular Biology, RECOMB (1998)
Crescenzi, P., Goldman, D., Papadimitriou, C., Piccolboni, A., Yannakakis, M.: On the Complexity of Protein Folding. In: Proceedings of the 2nd Conference on Computational Molecular Biology, RECOMB (1998)
Dill, K.A.: Theory for the Folding and Stability of Globular Proteins. Biochemistry 24, 1501 (1985)
Dill, K.A.: Dominant Forces in Protein Folding. Biochemistry 29, 7133–7155 (1990)
Hart, W.E., Istrail, S.: Fast Protein Folding in the Hydrophobic-hydrophilic Model within Three-eighths of Optimal. In: Proceedings of the 27th ACM Symposium on the Theory of Computing, STOC (1995)
Newman, A.: A New Algorithm for Protein Folding in the HP Model. In: Proceedings of the 13th ACM-SIAM Symposium on Discrete Algorithms, SODA (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Newman, A., Ruhl, M. (2004). Combinatorial Problems on Strings with Applications to Protein Folding. In: Farach-Colton, M. (eds) LATIN 2004: Theoretical Informatics. LATIN 2004. Lecture Notes in Computer Science, vol 2976. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24698-5_41
Download citation
DOI: https://doi.org/10.1007/978-3-540-24698-5_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21258-4
Online ISBN: 978-3-540-24698-5
eBook Packages: Springer Book Archive