Abstract
In the reverse complement (RC) equivalence model, it is not possible to distinguish between a string and its reverse complement. We show that one can still reconstruct a binary string of length n, up to reverse complement, using a linear number of subsequence queries of bounded length. A simple information theoretic lower bound proves the number of queries to be tight. Our result is also optimal w.r.t. the bound on the query length given in [Erdős et al., Ann. of Comb. 2006].
This research was carried out in part while F.C. and Zs.L. were visiting with the Alfréd Rényi Institute in Budapest, with support from the Hungarian Bioinformatics MTKD-CT-2006-042794, Marie Curie Host Fellowships for Transfer of Knowledge.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Carpi, A., de Luca, A.: Words and special factors. Theor. Comput. Sci. 259(1-2), 145–182 (2001)
de Luca, A.: On the combinatorics of finite words. Theor. Comput. Sci. 218(1), 13–39 (1999)
Dudík, M., Schulman, L.J.: Reconstruction from subsequences. J. Comb. Theory, Ser. A 103(2), 337–348 (2003)
Erdős, P.L., Ligeti, P., Sziklai, P., Torney, D.C.: Subwords in reverse-complement order. Annals of Combinatorics 10, 415–430 (2006)
Fici, G., Mignosi, F., Restivo, A., Sciortino, M.: Word assembly through minimal forbidden words. Theor. Comput. Sci. 359(1-3), 214–230 (2006)
Krasikov, I., Roditty, Y.: On a reconstruction problem for sequences. J. Comb. Theory, Ser. A 77(2), 344–348 (1997)
Levenshtein, V.I.: Efficient reconstruction of sequences. IEEE Transactions on Information Theory 47(1), 2–22 (2001)
Pevzner, P.: l-tuple DNA sequencing: Computer analysis. Journal of Biomolecular Structure and Dynamics 7, 63–73 (1989)
Piña, C., Uzcátegui, C.: Reconstruction of a word from a multiset of its factors. Theor. Comput. Sci. 400(1-3), 70–83 (2008)
Preparata, F.P.: Sequencing-by-hybridization revisited: The analog-spectrum proposal. IEEE/ACM Trans. Comput. Biology Bioinform. 1(1), 46–52 (2004)
Schützenberger, M.-P., Simon, I.: Combinatorics on Words, by M. Lothaire. Subwords. ch. 6. Cambridge University Press, Cambridge (1983)
Simon, I.: Piecewise testable events. In: Brakhage, H. (ed.) Automata Theory and Formal Languages. LNCS, vol. 33, pp. 214–222. Springer, Heidelberg (1975)
Skiena, S., Sundaram, G.: Reconstructing strings from substrings. Journal of Computational Biology 2(2), 333–353 (1995)
Tsur, D.: Tight bounds for string reconstruction using substring queries. In: Chekuri, C., Jansen, K., Rolim, J.D.P., Trevisan, L. (eds.) APPROX 2005 and RANDOM 2005. LNCS, vol. 3624, pp. 448–459. Springer, Heidelberg (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cicalese, F., Erdős, P.L., Lipták, Z. (2011). Efficient Reconstruction of RC-Equivalent Strings. In: Iliopoulos, C.S., Smyth, W.F. (eds) Combinatorial Algorithms. IWOCA 2010. Lecture Notes in Computer Science, vol 6460. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19222-7_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-19222-7_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19221-0
Online ISBN: 978-3-642-19222-7
eBook Packages: Computer ScienceComputer Science (R0)