Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/640075.640081acmconferencesArticle/Chapter ViewAbstractPublication PagesrecombConference Proceedingsconference-collections
Article

Towards optimally multiplexed applications of universal DNA tag systems

Published: 10 April 2003 Publication History

Abstract

We study a design and optimization problem that occurs, for example, when single nucleotide polymorphisms (SNPs) are to be genotyped using a universal DNA tag array. The problem of optimizing the universal array to avoid disruptive cross-hybridization between universal components of the system was addressed in a previous work. However, cross-hybridization can also occur assay-specifically, due to unwanted complementarity involving assay-specific components. Here we examine the problem of identifying the most economic experimental configuration of the assay-specific components that avoids cross-hybridization. Our formalization translates this problem into the problem of covering the vertices of one side of a bipartite graph by a minimum number of balanced subgraphs of maximum degree 1. We show that the general problem is NP-complete. However, in the real biological setting the vertices that need to be covered have degrees bounded by d. We exploit this restriction and develop an O(d)-approximation algorithm for the problem. We also give an O(d)-approximation for a variant of the problem in which the covering subgraphs are required to be vertex-disjoint. In addition, we propose a stochastic model for the input data and use it to prove a lower bound on the cover size. We complement our theoretical analysis by implementing two heuristic approaches and testing their performance on simulated and real SNP data.

References

[1]
N. Alon and J. H. Spencer. The Probabilistic Method. John Wiley and Sons, Inc., 1992.
[2]
A. Ben-Dor, R. M. Karp, B. Schwikowski, and Z. Yakhini. Universal DNA tag systems: A combinatorial design scheme. Journal of Computational Biology, 7(3):503--519, 2000.
[3]
S. Brenner. Methods for sorting polynucleotides using oligonucleotide tags, US Patent 5,604,097, 1997.
[4]
K. Cameron. Induced matchings. Discrete Applied Mathematics, 24:97--102, 1989.
[5]
M. Cargill, D. Altshuler, J. Ireland, P. Sklar, K. Ardlie, et al. Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nature Genetics, 22(3):231--8, 1999.
[6]
T. H. Cormen, C. E. Leiserson, and R. L. Rivest. Introduction to Algorithms. MIT Press, Cambridge, Mass., 1990.
[7]
R. Drmanac, G. Lennon, S. Drmanac, I. Labat, R. Crkvenjakov, and H. Lehrach. Partial sequencing by oligohybridization: Concept and applications in genome analysis. In Proceedings of the first international conference on electrophoresis supercomputing and the human genome, pages 60--75. World Scientific, 1991.
[8]
M.R. Garey and D.S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Completeness. W.H. Freeman and Co., San Francisco, 1979.
[9]
N.P. Gerry, N.E. Witowski, J. Day, R.P. Hammer, G. Barany, et~al. Universal DNA microarray method for multiplex detection of low abundance point mutations. J. Mol. Biol., 292(2):251--62, 1999.
[10]
D.M. Grant and M.S. Phillips. Technologies for the Analysis of Single-Nucleotide Polymorphisms: An Overview. Marcel Dekker, Inc., New York, 2001.
[11]
J.G. Hacia. Resequencing and mutational analysis using oligonucleotide micro arrays. Nature Genetics, 21(1):42--47, January 1999.
[12]
V. N. Kristensen, N. Harada, N. Yoshimura, E. Haraldsen, P. E. Lonning, et al. Genetic variants of cyp19 (aromatase) and breast cancer risk. Oncogene, 19(10):1329--33, March 2000.
[13]
R. W. Davis M. S. Morris, D. D. Shoemaker and M. P. Mittmann. Methods and compositions for selecting tag nucleic acids and probe arrays, European Patent Application 97,302,313, 1997.
[14]
D. Matula and L. Beck. Smallest-last ordering and clustering and graph coloring algorithms. Journal of the ACM, 30:417--427, 1983.
[15]
N. J. Risch. Searching for genetic determinants in the new millennium. Nature, 405(6788):847--56, 2000.
[16]
J. P. Schouten, C. J. McElgunn, R. Waaijer, D. Zwijnenburg, F. Diepvens, and G. Pals. Relative quantification of 40 nucleic acid sequences by multiplex ligation-dependent probe amplification. Nucleic Acids Research, 30(12), June 2002.
[17]
T. Strachen and A.P. Read. Human Molecular Genetics. Bios scientific publishers, 1996.
[18]
A. C. Syvanen. From gels to chips: "minisequencing" primer extension for analysis of point mutations and single nucleotide polymorphisms. Hum. Mutat., 13(1):1--10, 1999.
[19]
S. Venitt. Mechanisms of carcinogenesis and individual susceptibility to cancer. Clin. Chem., 40(7.2):1421--5, July 1994.
[20]
S. Venitt. Mechanisms of spontaneous human cancers. Environ. Health Perspect., 104(3):633--7, May 1996.
[21]
D. G. Wang, J. B. Fan, C. J. Siao, A. Berno, P. P. Young, et~al. Large-scale identification, mapping, and genotyping of single nucleotide polymorphisms in the human genome. Science, 280(5366):1077--82, 1998.
[22]
Y. Watanabe, A. Fujiyama, Y. Ichiba, M. Hattori, T. Yada, Y. Sakaki, and T. Ikemura. Chromosome-wide assessment of replication timing for human chromosomes 11q and 21q: disease-related genes in timing-switch regions. Human Molecular Genetics, 11(1):13--21, January 2002.

Cited By

View all
  • (2007)High-Throughput SNP Genotyping by SBE/SBHIEEE Transactions on Nanobioscience10.1109/TNB.2007.8918986:1(28-35)Online publication date: Mar-2007
  • (2005)Improved tag set design and multiplexing algorithms for universal arraysTransactions on Computational Systems Biology II10.5555/2167737.2167746(124-137)Online publication date: 1-Jan-2005
  • (2005)Multiplexing Schemes for Generic SNP Genotyping AssaysJournal of Computational Biology10.1089/cmb.2005.12.51412:5(514-533)Online publication date: Jun-2005
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
RECOMB '03: Proceedings of the seventh annual international conference on Research in computational molecular biology
April 2003
352 pages
ISBN:1581136358
DOI:10.1145/640075
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 April 2003

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. SNP genotyping
  2. cross-hybridization
  3. minimum primer cover
  4. stochastic model
  5. universal array

Qualifiers

  • Article

Conference

RECOMB03
Sponsor:

Acceptance Rates

RECOMB '03 Paper Acceptance Rate 35 of 175 submissions, 20%;
Overall Acceptance Rate 148 of 538 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2007)High-Throughput SNP Genotyping by SBE/SBHIEEE Transactions on Nanobioscience10.1109/TNB.2007.8918986:1(28-35)Online publication date: Mar-2007
  • (2005)Improved tag set design and multiplexing algorithms for universal arraysTransactions on Computational Systems Biology II10.5555/2167737.2167746(124-137)Online publication date: 1-Jan-2005
  • (2005)Multiplexing Schemes for Generic SNP Genotyping AssaysJournal of Computational Biology10.1089/cmb.2005.12.51412:5(514-533)Online publication date: Jun-2005
  • (2005)Improved Tag Set Design and Multiplexing Algorithms for Universal ArraysTransactions on Computational Systems Biology II10.1007/11567752_9(124-137)Online publication date: 2005
  • (2005)Exact and approximation algorithms for DNA tag set designProceedings of the 16th annual conference on Combinatorial Pattern Matching10.1007/11496656_33(383-393)Online publication date: 19-Jun-2005
  • (2005)Improved tag set design and multiplexing algorithms for universal arraysProceedings of the 5th international conference on Computational Science - Volume Part II10.1007/11428848_126(994-1002)Online publication date: 22-May-2005

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media