Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/640075.640119acmconferencesArticle/Chapter ViewAbstractPublication PagesrecombConference Proceedingsconference-collections
Article

Dynamic programming algorithms for haplotype block partitioning: applications to human chromosome 21 haplotype data

Published: 10 April 2003 Publication History

Abstract

Recent studies have shown that the human genome has a haplotype block structure such that it can be divided into discrete blocks of limited haplotype diversity. Patil et al. [6] and Zhang et al. [12] developed algorithms to partition haplotypes into blocks with minimum number of tag SNPs for the entire chromosome. However, it is not clear how to partition haplotypes into blocks with restricted number of SNPs when only limited resources are available. In this paper, we first formulated this problem as finding a block partition with a fixed number of tag SNPs that can cover the maximal percentage of a genome. Then we solved it by two dynamic programming algorithms, which are fairly flexible to take into account the knowledge of functional polymorphism. We applied our algorithms to the published SNP data of human chromosome 21 combining with the functional information of these SNPs and demonstrated the effectiveness of them. Statistical investigation of the relationship between the starting points of a block partition and the coding and non-coding regions illuminated that the SNPs at these starting points are not significantly enriched in coding regions. We also developed an efficient algorithm to find all possible long local maximal haplotypes across a subset of samples. After applying this algorithm to the human chromosome 21 haplotype data, we found that samples with long local haplotypes are not necessarily globally similar.

References

[1]
Daly, M.J., Rioux, J.D., Schaffner, S.F., Hudson, T.J., Lander, E.S. High-resolution haplotype structure in the human genome. Nature Genetics, 29:229--232, 2001.
[2]
Dawson, E., Abecasis, G.R., Bumpstead, S., Chen, Y., Hunt, S., Beare, D.M., Pabial, J., Dibling,T., et al. A first-generation linkage disequilibrium map of chromosome 22. Nature, 418: 544--548, 2002.
[3]
Gabriel, S.B., Schaffner, S.F., Nguyen, H., Moore, J.M., Roy, J., Blumenstiel, B., Higgins, J., DeFelice, M., Lochner, A., Faggart, M., Liu-Cordero, S.N., Rotimi, C., Adeyemo, A., Cooper, R., Ward, R., Lander, E.S., Daly, M.J., Altshulter, D. The structure of haplotype blocks in the human genome. Science 296: 2225--2229, 2002.
[4]
Gusfield D., Balasubramanian K., Naor D. Parametric optimization of sequence alignment. Algorithmica 12: 312--326, 1994.
[5]
Johnson, G.C.L., Esposito, L., Barratt, B.J., Smith, A.N., Heward, J., Genova, G.D., Ueda, H., Cordell, H.J., Eaves, I.A., Dubbridge, F., Twells, R.C.J., Huges, W., Stevens, H., Phillipa, C., Tuomilehto-Wolf, E., Tuomilehto, J., Gough S.C.L., Clayton D.G., Todd, J.A. Haplotype tagging for the identification of common disease genes. Nature Genetics 29: 233--237, 2001.
[6]
Patil, N., Berno, A.J., Hinds, D.A., Barrett, W.A., Doshi, J.M., Hacker C.R., Kautzer, C.R., Lee, D.H., Marjoribanks, C., McDonough,D.P., Nguyen, B.T.N., Norris, M.C., Sheehan, J.B., Shen, N., Stern, D., Stokowski, R.P., Thomas, D.J., Trulson, M.O., Vyas, K.R., Frazer, K.A., Fodor, S.P.A., Cox, D.R. Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21. Science 294: 1719--1723, 2001.
[7]
Saitou, N., Nei, M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular Biology and Evolution, 4: 405--425, 1987.
[8]
Wang, D.G., Fan, J.B., Siao, C.J., Berno, A., Young, P., Saplosky, R., Ghandour, G., et al. Large-scale identification, mapping and genotyping of single-nucleotide polymorphisms in the human genome,. Science, 280: 1077--1082, 1998.
[9]
Waterman, M.S., Eggert, M. and Lander E.L. Parametric sequence comparisons. Proceedings of the National Academy of Sciences of the United States of America 89: 6090--6093, 1992.
[10]
Waterman M.S. Introduction to computational biology: maps, sequences and genomes. Chapman & Hall/CRC Press, Boca Raton FL, 1995.
[11]
Zhang, K., Calabrese, P., Nordborg, M., Sun, F. Haplotype block structure and its applications in association studies: power and study design. The American Journal of Human Genetics, 71: 1836--1894, 2002.
[12]
Zhang, K., Deng, M., Chen, T., Waterman, M.S., Sun, F. A dynamic programming algorithm for haplotype block partitioning. Proceedings of the National Academy of Sciences of the United States of America 99: 7335--7339, 2002.

Cited By

View all
  • (2022)A Tagging SNP Set Method Based on Network Community Partition of Linkage Disequilibrium and Node CentralityCurrent Bioinformatics10.2174/157489361766622032415581317:9(825-834)Online publication date: Nov-2022
  • (2020)Dynamic Programming Applications: A Suvrvey2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES)10.1109/NILES50944.2020.9257968(380-385)Online publication date: 24-Oct-2020
  • (2017)Overview of Big Data in HealthcareHandbook of Research on Data Science for Effective Healthcare Practice and Administration10.4018/978-1-5225-2515-8.ch016(360-384)Online publication date: 2017
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
RECOMB '03: Proceedings of the seventh annual international conference on Research in computational molecular biology
April 2003
352 pages
ISBN:1581136358
DOI:10.1145/640075
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 April 2003

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. clustering
  2. dynamic programming
  3. haplotype block
  4. local maximal haplotypes
  5. tag SNPs

Qualifiers

  • Article

Conference

RECOMB03
Sponsor:

Acceptance Rates

RECOMB '03 Paper Acceptance Rate 35 of 175 submissions, 20%;
Overall Acceptance Rate 148 of 538 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)A Tagging SNP Set Method Based on Network Community Partition of Linkage Disequilibrium and Node CentralityCurrent Bioinformatics10.2174/157489361766622032415581317:9(825-834)Online publication date: Nov-2022
  • (2020)Dynamic Programming Applications: A Suvrvey2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES)10.1109/NILES50944.2020.9257968(380-385)Online publication date: 24-Oct-2020
  • (2017)Overview of Big Data in HealthcareHandbook of Research on Data Science for Effective Healthcare Practice and Administration10.4018/978-1-5225-2515-8.ch016(360-384)Online publication date: 2017
  • (2014)Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trendsBioData Mining10.1186/1756-0381-7-227:1Online publication date: 29-Oct-2014
  • (2008)Machine Learning Applications in SNP–Disease Association StudyMachine Learning in Bioinformatics10.1002/9780470397428.ch18(389-412)Online publication date: 21-Apr-2008
  • (2005)Choosing SNPs Using Feature SelectionProceedings of the 2005 IEEE Computational Systems Bioinformatics Conference10.1109/CSB.2005.22(301-309)Online publication date: 8-Aug-2005
  • (2004)Algorithms for Association Study Design Using a Generalized Model of Haplotype ConservationProceedings of the 2004 IEEE Computational Systems Bioinformatics Conference10.5555/1018417.1019245(90-97)Online publication date: 16-Aug-2004
  • (2004)Maximum likelihood resolution of multi-block genotypesProceedings of the eighth annual international conference on Research in computational molecular biology10.1145/974614.974616(2-9)Online publication date: 27-Mar-2004
  • (2004)Optimal Haplotype Block-Free Selection of Tagging SNPs for Genome-Wide Association StudiesGenome Research10.1101/gr.257000414:8(1633-1640)Online publication date: 2-Aug-2004
  • (2004)HapBlock: haplotype block partitioning and tag SNP selection software using a set of dynamic programming algorithmsBioinformatics10.1093/bioinformatics/bth48221:1(131-134)Online publication date: 27-Aug-2004
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media