Computational Haplotype Inference from Pooled Samples

Long, Quan

doi:10.1007/978-1-4939-6750-6_15

Quan Long⁴

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1551))

1457 Accesses

Abstract

Computationally inferring the identities and their relative frequencies from pooled samples that are whole-genome or segmentally genotyped or sequenced (e.g., using next-generation sequencing) in a pool is useful for population genetics analysis. To carry out such analysis, one needs to understand basics of how to use high-performance computing (HPC) facilities and the specifics of corresponding computational tools. Here, we describe the basic knowledge and step-by-step usage of a number of tools for haplotype inference on genotyping or next-generation sequencing data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

EUR 32.99 /Month

Get 10 units per month
Download Article/Chapter or Ebook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.00; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Accurate, scalable and integrative haplotype estimation

Article Open access 28 November 2019

Genotype Calling and Haplotype Phasing from Next Generation Sequencing Data

Analysis of Genotyping-by-Sequencing (GBS) Data

References

Schlotterer C, Tobler R, Kofler R, Nolte V (2014) Sequencing pools of individuals - mining genome-wide polymorphism data without big funding. Nat Rev Genet 15:749–763
Article PubMed Google Scholar
Zhang H, Yang HC, Yang Y (2008) PoooL: an efficient method for estimating haplotype frequencies from large DNA pools. Bioinformatics 24:1942–1948
Article CAS PubMed Google Scholar
Kuk AY, Zhang H, Yang Y (2009) Computationally feasible estimation of haplotype frequencies from pooled DNA with and without Hardy-Weinberg equilibrium. Bioinformatics 25:379–386
Article CAS PubMed Google Scholar
Long Q, Jeffares DC, Zhang Q, Ye K, Nizhynska V et al (2011) PoolHap: inferring haplotype frequencies from pooled samples by next generation sequencing. PLoS One 6:e15292
Article CAS PubMed PubMed Central Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J et al (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079
Article PubMed PubMed Central Google Scholar
Kessner D, Turner TL, Novembre J (2013) Maximum likelihood estimation of frequencies of known haplotypes from pooled sequence data. Mol Biol Evol 30:1145–1158
Article CAS PubMed PubMed Central Google Scholar
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760
Article CAS PubMed PubMed Central Google Scholar
Stephens M, Smith NJ, Donnelly P (2001) A new statistical method for haplotype reconstruction from population data. Am J Hum Genet 68:978–989
Article CAS PubMed PubMed Central Google Scholar
Scheet P, Stephens M (2006) A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. Am J Hum Genet 78:629–644
Article CAS PubMed PubMed Central Google Scholar
Pirinen M, Kulathinal S, Gasbarra D, Sillanpaa MJ (2008) Estimating population haplotype frequencies from pooled DNA samples using PHASE algorithm. Genet Res (Camb) 90:509–524
Article CAS Google Scholar
Long Q, MacArthur D, Ning Z, Tyler-Smith C (2009) HI: haplotype improver using paired-end short reads. Bioinformatics 25:2436–2437
Article CAS PubMed PubMed Central Google Scholar
Sasaki E, Sugino RP, Innan H (2013) The linkage method: a novel approach for SNP detection and haplotype reconstruction from a single diploid individual using next-generation sequence data. Mol Biol Evol 30:2187–2196
Article CAS PubMed Google Scholar
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K et al (2010) The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20:1297–1303
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgment

We are grateful to the communications with Dr. Yaning Yang on PoooL and the communications with Dr. Darren Kessner on HARP. This work was partially supported by the start-up grant of University of Calgary and NIH grants (HG008451 and AG046170)

Author information

Authors and Affiliations

Departments of Biochemistry & Molecular Biology and Medical Genetics, Alberta Children’s Hospital Research Institute and O’Brien Institute for Public Health, University of Calgary, Calgary, AB, Canada, T2N 4N1
Quan Long

Authors

Quan Long
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Quan Long .

Editor information

Editors and Affiliations

Johannes Kepler University Institute of Biophysics, Linz, Austria
Irene Tiemann-Boege
Vetmeduni Vienna InstitutPopulationsgenetik, Wien, Austria
Andrea Betancourt

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Long, Q. (2017). Computational Haplotype Inference from Pooled Samples. In: Tiemann-Boege, I., Betancourt, A. (eds) Haplotyping. Methods in Molecular Biology, vol 1551. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-6750-6_15

Download citation

DOI: https://doi.org/10.1007/978-1-4939-6750-6_15
Published: 31 January 2017
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-6748-3
Online ISBN: 978-1-4939-6750-6
eBook Packages: Springer Protocols

Publish with us

Policies and ethics

Computational Haplotype Inference from Pooled Samples

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Accurate, scalable and integrative haplotype estimation

Genotype Calling and Haplotype Phasing from Next Generation Sequencing Data

Analysis of Genotyping-by-Sequencing (GBS) Data

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Computational Haplotype Inference from Pooled Samples

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Accurate, scalable and integrative haplotype estimation

Genotype Calling and Haplotype Phasing from Next Generation Sequencing Data

Analysis of Genotyping-by-Sequencing (GBS) Data

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this protocol

Cite this protocol

Download citation

Publish with us

Search

Navigation