Abstract
A new method for building a gene tree from Single Nucleotide Polymorphism (SNP) data was developed by Chen and Lindsay (Biometrika 93(4):843–860, 2006). Called the mixture tree, it was based on an ancestral mixture model. The sieve parameter in the model plays the role of time in the evolutionary tree of the sequences. By varying the sieve parameter, one can create a hierarchical tree that estimates the population structure at each fixed backward point in time. In this chapter, we will review the model and then present an application to the clustering of the mitochondrial sequences to show that the approach performs well. A simulator that simulates real SNPs sequences with unknown ancestral history will be introduced. Using the simulator we will compare the mixture trees with true trees to evaluate how well the mixture tree method performs. Comparison with some existing methods including neighbor-joining method and maximum parsimony method will also be presented in this chapter.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chen, S. C., & Lindsay, B. (2006). Building mixture trees from binary sequence data. Biometrika, 93(4), 843–860.
Czelsniak, J., Goodman, M., Moncrief, N. D., & Kehoe, S. M. (1990). Maximum parsimony approach to construction of evolutionary trees from aligned homologous sequences. Methods in Enzymology, 183, 601–615.
Edwards, A. W. F., & Cavalli-Sforza, L. L. (1963). The reconstruction of evolution. Annals of Human Genetics, 27, 105–106. (also published in Heredity 18:553)
Edwards, A. W. F., & Cavalli-Sforza, L. L. (1964). Reconstruction of evolutionary trees. In V. H. Heywood & J. McNeill (Ed.), Phenetic and phylogenetic classification (Vol. 6, pp. 67–76). London: Systematics Association Publ.
Felsenstein, J. (1981). Evolutionary trees from DNA sequences: A maximum Liklihood Approach. Journal of Molecular Evolution, 17, 368–376.
Fisher, R. A. (1912). On an absolute criterion for fitting frequency curves. Messenger of Mathematics, 41, 155–160.
Fisher, R. A. (1921). On the “probable error” of a coefficient of correlation deduced from a small sample. Metron, 1, 3–32.
Fisher, R. A. (1922). On the mathematical foundations of theoretical statistics. Philosophical transactions of the Royal Society of London A, 222, 309–368.
Hudson, R. R. (2002). Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics, 18, 337–338.
Huelsenbeck, J. P., & Ronquist, F. (2005). Bayesian analysis of molecular evolution using MrBayes. In Nielsen, R. (Ed.), Statistical methods in molecular evolution. New York: Springer.
Lindsay, B., Markatou, M., Ray, S., Kang, K., & Chen, S. C. (2008). Quadratic distances on probabilities: A unified foundation. The Annals of Statistics, 36(2), 983–1006.
Nei, M., & Kumar, S. (2000). Molecular evolution and phylogenetics. New York: Oxford University Press.
Nei, M., Kumar, S., & Takahashi, K. (1998). The optimization principle in phylogenetic analysis tends to give incorrect topologies when the number of nucleotides or amino acids used is small. Proceedings of the National Academy of Sciences of the United States of America, 95, 12390–12397.
Penny, D., & Hendy, M. D. (1985). The use of tree comparison metrics. Systematic Zoology, 34, 75–82.
Robinson, D. F., & Foulds, L. R. (1981). Comparison of phylogenetic trees. Mathematical Biosciences, 53, 131–147.
Rzhestky, A., & Nei, M. (1992). A simple method for estimating and testing minimum-evolution trees. Molecular Biology and Evolution, 9, 945–967.
Saitou, N., & Nei, M. (1987). The neighbor-joining method: A new method for reconstructing phylogenetic trees. Molecular and Biological Evolution, 4, 406–425.
Takahashi, K., & Nei, M. (2000). Efficiencies of fast algorithms of phylogenetic inference under the criteria of maximum parsimony, minimum evolution, and maximum likelihood when a large number of sequences are used. Molecular Biology and Evolution, 17, 1251–1258.
Tamura, K., Dudley, J., Nei, M., & Kumar, S. (2007). MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Molecular Biology and Evolution, 24, 1596–1599.
Wen, B., Li, H., Gao, S., et al. (2004). Genetic structure of Hmong-Mien speaking populations in east Asia as revealed by mtDNA lineages. Molecular and Biological Evolution, 22(3), 725–734.
Acknowledgements
We thank National Science Foundation for the funding – Division of Mathematical Sciences 0714949 to Shu-Chuan Chen and Michael Rosenberg, and National Science Foundation – Division of Mathematical Sciences 0714839 to Bruce Lindsay. In addition, we also acknowledge the support of National Center for Theoretical Sciences (South), Taiwan.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Chen, G.S.C., Li, M., Rosenberg, M., Lindsay, B. (2011). Mixture Tree Construction and Its Applications. In: Lu, HS., Schölkopf, B., Zhao, H. (eds) Handbook of Statistical Bioinformatics. Springer Handbooks of Computational Statistics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16345-6_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-16345-6_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16344-9
Online ISBN: 978-3-642-16345-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)