PreprintArticleVersion 1Preserved in Portico This version is not peer-reviewed
A Customized Human Mitochondrial DNA Database (hMITO DB v1.0) for Rapid Sequence Analysis, Haplotyping and Geo-Mapping to Advance Translational Mitogenomics
Shen-Gunther, J.; Gunther, R.S.; Cai, H.; Wang, Y. A Customized Human Mitochondrial DNA Database (hMITO DB v1.0) for Rapid Sequence Analysis, Haplotyping and Geo-Mapping. Int. J. Mol. Sci.2023, 24, 13505.
Shen-Gunther, J.; Gunther, R.S.; Cai, H.; Wang, Y. A Customized Human Mitochondrial DNA Database (hMITO DB v1.0) for Rapid Sequence Analysis, Haplotyping and Geo-Mapping. Int. J. Mol. Sci. 2023, 24, 13505.
Shen-Gunther, J.; Gunther, R.S.; Cai, H.; Wang, Y. A Customized Human Mitochondrial DNA Database (hMITO DB v1.0) for Rapid Sequence Analysis, Haplotyping and Geo-Mapping. Int. J. Mol. Sci.2023, 24, 13505.
Shen-Gunther, J.; Gunther, R.S.; Cai, H.; Wang, Y. A Customized Human Mitochondrial DNA Database (hMITO DB v1.0) for Rapid Sequence Analysis, Haplotyping and Geo-Mapping. Int. J. Mol. Sci. 2023, 24, 13505.
Abstract
The field of mitochondrial genomics has advanced rapidly and revolutionized disciplines from molecular anthropology, population genetics, to medical/oncogenetics. However, mtDNA next-generation sequencing (NGS) analysis for matrilineal haplotyping and phylogeographic inference remains hindered by the lack of a consolidated, mitogenome database and efficient bioinformatics pipeline. To address this, we developed a customized human mitogenome database (hMITO DB) embedded in a CLC Genomics workflow for read mapping, variant analysis, haplotyping, and geo-mapping. The database was constructed from 4,286 mitogenomes. The macro-haplogroup (A to Z) distribution and representative phylogenetic tree were found consistent with published literature. The hMITO DB automated workflow was tested using mtDNA-NGS sequences derived from Pap smears and cervical cancer cell lines. The auto-generated read mapping, variants track, and table of haplotypes and geo-origins were completed in 15 min for 47 samples. The mtDNA workflow proved to be a rapid, efficient and accurate means of sequence analysis for translational mitogenomics.
Keywords
bioinformatics 1; hypervariable region 2; mitochondrial DNA 3; mitochondrial genomics 4; mitochondrial haplogroup 5; molecular anthropology 6; next generation sequencing 7; oncogenetics 8; phylogeography 9
Subject
Biology and Life Sciences, Biochemistry and Molecular Biology
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.