Mitochondrial Haplogroup Assignment for High-Throughput Sequencing Data from Single Individual and Mixed DNA Samples

The inference of mitochondrial haplogroups is an important step in forensic analysis of DNA samples collected at a crime scene. In this paper we introduced efficient inference algorithms based on Jaccard similarity between variants called from high-throughput sequencing data of such DNA samples and mutations collected in public databases such as PhyloTree. Experimental results on real and simulated datasets show that our mutation analysis methods have accuracy comparable to that of state-of-the-art methods based on haplogroup frequency estimation for both single-individual samples and two-individual mixtures, with a much lower running time.

[1]  R. L. Ewen,et al.  Thresholds for identifying pathological intracranial pressure in paediatric traumatic brain injury , 2019, Scientific Reports.

[2]  Ion I. Mandoiu,et al.  Estimation of alternative splicing isoform frequencies from RNA-Seq data , 2010, Algorithms for Molecular Biology.

[3]  Shamsudheen Karuthedath Vellarikkal,et al.  mit‐o‐matic: A Comprehensive Computational Pipeline for Clinical Evaluation of Mitochondrial Variations from Next‐Generation Sequencing Datasets , 2015, Human mutation.

[4]  Gastone Castellani,et al.  HAPLOFIND: A New Method for High‐Throughput mtDNA Haplogroup Assignment , 2013, Human mutation.

[5]  N Howell,et al.  Clinical mitochondrial genetics , 1999, Journal of medical genetics.

[6]  Y. Chien,et al.  Biparental Inheritance of Mitochondrial DNA in Humans , 2018, Proceedings of the National Academy of Sciences.

[7]  C. Vullo,et al.  The contributions of anthropology and mitochondrial DNA analysis to the identification of the human skeletal remains of the Australian outlaw Edward 'Ned' Kelly. , 2014, Forensic science international.

[8]  Roberto J. Bayardo,et al.  Scaling up all pairs similarity search , 2007, WWW '07.

[9]  Guido Davidzon,et al.  Mitochondrial DNA and disease , 2005, Annals of medicine.

[10]  D. Johns Seminars in medicine of the Beth Israel Hospital, Boston. Mitochondrial DNA and disease. , 1995, The New England journal of medicine.

[11]  Jordan M. Eizenga,et al.  A phylogenetic approach for haplotype analysis of sequence data from complex mitochondrial mixtures. , 2017, Forensic science international. Genetics.

[12]  D. Wallace,et al.  Mitochondrial DNA genetics and the heteroplasmy conundrum in evolution and disease. , 2013, Cold Spring Harbor perspectives in biology.

[13]  R. Kaas,et al.  Worldwide human mitochondrial haplogroup distribution from urban sewage , 2019, Scientific Reports.

[14]  M. Holland,et al.  Forensic Mitochondrial DNA Analysis: Current Practice and Future Potential. , 2012, Forensic science review.

[15]  Sung-Bae Cho,et al.  mtDNAmanager: a Web-based tool for the management and quality analysis of mitochondrial DNA control-region sequences , 2008, BMC Bioinformatics.

[16]  Yong-Gang Yao,et al.  MitoTool: a web server for the analysis and retrieval of human mitochondrial DNA sequence variations. , 2011, Mitochondrion.

[17]  B. Cong,et al.  Current developments in forensic interpretation of mixed DNA samples (Review) , 2014, Biomedical reports.

[18]  Igor Mandric,et al.  Fast bootstrapping‐based estimation of confidence intervals of expression levels and differential expression from RNA‐Seq data , 2017, Bioinform..

[19]  Günther Specht,et al.  HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups , 2011, Human mutation.

[20]  Günther Specht,et al.  mtDNA-Server: next-generation sequencing data analysis of human mitochondrial DNA in the cloud , 2016, Nucleic Acids Res..

[21]  L. Bachmann,et al.  Reconstructing mitochondrial genomes directly from genomic next-generation sequencing reads—a baiting and iterative mapping approach , 2013, Nucleic acids research.

[22]  Mannis van Oven,et al.  PhyloTree Build 17: Growing the human mitochondrial DNA tree , 2015 .

[23]  Dana C Crawford,et al.  Hi-MC: a novel method for high-throughput mitochondrial haplogroup classification , 2018, PeerJ.

[24]  Manfred Kayser,et al.  Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation , 2009, Human mutation.

[25]  Hans-Jürgen Bandelt,et al.  HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing , 2016, Nucleic Acids Res..

[26]  Ernesto Picardi,et al.  MToolBox: a highly automated pipeline for heteroplasmy annotation and prioritization analysis of human mitochondrial variants in high-throughput sequencing , 2014, Bioinform..

[27]  I. Măndoiu,et al.  Towards accurate detection and genotyping of expressed variants from whole transcriptome sequencing data , 2011, BMC Genomics.

[28]  Din J. Wasem,et al.  Mining of Massive Datasets , 2014 .

[29]  António Amorim,et al.  Mitochondrial DNA in human identification: a review , 2019, PeerJ.

[30]  Mark R. Wilson,et al.  Forensics and mitochondrial DNA: applications, debates, and foundations. , 2003, Annual review of genomics and human genetics.

[31]  T. Kivisild,et al.  Maternal ancestry and population history from whole mitochondrial genomes , 2015, Investigative Genetics.

[32]  Koji Ishiya,et al.  MitoSuite: a graphical tool for human mitochondrial genome profiling in massive parallel sequencing , 2017, PeerJ.