Big data in forensic genetics.

The potential and difficulties of the application of genome wide data in forensics are analyzed. We argue that, besides statistical, computational, ethical, economic and technical validation problems, the state of the art of population genetics theory is insufficient to deal with the forensic use of this type of data. In order to keep the current standards of quantifying and reporting genetic evidence, namely in kinship analyses and identification, substantial improvement in the theoretical framework should be reached, since to obtain genome-wide results is to provide the experts with data that they cannot quantify the corresponding evidentiary value. Therefore, while a satisfactory, generalized theoretical and biostatistical modelling is not achieved, it may well be wiser to improve the already established approaches to a limited, pre-defined number of validated genetic markers, amenable to a consensual handling and reporting. Whole genome population analyses will prove extremely useful in selecting the best suited and most efficient of those markers.

[1]  W Parson,et al.  Inter-laboratory evaluation of SNP-based forensic identification by massively parallel sequencing using the Ion PGM™. , 2015, Forensic science international. Genetics.

[2]  Yaran Yang,et al.  Application of Next-generation Sequencing Technology in Forensic Science , 2014, Genom. Proteom. Bioinform..

[4]  T Egeland,et al.  Beyond traditional paternity and identification cases. Selecting the most probable pedigree. , 2000, Forensic science international.

[5]  Xavier Estivill,et al.  Disorders: Filling the Gaps and Exploring Complexity in Genome-Wide Association Studies , 2022 .

[6]  J. V. van Meurs,et al.  A SNP panel for identification of DNA and RNA specimens , 2018, BMC Genomics.

[7]  B. Weir,et al.  A genome-wide study of Hardy–Weinberg equilibrium with next generation sequence data , 2017, Human Genetics.

[8]  A. Scally Global clues to the nature of genomic mutations in humans , 2017, eLife.

[9]  A. Dawid,et al.  Non-fatherhood or mutation? A probabilistic approach to parental exclusion in paternity testing. , 2001, Forensic science international.

[10]  Pedro V. Silva,et al.  General Derivation of the Sets of Pedigrees with the Same Kinship Coefficients , 2010, Human Heredity.

[11]  Lorena Pantano,et al.  InvFEST, a database integrating information of polymorphic inversions in the human genome , 2013, Nucleic Acids Res..

[12]  Jeffrey D. Wall,et al.  Detecting Recombination Hotspots from Patterns of Linkage Disequilibrium , 2016, G3: Genes, Genomes, Genetics.

[13]  Svante Pääbo,et al.  The mosaic that is our genome , 2003, Nature.

[14]  Robert C. Elston,et al.  Will Formal Genetics Become Dispensable? , 2013, Human Heredity.

[15]  S. Cebrat,et al.  Distribution of Recombination Hotspots in the Human Genome – A Comparison of Computer Simulations with Real Data , 2013, PloS one.

[16]  P. Donnelly,et al.  The Fine-Scale Structure of Recombination Rate Variation in the Human Genome , 2004, Science.

[17]  R. Redfield “Why Do We Have to Learn This Stuff?”—A New Genetics for 21st Century Students , 2012, PLoS biology.

[18]  Niels Morling,et al.  ISFG: Recommendations on biostatistics in paternity testing. , 2007, Forensic science international. Genetics.

[19]  J. Koehler,et al.  The Coming Paradigm Shift in Forensic Identification Science , 2005, Science.

[20]  T. Egeland,et al.  DNA Commission of the International Society for Forensic Genetics (ISFG): Guidelines on the use of X-STRs in kinship analysis. , 2017, Forensic science international. Genetics.

[21]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[22]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[23]  A. Dawid,et al.  Response to: DNA identification by pedigree likelihood ratio accommodating population substructure and mutations , 2011, Investigative Genetics.

[24]  N. Pinto,et al.  Mutation and mutation rates at Y chromosome specific Short Tandem Repeat Polymorphisms (STRs): a reappraisal. , 2014, Forensic science international. Genetics.

[25]  Kelly A Meiklejohn,et al.  An automated independent workflow for the analysis of massively parallel sequence data from forensic SNP assays , 2018, Electrophoresis.

[26]  Amanda B. Hepler,et al.  Genetic relatedness analysis: modern data and new challenges , 2006, Nature Reviews Genetics.

[27]  M. Cáceres,et al.  Human inversions and their functional consequences , 2015, Briefings in functional genomics.

[28]  Heterozygosity increases microsatellite mutation rate , 2016, Biology Letters.

[29]  María del Mar González,et al.  Frequency and Pattern of Heteroplasmy in the Complete Human Mitochondrial Genome , 2013, PloS one.

[30]  Martin Krzywinski,et al.  The curse(s) of dimensionality , 2018, Nature Methods.

[31]  Yuan-ming Wu,et al.  Tri-allelic pattern of short tandem repeats identifies the murderer among identical twins and suggests an embryonic mutational origin. , 2015, Forensic science international. Genetics.

[32]  Wentian Li,et al.  Mappability and read length , 2014, Front. Genet..

[33]  Michael J. Dougherty,et al.  Assessing the Genetics Content in the Next Generation Science Standards , 2015, PloS one.

[34]  A. Amorim,et al.  Tri-allelic pattern at the TPOX locus: a familial study. , 2014, Gene.

[35]  Hoi Yan Chow,et al.  Qualitative and quantitative assessment of Illumina’s forensic STR and SNP kits on MiSeq FGx™ , 2017, PloS one.

[36]  B. Payseur,et al.  Effects of Demographic History on the Detection of Recombination Hotspots from Linkage Disequilibrium , 2017, Molecular biology and evolution.

[37]  Bruce Budowle,et al.  Increasing the reach of forensic genetics with massively parallel sequencing , 2017, Forensic Science, Medicine and Pathology.

[38]  Roded Sharan,et al.  To Embed or Not: Network Embedding as a Paradigm in Computational Biology , 2019, Front. Genet..

[39]  L. Chikhi,et al.  Send Orders of Reprints at Reprints@benthamscience.org on the Structural Plasticity of the Human Genome: Chromosomal Inver- Sions Revisited , 2022 .

[40]  W. Rojas,et al.  Evaluating the X Chromosome-Specific Diversity of Colombian Populations Using Insertion/Deletion Polymorphisms , 2014, PloS one.

[41]  Arthur Wuster,et al.  Estimating the human mutation rate from autozygous segments reveals population differences in human mutational processes , 2017, Nature Communications.

[42]  Jacqueline Weber-Lehmann,et al.  Finding the needle in the haystack: differentiating "identical" twins in paternity testing and forensics by ultra-deep next generation sequencing. , 2014, Forensic science international. Genetics.