Computational framework for targeted high-coverage sequencing based NIPT

Non-invasive prenatal testing (NIPT) enables accurate detection of fetal chromosomal trisomies. The majority of publicly available computational methods for sequencing-based NIPT analyses rely on low-coverage whole-genome sequencing (WGS) data and are not applicable for targeted high-coverage sequencing data from cell-free DNA samples. Here, we present a novel computational framework for a targeted high-coverage sequencing-based NIPT analysis. The developed framework uses a hidden Markov model (HMM) in conjunction with a supplemental machine learning model, such as decision tree (DT) or support vector machine (SVM), to detect fetal trisomy and parental origin of additional fetal chromosomes. These models were developed using simulated datasets covering a wide range of biologically relevant scenarios with various chromosomal quantities, parental origins of extra chromosomes, fetal DNA fractions, and sequencing read depths. Developed models were tested on simulated and experimental targeted sequencing datasets. Consequently, we determined the functional feasibility and limitations of each proposed approach and demonstrated that read count-based HMM achieved the best overall classification accuracy of 0.89 for detecting fetal euploidies and trisomies on simulated dataset. Furthermore, we show that by using the DT and SVM on the HMM classification results, it was possible to increase the final trisomy classification accuracy to 0.98 and 0.99, respectively. We demonstrate that read count and allelic ratio-based models can achieve a high accuracy (up to 0.98) for detecting fetal trisomy even if the fetal fraction is as low as 2%. Currently, existing commercial NIPT analysis requires at least 4% of fetal fraction, which can be possibly a challenge in case of early gestational age (<10 weeks) or high maternal body mass index (>35 kg/m2). More accurate detection can be achieved at higher sequencing depth using HMM in conjunction with supplemental models, which significantly improve the trisomy detection especially in borderline scenarios (e.g., very low fetal fraction) and enables to perform NIPT even earlier than 10 weeks of pregnancy.

[1]  K. Nicolaides,et al.  Fetal fraction in maternal plasma cell‐free DNA at 11–13 weeks' gestation: relation to maternal and fetal characteristics , 2013, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[2]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[3]  M. McInnis,et al.  The meiotic stage of nondisjunction in trisomy 21: determination by using DNA polymorphisms. , 1992, American journal of human genetics.

[4]  Peiyong Jiang,et al.  FetalQuant: deducing fractional fetal DNA concentration from massively parallel sequencing of DNA in maternal plasma , 2012, Bioinform..

[5]  Amir R. Kermany,et al.  TroX: a new method to learn about the genesis of aneuploidy from trisomic products of conception , 2014, Bioinform..

[6]  K. Nicolaides,et al.  Analysis of cell‐free DNA in maternal blood in screening for fetal aneuploidies: updated meta‐analysis , 2015, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[7]  H. C. Fan,et al.  Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood , 2008, Proceedings of the National Academy of Sciences.

[8]  Patricia A. Hunt,et al.  Human aneuploidy: mechanisms and new insights into an age-old problem , 2012, Nature Reviews Genetics.

[9]  Peiyong Jiang,et al.  Noninvasive Prenatal Diagnosis of Fetal Trisomy 21 by Allelic Ratio Analysis Using Targeted Massively Parallel Sequencing of Maternal Plasma DNA , 2012, PloS one.

[10]  P. Hunt,et al.  To err (meiotically) is human: the genesis of human aneuploidy , 2001, Nature Reviews Genetics.

[11]  M. Swertz,et al.  Novel Algorithms for Improved Sensitivity in Non-Invasive Prenatal Testing , 2017, Scientific Reports.

[12]  J. Kere,et al.  TAC-seq: targeted DNA and RNA sequencing for precise biomarker molecule counting , 2018, bioRxiv.

[13]  Lyubov Yevtushok,et al.  Twenty-year trends in the prevalence of Down syndrome and other trisomies in Europe: impact of maternal age and prenatal screening , 2012, European Journal of Human Genetics.

[14]  Ryan L. Collins,et al.  Variation across 141,456 human exomes and genomes reveals the spectrum of loss-of-function intolerance across human protein-coding genes , 2019, bioRxiv.

[15]  Matthew Rabinowitz,et al.  Non-invasive prenatal aneuploidy testing at chromosomes 13, 18, 21, X, and Y, using targeted sequencing of polymorphic loci , 2013 .

[16]  John Tynan,et al.  Determination of fetal DNA fraction from the plasma of pregnant women using sequence read counts , 2015, Prenatal diagnosis.

[17]  W D Flanders,et al.  Advanced maternal age and the risk of Down syndrome characterized by the meiotic stage of chromosomal error: a population-based study. , 1996, American journal of human genetics.

[18]  M. Passos-Bueno,et al.  Development of a comprehensive noninvasive prenatal test , 2018, Genetics and molecular biology.

[19]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[20]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[21]  Nathalie Brison,et al.  NIPTmer: rapid k-mer-based software package for detection of fetal aneuploidies , 2018, Scientific Reports.

[22]  Sujoy Ghosh,et al.  Chromosome 21 non-disjunction and Down syndrome birth in an Indian cohort: analysis of incidence and aetiology from family linkage data. , 2010, Genetics research.

[23]  S. Linnarsson,et al.  Counting absolute numbers of molecules using unique molecular identifiers , 2011, Nature Methods.

[24]  Gary J. W. Liao,et al.  Targeted massively parallel sequencing of maternal plasma DNA permits efficient and unbiased detection of fetal alleles. , 2011, Clinical chemistry.

[25]  Ping Liu,et al.  An Advanced Model to Precisely Estimate the Cell-Free Fetal DNA Concentration in Maternal Plasma , 2016, PloS one.

[26]  Y. Li,et al.  Aneuploidy in Early Miscarriage and its Related Factors , 2015, Chinese medical journal.

[27]  T. Hassold,et al.  Down syndrome: genetic recombination and the origin of the extra chromosome 21 , 2000, Clinical genetics.

[28]  S. Freeman,et al.  The National down Syndrome Project: Design and Implementation , 2007, Public health reports.

[29]  S. Antonarakis Parental origin of the extra chromosome in trisomy 21 as indicated by analysis of DNA polymorphisms. Down Syndrome Collaborative Group. , 1991, The New England journal of medicine.

[30]  Christopher A. Miller,et al.  ReadDepth: A Parallel R Package for Detecting Copy Number Alterations from Short Sequencing Reads , 2011, PloS one.