Computational framework for targeted high-coverage sequencing based NIPT

Non-invasive prenatal testing (NIPT) enables accurate detection of fetal chromosomal trisomies. The majority of existing computational methods for sequencing-based NIPT analyses rely on low-coverage whole-genome sequencing (WGS) data and are not applicable for targeted high-coverage sequencing data from cell-free DNA samples. Here, we present a novel computational framework for a targeted high-coverage sequencing based NIPT analysis. The developed methods use a hidden Markov model (HMM)-based approach in conjunction with supplemental machine learning methods, such as decision tree (DT) and support vector machine (SVM), to detect fetal trisomy and parental origin of additional fetal chromosomes. These methods were tested with simulated datasets covering a wide range of biologically relevant scenarios with various chromosomal quantities, parental origins of extra chromosomes, fetal DNA fractions and sequencing read depths. Consequently, we determined the functional feasibility and limitations of each proposed approach and demonstrated that read count-based HMM achieved the best overall classification accuracy of 0.89 for detecting fetal euploidies and trisomies. Furthermore, we show that by using the DT and SVM methods on the HMM state classification results, it was possible to increase the final trisomy classification accuracy to 0.98 and 0.99, respectively. We demonstrated that read count and allelic ratio-based models can achieve a high accuracy (up to 0.98) for detecting fetal trisomy even if the fetal fraction is as low as 2%. Currently existing methods require at least 4% fetal fraction, which can be an issue in the case of early gestational age (<10 weeks) or elevated maternal body mass index (>35 kg/m2). More accurate detection can be achieved at higher sequencing depth using HMM in conjunction with supplemental methods, which significantly improve the trisomy detection especially in borderline scenarios (e.g., very low fetal fraction) and can enable to perform NIPT even earlier than 10 weeks of pregnancy.

[1]  K. Nicolaides,et al.  Fetal fraction in maternal plasma cell‐free DNA at 11–13 weeks' gestation: relation to maternal and fetal characteristics , 2013, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[2]  Matthew Rabinowitz,et al.  Non-invasive prenatal aneuploidy testing at chromosomes 13, 18, 21, X, and Y, using targeted sequencing of polymorphic loci , 2013 .

[3]  Y. Li,et al.  Aneuploidy in Early Miscarriage and its Related Factors , 2015, Chinese medical journal.

[4]  Nathalie Brison,et al.  NIPTmer: rapid k-mer-based software package for detection of fetal aneuploidies , 2018, Scientific Reports.

[5]  Sujoy Ghosh,et al.  Chromosome 21 non-disjunction and Down syndrome birth in an Indian cohort: analysis of incidence and aetiology from family linkage data. , 2010, Genetics research.

[6]  Patricia A. Hunt,et al.  Human aneuploidy: mechanisms and new insights into an age-old problem , 2012, Nature Reviews Genetics.

[7]  S. Antonarakis Parental origin of the extra chromosome in trisomy 21 as indicated by analysis of DNA polymorphisms. Down Syndrome Collaborative Group. , 1991, The New England journal of medicine.

[8]  John Tynan,et al.  Determination of fetal DNA fraction from the plasma of pregnant women using sequence read counts , 2015, Prenatal diagnosis.

[9]  W D Flanders,et al.  Advanced maternal age and the risk of Down syndrome characterized by the meiotic stage of chromosomal error: a population-based study. , 1996, American journal of human genetics.

[10]  TAC-seq: targeted DNA and RNA sequencing for precise biomarker molecule counting , 2018 .

[11]  M. McInnis,et al.  The meiotic stage of nondisjunction in trisomy 21: determination by using DNA polymorphisms. , 1992, American journal of human genetics.

[12]  U. Müller,et al.  Parental origin of the extra chromosome in trisomy 18. , 1989, American journal of human genetics.

[13]  T. Hassold,et al.  Down syndrome: genetic recombination and the origin of the extra chromosome 21 , 2000, Clinical genetics.

[14]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[15]  Peiyong Jiang,et al.  Noninvasive Prenatal Diagnosis of Fetal Trisomy 21 by Allelic Ratio Analysis Using Targeted Massively Parallel Sequencing of Maternal Plasma DNA , 2012, PloS one.

[16]  Gary J. W. Liao,et al.  Targeted massively parallel sequencing of maternal plasma DNA permits efficient and unbiased detection of fetal alleles. , 2011, Clinical chemistry.

[17]  Ping Liu,et al.  An Advanced Model to Precisely Estimate the Cell-Free Fetal DNA Concentration in Maternal Plasma , 2016, PloS one.

[18]  Peiyong Jiang,et al.  FetalQuant: deducing fractional fetal DNA concentration from massively parallel sequencing of DNA in maternal plasma , 2012, Bioinform..

[19]  Amir R. Kermany,et al.  TroX: a new method to learn about the genesis of aneuploidy from trisomic products of conception , 2014, Bioinform..

[20]  Christopher A. Miller,et al.  ReadDepth: A Parallel R Package for Detecting Copy Number Alterations from Short Sequencing Reads , 2011, PloS one.

[21]  Lyubov Yevtushok,et al.  Twenty-year trends in the prevalence of Down syndrome and other trisomies in Europe: impact of maternal age and prenatal screening , 2012, European Journal of Human Genetics.

[22]  J. Kere,et al.  TAC-seq: targeted DNA and RNA sequencing for precise biomarker molecule counting , 2018, bioRxiv.