Cell-free DNA fragment-size distribution analysis for non-invasive prenatal CNV prediction

BACKGROUND Non-invasive detection of aneuploidies in a fetal genome through analysis of cell-free DNA circulating in the maternal plasma is becoming a routine clinical test. Such tests, which rely on analyzing the read coverage or the allelic ratios at single-nucleotide polymorphism (SNP) loci, are not sensitive enough for smaller sub-chromosomal abnormalities due to sequencing biases and paucity of SNPs in a genome. RESULTS We have developed an alternative framework for identifying sub-chromosomal copy number variations in a fetal genome. This framework relies on the size distribution of fragments in a sample, as fetal-origin fragments tend to be smaller than those of maternal origin. By analyzing the local distribution of the cell-free DNA fragment sizes in each region, our method allows for the identification of sub-megabase CNVs, even in the absence of SNP positions. To evaluate the accuracy of our method, we used a plasma sample with the fetal fraction of 13%, down-sampled it to samples with coverage of 10X-40X and simulated samples with CNVs based on it. Our method had a perfect accuracy (both specificity and sensitivity) for detecting 5 Mb CNVs, and after reducing the fetal fraction (to 11%, 9% and 7%), it could correctly identify 98.82-100% of the 5 Mb CNVs and had a true-negative rate of 95.29-99.76%. AVAILABILITY AND IMPLEMENTATION Our source code is available on GitHub at https://github.com/compbio-UofT/FSDA CONTACT: : brudno@cs.toronto.edu.

[1]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[2]  A. Hui,et al.  Size distributions of maternal and fetal DNA in maternal plasma. , 2004, Clinical chemistry.

[3]  Stephen R Quake,et al.  Analysis of the size distributions of fetal and maternal cell-free DNA by paired-end sequencing. , 2010, Clinical chemistry.

[4]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[5]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[6]  Wade A. Barrett,et al.  Selective analysis of cell-free DNA in maternal blood for evaluation of fetal trisomy , 2012, Prenatal diagnosis.

[7]  C. Cantor,et al.  Noninvasive prenatal diagnosis of fetal chromosomal aneuploidy by massively parallel genomic sequencing of DNA in maternal plasma , 2008, Proceedings of the National Academy of Sciences.

[8]  A. Oliphant,et al.  Gestational age and maternal weight effects on fetal cell‐free DNA in maternal plasma , 2013, Prenatal diagnosis.

[9]  L. Chitty,et al.  Non-invasive prenatal testing for Down syndrome. , 2014, Seminars in fetal & neonatal medicine.

[10]  Michael Brudno,et al.  Probabilistic method for detecting copy number variation in a fetal genome using maternal plasma sequencing , 2014, Bioinform..

[11]  H. C. Fan,et al.  Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood , 2008, Proceedings of the National Academy of Sciences.

[12]  Jay Shendure,et al.  Noninvasive Whole-Genome Sequencing of a Human Fetus , 2012, Science Translational Medicine.

[13]  Matthew W. Snyder,et al.  Cell-free DNA Comprises an In Vivo Nucleosome Footprint that Informs Its Tissues-Of-Origin , 2016, Cell.

[14]  I. Sargent,et al.  Presence of fetal DNA in maternal plasma and serum , 1997, The Lancet.

[15]  C. Alkan,et al.  MoDIL: detecting small indels from clone-end sequencing with mixtures of distributions , 2009, Nature Methods.

[16]  Matthew Rabinowitz,et al.  Non-invasive prenatal aneuploidy testing at chromosomes 13, 18, 21, X, and Y, using targeted sequencing of polymorphic loci , 2013 .

[17]  Susan Gross,et al.  Clinical practice. Prenatal screening for aneuploidy. , 2009, The New England journal of medicine.

[18]  Gary J. W. Liao,et al.  Size-based molecular diagnostics using plasma DNA for noninvasive prenatal testing , 2014, Proceedings of the National Academy of Sciences.