Effectiveness of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data

Background Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole exome sequencing (WES) is an effective approach for detecting extremely rare protein-coding single nucleotide variants (SNVs). However, interpreting the functional and clinical consequences of these variants requires integrating multifaceted genomic information. Methods We compared the effectiveness of three bioinformatics tools in detecting ASD candidate SNVs from WES data of 250 ASD family trios registered in the National Autism Database of Israel. We studied only rare (less than 1% population frequency), proband specific SNVs. The pathogenicity of SNVs, according to the American College of Medical Genetics (ACMG) guidelines, was evaluated by the InterVar and TAPES tools. In addition, likely gene disrupting (LGD) SNVs were detected based on an inhouse bioinformatics tool, designated PsiVariant, that integrates results from seven insilico prediction tools. We compared the effectiveness of these three approaches and their combinations in detecting SNVs in high confidence ASD genes. Results Overall, 605 SNVs in 499 genes distributed in 193 probands were detected by these tools. The overlap between the tools was 64.1%, 17.0%, and 21.6% for InterVar and TAPES, InterVar and PsiVariant, and TAPES and PsiVariant, respectively. The intersection between InterVar and PsiVariant (I and P) was the most effective approach in detecting ASD genes (OR = 5.38, 95% C.I. = 3.25 to 8.53). This combination detected 102 SNVs in 99 genes among 80 probands (approximate 36% diagnostic yield). Conclusions Our results suggest that integration of different variant interpretation approaches in detecting ASD candidate SNVs from WES data is superior to each approach alone. Inclusion of additional criteria could further improve the detection of ASD candidate variants.

[1]  Shalaw R. Sallah,et al.  Assessing the pathogenicity of in-frame CACNA1F indel variants using structural modelling. , 2022, The Journal of molecular diagnostics : JMD.

[2]  Ryan L. Collins,et al.  Rare coding variation provides insight into the genetic architecture and phenotypic context of autism , 2022, Nature Genetics.

[3]  Peiyao A. Zhao,et al.  Rare variants and the oligogenic architecture of autism. , 2022, Trends in genetics : TIG.

[4]  I. Dinstein,et al.  Diagnostic Yield and Economic Implications of Whole-Exome Sequencing for ASD Diagnosis in Israel , 2021, Genes.

[5]  Trygve E Bakken,et al.  Recent ultra-rare inherited variants implicate new autism candidate risk genes , 2021, Nature Genetics.

[6]  B. Greenberg,et al.  Exome sequencing in obsessive–compulsive disorder reveals a burden of rare damaging coding variants , 2021, Nature Neuroscience.

[7]  N. de Leeuw,et al.  Stepwise ABC system for classification of any type of genetic variant , 2021, European Journal of Human Genetics.

[8]  F. Quintero-Rivera,et al.  Pathogenic paternally inherited NLGN4X deletion in a female with autism spectrum disorder: Clinical, cytogenetic, and molecular characterization , 2020, American journal of medical genetics. Part A.

[9]  Xiaoming Liu,et al.  dbNSFP v4: a comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs , 2020, Genome Medicine.

[10]  Y. Bonneh,et al.  The National Autism Database of Israel: a Resource for Studying Autism Risk Factors, Biomarkers, Outcome Measures, and Treatment Efficacy , 2020, Journal of Molecular Neuroscience.

[11]  D. Campion,et al.  Rare genetic susceptibility variants assessment in autism spectrum disorder: detection rate and practical use , 2020, Translational Psychiatry.

[12]  Rodney J Scott,et al.  TAPES: A tool for assessment and prioritisation in exome studies , 2019, PLoS Comput. Biol..

[13]  Jakob Grove,et al.  Autism spectrum disorder (ASD) and attention-deficit/hyperactivity disorder (ADHD) have a similar burden of rare protein-truncating variants , 2019, Nature Neuroscience.

[14]  B. O’Roak,et al.  Exome sequencing of 457 autism families recruited online provides evidence for autism risk genes , 2019, npj Genomic Medicine.

[15]  N. Zhang,et al.  Nonrandom occurrence of multiple de novo coding variants in a proband indicates the existence of an oligogenic model in autism , 2019, Genetics in Medicine.

[16]  M. Daly,et al.  Recessive gene disruptions in autism spectrum disorder , 2019, Nature Genetics.

[17]  B. Coe,et al.  Genome sequencing identifies multiple deleterious variants in autism patients with more severe phenotypes , 2018, Genetics in Medicine.

[18]  Matthew W. Mosconi,et al.  Large-Scale Exome Sequencing Study Implicates Both Developmental and Functional Changes in the Neurobiology of Autism , 2019, Cell.

[19]  Gregory M. Cooper,et al.  CADD: predicting the deleteriousness of variants throughout the human genome , 2018, Nucleic Acids Res..

[20]  C. K. Lim,et al.  Large-Scale Whole-Genome Sequencing of Three Diverse Asian Populations in Singapore , 2018, Cell.

[21]  B. H. Lo,et al.  Autism Spectrum Disorder , 2018, Journal of paediatrics and child health.

[22]  Len A. Pennacchio,et al.  Genomic Patterns of De Novo Mutation in Simplex Autism , 2017, Cell.

[23]  I. Dinstein,et al.  Brief Report: The Negev Hospital-University-Based (HUB) Autism Database , 2017, Journal of autism and developmental disorders.

[24]  M. Daly,et al.  Regional missense constraint improves variant deleteriousness prediction , 2017, bioRxiv.

[25]  Quan Li,et al.  InterVar: Clinical Interpretation of Genetic Variants by the 2015 ACMG-AMP Guidelines. , 2017, American journal of human genetics.

[26]  Gill Bejerano,et al.  M-CAP eliminates a majority of variants of uncertain significance in clinical exomes at high sensitivity , 2016, Nature Genetics.

[27]  Trevor Hastie,et al.  REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants. , 2016, American journal of human genetics.

[28]  S. Lovell,et al.  The role of small in-frame insertions/deletions in inherited eye disorders and how structural modelling can help estimate their pathogenicity , 2016, Orphanet Journal of Rare Diseases.

[29]  Rita M Cantor,et al.  Rare Inherited and De Novo CNVs Reveal Complex Contributions to ASD Risk in Multiplex Families. , 2016, American journal of human genetics.

[30]  Leif Groop,et al.  LoFtool: a gene intolerance score based on loss‐of‐function variants in 60 706 individuals , 2016, Bioinform..

[31]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, bioRxiv.

[32]  Heejeong Yoo,et al.  Genetics of Autism Spectrum Disorder: Current Status and Possible Clinical Applications , 2015, Experimental neurobiology.

[33]  Kali T. Witherspoon,et al.  Excess of rare, inherited truncating mutations in autism , 2015, Nature Genetics.

[34]  Bale,et al.  Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology , 2015, Genetics in Medicine.

[35]  Christopher S. Poultney,et al.  Synaptic, transcriptional, and chromatin genes disrupted in autism , 2014, Nature.

[36]  Kathryn Roeder,et al.  De novo insertions and deletions of predominantly paternal origin are associated with autism spectrum disorder. , 2014, Cell reports.

[37]  Jean-Baptiste Cazier,et al.  Choice of transcripts and software has a large effect on variant annotation , 2014, Genome Medicine.

[38]  Sharmila Banerjee-Basu,et al.  SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs) , 2013, Molecular Autism.

[39]  I. Adzhubei,et al.  Predicting Functional Effect of Human Missense Mutations Using PolyPhen‐2 , 2013, Current protocols in human genetics.

[40]  Kenny Q. Ye,et al.  De Novo Gene Disruptions in Children on the Autistic Spectrum , 2012, Neuron.

[41]  Evan T. Geller,et al.  Patterns and rates of exonic de novo mutations in autism spectrum disorders , 2012, Nature.

[42]  C. Lajonchere,et al.  Genetic heritability and shared environmental factors among twin pairs with autism. , 2011, Archives of general psychiatry.

[43]  Angelica Ronald,et al.  Autism spectrum disorders and autistic traits: A decade of new twin studies , 2011, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[44]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[45]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[46]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..