KD4v: comprehensible knowledge discovery system for missense variant

A major challenge in the post-genomic era is a better understanding of how human genetic alterations involved in disease affect the gene products. The KD4v (Comprehensible Knowledge Discovery System for Missense Variant) server allows to characterize and predict the phenotypic effects (deleterious/neutral) of missense variants. The server provides a set of rules learned by Induction Logic Programming (ILP) on a set of missense variants described by conservation, physico-chemical, functional and 3D structure predicates. These rules are interpretable by non-expert humans and are used to accurately predict the deleterious/neutral status of an unknown mutation. The web server is available at http://decrypthon.igbmc.fr/kd4v.

[1]  Jaime Prilusky,et al.  Automated analysis of interatomic contacts in proteins , 1999, Bioinform..

[2]  Narayanan Eswar,et al.  Protein structure modeling with MODELLER. , 2008, Methods in molecular biology.

[3]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[4]  M. Vihinen,et al.  Performance of mutation pathogenicity prediction methods on missense variants , 2011, Human mutation.

[5]  Mi Zhou,et al.  nsSNPAnalyzer: identifying disease-associated nonsynonymous single nucleotide polymorphisms , 2005, Nucleic Acids Res..

[6]  Piero Fariselli,et al.  I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure , 2005, Nucleic Acids Res..

[7]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[8]  Olivier Poch,et al.  MACSIMS : multiple alignment of complete sequences information management system , 2006, BMC Bioinformatics.

[9]  Peng Yue,et al.  SNPs3D: Candidate gene and SNP selection for association studies , 2006, BMC Bioinformatics.

[10]  E. Capriotti,et al.  Functional annotations improve the predictive score of human disease‐related mutations in proteins , 2009, Human mutation.

[11]  Olivier Poch,et al.  Décrypthon Grid - Grid Resources Dedicated to Neuromuscular Disorders , 2010, HealthGrid.

[12]  Anaïs Mottaz,et al.  Bioinformatics Applications Note Databases and Ontologies Easy Retrieval of Single Amino-acid Polymorphisms and Phenotype Information Using Swissvar , 2022 .

[13]  Tim J. P. Hubbard,et al.  Data growth and its impact on the SCOP database: new developments , 2007, Nucleic Acids Res..

[14]  Vítor Santos Costa,et al.  Inductive Logic Programming , 2013, Lecture Notes in Computer Science.

[15]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[16]  Olivier Poch,et al.  PipeAlign: a new toolkit for protein family analysis , 2003, Nucleic Acids Res..

[17]  Olivier Poch,et al.  M-ORBIS: Mapping of mOleculaR Binding sItes and Surfaces , 2010, Nucleic Acids Res..

[18]  Olivier Poch,et al.  SM2PH‐db: an interactive system for the integrated analysis of phenotypic consequences of missense mutations in proteins involved in human genetic diseases , 2010, Human mutation.

[19]  Olivier Poch,et al.  MSV3d: database of human MisSense variants mapped to 3D protein structure , 2012, Database J. Biol. Databases Curation.

[20]  Burkhard Rost,et al.  SNAP predicts effect of mutations on protein function , 2008, Bioinform..

[21]  Christian Schaefer,et al.  SNPdbe: constructing an nsSNP functional impacts database , 2011, Bioinform..