Annotation and curation of human genomic variations: an ELIXIR Implementation Study

Background: ELIXIR is an intergovernmental organization, primarily based around European countries, established to host life science resources, including databases, software tools, training material and cloud storage for the scientific community under a single infrastructure. Methods: In 2018, ELIXIR commissioned an international survey on the usage of databases and tools for annotating and curating human genomic variants with the aim of improving ELIXIR resources. The 27-question survey was made available on-line between September and December 2018 to rank the importance and explore the usage and limitations of a wide range of databases and tools for annotating and curating human genomic variants, including resources specific for next generation sequencing, research into mitochondria and protein structure. Results: Eighteen countries participated in the survey and a total of 92 questionnaires were collected and analysed. Most respondents (89%, n=82) were from academia or a research environment. 51% (n=47) of respondents gave answers on behalf of a small research group ( 10 people). The survey showed that the scientific community considers several resources supported by ELIXIR crucial or very important. Moreover, it showed that the work done by ELIXIR is greatly valued. In particular, most respondents acknowledged the importance of key features and benefits promoted by ELIXIR, such as the verified scientific quality and maintenance of ELIXIR-approved resources. Conclusions ELIXIR is a “one-stop-shop” that helps researchers identify the most suitable, robust and well-maintained bioinformatics resources for delivering their research tasks.

[1]  Vincent B. Chen,et al.  KING (Kinemage, Next Generation): A versatile interactive molecular and scientific visualization program , 2009, Protein science : a publication of the Protein Society.

[2]  Tommaso Mazza,et al.  MitImpact: an Exhaustive Collection of Pre‐computed Pathogenicity Predictions of Human Mitochondrial Non‐synonymous Variants , 2015, Human mutation.

[3]  K. Tomii,et al.  MitoFates: Improved Prediction of Mitochondrial Targeting Sequences and Their Cleavage Sites , 2015, Molecular & Cellular Proteomics.

[4]  Vincent Procaccio,et al.  Predicting the pathogenicity of novel variants in mitochondrial tRNA with MitoTIP , 2017, PLoS Comput. Biol..

[5]  Peng Yue,et al.  SNPs3D: Candidate gene and SNP selection for association studies , 2006, BMC Bioinformatics.

[6]  Karen Eilbeck,et al.  Evolution of the Sequence Ontology terms and relationships , 2009, J. Biomed. Informatics.

[7]  Bale,et al.  Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology , 2015, Genetics in Medicine.

[8]  Henrik Land,et al.  YASARA: A Tool to Obtain Structural Guidance in Biocatalytic Investigations. , 2018, Methods in molecular biology.

[9]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[10]  Bob Löwenberg,et al.  Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel. , 2017, Blood.

[11]  Andrew C. R. Martin,et al.  The SAAP pipeline and database: tools to analyze the impact and predict the pathogenicity of mutations , 2013, BMC Genomics.

[12]  Shamkant B. Navathe,et al.  MITOMAP: a human mitochondrial genome database—2004 update , 2004, Nucleic Acids Res..

[13]  Ernesto Picardi,et al.  MToolBox: a highly automated pipeline for heteroplasmy annotation and prioritization analysis of human mitochondrial variants in high-throughput sequencing , 2014, Bioinform..

[14]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[15]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[16]  E. Capriotti,et al.  Functional annotations improve the predictive score of human disease‐related mutations in proteins , 2009, Human mutation.

[17]  Gert Vriend,et al.  Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces , 2010, BMC Bioinformatics.

[18]  Karl R. Clauser,et al.  MitoCarta2.0: an updated inventory of mammalian mitochondrial proteins , 2015, Nucleic Acids Res..

[19]  Brian D. Weitzner,et al.  Serverification of Molecular Modeling Applications: The Rosetta Online Server That Includes Everyone (ROSIE) , 2013, PloS one.

[20]  Marcella Attimonelli,et al.  HmtVar: a new resource for human mitochondrial variations and pathogenicity data , 2018, Nucleic Acids Res..

[21]  Alfonso Valencia,et al.  PDBe-KB: a community-driven resource for structural and functional annotations , 2019, Nucleic Acids Res..

[22]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, Genome Biology.

[23]  Marcella Attimonelli,et al.  HmtDB 2016: data update, a better performing query system and human mitochondrial DNA haplogroup predictor , 2016, Nucleic Acids Res..

[24]  P E Bourne,et al.  The Protein Data Bank. , 2002, Nucleic acids research.

[25]  Hans-Jürgen Bandelt,et al.  HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing , 2016, Nucleic Acids Res..