hgvs: A Python package for manipulating sequence variants using HGVS nomenclature: 2018 Update

The Human Genome Variation Society (HGVS) nomenclature guidelines encourage the accurate and standard description of DNA, RNA, and protein sequence variants in public variant databases and the scientific literature. Inconsistent application of the HGVS guidelines can lead to misinterpretation of variants in clinical settings. Reliable software tools are essential to ensure consistent application of the HGVS guidelines when reporting and interpreting variants. We present the hgvs Python package, a comprehensive tool for manipulating sequence variants according to the HGVS nomenclature guidelines. Distinguishing features of the hgvs package include: (1) parsing, formatting, validating, and normalizing variants on genome, transcript, and protein sequences; (2) projecting variants between aligned sequences, including those with gapped alignments; (3) flexible installation using remote or local data (fully local installations eliminate network dependencies); (4) extensive automated tests; and (5) open source development by a community from eight organizations worldwide. This report summarizes recent and significant updates to the hgvs package since its original release in 2014, and presents results of extensive validation using clinical relevant variants from ClinVar and HGMD.

[1]  Vincent A. Fusaro,et al.  A Python package for parsing, validating, mapping and formatting sequence variants using HGVS nomenclature , 2014, Bioinform..

[2]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .

[3]  Caroline Clark,et al.  HGVS Nomenclature in Practice: An Example from the United Kingdom National External Quality Assessment Scheme , 2016, Human mutation.

[4]  P. Stenson,et al.  The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine , 2013, Human Genetics.

[5]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[6]  Johan T den Dunnen,et al.  Improving sequence variant descriptions in mutation databases and literature using the Mutalyzer sequence variation nomenclature checker , 2008, Human mutation.

[7]  Raymond Dalgleish,et al.  VariantValidator: Accurate validation, mapping, and formatting of sequence variation descriptions , 2017, Human mutation.

[8]  W. Chung,et al.  Recommendations for reporting of secondary findings in clinical exome and genome sequencing, 2016 update (ACMG SF v2.0): a policy statement of the American College of Medical Genetics and Genomics , 2016, Genetics in Medicine.

[9]  Deanna M. Church,et al.  A variant by any name: quantifying annotation discordance across tools and clinical databases , 2016, Genome Medicine.

[10]  Nicola Wolstenholme,et al.  What's in a Name? A Coordinated Approach toward the Correct Use of a Uniform Nomenclature to Improve Patient Reports and Databases , 2016, Human mutation.

[11]  Deanna M. Church,et al.  Assembly: a resource for assembled genomes at NCBI , 2015, Nucleic Acids Res..

[12]  Raymond Dalgleish,et al.  HGVS Recommendations for the Description of Sequence Variants: 2016 Update , 2016, Human mutation.

[13]  Bale,et al.  Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology , 2015, Genetics in Medicine.

[14]  Chunlei Liu,et al.  ClinVar: improving access to variant interpretations and supporting evidence , 2017, Nucleic Acids Res..

[15]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, Genome Biology.

[16]  Jeroen F. J. Laros,et al.  LOVD v.2.0: the next generation in gene variant databases , 2011, Human mutation.

[17]  S E McDonald,et al.  Standards and guidelines. , 1993, SCI nursing : a publication of the American Association of Spinal Cord Injury Nurses.

[18]  S. Antonarakis,et al.  Mutation nomenclature extensions and suggestions to describe complex mutations: A discussion , 2000 .

[19]  S. Antonarakis Recommendations for a nomenclature system for human gene mutations , 1998 .

[20]  Marilyn M. Li,et al.  Standards and Guidelines for the Interpretation and Reporting of Sequence Variants in Cancer: A Joint Consensus Recommendation of the Association for Molecular Pathology, American Society of Clinical Oncology, and College of American Pathologists. , 2017, The Journal of molecular diagnostics : JMD.