VIPER: a visualisation tool for exploring inheritance inconsistencies in genotyped pedigrees

BackgroundPedigree genotype datasets are used for analysing genetic inheritance and to map genetic markers and traits. Such datasets consist of hundreds of related animals genotyped for thousands of genetic markers and invariably contain multiple errors in both the pedigree structure and in the associated individual genotype data. These errors manifest as apparent inheritance inconsistencies in the pedigree, and invalidate analyses of marker inheritance patterns across the dataset. Cleaning raw datasets of bad data points (incorrect pedigree relationships, unreliable marker assays, suspect samples, bad genotype results etc.) requires expert exploration of the patterns of exposed inconsistencies in the context of the inheritance pedigree. In order to assist this process we are developing VIPER (Visual Pedigree Explorer), a software tool that integrates an inheritance-checking algorithm with a novel space-efficient pedigree visualisation, so that reported inheritance inconsistencies are overlaid on an interactive, navigable representation of the pedigree structure.Methods and resultsThis paper describes an evaluation of how VIPER displays the different scales and types of dataset that occur experimentally, with a description of how VIPER's display interface and functionality meet the challenges presented by such data. We examine a range of possible error types found in real and simulated pedigree genotype datasets, demonstrating how these errors are exposed and explored using the VIPER interface and we evaluate the utility and usability of the interface to the domain expert.Evaluation was performed as a two stage process with the assistance of domain experts (geneticists). The initial evaluation drove the iterative implementation of further features in the software prototype, as required by the users, prior to a final functional evaluation of the pedigree display for exploring the various error types, data scales and structures.ConclusionsThe VIPER display was shown to effectively expose the range of errors found in experimental genotyped pedigrees, allowing users to explore the underlying causes of reported inheritance inconsistencies. This interface will provide the basis for a full data cleaning tool that will allow the user to remove isolated bad data points, and reversibly test the effect of removing suspect genotypes and pedigree relationships.

[1]  Trevor Paterson,et al.  Visualising Errors in Animal Pedigree Genotype Data , 2011, Comput. Graph. Forum.

[2]  James T. Enns,et al.  High-speed visual estimation using preattentive processing , 1996, TCHI.

[3]  Victoria Interrante,et al.  User Studies: Why, How, and When? , 2003, IEEE Computer Graphics and Applications.

[4]  Eric A. Wernert,et al.  PViN: a scalable and flexible system for visualizing pedigree databases , 2005, ACM Symposium on Applied Computing.

[5]  Catherine Plaisant,et al.  Visualizing Missing Data: Graph Interpretation User Study , 2005, INTERACT.

[6]  Benjamin B. Bederson,et al.  A review of overview+detail, zooming, and focus+context interfaces , 2009, CSUR.

[7]  Pierre Dragicevic,et al.  GeneaQuilts: A System for Exploring Large Genealogies , 2010, IEEE Transactions on Visualization and Computer Graphics.

[8]  A. Law,et al.  Genotypechecker: an interactive tool for checking the inheritance consistency of genotyped pedigrees. , 2011, Animal genetics.

[9]  Melanie Tory,et al.  Evaluating Visualizations: Do Expert Reviews Work? , 2005, IEEE Computer Graphics and Applications.

[10]  R van Berloo,et al.  Peditree: pedigree database analysis and visualization for breeding and science. , 2005, The Journal of heredity.

[11]  B. Suarez,et al.  Genotyping errors, pedigree errors, and missing data , 2005, Genetic epidemiology.

[12]  R. Bennett,et al.  Recommendations for standardized human pedigree nomenclature. Pedigree Standardization Task Force of the National Society of Genetic Counselors. , 1995, American journal of human genetics.

[13]  Min He,et al.  PediDraw: A web-based tool for drawing a pedigree in genetic counseling , 2007, BMC Medical Genetics.

[14]  Cláudio T. Silva,et al.  PedVis: A Structured, Space-Efficient Technique for Pedigree Visualization , 2010, IEEE Transactions on Visualization and Computer Graphics.

[15]  C. Plaisant,et al.  Visualizing Missing Data : Classification and Empirical Study , 2005 .

[16]  Robin L. Bennett,et al.  Recommendations for standardized human pedigree nomenclature , 1995, Journal of Genetic Counseling.

[17]  D. L. Doyle,et al.  Standardized Human Pedigree Nomenclature: Update and Assessment of the Recommendations of the National Society of Genetic Counselors , 2008, Journal of Genetic Counseling.

[18]  J.C. Roberts,et al.  State of the Art: Coordinated & Multiple Views in Exploratory Visualization , 2007, Fifth International Conference on Coordinated and Multiple Views in Exploratory Visualization (CMV 2007).

[19]  Dennis R. Wixon,et al.  Using the RITE method to improve products; a definition and a case study , 2007 .