An open-source, integrated pedigree data management and visualization tool for genetic epidemiology.

With advances in genetic epidemiology, increasingly large amounts of pedigree-related information are being collected by family studies, including twin studies. To date, biomedical data management systems that cater for family data have usually done so as part of their standard (non-family-centric) data model. Consequently, data managers with computing expertise are needed to extract family datasets and perform family-centric operations. We present a robust approach to handling large family datasets. Our approach is implemented as a new module which extends the capabilities of The Ark, an open-source web-based biomedical data management tool. Using an algorithm designed by the authors, the pedigree module dynamically infers family relationships for any selected subject (not necessarily the proband). A web interface allows researchers to create, update, delete and navigate parental and twin relationships between subjects, and bulk import/export pedigrees. Consanguineous relationships can be captured, and configurable pedigree visualizations generated. A web services interface provides interoperability.

[1]  Simon Hartley,et al.  pedigreejs: a web-based graphical pedigree editor , 2017, Bioinform..

[2]  Angeline M. Loh,et al.  Celestial3D: a novel method for 3D visualization of familial data , 2008, Bioinform..

[3]  Emmanuel Barillot,et al.  CoPE: a collaborative pedigree drawing environment , 1999, Bioinform..

[4]  Trevor Paterson,et al.  VIPER: a visualisation tool for exploring inheritance inconsistencies in genotyped pedigrees , 2012, BMC Bioinformatics.

[5]  John D Potter,et al.  Colon Cancer Family Registry: An International Resource for Studies of the Genetic Epidemiology of Colon Cancer , 2007, Cancer Epidemiology Biomarkers & Prevention.

[6]  Graham G. Giles,et al.  Explaining Variance in the Cumulus Mammographic Measures That Predict Breast Cancer Risk: A Twins and Sisters Study , 2013, Cancer Epidemiology, Biomarkers & Prevention.

[7]  P. Harris,et al.  Research electronic data capture (REDCap) - A metadata-driven methodology and workflow process for providing translational research informatics support , 2009, J. Biomed. Informatics.

[8]  Cláudio T. Silva,et al.  PedVis: A Structured, Space-Efficient Technique for Pedigree Visualization , 2010, IEEE Transactions on Visualization and Computer Graphics.

[9]  Norman Boyd,et al.  The Breast Cancer Family Registry: an infrastructure for cooperative multinational, interdisciplinary and translational studies of the genetic epidemiology of breast cancer , 2004, Breast Cancer Research.

[10]  Paul White,et al.  The Ark: a customizable web‐based data management tool for health and medical research , 2017, Bioinform..

[11]  Vincent Ferretti,et al.  Software Application Profile: Opal and Mica: open-source software solutions for epidemiological data management, harmonization and dissemination , 2017, International journal of epidemiology.

[12]  N. Martin,et al.  A twin-pronged attack on complex traits , 1997, Nature Genetics.

[13]  M. Susan Lindee Moments of Truth in Genetic Medicine , 2005 .

[14]  Julia E. Richards,et al.  Madeline 2.0 PDE: a new program for local and web-based pedigree drawing , 2007, Bioinform..