Management of family relationship information for a three-generation cohort study

A system for inputting and storing family information, named “BirThree Enrollment,” was developed to promote a birth and three-generation cohort study (BirThree Cohort Study), and this system was operated successfully. In the study, it was necessary to satisfy many operational demands. Input information is overwritten and changed continuously. Complex kinship information must be quickly and accurately input and corrected, and information on those families not yet recruited must be retrieved. For these purposes, many devices are needed, from an input interface to the internal data structure. In the field of genetic statistics, a simple standard expressive form is used for describing family structure. This form has sufficient information for genetics; however, we developed this form further for our purposes in conducting the BirThree Cohort Study. To provide information about family roles as required in the BirThree Cohort Study, we expanded the data structure, and constructed the system that is able to be used for the daily operation. In our system, family pedigree information is stored along with initial clinical information, and enabled the input of all self-reported information to the data base. Operators are able to input this family information before the day is out. As a result, when recruitment is completed, family information will be completed concurrently. Therefore, it is possible to immediately know a certain person’s family structure. By using our system, data correction was improved dramatically, and the system was operated successfully. This study is the first report of the method for storing three generations of family data.

[1]  Francis S. Collins,et al.  Genes, environment and the value of prospective cohort studies , 2006, Nature Reviews Genetics.

[2]  W James Gauderman,et al.  Sample size requirements for matched case‐control studies of gene–environment interaction , 2002, Statistics in medicine.

[3]  Shinichi 進一 Kuriyama 栗山,et al.  The Tohoku Medical Megabank Project: Design and Mission , 2016, Journal of Epidemiology.

[4]  Hiroshi Tanaka,et al.  The Tohoku Medical Megabank Project: Design and Mission. , 2016, Journal of epidemiology.

[5]  Kengo Kinoshita,et al.  Rare variant discovery by deep whole-genome sequencing of 1,070 Japanese individuals , 2015, Nature Communications.

[6]  Clarice R Weinberg,et al.  Can DAGs clarify effect modification? , 2007, Epidemiology.

[7]  Juan Pablo Lewinger,et al.  Sample size requirements to detect gene‐environment interactions in genome‐wide association studies , 2011, Genetic epidemiology.

[8]  Brian L Browning,et al.  Identity by descent between distant relatives: detection and applications. , 2012, Annual review of genetics.

[9]  Robert E. Hewitt,et al.  Biobanking: the foundation of personalized medicine , 2011, Current opinion in oncology.

[10]  P. Tam The International HapMap Consortium. The International HapMap Project (Co-PI of Hong Kong Centre which responsible for 2.5% of genome) , 2003 .

[11]  Susumu Satomi The Great East Japan Earthquake: Tohoku University Hospital’s efforts and lessons learned , 2011, Surgery Today.

[12]  C. Wijmenga,et al.  Cohort Profile Cohort Profile : LifeLines , a three-generation cohort study and biobank , 2015 .

[13]  P. Visscher,et al.  Reconciling the analysis of IBD and IBS in complex trait studies , 2010, Nature Reviews Genetics.

[14]  F. Dekker,et al.  Graphical presentation of confounding in directed acyclic graphs. , 2015, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[15]  M. Pembrey The Avon Longitudinal Study of Parents and Children (ALSPAC): a resource for genetic epidemiology. , 2004, European journal of endocrinology.

[16]  Pieter B. T. Neerincx,et al.  Supplementary Information Whole-genome sequence variation , population structure and demographic history of the Dutch population , 2022 .

[17]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[18]  Terry M Therneau,et al.  The kinship2 R Package for Pedigree Data , 2014, Human Heredity.