Two new ArrayTrack libraries for personalized biomedical research

BackgroundRecent advances in high-throughput genotyping technology are paving the way for research in personalized medicine and nutrition. However, most of the genetic markers identified from association studies account for a small contribution to the total risk/benefit of the studied phenotypic trait. Testing whether the candidate genes identified by association studies are causal is critically important to the development of personalized medicine and nutrition. An efficient data mining strategy and a set of sophisticated tools are necessary to help better understand and utilize the findings from genetic association studies.DescriptionSNP (single nucleotide polymorphism) and QTL (quantitative trait locus) libraries were constructed and incorporated into ArrayTrack, with user-friendly interfaces and powerful search features. Data from several public repositories were collected in the SNP and QTL libraries and connected to other domain libraries (genes, proteins, metabolites, and pathways) in ArrayTrack. Linking the data sets within ArrayTrack allows searching of SNP and QTL data as well as their relationships to other biological molecules. The SNP library includes approximately 15 million human SNPs and their annotations, while the QTL library contains publically available QTLs identified in mouse, rat, and human. The QTL library was developed for finding the overlap between the map position of a candidate or metabolic gene and QTLs from these species. Two use cases were included to demonstrate the utility of these tools. The SNP and QTL libraries are freely available to the public through ArrayTrack at http://www.fda.gov/ArrayTrack.ConclusionsThese libraries developed in ArrayTrack contain comprehensive information on SNPs and QTLs and are further cross-linked to other libraries. Connecting domain specific knowledge is a cornerstone of systems biology strategies and allows for a better understanding of the genetic and biological context of the findings from genetic association studies.

[1]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[2]  David Haussler,et al.  The UCSC Genome Browser database: update 2010 , 2009, Nucleic Acids Res..

[3]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[4]  W. J. Visek,et al.  Diet-disease interactions at the molecular level: an experimental paradigm. , 1994, The Journal of nutrition.

[5]  R. Rodriguez,et al.  Nutritional genomics: the next frontier in the postgenomic era , 2003 .

[6]  M. Stoneking,et al.  Worldwide population differentiation at disease-associated SNPs , 2008, BMC Medical Genomics.

[7]  C. Rotimi,et al.  Genetic Variants Associated with Complex Human Diseases Show Wide Variation across Multiple Populations , 2009, Public Health Genomics.

[8]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[9]  Jim Kaput,et al.  Nutrigenomics research for personalized nutrition and medicine. , 2008, Current opinion in biotechnology.

[10]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[11]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[12]  W. J. Visek,et al.  Biochemical and Molecular Roles of Nutrients Lipid Level and Type Alter Stearoyl CoA Desaturase mRNA Abundance Differently in Mice with Distinct Susceptibilities to Diet-Influenced Diseases , 1997 .

[13]  L. Tappy,et al.  Metabolic effects of fructose and the worldwide increase in obesity. , 2010, Physiological reviews.

[14]  Momiao Xiong,et al.  Gene and pathway-based second-wave analysis of genome-wide association studies , 2010, European Journal of Human Genetics.

[15]  M. Stoneking,et al.  Identification and Analysis of Genomic Regions with Large Between‐Population Differentiation in Humans , 2007, Annals of human genetics.

[16]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[17]  Christian Gieger,et al.  A genome-wide perspective of genetic variation in human metabolism , 2010, Nature Genetics.

[18]  Mary Shimoyama,et al.  The Rat Genome Database, update 2007—Easing the path from disease to data and back again , 2006, Nucleic Acids Res..

[19]  Jim Kaput,et al.  A Strategy for Analyzing Gene—Nutrient Interactions in Type 2 Diabetes , 2009, Journal of diabetes science and technology.

[20]  Weida Tong,et al.  ArrayTrack--supporting toxicogenomic research at the U.S. Food and Drug Administration National Center for Toxicological Research. , 2003, Environmental health perspectives.