Mining beyond the exome

In the late 18th century, Erasmus Darwin, Charles Darwin's grandfather, advocated evolutionary theory as a mean to "unravel the theory of disease". More than 200 years later, although Darwinian medicine is regaining some ground after having been muzzled during the second half of the 20th century, genomics has largely outcompeted evolution and has acquired a dictatorial success as a tool for studying disease etiology. From an evolution-inspired perspective, we have gradually drifted into the habit of focusing primarily on genomic data from sources such as genome-wide association studies (GWAS). As a result, understanding the how and why of human diseases and pathobiology has largely become a matter of crunching DNA sequences. Despite the popularity of GWAS, their reality remains unchanged: most of the susceptibility loci they allow to identify explain only a small fraction of the heritability of complex diseases. A number of reasons for the so-called "missing heritability" have been proposed, and our goal is not to review them all. Here we primarily reiterate that there is more to discover than non-synonymous point mutations and suggest that amid genetic deserts and genetic islands, there is also more to explore than the coding regions of the genome. We then highlight the importance and the necessity of designing efficient methods to mine beyond the exome.

[1]  N. Risch Searching for genetic determinants in the new millennium , 2000, Nature.

[2]  D. Reich,et al.  Functional Enhancers at the Gene-Poor 8q24 Cancer-Linked Locus , 2009, PLoS genetics.

[3]  Joseph H Nadeau,et al.  Systems Genetics , 2011, Science.

[4]  Dan M Roden,et al.  A rare variant in MYH6 is associated with high risk of sick sinus syndrome , 2011, Nature Genetics.

[5]  Michael D. Cole,et al.  Upregulation of c-MYC in cis through a Large Chromatin Loop Linked to a Cancer Risk-Associated Single-Nucleotide Polymorphism in Colorectal Cancer Cells , 2010, Molecular and Cellular Biology.

[6]  E. Birney,et al.  Challenges and standards in integrating surveys of structural variation , 2007, Nature Genetics.

[7]  P. Stankiewicz,et al.  Structural variation in the human genome and its role in disease. , 2010, Annual review of medicine.

[8]  C. Croce,et al.  MicroRNA signatures in human cancers , 2006, Nature Reviews Cancer.

[9]  F. Slack,et al.  Oncomirs — microRNAs with a role in cancer , 2006, Nature Reviews Cancer.

[10]  Steven A McCarroll,et al.  Extending genome-wide association studies to copy-number variation. , 2008, Human molecular genetics.

[11]  Jake K. Byrnes,et al.  Genome-wide association study of copy number variation in 16,000 cases of eight common diseases and 3,000 shared controls , 2010, Nature.

[12]  Vip Viprakasit,et al.  A Regulatory SNP Causes a Human Genetic Disease by Creating a New Transcriptional Promoter , 2006, Science.

[13]  P. Gluckman,et al.  How evolutionary principles improve the understanding of human health and disease , 2011, Evolutionary applications.

[14]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[15]  Ryan Hunt,et al.  Silent (synonymous) SNPs: should we care about them? , 2009, Methods in molecular biology.

[16]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[17]  Jason H. Moore,et al.  Layers of epistasis: genome‐wide regulatory networks and network approaches to genome‐wide association studies , 2011, Wiley interdisciplinary reviews. Systems biology and medicine.

[18]  C. Croce,et al.  Non-codingRNA sequence variations in human chronic lymphocytic leukemia and colorectal cancer. , 2010, Carcinogenesis.