Graph theoretical approach to study eQTL: a case study of Plasmodium falciparum

Motivation: Analysis of expression quantitative trait loci (eQTL) significantly contributes to the determination of gene regulation programs. However, the discovery and analysis of associations of gene expression levels and their underlying sequence polymorphisms continue to pose many challenges. Methods are limited in their ability to illuminate the full structure of the eQTL data. Most rely on an exhaustive, genome scale search that considers all possible locus–gene pairs and tests the linkage between each locus and gene. Result: To analyze eQTLs in a more comprehensive and efficient way, we developed the Graph based eQTL Decomposition method (GeD) that allows us to model genotype and expression data using an eQTL association graph. Through graph-based heuristics, GeD identifies dense subgraphs in the eQTL association graph. By identifying eQTL association cliques that expose the hidden structure of genotype and expression data, GeD effectively filters out most locus–gene pairs that are unlikely to have significant linkage. We apply GeD on eQTL data from Plasmodium falciparum, the human malaria parasite, and show that GeD reveals the structure of the relationship between all loci and all genes on a whole genome level. Furthermore, GeD allows us to uncover additional eQTLs with lower FDR, providing an important complement to traditional eQTL analysis methods. Contact: przytyck@ncbi.nlm.nih.gov

[1]  C. Becquet,et al.  Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human SAGE data , 2002, Genome Biology.

[2]  J. Nap,et al.  Genetical genomics: the added value from segregation. , 2001, Trends in genetics : TIG.

[3]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[4]  John D. Storey,et al.  Multiple Locus Linkage Analysis of Genomewide Expression in Yeast , 2005, PLoS biology.

[5]  L. Kruglyak,et al.  Genetics of global gene expression , 2006, Nature Reviews Genetics.

[6]  R. Gwilliam,et al.  The complete nucleotide sequence of chromosome 3 of Plasmodium falciparum , 1999, Nature.

[7]  Andrew I Su,et al.  Uncovering regulatory pathways that affect hematopoietic stem cell function using 'genetical genomics' , 2005, Nature Genetics.

[8]  Manuel Llinás,et al.  Comparative whole genome transcriptome analysis of three Plasmodium falciparum strains , 2006, Nucleic acids research.

[9]  S. Hunt,et al.  Genome-Wide Associations of Gene Expression Variation in Humans , 2005, PLoS genetics.

[10]  David Botstein,et al.  GO: : TermFinder--open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes , 2004, Bioinform..

[11]  G. Churchill,et al.  A statistical framework for quantitative trait mapping. , 2001, Genetics.

[12]  E. Schadt,et al.  Genetic inheritance of gene expression in human cell lines. , 2004, American journal of human genetics.

[13]  Rachel B. Brem,et al.  Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors , 2003, Nature Genetics.

[14]  Jonathan E. Allen,et al.  Genome sequence of the human malaria parasite Plasmodium falciparum , 2002, Nature.

[15]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[16]  Robert W. Williams,et al.  Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function , 2005, Nature Genetics.

[17]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[18]  J. Wootton,et al.  Genetic mapping in the human malaria parasite Plasmodium falciparum , 2004, Molecular microbiology.

[19]  S. Wuchty,et al.  Regulatory Hotspots in the Malaria Parasite Genome Dictate Transcriptional Variation , 2008, PLoS biology.

[20]  Enrico Petretto,et al.  Heritability and Tissue Specificity of Expression Quantitative Trait Loci , 2006, PLoS genetics.

[21]  Joshua T. Burdick,et al.  Mapping determinants of human gene expression by regional and genome-wide association , 2005, Nature.

[22]  Rachel B. Brem,et al.  The landscape of genetic complexity across 5,700 gene expression traits in yeast. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Martin Farach-Colton,et al.  Lattice based Clustering of Temporal Gene-Expression Matrices , 2007, SDM.

[24]  Yang Huang,et al.  A Linear Delay Algorithm for Building Concept Lattices , 2008, CPM.