ExPlanes: Exploring Planes in Triplet Data

Many methods for the analysis of gene expression-, protein- or metabolite-data focus on the investigation of binary relationships, while the underlying biological processes creating this data may generate relations of higher than bivariate complexity. We give a novel method ExPlanes that helps to explore certain types of ternary relationships in a statistically robust, Bayesian framework. To arrive at an characterization of the data structure contained in triplet data we investigate 2-dimensional planes being the only linear structures that cannot be inferred from projections of the data. The key part of our methodology is the definition of a robust, Bayesian plane posterior under the assumption of an invariant prior and a Gaussian error model. A numerical representation of the plane posterior can be explored interactively. Beyond this purely Bayesian approach we can use the plane posterior to construct a family of posterior-based test statistics that allow testing the data for different plane related hypotheses. To demonstrate practicability we queried triplets of metabolic data from a plant crossing experiment for the presence of plane-, line- and point-structures by using posterior-based test statistics and were able to show their distinctiveness.

[1]  B. Palsson,et al.  Formulating genome-scale kinetic models in the post-genome era , 2008, Molecular systems biology.

[2]  E. Dougherty,et al.  Inferring Connectivity of Genetic Regulatory Networks Using Information-Theoretic Criteria , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  Joachim Selbig,et al.  Metabolite fingerprinting: detecting biological features by independent component analysis , 2004, Bioinform..

[4]  Carsten O. Daub,et al.  The mutual information: Detecting and evaluating dependencies between variables , 2002, ECCB.

[5]  M. Mann,et al.  Proteomics to study genes and genomes , 2000, Nature.

[6]  O. Fiehn,et al.  Metabolite profiling for plant functional genomics , 2000, Nature Biotechnology.

[7]  R. Heinrich,et al.  The Regulation of Cellular Systems , 1996, Springer US.

[8]  E. Winzeler,et al.  Treasures and traps in genome-wide data sets: case examples from yeast , 2002, Nature Reviews Genetics.

[9]  Robin Sibson,et al.  What is projection pursuit , 1987 .

[10]  E. Jaynes Probability theory : the logic of science , 2003 .

[11]  E. Jaynes The well-posed problem , 1973 .

[12]  Matthias Holschneider,et al.  Robust detection and verification of linear relationships to generate metabolic networks using estimates of technical errors , 2007, BMC Bioinformatics.

[13]  Edwin Thompson Jaynes,et al.  Probability theory , 2003 .

[14]  Willi Freeden,et al.  Constructive Approximation on the Sphere: With Applications to Geomathematics , 1998 .

[15]  S. Schuster,et al.  Metabolic network structure determines key aspects of functionality and regulation , 2002, Nature.

[16]  I N Bronstein,et al.  Taschenbuch der Mathematik , 1966 .

[17]  I S Kohane,et al.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.