Local similarity analysis reveals unique associations among marine bacterioplankton species and environmental factors

MOTIVATION Characterizing the diversity of microbial communities and understanding the environmental factors that influence community diversity are central tenets of microbial ecology. The development and application of cultivation independent molecular tools has allowed for rapid surveying of microbial community composition at unprecedented resolutions and frequencies. There is a growing need to discern robust patterns and relationships within these datasets which provide insight into microbial ecology. Pearson correlation coefficient (PCC) analysis is commonly used for identifying the linear relationship between two species, or species and environmental factors. However, this approach may not be able to capture more complex interactions which occur in situ; thus, alternative analyses were explored. RESULTS In this paper we introduced local similarity analysis (LSA), which is a technique that can identify more complex dependence associations among species as well as associations between species and environmental factors without requiring significant data reduction. To illustrate its capability of identifying relationships that may not otherwise be identified by PCC, we first applied LSA to simulated data. We then applied LSA to a marine microbial observatory dataset and identified unique, significant associations that were not detected by PCC analysis. LSA results, combined with results from PCC analysis were used to construct a theoretical ecological network which allows for easy visualization of the most significant associations. Biological implications of the significant associations detected by LSA were discussed. We also identified additional applications where LSA would be beneficial. AVAILABILITY The algorithms are implemented in Splus/R and they are available upon request from the corresponding author.

[1]  E. Triplett,et al.  Automated Approach for Ribosomal Intergenic Spacer Analysis of Microbial Diversity and Its Application to Freshwater Bacterial Communities , 1999, Applied and Environmental Microbiology.

[2]  Jed A Fuhrman,et al.  Coupling 16S-ITS rDNA clone libraries and automated ribosomal intergenic spacer analysis to show marine microbial diversity: development and application to a time series. , 2005, Environmental microbiology.

[3]  I. Hewson,et al.  Annually reoccurring bacterial communities are predictable from ocean conditions , 2006, Proceedings of the National Academy of Sciences.

[4]  Gerard Muyzer,et al.  Bacterial activity and genetic richness along an estuarine gradient (Rhone River plume, France) , 2002 .

[5]  J. Thioulouse,et al.  Characterization of Bacterial and Fungal Soil Communities by Automated Ribosomal Intergenic Spacer Analysis Fingerprints: Biological and Methodological Variability , 2001, Applied and Environmental Microbiology.

[6]  Eyke Hüllermeier,et al.  Clustering of gene expression data using a local shape-based similarity measure , 2005, Bioinform..

[7]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[8]  Ker-Chau Li,et al.  Genome-wide coexpression dynamics: Theory and application , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  A. Uitterlinden,et al.  Profiling of complex microbial populations by denaturing gradient gel electrophoresis analysis of polymerase chain reaction-amplified genes coding for 16S rRNA , 1993, Applied and environmental microbiology.

[10]  C. Brunk,et al.  A molecular technique for identification of bacteria using small subunit ribosomal RNA sequences. , 1994, BioTechniques.

[11]  Fengzhu Sun,et al.  A dynamic programming algorithm for binning microbial community profiles , 2006, Bioinform..

[12]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[13]  B. V. Van Mooy,et al.  Relationship between bacterial community structure, light, and carbon cycling in the eastern subarctic North Pacific , 2004 .

[14]  N. Pace,et al.  The Analysis of Natural Microbial Populations by Ribosomal RNA Sequences , 1986 .

[15]  Hans H. Cheng,et al.  Characterization of microbial diversity by determining terminal restriction fragment length polymorphisms of genes encoding 16S rRNA , 1997, Applied and environmental microbiology.

[16]  Susan R. Wilson INTRODUCTION TO COMPUTATIONAL BIOLOGY: MAPS, SEQUENCES AND GENOMES. , 1996 .

[17]  Daniel B. Oerther,et al.  Use of 16S rRNA Gene Terminal Restriction Fragment Analysis To Assess the Impact of Solids Retention Time on the Bacterial Diversity of Activated Sludge , 2005, Applied and Environmental Microbiology.

[18]  Eric W. Triplett,et al.  Within- and between-Lake Variability in the Composition of Bacterioplankton Communities: Investigations Using Multiple Spatial Scales , 2004, Applied and Environmental Microbiology.

[19]  Robert Tibshirani,et al.  Statistical Significance for Genome-Wide Experiments , 2003 .

[20]  Eric W. Triplett,et al.  Geographic and Environmental Sources of Variation in Lake Bacterial Community Composition , 2005, Applied and Environmental Microbiology.

[21]  Jed A. Fuhrman,et al.  Richness and Diversity of Bacterioplankton Species along an Estuarine Gradient in Moreton Bay, Australia , 2004, Applied and Environmental Microbiology.

[22]  M. Gerstein,et al.  Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions. , 2001, Journal of molecular biology.

[23]  Mary Ann Moran,et al.  Covariance of bacterioplankton composition and environmental variables in a temperate delta system , 2003 .