A dynamic programming algorithm for binning microbial community profiles

MOTIVATION A number of community profiling approaches have been widely used to study the microbial community composition and its variations in environmental ecology. Automated Ribosomal Intergenic Spacer Analysis (ARISA) is one such technique. ARISA has been used to study microbial communities using 16S-23S rRNA intergenic spacer length heterogeneity at different times and places. Owing to errors in sampling, random mutations in PCR amplification, and probably mostly variations in readings from the equipment used to analyze fragment sizes, the data read directly from the fragment analyzer should not be used for down stream statistical analysis. No optimal data preprocessing methods are available. A commonly used approach is to bin the reading lengths of the 16S-23S intergenic spacer. We have developed a dynamic programming algorithm based binning method for ARISA data analysis which minimizes the overall differences between replicates from the same sampling location and time. RESULTS In a test example from an ocean time series sampling program, data preprocessing identified several outliers which upon re-examination were found to be because of systematic errors. Clustering analysis of the ARISA from different times based on the dynamic programming algorithm binned data revealed important features of the biodiversity of the microbial communities.

[1]  C. Kaplan,et al.  Variation between observed and true Terminal Restriction Fragment length is dependent on true TRF length and purine content. , 2003, Journal of microbiological methods.

[2]  Eric W. Triplett,et al.  Within- and between-Lake Variability in the Composition of Bacterioplankton Communities: Investigations Using Multiple Spatial Scales , 2004, Applied and Environmental Microbiology.

[3]  Eric W. Triplett,et al.  Geographic and Environmental Sources of Variation in Lake Bacterial Community Composition , 2005, Applied and Environmental Microbiology.

[4]  Jed A. Fuhrman,et al.  Richness and Diversity of Bacterioplankton Species along an Estuarine Gradient in Moreton Bay, Australia , 2004, Applied and Environmental Microbiology.

[5]  Gerard Muyzer,et al.  Bacterial activity and genetic richness along an estuarine gradient (Rhone River plume, France) , 2002 .

[6]  S. Giovannoni,et al.  Bias caused by template annealing in the amplification of mixtures of 16S rRNA genes by PCR , 1996, Applied and environmental microbiology.

[7]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[8]  I. Hewson,et al.  Improved Strategy for Comparing Microbial Assemblage Fingerprints , 2006, Microbial Ecology.

[9]  E. Triplett,et al.  Automated Approach for Ribosomal Intergenic Spacer Analysis of Microbial Diversity and Its Application to Freshwater Bacterial Communities , 1999, Applied and Environmental Microbiology.

[10]  Jed A Fuhrman,et al.  Coupling 16S-ITS rDNA clone libraries and automated ribosomal intergenic spacer analysis to show marine microbial diversity: development and application to a time series. , 2005, Environmental microbiology.

[11]  A. Uitterlinden,et al.  Profiling of complex microbial populations by denaturing gradient gel electrophoresis analysis of polymerase chain reaction-amplified genes coding for 16S rRNA , 1993, Applied and environmental microbiology.

[12]  Hans H. Cheng,et al.  Characterization of microbial diversity by determining terminal restriction fragment length polymorphisms of genes encoding 16S rRNA , 1997, Applied and environmental microbiology.

[13]  Christian Winter,et al.  Horizontal and vertical complexity of attached and free‐living bacteria of the eastern Mediterranean Sea, determined by 16S rDNA and 16S rRNA fingerprints , 2001 .

[14]  Mary Ann Moran,et al.  Covariance of bacterioplankton composition and environmental variables in a temperate delta system , 2003 .

[15]  J. Borneman,et al.  Molecular microbial diversity in soils from eastern Amazonia: evidence for unusual microorganisms and microbial population shifts associated with deforestation , 1997, Applied and environmental microbiology.

[16]  C. Brunk,et al.  A molecular technique for identification of bacteria using small subunit ribosomal RNA sequences. , 1994, BioTechniques.