Gene-targeted-metagenomics reveals extensive diversity of aromatic dioxygenase genes in the environment

Understanding the relationship between gene diversity and function for important environmental processes is a major ecological research goal. We applied gene-targeted metagenomics and pyrosequencing to aromatic dioxygenase genes to obtain greater sequence depth than possible by other methods. A polymerase chain reaction (PCR) primer set designed to target a 524-bp region that confers substrate specificity of biphenyl dioxygenases yielded 2000 and 604 sequences from the 5′ and 3′ ends of PCR products, respectively, which passed our validity criteria. Sequence alignment showed three known conserved residues, as well as another seven conserved residues not reported earlier. Of the valid sequences, 95% and 41% were assigned to 22 and 3 novel clusters in that they did not include any earlier reported sequences at 0.6 distance by complete linkage clustering for sequenced regions. The greater diversity revealed by this gene-targeted approach provides deeper insights into genes potentially important in environmental processes to better understand their ecology, functional differences and evolutionary origins. We also provide criteria for primer design for this approach, as well as guidance for data processing of diverse functional genes, as gene databases for most genes of environmental relevance are limited.

[1]  F. Mondello,et al.  Identification and modification of biphenyl dioxygenase sequences that determine the specificity of polychlorinated biphenyl degradation , 1997, Applied and environmental microbiology.

[2]  J. Rothberg,et al.  The development and impact of 454 sequencing , 2008, Nature Biotechnology.

[3]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[4]  C. Bagnéris,et al.  Subtle Difference between Benzene and Toluene Dioxygenases of Pseudomonas putida , 2005, Applied and Environmental Microbiology.

[5]  Susan M. Huse,et al.  Microbial diversity in the deep sea and the underexplored “rare biosphere” , 2006, Proceedings of the National Academy of Sciences.

[6]  D. Gibson,et al.  Substrate Specificity of Naphthalene Dioxygenase: Effect of Specific Amino Acids at the Active Site of the Enzyme , 2000, Journal of bacteriology.

[7]  T. Wood,et al.  Protein engineering of toluene-o-xylene monooxygenase from Pseudomonas stutzeri OX1 for enhanced chlorinated ethene degradation and o-xylene oxidation , 2005, Applied Microbiology and Biotechnology.

[8]  D. Gibson,et al.  Aromatic hydrocarbon dioxygenases in environmental biotechnology. , 2000, Current opinion in biotechnology.

[9]  M. Fukuda,et al.  Crystal structure of the terminal oxygenase component of biphenyl dioxygenase derived from Rhodococcus sp. strain RHA1. , 2004, Journal of molecular biology.

[10]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[11]  D. Barriault,et al.  Diversity of the C-Terminal Portion of the Biphenyl Dioxygenase Large Subunit , 2008, Journal of Molecular Microbiology and Biotechnology.

[12]  Y.-M. Cheng,et al.  Estimating residue evolutionary conservation by introducing von Neumann entropy and a novel gap-treating approach , 2007, Amino Acids.

[13]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[14]  David P. Nagle,et al.  Polychlorinated Biphenyl (PCB)-Degrading Bacteria Associated with Trees in a PCB-Contaminated Site , 2006, Applied and Environmental Microbiology.

[15]  Ngadiman,et al.  Alteration of Regiospecificity in Biphenyl Dioxygenase by Active-Site Engineering , 2002, Journal of bacteriology.

[16]  J. Handelsman,et al.  Introducing DOTUR, a Computer Program for Defining Operational Taxonomic Units and Estimating Species Richness , 2005, Applied and Environmental Microbiology.

[17]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[18]  Jizhong Zhou,et al.  Biphenyl-utilizing bacteria and their functional genes in a pine root zone contaminated with polychlorinated biphenyls (PCBs) , 2007, The ISME Journal.

[19]  E. Fukuda,et al.  Crystal Structure of the Terminal Oxygenase Component of Cumene Dioxygenase from Pseudomonas fluorescens IP01 , 2005, Journal of bacteriology.

[20]  Susan M. Huse,et al.  Microbial Population Structures in the Deep Marine Biosphere , 2007, Science.

[21]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[22]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[23]  H. Eklund,et al.  Crystal Structure of Naphthalene Dioxygenase: Side-on Binding of Dioxygen to Iron , 2003, Science.