Advancement in bioinformatics and microarray-based technologies for genome sequence analysis and its application in bioremediation of soil and water pollutants

Abstract The microarray is an excellent platform for the understanding of complex regulatory as well as functional mechanisms operating in various cellular processes and, hence, microarray significantly contributing to human welfare. However, voluminous data generated following to microarray, requiring a series of consecutive-analyses to extract useful information to draw any conclusion. Moreover, bioinformatics is an emerging tool advocated for the generation of valuable information’s from large datasets, such as microarray, RNA sequencing, next-generation sequencing, proteomics, transcriptomics. Therefore, to make these experiments interpretable, a series of algorithms must be applied. Thus, in recent decades, microarray in combination with bioinformatics has transformed the field of biology by enhancing our fundamental understanding of how genome shapes the organism. In this chapter, we discuss key concepts, tools, software, and databases used in bioinformatics for the interpretation of large datasets. Furthermore, some fundamental issues associated with sequence analyses, transcriptomic studies, computational proteomics and metabolomics, bioontologies, and biological databases covered to gain our fundamental understanding of the general approaches and algorithms currently employed in the microarray. Today’s world is suffering from various environmental pollutants that impose a negative impact on our lifestyles. Therefore an economical and eco-friendly way is urgently calling the attention of scientists. Recently, bioremediation emerges as a hopeful tool, which utilizes microbes to clean our environment. The microarray and bioinformatics approach could be applied for the identification and characterization of novel microbes, which will serve as an inoculum in wastes bioremediation. However, this field is remaining elusive because of the complex biological activities of these microbes, and a better knowledge of their interactions could only be possible through microarray and bioinformatics. Several molecular and bioinformatic tools such as genomics, transcriptomics, translatomics, proteomics, evogenimics, metagenomics, metabolomics are used to gather information on such interaction. In this chapter, we discuss the role of microarray-based technologies and bioinformatics in the bioremediation of wastes.

[1]  C. Nusbaum,et al.  Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. , 1998, Science.

[2]  G. Church,et al.  Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation , 1998, Nature Biotechnology.

[3]  Jian-long Wang,et al.  Characteristics of Zn2+ biosorption by Saccharomyces cerevisiae. , 2007, Biomedical and environmental sciences : BES.

[4]  D. Eisenberg,et al.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[5]  J. Berg Genome sequence of the nematode C. elegans: a platform for investigating biology. , 1998, Science.

[6]  P. Bruheim,et al.  The potential of metabolomics tools in bioremediation studies. , 2007, Omics : a journal of integrative biology.

[7]  R. Chiodini,et al.  The impact of next-generation sequencing on genomics. , 2011, Journal of genetics and genomics = Yi chuan xue bao.

[8]  K. Miura,et al.  Quantitative assessment of DNA microarrays--comparison with Northern blot analyses. , 2001, Genomics.

[9]  Ronald W. Davis,et al.  Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. , 1999, Science.

[10]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[11]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[12]  Eoin L. Brodie,et al.  Field Evidence for Co-Metabolism of Trichloroethene Stimulated by Addition of Electron Donor to Groundwater , 2010 .

[14]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[15]  G. Parmigiani,et al.  The Consensus Coding Sequences of Human Breast and Colorectal Cancers , 2006, Science.

[16]  Etienne Yergeau,et al.  Metagenomic Analysis of the Bioremediation of Diesel-Contaminated Canadian High Arctic Soils , 2012, PloS one.

[17]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[18]  Jaehoon Yu,et al.  High-throughput profiling of peptide-RNA interactions using peptide microarrays. , 2012, Journal of the American Chemical Society.

[19]  Victor E. Velculescu,et al.  High-throughput gene expression analysis using SAGE , 1998 .

[20]  Ye Deng,et al.  GeoChip-based analysis of functional microbial communities during the reoxidation of a bioreduced uranium-contaminated aquifer. , 2009, Environmental microbiology.

[21]  Semyon Kruglyak,et al.  Multistage Sequencing by Hybridization , 1998, J. Comput. Biol..

[22]  R. Bruskiewich,et al.  Gene expression microarrays and their application in drought stress research. , 2006 .

[23]  Gertraud Burger,et al.  AutoFACT: An Automatic Functional Annotation and Classification Tool , 2005, BMC Bioinformatics.

[24]  K. Kinzler,et al.  Serial Analysis of Gene Expression , 1995, Science.

[25]  H. Melhus,et al.  A microarray minisequencing system for pharmacogenetic profiling of antihypertensive drug response. , 2003, Pharmacogenetics.

[26]  J. Waring,et al.  Identifying toxic mechanisms using DNA microarrays: evidence that an experimental inhibitor of cell adhesion molecule expression signals through the aryl hydrocarbon nuclear receptor. , 2002, Toxicology.

[27]  Joshua M. Korn,et al.  Association between microdeletion and microduplication at 16p11.2 and autism. , 2008, The New England journal of medicine.

[28]  J. T. Madison,et al.  Structure of a Ribonucleic Acid , 1965, Science.

[29]  F. Sanger,et al.  Nucleotide sequence of bacteriophage φX174 DNA , 1977, Nature.

[30]  R. Stoughton Applications of DNA microarrays in biology. , 2005, Annual review of biochemistry.

[31]  M. Fulekar,et al.  Bioremediation of Persistent Pesticides in Rice field Soil Environment Using Surface Soil Treatment Reactor , 2015 .

[32]  Gary L. Andersen,et al.  High-Density Universal 16S rRNA Microarray Analysis Reveals Broader Diversity than Typical Clone Library When Sampling the Environment , 2007, Microbial Ecology.

[33]  É. Yergeau,et al.  Identification of Nitrogen-Incorporating Bacteria in Petroleum-Contaminated Arctic Soils by Using [15N]DNA-Based Stable Isotope Probing and Pyrosequencing , 2011, Applied and Environmental Microbiology.

[34]  A. Aharoni,et al.  DNA microarrays for functional plant genomics , 2004, Plant Molecular Biology.

[35]  E. Carrilho,et al.  DNA sequencing by capillary array electrophoresis and microfabricated array systems , 2000, Electrophoresis.

[36]  G. Schuler,et al.  Entrez: molecular biology database and retrieval system. , 1996, Methods in enzymology.

[37]  Xifeng Chen,et al.  ChIP-Seq: A Powerful Tool for Studying Protein-DNA Interactions in Plants. , 2018, Current issues in molecular biology.

[38]  S. P. Fodor,et al.  Light-generated oligonucleotide arrays for rapid DNA sequence analysis. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[39]  S. Drăghici,et al.  Analysis of microarray experiments of gene expression profiling. , 2006, American journal of obstetrics and gynecology.

[40]  P. Alvarez,et al.  Hexahydro-1,3,5-trinitro-1,3,5-triazine (RDX) degradation by Acetobacterium paludosum , 2005, Biodegradation.

[41]  Bethann S. Hromatka,et al.  Transcriptional response of Candida albicans to nitric oxide and the role of the YHB1 gene in nitrosative stress and virulence. , 2005, Molecular biology of the cell.

[42]  Melanie E. Goward,et al.  The DNA sequence of human chromosome 22 , 1999, Nature.

[43]  A. Ryo,et al.  Use of serial analysis of gene expression (SAGE) technology. , 2001, Journal of immunological methods.

[44]  M. Bittner,et al.  Expression profiling using cDNA microarrays , 1999, Nature Genetics.

[45]  Jizhong Zhou Microarrays for bacterial detection and microbial community analysis. , 2003, Current opinion in microbiology.

[46]  Takuro Tamura,et al.  Formal design and implementation of an improved DDBJ DNA database with a new schema and object-oriented library , 1998, Bioinform..

[47]  F. Crick,et al.  Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid , 1953, Nature.

[48]  W. Aehle,et al.  Construction of stabilized proteins by combinatorial consensus mutagenesis. , 2004, Protein engineering, design & selection : PEDS.

[49]  P. Brown,et al.  Parallel human genome analysis: microarray-based expression monitoring of 1000 genes. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[50]  M. Uttamchandani,et al.  Developing a Strategy for Activity‐Based Detection of Enzymes in a Protein Microarray , 2003, ChemBioChem.

[51]  R. Boopathy,et al.  Biotransformation of 2,4,6-trinitrotoluene (TNT) by a Methanococcus sp. (strain B) isolated from a lake sediment. , 1994, Canadian journal of microbiology.

[52]  L. Samson,et al.  Global response of Saccharomyces cerevisiae to an alkylating agent. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Paolo Fortina,et al.  Parallel molecular genetic analysis , 1998, European Journal of Human Genetics.

[54]  Kenny Q. Ye,et al.  Large-Scale Copy Number Polymorphism in the Human Genome , 2004, Science.

[55]  Harry R Beller,et al.  A real-time polymerase chain reaction method for monitoring anaerobic, hydrocarbon-degrading bacteria based on a catabolic gene. , 2002, Environmental science & technology.

[56]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[57]  R. Chakraborty,et al.  Systems biology approach to bioremediation. , 2012, Current opinion in biotechnology.

[58]  R. Knight,et al.  Fast UniFrac: Facilitating high-throughput phylogenetic analyses of microbial communities including analysis of pyrosequencing and PhyloChip data , 2009, The ISME Journal.

[59]  M. Mehrasbi,et al.  Highly cadmium tolerant fungi: their tolerance and removal potential , 2015, Journal of Environmental Health Science and Engineering.

[60]  M. Levitt The birth of computational structural biology , 2001, Nature Structural Biology.

[61]  J. Trevors,et al.  Microbial gene expression in soil: methods, applications and challenges. , 2005, Journal of microbiological methods.

[62]  Rick L. Stevens,et al.  The RAST Server: Rapid Annotations using Subsystems Technology , 2008, BMC Genomics.

[63]  M. D. Aitken,et al.  Characteristics of phenanthrene-degrading bacteria isolated from soils contaminated with polycyclic aromatic hydrocarbons. , 1998, Canadian journal of microbiology.

[64]  Kenny Q. Ye,et al.  Strong Association of De Novo Copy Number Mutations with Autism , 2007, Science.

[65]  Mahesh Uttamchandani,et al.  Applications of microarrays in pathogen detection and biodefence , 2008, Trends in Biotechnology.

[66]  Complete Genome Sequence of Arthrobacter sp. Strain LS16, Isolated from Agricultural Soils with Potential for Applications in Bioremediation and Bioproducts , 2016, Genome Announcements.

[67]  T. Kudo,et al.  A Gram-positive Polychlorinated Biphenyl-degrading Bacterium, Rhodococcus erythropolis Strain TA421, Isolated from a Termite Ecosystem , 1994 .

[68]  E. Southern Detection of specific sequences among DNA fragments separated by gel electrophoresis. , 1975, Journal of molecular biology.

[69]  S Brunak,et al.  A DNA structural atlas for Escherichia coli. , 2000, Journal of molecular biology.

[70]  I. Dubus,et al.  Pesticides in rainfall in Europe. , 2000, Environmental pollution.

[71]  D. Relman,et al.  Using DNA microarrays to study host-microbe interactions. , 2000, Emerging infectious diseases.

[72]  Evelyn Camon,et al.  The EMBL Nucleotide Sequence Database , 2004, Nucleic acids research.

[73]  B. Singh,et al.  Biodegradation of Chlorpyrifos by Enterobacter Strain B-14 and Its Use in Bioremediation of Contaminated Soils , 2004, Applied and Environmental Microbiology.

[74]  N. Stralis-Pavese,et al.  Optimization of diagnostic microarray for application in analysing landfill methanotroph communities under different plant covers. , 2004, Environmental microbiology.

[75]  Jürgen Sühnel,et al.  GenColors: accelerated comparative analysis and annotation of prokaryotic genomes at various stages of completeness , 2005, Bioinform..

[76]  S. Steigrad Epidemiology of gestational trophoblastic diseases. , 2003, Best practice & research. Clinical obstetrics & gynaecology.

[77]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[78]  Yongyan Wang,et al.  Microarray analysis of gene expression on herbal glycoside recipes improving deficient ability of spatial learning memory in ischemic mice , 2004, Journal of neurochemistry.

[79]  L. Setti,et al.  Biodegradation of dibenzothiophene by a nodulating isolate of Rhizobium meliloti. , 1998 .

[80]  R. Drmanac,et al.  Accurate sequencing by hybridization for DNA diagnostics and individual genomics , 1998, Nature Biotechnology.

[81]  Mark Schena,et al.  Trends in microarray analysis , 2003, Nature Medicine.

[82]  Ivan G. Costa,et al.  Detecting differential peaks in ChIP-seq signals with ODIN , 2014, Bioinform..

[83]  E. Diamandis,et al.  Rapid sequencing of the p53 gene with a new automated DNA sequencer. , 1998, Clinical chemistry.

[84]  R. Varshney,et al.  Genomics-assisted breeding for crop improvement. , 2005, Trends in plant science.

[85]  J. Hughes,et al.  Biodegradation pathways of hexahydro-1,3,5-trinitro-1,3,5-triazine (RDX) by Clostridium acetobutylicum cell-free extract. , 2003, Chemosphere.

[86]  A. Pardee,et al.  Differential display of eukaryotic messenger RNA by means of the polymerase chain reaction. , 1992, Science.

[87]  Milan Mrksich,et al.  Peptide Arrays Identify Isoform-Selective Substrates for Profiling Endogenous Lysine Deacetylase Activity , 2010, ACS chemical biology.

[88]  G. Pertea,et al.  Cross-referencing eukaryotic genomes: TIGR Orthologous Gene Alignments (TOGA). , 2002, Genome research.

[89]  Narmada Thanki,et al.  CDD: a Conserved Domain Database for the functional annotation of proteins , 2010, Nucleic Acids Res..

[90]  Yuki Moriya,et al.  KAAS: an automatic genome annotation and pathway reconstruction server , 2007, Nucleic Acids Res..

[91]  M. Bayer,et al.  Chromatin state analysis of the barley epigenome reveals a higher‐order structure defined by H3K27me1 and H3K27me3 abundance , 2015, The Plant journal : for cell and molecular biology.

[92]  G. Andersen,et al.  Bacterial Diversity Analysis of Huanglongbing Pathogen-Infected Citrus, Using PhyloChip Arrays and 16S rRNA Gene Clone Library Sequencing , 2009, Applied and Environmental Microbiology.

[93]  Raghavendra P. Singh,et al.  Comparative Study on Bioremediation for Oil Spills Using Microbes , 2019 .

[94]  J. CherlysInfante Removal of lead, mercury and nickel using the yeast Saccharomyces cerevisiae , 2014 .

[95]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[96]  Christopher W. Schadt,et al.  Microarray-Based Analysis of Subnanogram Quantities of Microbial Community DNAs by Using Whole-Community Genome Amplification , 2006, Applied and Environmental Microbiology.

[97]  Raja Jothi,et al.  Genome-wide identification of in vivo protein–DNA binding sites from ChIP-Seq data , 2008, Nucleic acids research.

[98]  J. Banfield,et al.  Direct Microbial Reduction and Subsequent Preservation of Uranium in Natural Near-Surface Sediment , 2005, Applied and Environmental Microbiology.

[99]  P. Brown,et al.  DNA arrays for analysis of gene expression. , 1999, Methods in enzymology.

[100]  K. Vrana,et al.  Cocaine-responsive gene expression changes in rat hippocampus , 2001, Neuroscience.

[101]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[102]  Christine Hackel,et al.  A transcript finishing initiative for closing gaps in the human transcriptome. , 2004, Genome research.

[103]  Heribert Insam,et al.  Design and application of an oligonucleotide microarray for the investigation of compost microbial communities. , 2005, Journal of microbiological methods.

[104]  Y. Suneetha,et al.  Review Article: Current Knowledge on Microarray Technology - An Overview , 2012 .

[105]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[106]  Jizhong Zhou,et al.  Microarray Applications in Microbial Ecology Research , 2006, Microbial Ecology.

[107]  Michael J Taussig,et al.  Printing protein arrays from DNA arrays , 2008, Nature Methods.

[108]  Lee Bennett,et al.  Genomic interrogation of mechanism(s) underlying cellular responses to toxicants. , 2002, Toxicology.

[109]  P. Brown,et al.  Exploring the metabolic and genetic control of gene expression on a genomic scale. , 1997, Science.

[110]  K. Shinozaki,et al.  Monitoring Expression Profiles of Rice Genes under Cold, Drought, and High-Salinity Stresses and Abscisic Acid Application Using cDNA Microarray and RNA Gel-Blot Analyses1[w] , 2003, Plant Physiology.

[111]  M. Levitt Detailed Molecular Model for Transfer Ribonucleic Acid , 1969, Nature.

[112]  S. P. Fodor,et al.  High density synthetic oligonucleotide arrays , 1999, Nature Genetics.

[113]  Baohua Gu,et al.  GeoChip: a comprehensive microarray for investigating biogeochemical, ecological and environmental processes , 2007, The ISME Journal.

[114]  Stefan Engelen,et al.  MicroScope: a platform for microbial genome annotation and comparative genomics , 2009, Database J. Biol. Databases Curation.

[115]  L. Farinelli,et al.  Chromatin immunoprecipitation (ChIP) of plant transcription factors followed by sequencing (ChIP-SEQ) or hybridization to whole genome arrays (ChIP-CHIP) , 2010, Nature Protocols.

[116]  P Murray-Rust Bioinformatics and drug discovery. , 1994, Current opinion in biotechnology.

[117]  M. Ananthi,et al.  Polycyclic Aromatic Hydrocarbons (PAHs) Biodegradation by Basidiomycetes Fungi, Pseudomonas Isolate, and Their Cocultures: Comparative In Vivo and In Silico Approach , 2008, Applied biochemistry and biotechnology.

[118]  A. A. Hussein,et al.  Degradation of Oxyfluorfen Herbicide by Soil Microorganisms Biodegradation of Herbicides , 2011 .

[119]  B. Lal,et al.  Degradation of crude oil by Acinetobacter calcoaceticus and Alcaligenes odorans. , 1996, The Journal of applied bacteriology.

[120]  W. Gilbert,et al.  A new method for sequencing DNA. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[121]  Michael R. Green,et al.  Dissecting the Regulatory Circuitry of a Eukaryotic Genome , 1998, Cell.

[122]  C. Navarro,et al.  Heavy metal resistance strategies of acidophilic bacteria and their acquisition: importance for biomining and bioremediation. , 2013, Biological research.

[123]  U. Ijah Studies on relative capabilities of bacterial and yeast isolates from tropical soil in degrading crude oil , 1998 .

[124]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[125]  Jizhong Zhou,et al.  Detection of Genes Involved in Biodegradation and Biotransformation in Microbial Communities by Using 50-Mer Oligonucleotide Microarrays , 2004, Applied and Environmental Microbiology.

[126]  William Pennie,et al.  Toxicogenomics in risk assessment: an overview of an HESI collaborative research program. , 2004, Environmental health perspectives.

[127]  Christian A. Rees,et al.  Molecular portraits of human breast tumours , 2000, Nature.

[128]  Alfonso Valencia,et al.  MetaRouter: bioinformatics for bioremediation , 2004, Nucleic Acids Res..

[129]  W. Pennie Custom cDNA microarrays; technologies and applications. , 2002, Toxicology.

[130]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[131]  J. Dussán,et al.  Genome sequence and description of the mosquitocidal and heavy metal tolerant strain Lysinibacillus sphaericus CBAM5 , 2015, Standards in genomic sciences.

[132]  G. Sayler,et al.  Use of gene probes to assess the impact and effectiveness of aerobic in situ bioremediation of TCE , 2009, Archives of Microbiology.

[133]  Michael Wagner,et al.  Unravelling Microbial Communities with DNA-Microarrays: Challenges and Future Directions , 2007, Microbial Ecology.

[134]  Fernando A. Villanea,et al.  Diet and the evolution of human amylase gene copy number variation , 2007, Nature Genetics.

[135]  R. B. Merrifield Solid Phase Synthesis (Nobel Lecture) , 1985 .

[136]  Dorothea K. Thompson,et al.  Challenges in applying microarrays to environmental studies. , 2002, Current opinion in biotechnology.

[137]  C. Vázquez,et al.  Tolerance and uptake of heavy metals by Trichoderma atroviride isolated from sludge. , 2003, Chemosphere.

[138]  D. Heller Molar Pregnancies , 2021, Benirschke's Pathology of the Human Placenta.

[139]  S. Friend,et al.  How DNA microarrays and expression profiling will affect clinical practice , 1999, BMJ.

[140]  Ravi Naidu,et al.  The use of molecular techniques to characterize the microbial communities in contaminated soil and water. , 2008, Environment international.

[141]  J. Lupski,et al.  Genomic rearrangements and sporadic disease , 2007, Nature Genetics.

[142]  R. Naidu,et al.  Bioremediation approaches for organic pollutants: a critical perspective. , 2011, Environment international.

[143]  John J. Wyrick,et al.  Ctk Complex-Mediated Regulation of Histone Methylation by COMPASS , 2006, Molecular and Cellular Biology.