Whole Genome Shotgun Sequencing Based Taxonomic Profiling Methods for Comparative Study of Microbial Communities

123 Zusammenfassung in deutscher Sprache 125 Selbstständigkeitserklärung 127

[1]  Burton H. Bloom,et al.  Space/time trade-offs in hash coding with allowable errors , 1970, CACM.

[2]  N. Worathumrong,et al.  The Effect of o‐Salicylate upon Pentose Phosphate Pathway Activity in Normal and G6PD‐Deficient Red Cells , 1975, British journal of haematology.

[3]  Robert S. Boyer,et al.  A fast string searching algorithm , 1977, CACM.

[4]  Donald E. Knuth,et al.  Fast Pattern Matching in Strings , 1977, SIAM J. Comput..

[5]  D. Savage Microbial ecology of the gastrointestinal tract. , 1977, Annual review of microbiology.

[6]  W. H. King Isotope shift and configuration interaction in U I , 1979 .

[7]  A. Klinkhamer,et al.  [Digital subtraction angiography]. , 1981, Nederlands tijdschrift voor geneeskunde.

[8]  Eugene W. Myers,et al.  Suffix arrays: a new method for on-line string searches , 1993, SODA '90.

[9]  O. Kandler,et al.  Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Dan Gusfield Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[11]  R. Amann,et al.  Microbial Community Composition of Wadden Sea Sediments as Revealed by Fluorescence In Situ Hybridization , 1998, Applied and Environmental Microbiology.

[12]  Giovanni Manzini,et al.  Opportunistic data structures with applications , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[13]  Thomas P. Curtis,et al.  Estimating prokaryotic diversity and its limits , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Yves Van de Peer,et al.  The European database on small subunit ribosomal RNA , 2002, Nucleic Acids Res..

[15]  Jillian F Banfield,et al.  Microbial communities in acid mine drainage. , 2003, FEMS microbiology ecology.

[16]  Mauro Leoncini,et al.  Approximation algorithms for a hierarchically structured bin packing problem , 2004, Inf. Process. Lett..

[17]  Wolfgang R Streit,et al.  Metagenomics--the key to the uncultured microbes. , 2004, Current opinion in microbiology.

[18]  O. White,et al.  Environmental Genome Shotgun Sequencing of the Sargasso Sea , 2004, Science.

[19]  Jillian F. Banfield,et al.  Community genomics in microbial ecology and evolution , 2005, Nature Reviews Microbiology.

[20]  Naryttza N. Diaz,et al.  The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes , 2005, Nucleic acids research.

[21]  R. J. Sengwa,et al.  Study of dielectric relaxation and dipole moment of some hydrogen bonded solvent binary mixtures in 1,4-dioxane , 2006 .

[22]  Eoin L. Brodie,et al.  Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB , 2006, Applied and Environmental Microbiology.

[23]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[24]  James R. Cole,et al.  The ribosomal database project (RDP-II): introducing myRDP space and quality controlled public data , 2006, Nucleic Acids Res..

[25]  Knut Reinert,et al.  SeqAn An efficient, generic C++ library for sequence analysis , 2008, BMC Bioinformatics.

[26]  Alexander F. Auch,et al.  MEGAN analysis of metagenomic data. , 2007, Genome research.

[27]  D. Alland,et al.  A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteria. , 2007, Journal of microbiological methods.

[28]  R. Amils,et al.  Life in extreme environments , 2007 .

[29]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[30]  Andreas Wilke,et al.  phylogenetic and functional analysis of metagenomes , 2022 .

[31]  R. Knight,et al.  Evolution of Mammals and Their Gut Microbes , 2008, Science.

[32]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[33]  Steven J. M. Jones,et al.  Abyss: a Parallel Assembler for Short Read Sequence Data Material Supplemental Open Access , 2022 .

[34]  N. Warthmann,et al.  Simultaneous alignment of short reads against multiple genomes , 2009, Genome Biology.

[35]  Lu Wang,et al.  The NIH Human Microbiome Project. , 2009, Genome research.

[36]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[37]  Jouni Sirén,et al.  Compressed Suffix Arrays for Massive Data , 2009, SPIRE.

[38]  S. Salzberg,et al.  Phymm and PhymmBL: Metagenomic Phylogenetic Classification with Interpolated Markov Models , 2009, Nature Methods.

[39]  Faraz Hach,et al.  mrsFAST: a cache-oblivious algorithm for short-read mapping , 2010, Nature Methods.

[40]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[41]  Richard Durbin,et al.  Fast and accurate long-read alignment with Burrows–Wheeler transform , 2010, Bioinform..

[42]  Knut Reinert,et al.  A novel and well-defined benchmarking method for second generation read mapping , 2011, BMC Bioinformatics.

[43]  Courtney J. Robinson,et al.  From Structure to Function: the Ecology of Host-Associated Microbial Communities , 2010, Microbiology and Molecular Biology Reviews.

[44]  Katherine D. McMahon,et al.  A Guide to the Natural History of Freshwater Lake Bacteria , 2011, Microbiology and Molecular Reviews.

[45]  J. Gilbert,et al.  Metagenomics - a guide from sampling to data analysis , 2012, Microbial Informatics and Experimentation.

[46]  Giovanna Rosone,et al.  Lightweight BWT Construction for Very Large String Collections , 2011, CPM.

[47]  P. Bork,et al.  A Holistic Approach to Marine Eco-Systems Biology , 2011, PLoS biology.

[48]  Konstantinos T. Konstantinidis,et al.  Metagenomic Insights into the Evolution, Function, and Complexity of the Planktonic Microbial Community of Lake Lanier, a Temperate Freshwater Ecosystem , 2011, Applied and Environmental Microbiology.

[49]  Yasubumi Sakakibara,et al.  MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads , 2012, Nucleic acids research.

[50]  Scott Federhen,et al.  The NCBI Taxonomy database , 2011, Nucleic Acids Res..

[51]  Sergey I. Nikolenko,et al.  SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing , 2012, J. Comput. Biol..

[52]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[53]  Roderic Guigó,et al.  The GEM mapper: fast, accurate and versatile alignment by filtration , 2012, Nature Methods.

[54]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[55]  C. Huttenhower,et al.  Metagenomic microbial community profiling using unique clade-specific marker genes , 2012, Nature Methods.

[56]  Knut Reinert,et al.  RazerS 3: Faster, fully sensitive read mapping , 2012, Bioinform..

[57]  Sara Mitri,et al.  The genotypic view of social interactions in microbial communities. , 2013, Annual review of genetics.

[58]  C. Quince,et al.  Comparative metagenomic and rRNA microbial diversity characterization using archaeal and bacterial synthetic communities. , 2013, Environmental microbiology.

[59]  Maya Gokhale,et al.  Scalable metagenomic taxonomy classification using a reference genome database , 2013, Bioinform..

[60]  Michael Roberts,et al.  The MaSuRCA genome assembler , 2013, Bioinform..

[61]  Xiaohui Xie,et al.  Improving read mapping using additional prefix grams , 2014, BMC Bioinformatics.

[62]  Derrick E. Wood,et al.  Kraken: ultrafast metagenomic sequence classification using exact alignments , 2014, Genome Biology.

[63]  Alexandros Stamatakis,et al.  Metagenomic species profiling using universal phylogenetic marker genes , 2013, Nature Methods.

[64]  Chaochun Wei,et al.  NeSSM: A Next-Generation Sequencing Simulator for Metagenomics , 2013, PloS one.

[65]  P. Baldrian,et al.  The Variability of the 16S rRNA Gene in Bacterial Genomes and Its Consequences for Bacterial Community Analyses , 2013, PloS one.

[66]  Knut Reinert,et al.  Fast and accurate read mapping with approximate seeds and multiple backtracking , 2012, Nucleic acids research.

[67]  Quinn Snell,et al.  Pathoscope: Species identification and strain attribution with unassembled sequencing data , 2013, Genome research.

[68]  Knut Reinert,et al.  Lambda: the local aligner for massive biological data , 2014, Bioinform..

[69]  Claudio Bombardelli,et al.  Collision avoidance maneuver optimization , 2014 .

[70]  J. Zentek,et al.  The Influence of DNA Extraction Procedure and Primer Set on the Bacterial Community Analysis by Pyrosequencing of Barcoded 16S rRNA Gene Amplicons , 2014, Molecular biology international.

[71]  Knut Reinert,et al.  Journaled string tree - a scalable data structure for analyzing thousands of similar genomes on your laptop , 2014, Bioinform..

[72]  James R. Cole,et al.  Ribosomal Database Project: data and tools for high throughput rRNA analysis , 2013, Nucleic Acids Res..

[73]  M. Ferrer,et al.  Microbial diversity and metabolic networks in acid mine drainage habitats , 2015, Front. Microbiol..

[74]  Justin Chu,et al.  DIDA: Distributed Indexing Dispatched Alignment , 2015, PloS one.

[75]  The distribution, diversity, and importance of 16S rRNA gene introns in the order Thermoproteales , 2015, Biology Direct.

[76]  Paul P. Gardner,et al.  An evaluation of the accuracy and speed of metagenome analysis tools , 2015 .

[77]  S. Lonardi,et al.  CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers , 2015, BMC Genomics.

[78]  Enrico Siragusa,et al.  Approximate string matching for high-throughput sequencing , 2015 .

[79]  Bernhard Y. Renard,et al.  Metagenomic Profiling of Known and Unknown Microbes with MicrobeGPS , 2015, PloS one.

[80]  Kristy Deiner,et al.  Special Issue Article: Environmental DNA Choice of capture and extraction methods affect detection of freshwater biodiversity from environmental DNA , 2015 .

[81]  Josh D. Neufeld,et al.  Current and future resources for functional metagenomics , 2015, Front. Microbiol..

[82]  W. Shu,et al.  Comparative metagenomic and metatranscriptomic analyses of microbial communities in acid mine drainage , 2014, The ISME Journal.

[83]  Duy Tin Truong,et al.  MetaPhlAn2 for enhanced metagenomic taxonomic profiling , 2015, Nature Methods.

[84]  James R. Cole,et al.  Reconstructing 16S rRNA genes in metagenomic data , 2015, Bioinform..

[85]  Yun Xu,et al.  BitMapper: an efficient all-mapper based on bit-vector computing , 2015, BMC Bioinformatics.

[86]  Ron Milo,et al.  Are We Really Vastly Outnumbered? Revisiting the Ratio of Bacterial to Host Cells in Humans , 2016, Cell.

[87]  Bernhard Y. Renard,et al.  DUDes: a top-down taxonomic profiler for metagenomics , 2016, Bioinform..

[88]  Peer Bork,et al.  Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees , 2016, Nucleic Acids Res..

[89]  W. Shu,et al.  Microbial communities, processes and functions in acid mine drainage ecosystems. , 2016, Current opinion in biotechnology.

[90]  A. Clooney,et al.  16S rRNA gene sequencing of mock microbial populations- impact of DNA extraction method, primer choice and sequencing platform , 2016, BMC Microbiology.

[91]  Lior Pachter,et al.  Near-optimal probabilistic RNA-seq quantification , 2016, Nature Biotechnology.

[92]  Steven L. Salzberg,et al.  Bracken: Estimating species abundance in metagenomics data , 2016 .

[93]  Bernhard Y. Renard,et al.  SLIMM: species level identification of microorganisms from metagenomes , 2017, PeerJ.

[94]  Knut Reinert,et al.  Flexbar 3.0 ‐ SIMD and multicore parallelization , 2017, Bioinform..

[95]  Bobbie Farsides,et al.  The UK’s 100,000 Genomes Project: manifesting policymakers’ expectations , 2017, New genetics and society.

[96]  R. DeSalle,et al.  Large-scale differences in microbial biodiversity discovery between 16S amplicon and shotgun sequencing , 2017, Scientific Reports.

[97]  Knut Reinert,et al.  The SeqAn C++ template library for efficient sequence analysis: A resource for programmers. , 2017, Journal of biotechnology.

[98]  Lior Pachter,et al.  Pseudoalignment for metagenomic read assignment , 2015, Bioinform..

[99]  N. Fierer Embracing the unknown: disentangling the complexities of the soil microbiome , 2017, Nature Reviews Microbiology.

[100]  Benjamin T James,et al.  MeShClust: an intelligent tool for clustering DNA sequences , 2018 .

[101]  H. Ochman,et al.  Unexplored Archaeal Diversity in the Great Ape Gut Microbiome , 2017, mSphere.

[102]  Phelim Bradley,et al.  Real-time search of all bacterial and viral genomic data , 2017, bioRxiv.

[103]  D. Huson,et al.  SILVA, RDP, Greengenes, NCBI and OTT — how do these taxonomies compare? , 2017, BMC Genomics.

[104]  Steven Salzberg,et al.  Short Read Mapping: An Algorithmic Tour , 2017, Proceedings of the IEEE.

[105]  K. Zengler,et al.  The social network of microorganisms — how auxotrophies shape complex communities , 2018, Nature Reviews Microbiology.

[106]  Robert D. Finn,et al.  EBI Metagenomics in 2017: enriching the analysis of microbial communities, from sequence reads to assemblies , 2017, Nucleic Acids Res..

[107]  K. Reinert,et al.  Formula Feeding Predisposes Neonatal Piglets to Clostridium difficile Gut Infection , 2018, The Journal of infectious diseases.

[108]  Rob Knight,et al.  Current understanding of the human microbiome , 2018, Nature Medicine.

[109]  Wen J. Li,et al.  RefSeq: an update on prokaryotic genome annotation and curation , 2017, Nucleic Acids Res..

[110]  Falk Hildebrand,et al.  Structure and function of the global topsoil microbiome , 2018, Nature.

[111]  Daniel N. Baker,et al.  KrakenUniq: confident and fast metagenomics classification using unique k-mer counts , 2018, Genome Biology.

[112]  Sergey Koren,et al.  RefSeq database growth influences the accuracy of k-mer-based lowest common ancestor species identification , 2018, Genome Biology.

[113]  Bernhard Y. Renard,et al.  ganon: continuously up-to-date with database growth for precise short read classification in metagenomics , 2018 .

[114]  M. Watson,et al.  The Madness of Microbiome: Attempting To Find Consensus “Best Practice” for 16S Microbiome Studies , 2018, Applied and Environmental Microbiology.

[115]  Andreas Andrusch,et al.  DREAM-Yara: An exact read mapper for very large databases with short update time , 2018, bioRxiv.

[116]  Julia Oh,et al.  ReprDB and panDB: minimalist databases with maximal microbial representation , 2017, Microbiome.