Pan-genomic analysis provides novel insights into the association of E.coli with human host and its minimal genome

MOTIVATION Bacteria can usually acquire certain advantageous genes that enable the bacteria to adapt to rapidly changing niches, thereby leading to a wide range of intraspecific genome content and genetic redundancy. The minimal genome of Escherichia coli, which is the most important bacterial species, and the association between E.coli and its human host are worthy of further exploration. RESULTS We used gene prediction and phylogenetic analysis to reveal a rich phylogenetic diversity among 491 E.coli strains and to reveal substantial differences between these strains with respect to gene number and genome length. We used pan-genomic analysis to accurately identify 867 core genes, in which only 243 genes are shared by essential genes. This analysis revealed that core genes mainly provide essential functions to the basic lifestyle of E.coli, and accessory genes are likely to confer selective advantages such as niche adaptation or the ability to colonize specific hosts. By association analysis, we found that E.coli strains in non-human hosts may more easily utilize foreign genetic materials to adapt to their surroundings, but the population in human hosts has higher demands for the control of population density, indicating that highly accurate quorum-sensing behavior is very important for harmony between E.coli and its human host. By considering core genes and previous deletions together, we proposed a potential direction for further reduction of the E.coli genome. AVAILABILITY AND IMPLEMENTATION The data, analysis process and detailed information on software tools used in this study are all available in the supplementary material. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  C. Hutchison,et al.  Minimal Cells-Real and Imagined. , 2017, Cold Spring Harbor perspectives in biology.

[2]  F. Blattner,et al.  Emergent Properties of Reduced-Genome Escherichia coli , 2006, Science.

[3]  C. Stoeckert,et al.  OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.

[4]  H. Mizoguchi,et al.  Extensive Genomic Diversity in Pathogenic Escherichia coli and Shigella Strains Revealed by Comparative Genomic Hybridization Microarray , 2004, Journal of bacteriology.

[5]  Tamás Fehér,et al.  Low-mutation-rate, reduced-genome Escherichia coli: an improved host for faithful maintenance of engineered genetic constructs , 2012, Microbial Cell Factories.

[6]  Naotake Ogasawara,et al.  Genetic manipulations restored the growth fitness of reduced-genome Escherichia coli. , 2013, Journal of bioscience and bioengineering.

[7]  Alan McNally,et al.  Why prokaryotes have pangenomes , 2017, Nature Microbiology.

[8]  Andrew J. Page,et al.  Roary: rapid large-scale prokaryote pan genome analysis , 2015, bioRxiv.

[9]  Yu-Wei Wu,et al.  A pan-genome-based machine learning approach for predicting antimicrobial resistance activities of the Escherichia coli strains , 2018, Bioinform..

[10]  S. Gilbert,et al.  Eco-Evo-Devo: developmental symbiosis and developmental plasticity as evolutionary agents , 2015, Nature Reviews Genetics.

[11]  H. Mori,et al.  Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection , 2006, Molecular systems biology.

[12]  D. Relman,et al.  An ecological and evolutionary perspective on human–microbe mutualism and disease , 2007, Nature.

[13]  Guido Grandi,et al.  Large scale validation of an efficient CRISPR/Cas-based multi gene editing protocol in Escherichia coli , 2017, Microbial Cell Factories.

[14]  Torsten Seemann,et al.  Prokka: rapid prokaryotic genome annotation , 2014, Bioinform..

[15]  Stephen P. Diggle,et al.  Progress in and promise of bacterial quorum sensing research , 2017, Nature.

[16]  J. W. Campbell,et al.  Experimental Determination and System Level Analysis of Essential Genes in Escherichia coli MG1655 , 2003, Journal of bacteriology.

[17]  V. Sperandio,et al.  Interactions between the microbiota and pathogenic bacteria in the gut , 2016, Nature.

[18]  Huiyong Zhang,et al.  Pangenome Evidence for Higher Codon Usage Bias and Stronger Translational Selection in Core Genes of Escherichia coli , 2016, Front. Microbiol..

[19]  D. G. Gibson,et al.  Design and synthesis of a minimal bacterial genome , 2016, Science.

[20]  Hirofumi Honda,et al.  Oxidative stress sensitivity of engineered Escherichia coli cells with a reduced genome. , 2011, FEMS microbiology letters.

[21]  N. Loman,et al.  A complete bacterial genome assembled de novo using only nanopore sequencing data , 2015, Nature Methods.

[22]  Hiroshi Mizoguchi,et al.  Cell size and nucleoid organization of engineered Escherichia coli cells with a reduced genome , 2004, Molecular microbiology.

[23]  M. Vences,et al.  The Intestinal Microbiota of Tadpoles Differs from Those of Syntopic Aquatic Invertebrates , 2018, Microbial Ecology.

[24]  S. Koren,et al.  One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly. , 2015, Current opinion in microbiology.

[25]  David W Ussery,et al.  Characterization of probiotic Escherichia coli isolates with a novel pan-genome microarray , 2007, Genome Biology.

[26]  Genetic redundancy is prevalent within the 6.7 Mb Sinorhizobium meliloti genome , 2015, Molecular Genetics and Genomics.