jMorp updates in 2020: large enhancement of multi-omics data resources on the general Japanese population

Abstract In the Tohoku Medical Megabank project, genome and omics analyses of participants in two cohort studies were performed. A part of the data is available at the Japanese Multi Omics Reference Panel (jMorp; https://jmorp.megabank.tohoku.ac.jp) as a web-based database, as reported in our previous manuscript published in Nucleic Acid Research in 2018. At that time, jMorp mainly consisted of metabolome data; however, now genome, methylome, and transcriptome data have been integrated in addition to the enhancement of the number of samples for the metabolome data. For genomic data, jMorp provides a Japanese reference sequence obtained using de novo assembly of sequences from three Japanese individuals and allele frequencies obtained using whole-genome sequencing of 8,380 Japanese individuals. In addition, the omics data include methylome and transcriptome data from ∼300 samples and distribution of concentrations of more than 755 metabolites obtained using high-throughput nuclear magnetic resonance and high-sensitivity mass spectrometry. In summary, jMorp now provides four different kinds of omics data (genome, methylome, transcriptome, and metabolome), with a user-friendly web interface. This will be a useful scientific data resource on the general population for the discovery of disease biomarkers and personalized disease prevention and early diagnosis.

[1]  Josyf Mychaleckyj,et al.  Robust relationship inference in genome-wide association studies , 2010, Bioinform..

[2]  J. Yasuda,et al.  Genome-wide identification of inter-individually variable DNA methylation sites improves the efficacy of epigenetic association studies , 2017, npj Genomic Medicine.

[3]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[4]  K. Kinoshita,et al.  Construction and integration of three de novo Japanese human genome assemblies toward a population-specific reference , 2019, bioRxiv.

[5]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[6]  M. Bonder,et al.  Individual variations in cardiovascular-disease-related protein levels are driven by genetics and gut microbiome , 2018, Nature Genetics.

[7]  Helen E. Parkinson,et al.  The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019 , 2018, Nucleic Acids Res..

[8]  Hiroshi Tanaka,et al.  Security controls in an integrated Biobank to protect privacy in data sharing: rationale and study design , 2017, BMC Medical Informatics and Decision Making.

[9]  Hiroshi Tanaka,et al.  The Tohoku Medical Megabank Project: Design and Mission. , 2016, Journal of epidemiology.

[10]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[11]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[12]  K. Kinoshita,et al.  Cohort Profile: Tohoku Medical Megabank Project Birth and Three-Generation Cohort Study (TMM BirThree Cohort Study): rationale, progress and perspective , 2019, International journal of epidemiology.

[13]  Reedik Mägi,et al.  GWAMA: software for genome-wide association meta-analysis , 2010, BMC Bioinformatics.

[14]  J. Weber The Iceland map , 2002, Nature Genetics.

[15]  T. Takenawa,et al.  A Novel Serum Metabolomics-Based Diagnostic Approach for Colorectal Cancer , 2012, PloS one.

[16]  Yuh Shiwa,et al.  iMETHYL: an integrative database of human DNA methylation, gene expression, and genomic variation , 2018, Human Genome Variation.

[17]  Akira R. Kinjo,et al.  Molmil: a molecular viewer for the PDB and beyond , 2016, Journal of Cheminformatics.

[18]  R. Xavier,et al.  Gut Microbial Associations to Plasma Metabolites Linked to Cardiovascular Phenotypes and Risk. , 2019, Circulation research.

[19]  Kengo Kinoshita,et al.  Establishment of Protocols for Global Metabolomics by LC-MS for Biomarker Discovery , 2016, PloS one.

[20]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[21]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[22]  Toshiko Tanaka,et al.  Blood Metabolite Signatures of Metabolic Syndrome in Two Cross-Cultural Older Adult Cohorts , 2020, International journal of molecular sciences.

[23]  K. Kinoshita,et al.  Omics research project on prospective cohort studies from the Tohoku Medical Megabank Project , 2018, Genes to cells : devoted to molecular & cellular mechanisms.

[24]  Chenghua Shao,et al.  RCSB Protein Data Bank: Enabling biomedical research and drug discovery , 2019, Protein science : a publication of the Protein Society.

[25]  Jingyuan Fu,et al.  Causal relationships among the gut microbiome, short-chain fatty acids and metabolic diseases , 2019, Nature Genetics.

[26]  Suzanna E Lewis,et al.  JBrowse: a dynamic web platform for genome visualization and analysis , 2016, Genome Biology.

[27]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[28]  Yun Li,et al.  METAL: fast and efficient meta-analysis of genomewide association scans , 2010, Bioinform..

[29]  K. Kinoshita,et al.  Genome analyses for the Tohoku Medical Megabank Project towards establishment of personalized healthcare , 2018, Journal of biochemistry.

[30]  P. Donnelly,et al.  The UK Biobank resource with deep phenotyping and genomic data , 2018, Nature.

[31]  David S. Wishart,et al.  HMDB 4.0: the human metabolome database for 2018 , 2017, Nucleic Acids Res..

[32]  Chittibabu Guda,et al.  LMPD: LIPID MAPS proteome database , 2005, Nucleic Acids Res..

[33]  David W. Russell,et al.  LMSD: LIPID MAPS structure database , 2006, Nucleic Acids Res..

[34]  K. Kinoshita,et al.  Study Profile of the Tohoku Medical Megabank Community-Based Cohort Study , 2020, Journal of epidemiology.

[35]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[36]  S. Tsugane,et al.  Investigations in the possibility of early detection of colorectal cancer by gas chromatography/triple-quadrupole mass spectrometry , 2017, Oncotarget.

[37]  A. Zhernakova,et al.  Cohort profile: LifeLines DEEP, a prospective, general population cohort study in the northern Netherlands: study design and baseline characteristics , 2015, BMJ Open.

[38]  Andrew P. Boughton,et al.  Exploring and visualizing large-scale genetic associations by using PheWeb , 2020, Nature Genetics.

[39]  Carol J. Bult,et al.  The Mouseion at the JAXlibrary , 2022 .

[40]  Jerzy Adamski,et al.  Interlaboratory Reproducibility of a Targeted Metabolomics Platform for Analysis of Human Serum and Plasma. , 2017, Analytical chemistry.

[41]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[42]  Irina M. Armean,et al.  The mutational constraint spectrum quantified from variation in 141,456 humans , 2019, Nature.

[43]  J. Ioannidis,et al.  Meta-analysis methods for genome-wide association studies and beyond , 2013, Nature Reviews Genetics.

[44]  K. Kinoshita,et al.  3.5KJPNv2: an allele frequency panel of 3552 Japanese individuals including the X chromosome , 2019, Human Genome Variation.

[45]  Kengo Kinoshita,et al.  jMorp: Japanese Multi Omics Reference Panel , 2017, Nucleic Acids Res..

[46]  K. Lunetta Methods for Meta‐Analysis of Genetic Data , 2013, Current protocols in human genetics.

[47]  Stephen E. Stein,et al.  Metabolite profiling of a NIST Standard Reference Material for human plasma (SRM 1950): GC-MS, LC-MS, NMR, and clinical laboratory analyses, libraries, and web-based resources. , 2013, Analytical chemistry.

[48]  N. Laird,et al.  Meta-analysis in clinical trials revisited. , 2015, Contemporary clinical trials.