Federated sharing and processing of genomic datasets for tertiary data analysis

[1]  Marco Masseroli,et al.  GenoMetric Query Language: a novel approach to large-scale genomic data management , 2015, Bioinform..

[2]  Pietro Liò,et al.  The BioMart community portal: an innovative alternative to large, centralized data repositories , 2015, Nucleic Acids Res..

[3]  Stefano Ceri,et al.  Framework for Supporting Genomic Operations , 2017, IEEE Transactions on Computers.

[4]  Tarcisio Mendes de Farias,et al.  Enabling semantic queries across federated bioinformatics databases , 2019, Database : the journal of biological databases and curation.

[5]  F. Arnaud,et al.  From core referencing to data re-use: two French national initiatives to reinforce paleodata stewardship (National Cyber Core Repository and LTER France Retro-Observatory) , 2017 .

[6]  Marco Masseroli,et al.  Processing of big heterogeneous genomic datasets for tertiary analysis of Next Generation Sequencing data , 2018, Bioinform..

[7]  Stefan Decker,et al.  TopFed: TCGA tailored federated query processing and linking to LOD , 2014, Journal of Biomedical Semantics.

[8]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[9]  Junjun Zhang,et al.  BioMart: a data federation framework for large collaborative projects , 2011, Database J. Biol. Databases Curation.

[10]  Marco Masseroli,et al.  GenoSurf: metadata driven semantic search system for integrated genomic datasets , 2019, Database J. Biol. Databases Curation.

[11]  Stefano Ceri,et al.  Optimal Binning for Genomics , 2019, IEEE Transactions on Computers.

[12]  Marco Masseroli,et al.  Modeling and interoperability of heterogeneous genomic big data for integrative processing and querying. , 2016, Methods.

[13]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[14]  Marco Masseroli,et al.  Scalable Genomic Data Management System on the Cloud , 2017, 2017 International Conference on High Performance Computing & Simulation (HPCS).

[15]  Bertil Schmidt,et al.  Next-generation sequencing: big data meets high performance computing. , 2017, Drug discovery today.

[16]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[17]  Gillian L. Currie,et al.  Risk of Bias in Reports of In Vivo Research: A Focus for Improvement , 2015, PLoS biology.

[18]  Dietrich Rebholz-Schuhmann,et al.  BioFed: federated query processing over life sciences linked open data , 2017, J. Biomed. Semant..

[19]  Dariusz Mrozek,et al.  Protein Construction-Based Data Partitioning Scheme for Alignment of Protein Macromolecular Structures Through Distributed Querying in Federated Databases , 2020, IEEE Transactions on NanoBioscience.

[20]  N. Siva UK gears up to decode 100 000 genomes from NHS patients , 2015, The Lancet.

[21]  G. Nolan,et al.  Computational solutions to large-scale data management and analysis , 2010, Nature Reviews Genetics.

[22]  Dariusz Mrozek,et al.  Cloud4Psi: cloud computing for 3D protein structure similarity searching , 2014, Bioinform..

[23]  Jeremy J. Yang,et al.  PIBAS FedSPARQL: a web-based platform for integration and exploration of bioinformatics datasets , 2017, J. Biomed. Semant..

[24]  Ana Roxin,et al.  FOWLA, A Federated Architecture for Ontologies , 2015, RuleML.

[25]  Gunnar Rätsch,et al.  BRCA Challenge: BRCA Exchange as a global resource for variants in BRCA1 and BRCA2 , 2018, PLoS genetics.

[26]  F. Song,et al.  Roles of low-density lipoprotein receptor-related protein 1 in tumors , 2016, Chinese journal of cancer.

[27]  Mete Akgün,et al.  Privacy preserving processing of genomic data: A survey , 2015, J. Biomed. Informatics.

[28]  M. Schatz,et al.  Big Data: Astronomical or Genomical? , 2015, PLoS biology.