OWL reasoning over big biomedical data

Recently, the emerging accumulation of biomedical data on the Web (e.g. vast amounts of protein sequences, genes, gene products, drugs, diseases and chemical compounds, etc.) has shaped a big network of isolated professional knowledge. Embedded with domain knowledge from different disciplines all regarding to human biological systems, the decentralized data repositories are implicitly connected by human expert knowledge. Lots of biomedical data sources are published separately in the form of semantic ontologies represented by Web Ontology Language (OWL) syntax, which is naturally based on linked graphs. When we are faced with such massive, disparate and interlinked data, biomedical data analysis becomes a challenge. In this paper, we present a general OWL reasoning framework for the analysis of big biomedical data and implement a MapReduce-based property chain reasoning prototype system. OWL reasoning method is ideally suitable for problems involved complex semantic associations because it is able to infer logical consequences based on a set of asserted rules or axioms. MapReduce framework is used to solve the problem of scalability. In our experiment, we focus on the discovery of associations between Traditional Chinese Medicine (TCM) and Western Medicine (WM). The results show the system achieves high performance, accuracy and scalability.

[1]  Bernard De Baets,et al.  Reasoning with bio-ontologies: using relational closure rules to enable practical querying , 2011, Bioinform..

[2]  Kei-Hoi Cheung,et al.  Using semantic web rules to reason on an ontology of pseudogenes , 2010, Bioinform..

[3]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[4]  M. Schatz,et al.  Searching for SNPs with cloud computing , 2009, Genome Biology.

[5]  Michael C. Schatz,et al.  CloudBurst: highly sensitive read mapping with MapReduce , 2009, Bioinform..

[6]  K. Cheung,et al.  Semantic Web for Health Care and Life Sciences: a review of the state of the art , 2009, Briefings Bioinform..

[7]  José A. B. Fortes,et al.  CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications , 2008, 2008 IEEE Fourth International Conference on eScience.

[8]  Sanjay Ghemawat,et al.  MapReduce: simplified data processing on large clusters , 2008, CACM.

[9]  Chi‐Huey Wong,et al.  Ganoderma lucidum polysaccharides in human monocytic leukemia cells: from gene expression to network construction , 2007, BMC Genomics.

[10]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..

[11]  Li Ma,et al.  Minerva: A Scalable OWL Ontology Storage and Inference System , 2006, ASWC.

[12]  Hong-Sheng Zhang,et al.  Salvianolic acid B from Salvia miltiorrhiza inhibits tumor necrosis factor-α (TNF-α)-induced MMP-2 upregulation in human aortic smooth muscle cells via suppression of NAD(P)H oxidase-derived reactive oxygen species , 2006 .

[13]  Hong-Sheng Zhang,et al.  Salvianolic acid B from Salvia miltiorrhiza inhibits tumor necrosis factor-alpha (TNF-alpha)-induced MMP-2 upregulation in human aortic smooth muscle cells via suppression of NAD(P)H oxidase-derived reactive oxygen species. , 2006, Journal of molecular and cellular cardiology.

[14]  Christine Golbreich,et al.  The Foundational Model of Anatomy in OWL: Experience and Perspectives , 2006, OWLED.

[15]  F. Lu,et al.  Anti-inflammatory potential of Antrodia Camphorata through inhibition of iNOS, COX-2 and cytokines via the NF-kappaB pathway. , 2005, International immunopharmacology.

[16]  P. Barnes,et al.  Complementary and alternative medicine use among adults: United States, 2002. , 2004, Advance data.

[17]  Tao Gu,et al.  Ontology based context modeling and reasoning using OWL , 2004, IEEE Annual Conference on Pervasive Computing and Communications Workshops, 2004. Proceedings of the Second.

[18]  L. Stein,et al.  OWL Web Ontology Language - Reference , 2004 .

[19]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[20]  Volker Haarslev,et al.  Racer: A Core Inference Engine for the Semantic Web , 2003, EON.

[21]  J. Phillipson,et al.  St John's wort (Hypericum perforatum L.): a review of its chemistry, pharmacology and clinical properties , 2001, The Journal of pharmacy and pharmacology.

[22]  Philip Yam Take Me to the Ocean. , 2001 .

[23]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[24]  C. Ho,et al.  The anti‐tumor effect of Ganoderma Lucidum is mediated by cytokines released from activated macrophages and T lymphocytes , 1997, International journal of cancer.