SOA Based System for Big Genomic Data Analytics and Knowledge Discovery

The volume of stored genomic data has increased significantly in the recent years. Main challenge in their analysis and knowledge discovery is to suggest advanced and efficient tools, methods and technologies for access and processing. SOA based system for adaptive knowledge discovery and decision making based on big genomic data analytics is proposed in this paper. The system architecture is comprised of web services for data integration, preprocessing of large data streams, knowledge discovery based on genomic data analytics, knowledge interpretation and results visualization. The functionality of the developed system is explained. A web service for breast cancer data processing has been developed for the purpose of system testing and validation. The proposed system architecture allows scientists an easy, fast and flexible approach for data processing. They can choose the services they wish to be executed, use the available data sets in databases, or enter their own data to be processed.

[1]  Barbara Martini,et al.  The Data Revolution. Big Data, Open Data, Data Infrastructures and Their Consequences , 2016 .

[2]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT.

[3]  Amir Mehrgou,et al.  The importance of BRCA1 and BRCA2 genes mutations in breast cancer development , 2016, Medical journal of the Islamic Republic of Iran.

[4]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[5]  Benjamin W. Wah,et al.  Significance and Challenges of Big Data Research , 2015, Big Data Res..

[6]  M. Roizen,et al.  Hallmarks of Cancer: The Next Generation , 2012 .

[7]  Farahnaz Hamedan,et al.  Artificial intelligence methods for the diagnosis of breast cancer by image processing: a review , 2018, Breast cancer.

[8]  Misha Angrist,et al.  Personalized medicine and human genetic diversity. , 2014, Cold Spring Harbor perspectives in medicine.

[9]  Dimitrios I. Fotiadis,et al.  Machine learning applications in cancer prognosis and prediction , 2014, Computational and structural biotechnology journal.

[10]  Kathryn A Phillips,et al.  Precision Medicine: From Science To Value. , 2018, Health affairs.

[11]  Hristo Stoev,et al.  DNA Sequence Alignment Method Based on Trilateration , 2019, IWBBIO.

[12]  Veska Gancheva SOA Based Multi-Agent Approach for Biological Data Searching and Integration , .

[13]  B. Freidlin,et al.  Statistical and practical considerations for clinical evaluation of predictive biomarkers. , 2013, Journal of the National Cancer Institute.

[14]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[15]  Mehryar Mohri,et al.  Perceptron Mistake Bounds , 2013, ArXiv.