Securing Bioinformatics Cloud for Big Data: Budding Buzzword or a Glance of the Future

Insight to utilize the Big data of Bioinformatics information generated by a paradigm; Cloud Computing is coming up as a guarantee to deal with big information storage and scrutiny challenges in the Bioinformatics field. Cloud computing is viewed to be a cost effectual technique to process and accumulate this immense quantity of data with parallel processing tools and carried as “Services” through the internet. Due to its fast and efficient performance for data processing on cloud clusters and easy to use environments, The Hadoop parallel programming framework is dominantly used. This document will be bearing in the direction of the productive course for economical Bioinformatics clouds for the Big data and also the challenges that would obstruct Bioinformatics Big data to take a stride towards the cloud. In this document, we state an outline of the applications of Bioinformatics clouds, merits, and limitations of the current research activity methods used for storing Big Data in Bioinformatics. The paper mentions how the existing dilemma can be addressed from the perspective of Cloud computing services in addition to Bioinformatics tools. For ensuring trust, a simulation comparing the trust values for different Cloud providers is being illustrated in Fog server. For Future enhancements, efforts are being made to build up an efficient cloud data storage system employing different Bioinformatics tools ensuring security so that various Healthcare organizations are benefited by this approach.

[1]  Shruti Goyat,et al.  A secure cryptographic cloud communication using DNA cryptographic technique , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[2]  Sandeep Tata,et al.  BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters , 2013, Bioinform..

[3]  José A. B. Fortes,et al.  CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications , 2008, 2008 IEEE Fourth International Conference on eScience.

[4]  Konstantinos Krampis,et al.  Cloud BioLinux: pre-configured and on-demand bioinformatics computing for the genomics community , 2012, BMC Bioinformatics.

[5]  Lúcia Maria de A. Drummond,et al.  Evaluating Grasp-based cloud dimensioning for comparative genomics: A practical approach , 2014, 2014 IEEE International Conference on Cluster Computing (CLUSTER).

[6]  Mete Akgün,et al.  Privacy preserving processing of genomic data: A survey , 2015, J. Biomed. Informatics.

[7]  Andrian Yang,et al.  Scalability and Validation of Big Data Bioinformatics Software , 2017, Computational and structural biotechnology journal.

[8]  Luca Pireddu,et al.  MapReducing a genomic sequencing workflow , 2011, MapReduce '11.

[9]  Michael C. Schatz,et al.  CloudBurst: highly sensitive read mapping with MapReduce , 2009, Bioinform..

[10]  Pinki Roy,et al.  Time efficient secure DNA based access control model for cloud computing environment , 2017, Future Gener. Comput. Syst..

[11]  X. Guan,et al.  Cancer metastases: challenges and opportunities , 2015, Acta pharmaceutica Sinica. B.

[12]  Brian D. O'Connor,et al.  SeqWare Query Engine: storing and searching sequence data in the cloud , 2010, BMC Bioinformatics.

[13]  Luis Ceze,et al.  Computer Security, Privacy, and DNA Sequencing: Compromising Computers with Synthesized DNA, Privacy Leaks, and More , 2017, USENIX Security Symposium.

[14]  Maria Fazio,et al.  New trends in Biotechnology: The point on NGS Cloud computing solutions , 2016, 2016 IEEE Symposium on Computers and Communication (ISCC).

[15]  Ivan Merelli,et al.  Managing, Analysing, and Integrating Big Data in Medical Bioinformatics: Open Problems and Future Perspectives , 2014, BioMed research international.

[16]  Henning Hermjakob,et al.  Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework , 2012, BMC Bioinformatics.

[17]  Yaw-Ling Lin,et al.  Cloud computing service framework for bioinformatics tools , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[18]  Prachi Singh,et al.  Big Genomic Data in Bioinformatics Cloud , 2016 .

[19]  Christopher Williams,et al.  Secure and robust cloud computing for high-throughput forensic microsatellite sequence analysis and databasing. , 2017, Forensic science international. Genetics.

[20]  Alireza Tabatabaei Tabrizi,et al.  Applications of Cloud Computing in Health Systems , 2016 .

[21]  Yixue Li,et al.  Big Biological Data: Challenges and Opportunities , 2014, Genom. Proteom. Bioinform..

[22]  João José Costa Gondim,et al.  Attribute based access control in federated clouds: A case study in bionformatics , 2017, 2017 12th Iberian Conference on Information Systems and Technologies (CISTI).

[23]  Richard O. Sinnott,et al.  TruXy: Trusted Storage Cloud for Scientific Workflows , 2017, IEEE Transactions on Cloud Computing.

[24]  B. Langmead,et al.  Cloud-scale RNA-sequencing differential expression analysis with Myrna , 2010, Genome Biology.

[25]  Karolj Skala,et al.  Building and provisioning bioinformatics environments on public and private Clouds , 2015, 2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[26]  Yaw-Ling Lin,et al.  Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data , 2015, International journal of molecular sciences.

[27]  Ying-Chih Lin,et al.  Enabling Large-Scale Biomedical Analysis in the Cloud , 2013, BioMed research international.

[28]  M. Shamim Hossain,et al.  A Security Model for Preserving the Privacy of Medical Big Data in a Healthcare Cloud Using a Fog Computing Facility With Pairing-Based Cryptography , 2017, IEEE Access.

[29]  Borja Sotomayor,et al.  Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses , 2014, J. Biomed. Informatics.

[30]  Heena Kharche,et al.  Big Data in Bioinformatics & the Era of Cloud Computing , 2013 .

[31]  B. B. Zaidan,et al.  A distributed framework for health information exchange using smartphone technologies , 2017, J. Biomed. Informatics.

[32]  Vineet Kumar Cloud computing using bioinformatics MapReduce applications , 2016, 2016 Symposium on Colossal Data Analysis and Networking (CDAN).

[33]  Stéphane Le Crom,et al.  Eoulsan: a cloud computing-based framework facilitating high throughput sequencing analyses , 2012, Bioinform..

[34]  Weisong Shi,et al.  CloudAligner: A fast and full-featured MapReduce based tool for sequence mapping , 2011, BMC Research Notes.

[35]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[36]  David William Galbraith Frontiers in Genomic Assay Technologies: The Grand Challenges in Enabling Data-Intensive Biological Research , 2011, Front. Gene..

[37]  Ming Ouyang,et al.  Biocloud: Cloud Computing for Biological, Genomics, and Drug Design , 2013, BioMed Research International.

[38]  T. Benton,et al.  Food production vs. biodiversity: comparing organic and conventional agriculture , 2013 .

[39]  Finn Drabløs,et al.  The eGenVar data management system—cataloguing and sharing sensitive data and metadata for the life sciences , 2014, Database J. Biol. Databases Curation.

[40]  N. B. Anuar,et al.  The rise of "big data" on cloud computing: Review and open research issues , 2015, Inf. Syst..

[41]  David R. Riley,et al.  CloVR: A virtual machine for automated and portable sequence analysis from the desktop using cloud computing , 2011, BMC Bioinformatics.

[42]  Michael C. Schatz,et al.  Cloud Computing and the DNA Data Race , 2010, Nature Biotechnology.

[43]  Yuri Yamamoto,et al.  A Decentralized System of Genome Secret Search Implemented with Fully Homomorphic Encryption , 2017, 2017 IEEE International Conference on Smart Computing (SMARTCOMP).

[44]  Blair Bethwaite,et al.  Development of a cloud-based Bioinformatics Training Platform , 2017, Briefings Bioinform..

[45]  Tomislav Lipic,et al.  Delivering bioinformatics MapReduce applications in the cloud , 2014, 2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[46]  Michael Naehrig,et al.  Manual for Using Homomorphic Encryption for Bioinformatics , 2017, Proceedings of the IEEE.

[47]  Arcady Mushegian,et al.  Grand Challenges in Bioinformatics and Computational Biology , 2011, Front. Gene..

[48]  Eija Korpelainen,et al.  Hadoop-BAM: directly manipulating next generation sequencing data in the cloud , 2012, Bioinform..

[49]  Miguel López-Coronado,et al.  Analysis of the Security and Privacy Requirements of Cloud-Based Electronic Health Records Systems , 2013, Journal of medical Internet research.

[50]  V. Siddaramappa,et al.  Cryptography and bioinformatics techniques for secure information transmission over insecure channels , 2015, 2015 International Conference on Applied and Theoretical Computing and Communication Technology (iCATccT).

[51]  Gos Micklem,et al.  Constructing synthetic biology workflows in the cloud , 2017 .

[52]  M. Schatz,et al.  Searching for SNPs with cloud computing , 2009, Genome Biology.

[53]  María José del Jesús,et al.  Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks , 2014, WIREs Data Mining Knowl. Discov..

[54]  Mario Cannataro,et al.  Cloud Computing in Bioinformatics: current solutions and challenges , 2016 .

[55]  Xiandong Meng,et al.  A case study of tuning MapReduce for efficient Bioinformatics in the cloud , 2017, Parallel Comput..

[56]  Ivan Stojmenovic,et al.  The Fog computing paradigm: Scenarios and security issues , 2014, 2014 Federated Conference on Computer Science and Information Systems.

[57]  Hugh P. Shanahan,et al.  Bioinformatics on the Cloud Computing Platform Azure , 2014, PloS one.

[58]  Jin Soo Lee,et al.  FX: an RNA-Seq analysis tool on the cloud , 2012, Bioinform..

[59]  Roy D. Sleator,et al.  'Big data', Hadoop and cloud computing in genomics , 2013, J. Biomed. Informatics.

[60]  Jake Luo,et al.  Big Data Application in Biomedical Research and Health Care: A Literature Review , 2016, Biomedical informatics insights.

[61]  Maristela Holanda,et al.  ACOsched: A scheduling algorithm in a federated cloud infrastructure for bioinformatics applications , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[62]  Qun Li,et al.  Security and Privacy Issues of Fog Computing: A Survey , 2015, WASA.

[63]  V. Marx Biology: The big challenges of big data , 2013, Nature.

[64]  A. Kuo Opportunities and Challenges of Cloud Computing to Improve Health Care Services , 2011, Journal of medical Internet research.

[65]  Geoffrey C. Fox,et al.  Cloud computing paradigms for pleasingly parallel biomedical applications , 2010, HPDC '10.