Towards a big data analytics platform with Hadoop/MapReduce framework using simulated patient data of a hospital system

Database (DAD) metadata profiles including workflow steps carried out on a regular basis by VIHA staff only. Med2020 WinRecs abstraction software is used to abstract data based on dictionaries and data standards, accordingly. Figure 6. The main components of our Healthcare Big Data Analytics (HBDA) platform that were envisioned by stakeholders and derived from research team. There are numerous reasons why the BDA platform did not use real patient data. Firstly, the process of Ethics and Research Capacity at VIHA for approval for the entire patient data of the hospital system is not possible without existing system and total security/privacy testing. Secondly, it is not possible to piece together summarized data specific to health outcomes because this data has already been summarized and no benefit to VIHA, and, therefore, Ethics will not approve. Thirdly, in a real setting the data will have to be moved or migrated off of the production architecture to avoid using and consuming network resources in use in the hospital. Fourthly, real data in the data warehouse at VIHA will require several months to review and develop the solution to use big data technologies, which is not available. Fifthly, the platform Big Data Analytics in Healthcare Generated Patient Data: •Validation •Integration •Replication HBDA Patient and Hospital Applications and Visualization: Interface Systems Actions and Filters: Query Performance, Benchmarking for Healthcare noSQL Database: Indexed qualifiers, Management, Data Integrity and Access

[1]  INPUT SPLIT FREQUENT PATTERN TREE USING MAPREDUCE PARADIGM IN HADOOP , 2016 .

[2]  Erik Brauner,et al.  Informatics and Quantitative Analysis in Biological Imaging , 2003, Science.

[3]  Peter Saffrey,et al.  Rapid Whole-Genome Sequencing for Genetic Disease Diagnosis in Neonatal Intensive Care Units , 2012, Science Translational Medicine.

[4]  Z. H. Li,et al.  Research on the Method of Big Data Analysis , 2013 .

[5]  Robert M. Stephens,et al.  Knowledge and Theme Discovery across Very Large Biological Data Sets Using Distributed Queries: A Prototype Combining Unstructured and Structured Data , 2013, PloS one.

[6]  Brian Lehaney,et al.  Healthcare Knowledge Management Primer , 2009 .

[7]  Edmund Kohlwey,et al.  Leveraging the Cloud for Big Data Biometrics: Meeting the Performance Requirements of the Next Generation Biometric Systems , 2011, 2011 IEEE World Congress on Services.

[8]  Julia Adler-Milstein,et al.  Healthcare's "big data" challenge. , 2013, The American journal of managed care.

[9]  Yannick Dufresne,et al.  The True North Strong and Free Healthcare? Nationalism and Attitudes Towards Private Healthcare Options in Canada , 2014, Canadian Journal of Political Science.

[10]  Ciprian Dobre,et al.  Parallel Programming Paradigms and Frameworks in Big Data Era , 2013, International Journal of Parallel Programming.

[11]  John Castellani,et al.  Data Mining: Qualitative Analysis with Health Informatics Data , 2003, Qualitative health research.

[12]  Laura B. Madsen Data-Driven Healthcare: How Analytics and BI are Transforming the Industry , 2014 .

[13]  Tae-Min Song,et al.  Big Data Analysis Framework for Healthcare and Social Sectors in Korea , 2015, Healthcare informatics research.

[14]  M. Jonas,et al.  Patient Identification, A Review of the Use of Biometrics in the ICU , 2014 .

[15]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[16]  Haoxiang Lin,et al.  An Empirical Study on Quality Issues of Production Big Data Platform , 2015, 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering.

[17]  Yueyang Alice Li,et al.  Medical data mining : improving information accessibility using online patient drug reviews , 2011 .

[18]  Limsoon Wong,et al.  Random forests on Hadoop for genome-wide association studies of multivariate neuroimaging phenotypes , 2013, BMC Bioinformatics.

[19]  Alvaro A. Cárdenas,et al.  Big Data Analytics for Security , 2013, IEEE Security & Privacy.

[20]  Syed Akhter Hossain,et al.  NoSQL Database: New Era of Databases for Big data Analytics - Classification, Characteristics and Comparison , 2013, ArXiv.

[21]  Vijay H. Kothari,et al.  Workarounds to Computer Access in Healthcare Organizations: You Want My Password or a Dead Patient? , 2015, ITCH.

[22]  Keun Ho Ryu,et al.  Design and Partial Implementation of Health Care System for Disease Detection and Behavior Analysis by Using DM Techniques , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[23]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[24]  Peter J. Haas,et al.  Ricardo: integrating R and Hadoop , 2010, SIGMOD Conference.

[25]  Qian Xu,et al.  Compression-aware I/O performance analysis for big data clustering , 2012, BigMine '12.

[26]  L. Lenert,et al.  EHR Big Data Deep Phenotyping , 2014, Yearbook of Medical Informatics.

[27]  Jie Xu,et al.  ZQL: A Unified Middleware Bridging Both Relational and NoSQL Databases , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[28]  Che-Lun Hung,et al.  Novel and efficient tag SNPs selection algorithms. , 2014, Bio-medical materials and engineering.

[29]  G. Nolan,et al.  Computational solutions to large-scale data management and analysis , 2010, Nature Reviews Genetics.

[30]  T. S. Eugene Ng,et al.  Understanding the effects and implications of compute node related failures in hadoop , 2012, HPDC '12.

[31]  Yingjie Wang,et al.  mDHT: a multi-level-indexed DHT algorithm to extra-large-scale data retrieval on HDFS/Hadoop architecture , 2014, Personal and Ubiquitous Computing.

[32]  Geoffrey C. Fox,et al.  MapReduce for Data Intensive Scientific Analyses , 2008, 2008 IEEE Fourth International Conference on eScience.

[33]  George Siemens Connectivism: A Learning Theory for the Digital Age , 2004 .

[34]  Emad A. Mohammed,et al.  Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends , 2014, BioData Mining.

[35]  Sameer Kumar,et al.  HIPAA's effects on US healthcare. , 2009, International journal of health care quality assurance.

[36]  Sandra Hempel The Strange Case of the Broad Street Pump: John Snow and the Mystery of Cholera , 2007 .

[37]  G. Sudha Sadasivam,et al.  A novel approach to multiple sequence alignment using hadoop data grids , 2010, MDAC '10.

[38]  James M. Tien,et al.  Big Data: Unleashing information , 2013, 2013 10th International Conference on Service Systems and Service Management.

[39]  Murat Kantarcioglu,et al.  BigSecret: A Secure Data Management Framework for Key-Value Stores , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[40]  K. Cios Medical data mining and knowledge discovery. , 2000, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[41]  Gang-hoon Kim,et al.  Potentiality of Big Data in the Medical Sector: Focus on How to Reshape the Healthcare System , 2013, Healthcare informatics research.

[42]  John F. Roddick,et al.  Geographic Data Mining and Knowledge Discovery , 2001 .

[43]  Roger Clarke,et al.  Big Data's Big Unintended Consequences , 2013, Computer.

[44]  Eleni Stroulia,et al.  Enhancing Query Support in HBase via an Extended Coprocessors Framework , 2011, ServiceWave.

[45]  George Hripcsak,et al.  Next-generation phenotyping of electronic health records , 2012, J. Am. Medical Informatics Assoc..

[46]  Divyakant Agrawal,et al.  $\mathcal{MD}$-HBase: design and implementation of an elastic data infrastructure for cloud-scale location services , 2012, Distributed and Parallel Databases.

[47]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[48]  K.J. Cios,et al.  From the guest editor medical data mining and knowledge discovery , 2000, IEEE Engineering in Medicine and Biology Magazine.

[49]  James L. Schwing,et al.  Visual and spatial analysis : advances in data mining, reasoning, and problem solving , 2004 .

[50]  Maziar Goudarzi,et al.  The Memory Challenge in Reduce Phase of MapReduce Applications , 2016, IEEE Transactions on Big Data.

[51]  Peter Langkafel Big Data in Medical Science and Healthcare Management , 2015 .

[52]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[53]  Gail-Joon Ahn,et al.  Patient-centric authorization framework for electronic healthcare services , 2011, Comput. Secur..

[54]  Chao-Tung Yang,et al.  Implementation of Data Transform Method into NoSQL Database for Healthcare Data , 2013, 2013 International Conference on Parallel and Distributed Computing, Applications and Technologies.

[55]  Robert Hoyt,et al.  Digital family histories for data mining. , 2013, Perspectives in health information management.

[56]  Xiaoyong Du,et al.  Big data challenge: a data management perspective , 2013, Frontiers of Computer Science.

[57]  Katharine Armstrong,et al.  Big data: a revolution that will transform how we live, work, and think , 2014 .

[58]  William Perrizo,et al.  Big Data Analytics in Bioinformatics and Healthcare , 2014 .

[59]  Adam Jorgensen,et al.  Microsoft Big Data Solutions , 2014 .

[60]  Muhammad Afzal,et al.  Autonomous mapping of HL7 RIM and relational database schema , 2012, Inf. Syst. Frontiers.

[61]  Achim Streit,et al.  NoWog: A Workload Generator for Database Performance Benchmarking , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[62]  Sherif Sakr,et al.  Towards a Comprehensive Data Analytics Framework for Smart Healthcare Services , 2016, Big Data Res..

[63]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[64]  Melnned M. Kantardzic Big Data Analytics , 2013, Lecture Notes in Computer Science.

[65]  David J Hunter,et al.  From Darwin's finches to canaries in the coal mine--mining the genome for new biology. , 2008, The New England journal of medicine.

[66]  Ronald C. Taylor An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics , 2010, BMC Bioinformatics.

[67]  Anand Raghunathan,et al.  ShuffleWatcher: Shuffle-aware Scheduling in Multi-tenant MapReduce Clusters , 2014, USENIX Annual Technical Conference.

[68]  L. Ohno-Machado,et al.  “Big Data” and the Electronic Health Record , 2014, Yearbook of Medical Informatics.

[69]  Régis Beuscart,et al.  Toward a Literature-Driven Definition of Big Data in Healthcare , 2015, BioMed research international.

[70]  Yi Mu,et al.  Personal Health Record Systems and Their Security Protection , 2006, Journal of Medical Systems.

[71]  Jin Chang,et al.  Balanced parallel FP-Growth with MapReduce , 2010, 2010 IEEE Youth Conference on Information, Computing and Telecommunications.

[72]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[73]  Farhad Mehdipour,et al.  FOG-Engine: Towards Big Data Analytics in the Fog , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[74]  Guangchen Ruan,et al.  Exploiting MapReduce and data compression for data-intensive applications , 2013, XSEDE.

[75]  Muhammad Shiraz,et al.  Big Data: Survey, Technologies, Opportunities, and Challenges , 2014, TheScientificWorldJournal.

[76]  Jan-Ming Ho,et al.  De Novo Assembly of High-Throughput Sequencing Data with Cloud Computing and New Operations on String Graphs , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[77]  Jingfa Xiao,et al.  Bioinformatics clouds for big data manipulation , 2012, Biology Direct.

[78]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[79]  Cong Xu,et al.  Virtual Shuffling for Efficient Data Movement in MapReduce , 2015, IEEE Transactions on Computers.

[80]  Laurie D. Smith,et al.  A 26-hour system of highly sensitive whole genome sequencing for emergency management of genetic diseases , 2015, Genome Medicine.

[81]  Jimmy J. Lin,et al.  Pairwise Document Similarity in Large Collections with MapReduce , 2008, ACL.

[82]  Sanjay P. Ahuja,et al.  State of Big Data Analysis in the Cloud , 2013, Netw. Commun. Technol..

[83]  Kui Zhang,et al.  Dynamic programming algorithms for haplotype block partitioning: applications to human chromosome 21 haplotype data , 2003, RECOMB '03.

[84]  Jesus J. Caban,et al.  Visual analytics in healthcare - opportunities and research challenges , 2015, J. Am. Medical Informatics Assoc..

[85]  Darcy A. Davis,et al.  Bringing Big Data to Personalized Healthcare: A Patient-Centered Framework , 2013, Journal of General Internal Medicine.

[86]  Erik Sundvall,et al.  Comparing the Performance of NoSQL Approaches for Managing Archetype-Based Electronic Health Record Data , 2016, PloS one.

[87]  Sungchul Choi,et al.  Big Data Framework for Analyzing Patents to Support Strategic R&D Planning , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[88]  Yeh-Ching Chung,et al.  JackHare: a framework for SQL to NoSQL translation using MapReduce , 2013, Automated Software Engineering.

[89]  Che-Rung Lee,et al.  Performance Optimization of the SSVD Collaborative Filtering Algorithm on MapReduce Architectures , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[90]  Yanpei Chen,et al.  Energy efficiency for large-scale MapReduce workloads with significant interactive analysis , 2012, EuroSys '12.

[91]  Sanjay Ghemawat,et al.  MapReduce: a flexible data processing tool , 2010, CACM.

[92]  Shusaku Tsumoto,et al.  Temporal data mining in history data of hospital information systems , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[93]  P. Schilling,et al.  The Big To Do About “Big Data” , 2014, Clinical orthopaedics and related research.

[94]  Reinhold Haux,et al.  Medical informatics: Past, present, future , 2010, Int. J. Medical Informatics.

[95]  Erik W. Kuiler From Big Data to Knowledge: An Ontological Approach to Big Data Analytics , 2014 .

[96]  D. Skiba The connected age: big data & data visualization. , 2014, Nursing education perspectives.

[97]  Tin Yu Wu,et al.  Towards a framework for large-scale multimedia data storage and processing on Hadoop platform , 2013, The Journal of Supercomputing.

[98]  N Peek,et al.  Technical Challenges for Big Data in Biomedicine and Health: Data Sources, Infrastructure, and Analytics , 2014, Yearbook of Medical Informatics.

[99]  E. Schadt The changing privacy landscape in the era of big data , 2012, Molecular systems biology.

[100]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[101]  Lorrie Faith Cranor,et al.  Engineering Privacy , 2009, IEEE Transactions on Software Engineering.

[102]  Perry L. Miller,et al.  Viewpoint: Opportunities at the Intersection of Bioinformatics and Health Informatics: A Case Study , 2000, J. Am. Medical Informatics Assoc..

[103]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[104]  Sandeep Tata,et al.  BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters , 2013, Bioinform..

[105]  Wei Hu,et al.  Towards a real-time big data analytics platform for health applications , 2017, Int. J. Big Data Intell..

[106]  Brian Hayes,et al.  What Is Cloud Computing? , 2019, Cloud Technologies.

[107]  Yike Guo,et al.  High dimensional biological data retrieval optimization with NoSQL technology , 2014, BMC Genomics.

[108]  M M Hansen,et al.  Big Data in Science and Healthcare: A Review of Recent Literature and Perspectives , 2014, Yearbook of Medical Informatics.

[109]  Elizabeth M. Borycki,et al.  A Comparison of National Health Data Interoperability Approaches in Taiwan, Denmark and Canada , 2011 .

[110]  Frans Coenen DOI: 10.1017/S000000000000000 Printed in the United Kingdom Data Mining: Past, Present and Future , 2022 .

[111]  Naveen Ashish,et al.  The Abzooba Smart Health Informatics Platform (SHIP) TM - From Patient Experiences to Big Data to Insights , 2012, ArXiv.

[112]  Anurag Barthwal,et al.  Big Data Analytics using Hadoop , 2014 .

[113]  A Ziegler,et al.  Data Analysis and Data Mining: Current Issues in Biomedical Informatics , 2011, Methods of Information in Medicine.

[114]  Gabriel Antoniu,et al.  Optimizing intermediate data management in MapReduce computations , 2011, CloudCP '11.

[115]  Yao Sun,et al.  HBase, MapReduce, and Integrated Data Visualization for Processing Clinical Signal Data , 2011, AAAI Spring Symposium: Computational Physiology.

[116]  P. O'Sullivan,et al.  Applying data models to big data architectures , 2014, IBM J. Res. Dev..

[117]  Hans De Sterck,et al.  Supporting multi-row distributed transactions with global snapshot isolation using bare-bones HBase , 2010, 2010 11th IEEE/ACM International Conference on Grid Computing.

[118]  Jyotsna Talreja Wassan Modelling Stack Framework for Accessing Electronic Health Records with Big Data Needs , 2014 .

[119]  Tony R. Sahama,et al.  Health big data analytics: current perspectives, challenges and potential solutions , 2014, Int. J. Big Data Intell..

[120]  Sreekanth Rallapalli,et al.  Impact of Processing and Analyzing Healthcare Big Data on Cloud Computing Environment by Implementing Hadoop Cluster , 2016 .

[121]  J. Saltz,et al.  Hadoop-GIS : A High Performance Spatial Query System for Analytical Medical Imaging with MapReduce , 2012 .

[122]  Jimeng Sun,et al.  Big data analytics for healthcare , 2013, KDD.

[123]  Sherif Sakr,et al.  The family of mapreduce and large-scale data processing systems , 2013, CSUR.

[124]  Karin M. Verspoor,et al.  Big Data in Medicine Is Driving Big Changes , 2014, Yearbook of Medical Informatics.

[125]  Kimberlyn M. McGrail,et al.  Privacy by Design at Population Data BC: a case study describing the technical, administrative, and physical controls for privacy-sensitive secondary use of personal information for research in the public interest , 2013, J. Am. Medical Informatics Assoc..

[126]  Clarence J M Tauro,et al.  Comparative Study of the New Generation, Agile, Scalable, High Performance NOSQL Databases , 2012 .

[127]  Domenico Talia,et al.  P2P-MapReduce: Parallel data processing in dynamic Cloud environments , 2012, J. Comput. Syst. Sci..

[128]  Anjana Gosain,et al.  New Design Principles for Effective Knowledge Discovery from Big Data , 2014 .

[129]  Patrick B. Ryan,et al.  Big data, big results: Knowledge discovery in output from large‐scale analytics , 2014, Stat. Anal. Data Min..

[130]  Nigam H. Shah,et al.  The coming age of data-driven medicine: translational bioinformatics' next frontier , 2012, J. Am. Medical Informatics Assoc..

[131]  M. Grossglauser,et al.  Data-driven healthcare: from patterns to actions , 2014, European journal of preventive cardiology.

[132]  William H. Sanders,et al.  Failure scenario as a service (FSaaS) for Hadoop clusters , 2012, SDMCMM '12.

[133]  Mary Czerwinski,et al.  Interactions with big data analytics , 2012, INTR.

[134]  Yanpei Chen,et al.  Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads , 2012, Proc. VLDB Endow..

[135]  Nada Lavrac,et al.  Data mining and visualization for decision support and modeling of public health-care resources , 2007, J. Biomed. Informatics.

[136]  Akhil Mittal Trustworthiness of Big Data , 2013 .

[137]  Adam Lith,et al.  Investigating storage solutions for large data - A comparison of well performing and scalable data storage solutions for real time extraction and batch insertion of data , 2010 .

[138]  Wei Hu,et al.  Design and Construction of a Big Data Analytics Framework for Health Applications , 2015, 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity).

[139]  Hsinchun Chen,et al.  Knowledge Management, Data Mining, and Text Mining in Medical Informatics , 2005 .

[140]  Byeong-Soo Jeong,et al.  An Efficient Distributed Programming Model for Mining Useful Patterns in Big Datasets , 2013 .

[141]  C E Kuziemsky,et al.  Big Data in Healthcare - Defining the Digital Persona through User Contexts from the Micro to the Macro. Contribution of the IMIA Organizational and Social Issues WG. , 2014, Yearbook of medical informatics.

[142]  Byeong-Soo Jeong,et al.  A MapReduce Framework for Mining Maximal Contiguous Frequent Patterns in Large DNA Sequence Datasets , 2012 .

[143]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[144]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[145]  Anwitaman Datta,et al.  Multiterm Keyword Search in NoSQL Systems , 2012, IEEE Internet Computing.

[146]  Zhongchuan Fu,et al.  Hadoop-Based Healthcare Information System Design and Wireless Security Communication Implementation , 2015, Mob. Inf. Syst..

[147]  Randy H. Katz,et al.  How Hadoop Clusters Break , 2013, IEEE Software.

[148]  Jianling Sun,et al.  Scalable RDF store based on HBase and MapReduce , 2010, 2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE).

[149]  Yu Tian,et al.  Design and Development of a Medical Big Data Processing System Based on Hadoop , 2015, Journal of Medical Systems.

[150]  Marcelo Fiszman,et al.  Semantic Interpretation for the Biomedical Research Literature , 2005 .

[151]  Athanasios V. Vasilakos,et al.  Big data: From beginning to future , 2016, Int. J. Inf. Manag..

[152]  M Markus Maier,et al.  Towards a big data reference architecture , 2013 .

[153]  Stefan Debortoli,et al.  Comparing Business Intelligence and Big Data Skills , 2014, Business & Information Systems Engineering.

[154]  Xin Chen,et al.  Failure Analysis of Jobs in Compute Clouds: A Google Cluster Case Study , 2014, 2014 IEEE 25th International Symposium on Software Reliability Engineering.

[155]  Ramón Díaz-Uriarte,et al.  Gene selection and classification of microarray data using random forest , 2006, BMC Bioinformatics.

[156]  M. Eric Johnson,et al.  Usability Failures and Healthcare Data Hemorrhages , 2011, IEEE Security & Privacy.

[157]  Evon M. O. Abu-Taieh,et al.  Comparative Study , 2020, Definitions.

[158]  Kai Wang,et al.  BioPig: a Hadoop-based analytic toolkit for large-scale sequence data , 2013, Bioinform..

[159]  Ruay-Shiung Chang,et al.  Dynamic Deduplication Decision in a Hadoop Distributed File System , 2014, Int. J. Distributed Sens. Networks.

[160]  Brian C Sauer,et al.  Information extraction from narrative data. , 2012, American journal of health-system pharmacy : AJHP : official journal of the American Society of Health-System Pharmacists.

[161]  Richard Cumbley,et al.  Is "Big Data" creepy? , 2013, Comput. Law Secur. Rev..

[162]  Dong Yan,et al.  Using Memory in the Right Way to Accelerate Big Data Processing , 2015, Journal of Computer Science and Technology.

[163]  S de Lusignan,et al.  Big Data Usage Patterns in the Health Care Domain: A Use Case Driven Approach Applied to the Assessment of Vaccination Benefits and Risks , 2014, Yearbook of Medical Informatics.

[164]  Eero Vainikko,et al.  Adapting scientific computing problems to clouds using MapReduce , 2012, Future Gener. Comput. Syst..

[165]  Dillon Chrimes,et al.  Interactive Healthcare Big Data Analytics Platform under Simulated Performance , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[166]  Daniel M. Batista,et al.  A Survey of Large Scale Data Management Approaches in Cloud Environments , 2011, IEEE Communications Surveys & Tutorials.