A distributed, proactive intelligent scheme for securing quality in large scale data processing

[1]  Joshua Zhexue Huang,et al.  Random Sample Partition: A Distributed Data Model for Big Data Analysis , 2017, IEEE Transactions on Industrial Informatics.

[2]  Hong Linh Truong,et al.  Analytics of Performance and Data Quality for Mobile Edge Cloud Applications , 2018, 2018 IEEE 11th International Conference on Cloud Computing (CLOUD).

[3]  Zirije Hasani,et al.  Robust anomaly detection algorithms for real-time big data: Comparison of algorithms , 2017, 2017 6th Mediterranean Conference on Embedded Computing (MECO).

[4]  Jahanpour Alipour,et al.  Dimensions and assessment methods of data quality in health information systems , 2017 .

[5]  Steven Van den Berghe,et al.  Data Quality Assessment and Improvement: A Vrije Universiteit Brussel Case Study , 2017, CRIS.

[6]  Mario Piattini,et al.  A Data Quality in Use model for Big Data , 2016, Future Gener. Comput. Syst..

[7]  Saraswati Mishra,et al.  An Efficient Method of Partitioning High Volumes of Multidimensional Data for Parallel Clustering Algorithms , 2016, ArXiv.

[8]  Jie Zhang,et al.  A Distributed and Scalable Machine Learning Approach for Big Data , 2016, IJCAI.

[9]  Weisong Shi,et al.  Edge Computing: Vision and Challenges , 2016, IEEE Internet of Things Journal.

[10]  Jerry Zeyu Gao,et al.  Big Data Validation and Quality Assurance -- Issuses, Challenges, and Needs , 2016, 2016 IEEE Symposium on Service-Oriented System Engineering (SOSE).

[11]  Vijay V. Raghavan,et al.  Data quality issues in big data , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[12]  Pekka Pääkkönen,et al.  Evaluating the Quality of Social Media Data in Big Data Architecture , 2015, IEEE Access.

[13]  Samani A. Talab,et al.  Enhanced Extraction Clinical Data Technique to Improve Data Quality in Clinical Data Warehouse , 2015 .

[14]  Yangyong Zhu,et al.  The Challenges of Data Quality and Data Quality Assessment in the Big Data Era , 2015, Data Sci. J..

[15]  Justyna Majewska,et al.  Identification of multivariate outliers – problems and challenges of visualization methods , 2015 .

[16]  Le Hoang Son DPFCM: A novel distributed picture fuzzy clustering method on picture fuzzy sets , 2015, Expert Syst. Appl..

[17]  Carlo Batini,et al.  From Data Quality to Big Data Quality , 2015, J. Database Manag..

[18]  Dipak Kalra,et al.  Data quality in European primary care research databases. Report of a workshop held in London September 2013 , 2014, IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI).

[19]  F. Urbano,et al.  Data Quality: Detection and Management of Outliers , 2014 .

[20]  Roger Clarke,et al.  Big Data's Big Unintended Consequences , 2013, Computer.

[21]  Christian Wagner,et al.  Juzzy - A Java based toolkit for Type-2 Fuzzy Logic , 2013, 2013 IEEE Symposium on Advances in Type-2 Fuzzy Logic Systems (T2FUZZ).

[22]  T. K. Das,et al.  BIG Data Analytics: A Framework for Unstructured Data Analysis , 2013 .

[23]  P. Mikkelsen,et al.  Data quality assurance in monitoring of wastewater quality: Univariate on-line and off-line methods , 2013 .

[24]  Dongrui Wu,et al.  On the Fundamental Differences Between Interval Type-2 and Type-1 Fuzzy Logic Controllers , 2012, IEEE Transactions on Fuzzy Systems.

[25]  Hamidah Ibrahim,et al.  Data quality: A survey of data quality dimensions , 2012, 2012 International Conference on Information Retrieval & Knowledge Management.

[26]  Jian Pei,et al.  2012- Data Mining. Concepts and Techniques, 3rd Edition.pdf , 2012 .

[27]  Jyotirmoy Karjee,et al.  Data Accuracy Model for Distributed Clustering Algorithm based on Spatial Data Correlation in Wireless Sensor Networks , 2011, ArXiv.

[28]  David Loshin,et al.  Data Quality Maturity , 2011 .

[29]  Stathes Hadjiefthymiades,et al.  Buyer agent decision process based on automatic fuzzy rules generation methods , 2010, International Conference on Fuzzy Systems.

[30]  Georgios B. Giannakis,et al.  Consensus-Based Distributed Support Vector Machines , 2010, J. Mach. Learn. Res..

[31]  Chuong B Do,et al.  What is the expectation maximization algorithm? , 2008, Nature Biotechnology.

[32]  Edward Y. Chang,et al.  Parallelizing Support Vector Machines on Distributed Computers , 2007, NIPS.

[33]  Shuai Ma,et al.  Improving Data Quality: Consistency and Accuracy , 2007, VLDB.

[34]  Jerry Mendel,et al.  Type-2 Fuzzy Sets and Systems: An Overview [corrected reprint] , 2007, IEEE Computational Intelligence Magazine.

[35]  Jerry M. Mendel,et al.  Type-2 fuzzy sets and systems: an overview , 2007, IEEE Computational Intelligence Magazine.

[36]  Barbara Wixom,et al.  Antecedents of Information and System Quality: An Empirical Examination Within the Context of Data Warehousing , 2005, J. Manag. Inf. Syst..

[37]  Edgar Acuña,et al.  An empirical study of the effect of outliers on the misclassification error rate , 2005 .

[38]  Richard Y. Wang,et al.  Data quality assessment , 2002, CACM.

[39]  Michalis Vazirgiannis,et al.  Clustering validity assessment: finding the optimal partitioning of a data set , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[40]  Philip S. Yu IEEE Transactions on Knowledge and Data Engineering: EIC Editorial , 2001 .

[41]  J. Mendel Uncertain Rule-Based Fuzzy Logic Systems: Introduction and New Directions , 2001 .

[42]  Mark Last Automated Detection of Outliers in Real-World Data , 2001 .

[43]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[44]  Ahmed K. Elmagarmid,et al.  Enterprise Data Quality: A Pragmatic Approach , 1999, Inf. Syst. Frontiers.

[45]  Domenico Saccà,et al.  Database partitioning in a cluster of processors , 1983, TODS.

[46]  Shamkant B. Navathe,et al.  Vertical partitioning algorithms for database design , 1984, TODS.

[47]  E. S. Page CONTINUOUS INSPECTION SCHEMES , 1954 .