Diversification applications of network fully combined with the people’s daily activities and life. All network activities generate and record the large amount of data that implies the business values of enterprises and organizations. Collecting, analyzing and visualizing the large amount of data, intelligent information may be efficiently extracted. Big data applications can help enterprises enhance market competitiveness advantages, and assist government units improve the people daily life quality. However, big data collected from network and IoT (Internet of Things) environment existed many quality defects and problems to be resolved. Data quality of big data will directly impact the analysis results of big data, and may cause wrong decisions, inaccurate predication, imperfect planning and arrangements. Data preprocessing is an important procedure of big data applications. How to ensure data preprocessing tasks quality has become a concern issue of big data applications. Based on the review activities, this paper proposes the Preprocessing Tasks Quality Measurement (PTQM) model to identify the quality defects of data preprocessing tasks. Applying Data Preprocessing Quality Management (DPQM) procedure timely modifies the preprocessing tasks quality defects to increase the big data applications efficiency and practicality.
[1]
Victoria L. Rubin,et al.
Veracity Roadmap: Is Big Data Objective, Truthful and Credible?
,
2014
.
[2]
C. L. Philip Chen,et al.
Data-intensive applications, challenges, techniques and technologies: A survey on Big Data
,
2014,
Inf. Sci..
[3]
Yangyong Zhu,et al.
The Challenges of Data Quality and Data Quality Assessment in the Big Data Era
,
2015,
Data Sci. J..
[4]
Divesh Srivastava,et al.
Data quality: The other face of Big Data
,
2014,
2014 IEEE 30th International Conference on Data Engineering.
[5]
Paul Zikopoulos,et al.
Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data
,
2011
.
[6]
Rachida Dssouli,et al.
Big Data Pre-processing: A Quality Framework
,
2015,
2015 IEEE International Congress on Big Data.
[7]
Norman E. Fenton,et al.
Software Metrics: A Rigorous Approach
,
1991
.
[8]
Ahmed Elragal,et al.
Big Data Analytics: A Literature Review Paper
,
2014,
ICDM.