The great amount of time series generated by machines has enormous value in intelligent industry. Knowledge can be discovered from high-quality time series, and used for production optimization and anomaly detection in industry. However, the original sensors data always contain many errors. This requires a sophisticated cleaning strategy and a well-designed system for industrial data cleaning. Motivated by this, we introduce Cleanits, a system for industrial time series cleaning. It implements an integrated cleaning strategy for detecting and repairing three kinds of errors in industrial time series. We develop reliable data cleaning algorithms, considering features of both industrial time series and domain knowledge. We demonstrate Cleanits with two real datasets from power plants. The system detects and repairs multiple dirty data precisely, and improves the quality of industrial time series effectively. Cleanits has a friendly interface for users, and result visualization along with logs are available during each cleaning process. PVLDB Reference Format: Xiaoou Ding, Hongzhi Wang, Jiaxuan Su, Zijue Li, Jianzhong Li, and Hong Gao. Cleanits: A Data Cleaning System for Industrial Time Series. PVLDB, 12(12): 1786-1789, 2019. DOI: https://doi.org/10.14778/3352063.3352066
[1]
Christopher Ré,et al.
The HoloClean Framework Dataset to be cleaned Denial Constraints External Information t 1 t 4 t 2 t 3 Johnnyo ’ s
,
2017
.
[2]
Philip S. Yu,et al.
Time Series Data Cleaning: From Anomaly Detection to Anomaly Repairing
,
2017,
Proc. VLDB Endow..
[3]
J. Munkres.
ALGORITHMS FOR THE ASSIGNMENT AND TRANSIORTATION tROBLEMS*
,
1957
.
[4]
Bar Koer,et al.
Bollinger bands approach on boosting ABC algorithm and its variants
,
2016
.
[5]
Hongzhi Wang,et al.
Time Series Cleaning Under Variance Constraints
,
2018,
DASFAA Workshops.
[6]
Ira Cohen,et al.
Real-time anomaly detection system for time series at scale
,
2017,
ADF@KDD.
[7]
Jianmin Wang,et al.
Sequential Data Cleaning: A Statistical Approach
,
2016,
SIGMOD Conference.