论文信息 - Rethinking 1D-CNN for Time Series Classification: A Stronger Baseline

Rethinking 1D-CNN for Time Series Classification: A Stronger Baseline

For time series classification task using 1D-CNN, the selection of kernel size is critically important to ensure the model can capture the right scale salient signal from a long time-series. Most of the existing work on 1D-CNN treats the kernel size as a hyper-parameter and tries to find the proper kernel size through a grid search which is time-consuming and is inefficient. This paper theoretically analyses how kernel size impacts the performance of 1D-CNN. Considering the importance of kernel size, we propose a novel Omni-Scale 1D-CNN (OS-CNN) architecture to capture the proper kernel size during the model learning period. A specific design for kernel size configuration is developed which enables us to assemble very few kernel-size options to represent more receptive fields. The proposed OS-CNN method is evaluated using the UCR archive with 85 datasets. The experiment results demonstrate that our method is a stronger baseline in multiple performance indicators, including the critical difference diagram, counts of wins, and average accuracy. We also published the experimental source codes at GitHub (this https URL).

[1] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Germain Forestier,et al. Deep learning for time series classification: a review , 2018, Data Mining and Knowledge Discovery.

[3] Ulf Leser,et al. Fast and Accurate Time Series Classification with WEASEL , 2017, CIKM.

[4] Lovekesh Vig,et al. ConvTimeNet: A Pre-trained Deep Convolutional Neural Network for Time Series Classification , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[5] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[6] Bolei Zhou,et al. Learning Deep Features for Discriminative Localization , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Yixin Chen,et al. Multi-Scale Convolutional Neural Networks for Time Series Classification , 2016, ArXiv.

[8] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Janez Demsar,et al. Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[10] Patrick Schäfer. The BOSS is concerned with time series classification in the presence of noise , 2014, Data Mining and Knowledge Discovery.

[11] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] John Cristian Borges Gamboa,et al. Deep Learning for Time-Series Analysis , 2017, ArXiv.

[13] Francesca Mangili,et al. Should We Really Use Post-Hoc Tests Based on Mean-Ranks? , 2015, J. Mach. Learn. Res..

[14] Jason Lines,et al. A shapelet transform for time series classification , 2012, KDD.

[15] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[16] Alexandros Karatzoglou,et al. Towards a universal neural network encoder for time series , 2018, CCIA.

[17] Douglas L. Jones,et al. A resolution comparison of several time-frequency representations , 1992, IEEE Trans. Signal Process..

[18] Andrew Y. Ng,et al. Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks , 2017, ArXiv.

[19] G. Arfken. Mathematical Methods for Physicists , 1967 .

[20] Tim Oates,et al. Time series classification from scratch with deep neural networks: A strong baseline , 2016, 2017 International Joint Conference on Neural Networks (IJCNN).

[21] Geoffrey I. Webb,et al. InceptionTime: Finding AlexNet for Time Series Classification , 2020, Data Min. Knowl. Discov..

[22] Dennis Clark,et al. The Prime Number Theorem , 2002 .