论文信息 - Distribution Regression for Sequential Data

Distribution Regression for Sequential Data

Distribution regression refers to the supervised learning problem where labels are only available for groups of inputs instead of individual inputs. In this paper, we develop a rigorous mathematical framework for distribution regression where inputs are complex data streams. Leveraging properties of the expected signature and a recent signature kernel trick for sequential data from stochastic analysis, we introduce two new learning techniques, one feature-based and the other kernel-based. Each is suited to a different data regime in terms of the number of data streams and the dimensionality of the individual streams. We provide theoretical results on the universality of both approaches and demonstrate empirically their robustness to irregularly sampled multivariate time-series, achieving state-of-the-art performance on both synthetic and real-world examples from thermodynamics, mathematical finance and agricultural science.

[1] M. Rosenbaum,et al. Volatility is rough , 2014, 1410.3394.

[2] P J Moore,et al. Using path signatures to predict a diagnosis of Alzheimer’s disease , 2018, PloS one.

[3] Terry Lyons,et al. A signature-based machine learning model for distinguishing bipolar disorder and borderline personality disorder , 2017, Translational Psychiatry.

[4] Sudhanshu Sekhar Panda,et al. Application of Vegetation Indices for Agricultural Crop Yield Prediction Using Neural Network Techniques , 2010, Remote. Sens..

[5] Le Song,et al. A Hilbert Space Embedding for Distributions , 2007, Discovery Science.

[6] Stefano Ermon,et al. Deep Gaussian Process for Crop Yield Prediction Based on Remote Sensing Data , 2017, AAAI.

[7] Kiri Wagstaff,et al. Multiple-Instance Regression with Structured Data , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[8] Ataur Rahman,et al. NDVI Derived Sugarcane Area Identification and Crop Condition Assessment , 2001 .

[9] Tomoko Matsui,et al. A Kernel for Time Series Based on Global Alignments , 2006, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[10] Arnaud Doucet,et al. Autoregressive Kernels For Time Series , 2011, 1101.0673.

[11] Bernhard Schölkopf,et al. Learning from Distributions via Support Measure Machines , 2012, NIPS.

[12] Adeline Fermanian. Embedding and learning with signatures , 2019, ArXiv.

[13] École d'été de probabilités de Saint-Flour,et al. Differential equations driven by rough paths , 2007 .

[14] L. Reichl. A modern course in statistical physics , 1980 .

[15] Yangru Wu,et al. Mean Reversion across National Stock Markets and Parametric Contrarian Investment Strategies , 2000 .

[16] David R. Musicant,et al. Supervised Learning by Training on Aggregate Outputs , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[17] Patrick Kidger,et al. Signatory: differentiable computations of the signature and logsignature transforms, on both CPU and GPU , 2020, ArXiv.

[18] Franz J. Király,et al. Kernels for sequentially ordered data , 2016, J. Mach. Learn. Res..

[19] Andrey Kormilitzin,et al. A Primer on the Signature Method in Machine Learning , 2016, ArXiv.

[20] Kenji Fukumizu,et al. Persistence weighted Gaussian kernel for topological data analysis , 2016, ICML.

[21] C. Caramanis. What is ergodic theory , 1963 .