We propose a Case-Based Reasoning(CBR) approach for content selection, which is an intermediate step towards generating textual summaries of time series data in the weather prediction domain. Specifically, we handle two significant challenges, the first involving multivariate data that warrants modeling of the interaction of two ’channels’ (wind speed and direction in our context) and the second involving the effective integration of domain-specific knowledge in the form of rules with data from a case library of past instances of content selection. We present an approach that uses domain knowledge to transform a given raw time series instance into a representation that facilitates effective retrieval of relevant cases, which are then used for change point prediction. We empirically demonstrate that our approach combining CBR and domain rules outperforms classical content selection mechanisms that are based on rules or heuristics alone as well as those that are purely data-driven.
[1]
Janet L. Kolodner,et al.
An introduction to case-based reasoning
,
1992,
Artificial Intelligence Review.
[2]
Jim Hunter,et al.
Segmenting Time Series for Weather Forecasting
,
2003
.
[3]
Sutanu Chakraborti,et al.
Learning to Summarize Time Series Data
,
2014,
CICLing.
[4]
Somayajulu Sripada,et al.
SUMTIME-METEO: Parallel Corpus of Naturally Occurring Forecast Texts and Weather Data
,
2008
.
[5]
Anja Belz,et al.
Automatic generation of weather forecast texts using comprehensive probabilistic generation-space models
,
2008,
Natural Language Engineering.
[6]
S. Chiba,et al.
Dynamic programming algorithm optimization for spoken word recognition
,
1978
.
[7]
Ibrahim Adeyanju.
Generating Weather Forecast Texts with Case Based Reasoning
,
2015,
ArXiv.