An effective method to analyze variations of high-dimensional patterns over medical streams

In medical field, patterns over time-varied data streams usually imply high domain value. The variations of patterns can often be very complex and hard to evaluate. Traditional methods usually take each pattern as a whole to analyze data stream variations or only focus on one type of variation; however, few works have achieved a widely applicable resolution. This paper considers the feature of sub parts for data stream patterns and studies their variations and relationships from the perspective of multiple dimensions, to explore a comprehensive understanding for the variation history and effectively support different types of queries to help analyze the variations. This paper first decomposes patterns into different dimensions and then evaluates the variations of each dimension. After that, a data cube called VS-Cube is used to find out the variations of a single dimension as well as the relationships between different dimensions within a certain pattern. At last, a case study on disease MI over medical stream is given to demonstrate the effectiveness and efficiency of our proposed methods.

[1]  Lei Chen,et al.  On The Marriage of Lp-norms and Edit Distance , 2004, VLDB.

[2]  Hongyan Li,et al.  Effective variation management for pseudo periodical streams , 2007, SIGMOD '07.

[3]  Like Gao,et al.  Continually evaluating similarity-based pattern queries on a streaming time series , 2002, SIGMOD '02.

[4]  Jignesh M. Patel,et al.  An efficient and accurate method for evaluating time series similarity , 2007, SIGMOD '07.

[5]  M. Tamer Özsu,et al.  Mining data streams with periodically changing distributions , 2009, CIKM.

[6]  Like Gao,et al.  Continuous similarity-based queries on streaming time series , 2005, IEEE Transactions on Knowledge and Data Engineering.

[7]  Majid Sarrafzadeh,et al.  A Monte Carlo approach to biomedicai time series search , 2012, 2012 IEEE International Conference on Bioinformatics and Biomedicine.

[8]  Yang-Sae Moon,et al.  Similar sequence matching supporting variable-length and variable-tolerance continuous queries on time-series data stream , 2008, Inf. Sci..

[9]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[10]  S. Muthukrishnan,et al.  One-Pass Wavelet Decompositions of Data Streams , 2003, IEEE Trans. Knowl. Data Eng..

[11]  Eamonn J. Keogh,et al.  A symbolic representation of time series, with implications for streaming algorithms , 2003, DMKD '03.

[12]  Christos Faloutsos,et al.  Adaptive, unsupervised stream mining , 2004, The VLDB Journal.

[13]  Majid Sarrafzadeh,et al.  Salient Segmentation of Medical Time Series Signals , 2011, 2011 IEEE First International Conference on Healthcare Informatics, Imaging and Systems Biology.

[14]  Donghui Zhang,et al.  Online event-driven subsequence matching over financial data streams , 2004, SIGMOD '04.

[15]  Steve B. Jiang,et al.  A finite state model for respiratory motion analysis in image guided radiation therapy. , 2004, Physics in medicine and biology.

[16]  Yixin Chen,et al.  Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams , 2005, Distributed and Parallel Databases.

[17]  Hongyan Li,et al.  Diversity-Based Load Shedding Strategy over Pattern Streams , 2008, 2008 International Conference on Computer Science and Software Engineering.

[18]  Christos Faloutsos,et al.  FTW: fast similarity search under the time warping distance , 2005, PODS.

[19]  Hongyan Li,et al.  VS-Cube: Analyzing Variations of Multi-dimensional Patterns over Data Streams , 2012, ADMA.