On analyzing multiple, physiological sensor databases

Recent technological advances are encouraging a pervasive deployment of various physiological sensors such as motion trackers, accelerometers, EMG (Electromyogram), EKG (Electro-cardiograms), and other sensors that are used for monitoring and managing medical conditions as well as human performance in sports and military operations/training. The physiological data coming from multiple sensors are generally in time series format and forms multiple, multi-dimensional time series framework. Analyzing different physiological data in such integrated environment poses major challenges: different sensors have different characteristics, different people generate different patterns through these sensors, and even for the same person the data can vary widely depending on time and environment. In this dissertation, we assess the role of joint movements and muscular activities by proposing an approach to quantify integrated features of biomechanical kinematics with electrophysiology. We study content-based retrieval and cluster analysis of human motions based on the similarity of joint movements and electromyogram activity. Next, applications with multiple sensor systems require efficient access to large-scale, heterogeneous multi-dimensional data sets. To achieve this objective, we propose an integrated indexing structure based on bi-level spatial grids to efficiently support content-based queries on such multiple, high-dimensional data sets. Time series pattern mining (TSPM) finds correlations or dependencies in same series or in multiple time series. When the numerous instances of multiple time series data are associated with different quantitative attributes, they form a multiple multidimensional framework. Finding frequent patterns and ultimately, high confident association rules in such multidimensional environment is always a challenge. In this study, our emphasis is on discovering frequent patterns in multiple time series through sequential mining across varying time slices, and also mining quantitative attributes of only those time series that are present in the discovered patterns. Analyzing heterogeneous, high-dimensional physiologic and motoric streams to quantify the human performance and at the same time, provide visualization for performances of participants in low-dimensional space for easier interpretation is the another important issue discussed in this dissertation. We proposed an efficient, multidimensional factor analysis technique for analyzing and visualizing body sensor network data across different participants. Finally, in this study we discuss the experiments conducted on real-world data of human body motions and muscular functions that demonstrates the applicability of our approaches in practical applications such as physical medicines, medical rehabilitations, training and sports performances.

[1]  Eamonn J. Keogh,et al.  Mining motifs in massive time series databases , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[2]  Cyrus Shahabi,et al.  A multilevel distance-based index structure for multivariate time series , 2005, 12th International Symposium on Temporal Representation and Reasoning (TIME'05).

[3]  Hans-Peter Kriegel,et al.  The X-tree : An Index Structure for High-Dimensional Data , 2001, VLDB.

[4]  Jian Pei,et al.  Mining sequential patterns with constraints in large databases , 2002, CIKM '02.

[5]  J. Eng,et al.  Biomechanics of reaching: clinical implications for individuals with acquired brain injury , 2002, Disability and rehabilitation.

[6]  Hongjun Lu,et al.  Stock movement prediction and N-dimensional inter-transaction association rules , 1998, SIGMOD 1998.

[7]  Susana Nascimento Fuzzy Clustering Via Proportional Membership Model , 2005 .

[8]  Jitender S. Deogun,et al.  Discovering Sequential Association Rules with Constraints and Time Lags in Multiple Sequences , 2002, ISMIS.

[9]  Jian-Kang Wu Content-Based Indexing of Multimedia Databases , 1997, IEEE Trans. Knowl. Data Eng..

[10]  Euripides G. M. Petrakis,et al.  Similarity Searching in Medical Image Databases , 1997, IEEE Trans. Knowl. Data Eng..

[11]  R J Jagacinski,et al.  Manual performance of a repeated pattern by older and younger adults with supplementary auditory cues. , 1993, Psychology and aging.

[12]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[13]  R.N. Scott,et al.  A new strategy for multifunction myoelectric control , 1993, IEEE Transactions on Biomedical Engineering.

[14]  Johannes Peltola,et al.  Activity classification using realistic data from wearable sensors , 2006, IEEE Transactions on Information Technology in Biomedicine.

[15]  B. Prabhakaran,et al.  Analyzing motoric and physiological data in describing upper extremity movement in the aged , 2008, PETRA '08.

[16]  Ambuj K. Singh,et al.  Similarity searching for multi-attribute sequences , 2002, Proceedings 14th International Conference on Scientific and Statistical Database Management.

[17]  P. Derambure,et al.  Effect of age on anticipatory postural adjustments in unilateral arm movement. , 2006, Gait & posture.

[18]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[19]  Kyuseok Shim,et al.  SPIRIT: Sequential Pattern Mining with Regular Expression Constraints , 1999, VLDB.

[20]  David B. Lomet,et al.  The hB-tree: a multiattribute indexing method with good guaranteed performance , 1990, TODS.

[21]  Raghu Ramakrishnan,et al.  Bottom-up computation of sparse and Iceberg CUBE , 1999, SIGMOD '99.

[22]  Jérôme Pagès,et al.  Hierarchical Multiple Factor Analysis: application to the comparison of sensory profiles , 2003 .

[23]  Aidong Zhang,et al.  ClusterTree: Integration of Cluster Representation and Nearest-Neighbor Search for Large Data Sets with High Dimensions , 2003, IEEE Trans. Knowl. Data Eng..

[24]  B. Prabhakaran,et al.  Indexing 3-D Human Motion Repositories for Content-Based Retrieval , 2009, IEEE Transactions on Information Technology in Biomedicine.

[25]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[26]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[27]  Johannes Gehrke,et al.  Sequential PAttern mining using a bitmap representation , 2002, KDD.

[28]  Wei Wang,et al.  A system for analyzing and indexing human-motion databases , 2005, SIGMOD '05.

[29]  F. Lacquaniti,et al.  The role of preparation in tuning anticipatory and reflex responses during catching , 1989, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[30]  D Graupe,et al.  Multifunctional prosthesis and orthosis control via microcomputer identification of temporal pattern differences in single-site myoelectric signals. , 1982, Journal of biomedical engineering.

[31]  T. Kuo,et al.  A comparative analysis of various EMG pattern recognition methods. , 1996, Medical engineering & physics.

[32]  Jeffrey F. Naughton,et al.  On the Computation of Multidimensional Aggregates , 1996, VLDB.

[33]  Myrtille Vivien,et al.  A generalization of STATIS-ACT strategy: DO-ACT for two multiblocks tables , 2004, Comput. Stat. Data Anal..

[34]  Mihai Nadin,et al.  Mind: Anticipation and Chaos , 1995 .

[35]  Dimitrios Gunopulos,et al.  Indexing Large Human-Motion Databases , 2004, VLDB.

[36]  G E Stelmach,et al.  Aging and rapid aiming arm movement control. , 1998, Experimental aging research.

[37]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.

[38]  Christos Faloutsos,et al.  The TV-tree: An index structure for high-dimensional data , 1994, The VLDB Journal.

[39]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[40]  El Mostafa Qannari,et al.  Comparing Generalized Procrustes Analysis and STATIS , 1998 .

[41]  G Rau,et al.  Movement biomechanics goes upwards: from the leg to the arm. , 2000, Journal of biomechanics.

[42]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[43]  E. Finch Physical rehabilitation outcome measures : a guide to enhanced clinical decision making , 2002 .

[44]  Feng Liu,et al.  3D motion retrieval with motion index tree , 2003, Comput. Vis. Image Underst..

[45]  Donald J. Berndt,et al.  Finding Patterns in Time Series: A Dynamic Programming Approach , 1996, Advances in Knowledge Discovery and Data Mining.

[46]  Robert Sabatier,et al.  The ACT (STATIS method) , 1994 .

[47]  Dimitrios Gunopulos,et al.  Indexing multi-dimensional time-series with support for multiple distance measures , 2003, KDD '03.

[48]  Umeshwar Dayal,et al.  Multi-dimensional sequential pattern mining , 2001, CIKM '01.

[49]  Anthony K. H. Tung,et al.  Breaking the barrier of transactions: mining inter-transaction association rules , 1999, KDD '99.

[50]  Deok-Hwan Kim,et al.  Similarity search for multidimensional data sequences , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[51]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[52]  B. Prabhakaran,et al.  Hierarchical Indexing Structure for 3D Human Motions , 2007, MMM.

[53]  Chih-Yi Chiu,et al.  Motion retrieval and synthesis based on posture features indexing , 2003, Proceedings Fifth International Conference on Computational Intelligence and Multimedia Applications. ICCIMA 2003.

[54]  Daniel A. Keim,et al.  A pivot-based index structure for combination of feature vectors , 2005, SAC '05.

[55]  Bruce C. Wheeler,et al.  EMG feature evaluation for movement control of upper extremity prostheses , 1995 .

[56]  Georgios Evangelidis,et al.  The hB $^\Pi$-tree: a multi-attribute index supporting concurrency, recovery and node consolidation , 1997, The VLDB Journal.

[57]  A. Akhmetova Discovery of Frequent Episodes in Event Sequences , 2006 .

[58]  Hans-Peter Kriegel,et al.  Optimal multi-step k-nearest neighbor search , 1998, SIGMOD '98.

[59]  Alexander Thomasian,et al.  CSVD: Clustering and Singular Value Decomposition for Approximate Similarity Search in High-Dimensional Spaces , 2003, IEEE Trans. Knowl. Data Eng..

[60]  Eamonn J. Keogh,et al.  A Probabilistic Approach to Fast Pattern Matching in Time Series Databases , 1997, KDD.

[61]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[62]  Jaap H van Dieën,et al.  EMG modulation in anticipation of a possible trip during walking in young and older adults. , 2006, Journal of electromyography and kinesiology : official journal of the International Society of Electrophysiological Kinesiology.

[63]  Chih-Yi Chiu,et al.  Content-based retrieval for human motion data , 2004, J. Vis. Commun. Image Represent..

[64]  B. Prabhakaran,et al.  A similarity measure for motion stream segmentation and recognition , 2005, MDM '05.

[65]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[66]  Richard Shiavi,et al.  Automated Extraction of Activity Features in Linear Envelopes of Locomotor Electromyographic Patterns , 1986, IEEE Transactions on Biomedical Engineering.

[67]  Morris Milner,et al.  An Optimality Criterion for Processing Electromyographic (EMG) Signals Relating to Human Locomotion , 1978, IEEE Transactions on Biomedical Engineering.

[68]  Shin'ichi Satoh,et al.  The SR-tree: an index structure for high-dimensional nearest neighbor queries , 1997, SIGMOD '97.

[69]  Philip S. Yu,et al.  Mining long sequential patterns in a noisy environment , 2002, SIGMOD '02.

[70]  Gene H. Golub,et al.  Matrix computations , 1983 .

[71]  Jérôme Pagès,et al.  Multiple factor analysis (AFMULT package) , 1994 .

[72]  Jiawei Han,et al.  Mining Segment-Wise Periodic Patterns in Time-Related Databases , 1998, KDD.

[73]  Ramesh C. Jain,et al.  Similarity indexing with the SS-tree , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[74]  Beng Chin Ooi,et al.  Hierarchical Indexing Structure for Efficient Similarity Search in Video Retrieval , 2006, IEEE Transactions on Knowledge and Data Engineering.

[75]  Guido Moerkotte,et al.  Indexing Multiple Sets , 1994, VLDB.

[76]  Alberto O. Mendelzon,et al.  Querying Time Series Data Based on Similarity , 2000, IEEE Trans. Knowl. Data Eng..

[77]  Johannes Gehrke,et al.  BOAT—optimistic decision tree construction , 1999, SIGMOD '99.

[78]  D A Winter,et al.  Pathologic gait diagnosis with computer-averaged electromyographic profiles. , 1984, Archives of physical medicine and rehabilitation.

[79]  F. Mohd-Yasin,et al.  Techniques of EMG signal analysis: detection, processing, classification and applications , 2006, Biological Procedures Online.

[80]  B. Prabhakaran,et al.  Indexing of variable length multi-attribute motion data , 2004, MMDB '04.

[81]  El Mostafa Qannari,et al.  A hierarchy of models for analysing sensory data , 1995 .

[82]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[83]  Sridhar Ramaswamy,et al.  Cyclic association rules , 1998, Proceedings 14th International Conference on Data Engineering.

[84]  George Karypis,et al.  SLPMiner: an algorithm for finding frequent sequential patterns using length-decreasing support constraint , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[85]  R. Merletti,et al.  Spatio-temporal evaluation of neck muscle activation during postural perturbations in healthy subjects. , 2004, Journal of electromyography and kinesiology : official journal of the International Society of Electrophysiological Kinesiology.

[86]  Haiying Wang,et al.  An integrative and interactive framework for improving biomedical pattern discovery and visualization , 2004, IEEE Transactions on Information Technology in Biomedicine.

[87]  Pascal Schlich,et al.  Defining and Validating Assessor Compromises About Product Distances and Attribute Correlations , 1996 .

[88]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[89]  J. Tukey,et al.  Multiple-Factor Analysis , 1947 .

[90]  Tido Röder,et al.  Efficient content-based retrieval of motion capture data , 2005, SIGGRAPH 2005.

[91]  Xiaoming Jin,et al.  Indexing and Mining of the Local Patterns in Sequence Database , 2002, IDEAL.

[92]  Jiawei Han,et al.  Efficient mining of partial periodic patterns in time series database , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[93]  C. J. Jones,et al.  Age and physical activity effects on reaction time and digit symbol substitution performance in cognitively active adults. , 1993, Research quarterly for exercise and sport.

[94]  B. Prabhakaran,et al.  Integration of Motion Capture and EMG data for Classifying the Human Motions , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[95]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[96]  P. Robert,et al.  A Unifying Tool for Linear Multivariate Statistical Methods: The RV‐Coefficient , 1976 .

[97]  Vasil Simeonov,et al.  STATIS, a three-way method for data analysis. Application to environmental data , 2004 .

[98]  Nicola J. Ferrier,et al.  Automated analysis of repetitive joint motion , 2003, IEEE Transactions on Information Technology in Biomedicine.

[99]  R Gottsdanker,et al.  Age and simple reaction time. , 1982, Journal of gerontology.

[100]  I. Melzer,et al.  Age-Related Changes of Postural Control: Effect of Cognitive Tasks , 2001, Gerontology.

[101]  D. Wade,et al.  Measurement in neurological rehabilitation. , 1992, Current opinion in neurology and neurosurgery.

[102]  Hans-Jörg Schek,et al.  A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces , 1998, VLDB.

[103]  J. T. Robinson,et al.  The K-D-B-tree: a search structure for large multidimensional dynamic indexes , 1981, SIGMOD '81.

[104]  Kyriakos Mouratidis,et al.  Aggregate nearest neighbor queries in spatial databases , 2005, TODS.

[105]  N. P. Reddy,et al.  Fractal analysis of surface EMG signals from the biceps. , 1997, International journal of medical informatics.

[106]  Marios Hadjieleftheriou,et al.  R-Trees - A Dynamic Index Structure for Spatial Searching , 2008, ACM SIGSPATIAL International Workshop on Advances in Geographic Information Systems.

[107]  B. Prabhakaran,et al.  Analysis of human performance using physiological data streams , 2008, BODYNETS.

[108]  Jeffrey F. Naughton,et al.  An array-based algorithm for simultaneous multidimensional aggregates , 1997, SIGMOD '97.

[109]  Ke Wang,et al.  Growing decision trees on support-less association rules , 2000, KDD '00.

[110]  Jeffrey M. Hausdorff,et al.  Multiscale entropy analysis of human gait dynamics. , 2003, Physica A.

[111]  Ruoming Jin,et al.  Efficient decision tree construction on streaming data , 2003, KDD '03.

[112]  L. D. de Souza,et al.  A comparison of the timing of muscle activity during sitting down compared to standing up. , 2000, Physiotherapy research international : the journal for researchers and clinicians in physical therapy.

[113]  Nick Roussopoulos,et al.  Nearest neighbor queries , 1995, SIGMOD '95.

[114]  David B. Lomet,et al.  Access method concurrency with recovery , 1992, SIGMOD '92.

[115]  Jesse S. Jin,et al.  SS+ tree: an improved index structure for similarity searches in a high-dimensional feature space , 1997, Electronic Imaging.

[116]  Jean-Xavier Guinard,et al.  Use of the STATIS method to analyze time-intensity profiling data , 2004 .

[117]  F. K. Lam,et al.  Fuzzy EMG classification for prosthesis control. , 2000, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[118]  Heikki Mannila,et al.  Rule Discovery from Time Series , 1998, KDD.

[119]  Padhraic Smyth,et al.  An Information Theoretic Approach to Rule Induction from Databases , 1992, IEEE Trans. Knowl. Data Eng..

[120]  H. Buchner The Grid File : An Adaptable , Symmetric Multikey File Structure , 2001 .

[121]  Adrian Del Boca Myoelectric signal recognition using artificial neural networks in real time , 1993 .

[122]  Beng Chin Ooi,et al.  Indexing the Distance: An Efficient Method to KNN Processing , 2001, VLDB.

[123]  B. Prabhakaran,et al.  An Integrated Mobile Wireless System for Capturing Physiological Data Streams during a Cognitive-motor Task: Applications for Aging , 2007, 2007 IEEE Dallas Engineering in Medicine and Biology Workshop.

[124]  Jiawei Han,et al.  Metarule-Guided Mining of Multi-Dimensional Association Rules Using Data Cubes , 1997, KDD.

[125]  Hamid Pirahesh,et al.  Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals , 1996, Data Mining and Knowledge Discovery.