论文信息 - Structural model discovery in temporal event data streams

Structural model discovery in temporal event data streams

This dissertation presents a unique approach to human behavior analysis based on expert guidance and intervention through interactive construction and modification of behavior models. Our focus is to introduce the research area of behavior analysis, the challenges faced by this field, current approaches available, and present a new analysis approach: Interactive Relevance Search and Modeling (IRSM). More intelligent ways of conducting data analysis have been explored in recent years. Machine learning and data mining systems that utilize pattern classification and discovery in non-textual data promise to bring new generations of powerful “crawlers” for knowledge discovery, e.g., face detection and crowd surveillance. Many aspects of data can be captured by such systems, e.g., temporal information, extractable visual information—color, contrast, shape, etc. However, these captured aspects may not uncover all salient information in the data or provide adequate models/patterns of phenomena of interest. This is a challenging problem for social scientists who are trying to identify high-level, conceptual patterns of human behavior from observational data (e.g., media streams). The presented research addresses how social scientists may derive patterns of human behavior captured in media streams. Currently, media streams are being segmented into sequences of events describing the actions captured in the streams, such as the interactions among humans. This segmentation creates a challenging data space to search characterized by nonnumerical, temporal, descriptive data, e.g., Person A walks up to Person B at time T. This dissertation will present an approach that allows one to interactively search, identify, and discover temporal behavior patterns within such a data space. Therefore, this research addresses supporting exploration and discovery in behavior analysis through a formalized method of assisted exploration. The model evolution presented supports the refining of the observer’s behavior models into representations of their understanding. The benefit of the new approach is shown through experimentation on its identification accuracy and working with fellow researchers to verify the approach’s legitimacy in analysis of their data.

Francis K. H. Quek | Francis Quek | Chreston A. Miller | Chreston Miller

[1] John F. Roddick,et al. Mining Relationships Between Interacting Episodes , 2004, SDM.

[2] David McNeill,et al. Gesture, Gaze, and Ground , 2005, MLMI.

[3] Rebecca J. Passonneau,et al. Discourse Segmentation by Human and Automated Means , 1997, CL.

[4] Hennie Brugman,et al. Annotating Multi-media/Multi-modal Resources with ELAN , 2004, LREC.

[5] Stephan M. Winkler,et al. Genetic Algorithms and Genetic Programming - Modern Concepts and Practical Applications , 2009 .

[6] Lotfi A. Zadeh,et al. Soft computing and fuzzy logic , 1994, IEEE Software.

[7] Alfred Ultsch,et al. A Method for Temporal Knowledge Conversion , 1999, IDA.

[8] Frank Höppner,et al. Knowledge discovery from sequential data , 2003 .

[9] Ada Wai-Chee Fu,et al. Discovering Temporal Patterns for Interval-Based Events , 2000, DaWaK.

[10] Peter Stanchev,et al. Content-Based Image Retrieval Systems , 2001 .

[11] Rainer Stiefelhagen,et al. 3D user-perspective, voxel-based estimation of visual focus of attention in dynamic meeting scenarios , 2010, ICMI-MLMI '10.

[12] John H. Holland,et al. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[13] Ramakrishnan Srikant,et al. Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[14] Dariu Gavrila,et al. The Visual Analysis of Human Movement: A Survey , 1999, Comput. Vis. Image Underst..

[15] Thomas S. Huang,et al. Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[16] M S Magnusson,et al. Discovering hidden time patterns in behavior: T-patterns and their detection , 2000, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[17] Andreas Bastian. Identifying fuzzy models utilizing genetic programming , 2000, Fuzzy Sets Syst..

[18] Jean Carletta,et al. The AMI Meeting Corpus: A Pre-announcement , 2005, MLMI.

[19] Thomas C. Schmidt. The transcription system EXMARaLDA: An application of the annotation graph formalism as the basis of a database of multilingual spoken discourse , 2001 .

[20] Kumpati S. Narendra,et al. Adaptation and learning using multiple models, switching, and tuning , 1995 .

[21] Chih-Ping Chou,et al. Model Modification in Structural Equation Modeling by Imposing Constraints , 2002, Comput. Stat. Data Anal..

[22] Peter Norvig,et al. Artificial Intelligence: A Modern Approach , 1995 .

[23] Lotfi A. Zadeh,et al. Fuzzy logic = computing with words , 1996, IEEE Trans. Fuzzy Syst..

[24] Eddie Schwalb,et al. Temporal Constraints: A Survey , 1998, Constraints.

[25] Alex Groce,et al. Adaptive Model Checking , 2006, Log. J. IGPL.

[26] Martha Larson,et al. ACM multimedia 2012 workshop on crowdsourcing for multimedia , 2012, ACM Multimedia.

[27] Ruiduo Yang,et al. Efficient Generation of Large Amounts of Training Data for Sign Language Recognition: A Semi-automatic Tool , 2006, ICCHP.

[28] Thomas C. Schmidt,et al. EXMARaLDA – creating, analysing and sharing spoken language corpora for pragmatic research , 2009 .

[29] P. S. Sastry,et al. Discovering frequent episodes and learning hidden Markov models: a formal connection , 2005, IEEE Transactions on Knowledge and Data Engineering.

[30] Debprakash Patnaik,et al. Inferring neuronal network connectivity from spike data: A temporal data mining approach , 2008, Sci. Program..

[31] Polle Zellweger,et al. Scheduling Multimedia Documents Using Temporal Constraints , 1992, NOSSDAV.

[32] Christian R. Huyck,et al. Automated discourse segmentation by syntactic information and cue phrases. , 2004 .

[33] Francis K. H. Quek,et al. Gesture, speech, and gaze cues for discourse segmentation , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[34] Dmitriy Fradkin,et al. Robust Mining of Time Intervals with Semi-interval Partial Order Patterns , 2010, SDM.

[35] Naren Ramakrishnan,et al. Experiences with mining temporal event sequences from electronic medical records: initial successes and some challenges , 2011, KDD.

[36] C. Creider. Hand and Mind: What Gestures Reveal about Thought , 1994 .

[37] Akira Utsumi,et al. Multiple-hand-gesture tracking using multiple cameras , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[38] Christoph Bregler,et al. Hands by hand: Crowd-sourced motion tracking for gesture annotation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[39] Naren Ramakrishnan,et al. Structuring ordered nominal data for event sequence discovery , 2010, ACM Multimedia.

[40] Leo Breiman,et al. Bagging Predictors , 1996, Machine Learning.

[41] Francis K. H. Quek,et al. Structural and temporal inference search (STIS): pattern identification in multimodal data , 2012, ICMI '12.

[42] Victor Cheng,et al. Dissimilarity learning for nominal data , 2004, Pattern Recognit..

[43] Hans-Georg Beyer,et al. The Theory of Evolution Strategies , 2001, Natural Computing Series.

[44] Ying Yin,et al. A hierarchical approach to continuous gesture analysis for natural multi-modal interaction , 2012, ICMI '12.

[45] Fabian Mörchen,et al. Unsupervised pattern mining from symbolic temporal data , 2007, SKDD.

[46] David J. Murray-Smith,et al. Nonlinear model structure identification using genetic programming , 1998 .

[47] Hermann Ney,et al. Algorithms for bigram and trigram word clustering , 1995, Speech Commun..

[48] Thad Starner,et al. American sign language recognition with the kinect , 2011, ICMI '11.

[49] Gerald M. Knapp,et al. Affect corpus 2.0: an extension of a corpus for actor level emotion magnitude detection , 2011, MMSys.