Deep Learning Based Imbalanced Data Classification and Information Retrieval for Multimedia Big Data

of a dissertation at the University of Miami. Dissertation supervised by Professor Mei-Ling Shyu. No. of pages in text. (153) The development in information science has enabled an explosive growth of data, which attracts more and more researchers to engage in the field of big data analytics. Noticeably, in many real-world applications, large amounts of data are imbalanced data since the events of interests occur infrequently. Classification of imbalanced data is an important research problem as lots of real-world datasets have skewed class distributions in which the majority of instances (examples) belong to one class and far fewer instances belong to the others. A classifier induced from an imbalanced dataset is more likely to be biased towards the majority classes and shows very poor classification accuracy on the minority classes. While in many applications, the minority instances actually represent the concept of interest (e.g., fraud in banking operations, abnormal cell in medical data, etc.), and the detection of these rare events has become more important. Despite extensive research efforts, rare event mining remains one of the most challenging problems in information retrieval, especially for multimedia big data. To tackle this challenge, in this dissertation, we propose an extended deep learning approach to achieve promising performance in classifying largely skewed multimedia dataset. Specifically, we investigate the integration of bootstrapping methods and a state-of-the-art deep learning approach, Convolutional Neural Networks (CNNs), with extensive empirical studies. Considering the fact that deep learning approaches such as CNNs are usually computationally expensive, we propose to feed low-level features to CNNs and prove its feasibility in achieving promising performance while saving a lot of training time. Furthermore, since big training datasets are required to train CNNs, we propose to extract features from pre-trained CNN models and feed those features to another full connected neural network. Implementations in big data environments show promising performance of our model in handling big datasets with respect to feasibility and scalability. In order to further improve the classification results and bridge the semantic gap between high-level concepts and low-level visual features, correlation discovery in semantic concept mining is worth exploring. Though inter-concept correlations have been utilized to address this issue recently, the very small number of instances in the minority classes often lead to the detection of imprecise correlations and unsatisfactory classification results. Meanwhile, correlation discovery is a computationally intensive task in the sense that it requires a deep analysis of very large and growing repositories. This dissertation further proposes a novel concept correlation analysis strategy framework that utilizes the correlations between the retrieval scores and labels. By integrating the correlation information, the proposed framework can help imbalanced data classification and enhance rare class (event or concept) mining even with trivial scores from the minority classes. Not only deep learning but also numerous other classification algorithms have been developed for a variety of data types. However, it is nearly impossible for one classifier to perform the best in all kinds of datasets all the time. Therefore, ensemble learning models which aim to take advantages of different classifiers have received a lot of attentions recently. In this dissertation, a scalable classifier ensemble framework assisted by a set of “judgers” is also proposed to integrate the outputs from multiple classifiers for multimedia big data classification. Specifically, based on the confusion matrices of different classifiers, a set of judgers are organized into a hierarchically structured decision model. A testing instance is first input to different classifiers, and then the classification results are passed to the proposed hierarchical structured decision model to derive the final result. The ensemble system can be run on Spark, which is designed for big data processing. All the proposed components are evaluated on multimedia datasets containing different kinds of data. The experimental results show the effectiveness of our framework in classifying severely imbalanced data with promising performance, and demonstrate that the proposed classifier ensemble framework outperforms several state-of-the-art model fusion approaches. Furthermore, the proposed framework is applied to two realworld applications, i.e., deep learning based text data analysis on an Amazon review dataset and efficient large-scale stance Analysis in Twitter, and achieves promising results in both. In additional, we also design a web-based information retrieval system and identify several future directions that could be explored to further improve the current work. To my dear parents and grandparents

[1]  Shu-Ching Chen,et al.  AAFA: Associative Affinity Factor Analysis for Bot Detection and Stance Classification in Twitter , 2017, 2017 IEEE International Conference on Information Reuse and Integration (IRI).

[2]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[3]  Choochart Haruechaiyasak,et al.  Collaborative Filtering by Mining Association Rules from User Access Sequences , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.

[4]  Mei-Ling Shyu,et al.  Leveraging Concept Association Network for Multimedia Rare Concept Mining and Retrieval , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[5]  E. Kandel An introduction to the work of David Hubel and Torsten Wiesel , 2009, The Journal of physiology.

[6]  Richard Nock,et al.  Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Rukshan Athauda,et al.  Semantic Access: Semantic Interface for Querying Databases , 2000, VLDB.

[8]  Chengcui Zhang,et al.  1 Spatio-Temporal Vehicle Tracking Using Unsupervised Learning-Based Segmentation and Object Tracking , 2015 .

[9]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[10]  Alekh Jindal,et al.  Hadoop++ , 2010 .

[11]  Blaise Agüera y Arcas,et al.  Federated Learning of Deep Networks using Model Averaging , 2016, ArXiv.

[12]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[13]  Luca Benini,et al.  Neurostream: Scalable and Energy Efficient Deep Learning with Smart Memory Cubes , 2017, IEEE Transactions on Parallel and Distributed Systems.

[14]  Chong-Wah Ngo,et al.  Domain adaptive semantic diffusion for large scale context-based video annotation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[15]  Zhihua Cai,et al.  Evaluation Measures of the Classification Performance of Imbalanced Data Sets , 2009 .

[16]  Li Deng,et al.  A tutorial survey of architectures, algorithms, and applications for deep learning , 2014, APSIPA Transactions on Signal and Information Processing.

[17]  Shu-Ching Chen,et al.  Effective supervised discretization for classification based on correlation maximization , 2011, 2011 IEEE International Conference on Information Reuse & Integration.

[18]  Akshay Nikam,et al.  SkewBoost: An algorithm for classifying imbalanced datasets , 2011, 2011 2nd International Conference on Computer and Communication Technology (ICCCT-2011).

[19]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[20]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jiebo Luo,et al.  Recognizing realistic actions from videos “in the wild” , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Shu-Ching Chen,et al.  An enhanced query model for soccer video retrieval using temporal relationships , 2005, 21st International Conference on Data Engineering (ICDE'05).

[23]  Kedar Tatwawadi,et al.  DeepZip: Lossless Compression using Recurrent Networks , 2017 .

[24]  Rohini K. Srihari,et al.  Feature selection for text categorization on imbalanced data , 2004, SKDD.

[25]  Shu-Ching Chen,et al.  Florida International University - University of Miami TRECVID 2018 , 2013, TRECVID.

[26]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[27]  Rangasami L. Kashyap,et al.  Augmented transition networks as video browsing models for multimedia databases and multimedia information systems , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[28]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Haojie Li,et al.  TRECVid 2013 Semantic Video Concept Detection by NTT-MD-DUT , 2013, TRECVID.

[30]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[31]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[32]  Xiuqi Li,et al.  An effective content-based visual image retrieval system , 2002, Proceedings 26th Annual International Computer Software and Applications.

[33]  Mei-Ling Shyu,et al.  Correlation-Assisted Imbalance Multimedia Concept Mining and Retrieval , 2017, Int. J. Semantic Comput..

[34]  Chengcui Zhang,et al.  Scene change detection by audio and video clues , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[35]  Ching-Te Chiu,et al.  Boosted multi-class object detection with parallel hardware implementation for real-time applications , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[36]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[37]  Min Chen,et al.  Florida public hurricane loss model: Research in multi-disciplinary system integration assisting government policy making , 2009, Gov. Inf. Q..

[38]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Mei-Ling Shyu,et al.  Negative Correlation Discovery for Big Multimedia Data Semantic Concept Mining and Retrieval , 2016, 2016 IEEE Tenth International Conference on Semantic Computing (ICSC).

[40]  Juan Liu,et al.  Will scene information help realistic action recognition? , 2012, Proceedings of the 10th World Congress on Intelligent Control and Automation.

[41]  Mohamed A. Deriche,et al.  A New Technique for Combining Multiple Classifiers using The Dempster-Shafer Theory of Evidence , 2002, J. Artif. Intell. Res..

[42]  Min Chen,et al.  A Multiple Instance Learning Approach for Content Based Image Retrieval Using One-Class Support Vector Machine , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[43]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[44]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[45]  Shu-Ching Chen,et al.  Wavelet Analysis in Current Cancer Genome Research: A Survey , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[46]  Jake Bouvrie,et al.  Notes on Convolutional Neural Networks , 2006 .

[47]  Shu-Ching Chen,et al.  Multimedia Databases and Data Management: A Survey , 2010, Int. J. Multim. Data Eng. Manag..

[48]  Hayder Radha,et al.  An Information-Theoretic Combining Method for Multi-Classifier Anomaly Detection Systems , 2010, 2010 IEEE International Conference on Communications.

[49]  Chao Chen,et al.  Web media semantic concept retrieval via tag removal and model fusion , 2013, ACM Trans. Intell. Syst. Technol..

[50]  Shu-Ching Chen,et al.  Network intrusion detection through Adaptive Sub-Eigenspace Modeling in multiagent systems , 2007, ACM Trans. Auton. Adapt. Syst..

[51]  Mei-Ling Shyu,et al.  Effective Moving Object Detection and Retrieval via Integrating Spatial-Temporal Multimedia Information , 2012, 2012 IEEE International Symposium on Multimedia.

[52]  Mark Johnston,et al.  Developing New Fitness Functions in Genetic Programming for Classification With Unbalanced Data , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[53]  Shahram Jafari,et al.  Feature Selection in Imbalance data sets , 2012 .

[54]  Shamik Sural,et al.  Segmentation and histogram generation using the HSV color space for image retrieval , 2002, Proceedings. International Conference on Image Processing.

[55]  Zhao Li,et al.  Multimodal Sparse Linear Integration for Content-Based Item Recommendation , 2013, 2013 IEEE International Symposium on Multimedia.

[56]  Shu-Ching Chen,et al.  Function approximation using robust wavelet neural networks , 2002, 14th IEEE International Conference on Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings..

[57]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[58]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[59]  Chengcui Zhang,et al.  Multiple object retrieval for image databases using multiple instance learning and relevance feedback , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[60]  Jun-Wei Hsieh,et al.  Sparse representation for recognizing object-to-object actions under occlusions , 2013, ICIMCS '13.

[61]  Liang Tang,et al.  Data Mining Meets the Needs of Disaster Information Management , 2013, IEEE Transactions on Human-Machine Systems.

[62]  Yi Yang,et al.  Fast and Accurate Content-based Semantic Search in 100M Internet Videos , 2015, ACM Multimedia.

[63]  Mei-Ling Shyu,et al.  Utilizing Context Information to Enhance Content-Based Image Classification , 2011, Int. J. Multim. Data Eng. Manag..

[64]  Rangasami L. Kashyap,et al.  Augmented Transition Network as a Semantic Model for Video Data , 2001 .

[65]  Min Chen,et al.  Neural network based framework for goal event detection in soccer videos , 2005, Seventh IEEE International Symposium on Multimedia (ISM'05).

[66]  Hal Daumé,et al.  Incorporating Lexical Priors into Topic Models , 2012, EACL.

[67]  Ivan Laptev,et al.  On Space-Time Interest Points , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[68]  Arnaldo de Albuquerque Araújo,et al.  Combining Orientation Tensors for Human Action Recognition , 2013, 2013 XXVI Conference on Graphics, Patterns and Images.

[69]  Luis Fernandez-Luque,et al.  Identifying Measures Used for Assessing Quality of YouTube Videos with Patient Health Information: A Review of Current Literature , 2013, Interactive journal of medical research.

[70]  Min Chen,et al.  Efficient Imbalanced Multimedia Concept Retrieval by Deep Learning on Spark Clusters , 2017, Int. J. Multim. Data Eng. Manag..

[71]  Chao Chen,et al.  Weighted Subspace Filtering and Ranking Algorithms for Video Concept Retrieval , 2011, IEEE MultiMedia.

[72]  Chengcui Zhang,et al.  Innovative Shot Boundary Detection for Video Indexing , 2005 .

[73]  Marcelo Bernardes Vieira,et al.  A tensor motion descriptor based on histograms of gradients and optical flow , 2014, Pattern Recognit. Lett..

[74]  Lei Zhao,et al.  A Crosstab-based Statistical Method for Effective Fault Localization , 2008, 2008 1st International Conference on Software Testing, Verification, and Validation.

[75]  Scott Shenker,et al.  Shark: SQL and rich analytics at scale , 2012, SIGMOD '13.

[76]  Ji Wan,et al.  Deep Learning for Content-Based Image Retrieval: A Comprehensive Study , 2014, ACM Multimedia.

[77]  Rangasami L. Kashyap,et al.  Generalized Affinity-Based Association Rule Mining for Multimedia Database Queries , 2001, Knowledge and Information Systems.

[78]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[79]  Yang Liu,et al.  Enhancing Multimedia Semantic Concept Mining and Retrieval by Incorporating Negative Correlations , 2014, 2014 IEEE International Conference on Semantic Computing.

[80]  Jim X. Chen,et al.  The Evolution of Computing: AlphaGo , 2016, Comput. Sci. Eng..

[81]  Robert P. W. Duin,et al.  The combining classifier: to train or not to train? , 2002, Object recognition supported by user interaction for service robots.

[82]  Wu Qingfeng,et al.  An empirical study on ensemble selection for class-imbalance data sets , 2010, 2010 5th International Conference on Computer Science & Education.

[83]  Rangasami L. Kashyap,et al.  Semantic Models for Multimedia Database Searching and Browsing , 2000, Advances in Database Systems.

[84]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[85]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[86]  Rangasami L. Kashyap,et al.  Indexing and searching structure for multimedia database systems , 1999, Electronic Imaging.

[87]  Robert P. W. Duin,et al.  Experiments with Classifier Combining Rules , 2000, Multiple Classifier Systems.

[88]  Min Chen,et al.  Hierarchical Temporal Association Mining for Video Event Detection in Video Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[89]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[90]  C. Lee Giles,et al.  Learning on the border: active learning in imbalanced data classification , 2007, CIKM '07.

[91]  Yiannis S. Boutalis,et al.  CEDD: Color and Edge Directivity Descriptor: A Compact Descriptor for Image Indexing and Retrieval , 2008, ICVS.

[92]  Rangasami L. Kashyap,et al.  Temporal And Spatial Semantic Models For Multimedia Presentations , 1997 .

[93]  Emmanuel Dellandréa,et al.  Multimodal recognition of visual concepts using histograms of textual concepts and selective weighted late fusion scheme , 2013, Comput. Vis. Image Underst..

[94]  Everton Alvares Cherman,et al.  Incorporating label dependency into the binary relevance framework for multi-label classification , 2012, Expert Syst. Appl..

[95]  郑肇葆,et al.  基于Naive Bayes Classifiers的航空影像纹理分类 , 2006 .

[96]  Mei-Ling Shyu,et al.  Enhancing Rare Class Mining in Multimedia Big Data by Concept Correlation , 2016, 2016 IEEE International Symposium on Multimedia (ISM).

[97]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[98]  Shu-Ching Chen,et al.  Classifier fusion by judgers on spark clusters for multimedia big data classification , 2016 .

[99]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[100]  Lukasz Kaiser,et al.  One Model To Learn Them All , 2017, ArXiv.

[101]  Ting Liu,et al.  Document Modeling with Gated Recurrent Neural Network for Sentiment Classification , 2015, EMNLP.

[102]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[103]  Mei-Ling Shyu,et al.  Mining Anomalies in Medicare Big Data Using Patient Rule Induction Method , 2017, 2017 IEEE Third International Conference on Multimedia Big Data (BigMM).

[104]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[105]  Cees G. M. Snoek,et al.  The MediaMill at TRECVID 2013: : Searching concepts, Objects, Instances and events in video , 2013, TRECVID.

[106]  Min Chen,et al.  DETECTION OF SOCCER GOAL SHOTS USING JOINT MULTIMEDIA FEATURES AND CLASSIFICATION RULES , 2003 .

[107]  Min Chen,et al.  A multimodal data mining framework for soccer goal detection based on decision tree logic , 2006, Int. J. Comput. Appl. Technol..

[108]  I-Chen Wu,et al.  Human vs. Computer Go: Review and Prospect [Discussion Forum] , 2016, IEEE Computational Intelligence Magazine.

[109]  Xiuqi Li,et al.  Image Retrieval By Color , Texture , And Spatial Information , 2002 .

[110]  Phil Howlett,et al.  Matching the grade correlation coefficient using a copula with maximum disorder , 2007 .

[111]  Xue-wen Chen,et al.  Combating the Small Sample Class Imbalance Problem Using Feature Selection , 2010, IEEE Transactions on Knowledge and Data Engineering.

[112]  Fang Chen,et al.  Investigating speech features and automatic measurement of cognitive load , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[113]  Jun-Wei Hsieh,et al.  PLSA-Based Sparse Representation for Object Classification , 2014, 2014 22nd International Conference on Pattern Recognition.

[114]  Chengcui Zhang,et al.  Adaptive background learning for vehicle detection and spatio-temporal tracking , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[115]  Chengcui Zhang,et al.  PixSO: a system for video shot detection , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[116]  Dennis Koelma,et al.  The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection , 2016, ICMR.

[117]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[118]  Shu-Ching Chen,et al.  A Classifier Ensemble Framework for Multimedia Big Data Classification , 2016, 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI).

[119]  Min Chen,et al.  Deep Learning for Imbalanced Multimedia Data Classification , 2015, 2015 IEEE International Symposium on Multimedia (ISM).

[120]  Yongzhao Zhan,et al.  Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks , 2014, IEEE Transactions on Multimedia.

[121]  Jun-Wei Hsieh,et al.  PLSA-based sparse representation for vehicle color classification , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[122]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[123]  Mei-Ling Shyu,et al.  Weighted Association Rule Mining for Video Semantic Detection , 2010, Int. J. Multim. Data Eng. Manag..

[124]  Gustavo E. A. P. A. Batista,et al.  A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.

[125]  Mubarak Shah,et al.  Learning semantic visual vocabularies using diffusion distance , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[126]  Sheng Guan,et al.  Domain Knowledge Assisted Data Processing for Florida Public Hurricane Loss Model (Invited Paper) , 2016, 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI).

[127]  Min Chen,et al.  Affinity relation discovery in image database clustering and content-based retrieval , 2004, MULTIMEDIA '04.

[128]  Fredric C. Gey,et al.  The Relationship between Recall and Precision , 1994, J. Am. Soc. Inf. Sci..

[129]  Xiuqi Li,et al.  Web document classification based on fuzzy association , 2002, Proceedings 26th Annual International Computer Software and Applications.

[130]  Rangasami L. Kashyap,et al.  Identifying Overlapped Objects for Video Indexing and Modeling in Multimedia Database Systems , 2001, Int. J. Artif. Intell. Tools.

[131]  Bharti,et al.  An efficient approach for Color Image Retrieval using Haar wavelet , 2009, 2009 Proceeding of International Conference on Methods and Models in Computer Science (ICM2CS).

[132]  Min Chen,et al.  A latent semantic indexing based method for solving multiple instance learning problem in region-based image retrieval , 2005, Seventh IEEE International Symposium on Multimedia (ISM'05).

[133]  Xuegong Zhang,et al.  Prediction of kinase‐specific phosphorylation sites with sequence features by a log‐odds ratio approach , 2007, Proteins.

[134]  Mark R. Segal,et al.  Machine Learning Benchmarks and Random Forest Regression , 2004 .

[135]  Steve Renals,et al.  Convolutional Neural Networks for Distant Speech Recognition , 2014, IEEE Signal Processing Letters.

[136]  Koichi Shinoda,et al.  A Fast and Accurate Video Semantic-Indexing System Using Fast MAP Adaptation and GMM Supervectors , 2012, IEEE Transactions on Multimedia.

[137]  Mei-Ling Shyu,et al.  Effective Feature Space Reduction with Imbalanced Data for Semantic Concept Detection , 2008, 2008 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (sutc 2008).

[138]  Mei-Ling Shyu,et al.  Concept-concept association information integration and multi-model collaboration for multimedia semantic concept detection , 2014, Inf. Syst. Frontiers.

[139]  Michael I. Jordan Serial Order: A Parallel Distributed Processing Approach , 1997 .

[140]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[141]  Min Chen,et al.  A unified framework for image database clustering and content-based retrieval , 2004, MMDB '04.

[142]  Shu-Ching Chen,et al.  Correlation-Based Video Semantic Concept Detection Using Multiple Correspondence Analysis , 2008, 2008 Tenth IEEE International Symposium on Multimedia.

[143]  Koen E. A. van de Sande,et al.  All vehicles are cars: subclass preferences in container concepts , 2012, ICMR '12.

[144]  P. Westfall,et al.  Understanding Advanced Statistical Methods , 2013 .

[145]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[146]  Choochart Haruechaiyasak,et al.  Category cluster discovery from distributed WWW directories , 2003, Inf. Sci..

[147]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[148]  Geoffrey E. Hinton,et al.  Learning distributed representations of concepts. , 1989 .

[149]  Samira Pouyanfar,et al.  Semantic Event Detection Using Ensemble Deep Learning , 2016, 2016 IEEE International Symposium on Multimedia (ISM).

[150]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[151]  Shu-Ching Chen,et al.  Video semantic concept detection via associative classification , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[152]  L. A. Goodman,et al.  Measures of Association for Cross Classifications, IV: Simplification of Asymptotic Variances , 1972 .

[153]  Jun-Wei Hsieh,et al.  Vehicle make and model recognition using sparse representation and symmetrical SURFs , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[154]  Jonathan G. Fiscus,et al.  TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking , 2016, TRECVID.

[155]  Quoc V. Le,et al.  On optimization methods for deep learning , 2011, ICML.

[156]  Yi Deng,et al.  Towards a business continuity information network for rapid disaster recovery , 2008, DG.O.

[157]  Wallace Koehler,et al.  Information science as "Little Science":The implications of a bibliometric analysis of theJournal of the American Society for Information Science , 2001, Scientometrics.

[158]  Mei-Ling Shyu,et al.  Handling nominal features in anomaly intrusion detection problems , 2005, 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA'05).

[159]  Si Wu,et al.  Improving support vector machine classifiers by modifying kernel functions , 1999, Neural Networks.

[160]  Yann LeCun,et al.  Deep belief net learning in a long-range vision system for autonomous off-road driving , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[161]  Choochart Haruechaiyasak,et al.  Mining user access behavior on the WWW , 2001, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236).

[162]  Gang Hua,et al.  Semantic Model Vectors for Complex Video Event Recognition , 2012, IEEE Transactions on Multimedia.

[163]  Juan Carlos Niebles,et al.  Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification , 2010, ECCV.

[164]  Min Chen,et al.  Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework , 2008, IEEE Transactions on Multimedia.

[165]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[166]  Shu-Ching Chen,et al.  Enhancing Multimedia Imbalanced Concept Detection Using VIMP in Random Forests , 2016, 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI).

[167]  Shiming Xiang,et al.  Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks , 2014, IEEE Geoscience and Remote Sensing Letters.

[168]  Chengcui Zhang,et al.  Multimedia Data Mining for Traffic Video Sequences , 2001, MDM/KDD.

[169]  Georges Quénot,et al.  TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.

[170]  Chengcui Zhang,et al.  A Dynamic User Concept Pattern Learning Framework for Content-Based Image Retrieval , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[171]  Gang Hua,et al.  Learning Discriminative Reconstructions for Unsupervised Outlier Removal , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[172]  Bingbing Ni,et al.  YouTubeEvent: On large-scale video event classification , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[173]  Xin Huang,et al.  User Concept Pattern Discovery Using Relevance Feedback And Multiple Instance Learning For Content-Based Image Retrieval , 2002, MDM/KDD.

[174]  Hong Heather Yu,et al.  Overview and Future Trends of Multimedia Research for Content Access and Distribution , 2007, Int. J. Semantic Comput..

[175]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[176]  Mei-Ling Shyu,et al.  Sparse Linear Integration of Content and Context Modalities for Semantic Concept Retrieval , 2015, IEEE Transactions on Emerging Topics in Computing.

[177]  Marcelo Bernardes Vieira,et al.  Combining gradient histograms using orientation tensors for human action recognition , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[178]  T. Lawson,et al.  Spark , 2011 .

[179]  Min Chen,et al.  Spatio-Temporal Analysis for Human Action Detection and Recognition in Uncontrolled Environments , 2015, Int. J. Multim. Data Eng. Manag..

[180]  Liang Tang,et al.  Using data mining techniques to address critical information exchange needs in disaster affected public-private networks , 2010, KDD.

[181]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[182]  Shu-Ching Chen,et al.  Feature Selection Using Correlation and Reliability Based Scoring Metric for Video Semantic Detection , 2010, 2010 IEEE Fourth International Conference on Semantic Computing.

[183]  Min Chen,et al.  Image database retrieval utilizing affinity relationships , 2003, MMDB '03.

[184]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[185]  Mei-Ling Shyu,et al.  Supporting Semantic Concept Retrieval with Negative Correlations in a Multimedia Big Data Mining System , 2016, Int. J. Semantic Comput..

[186]  Yan Meng,et al.  Human activity recognition in video using a hierarchical probabilistic latent model , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[187]  Mehrbakhsh Nilashi,et al.  Collaborative filtering recommender systems , 2013 .

[188]  Sheng Guan,et al.  A Scalable and Automatic Validation Process for Florida Public Hurricane Loss Model (Invited Paper) , 2016, 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI).

[189]  Qiang Ji,et al.  A Hierarchical Context Model for Event Recognition in Surveillance Video , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[190]  Min Chen,et al.  Utilizing concept correlations for effective imbalanced data classification , 2014, Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014).

[191]  Mei-Ling Shyu,et al.  Multimodal Information Integration and Fusion for Histology Image Classification , 2011, Int. J. Multim. Data Eng. Manag..

[192]  K. Pearson VII. Note on regression and inheritance in the case of two parents , 1895, Proceedings of the Royal Society of London.

[193]  Serge J. Belongie,et al.  Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[194]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[195]  Liang Tang,et al.  Applying data mining techniques to address disaster information management challenges on mobile devices , 2011, KDD.

[196]  Jun-Wei Hsieh,et al.  Modeling and recognizing action contexts in persons using sparse representation , 2015, J. Vis. Commun. Image Represent..

[197]  Stuart Harvey Rubin,et al.  A Human-Centered Multiple Instance Learning Framework for Semantic Video Retrieval , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[198]  Diyi Yang,et al.  Hierarchical Attention Networks for Document Classification , 2016, NAACL.