Machine Learning for Analyzing Human Brain Function

A major opportunity for knowledge discovery and data mining over the coming decade is to accelerate scientific discovery by providing new computer tools to analyze experimental data. Scientific fields from astronomy to cell biology to neuroscience now collect experimental data sets that are huge when compared to the data sets available just a decade ago. New data mining tools are needed to interpret these new data sets. This talk presents our own research in one such scientific subfield: studying the operation of the human brain using functional Magnetic Resonance Imaging (fMRI). A typical fMRI experiment captures three-dimensional images of human brain activity, once per second, at a spatial resolution of a few millimeters, providing a 3D movie of brain activity. We present our recent research exploring the question of how best to analyze fMRI data to study human cognitive processes. We will first describe our recent successes training machine learning classifiers to distinguish cognitive subprocesses based on observed fMRI images. For example, we have been able to train classifiers to discriminate whether a person is reading words about tools, or words about buildings, based on their observed fMRI brain activation. We will then introduce an algorithm for learning a new class of probabilistic time series models called Hidden Process Models, and discuss their use for tracking multiple hidden cognitive processes from observed fMRI brain image data.

[1]  Hongjun Lu,et al.  Locating Motifs in Time-Series Data , 2005, PAKDD.

[2]  Licheng Jiao,et al.  Training Support Vector Machines Using Greedy Stagewise Algorithm , 2005, PAKDD.

[3]  Zhi-Hua Zhou,et al.  Feature Selection for High Dimensional Face Image Using Self-organizing Maps , 2005, PAKDD.

[4]  Eamonn J. Keogh,et al.  A Novel Bit Level Time Series Representation with Implication of Similarity Search and Clustering , 2005, PAKDD.

[5]  Kai Ming Ting,et al.  Maximizing Tree Diversity by Building Complete-Random Decision Trees , 2005, PAKDD.

[6]  Sanjay Chawla,et al.  Conditional Random Fields for Transmembrane Helix Prediction , 2005, PAKDD.

[7]  Kate Smith-Miles,et al.  An Efficient Compression Technique for Frequent Itemset Generation in Association Rule Mining , 2005, PAKDD.

[8]  Min Song,et al.  An Automatic Unsupervised Querying Algorithm for Efficient Information Extraction in Biomedical Domain , 2005, PAKDD.

[9]  Senmiao Yuan,et al.  Learning Bayesian Networks Structures from Incomplete Data: An Efficient Approach Based on Extended Evolutionary Programming , 2005, PAKDD.

[10]  Marek Wojciechowski,et al.  On Multiple Query Optimization in Data Mining , 2005, PAKDD.

[11]  Jong-Tae Lim,et al.  Voting Fuzzy k-NN to Predict Protein Subcellular Localization from Normalized Amino Acid Pair Compositions , 2005, PAKDD.

[12]  Lydia Boudjeloud,et al.  Visual Interactive Evolutionary Algorithm for High Dimensional Data Clustering and Outlier Detection , 2005, PAKDD.