Hierarchical Dirichlet Process Hidden Markov Model for unsupervised bioacoustic analysis

Hidden Markov Models (HMMs) are one of the most popular and successful models in statistics and machine learning for modeling sequential data. However, one main issue in HMMs is the one of choosing the number of hidden states. The Hierarchical Dirichlet Process (HDP)-HMM is a Bayesian non-parametric alternative for standard HMMs that offers a principled way to tackle this challenging problem by relying on a Hierarchical Dirichlet Process (HDP) prior. We investigate the HDP-HMM in a challenging problem of unsupervised learning from bioacoustic data by using Markov-Chain Monte Carlo (MCMC) sampling techniques, namely the Gibbs sampler. We consider a real problem of fully unsupervised humpback whale song decomposition. It consists in simultaneously finding the structure of hidden whale song units, and automatically inferring the unknown number of the hidden units from the Mel Frequency Cepstral Coefficients (MFCC) of bioacoustic signals. The experimental results show the very good performance of the proposed Bayesian non-parametric approach and open new insights for unsupervised analysis of such bioacoustic signals.

[1]  Carl E. Rasmussen,et al.  Factorial Hidden Markov Models , 1997 .

[2]  J. Pitman Exchangeable and partially exchangeable random partitions , 1995 .

[3]  Ellen C. Garland,et al.  Dynamic Horizontal Cultural Transmission of Humpback Whale Song at the Ocean Basin Scale , 2011, Current Biology.

[4]  Emily B. Fox,et al.  Bayesian nonparametric learning of complex dynamical phenomena , 2009 .

[5]  Gérard Govaert,et al.  Assessing a Mixture Model for Clustering with the Integrated Completed Likelihood , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[9]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[10]  Hervé Glotin,et al.  Subunit definition and analysis for humpback whale call classification , 2010 .

[11]  C. Robert The Bayesian choice : a decision-theoretic motivation , 1996 .

[12]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[13]  C. Antoniak Mixtures of Dirichlet Processes with Applications to Bayesian Nonparametric Problems , 1974 .

[14]  J. Sethuraman A CONSTRUCTIVE DEFINITION OF DIRICHLET PRIORS , 1991 .

[15]  Jerry Nedelman,et al.  Book review: “Bayesian Data Analysis,” Second Edition by A. Gelman, J.B. Carlin, H.S. Stern, and D.B. Rubin Chapman & Hall/CRC, 2004 , 2005, Comput. Stat..

[16]  Michael J. Black,et al.  A nonparametric Bayesian alternative to spike sorting , 2008, Journal of Neuroscience Methods.

[17]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[18]  Michael I. Jordan,et al.  Bayesian Nonparametrics: Hierarchical Bayesian nonparametric models with applications , 2010 .

[19]  Yee Whye Teh,et al.  Beam sampling for the infinite hidden Markov model , 2008, ICML '08.

[20]  H. Akaike A new look at the statistical model identification , 1974 .

[21]  Michael I. Jordan,et al.  An HDP-HMM for systems with state persistence , 2008, ICML '08.

[22]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[23]  Samuel J. Gershman,et al.  A Tutorial on Bayesian Nonparametric Models , 2011, 1106.2697.

[24]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[25]  Thomas L. Griffiths,et al.  A Non-Parametric Bayesian Method for Inferring Hidden Causes , 2006, UAI.

[26]  Michael I. Jordan,et al.  Hierarchical Bayesian Nonparametric Models with Applications , 2008 .