Compact representation of temporal processes in echosounder time series via matrix decomposition

The recent explosion in the availability of echosounder data from diverse ocean platforms has created unprecedented opportunities to observe the marine ecosystems at broad scales. However, the critical lack of methods capable of automatically discovering and summarizing prominent spatio-temporal echogram structures has limited the effective and wider use of these rich datasets. To address this challenge, a data-driven methodology is developed based on matrix decomposition that builds compact representation of long-term echosounder time series using intrinsic features in the data. In a two-stage approach, noisy outliers are first removed from the data by principal component pursuit, then a temporally smooth nonnegative matrix factorization is employed to automatically discover a small number of distinct daily echogram patterns, whose time-varying linear combination (activation) reconstructs the dominant echogram structures. This low-rank representation provides biological information that is more tractable and interpretable than the original data, and is suitable for visualization and systematic analysis with other ocean variables. Unlike existing methods that rely on fixed, handcrafted rules, this unsupervised machine learning approach is well-suited for extracting information from data collected from unfamiliar or rapidly changing ecosystems. This work forms the basis for constructing robust time series analytics for large-scale, acoustics-based biological observation in the ocean.

[1]  Ivan Oseledets,et al.  Tensor-Train Decomposition , 2011, SIAM J. Sci. Comput..

[2]  Arnt-Børre Salberg,et al.  Acoustic classification in multifrequency echosounder data using deep convolutional neural networks , 2020, ICES Journal of Marine Science.

[3]  Dezhang Chu,et al.  A Wave Glider Approach to Fisheries Acoustics: Transforming How We Monitor the Nation's Commercial Fisheries in the 21st Century , 2014 .

[4]  Kenneth L. Smith,et al.  Detection and characterisation of deep-sea benthopelagic animals from an autonomous underwater vehicle with a multibeam echosounder: A proof of concept and description of data-processing methods , 2018 .

[5]  J. Jech,et al.  A multifrequency method to classify and evaluate fisheries acoustics data , 2006 .

[6]  David Pfau,et al.  Simultaneous Denoising, Deconvolution, and Demixing of Calcium Imaging Data , 2016, Neuron.

[7]  Andri Mirzal,et al.  Nonparametric Tikhonov Regularized NMF and Its Application in Cancer Clustering , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  William T. Peterson,et al.  Zooplankton distribution and cross-shelf transfer of carbon in an area of complex mesoscale circulation in the northern California Current , 2009 .

[9]  Peter I. Miller,et al.  Assessing the potential of autonomous submarine gliders for ecosystem monitoring across multiple trophic levels (plankton to cetaceans) and pollutants in shallow shelf seas , 2014 .

[10]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[11]  Alex De Robertis,et al.  Development and application of an empirical multifrequency method for backscatter classification , 2010 .

[12]  Antoine Liutkus,et al.  Cauchy nonnegative matrix factorization , 2015, 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[13]  Mark A. Moline,et al.  Integration of Scientific Echo Sounders with an Adaptable Autonomous Vehicle to Extend Our Understanding of Animals from the Surface to the Bathypelagic , 2015 .

[14]  Yi Ma,et al.  Robust principal component analysis? , 2009, JACM.

[15]  V. Kshirsagar,et al.  Face recognition using Eigenfaces , 2011, 2011 3rd International Conference on Computer Research and Development.

[16]  Andrew S. Brierley,et al.  A method for identifying Sound Scattering Layers and extracting key characteristics , 2015 .

[17]  John A. Barth,et al.  Zooplankton Distribution and Transport in the California Current off Oregon , 2014 .

[18]  Mattias Höglund,et al.  Non-Negative Matrix Factorization for the Analysis of Complex Gene Expression Data: Identification of Clinically Relevant Tumor Subtypes , 2008, Cancer informatics.

[19]  Kush R. Varshney,et al.  Dynamic matrix factorization with social influence , 2016, 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP).

[20]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[21]  Chris H. Q. Ding,et al.  On the equivalence between Non-negative Matrix Factorization and Probabilistic Latent Semantic Indexing , 2008, Comput. Stat. Data Anal..

[22]  Junjie Wu,et al.  Modeling Emerging, Evolving and Fading Topics Using Dynamic Soft Orthogonal NMF with Sparse Representation , 2015, 2015 IEEE International Conference on Data Mining.

[23]  Pierre Petitgas,et al.  Indices for capturing spatial patterns and their evolution in time, with application to European hake (Merluccius merluccius) in the Bay of Biscay , 2007 .

[24]  Valentina Staneva,et al.  Tensor decomposition of multi-frequency echosounder time series , 2019, OCEANS 2019 MTS/IEEE SEATTLE.

[25]  John K. Horne,et al.  Measuring the vertical distributional variability of pelagic fauna in Monterey Bay , 2012 .

[26]  Vikas Sindhwani,et al.  Learning evolving and emerging topics in social media: a dynamic nmf approach with temporal regularization , 2012, WSDM '12.

[27]  Adam T. Greer,et al.  Acoustic detection of zooplankton diel vertical migration behaviors on the northern Gulf of Mexico shelf , 2019, Limnology and Oceanography.

[28]  J K Horne,et al.  Multifrequency species classification of acoustic-trawl survey data using semi-supervised learning with class discovery. , 2012, The Journal of the Acoustical Society of America.

[29]  Nelly Pustelnik,et al.  Solving NMF with smoothness and sparsity constraints using PALM , 2019, ArXiv.

[30]  Boris G. Mirkin,et al.  Choosing the number of clusters , 2011, Wiley Interdiscip. Rev. Data Min. Knowl. Discov..

[31]  Fillia Makedon,et al.  Learning from Incomplete Ratings Using Non-negative Matrix Factorization , 2006, SDM.

[32]  Erkki Oja,et al.  Kullback-Leibler Divergence for Nonnegative Matrix Factorization , 2011, ICANN.

[33]  Zhiguo Jiang,et al.  A method of automatically estimating the regularization parameter for Non-negative Matrix Factorization , 2010, 2010 Sixth International Conference on Natural Computation.

[34]  S. Georgakarakos,et al.  Artificial neural networks as a tool for species identification of fish schools , 1996 .

[35]  G. Hays A review of the adaptive significance and ecosystem consequences of zooplankton diel vertical migrations , 2003, Hydrobiologia.

[36]  Pablo Tamayo,et al.  Metagenes and molecular pattern discovery using matrix factorization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[37]  Andrzej Cichocki,et al.  Nonnegative Matrix and Tensor Factorization T , 2007 .

[38]  Truong Q. Nguyen,et al.  A method for choosing the regularization parameter in generalized Tikhonov regularized linear inverse problems , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[39]  El-hadi Zahzah,et al.  Handbook of Robust Low-Rank and Sparse Matrix Decomposition: Applications in Image and Video Processing , 2016 .

[40]  Thierry Bouwmans,et al.  Robust PCA via Principal Component Pursuit: A review for a comparative evaluation in video surveillance , 2014, Comput. Vis. Image Underst..

[41]  D. V. Holliday Fundamentals of acoustical oceanography , 1999 .

[42]  P. Paatero,et al.  Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[43]  Paul G. Fernandes,et al.  Classification of Southern Ocean krill and icefish echoes using random forests , 2016 .

[44]  Rolf J. Korneliussen,et al.  Acoustic identification of marine species using a feature library , 2016 .

[45]  Paris Smaragdis,et al.  A State-Space Approach to Dynamic Nonnegative Matrix Factorization , 2015, IEEE Transactions on Signal Processing.

[46]  R. Sokal,et al.  THE COMPARISON OF DENDROGRAMS BY OBJECTIVE METHODS , 1962 .

[47]  Olav Rune Godø,et al.  Towards an acoustic‐based coupled observation and modelling system for monitoring and predicting ecosystem dynamics of the open ocean , 2013 .

[48]  Marc Teboulle,et al.  Proximal alternating linearized minimization for nonconvex and nonsmooth problems , 2013, Mathematical Programming.

[49]  E. John Simmonds,et al.  Fisheries Acoustics: Theory and Practice , 2005 .

[50]  Lucie N. Hutchins,et al.  Position-dependent motif characterization using non-negative matrix factorization , 2008, Bioinform..

[51]  X. Irigoien,et al.  Large scale patterns in vertical distribution and behaviour of mesopelagic scattering layers , 2016, Scientific Reports.

[52]  Kelly J Benoit-Bird,et al.  Ecological Insights from Pelagic Habitats Acquired Using Active Acoustic Techniques. , 2016, Annual review of marine science.

[53]  David E. Cade,et al.  An automatic and quantitative approach to the detection and tracking of acoustic scattering layers , 2014 .

[54]  Patrick Lehodey,et al.  Bridging the gap from ocean models to population dynamics of large marine predators: A model of mid-trophic functional groups , 2010 .

[55]  Christopher D. Wilson,et al.  Can a bottom-moored echosounder array provide a survey-comparable index of abundance? , 2018 .