论文信息 - New methods for anechoic demixing with application to shift invariant feature extraction

New methods for anechoic demixing with application to shift invariant feature extraction

Blind source separation problems emerge in many applications, where signals can be modeled as superpositions of multiple sources. Many popular applications of blind source separation are based on linear instantaneous mixture models. If specific invariance properties are known about the sources, e.g. translation or rotation invariance, the simple linear model can be extended by inclusion of the corresponding transformations. When the sources are invariant against translations (i.e. spatial displacements or time shifts) the resulting model is called anechoic mixing model. The main focus of this thesis is the development of new mathematical framework for the solution of the anechoic mixing problem and the successive derivation of concrete algorithms. This framework integrates approaches from many distinct fields of signal processing like stochastic time-frequency analysis, convex optimization, projection onto convex set methods, delay estimation and naturally blind source separation. The developed method is tested on a variety of applications including music recordings, natural two dimensional images, two-dimensional shapes and optic flow. However the main application is the analysis and synthesis of human motion trajectories, which is motivated by the idea in motor control that complex motor behavior can be explained by a superposition of simple basis components, or spatio-temporal primitives. The new anechoic demixing algorithm allows to approximate high-dimensional movement trajectories accurately based on a small number of learned primitives or source signals. It is demonstrated that the new method is significantly more accurate than other common techniques. This allows the modeling of subtle style changes, like the bodily expression of emotion as well as a sufficient synthesis quality for computer animation with only few mixture components.

Lars Omlor | Lars Omlor

[1] Jessica K. Hodgins,et al. Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces , 2004, SIGGRAPH 2004.

[2] Václav Hlavác,et al. Sequential Coordinate-Wise Algorithm for the Non-negative Least Squares Problem , 2005, CAIP.

[3] Steven Kay,et al. A Fast and Accurate Single Frequency Estimator , 2022 .

[4] Tuomas Virtanen,et al. Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[5] Patrick Flandrin,et al. Wigner-Ville spectral analysis of nonstationary processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[6] J. Schu,et al. Weak and strong convergence to fixed points of asymptotically nonexpansive mappings , 1991, Bulletin of the Australian Mathematical Society.

[7] A. Iusem,et al. Primal-dual row-action method for convex programming , 1995 .

[8] Approximation of Nearest Common Fixed Point of Nonexpansive Mappings in Hilbert Spaces , 2007 .

[9] Yuanqing Li,et al. Analysis of Sparse Representation and Blind Source Separation , 2004, Neural Computation.

[10] Marco Santello,et al. Patterns of Hand Motion during Grasping and the Influence of Sensory Guidance , 2002, The Journal of Neuroscience.

[11] P. Schönemann,et al. A generalized solution of the orthogonal procrustes problem , 1966 .

[12] Philip M. Woodward,et al. Probability and Information Theory with Applications to Radar , 1954 .

[13] Jun Xiao,et al. Multitaper Time-Frequency Reassignment for Nonstationary Spectrum Estimation and Chirp Enhancement , 2007, IEEE Transactions on Signal Processing.

[14] Leslie M. Collins,et al. Classification of closely spaced subsurface objects using electromagnetic induction data and blind source separation algorithms , 2004 .

[15] A. Lee Swindlehurst,et al. Time delay and spatial signature estimation using known asynchronous signals , 1998, IEEE Trans. Signal Process..

[16] Andrea d'Avella,et al. Matrix factorization algorithms for the identification of muscle synergies: evaluation on simulated and experimental data sets. , 2006, Journal of neurophysiology.

[17] Paris Smaragdis,et al. Blind separation of convolved mixtures in the frequency domain , 1998, Neurocomputing.

[18] Thomas Hofmann,et al. Blind source separation for over-determined delayed mixtures , 2007 .

[19] Rémi Gribonval,et al. Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[20] J. Mayer,et al. On the Quantum Correction for Thermodynamic Equilibrium , 1947 .

[21] M. Kostreva. Block pivot methods for solving the complementarity problem , 1978 .

[22] Pierre Weiss,et al. A proximal method for inverse problems in image processing , 2009, 2009 17th European Signal Processing Conference.

[23] Norbert Wiener,et al. Extrapolation, Interpolation, and Smoothing of Stationary Time Series , 1964 .

[24] M. V. Van Benthem,et al. Fast algorithm for the solution of large‐scale non‐negativity‐constrained least squares problems , 2004 .

[25] D Mendlovic,et al. Gerchberg-Saxton algorithm applied in the fractional Fourier or the Fresnel domain. , 1996, Optics letters.

[26] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[27] Shangming Yang,et al. Nonnegative Matrix Factorization for Independent Component Analysis , 2007, 2007 International Conference on Communications, Circuits and Systems.

[28] Lars Kai Hansen,et al. Shift-invariant multilinear decomposition of neuroimaging data , 2008, NeuroImage.

[29] Kamalesh Kumar Sharma,et al. Signal separation using linear canonical and fractional Fourier transforms , 2006 .

[30] Cnrs Ltci. Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures With Application to Blind Audio Source Separation , 2009 .

[31] Bryan M Hennelly,et al. Fast numerical algorithm for the linear canonical transform. , 2005, Journal of the Optical Society of America. A, Optics, image science, and vision.

[32] Kuldip K. Paliwal,et al. Fast principal component analysis using fixed-point algorithm , 2007, Pattern Recognit. Lett..

[33] Robert Boorstyn,et al. Single tone parameter estimation from discrete-time observations , 1974, IEEE Trans. Inf. Theory.

[34] Inderjit S. Dhillon,et al. Fast Newton-type Methods for the Least Squares Nonnegative Matrix Approximation Problem , 2007, SDM.

[35] I. Shapiro,et al. Asteroid radar astrometry , 1991 .

[36] Guojun Lu,et al. Review of shape representation and description techniques , 2004, Pattern Recognit..

[37] Jean-Jacques E. Slotine,et al. Audio classification from time-frequency texture , 2008, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[38] Gerald Matz,et al. Wigner distributions (nearly) everywhere: time-frequency analysis of signals, systems, random processes, signal spaces, and frames , 2003, Signal Process..

[39] Stan Z. Li,et al. Learning spatially localized, parts-based representation , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[40] Claire L. Roether,et al. Critical features for the perception of emotion from gait. , 2009, Journal of vision.

[41] D. Chakrabarti,et al. A fast fixed - point algorithm for independent component analysis , 1997 .

[42] Kari Torkkola,et al. Blind separation of convolved sources based on information maximization , 1996, Neural Networks for Signal Processing VI. Proceedings of the 1996 IEEE Signal Processing Society Workshop.

[43] K. Obermayer,et al. DECORRELATION PROCEDURES FOR BLIND SOURCESEPARATIONR , 2000 .

[44] Kenji Kita,et al. Dimensionality reduction using non-negative matrix factorization for information retrieval , 2001, 2001 IEEE International Conference on Systems, Man and Cybernetics. e-Systems and e-Man for Cybernetics in Cyberspace (Cat.No.01CH37236).

[45] Berthold K. P. Horn,et al. Determining Optical Flow , 1981, Other Conferences.

[46] Maja J. Mataric,et al. Deriving action and behavior primitives from human motion data , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[47] Kehong Yuan,et al. Reducing microarray data via nonnegative matrix factorization for visualization and clustering analysis , 2008, J. Biomed. Informatics.

[48] John C. Platt,et al. Networks for the Separation of Sources that Are Superimposed and Delayed , 1991, NIPS.

[49] G. Vallortigara,et al. rTMS of motor cortex induces the release of dopamine in the putamen, Strafella and colleagues delivered rTMS to the motor cortex of subjects in the early stages of Parkinson’s Disease (PD) and measured subsequent changes in dopamine , 2007 .

[50] Abdul Latif,et al. Fixed points of nonexpansive type multivalued maps , 1987 .

[51] I. Yamada,et al. Pairwise Optimal Weight Realization—Acceleration Technique for Set-Theoretic Adaptive Parallel Subgradient Projection Algorithm , 2006, IEEE Transactions on Signal Processing.

[52] R. Harshman,et al. Shifted factor analysis—Part I: Models and properties , 2003 .

[53] Arie Yeredor,et al. BLIND SOURCE SEPARATION BASED ON THE FRACTIONAL FOURIER TRANSFORM , 2003 .

[54] Tao Li,et al. The Relationships Among Various Nonnegative Matrix Factorization Methods for Clustering , 2006, Sixth International Conference on Data Mining (ICDM'06).

[55] William H. Richardson,et al. Bayesian-Based Iterative Method of Image Restoration , 1972 .

[56] K. Raja Rajeswari,et al. Time-delay estimation using MLE approach for wide-band radar systems , 1998, ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344).

[57] José Tribolet,et al. A new phase unwrapping algorithm , 1977 .

[58] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[59] Peter R. Roth,et al. Effective measurements using digital signal analysis , 1971, IEEE Spectrum.

[60] W. Martin,et al. Time-frequency analysis of random signals , 1982, ICASSP.

[61] Erkki Oja,et al. Projective Nonnegative Matrix Factorization for Image Compression and Feature Extraction , 2005, SCIA.

[62] Z. Zalevsky,et al. The Fractional Fourier Transform: with Applications in Optics and Signal Processing , 2001 .

[63] PaperNo. Recognition of shapes by editing shock graphs , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[64] J. F. Soechting,et al. Invariant characteristics of a pointing movement in man , 1981, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[65] D Regan,et al. How do we avoid confounding the direction we are looking and the direction we are moving? , 1982, Science.

[66] K. Matsuoka,et al. Minimal distortion principle for blind source separation , 2002, Proceedings of the 41st SICE Annual Conference. SICE 2002..

[67] Susumu Yoshida,et al. Multipath delay estimation for indoor wireless communication , 1990, 40th IEEE Conference on Vehicular Technology.

[68] G. Carter,et al. The generalized correlation method for estimation of time delay , 1976 .

[69] Stephen A. Vavasis,et al. On the Complexity of Nonnegative Matrix Factorization , 2007, SIAM J. Optim..

[70] Haibin Ling,et al. Shape Classification Using the Inner-Distance , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[71] Arie Yeredor,et al. Time-delay estimation in mixtures , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[72] Barak A. Pearlmutter,et al. Survey of sparse and non‐sparse methods in source separation , 2005, Int. J. Imaging Syst. Technol..

[73] Gregor Schöner,et al. Toward a new theory of motor synergies. , 2007, Motor control.

[74] Hong-Kun Xu. Iterative Algorithms for Nonlinear Operators , 2002 .

[75] G. W. Lank,et al. A Semicoherent Detection and Doppler Estimation Statistic , 1973, IEEE Transactions on Aerospace and Electronic Systems.

[76] E Bizzi,et al. Motor learning through the combination of primitives. , 2000, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[77] L. N. Vicente,et al. A comparison of block pivoting and interior-point algorithms for linear least squares problems with nonnegative variables , 1994 .

[78] W. Rinn,et al. The neuropsychology of facial expression: a review of the neurological and psychological mechanisms for producing facial expressions. , 1984, Psychological bulletin.

[79] Shih-Ping Han,et al. A successive projection method , 1988, Math. Program..

[80] Albert Bijaoui,et al. Blind source separation and analysis of multispectral astronomical images , 2000 .

[81] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[82] Luigi Grippo,et al. On the convergence of the block nonlinear Gauss-Seidel method under convex constraints , 2000, Oper. Res. Lett..

[83] Remco C. Veltkamp,et al. State of the Art in Shape Matching , 2001, Principles of Visual Information Retrieval.

[84] Charles L. Lawson,et al. Solving least squares problems , 1976, Classics in applied mathematics.

[85] Bhiksha Raj,et al. Supervised and Semi-supervised Separation of Sounds from Single-Channel Mixtures , 2007, ICA.

[86] B. Hochner,et al. Patterns of Arm Muscle Activation Involved in Octopus Reaching Movements , 1998, The Journal of Neuroscience.

[87] P. Lions. Approximation de Points Fixes de Contractions , 1977 .

[88] Albert Mukovskiy,et al. Selforganization of character behavior by mixing of learned movement primitives , 2008, VMV.

[89] Lars Omlor,et al. Learning of Translation-Invariant Independent Components: Multivariate Anechoic Mixtures , 2007, ICA.

[90] Werner Kozek,et al. The Wigner distribution of a linear signal space , 1993, IEEE Trans. Signal Process..

[91] Patricia Mariela Morillas. Dykstra's algorithm with strategies for projecting onto certain polyhedral cones , 2005, Appl. Math. Comput..

[92] R. Wittmann. Approximation of fixed points of nonexpansive mappings , 1992 .

[93] A. Janssen. Application of the Wigner distribution to harmonic analysis of generalized stochastic processes , 1979 .

[94] Lars Omlor,et al. Lateral asymmetry of bodily emotion expression , 2008, Current Biology.

[95] J. Ianniello. High resolution multipath time delay estimation for broadband random signals , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[96] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[97] Christian Jutten,et al. Space or time adaptive signal processing by neural network models , 1987 .

[98] John G. Proakis,et al. Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[99] Marwan A. Jabri,et al. Independent Components of Optical Flows Have MSTd-Like Receptive Fields , 2000 .

[100] Jerry L. Prince,et al. Motion estimation from tagged MR image sequences , 1992, IEEE Trans. Medical Imaging.

[101] John T. Sheridan,et al. Simulating paraxial optical systems using the linear canonical transform: properties, issues, and applications , 2008, Optical Engineering + Applications.

[102] Yan Yang,et al. Image Denoising by Sparse Code Shrinkage , 2009, 2009 5th International Conference on Wireless Communications, Networking and Mobile Computing.

[103] Scott Rickard,et al. Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[104] Kari Torkkola,et al. Blind separation of delayed sources based on information maximization , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[105] R. Manmatha,et al. Learning Shapes for Image Classification and Retrieval , 2005, CIVR.

[106] Clifford Hildreth,et al. A quadratic programming procedure , 1957 .

[107] Franz Hlawatsch,et al. The Wigner distribution : theory and applications in signal processing , 1997 .

[108] Lucas C. Parra,et al. A SURVEY OF CONVOLUTIVE BLIND SOURCE SEPARATION METHODS , 2007 .

[109] Alan Watt,et al. Advanced animation and rendering techniques , 1992 .

[110] Wan-Chi Siu,et al. A general contrast function based blind source separation method for convolutively mixed independent sources , 2007, Signal Process..

[111] Lars Kai Hansen,et al. Shifted Independent Component Analysis , 2007, ICA.

[112] Brian C. Lovell,et al. The circular nature of discrete-time frequency estimates , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[113] Aee-Ni Park,et al. Self organized character animation based on learned synergies from full-body motion capture data , 2008 .

[114] Yin Zhang,et al. Interior-Point Gradient Method for Large-Scale Totally Nonnegative Least Squares Problems , 2005 .

[115] Andrzej Cichocki,et al. Nonnegative matrix factorization with constrained second-order optimization , 2007, Signal Process..

[116] F. Lacquaniti,et al. Five basic muscle activation patterns account for muscle activity during human locomotion , 2004, The Journal of physiology.