Adaptive Constrained Learning in Reproducing Kernel Hilbert Spaces: The Robust Beamforming Case

This paper establishes a new paradigm for convexly constrained adaptive learning in reproducing kernel Hilbert spaces (RKHS). Although the technique is of a general nature, we present it in the context of the beamforming problem. A priori knowledge, like beampattern specifications and constraints concerning robustness against steering vector errors, takes the form of multiple closed convex sets in a high (possibly infinite) dimensional RKHS. Every robustness constraint is shown to be equivalent to a min-max optimization task formed by means of the robust statistics epsiv -insensitive loss function. Such a multiplicity of specifications turns out to obtain a simple expression by using the rich frame of fixed-point sets of certain mappings defined in a Hilbert space. Moreover, the cost function, that the final solution has to optimize, is expressed as an infinite sequence of convex, nondifferentiable loss functions, springing from the sequence of the incoming training data. A novel adaptive beamforming design, of linear complexity with respect to the number of unknown parameters, to such a constrained nonlinear learning problem is derived by employing a very recently developed version of the adaptive projected subgradient method (APSM). The method produces a sequence that, under mild conditions, exhibits properties like the strong convergence to a beamformer that satisfies all of the imposed constraints, and in the meantime asymptotically minimizes the sequence of the loss functions imposed by the training data. The numerical examples demonstrate that the proposed method displays increased resolution in cases where the classical linear beamforming solutions collapse. Moreover, it leads to solutions, which are in agreement with the imposed a priori knowledge, as opposed to unconstrained online kernel regression techniques.

[1]  N. Aronszajn Theory of Reproducing Kernels. , 1950 .

[2]  Thomas M. Cover,et al.  Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[3]  Boris Polyak,et al.  The method of projections for finding the common point of convex sets , 1967 .

[4]  J. Nagumo,et al.  A learning method for system identification , 1967, IEEE Transactions on Automatic Control.

[5]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[6]  D. Luenberger Optimization by Vector Space Methods , 1968 .

[7]  D. Meeter Stochastic Approximation and Nonlinear Regression , 1969 .

[8]  G. Wahba,et al.  Some results on Tchebycheffian spline functions , 1971 .

[9]  T. Hinamoto,et al.  Extended theory of learning identification , 1975 .

[10]  I. Ekeland,et al.  Convex analysis and variational problems , 1976 .

[11]  D. Youla,et al.  Image Restoration by the Method of Convex Projections: Part 1ߞTheory , 1982, IEEE Transactions on Medical Imaging.

[12]  Guy Pierra,et al.  Decomposition through formalization in a product space , 1984, Math. Program..

[13]  S. Reich,et al.  Uniform Convexity, Hyperbolic Geometry, and Nonexpansive Mappings , 1984 .

[14]  Kazuhiko Ozeki,et al.  An adaptive filtering algorithm using an orthogonal projection to an affine subspace and its properties , 1984 .

[15]  Henry Cox,et al.  Robust adaptive beamforming , 2005, IEEE Trans. Acoust. Speech Signal Process..

[16]  J. Hiriart-Urruty,et al.  Convex analysis and minimization algorithms , 1993 .

[17]  P. L. Combettes The foundations of set theoretic estimation , 1993 .

[18]  Heinz H. Bauschke,et al.  On Projection Algorithms for Solving Convex Feasibility Problems , 1996, SIAM Rev..

[19]  Heinz H. Bauschke,et al.  Projection algorithms and monotone operators , 1996 .

[20]  Patrick L. Combettes,et al.  Convex set theoretic image recovery by extrapolated iterations of parallel subgradient projections , 1997, IEEE Trans. Image Process..

[21]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[22]  Yongyi Yang,et al.  Vector Space Projections: A Numerical Approach to Signal and Image Processing, Neural Nets, and Optics , 1998 .

[23]  James A. Bucklew,et al.  Support vector machine techniques for nonlinear equalization , 2000, IEEE Trans. Signal Process..

[24]  Gert Cauwenberghs,et al.  Incremental and Decremental Support Vector Machine Learning , 2000, NIPS.

[25]  Heinz H. Bauschke,et al.  A Weak-to-Strong Convergence Principle for Fejé-Monotone Methods in Hilbert Spaces , 2001, Math. Oper. Res..

[26]  Paulo Sergio Ramirez,et al.  Fundamentals of Adaptive Filtering , 2002 .

[27]  Isao Yamada,et al.  An efficient robust adaptive filtering algorithm based on parallel subgradient projection techniques , 2002, IEEE Trans. Signal Process..

[28]  Harry L. Van Trees,et al.  Optimum Array Processing: Part IV of Detection, Estimation, and Modulation Theory , 2002 .

[29]  Zhi-Quan Luo,et al.  Robust adaptive beamforming using worst-case performance optimization: a solution to the signal mismatch problem , 2003, IEEE Trans. Signal Process..

[30]  Alexander J. Smola,et al.  Online learning with kernels , 2001, IEEE Transactions on Signal Processing.

[31]  Shie Mannor,et al.  The kernel recursive least-squares algorithm , 2004, IEEE Transactions on Signal Processing.

[32]  Petre Stoica,et al.  Robust Adaptive Beamforming: Li/Robust Adaptive Beamforming , 2005 .

[33]  Yih-Fang Huang,et al.  Kernelized set-membership approach to nonlinear adaptive filtering , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[34]  Y. Censor,et al.  The multiple-sets split feasibility problem and its applications for inverse problems , 2005 .

[35]  I. Yamada,et al.  Adaptive Projected Subgradient Method for Asymptotic Minimization of Sequence of Nonnegative Convex Functions , 2005 .

[36]  Sergios Theodoridis,et al.  A geometric approach to Support Vector Machine (SVM) classification , 2006, IEEE Transactions on Neural Networks.

[37]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[38]  Klaus-Robert Müller,et al.  Incremental Support Vector Learning: Analysis, Implementation and Applications , 2006, J. Mach. Learn. Res..

[39]  I. Yamada,et al.  The Adaptive Projected Subgradient Method over the Fixed Point Set of Strongly Attracting Nonexpansive Mappings , 2006 .

[40]  David J. Edwards,et al.  Ultra-wideband : antennas and propagation for communications, radar and imaging , 2006 .

[41]  Isao Yamada,et al.  Robust Wideband Beamforming by the Hybrid Steepest Descent Method , 2007, IEEE Transactions on Signal Processing.

[42]  G. Camps‐Valls,et al.  Kernel Antenna Array Processing , 2007, IEEE Transactions on Antennas and Propagation.

[43]  Weifeng Liu,et al.  Kernel Affine Projection Algorithms , 2008, EURASIP J. Adv. Signal Process..

[44]  Sergios Theodoridis,et al.  Online Kernel-Based Classification Using Adaptive Projection Algorithms , 2008, IEEE Transactions on Signal Processing.

[45]  Sergios Theodoridis,et al.  Sliding Window Generalized Kernel Affine Projection Algorithm Using Projection Mappings , 2008, EURASIP J. Adv. Signal Process..

[46]  Sergios Theodoridis,et al.  Pattern Recognition, Fourth Edition , 2008 .

[47]  I. Yamada,et al.  Signal processing in dual domain by adaptive projected subgradient method , 2009, 2009 16th International Conference on Digital Signal Processing.

[48]  Paul Honeine,et al.  Online Prediction of Time Series Data With Kernels , 2009, IEEE Transactions on Signal Processing.