Wirtinger Calculus Based Gradient Descent and Levenberg-Marquardt Learning Algorithms in Complex-Valued Neural Networks

Complex-valued neural networks (CVNNs) bring in nonholomorphic functions in two ways: (i) through their loss functions and (ii) the widely used activation functions. The derivatives of such functions are defined in Wirtinger calculus. In this paper, we derive two popular algorithms—the gradient descent and the Levenberg-Marquardt (LM) algorithm—for parameter optimization in the feedforward CVNNs using the Wirtinger calculus, which is simpler than the conventional derivation that considers the problem in real domain. While deriving the LM algorithm, we solve and use the result of a least squares problem in the complex domain,$\|\mathbf{b-(Az+Bz^*)}\|_{\underset{\mathbf{z}}{\min}}$, which is more general than the $\|\mathbf{b-Az}\|_{\underset{\mathbf{z}}{\min}}$. Computer simulation results exhibit that as with the real-valued case, the complex-LM algorithm provides much faster learning with higher accuracy than the complex gradient descent algorithm. $|\mathbf{b-(Az+Bz^*)}\|_{\underset{\mathbf{z}}{\min}}$, which is more general than the $\|\mathbf{b-Az}\|_{\underset{\mathbf{z}}{\min}}$. Computer simulation results exhibit that as with the real-valued case, the complex-LM algorithm provides much faster learning with higher accuracy than the complex gradient descent algorithm. $|\mathbf{b-(Az+Bz^*)}\|_{\underset{\mathbf{z}}{\min}}$, which is more general than the $\|\mathbf{b-Az}\|_{\underset{\mathbf{z}}{\min}}$. Computer simulation results exhibit that as with the real-valued case, the complex-LM algorithm provides much faster learning with higher accuracy than the complex gradient descent algorithm. $|\mathbf{b-(Az+Bz^*)}\|_{\underset{\mathbf{z}}{\min}}$, which is more general than the $\|\mathbf{b-Az}\|_{\underset{\mathbf{z}}{\min}}$. Computer simulation results exhibit that as with the real-valued case, the complex-LM algorithm provides much faster learning with higher accuracy than the complex gradient descent algorithm.

[1]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[2]  Harry L. Van Trees,et al.  Optimum Array Processing , 2002 .

[3]  Akira Hirose,et al.  Complex-Valued Neural Networks , 2006, Studies in Computational Intelligence.

[4]  Aggelos K. Katsaggelos,et al.  Joint Video Summarization and Transmission Adaptation for Energy-Efficient Wireless Video Streaming , 2008, EURASIP J. Adv. Signal Process..

[5]  V D Calhoun,et al.  Independent component analysis of fMRI data in the complex domain , 2002, Magnetic resonance in medicine.

[6]  Sergios Theodoridis,et al.  Ieee Transactions on Signal Processing Extension of Wirtinger's Calculus to Reproducing Kernel Hilbert Spaces and the Complex Kernel Lms , 2022 .

[7]  Hualiang Li,et al.  Complex-Valued Adaptive Signal Processing Using Nonlinear Functions , 2008, EURASIP J. Adv. Signal Process..

[8]  W. Wirtinger Zur formalen Theorie der Funktionen von mehr komplexen Veränderlichen , 1927 .

[9]  R. Remmert,et al.  Theory of Complex Functions , 1990 .

[10]  Hai Quang Dam,et al.  Deleted DOI: Postfiltering Using Multichannel Spectral Estimation in Multispeaker Environments , 2008 .

[11]  Mohammad Bagher Menhaj,et al.  Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[12]  Gordon L. Stuber,et al.  Principles of Mobile Communication , 1996 .

[13]  Hualiang Li,et al.  Algorithms for Complex ML ICA and Their Stability Analysis Using Wirtinger Calculus , 2010, IEEE Transactions on Signal Processing.

[14]  A. Bos Complex gradient and Hessian , 1994 .

[15]  Carl W. Helstrom,et al.  Elements of signal detection and estimation , 1994 .

[16]  Gordon L. Stuber,et al.  Principles of mobile communication (2nd ed.) , 2001 .

[17]  Akira Hirose,et al.  Complex-Valued Neural Networks (Studies in Computational Intelligence) , 2006 .

[18]  Ken Kreutz-Delgado,et al.  The Complex Gradient Operator and the CR-Calculus ECE275A - Lecture Supplement - Fall 2005 , 2009, 0906.4835.

[19]  Tülay Adali,et al.  Approximation by Fully Complex Multilayer Perceptrons , 2003, Neural Computation.

[20]  Tohru Nitta,et al.  An Extension of the Back-Propagation Algorithm to Complex Numbers , 1997, Neural Networks.

[21]  B. A. D. H. Brandwood A complex gradient operator and its applica-tion in adaptive array theory , 1983 .