Obtaining smooth solutions to large, linear, inverse problems

It is not uncommon now for geophysical inverse problems to be parameterized by 10 4 to 10 5 unknowns associated with upwards of 10 6 to 10 7 data con­ straints. The matrix problem defining the linearization of such a system (e.g., ~m = b) is usually solved with a least-squares criterion (m = (~t~ -) ~ t b). The size of the matrix, however, discourages the direct solution of the system and researchers often tum to iterative techniques such as the method of conjugate gradients to obtain an estimate of the least-squares solution. These iterative methods take advantage of the sparse­ ness of ~, which often has as few as 2-3 percent of its elements nonzero, and do not require the calculation (or storage) of the matrix ~t~. Although there are usually many more data constraints than unknowns, these problems are, in general, underdetermined and therefore require some sort of regularization to obtain a solution. When the regularization is simple damping, the conjugate gradients method tends to converge in relatively few iterations. However, when derivative-type regularization is applied (first derivative constraints to obtain the flattest model that fits the data; second deriv­ ative to obtain the smoothest), the convergence of parts ofthe solution may be drastically inhibited. In a series of I-D examples and a synthetic 2-D crosshole tomography example, we demonstrate this problem and also suggest a method of accelerating the convergence through the preconditioning of the conjugate gradient search direc­ tions. We derive a I-D preconditioning operator for the case of first derivative regularization using a WKBJ approximation. We have found that preconditioning can reduce the number of iterations necessary to obtain satisfactory convergence by up to an order of magnitude. The conclusions we present are also relevant to Baye­ sian inversion, where a smoothness constraint is im­ posed through an a priori covariance of the model.

[1]  A. Tarantola Inverse problem theory : methods for data fitting and model parameter estimation , 1987 .

[2]  Robert L. Parker,et al.  UNDERSTANDING INVERSE THEORY x10066 , 1977 .

[3]  R. Wiggins,et al.  The general linear inverse problem - Implication of surface waves and free oscillations for earth structure. , 1972 .

[4]  T. Moser Shortest path calculation of seismic rays , 1991 .

[5]  Malcolm Sambridge,et al.  Non-linear arrival time inversion: constraining velocity anomalies by seeking smooth models in 3-D , 1990 .

[6]  A. Zielhuis S-wave velocity below Europe from delay-time and waveform inversions , 1992 .

[7]  Gene H. Golub,et al.  Matrix computations , 1983 .

[8]  Guust Nolet,et al.  Solving or resolving inadequate and noisy tomographic systems , 1985 .

[9]  G. Backus,et al.  Numerical Applications of a Formalism for Geophysical Inverse Problems , 1967 .

[10]  C. Stork SINGULAR VALUE DECOMPOSITION OF THE VELOCITY-REFLECTOR DEPTH TRADEOFF, PART 2 : HIGH-RESOLUTION ANALYSIS OF A GENERIC MODEL , 1992 .

[11]  R. Parker,et al.  Occam's inversion; a practical algorithm for generating smooth models from electromagnetic sounding data , 1987 .

[12]  G. Backus,et al.  The Resolving Power of Gross Earth Data , 1968 .

[13]  Guust Nolet,et al.  Tomographic imaging of subducted lithosphere below northwest Pacific island arcs , 1991, Nature.

[14]  G. Backus,et al.  Uniqueness in the inversion of inaccurate gross Earth data , 1970, Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences.

[15]  R. Snieder,et al.  Solving large linear inverse problems by projection , 1990 .

[16]  Filip Neele,et al.  The use of P wave amplitude data in a joint inversion with travel times for upper mantle velocity structure , 1993 .

[17]  M. Hestenes,et al.  Methods of conjugate gradients for solving linear systems , 1952 .

[18]  J. Meijerink,et al.  An iterative solution method for linear systems of which the coefficient matrix is a symmetric -matrix , 1977 .

[19]  G. Nolet,et al.  Seismic wave propagation and seismic tomography , 1987 .

[20]  C. Fombrun,et al.  Matrix , 1979, Encyclopedic Dictionary of Archaeology.

[21]  J. Scales Tomographic inversion via the conjugate gradient method , 1987 .

[22]  Wim Spakman,et al.  Imaging algorithms, accuracy and resolution in delay time tomography , 1988 .

[23]  J. VanDecar Upper-mantle structure of the Cascadia subduction zone from non-linear teleseismic travel-time inversion , 1991 .

[24]  S. Orszag,et al.  Advanced Mathematical Methods For Scientists And Engineers , 1979 .

[25]  C. H. Chapman,et al.  Crosshole seismic tomography , 1989 .

[26]  A. Tarantola,et al.  Generalized Nonlinear Inverse Problems Solved Using the Least Squares Criterion (Paper 1R1855) , 1982 .

[27]  H. A. van der Vorst,et al.  Numerical solution of large, sparse linear algebraic systems arising from tomographic problems , 1987 .

[28]  R. Parker Understanding Inverse Theory , 1977 .

[29]  Michael A. Saunders,et al.  LSQR: An Algorithm for Sparse Linear Equations and Sparse Least Squares , 1982, TOMS.