Signal Compression: Technology Targets and Research Directions

A description of technology targets in signal compression and a nonexhaustive account of research directions that may lead toward these targets are presented. Opportunities for integrating source coding and channel coding technologies are also pointed out. Such integration, which has hitherto been an informal exercise, will become increasingly essential as communication capabilities are stretched with capacity-limited channels such as wireless media. In parallel, as greater sophistication is sought in the integration of speech and data with broadband signals such as CD-audio and high-resolution video, there will be increased interaction of signal compression technology with the field of communication networking. >

[1]  Günther Theile,et al.  Low-Bit Rate Coding of High Quality Audio Signals , 1987 .

[2]  Reinaldo A. Valenzuela,et al.  A new voice-packet reconstruction technique , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3]  Jae S. Lim,et al.  Multiband excitation vocoder , 1988, IEEE Transactions on Acoustics, Speech, and Signal Processing.

[4]  Sarah A. Rajala,et al.  Segmentation based image coding using fractals and the human visual system , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[5]  P. Wintz Transform picture coding , 1972 .

[6]  Schuyler Quackenbush A 7 kHz bandwidth, 32 kbps speech coder for ISDN , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[7]  Allen Gersho,et al.  Variable block-size image coding , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  W. Voiers,et al.  Diagnostic acceptability measure for speech communication systems , 1977 .

[9]  Jr. Thomas G. Stockham,et al.  Image processing in the context of a visual model , 1972 .

[10]  Yair Shoham Constrained-stochastic excitation coding of speech at 4.8 kb/s , 1990, ICSLP.

[11]  Y. Ninomiya,et al.  HDTV broadcasting systems , 1991, IEEE Communications Magazine.

[12]  James D. Johnston,et al.  A filter family designed for use in quadrature mirror filter banks , 1980, ICASSP.

[13]  Allen Gersho,et al.  Asymptotically optimal block quantization , 1979, IEEE Trans. Inf. Theory.

[14]  Arild Fuldseth,et al.  A real-time implementable 7 khz speech coder at 16 kbit/s , 1991, EUROSPEECH.

[15]  Limin Wang,et al.  Progressive image transmission using vector quantization on images in pyramid form , 1989, IEEE Trans. Commun..

[16]  D. Neuhoff,et al.  An Asymptotic Analysis Of Two-stage Vector Quantization , 1991, Proceedings. 1991 IEEE International Symposium on Information Theory.

[17]  Schuyler Quackenbush,et al.  Hardware implementation of a color image decoder for remote database access , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[18]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[19]  David L. Neuhoff,et al.  Theory of lattice-based fine-coarse vector quantization , 1991, IEEE Trans. Inf. Theory.

[20]  P. Pirsch,et al.  Advances in picture coding , 1985, Proceedings of the IEEE.

[21]  David J. Goodman Embedded DPCM for Variable Bit Rate Transmission , 1980, IEEE Trans. Commun..

[22]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[23]  Bhaskar Ramamurthi,et al.  Classified Vector Quantization of Images , 1986, IEEE Trans. Commun..

[24]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[25]  Peter H. Westerink,et al.  A High Quality Digital HDTV Codec , 1991 .

[26]  Kiyoharu Aizawa,et al.  Real-time facial action image synthesis system driven by speech and text , 1990, Other Conferences.

[27]  N. Jayant,et al.  Digital Coding of Waveforms: Principles and Applications to Speech and Video , 1990 .

[28]  Allen Gersho,et al.  On the structure of vector quantizers , 1982, IEEE Trans. Inf. Theory.

[29]  Nariman Farvardin,et al.  A study of vector quantization for noisy channels , 1990, IEEE Trans. Inf. Theory.

[30]  D. Huffman A Method for the Construction of Minimum-Redundancy Codes , 1952 .

[31]  Joel Max,et al.  Quantizing for minimum distortion , 1960, IRE Trans. Inf. Theory.

[32]  Lawrence Wai-Choong Wong,et al.  Waveform substitution techniques for recovering missing speech segments in packet voice communications , 1986, IEEE Trans. Acoust. Speech Signal Process..

[33]  P. Schultheiss,et al.  Block Quantization of Correlated Gaussian Random Variables , 1963 .

[34]  Kuldip K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1990, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[35]  Robert Forchheimer,et al.  Image coding-from waveforms in animation , 1989, IEEE Trans. Acoust. Speech Signal Process..

[36]  N. Kitawaki,et al.  Speech coding technology for ATM networks , 1990, IEEE Communications Magazine.

[37]  Nuggehally Sampath Jayant,et al.  Waveform quantization and coding , 1976 .

[38]  Thomas R. Fischer,et al.  A pyramid vector quantizer , 1986, IEEE Trans. Inf. Theory.

[39]  O. Rioul,et al.  Wavelets and signal processing , 1991, IEEE Signal Processing Magazine.

[40]  Amy R. Reibman,et al.  DCT-based embedded coding for packet video , 1991, Signal Process. Image Commun..

[41]  R. Gray,et al.  Product code vector quantizers for waveform and voice coding , 1984 .

[42]  M. Kunt,et al.  Second-generation image-coding techniques , 1985, Proceedings of the IEEE.

[43]  Nuggehally Sampath Jayant,et al.  Sparse codebooks for the quantization of nondominant sub-bands in image coding , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[44]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[45]  J. B. O'Neal,et al.  Predictive quantizing systems (differential pulse code modulation) for the transmission of television signals , 1966 .

[46]  R. Crochiere,et al.  Speech Coding , 1979, IEEE Transactions on Communications.

[47]  C. Cutler,et al.  Delayed Encoding: Stabilizer for Adaptive Coders , 1971 .

[48]  Peter Kroon,et al.  A High-Quality Multirate Real-Time CELP Coder , 1992, IEEE J. Sel. Areas Commun..

[49]  Philip A. Chou,et al.  Entropy-constrained vector quantization , 1989, IEEE Trans. Acoust. Speech Signal Process..

[50]  David O. Beaumont,et al.  Two-layer video coding for ATM networks , 1991, Signal Process. Image Commun..

[51]  Jens-Rainer Ohm,et al.  Predictive tree encoding of still images with vector quantization , 1990 .

[52]  David L. Neuhoff,et al.  On the performance of tree-structured vector quantization , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[53]  Yair Shoham,et al.  Coding of wideband speech , 1991, Speech Commun..

[54]  D. Esteban,et al.  Application of quadrature mirror filters to split band voice coding schemes , 1977 .

[55]  H. G. Musmann The ISO audio coding standard , 1990, [Proceedings] GLOBECOM '90: IEEE Global Telecommunications Conference and Exhibition.

[56]  Gunnar Karlsson,et al.  Packet video and its integration into the network architecture , 1989, IEEE J. Sel. Areas Commun..

[57]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[58]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[59]  M. Vetterli Multi-dimensional sub-band coding: Some theory and algorithms , 1984 .

[60]  Ian H. Witten,et al.  Arithmetic coding for data compression , 1987, CACM.

[61]  Wayne E. Stark,et al.  Fine-coarse vector quantization , 1991, IEEE Trans. Signal Process..

[62]  Allen Gersho,et al.  Optimal Block-Adaptive Image Coding with Constrained Bit-Rate , 1990, 1990 Conference Record Twenty-Fourth Asilomar Conference on Signals, Systems and Computers, 1990..

[63]  Bishnu S. Atal,et al.  ON IMPROVING THE PERFORMANCE OF PITCH PREDICTORS IN SPEECH CODING SYSTEMS , 1991 .

[64]  N. S. Jayant High-quality coding of telephone speech and wideband audio , 1990 .

[65]  R. Gallager Information Theory and Reliable Communication , 1968 .

[66]  Claude E. Shannon,et al.  A Mathematical Theory of Communications , 1948 .

[67]  A.D. Wyner,et al.  Fundamental limits in information theory , 1981, Proceedings of the IEEE.

[68]  Hideyoshi Tominaga,et al.  A video coding method considering cell losses in ATM-based networks , 1991, Signal Process. Image Commun..

[69]  D. Legall,et al.  MPEG : A video compression standard for multimedia applications , 1991 .

[70]  Allen Gersho,et al.  Real-time vector APC speech coding at 4800 bps with adaptive postfiltering , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[71]  Ming Lei Liou,et al.  Overview of the p×64 kbit/s video coding standard , 1991, CACM.

[72]  D.C. Cox,et al.  Portable digital radio communications-an approach to tetherless access , 1989, IEEE Communications Magazine.

[73]  P. Vaidyanathan Quadrature mirror filter banks, M-band extensions and perfect-reconstruction techniques , 1987, IEEE ASSP Magazine.

[74]  V. Cuperman,et al.  Vector quantization: A pattern-matching technique for speech coding , 1983, IEEE Communications Magazine.

[75]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[76]  A.N. Netravali,et al.  Picture coding: A review , 1980, Proceedings of the IEEE.

[77]  R. Steele The cellular environment of lightweight handheld portables , 1989, IEEE Communications Magazine.

[78]  N. Kitawaki,et al.  Quality assessment of speech coding and speech synthesis systems , 1988, IEEE Communications Magazine.

[79]  N. Jayant Adaptive quantization with a one-word memory , 1973 .

[80]  Peter H. Westerink,et al.  Subband coding of images using vector quantization , 1988, IEEE Trans. Commun..

[81]  J. W. Modestino,et al.  Combined Source-Channel Coding of Images , 1978, IEEE Trans. Commun..

[82]  Jerry D. Gibson,et al.  Sequentially Adaptive Backward Prediction in ADPCM Speech Coders , 1978, IEEE Trans. Commun..

[83]  R. J. Safranek,et al.  A perceptually tuned sub-band image coder with image dependent quantization and post-quantization data compression , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[84]  L. Davisson Rate-distortion theory and application , 1972 .

[85]  T. Murakami,et al.  Vector quantization of color images , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[86]  Allen Gersho,et al.  Phonetically-based vector excitation coding of speech at 3.6 kbps , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[87]  R. Wilson,et al.  Anisotropic Nonstationary Image Estimation and Its Applications: Part II - Predictive Image Coding , 1983, IEEE Transactions on Communications.

[88]  P. Mermelstein G.722: a new CCITT coding standard for digital transmission of wideband audio signals , 1988, IEEE Communications Magazine.

[89]  Bernard M. Smith Instantaneous companding of quantized signals , 1957 .

[90]  Nuggehally Sampath Jayant,et al.  Effects of Packet Losses in Waveform Coded Speech and Improvements Due to an Odd-Even Sample-Interpolation Procedure , 1981, IEEE Trans. Commun..

[91]  Biing-Hwang Juang,et al.  Multiple stage vector quantization for speech coding , 1982, ICASSP.

[92]  B. Atal,et al.  Predictive coding of speech signals and subjective error criteria , 1979 .

[93]  Ira Alan Gerson,et al.  Vector Sum Excited Linear Prediction (VSELP) , 1991 .

[94]  James L. Flanagan,et al.  Autodirective Microphone Systems , 1991 .

[95]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[96]  T. Kim New finite state vector quantizers for images , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[97]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[98]  Lawrence G. Roberts,et al.  Picture coding using pseudo-random noise , 1962, IRE Trans. Inf. Theory.

[99]  James L. Flanagan,et al.  Digital coding of speech in sub-bands , 1976, The Bell System Technical Journal.

[100]  P. Noll,et al.  Adaptive transform coding of speech signals , 1977 .

[101]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[102]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[103]  Nobuhiko Kitawaki,et al.  Speech-quality assessment methods for speech-coding systems , 1984, IEEE Communications Magazine.

[104]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1991, CACM.

[105]  Kiyoharu Aizawa,et al.  Model-based analysis synthesis image coding (MBASIC) system for a person's face , 1989, Signal Process. Image Commun..

[106]  W. Daumer Subjective Evaluation of Several Efficient Speech Coders , 1982, IEEE Trans. Commun..

[107]  Richard V. Cox,et al.  The design of uniformly and nonuniformly spaced pseudoquadrature mirror filters , 1986, IEEE Trans. Acoust. Speech Signal Process..

[108]  Rémy Prost,et al.  Block-adaptive subband coding of images , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[109]  Man Mohan Sondhi,et al.  Enhancement of ADPCM speech coding with backward-adaptive algorithms for postfiltering and noise feedback , 1988, IEEE J. Sel. Areas Commun..

[110]  William F. Schreiber Psychophysics and the Improvement of Television Image Quality , 1984 .

[111]  Arnaud E. Jacquin,et al.  A novel fractal block-coding technique for digital images , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[112]  Carl-Erik W. Sundberg,et al.  Subband speech coding and matched convolutional channel coding for mobile radio channels , 1991, IEEE Trans. Signal Process..

[113]  W. Bastiaan Kleijn,et al.  Methods for waveform interpolation in speech coding , 1991, Digit. Signal Process..

[114]  Hsueh-Ming Hang,et al.  Predictive Vector Quantization of Images , 1985, IEEE Trans. Commun..

[115]  David L. Neuhoff,et al.  Perceptual coding of images for halftone display , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[116]  John Princen,et al.  Subband/Transform coding using filter bank designs based on time domain aliasing cancellation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[117]  Karlheinz Brandenburg OCF--A new coding algorithm for high quality sound signals , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[118]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[119]  John W. Woods,et al.  Subband coding of images , 1986, IEEE Trans. Acoust. Speech Signal Process..

[120]  A. Nejat Ince,et al.  Digital Speech Processing , 1992 .

[121]  Robert M. Gray,et al.  Finite-state vector quantization for waveform coding , 1985, IEEE Trans. Inf. Theory.

[122]  J. C. Hardwick,et al.  The application of the IMBE speech coder to mobile communications , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[123]  David J. Goodman,et al.  Subjective Quality of the Same Speech Transmission Conditions in Seven Different Countries , 1982, IEEE Trans. Commun..

[124]  Kiyoharu Aizawa,et al.  Adaptive discrete cosine transform image coding using gain/Shape vector quantizers , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[125]  Allen Gersho,et al.  Constrained-storage quantization of multiple vector sources by codebook sharing , 1991, IEEE Trans. Commun..

[126]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[127]  Nasser M. Nasrabadi,et al.  Image coding using vector quantization: a review , 1988, IEEE Trans. Commun..

[128]  Thomas J. Goblick,et al.  Analog source digitization: A comparison of theory and practice (Corresp.) , 1967, IEEE Trans. Inf. Theory.

[129]  D.L. Neuhoff,et al.  Model-based Halftoning , 1991, Proceedings of the Seventh Workshop on Multidimensional Signal Processing.

[130]  R. Steele,et al.  Delta Modulation Systems , 1975 .

[131]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[132]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[133]  Willem Verbiest,et al.  A variable bit rate video codec for asynchronous transfer mode networks , 1989, IEEE J. Sel. Areas Commun..

[134]  B. Atal High-quality speech at low bit rates: Multi-pulse and stochastically excited linear predictive coders , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[135]  Thomas E. Tremain,et al.  An elevation of 4800 bps voice coders , 1989, International Conference on Acoustics, Speech, and Signal Processing,.