Four well-known procedures for analog speech privacy have been compared in terms of residual intelligibility, bandwidth expansion, and encoding delay. Intelligibility scores have been determined from a perceptual experiment where about 70 untrained listeners were given the task of recognizing each of 200 spoken digits that occurred in a balanced set of 50 encrypted four-digit utterances, and by averaging resulting probabilities of correct digit recognition. Bandwidth expansion has been expressed in terms of a new segmental measure that is more sensitive to short-time bandwidth manipulations than a conventional, long-time-averaged power spectrum measurement. Encoding delay is a straightforward function of analog scrambler parameters. The scrambling procedures that have been compared are sample permutation ( S ), block permutation ( B ), frequency inversion ( F ), and a combination of methods B and F , denoted by [ BF ]. Sample permutations involved a contiguous set of L S (2 to 128) 8 kHz samples, while block permutations operated on a contiguous set of N B (4 to 128) speech segments each of which was L B (8 to 256) samples long. Frequency inversion is obtained by simply inverting the sign of every other Nyquist (8 kHz) sample. The parameters, L_{s},N_{B} , and L B , determine residual intelligibility as well as transmission properties such as encoding delay and bandwidth. The comparisons in our study provide a quantitative justification for the popular approach [ BF ]. For example, with N_{B} = 8 and L_{B} =128 , although the encoding delay is as much as 128 ms, the bandwidth expansion is only about 100 Hz (using the new segmental measure), and the digit intelligibility I is 20 percent. Note that in the specific problem of recognizing ten digits, purely random (input-independent) listener responses correspond to I = 10 percent.
[1]
Aaron D. Wyner.
An analog scrambling scheme which does not expand bandwidth, Part I: Discrete time
,
1979,
IEEE Trans. Inf. Theory.
[2]
AARON D. WYNER.
An analog scrambling scheme which does not expand bandwidth, Part II: Continuous time
,
1979,
IEEE Trans. Inf. Theory.
[3]
S. Kak,et al.
On speech encryption using waveform scrambling
,
1977,
Bell Labs technical journal.
[4]
B Blesser,et al.
Speech perception under conditions of spectral transformation. I. Phonetic characteristics.
,
1972,
Journal of speech and hearing research.
[5]
M.E. Hellman,et al.
Privacy and authentication: An introduction to cryptography
,
1979,
Proceedings of the IEEE.
[6]
R. Crochiere,et al.
Speech Coding
,
1979,
IEEE Transactions on Communications.
[7]
G. A. Miller,et al.
The intelligibility of speech as a function of the context of the test materials.
,
1951,
Journal of experimental psychology.