A Study of On-Off Characteristics of Conversational Speech

The on-off statistics of conversational speech have been investigated using a large database of 50 min of telephone speech. Based on the measurement results of on-off patterns, probability density functions of silence and talkspurt durations are modeled approximately by two weighted geometric functions. Then, for any value of hangover or fill-in, speech parameters such as speech activity, average silence, and talkspurt durations are calculated using the fitted probability density function of silence durations and compared to those measured. Directly measured values of speech parameters and those calculated agree closely for a practical range of hangover and fill-in time. For a large hangover time greater than 200 ms, silences and talkspurts can be fit by an exponential distribution and a constant-plus-exponential distribution, respectively. On the other hand, for a large fill-in time greater than 200 ms, silences and talkspurts can be modeled by a constant-plus-exponential distribution and an exponential distribution, respectively. With both large hangover and fill-in values, the talkspurt model agrees closely with the measured data, but the silence model does not agree as closely as the talkspurt model.

[1]  Chong Un,et al.  Voiced/Unvoiced/Silence discrimination of speech by delta modulation , 1980 .

[2]  F. Vagliani,et al.  Digital Dynamic Speech Detectors , 1978, IEEE Trans. Commun..

[3]  E. Harrington,et al.  Voice/Data Integration Using Circuit Switched Networks , 1980, IEEE Trans. Commun..

[4]  E. Fariello,et al.  A Novel Digital Speech Detector for Improving Effective Satellite Capacity , 1972, IEEE Trans. Commun..

[5]  Y. Yatsuzuka Highly Sensitive Speech Detector and High-Speed Voiceband Data Discriminator in DSI-ADPCM Systems , 1982, IEEE Trans. Commun..

[6]  Paul T. Brady,et al.  A model for generating on-off speech patterns in two-way conversation , 1969 .

[7]  J. Gruber,et al.  Delay Related Issues in Integrated Voice and Data Networks , 1981, IEEE Trans. Commun..

[8]  G. Monti,et al.  Speech Interpolation in Digital Transmission Systems , 1974, IEEE Trans. Commun..

[9]  M. Fischer,et al.  A Model for Evaluating the Performance of an Integrated Circuit- and Packet-Switched Multiplex Structure , 1976, IEEE Trans. Commun..

[10]  Paul T. Brady,et al.  A technique for investigating on-off patterns of speech , 1965 .

[11]  Jacek Jankowski A new digital voice-activated switch , 1976 .

[12]  R. Maruta,et al.  Design and Performance of a DSI Terminal for Domestic Applications , 1981, IEEE Trans. Commun..

[13]  M. G. Schachtman,et al.  Tasi quality — Effect of speech detectors and interpolation , 1962 .

[14]  J. Gruber,et al.  A Comparison of Measured and Calculated Speech Temporal Parameters Relevant to Speech Activity Detection , 1982, IEEE Trans. Commun..

[15]  T. Bially,et al.  Voice Communication in Integrated Digital Voice and Data Networks , 1980, IEEE Trans. Commun..