EU-US WORKING GROUP ON SPOKEN-WORD AUDIO COLLECTIONS

0.0 EXECUTIVE SUMMARY Our diverse cultures rely increasingly on audio and video resources. We need to chart a steady course to assure the utility of this record. Such a course calls for a plan to preserve these resources and to determine the most effective ways to access their rich content. For example, though our nations possess enormous collections of spoken-word materials, much of these collections will remain inaccessible to the public for lack of adequate search technologies or from decay unless we act to chart an access and preservation path. Our aim is to forge agreement on these vital topics so that as technology changes, we will be able to rely on our collections to understand and preserve these essential components of our cultural heritage. We also need to focus research support on areas of access and preservation that we believe will yield the greatest benefits across many intersecting disciplines. This document presents an agenda for collaborative research in this field.

[1]  A.E. Rosenberg,et al.  Automatic speaker verification: A review , 1976, Proceedings of the IEEE.

[2]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[3]  J.M. Naik,et al.  Speaker verification: a tutorial , 1990, IEEE Communications Magazine.

[4]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[5]  Herman J. M. Steeneken,et al.  Human benchmarks for speaker independent large vocabulary recognition performance , 1995, EUROSPEECH.

[6]  Richard Lippmann,et al.  Speech recognition by machines and humans , 1997, Speech Commun..

[7]  Steve Young,et al.  Corpus-based methods in language and speech processing , 1997 .

[8]  S. Chen,et al.  Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion , 1998 .

[9]  Til T. Phan,et al.  Text-Independent Speaker Identification , 1999 .

[10]  Ellen M. Voorhees,et al.  The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[11]  Ramesh A. Gopinath,et al.  Improved speaker segmentation and segments clustering using the bayesian information criterion , 1999, EUROSPEECH.

[12]  Jean-Luc Gauvain,et al.  Portability Issues for Speech Recognition Technologies , 2001, HLT.

[13]  Jean-Luc Gauvain,et al.  Lightly supervised and unsupervised acoustic model training , 2002, Comput. Speech Lang..

[14]  Fabio Brugnara,et al.  Cross-task portability of a broadcast news speech recognition system , 2002, Speech Commun..

[15]  Sundara Rajan,et al.  Moral Rights and Copyright Harmonization: Prospects for an 'International Moral Right' , 2002 .