The 2019 Inaugural Fearless Steps Challenge: A Giant Leap for Naturalistic Audio

The 2019 FEARLESS STEPS (FS-1) Challenge is an initial step to motivate a streamlined and collaborative effort from the speech and language community towards addressing massive naturalistic audio, the first of its kind. The Fearless Steps Corpus is a collection of 19,000 hours of multi-channel recordings of spontaneous speech from over 450 speakers under multiple noise conditions. A majority of the Apollo Missions original analog data is unlabeled and has thus far motivated the development of both unsupervised and semi-supervised strategies. This edition of the challenge encourages the development of core speech and language technology systems for data with limited groundtruth/low resource availability and is intended to serve as the “First Step” towards extracting high-level information from such massive unlabeled corpora. In conjunction with the Challenge, 11,000 hours of synchronized 30-channel Apollo-11 audio data has also been released to the public by CRSS-UTDallas. We describe in this paper the Fearless Steps Corpus, Challenge Tasks, their associated baseline systems, and results. In conclusion, we also provide insights gained by the CRSS-UTDallas team during the inaugural Fearless Steps Challenge.

[1]  John H. L. Hansen,et al.  Unsupervised Speech Activity Detection Using Voicing Measures and Perceptual Spectral Flux , 2013, IEEE Signal Processing Letters.

[2]  John H. L. Hansen,et al.  'houston, we have a solution': a case study of the analysis of astronaut speech during NASA apollo 11 for long-term speaker modeling , 2014, INTERSPEECH.

[3]  Florin Curelaru,et al.  Front-End Factor Analysis For Speaker Verification , 2018, 2018 International Conference on Communications (COMM).

[4]  John H. L. Hansen,et al.  Fearless Steps: Apollo-11 Corpus Advancements for Speech Technologies from Earth to the Moon , 2018, INTERSPEECH.

[5]  John H. L. Hansen,et al.  Active Learning Based Constrained Clustering For Speaker Diarization , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[6]  Jon Barker,et al.  The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines , 2018, INTERSPEECH.

[7]  John H. L. Hansen,et al.  Automatic audio sentiment extraction using keyword spotting , 2015, INTERSPEECH.

[8]  Chengzhu Yu,et al.  A study of voice production characteristics of astronuat speech during Apollo 11 for speaker modeling in space. , 2017, The Journal of the Acoustical Society of America.

[9]  John H. L. Hansen,et al.  Speech activity detection for NASA apollo space missions: challenges and solutions , 2014, INTERSPEECH.

[10]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[11]  John H. L. Hansen,et al.  i-Vector/PLDA speaker recognition using support vectors with discriminant analysis , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[13]  John H. L. Hansen,et al.  Robust Speaker Clustering using Mixtures of von Mises-Fisher Distributions for Naturalistic Audio Streams , 2018, INTERSPEECH.

[14]  John H. L. Hansen,et al.  Speech Detection and Enhancement Using Single Microphone for Distant Speech Applications in Reverberant Environments , 2017, INTERSPEECH.

[15]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[16]  Lakshmish Kaushik Conversational Speech Understanding in Highly Naturalistic Audio Streams , 2018 .

[17]  John H. L. Hansen,et al.  'houston, we have a solution': using NASA apollo program to advance speech and language processing technology , 2013, INTERSPEECH.

[18]  John H. L. Hansen,et al.  Sentiment extraction from natural audio streams , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  Douglas A. Reynolds,et al.  The NIST 2014 Speaker Recognition i-vector Machine Learning Challenge , 2014, Odyssey.