Ontology for Transcription of ATC Speech Commands of SESAR 2020 Solution PJ.16-04

Nowadays Automatic Speech Recognition (ASR) applications are increasingly successful in the air traffic (ATC) domain. Paramount to achieving this is collecting enough data for speech recognition model training. Thousands of hours of ATC communication are recorded every day. However, the transcription of these data sets is resource intense, i.e. writing down the sequence of spoken words, and more importantly, interpreting the relevant semantics. Many different approaches including CPDLC (Controller Pilot Data Link Communications) currently exist in the ATC community for command transcription, a fact that e.g. complicates exchange of transcriptions. The partners of the SESAR funded solution PJ.16-04 are currently developing on a common ontology for transcription of controller-pilot communications, which will harmonize integration of ASR into controller working positions. The resulting ontology is presented in this paper.

[1]  Shuo Chen,et al.  Speech inputs to surface safety logic systems , 2015, 2015 IEEE/AIAA 34th Digital Avionics Systems Conference (DASC).

[2]  Natalia Rodríguez Uclés Relationship between Workload and Duration of ATC Voice Communications , 2014 .

[3]  Oliver Ohneiser,et al.  Reducing controller workload with automatic speech recognition , 2016, 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC).

[4]  José Manuel Cordero,et al.  Automated Speech Recognition in Controller Communications applied to Workload Measurement , 2013 .

[5]  Van Nhan Nguyen,et al.  N-best list re-ranking using syntactic score: A solution for improving speech recognition accuracy in air traffic control , 2016, 2016 16th International Conference on Control, Automation and Systems (ICCAS).

[6]  Ajay Srinivasamurthy,et al.  Machine Learning of Controller Command Prediction Models from Recorded Radar Data and Controller Speech Utterances , 2017 .

[7]  Heiko Ehr,et al.  Assistant-Based Speech Recognition for ATM Applications , 2015 .

[8]  Hermann Ney,et al.  Investigation on log-linear interpolation of multi-domain neural network language model , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Keith Baldwin,et al.  AN AUTOMATED SIMULATION PILOT CAPABILITY TO SUPPORT ADVANCED AIR TRAFFIC CONTROLLER TRAINING , 2008 .

[10]  Geoffrey Zweig,et al.  Advances in speech transcription at IBM under the DARPA EARS program , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  José Manuel Cordero Garcia,et al.  Automated speech recognition in ATC environment , 2012, ATACCS.

[12]  O V Prinzo Data-Linked Pilot Reply Time On Controller Workload and Communication in a Simulated Terminal Option , 2001 .

[13]  Hunter Kopald,et al.  Read Back Error Detection using Automatic Speech Recognition , 2017 .

[14]  Cheryl J. Hamel,et al.  Microcomputer System Integration for Air Control Training , 1989 .

[15]  Van Nhan Nguyen,et al.  N-best list re-ranking using semantic relatedness and syntactic score: An approach for improving speech recognition accuracy in air traffic control , 2016, 2016 16th International Conference on Control, Automation and Systems (ICCAS).

[16]  Oliver Ohneiser,et al.  Increasing ATM Efficiency withAssistant Based Speech Recognition , 2017 .

[17]  Geoffrey Zweig,et al.  Achieving Human Parity in Conversational Speech Recognition , 2016, ArXiv.

[18]  Daniel R. Johnson,et al.  Automatic Speech Semantic Recognition and verification in Air Traffic Control , 2013, 2013 IEEE/AIAA 32nd Digital Avionics Systems Conference (DASC).