论文信息 - Speech, Voice, Text, and Meaning: A Multidisciplinary Approach to Interview Data through the use of digital tools

Speech, Voice, Text, and Meaning: A Multidisciplinary Approach to Interview Data through the use of digital tools

Interview data is multimodal data: it consists of speech sound, facial expression and gestures, captured in a particular situation, and containing textual information and emotion. This workshop shows how a multidisciplinary approach may exploit the full potential of interview data. The workshop first gives a systematic overview of the research fields working with interview data. It then presents the speech technology currently available to support transcribing and annotating interview data, such as automatic speech recognition, speaker diarization, and emotion detection. Finally, scholars who work with interview data and tools may present their work and discover how to make use of existing technology.

[1] Franciska de Jong,et al. Emotional Expression in Oral History Narratives: Comparing Results of Automated Verbal and Nonverbal Analyses , 2013, CMN.

[2] Louise Corti,et al. A CLARIN Transcription Portal for Interview Data , 2020, LREC.

[3] Franciska de Jong,et al. Croatian Memories : speech, meaning and emotions in a collection of interviews on experiences of war and trauma. , 2014, LREC 2014.