Field Study Evaluation of Cepstrum Coefficient Speech Analysis for Fatigue in Aviation Cabin Crew

Impaired neurobehavioral performance induced by fatigue may compromise safety in 24-hour operational environments such as aviation. As such, non-invasive, reliable, and valid methods of objectively detecting compromised performance capacity in operational settings could be valuable as a means of identifying, preventing, and mitigating fatigue-induced safety risks. One approach that has attracted attention in recent years is quantitative speech analysis, but the extent of its operational feasibility, validity of the metrics, and sensitivity to operationally-relevant factors in aviation remains unknown. To this end, the present report offers an initial proof-of-concept evaluation of a speech analysis method based on Cepstrum Coefficient modeling, using voice files from a broad sample of 195 cabin crew personnel collected during the 2009-2010 United States Civil Aerospace Medical Institute-sponsored Flight Attendant Field Study. Using a personal digital assistant device, participants recited five standardized phrases in random order before and after each workday and sleep episode throughout their respective 3-4 week study periods. Operational acceptability of the procedure was high, as indicated by high protocol compliance and, despite the inherent variability of the timing and environments in which the test sessions occurred, the 13,975 files from 2,795 valid sessions were of sufficient quality for formal analysis. Individualized “baseline” speech models were built from the files collected during test sessions coinciding with optimal neurobehavioral performance, as determined by 5-min Psychomotor Vigilance Test (PVT) reaction times(RT), then speech deviation scores relative to individual baseline models were calculated for the test sessions that preceded and concluded each “trip” of multiple consecutive work days. Regarding validity, speech scores correlated significantly with PVT RTs and Lapses (RTs > 500 msec), with a stronger relationship to Lapses, but high variability at the low range of both performance variables suggests the influence of other factors. Regarding sensitivity to operational factors, average Pre-Trip vs. Post-Trip speech scores differed significantly, although scores unexpectedly decreased from Pre to Post, an artifact attributable to the composition of the baseline session pool. Nonetheless, the pattern of speech data echoed performance data from a previous report in which scores were most affected in crew of Regional carriers, with Junior seniority, and in Domestic operations. These initial results reveal promising validity and sensitivity of Cepstrum Coefficient modeling for speech signal analysis of fatigue in dynamic operational environments. Remaining questions underscore the need to further explore the dataset to determine the precise relationship between speech production and neurobehavioral performance capacity, the parameters for constructing individualized models, and standardized quantitative speech-based definitions of fatigue.

[1]  D. Dawson,et al.  The Impact of Short, Irregular Sleep Opportunities at Sea on the Alertness of Marine Pilots Working Extended Hours , 2008, Chronobiology international.

[2]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[3]  Martin Golz,et al.  Acoustic sleepiness detection: Framework and validation of a speech-adapted pattern recognition approach , 2009, Behavior research methods.

[4]  Melissa M. Mallis,et al.  Aircrew Fatigue, Sleep Need and Circadian Rhythmicity , 2010 .

[5]  D. Dinges,et al.  Maximizing sensitivity of the psychomotor vigilance test (PVT) to sleep loss. , 2011, Sleep.

[6]  Thomas E Nesthus,et al.  Flight Attendant Fatigue Recommendation 2: Flight Attendant Work/Rest Patterns, Alertness, and Performance Assessment , 2010 .

[7]  Drew Dawson,et al.  Fatigue assessment in the field: validation of a hand-held electronic psychomotor vigilance task. , 2005, Aviation, space, and environmental medicine.

[8]  Kakuichi Shiomi Voice processing technique for human cerebral activity measurement , 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics.

[9]  Eric Friets,et al.  Fatigue estimation using voice analysis , 2007, Behavior research methods.

[10]  Ann R. Bradlow,et al.  Landmark‐based analysis of sleep‐deprived speech , 2008 .

[11]  Tao Chen,et al.  On the use of Gaussian mixture model for speaker variability analysis , 2002, INTERSPEECH.

[12]  T. Åkerstedt Work hours and sleepiness , 1995, Neurophysiologie Clinique/Clinical Neurophysiology.

[13]  D. Dawson,et al.  Do Short International Layovers Allow Sufficient Opportunity for Pilots to Recover? , 2006, Chronobiology international.

[14]  Gregory Belenky,et al.  The Walter Reed palm-held psychomotor vigilance test , 2005, Behavior research methods.

[15]  D. Dinges An overview of sleepiness and accidents , 1995, Journal of sleep research.

[16]  P. Mermelstein,et al.  Distance measures for speech recognition, psychological and instrumental , 1976 .

[17]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[18]  P. Ladefoged A course in phonetics , 1975 .

[19]  T. Davidson The Great Leap Forward: the anatomic basis for the acquisition of speech and obstructive sleep apnea. , 2003, Sleep medicine.

[20]  Debra J. Skene,et al.  Differences in Sleep, Light, and Circadian Phase in Offshore 18:00–06:00 h and 19:00–07:00 h Shift Workers , 2008, Chronobiology international.

[21]  Philip Lieberman,et al.  Mount Everest: a space analogue for speech monitoring of cognitive deficits and stress. , 2005, Aviation, space, and environmental medicine.