International Large-Scale Vehicle Corpora for Research on Driver Behavior on the Road

This paper considers a comprehensive and collaborative project to collect large amounts of driving data on the road for use in a wide range of areas of vehicle-related research centered on driving behavior. Unlike previous data collection efforts, the corpora collected here contain both human and vehicle sensor data, together with rich and continuous transcriptions. While most efforts on in-vehicle research are generally focused within individual countries, this effort links a collaborative team from three diverse regions (i.e., Asia, American, and Europe). Details relating to the data collection paradigm, such as sensors, driver information, routes, and transcription protocols, are discussed, and a preliminary analysis of the data across the three data collection sites from the U.S. (Dallas), Japan (Nagoya), and Turkey (Istanbul) is provided. The usability of the corpora has been experimentally verified with a Cohen's kappa coefficient of 0.74 for transcription reliability, as well as being successfully exploited for several in-vehicle applications. Most importantly, the corpora are publicly available for research use and represent one of the first multination efforts to share resources and understand driver characteristics. Future work on distributing the corpora to the wider research community is also discussed.

[1]  Gaël Richard,et al.  The speechdat-car multilingual speech databases for in-car applications: some first validation results , 1999, EUROSPEECH.

[2]  Kazuya Takeda,et al.  Driver Modeling Based on Driving Behavior and Its Evaluation in Driver Identification , 2007, Proceedings of the IEEE.

[3]  James R. Sayer,et al.  Road Departure Crash Warning System Field Operational Test: Methodology and Results. Volume 1: Technical Report , 2006 .

[4]  John H. L. Hansen,et al.  Advances for In-Vehicle and Mobile Systems: Challenges for International Standards , 2007 .

[5]  Kenji Araki,et al.  Analysis of Changes in Dialogue Rhythm Due to Dialogue Acts in Task-Oriented Dialogues , 2007, TSD.

[6]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[7]  John H. L. Hansen,et al.  Computer Vision Systems for “Context-Aware” Active Vehicle Safety and Driver Assistance , 2009 .

[8]  John H. L. Hansen,et al.  CU-Move: Advanced In-Vehicle Speech Systems for Route Navigation , 2005 .

[9]  Kazuya Takeda,et al.  On-Going Data Collection of Driving Behavior Signals , 2009 .

[10]  James R. Sayer,et al.  Integrated vehicle-based safety systems field operational test final program report. , 2011 .

[11]  Mathews Jacob,et al.  Design of steerable filters for feature detection using canny-like criteria , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Kazuya Takeda,et al.  Construction and Analysis of a Multi-Layered In-car Spoken Dialogue Corpus , 2005 .

[13]  Ming Liu,et al.  AVICAR: audio-visual speech corpus in a car environment , 2004, INTERSPEECH.

[14]  Marilyn A. Walker,et al.  The AT&t-DARPA communicator mixed-initiative spoken dialog system , 2000, INTERSPEECH.

[15]  Deborah Bruce,et al.  Improved Vehicle Safety and How Technology Will Get Us There, Hopefully , 2009 .

[16]  Thomas A. Dingus,et al.  The Impact of Driver Inattention on Near-Crash/Crash Risk: An Analysis Using the 100-Car Naturalistic Driving Study Data , 2006 .

[17]  A. Sathyanarayana,et al.  Driver behavior analysis and route recognition by Hidden Markov Models , 2008, 2008 IEEE International Conference on Vehicular Electronics and Safety.

[18]  Kazuya Takeda,et al.  In-car Speech Data Collection along with Various Multimodal Signals , 2008, LREC.

[19]  L. Malta,et al.  Multi-modal real-world driving data collection, transcription, and integration using Bayesian Network , 2008, 2008 IEEE Intelligent Vehicles Symposium.

[20]  Kazuya Takeda,et al.  Analysis of Real-World Driver's Frustration , 2011, IEEE Transactions on Intelligent Transportation Systems.

[21]  Gary L. Rupp,et al.  Performance Metrics for Assessing Driver Distraction: The Quest for Improved Road Safety , 2011 .

[22]  James R. Sayer,et al.  Automotive Collision Avoidance System Field Operational Test Report: Methodology and Results Appendices , 2005 .

[23]  John H. L. Hansen,et al.  Towards Multimodal Driver’s Stress Detection , 2012 .

[24]  John H. L. Hansen,et al.  UTDrive: The Smart Vehicle Project , 2009 .

[25]  John H. L. Hansen,et al.  "CU-move" : analysis & corpus development for interactive in-vehicle speech systems , 2001, INTERSPEECH.

[26]  John H. L. Hansen,et al.  In-Vehicle Corpus and Signal Processing for Driver Behavior , 2008 .

[27]  Thomas A. Dingus,et al.  The 100-Car Naturalistic Driving Study Phase II – Results of the 100-Car Field Experiment , 2006 .