Subjective data arrangement using clustering techniques for training expert systems

Abstract The evaluation of subjective data is a very demanding task. The classification of the information gathered from human evaluators and the possible high noise levels introduced are ones of the most difficult issues to deal with. This situation leads to adopt individuals who can be considered as experts in the specific application domain. Thus, the development of Expert Systems (ES) that consider the opinion of these individuals have been appeared to mitigate the problem. In this work an original methodology for the selection of subjective sequential data for the training of ES is presented. The system is based on the arrangement of knowledge acquired from a group of human experts. An original similarity measure between the subjective evaluations is proposed. Homogeneous groups of experts are produced using this similarity through a clustering algorithm. The methodology was applied to a practical case of the Intelligent Transportation Systems (ITS) domain for the training of ES for driving risk prediction. The results confirm the relevance of selecting homogeneous information (grouping similar opinions) when generating a ground truth (a reliable signal) for the training of ES. Further, the results show the need of considering subjective sequential data when working with phenomena where a set of rules could not be easily learned from human experts, such as risk assessment.

[1]  James R. Slagle,et al.  An explanation facility for today's expert systems , 1989, IEEE Expert.

[2]  T. M. Abdel-Moneim,et al.  Optimization and mechanical simulation of a pursuit-evader scenario using genetic algorithm and stewart platform , 2017, 2017 8th International Conference on Mechanical and Aerospace Engineering (ICMAE).

[3]  Yuan Zhang,et al.  Credit risk assessment based on neural network , 2012, 2012 8th International Conference on Natural Computation.

[4]  François de Vieilleville,et al.  Analysis and Comparative Evaluation of Discrete Tangent Estimators , 2005, DGCI.

[5]  J. Leask,et al.  Vaccine Rejecting Parents’ Engagement With Expert Systems That Inform Vaccination Programs , 2017, Journal of Bioethical Inquiry.

[6]  Michael J. Goodman,et al.  NHTSA DRIVER DISTRACTION RESEARCH: PAST, PRESENT, AND FUTURE , 2001 .

[7]  Xiaobu Yuan,et al.  An enhanced approach for classifying emotions using customized decision tree algorithm , 2012, 2012 Proceedings of IEEE Southeastcon.

[8]  Sarah E. Brockwell,et al.  A comparison of statistical methods for meta‐analysis , 2001, Statistics in medicine.

[9]  Brian R. Gaines,et al.  An Overview of Knowledge-Acquisition and Transfer , 1987, Int. J. Man Mach. Stud..

[10]  Harry Zhang,et al.  Naturalistic use of cell phones in driving and context-based user assistance , 2007, Mobile HCI.

[11]  Jiang Hua,et al.  Study on Knowledge Acquisition Techniques , 2008, 2008 Second International Symposium on Intelligent Information Technology Application.

[12]  Jay F. Nunamaker,et al.  Using a group decision support system environment for knowledge acquisition: a field study , 1990, Twenty-Third Annual Hawaii International Conference on System Sciences.

[13]  Wolfgang Fastenmeier,et al.  Driving Task Analysis as a Tool in Traffic Safety Research and Practice , 2007 .

[14]  William P. Wagner Trends in expert system development: A longitudinal content analysis of over thirty years of expert system case studies , 2017, Expert Syst. Appl..

[15]  Hussein Dia,et al.  An agent-based approach to modelling driver route choice behaviour under the influence of real-time information , 2002 .

[16]  Cristina Conde,et al.  Combining experts knowledge for driving risks recognition , 2011, Proceedings of 2011 IEEE International Conference on Vehicular Electronics and Safety.

[17]  D. Prelec A Bayesian Truth Serum for Subjective Data , 2004, Science.

[18]  Michael J. Goodman,et al.  The role of driver inattention in crashes: new statistics from the 1995 crashworthiness data system , 1996 .

[19]  Roger Tourangeau,et al.  Evaluating the Effectiveness of Visual Analog Scales , 2006 .

[20]  Charu C. Aggarwal,et al.  Data Mining: The Textbook , 2015 .

[21]  Cristina Conde,et al.  Analysis of hands activity for automatic driving risk detection , 2013 .

[22]  Hei-Chia Wang,et al.  Combining subjective and objective QoS factors for personalized web service selection , 2007, Expert Syst. Appl..

[23]  Cristina Conde,et al.  Combining traffic safety knowledge for driving risk detection , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[24]  Kazuya Takeda,et al.  Analysis of Real-World Driver's Frustration , 2011, IEEE Transactions on Intelligent Transportation Systems.

[25]  Cristina Conde,et al.  Subjective Traffic Safety Experts' Knowledge for Driving-Risk Definition , 2014, IEEE Transactions on Intelligent Transportation Systems.

[26]  Javier M. Moguerza,et al.  Methods for the combination of kernel matrices within a support vector framework , 2009, Machine Learning.

[27]  Sara B. Kiesler,et al.  Calling while driving: effects of providing remote traffic context , 2005, CHI.

[28]  Hesham Rakha,et al.  Characterizing Driver Behavior on Signalized Intersection Approaches at the Onset of a Yellow-Phase Trigger , 2007, IEEE Transactions on Intelligent Transportation Systems.

[29]  Ngoc Thanh Nguyen,et al.  Advanced Computational Methods for Knowledge Engineering - Proceedings of the 6th International Conference on Computer Science, Applied Mathematics and Applications, ICCSAMA 2019, Hanoi, Vietnam, December 19-20, 2019 , 2020, ICCSAMA.

[30]  Eamonn J. Keogh,et al.  Segmenting Time Series: A Survey and Novel Approach , 2002 .

[31]  Jennifer Healey,et al.  Recording Affect in the Field: Towards Methods and Metrics for Improving Ground Truth Labels , 2011, ACII.

[32]  Cristina Conde,et al.  Section-Wise Similarities for Clustering and Outlier Detection of Subjective Sequential Data , 2011, SIMBAD.

[33]  De Wu,et al.  A Piecewise Linear Representation Method of Time Series Based on Feature Points , 2007, KES.

[34]  Robert F. Hodson,et al.  Real-time expert systems' computer architecture , 1989 .

[35]  E. Turban,et al.  Managing knowledge acquisition from multiple experts , 1991, [1991] Proceedings of the IEEE/ACM International Conference on Developing and Managing Expert System Programs.

[36]  Cristina Conde,et al.  Accident reproduction system for the identification of human factors involved on traffic accidents , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[37]  Yihong Gong,et al.  Driving Safety Monitoring Using Semisupervised Learning on Time Series Data , 2010, IEEE Transactions on Intelligent Transportation Systems.

[38]  Raúl Quintero,et al.  Drowsiness monitoring based on driver and driving data fusion , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[39]  Kai Chen,et al.  Collaborative filtering and deep learning based recommendation system for cold start items , 2017, Expert Syst. Appl..

[40]  Cristina Conde,et al.  Automatic driving risk detection based on hands activity , 2011, 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[41]  Cristina Conde,et al.  Driving risk classification based on experts evaluation , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[42]  S. Menard Applied Logistic Regression Analysis , 1996 .

[43]  Rubén Posada-Gómez,et al.  Development of a fuzzy expert system for the nephropathy control assessment in patients with type 2 diabetes mellitus , 2017, Expert Syst. Appl..

[44]  Bruce Simons-Morton,et al.  The observed effects of teenage passengers on the risky driving behavior of teenage drivers. , 2005, Accident; analysis and prevention.

[45]  Abdul Sattar,et al.  Developing expert system with soft systems concept , 1994, Proceedings of International Conference on Expert Systems for Development.

[46]  A. Sathyanarayana,et al.  Driver behavior analysis and route recognition by Hidden Markov Models , 2008, 2008 IEEE International Conference on Vehicular Electronics and Safety.

[47]  H. Greenberg An Analysis of Traffic Flow , 1959 .

[48]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[49]  R. Cork,et al.  A Comparison Of The Verbal Rating Scale And The Visual Analog Scale For Pain Assessment , 2003 .

[50]  Cristina Conde,et al.  Optimal experts' knowledge selection for intelligent driving risk detection systems , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[51]  Joaquim Ferreira,et al.  Introduction to Intelligent Transportation Systems , 2016 .

[52]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[53]  Mohan M. Trivedi,et al.  Multi-spectral and multi-perspective video arrays for driver body tracking and activity analysis , 2007, Comput. Vis. Image Underst..

[54]  Katja Kircher,et al.  Issues related to the driver distraction detection algorithm AttenD , 2009 .

[55]  Charles C. MacAdam,et al.  Application of an Optimal Preview Control for Simulation of Closed-Loop Automobile Driving , 1981, IEEE Transactions on Systems, Man, and Cybernetics.

[56]  Shivani Goel,et al.  Expert system and it's requirement engineering process , 2014, International Conference on Recent Advances and Innovations in Engineering (ICRAIE-2014).

[57]  Sterling A. Bone,et al.  Identifying the traits of aggressive and distracted drivers: a hierarchical trait model approach , 2006 .

[58]  Liu Yijun,et al.  A machine learning algorithm for expert system based on MYCIN model , 2010, 2010 2nd International Conference on Computer Engineering and Technology.