Using Web Mining to Support Low Cost Historical Vehicle Traffic Analytics

Analyzing historical vehicle traffic data has many applications including urban planning and intelligent in-vehicle route prediction. A common practice to acquire this data is through roadside sensors. This approach is expensive because of infrastructure and planning costs and cannot be easily applied to new routes. In this paper, a low-cost Web mining approach is proposed to address these limitations. Our system gathers information about vehicle commute times, accidents, and weather reports from heterogeneous Web sources. Information from these sources can be combined to support road traffic analytics. We illustrate the utility of our system through a clustering analysis that investigates the traffic patterns of the busiest highway in Calgary along with factors having the most impact on commute time. The analysis shows that most of the accidents are localized around a small section of the highway near the city center and that the commute time in this segment is significantly more than that in other segments. Bad weather increases the typical evening rush hour commute time by 60% for days with moderate accidents and by a factor of 100% for days with large number of accidents. Overall, commute times can vary by a factor of 4 depending on accidents and weather. Keywords-road traffic; clustering; data analysis; Web mining; traffic management

[1]  Virgílio A. F. Almeida,et al.  A methodology for workload characterization of E-commerce sites , 1999, EC '99.

[2]  D.M. Mount,et al.  An Efficient k-Means Clustering Algorithm: Analysis and Implementation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Thomas A. Runkler,et al.  Classification and prediction of road traffic using application-specific fuzzy clustering , 2002, IEEE Trans. Fuzzy Syst..

[4]  Xiao-dong Zhang,et al.  The study on the application of fuzzy clustering analysis in the dynamic identification of road traffic state , 2003, Proceedings of the 2003 IEEE International Conference on Intelligent Transportation Systems.

[5]  Yi Zhang,et al.  Spatial-temporal traffic data analysis based on global data management using MAS , 2004, IEEE Trans. Intell. Transp. Syst..

[6]  Michel Pasquier,et al.  POP-TRAFFIC: a novel fuzzy neural approach to road traffic analysis and prediction , 2006, IEEE Transactions on Intelligent Transportation Systems.

[7]  Xu Lunhui,et al.  The urban road traffic state identification method based on FCM clustering , 2011, Proceedings 2011 International Conference on Transportation, Mechanical, and Electrical Engineering (TMEE).

[8]  Markus Reischl,et al.  Data mining tools , 2011, WIREs Data Mining Knowl. Discov..

[9]  Lakshminarayanan Subramanian,et al.  Road traffic congestion in the developing world , 2012, ACM DEV '12.

[10]  Behrouz Homayoun Far,et al.  Using Neuro-fuzzy Models to Benchmark Road Safety Management Systems , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.