Development of a statewide transportation data warehousing and mining system under the Louisiana transportation information system (LATIS) program : technical summary report.

More jurisdictions including states and metropolitan areas are establishing traffic management centers to assist in reducing congestion. To a lesser extent, these centers are helpful in providing information that assists engineers in making such adjustments as signal synchronization or road improvements. However, the main traffic management center function is real-time decision making for freeways and surface streets. Planning and modification of a traffic network is best pursued in a context that includes large amounts of historical data. Understanding traffic “behavioral” properties is the domain of numerous technologies such as data mining which depends on the presence of large amounts of historical data. To these ends, the Louisiana Transportation Research Center (LTRC) commissioned this study for the design of a data warehousing/data mining system that, while limited to Baton Rouge, will serve as a statewide model. Few traffic-oriented data warehouses exist in the U.S. The methodology employed in designing the Baton Rouge, Louisiana warehouse included visiting many of them and collecting sample data from Baton Rouge sensors. Advisory and stakeholder committees were formed to give advice on the base applications. Base applications are the ones recommended for the initial inclusion in the warehouse. The applications were traced back to the data, resulting in some of them being dropped or modified to suit the data and data quality that was available. From that juncture, the data were tracked forward again to the applications, modeling the transformations necessary. This transformation set constituted the design. Chosen applications, in addition to data mining, included several variations of performance measuring and hydrowatch. The latter is unique among traffic warehouses and is particularly appropriate for the region and State. The data warehouse design consists of a system with three stages – extraction/transformation/loading (ETL) of source data, main storage of the warehouse, and client workstation software. ETL consists of acquiring, cleansing, formatting, merging, and purging of the source data. Much of this stage entails data quality checking and the report addresses this aspect. Main storage is organized around a star schema also called a multidimensional data cube. This separates the static data such as sensor location from dynamic data such as lane occupancy. This design assumes a one-way data flow, input from the sensors and output to client workstations or other media. This approach is codified commercially as online analytical processing (OLAP). Many warehouses stop at the main storage phase (the “dump and run” model). Here, the solution to key client phase issues, interfaces to GIS and linear referencing are given. Infrastructure and a marketing plan are given. The key infrastructure decision is determining the warehouse’s physical location – at LTRC, at a university, or at a private/public concern. All three variations were discovered in site surveys. Marketing consists of addressing various segments beginning with those who are inclined to add value to the system. These are the engineers and planners but at some stage university researchers and the general public must be given access and be convinced of the value that can be obtained.

[1]  Erik van Zwet,et al.  A simple and effective method for predicting travel times on freeways , 2004, IEEE Transactions on Intelligent Transportation Systems.

[2]  Judea Pearl,et al.  The International Journal of Biostatistics C AUSAL I NFERENCE An Introduction to Causal Inference , 2011 .

[3]  James A Bonneson Kinematic Approach to Horizontal Curve Transition Design , 2000 .

[4]  Brian Lee Smith Software Development Cost Estimation for Infrastructure Systems , 2002 .

[5]  W. D. Jones,et al.  Keeping cars from crashing , 2001 .

[6]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[7]  Gary A. Davis,et al.  Bayesian networks and traffic accident reconstruction , 2003, ICAIL.

[8]  I. Chisalita,et al.  Traffic accidents modeling and analysis using temporal reasoning , 2004, Proceedings. The 7th International IEEE Conference on Intelligent Transportation Systems (IEEE Cat. No.04TH8749).

[9]  Yilin Zhao,et al.  Mobile phone location determination and its impact on intelligent transportation systems , 2000, IEEE Trans. Intell. Transp. Syst..

[10]  K Courage,et al.  FEASIBILITY STUDY FOR AN INTEGRATED NETWORK OF DATA SOURCES , 2004 .

[11]  Sebastian Thrun,et al.  Bayesian Network Induction via Local Neighborhoods , 1999, NIPS.

[12]  D. Freedman Graphical Models for Causation, and the Identification Problem , 2004 .

[13]  Fred L. Hall,et al.  IMPROVED DATA SCREENING TECHNIQUES FOR FREEWAY TRAFFIC MANAGEMENT SYSTEMS , 1991 .

[14]  Benjamin Coifman,et al.  Using Dual Loop Speed Traps To Identify Detector Errors , 1999 .

[15]  Will Recker,et al.  Section Travel Time Estimation from Point Detection Data , 2002 .

[16]  Edgar Ergueta,et al.  IMPROVED VEHICLE REIDENTIFICATION AND TRAVEL TIME MEASUREMENT ON CONGESTED FREEWAYS , 2002 .

[17]  Nancy L. Nihan,et al.  Aid to determining freeway metering rates and detecting loop errors , 1997 .

[18]  Rob Mattison Data warehousing - strategies, technologies, and techniques , 1996 .

[19]  Xindong Wu,et al.  A Study of Causal Discovery With Weak Links and Small Samples , 1997, IJCAI.

[20]  Brian L. Smith,et al.  New Procedure for Detector Data Screening in Traffic Management Systems , 2000 .

[21]  Brian Lee Smith,et al.  Developing Complex Integrated Computer Applications and Systems , 1999 .

[22]  D. T. Lee,et al.  Travel-time prediction with support vector regression , 2004, IEEE Transactions on Intelligent Transportation Systems.

[23]  Shashi Shekhar,et al.  CubeView: a system for traffic data visualization , 2002, Proceedings. The IEEE 5th International Conference on Intelligent Transportation Systems.

[24]  David Cordes,et al.  CARE: An Automobile Crash Data Analysis Tool , 2003, Computer.

[25]  Bhagwant Persaud,et al.  Accident Prediction Models With and Without Trend: Application of the Generalized Estimating Equations Procedure , 2000 .

[26]  Sherif Ishak Quantifying Uncertainties of Freeway Detector Observations using Fuzzy-Clustering Approach , 2003 .

[27]  Chris Lee,et al.  Real-Time Crash Prediction Model for Application to Crash Prevention in Freeway Traffic , 2002 .

[28]  Adolf D May,et al.  TRAFFIC DETECTOR ERRORS AND DIAGNOSTICS , 1987 .