Effects of Coverage Area Treatment, Spatial Analysis Unit, and Regression Model on the Results of Station-Level Demand Modeling of Urban Rail Transit

Direct ridership models can predict station-level urban rail transit ridership. Previous research indicates that the direct modeling of urban rail transit ridership uses different coverage overlapping area processing methods (such as naive method or Thiessen polygons), area analysis units (such as census block group and census tract), and various regression models (such as linear regression and negative binomial regression). However, the selection of these methods and models seems arbitrary. The objective of this research is to suggest methods of station-level urban rail transit ridership model selection and evaluate the impact of this selection on ridership model results and prediction accuracy. Urban rail transit ridership data in 2010 were collected from five cities: New York, San Francisco, Chicago, Philadelphia, and Boston. Using the built environment characteristics as the independent variables and station-level ridership as the dependent variable, an analysis was conducted to examine the differences in the model performance in ridership prediction. Our results show that a large overlap of circular coverage areas will greatly affect the accuracy of models. The equal division method increases model accuracy significantly. Most models show that the generalized additive models have lower mean absolute percentage errors (MAPE) and higher adjusted R 2 values. By comparison, the Akaike information criterion (AIC) values of the negative binomial models are lower. The influence of different basic spatial analysis unit on the model results is marginal. Therefore, the selection of basic area unit can use existing data. In terms of model selection, advanced models seem to perform better than the linear regression models.

[1]  Seungil Lee,et al.  Urban structural hierarchy and the relationship between the ridership of the Seoul Metropolitan Subway and the land-use pattern of the station areas , 2013 .

[2]  H. Bozdogan Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions , 1987 .

[3]  Xilei Zhao,et al.  Identifying key factors associated with ridesplitting adoption rate and modeling their nonlinear relationships , 2021, Transportation Research Part A: Policy and Practice.

[4]  Graham Currie,et al.  Understanding ridership drivers for bus rapid transit systems in Australia , 2010 .

[5]  Chuan Ding,et al.  Non-linear associations between zonal built environment attributes and transit commuting mode choice accounting for spatial heterogeneity , 2021 .

[6]  Siqi Zheng,et al.  Does subway proximity discourage automobility? Evidence from Beijing , 2017 .

[7]  Harry Timmermans,et al.  Analysis of Metro Station Ridership Considering Spatial Heterogeneity , 2019, Chinese Geographical Science.

[8]  Renbin Pan,et al.  Exploring the Equity of Traditional and Ride-Hailing Taxi Services during Peak Hours , 2020 .

[9]  Keechoo Choi,et al.  Exploring the impacts of land use by service coverage and station-level accessibility on rail transit ridership , 2014 .

[10]  Hongtai Yang,et al.  Direct modeling of subway ridership at the station level: a study based on mixed geographically weighted regression , 2020, Canadian Journal of Civil Engineering.

[11]  Hongtai Yang,et al.  Exploring spatial variation of bike sharing trip production and attraction: A study based on Chicago’s Divvy system , 2020 .

[12]  Jeffrey R. Brown,et al.  What Really Matters for Increasing Transit Ridership: Understanding the Determinants of Transit Ridership Demand in Broward County, Florida , 2012 .

[13]  Zi-jia Wang,et al.  Spatio-temporal analysis of rail station ridership determinants in the built environment , 2018, Transportation.

[14]  Ahmed M El-Geneidy,et al.  Using GIS to Measure the Effect of Overlapping Service Areas on Passenger Boardings at Bus Stops , 2007 .

[15]  Cheng Shi,et al.  What determines rail transit passenger volume? Implications for Transit oriented development planning , 2017 .

[16]  Claire E McKnight,et al.  Does the built environment make a difference? Additional evidence from the daily activity and travel behavior of homemakers living in New York City and suburbs , 2007 .

[17]  Jian Rong,et al.  Estimation of transit ridership based on spatial analysis and precise land use data , 2016 .

[18]  Tao Feng,et al.  Examining the relationship between built environment and metro ridership at station-to-station level , 2020, Transportation Research Part D: Transport and Environment.

[19]  Robbi Rahim,et al.  Forecasting Error Calculation with Mean Absolute Deviation and Mean Absolute Percentage Error , 2017 .

[20]  Chi Xie,et al.  Promoting carsharing attractiveness and efficiency: An exploratory analysis , 2018, Transportation Research Part D: Transport and Environment.

[21]  Zhigang Gao,et al.  Linkage between passenger demand and surrounding land-use patterns at urban rail transit stations: A canonical correlation analysis method and case study in Chongqing , 2016 .

[22]  Gregory D. Erhardt,et al.  Evaluating the Ability of Transit Direct Ridership Models to Forecast Medium-Term Ridership Changes: Evidence from San Francisco , 2018 .

[23]  Juhwan Oh,et al.  Transit-oriented development in a high-density city: Identifying its association with transit ridership in Seoul, Korea , 2011 .

[24]  Behram Wali,et al.  Developing policy thresholds for objectively measured environmental features to support active travel , 2021 .

[25]  J. Gutiérrez,et al.  Application of geographically weighted regression to the direct forecasting of transit ridership at station-level , 2012 .

[26]  Luis F. Miranda-Moreno,et al.  A station-level ridership model for the metro network in Montreal, Quebec , 2013 .

[27]  Anabela Ribeiro,et al.  Measuring and Evaluating the Impacts of TOD Measures – Searching for Evidence of TOD Characteristics in Azambuja Train Line , 2014 .

[28]  Linchuan Yang,et al.  Equitable? Exploring ridesourcing waiting time and its determinants , 2021 .

[29]  S. Wood mgcv:Mixed GAM Computation Vehicle with GCV/AIC/REML smoothness estimation , 2012 .

[30]  Keechoo Choi,et al.  Land use characteristics of subway catchment areas and their influence on subway ridership in Seoul , 2015 .

[31]  Jing Huo,et al.  How Do Different Treatments of Catchment Area Affect the Station Level Demand Modeling of Urban Rail Transit? , 2021, Journal of Advanced Transportation.

[32]  Chuan Ding,et al.  Non-linear relationships between built environment characteristics and electric-bike ownership in Zhongshan, China , 2019, Transportation Research Part D: Transport and Environment.

[33]  Xiaokun Wang,et al.  Transit ridership estimation with network Kriging: a case study of Second Avenue Subway, NYC , 2014 .

[34]  Simon Choi,et al.  Sustainable Mobility: Longitudinal Analysis of Built Environment on Transit Ridership , 2016 .

[35]  Yan Song,et al.  Analysis of Metro ridership at station level and station-to-station level in Nanjing: an approach based on direct demand models , 2014 .

[36]  Keemin Sohn,et al.  Factors generating boardings at Metro stations in the Seoul metropolitan area , 2010 .

[37]  Catherine L. Ross,et al.  Machine Learning Travel Mode Choices: Comparing the Performance of an Extreme Gradient Boosting Model with a Multinomial Logit Model , 2018 .

[38]  Linchuan Yang,et al.  Influence of the built environment on E-scooter sharing ridership: A tale of five cities , 2021 .

[39]  Sevgi Erdogan,et al.  How to Increase Rail Ridership in Maryland: Direct Ridership Models for Policy Guidance , 2016 .