Short-Term Passenger Flow Forecast of Rail Transit Station Based on MIC Feature Selection and ST-LightGBM considering Transfer Passenger Flow

To solve the problems of current short-term forecasting methods for metro passenger flow, such as unclear influencing factors, low accuracy, and high time-space complexity, a method for metro passenger flow based on ST-LightGBM after considering transfer passenger flow is proposed. Firstly, using historical data as the training set to transform the problem into a data-driven multi-input single-output regression prediction problem, the problem of the short-term prediction of metro passenger flow is formalized and the difficulties of the problem are identified. Secondly, we extract the candidate temporal and spatial features that may affect passenger flow at a metro station from passenger travel data based on the spatial transfer and spatial similarity of passenger flow. Thirdly, we use a maximal information coefficient (MIC) feature selection algorithm to select the significant impact features as the input. Finally, a short-term forecasting model for metro passenger flow based on the light gradient boosting machine (LightGBM) model is established. Taking transfer passenger flow into account, this method has a low space-time cost and high accuracy. The experimental results on the dataset of Lianban metro station in Xiamen city show that the proposed method obtains higher prediction accuracy than SARIMA, SVR, and BP network.

[1]  Pan Shang,et al.  Timetable Synchronization and Optimization Considering Time-Dependent Passenger Demand in an Urban Subway Network , 2018 .

[2]  Cheng Wang,et al.  A fast clustering algorithm based on pruning unnecessary distance computations in DBSCAN for high-dimensional data , 2018, Pattern Recognit..

[3]  Yanyan Xu,et al.  Short-Term Origin-Destination Based Metro Flow Prediction with Probabilistic Model Selection Approach , 2018, Journal of Advanced Transportation.

[4]  Jianhua Guo,et al.  Adaptive Kalman filter approach for stochastic short-term traffic flow rate prediction and uncertainty quantification , 2014 .

[5]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[6]  Lei Ma,et al.  Prediction of 30-Day Readmission: An Improved Gradient Boosting Decision Tree Approach , 2019, J. Medical Imaging Health Informatics.

[7]  Houbing Song,et al.  Imperfect Information Dynamic Stackelberg Game Based Resource Allocation Using Hidden Markov for Cloud Computing , 2018, IEEE Transactions on Services Computing.

[8]  Zhiwen Yu,et al.  KNN-BLOCK DBSCAN: Fast Clustering for Large-Scale Data , 2021, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[9]  Yang Ying,et al.  ST-LSTM: A Deep Learning Approach Combined Spatio-Temporal Features for Short-Term Forecast in Rail Transit , 2019, Journal of Advanced Transportation.

[10]  M. Florian,et al.  A combined trip distribution modal split and trip assignment model , 1978 .

[11]  Mi Gensu Subway station passenger flow forecast based on mixed kernel support vector machine optimized by golden section chaotic particle swarm optimization , 2015 .

[12]  Pan Shang,et al.  Equity-oriented skip-stopping schedule optimization in an oversaturated urban rail transit network , 2018 .

[13]  Bo Zeng,et al.  A multi-pattern deep fusion model for short-term bus passenger flow forecasting , 2017, Appl. Soft Comput..

[14]  Mohamed El Esawey Daily Bicycle Traffic Volume Estimation: Comparison of Historical Average and Count Models , 2018 .

[15]  Bin Ran,et al.  Short-to-medium Term Passenger Flow Forecasting for Metro Stations using a Hybrid Model , 2017, KSCE Journal of Civil Engineering.

[16]  Robertas Damasevicius,et al.  Multi-sink distributed power control algorithm for Cyber-physical-systems in coal mine tunnels , 2019, Comput. Networks.

[17]  Shuai Feng,et al.  Passenger Flow Forecast of Metro Station Based on the ARIMA Model , 2016 .

[18]  D I Robertson,et al.  ELECTRONIC ROUTE GUIDANCE BY AUTOGUIDE: THE RESEARCH BACKGROUND , 1987 .

[19]  Yang Zhao,et al.  Forecasting Short-Term Passenger Flow: An Empirical Study on Shenzhen Metro , 2019, IEEE Transactions on Intelligent Transportation Systems.

[20]  Mecit Cetin,et al.  Short-term traffic flow rate forecasting based on identifying similar traffic patterns , 2016 .

[21]  Gérald Gavin,et al.  A dynamic Bayesian network approach to forecast short-term urban rail passenger flows with incomplete data , 2017 .

[22]  Jianwei Chen,et al.  Passenger Flow Forecast of Rail Station Based on Multi-Source Data and Long Short Term Memory Network , 2020, IEEE Access.

[23]  Jing Zhou,et al.  Default prediction in P2P lending from high-dimensional data based on machine learning , 2019, Physica A: Statistical Mechanics and its Applications.

[24]  Xin Liu,et al.  Fast density peak clustering for large scale data based on kNN , 2020, Knowl. Based Syst..

[25]  Yizhe Wang,et al.  Prediction of Daily Entrance and Exit Passenger Flow of Rail Transit Stations by Deep Learning Method , 2018 .

[26]  Yang Liu,et al.  DeepPF: A deep learning based architecture for metro passenger flow prediction , 2019, Transportation Research Part C: Emerging Technologies.

[27]  Thomas L. Magnanti,et al.  A Combined Trip Generation, Trip Distribution, Modal Split, and Trip Assignment Model , 1988, Transp. Sci..

[28]  Pengpeng Jiao,et al.  Three Revised Kalman Filtering Models for Short-Term Rail Transit Passenger Flow Prediction , 2016 .

[29]  Yang Li,et al.  Feature selection for transient stability assessment based on kernelized fuzzy rough sets and memetic algorithm , 2018, 1808.08790.

[30]  Michael Mitzenmacher,et al.  Detecting Novel Associations in Large Data Sets , 2011, Science.

[31]  Lei Shi,et al.  Study on the Selection Model of Staying Adjustment Bus Lines along Rail Transit , 2020 .

[32]  Lei Deng,et al.  PDRLGB: precise DNA-binding residue prediction using a light gradient boosting machine , 2018, BMC Bioinformatics.

[33]  Jiang-She Zhang,et al.  Big data analytics enabled by feature extraction based on partial independence , 2017, Neurocomputing.

[34]  Biao Leng,et al.  A novel wavelet-SVM short-time passenger flow prediction in Beijing subway system , 2015, Neurocomputing.

[35]  Lelitha Vanajakshi,et al.  Short-term traffic flow prediction using seasonal ARIMA model with limited input data , 2015, European Transport Research Review.

[36]  Lee D. Han,et al.  A Kriging based spatiotemporal approach for traffic volume data imputation , 2018, PloS one.

[37]  Tao Yu,et al.  Short-Term Load Forecasting Based on LS-SVM Optimized by Bacterial Colony Chemotaxis Algorithm , 2009, 2009 International Conference on Information and Multimedia Technology.

[38]  Yan Hui Wang,et al.  Prediction of the Metro Section Passenger Flow Based on Time-Space Characteristic , 2013 .

[39]  Nebojsa J. Bojovic,et al.  SARIMA modelling approach for railway passenger flow forecasting , 2016 .

[40]  Peng Gao,et al.  Empirical mode decomposition based long short-term memory neural network forecasting model for the short-term metro passenger flow , 2019, PloS one.

[41]  Peng Gao,et al.  Predicting Station-Level Short-Term Passenger Flow in a Citywide Metro Network Using Spatiotemporal Graph Convolutional Neural Networks , 2019, ISPRS Int. J. Geo Inf..

[42]  Saeedeh Anvari,et al.  Automated Box–Jenkins forecasting tool with an application for passenger demand in urban rail systems , 2016 .

[43]  Wei Wu,et al.  Short‐term passenger flow forecast of urban rail transit based on GPR and KRR , 2019, IET Intelligent Transport Systems.

[44]  Gary A. Davis,et al.  Nonparametric Regression and Short‐Term Freeway Traffic Forecasting , 1991 .

[45]  Mu-Chen Chen,et al.  Forecasting the short-term metro passenger flow with empirical mode decomposition and neural networks , 2012 .

[46]  Akhil Kadiyala,et al.  Applications of python to evaluate the performance of decision tree‐based boosting algorithms , 2018 .

[47]  Rung-Ching Chen,et al.  A novel passenger flow prediction model using deep learning methods , 2017 .