A Two-Dimensional Clustering Method for High-Speed Railway Trains in China Based on Train Characteristics and Operational Performance

At present, China has the world’s longest high-speed railway (HSR) network, but most trains do not have clear market positioning and hierarchical standard. The current train hierarchies and adjustment is empirically set by HSR organisers. This paper applies a clustering method and cross-over analysis to study the classification of China’s HSR trains and provides scientific operation suggestions. By adopting the timetable data and ticket booking data of Shanghai-Nanjing intercity railway, we establish a clustering index system consists of train characteristics indexes and operational performance indexes, then use t-distributed Stochastic Neighbour Embedding (t-SNE) to do dimensionality reduction for original data. We obtain the optimal clustering number by validity indexes and use $k$ -means to cluster the HSR trains. After clustering, we use a cross-over analysis to illustrate the relationship between train characteristics, passenger demand and operational performance. We find there are two main types of train on Shanghai-Nanjing intercity HSR: trains departing on the hour that have good operational performance, trains with staggered stops and low-capacity based on the strategies of “low capacity, high density” that have better operational performance at peak time. After 19:00, due to the passenger demand decrease, train capacity can be reduced to avoid unnecessary waste. Short-distance trains could easily be replaced by long-distance trains with similar stop schedules and need to maintain a certain operation frequency. The proposed clustering method has universal applicability for Chinese HSR lines.

[1]  Svetla Stoilova,et al.  AN APPLICATION OF AHP METHOD FOR EXAMINING THE TRANSPORT PLAN OF PASSENGER TRAINS IN BULGARIAN RAILWAY NETWORK , 2018 .

[2]  Fancello Gianfranco,et al.  Port Cooperation Policies in the Mediterranean Basin: An Experimental Approach Using Cluster Analysis , 2014 .

[3]  Ching-Hsien Hsu,et al.  Evaluating rail transit timetable using big passengers' data , 2016, J. Comput. Syst. Sci..

[4]  Geert Wets,et al.  Traffic accident segmentation by means of latent class clustering. , 2008, Accident; analysis and prevention.

[5]  Huiling Fu,et al.  A hierarchical line planning approach for a large-scale high speed rail network: The China case , 2015 .

[6]  Chin-Shan Lu,et al.  Effects of service quality and customer satisfaction on customer loyalty in high-speed rail services in Taiwan , 2014 .

[7]  Xuesong Zhou,et al.  Single-Track Train Timetabling with Guaranteed Optimality: Branch-and-Bound Algorithms with Enhanced Lower Bounds , 2007 .

[9]  Alireza Ermagun,et al.  Using Twitter network to detect market segments in the airline industry , 2018, Journal of Air Transport Management.

[10]  Robert Mayer,et al.  Airport classification based on cargo characteristics , 2016 .

[11]  Tal Raviv,et al.  Service-Oriented Line Planning and Timetabling for Passenger Trains , 2013, Transp. Sci..

[12]  Shifeng Wang,et al.  Optimizing train stopping patterns and schedules for high-speed passenger rail corridors , 2016 .

[13]  Feng Liu,et al.  A New Approach for Hierarchical Dividing to Passenger Nodes in Passenger Dedicated Line , 2018, J. Inf. Process. Syst..

[14]  Geoffrey E. Hinton,et al.  Stochastic Neighbor Embedding , 2002, NIPS.

[15]  Eftihia Nathanail,et al.  Measuring the quality of service for passengers on the hellenic railways , 2008 .

[16]  Hyojoo Son,et al.  Longitudinal assessment of high-speed rail service delivery, satisfaction and operations: A study of Taiwan and Korea systems , 2017 .

[17]  Qi Zhang,et al.  High-Speed Railway Passenger Node Classification Method and Train Stops Scheme , 2014 .

[18]  Ferenc Szidarovszky,et al.  A multi-objective train scheduling model and solution , 2004 .

[19]  Iwan von Wartburg,et al.  1 Customer Segmentation Revisited : The Case of the Airline Industry , 2020 .

[20]  Frédéric Dobruszkes,et al.  High-speed rail and air transport competition in Western Europe: A supply-oriented perspective , 2011 .

[21]  Mirko Hornung,et al.  Airline categorisation by applying the business model canvas and clustering algorithms , 2018, Journal of Air Transport Management.

[22]  A. Graham,et al.  Devising airport groupings for financial benchmarking , 2013 .

[23]  Francisco de Sousa Ramos,et al.  Cluster analysis of the competitiveness of container ports in Brazil , 2014 .

[24]  Linyan Sun,et al.  Study and applications of data mining to the structure risk analysis of customs declaration cargo , 2005, IEEE International Conference on e-Business Engineering (ICEBE'05).

[25]  Gi-Tae Yeo,et al.  Evaluating the competitiveness of container ports in Korea and China , 2008 .

[26]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[27]  P. Kreuzer,et al.  Optimal lines for railway systems , 1997 .

[28]  Jian Sun,et al.  The evaluation method of road traffic safety based on the combined Principal Component Cluster analysis , 2011, 2011 International Conference on Business Management and Electronic Information.

[29]  Jingjuan Jiao,et al.  Competition of spatial service hinterlands between high-speed rail and air transport in China: Present and future trends , 2015, Journal of Geographical Sciences.

[30]  Hugo M. Repolho,et al.  Optimal Location of Railway Stations: The Lisbon-Porto High-Speed Rail Line , 2013, Transp. Sci..

[31]  Nico M. van Dijk,et al.  Cost optimal allocation of rail passenger lines , 1998, Eur. J. Oper. Res..

[32]  Yi-Ming Wei,et al.  Exploring the differences in the airport competitiveness formation mechanism: evidence from 45 Chinese airports during 2010–2014 , 2017 .