Application of complex network principles to key station identification in railway network efficiency analysis

Network efficiency analysis becomes important in railways in order to contribute towards improving the safety and capacity of the rail network, making rail travel more attractive for passengers, and improving industry practice and informing policy development. However, a physical railway network structure is a complicated system, and the operation, maintenance, and management of such a network is a difficult task which may be affected by many influential factors. By using efficiency analysis technology for a railway network, combining physical structure with operation functions can help railway industry to optimize the railway network while improving its efficiency and reliability. This paper presents a new methodology based on complex network principles that combines the physical railway structure with railway operation strategy for a railway network efficiency analysis. In this method, two network models of railway physical and train flow networks are developed for the identification of key stations in the railway network based on network efficiency contribution in which the terms of degree, strength, betweenness, clustering coefficient, and a comprehensive factor are taken into consideration. Once the key stations have been identified and analysed, the railway network efficiency is then studied on the basis of selective and random modes of the station failures. A case study is presented in this paper to demonstrate the application of the proposed methodology. The results show that the identified key stations in the railway network play an important role in improving the overall railway network efficiency, which can provide useful information to railway designers, engineers, operators and maintainers to operate and maintain railway network effectively and efficiently.


Introduction
In comparison with road transportation, railways are by far one of the safest means of ground transportation, especially for their passengers and employees. However, there are some issues involved in both maintaining this position in reality and sustaining the public perception of railway safety excellence.
e railway now nds itself in a situation where actual and perceived safeties are real issues, to be dealt with in a new public culture of rapid change, short-term pressures, and instant communications [1][2][3]. However, operation and maintenance of railway networks are becoming di cult, particularly, it is related to railway network e ciency, reliability, and safety. For example, if a key station fails to operate in a railway network, it would a ect the overall railway network transportation e ciency. A station failure to operate can be classi ed into physical failures of the railway physical network such as failures of signal system, power supply system, track system, and vehicle system, and the function failures of the train ow network such as inaccurate train route planning and scheduling [4,5]. A network e ciency is related to its demands, ows, costs, and behaviour on the network, which can be used to measure the importance of network components and their rankings [6,7]. In this study, network e ciency measure is used to assess the importance of railway network stations and their rankings, which is discussed in Section 4 and expressed in Equation (8). Based on the de nition of network e ciency, therefore, the identi cation of key stations in the railway network e ciency analysis is the major task in the railway development and maintenance activities [8,4]. e railway network is a complex system with hundreds of stations and correlative operations. erefore, the network e ciency analysis is becoming more important in aspect of ensuring safe and reliable operation to be applied in the railway network. is paper presents a new methodology to analyse the efficiency of the railway networks using complex network principles for the key station identification, in which railway network efficiency is evaluated based on station failure modes. is will provide useful method and information to the industry in the design, operation, and maintenance of the railway network effectively and efficiently.
In the context of the network theory, a complex network can be defined as a graph that is composed of relatively many mutually related nodes including structural and functional relations, and it could also be defined as a network that has nonobservable topological features that do not arise in simple networks such as random ones but o en occur in graph models of real systems [9]. A railway network can be classified as a complex network and investigated through complex network analysis [4,5]. Complex network anlaysis has been successfully applied to analyse the efficiency of networks, for example, a biology network [10], a research cooperating network [11,12], an electricity supply system [13], a traffic network [4,14], and even an Internet network [15]. Dey et al. [16] successfully applied complex network theory to analyse safety and reliability of topology impact on the propagation of cascading failure in a national power grid. Zio and Sansavini [17] also applied complex network method for modelling interdependent network systems in order to identify cascade-safe operating margins. ese researchers have developed various network analysis models and also studied the structural characteristics of the networks including system indicators such as node degree, length of the path, and clustering coefficient, in which the complex network vulnerability can be analysed in the selective and random node failure modes [18][19][20][21]. In the literature, some of the studies have also been conducted to investigate characteristics of transportation networks based on complex network theory. Xu et al. [22] studied urban bus transport systems, Porta et al. [23] and Wang et al. [7] looked at insight of urban street networks, Bagler [24] studied airline network, and Dall' Asta et al. [25] investigated USA airline network. Guidotti et al. [26] proposed a probabilistic methodology to quantify the network reliability based on current network efficiency and a measure of connectivity, i.e., eccentricity and heterogeneity. is method was applied to analyse a highway transportation network reliability. Qian et al. [27] developed a cascading failure model of the complex network to simulate the road traffic status by using the delay of the time, incident dissipation factor and load capacity. Chen et al. [28] presented a particle swarm optimization (PSO) algorithm to optimize the invulnerability of China railway traffic network by introducing the concept of the edge to the network. e results produced from these researches provide useful information for maintaining and operating of the complex road and airspace transportation networks.
Because a railway network is a complex network that is also suitable to be examined by using complex network principles. Lin et al. [29] and Li et al. [8] studied China's high-speed rail network as a complex system and analysed network safety and reliability based on complex network theory. Ouyang et al. [5] also applied complex network theory to study the performance and vulnerability of railways under various types of attacks and hazards. e complex network theory has been widely applied to the reliability and safety analysis of the complex networks and has also been used in railway network reliability analysis. However, these studies are limited to properties of the physical networks, and the railway operation functions are neglected in their analyses. It is essential to develop new methods and models to take not only the characteristics of physical railway network, but also the operation functions into considerationl in this case, train flow network needs to be integrated into the network efficiency analysis process in order to obtain reliable results.
is paper presents a new methodology to analyse railway network efficiency that combines characteristics of railway physical network with the functions of train flow network. In other words, the proposed method considers not only the physical network topologies such as degree and clustering coefficient, but also the dynamic operation parameters such as train running paths, stop-schedules, and service frequencies. e proposed method can be used to identify the key stations in the network and analyse railway network efficiency based on selective and random failures of stations, which provides a useful method and more reliable and accurate information to railway designers, engineers, operators, and maintainers for operating and maintaining railway network effectively and efficiently.
is paper is organised into the following sections. A er the introduction, Section 2 presents the development of railway physical network (RPN) and train flow network (TFN) and considerations in the RPN and TFN are discussed more detailed. Specific terms in mathematics are defined in Section 3, which will form the basis of the proposed method using complex network principles for key station identification in the railway network. Section 4 describes a new proposed network efficiency analysis method. A case study of a highspeed rail network efficiency analysis is presented in Section 5 to demonstrate the application of the proposed method, and recommendations are given in this section for improvement of railway network at planning, operation and maintenance in order to satisfy the railway network efficiency requirement. Finally, conclusions are given in Section 6.

Development of Railway Physical Network (RPN) and Train Flow Network (TFN)
A new approach for the analysis of network efficiency is proposed, which combines the RPN with the TFN in order to consider network structure properties together with network operation functions in the analysis process. e RPN considers the physical connecting properties in the network and provides constraints to the TFN. e RFN takes train service plan and the operation functions of TFN into consideration. e proposed railway network efficiency analysis process includes three steps, i.e., development of railway network efficiency model, identification of key station indices, station ranking, and network efficiency analysis as shown in Figure 1.
A RPN can be developed based on the existing railway network, and then a TFN of a service plan can be established by integrating RPN into operation strategies such as train running routes, stop-schedules and service frequencies [4,26].
Key station identi cation indices can be obtained by evaluating the importance of the stations in a TFN. Stations in the RPN can be ranked based on the evaluated importance of each station. Once key stations are identi ed, the key station identi cation indices are then used to analyse network e ciency by the choice of failure modes of stations or the random selection of failure modes of stations in the railway network.

Railway Physical Network (RPN).
e stations can be considered as nodes, and the connection between any two stations can be expressed as an edge in the RPN [7,16,22,24]. For example, Figure 2(a) shows a simple RPN in which 8 stations are connected by two rail lines, i.e., A-B-C-D-E (5 stations) and F-G-C-H (4 stations). e station C is a junction station, and the dark nodes show that these stations are terminal stations that can be as original or destination stations of the trains. e RPN can be expressed as = , , where is a set of railway stations in the network, and is a set of rail tracks. e RPN presents the physical connectivity among the stations in the RPN in which takes the track length, section capacity and station capacity into consideration. e RPN can be used to analyse transportation capacity constraints for train service plan.

Train Flow Network (TFN).
As the stations are represented as nodes in the RPN, if a train has been scheduled to be operated between two stations, this will produce one edge between these two stations as shown in Figure 2(b). e numbers of trains scheduled to stop at any two connected stations determine the weight of edge between the two stations [4]. e total number of edges can be calculated by 2 = × ( − 1)/2 and denotes a train that has a total number of stations to stop as scheduled. For example, Figure 2(b) shows 8 stations that are connected by 28 edges, i.e., 2 8 = (8 × 7)/2 = 28 between two stations based on the proposed train service plan as shown in Figure 3. e number on each edge presents frequency of trains, for example, number of 5 on edge A to C presents that 5 trains are scheduled running on this edge and stop at station A and C as shown in Figure 2(b). e TFN can be expressed as = , , where is the set of stations that any train can stop at these stations, and is a set of edges that creates any two stops at any stations in the RPN. erefore, the TFN can be developed based on train service plan in which the train stop schedule creates the edges in the RPN, and frequency of trains determines the weight of the edge. Obviously, if the frequency of trains running on an edge is high, the weight of this edge is high.

RPN and TFN Considerations.
As described earlier in this paper, the RPN can be improved by taking the train service plan into consideration to produce the TFN. For example, Figure 3 shows a proposed train service plan which includes 9 stop-schedules, i.e., T1, T2, …,T9 and frequencies of trains are 2, 1, 1, 3, 2, 1, 1, 1, and 1, respectively. e nodes as shown in Figure 3 for each stop-schedule means that a train stops at these stations, for example, the stop-schedule of T4(3)  each station in the railway network, which are described as below.

Degree Centrality (DC).
Assume denotes the th node in the TFN, the DC of a node is the number of the connections between and other nodes in the RPN, which describes the physical connective in uence of a node by the number of its neighbour nodes. For example, the DC of Node A is 7 (i.e., 5 + 1 + 1 = 7) as shown in Figure 2(b), which means that 7 Nodes of B, C, D, F, E, G, and H are connected to Node A directly. e DC of a node in the TFN can be de ned as where is the DC of a node, is the number of the nodes in the RPN, and , is a variable of 0 or 1, i.e., if there is a connection between nodes and , then , = 1, otherwise, , = 0. If a node in the TFN is connected with more edges, it will have a large value of DC . In other words, the DC of a node describes the reachability of the station.

Strength Centrality (SC).
In the TFN, some of the edges are more important than others that depend on weights of edges. e weight of an edge presents the importance of this edge in the TFN in which it depends on frequencies of train service running on this edge, i.e., the more frequently the edge is used by trains, the more important it is. In this study, the SC is used to describe the weight of each node. For example, the SC of Node A is 25 (5 + 6 + 5 + 6 + 1 + 1 + 1 = 25) as shown in Figure  2(b). Assume SC of a node is the sum of the weights of the edges between and other nodes, and it can be de ned as indicates that 3 trains have a same stop-schedule with di erent departure times, and all of these 3 trains will stop at stations of A, B, C, D, and E. Based on the proposed train service plan, the TFN can be produced as shown in Figure 3 in which train service plan has been taken into consideration.
As can be seen from Figure 2(b), the edge between nodes A and B is created by stop-schedules of T4, T7, and T9 as shown in Figure 3, and the weight of edge is the sum of frequencies of these three stop-schedules, i.e., 3 + 1 + 1 = 5. It should be noted that there is an edge between nodes F and C in TFN as shown in Figure 2(b), although there is not a direct connection between F and C in RPN as shown in Figure 2(a). However, the edge between nodes F and C in TFN is created by stop-schedules of T5, T6, and T7 as shown in Figure 3, and the edge weight is the sum of frequencies of these three stop-schedules, i.e., 2 + 1 + 1 = 4 as shown in Figure 2 Similarly, other edges based on stop-schedules can be produced, and the weights of edges in TFN can be calculated. In this case, the railway network physical topology in the RPN and the operation strategies such as train running routes, original stations and destination stations, stop-schedules, and service frequencies can be considered together to produce the TFN by taking the relations and weights of edges into account.

Factors Used in Key Station Identification Analysis
As described in Section 2, the TFN can be developed based on the RPN by taking train service plan into consideration. Key station identi cation indices can be then calculated based on complex network principles [30], including degree centrality (DC), strength centrality (SC), betweenness centrality (BC), clustering coe cient (CC), and comprehensive factor (CF). ese indices are then used to assess the importance of (2) T2 (1) T3 (1) T4 (3) T5 (2) T6 (1) T7 (1) T8 (1) T9 (1) connected each other or not. e higher the value of the CC of a node is, the more densely connected nodes will be. e CC of a node is de ned as where is CC, is the DC of the th node that has a maximum number of edges that equals to × + 1 /2, and is the number of the edges that actually exist based on train service plan of the th node. In other words, the CC represents the in uence of the stations in the network. For example, as shown in Figure 4(b), node S4 has actual connections with nodes S2, S3, S5, and S6. In this case, = 4 which consists of 6 edges. i.e., S4-S2, S4-S3, S4-S5, S4-S6, S2-S3, and S3-S6. e maximum number of edges of node S4 is 4 × (4 + 1)/2 = 10. However, only 6 connections actually exist in the network and other 4 edges are not actually existing. In other words, these 4 edges indicate that the connections can be created by using other actual connections, for example, node S4 has not an actual link with node S1 but can use the path S2-S1 to create an edge such as S4-S2-S1. erefore, by using Equation (5), the CC of S4 is 6 = 2 × 6/4(4 + 1) = 0.6. e CC represents the in uence of the stations in the network, but it has some problems by using the CC in the ranking importance of stations; for example, in some cases, if some of the nodes have same connections in the network, these nodes will have a same value of the CC. is will be demonstrated in the section of case study.

Comprehensive Factor (CF).
In this study, an important factor of CF is introduced in order to integrate degree centrality where is the SC of a node , and , is the weight of the edge between node and . e weight , of an edge between nodes and in the TFN is the number of trains that do stop at the th and th stations. e SC of a node also describes the service capability of a speci c station, which represents the convenience of the passengers from this station to other stations in the network without any change of the train; in other words, trains can reach more stations from the th station.

Betweenness Centrality (BC).
e BC as de ned in the complex network principles [5] describes the in uence of a node in the network; in this case, it is a station in the RPN. In this study, the BC relates to the shortest paths from one node to the other one, i.e., from one station to other station. For every pair of nodes, i.e., between two stations in a network, at least there is one shortest path either it has the minimum number of the edges or it has the minimum value of the weights of the edges. A path is de ned as from a node (i.e., Station ) to a node (i.e., Station ), which indicates a path passing between two stations in the network based on train service plan [4,9]. For example, there are 15 shortest paths between any two nodes in the network as shown in Figure 4(a). Among all the 15 shortest paths, 9 paths pass through node S2 including S1-S2, S1-S2-S4, S1-S2-S4-S3, S1-S2-S4-S5, S1-S2-S4-S6, S2-S4, S2-S4-S3, S2-S4-S5, and S2-S4-S6. But 6 paths that are S3-S4, S3-S4-S5, S3-S4-S6, S5-S4, S6-S4, and S5-S4-S6 do not include the node S2. erefore, the BC of the node S2 is 9/15 = 0.6.
Assume BC of a node in the network without the weight of the edge, which can be calculated by: where , is the number of shortest paths with the minimum number of the edges from a node to a node , and , ( ) is the number of shortest paths with the minimum number of the edges, which pass through the node from a node to a node .
Similarly, the BC of a node with the weight of the edge is de ned as capacity BC, which can be calculated by where , is the number of shortest paths with the minimum sum value of the weights of the edges from a node to a node , and , ( ) is the number of shortest paths with the minimum sum value of the weights of the edges, in which trains pass through the node from a node to a node . e BC re ects the in uence of the nodes throughout the network. A node that has a high impact on network e ciency is called as an in uential node. For example, the node S2 in Figure 4(a) is such a node because 9 out of 15 short paths go through this node. erefore, on the basis of the BC analysis, in uential nodes can be obtained in di erent perspectives of connectivity and transportation capacity.

Clustering Coe cient (CC).
e CC represents a node that links with a certain node, and whether those nodes have also Journal of Advanced Transportation 6

Background.
is section presents a case study on network e ciency analysis for a high-speed rail network to demonstrate the application of the proposed methodology for railway network e ciency analysis. e data and information have been collected from the railway industry for one-day train operation, which indicates that in a total of 2487 trains were operated in such a high-speed rail network. Figure 5 shows the established RPN with 485 nodes (i.e., stations in the network). Based on train service plan on the day, the TFN has also been established with the same number of nodes as the RPN has, i.e., 485 nodes, and the number of edges is analysed as described in Section 2.2; in this case, there are 68198 edges which make up the TFN is more complex than the RPN. Since a large amount of data and information in the network has to be analysed, the new methodology described above has been converted into computer code.

Degree Centrality (DC) Calculation.
As stated in Section 3.1, the distribution of DC can be calculated by Equation (1), for convenience, it has been converted into exponential distribution by

Strength Centrality (SC) Calculation.
e distribution of SC is shown in Figure 7(a), which can be calculated by Equation (2). e results show that only 12 stations (about 4.53%) of total number of stations in the network have a value more than 1000 of SC, and 377 stations (about 77.73%) of total number of stations in the network have a value less than 400 of SC, which indicates that the distribution of SC of the stations in the TFN is extremely deviated, which represents that only a few stations having high service capacity in the network. In other words, it would be more convenient for the passengers to travel from these high service capacity stations than others. Comparing distributions of DC with SC are shown in Figure 7(b), which has been converted by using power law where can be obtained by Equation (1), and k can be calculated by Equation (2). Equation (11) demonstrates if the connectivity of a station in the current transportation operation strategy is , the ability to serve the passengers is 1.242 . As can be seen in Figure 7(b), the SC is increased faster than the DC, in other words, the transportation capacity of a station is growing faster than the growth of connectivity.

Betweenness Centrality (BC) Calculation.
e distributions of BC and (i.e., with/without the weight of an edge) in the  where α is the number 1, 2, 3, 4, and 5, i.e., 1 denotes DC, 2 is SC, 3 is BC , 4 is BC , and 5 is CC, represents the values of DC, SC, BC ( and ), and CC of a node , min is the minimum value of DC, SC, BC ( and ), and CC of the nodes in the TFN, max is the maximum value of DC, SC, BC ( and ) and CC of the nodes in TFN, is normalized value of the node , is the weight of the , which shows the impact of di erent in the CF. e selection of depends on the evaluation purposes such as the connectivity and transportation capacity. For example, if DC, SC, BC ( and ), and CC of the nodes are taken as equally important, then can be chosen as 1/5. If transportation capacity in the TFN is more important than connectivity in the RFN, then can be 2/5. Expert judgement and engineering judgement such as the Delphi method [30] can be used in the selection of .

A New Methodology for Network Efficiency Analysis
Network reliability can be obtained by the analysis of the characteristics of the network under selective and random station failure modes in the railway network [29]. Selective failure mode will enable network analysts to select stations in the railway network based on the current status of the network and their experience to analyse the network reliability, while random failure mode will enable network analysts to assess network e ciency by selecting stations randomly in the railway network. e network e ciency and relative network e ciency are given below to evaluate the reliability of the TFN, which are derived by where is the total number of nodes in the network a er a number of station failures, and , denotes the number of edges in the shortest path between nodes and , which represents the distance between nodes and (if is not connected with , then , = +∞, and = 0), and is the network e ciency a er the failures of selected stations, and 0 is the initial network e ciency, and denote two di erent nodes. (7) = , = 1, 2, 3, 4, 5,

Clustering Coe cient
Calculation. e CC can be calculated by using Equation (5). Figure 9(a) shows the distribution of the CC in the cast study. e average of CC in the TFN is 0.697, which demonstrates high aggregation characteristics of the TFN, i.e., stations within the railway network are closely connected. e relationship between the CC and DC of each node is shown in Figure 9(b). As can be seen from Figure 9(b) that, obviously, a node with high CC has a low value of DC. In other words, the TFN are given in Table 1 and shown in Figure 8, which can be calculated by Equations (3) and (4). Most of the stations have a small value of probability of and . Value ranges of BC and BC in Table 1 are presented as percentage. Only 5 stations have a large value of , which shows that 1.8% of total stations in the railway network with a value between 0.05730 and 0.06548. In other words, these 5 stations are important and contribute signi cantly to the e ciency of the TFN.  Journal of Advanced Transportation 8 the values of the DC of the stations. In comparison of the rankings based on values of the CF of the stations with the rankings based on values of the DC, SC, BC ( , BC ), CBC and CC of stations, as can be seen that rankings of stations are di erent. e reason is that the evaluations of DC, SC, BC ( and ) and CC have di erent focuses as discussed in Sections 3.1-3.5. However, the CF addresses all of issues that focuses on DC, SC, BC , BC and CC have; in other words, CF not only takes the railway network physical topology, but also the operation strategies into consideration such as train running routes, original stations and destination stations, stop-schedules, and service frequencies. erefore, by using lower value of the DC the station has, the greater value of the CC that the station has.

Comprehensive Factor.
e CF can be calculated by Equations (6) and (7). Table 2 shows the results and station important rankings of top 20 stations based on their values of the CF, and these stations are also presented in Figure 5 denoted by red spots. Table 2 also shows rankings based on values of DC, SC, TBC, CBC, and CC, respectively. For example, in the column of station (DC), the rankings are based on the values of the DC of the stations, which station identity numbers are given that can be found in Figure 5 and edges of the TFN are changed, and application of a single DC or SC or BC , or BC or CC cannot provide reliable results; CF takes all the factors that DC, SC, BC , BC , and CC used in the consideration; therefore, by using CF, more reliable results can be obtained. e results of this case study have been further con rmed by the industry from their observation in the railway network [5].
Additionally, based on the results produced by the evaluation using CF, as can be seen that most of nodes (i.e., stations) with the high values of CF are in the central and eastern regions in the railway network as shown in Figure 5. is is particularly true because this is also con rmed that these areas have a high economic development with high populations and high level of demand of transportation. However, it should be noted that it is not all of these top 20 stations are in the areas with high economic development and high populations. For example, although station Nos. 11, 40, 41, 58, 96, and 174 have higher values of CF, these stations only have one rail line pass through as shown in Figure 5. In order words, these stations (i.e., nodes) have lower physical connectivity in the RPN but have a higher transportation capacity because the frequency of trains using these stations is high. In other words, more trains are scheduled to use these stations. Another interesting nding is that the stations Nos. 259 and 266 in the capital city, are ranked in the places of 17 th and 14 th , and this is because there are four stations in the capital city to decentralize transport pressure. Other issues should be noted as stated in Section 3.4, in some cases, the application of the CC in the ranking importance of stations is not e ective. In this case, as can be seen from Table 2, the top 20 stations have a same value of the CC because these stations have a same number of connections with other stations in the network.

Network E ciency Analysis.
In this case, the average network distance of RPN and TFN can be obtained as 30.84 and 2.89 by analysing the network. In other words, the average network distance 30.84 in the RPN represents that a passenger travels from an original station to a destination CF in the assessment railway network e ciency, more reliable results can be obtained. is can be demonstrated that, for example, in an emergency, a train may be delayed or cancelled; in this case, the RPN is not changed, but the weights and TFN is higher than the RPN, in other words, this represents physical connectivity of the railway network in this case is not very dense, but it has a high service capacity and convenient transportation services. e relative network e ciency can be calculated by Equation (9). Figure 10    which will not only balance the distribution of the key stations in the network and relieves the transportation pressure but also help to improve the network e ciency. e combination of RPN and TFN indicates that some stations are located in the same city, for instance, 4 stations in the capital city. Some hub links between these stations should be allocated in the future, which can not only to improve the physical connectivity of the network and transportation services but also improve the e ciency of the whole network in di erent failure modes.

Railway Transportation Operation.
According to the e ciency analysis of the railway network, once the key stations are failed or lost service capacity, the connectivity and e ciency of the overall network would drop down rapidly. To ensure the railway network to provide service, it is recommended to establish operation and maintenance strategies to protect these key stations in the railway network in case of emergency, for example, accidents and incidents, and extreme weather. Furthermore, service capacity can be improved by optimising operation scheme with the constraints of the existing RPN. As described in Section 3.1, the higher k is in the network, the higher service capacity of the network can provide. erefore, a better operation scheme can also increase e ciency of the railway network.

Conclusions
e paper presented a new method to analyse the e ciency of the railway network by identifying the key stations based models 1, 2, 3, and 4 show e ciency under randomly selected percentage of the station failures in the network. e following assumptions are made to implement the selection of random modes [4,9]: (i) All stations in the network have an equal opportunity to be selected. (ii) Each station may be/may not be selected at every time or not. (iii) Selections of stations start from 10% of station failures to 80% of station failures in every random mode.
As can be seen from Figure 10, when 80% of stations in the network fail to provide service in random modes, the network e ciency is near the zero. erefore, in comparison results of selective mode with random modes, the identi ed top 20 stations as key stations in the network have high impacts on the network e ciency, which require more attention in operation and further development in the railway network. Based on the results produced by the railway network e ciency analysis, recommendations for improvement and optimisation of a railway network can be made, which should address two aspects in the planning and operation.

Railway Network Infrastructure Planning.
e network e ciency should be considered in the future railway infrastructure planning by taking the economic and demographic factors into consideration. In this case, some of stations with a high CF in the network only have one railway line passing through, and these stations should consider a new rail line to be constructed, on the two network models of RPN and TFN. Both physical network topology and dynamic operation strategies are considered in this method. Considering the key stations, railway network efficiency can be analysed under selective and random failure modes. A real case study on a high-speed railway network is presented to demonstrate the application of the proposed method. In this case, the key stations can be identified based on the CF in which connectivity, transportation capacity and local influence are taken into consideration. e results show that the identified key stations in the railway network play an important role in improving the overall railway network efficiency, which provide useful information to railway designers, engineers, operators and maintainers to operate and maintain railway network effectively and efficiently.
As stated above, the proposed new method considers physical network topology and dynamic operation strategies in the railway network efficiency analysis process. As suggestions, the following aspects in future work may need to be considered to obtain information on availability and stability of the railway network. Research should address (1) application of the proposed method by taking railway transportation organization strategy into physical network consideration to establish a vehicle flow network for railway transportation organization availability analysis, (2) methods to maintain key stations within a railway network as normal service in an emergency because of an accident or incident occurred, and (3) methods to assess the existing networks by increasing number of new stations while improving efficiency of the railway network.
Data Availability e readers can find the data used to support the findings of this study are included within the article.

Conflicts of Interest
e authors declare that they have no conflicts of interest.