Route Identification Method for On-Ramp Traffic at Adjacent Intersections of Expressway Entrance

To determine the control strategy at intersections adjacent to the expressway on-ramp, a route identication method based on empirical mode decomposition (EMD) and dynamic time warping (DTW) is established. First, the de-noise function of EMD method is applied to eliminate disturbances and extract features and trends of trac data. en, DTW is used to measure the similarity of trac volume time series between intersection approaches and expressway on-ramp. Next, a three-dimensional feature vector is built for every intersection approach trac ow, including DTW distance, space distance between on-ramp and intersection approach, and intersection trac volume. Fuzzy C-means clustering method is employed to cluster intersection approaches into classications and identify critical routes carrying the most trac to the on-ramp. e trac data are collected by inductive loops at Xujiahui on-ramp of North and South Viaduct Expressway and surrounding intersections in Shanghai, China. e result shows that the proposed method can achieve route classication among intersections for dierent time periods in one day, and the clustering result is signicantly inuenced by three dimensions of trac ow feature vector. As an illustrative example, micro-simulation models are built with dierent control strategies. e simulation shows that the coordinated control of critical routes identied by the proposed method has a better performance than coordinated control of arterial roads. Conclusions demonstrated that the proposed route identication method could provide a theoretical basis for the coordinated control of trac signals among intersections and on-ramp.


Introduction
With increasing congestion problems of urban traffic, many expressway on-ramp are usually congested, especially during the morning and evening peak periods. Congestion is frequently spread to ground intersections or roads through the oversaturated on-ramp. To solve this problem, scholars began to pay attention to coordinated control of on-ramp and adjacent intersections to improve the overall operation efficiency of traffic system [1]. However, most current studies only coordinately control the on-ramp and one connected intersection [2]. To achieve coordinated control of an on-ramp and intersections in an adjacent area, we can only refer to the research methods and findings of the coordinated control among intersections.
Two coordinated control modes of intersections including subarea control and arterial road control are widely used [3]. In subarea control mode, a large area is divided into small areas, and internal function or state within these subareas are similar. is control mode can achieve real-time traffic control and improve the stability of the system. e sub-area division process firstly chooses a key intersection on the basis of traffic volume, then the division will be finished based on principles of distance, saturation, and the correlation of key intersection and intersections [4]. e subarea division is to identify whether control performance can be improved when subarea coordinated mode is implemented to adjacent intersections. If the performance can be improved, the adjacent intersections should be divided into one subarea. erefore, a straightforward method for subarea division is to calculate and compare different control performances of different signal-controlling modes. is method is time-consuming for a large area of intersections. Subarea control method can improve the efficiency of traffic in an intersection area [5]. But for the directional traffic flow from intersections to on-ramp, it involves not only the intersection area, but also the on-ramp connecting expressway and ground system. e sub-area control can not solve this problem effectively. e second coordinated control mode is arterial road control, it can achieve the control of intersections on arterial roads easily with green wave. But this method only simply considers the line control on arterial roads, and it cannot guarantee the best effect in an area when some branch roads with high traffic demand also need to be controlled.
By comparison of the above two control modes, neither of them is suitable for the coordinated control of on-ramp and adjacent intersections. erefore, an effective way to coordinately control the traffic from intersections to on-ramp is necessary. Here, a critical routes identification and control method is proposed for on-ramp traffic at adjacent intersections. For on-ramp, critical routes [6] mean the routes carrying the maximal flow to on-ramp, and they are the key to determine the efficiency of the traffic flow to on-ramp. With route identification method, critical routes are regarded as the control objects with best effects. By controlling these critical routes, the most direct control effect among on-ramp and adjacent intersections can be achieved. We can use different control strategies based on different traffic demands. When the on-ramp is not congested, traffic signals on critical routes can be controlled to facilitate more vehicles flow to on-ramp. When vehicles are queuing on the ramp, the critical routes can be controlled to delay vehicle flow to on-ramp. Critical routes may not consistent with the direction of arterial roads, and they are not fixed. Critical routes may constantly vary with time, for example, they are different during peak hour and off-peak hour. erefore, the fast and accurate identification of critical routes is an effective way to optimize the performance of traffic signals coordinated control.
For coordinated control of traffic signals at intersections, the method of critical routes identification should be dynamically and easily operated for various traffic states. And the method should be less affected by external environment. Commonly used route identification methods include traffic survey statistical analysis [3], real-time traffic surveillance method [7], floating car method [8], traffic count OD estimation method [9], vehicle plate automatic recognition technology [10] and the meta-heuristic search mechanism [11]. Although the above methods can identify critical routes at intersections, they are easily interfered by external environment or artificial factors. So, the above methods cannot meet the requirements of signal coordination control.
In this paper, a novel method is presented to identify the critical routes carrying the maximal traffic flow to on-ramp. And through signal coordinated control, the traffic efficiency among on-ramp and adjacent intersections can be improved. e new method utilizes dynamic time warping (DTW) distance to compare the similarity between on-ramp and adjacent intersection approaches. DTW is a method widely used in other areas such as speech recognition [12] and on-line signature recognition [13]. It is a classical and well-established distance measurement between two time series. Compared with classic Euclidean distance, DTW distance can measure similarity between two discrete time series which may have different time interval. A detailed description about DTW is provided in Section 3. e empirical mode decomposition (EMD) method is used to denoise the original time series. EMD can reconstruct the original traffic data and extract features and trends of them. Previous researchers have proposed some filtering methods, such as linear method which is easy to design and implement but not very effective [14]. To overcome this shortcoming, nonlinear methods including wavelet filter are proposed, but the wavelet has the fixed basic function, so it cannot match all real data time series [15]. Recently, the data-driven EMD method was introduced from nonstationary and nonlinear process. e basic functions of EMD are derived from the data time series itself, and it's the main advantage of EMD. Detailed information about EMD is also introduced in Section 3. e main framework of the paper is organized as follows. In the next section, a literature review on the coordinated control between expressway on-ramp and intersections is given. A er that, three methods including EMD method, DTW distance method and fuzzy C-means clustering method are introduced. en, data source is descripted for case study. e three methods are employed for a case study and the case results analysis is given in Section 5. Finally, the method is validated by traffic simulation in Section 6, and some conclusions are discussed in the last section. e architecture of this paper is shown as Figure 1.

Literature Review
Most of the research about coordinated control of on-ramp and adjacent intersections only focus on the closest intersection connected to the on-ramp [16,17]. e coordinated control includes ramp control [16], coordinated control of ramp and one connected intersection [2], coordinated control of ramp, expressway main line and one connected ground intersection [17]. In fact, when the vehicles queue on the ramp spreads to ground roads, the congestion will affect the traffic state of the one connected and further intersections. Few studies have focused on the coordinated control of on-ramp and intersections in adjacent areas.
If we want to achieve coordinated control of on-ramp and adjacent intersections, the research of coordinated control among intersections can be taken as a reference. One important method to coordinately control intersections is to measure correlation degree between intersections and then decide control priority. Current research mainly includes two methods to measure correlation degree between intersections. e first one is to build correlation degree models based on various influencing factors, including traffic volume, section length [16], section flow and cycle ratio [17]. e correlation degree model is very helpful to quantify the correlation degree between intersections, and more and more factors are needed to take into consideration to achieve a more accurate result. e other one is a data-driven method to measure the correlation degree between intersections by measuring similarity degree of traffic feature time series. Similarity degree is usually described by distance between two time series, and the most commonly used distances include Euclidean distance, DTW distance and editing distance [18]. For the traffic feature time series at different intersection, there will be asynchronism because of the lag between upstream and downstream traffic flow. Considering this, DTW distance is the best choice to analyse the similarity degree of traffic feature. Tin et al. [19] calculates DTW distance of different vehicles' speed time series to detect abnormal running speed of vehicles in advance. Zhang et al. [20] adopts the DTW distance to measure the similarity, and proposes a speed-up version of DTW to improve the accuracy of speed estimation and reduce the runtime. A er comparing two methods, it can be shown that the first correlation degree model method can measure the correlation degree between intersections by considering different influencing factors. is method is suitable for arterial intersections or a few connected intersections. However, for the correlation degree between the on-ramp and a large area of intersections, this method cannot behave well because of too many influencing factors and complex computation. It is hard to consider all the external influencing factors. By contrast, the similarity degree of traffic feature time series method is more appropriate to measure correlation degree between on-ramp and intersections. Even a large-scale area study will not add much complexity and the algorithm will not change with the external influencing factors.
On the basis of similarity degree, routes among intersections can be divided into different classifications. In previous studies, Lu et al. used a new data-driven method to measure road traffic correlation degree considering spatial heterogeneity [21] to achieve the dynamic division method for signal coordination. Yan et al. used spectrum analysis [22] which can determine the relevance and lag between upstream and downstream traffic to determine the critical routes among intersections. In addition, clustering is also a useful method to identify critical routes. Fuzzy-C means clustering method has been used to improve accuracy and robustness of sub-area control [23]. Clustering method is easy to calculate, and the number of clusters can be adjusted based on actual management requirements. So, clustering method can be used to classify different control levels and provide a reference for traffic management and control. erefore, in this paper, the similarity degree measurement based on DTW and fuzzy-C means clustering method can be integrated to achieve the route identification.

Materials and Methods
e principal methods used in this paper are the following: (1) EMD method that can filter and reconstruct traffic volume time series; (2) DTW distance that can measure similarity between traffic volume time series of intersection approaches and on-ramp; (3) fuzzy C-means method that can cluster intersection approaches into three classifications to identify critical routes.

Empirical Mode Decomposition (EMD)
3.1.1. Time Series. A time series is a sequence of real data observed in a period of time. ese values are recorded with intervals between them, such as a time series = 1 , 2 ..., . In this work, a sequence of traffic volumes is considered as a time series.

Introduction of EMD.
Empirical mode decomposition (EMD) [24] is a data analysis method with which any complicated data set can be decomposed into a small number of intrinsic mode functions (IMF) through si ing process. Each IMF has a certain time scale distinct from others [25]. EMD picks out the highest frequency oscillation from the time series as the first IMF and continue to pick out the highest frequency part in the remaining part of data to be the second IMF, and so on. It is apparent that IMFs's frequencies show a decrease trend step by step. is decomposition method is adaptive and highly efficient. A function needs to satisfy two conditions to be an IMF, readers can refer to [24]. EMD process involves five major steps as following.
Step 1. Identify all the local extrema of original time series ( ), then obtain the upper envelope 1 ( ) and the lower envelope 1 ( ).
Step 2. e point-by-point envelope mean 1 ( ) can be calculated using the upper and lower envelopes as follows: Step 3. Using the original time series ( ) and 1 ( ) to get ℎ 1 ( ) as follows: Step 4. If ℎ 1 ( ) satisfies the definition of IMF, ℎ 1 ( ) can be regarded as the first IMF 1 ( ) and then residue 1 ( ) is substituted for the original ( ), where If ℎ 1 ( ) is not an IMF, ℎ 1 ( ) should be seen as the original time series and Steps 1-3 are repeated.
Step 5. Repeat Steps 1-4 and stop when the residue ( ) becomes a monotonic function.
Step 6. e original time series ( ) can be decomposed into modes and a residue as follows: (2) Distance measure has been applied in similarity search, classification, and clustering of time series. ere have been many distance measurements and improved methods to measure the similarity between time series. Among these distance measures, Euclidean distance and DTW distance are widely used because they were proved hard to be beaten [26,27]. For Euclidean distance, the similarity between two time series = 1 , 2 ..., and is defined as follows: Note that two time series and must have the same length of . Euclidean distance is unsuitable for time series distance measure in case time series data sets are of different lengths or they are distorted along the time axis. For traffic flow, there must be time lag between upstream and downstream time series, and two time series are not corresponding to each other point to point on time axis. erefore, we use DTW distance to avoid these problems.

3.2.2.
Introduction of DTW Distance. DTW distance is a measure that allows non-point-to-point alignment between two time series to accommodate sequences that are similar but out of phase [28].
Given two traffic volume time series ( ) and ( ) : ( ) = 1 , 2 ,... ,..., , ( ) = 1 , 2 ,... ,..., . DTW is a dynamic programming technique which attempts to find all possible warping paths between two time series to get a minimum distance. e difference between DTW and Euclidean distance is shown in Figure 4. e shape of two time series are similar but not aligned in time axis, and DTW distance − is more intuitive than Euclidean distance − ὔ , a pessimistic dissimilarity measure (see Figure 4). Now we would like to show how to compute DTW distance between two time series. First of all, we construct a cumulative warping distance matrix with size × . en, every element in matrix is computed as follows: where , means the iterative computation of the minimum cumulative distance of two time series from starting point (1, 1) to point (i, j). And , is the distance between the -th (1, 1) = (1, 1), ( , 1) = ( , 1) + ( − 1, 1), < 1, 1, = 1, + 1, − 1 , us, the original data are decomposed into IMFs with different frequencies ranging from high to low and a residue ( ) obtained with mean trend of original data. e detail flowchart is listed in Figure 2. Take traffic volume time series as an example (see Figure 3), it is decomposed by EMD into eight IMFs and one residue. One can observe that the IMF construction starts from the highest frequency component, and the last extracted function is monotone. [25] introduced an EMD-based method to de-noise and smooth the data set. Being fully data-driven, this method does not need to set any threshold or select wavelet basis function. us, EMD is much more efficient than wavelet filtering method. Given a set of data ( ) containing the deterministic part and white noise, we need to extract the deterministic part from ( ). ( ) can be decomposed into IMFs and a residue ( ) by EMD, and we can get the reconstruction ∼ ( ) with the -th~ -th IMFs and ( ) (Equation (5)).

Filtering Method. Boudraa and Cexus
To confirm the value of , consecutive mean square error (CMSE) [24] is proposed to filter data by measuring the squared Euclidean distance between two consecutive reconstructions of the data set. CMSE is defined as follows:

Time series
Upper envelope u 1 (t) Lower envelope l 1 (t) . In order to calculate the distance between them, a matrix of cumulative distance was constructed (see Table 1) by Equation (10). ( , ) = (8, 8) = 13 is the target warping distance between and , and the calculation path is the target warping path (see Table 1).

Fuzzy C-means (FCM) Clustering.
Clustering is used to group points into clusters so that the within-cluster data are more similar to each other than to those in other clusters [30]. e advantage of clustering is to arrange a huge amount of data into several pieces and help users to identify the choice of interest. Clustering is a method using the criteria predefined to divide the input data into different clusters, and it helps to differentiate similarities and dissimilarities and compute useful inferences from them. Fuzzy clustering is a form of so clustering method, and each data point can belong to more than one cluster. e fuzzy C-means (FCM) clustering attempts to group a finite data set = 1 , 2 ,..., into fuzzy clusters. represents feature point in time series ( ) and the -th point in time series ( ). Note that matrix must be computed as the order of above equations, i.e. we must compute (1, 1) first, then ( , 1), 1, , and compute , at last. e minimum cumulative distance ( , ) is the target warping distance of two time series and the calculation path is the target warping path. e distance between two time series and now can be easily calculated as follows: Because square root of the above equations is not important when comparing distances together, we can use this equation: Moreover, to accelerate computational time and get a more accurate distance measure, some global constraints were proposed to DTW. For more details about DTW technique as well as its constraints, readers can refer to [29].

Data Collection and Description
A downtown area with expressway on-ramp, located in Xuhui District, Shanghai, China (see Figure 5), is chosen as the study area. It consists of 43 intersections and an expressway on-ramp, and the on-ramp is located between intersection #19 and #28 (see Figure 6). e roads with red mark are arterial roads, and the detailed information of road type is shown in Table 2.
e traffic data are collected on 7 th March, 2017 by inductive detection loops of SCATS control system, including the traffic volume time series. Inductive detection loops are located on every approach of intersections and the data was collected per cycle shown in Table 3. In Table 3, the first column is the data acquisition time, the second column is the serial number of inductive detection loop. e third column is the phase of signal light. e fourth and fi h column mean the length and max length of current phase. e last column represents the traffic volume in this phase. In data processing, the original data is divided into every second and then aggregated into every minute.
e SCATS system also provides other information of each intersection. Take one intersection (#39) as an example. It is shown in Figure 7 that the locations of inductive detection loops, the lane function and the phases information of signal are marked. Apart from intersection phasing and timing information and inductive detection loop data, the SCATS system can also provide control strategy of intersections, such as coordinated control among several intersections.

EMD Filtering Method
Result. EMD method is used to filter the original data of approaches at every intersection. Take #28 intersection as an example, the traffic volume time series is decomposed (see Figure 3). In the reconstruction of time series, the calculation based on Equation (7) shows that equals to 5. en the reconstruction of time series equals to the sum of IMF5-IMF8 and residue (see Figure 8). It can be seen that EMD method can smooth time series and maintain the feature of original data.

Calculation Result of DTW Distance between Each
Intersection Approach and On-Ramp. A er data processing, denoised traffic volume time series of le -turn, straight and rightturn approaches of 43 intersections are obtained. e DTW distances between traffic volume time series of each approaches and on-ramp are compared. Take the #5 intersection in Figure  6 as an example: it has le -turn, straight and right-turn lanes at the eastern, western, southern and northern approaches. Assuming drivers do not make a detour to the on-ramp, so the valid routes from #5 intersection to on-ramp only contain the straight and right-turn lanes at the northern approach and the straight and le -turn lanes at the eastern approach. Based on this assumption, all valid routes to on-ramp have been screened out, and we can obtain 124 intersection approaches traffic volume time series. Finally, DTW distances between the traffic volume time series at selected approaches and onramp are calculated. In order to explore how DTW distances vector. Given the data set and cluster number , the algorithm returns cluster centres = 1 , 2 ,... and a matrix of membership = ∈ [0, 1], = 1,..., , = 1, . . . , in which means the membership degrees of to the cluster .
where is the fuzzification coefficient and it determines the fuzziness of the cluster. is a constant greater than 1, and = 2 in this study [31].
For FCM, the centroid of a cluster is the mean of all points, weighted by their membership to the cluster. e FCM aims to minimize the objective function: e fuzzy C-means (FCM) clustering is composed of the following step: Step 1. Given data set = =1 , and cluster number ; Step 2. 0 is randomly initialized; Step 3. At -th step: calculate the centres vectors ( ) = with ( ) , see Equation (14); Step 4. Update ( ) with ( ) by Equation (13) into ( +1) ; Step 5. If | ( +1) − ( ) | < , then STOP; otherwise, return to step 3.  According to calculation results, the top five intersection approaches with highest similarity during 06:00-09:00 are shown in Figure 9. It is obvious that there is a similar trend of traffic volume change at the on-ramp and intersection approaches.
vary in different periods of one day, a day is divided into five periods: 00:00-06:00 (off peak), 06:00-09:00 (morning peak), 09:00-16:00 (off peak), 16:00-19:00 (evening peak), and 19:00-24:00 (off peak).  Journal of Advanced Transportation 8 flow, and three features are DTW value between time series of intersection approaches and on-ramp, space distance between intersection and on-ramp and traffic volume of intersection approaches. en three feature factors should be normalized to [0,1] and the fuzzy C-means clustering algorithm is used to cluster all intersection approaches. Clustering result in five periods on 7 th March, 2017 is displayed in Figure 11. Table 4 describes the results of univariate analysis using F-test based on three clustering groups at different periods. In Table 4, it can be seen, in different time periods, the DTW, traffic volume and distance are all significantly different in three groups (sig. < 0.05). It means this clustering method can effectively classify routes among intersections relating to DTW, traffic volume and distance.
According to the clustering result, intersection approaches in research area are clustered into three classifications in different periods. In the first classification, approaches with the smallest DTW means the highest similarity to on-ramp, and they are the critical routes in this period. ese routes need Figure 10 shows the intersection approaches with 15 maximum DTW values and 15 minimum DTW values. Larger DTW value means lower similarity degree. It can be found that (1) in general, intersections close to on-ramp is highly similar with on-ramp, such as #19, #28 intersection (see Figure 6). But there are exceptions, some intersection approaches far away from on-ramp also have high similarity, such as #25 and #24 intersection. It means there are more traffic flows choosing these routes to enter the ramp; (2) e location of intersection does not play a decisive role. For the similarity degree between intersection approaches and on-ramp, for example, #38 intersection (see Figure 6) is not located on an arterial road, but it shows a high similarity with on-ramp.

Clustering Result of Intersections Approaches.
To achieve critical routes identification, various factors are considered. A er comparing the clustering effects and efficiency, three parameters are selected in this work. A three-dimensional feature vector is built for every intersection approach traffic while only several nontrunk roads need coordinated control. For example, in Figure 12(a), the critical route identification result coincides with arterial roads in South Chongqing Rd., Zhaojiabang Rd. and Xujiahui Rd. Although Luban Rd. is not contained in critical routes, we can see that the critical routes turn from #43, #42, #41 on nontrunk road to #33 on arterial road. us, it can be seen that the coordinated control of on-ramp and intersections can not be limited in the nearest intersection to on-ramp, and the conventional control of arterial roads can not always lead to the best effect of traffic flow from intersections to on-ramp.

Effect Validation
A er the identification of critical routes in different periods of a day with the proposed method, we can test the effectiveness of this method with TransModeler simulation.

Micro-Simulation
Model. e proposed critical routebased control strategy is tested and evaluated using the scalable, high-performance microscopic simulation package of TransModeler so ware. In order to accurately code the intersections in TransModeler, large scale background images of the studied intersections and on-ramp are needed. For each link, several specifications are defined in TransModeler: (1) number of lanes for segment; (2) lane width and distance; (3) signal phases; (4) speed limit; (5) input volume. e network in TransModeler is constructed as Figure 6.
A er the network coding, network calibration and validation are required for a reliable micro-simulation study. e simulation uses the collected traffic demands of five periods on 7 th March, 2017. Five periods are chosen as the simulation periods, and every period was simulated for 3 h and 20 min. e first 20 min is considered to be the warm-up period for coordinated control with on-ramp. e approaches in the second classification are with medium DTW values. When the control of routes in the first classification is achieved, routes in the second classification can also be coordinately controlled if the traffic service level is still low and control condition allows it; the third classification has the lowest similarity with on-ramp, and no special control is needed for this classification. According to the clustering results of 5 periods in one day, the priority level of the routes is obtained (see Figure 12). e clustering results show that different control strategies should be adopted in different periods of one day. It can be seen that the coordinated control results in different periods always contain the most and even the whole arterial roads,  where = modelled volume (vph) is the simulated volumes obtained from the simulation network, and = field volume (vph) is the observed volume of in-field detectors. The simulated traffic volumes are aggregated to 5-min intervals. The GEH statistics are from two selected detectors at the west and east approaches of #28 intersection for a 3 h simulation period (see Table 5). The results from 10 simulation runs show that 94.3% of the observed GEHs are less than 5 for all intersections, which means the network setup in vehicles to fill in the network, because it takes less than 20 min to finish the longest trip in the network in the real world. e simulation model is calibrated for traffic volume at 5-min intervals. e simulation can be calibrated and validated by evaluating the Geoffrey E. Havers (GEH) equation [32]. GEH is a valid volume calibration parameter, and the definition is as follows:  arterial roads or critical routes. en, the phase differences between the intersections on arterial roads or critical routes are calculated by numerical method. us, the timing scheme of coordinated control of the arterial roads or critical routes are obtained. Take AC as an example, a er the calculation, the #28 intersection signal cycle is set to 120 s, and it's the maximum cycle of arterial roads intersections. erefore, the system cycle length is set to 120 s. en green signal ratio is determined by actual traffic conditions of each intersection, and phase difference is determined by average speed and length of the road section. According to these control parameters to achieve simulation.
TransModeler can accurately represent the intersections traffic flow characteristics.
6.2. Control Strategy. Critical route control strategy (CC) proposed in this paper and arterial road control strategy (AC) are explored in simulation. No control strategy (NC) and AC are used as references to the proposed CC. NC means the actual state of intersections in study area. AC means achieving green wave control on arterial roads. CC is a coordinated control strategy with green waves for intersections on critical routes. Green wave control strategy is based on Webster formula. Firstly, calculate and determine the isolated intersection timing and phasing scheme and obtain the common cycle length of  and on-ramp, and it has considered the lag between upstream and downstream traffic flow. A er this, the correlation degree between on-ramp and intersection approaches can be calculated. en the clustering method are used to classify intersection approaches into three groups and to identify the critical routes. Finally, the validation of the critical route control is conducted with simulation test to compare effects of different control strategies. Compared to arterial road control strategy, critical route control strategy is more effective. e similarity of two time series measured with DTW is a novel way in traffic control area, and this paper is just a start. In this paper, only the relationship between on-ramp and intersections is considered, and off-ramp will be considered in the future. For the selection of critical routes, it's an important task to determine the border of "green wave" control, and this is the aim of future work. In addition, only traffic volume was considered as one-dimension traffic feature in this paper. In the future, a multi-dimension index, which contains not only traffic volume but also speed and density, may be better to represent the traffic state before using DTW to explore the traffic characteristics. en it can be more accurate to identify the bottleneck points in the network and give more helpful reference for coordinated control.
Data Availability e funding agency does not allow the data to be open to the public without their permission.

Conflicts of Interest
e authors declare that they have no conflicts of interest.

Simulation Result. Simulation result of 5 periods on 7 th
March 2017 with TransModeler is shown in Table 6. e weighted average delay per vehicle is chosen as an indicator of intersection efficiency, and the weighted average delay is calculated as Equations (17) and (18) based on the flow of each intersection. It can be seen that Table 6 shows at most times, CC outperforms AC. And this is more obvious during peak period (06:00-09:00 and 16:00-19:00), which means CC has better effect with higher traffic demands on the whole.
where means the weighted average delay of the -th intersection.
means the traffic flow measured at -th approach at -th intersection.
means the average delay of the traffic flow measured at -th approach at -th intersection.
where WAD means weighted average delay of the whole area; means traffic flow at the -th intersection. Take the morning peak (06:00-09:00) as an example. e critical route identification results shown in Figure 12(b) is simulated using coordinated control of the NC, AC and CC strategy. e controlled intersections of AC and CC control strategy are shown in Table 7. Result of the weighted average delay at every intersection during the morning peak (06:00-09:00) from 10 simulations is shown in Figure 13. Figure 13(a) shows the weighted average delay at the intersections which CC coordinately controls. It can be seen that CC has a more efficient control at these intersections. Figure 13(b) shows the weighted average delay at the intersections which CC doesn't control. It can be seen that for these intersections which are not controlled with CC, the performance does not necessarily get better or worse. But on the whole research area, CC is 10.36% more efficient than AC (see Table 6). It can be seen from Table 7 that more intersections are controlled, and the control effect is better in CC case than AC case.

Conclusions and Discussion
Traditional coordinated control strategy including subarea and arterial road control cannot achieve the traffic control among intersections and on-ramp. is paper has proposed a route identification method for on-ramp traffic and adjacent intersections. Compared with previous reviews of coordinated control of on-ramp and intersections, the paper is unique in that it provides a new perspective to use DTW distance to measure similarity between intersections and on-ramp and classify routes among intersections based on this.
We use a data-driven EMD method to denoise real-world traffic volume data, due to the widely-used wavelet filtering needs to take efforts to select an appropriate wavelet to achieve the best result [33]. en an appropriate and innovative DTW distance is chosen to measure similarity between intersections