Online Traffic Condition Evaluation Method for Connected Vehicles Based on Multisource Data Fusion

With the development of connected vehicle (CV) and Vehicle to X (V2X) communication, more traffic data is being collected from the road network. In order to predict future traffic condition from connected vehicles’ data in real-time, we present an online traffic condition evaluation model utilizing V2X communication. This model employs the Analytic Hierarchy Process (AHP) and the multilevel fuzzy set theory to fuse multiple sources of information for prediction. First, the contemporary vehicle data from the On Board Diagnostic (OBD) is fused with the static road data in the Road Side Unit (RSU).Then, the real-time traffic evaluation scores are calculated using the variable membership model. The real data collected by OBU in field test demonstrates the feasibility of the evaluation model. Compared with traditional evaluation systems, the proposed model can handle more types of data but demands less data transfer.


Introduction
Nowadays, traffic congestion is a serious issue due to the growing number of vehicles moving on the urban road networks.Connected Vehicle (CV) technology enhances the ability of traffic information collection and management through Vehicle to X communication (including Vehicle-To-Infrastructure (V2I) and Vehicle-To-Vehicle (V2V) communication), which presents one of the best ways to mitigate urban traffic congestion, improve traffic safety, and reduce fuel consumption [1].
With the development of connected vehicles, multiple sensors and communication modules tend to become standard equipment in vehicles.Through these sensors and communication modules, the required traffic information can be collected and distributed efficiently.Meanwhile, diversified traffic data sources and effective analysis methods provide a more reliable decision-making basis for traffic managers [2].Furthermore, advanced data fusion technology can be used to deal with massive multisource traffic data to provide more accurate estimation of urban road conditions and improve the evaluation and prediction methods for urban traffic system [3].
In the urban traffic system, there are various traffic data acquisition methods, such as detector, video, and radar.With the application of V2X communication, more traffic data can be collected from connected vehicles, infrastructure, and other traffic sensors.The data can then be fed to the traffic condition evaluation, prediction, and decision-making system.If the traffic management department can leverage real-time traffic information from V2X communication to induce traffic flow and reduce unnecessary travel time, the operational efficiency of the transport network can be improved.
Huang proposed a data fusion method to optimize urban traffic flow based on neural network and fuzzy reasoning, which collected the traffic data from varied detectors on the urban road [4].Quek et al. introduced a special class of fuzzy neural network known as the pseudo outer-product fuzzy neural network using the truth-value-restriction method (POPFNN-TVR) for short-term traffic flow prediction.The method combined the complementary capabilities of both neural networks and fuzzy logic; thus it constituted a more promising technique for modeling traffic flow [5].Zhao et al. analyzed the characteristics of multisource data fusion and support vector machine (SVM).Following to the principle of SVM, they collected multisource traffic flow data from Hanshin Highway [6].Castillo et al. reviewed the roles of mathematical tools and methods in traffic flow observability, estimation, and prediction problems.The high number of possible combinations of these elements justifies the existence of a wide collection of methods for analyzing static and dynamic situations [7].Thomas and Dia presented a neural network algorithm based on traffic data fusion and tested it with simulated data.It analyzed various influence factors on data collection, such as positions of detectors, numbers of floating cars, length of the urban road, and severities of traffic accidents.Several classical algorithms were applied in traffic information fusion, including Kalman filter, artificial neural network, exponential smoothing, and recursive estimation algorithm [8].Yang et al. proposed a novel fusion model which can be used to identify traffic status and analyze traffic conditions, accidents, scope of coverage, and forecast of future traffic flow [9].Ren et al. processed observation traffic data for the traffic volume of urban road using fuzzy fusion algorithm.The test results showed that this method can acquire more complete and reliable traffic data.To forecast long periods of traffic flow conditions [10] Stutz and Runkler used fuzzy clustering to classify and analyze the traffic jams on a German freeway [11].Jiang et al. used fuzzy clustering to identify road traffic conditions; the feasibility of this model was proved by the simulation results [12].Rizzi et al. proposed an application of a highly efficient classification system based on low complexity real-time Internet traffic flows, by considering traffic data sets collected in different epochs and places [13].Guo et al. proved the urban road traffic conditions can be analyzed with traffic data of coil detector by improved fuzzy clustering method [14].He et al. improved a fusion method with new data collected from mobile phone and microwave sensors, providing enough data for traffic analysis [15].
There are several achievements while applying V2X communication in ITS.Backfrieder et al. predicted future congestion based on the Bottleneck prediction method and V2X communication.It demonstrated promised performance through dynamic microscopic traffic simulations both in a real-world scenario and in an artificial road network scenario [16].Schünemann proposed a flexible simulation tool which simulated real-time traffic flow by V2X.This tool can also be used to simulate various scenarios of future intelligent transportation systems [17].Otsuki and Miwa designed an efficient content-delivery control algorithm using real-time traffic data generated from traffic situation.The algorithm utilized the route prediction information in order to share traffic data during the vehicles by V2X communication efficiently [18].Wedel et al. introduced a novel algorithm that can be used for connected vehicle with navigation system to calculate routes circumnavigating congested roads [19].
We can draw a conclusion from the literatures above that current research on traffic evaluation mainly focuses on how to process traditional data from detectors.Researchers have conducted a full study of how to improve the accuracy and reliability of information fusion.However, due to the limitation of data types and inevitable errors from traffic detectors, the advantages of novel traffic evaluation method are not prominent.To fill this research gap, a real-time traffic evaluation method based on data fusion in V2X scenario is presented in this paper.The vehicle OBD data is collected and processed by the RSU installed at the intersection.OBD data will be later fused with the static road data in the method.Details are discussed in the following section.

Description of the Traffic Data Fusion Scenario
Unlike traditional floating car system which sends real-time data directly to the server via mobile network and provides few data for traffic control, this paper uses the OBD data as the source of vehicle dynamic data and keeps the data within RSU at intersection.There are two types of data stored in the RSU: static road parameters such as road grade, number of lanes, road length, and real-time dynamic data generated by the connected vehicles.
The scenario of connected vehicles with V2X communication in this paper is described as shown in Figure 1.With the RSU installed at the intersection, all kinds of vehicle data generated by the vehicle passing through the road section (Point A → Point D) are collected.By fusing the parameters of the road section, the system evaluates the traffic conditions at each collection interval.
The structure of CV system is shown as Figure 2. The On Board Unit (OBU) installed on the vehicle is an embedded acquisition system that receives the vehicle data through the Controller Area Network (CAN) protocol from the Electronic Control Unit (ECU).The RSU actively sends the handshake information to establish communication with the OBU that supports V2X communication and determines whether the vehicle enters or leaves the intersection by comparing the location information of the intersection and the GPS data of the vehicle.
The flow chart of traffic data fusion is shown in Figure 3.When the vehicle leaves the last intersection (Point A), it starts to record the running data of the current road section and sends the data generated on the road section to the RSU when leaving the next intersection (Point D).When the vehicle enters the communication range (Point B) of the RSU, the two parties will establish a stable V2X communication.The OBU sends its own vehicle basic information and continuously sends the positioning information before the RSU requests the OBD data.When the vehicle enters the intersection 2 (Point C), since the communication history of the RSU1 is recorded in the OBU, the current vehicle data belonging to entrance lane (intersection 1 → intersection 2) can be determined in the RSU2.According to this principle, the data of vehicles on multiple entrance lanes can be processed simultaneously in the RSU.After the data sent by the OBU is verified as valid, data fusion and evaluation will be done in the RSU according to the model in Section 3.
The OBD interface is chosen since it provides not only the vehicle sensor information, but also the vehicle internal control information and fault information.This distinguishes the proposed model from most traditional floating car data collection systems which obtain the vehicle data mainly from GPS module.Since the OBD interface integrates external detectors, it greatly enhances the evaluation system's versatility.
With the V2X communication and vehicle OBD data, the scenarios described in this paper have the following advantages over traditional floating car systems: (1) More traffic information is shared by V2X.The traffic flow data on the macroscale is fused and calculated at the RSU, whereas the running information of each vehicle is collected in V2X communication on the microscale.
Meanwhile, the information for driving guidance or alarm can be sent to vehicles as well.In a word, if the multisource data fusion with V2X communication is applied in urban traffic system, it can greatly improve the driving safety and traffic capacity.The accuracy error caused by the interference of the sensor can also be avoided.
(2) Traffic data from OBD interface is more accurate and computationally friendly.The method reduces the load and the computation amount of the data on the network and can effectively avoid errors caused by the collection system.Since the method collects corresponding dynamic data judged based on the OBD data directly, it is not necessary to always track and calculate the vehicle GPS data.The process of data calculation and transmission is simplified by the method, and errors caused by sensors can also be avoided.

Evaluation Model Based on Real-Time Traffic Information Fusion
The evaluation model established in this paper is based on the multilevel fuzzy synthetic evaluation model.The basic idea is to establish the fuzzy judgment matrix by using the transformation principle to describe the data boundary of the factors in fuzzy set.Through the multilayer numerical calculation based on the evaluation criteria and weights, we will determine the results of the evaluation object [20,21].The structure of synthetic evaluation model is shown as Figure 4.
While evaluating a road's real-time status, there are many evaluation indexes that can be used.Based on the principle of measurability, the average travel time (ATT), average number of stops (ANS), and average stopped time (AST) are selected as the final evaluation indices in this model, as in (1).All these three indices can be calculated from data collected from OBD and V2X communication.
Travel time is defined as the difference between the data exchange time at the current intersection and that at the adjacent downstream intersection.While traveling to the next intersection, the OBD can record the number of stops and total stopped time.In addition, all detected data will be verified by communication integrity and data validation to ensure the validity of the data.Then, the above selected three real-time evaluation indices can be calculated by averaging the valid data.
Since the membership function is applicable to all evaluation objects, an evaluation matrix can be obtained as a fuzzy relation, that is, :  ×  → [0, 1], which is defined as where   = (  ,   ) ∈ [0, 1] is the membership degree of the  th object to be evaluated on the  th evaluation index.Define  = (, , ) as the primary evaluation space and give a fuzzy vector : In ( 3), element   of  represents the weight of each evaluation index with respect to the primary evaluation model:

The Variable Membership Model.
Assuming that the evaluation result is a finite set  = {V 1 , V 2 , V 3 , V 4 } = {excellent, good, medium, bad}, each element in the set corresponds to a distribution interval of the membership function, which is shown in   ∈ [0.25 ( − 1) , 0.25] , V = V  ( = 1, 2, 3, 4) . ( Considering the negative correlation between the evaluation indices and evaluation results, this paper selects the membership function of the Cauchy type, as shown in It can be seen that, in the coordinate system   −   , each membership function distribution interval 0.25 must have a corresponding critical value   .According to the set of values under the same membership function, the coefficients of the membership function, including   ,   , and   , can be solved by regression analysis.Taking into account the actual traffic scenario,   is a time-varying value changing with static traffic parameters.The dynamic adjustment strategy for   is shown in where    is a typical critical value for the length of the unidirectional road at the specified road grade and is calibrated by a large number of tests,  is a coefficient that represents the part of evaluation indices which is generated by the control of the signal, which depends mainly on the green signal ratio and the number of phases,   is the standard length (500 m) for the urban road,   is the influence coefficient of  lanes on the  th evaluation index, and   is the influence coefficient of  branch road on the  th evaluation index.
To obtain the critical value   , we take the relevant static data (in Table 4) and typical values into (7).By the nonlinear regression analysis function "nlinfit" in MATLAB, we obtain the parameters of the Cauchy-type function in (6).Then the coefficients are fitted as a curve, including   ,   , and   .The results are shown in Table 1 and Figure 5.
For the value of the membership function   , it is necessary to convert it into the membership degree with the corresponding indices: based on the trapezoidal membership, we define the interval as 0.25 of the evaluation result set , which is the intermediate membership degree (  = 0.5) of the two evaluation indices.The floating range is from 0.25 − 0.1 to 0.25 + 0.1.The final membership relationship is shown in Figure 6.

The Analytic Hierarchy Process. The Analytic Hierarchy
Process (AHP) is used for organizing and analyzing complex decisions based on mathematics and psychology.Rather than prescribing a "correct" decision, the AHP helps decision makers find one solution that best suits their goal and their understanding of the problem.It provides a comprehensive and rational framework for structuring a decision problem, for representing and quantifying its elements, for relating those elements to overall goals, and for evaluating alternative solutions.
In the proposed model, there are many evaluation indexes that can be considered.However, the weights of each index are not predefined.For example, some researchers may regard that the average speed is the most important index to evaluate a road state, but others may regard the stops as more important than the average speed.Both of the two viewpoints are subjective assumptions.So the AHP method is to determine the weight of these evaluation indexes scientifically.The steps of AHP are shown as follows.
Step 1.According to the relevant research and practical experience, comparing the importance of the three evaluation indices, the judgment matrix table can be acquired as shown in Table 2.   Step 2. The data in Table 2 is brought into the following equation to obtain the judgment matrix : Step 3. The columns of judgment matrix  are normalized as To apply the AHP method, each element in ( 9) is recalculated as a proportion to the sum of its own column.By normalizing, the value of each element can be transformed as a percentage, which is the value that needs to be calculated in (10) Step 4. The sum of the rows of the judgment matrix  is calculated as Step 5. Normalize   to get   ; we can find the largest eigenvalue  max and eigenvector, according to  =  max : Finally, the weight matrix can be calculated:  = (0.52, 0.22, 0.26)  .
When the weight set is calculated by AHP method, we will make a consistency check to ensure that the results are reasonable.For example, to avoid logical errors, if the result is index  being more important than index , and index  is more important than index , but index  is more important than index , then the results are unreasonable.
Step 6. Calculate the consistence index C.I; we can find the corresponding mean random consistency index R.I, where  represents the order of the judgment matrix : Step 7. The consistency ratio C.R is calculated as (13), where R.I represents a constant value, determined by  (e.g.,  = 3, R.I = 0.52): Through the calculation in ( 13), the results of consistency check (C.R < 0.1) are accepted.

The Fuzzy Operator Pair and Secondary Evaluation Model.
The symbol ⊗ in (4) represents a fuzzy operator pair.If more operator pairs are introduced at the same time, a new fuzzy subset can be obtained for each evaluation object: In ( 14),  represents the number of fuzzy operator pairs.The fuzzy operator pair will determine the meaning of the fuzzy vector to a larger extent.Besides, the secondary evaluation space composed of multiple operator pairs will help to measure the influence of the evaluation indices () on the object to be evaluated () from various aspects.In this paper, we select three operator pairs: (∧, ∨), (•, ∨), and (∧, ⊕), where "∨" represents Max, "∧" represents Min, "•" represents multiplication, and "⊕" represents addition.These three operator pairs focus on the contribution of individual or multiple evaluation indexes, and ∑  =1   ̸ = 1.A new fuzzy relation can be obtained Combining  and   , that is,   :  ×   → [0, 1]: In (15),   represents the primary evaluation value of the  th object calculated from (4) when the  th operator pair is used.Thus, the secondary evaluation space   = (,   ,   ) is obtained.Each element   ∈  is given a fuzzy vector   in the secondary evaluation space: In ( 16), the element    of   represents the weight of the  th fuzzy operator pair for the secondary evaluation space, and . The secondary fuzzy vector matrix is obtained according to the AHP, that is,   = (0.17, 0.28, 0.55).Then the secondary evaluation model is obtained: In (17),   represents the evaluation index of the  th evaluation object, where   = ∑  =1      .We can find that this mathematical model is a two-level fuzzy evaluation model.The final result is a set of evaluation results.

The Synthetic Evaluation Results
. Finally, the evaluation results for connected vehicles are obtained based on the synthetic evaluation method.To ensure the feasibility, the original evaluation results  are integrated according to the weighted average principle.Firstly, the elements of  are normalized to obtain b ; then the final evaluation score  is calculated in We can find in (18), where  ∈ [0, 100], that the road condition is proportional to the value of .

Experiment and Analysis
4.1.Experimental Method.A data acquisition system with reference to a real V2X communication scenario is established in this article.The system has an embedded data acquisition device based on the chip of Freescale i.MX6 Q, which is installed in vehicles (OBU) and intersections (RSU).The device has a rich interface as shown in Figure 7, and the data of the Experimental vehicle can be passed into the device via the CAN interface.The OBU acquires OBD and GPS data according to the implementation procedure and communicates with RSU through the high power Zig-Bee module.The actual environment proved that the communication established by the Zig-Bee module was stable enough to simulate the real V2X communication.The test results are shown in Table 3.
A segment of the Pingguoyuan South Road in Shijingshan District of Beijing is picked as experimental section in this paper.The static data of this road is shown in Table 4.In the experiment, two experimental vehicles equipped with OBU device are traveling continuously on both sides of the road.The calculated evaluation indices are transmitted via wireless communication to the RSU device installed at the intersection.The experiment is carried out from 6:00 am to 8:00 pm on March 3, 2017.In order to improve the accuracy of the evaluation as much as possible, the experiment ensured that there are at least 10/16 sets of data in each section of the road during the peak/valley hours of traffic flow.At the same time, the flow data of the day is obtained through artificial observation and converted to Passenger Car Unit (PCU).
Restricted by the experimental condition, the final evaluation result is calculated by deriving the vehicle data stored in the RSU into the personal computer.However, this does not affect the evaluation result and preserves more details of the original data to ensure the reliability of the results.

Experimental Data Analysis.
A total of 407 data packets were collected by the experimental device, of which 377 were valid.That is to say, 92.6% of all data packets received by the device is valid.All valid data are processed at intervals of one hour as shown in Figures 8, 9, and 10 where (a) represents a section from west to east, (b) represents a section from east to west, the curve represents the three evaluation indices, and the gray shaded area represents the distribution of the data.As seen from the figures, the three evaluation indices share similar trend with respect to time.This proves the reliability of the data from one aspect.The data sheets of the three evaluation indices are brought into a MATLAB program where the final synthetic evaluation result  is calculated; see the lower part of Figure 11.The two colors of the data represent the two directions of the road.The upper part of Figure 10 is the flow data for the day.Comparing the upper and lower parts of the figure, we can find that the evaluation score has a negative correlation with the original traffic flow data, which is consistent with the actual traffic situation.As shown in the figure, the peak flow in the day also  corresponds to the lowest evaluation result.The difference between the peak hours and the valley hours is evident in the evaluation results.
Since the two directions of the road have similar static road parameters, in order to describe the relationship correctly between the evaluation score and the traffic flow, all the evaluation data are arranged in ascending order of the traffic flow as shown in Figure 12.The red line in the figure represents the actual calculated evaluation score, and the green line represents the reference delay value of the grade road at different traffic flows.
It can be seen from the above analysis that the evaluation results of the multilevel fuzzy synthetic method used in this paper are in good agreement with the actual situation.If the device's communication coverage is further improved, the time interval of the traffic evaluation can be reduced to 5 to 15 minutes, which is sufficient to meet the requirements of the road evaluation system under the complex road network.It provides a feasible solution for the traffic evaluation method under the V2X scenario.

Conclusion
By fusing the real-time connected vehicle data with static road segment information, an online traffic condition estimation model is proposed and tested in this paper.The OBD data and the traffic evaluation method are applied in the system of connected vehicles on urban road with V2X.
Based on the traditional fuzzy synthetic model, the multioperator synthetic fuzzy and variable membership model is introduced.We determined the scientific model parameters through AHP.In the field experiment, the evaluation results produced by the proposed model are the same with the actual situation of the road, which demonstrates the fidelity and effectiveness of the method.When the OBD data in the vehicle is collected by the V2X, the proposed model has greater advantage over the traditional floating vehicle data evaluation method.In addition, since connected vehicles are supplied with more detailed traffic information, the traffic capacity and safety can be greatly improved in the foreseeable future. West

Figure 1 :Figure 2 :
Figure 1: The scenario of CV system on intersections.

Figure 3 :
Figure 3: The flow chart of online traffic evaluation based on data fusion with V2X.

Figure 4 :
Figure 4: The structure of evaluation model.

Figure 5 :
Figure 5: Curve fitting of the membership function.

Figure 6 :
Figure 6: Membership relationship of the evaluation model.

Figure 7 :
Figure 7: The hardware structure of the terminal for CV system.

Figure 8 :
Figure 8: Results of average travel time.

Figure 11 :Figure 12 :
Figure 11: Comparison of evaluation scores and traffic conditions (traffic flows) for connected vehicles.

Table 1 :
Parameters of the membership function.

Table 2 :
Parameters of the judgment matrix.

Table 3 :
Data transmission performance with respect to communication distance.Note.The test data consists of three packets, totaling 51 KB.

Table 4 :
Parameters of the road section.