Expressway Traffic Flow Missing Data Repair Method Based on Coupled Matrix-Tensor Factorizations

Traﬃc ﬂow data is the basis of traﬃc management, planning, control, and other forms of implementation. Once missing, it will directly aﬀect the monitoring and prediction of expressway traﬃc status. Regarding this, this paper proposes a repair method for the traﬃc ﬂow missing data of expressway, combined with the idea of coupled matrix-tensor factorizations (CMTF), to couple the auxiliary traﬃc ﬂow data into the main traﬃc ﬂow data and to construct the coupling matrix-tensor expression of traﬃc ﬂow data, and the alternating direction multiplier algorithm is used to realize the repair of missing traﬃc ﬂow data. Combined with the measured data of expressway traﬃc ﬂow, the experimental results show that, under diﬀerent missing data types and missing rates, the proposed method outperforms the methods lacking auxiliary traﬃc ﬂow data and achieves a good repair eﬀect, especially for high miss data rates.


Introduction
e traffic management department requires traffic flow data to be complete to realize accurate assessment and prediction of the traffic status. us, the repair of the missing data is crucial for effective management of the traffic flow.
In the past few decades, researchers have shown much interest in the problem of repairing the missing traffic flow data. e existing research on this area can be categorized into three types of data composition: vector, matrix, and tensor. Below, we elaborate on the existing studies to provide an overview of the state of the art.
First of all, researchers used the traffic flow data in the form of a one-dimensional vector to repair the missing parts based on spatial or temporal correlation. Gold et al. classified the reasons for the missing data based on the characteristics of expressway traffic flow data and repaired the missing data using nuclear regression, polynomial regression, and linear interpolation [1]. Based on the hotspot analysis of the spatial correlation of traffic data, Conklin et al. used the similarity of the adjacent lanes to repair the missing data [2]. Smith et al. utilized the temporal correlation of traffic data and repaired the missing parts by using the average value of the historical data at the corresponding time and the exponential smoothing value of data at adjacent periods from several days ago [3]. Based on the spatial and temporal correlation, Jiang and Gang predicted the value of data in previous periods and adjacent sections to repair the missing data [4]. Jin and Wang proposed a traffic flow parameter correction method based on correlation analysis from the perspective of statistical analysis of traffic flow data [5]. Gheyas and Smith mainly focused on the measured traffic flow data and utilized the methods of historical trend and exponential smoothing to repair the missing parts. By comparison, these two methods were proved to be more effective in dealing with continuous abnormal data [6]. In consideration of the spatial-temporal characteristics of the traffic flow data, Wang et al. proposed a repair method based on the 3D shape function and the spatial-temporal interpolation [7]. Chen et al. proposed a long short-term memory (LSTM) method, which was built on the normal time-series using the calculation error of the Gaussian Bayesian model. Experimental results on three real datasets showed the advantages of this method [8].
en, the data is organized into a matrix to repair the missing data. Kurucz et al. proposed a low-rank matrix completion algorithm using the path norm minimization of the matrix to replace the minimization of the rank function [9]. Qu et al. performed (Bayesian) principal component analysis to reconstruct the traffic flow data into a matrix. e results showed the effectiveness of this method in data repair for low missing rates [10]. Tang et al. made use of fuzzy c-means clustering and genetic algorithm to repair the missing data based on a matrix. e repair effect of data on different time scales was quantified. e results revealed that the fusion model had a significant repair effect [11]. Lu et al. proposed a data repair model structured on an improved multiscale principal component analysis. Based on the spatial-temporal correlation characteristics of the traffic flow data, they calculated the correlation coefficient of the missing data to estimate its true value [12]. Similarly, Jiang proposed a data repair algorithm using the fusion matrix low-rank decomposition [13]. e data composition in the form of multidimensional tensor expands the vector and matrix to higher dimensions. In recent years, it has proliferated in machine learning (clustering and dimensionality reduction) and other fields [14]. is method can repair the missing data by organizing the traffic flow data into tensors. Acar et al. proposed an algorithm based on CP decomposition and gradient optimization and then proved that the algorithm had high flexibility [15]. Silva and Herrmann proposed a tensor repair method based on the hierarchical Tucker decomposition to repair the missing seismic data [16]. Tan et al. proposed a traffic data repair method based on tensor for the first time, which achieved good experimental results, especially in the case of high data missing rates. Subsequently, Tan et al. proposed several other tensorbased repair methods, which proved that the tensor-based method was still useful, especially for the cases with high missed data [17]. Liu et al. proposed a low-rank tensor repair algorithm to solve the problem of image repair and proved that their method was better than the tensor model using CP decomposition [18]. Based on the tensor theory, Xiang used the spatial-temporal characteristics of the traffic data for modelling, which significantly improved the accuracy of data repair [19]. Zhang focused on the microwave data of the urban expressway and repaired the data using a CP-ALS algorithm based on tensor decomposition. e results proved that their method could effectively improve data quality [20]. Zhang et al. proposed an iterative tensor decomposition (ITD) method to repair the lost data in the ANPR system. ey used the multidimensional intrinsic correlation of the traffic flow data for the detection of missed parts and their repair. e results showed that this method could accurately detect and repair the missing data under various missing rates [21].
To summarize, the one-dimensional vector was highly dependent on the historical information, requiring stable traffic flow data within a period or space. us, it could not use the information collected after the data missed. Besides, if the data missed within a long period, the error in the repair results turned to be high. On the other hand, although the two-dimensional matrix made full use of the spatial-temporal correlation characteristics of the traffic flow data, it was limited to the 2D data. Furthermore, it could not simultaneously utilize the multimode correlation of the traffic flow data, which resulted in low repair accuracy. In recent years, the high-dimensional tensor model has attracted significant attention in data repair, and some progress has been achieved, especially for the repair of traffic flow missing data. However, most of the existing efforts have focused on the model construction of spatial and temporal correlation of the traffic flow data.
In summary, it is necessary to introduce a data repair method, which makes full, reasonable, and effective use of multidimensional information between the traffic flow data. To do so, this work for the first time proposes a traffic flow missing data repair model based on the coupled matrix-tensor factorizations. Compared with the existing work, we make two contributions: (1) we aim to obtain better multidimensional information of traffic flow data, which is divided into main traffic flow data and auxiliary traffic flow data, and coupled the auxiliary traffic flow data into the main traffic flow data to construct the coupling matrix-tensor expression of traffic flow data. (2) In order to get better data repair effect, we used alternating direction multiplier (ADMM) algorithm to decompose large global problems into many smaller and easier local subproblems to solve the optimal solution. By comparison with the prior method, we validate the effectiveness and feasibility of the proposed methodology in data repair. e rest of this paper is organized as follows. Firstly, the tensor theory is explained in Section 2. en, the traffic flow data analysis is provided in Section 3. Section 4 presents the missing data repair method based on CMTF, where the results and discussion on the findings are elaborated in Section 5. Finally, Section 6 concludes the paper.

Tensor Theory
As an extension of vector and matrix to a high-dimensional space, tensor is a mathematical representation of a multidimensional data. e dimension of a tensor is also called the modulus or the order [22,23]. Hence, if we consider a vector as a tensor of the first order and a matrix as a tensor of the second order, it refers to an array of three or more dimensions as a tensor of a high order, tensor for short. e third-order tensor χ ∈ R I×J×K with dimensions I, J, and K shown in Figure 1 is an example of this.
Tensors can be expressed in either a fibrous or a slice form [24,25]. Any two dimensions in the third-order tensor are kept unchanged. In this case, the tensor can be expressed by a column fiber line (x : jk ), a fiber (x i: k ), and a pipe (x ij: ), where the colon represents the dimension all of elements. Alternatively, the tensor can be expressed in the form of a horizontal slice (X i∷ ), a side slice (X : j: ), and a front slice (X : : k ) when the other two dimensions are changed.

Tensor Matrixing.
Tensor matrixing is a process, in which the elements of a tensor are rearranged into matrices. To be specific, tensor operations are simplified by expanding tensor χ into the matrix X (n) in the n modulus.
e CP decomposition is one of the earliest and most indepth tensor decomposition methods. It can be interpreted as decomposing a tensor into the form of a sum of rank-1 tensors. For χ ∈ R I×J×K , the CP decomposition can be expressed in the following matrix form: where A � [a 1 , a 2 , . . . , a r ] is the vector a r corresponding to rank-1 tensor. Similarly, B and C are the vectors b r and c r corresponding to rank-1 tensor. us, A, B, and C are the factor matrices decomposed by CP decomposition, where A ∈ R I×R , B ∈ R J×R , and C ∈ R K×R . e slice of a tensor can be represented as follows: where the symbol "⊙" can be specifically described as the output value of 1 when two input variables are the same, otherwise the output value is. In practice, however, the column vectors in the factor matrix are usually normalized: us, the CP decomposition can be expressed as where λ is the regularization parameter.

e Coupled Matrix-Tensor
Factorizations. e joint analysis of data is quite useful in understanding the underlying structure of complex datasets. Here, the coupled matrix-tensor factorizations (CMTF) can capture the potential structural characteristics of heterogeneous data by coupling the matrix data into the high-order tensor model.
To explain this coupling model in detail, we assume that a third-order tensor χ ∈ R I×J×K and a matrix U ∈ R I×M are derived from different datasets, and U is coupled to a dimension of χ. e schematic diagram of the CMTF is shown in Figure 2.
e underlying structure of the dataset can be extracted by the CMTF, where the R-component of a tensor χ and a matrix U is defined as where A, B, C are the factor matrices of the tensor χ extracted by CP decomposition, where A ∈ R I×R , B ∈ R J×R , and C ∈ R K×R . e factor matrices A ∈ R M×R and V ∈ R M×R are extracted from matrix U, which can be represented as where A is the common mode factor matrix of the tensor χ and the matrix U on a certain dimension (modulus).
To ensure that the datasets are codecomposed and share the same potential space, the matrix U should have the same underlying low-rank structure as the tensor χ on at least one dimension.

Traffic Flow Missing Data: Causes and Types.
e complete traffic flow data is key for the traffic management department to assess the traffic status. However, due to the influence/restriction of traffic conditions, meteorological environment, and many other factors, the traffic system has formed a complex operating environment. ere are four main reasons of the missing data: detection equipment failure, power failure, communication network failure, and environmental factors. e missing data types can be summarized as follows: (1) Missing Completely at Random (MCAR). It refers to the type where the missing traffic flow data are completely random and independent of each other; that is, the probability of the missing data is not affected by other factors. e missing data location is represented as a random discrete distribution. (2) Missing at Random (MAR). It refers to the type where the missing traffic flow data is not completely random. e missing data is not correlated with itself Figure 2: e CMTF model.

Mathematical Problems in Engineering
but with other adjacent data points. e missing data location is a random continuous distribution.

(3) Missing at Nonrandom (MANR). It refers to the type
where there is a certain correlation between the missing data and its characteristics, which also depends on other missing data. is loss is generally caused by the system itself.
One of the earlier studies in the field [26,27] assumed that the MANR data was found and then deleted, where subsequent studies mainly used the MCAR and MAR. To comprehensively reflect the real traffic flow under the condition that the complexity of the data is missing, the follow-up work will be according to the three following case studies: (1) e missing data type 100% are missing completely at random (MCAR) (2) Lack of missing at random (MAR) data type 100% (3) Lack of missing completely at random data type and random loss, 50% each, namely, lack of hybrid (MIX-M)

Analysis of the Traffic Flow Data.
In analysis of traffic flow data variation, the traffic flow data is divided into main traffic flow data and auxiliary traffic flow data according to whether it can directly reflect the actual traffic status of the road.

Analysis of the Main Traffic Flow Data.
is paper defines the main traffic flow data as the data that can directly reflect the state change of the system. Specifically, it is described as the index value that can represent the state of the traffic flow through the data, such as flow, speed, and occupancy, which plays a leading role in the repair of the missing data. e phenomena of the main traffic flow data changes with time and space are called the spatial and temporal distribution of traffic flow, which has the characteristics of temporal and spatial correlation. e time correlation analysis of the traffic flow data describes the change characteristics in the time domain. e travellers determine the time regularity of the traffic flow. According to the traffic data of different times of a month, week, and day, the similarity of the different degrees is reflected. Considering the different time organization modes of traffic flow data, the time correlation analysis will be performed according to the correlation analysis of different weeks of a month, days of a week, and periods of a day. Some studies have revealed that the traffic flow data has strong daily and weekly correlations [3-5, 7, 16, 18, 28-31].
e spatial correlation analysis of the traffic flow data includes two main parts: (i) the data correlation analysis of a lane and its transverse adjacent lanes and (ii) the data correlation analysis of a lane and its longitudinal upstream and downstream sections of the same lane. Some studies have proved that the flow data of an urban expressway has a strong correlation with its horizontal and vertical spaces. In theory, the spatial and temporal correlation of the traffic flow data can be used to effectively repair the missing data.

Analysis of the Auxiliary Traffic Flow Data.
e core of data repair is to establish a relationship between the known and missing data. e more information related to missing data is contained in the data repair model, the better the effect of data repair will be. e auxiliary traffic flow data refers to the fact that the traffic flow data itself does not have a direct response state of the quantitative indicators. However, in the process of data repair, it can provide certain information about road conditions and environmental characteristics.
us, the acquisition and use of auxiliary data will greatly help to restore the missing data. e importance of auxiliary traffic flow data in data repair is explained as follows: Firstly, different weather conditions may affect the operational behaviours of vehicles, for example, in rainy weather, due to the slippery road surface, which will directly affect the speed of vehicles, thus leading to similar running state of vehicles in similar weather conditions. Secondly, the analysis of the running speed of vehicles in different lanes of the same section shows that the vehicles using the middle lane will travel at different speeds compared to the ones in the edge lanes [32,33]. e difference in the information added for the traffic flow data repair helps to improve the restoration precision of the traffic flow data.

The Missing Data Repair Method
Based on CMTF e joint analysis of multisource data improves the understanding of the underlying structure of complex datasets. In the existing research, the repair of traffic flow missing data is mostly based on spatiotemporal information, while the importance of auxiliary information in data repair is neglected. Furthermore, due to the heterogeneity of the latter, there are different quantitative standards in practical application. erefore, a method is required to integrate the auxiliary information about traffic flow with the spatial-temporal information to repair the missing data. e CMTF model can fit the matrix data into the tensor model through the coupling and capture the underlying structural characteristics of the heterogeneous data, realizing the joint analysis. us, combining with the CMTF model, this paper proposes a missing data repair method based on multisource information.

4.1.
e CMTF Representation of the Traffic Flow Data. e schematic diagram of the CMTF model is shown in Figure 3. ere are three datasets in the figure: the tensor with missing traffic flow data, the feature matrix of weather feature data, and the lane feature data matrix. e third-order tensor χ ∈ R I×J×K is constructed from the perspective of the spatiotemporal correlation of the main traffic flow data, where I represents the longitudinal time-series data of the same day in I consecutive weeks, J represents the J lanes of the same detection section, and K represents the K transverse time-series data of a day.
According to the auxiliary traffic flow data, also considering that the third-order tensor contains the same underlying low-rank structure with the construction of the third-order tensor, the two following feature matrices are constructed: (1) Weather Feature Matrix. Different weather conditions, for example, rain, will affect the operational behaviours of vehicles. e wet roads and low visibility will directly affect the speed of vehicles, resulting in a similar running state for all vehicles. e weather data matrix Q ∈ R I×M is in the composition of feature dimension, where I represents the longitudinal time-series data of the same day in I consecutive weeks, and M represents the M transverse time-series data of a day. Here, the sunny, cloudy (foggy), and rainy/snowy days are assigned the values of 1, 2, and 3, respectively. us, the element x im in the matrix is an integer varying from 1 to 3.
(2) Lane Feature Matrix. e running state of the vehicles traveling towards the same direction in the different lanes of the same section will have typical differences. For example, near the separator, the operating characteristics of the vehicle lane edge will be different, which will be considered in the traffic flow data repair. For the lane feature matrix D ∈ R J×N , where J represents the J lanes of the same detection section and N represents the lane attribute values. Starting from the median, lane of attribute assignment is, respectively, 1, 2, 2, and 3. us, the elements in matrix y jn are integers varying between 1 and 3.
e tensor containing missing data is constructed for the traffic flow spatiotemporal data, where the lane and weather feature matrices are coupled in lane and date dimensions, respectively. In this way, the data fusion problem is transformed into the CMTF decomposition problem. us, the missing data in the original tensor can be repaired by decomposing the tensor and matrix into multiple datasets and then reconstructing the tensor.

Objective Function Construction.
e factor matrices A 1 , B 1 , and C are obtained by the CP decomposition of tensor χ, where A 1 ∈ R I×R , B 1 ∈ R J×R , and C ∈ R K×R . Decomposition of matrices Q and D, respectively, gives the factor matrices A 2 , V, B 2 , and W, where A 2 ∈ R I×M , V ∈ R I×M , B 2 ∈ R J×N , and W ∈ R J×N . us, the objective function of CMTF decomposition can be expressed as e missing data in tensor χ can be repaired by coupling it with a known matrix.
Next, denote by F x the set of the squared errors between the original tensor and the factor matrices A 1 , B 1 , and C, where the reconstruction tensor is obtained by CP decomposition. e smaller F x is, the better the repair effect of missing data in the original tensor will be. R is the rank of the tensor, and λ is the regularization parameter. ‖A 1 ‖ F , ‖B 1 ‖ F , and ‖C‖ F are the norms corresponding to each factor matrix.
us, the tensor χ is decomposed and expanded according to one module CP as where X (1) is the matrix representation of the tensor on one modulus. e sets F Q and F D represent the squared error between the original matrix and the matrix reconstructed from the factor matrix, which are obtained by matrix decomposition as

Mathematical Problems in Engineering
According to the analysis of the CMTF model, the matrix should have the same underlying low-rank structure as the tensor in at least one dimension. erefore, the matrix and the tensor should have a common modulus factor matrix.
e assumption is that A 1 and A 2 are equal; and B 1 and B 2 are equal. erefore, two global optimal variables A and B are introduced to satisfy the application premise of the CMTF model: us, the optimization problem of the objective function is described as where Ψ represents the factor matrix set: Ψ � A 1 , B 1 , C, A 2 , V, B 2 , W . erefore, the problem of missing data repair is transformed into the problem of solving the optimal value of the objective function F f .

Optimization Solution.
For optimization, the alternating least square (ALS) and gradient descent method are commonly used in the literature [34][35][36][37][38][39][40]. Since both the matrix and the tensor decomposition are a nonconvex problem, the commonly used methods cannot achieve the global optimum. To resolve this issue, the alternating direction multiplier algorithm (ADMM) is suggested [41]. In this method, large global problems are decomposed into many smaller and easier local subproblems, obtaining large global optimal solutions.
us, when F f is finding the optimal solution, it can be interpreted by solving the local F χ , F Q , and F D optimal solution problems. e objective function of this problem can be given as where Θ 1 are the Lagrangian multipliers [31], ρ is the penalty coefficient ( > 0), and tr is the trace of the matrix, that is, the sum of the diagonal elements in the matrix. e iterative calculation formula of (9) is Concerning L ρ , the partial derivative of each factor matrix can be obtained as By setting (zL ρ /zA 1 ) � 0, Similarly, Similarly, it can be inferred further that

Mathematical Problems in Engineering
If set � 1, 2, . . . , I max , and it can be inferred further that Based on the above-given analysis, the main steps of the CMTF-based traffic flow missing data repair method are shown in Table 1.

Data Sources.
e main section of an urban expressway is selected as the area to be analysed. e section includes five detection sections (in the east and west) with four lanes in each section and a total of 40 coil detectors. e data that will be collected by the detectors is shown in Table 2, where "TY" indicates the code of the detection section, "D" refers to the east, and "10 (2)" means Lane 2 of the detection section #10. e data is sampled from August 27, 2018, to September 29, 2018, with a sampling interval of 5 min. ere are 161280 groups of data.

Judgment Criteria for Data Repair.
e existence and reparability of the missing flow data have to be judged before starting a repair process. To determine whether anything is missing, the traffic flow data can be used. For example, if the flow data is displayed as "0," it might be due to no vehicle passing through the detector or might be due to some missed data.
Under normal traffic conditions, the missing traffic flow data can be repaired according to the changing trend in the traffic flow. However, under abnormal conditions, such as traffic accidents conditions, once the data is "0," it is impossible to judge the existence and (if exists) the reparability of the data.
Following the continuous progress in detection technology, the detectors have become more capable of realizing the preliminary discrimination of data anomalies. For example, the location detector can classify the data as valid, invalid, and/or lost. For repair judgment, this paper only considers the invalid data with the value of "0," which is denoted as the missed data that will be repaired.

Experimental Scheme Design.
(1) Data Representation. Let us construct the tensor χ ∈ R 5×4×288 for the target data, where "5" represents "Day-mode," which means the longitudinal time-series data of five consecutive Mondays, "4" represents "Lane-mode," which means the transverse spatial sequence data of four lanes in the same detection section, and "288" represents "Time-mode," which means the 288 transverse time-series data obtained in one day. e weather feature matrix is Q ∈ R 5×288 , where "5" refers to five longitudinal time-series data of Monday, and "288" represents the 288 transverse time-series data obtained in one day. e lane feature matrix is D ∈ R 4×3 , where "4" refers to the data of four lanes of the same detection section, and "3" represents the lane attribute values. (2) e evaluation index of the missing data repair effect. e root mean square error (RMSE) and mean absolute percentage error (MAPE) are used as evaluation indexes [14]. It should be noted that the smaller the difference between the repaired and actual data is, the smaller the RMSE and MAPE will be. ese errors are calculated as follows: Table 1: e main steps of CMTF. Input: the tensor χ ∈ R I×J×K with missing traffic flow data, matrices Q ∈ R I×M and D ∈ R J×N , tensor rank R, initial iteration number iter, maximum iteration number MaxIter Output: completed tensor χ ∈ R I×J×K (1) Initialize ρ, λ 1 , (22), and B using (23) (3) If iter < MaxIter, repeat Step 4 ∼ Step 8 (4) Using (14)-(16), calculate and update A 1 , B 1 , C, respectively (5) Using (17)- (20), calculate and update A 2 , V, B 2 , W, respectively (6) Using (22), calculate and update A. Similarly, calculate and update B using (23) (7) Using (12), calculate and update Θ 1 If the algorithm reaches the convergence standard, the loop is broken. Otherwise, return to Step 3 (9) Return to χ � A 1 ∘ B 1 ∘ C where y i is the actual data, y i is the repaired data, and n is the number of repaired pieces of data. (3) Missing Data Type Design. To illustrate the data repair performance of the CMTF, three types of missing data were manually set up: MCAR, MAR, and MIX-M. en, the data missing rates of each type are set to 10%, 20%, . . ., 90%. In order to avoid the influence of the randomly set data missing position on the repair result, ten settings were made for different data missing types, and the average value of the 10 repair effects (RMSE and MAPE) corresponding to each method was selected as the final repair result.

Analysis of the Experimental Results.
e experiment environment is designed using Matlab-R2014a software, where the data repair based on the CMTF algorithm is realized by the CMTF_Toolbox_v1_1toolbox [15]. e relevant parameters of the model are obtained by cross-validation. e TYD12(2) detector with relatively complete flow data on August 27, 2018, was selected for tests, which were loaded with three missing data types (MCAR, MAR, and MIX-M) and nine missing rates (10% to 90%). ree methods, namely, matrix repair method [13] (Method 1), tensor repair algorithm (Method 2) [20], and the optimization algorithm of CMTF based on alternating least square (Method 3), are compared with the algorithm proposed in this paper (Method 4). e specific analysis of the data repair effect is as follows.

Repair Effect Evaluation of the MACR.
For the MACR, the repair results are shown in Tables 3 and 4. e results show that the RMSE and MAPE of all methods are relatively small and close to each other for low data missing rates. However, with the gradual increase of the missing rate, the RMSE and MAPE for all methods also increase gradually. For high missing rates, the RMSE of Method 3 and Method 4 remains around 15, which is significantly lower than that of the other two methods. is indicates that the multidimensional flow data has a significant effect on the repair of the missing data. e RMSE and MAPE of Method 4 are generally lower than the corresponding values of Method 3, indicating that the local optimization solution based on ADMM is better than the one based on ALS.

Repair Effect Evaluation of the MAR.
For the MAR, the repair results are shown in Tables 5 and 6. e results show that the RMSE and MAPE of all methods are relatively small for low missing rates similar to MACR. However, with the gradual increase of the miss rate, the RMSE and MAPE values also increase gradually, where Method 4 has the slowest growth rate. For high missing rates, the RMSE of Method 3 and Method 4 is still lower than 16.5%, which is significantly lower than the other two methods. is indicates that the multisource flow data has a more significant effect on the repair. e repair effect of Method 4 is lower than that of Method 3, indicating that the local optimization solution based on ADMM is better than the one based on ALS. Compared to the MACR, the RMSE and MAPE of all methods under the MAR are relatively larger. is is because the missing data is continuous, which will affect the repair effect to some extent.  Tables 7 and 8. e overall trend is similar to that of MACR and MAR, but the RMSE and MAPE of all methods are in the middle.

Repair Effect
is is because the missing data in MIX-M includes both the discrete missing data of MACR and the continuous missing data of MAR. us, the existence of continuous missing data will affect the repair process. e MIX-M can reflect the lack of traffic flow data. From Figure 4, for the missing rate of 50%, the MAPE of all methods is lower than 21%. When the missing rate exceeds 60%, the MAPE difference between the methods becomes larger. Here, Method 4 still has the best repair effect for high data missing rates.
Under the three types of data missing, all methods can repair the missing data at certain levels, where the repair effect of the MACR is the best, and the MAR is the worst.
In comparison, the repair effect of Method 2 is better than that of Method 1.
is indicates that the tensor algorithm based on multidimensional data can improve the repair effect. e repair effect of Method 3 and Method 4 is better than the other two, indicating that the addition of auxiliary data is useful in data repair. e repair effect of Method 4 is better than that of Method 3; that is, the optimal solution of ADMM is better than the traditional solution that ALS offers.
Our proposed method has a best repair effect, which has the lowest RMSE and MAPE among all methods. Even for     the MAR with a miss rate of more than 80%, the RMSE and MAPE of our method are still around 16.5 and 20%, respectively. is shows that the accuracy of traffic flow missing data repair can be effectively improved by adding auxiliary data based on sufficient and reasonable use of the spatial and temporal correlation between the traffic flow data.

Conclusions
To improve the repair effect of traffic flow missing data, this paper proposes a CMTF-based repair method. Different from the existing methods, the proposed method can make full and reasonable use of the spatial and temporal data of traffic flow to further increase the auxiliary traffic flow data and use the ADMM algorithm for optimization solution. e results showed that the proposed algorithm outperformed the other three comparison methods; that is, it has a better repair effect under three data deletion modes and nine data deletion rates. It can also achieve the best repair accuracy under high data deletion rates. e acquisition of complete traffic flow data realized by the proposed method will be highly beneficial in traffic management and control.
Since this paper only considered the coupling of a 3D tensor and a 2D matrix, future research will focus on the coupling of multiple matrices based on the multidimensional tensor. While doing that, we will also consider the total time cost of the running model.

Data Availability
e data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest
e authors declare that there are no conflicts of interest regarding the publication of this paper.