Target Centroid Position Estimation of Phase-Path Volume Kalman Filtering

For the problem of easily losing track target when obstacles appear in intelligent robot target tracking, this paper proposes a target tracking algorithm integrating reduced dimension optimal Kalman filtering algorithm based on phase-path volume integral with Camshift algorithm. After analyzing the defects of Camshift algorithm, compare the performance with the SIFT algorithm and Mean Shift algorithm, and Kalman filtering algorithm is used for fusion optimization aiming at the defects. Then aiming at the increasing amount of calculation in integrated algorithm, reduce dimension with the phase-path volume integral instead of the Gaussian integral in Kalman algorithm and reduce the number of sampling points in the filtering process without influencing the operational precision of the original algorithm. Finally set the target centroid position from the Camshift algorithm iteration as the observation value of the improved Kalman filtering algorithm to fix predictive value; thus to make optimal estimation of target centroid position and keep the target tracking so that the robot can understand the environmental scene and react in time correctly according to the changes. The experiments show that the improved algorithm proposed in this paper shows good performance in target tracking with obstructions and reduces the computational complexity of the algorithm through the dimension reduction.


Introduction
Robot moving target detection and tracking based on computer vision has become a hot spot of research scholars at home and abroad.Its essence is to use effective characteristics of the target to choose effective matching algorithm, have state estimation on the target, and obtain target parameters such as position, velocity, and trajectory and then provide reliable data information for target behavior analysis and understanding [1].Moving target tracking technology under complex scene needs to achieve sustained and stable tracking for moving object, have certain robustness on the illumination variation of the environment and obstructions of moving target, and improve the running efficiency and accuracy of the algorithm [2].The tracking technology advice of moving target runs successfully on intelligent robots, realizing the robot formation, scene understanding, and positioning, and it also has the very high economic benefits and social values in real life, such as intelligent automatic monitoring, human body tracking and identification, detection through a red light, and car speeding detection [3].
In order to solve various problems related to the target tracking, many scholars have done a lot of innovative research work in the field, and, in view of the practical problems of the different areas, many different target tracking algorithms are proposed, which mainly concentrate in the SIFT algorithm, Mean Shift algorithm, and Camshift algorithm.The PCA-SIFT algorithm put forward by Ke and Sukthankar adopts Principal Components Analysis (PCA) to reduce the dimension of SIFT feature vector from 128 to 36 when generating feature vector through the interest points neighborhood gradient direction histogram, which makes the matching speed about three times higher than SIFT algorithm, but also paid the cost of reducing a small amount of matching performance [4].Mikolajczyk proposes a characteristics description method, GLOH (Gradient Location-Orientation Histogram) with better uniqueness and stronger robustness by extending the SIFT algorithm.GLOH is different from 2 Journal of Sensors SIFT in the selection of interest points neighborhood and statistical method, and it uses the principal component analysis algorithm to reduce the dimension of feature vector to 128d.Although GLOH has stronger robustness than SIFT, its time complexity is also higher [5].Liu et al. propose PCA-SIFT algorithm and reduce the time complexity of SIFT algorithm, but also reduce the corner detection accuracy [6].Collins proposes a way of having the Mean Shift iteration in the defied discrete scale space by adding an extra dimension nucleus, so as to find the method for the best bandwidth [7].Comaniciu proposes the bandwidth selection problem of kernel function in Mean Shift algorithm, but with complex computation and not weak adaptability [8].Li uses color fusion template and location prediction to improve the performance of Mean Shift tracking algorithm.On the basis of simple linear fusion for RGB trichromatism of image, select the most prominent fusion feature algorithm in the target features through the target template and the candidate similarity function and set up three target templates accordingly [9].Guo et al. propose a multiple features fusion tracking method, which combines with color and motion edge character at the same time, and have the probability fusion with the particle filtering method.The combination of Mean Shift algorithm and particle filtering has good robustness for tracking target process in the case of complex environment and changed background, but because of the high algorithm complexity of particle filtering, it is not ideal in real time [10].Wang et al. propose Camshift target tracking algorithm based on the frame difference and motion estimation, although, to some extent, the proposed algorithm can solve the problem of target occlusion and too fast movement, but when the background is complex and there are other moving targets, it cannot track well.At the same time, the algorithm cannot solve the problem of similar target jamming [11].Qin combines the space histogram and the Camshift, but the algorithm has some shortcomings; for example, background information in the operation area may also be regarded as target information in statistical histogram, which reduces the precision of the histogram, influences on target tracking.At the same time, the algorithm is used in H layer of HSV space and it is hard to extract the targets when the brightness of the target and background are similar [12].Huang combined frame difference method to realize the Camshift automatic tracking, but because frame difference method only extracts the edge of the part outline to make the holes inside the target, it tends to cause the incomplete or containing background in target extracting or reduce tracking quality for containing the other colors in the color features [13].Li and others improve the interested area extraction in Camshift algorithm with Canny filtering, making the acquisition of target color histogram resist the influence of noise, while the method is not effective when the background profile is taken into the detection range [14].Cai improves Camshift tracking performance with three-frame difference method, which makes the extraction of the target more accurate, but still cannot get rid of the inherent defect of frame difference method [15].This paper proposes a target tracking algorithm integrating reduced dimension optimal Kalman filtering algorithm based on phase-path volume integral with Camshift algorithm, improving the existed efficiency in target tracking aiming at Camshift algorithm.

Performance Analysis of Target Tracking Algorithm
The current common moving object tracking algorithms include SIFT algorithm, Mean Shift algorithm, and Camshift algorithm.This paper first has a performance analysis on the three kinds of algorithms.

Defect Analysis of SIFT Algorithm.
SIFT algorithm extracts the target key points from the image, and these feature points have invariance in the image scaling, rotation, some degree of light intensity, and camera angle changing.The first step of key points detection is on the insensitive pixel location for the changes of image scales in different scales.The scale space of the image is defined as a function of (, , ), which is from the input image (, ) and convolution of Gaussian kernel function (, , ) with different scales: in which ⋅ is a convolution operation about  and .Consider In order to detect the key position effectively in image scale space, the differential Gaussian function (, , ) is used here, which can be composed of image convolution results in two adjacent scales by subtracting  (, , ) = ( (, , ) −  (, , ( − 1) )) ⋅  (, ) =  (, , ) −  (, , ( − 1) ) .
(3) Then set the gotten extreme value point as alternative key point, using the Taylor expansion of the scale space function (, , ): in which () and its expansion item calculate in the sampling point and  = (,,)  is the compensation amount of the point.When the expansion item of () is 0, the corresponding  is the extreme point, and the position is decided by the following formula: In the specific calculation, the difference value between adjacent sampling points is used as the approximate scale space function (, , ) and its expansion.If the amount of compensation X in various scales is greater than 0.5, then the distance between the current sample point and the extreme value point is regarded as not the shortest.Next exclude the current sample point and continue the same calculation for the other sample points.Finally get the approximate estimation of extreme value point location by combining X with its corresponding sampling points.
The value ( X) of scale space function in the extreme point will be used to eliminate the unstable extreme value points of low contrast.Substitute formula ( 5) into (4) and we can get By setting the threshold value, abandon the extreme value points whose |( X)| is less than the threshold value.A typical threshold size can be set as 0.05.
It can be seen from the above analysis that SIFT algorithm relies on extracting features in great quantities; thus it achieves precise matching with information redundancy strategy and, at the same time, also increases the amount of calculation of the algorithm itself.

Defect Analysis of the Mean Shift Algorithm.
Mean Shift algorithm is a kind of estimation technique with smooth and density estimation, and algorithm convergence is the heart of determining the precision of the target tracking.The convergence of Mean Shift and the increasing of the value is equivalent to the proven proposition: if kernel function is defined by a descending concave function (), then sequences {  } =1,2 and { fℎ, ()} are decreasing convergence.
As  is limited, and sequence { fℎ, ()} is bounded, according to the definition of sequence { fℎ, ()}, { fℎ, ()} − fℎ, () According to the definition of concave function, for all As () = −  (), formula (8) can be written as Substituting the above formula into (7), we can get As can be seen from the above analysis, the Mean Shift algorithm will fail in tracking when the target scale changes due to the fact that the window width remains the same in the target tracking process, and when the target speed is high, the tracking effect is not good.

Defect Analysis of Camshift Algorithm. Extend Mean
Shift algorithm to the continuous image sequence, which generally refers to the image sequence; thus it forms the Camshift algorithm.The basic idea of Camshift algorithm is to have Mean Shift algorithm for all the frames of the image; then set the center and size of the search window on the previous frame as the initial value of the search window on the next frame, and the target tracking can be achieved with such iteration.
Camshift algorithm tracks target with elliptical lock.The size and the deflection angle of the tracking target are shown in Figure 1.
Define the secondary moment as follows: Then long axis and short axis of target ellipse area, respectively, are In the formula, the values of , ,  are obtained with the following formula: The target angle  of the direction is defined as the included angel less than 180 ∘ between long axis and the horizontal plane.
Camshift algorithm is to set the result of the previous frame as the initial value of the next search window frame; then find optimal target location in the field of the center coordinates.If the target moves too fast, which makes the distance between the current frame and the previous frame center larger, the previous location is still used in the current frame search as the initial position, which will lead to increasing times in searching target, and even failure in tracking target.And the testing results of the other five image sequences with obstructions are shown in Figure 3.

Performance Analysis on the
After analyzing the results of the experiment, Camshift algorithm has better performance in terms of target tracking.So this paper will apply Camshift algorithm to target tracking and improve the efficiency of Camshift algorithm.

Target Centroid Position Estimation
Optimization of Camshift Algorithm

Reduced Dimension Optimization of Kalman Filtering
Algorithm.This paper adopts the integration of Camshift algorithm and Kalman algorithm to improve the defects of Camshift algorithm but considers that it improves the complexity of the algorithm and greatly increases the amount of calculation after combining Kalman algorithm, so the optimization in calculation amount of Kalman algorithm comes first.
The sampling point number of Kalman algorithm increases with higher dimension in dealing with nonlinear system, and filtering calculation also increases, while the Gaussian integral has the greatest impact on the calculation of Kalman algorithm, so phase-path volume integral is used instead of Gaussian integral in Kalman algorithm to reduce dimension.In the case of no loss of filtering precision, it reduces the number of sampling points and the amount of calculation in the process of filtering.
Phase-path volume integral is an approximate integral after nonlinear function passing to a series of point set  and weighted sum, but the requirements for the point set  are as follows.
If  ∈ , change the order and the symbols of the elements in  to get ; there are still  ∈ , and the weights of the two factors are equal, and  is called the holohedral symmetry.
For an element  = ( 1 ,  2 , . . .,   , 0, . . ., 0) ∈  in holohedral symmetry area  of phase-path volume integral, if   ≥  +1 > 0,  ∈   , then  can be defined as the generator of the holohedral symmetry area  in  dimensional real number area.At the same time, define ( 1 ,  2 , . . .,   )  as the same collection with .Set the Gaussian integral as If constant  is not considered, the integral can transform by integral conversion into Among them,   is the unit volume of  dimension, and (⋅) is the element on   ; at this point, the integral of formula ( 18) is converted into a volume integral and a phase-path integral: The volume integral in formula (20) can be approximate with volume rules.It can be known from the holohedral symmetry rules of the volume that for every monomial { } in (), in which   is the variable order; when ∑  =1   is odd integer, the integral is 0. So if we want to be accurate to three monomial, namely, approximate to the integral with the phase-path volume rules, and as long as the two points can be accurately obtained with the chosen volume point and the corresponding weights, only the conditions ∑  =1   = 0 and ∑  =1   = 2, that is, () = 1 and () =  2 1 , need to be considered.The formula is as follows: in which   = 2 √   /Γ(/2) means the unit volume of  dimension, and Γ() = ∫ ∞ 0  −1 exp(−).Solve formula (21); we can get  =   /2; then we can get the intersection points of volume points and each axis, that is, point set [1].Consider For the phase-path integral type, set √ = , and it can be transformed through the integral conversion into According to the rules of phase-path volume integral, when (√) = 1 or , that is, when () = 1 or  2 , the accurate integral type can be obtained.At this time, the selected integral points and weights, respectively, are Merging ( 22) and ( 26), we can get the approximate integral of formula (18): Then, use the phase-path volume integral instead of Gaussian integral in Kalman algorithm.The nonlinear system of Kalman algorithm can be shown in the following form: in which   is the previous  element of   ; that is,   = [   ,    ]  .If statistical properties ( x−1 ,  −1 ) of  at the time  − 1 are known, then the filtering function of Kalman algorithm is as follows: in which x/−1 and   /−1 are the Gaussian integrals about In the standard Kalman algorithm, the number of sampling points on state  is 2, for  −1 is  dimension vector; if the phase-path volume is used to approximate formula (33), then only 2 sampling points of  −1 are needed.The formulas are as follows: Then constitute reduced dimension Kalman filtering algorithm by combining formulas (29)∼(32).

Camshift Algorithm Based on Kalman Prediction Estimation.
The basic idea of Camshift algorithm based on Kalman prediction estimation is that set the optimal estimation of target centroid in current  frame as the initial position of the Camshift algorithm search window in the next frame and then revise the predicted value with the centroid result of Camshift algorithm iteration as the observation value   of the improved Kalman filtering algorithm, so as to obtain the optimal estimation of target centroid position of the next frame.The forecast of target centroid position can solve the problem of quick target motion effectively, making sure the search window of Camshift algorithm can keep up with the movement of the target, improving the effect of tracking.Definition 1.The state vector   of Kalman filtering algorithm is (  ,   , V  , V  )  , in which   and   represent the target centroid position, V  and V  , respectively, represent the displacement speed of the target in the direction of  and .Define observation vector as (  ,   )  , in which   and   represent the target centroid position obtained with Camshift algorithm.Due to the fact that the time interval between two frames in image sequence is short, we regard it as uniform motion in unit time interval and then define the transition matrix   as follows: in which Δ is the time interval between   and  −1 .From the system state and observation state, the observation matrix is as follows: The estimation on the centroid position of moving targets using improved Kalman filtering algorithm in Camshift tracking algorithm can be roughly divided into three stages, which are as follows.
(1) Initialization of the Kalman Filtering Algorithm.Initialize improved Kalman filtering algorithm, initialize  0 with the target centroid position and moving speed obtained by the moving target detection algorithm, and record the current time.
(2) Centroid Position Estimation.Calculate the time interval Δ with the previous frame image when processing the current frame image and substitute it in the transition matrix   .Use the iterative operation by taking the location ( −1 ,  −1 ) of  −1 as the center of the Camshift algorithm search window and then substitute the searched final location into the observed value   = (  ,   )  to correct predicted value.
(3) Location Update.Substitute the state vector  −1 and the observation vector   into the recursive formula of improved Kalman filtering algorithm, get the optimal target centroid position estimation X of target area centroid in the current frame, and record the moment of the current frame image and target centroid position.

Algorithm Performance Test
First take performance test to the improved Kalman filtering algorithm in order to verify the effectiveness of the improved algorithm proposed in this paper, having 100 Monte Carlo simulations under the same condition with the standard Kalman filtering algorithm and reduced dimension Kalman filtering algorithm (IM-Kalman).The applied software is Matlab, and the computer processor is Core I7-3610 with the main frequency of 2.3 GHz, memory of 4 G.The mean absolute error curves of two filtering algorithms for state estimation are shown in Figures 4∼6.When processing the single frame image, the accuracy of state estimation of standard particle filter (PF) algorithm, standard Kalman filter algorithm and IM-Kalman filter algorithm is contrasted, respectively, at the particle number of 50, 100, and 150, and the results are shown in Tables 1 and 2.
It can be seen from Table 1 that when the particle number is 50, the time used in single frame image processing of the three algorithms has no big differences, while the time used of the improved Kalman filtering algorithm proposed in this paper is slightly less than that of the other two algorithms.But when the particle number increases to 100, the processing speed of PF algorithm and the standard Kalman filtering algorithm is significantly slower, while the speed of IM-Kalman filtering algorithm proposed in this paper changes    in this paper is much higher than the PF algorithm and the standard Kalman filtering algorithm.
As can be seen from Table 2, when the particle number is 50, there has been a certain gap among the state estimation accuracy of three algorithms, and the accuracy of IM-Kalman filtering algorithm proposed in this paper is much higher than the other two algorithms.When particles are increased to 100, the precision of PF algorithm and the standard Kalman filtering algorithm has been reduced slightly, but there is no significant reduction in the IM-Kalman filter algorithm proposed in this paper.When the particle number increases to 150, the estimation accuracy of the PF algorithm and the standard Kalman filtering algorithm has been less than 1, while the estimation accuracy of IM-Kalman filter algorithm reduces a little.
The simulation results show that IM-Kalman algorithm has better estimation precision and uses less time.Then the standard Camshift algorithm and the improved Camshift algorithm are used for target tracking the four 100 frames video sequences, which are human motion video sequences without obstructions (Test 1), human motion video sequences with obstructions (Test 2), vehicle motion video sequences without obstructions (Test 3), and vehicle motion video sequences with obstructions (Test 4).Results are shown as follows.
(1) The Human Body Target Tracking.First have the human body target tracking with obstructions and without them.Figure 7 shows target tracking result of the standard Camshift algorithm in the absence of any obstructions, and Figure 8 shows the result of improved Camshift algorithm in the same case.Figure 9 is the target tracking result of standard Camshift algorithm with obstructions, while Figure 10 is the result of improved Camshift algorithm in the same case.
It can be seen from the above experimental results that the improved Camshift algorithm put forward in this paper has better effect than the standard Camshift algorithm in the human body tracking with obstructions.(2) Vehicle Target Tracking.Then have the vehicle target tracking with obstructions and without them.Figure 11 shows target tracking result of the standard Camshift algorithm in the absence of any obstructions, and Figure 12 shows the result of improved Camshift algorithm in the same case.Figure 13 is the target tracking result of standard Camshift algorithm with obstructions, while Figure 14 is the result of improved Camshift algorithm in the same case.
In the above experiment, the results of processing time on four video sequences with two algorithms are shown in Table 3.It can be seen from the above experimental results that the improved Camshift algorithm proposed in this paper has better effect than the standard Camshift algorithm in the vehicle tracking with obstructions, improving the efficiency and accuracy of the intelligent robot target tracking algorithm under complex environment.From the comparison results of the processing time of two algorithms in four video sequences, the improved Camshift algorithm proposed in this paper uses relatively less time and is with excellent performance.

Conclusion
This paper proposes a target tracking algorithm integrating reduced dimension optimal Kalman filtering algorithm based on phase-path volume integral with Camshift algorithm, reduces dimension of Kalman filtering algorithm, fixes the predictive values with the target centroid position as the observation value of the improved Kalman filtering algorithm, and thus takes the optimal estimation for target centroid position.Under the condition of the target with obstructions, it can detect the target centroid position accurately, solve the problem of transient target occlusion, improve the running speed of intelligent robot system, and have higher efficiency and robustness in tracking process.

Figure 1 :
Figure 1: The size and the deflection angle of the tracking target region.
Three Algorithms.Use ten different image sequences to test the target tracking of the above three algorithms, and extract 100 observation sequences from each image sequence to check the accuracy of target tracking.The algorithm testing results of five image sequences without obstructions are shown in the Figure2.

Figure 2 :
Figure 2: The performance comparing results of three algorithms without obstructions.

Figure 3 :
Figure 3: The performance comparing results of three algorithms with obstructions.

Figure 6 :
Figure 6: Mean absolute error of  axial position.

Table 1 :
The comparison results of used time in single frame image processing.

Table 2 :
The comparison results of state estimation accuracy.
slightly.When the particle number increases to 150, the processing speed of IM-Kalman filtering algorithm proposed

Table 3 :
The comparison results of processing time on four video sequences with two algorithms.