Modeling of Temperature Effect on Modal Frequency of Concrete Beam Based on Field Monitoring Data

Temperature variation has been widely demonstrated to produce significant effect on modal frequencies that even exceed the effect of actual damage. In order to eliminate the temperature effect on modal frequency, an effective method is to construct quantitative models which accurately predict the modal frequency corresponding to temperature variation. In this paper, principal component analysis (PCA) is conducted on the temperatures taken from all embedded thermocouples for extracting input parameters of regression models. Three regression-based numerical models using multiple linear regression (MLR), back-propagation neural network (BPNN), and support vector regression (SVR) techniques are constructed to capture the relationships between modal frequencies and temperature distributions from measurements of a concrete beam during a period of forty days of monitoring. A comparison with respect to the performance of various optimally configured regression models has been performed on measurement data. Results indicate that the SVR exhibits a better reproduction and prediction capability than BPNN and MLR models for predicting the modal frequencies with respect to nonuniformly distributed temperatures. It is succeeded that temperature effects on modal frequencies can be effectively eliminated based on the optimally formulated SVR model.


Introduction
With the development of advanced techniques in sensing, data acquisition, computing, and information management, structural health monitoring (SHM) systems have been widely implemented to diagnose structural condition [1][2][3].Vibration-based damage identification, as an important part of SHM system, has been most widely investigated because of their capability for accurately identifying structural damage [4].The basic concept underlying the use of vibration-based damage identification is that vibration properties (natural frequencies, mode shapes, and damping ratio) are functions of the physical properties (mass, stiffness, and damping) of structure [5,6].The measured changes in dynamic parameters can be used to evaluate corresponding changes in physical properties that indicate the structural damage.However, dynamic parameters are inevitably affected by environmental and operational conditions (temperature, humidity, wind, traffic load, etc.).Changes of dynamic parameters caused by environment may lead to a false damage identification result [7][8][9][10].
Modal frequency is the most widely used dynamic parameter to identify structural damage because it is easy to be measured with high precision [11].Extensive researches have demonstrated that temperature change is the most important source that causes the variation in modal frequencies of structures [12].Researchers from Los Alamos National Laboratory find that the first three frequencies of Alamosa Canyon Bridge vary about 4.7%, 6.6%, and 5.0% during 24 hours, in which the temperature of the bridge deck changes by approximately 22 ∘ C [13,14].Peeters et al. [15,16] report that the first four modal frequencies of Z24 Bridge vary by 14%-18% during 10-month monitoring period, and this variation is more significant than the change of 10% caused by destructive damage.Desjardins et al. [17] make a continuous monitoring for the modal frequencies and average girder temperature of Confederation Bridge.The modal frequencies are reduced by 4% when the temperature varies from −20 ∘ C to 25 ∘ C. Askegaard and Mossing [18] continuously monitor a three-span reinforced concrete (RC) bridge for three years, and the seasonal changes of modal frequencies reach as much as 10%.Yuen and Kuok [19] extract the modal frequencies of a 22-storey RC building for one year using Bayesian spectral density approach.They find that the first three modal frequencies increase with an increase in ambient temperature.Chen et al. [20] explore the correlation between modal frequencies of Guangzhou New TV Tower and air temperature through more than 100 h measurement.Results show that modal frequencies are linearly dependent on air temperature.Saisi et al. [21,22] present the results of continuous dynamic monitoring for Gabbia Tower in Italy during a period of 8 months.Identified natural frequencies are observed to vary by 5-11% when the measured temperatures range from 2 ∘ C to 45 ∘ C. Ubertini et al. [23] monitor the modal frequencies of San Pietro Bell Tower during more than nine-month period.Temperature variation produces significant changes in natural frequencies, up to 16 MHz/ ∘ C, while effects of air humidity were relatively marginal.As indicated by the prior researches, temperature significantly affects the modal frequencies.In addition, no such temperature dependence has been observed for the mode shapes and damping ratio, and hence the temperature effect on them can be generally ignored [12,24].For the reliable performance of vibration-based damage identification, it is of paramount importance to eliminate and discriminate the variations in modal frequencies due to temperature change from those caused by structural damage.
In order to eliminate the temperature effect on modal frequency, quantitative models between them are required to normalize the identified modal frequencies to an identical reference temperature [25][26][27].Xia et al. [24] propose a simple linear regression model to correlate the air temperatures and modal frequencies of a RC slab based on laboratory monitoring data.Peeters and De Roeck [15] derive an autoregressive and moving average (ARMA) model to formulate the relationship between air temperature and modal frequencies for the Z24 Bridge.Moser and Moaveni [28] utilize several models (a static linear model, an ARX model, a bilinear model, and polynomials with various orders) to represent the relationship between the modal frequencies and measured temperatures.Ding and Li [29] propose a polynomial regression model to describe the frequency-temperature seasonal correlations of the Runyang Suspension Bridge.Ni et al. [30,31] apply the support vector machine (SVM) and backpropagation neural network (BPNN) techniques to formulate regression models that quantified the temperature effect on modal frequencies of the cable-stayed Ting Kau Bridge.These studies mentioned above have proposed methods for predicting the modal frequencies of bridges, but none has compared the prediction accuracy of multiple linear regression (MLR), BPNN, and SVM methods.In addition, these regression models mainly focus on the relationships between modal frequencies and air temperature or structural temperatures measured at some surface points.They ignore the nonuniformly distributed temperatures in the cross-section of structure, which may lead to information loss in modeling the temperature effect on modal frequencies [32,33].
In this paper, three regression models of predicting modal frequencies corresponding to nonuniformly distributed temperatures are built on measurements from a concrete beam during 40-day monitoring period.Prediction capabilities are compared in order to select the optimal model for eliminating temperature effect on modal frequency.The rest of the paper is organized as follows.The various regression algorithms that are adopted to predict the modal frequencies are first outlined.A concrete beam is constructed and the details are described.Principal component analysis (PCA) is performed to extract principal components from the measured temperatures.And the quantitative models using MLR, BPNN, and SVR are constructed by use of extracted principal components (PCs) of temperatures.Prediction capabilities of constructed regression models are studied and examined on training and test samples.Selected optimal model is later used to remove the variability of identified modal frequencies due to temperature effect.Lastly, results are summarized with important conclusions.

Theoretical Background
Modal frequency is directly related to the temperature distribution across the structure.This research proposes to employ internal distributed temperature measurements to predict the modal frequency of a concrete beam.The methodology is outlined in the form of a flowchart in Figure 1.Temperature measurements collected from monitoring are first preprocessed by PCA for dimensionality reduction.The PCs of temperatures are then supplied as input to statistical regression techniques to compute regression models.Prediction capabilities of regression models are examined and compared based on statistical indicators.The regression model with the best prediction accuracy is then used to predict modal frequency from collected temperatures, which have been preprocessed by PCA.Temperature effect on modal frequency is successfully eliminated using the regression model with the best prediction accuracy.

Principal Component Analysis (PCA).
PCA is multivariate statistical tool that takes advantage of inherent correlations between variables for dimensionality reduction and feature extraction.It is a linear transformation mapping an original set of variables into a substantially smaller set of uncorrelated variations that represents most of information in original set of variables [3,34].Using PCA, original set of correlated variables  ∈   in an -dimensional space can be transformed into a new set of uncorrelated variables  ∈   in an -dimensional ( < ) orthogonal space by the application of the following equation: where Γ( × ) is a transformation matrix that applies an orthogonal rotation to the original coordinate system.Through the singular value decomposition for the covariance matrix of original variables , we can obtain where  is the orthogonal eigenvector matrix with    =  and Λ is a diagonal matrix composed of singular values as follows: where the singular values rank in descending order  1 ≥  2 ≥ ⋅ ⋅ ⋅ ≥   ≥ 0. They represent the variances of principal components, and the small singular values are not relevant to explain the overall variance of data set.The proportion of original variables explained by the first  principal components is defined as where   is accumulated variance contribution rate and decides the number of selected principal components.Generally, if   × 100% ≥ 85%, the transformation matrix Γ could be obtained by the first  column vectors in orthogonal eigenvector matrix .Once  has been chosen and orthogonal transformation matrix Γ has been determined, variables in principal space can be calculated by (1).

Multiple Linear Regression (MLR).
MLR is an extension of simple linear regression for the purpose of predicting dependent variables by multiple explanatory variables [35].When a dependent variable  is linearly related to  explanatory variables, the general form of MLR model can be expressed as where  is the predicted value of dependent variable,  0 is the intercept and ( 1 ,  2 , . . .,   ) are regression coefficients associated with the explanatory variables ( 1 ,  2 , . . .,   ), and  is random error with mean zero and variation  2 .Based on the data from  measurements, unknown regression coefficients can be determined using least-squares method.In this study,  represents the modal frequency at specific time, and ( 1 ,  2 , . . .,   ) represent PCs extracted from measured temperatures in cross-section at mid-span of concrete beam.

Back-Propagation Neural Network (BPNN).
Artificial neural network (ANN) is a functional abstraction from biologic neural structure, which can process complex nonlinear relationships among several variables through learning [36].
Therefore, it provides a powerful tool for modeling the relationship between modal frequencies and distributed temperatures.As one of the widely used ANN structures, BPNN is established through forward transfer of information and back-propagation of training error.The biases and weights are constantly adjusted to minimize target error through gradient descent algorithm.An evaluator, the sum of square error (SSE) between actual and target outputs, is taken as the objective function of BPNN model, as shown in where   and   are the actual and target output of th node in output layer for th pattern, respectively,  is the number of outputs, and  is the number of patterns.
The typical two-layer BPNN contains an input layer, hidden layer, and output layer.The transfer function for hidden layer is taken as a tan-sigmoid function and that for the output layer is a linear function.In this research, BPNN is simulated using MATLAB's neural network toolbox, and the "traingdx" and "learngd" functions are chosen as training function and learning function, respectively [37].An important parameter in BPNN is the optimal number of nodes in the hidden layer.It is optimally determined through trials and validation errors.In order to avoid the underfitting and overfitting phenomena, an early stopping technique is employed.Training process is stopped when the errors on validation data increases for a specific number of iterations.The parameters (weights and biases) of BPNN model are determined as those associated with the minimum of validation error.

Support Vector Regression (SVR)
. Support vector machine (SVM) is a newly emerging learning technique following the structural risk minimization (SRM) principle rather than the common empirical risk minimization (ERM) principle.It transforms sample data to a higher dimensional feature space and defines the optimal linear hyperplane to minimize the upper bound on the generalization error [3,30,38].SVR refers to the regression model of SVM.It is to transform the nonlinear relationship in original space into linear relationship in a feature space so as to discover relationship more easily.
Consider a set of training data  = {(x 1 ,  1 ), (x 2 ,  2 ), . . ., (x  ,   )} ∈   × , where x  is a -dimensional input vector and   is the corresponding scalar output in original space.For linear regression problem in the feature space, the linear estimation function is described as where w and  are weights and thresholds, respectively,  is the mapping function transforming input vector to feature space, and ⟨ * , * ⟩ denotes the inner product.
The SRM principle is adopted in SVR to avoid overfitting and improve generalization performance.The optimization object is set as Subject to where  is loss function,  is penalty coefficient, and   ,  *  are slack variables.
Solution of ( 8) under constraints of ( 9) is achieved by introducing the Lagrange multipliers and using the duality principle.
where   ,  *  are the Lagrange multipliers, which can be obtained from the above optimization problem.The weight vector w can be solved and written as For nonlinear regression problems, SVR transforms data to a high-dimensional feature space by a kernel function.The linear SVR algorithm conducted in feature space represents the nonlinear SVR operation in original space.Inner product in the feature space calculated using a kernel function is expressed as Radial-basis kernel function (RBF) is a reasonable choice of kernel functions since it equips with more flexible and fewer parameters.It is applied and listed as follows: Therefore, the nonlinear regression function can be calculated and expressed by In the formulation of SVR model, selection of hyperparameters (, , ) is crucial to improve the generalization ability and prediction accuracy.Grid search method is applied to optimize the hyperparameters.For each combination of the hyperparameters, SVR is trained using the training data and their performance is evaluated by a cross-validation scheme.Optimal hyperparameters are determined to construct the SVR model for modeling the relationship between modal frequencies and nonuniformly distributed temperatures.

Prediction Capability Evaluation Index.
The prediction capabilities of formulated models (MLR, BPNN, and SVR) are examined and compared using training and test set.Prediction error (PE) is proposed to reflect the difference between target and prediction values.PE is defined as where   is identified modal frequency and f is predicted modal frequency.
In order to quantify and rank the performances of formulated models, two statistical indicators including root mean squared error (RMSE) and correlation coefficient () are used to quantitatively evaluate the performance of regression models, which are expressed by (16), and  is a numerical value between −1 and 1, which illustrates the relationship between target and predicted values.A high  value close to 1 indicates a strong positive correlation between target and predicted values, which demonstrates good generalization capability of models.RMSE represents the root mean square of differences between actual and predicted values.It can be also used to evaluate forecasting accuracy of models.
where  is the number of sample data, cov(, f) represents the covariance between target value  and predicted value f, and (), ( f) are standard deviations of  and f, respectively.at each end.The RC beam was produced on 8 June 2015 and was installed on 12 September 2015.During this period, the hydration reaction is sufficiently completed, and shrinkage and creep of concrete would produce negligible effect on measurement results.Considering that the nonuniform temperature is primarily distributed in cross-section, a total of 14 type T thermocouples are embedded in the cross-section at the mid-span of the beam to monitor the temperature field.Figure 3 illustrates the deployment of thermocouples at mid-span."V" represents thermocouples along vertical direction, while "H" represents the thermocouples along horizontal direction.In order to fix the thermocouples in accurate position, preformed concrete units containing type T thermocouples are located at predetermined position before pouring concrete.Type T thermocouples are made by copper and constantan, and measurement range of them is −250 to 260 ∘ C. A TP700 multichannel data recorder is employed to sample data collected by type T thermocouples.

Case Study
It features an auto-zero channel, a cold-junction compensator, and automatic voltage-temperature conversions for common thermocouples.Due to the operating temperature ranging from 0 to 50 ∘ C, it is installed in laboratory to ensure the accuracy of measurement.Type T thermocouples are connected to TP700 data recorder using shielded thermocouple compensation lead [39].
As for the modal testing, two DH131E piezoelectric accelerometers are used to acquire the acceleration response under impact excitation from a rubber hammer at 7/10 span length from left end along vertical direction.The 1 # accelerometer is placed at mid-span and 2 # accelerometer at 3/10 span length from left end.DH131E accelerometers feature a sensitivity of 1 mV/g, a frequency range of 1-8000 Hz, small size (10 × 16 mm), and light weight (5.5 g).The operating temperature ranging from −40 to 80 ∘ C makes it ideal for outdoor application.The magnetic bases are fixed on the upper surface of RC beam using metal/concrete epoxy, and accelerometers are mounted on the magnetic bases.A DH5922 type dynamic signal measurement and analysis system is used to measure and analyze acceleration response.It includes sixteen 24-bit Integrated Electronics PiezoElectric (IEPE) input channels and supports sampling rates of up to 51.2 kHz.Antialiasing filters and time-base export for tight synchronization between channels are equipped.The DH5922 system has an operating temperature range of 0 to 40 ∘ C, and it is placed in laboratory.Accelerometers are connected with data acquisition device using L5 coaxial extension cables with lengths of 15 m.DH5922 system samples a 16-second data from the two acceleration channels at a 5120 Hz sampling rate.Data processing is performed by DHDAS-2013 software platform, which is an important part of DH5922 system.Firstly, the recorded sample data is bandpass-filtered between 10 and 1000 Hz using a finite impulse response filter.Secondly, Hamming window with 50% overlap is used to intercept acceleration signal.The number of spectrum lines is set as 6400, and obtained frequency resolution is 0.156 Hz.Finally, modal frequencies are identified and extracted by frequency spectrum analysis using fast Fourier transform (FFT).For example, samples of data collected from accelerometers installed on the RC beam on March 6, 2016, at 8:00 am are shown in Figure 4.Both the time history of acceleration signal and corresponding amplitude spectrum are illustrated.
Vibration tests were carried out with two hour intervals from 8:00 am to 22:00 pm in everyday monitoring, while temperatures are measured at an interval of 5 minutes.The continuously measured temperatures are used to analyze the temperature variation and its nonuniform distribution.In this study, temperatures corresponding to vibration rests are selected and employed to explore the correlation between temperature and modal frequency and construct regression models between them.

Temperature Effect on Modal
Frequencies.Due to the restrictions of measurement equipment, RC beams are discontinuously monitored.Measurement data have been collected during the period beginning on 20 September 2015 and ending on 29 August 2016.In this research, measurement data from forty days of monitoring since 19 January 2016 covering winter (minimum temperature) and summer (maximum temperature) are used.During this period, measurements are carried out under weak wind condition (hourly-average wind speed less than 3 m/s).Therefore, wind speed effect on modal frequency can be ignored.
Under the influence of solar radiation, air temperature, and thermal inertia of concrete, temperature distributions in the RC beam are usually nonuniform and nonlinear.In order to specifically exhibit the nonuniform temperature distributions in beam, temperatures measured on 27 June 2016 at 14:00 pm are shown in Figure 5.One can find that the temperature distributions along vertical direction and horizontal direction are nonuniform and nonlinear.This indicated that it is important to consider the nonuniformly distributed temperatures in modeling the relationship between modal frequencies and temperatures.
A total of 320 sets of modal frequencies and temperature data from January to August in 2016 are obtained.The measured data samples are sorted in measurement time order.Figure 6 presents the identified modal frequencies and measured average temperature during monitoring period.As can be seen, significant negative correlations exist between modal frequencies and temperature.The modal frequencies decrease with average temperature increasing.Statistical analysis for the variation of modal frequencies during monitoring period is summarized in Table 1.Average temperature variation ranging from −22 ∘ C to 37 ∘ C accounts for the change in modal frequencies of 14.29% to 41.70% in the relative sense for the first four modes.And the variation coefficient varies from 4.05% to 11.48%.For the second modal frequency, it is combined action from vertical vibration and torsional vibration interfered.The higher relative variation (41.70%) and variation coefficient (11.48%) could be caused by the couple effect of vertical vibration and torsional vibration.As listed in Table 1, significant changes of first four modal frequencies demonstrate the necessity to eliminate temperature effect.

Performance Comparison between Regression Models
4.1.Formulation of Regression Models.Three regression algorithms presented in previous section are applied to predict modal frequencies of RC beam, and prediction accuracy of models becomes the utmost concern.Table 1 indicates that the ranges of variation of modal frequencies vary with modes.
If models are configured to accommodate all the modes, corresponding reproduction and prediction capabilities will be reduced [40].Therefore, an individual model is developed for each mode separately, which will improve the accuracy of predicting and eliminating temperature effect on modal frequencies.Measurement data from 40 days of monitoring on RC beam are divided into three nonoverlapping and independent data sets: a training set of 50% (160 sets of data from 20 days of monitoring), a validation set of 20% (64 sets of data from 8 days of monitoring), and a test set of 30% (96 sets of data from 12 days of monitoring).Training set is used to construct and train the regression models, validation set to optimize the models, and test set to check their prediction accuracy.The detailed partition is illustrated in Figure 7 taking the first modal frequency, for example.Training set covers a complete temperature range, which is necessary to contain the limitation of prediction.Validation set and test set are uniformly distributed in the range of training set.This partition is helpful to improve the accuracy of prediction models.
In multivariate regression, the highly correlated data could produce unstable regression estimates [3,35].Internal temperatures of RC beam measured by thermocouples embedded in the cross-section at mid-span are highly correlated.In this research, PCA technique is employed to extract the PCs of temperatures taken from all thermocouples over   the monitoring period, and the transformed data is then given as input to the regression models.The 320 sets of temperatures measured by 14 thermocouples are analyzed by PCA.Variance contribution rate of the first four PCs accounts for 99.99% of total variance, and hence the extracted four PCs of temperatures are applied to construct regression models for accurately predicting modal frequencies.
The relationship between modal frequencies and PCs of temperatures are first formulated for each mode by MLR.Training data set is used to build MLR models and calculate regression coefficients using the least square method.For the formulation of BPNN model, an early stopping technique is employed to optimize the BPNN model and avoid overfitting caused by unreasonable performance goal.With intent to determine the optimal number of hidden nodes, BPNN models with different number of hidden nodes are trained by the early stopping technique using training data.The optimal number of hidden nodes is determined so that the validation error reaches the minimum value.Optimally configured BPNN model is selected as the one with 5 hidden nodes for the first vibration mode.Similarly, the optimal numbers of hidden nodes for the other modes are determined as 4, 4, and 5, respectively.SVR modeling is also carried out individually for each mode using the LIBSVM toolbox in MATLAB [41].A grid search method is used to determine the optimal hyperparameters (, , ).The bounds on the hyperparameters  and  are set to vary from 2 −10 to 2 10 , and  is set to vary from 0 to 10 −3 .SVR models are built by the training data set using fivefold cross-validation scheme.

Reproduction Capability.
PEs between identified and reproduced modal frequencies are calculated on training set and are evaluated by the use of histograms.Figure 8 presents the histograms of PEs generating from the three regression models.As can been seen, the PEs generated by BPNN and SVR model are concentrated in narrower range, and the observed probability distribution is in good agreement with a normal distribution with zero mean.It indicates that the reproduction capacities of formulated BPNN and SVR models are excellent compared to MLR model.Additionally, the PEs from SVR model are smaller than BPNN model, and the distributions are concentrated in zero more significantly, which demonstrates the outstanding reproduction capability of SVR model.
RMSE and  values between target and reproduced modal frequencies for the first four modes are listed in Table 2.It is observed that the MLR model generates the highest RMSE among the three models, while the SVR model achieves the lowest.In addition,  values of the SVR model are larger than those of MLR and BPNN models.This reveals that SVR model presents a stronger linear relationship between reproduced and identified modal frequencies.Based on above comparisons, SVR model has higher accuracy in reproducing the training data and the reproduction capability rank with a descending order of SVR, BPNN, and MLR.

Prediction Capability.
The prediction capacities of formulated models are verified using testing data set.Histograms of PEs generating from the three regression models  are illustrated in Figure 9.It presents that the PEs generated by BPNN and SVR model are concentrated in narrower range.And the observed probability distribution is in good agreement with a normal distribution with zero mean.Similar to reproduction capability, BPNN and SVR models possess better prediction accuracy than MLR model because of less prediction error.
RMSE and  values of three regression models for all the modes are listed in Table 3.It can be seen that RMSE values of MLR, BPNN, and SVR models rank with a descending order of MLR, BPNN, and SVR.The SVR model with minimal RMSE value performs higher prediction accuracy than the BPNN and MLR models.Moreover,  values of SVR model are larger than those of other models, which imply that the performance of SVM models is excellent in predicting the modal frequencies under the nonuniformly distributed temperatures.

Eliminating Temperature Effect on Modal Frequency
The main purpose of constructing accurate regression model is to eliminate the temperature effect on the modal frequencies and to normalize all the modal frequencies to a set of reference temperature.Comparing to the MLR and BPNN regression models, the SVR model exists with better capability for predicting modal frequencies of RC beam in this research.The established SVR model above is used to eliminate the temperature effect on modal frequencies.Firstly, the modal frequencies at the reference temperature of 20 ∘ C are identified.Then the normalized modal frequencies after removing environmental effect can be obtained by where  reference is the modal frequencies at reference temperature,  predicted is the modal frequencies predicted by SVR regression model, and  identified represents the modal frequencies identified at different temperature.Figure 10 illustrates the PEs of first four modal frequencies produced by SVR regression model.It can be clearly observed that the variation of PEs is around zero (the small difference is due to the measurement error, prediction error, and other noise).This indicates that the seasonal variation of modal frequencies is successfully eliminated.Histograms of modal frequencies before and after correction by SVR regression model are plotted to demonstrate the reduction of the variation of modal frequencies.Figure 11 presents the histograms   of first four modal frequencies during the period of 40 days of monitoring.It is clear that the modal frequencies are concentrated in narrower range with normal distributions after the application of SVR regression model.Standard deviation and variation coefficient of first four modal frequencies before and after eliminating temperature effect are listed in Table 4.As can be seen, standard deviations of modal frequencies after eliminating temperature effect are only 25% of standard deviations before eliminating temperature effect.Maximum variation coefficient is reduced from 11.48% to 2.05% after eliminating temperature effect.This verifies the effectiveness of SVR regression model in eliminating temperature effect on modal frequencies.

Conclusions
In this paper, three regression models are constructed to predict the modal frequencies of a concrete beam caused by temperature change in seasonal cold region.The prediction capabilities of formulated MLR, BPNN, and SVR models are evaluated and compared.The following conclusions can be obtained.
(1) During the monitoring period, average temperature variation in RC beam ranging from −22 ∘ C to 37 ∘ C accounts for the changes in first four modal frequencies of 14.29% to 41.70% in relative sense.And the variation coefficient ranges from 4.05% to 11.48%.It demonstrates the necessity to eliminate temperature effect on modal frequency.
(2) A series of statistical indexes including PE, RMSE, and  are introduced to evaluate the reproduction and prediction capability of the formulated models.Histograms statistics of PEs demonstrates that the reproduction and prediction capability of SVR model are superior to MLR and BPNN models.Comparison analysis on RMSE and R indicators also prove that SVR model exhibits excellent reproduction and prediction capabilities and evaluates the modal frequency with high accuracy.
(3) Eliminating temperature effect on modal frequencies is achieved by use of the established SVR model.After eliminating temperature effect, seasonal variation of modal frequencies disappeared, and modal frequencies are concentrated in narrower range with normal distributions.Comparison between variabilities of modal frequencies before and after eliminating temperature effect demonstrates the effectiveness of SVR model in eliminating temperature effect.

Figure 7 :
Figure 7: Sample selection for training, validation, and test set.

Table 1 :
Statistics of first four modal frequencies.

Table 2 :
RMSE and  values of reproduced modal frequencies.

Table 3 :
RMSE and  values of predicted modal frequencies.

Table 4 :
Variability of first four modal frequencies before and after eliminating temperature effect.