Assessment for Thermal Conductivity of Frozen Soil Based on Nonlinear Regression and Support Vector Regression Methods

-e comprehensive understanding of the variation law of soil thermal conductivity is the prerequisite of design and construction of engineering applications in permafrost regions. Compared with the unfrozen soil, the specimen preparation and experimental procedures of frozen soil thermal conductivity testing are more complex and challengeable. In this work, considering for essentially multiphase and porous structural characteristic information reflection of unfrozen soil thermal conductivity, prediction models of frozen soil thermal conductivity using nonlinear regression and Support Vector Regression (SVR) methods have been developed. -ermal conductivity of multiple types of soil samples which are sampled from the Qinghai-Tibet Engineering Corridor (QTEC) are tested by the transient plane source (TPS) method. Correlations of thermal conductivity between unfrozen and frozen soil has been analyzed and recognized. Based on the measurement data of unfrozen soil thermal conductivity, the prediction models of frozen soil thermal conductivity for 7 typical soils in the QTEC are proposed. To further facilitate engineering applications, the prediction models of two soil categories (coarse and fine-grained soil) have also been proposed. -e results demonstrate that, compared with nonideal prediction accuracy of using water content and dry density as the fitting parameter, the ternary fitting model has a higher thermal conductivity prediction accuracy for 7 types of frozen soils (more than 98% of the soil specimens’ relative error are within 20%). -e SVR model can further improve the frozen soil thermal conductivity prediction accuracy and more than 98% of the soil specimens’ relative error are within 15%. For coarse and fine-grained soil categories, the above two models still have reliable prediction accuracy and determine coefficient (R) ranges from 0.8 to 0.91, which validates the applicability for small sample soils. -is study provides feasible prediction models for frozen soil thermal conductivity and guidelines of the thermal design and freeze-thaw damage prevention for engineering structures in cold regions.


Introduction
With the increasing frequency of human activities and the trend of global warming, the temperature sensitive permafrost is changing significantly, most of which are in critical equilibrium state or even gradual degradation [1][2][3]. e widely distributed permafrost in the Qinghai-Tibet Plateau exhibits stronger temperature sensitivity and faster temperature rise rate than other permafrost regions with the same latitude [4][5][6]. e dynamic degradation process of frozen soil has a great impact on the stability of buildings in cold regions [7][8][9][10]. In the meanwhile, it will also lead to local environmental degradation, or further affect the global climate carbon cycle and climate change [11][12][13][14].
e thermal conductivity is one of the important thermophysical parameters for reconstructing the past and predicting the temperature status of permafrost under climate change conditions and also determining a parameter for engineering design in cold regions [18,19]. erefore, the comprehensive understanding of variation law of soil thermal conductivity is one of the important tasks in permafrost research. ermal conductivity is an inherent parameter to characterize the heat transfer performance of soil, which can usually be obtained by means of experimental test and prediction models [20][21][22]. e test of thermal conductivity can generally be divided into the steady-state method and transient method [23,24]. Specifically, it includes steadystate comparison method, steady-state heat flow meter method, transient hot wire method, transient heat pulse method, and TPS method [25]. However, the disadvantages of the experimental test are usually strict technical requirements, long time consumption, and high cost, and sometimes limited test results cannot satisfy the needs of practical applications. erefore, as reported [26,27], many soil thermal conductivity prediction models had been proposed. Empirical models are usually based on statistical analysis of thermal conductivity experimental results and use water content, dry density, porosity, and other related soil properties as fitting parameters [28][29][30][31]. Kersten [28] analyzed the test results of 19 soil types and established the empirical formula of thermal conductivity using water content and dry density. Johansen [29] proposed the concept of normalized thermal conductivity and given the interpolation calculation model of soil thermal conductivity based on the relationship between normalized coefficient and soil saturation. Lu et al. [30] performed a series of thermo-TDR tests on twelve natural soils and proposed a linear prediction model across a wide range of soil moisture condition. Yan et al. [31] developed a generalized effective soil thermal conductivity model for soils of various textures from dry to saturation. en, many scholars improved the model for wider applicability and higher prediction accuracy [32][33][34][35]. At the same time, many scholars have established theoretical predictive models of soil thermal conductivity based on specific physical theory and models [36][37][38]. Farouki [39] and Xu et al. [40], respectively, established the weighted calculation model and geometric average calculation formula based on the volume ratio and thermal conductivity of each component of the frozen soil on the basis of predecessors. Considering for the interactions among soil particle, water, and air of a soil unit cell, Haigh [41] derived a theoretical thermal conductivity model for sand soil. In recent years, with the maturity of machine learning and its advantages in dealing with complex nonlinear problems, many researchers had adopted it to the prediction of soil thermal conductivity [42,43]. For instance, Bang et al. [44] investigated the application effect of linear regression and various machine learning methods in the prediction of thermal conductivity of compacted bentonite and verified the feasibility and superiority of the machine learning method in the prediction of the soil thermal conductivity.
Compared to unfrozen soil, the specimen preparation and experimental procedures of frozen soil thermal conductivity testing are more complex and challengeable. And it can be found that the prediction accuracy of the frozen soil thermal conductivity predictive model is lower than that of unfrozen soil [40]. Meanwhile, as a complex multiphase composition, previous research indicated that the thermal conductivity of frozen soil was associated with many factors [26,27,45]. e existing frozen soil thermal conductivity empirical model (usually using dry density, water content, porosity, etc., as the fitting parameters) cannot comprehensively consider the influences of complex multiphase and porous structural characteristics of soil. erefore, in the present work, prediction models of frozen soil thermal conductivity using nonlinear regression and SVR methods have been developed considering for essentially multiphase and porous structural characteristic information reflection of unfrozen soil thermal conductivity. ermal conductivity of various types of soil samples which are sampled from the QTEC are tested by the TPS method. Correlation analysis of thermal conductivity of unfrozen and frozen soil has been adopted. Based on the measurement data of unfrozen soil thermal conductivity, the ternary fitting and SVR prediction models of frozen soil thermal conductivity for the typical soils in the QTEC are proposed. To further facilitate engineering applications, the prediction models of coarse and fine-grained soil categories have been proposed. Furthermore, the prediction models of two soil categories have also been used to predict the frozen soil thermal conductivity of small size soil samples in the QTEC.

Collection of Soil Samples.
e soil samples of thermal conductivity measurement were collected from the drilling specimens of Qinghai-Tibet expressway geological exploration project, which was implemented by the CCCC First Highway Consultants Co. Ltd. and Eco-Environment and Resources Northwest Institute of CAS from September 2017 to June 2018. As shown in Figure 1, the sampling spot mainly locates in the permafrost regions between Xidatan and Tanggula Mountain (the corresponding Qinghai-Tibet Highway mileage is K2870∼K3307). Every drilling spot is sampled for different depths, and the sampling depth varies from 0.5 m to 40 m. Total number of 638 unfrozen soil samples and 860 frozen soil samples are tested.
Soil types of testing specimens are determined by the classification criteria of geotechnical engineering. Figure 2 shows the statistic results of different types of soil samples (only numbers more than 15 are given). It can be seen that silty clay is the most widely distributed frozen soil type in the QTEC, and the following soil types are sandy and gravel soil. Because some soil samples with large water (ice) content are impossible to implement unfrozen soil thermal conductivity testing, a total of 609 specimens tested for both frozen and unfrozen are selected as the research objects in this work, and the statistical distribution of various soil samples is shown in Table 1. It can be seen that the proportions of silty clay, silt, fine sand, gravel sand, boulder, breccia, and all weathered rock soils accounting for 86.2% are defined as the typical analytical soil types in the following predictive model research.

2
Advances in Civil Engineering

Water Content and Dry Density Distribution of Soil
Samples. e dry density (ρ d ) and water content (w) of soil samples are statistically calculated, and the accumulative proportion distribution is shown in Figure 3. It shows that the water content and dry density distributions are significantly correlated with soil properties. e dry density is negatively correlated with soil particle size and basic orders of dry density of the samples are silt, silty clay, weathered rock, fine sand, gravel sand, breccia, and boulder soils. e average dry density of above soil samples are 1 Figure 1: Schematic of the sampling spot along the QTEC [16].

Experimental Methods and Apparatus.
e TPS method is utilized as the thermal conductivity testing measure for all soil samples. e experimental apparatus is the Hot Disk 1500s ermal Conductivity Analyzer (as shown in Figure 4), which has ±3% measurement accuracy. e principle of the TPS method for measuring the thermal properties of materials is based on the transient temperature response of a step-heated disc-shaped heat source in an infinite medium. No. 4922 Kapton film probe with radius of 29.4 mm is used to make specimen as large as possible, and the thermal conductivity (λ) is calculated by line equation of temperature variation values and dimensionless time constant which is measured by the film sensor: where ΔT(τ) is the temperature variation value, P 0 is the total output of power and r is the radius of Kapton film sensor, and D(τ) is the dimensionless time constant. During the test, experimental soil samples are formed into 3 cm height and 8 cm diameters column, and the film probe is  sandwiched between two soil samples and fixed with a stainless steel sample holder. e detailed experimental procedures can be obtained in our previous work [46].

Nonlinear Regression Model.
Previous studies have shown that dry density and water content have significant influence on soil's thermal conductivity. Partial correlation analysis between frozen soil thermal conductivity of dry density, water content, and unfrozen soil thermal conductivity for two soil categories have been performed, and the results are shown in Table 2. e results exhibit that dry density and water content, as well as unfrozen soil thermal conductivity (λ u ), have significant positive correlations with frozen soil thermal conductivity (λ f ). In particular, it should be noted that the correlation between the frozen and unfrozen soil thermal conductivity for both coarse and finegrained soils are very high (0.88 and 0.93, respectively). It can be inferred that, considering for the essential soil properties information reflection of unfrozen soil thermal conductivity, it would be possible to utilize the unfrozen soil thermal conductivity to predict frozen soil thermal conductivity.
Curve estimation of the fitting relationship between the frozen soil thermal conductivity and three factors has been taken, and it is found that the thermal conductivity of the frozen soil is linear with the unfrozen soil thermal conductivity, while its relation with dry density and water content are in the form of logarithmic functions. e ternary fitting formulas of frozen soil thermal conductivity for 7 typical soils in the QTEC are given as follows: where a, b, c, and d are the fitting coefficients of the equation. e detail fitting results of 7 typical soils are listed in Table 3. It can be seen that the determine coefficients (R 2 ) of 7 typical soils range from 0.76∼0.93, while the analog effect of sandy soils are better than other soil types.
In order to analysis the predictive effect of the ternary fitting model, the estimated frozen soil thermal conductivity based on the Kersten model [28], Gangadhara Rao and Singh model [47], and ternary fitting model are plotted against the measured value of testing dataset. e empirical formulas proposed by Kersten and Gangadhara Rao and Singh used dry density and water content as fitting parameters, while the applicable soil types are silt, fine sand and gravel sand soils, etc. e comparison results of the estimation values of three models and the experimental values are shown in Figure 5. It can be seen that the proposed ternary fitting model performs best among all three models as most of the predictive values are within the ±10% relative error line. However, the predictive values of Kersten and Gangadhara Rao and Singh models are generally overestimated for all soil types and their prediction errors reach up to 40%. erefore, it can be concluded that, compared with nonideal prediction accuracy of binary fitting models, the ternary fitting model using the unfrozen soil thermal conductivity as the fitting parameter has a higher thermal conductivity prediction accuracy.

3.1.
eory and Performance Evaluation of SVR Model. SVR method is a type of supervised machine learning method and proposed by Vapnik et al. [48,49], which is based on statistical learning theory. e SVR method uses kernel functions to map low-dimensional nonlinear problems to high-dimensional space to achieve linear separability and then to seek linear regression equations to fit sample data (as shown in Figure 6). e regression function can be expressed as follows: where x is the input vector, K is the weight vector, m is the offset vector, and φ(x) is eigenvector that maps input data to the high-dimensional space. e Gaussian radial basis function is chosen as the kernel function in the calculation. e SVR method is based on the principle of minimizing structural risk, transforming the linear regression problem into the following optimization problem, and then the values of K and m can be determined [48]:

Subject to
where t is the penalty parameter which is greater than 0, ε is the insensitive loss coefficient, and ξ i and ξ * i are slack variables.
e predictive results of SVR models for silt and gravel sand soils are shown in Figure 7. It can be seen that the predictive results of the SVR model are in good agreement with the experimental results (R 2 � 0.86 for silt and R 2 � 0.94 for gravel sand). e SVR model exhibits remarkable prediction accuracy and most of predictive values are within the ±10% error bars, which validates their predictive availability        Advances in Civil Engineering and engineering suitability. Furthermore, it also should be noted that evaluation spots in Figure 7 is distributed uniformly throughout the thermal conductivity interval (λ f ranges 0.6∼3.3 W/(m·K)), which proves the broader application range of the SVR model.

Comparison of SVR Model with Ternary Fitting
Model. e comparison of predictive results of the SVR model and ternary fitting model are shown in Figure 8. It can be seen that the fitting degree of the SVR model is much higher than that of the ternary fitting model. Especially for the silty clay and breccia soils, the predictive improvement effect of the SVR method is more obvious. For improvement at both high and low thermal conductivity intervals, the R 2 increases to 0.84 and 0.85, respectively. It can be considered that, for the complex multiphase and porous structural characteristics of soil, the SVR model has more advantages than the traditional empirical formula model. e fitting results of the ternary fitting method and SVR method are statistically calculated. e distributions of R 2 , sample probability of mean absolute percent error (MAPE) less than 10%, and maximum relative error (δ max ) are shown in Table 4. It can be seen that, compared with ternary fitting method, the minimum R 2 of the SVR method is 0.84 (silty clay), and the R 2 of 4 soil types is above 0.9. e 7 typical soils' average δ max of the SVR model is 18.1% and ternary fitting model is 21.9%. Moreover, the proportion of MAPE 8 Advances in Civil Engineering less than 10% of SVR model are larger than that of ternary fitting model for all 7 typical soils, which further indicate that the SVR model has better performance than the ternary fitting model. e sample probability of the MAPE within 20% of the ternary fitting method is more than 98%, while the sample probability of the MAPE within 15% of the SVR method is more than 98%. erefore, it can be considered that the ternary fitting method has a predictive accuracy of ±20% and the SVR method has a predictive accuracy of ±15%.

Generalized Prediction Model for Coarse and Fine-Grained Soil
Based on the previous analysis, it can be found that some soil types, such as clay, silty sand, and gravel soil, cannot be analyzed by the nonlinear regression or SVR method for their small sample sizes. Furthermore, the overly detailed prediction models of subdivided soil types will also induce a certain degree of inconvenience in engineering application. us, generalized frozen soil thermal conductivity prediction models for coarse and fine-grained soil categories have been developed. e R 2 and fitting coefficient values of the ternary fitting model and SVR model are listed in Table 5, and the comparison of predictive results of the above two models for coarse and fine-grained soil categories are shown in Figure 9. It shows that both the ternary fitting model and SVR model have acceptable fitting effect for two soil categories, and most evaluation spots of two models are within or near the ±10% error. e R 2 of the ternary fitting model for the coarse and fine-grained soil categories is 0.80 and 0.89, respectively, while the SVR model is 0.85 and 0.91, which proves the feasibility of soil categories predictive models. Nevertheless, it also can be found that the SVR   Figure 9: Comparison of predictive results of ternary fitting and SVR models for (a) fine-grained sand and (b) coarse-grained sand.
Advances in Civil Engineering method still has a better performance despite the increase of variety and complexity of soil samples. e MAPE distribution of two prediction models of coarse-and fine-grained soil categories is shown in Figure 10. It can be seen that the prediction accuracy of coarse-grained soil category is generally higher than that of fine-grained soil category. e MAPE of coarsegrained soil category distributes uniformly in each thermal conductivity range and its average value is lower than 10%, which means that the predictive results of two prediction models for coarse-grained soil category are more reliable in the entire frozen soil thermal conductivity distribution interval. Additionally, it should be mentioned that the large error intervals of finegrained soil category are mainly concentrated in 0∼1.2 W/(m·K) and 2.7∼3.6 W/(m·K) ranges and the proportion of the two ranges are 6.67% and 2.6% (total of 9.27%), which is roughly excluded from its main thermal conductivity distribution range. erefore, considering for applicability scope and convenience, the fine-grained soil category prediction models definitely have certain application values in engineering. e prediction models of two soil categories are applied to small sample soils (clay, silty sand, and gravel soil), and the results are shown in Figure 11. It can be seen that both ternary fitting model and SVR model exhibit good predictive accuracy for all three small sample soils, which supports applicability of prediction models. Furthermore, it can be clearly noted that the SVR model has better predictive performance for clay soil, which testify that the machine learning method has broad application prospects for its capability of nonlinear and complex relationship capturing and imitating.

Conclusions
In present work, a large-scale soil thermal conductivity test has been conducted by the TPS method. Correlation analysis of thermal conductivity of unfrozen and frozen soil has been adopted. Based on the measurement data of unfrozen soil thermal conductivity, the ternary fitting and SVR prediction models of frozen soil thermal conductivity for the typical soils in the QTEC are proposed considering for the essential soil properties information reflection of unfrozen soil thermal conductivity. Furthermore, to facilitate engineering applications, the prediction models of coarse and finegrained soil categories have also been proposed and compared. e results show that (1) Compared with nonideal prediction accuracy of using water content and dry density as the fitting parameter, the ternary fitting model has a higher thermal conductivity prediction accuracy for typical soil types in QTEC, and more than 98% of soil specimens' relative error are within 20% (2) With the capability of nonlinear and complex relationship capturing and imitating, the SVR model can further improve the frozen soil thermal conductivity prediction accuracy and more than 98% of the soil specimens' relative error are within 15% (3) For coarse-and fine-grained soil categories, the above ternary fitting and SVR models still have reliable prediction accuracy and their R 2 ranges from 0.8 to 0.91, which validates the applicability for small sample soils

Data Availability
All the data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest
e authors declare that they have no conflicts of interest.