Using EVT for Geological Anomaly Design and Its Application in Identifying Anomalies in Mining Areas

A geological anomaly is the basis ofmineral deposit prediction.Through the study of the knowledge and characteristics of geological anomalies, the category of extreme value theory (EVT) to which a geological anomaly belongs can be determined. Associating the principle of the EVT and ensuring the methods of the shape parameter and scale parameter for the generalized Pareto distribution (GPD), the methods to select the threshold of the GPD can be studied. This paper designs a new algorithm called the EVT model of geological anomaly. These study data on Cu and Au originate from 26 exploration lines of the Jiguanzui Cu-Au mining area in Hubei, China. The proposed EVTmodel of the geological anomaly is applied to identify anomalies in the Jiguanzui Cu-Au mining area. The results show that the model can effectively identify the geological anomaly region of Cu and Au. The anomaly region of Cu and Au is consistent with the range of ore bodies of actual engineering exploration. Therefore, the EVTmodel of the geological anomaly can effectively identify anomalies, and it has a high indicating function with respect to ore prospecting.


Introduction
In mineral deposit prediction, searching for mineral deposits requires identification of a geological anomaly indicating that the economic value is high ("as discussed by Darehshiri et al. [1]").A geological anomaly is a geological body or complex of bodies with obvious different compositions, structures, or orders of genesis as compared with the surrounding circumstances ("as discussed by Lu and Zhao [2]").With the evolution of the earth, the nature, source, and intensity of force will not be the same across different times and space.In addition, the distribution of material of the earth is not uniform in time and space, which results in different events and responses, such as the tension and compression of layers, deposition and erosion of material, subsidence and uplift of the crust, simple and complex structures, and intrusion and ejection of magma; these differences form the geological anomaly ("as discussed by Pengda et al. [3]").If a numerical value or numerical interval is used as a threshold to represent the background field, the field that is above or below the threshold constitutes a geological anomaly ("as discussed by Cheng [4]").The character of the geological anomaly and the size and type of mineral resources are determined by the geological environment, geological age, rock type, and structural background of the formation of the geological anomaly.With the evolution of geology, the geological anomaly has an evolution sequence in the time and space.With respect to time, evolution has the stage; with respect to space, evolution has inheritance and superposition ("as discussed by Freedman and Parsons [5]

").
Not all geological anomalies can form deposits, but the constitution of a geological anomaly is a prerequisite for the formation of deposits ("as discussed by Shen et al. [6]").Determining which geological anomaly can result in a mineral deposit can allow effective identification of the deposit.Based on the time required for ore formation, a geological anomaly can be classified into a front ore-forming anomaly, an ore-forming anomaly, and a tail ore-forming anomaly ("as discussed by Zhao et al. [7]").Different factors and combinations of ore-forming geologies have certain special properties related to ore formation.However, various minerals with different genetic, morphological, mineral, and industrial types are needed to select certain geological factors and combinations.Therefore, it is necessary to find the target anomaly in all of the possible ore-forming geological anomalies; the area of the target anomaly is known as the feasible location for prospecting.According to additional information on ore formation, such as the remote sensing anomaly, geophysical anomaly, and geochemical anomaly, we can find the location of the required ore deposit; these areas are known as the favorable areas for prospecting.With more information on the geological anomaly, the area of the prospecting target will be gradually reduced, making it easy to locate the deposit.Therefore, the geological anomaly is the basis of mineral deposit prediction; it is effective in locating deposits by precisely identifying the geological anomaly.Therefore, it is important to identify more reasonable methods to locate the geological anomaly.To meet this challenge, various methods have been proposed and successfully applied with respect to the geological anomaly, such as the Three-Component Mineral Prediction theory ("as discussed by Zhao et al. [8]"), the quantitative prediction theory of geological anomaly ("as discussed by Pengda et al. [3]"), and singularity theories and methods for mineral deposit prediction ("as discussed by Cheng and Zhao [9]").However, each method pertaining to the geological anomaly needs to meet the conditions of the algorithms when they identify the geological anomaly.In fact, not every algorithm satisfies the entire geological environment.Consequently, more algorithms related to the characteristics of the geological anomaly and those based on the environment are needed, which match the extraction criteria of the geological anomaly.
At the International Statistics Congress held in Seoul, Republic of Korea ("as discussed by Chen et al. [10]"), Pengda Zhao described the geological anomaly as an extreme value based on a mathematical foundation.For the geological background, he thought that an abnormal value was the geological anomaly, which directly infers that knowledge of the mathematical foundation of the geological anomaly is of extreme value.The extreme value analysis pertains to research on the random character in the process of quantification at a very large or small level and an estimate of the probability of an extreme event at the existing observational level, while the observation data of the geological anomaly are located in the tail end of the distribution.Therefore, the geological anomaly belongs to the EVT category.The extreme value theorem is a branch of statistics that studies the limiting distribution of the minimum and maximum value and evaluates the risk of extreme events ("as discussed by Allen et al. [11]").In recent years, the EVT has been widely used in the fields of finance, insurance, floods, earthquakes, rainfall analysis, and so on ("as discussed by Chen and Lv [12]" and "as discussed elsewhere [12][13][14]").Since the geological anomaly belongs to the EVT category, the EVT has been widely used in many fields, so we can learn from the experience of these types of applications and design the extreme value model of the geological anomaly that can effectively identify the geological anomaly.Does the EVT model of the geological anomaly really identify anomalies?This study was performed to verify the use of the model to identify anomalies in the Jiguanzui Cu-Au mining area.The results show that the model can effectively identify the geological anomaly region.
The rest of this paper is organized as follows.Section 2 studies the EVT, the method of selecting the threshold EVT is studied, and some parameters of the EVT are also discussed.Section 3 designs the EVT model of the geological anomaly and provides a new method to increase the accuracy with which the threshold can be selected.The feasibility of using the EVT model to identify the anomaly is discussed.Section 4 will demonstrate the application of the EVT model of the geological anomaly.Some conclusions are presented in Section 5.

The Study of the EVT
2.1.The Knowledge Related to the EVT.In the sample data, if the parent distribution or the sample size is not fully known, the parent distribution can be obtained from the asymptotic distribution of the extreme value of the sample.While the sample data are large, the largest or smallest value from a sample has a degradation problem.However, the extremal type theorem can effectively solve this problem ("as discussed by Vanem [15]").The extremal type theorem is presented as follows.
If  1 ,  2 , . . .,   is a sequence of independent random variables with a common distribution, parent distribution () is unknown.  is the largest value of the sample interval, and () is a nondegenerate distribution function.If there exists a sequence of constants {  } > 0 and {  } ∈ , () indicates a generalized extreme value distribution.Here,   is a scaling constant and   is a location constant.Then, this limiting distribution () after standardization (  −   )/  must be one of the three following types: Here,  is a shape parameter,  is a location parameter, and  is a scale parameter.

The GPD Model.
In the EVT, the block maxima method (BMM) is a traditional model ("as discussed by Rivas et al. [16]").The BMM divides the sample interval into several nonoverlapping cells in accordance with the time, the length, and so on.Then, there is an extreme sequence that is formed by selecting all the maximum values of each small interval; from the extreme sequence, the parent distribution can be obtained by distribution fitting of the extremal type theorem.However, the BMM model has a problem that some maximum values of intervals are greater than those of the other intervals; thus, the validity of the BMM model is not satisfactory.The defects can be solved by generalized Pareto distribution (GPD) ("as discussed by Ashkar and El Adlouni [17]").The GPD is a fitting of the observed data, which is greater than a certain threshold.
Here,  1 ,  2 , . . .,   is a sequence of independent random variables with a common distribution, and  is a sufficiently high threshold.If there is positive number , the excess distribution (  − ,  = 1, 2, . . ., ) can be expressed as where  is a shape parameter and  is a scale parameter.If  ≥ 0,  ≥  and  < 0,  ≤  ≤ −/ + .The sequence (  ,  = 1, 2, . . ., ) obeys the GPD.The general formula of the GPD is given by Here, if  = 1, the expression of the GPD is referred to as the standard form.If  = −0.5, 0.5, and 0, then the image of the standard distribution function and density distribution function of the GPD is as presented in Figure 1.From Figure 1(a), it is seen that the tail of the GPD thickens as the shape parameter increases.Figure 1(b) shows that the density function of the GPD decreases monotonically.
The GPD requires estimates of the parameters and threshold.The shape parameter  and scale parameter  of the GPD can be estimated by the maximum likelihood function ("as discussed by Castillo and Serra [18]").Taking the derivative of (4), we can obtain the density function of the GPD: Taking natural logarithms of both sides of (5), we obtain the log likelihood function: Taking the partial derivative of  and  of ( 6), respectively, the likelihood equation is as follows: Thus, the maximum likelihood estimate value ξ of the shape parameter  and the maximum likelihood estimate value β of the scale parameter  can be obtained from (7).In addition, the shape parameter and scale parameter of the GPD can be estimated by the moment method; the estimation results obtained by the moment method are superior to those obtained using the likelihood function ("as discussed by Ergün and Jun [19]").The moment method is given by ξ where   =   −  ≥ 0,  is the mean value of   , and  is the standard deviation of   .In the GPD model, the threshold selection methods mainly concern the mean excess function (MEF) ("as discussed by Gencay and Selcuk [20]") and Hill plotting ("as discussed by J. H. T. Kim and J. Kim [21]").If random variable  obeys the GPD, the MEF () is given by For the actual sample data, () can be calculated by the following: where  is the total number of sample data and   is the total number of sample data that exceed threshold .If   ≥ , (  − ) + =   −  or   < , (  − ) + = 0.Then, we can plot scatter diagram (, ()).In the scatter diagram, there is sufficiently high threshold ; when  > , () is an approximate linear function.

Design the EVT Model of the Geological Anomaly
For the actual observational data  = { 1 ,  2 , . . .,   } of the geological anomaly, the tail distribution of the geological observational data is called the geological anomaly.Therefore, if the data are higher than sufficiently high threshold  in the sample data, we can model these data using the GPD.
The parameters of the GPD can be estimated by the moment method or the likelihood function.Threshold  can also be calculated using the MEF or Hill plotting.Thus, the designed EVT model of the geological anomaly is as follows.
Step 1 (conditional test).Before using the EVT model of the geological anomaly, the stationary and posttail of the sample data need to be tested.The common method of the conditional test is as follows: probability plot and quantilequantile (Q-Q) plot ("as discussed by Feng et al. [22]"), augmented Dickey-Fuller (ADF) test ("as discussed by Lee and Chang [23]"), and so on.
Step 2 (estimate the parameters of the model).For the sample data, use the moment method or likelihood function to estimate shape parameter  and scale parameter  of the GPD.
Step 3 (determine the threshold).Select different thresholds  from the sample data.Then, we can calculate the MEF of the sample data through scatter diagram (, ()); there is sufficiently high threshold , when  > , and () is an approximate linear function.Here, the value  is called the threshold of the geological anomaly.
Step 4 (determine the distribution of the excess threshold).
After the threshold and parameters are determined, we insert the threshold and parameters into the GPD to obtain the distribution of the excess threshold; this distribution is called the abnormal probability distribution.
Step 5.After the distribution of excess threshold is determined, a diagnostic test can determine whether the threshold selection is rational.The diagnostic test of the model mainly tests the consistency between the theoretical distribution and the actual distribution, especially the fitting degree of the actual data and the model distribution.The methods we usually use are the Q-Q plot and probability plot.If the test effect is not satisfactory, repeat Step 3 and determine the new reasonable threshold.The chart of the EVT model of the geological anomaly is shown in Figure 2.
In this study, the determination of threshold  in the EVT model is critical.If the selection of the threshold value is higher, the number of samples that exceed the threshold value is lower, and the parameters of the GPD are very sensitive to the high values of the observational data, which will cause errors in the parameter estimation.Conversely, the selection of a threshold value that is low will increase the number of observations, increasing the accuracy of the estimation of the parameters, but the excess data   −  do not obey the GPD distribution.At present, there is no clear method to select the accuracy threshold.The MEF can be used to estimate the threshold with some defects, in which the selection of the threshold is usually an interval value and not an accurate constant.Therefore, this paper provides a new method to increase the accuracy with which the threshold can be selected.In the GPD, when the initial threshold value  0 is determined, the excess data   −  0 approximately obey the GPD distribution.Regardless of any threshold  ( >  0 ), the shape parameter  and scale parameter  of the GPD should remain unchanged.Therefore, information can be obtained on the transformation relationship between () and threshold  ( >  0 ) from (3).
Let  * () = ( 0 )+(− 0 );  * () is called the modified scale, and the values of  * () will not change when threshold  changes.Therefore, when the interval threshold value is determined by MEF, the accuracy threshold can be estimated by  * ().Estimating the threshold using  * () is detailed in Section 4.2.
In this paper, the EVT model of the geological anomaly takes full account of the characteristic of the geological anomaly distribution and the practical features of the EVT.Relative to the geological background value, the anomaly and extreme value can be used to describe the geological anomaly.The observations of the geological anomaly are located in the tail of the samples, which are related to the random characters in the process of quantification at the very large or small level and estimate the probability of the extreme event in the existing observation levels-these characteristics are also the contents of the EVT.The EVT is a branch of statistics that studies the limiting distribution of the minimum and maximum value and evaluates the risk of extreme events.Therefore, the mathematical foundation of the geological anomaly is described by the EVT.On the one hand, the EVT describes the characteristic of the geological anomaly distribution from the perspective of mathematics; the results from (3) show the distribution of the sample data, which is proved in (12).On the other hand, the EVT provides a quantitative and digital research method for predicting and evaluating the mineral resources, which is proved in Figure 10.Besides, this paper also discusses the methods of estimate parameters and the threshold of the EVT.Consequently, a mathematical statistical model is established for quantitative geological data and geology information where the data exceed the threshold of the sample in (12).Therefore, the ability of the EVT model to identify the geological anomaly is feasible.

The Model Application in Identifying the Geological Anomaly of the Jiguanzui Cu-Au Mining Area
In order to show the effect of identifying the anomaly using the EVT model with geological anomaly recognition, the model was applied to identify the anomaly in an actual mining area.  1, we can see that the skewness of the sample data is greater than zero, and the sample data are not normally distributed, that is, distributed to the right.By observing the coefficient of variation, it is seen that the coefficient of variation of Au is smaller than that of Cu, and the stability of Au is higher than that of Cu.Besides, the kurtosis of Cu and Au is greater than that of the normal distribution (of which the kurtosis value is 3), which results in a leptokurtic distribution for Cu and Au.Therefore, the distribution of Cu and Au is shown to be skewed to the right with leptokurtic characteristics.Secondly, in order to indicate the difference between the actual distribution and normal distribution, Q-Q plot can be used for the observation test (Figure 4).The distribution of Cu and Au is also shown to be skewed to the right with posttail characteristics.

The Stationary Test of the Sample Data.
The stationary test mainly inspects the self-correlation of the geological data; the common methods of the stationary test are as follows: the augmented Dickey-Fuller (ADF) and sequence correlation analysis.Through the ADF, we can obtain the test results of the sample data (Table 2).The -statistics of Cu and Au are −33.93498and −13.19081, respectively, which are smaller than their own 1% significant level.Therefore, the sequences of Cu and Au do not have unit roots; they are stationary sequences.The results of the sequence correlation analysis are

Ore bodies and numbers
The first metallogenic region The second metallogenic region The third metallogenic region The fourth metallogenic region  shown in Figure 5, where we can see that the autocorrelation coefficients (AC) and the partial autocorrelation coefficient (PAC) are not zero, and the significance of -states is high, so the uncorrelated hypothesis cannot be rejected.Therefore, the sequence of the geological data is a stationary time series.
Together, the results indicate that the data follow a stationary sequence, and the distributions of Cu and Au have posttail characteristics, indicating that they are abnormally distributed.Therefore, we can use the EVT model of the geological anomaly to identify the anomaly.

Mathematical Problems in Engineering
The distribution of the excess threshold is called the abnormal probability distribution.In the mineral deposit prediction, information on the geological data can be described and expressed by the distribution of the excess threshold.selection of the thresholds is reasonable.Fitting the excess threshold of the sample data using the GPD (Figure 9) indicates that the excess threshold of the sample data is in the vicinity of the line; the results show that the theoretical distribution and actual distribution of the sample data are consistent.Therefore, the threshold selection is reasonable.There are currently seven mining drill holes, that is, KZK10, KZK11, KZK23, KZK28, ZK02618, ZK02619, and ZK02620, and the exploitation ore bodies are mostly VII main ore body in the 26 exploration lines of the Cu-Au mining area.GIS technology is used to show the geological anomaly region with the selection of the thresholds (Figure 10).From Figure 10, it is seen that the anomaly region of Cu and Au is consistent with the range of ore bodies of the actual engineering exploration, which has a high indicating function with respect to ore prospecting.The results show Mathematical Problems in Engineering that the EVT model of the geological anomaly is good at mineral deposit prediction, and it has good prospecting significance.

Conclusion
In this study, the proposed EVT model of the geological anomaly was applied to identify geochemical anomalies associated with Cu and Au mineralization.The results of this study led to the following: (1) The characteristics of the geological anomaly and the principle of EVT were studied; knowledge of the distribution of the EVT coincides with the distribution of the geological anomaly data.The designed EVT model of the geological anomaly takes full account of the characteristic of the geological anomaly and the practical features of the EVT.The threshold selection and parameter estimates of the model were determined.
(2) The proposed EVT model of the geological anomaly was successfully applied to identify the geological anomaly region in the Jiguanzui Cu-Au mining area.
The results show that the anomaly threshold of Cu is 816.4006 and that of Au is 75.4736; the shape parameter of Cu is 0.3162 and that of Au is 0.3342; and the scale parameter of Cu is 440.9216 and that of the Au is 31.5699.The abnormal probability distribution was also determined.Testing the results of the model by fitting the excess threshold of the sample data showed that the results of the theoretical distribution and actual distribution of the sample data were consistent.
(3) The geological anomalies of Cu and Au predicted by the EVT model are consistent with the range of ore bodies of the actual engineering exploration.
The EVT model has a high indicating function with respect to ore prospecting, and it is applicable for the exploration of mineral deposits.

Figure 1 :
Figure 1: The standard GPD: (a) is the distribution function of the GPD and (b) is the density function of the GPD.

Figure 2 :
Figure 2: The chart of the algorithm.

Figure 3 :
Figure 3: The pattern of the Jiguanzui Cu-Au mining deposit.

Figure 4 :
Figure 4: The posttail test of the sample data according to Q-Q plot.(a) is Q-Q plot test of Cu and (b) is Q-Q plot test of Au.

Figure 5 :
Figure 5: The sequence correlation analysis of the sample data.(a) is the sequence correlation analysis of Cu and (b) is the sequence correlation analysis of Au.

Figure 6 :Figure 7 :Figure 8 :
Figure 6: The scatter diagram of the MEF.(a) is the MEF of Cu and (b) is the MEF of Au.

Figure 9 :Figure 10 :
Figure 9: The GPD fitting of the sample data.(a) is the distribution fitting of Cu and (b) is the distribution fitting of Au.

Table 1 :
The basic statistics of Cu and Au.
The Jiguanzui Cu-Au mining deposit is the blind deposits at the lower part of the Quaternary overburden layer.Currently, I, II, III, and VII main ore body groups, 14 main ore bodies, and 105 small fragmentary ore bodies have been identified in the mining area.The main ore bodies of the Jiguanzui Cu-Au mining area are distributed in the 013 to 034 lines, which are 950 meters long, the width is 160-800 meters, the elevation ranges from −5 m to −1412 m deep extension, and the level projection area of the ore bodies is 0.58 square kilometers.The overall distribution of the ore bodies is northeast 30 ∘ , the trend of the ore bodies is northeast 15 ∘ -72 ∘ , and the local trend of the ore bodies is northwest.I, II, III, and VII ore bodies are arranged in the form of an echelon, in which the tendency is northwest and the local tendency is south.The main ore bodies occur in the fault basin at the edge of the northwestern rock body of the Tonglushan, near the contact zone of the dolomitic marble, the quartz monzonite diorite porphyry, and quartz diorite in the Lower Triassic Jialingjiang Formation, and near the different lithology and the echelon fracture of dolomitic marble.The pattern of the Jiguanzui Cu-Au mining deposit is shown in Figure3.
The study data with Cu and Au originate from 26 exploration lines of the Jiguanzui Cu-Au mining area in Hubei, China.The count of the sample data with Cu and Au is 14309.4.1.The Condition Test of the Model 4.1.1.The Posttail Test of Sample Data.Firstly, this paper analyses the basic statistics of the Cu and Au elements, and the results are shown in Table 1.From Table

Table 2 :
The ADF test of Cu and Au.
[24]mate the Parameters and Threshold.Through the EVT model of the geological anomaly, we can calculate the MEF of Cu and Au and plot the scatter diagram of the MEF (Figure6).From Figure6, it is seen that, in the interval [784.5406,841.3659]withCuandinterval[72.1178,85.1918]withAu,()forCuandAufollowsanapproximatelylineardistribution.These intervals are selected as the threshold for Cu and Au.As the result of the threshold selection is subjective, modified scale  * () is used to estimate the accuracy threshold.Based on  * (), if initial threshold value  0 is determined, regardless of any threshold  ( >  0 ), the shape parameter  and scale parameter  of GPD will not change.Uniformly selecting 50 threshold values from [784.5406, 841.3659] and [72.1178, 85.1918], respectively, we can obtain the transformation relations between  * (), , and the threshold  ( >  0 ) for Cu and Au using (3) and(11); see Figures7 and 8.In order to ensure the accuracy of the EVT, the threshold selection is as large as possible in the permissible range of threshold estimation where the data show the stationary characteristic ("as discussed by Cao and Zhang[24]").From Figures7 and 8, it is seen that the threshold of Cu is 816.4006 and that of Au is 75.4736.After the thresholds are determined, the parameters of the EVT model of the geological anomaly can be estimated using the moment method, which reveals that the shape parameter of Cu is 0.3162 and that of Au is 0.3342; the scale parameter of Cu is 440.9216 and that of Au is 31.5699.Then, inserting the thresholds and parameters into the GPD, the distribution of the excess threshold of Cu and Au can be obtained by