A Method of Selecting the Block Size of BMM for Estimating Extreme Loads in Engineering Vehicles

Extreme loads have a significant effect on the fatigue damage of components.The blockmaximummethod (BMM) is widely used to estimate extreme values in various fields. Selecting a reasonable block size for BMM is crucial to ensure that proper extreme values are extracted to get extreme sample to estimate extreme values. Aiming at this issue, this study proposed a comprehensive evaluation approach based onmultiple-criteria decisionmaking (MCDM)method to select a proper block size. Awheel loaderwith six sections in one operating cycle was illustrated as an example. First, spading sections of each operating cycle were extracted and connected as extreme loads often occur at that section. Then extreme sample was obtained by BMM for fitting the generalized extreme value (GEV) distribution. Kolmogorov-Smirnov (K-S) test, Pearson’s Chi-Square (χ2) test, and average deviation in Probability Distribution Function (PDF) are selected as the fitting test. The comprehensive weights are calculated by the maximum entropy principle. Finally, the optimal block size corresponding to the minimum comprehensive evaluation indicator is obtained and the result exhibited a good fitting effect. The proposed method can also be flexibly used in various situations to select a block size.


Introduction
Load analysis should be considered during vehicle design [1].First, a measured load can be regarded as a sequence of turning points formed by the local maxima and minima, which determine the amplitude of load cycles.But the measured load represents only a very limited part of the design life and necessarily should be extrapolated in the time domain if our main goal is to perform fatigue tests or FEM fatigue analysis.The main issue in load analysis is how to determine the extreme cycles [2], which contribute to fatigue damage significantly.Second, the components of transmission cannot contain any growing cracks when subjected to millions of cycles in a short period of time.The extreme loads are therefore the most important load characteristics that determine a fatigue limit.Third, the components of engineering vehicles may bear extreme loads caused by misuse and rare events and will be severely damaged, and even sudden failures may occur.As extreme loads appear once or a few times in the design life, they should be estimated from the measured load.Therefore, when the fatigue limit, life prediction, and sudden failures are considered for designing mechanical components in the time domain or amplitude domain, extreme loads should be estimated accurately to achieve the correct balance between safety design and overdesign [3].
Extreme value theory (EVT), based on asymptotic theory, provides a model that can be extrapolated to deduce the probability of certain serious extreme events, which have not happened or have not been recorded yet.EVT can be applied to estimate extreme values in various fields, such as wind engineering, strength of materials, and fatigue strength [4][5][6].To fit the extreme distribution, extreme sample should be extracted accurately from the measured load.The most commonly used methods are BMM, Peaks over Threshold (POT), and the method of independent storms (MIS).POT method [7] is applicable for selecting loads above a threshold, but the crucial problem is how to determine the threshold properly [8][9][10].Although there are many researches about the threshold, a stable and effective method has not been established.MIS [11] selects maximum loads of each independent storm and has a lower sensitivity than POT in threshold 2 Mathematical Problems in Engineering determination, but it requires a longer measured load for a few loads which remained after using MIS.
BMM has been widely used to estimate extreme values, especially in dealing with seasonal data.Selecting a proper block size for BMM is crucial [12], the methods of selecting the block size vary from different fields, such as wind speeds [13], flood [14], rainfall [15], and other fields [16][17][18].In the fields mentioned above, such as wind speeds, we assume that  1 ,  2 , . . .,   are the daily observations.The block size is set as a year and   is the annual maximum because the wind speeds undergo seasonal variations in a year.Given further research on extreme loads in engineering vehicles, how to select a block size properly is still a problem.This study took a wheel loader as an example.Its fixed V-6 operation pattern was used in the experimental measurement of service loadings [19,20], that is, no load forward, spading, full load backward, full load forward, unloading, and load backward, as exemplified in Figure 1(a).The actual measured load time history at a drive flange under the V-6 operating scheme was shown in Figure 1(b); the change among different sections is similar to the seasonal characteristics of wind speeds.In wind speeds, one year is commonly used as a block size, while in engineering vehicles one operating cycle may reasonably be set as a block size.Then the maximum loads of each operating cycle were extracted to fit GEV.However, the fitting effect of GEV shown in the latter part of the paper was not ideal.
To extract more extreme loads and model a reliable extreme distribution by restricting the fitting error of GEV, spading sections of each operating cycle were extracted and connected.Then a comprehensive evaluation approach based on MCDM was proposed to select a proper block size for BMM.The block size is determined by comparing the comprehensive fitting errors with less subjectivity and prejudice.The load analysis in this paper was realized through WAFO and EVIM [21,22], and it can flexibly be used in various situations to select a block size.

GEV for Extreme Loads Analysis
where () is a nondegraded function, then () must be one of the three forms: As for Fréchet and Weibull distributions ( > 0), the probability density functions of them are illustrated in Figure 2.
Only one of the three forms is practically selected to undertake a desirable fitting from the sample load; its distribution parameter is then estimated.However, it is difficult to decide which of the three functions has the best fitting effect.Besides, the selection is performed based on earlier experience.Thus, arbitration plays a high-weighted role during the process, and a mistaken selection of the distribution form leads to disastrous estimated results.Therefore, integrating the three forms of distribution into a comprehensive one, namely, GEV, is preferable: where 1 + ( − )/ > 0, , , , respectively, represent the shape, scale, and position parameters, and all of them satisfy the demand that  ∈ ,  > 0,  ∈ .The tail form of GEV is determined solely by ; a higher value of  indicates a thicker tail; hence,  is defined as extreme value index.The relationship of GEV and the three extreme value distributions depends on the value of .If  = 0, GEV refers to a Gumbel distribution; if  > 0, GEV refers to a Fréchet distribution; and if  < 0, GEV refers to a Weibull distribution.

Extreme Sample Selected by BMM.
The extraction of extreme values is significant when GEV is applied to fit the tail of the data.Suppose that the continuous sampling data  1 ,  2 , . . .,   are divided into  blocks with the interval length .The maximum loads   1 ,   2 , . . .,    of each block are extracted to constitute extreme sample, as depicted in Figure 3.And   1 ,   2 , . . .,    can be viewed as independent identically distributed observations from GEV.However, if a selected block size is unreasonable, a large scale of extreme loads will be wasted.The fitting will be inaccurate because a block size that is too small may lead to biased estimation, whereas a one that is too large may result in few extracted extreme values and subsequently a large variance.Thus, to fully extract extreme values to constitute sample loads for fitting GEV, the block size must be exactly selected.Practically,   is sample data obtained during some process or some period of time, such as average temperature per day or average wind velocity per 10 min;   is the maximum value corresponding to the process or time.The period of time for perspicuously seasonal data is often a year;   represents the annual maximum value.Though under different working conditions, a loader's average working time for each operating cycle is 36 s; the components of a loader work under periodically loading conditions.One or more extreme loads may occur in each operating cycle.In a short period of time, numerous extreme loads will have an impact on the components.Thus, it is imperative to find an efficient method that differs from the traditional method applied in other fields to select block size and estimate extreme loads in engineering vehicles.the most popular method applied in extreme loads estimation.Its superiority lies in asymptotic normality, congruency, invariance, and other attributes when the distribution is known.Hosking [23] provided the arithmetic procedure for parameter estimation with maximum likelihood method.

Parameter Estimate Using MLE. Maximum likelihood estimation (MLE) method is a kind of point estimation and
1 ,  2 , . . .,   are independent and identically distributed, whose log-likelihood function is as follows.
When  ̸ = 0 and satisfies 1 + (( where  is block numbers in the BMM model; , ,  can be solved through the two equations above. After a thorough study on the application of maximum likelihood estimation in GEV, Smith discovered that when  ∈ (−0.5, +∞), the parameter in MLE can be obtained and it is holomorphic; when  ∈ (−1, −0.5), the parameter in MLE can also be obtained but has no generally asymptotic property; when  ∈ (−∞, −1) the parameter in MLE cannot be obtained any more.The second and third conditions seldom occur when applying GEV for parameter estimation.Therefore, the theoretical defect of MLE poses little harm to its practical application.

Test of Goodness-of-Fit.
The test of goodness-of-fit is primarily used to evaluate the selected block size.The graphic and calculation methods are available for the test.The quantile-quantile (Q-Q) plot and probability-probability (P-P) plot are graphic methods for comparing two probability distributions or two data sets.If they are similar and close, the points in the Q-Q and P-P plots will approximately lie on a line.There are many kinds of calculation methods, the most popular methods are the Kolmogorov-Smirnov (K-S) test, Pearson's Chi-Square ( 2 ) test, average deviation in PDF, root mean square deviation, and correlation coefficient test ( 2 ) [24,25].This study selected three tests to describe the fitting effect of GEV; the details are described as follows: The K-S test quantifies the distance between the hypothesized cumulative distribution function (CDF) and PDF.The smaller  max , the better the goodness-of-fit. 2 test measures the distance between the observed data and the normal distribution with mean 0;   is the observed count, and   is the expected count.The hypothesized distribution represents the observation data better when  2 is smaller [26].For the average deviation in PDF, smaller  pdf means that a hypothesized distribution can represent the observation data better.

Comprehensive Evaluation Approach Based on MCDM
The graphic method of the test of goodness-of-fit is too subjective, whereas each of the tests in calculation method can only test the goodness-of-fit in one certain aspect.Given that different researchers adopt different tests, the determined block size may be various.Figure 4 shows that when adopting different block sizes in a sample load for fitting GEV, K-S test,  2 test, and average deviation demonstrate different rules of variation.When test methods are used to determine the block size, respectively, the results vary greatly.Therefore, it is crucial to find a method that comprehensively considers different fitting test results to select the block size properly.
The comprehensive evaluation approach based on MCDM [27][28][29] can integrate different tests into a comprehensive one that considers different results from the various tests and makes the selection of block size more scientific and objective.The key point of the comprehensive evaluation approach is to determine the weight of each test.The entropy value method (EVM) determines the weight of each test based on its respective effect on block size and is a relatively objective weight-determining method.According to the maximum entropy principle and the combination of the three tests, an exact method of determining block size was proposed in this study.Its procedure is shown in Figure 5.

3.1.
Calculating the Weight of the Tests.Entropy was first proposed by German scientist R. Clausius in 1846.Claude Elwood Shannon introduced this concept to the theory of information, where it was used to measure the degree of nondeterminacy [30].Since its proposition, the concept of entropy has been introduced to many fields such as probability theory, bioscience, and engineering.
In the present study, the selected block sizes in each test are different.Aiming at the content,  1 ,  2 , . . .,   are the candidate block sizes and  1 ,  2 , . . .,   are the tests.The error of fitting from applying   to   is   , and the evaluation matrix  is shown as follows: where  1 ,  2 , . . .,   are unknown parameters; those values need be calculated by EVM.For   , a higher varying magnitude of   from fitting with candidate   indicates a larger entropy.This larger entropy implies that   has a larger influence on   .In addition, a lower varying magnitude of   implies that   has a smaller influence on   .If for certain   , its error of fitting on every block size is congruent, then   has nothing to do with the ultimate decision and its weight   is 0.
The evaluation matrix needs to be standardized.The normalization method reserves the degree of effect of the tests on evaluation objectives and is adopted in this study.
where x represents the standardized   ; the column represents the evaluation objectives; thus, ∑  =1 x = 1.The entropy   from test   can be calculated as follows: where 0 <   < 1.

Estimate the extreme load
Test criteria The entropy weight of each test can be calculated as follows:

Entropy value method
The weight vector is  = ( 1 ,  2 , . . .,   ).The comprehensive evaluation indicator that integrates different tests can be calculated as follows: When fitting for GEV is conducted, the comprehensive error calculated by (11) should be small enough.

Case Study
In the field of engineering vehicles, extreme loads to which the components are subjected should be estimated.Extreme loads usually occur once or more during one operating cycle because of the tough working conditions.Traditionally, when the BMM is adopted, the selection of block size is based on the engineering practice of technicians, and the graphic method is applied to subjectively evaluate the fitting effect, which depends heavily on the experience of data processing.A wheel loader was taken as an illustrative example to verify the effectiveness and practicability of the proposed method.

Data Analysis.
A wheel loader's fixed V-6 operation pattern for loading and unloading bulk material was used in the experimental measurement under spading primary soil.The actual measured load history of an axle shaft measured at 75 operating cycles was obtained.After denoising, smoothing, and other pretreatments conducted on the data, selecting one operating cycle as a block size may be reasonable because of the section working characteristics of a wheel loader, which are analogous to the seasonal characteristics of wind speeds.However, the fitting effect of GEV was not ideal, as shown in Figure 6.The extreme information was wasted when the block size is set as one operating cycle because of the increased probability of the appearing extreme loads.Therefore, the loads of six sections in each operating cycle were individually extracted, and then the loads of the same section were combined to form a new load history, as shown in Figure 7.Given that the spading section was more serious than any other sections, the connected spading section was extracted.
Because the data of each spading section enjoy the same load characteristic, the connected data were independent and identically distributed, which adhere to the underlying theoretical basis for the GEV indicating that the parent population is independent and identically distributed.
In this study, a new load time history was obtained by combining the data of the spading section of each operating  cycle, and the block size was selected using the comprehensive evaluation approach to obtain more extreme loads.The working time for every spading section is 10 s.A hundred data points are gathered per second, and the total time is 750 s.Therefore, 75001 data points were gathered, as shown in Figure 8.

Determination of the Comprehensive Evaluation Indicator.
Based on the proposed method in this study, a third of the number of data during a spading section was selected as the initial block size and the number of data during a spading section was selected as the terminal condition.The region of the block size was [300, 1000]; 5 data points were set as step length.The number of data per block was therefore determined and subsequently the number of blocks.The maximum loads were then extracted to constitute extreme sample to fit GEV.The maximum likelihood function was selected for parameter estimation, and K-S test,  2 test, and average deviation in PDF were selected for the test of goodness-of-fit.Thus, based on MCDM, a 141 × 3 evaluation matrix was constructed.The normalization method was adopted in this study to standardize the estimation matrix because of the different dimensions for the different tests.The standardized evaluation matrix was shown in Table 1.
EVM was adopted to resolve the entropies of the tests, and the entropy vector was resolved.The weight coefficient vector was obtained through (10), and the comprehensive evaluation indicator, that is, the comprehensive fitting error, was calculated through (11).

Selecting the Block Size.
From Table 1, it can be concluded that when the block number was 218, which corresponded to a block size of 345, the K-S test reached its nadir.When the block number was 101, which corresponded to a block size of 740,  2 test reached its nadir.When block number was 251, which corresponded to a block size of 300, the average deviation reached its nadir.Figure 9 showed that when the block number was 221 the comprehensive fitting error reached its nadir and that the block size was 340.As the comprehensive test took all the three tests of K-S,  2 , and average deviation, it can conduct a minimum comprehensive fitting error.

Fitting GEV.
Once the block size was selected, the maximum loads of each block were extracted to constitute extreme sample to fit GEV under a confidence region of 95%.The result of fitting GEV was evaluated by adopting the Q-Q plot, graph of distribution function, graph of probability density function, residue quantile plot, and graph of residue probability density function, as shown in Figures 10 and 11.Compared to the GEV fitted by the extreme sample extracted from each operating cycle, it exhibited a better fitting result, which was compatible with the analytical demand from the BMM.Meanwhile,  value of the first condition was 0.86 and that of the second condition was 0.92.Given that a higher value of  value indicated a more likely GEV distribution, the method was proven to be successful in extracting sufficient extreme values from the measured load and displayed significant engineering practicality for estimating extreme loads.

Conclusions
When estimating extreme loads in engineering vehicles, block size are sensitive to parameter estimation and the selection of the extreme loads by BMM; thus, selecting a proper block size is essential.Once the BMM is adopted, performing block partition based on engineering practice has the testability and nondeterminacy.To model a reliable extreme distribution by decreasing the fitting error of GEV and evaluate the selected block size, this study proposed a comprehensive evaluation approach based on MCDM to divide blocks automatically and analyze fitting results quantitatively.
The measured load time history of a wheel loader was used as an example to verify the effectiveness and practicability of the proposed method.The block size selected by the proposed method was tested with the graphic method.The results showed that the fitting effect of GEV was desirable, and   then a satisfactory block size was determined.Compared with the GEV fitted by the extreme sample extracted from each operating cycle, it exhibited a better fitting result.The load analysis in this paper decreases the subjectivity and prejudice and can be flexibly used in various situations to select a block size.

Figure 1 :Figure 2 :Figure 3 :
Figure 1: (a) V-6 operating scheme.(b) Actual load history measured at the drive flange of a wheel loader.

Figure 4 :
Figure 4: Standardized fitting error of test methods.

Figure 5 :
Figure 5: Flow chart for determining block size based on MCDM.

Figure 8 :
Figure 8: Load data of the combined spading sections.

Table 1 :
Candidate block numbers with standardized matrix.