A Novel Approach to Increase the Goodness of Fits with an Application to Real and Simulated Data Sets

In practice, the data sets with extreme values are possible in many fields such as engineering, lifetime analysis, business, and economics. A lot of probability distributions are derived and presented to increase the model flexibility in the presence of such values. The current study also focuses on investigations to derive a new probability model New Flexible Family (NFF) of distributions. The significance of NFF is carried out using the Weibull distribution called New Flexible Weibull distribution or in short NFW. Various mathematical properties of NFW have been discussed including the estimation of parameters and entropy measures. Two real data sets with extreme values and a simulation study have been conducted so as to delineate the importance of NFW. Furthermore, NFW is compared with other existing probability distributions; numerically, it has been observed that the new mechanism of producing the lifetime probability distributions plays a significant role in making predictions about the population than others using the data sets with extreme values.


Introduction and Problem Statement
Due to wide applicability of the probability models in various disciplines, it remains a great interest of the researchers to increase the precision and validity of predictions and forecasting through probability functions in the presence of extreme values. For example, Ijaz et.al. [1] produced a new family of lifetime distributions and discussed its various statistical properties. e probability function of [1] can increase the reliability than others for the lifetime data analysis. Other prominent families of distributions include the Beta-G proposed by Eugene et al. [2], Jones [3], and Zografos and Balakrishnan [4] suggested a Gamma-G (type-1) family, Mc-G proposed by Alexander et al. [5], Amini et al. [6] delineate the Log-Gamma-G type-2 family, Gamma-G (type-2) studied by Ristić and Balakrishnan [7], Torabi and Narges Montazeri [8] discussed Gamma-G (type-3), Weibull-X family of distributions introduced by Alzaatreh et al. [9], and exponentiated generalized class of Cordeiro et al. [10]. For a detailed discussion on the development of such new families of distributions, we refer to see [11][12][13][14][15][16][17][18][19].
e existing probability distributions have some limitations while modeling the lifetime data. First of all, these distributions are unable to model the nonmonotonic hazard function; for example, exponential distribution can only model the constant hazard rate function, while gamma distribution can only model the monotonically increasing failure rate function. Secondly, even if the existing probability functions model the data, they provide a bad fit to the real data. In practice, a lot of data sets follow a shape rather than the constant or monotonically increasing failure function. For instance, the lifetime of accident rate or an electronic device follows the pattern of nonmonotonic hazard rate function [1].
is paper presents another contribution to the existing theory of probability functions that will overcome the limitations of the existing distributions and some others recently developed.

Methodology to Proposed New Family of Distributions
A lot of probability distributions are introduced by defining the new generator called families of probability distributions; for example, [1] introduced a Gull Power Alpha family of distribution, and for other families, we refer to [6,9,15,20,21] and [22][23][24][25][26][27][28][29]. In this paper, a new family (NFF) is presented by introducing a new scale parameter "a" which is given by where F(y) is the CDF of the baseline distribution. e probability density function of F(x) is , a ≠ 1.

Special Form of the NFF
is portion demonstrates the specific form of NFF by utilizing the CDF of the Weibull distribution called new Weibull distribution (NFW). e cumulative distribution function (CDF) of the Weibull distribution [30] is presented by where b in the above equation represents scale parameter and c represents the shape parameter. By replacing (3) in (1), the CDF and PDF of NFW are, respectively, given by Figure 1 presents numerous shapes of the CDF and PDF with distinct set of parameter values.

e Survival and Hazard Rate Function.
e survival and hazard rate function of NFW is defined by , and using (4), we get After simplification, the final result is given as follows: e hazard rate function of NFW is provided by Using (4) and (5), we finally obtained � a e − bx c ln(a)bcx c− 1 e − bx c /(a − 1) Finally, the following result is obtained: Figure 2 demonstrates the nature of the hazard rate function for different values of parameter.

3.2.
e Quantile Function and Median. e quantile function and median of NFW is defined by where q is a standard uniform random variable. Substituting (4), we obtain the result e solution of equation (12) for x will give the following result: For median, consider q � 0.5 in equation (13).

The R th Moments
e r th moments about origin, suppose u r ′ is defined as Recalling (5), we get e simplified form of the above integral form is Let bx c � z, then x � (z/b) 1/c , and dz � bcx c− 1 dx. By using the above substitution in (16), finally, we get

Order Statistics
Let X 1 , X 2 , X 3 ,. . ., X n be ordered random variables from NFW, then the PDF of the HQIC order statistic is given by 2 Mathematical Problems in Engineering Using (4) and (5), the smallest and largest order statistic of NFW is defined by and the largest order statistic is

Parameter Estimation
To increase the efficiency of the probability models, an estimation of the parameters also plays a vital role. Since the maximum likelihood estimation (MLE) method provides the most reliable estimates, therefore, for estimation of the parameters of NFW, we considered the MLE approach. A comprehensive discussion on maximum likelihood estimation was given by [31][32][33]. e likelihood function of (4) is defined by By taking the log of the above function, we get x c log(a) + n log(a) + n log(b) To obtain the estimate of the parameters, partial derivative with respect to various parameters (a, b, c) was taken and equated the result to zero:   Mathematical Problems in Engineering e exact values of the parameters are not possible as equations (23) to (25) are not in closed form. Still, one can use the numerical methods to get the ML estimates.

Asymptotic Confidence Bounds.
e asymptotic confidence bounds for the parameters of NFW(a, b, c) are derived based on asymptotic distribution. To derive the required asymptotic confidence bounds, we are required the second time derivatives of (23)- (25):
By substituting the above result in (29), we obtained Finally, we obtained the result as

Mode
e mathematical form of the NFW for the mode can be illustrated as follows: Using equation (4) After simplifying the above expression, the result may be written as bcy c − c + 1 e by c + ln(a)bcy c � 0.
e above function is an implicit function. It can be solved numerically under some restriction on the parameter values.

Skewness and Kurtosis
e general functions for calculating the skewness (S) and kurtosis (K) of the NFW are given as follows:

Special Cases
Following are the submodels of NFW.
10.1. Case c � 1. By putting c � 1 in (4) and (5), we get exactly the new probability distribution with the baseline CDF of the exponential distribution called new exponential distribution (NFE). e mathematical forms of the CDF and PDF are, respectively, given by

Applications
To where L � L(ψ; x i ) is the maximized likelihood function and x i is the random sample, ψ is the MLE, and p represents the number of parameters involved in the model. Generally, the probability model that has the smallest number among these various criteria is judged as the best fitted model.

Data Set 1: Remission Time of Bladder Cancer Patients.
e following data set represents the remission time of 128 bladder cancer patients taken from Aldeni and Famoye [20].     Figure 3 describes the empirical and theoretical plots of the PDF and CDF of NFW. Figure 4 defines the P-P and Q-Q plot, and Figure 5 demonstrates the TTT plot that demonstrates the data follow a nonmonotonic hazard rate shape. Table 2 indicates MLEs, the log-likelihood, and their standard error values. Table 3 describes the goodness of fit measures; the goodness of fit measures for NFW has the smallest values that indicate that the proposed probability model has better performance than the already developed Rayleigh (R), exponential (E), Weibull (W), Weibull exponential (W.E), and Algoharai inverse flexible Weibull (AIFW).

Data Set 2: Bank Customer Data.
e data set of Aldeni et al. [20] for the waiting time of the 100 bank customers is considered with the following values: 0.8, 0  8.8, 8.8, 8.9, 8.9, 9.5, 9.6, 9.7, 9.8, 10.7, 10.9, 11.0, 11.0, 11.1, 11.2, 11.2, 11.5, 11.9, 12.4, 12.5, 12.9, 13.0, 13 Figure 6 shows the empirical and theoretical CDF and PDF of the bank customer's data. e red line of the graph illustrates that the line is the best fitted to theoretical data as matched to other distributions. Figure 7 reveals the P-P and   Q-Q plot, and Figure 8 presents the TTT plot. Figure 8 clearly demonstrates that the data fit the monotonic hazard rate structure. e MLEs are given in Table 4 including the log-likelihood and standard errors. e values in Table 5 show that the NFW performs better as compared to R, E, W, W. E, and AIFW distributions.

Simulations
To check the performance of the proposed probability model, an expression (13) was applied on the NFW distribution so that to produce artificial data. e simulations are carried out 1000 times for a various samples of size n with

Conclusion and Discussion
e paper focuses on generating a new family of probability distributions called New Flexible Family of distributions or in short NFF. e proposed family of distribution is applied to a Weibull distribution called New Flexible Weibull (NFW) distribution. Various statistical properties including the entropy measures, order statistics, and moment generating function are determined. e parameters are estimated by using the usual method called maximum likelihood estimates. To achieve the main objectives of the paper, the proposed probability model is applied to real data sets with extreme values, where one datum follows a monotonic hazard rate and a second data set capturing the nonmonotonicity of the hazard rate function. Moreover, the significance of parameters is investigated using a simulation study and it has been shown that the parameters lead to a flexible results. Hence, it has been established that NFW produces a superior fit when modeling data sets with extreme values as opposed to the existing distributions to these data sets. A future research study should also be conducted to extend the proposed model flexibility for the extreme value data sets via transmutation techniques and some others.
Data Availability e data sets are taken from the literature.

Conflicts of Interest
e authors have no conflicts of interest.