On Statistical Development of Neutrosophic Gamma Distribution with Applications to Complex Data Analysis

Department of Mathematics and Statistics, Hazara University, Mansehra, Pakistan Department of Mathematics, College of Science and Humanities in Al-Kharj, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia Department of Mathematics, College of Sciences and Arts (Muhyil), King Khalid University, Muhyil, Abha, Asir Region 61421, Saudi Arabia Department of Mathematics and Computer, College of Sciences, Ibb University, Ibb 70270, Yemen Country Business Administration Department, Administrative Science College, 2amar University, 2amar, Yemen


Introduction
e gamma distribution is named after the widely known gamma function (GF), which is explored in many fields of mathematics. e gamma distribution in applied statistical analysis is used extensively for studying stochastic phenomena [1]. Wang and Wu [2] presented valuable information on the gamma distribution and its uses. ree types of gamma distribution functions, namely, one, two, and three parameter gamma densities, are employed as wellsuited models for many real datasets [3][4][5].
e gamma distribution family portrays three different failure rates in survival and reliability analysis [6].
is resulted in the gamma model being reasonably versatile and describing various lifespan data [7]. It has been frequently applied in numerous disciplines, including metagenomics [8], hydrology [9], environmental science [10], climatology [11], and genetics [12]. Some excellent work can be found in the inference part of the gamma distribution in [13,14]. Due to extensive applications of the gamma distribution, many generalized structures have been suggested in the literature [15,16]. e basic model and generalized structures of the gamma distribution are rooted in the assumption that distributional parameters accurately describe a particular set of data, which is unrealistic in many situations. For example, the scale and shape parameters of the gamma density cannot be fixed to exactly single numbers to find a well-suited model for the amount of rainfall data. However, it is more realistic to believe that the gamma model is a best-described distribution for the rainfall data with an interval set of values for shape and scale parameters. e notion of the neutrosophy can model such indeterminacies in the data originated by Smarandache [17]. It is a new philosophical area, an extension for fuzzy and intuitionistic fuzzy logics [18][19][20][21][22]. Smarandache suggested the essential principles of neutrosophic set in [23][24][25][26][27]. Classical statistics deals with determinate data and determinate inference methods only, neutrosophic statistics deals with indeterminate data and indeterminate inference methods, i.e., data that have any kind of indeterminacy (unclear, vague, partially unknown, contradictory, incomplete, etc.), and inference methods that have degrees of indeterminacy as well (for example, instead of crisp arguments and values for the probability distributions, algorithms, functions etc., one may have inexact or ambiguous arguments and values). Neutrosophic statistics was founded by Smarandache in 1998 and developed in 2014 [24]. e neutrosophic statistics is also a generalization of interval statistics because, among others, interval statistics is based on interval analysis. Neutrosophic statistics is based on set analysis (meaning all kinds of sets, not only intervals). Neutrosophic statistics is more elastic than classical statistics. If all data and inference methods are determinate, then neutrosophic statistics coincides with classical statistics. But, since, in our world, we have more indeterminate data than determinate data, more neutrosophic statistical procedures are needed than classical ones. Because the parameters of classical gamma distribution are indeterminate in the neutrosophic logic, it is possible to deal with all the scenarios that one may find while working with real data analysis [28][29][30][31][32][33][34]. Recently, Duan et al. [35] introduced the algebraic structure of the exponential distribution. Sherwani et al. [36] described the neutrosophic version of the normal distribution. More applications of the neutrosophic statistics can be seen in [37][38][39][40].
In the interest of more practical data modelling, this study proposes a new extension of gamma distribution under neutrosophic logic. Some research work exists on neutrosophic distributions, for example, [34], but we expect that the current study may be complete by presenting a comprehensive overview of NGD and providing more insights on it. e purpose of the present study is to introduce a neutrosophic version of the gamma distribution. Such an extension is potentially relevant for practical applications since there are far fewer distributions available for dealing with indeterminate data in the univariate and multivariate cases, especially for situations where data are not precise and reported in fuzzy or interval forms. e rest of the work is described as follows: e NGD is presented in Section 2. Section 3 provides some examples for conceptual understanding of the NGD. e distributional parameters are estimated in Section 4. Finally, a real data analysis using NGD is explained in Section 5. Lastly, Section 6 concludes the findings of the study.

Neutrosophic Gamma Model
e random variable Z is neutrosophic gamma distributed with parameters p n > 0 and λ n > 0 if it has the following probability function: where Γ is well-known GF and for a neutrosophic value, Γp n � ∞ 0 t p n − 1 e − t dt. Z follows the neutrosophic Erlang model for integer values of p n . e distributional parameters p n and (1/λ n ) are frequently recognized as the shape and scale parameters, respectively. Sketches of the NGD for several values of p n and λ n are shown in Figures 1 and 2.
In Figures 1 and 2, parameters settings may be changed to create various neutrosophic gamma curves. In Figure 1, the shape parameter is fixed, i.e., p n � [8,8], while the scale parameter λ n varies to three different values. However, in Figure 2, the scale parameter is fixed to a crisp value [6,6] but shape parameter p n varies to three different values. e area under a particular neutrosophic curve is always equal to one that can easily be shown by using the transformation y � (z/λ n ) in (1). Equation (1) can be used to compute the probability for a neutrosophic gamma random variable. Nevertheless, it is difficult to calculate the integral of the NGD using a mathematical expression, so probabilities are evaluated using computer software. ese may be evaluated via repeated integration by parts, but the details are long and ignored. Some useful distributional properties of NGD can be described in the forms of some theorems as follows.

Theorem 1.
e mean of the NGD is p n λ n .
Proof. Using the definition of expected value, mean of the NGD can be written as Using the neutrosophic calculus, expression (2) can be further simplified as us, (3) provides 2 Complexity Hence proved.
Proof. By definition, variance of NGD is given by Now, From (7), simplification provided us, (7) becomes Substituting (5) and (9) in (6) Proof. By definition, the j th moment of NGD is given by us, (10) becomes where μ jn ′ is the jth moment about origin. However, moments about mean can be established by using the wellknown relation as   e coefficient of skewness of NGD is shown as Proof. By definition, the skewness of the NGD is given by where μ 3N � 2λ n and μ 2N � λ n . Simplifying (7) further provides where is coefficient is in a standardized form and provides information about the spread of the distribution around its mean value. e standardized form is unit free and of much help to compare two variables computed in a different unit of measurements. e NGD curve in Figure 1   Proof.
e kurtosis coefficient is also in the standardized form and provides information about the spread of the distribution. It is more related to extreme points of the distribution. □ Theorem 6.
Proof. e MGF of the NGD is given by Using the transformation y l � z(1/λ l − t) and e expression given in (19) can be used to generate moments of the NGD discussed in eorem 3.
Differentiating (20) with respect to λ n and p n yielded δφ p n , λ n |z δλ n � n n i�1 ln z i n − ln λ n − ω(p) , where ω(p) � (d ln(p n )/dp n ) is known as the digamma function. Now, equating (21) and (22) to zero and simultaneous solution for p n and λ n provide From (23) and (24), maximum values of p n and λ n can be achieved by using an iterative numerical algorithm. R software is more convenient to perform this algorithm. To assess the performance of distributional estimators in terms of neutrosophic average biased (AB n ) and neutrosophic root measure square error (RME n ), a simulation study has also been conducted. Assessment measures AB n and RME n are defined as follows [41]: where ϑ n is the shape or scale parameter and ϑ n is a corresponding estimate of the NGD. Using the R programming language, a Monte Carlo simulation is performed with different sample sizes and fixed values of the neutrosophic parameters λ n � [5,8] and p n � [1,1]. An imprecise dataset is produced with defined parametric values, and simulation analysis is replicated N � 10 5 times with sample sizes of m � 5, 15, 30, and 60, respectively.
e performance metrics of the distributional estimators are then estimated and listed in Table 1.
To evaluate the reliability of maximum likelihood estimators, AB n and RME n are computed over the given sample size. While the metrics used provide a numerical evaluation of estimation accuracy, they are difficult to assess without a baseline value. e parameter setting λ n � [5,8] and p n � [1,1] are used as baseline values for the evaluation of estimation accuracy. e results show that when the sample size n increases, AB n , i.e., biases, decrease. On this account, the study shows that with a higher sample size, the neutrosophic estimators provide reliable estimates.
us, the differences between the baseline values and estimated values are decreased when the sample size increases.

Illustrative Examples
Some illustrative examples are given in this section to understand the theoretical results presented in the previous section.
Example 1. In a biomedical investigation using rats, a medicate-response experiment is performed to evaluate the influence of a toxicant's dosage on their survival time. is toxic substance is one which is ordinarily released into the atmosphere as a byproduct of jet fuel. e study indicates that for a given toxicant dosage, the survival time, in weeks, has an NGD with p n � [4,6] and λ n � 1. What is the probability that rats will survive for no more than 6 weeks? Also, find the mean survival time of randomly selected rats in this experiment.
Solution: let the random variable Z denotes the survival times of rats in weeks.
e expression in (26) is incomplete gamma distribution. us, using the neutrosophic calculus and R software, this can be found as which is the required probability that rats would survive no more than 6 weeks. Moreover, using (5), the average survival time of randomly selected rats can be calculated as Example 2. In a particular metropolis, the daily usage of water (measured in trillion of litres) roughly follows the NGD with neutrosophic parameters with p n � [4,5] and λ n � [1, 1.5]. What is the probability that the water supply would be insufficient on any given day if the city has a daily capacity of 5 trillion litres of water? Solution: water supply would be insufficient if consumption is more than daily capacity.
Following (1) reveals that Simplification of (29) using neutrosophic calculus yielded Example 3. e life of a specific type of transistor used in Phillip brand washing machines follows the NGD with a mean of [4.5, 7] years and a standard deviation of [3.7, 4.9] months. Find the probability that a transistor of this type would survive no more than 1.5 years.
Solution: using the expressions for mean and standard deviation of the NGD, we can write Simultaneously solving (31) and (32) yielded us, the required probability can be calculated as

Real Application
In this section, the applicability of the suggested NGD has been evaluated by using the dataset given by Linhart and Zucchini [42]. e dataset represents the failure times of the cooling system used in an airplane. Several authors have used this dataset in their study, such as [43,44]. e gamma distribution with rate and shape parameters equal to 0.74 and 1, respectively, best describes the failure time data. e appropriateness of gamma distribution with defined parameters can be seen in Figures 3 and 4. e CDF plot in Figure 3 indicates how the best-fitted gamma distribution represents the empirical distribution of the observed data. In Figure 4, empirical probabilities calculated from the observed data do not entirely depart from the straight line of the theoretical gamma distribution. us, both CDF and PP plots indicate the suitability of the gamma distribution for the observed dataset. Failure measurements are initially crisp values; however, for the sake of illustration, we treat data as uncertain sample values for some air condition systems, as indicated in Table 2. Table 3 shows that failure times such as [121, 125], [71, 74], [11,13], and [52, 55] are not precisely documented but are available in intervals. Because of ambiguities or uncertainties in the sample, the existing gamma model is not appropriate. On the contrary, the suggested NGD model may simply be used to analyze the given dataset. e descriptive summary of the failure data using the proposed NGD is provided in Table 3.    It is clearly demonstrated from Table 3 that there are inconsistencies in several of the key numerical statistics of the failure times data due to indeterminacies considered in the observed sample.

Conclusions
In this work, a neutrosophic extension of the gamma distribution and its applications has been discussed. is extension would open the way for dealing with other classical probability models that are defined for the accurately specified dataset. Using the properties of the neutrosophic calculus, all fundamental properties of the classical gamma model such as moments, shape coefficients, and MGF have been derived. e estimation procedure supported by neutrosophic algebra is developed. Point estimates of the distributional parameters of the proposed NGD have been assessed by the Monte Carlo simulation. Results of the simulation study showed that a large sample size provides reliable estimates of the parameters.
Several illustrated examples are discussed to highlight the importance of proposed NGD while handling situations when incomplete or vague information is available about the distributional parameters. A real dataset on failure times of the cooling system with some degree of uncertainty has been analyzed using NGD. e scope of the present study may be broadened to include other generalized versions of the gamma distribution. Moreover, various properties of the NGD, including testing and estimation methods, based on the neutrosophic logic can be explored in future work. Furthermore, multivariate neutrosophic structure of the proposed model may be worked out.

Data Availability
Data are included within the manuscript.

Conflicts of Interest
e authors declare that they have no conflicts of interest.