A Procedure for the Selection of Single Sampling Plans by Variables Based on Pareto Distribution

Product control in statistical quality control is the methodology that deals with procedures for taking decisions about one or more lots of finished products manufactured by production processes. Sampling inspection by variables is one of the major classifications of product control and comprises procedures for deciding about the disposition of a lot of individual units based on sample measurements of units on a quality characteristic under study.These procedures are defined under the assumption that the quality characteristic is measurable on a continuous scale, and the functional form of the probability distribution must be known. Inspection procedures which have been developed under the implicit assumption that the quality characteristic is distributed as normal with the related properties are found in the literature of product control. The assumption of normality may not be realized often in practice, and it becomes indispensable to investigate the properties of sampling plans based on nonnormal distributions. In this paper, a single sampling plan by variables is formulated and evaluated when the quality characteristic is assumed to be distributed according to a Pareto distribution. A procedure is developed for determining the parameters of the plan for specified requirements to ensure protection to both the producer and consumer.


Introduction
Sampling inspection is an activity for taking decisions on one or more lots of finished products which have been submitted for inspection.The decision of either acceptance or rejection of the lots is usually taken by adopting suitable sampling inspection procedures called sampling plans.Sampling plans are generally categorized into two types, namely, lot-by-lot sampling by attributes and lot-by-lot sampling by variables.In lot-by-lot inspection by attributes, one or more samples of items are drawn from a given lot of manufactured items; each item in the sample(s) is classified as conforming or nonconforming, and the decision of acceptance or rejection of the lot is made based on a specific rule.In lot-bylot inspection by variables, one or more samples of items are drawn from a given lot; the measurement of a quality characteristic in each sampled item is recorded, and the decision of acceptance or rejection of the lot is made as a function of such measurements.The theory of inspection by variables is applicable when the quality characteristic of sampled items is measurable on a continuous scale and the functional form of the probability distribution is assumed to be known.A variables sampling is advantageous in the sense that it generates more information from each item inspected, requires small sample, and provides same protection when compared to attributes sampling.(See Bowker and Goode [1] and Montgomery [2]).
On the basis of the implicit assumption that the quality characteristic is distributed according to normal with mean  and standard deviation , the concept of variables sampling inspection has been studied by many researchers.Some of the early works on variables sampling inspection are seen in Lieberman and Resnikoff [3], Schilling [4], Owen [5,6], and Hamaker [7].Studies relating to sampling plans when the assumption of normality of the quality characteristic fails or the functional form of the underlying distribution deviates from normal or the form of the distribution is not known are also found in the literature of acceptance sampling.
When the quality characteristic  follows normal distribution with mean  and standard deviation , the problem of designing single sampling plans by variables is addressed with ease.In the industrial situations, quite often, the assumption of normality may not be valid or the quality characteristic may be distributed according to non-normal distributions.In such cases, the selection of variable sampling plans becomes complicated.However, the literature of acceptance sampling traces several works which deal with the designing of variables plans when the quality characteristic follows a probability distribution other than normal.Owen [11] presented a detailed survey on various works related to variable sampling plans with emphasis on nonnormality.
In this paper a study on single sampling plans by variables is formulated under the assumption that the quality characteristics have a Pareto distribution.A procedure for determining the parameters of the proposed plan for specified requirements in terms of producer's and consumer's protection is also developed.

Single Sampling Inspection Plans by Variables
A single sampling inspection plan by variables is defined under the following assumptions.
(a) The quality characteristic, denoted by , is measurable on a continuous scale and has a known form of probability distribution, represented by   (; , ), which is the distribution function of  with mean  and variance  2 .
(b) Each individual unit in a submitted lot has a onesided specification say, lower specification,  or upper specification .If, for a unit,  >  (or  < ), the unit is classified as a nonconforming unit.
The operating procedure of a variable sampling plan is as follows.
Step 1. Draw a random sample of  units from a lot and observe the measurements  1 ,  2 , . . .,   of the quality characteristic .
Step 2. When  is known, accept the lot if  +  ≤  (or  −  ≥ ); otherwise, reject the lot where  is the sample mean.When  is unknown, accept the lot if  +  ≤  (or  −  ≥ ), where Thus, a single sampling plan by variables is designated by two parameters, namely, the sample size, , and the acceptability constant, .When these parameters are known, the plan could be implemented.The explicit expressions for  and  can be derived by specifying two points on the operating characteristic curve of the plan, namely, ( 1 , 1 − ) and ( 2 , ), where  1 and  2 are termed as acceptable quality level (AQL), and the limiting quality level (LQL) associated with the producer's risk  and the consumer's risk , respectively.A sampling plan by variables is termed as a known  or unknown  plan according to whether  is known or unknown.

Operating Characteristic Function
An important measure of the performance of a variables sampling plan is its operating characteristic function, which is a function of the proportion  of non-conforming units called incoming lot quality, and it provides the probability,   () of acceptance of a lot.The plot of   () against  results in a curve called the operating characteristic (OC) curve.For a given upper specification limit  when  is known,  and   () are defined by AQL and LQL, using (1), are defined by where  1 and  2 are the means of the underlying distribution which results in AQL and LQL, respectively.Assume that  is modeled by the Pareto distribution with parameters  and .Then, its probability density function and the cumulative distribution function are, respectively, given by The mean and the variance of the underlying distribution are given by  = /( − 1) for  > 1 and  2 =  2 /( − 1) 2 ( − 2) for  > 2, respectively.Further, the measures of skewness and kurtosis, which are the measures for identifying the asymmetric nature of the distribution, are expressed as  3 = (2(1 + )/( − 3))√( − 2)/ for  > 3 and  4 = 3( − 2)(3 3 +  + 2)/( − 3)( − 4), respectively.From (1) and (3), the lot quality levels , AQL, and LQL using standard Pareto distribution are, respectively, defined by where  = ( − )/,  *  = ( − )/,  *  1 = ( −  1 )/, and The producer's risk  and the consumer's risk  corresponding to AQL and LQL are, respectively, defined from (2) as When  is unknown, the estimate  is used in the decision criterion and hence in the evaluation of  and .

Designing Single Sampling Plans by Variables
In the industrial practice the unknown standard deviation variables plans are more realistic than the known standard deviation variables plans.If the distribution is non-normal, then the designing of unknown  plans is much complicated.
Takagi [12] attempted to provide solution to such problems and proposed a methodology for determining the parameters of variables sampling plans under the conditions of nonnormal populations by introducing an expansion factor in terms of the measures of skewness and kurtosis.The procedures of designing known and unknown standard deviation sampling plans giving protection to the producer and consumer are discussed in [18][19][20][21][22] under the assumption that the quality characteristics under study follow gamma, Laplace, Rayleigh and logistic distributions when the measures of skewness and/or kurtosis are specified.The methodology proposed in [12] using the expansion factor will, now, be discussed for an unknown sigma plan by variables under the assumption of Pareto distribution for the quality characteristic.
In the case of unknown sigma plan, the determination of  and  is usually based on the sampling distribution of + or −.When the underlying distribution is normal,  and  are independently distributed as normal and hence + and  −  are also normally distributed.By this property, explicit forms of  and  could be derived for determining  and .A linear combination of independently distributed variables tends to be normally distributed even when the individual variables depart from normality (see [15]).Further, when the underlying distribution is non-normal, according to [12], the asymptotic distributions of  +  and  −  are normal with the means, respectively, given by   = + and   = −, and with the common variance expressed by where  3 and  4 represent the measures of skewness and kurtosis of the underlying distribution.
It is known that  *  = ( − )/.Thus, for specified  1 and  2 , The operating characteristic function   () for an unknown standard deviation plan is expressed as Assume that  1 and  2 are specified with the risks  of rejecting the lot and  of accepting the lot.Then, corresponding to  and , the normal deviates   and   are, respectively, defined by On substituting the expressions ( 12) for  1 and  2 , ( 14) would result in the following: The mathematical expressions for  and  of an unknown standard deviation plan for specified requirements in terms of the points ( 1 , 1 − ) and ( 2 , ) on the OC curve such that   ( 1 ) = 1 −  and   ( 2 ) = , when  is specified, are derived from (15) as where   and   are the standardized deviates exceeded with the probabilities  and , respectively, and   is an expansion factor, which is a function of  3 and  4 , expressed as   = 1 + ( 2  /4)( 4 − 1) +    3 .The expansion factor is used to obtain the known standard deviation plans with parameters   and   by using the relations    =   /  and    =   .In a similar way, when  is specified, the expressions for  and  are given by where   is expressed by   = 1+( 2  /4)( 4 − 1) −    3 , using which the known standard deviation plan with parameters   and   could be derived as    =   /  and    =   .

Numerical Example
Suppose that a set of measurements yields  3 = 2.2671 and  4 = 11.3626.It is desired to determine a variables sampling plan giving protection to the producer and the consumer in terms of ( 1 = 0.01,  = 0.05) and ( 2 = 0.06,  = 0.10).For the given requirements, the value of  is calculated from the expression of  3 of Pareto distribution as 25.Corresponding to  1 and  2 , the values of  *  1 and  *  2 are determined, respectively, from ( 7) and ( 8) as 3.6970 and 1.7828.The normal deviates   and   are obtained as 1.64485 and 1.28155 by satisfying ( 9) and (10) for specified values of  and .Substituting these values in ( 16) and ( 17), the parameters of the desired plan are determined as  = 57.8256which when rounded becomes 58 and  = 2.6211.The value of   is obtained as 24.7401.Thus, the parameters of a known standard deviation plan are computed as    =   /  = 2.3373 ≈ 3 and    = 2.6211.

A Note on Deriving the Plans Based on Exact Pareto Distribution
It is known that Pareto distribution is a very heavy-tailed distribution for which the th moment is finite when the shape parameter  is greater than .Thus, large observations can be expected in samples from this distribution.Further, the distribution function and the maximum likelihood estimators of the parameters can be written in closed form.As the distribution is not in general symmetric and the results given in the previous sections are based only on the assumption of a symmetric distribution, it would be appropriate if one could derive variables sampling plans using the exact Pareto distribution.Whereas it would be reasonable to assume that the mean is finite, no assumption about finite moments has to be made if one uses the median in lieu of mean.It is to be noted that the median could be used as a measure of location when a distribution is skewed (say, for instance, a Pareto distribution) or when end values are not known or when one requires reduced importance to be attached to outliers.When sample median is used in the operating procedure of a sampling plan for making a decision about the lot, the explicit expressions for the plan parameters are not plausible to derive.In such a case, the procedure for finding the parameters based on exact Pareto distribution will be based on the sampling distribution of sample median and will be much complicated.

Conclusion
Sampling inspection by variables is the methodology used for deciding about the disposition of a lot based on sample measurements of units on a quality characteristic under study.Such a procedure is defined under the assumption that the quality characteristic is measurable on a continuous scale and the functional form of the probability distribution is known.Though sampling plans by variables under the implicit assumption that the quality characteristic is distributed as normal have been developed with the related properties, it should be realized that the assumption of normality is seldom hold in practice.Hence, it is inevitable to develop sampling plans by variables when the underlying quality characteristics follow non-normal distributions, which may be skewed or symmetric.In this paper a single sampling plan by variables is formulated and evaluated when the quality characteristic is distributed according to a Pareto distribution.The approximation of normal distribution for a linear combination of independent variables even when the individual variables depart from normality is effectively used in developing the procedure for determining the parameters of the plan when requirements for producer's and the consumer's protection are specified.