DDNS Discrete Dynamics in Nature and Society 1607-887X 1026-0226 Hindawi Publishing Corporation 10.1155/2014/701074 701074 Research Article Multiple-Decision Procedures for Testing the Homogeneity of Mean for k Exponential Distributions http://orcid.org/0000-0001-7991-4195 Chen Han-Ching Shan Her Pei http://orcid.org/0000-0002-9818-3137 Wang Nae-Sheng Yin Yunqiang Department of Statistics Feng Chia University No. 100 Wenhua Road Xitun District Taichung City 407 Taiwan fcu.edu.tw 2014 1982014 2014 25 06 2014 04 08 2014 19 8 2014 2014 Copyright © 2014 Han-Ching Chen et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

In multiple-decision procedures, a crucial objective is to determine the association between the probability of a correct decision (CD) and the sample size. A review of some methods is provided, including a subset selection formulation proposed by Huang and Panchapakesan, a multidecision procedure for testing the homogeneity of means by Huang and Lin, and a similar procedure for testing the homogeneity of variances by Lin and Huang. In this paper, we focus on the use of the Lin and Huang method for testing the null hypothesis H0 of homogeneity of means for k exponential distributions. We discuss the decision rule R, evaluation of the critical value C, and the infimum of P(CDR) for k independent random samples from k exponential distributions. In addition, we also observed that a lower bound for the probability of CD relative to the number of the common sample size is determined based on the desired probability of CD when the largest mean is sufficiently larger than the other means. We explain the results by using two examples.

1. Introduction

A multiple-decision problem can be defined as a situation where a person or a group of people must select the number of possible actions from a given finite set. Gupta and Huang  and Lin and Gupta  presented the selection procedures relevant to multiple-decision theory, including indifference zone selection and subset selection. They suggested that preferences among alternatives can be determined by maximizing the expected value of a numerical utility function or equivalently minimizing the expected value of a loss function. They indicated that the subset selection procedures have been studied and applied widely in determining the required sample size, which is the number of replications or batches used for selecting the optimal population among k populations and for selecting a subset.

Huang and Panchapakesan  suggested a modification of the subset selection formulation on the largest mean and the smallest variance. Huang and Lin  presented a multidecision procedure for testing the homogeneity of means when the sample sizes and unknown variance are unequal. Lin and Huang  used a similar procedure for testing the hypothesis H0 regarding the homogeneity of the variances. The purpose of this paper was to use the Lin and Huang method for testing the hypothesis H0 regarding the homogeneity of the means for k exponential distributions. When H0, the hypothesis, is rejected, the main objective was to obtain a nonempty subset E of the k populations that will include the population related to the largest means (called the best population). In this case, a correct decision (CD) is said to occur if the selected subset E contains the best populations.

The paper is organized as follows. In Section 2, we introduce the definitions and notations of decision rule R for k exponential distributions. In Section 3, we discuss the evaluation of the critical value of our test and the infimum of the probability of a correct decision CD. In Section 4, the performance of the method is illustrated with two examples and the behavior of our procedure is analyzed. Finally, concluding remarks are provided in Section 5.

2. Related Concepts of the Decision Rule

In this section, we use the Lin and Huang  method to identify the decision rule R for k exponential distributions.

Let Xi1,Xi2,,Xini, i=1,2,,k, be k independent random samples from k exponential distribution Γ(1,θi), θi>0, i=1,2,,k. We define the τi=θi/(j=1kθj)1/k as the distance between θi and all other θjs.

Then the MLE of τi is (1)τ^i=θ^i(j=1kθ^j)1/k, where (2)θ^i=1nij=1niXij=X-i.i=1,2,,k. For testing H0:θ1=θ2=,θk, the test statistic that arises naturally is max1ikτ^i.

We now present the steps of decision rule R for k exponential distribution as follows.

First, given α, where 0<α<1, we want to find a C such that the condition (3)P{max1ikτ^iCH0}α, where C is the critical value for the decision rule R and α is a given probability of Type I error at level α.

Second, given Δ>0 and P*, where 1/k<P*<1, we want to find a nonempty subset E={1jkτ^jC} of the k populations that contains the best populations and it is necessary that infθ_ΩΔP{CDR}P*, where P(CDR)=PΩΔ(max1ikτ^iC  and  τ^(k)C) and ΩΔ={θ_=(θ1,,θk)τ[k]Δ}, where τττ[k] denote the ordered τi and τ^i=θ^i/(j=1kθ^j)1/k and θ^(k) is associated with the population having the largest θ[k].

3. Assessment of the Critical Value <inline-formula><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M59"><mml:mrow><mml:mi>C</mml:mi></mml:mrow></mml:math></inline-formula> and the Infimum of <inline-formula><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML" id="M60"><mml:mi>P</mml:mi><mml:mo stretchy="false">(</mml:mo><mml:mtext>CD</mml:mtext><mml:mrow><mml:mo>∣</mml:mo></mml:mrow><mml:mi>R</mml:mi><mml:mo stretchy="false">)</mml:mo></mml:math></inline-formula>

In this section, we want to estimate the critical value C and the infimum of P(CDR) for k exponential distribution.

Lemma 1.

Let Xi2,,Xini, i=1,2,,k, be k independent random samples from k exponential distribution Γ(1,θi), i=1,2,,k. The MLE of θi is (4)θ^i=1nij=1niXij=X-i.i=1,2,,k,niθ^i~Γ(ni,θi),θ^iθi~Γ(ni,1ni).

Lemma 2.

The MLE of τi is (5)τ^i=θ^i(j=1kθ^j)1/k,forθ^i=X-i.i=1,2,,k,lnτ^i-lnτi=j=1kcjlnθ^jθj, where (6)cj={1-1kj=i,-1kji. Thus, lnτ^i-lnτi is a linear combination of independent log-gamma random variables with coefficients (1-(1/k)) for ln(θ^i/θi) and -1/k for ln(θ^j/θj), j=1,2,,k.

Lemma 3.

According to the Lin and Huang  appendix, we can get (7) Pr (lnτ^i-lnτi<C)11+exp{-π(C-bi)/3ai}, where (8)ai=(k-1)2k2ni+1k2j=1jik1nj,bi=-12ni+1kj=1k12nj.

Theorem 4.

Under the same assumption of Lemma 1, for testing H0:θ1=θ2=,θk, given the samples sizes n1,,nk and 0<α<1, the critical value C for the decision rule R satisfies the Pr {max1ikτ^iCH0}α which is approximately C=exp((3a[k]/π)ln((k/α)-1)+b[k]), where a[k]=maxiai, b[k]=maxibi and (ai,bi) are given by (8). Further, given Δ>0, one then has infθ_ΩΔP{ CD R}1/(1+exp{π(lnC-Δ-b)/3a})=P*, where b=minibi and a=miniai.

Proof.

Under H0, we have τi=θi/(j=1kθj)1/k=1 for each i=1,2,,k.

Therefore, the lnτi=0 for each i=1,2,,k. And (9)Pr{max1ikτ^iCH0}=Pr{max1iklnτ^ilnC,for  some  i=1,,kH0}=Pr{lnτ^i-lnτilnC-lnτi,for  some  i,iiiiii  i=1,2,,kH0}i=1kPr{lnτ^i-lnτilnC-lnτi}=i=1kPr{lnτ^i-lnτilnC}(since  thelnτi=0)i=1k(1-11+exp{-π(lnC-bi)/3ai})iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii(by  Lemma  3)=i=1k[11+exp{(πlnC-bi)/3ai}]k1+exp{π(lnC-b[k])/3a[k]}α(say). We have (10)1+exp{π(lnC-b[k])3a[k]}=kα,lnC-b[k]=3a[k]πln(kα-1). Therefore, the critical value C is (11)C=exp(3a[k]πln(kα-1)+b[k]). However (12)infθ_ΩΔP{CDR}=infθ_ΩΔ{Pr(τ^(k)C)}=infθ_ΩΔ{Pr(lnτ^(k)lnc)}infθ_Ωdp(lnτ^(k)-lnτ[k]lnC-lnΔ)1-11+exp{-π(lnC-lnΔ-b(k))/3a(k)}=11+exp{π(lnC-lnΔ-b(k))/3a(k)}iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii(by  Lemma  3)11+exp{π(lnC-lnΔ-b)/3a}P*, which is the desired result.

Theorem 5.

Under the same assumption of Lemma 1 and assuming n1=n2==nk=n, given level α, where 0<α<1, and n, the critical value C is (13)C=exp(3π(k-1)knln(kα-1)). Furthermore, given P*, where 1/k<P*<1 and Δ>0, under the decision rule that R satisfies Pr {max1iklnτ^ilnCH0}α and infθ_ΩΔP{ CD R}P*, we have the common sample size n as follows: (14)n=[(k-1)k(ln{(k-α)P*}-ln{α(1-P*)}(3/π)lnΔ)2]+1, where [x] denotes the lowest integer greater than or equal to x.

Proof.

By Theorem 4, ni=n, i=1,2,,k, we have (15)ai=k-1nk,bi=0for  each  i=1,2,,k, and the critical value C is (16)C=exp(3a(k)πln(kα-1)+b(k))=exp(3πk-1nkln(kα-1)+b(k))=exp(3πk-1nkln(kα-1)), which is the desired result.

Given P*, where 1/k<P*<1 and Δ>0, using the propriety of Theorem 4, we have (17)infθ_ΩΔP{CDR}11+exp{π(lnC-lnΔ)/3a}LetP*, and we have π(lnC-lnΔ)/3aln(1/P*-1).

Using (15) and C=exp((3a/π)(k-1)/nkln((k/α)-1)), we then have the minimal sample size n as follows: (18)n[(k-1)k(ln{(k-α)P*}-ln{α(1-P*)}(3/π)lnΔ)2]+1.

Remark 6.

The θ^i defined in this study fulfills Lawless Corollary 4.1.1. (Type II censored test property) . When the observations are Type II censored data, we can take θ^i=2Ti/θi, where T=i=1rt(i)+(n-r)t(r) and t(r) are the first r ordered observation of a random sample of size n from the exponential distribution. In this case, the T=i=1rwi, where wi=nt(i), i=1,2,r, remain unchanged.

4. Examples

In this section, we provide two examples to explain the results of performing Theorems 4 and 5.

Example 1.

This example is from Nelson . In this example, the results of a life test experiment are described in which specimens of electrical insulating fluid were subjected to a constant voltage stress. The length of time until each specimen failed, or “broke down,” was observed. Table 1 gives results for five groups of specimens, tested at voltages ranging from 30 to 38 kilovolts (kV). We use the data on times to breakdown (in minutes) at each of the five voltage levels for our example.

The computed values are given in Table 2 based on the assumption that α=0.01.

We obtained C=2.8511. Because τ^1C and τ^2C, using the decision rule R, we reject H0:τ^1=τ^2=τ^3=τ^4=τ^5 and select the subset containing populations 1 and 2. We identified these two populations as contributing substantially. We claim that the select subset contains the population with the largest mean.

For selected values of Δ,Δ=2(0.2)3.2,infθ_ΩΔP{CDR}=P* is tabulated in Table 3. The probability of a correct decision is at least 0.3201 when Δ is 2. This probability increases to 0.9589 when Δ is 3.2.

Times to breakdown (in minutes) at each of the five voltage levels.

Voltage level (kV) n 1 Breakdown times
30 11 17.05, 22.66, 21.02, 175.88, 139.07, 144.12, 20.46, 43.40, 194.90, 47.30, 7.74
32 15 0.40, 82.85, 9.88, 89.29, 215.10, 2.75, 0.79, 15.93, 3.91, 0.27, 0.69, 100.58, 27.80, 13.95, 53.24
34 19 0.96, 4.15, 0.19, 0.78, 8.01, 31.75, 7.35, 6.50, 8.27, 33.91, 32.52, 3.16, 4.85, 2.78, 4.67, 1.31, 12.06, 36.71, 72.89
36 15 1.97, 0.59, 2.58, 1.69, 2.71, 25.50, 0.35, 0.99, 3.99, 3.67, 2.07, 0.96, 5.35, 2.90, 13.77
38 8 0.47, 0.73, 1.40, 0.74, 0.39, 1.13, 0.09, 2.38

Computed values.

1 2 3 4 5
n i 11 15 19 15 8
a i 0.0706 0.0561 0.0477 0.0561 0.0911
b i −0.0053 0.0069 0.0139 0.0069 −0.0223
θ ^ i 75.981 41.174 14.35895 4.606 0.9162
τ ^ i 6.6858 3.623 1.2635 0.4053 0.0806

Infimum of P{CDR}.

 Δ 2 2.2 2.4 2.6 2.8 3.0 3.2 P * 0.3201 0.5095 0.6815 0.8062 0.8850 0.9318 0.9589
Example 2.

Based on the same assumption as Theorem 5, given the number of populations k, k=3,4,5, and 6, as well as α=0.05 and 0.01 and Δ=1.5(0.1)2.5 and P*=0.6,0.8,0.9, and 0.95, we can determine n by using (14), so that infθ_ΩΔP{CDR}P*. Several selected combinations of n in each case are tabulated in Tables 4, 5, 6, and 7 which show the populations k that have the minimal sample size n required to satisfy the P*.

Sample size n for k=3,4,5,6, α=0.01 (0.05), and P*=0.60.

k Δ
1.5 1.6 1.7 1.8 1.9 2.0 2.1 2.2 2.3 2.4 2.5
3 69 (38) 52 (28) 41 (23) 33 (19) 28 (16) 24 (14) 22 (12) 19 (11) 17 (10) 16 (9) 15 (9)
4 84 (48) 63 (36) 50 (28) 41 (23) 35 (20) 30 (17) 26 (15) 23 (14) 21 (12) 19 (11) 18 (11)
5 96 (56) 72 (42) 57 (33) 47 (27) 39 (23) 34 (20) 30 (18) 27 (16) 24 (14) 22 (13) 20 (12)
6 106 (62) 79 (47) 62 (37) 51 (30) 43 (26) 37 (22) 33 (20) 29 (18) 26 (16) 24 (14) 22 (13)

Note: the numbers in parentheses represent the fitted values at the level; α=0.05.

Sample size n for k=3,4,5,6, α=0.01 (0.05), and P*=0.80.

k Δ
1.5 1.6 1.7 1.8 1.9 2.0 2.1 2.2 2.3 2.4 2.5
3 92 (55) 69 (42) 54 (33) 45 (27) 38 (23) 32 (20) 29 (18) 25 (16) 23 (14) 21 (13) 19 (12)

4 112 (69) 84 (51) 66 (41) 54 (33) 46 (28) 39 (24) 34 (22) 31 (19) 28 (17) 25 (16) 23 (15)
5 126 (79) 94 (59) 74 (47) 61 (38) 51 (32) 44 (28) 39 (25) 35 (22) 31 (20) 28 (18) 26 (17)
6 138 (87) 103 (65) 81 (52) 66 (42) 56 (36) 48 (31) 42 (26) 38 (24) 34 (22) 31 (20) 28 (18)

Note: the numbers in parentheses represent the fitted values at the level; α=0.05.

Sample size n for k=3,4,5,6, α=0.01 (0.05), and P*=0.90.

k Δ
1.5 1.6 1.7 1.8 1.9 2.0 2.1 2.2 2.3 2.4 2.5
3 114 (72) 85 (54) 67 (43) 55 (35) 46 (30) 40 (26) 35 (23) 31 (20) 28 (18) 26 (17) 24 (15)
4 137 (89) 103 (67) 81 (53) 66 (43) 56 (36) 48 (31) 42 (28) 37 (25) 34 (22) 31 (20) 28 (19)
5 154 (101) 115 (76) 91 (60) 74 (49) 63 (41) 54 (36) 47 (31) 42 (28) 38 (25) 34 (23) 31 (21)
6 168 (111) 125 (83) 99 (66) 81 (54) 68 (45) 58 (39) 51 (34) 45 (30) 41 (27) 37 (25) 34 (23)

Note: the numbers in parentheses represent the fitted values at the level; α=0.05.

Sample size n for k=3,4,5,6, α=0.01 (0.05), and P*=0.95.

k Δ
1.5 1.6 1.7 1.8 1.9 2.0 2.1 2.2 2.3 2.4 2.5
3 136 (90) 102 (68) 80 (53) 66 (44) 55 (37) 48 (32) 42 (28) 37 (25) 33 (23) 30 (21) 28 (19)

4 163 (110) 122 (82) 96 (65) 78 (53) 66 (45) 57 (39) 50 (34) 44 (30) 40 (27) 36 (25) 33 (23)
5 183 (124) 136 (93) 107 (73) 88 (60) 74 (51) 64 (44) 56 (38) 49 (34) 44 (31) 40 (28) 37 (26)

6 198 (136) 148 (102) 116 (80) 95 (65) 80 (55) 69 (47) 60 (42) 53 (37) 48 (33) 44 (30) 40 (28)

Note: the numbers in parentheses represent the fitted values at the level; α=0.05.

5. Concluding Remarks

In this study, we considered the methods of the Lin and Huang theorems to propose a framework for analyzing and synthesizing multiple-decision procedures used for testing the homogeneity of means for k exponential distributions . We provided two examples and present the main results to explain Theorems 4 and 5 which can select the subset containing the population with the largest mean and effectively determine common sample size n to satisfy the requirement of P*. This paper presents the use of one technique to both select the optimal system among k systems and construct an optimal rule for selecting a subset of k independent random samples. We suggest employing the methods to facilitate the development of traditional statistical analyses used in the methodologies, techniques, and software applied in performing multiple-decision procedures for testing the homogeneity of means for k exponential distributions problems, such as life testing and reliability engineering.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Gupta S. S. Huang D. Y. A note on optimal subset selection procedures The Annals of Statistics 1980 8 5 1164 1167 10.1214/aos/1176345154 MR585715 Lin X. Gupta S. S. Multiple decision theory: Ranking and selection problems [Ph.D. thesis] 1999 Purdue University Huang D. Y. Panchapakesan S. A modified subset selection formulation with special reference to one-way and two-way layout experiments Communications in Statistics A: Theory and Methods 1976 5 7 621 633 MR0411021 Huang D. Lin C. Panchapakesan S. Balakrishnan N. Multiple decision procedures for testing homogeneity of normal means with unequal unknown variances Advances in Statistical Decision Theory and Applications 1997 Boston, Mass, USA Birkhäuser 253 262 Statistics for Industry and Technology MR1479189 Lin C. Huang D. On some multiple decision procedures for normal variances Communications in Statistics: Simulation and Computation 2007 36 1–3 265 275 10.1080/03610910601158435 MR2370901 2-s2.0-33947179002 Lawless J. F. Statistical Models and Methods for Lifetime Data 2003 2nd Wiley-Interscience New York MR1940115 Nelson W. B. Graphical analysis of accelerated life test data with the inverse power law model IEEE Transactions on Reliability 1972 21 2 11