The Probability That a Measurement Falls within a Range of n Standard Deviations from an Estimate of the Mean

We derive a general equation for the probability that a measurement falls within a range of n standard deviations from an estimate of the mean. So, we provide a format that is compatible with a confidence interval centered about the mean that is naturally independent of the sample size. The equation is derived by interpolating theoretical results for extreme sample sizes. The intermediate value of the equation is confirmed with a computational test.


Introduction
A confidence interval is an interval in which a measurement or trial falls corresponding to a given probability 1, 2 .In statistics, confidence intervals centered about an estimate of the mean target the mean 1 .Because these confidence intervals rely on the standard error of the estimate, they increase as sample size decreases and they decrease as sample size increases.Consequently, the confidence interval, in statistics, is often a margin of error for the estimate of the mean 3 .
We are interested in finding the probability of a confidence interval centered about an estimate of the mean that targets an arbitrary measurement and is independent of the sample size.In this way, we provide a format that is compatible with a confidence interval centered about the mean that is naturally independent of sample size, but has a width of n standard deviations.We approach this problem by considering the known case of a confidence interval centered about the mean and we derive the associated probability.We correlate this result to the case of infinite sample size.The next step is to calculate the probability associated with the minimum sample size of one.This requires that we average the probability over different possible sample values.Finally, we propose an equation for probability that naturally interpolates our results and we show that the equation is consistent with intermediate probability values by comparing the equation to estimates of probability produced by computer simulation.

Expected Probability
In order to make the necessary confidence interval calculations, we will have to determine an expected or average probability.This can be examined by considering the cumulative distribution function 4, 5 , where erf x is the so-called erf function 6, 7 .Now, we know that Rather than specifying a value of x, we would like to compute the probability averaged over all possible values of x.So we have This yields Since erf z is an odd function, we have This can be easily confirmed by an alternative calculation.The probability density f x is Therefore, we can write 8 The expected value 9 of the cumulative distribution function is thus

2.11
Note that when x → ∞, F 1 and when x → −∞, F 0. Therefore, we can write

2.12
Consequently, as per 2.4 , we can write the expected probability as 2.13

The Probability of a Confidence Interval
We want to determine the probability given as is the estimate of the mean, μ.At the maximal value of N, we find that lim x i ≈ μ.

3.3
For a normal distribution, the probability that a measurement falls within n standard deviations nσ of the mean μ i.e., within the interval μ − nσ, μ nσ is given by

3.7
Observe that we can express 3.7 in terms of the cumulative probability distribution function F. Thus, we can write The minimal value of N is one, so that x x 1 .
In this case, we have 3.9 We find that 3.9 reduces into The probability in 3.10 is written in terms of the value x 1 .Consequently, we can derive an expected probability by computing

3.15
A portion of the numerical results is presented in Table 1.Based on 3.7 and 3.15 , we can propose the equation in which we are by default referring to the expected probability.Equation 3.16 clearly converges for the extreme estimates of the mean, since lim 3.17 A plot of 3.16 for four values of N is shown in Figure 1.Observe that the curve for N 2 is intermediate to the other curves.Consequently, we can have confidence in 3.16 if we can show that it is valid for N 2.

Computational Test
We can estimate P x − nσ ≤ x ≤ x nσ computationally.Simulate the normal, independent random variables X and {X i }, for which x ∈ X, x i ∈ X i .

4.1
Let the condition β be If M β is the number of trials in which the condition β is met and M is the total number of trials, then an estimate of P is given as Figure 2 shows a plot of P versus P for N 2 and M 2000.

Conclusions
We derive a general equation for the probability that a measurement falls within a range of n standard deviations from an estimate of the mean.So, we provide a format that is compatible with a confidence interval centered about the mean that is naturally independent of the sample size.It is consistent with our equation that probability reduces with sample size.However, for samples greater than ten, the value of probability begins to converge.The equation for probability is derived by considering the minimal and maximal sample sizes and producing an equation which naturally interpolates the results.Computer simulation is used to estimate probability for the sample size N 2 that produces intermediate results that are in strong agreement with the general equation.

Figure 1 :
Figure 1: A plot of the equation P x − nσ ≤ x ≤ x nσ erf n/ √ 2 √ 2 1/N for four different values of N.

Figure 2 :
Figure 2: A plot of P versus P for N 2 and M 2000.