Statistical Details for the Distribution Platform

This section contains statistical details for Distribution options and reports.

Statistical Details for Standard Error Bars

Standard errors bars are calculated using the standard error

This section describes how quantiles are computed.

To compute the pth quantile of n non-missing values in a column, arrange the n values in ascending order and call these column values y1, y2, ..., yn. Compute the rank number for the pth quantile as p / 100(n + 1).

•	If the result is an integer, the pth quantile is that rank’s corresponding value.

•	If the result is not an integer, the pth quantile is found by interpolation. The pth quantile, denoted qp, is computed as follows:

where:

‒	n is the number of non-missing values for a variable

‒	y1, y2, ..., yn represents the ordered values of the variable

‒	yn+1 is taken to be yn

‒	i is the integer part and f is the fractional part of (n+1)p.

‒	(n + 1)p = i + f

For example, suppose a data table has 15 rows and you want to find the 75th and 90th quantile values of a continuous column. After the column is arranged in ascending order, the ranks that contain these quantiles are computed as follows:

and

The value y12 is the 75th quantile. The 90th quantile is interpolated by computing a weighted average of the 14th and 15th ranked values as y90 = 0.6y14 + 0.4y15.

Statistical Details for Summary Statistics

This section contains statistical details for specific statistics in the Summary Statistics report.

Mean

The mean is the sum of the non-missing values divided by the number of non-missing values. If you assigned a Weight or Freq variable, the mean is computed by JMP as follows:

1.	Each column value is multiplied by its corresponding weight or frequency.

2.	These values are added and divided by the sum of the weights or frequencies.

Std Dev

The standard deviation measures the spread of a distribution around the mean. It is often denoted as s and is the square root of the sample variance, denoted s2.

Std Err Mean

The standard error means is computed by dividing the sample standard deviation, s, by the square root of N. In the launch window, if you specified a column for Weight or Freq, then the denominator is the square root of the sum of the weights or frequencies.

Skewness

Skewness is based on the third moment about the mean and is computed as follows:

where

and wi is a weight term (= 1 for equally weighted items).

Kurtosis

Kurtosis is based on the fourth moment about the mean and is computed as follows:

where wi is a weight term (= 1 for equally weighted items). Using this formula, the Normal distribution has a kurtosis of 0.

Statistical Details for the Normal Quantile Plot

The empirical cumulative probability for each value is computed as follows:

where ri is the rank of the ith observation, and N is the number of non-missing (and nonexcluded) observations.

The normal quantile values are computed as follows:

where Φ is the cumulative probability distribution function for the normal distribution.

These normal quantile values are Van Der Waerden approximations to the order statistics that are expected for the normal distribution.

Statistical Details for the Wilcoxon Signed Rank Test

The Wilcoxon signed-rank test can be used to test for the median of a single population or to test matched-pairs data for a common median. In the case of matched pairs, the test reduces to testing the single population of paired differences for a median of 0. The test assumes that the underlying population is symmetric.

The Wilcoxon test allows tied values. The test statistic is adjusted for differences of zero using a method suggested by Pratt. See Lehman (2006), Pratt (1959), and Cureton (1967).

Testing for the Median of a Single Population

•	There are N observations:

X1, X2, ..., XN

•	The null hypothesis is:

H0: median = m

•	The differences between observations and the hypothesized value m are calculated as follows:

Dj = Xj - m

Testing for the Equality of Two Population Medians with Matched Pairs Data

A special case of the Wilcoxon signed-rank test is applied to matched-pairs data.

•	There are N pairs of observations from two populations:

X1, X2, ..., XN and Y1, Y2, ..., YN

•	The null hypothesis is:

H0: medianX - Y = 0

•	The differences between pairs of observations are calculated as follows:

Dj = Xj -Yj

Wilcoxon Signed-Rank Test Statistic

The test statistic is based on the sum of the signed ranks. Signed ranks are defined as follows:

•	The absolute values of the differences, , are ranked from smallest to largest.

•	The ranks start with the value 1, even if there are differences of zero.

•	When there are tied absolute differences, they are assigned the average, or midrank, of the ranks of the observations.

Denote the rank or midrank for a difference

by Rj. Define the signed rank for

as follows:

•	If the difference is positive, the signed rank is Rj.

•	If the difference is zero, the signed rank is 0.

•	If the difference is negative, the signed rank is -Rj.

The signed-rank statistic is computed as follows:

Define the following:

is the number of signed ranks that equal zero

R+ is the sum of the positive signed ranks

Then the following holds:

Wilcoxon Signed-Rank Test P-Values

For

, exact p-values are calculated.

For N > 20, a Student’s t approximation to the statistic defined below is used. Note that a correction for ties is applied. See Iman (1974) and Lehmann (1998).

Under the null hypothesis, the mean of W is zero. The variance of W is given by the following:

The last summation in the expression for Var(W) is a correction for ties. The notation di for i > 0 represents the number of values in the ith group of non-zero signed ranks. (If there are no ties for a given signed rank, then di = 1 and the summand is 0.)

The statistic t given by the following has an approximate t distribution with N - 1 degrees of freedom:

Statistical Details for the Standard Deviation Test

Here is the formula for calculating the Test Statistic:

The Test Statistic is distributed as a Chi-square variable with n - 1 degrees of freedom when the population is normal.

The Min PValue is the p-value of the two-tailed test, and is calculated as follows:

2*min(p1,p2)

where p1 is the lower one-tail p-value and p2 is the upper one-tail p-value.

Statistical Details for Normal Quantiles

The normal quantile values are computed as follows:

where:

•	is the cumulative probability distribution function for the normal distribution

•	ri is the rank of the ith observation

•	N is the number of non-missing observations

Statistical Details for Saving Standardized Data

The standardized values are computed using the following formula:

where:

•	X is the original column

•	is the mean of column X

•	is the standard deviation of column X

Statistical Details for Prediction Intervals

The formulas that JMP uses for computing prediction intervals are as follows:

•	For m future observations:

for

•	For the mean of m future observations:

for

•	For the standard deviation of m future observations:

for

where m = number of future observations, and n = number of points in current analysis sample.

•	The one-sided intervals are formed by using 1-α in the quantile functions.

For references, see Hahn and Meeker (1991), pages 61-64.

Statistical Details for Tolerance Intervals

This section contains statistical details for one-sided and two-sided tolerance intervals.

One-Sided Interval

The one-sided interval is computed as follows:

Upper Limit =

Lower Limit =

where

from Table 1 of Odeh and Owen (1980).

t is the quantile from the non-central t-distribution, and

is the standard normal quantile.

Two-Sided Interval

The two-sided interval is computed as follows:

where

s = standard deviation and

is a constant that can be found in Table 4 of Odeh and Owen 1980).

To determine g, consider the fraction of the population captured by the tolerance interval. Tamhane and Dunlop (2000) give this fraction as follows:

where Φ denotes the standard normal c.d.f. (cumulative distribution function). Therefore, g solves the following equation:

where 1-γ is the fraction of all future observations contained in the tolerance interval.

More information is given in Tables A.1a, A.1b, A.11a, and A.11b of Hahn and Meeker (1991).

Statistical Details for Capability Analysis

All capability analyses use the same formulas. Options differ in how sigma (σ) is computed:

•	Long-term uses the overall sigma. This option is used for statistics, and computes sigma as follows:

Note: There is a preference for Distribution called Ppk Capability Labeling that labels the long-term capability output with Ppk labels. This option is found using File > Preferences, then select Platforms > Distribution.

•	Specified Sigma enables you to type a specific, known sigma used for computing capability analyses. Sigma is user-specified, and is therefore not computed.

•	Moving Range enables you to enter a range span, which computes sigma as follows:

where

is the average of the moving ranges

d2(n) is the expected value of the range of n independent normally distributed variables with unit standard deviation.

•	Short Term Sigma, Group by Fixed Subgroup Size if r is the number of subgroups of size nj and each ith subgroup is defined by the order of the data, sigma is computed as follows:

where

•	This formula is commonly referred to as the Root Mean Square Error, or RMSE.

Note: The confidence intervals in the following table are computed using an alpha level of 0.05.

Descriptions of Capability Indices and Computational Formulas

Index

Index Name

Formula

process capability ratio, Cp

(USL - LSL)/6s where:

•	USL is the upper spec limit

•	LSL is the lower spec limit

process capability index, Cpk

process capability index, Cpm

Note: CPM confidence intervals are not reported when the target is not within the Lower and Upper Spec Limits range. CPM intervals are only reported when the target is within this range. JMP writes a message to the log to note why the CPM confidence intervals are missing.

CIs for CPM

Lower CI on CPM

, where γ =

Upper CI on CPM

where γ = same as above.

CPL

process capability ratio of one-sided lower spec

(mean - LSL)/3s

CPU

process capability ratio of one-sided upper spec

(USL - mean)/3s

•	A capability index of 1.33 is considered to be the minimum acceptable. For a normal distribution, this gives an expected number of nonconforming units of about 6 per 100,000.

•	Exact 100(1 - α)% lower and upper confidence limits for CPL are computed using a generalization of the method of Chou et al. (1990), who point out that the 100(1 - α) lower confidence limit for CPL (denoted by CPLLCL) satisfies the following equation:

where Tn-1(δ) has a non-central t-distribution with n - 1 degrees of freedom and noncentrality parameter δ.

•	Exact 100(1 - α)% lower and upper confidence limits for CPU are also computed using a generalization of the method of Chou et al. (1990), who point out that the 100(1 - α) lower confidence limit for CPU (denoted CPULCL) satisfies the following equation:

where Tn-1(δ) has a non-central t-distribution with n - 1 degrees of freedom and noncentrality parameter δ.

Note: Because of a lack of supporting research at the time of this writing, computing confidence intervals for capability indices is not recommended, except for cases when the capability indices are based on the standard deviation.

•	Sigma Quality is defined as the following

For example, if there are 3 defects in n=1,000,000 observations, the formula yields 6.03, or a 6.03 sigma process. The results of the computations of the Sigma Quality Above USL and Sigma Quality Below LSL column values do not sum to the Sigma Quality Total Outside column value because calculating Sigma Quality involves finding normal distribution quantiles, and is therefore not additive.

•	Here are the Benchmark Z formulas:

Z USL = (USL-Xbar)/sigma = 3 * CPU

Z LSL = (Xbar-LSL)/sigma = 3 * CPL

Z Bench = Inverse Cumulative Prob(1 - P(LSL) - P(USL))

where:

P(LSL) = Prob(X < LSL) = 1 - Cum Prob(Z LSL)

P(USL) = Prob(X > USL) = 1 - Cum Prob(Z USL).

Statistical Details for Continuous Fit Distributions

This section contains statistical details for the options in the Continuous Fit menu.

Normal

The Normal fitting option estimates the parameters of the normal distribution. The normal distribution is often used to model measures that are symmetric with most of the values falling in the middle of the curve. Select the Normal fitting for any set of data and test how well a normal distribution fits your data.

The parameters for the normal distribution are as follows:

•	μ (the mean) defines the location of the distribution on the x-axis

•	σ (standard deviation) defines the dispersion or spread of the distribution

The standard normal distribution occurs when

and

. The Parameter Estimates table shows estimates of μ and σ, with upper and lower 95% confidence limits.

pdf:

for

;

; 0 < σ

E(x) = μ

Var(x) = σ2

LogNormal

The LogNormal fitting option estimates the parameters μ (scale) and σ (shape) for the two-parameter lognormal distribution. A variable Y is lognormal if and only if

is normal. The data must be greater than zero.

E(x) =

Var(x) =

Weibull, Weibull with Threshold, and Extreme Value

The Weibull distribution has different shapes depending on the values of α (scale) and β (shape). It often provides a good model for estimating the length of life, especially for mechanical devices and in biology. The Weibull option is the same as the Weibull with threshold option, with a threshold (θ) parameter of zero. For the Weibull with threshold option, JMP estimates the threshold as the minimum value. If you know what the threshold should be, set it by using the Fix Parameters option. See Fit Distribution Options.

The pdf for the Weibull with threshold is as follows:

pdf:

for α,β > 0;

E(x) =

Var(x) =

where

is the Gamma function.

The Extreme Value distribution is a two parameter Weibull (α, β) distribution with the transformed parameters δ = 1 / β and λ = ln(α).

Exponential

The exponential distribution is especially useful for describing events that randomly occur over time, such as survival data. The exponential distribution might also be useful for modeling elapsed time between the occurrence of non-overlapping events, such as the time between a user’s computer query and response of the server, the arrival of customers at a service desk, or calls coming in at a switchboard.

The Exponential distribution is a special case of the two-parameter Weibull when β = 1 and α = σ, and also a special case of the Gamma distribution when α = 1.

pdf:

for 0 < σ;

E(x) = σ

Var(x) = σ2

Devore (1995) notes that an exponential distribution is memoryless. Memoryless means that if you check a component after t hours and it is still working, the distribution of additional lifetime (the conditional probability of additional life given that the component has lived until t) is the same as the original distribution.

Gamma

The Gamma fitting option estimates the gamma distribution parameters, α > 0 and σ > 0. The parameter α, called alpha in the fitted gamma report, describes shape or curvature. The parameter σ, called sigma, is the scale parameter of the distribution. A third parameter, θ, called the Threshold, is the lower endpoint parameter. It is set to zero by default, unless there are negative values. You can also set its value by using the Fix Parameters option. See Fit Distribution Options.

pdf:

for

; 0 < α,σ

E(x) = ασ + θ

Var(x) = ασ2

•	The standard gamma distribution has σ = 1. Sigma is called the scale parameter because values other than 1 stretch or compress the distribution along the x-axis.

•	The Chi-square distribution occurs when σ = 2, α = ν/2, and θ = 0.

•	The exponential distribution is the family of gamma curves that occur when α = 1 and θ = 0.

The standard gamma density function is strictly decreasing when

. When

, the density function begins at zero, increases to a maximum, and then decreases.

Beta

The standard beta distribution is useful for modeling the behavior of random variables that are constrained to fall in the interval 0,1. For example, proportions always fall between 0 and 1. The Beta fitting option estimates two shape parameters, α > 0 and β > 0. There are also θ and σ, which are used to define the lower threshold as θ, and the upper threshold as θ + σ. The beta distribution has values only for the interval defined by

. The θ is estimated as the minimum value, and σ is estimated as the range. The standard beta distribution occurs when θ = 0 and σ = 1.

Set parameters to fixed values by using the Fix Parameters option. The upper threshold must be greater than or equal to the maximum data value, and the lower threshold must be less than or equal to the minimum data value. For details about the Fix Parameters option, see Fit Distribution Options.

pdf:

for

; 0 < σ,α,β

E(x) =

Var(x) =

where

is the Beta function.

Normal Mixtures

The Normal Mixtures option fits a mixture of normal distributions. This flexible distribution is capable of fitting multi-modal data.

Fit a mixture of two or three normal distributions by selecting the Normal 2 Mixture or Normal 3 Mixture options. Alternatively, you can fit a mixture of k normal distributions by selecting the Other option. A separate mean, standard deviation, and proportion of the whole is estimated for each group.

pdf:

E(x) =

Var(x) =

where μi, σi, and πi are the respective mean, standard deviation, and proportion for the ith group, and

is the standard normal pdf.

Smooth Curve

The Smooth Curve option fits a smooth curve using nonparametric density estimation (kernel density estimation). The smooth curve is overlaid on the histogram and a slider appears beneath the plot. Control the amount of smoothing by changing the kernel standard deviation with the slider. The initial Kernel Std estimate is calculated from the standard deviation of the data.

Johnson Su, Johnson Sb, Johnson Sl

The Johnson system of distributions contains three distributions that are all based on a transformed normal distribution. These three distributions are the following:

•	Johnson Su, which is unbounded.

•	Johnson Sb, which has bounds on both tails defined by parameters that can be estimated.

•	Johnson Sl, which is bounded in one tail by a parameter that can be estimated. The Johnson Sl family contains the family of lognormal distributions.

The S refers to system, the subscript of the range. Although we implement a different method, information about selection criteria for a particular Johnson system can be found in Slifker and Shapiro (1980).

Johnson distributions are popular because of their flexibility. In particular, the Johnson distribution system is noted for its data-fitting capabilities because it supports every possible combination of skewness and kurtosis.

If Z is a standard normal variate, then the system is defined as follows:

where, for the Johnson Su:

where, for the Johnson Sb:

and for the Johnson Sl, where

; 0 < θ,δ

for θ < x < θ+σ; 0 < σ

Johnson Sl

pdf:

for θ < x if σ = 1; θ > x if σ = -1

where

is the standard normal pdf.

Note: The parameter confidence intervals are hidden in the default report. Parameter confidence intervals are not very meaningful for Johnson distributions, because they are transformations to normality. To show parameter confidence intervals, right-click in the report and select Columns > Lower 95% and Upper 95%.

Generalized Log (Glog)

This distribution is useful for fitting data that are rarely normally distributed and often have non-constant variance, like biological assay data. The Glog distribution is described with the parameters μ (location), σ (scale), and λ (shape).

pdf:

for

; 0 < σ;

The Glog distribution is a transformation to normality, and comes from the following relationship:

If z =

~ N(0,1), then x ~ Glog(μ,σ,λ).

When λ = 0, the Glog reduces to the LogNormal (μ,σ).

Note: The parameter confidence intervals are hidden in the default report. Parameter confidence intervals are not very meaningful for the GLog distribution, because it is a transformation to normality. To show parameter confidence intervals, right-click in the report and select Columns > Lower 95% and Upper 95%.

All

In the Compare Distributions report, the ShowDistribution list is sorted by AICc in ascending order.

The formula for AICc is as follows:

AICc =

where:

‒	logL is the logLikelihood

‒	n is the sample size

‒	ν is the number of parameters

If the column contains negative values, the Distribution list does not include those distributions that require data with positive values. Only continuous distributions are listed. Distributions with threshold parameters, such as Beta and Johnson Sb, are not included in the list of possible distributions.

Statistical Details for Discrete Fit Distributions

This section contains statistical details for the options in the Discrete Fit menu.

Poisson

The Poisson distribution has a single scale parameter λ > 0.

pmf:

for

; x = 0,1,2,...

E(x) = λ

Var(x) = λ

Since the Poisson distribution is a discrete distribution, the overlaid curve is a step function, with jumps occurring at every integer.

Gamma Poisson

This distribution is useful when the data is a combination of several Poisson(μ) distributions, each with a different μ. One example is the overall number of accidents combined from multiple intersections, when the mean number of accidents (μ) varies between the intersections.

The Gamma Poisson distribution results from assuming that x|μ follows a Poisson distribution and μ follows a Gamma(α,τ). The Gamma Poisson has parameters λ = ατ and σ = τ+1. The parameter σ is a dispersion parameter. If σ > 1, there is over dispersion, meaning there is more variation in x than explained by the Poisson alone. If σ = 1, x reduces to Poisson(λ).

pmf:

for

;

; x = 0,1,2,...

E(x) = λ

Var(x) = λσ

where

is the Gamma function.

Remember that x|μ ~ Poisson(μ), while μ~ Gamma(α,τ). The platform estimates λ = ατ and σ = τ+1. To obtain estimates for α and τ, use the following formulas:

If the estimate of σ is 1, the formulas do not work. In that case, the Gamma Poisson has reduced to the Poisson(λ), and

is the estimate of λ.

If the estimate for α is an integer, the Gamma Poisson is equivalent to a Negative Binomial with the following pmf:

for

with r = α and (1-p)/p = τ.

Run demoGammaPoisson.jsl in the JMP Samples/Scripts folder to compare a Gamma Poisson distribution with parameters λ and σ to a Poisson distribution with parameter λ.

Binomial

The Binomial option accepts data in two formats: a constant sample size, or a column containing sample sizes.

pmf:

for

; x = 0,1,2,...,n

E(x) = np

Var(x) = np(1-p)

where n is the number of independent trials.

Note: The confidence interval for the binomial parameter is a Score interval. See Agresti (1998).

Beta Binomial

This distribution is useful when the data is a combination of several Binomial(p) distributions, each with a different p. One example is the overall number of defects combined from multiple manufacturing lines, when the mean number of defects (p) varies between the lines.

The Beta Binomial distribution results from assuming that x|π follows a Binomial(n,π) distribution and π follows a Beta(α,β). The Beta Binomial has parameters p = α/(α+β) and δ = 1/(α+β+1). The parameter δ is a dispersion parameter. When δ > 0, there is over dispersion, meaning there is more variation in x than explained by the Binomial alone. When δ < 0, there is under dispersion. When δ = 0, x is distributed as Binomial(n,p). The Beta Binomial only exists when

pmf:

for

;

; x = 0,1,2,...,n

E(x) = np

Var(x) = np(1-p)[1+(n-1)δ]

where

is the Gamma function.

Remember that x|π ~ Binomial(n,π), while π ~ Beta(α,β). The parameters p = α/(α+β) and δ = 1/(α+β+1) are estimated by the platform. To obtain estimates of α and β, use the following formulas:

If the estimate of δ is 0, the formulas do not work. In that case, the Beta Binomial has reduced to the Binomial(n,p), and

is the estimate of p.

The confidence intervals for the Beta Binomial parameters are profile likelihood intervals.

Run demoBetaBinomial.jsl in the JMP Samples/Scripts folder to compare a Beta Binomial distribution with dispersion parameter δ to a Binomial distribution with parameters p and n = 20.

Statistical Details for Fitted Quantiles

The fitted quantiles in the Diagnostic Plot and the fitted quantiles saved with the Save Fitted Quantiles command are formed using the following method:

1.	The data are sorted and ranked. Ties are assigned different ranks.

2.	Compute the p[i] = rank[i]/(n+1).

3.	Compute the quantile[i] = Quantiled(p[i]) where Quantiled is the quantile function for the specific fitted distribution, and i = 1,2,...,n.

Statistical Details for Fit Distribution Options

This section describes Goodness of Fit tests for fitting distributions and statistical details for specification limits pertaining to fitted distributions.

Goodness of Fit

Descriptions of JMP Goodness of Fit Tests
Distribution	Parameters	Goodness of Fit Test
Normal 1	μ and σ are unknown	Shapiro-Wilk (for n ≤ 2000) Kolmogorov-Smirnov-Lillefors (for n > 2000)
	μ and σ are both known	Kolmogorov-Smirnov-Lillefors
	either μ or σ is known	(none)
LogNormal	μ and σ are known or unknown	Kolmogorov's D
Weibull	α and β known or unknown	Cramér-von Mises W2
Weibull with threshold	α, β and θ known or unknown	Cramér-von Mises W2
Extreme Value	α and β known or unknown	Cramér-von Mises W2
Exponential	σ is known or unknown	Kolmogorov's D
Gamma	α and σ are known	Cramér-von Mises W2
Gamma	either α or σ is unknown	(none)
Beta	α and β are known	Kolmogorov's D
Beta	either α or β is unknown	(none)
Binomial	ρ is known or unknown and n is known	Kolmogorov's D (for n ≤ 30) Pearson χ2 (for n > 30)
Beta Binomial	ρ and δ known or unknown	Kolmogorov's D (for n ≤ 30) Pearson χ2 (for n > 30)
Poisson	λ known or unknown	Kolmogorov's D (for n ≤ 30) Pearson χ2 (for n > 30)
Gamma Poisson	λ or σ known or unknown	Kolmogorov's D (for n ≤ 30) Pearson χ2 (for n > 30)

For the three Johnson distributions and the Glog distribution, the data are transformed to Normal, then the appropriate test of normality is performed.

Spec Limits

Writing T for the target, LSL, and USL for the lower and upper specification limits, and Pα for the α*100th percentile, the generalized capability indices are as follows:

If the data are normally distributed, these formulas reduce to the formulas for standard capability indices. See Descriptions of Capability Indices and Computational Formulas.

Set Spec Limits for K Sigma

Type a K value and select one-sided or two-sided for your capability analysis. Tail probabilities corresponding to K standard deviations are computed from the Normal distribution. The probabilities are converted to quantiles for the specific distribution that you have fitted. The resulting quantiles are used for specification limits in the capability analysis. This option is similar to the Quantiles option, but you provide K instead of probabilities. K corresponds to the number of standard deviations that the specification limits are away from the mean.

For example, for a Normal distribution, where K=3, the 3 standard deviations below and above the mean correspond to the 0.00135th quantile and 0.99865th quantile, respectively. The lower specification limit is set at the 0.00135th quantile, and the upper specification limit is set at the 0.99865th quantile of the fitted distribution. A capability analysis is returned based on those specification limits.