For continuous variables, the Summary Statistics report in the Distribution platform shows the mean, standard deviation, and other summary statistics. You can control which statistics appear in this report by clicking the red triangle next to the variable name and selecting Customize Summary Statistics.
Tip: You can specify which summary statistics show in the report each time you run a Distribution analysis for a continuous variable. Select File > Preferences > Platforms > Distribution Summary Statistics and then select the ones that you want to appear.
• Description of the Summary Statistics Report describes the statistics that appear by default.
• Additional Summary Statistics describes additional statistics that you can add to the report by using the Customize Summary Statistics option.
Mean
The arithmetic average of the variable. The mean estimates the expected value of the underlying distribution for the response variable. It is the sum of the nonmissing values divided by the number of nonmissing values.
Std Dev
The standard deviation of the variable.
Std Err Mean
The standard error of the mean, which estimates the standard deviation of the distribution of the mean.
Upper and Lower Mean Confidence Limits
The 95% confidence limits for the mean. They define an interval that is likely to contain the true population mean.
N
The total number of nonmissing values.
N Missing
The number of missing observations.
Sum Weight
The sum of a column specified as the Weight column in the launch window.
Sum
The sum of the response values.
Variance
The sample variance, and the square of the sample standard deviation.
Skewness
The sidedness or symmetry measurement.
Kurtosis
The peakedness or heaviness of tails measurement. See Kurtosis for formula details.
CV
The percent coefficient of variation. It is computed as the standard deviation divided by the mean and multiplied by 100. The coefficient of variation can be used to assess relative variation. For example, it can be used when comparing the variation in data measured in different units or with different magnitudes.
N Zero
The number of zero values.
N Unique
The number of unique values.
Uncorrected SS
The uncorrected sum of squares or sum of values squared.
Corrected SS
The corrected sum of squares or sum of squares of deviations from the mean.
Autocorrelation
(Appears only if you have not specified a Frequency variable.) First autocorrelation that tests if the residuals are correlated across the rows. This test helps detect non-randomness in the data.
Minimum
The smallest observation.
Maximum
The largest observation.
Median
The 50th percentile of the observations.
Mode
The value that occurs most often in the data. If there are multiple modes, the smallest mode appears.
Trimmed Mean
The mean calculated after removing the smallest p% and the largest p% of the data. The value of p is entered in the Enter trimmed mean percent text box at the bottom of the window. The Trimmed Mean option is not available if you have specified a Weight variable.
Geometric Mean
The nth root of the product of the data. For example, geometric means are often used to calculate interest rates. The statistic is also helpful when the data contains a large value in a skewed distribution.
Note: Negative values result in missing numbers, and zero values (with no negative values) result in zero.
Range
The difference between the maximum and minimum observations.
Interquartile Range
The difference between the 75th and 25th percentiles.
Median Absolute Deviation
(Not available when a Weight variable is specified in the launch window.) The median of the absolute deviations from the median.
Proportion Zero
The proportion of nonmissing values that are equal to zero.
Proportion Nonzero
The proportion of nonmissing values that are not equal to zero.
3*StdDev
The value of 3 times the standard deviation. You can specify the value of the multiplier by using the K Multiplier platform preference under Preferences > Platforms > Distribution Summary Statistics.
3*StdDev Above Mean
The value of the mean plus 3 times the standard deviation. You can specify the value of the multiplier by using the K Multiplier platform preference under Preferences > Platforms > Distribution Summary Statistics.
3*StdDev Below Mean
The value of the mean minus 3 times the standard deviation. You can specify the value of the multiplier by using the K Multiplier platform preference under Preferences > Platforms > Distribution Summary Statistics.
Robust Mean
The robust mean, calculated in a way that is resistant to outliers, using Huber's M-estimation. See Huber and Ronchetti (2009).
Robust Std Dev
The robust standard deviation, calculated in a way that is resistant to outliers, using Huber's M-estimation. See Huber and Ronchetti (2009).
Enter (1-alpha) for mean confidence interval
The alpha level for the mean confidence interval.
Enter trimmed mean percent
The trimmed mean percentage. The percentage is trimmed off each side of the data.
The red triangle next to Summary Statistics contains the following options:
Customize Summary Statistics
Opens a window that enables you to select summary statistics that you want to appear in your report. You can select or deselect all summary statistics. See Additional Summary Statistics.
Show All Modes
Shows or hides a table of the modes.
For statistical details, see Statistical Details for Summary Statistics.