Using Validation

In JMP, you can perform cross-validation by selecting the K-Fold Crossvalidation option from the Stepwise Fit red triangle menu.

In JMP Pro, you can specify a Validation column in the Fit Model window. A validation column must have a numeric data type and should contain at least two distinct values.

•	If the column contains two values, the smaller value defines the training set and the larger value defines the validation set.

•	If the column contains three values, the values define the training, validation, and test sets in order of increasing size.

•	If the column contains four or more distinct values and the response is continuous, these values define folds for k-fold validation.

Validation Set with Two or Three Values

If you specify a Validation column with two or three values, Stepwise fits models based on the training set. Model fit statistics are reported for the validation and test sets. SeeValidation and Test Set Statistic Definitions for details on how these statistics are defined.

If the response is continuous, the following statistics appear in the Stepwise Regression Control panel:

•	RSquare Validation (also shown in the Step History report)

•	RMSE Validation

•	RSquare Test (if there is a test set)

•	RMSE Test (if there is a test set)

If the response is binary nominal or ordinal, the following statistics appear in the Stepwise Regression Control panel:

•	RSquare Validation (also shown in the Step History report)

•	Avg Log Error Validation

•	RSquare Test (if there is a test set)

•	Avg Log Error Test (if there is a test set)

Max Validation RSquare

If you specify a validation column with two or three values in the Fit Model window, the Stopping Rule defaults to Max Validation RSquare. This rule attempts to find a model that maximizes the RSquare statistic for the validation set. The rule can be applied with the Direction set to Forward or Backward.

Note: Max Validation RSquare considers only the models defined by p-value entry (Forward direction) or removal (Backward direction). It does not consider all possible models.

You can use the Step button to enter terms one-by-one in the Forward direction or to remove them one-by one in the Backward direction. At any point, you can select a model by clicking the button to the right of RSquare Validation in the Step History report. The selection of model terms is updated in the Current Estimates report. This is the model that is used once you click Make Model or Run Model.

Forward Direction

In the Forward direction, Stepwise constructs successive models by adding terms based on the next smallest p-value.

If you click Go rather than Step, the process of entering terms proceeds automatically. Among the fitted models, the model that is considered best is listed last. This model is obtained by overlooking local dips in RSquare Validation. Specifically, it is the model with the largest RSquare Validation that can be followed by as many as ten models with lower RSquare Validation values. This model is designated by the terms Best in the Parameter column and Specific in the Action column. The button to the right of RSquare Validation selects this Best model, though you are free to change this selection.

Backward Direction

In the Backward direction, Stepwise constructs successive models by removing terms based on the next largest p-value.

To use the Backward direction, you must first click Enter All to enter all terms. The Backward direction behaves in a similar fashion to the Forward direction. If you click Go rather than Step, the process of entering terms proceeds automatically. The model designated as Best is the one with the largest RSquare Validation that can be followed by as many as ten models with lower RSquare Validation values.

Validation and Test Set Statistic Definitions

RSquare Validation and RMSE Validation are defined in this section. RSquare Test and RMSE Test are computed for the test set in a completely analogous fashion.

Continuous Response

RSquare Validation

An RSquare measure for the validation set computed as follows:

‒	For each observation in the validation set, compute the prediction error. This is the difference between the actual response and the response predicted by the training set model.

‒	Square and sum the prediction errors to obtain SSEValidation.

‒	Square and sum the differences between the actual responses in the validation set and their mean. This is the SSTValidation.

‒	RSquare Validation is:

Note: It is possible for RSquare Validation to be negative.

RMSE Validation

The square root of the mean squared prediction error for the validation set. This is computed as follows:

‒	For each observation in the validation set, compute the prediction error. This is the difference between the actual response and the response predicted by the training set model.

‒	Square and sum the prediction errors to obtain the SSEValidation.

‒	Denote the number of observations in the validation set by nValidation.

‒	RMSE Validation is:

Note: In the Fit Least Squares Crossvalidation report, RMSE for the validation and test sets is called RASE (Root Average Squared Error).

Binary Nominal or Ordinal Response

RSquare Validation

An Entropy RSquare measure (also known as McFadden’s R2) for the validation set computed as follows:

‒	A model is fit using the training set.

‒	Predicted probabilities are obtained for all observations.

‒	Using the predicted probabilities based on the training set model, the likelihood for the model is computed for observations in the validation set. Call this quantity Likelihood_FullValidation.

‒	Using the data in the validation set, the likelihood of the reduced model (no predictors) is computed. Call this quantity Likelihood_ReducedValidation.

‒	RSquare Validation is:

Note: It is possible for RSquare Validation to be negative.

Avg Log Error Validation

The average log error for the validation set is computed as follows:

‒	For each observation in the validation set, compute the log of its predicted probability as determined by the model based on the training set.

‒	Sum these logs, divide by the number of observations in the validation set, and take the negative of the resulting value.

Tip: Smaller values of Avg Log Error Validation are desirable.

K-Fold Cross Validation

K-fold cross validation randomly divides the data into k subsets. In turn, each of the k sets is used as a validation set while the remaining data are used as a training set to fit the model. In total, k models are fit and k validation statistics are obtained. The model giving the best validation statistic is chosen as the final model. This method is useful for small data sets, because it makes efficient use of limited amounts of data.

Note: K-fold cross validation is only available for continuous responses.

In JMP, select K-Fold Crossvalidation from the red triangle options for Stepwise Fit.

In JMP Pro, you can access k-fold cross validation in two ways:

•	From the red triangle options for Stepwise Fit, select K-Fold Crossvalidation.

•	Specify a validation column with four or more distinct values.

RSquare K-Fold Statistic

If you conduct k-fold cross validation, the RSquare K-Fold statistic appears to the right of the other statistics in the Stepwise Regression Control panel. RSquare K-Fold is the average of the RSquare Validation values for the k folds.

Max K-Fold RSquare

When you use k-fold cross validation, the Stopping Rule defaults to Max K-Fold RSquare. This rule attempts to maximize the RSquare K-Fold statistic.

Note: Max K-Fold RSquare considers only the models defined by p-value entry (Forward direction) or removal (Backward direction). It does not consider all possible models.

The Max K-Fold RSquare stopping rule behaves in a fashion similar to the Max Validation RSquare stopping rule. See Max Validation RSquare. Replace references to RSquare Validation with RSquare K-Fold.