This process sets up a predictive modeling review that can be used to compare the efficacy of different models, applied to one or more dependent variables, at making predictions under the same conditions and compare the models using cross validation, test sets, or learning curves. The process works by creating a setting that is loaded and run in Workflow Builder.
What do I need?
One wide Input Data Set is needed to run this process. This data set contains all of the numeric and other data to be analyzed. Data must be in the wide format. Genetic marker data is likely in this form already, but any genomic data that are in tall form must be converted to the wide format. The Transpose Rectangular process can be used to convert the tall data set and its accompanying Experimental Design Data Set (EDDS) to wide form.
The adsl_dii.sas7bdat data set, used in the following example, consists of 906 rows of individuals with 382 columns corresponding to data on these individuals. It was generated from the original nicardipine ADSL data set described in Nicardipine and is included with JMP Clinical. This data set is partially shown below.
