PROC GENESELECT Statement Options

Enter SAS PROC GENESELECT options in this field to override default parameters.

You can specify any PROC GENESELECT option using the following syntax:

Option

where:

Option is the PROC GENESELECT option, and
a space is used to delimit individual conditions.

Examples of commonly used PROC GENESELECT options are listed in the following table.

Option

Definition

LEAFFRACTION=p

Specifies the smallest number of observations a new branch can have, expressed as the proportion of the number N of available observations in the DATA= data set specified in the PROC statement. It can be less than the total number of observations in the data set because observations with a missing trait or nonpositive value of the variable specified in the FREQ statement are excluded from N. The LEAFSIZE= option specifies the same quantity as an absolute number. The procedure uses the larger of the two. It can be any number from zero (0) through one (1). The default value equals 0.001.

MINCATSIZE=n

Specifies the minimum number of observations that a given qualitative variable value must have in order to use the value in a split search. The GENESELECT procedure handles qualitative values that appear in fewer than n observations in the same way it handles missing values. The default value of n is 5.

MODELTYPE=type

Specifies the type of model to create:

TREEBOOST generates a boosted series of trees.
TREE generates a decision tree.

To Specify One or More PROC GENESELECT Options:

8 Type specific PROC GENESELECT options in the PROC GENESELECT Statement Options field.

For example, to specify a minimum proportion (0.003) of observations a new branch can have, type LEAFFRACTION=0.003 in the text field, as shown below:

For Additional Information

Refer to the SAS PROC GENESELECT documentation for more information.