Processes | Clinical | Cluster Subjects within Study Sites

Cluster Subjects within Study Sites
This process clusters subjects within study site for the purpose of identifying similar subjects. It constructs a cross domain data set using as much data as possible (subject to user options). Next, it computes a distance matrix and performs hierarchical clustering of subjects within each study center. The goal of this exercise is to identify pairs of subjects with a very small distance. This could be an indication that these subject are slightly modified copies of one another.
What do I need?
This process requires the following variables:
Findings domains require VISITNUM and xxSTRESN. (xxTPTNUM is used if available.)
Domains that fail to meet the aforementioned criteria are not used.
Refer to Localization-Specific Value Specification for more information.
The output generated by this process is summarized in a tabbed report. Refer to the Cluster Subjects within Study Sites output documentation for detailed descriptions and guides to interpreting your results.