Measurement Systems Analysis
What is measurement systems analysis?
Measurement systems analysis (MSA) is the process of identifying, characterizing, and quantifying sources of measurement variability associated with a measurement system. The analysis can include any characteristic of the measurement process that the investigator believes may influence measured values such as instrument type, set-up procedures, operators, time, etc.
MSA response data can be quantitative (numerical) or qualitative (categorical). For qualitative measurements, the two main sources of variability are random and systematic error. Estimation of random error is performed by measuring the same quantity multiple times under the same conditions (repeatability) and for differing conditions (reproducibility). The estimate of systematic error is often performed by taking multiple measurements on a known quantity and comparing the results to the known (or verified) value for that quantity. Bias and linearity are characteristics associated with systematic error.
What are the benefits of using measurement systems analysis?
MSA provides a method to better understand the quality of measurements being made. If I am unsure that my measurement system can accurately and repeatably quantify a given characteristic, why use that system? Insomuch as decisions are made based on measured values, it is foundational to many processes. MSA allows me to quantify the closeness and variability of my measurements.
$ICC = \frac{\hat{\sigma}^2_{\text{process}}}{\left( \hat{\sigma}^2_{\text{process}} + \hat{\sigma}^2_{\text{repeatability}} \right)} = \frac{1}{0.2213 + 1} = 0.8188$
$\frac{P}{T} = k \frac{\hat{\sigma}_{\text{repeatability}}}{\text{USL} - \text{LSL}} = 6 \times \frac{0.4705}{10} = 0.2823$
What is a gauge repeatability and reproducibility study?
A gauge repeatability and reproducibility (GR&R) study can be considered part of an MSA where the focus is on a single instrument. An MSA conducted on qualitative outcomes focuses on metrics such as interrater/intrarater reliability, test-retest reliability, and misclassification probabilities. They are sometimes called attribute gauge studies.
Gauge R&R study example: Estimation of random error components for a single instrument
An automated measurement tool is used to measure parts coming off a manufacturing line. A single part from the center of the process distribution is created for the study. It is assumed that the part does not change over the course of the study. Early on Day 1, the part is measured three times in rapid succession. Later in the day, the part is measured three more times by the same operator. This process is repeated every other day four more times. The data are shown below. Rep indicates measurements made in rapid succession, while Run corresponds to a set of replications.
For the analysis, the variance component (VC) associated with Rep is used to estimate repeatability. The Run and Day VCs, along with the Rep VC, are used to estimate reproducibility.
Day
Run
Rep
Y
Variance components results are shown below.
Component
VC
% of Total
Sqrt(VC)
The specification range (upper specification limit [USL] - lower specification limit [LSL]) for a part is ±5 units. The process variance is 1 unit2. The intraclass correlation (ICC) for the measurement system is
$ICC = \frac{\sigma^2_{\text{process}}}{\left( \sigma^2_{\text{process}} + \sigma^2_{\text{repeatability}} \right)} = \frac{1}{0.2213 + 1} = 0.8188$
ICC measures the proportion of total variance attributable to the process. Given that it is favorable for the measurement system to account for a small proportion of the total variance, higher values are better.
The precision-to-tolerance ratio, based on the AIAG MSA manual guidance, is
$\frac{P}{T} = k \frac{\hat{\sigma}_{\text{repeatability}}}{\text{USL} - \text{LSL}} = 6 \times \frac{0.4705}{10} = 0.2823$
Estimation of bias example
Five parts evenly spanning the process range are created for the study. It is assumed that the parts do not change over the course of the study. Using the same automated measurement tool, a part is selected randomly, and three measurements are taken in quick succession. This is repeated until all parts are selected. After the study, the parts are measured with a second method, which is more accurate but destructive. Results are shown below. Y corresponds to the measurements taken during the bias study; Y2 corresponds to the destructive measurements. Bias is the difference between Y and Y2.
Part
Rep
Y
Y2
Bias
The average bias for each part is shown below. An ANOVA indicates that all measurements, except the those taken on Part 1, are significantly different from their nominal value.
Part
Nominal Value
Average Measurement
Average Bias
p Value
A plot of the bias as a function of the nominal value and a test for non-significance of the slope indicate the tool is exhibiting non-linearity.
What is method validation?
In life sciences, particularly in assay development, the process of quantifying measurement system variability is part of method validation. Analytic method validation found in chemical or biological assay development shares quantitative methodologies with MSA, in particular the estimation of repeatability and reproducibility.
Industry standards and guidelines
Several industry guidance groups and standards organizations – such as the American Society for Testing and Measurement (ASTM), Automotive Industry Action Group (AIAG), International Council for Harmonization of Technical Requirements for Pharmaceutical for Human Use (ICH), International Organization for Standardization (ISO), and Joint Committee for Guides in Metrology (JCGM) – have developed detailed definitions and methodologies around MSA. A partial list of definitions can be found under the Terminology heading below. A list of standards and guidelines can be found in the Standards and Guidelines section. For a more in depth understanding of MSA, see the References page.
Terminology
- Bias: estimate of systematic measurement error.
- Conventional quantity value: quantity value attributed by agreement to a quantity for a given purpose.
- Measurand: quantity intended to be measured.
- Measurement error: measured quantity value minus a reference quantity value.
- Measurement quantity value: quantity value representing a measurement result.
- Random error: component of measurement error that in replicate measurements varies in an unpredictable manner.
- Repeatability: measurement precision under a set of repeatability conditions of measurement.
- Reference quantity value: quantity value used as a basis for comparison with values of quantities of the same kind. A reference quantity value can be a true quantity value of a measurand, in which case it is unknown, or a conventional quantity value, in which case it is known.
- Repeatability condition: condition of measurement, out of a set of conditions that includes the same measurement procedure, same operators, same measuring system, same operating conditions and same location, and replicate measurements on the same or similar objects over a short period of time.
- Reproducibility: measurement precision under a set of reproducibility conditions of measurement.
- Reproducibility condition: condition of measurement, out of a set of conditions that includes different locations, operators, measuring systems, and replicate measurements on the same or similar objects.
- Systematic error: component of measurement error that in replicate measurements remains constant or varies in a predictable manner.
Definitions are based on the International vocabulary of metrology (JCGM 200:2012).
Standards and guidelines
Two seminal documents in the development of terminology and methodologies used in metrology are the Guide to the expression of uncertainty in measurement (GUM) (JCGM 100:2008) and the International vocabulary of basic and general terms in metrology (VIM) (JCGM 200:2012), maintained by the Joint Committee for Guides in Metrology (JCGM). Standards organization and industry guidance groups such as the International Organization for Standards (ISO), the American Society for Testing and Materials (ASTM), and the Automotive Industry Action Group (AIAG), have developed guidelines for conducting MSAs.
- ASTM E2782-17(2022)e1, Standard Guide for Measurement Systems Analysis (MSA). American Society for Testing and Measurement.
- ICH Q2(R2) Validation of analytical procedures – Scientific guidelines (2024). International Council for Harmonization of Technical Requirements for Pharmaceutical for Human Use.
- ISO 5725-1:2023, Precision of test methods – Accuracy (trueness and precision) of measurement methods and results. Part 1: General principles and definitions, International Organization for Standardization (Geneva, Switzerland).
- ISO 5725-2:2019, Precision of test methods – Accuracy (trueness and precision) of measurement methods and results. Part 2: Basic method for the determination of repeatability and reproducibility of a standard measurement method, International Organization for Standardization (Geneva, Switzerland).
- ISO 5725-3:2023, Precision of test methods – Accuracy (trueness and precision) of measurement methods and results. Part 3: Intermediate precision and alternative designs for collaborative studies, International Organization for Standardization (Geneva, Switzerland).
- ISO 5725-4:2020, Precision of test methods – Accuracy (trueness and precision) of measurement methods and results. Part 4: Basic methods for the determination of the trueness of a standard measurement method, International Organization for Standardization (Geneva, Switzerland).
- ISO 5725-5:1998, Precision of test methods – Accuracy (trueness and precision) of measurement methods and results. Part 5: Alternative methods for the determination of the precision of a standard measurement method, International Organization for Standardization (Geneva, Switzerland).
- ISO 5725-6:1994, Precision of test methods – Accuracy (trueness and precision) of measurement methods and results. Part 6: Use in practice of accuracy values, International Organization for Standardization (Geneva, Switzerland).
- JCGM 100:2008(E), Evaluation of measurement data – Guide to the expression of uncertainty in measurement. Joint Committee for Guides in Metrology.
- JCGM 200:2012, International vocabulary of metrology – Basic and general concepts and associated terms (VIM). Joint Committee for Guides in Metrology.
- Measurement Systems Analysis, 4th Edition. Automotive Industry Action Group. 2010
Note: The JCGM is a collaborative effort between eight organizations: Bureau International des Poids et Mesures (BIPM), Innovative Engineering Consortium (IEC), International Federation of Clinical Chemistry and Laboratory Medicine (IFCC), International Laboratory Accreditation Cooperation (ILAC), International Organization for Standardization (ISO), International Union of Pure and Applied Chemistry (IUPAC), International Union of Pure and Applied Physics (IUPAP), and International Organization of Legal Metrology (OIML).
References
Burdick, Richard K.; Borror, Connie M.; Montgomery, Douglas C. (2005). Design and Analysis of Gauge R&R Studies: Making Decisions with Confidence Intervals in Random and Mixed ANOVA Models. SIAM.
Eisenhart, C. (1963) Realistic Evaluation of the Precision and Accuracy of Instrument Calibration Systems, Journal of Research of the National Bureau of Standards – C, Engineering and Instrumentation, 67C(2).
Montgomery, D.C. & Runger G. C. (1993a), “Gauge Capability and Designed Experiments. Part I: Basic Methods”, Quality Engineering, 6(1), 115–135.
Montgomery, D.C. & Runger G. C. (1993b), “Gauge Capability and Designed Experiments. Part II: Experimental Design Models and Variance Component Estimation”, Quality Engineering, 6(2), 289–305.
Sandler, L.C. (2020, July 7). Analytical Method Validation [video]. National Institute of Standards and Technology. Analytical Method Validation
Wheeler, Donald (2006). EMP III: Evaluating the Measurement Process & Using Imperfect Data. SPC Press.
Wheeler, D. J. (2006). EMP III Using Imperfect Data. Knoxville, TN: SPC Press.