6.5.4.3. Hotelling's <em>T</em> squared

6. Process or Product Monitoring and Control
6.5. Tutorials
6.5.4. Elements of Multivariate Analysis

6.5.4.3. Hotelling's T squared

Hotelling's $T^2$ distribution A multivariate method that is the multivariate counterpart of Student's $t$ and which also forms the basis for certain multivariate control charts is based on Hotelling's $T^2$ distribution, which was introduced by Hotelling (1947).

Univariate $t$-test for mean Recall, from Section 1.3.5.2, $$ t = \frac{\bar{x} - \mu}{s/\sqrt{n}} $$ has a $t$ distribution provided that $X$ is normally distributed, and can be used as long as $X$ doesn't differ greatly from a normal distribution. If we wanted to test the hypothesis that $\mu = \mu_0$, we would then have $$ t = \frac{\bar{x} - \mu_0}{s/\sqrt{n}} $$ so that $$ \begin{eqnarray} t^2 & = & \frac{(\bar{x} - \mu_0)^2}{s^2 / n} \\ & & \\ & = & n (\bar{x} - \mu_0)(s^2)^{-1} (\bar{x} - \mu_0) ,円 . \end{eqnarray} $$

Generalize to $p$ variables When $T^2$ is generalized to $p$ variables it becomes $$ T^2 = n (\bar{{\bf x}} - {\bf \mu}_0) {\bf S}^{-1} (\bar{{\bf x}} - {\bf \mu}_0) ,円 , $$ with $$ \bar{{\bf x}} = \left[ \begin{array}{c} \bar{x}_1 \\ \bar{x}_2 \\ \vdots \\ \bar{x}_p \end{array} \right] ,円,円,円,円,円,円,円,円,円,円,円,円 {\bf \mu}_0 = \left[ \begin{array}{c} \mu_1^0 \\ \mu_2^0 \\ \vdots \\ \mu_p^0 \end{array} \right] ,円 . $$ ${\bf S}^{-1}$ is the inverse of the sample variance-covariance matrix, ${\bf S}$, and $n$ is the sample size upon which each $\bar{x}_i, ,円 i=1, ,円 2, ,円 \ldots, ,円 p$, is based. (The diagonal elements of ${\bf S}$ are the variances and the off-diagonal elements are the covariances for the $p$ variables. This is discussed further in Section 6.5.4.3.1.)

Distribution of $T^2$ It is well known that when $\mu = \mu_0$ $$ T^2 \sim \frac{p(n-1)}{n-p} F_{(p, ,円 n-p)} ,円 , $$ with $F_{(p, ,円 n-p)}$ representing the F distribution with $p$ degrees of freedom for the numerator and $n-p$ for the denominator. Thus, if $\mu$ were specified to be $\mu_0$, this could be tested by taking a single $p$-variate sample of size $n$, then computing $T^2$ and comparing it with $$ \frac{p(n-1)}{n-p} F_{\alpha ,円 (p, ,円 n-p)} $$ for a suitably chosen $\alpha$.

Result does not apply directly to multivariate Shewhart-type charts Although this result applies to hypothesis testing, it does not apply directly to multivariate Shewhart-type charts (for which there is no $\mu_0$, although the result might be used as an approximation when a large sample is used and data are in subgroups, with the upper control limit (UCL) of a chart based on the approximation.

Three-sigma limits from univariate control chart When a univariate control chart is used for Phase I (analysis of historical data), and subsequently for Phase II (real-time process monitoring), the general form of the control limits is the same for each phase, although this need not be the case. Specifically, three-sigma limits are used in the univariate case, which skirts the relevant distribution theory for each Phase.

Selection of different control limit forms for each Phase Three-sigma units are generally not used with multivariate charts, however, which makes the selection of different control limit forms for each Phase (based on the relevant distribution theory), a natural choice.