Skip to main content

Statistical Methods - STAT 581 - Problem Set 4

Problem 1

The insulating life of protective fluids at an accelerated load is being studied. The experiment has been performed for four types of fluids, with n=5n = 5 trials per fluid type. Suppose fluid types 1 and 2 are from manufacturer A, and that fluid types 3 and 4 are from manufacturer B. The data is available on Blackboard as an Excel File.

(a)

(i)

\fbox{\begin{minipage}{\textwidth} Perform a test of the global null hypothesis H0:μ1=μ2=μ3=μ4H_0 : \mu_1 = \mu_2 = \mu_3 = \mu_4. Compute the F0F_0 statistic, and the pp-value. \end{minipage}}

This is a one-factor experiment with a=4a=4 levels of the factor (fluid level) and n=6n=6 replicates for a total of N=an=24N = a n = 24 observations.

The test statistic

$$ F_0 = \frac{\mstr}{\mse} $$

under the null hypothesis μ1==μa\mu_1 = \ldots = \mu_a (or τ1==τa=0\tau_1 = \ldots = \tau_a = 0 in the effects model) has a reference distribution

F0F(a1,Na) F_0 \sim F(a-1,N-a)

where N=naN = n a.

We compute the observed test statistic and its pp-value with:

library("readxl")
h4.data = read_excel("handout2data.xlsx")
fluid = as.factor(na.omit(h4.data$fluid))
life = na.omit(h4.data$life)
#head(data.frame(fluid=fluid,life=life))

aov.mod = aov(life ~ fluid)
summary(aov.mod)
##             Df Sum Sq Mean Sq F value Pr(>F)  
## fluid        3  30.16   10.05   3.047 0.0525 .
## Residuals   20  65.99    3.30                 
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

We see that F0=3.047F_0 = 3.047 which has a pp-value .0525.0525.

(ii)

\fbox{\begin{minipage}{\textwidth} Comment on the additional information provided by the pp-value, beyond a determination of statistical significance alone. \end{minipage}}

The pp-value quantifies a measure of evidence beyond a determination of statistical significance.

Earlier, we obtained a pp-value of .0525.0525. If we were to perform a dichotomous hypothesis test which specifies a significance level α=0.05\alpha = 0.05, we would decide H0H_0. However, had we obtained a pp-value of .049.049, while the strength of the evidence is nearly the same, we would decide HAH_A. Presenting the pp-value provides more information.

(b)

Consider the orthogonal contrasts \begin{align*} \Gamma_1 &= \mu_1 - \mu_2\ \Gamma_2 &= \mu_3 - \mu_4\ \Gamma_3 &= (\mu_1 + \mu_2) - (\mu_3 + \mu_4). \end{align*}

Preliminary analysis

A \emph{constrast} Γ\Gamma is a linear combination of parameters

Γ=i=1aciμi \Gamma = \sum_{i=1}^{a} c_i \mu_i

such that c1++ca=0c_1 + \cdots + c_a = 0. Assuming a balanced design,

$$ \var(\sum_{i=1}^{a} c_i \bar{y}_{i\cdot}) = \frac{\sigma^2}{n} \sum_{i=1}^{a} c_i^2 $$

and thus

$$ t_0 = \frac{\sum_{i=1}^{a} c_i \bar{y}_{i\cdot}}{\left(\mse/n \sum_{i=1}^{a} c_i\right)^{1/2}} \sim t_{N-a} $$

under H0H_0. The FF-test may also be used, where $F_0 = t_0^2 = \frac{\msc}{\mse} = \frac{\ssc/1}{\mse}$ where

$$ \ssc = \frac{(\sum_{i=1}^{a} c_i \bar{y}_{i\cdot})^2}{1/n \sum_{i=1}^{a} c_i^2} \sim t_{N-a}. $$

We have a=4a=4 fluid types, 11, 22, 33, and 44. Fluid types 11 and 22 are from manufacturer AA and fluid types 33 and 44 are from manufacturer BB.

Γ1\Gamma_1 compares the average effect (on lifetime) of fluid 11 with the average effect of fluid 22, Γ2\Gamma_2 compares the average effect of fluid 33 with the average effect of fluid 44, and Γ3\Gamma_3 compares the average effect of manufacturer AA with the average effect of manufacturer BB.

(i)

\fbox{\begin{minipage}{\textwidth} Compute SSC for each contrast. Describe a general property for the sums of squares of orthogonal contrasts. Why is this property desirable? \end{minipage}}

Orthogonal contrasts

Orthogonality is desirable since the treatment sum of squares can be decomposed into specific effects.

Additional remarks:

Two contrasts (assuming a balanced design) with coefficients ci{c_i} and di{d_i} are orthogonal if i=1acidi=0\sum_{i=1}^{a} c_i d_i = 0.

For aa factor levels (or treatments), a set of (a1)(a-1) orthogonal contrasts Γ1,,Γa1\Gamma_1,\ldots,\Gamma_{a-1} are independent with df=1\operatorname{df} = 1 and thus tests performed on them are independent.

We see that Γ1\Gamma_1, Γ2\Gamma_2, and Γ3\Gamma_3 are orthogonal contrasts.

Computing $\ssc$