Factorial Design
In a factorial design, there are more than one factors under consideration in the experiment. The test subjects are assigned to treatment levels of every factor combinations at random.
Example
A fast food franchise is test marketing 3 new menu items in both East and West Coasts of continental United States. To find out if they the same popularity, 12 franchisee restaurants from each Coast are randomly chosen for participation in the study. In accordance with the factorial design, within the 12 restaurants from East Coast, 4 are randomly chosen to test market the first new menu item, another 4 for the second menu item, and the remaining 4 for the last menu item. The 12 restaurants from the West Coast are arranged likewise.
Problem
Suppose the following tables represent the sales figures of the 3 new menu items after a week of test marketing. Each row in the upper table represents the sales figures of 3 different East Coast restaurants. The lower half represents West Coast restaurants. At .05 level of significance, test whether the mean sales volume for the new menu items are all equal. Decide also whether the mean sales volume of the two coastal regions differs.
==========
Item1 Item2 Item3
E1 25 39 36
E2 36 42 24
E3 31 39 28
E4 26 35 29
West Coast:
==========
Item1 Item2 Item3
W1 51 43 42
W2 47 39 36
W3 47 53 32
W4 52 46 33
Solution
The solution consists of the following steps:
- Save the sales figure into a file named "fastfood-3.csv" in CSV format as follows.
- Load the data into a data frame named df3 with the read.csv function.
- Concatenate the data rows in df3 into a single vector r .
- Assign new variables for the treatment levels and number of observations.
- Create a vector that corresponds to the 1th treatment level of the response data r in step 3 element-by-element with the gl function.
- Similarly, create a vector that corresponds to the 2nd treatment level of the response data r in step 3.
- Apply the function aov to a formula that describes the response r by the two treatment factors tm1 and tm2 with interaction.
- Print out the ANOVA table with summary function.
Answer
Since the p-value of 0.0015 for the menu items is less than the .05 significance level, we reject the null hypothesis that the mean sales volume of the new menu items are all equal. Moreover, the p-value of 1.2e-05 for the east-west coasts comparison is also less than the .05 significance level. It shows there is a difference in overall sales volume between the coasts. Finally, the last p-value of 0.0113 (< 0.05) indicates that there is a possible interaction between the menu item and coast location factors, i.e., customers from different coastal regions have different tastes.
Exercise
Create the response data in step 3 above along vertical columns instead of horizontal rows. Adjust the factor levels in steps 5 and 6 accordingly.