Chapter 10: Multifactorial analyses using R

Figure 10.1 Output for one- way analysis of variance

Box 10.3 Presenting the results for one-way analysis of variance

Figure 10.1 xx

Figure 10.1 and Box 10.3 Code

# Tell R where you will be working.  R will look here for your data:
setwd("~/Documents/Books/Presenting/data/rData/")
# See chapter 2 for more details on the setwd() function.
 
# Use the load() function to load the dataset smokingbwt.RData out of the   
# current folder/directory into R's interactive working memory/environment. See  
# chapter 2 for more details. 
load("smokingbwt.RData")
 
# Use the library() package to load the psych package for the describe() 
# function.  See chapter 2 for more details on library().
library(psych)
# The describe() function in the psych package does descriptive statistics 
# including a standard deviation.  See chapter 2 for more details.  
describe(smokingbwt$birthwt)  # Look for bad/impossible data.
 
# The table() function does simple cross tabulation tables.
table(smokingbwt$smokegroup)  # Check for empty factor levels.
 
# Make a dataset that has the two variables and drops the record 
# with the factor level of "".  "" is a legal value so the complete.cases() function
# will not drop the blank.  The syntax is: 
#    dataframe[ what records , what columns ]
completeSmokebwt <- smokingbwt[smokingbwt$smokegroup!="", 
                               c("birthwt", "smokegroup") 
                               ]
 
library(gmodels)  # Load the gmodels package for the ci() function.
# See chapter 2 for details on the with() function.
with(completeSmokebwt,
     # First, group by smokinggroup levels, then calculate confidence limits on birthwt.
     by(birthwt, smokegroup, ci)  
     )
 
# Load the gdata package for the reorder() function.
library(gdata)    
# Use the reorder() function to set the levels of a categorical factor so it prints
# in logical order.
completeSmokebwt$smokegroup <-reorder(completeSmokebwt$smokegroup, 
                                      new.order = c("never smoked",
                                                    "ex pre-pregnancy", 
                                                    "ex in pregnancy", 
                                                    "1-14 cigs/day", 
                                                    "15+ cigs/day")
                                      )
 
# The lm() function can be used to build linear models.  Here the code makes a 
# lm object, named modelBirthwt, holding the details of the predictive model.
# See chapter 9 for more details on lm().
modelBirthwt<- with(completeSmokebwt,
                    # linear model predicting birth weight with smoking group
                    lm(birthwt ~ smokegroup) 
                    )
 
# The summary() function pays attention to the type of object it is summarizing.
# See chapter 9 for more details.
summary(modelBirthwt)  # Display the model with estimates relative to nonsmokers.
 
# The anova() function, when used with a lm object, can be used to show the 
# results of a linear model as an ANOVA table.  Here the anova() function is used to show 
# the overall p-value for smoking.
anova(modelBirthwt)  
 
with(completeSmokebwt,
     # The pairwise.t.test() function can create p-values for all pairwise 
     # levels of the predictor while adjusting the p-values. 
     pairwise.t.test(birthwt,       # outcome
                     smokegroup,    # predictor
                     p.adj="holm"   # Adjust with the Holm method.
                     )
     )
 
 
 
library(agricolae)  # Load the agricolae package for the scheffe.test() function. 
# The scheffe.test() function does not give pairwise p-values but shows which groups/levels
# are different from others.
scheffe.test(modelBirthwt,         # the name of the model
             trt = "smokegroup",   # the variable with levels to be compared
             console = TRUE        # Show results.
             )