Stat 362: Sections 4.1–4.4 WorksheetName:
UIN:
Show all of your work / code. No credit will be given if there is no work / code.
1. We have recorded measurements about the drying time, in hours, of a certain brand of latex
paint. Assume that the measurements are a simple random sample.
Import the data from the file “LatexPaint.csv”. There is a header in the dataset.
(a) Find the summary statistics for this dataset. Fill in the table below with your answers
(no code required). Hint: You may need to change the data type of the dataset. To do
this, use the command as.matrix(dataset) and save it as another dataset. Use those
values when doing the below calculations and graphs.
Statistic
Sample Size
Value
blankblankblankblank
Minimum
Median
Mean
Maximum
Variance
Standard Deviation
95th Percentile
(b) Determine if the dataset is normal or not by making a Q-Q Plot and performing the
Shapiro-Wilk Test. Report your code as well as whether you think the data is normal or
not.
(c) Does there appear to be any outliers in this dataset? Hint: Draw a boxplot. Report your
code as well as whether you think the data has any outliers.
2. We have obtained data from a study of the emissions produced when gasoline is burned in
internal combustion engines. File contains emission levels of two major pollutants (CO: Carbon monoxide emissions (grams per mile) and NOX: Nitrogen oxide emissions) for a sample
of 46 light-duty engines.
This data is in the file: Pollutants.csv. There is a header in the dataset.
Find summary statistic information and the sample sizes. We are interested in Carbon Monoxide emissions grouped by Nitrogen Oxide emissions. Report your code, and fill in the table
below.
blank
CO Values
NOX Level ↓
Minimum
Q1
Median
Mean
Q3
Maximum
Sample Size
Low
blankblank
blankblank
blankblank
blankblank
blankblank
blankblank
blankblank
Medium
High
Code:
2