100 CTIAPTER 3 Numerical Descriotive Measures
Problems for Sections 3.1 and 3.2
LEARNING THE BASICS
3.1 The following is a set of data from a sample of n : 5:
7 4 9 8 2
a. Compute the mean, median, and mode.
b. Compute the range, variance, standard deviation, and
coefficient of variation.
c. Compute the Z scores. Are there any outliers?
d. Describe the shape of the data set.
3.2 The following is a set of data from a sample of n : 6:
7 4 9 7 3 1 2
a. Compute the mean, median, and mode.
b. Compute the range, variance, standard deviation, and
coefficient of variation.
c. Compute the Z scores. Are there any outliers?
d. Describe the shape of the data set.
3.3 The following set of data is from a sample of n : J:
1 2 7 4 9 0 7 3
a. Compute the mean, median, and mode.
b. Compute the range, variance, standard deviation, and
coefficient of variation.
c. Compute the Z scores. Are there any outliers?
d. Describe the shape of the data set.
3.4 The following is a set of data from a sample of n : 5:
7 – 5 – 8 7 9
a. Compute the mean, median, and mode.
b. Compute the range, variance, standard deviation, and
coefficient of variation.
c. Compute the Z scores. Are there any outliers?
d. Describe the shape of the data set.
3.5 Suppose that the rate of return for a particular stock
during the past two years was l0% and30o/o. Compute
the
geometric rate of return. (Note: A rate of return of 10% is
recorded as 0.10. and a rate of return of 30% is recorded
as 0.30. )
3.5 Suppose that the rate of return for a particular stock
during the past t\\o vears *’as 20o/o and-30o/o. Compute the
geometric rate of return.
APPLYING THE CONCEPTS
3.7 A business school reported its findings from a study
of recent graduates. A sample of n : l0 finance majors
had a mean starting salary of $45,000, a median starting
salary of $45,000, and a standard deviation of $10,000.
A sample of n : l0 information systems majors had
a mean starting salary of $56,000, a median of $45,000,
and a standard deviation of $37,000. Discuss the central
tendency, variation, and shape of starting salaries for the
two majors.
3.8 The operations manager of a plant that manufactures tires
wants to compare the actual inner diameters of two grades
of tires, each of which is expected to be 575 millimeters.
F, srrrra\s st {rrR \scs st eus\ git(s -lrlu\ \e,\e,\\\q \\( tlNs”
results representing the inner diameters of the tires, ranked
from smallest to lareest” are as follows:
GradeX Grade Y
s68 570 575 578 584 573 574 575 577 578
a. For each of the two grades of tires, compute the mean,
median. and standard deviation.
b. Which grade of tire is providing better quality?
Explain.
c. What would be the effect on your answers in (a) and (b) il
the last value for grade )’were 588 instead of 578? Explain.
3.9 According to the U.S. Census Bureau, in November
2008 the median sales price of new houses was $220,400,
and the mean sales price was $287,500 (extracted from
www.census.gov, January 21, 2009).
a. Interpret the median sales price.
b. Interpret the mean sales price.
c. Discuss the shape of the distribution of the price of new
houses.
3.10 The file EE!ft[@lE contains the prices for
two tickets, with online service charges, large pop-
corn, and two medium soft drinks, at a sample of six theater
chains:
s36.15 $31.00 $3s.0s $40.2s $33.75 $43.00
Source: Data extracted”from K. Kelly, “The Multiplex Under Siege,”
The \\’all Street Journal, December 24-25, 2005, pp. P1, P5.
a. Compute the mean and median.
b. Compute the variance, standard deviation, range, and
coefficient of variation.
c. Are the data skewed? If so, how?
d. Based on the results of (a) through (c), what conclu-
sions can you reach concerning the cost of going to the
movies?
3.11 The file E!!EE contains the overall miies per gallon
(MPG) of 2009 sedans priced under $20,000.
2 7 3 1 3 0 2 8 2 7 2 4 2 9 3 2 ,
32 21 26 26 2s 26 2s 24
Source: Data extractedfrom “Vehicle Ratings,” Consumer Reports,
April 2009, p. 27.
a. Compute the mean, median, and mode.
b. Compute the variance, standard deviation, range, coeffi-
cient ofvariation, and Z scores.
c. Are the data skewed? If so, how?
d. Compare the results of (a) through (c) to those ol
Problem 3.12 (a) through (c) that refer to the miles pel
gallon of SUVs priced under $30,000.
!D
i3
8.,
t
The file f!!l contains the overall miles per gallon
I of 2009 small SUVs priced under $30,000.
5 22 21 22 22 18 t9 19 19 21 21
[ E 1 9 2 t t ] 2 2 1 8 1 8 2 2 t 6 t 6
7 Data extractedfrom “Vehicle Ratings,” Consumer Reporrs,
NfR. pp. 33-34.
the mean, median. and mode.
the variance, standard deviation, range, coeffi-
lh ofvariation, and Z scores.
the data skewed? If so, how?
Cqare the results of (a) through (c) to those of
Dublem 3.11 (a) through (c) that refer to the miles per
fllm of sedans priced under $20,000.
The file fatit!:Ey?f4frt contains the cost (in cents) per
serving for a sample of 13 chocolate chip cookies.
are as follows:
3.2 Yariationand Shape 101
a. For money market accounts and five-year CDs, sepa-
rately compute the variance, standard deviation, range,
and coefficient of variation.
b. Based on the results of (a), do money market accounts or
five-year CDs have more variation in the highest yields
offered? Explain.
3.16 The file EEsfEE contains the starting admission
price (in $) for one-day tickets to l0 theme parks in the
United States:
58 63 4t 42 29 50 62 43 40 40
Source: Data extractedfrom C. Jackson and E. Gamerman,
“Rethinking the Thrill Factor” The Wall Street Journal, April I5-16,
2006, pp. Pl, P4.
a. Compute the mean, median, and mode.
b. Compute the range, variance, and standard deviation.
c. Based on the results of (a) and (b), what conclusions can
you reach concerning the starting admission price for
one-day tickets.
d. Suppose that the first value was 98 instead of 58. Repeat
(a) through (c), using this value. Comment on the differ-
ence in the results.
3.17 A bank branch located in a commercial district of a
city has the business objective of developing an improved
process for serving customers during the noon-to-l:00 p.u.
lunch period. The waiting time, in minutes, is defined as the
time the customer enters the line to when he or she reaches
the teller window. Data are collected from a sample of 15
customers during this hour. The file E!fifl contains the
results, which are listed below:
4 .2 t 5 .55 3 .02 s .13 4 .17 2 .34 3 .s4 3 .20
4.50 6 .10 0 .38 5 . r2 6 .46 6 . r9 3 .79
a. Compute the mean and median.
b. Compute the variance, standard deviation, range, coeffi-
cient ofvariation, and Z scores. Are there any outliers?
Explain.
c. Are the data skewed? If so, how?
d. As a customer u’alks into the branch office during the
lunch hour, she asks the branch manager how long she
can expect to wait. The branch manager replies, ‘Almost
certainly less than five minutes.” On the basis of the
results of (a) through (c), evaluate the accuracy of this
statement.
3.18 Suppose that another bank branch, located in a resi-
dential area, is also concerned with the noon-to-l p.v. lunch .
hour. The waiting time, in minutes, collected from a sample
of 15 customers during this hour, is contained in the file
f![!| and listed below:
9.66 5.90 8.02 5.79 8.73 3.82 8.01 8.35
10.49 6.68 s.64 4.08 6.11 9.91 s.47
a. Compute the mean and median.
b. Compute the variance, standard deviation, range, coeffi-
cient
of variation, and Z scores. Are there any outliers?
Explain.
t’t 25 23 36
futa ertractedfrom
4 3 7 4 3 2 s 4 7 2 4 4 5 4 4
“Chip, Chip, Hooray,” Consumer Reports,
W.p .7 .
the mean. median” and mode.
the variance, standard deviation, range, coeffi-
of variation, and Z scores. Are there any outliers?
fre data skewed? If so, how?
on the results of (a) through (c), what conclusions
you reach concerning the cost of chocolate chip
ies?
The file fjqlllift:iHEl+l contains the cost per ounce
a sample of 14 dark chocolate bars,
0.68 0.72 0.92 t . t4 1.42 0.94 037
0.57 1 .s l 0 .s7 0 .55 0 .86 t .4 t 0 .90
Dtro extractedfrom “Dark Chocolate: Wich Bars Are Best? ”
Reports, September 2007, p. 8.
the mean. median. and mode.
the variance, standard deviation, range, coefficient
ion, and Z scores. Are there any outliers? Explain.
the data skewed? If so, how?
on the results of (a) through (c), what conclusions
1ou reach concerning the cost ofdark chocolate bars?
ls there a difference in the variation of the yields of
qpes of investments? The file fflftlfi contains
inr+’ide highest yields of money market accounts and
CDs as of Mav 17 ” 2009:
trney Market Five-Year CD
2.25
2.20
2.12
2.03
2.02
3.70
3.66
3.65
3.50
3.50
extrac ted from www.Bankrate,com, May t 7. 2 009.
108 cHAPTER3Numerical DescriPtive Measures
The distributions in Panels A and D of Figure 3.5 are symmetric’
In these distributions’
mean and median are equal. In addition, the length of the left tail
is equal to the length of
right tail, and the median line divides the box in half’
The distribution in Panel B of Figure 3.5 is left-skewed’ The
few small values dis
the mean toward the left tail. For this ieft-skewed distribution,
there is a heavy clusterinl
values at tne frigtr end of the scale (i.e.. the right side);75% of
all values are found betu
the left edge of the box (Q1) and the end ofihe right tail
(X1u.r.,,)’
fher^e
is a long-1eft
that contains the smalle it ZS”t, of the Ialues. demonstrating
the lack of symmetry tn
data set.
The distribution in Panel C of Figure 3.5 is righrskewed’ The concentration
of values i
the low end of ihe scale (i.e., the lei side of theloxplot)’ Here, 75oh
of all values are fc
berween the beginning of ine ieft tait and the right edge of the box
(Q3) There is a long righ
that containr,*t. turg”rrz5oh of the r-alues. demonstrating
the lack of symmetry in
data set’
c. Construct a boxplot and describe its shape’
d. Compare vour answer in (c) with that from Pro
3..t(d) on Page 100′ Discuss.
APPLYING THE COI{CEPTs
3.27 The file EIiEIEMffi contains the cost (in centr
l-ounce sen’inffor a sample of 13 chocolate chip cot
The data are as follows:
5-{ ll 25 23 36 43 7 43 25 47 24 45
Source: Data ettracted from “Chip, Chip, Hooray”‘Consumer
Repons. June 2009, P.7.
a. Compute the first quartile (Q,), the third quartile
and the interquartile range.
b. List the five-number summary’
c. Constmct a boxplot and describe its shape’
3.28 The file EflGI|EEIE represent th
(S) per ounce for a sample of 14 dark chocolat’
0.68 0.72 0.92 1.14 l ’42 0.94 0.77 0’s7
0.57 0.5s 0.86 l .4l 0.90
Source: Data ettracted from “Dark Chocolate Which Bars are
Consumer Reports, September 2007, pp’ 1-8′
a. Compute the first quartile (p1), the third quartile
and the interquartile range’
b. List the five-number summary.
c. Construct a boxplot and describe its shape’
3.29 The fil” E[EE!E!@ contains data on the s
admission price ($) for one-day tickets to 10 theme p
the United States:
s8 63 41 42 29 50 62 43 40 40
Source: Data extractedfrom C’ Jackson and E’ Gamerman’
:nnni”n”g the Thrilt itactor” TheWall Street Jownal’ April
2006, pp. P1, P4.
a. Compute the first quartile (Q), h” third quartil
and the interquartile range.
Problems for Section 3.3
I-HARhIING ThIH SI\$ICS
3.23 The following is a set of data from a sample of n
: J
”
1 2 7 4 9 0 1 3
a. Compute the first quartile (Q1), the third quartile (Q3)’
and the interquartile range.
b. List the five-number summary’
c. Construct a boxplot and describe its shape’
d. Compare your answer in (c) with that from Problem 3’3(d)
on page 100. Discuss.
3.24 The following is a set of data from a sample of
n : 6 ‘ .
7 4 9 7 3 1 2
a. Compute the first quartile (Q),the third quartile (p3)’
and the interquartile range.
b. List the five-number summary’
c. Construct a boxplot and describe its shape’
d. Compare your answer in (c) with that from Problem
3.2(d) on Page 100. Discuss.
3.25 The following is a set of data from a sample of
n : 5 :
I 4 9 8 2
a. Compute the first quartile (pt)’ the third quartile (Q3)’
and the interquartile range.
b. List the five-number summary’
c. Construct a boxplot and describe its shape’
d. Compare your answer in (c) rvith that from Problem
3.1(d) onPage 100. Discuss.
3.26 The following is a set of data from a sample of
n — 5′.
7 – 5 – 8 1 9
a. Compute the first quartile (Q),the third quartile (Qt)’
and the interquartile range.
b. List the five-number summary’
ffie flne-number summary.
a boxplot and describe its shape.
llfht {ile f@ contains the overall miles per gallon
di009 small SUVs priced under $30,000:
l :1 : l 21 22 22 18 t9 19 t9 21 2l
Ms amctedfrom “Vehicle Ratings,” Consumer Reports,
n l l – l /
the first quartile (Q),Ihe third quartile (p3),
mterquartile range.
frc fne-number summary.
a boxplot and describe its shape.
!h lile gWf.mm contains the yields for a money
eunL a one-year certificate of deposit (CD), and
CD for 23 banks in the metropolitan New York
of\tay 28,2009. For each type ofaccount:
atmctedfrom www.Bankrate.corn, May 28, 2009.
the first quartile (Q1),the third quartile (p3),
mrerquartile range.
fn-e-number summary.
a boxplot and describe its shape.
A b.nk branch located in a commercial district of a
tftc business objective of developing an improved
fusen’ing customers during the noon-to-1:00 p.u.
*rd- The waiting time, in minutes, is defined as
tlhe customer enters the line to when he or she
frc teller window Data is collected from a sam-
qustomers during this hour. The file flfifl con-
results, which are listed below:
4′ tr 5.55 3.02 5.13 4.77 2.34 3.54 3.20
{50 6 .10 0 .38 5 .12 6 .46 6 . r9 3 .79
branch, located in a residential area. is also con-
fre noon-to-l p.rr,l. lunch hour. The waitine time. in
mllected from a sample of 15 customers during this
in the file fg and listed below:
8.02 5.19 8.73 3.82 8.01 8.3s
6.68 s.64 4.08 6.17 9.91 s.47
five-number summaries of the waiting times at
bank branches.
3.4 Numerical Descriptive Measures for a Populatio” 109
b. Construct boxplots and describe the shapes of the distri-
butions for the tu’o bank branches.
c. What similarities and differences are there in the dis-
tributions of the rvaiting times at the two bank
branches?
3.33 Using the data in [@fi@E.
a. Construct a PivotTable of the mean 2006 return by cate-
gory.and risk.
b. Construct a PivotTable of the standard deviation of the
2006 return by category and risk.
c. What conclusions can you reach concerning differences
between the categories of mutual funds (large cap,
medium cap, and small cap) based on risk factor (low,
average, and high)?
3.34 Using the data in EEEIIEE,
a. Construct a PivotTable of the mean three-year return by
category and risk.
b. Construct a PivotTable of the standard deviation of the
three-year return by category and risk.
c. What conclusions can you reach concerning differences
between the categories of mutual funds (large cap,
medium cap, and small cap) based on risk factor (low,
average, and high)?
3.35 Using the data in EEEIEEEE,
a. Construct a PivotTable of the mean five-year return by
category and risk.
b. Construct a PivotTable of the standard deviation of the
five-year return by category and risk.
c. What conclusions can you reach concerning differences
between the categories of mutual funds (large cap,
medium cap, and small cap) based on risk factor (low,
average, and high)?
3.36 Using the data in EEEEIE@.
a. Construct a PivotTable of the mean 2006 return by cate-
gory, objective. and risk.
b. Construct a Pir-otTable of the standard deviation of the
2006 rerurn by category objective, and risk.
c. What conclusions can you reach concerning differences
bet*-een the categories of mutual funds (large cap, medium
cap. and small cap) based on objective (growth or value)
and risk factor (low, average, and high)?
F{umerical Descriptive Measures for a Population
Sections 3.1 and 3.2 present various statistics that described the properties ofcentral tendency
and variation for a sample. If your data set represents numerical measurements for an entire
population, you need to calculate and interpret parameters and summary measures for a popu-
lation. In this section, you will learn about three population parameters: the population mean,
population variance, and population standard deviation.
To help illustrate these parameters, first review Table 3.6, which contains the one-year
returns for the five largest bond funds (in terms of total assets) as of May 20,2009 (stored in
@)
I
q
!
f,
tl
l-
f,
3-4 Numerical Descriptive Measures for a Populatio” 1 1 3
3 . 1 6 As in Example 3. 15, a population of l2-ounce cans of cola is known to have a mean fill-weight
of 12.06 ounces and a standard deviation of 0.02. Houever. the shape of the population is
. unknown, and you cannot assume that it is bell-shaped. Describe the distribution of fill-
rle weights. Is it very likely that a can will contain less than 12 ounces of cola?
SOLUTION
p + a : 12.06 + 0.02 : (12.M,12.08)
p. ! .2o : 12.06.r 2(0.02) : (12.02,12.10)
1t” * .3o : 12.06 + 3(0.02) : (12.00, 12.12)
Because the distribution may be skewed” you cannot use the empirical rule. Using the
Chebyshev rule, you cannot say anything about the percentage of cans containing betu’een
12.04 and 12.08 ounces. You can state that at least 75o/o of the cans will contain betqeen 12.02
and 12.10 ounces and at least 88.89% will contain between 12.00 and 12.12 ounces. Therefore-
between 0 and 11.11% of the cans will contain less than 12 ounces.
You can use these two rules to understand how data are distributed around the mean when you
have sample data. With each rule, you use the value you calculated for X in place of p and the
value you calculated for S in place of o. The results you compute using the sample statistics are
approximations because you used sample statistics (X S) and not population parameters (p, o).
csablishments in that locale:
10 .3 11 .1 9 .6 9 .0 r4 .5
13.0 6 .7 I1 .0 8 .4 10 .3
13.0 t t .2 7.3 5.3 t2.s
8 .0 11 .8 8 .7 10 .6 9 .5
n . l 10 .2 11 .1 9 .9 9 .8
11.6 1s . l 12 .5 6 .s 7 .s
10.0 r2.9 9.2 10.0 12.8
rz.s 9.3 10.4 12.7 10.5
9.3 1 1.5 10.7 I 1.6 7 .8
10.5 7 .6 10 .1 8 .9 8 .6
for Section 3.4
THE BASICS
f,ollowing is a set of data for a population with
7 s l l 8 3 6 2 1 9 8
the population mean.
the population standard deviation.
f,ollowing is a set of data for a population with
7 5 6 6 6 4 8 6 9 3
tte population mean.
the population standard deviation.
THE CONCEPTS
file E contains the quarterly sales tax receipts
of dollars) submitted to the comptroller of the
ir Lake for the period ending March 2009 by all
a. Compute the mean, variance, and standard deviation for
this population.
b. What percentage of these businesses have quarterly sales
tax receipts within +1, t2, or f 3 standard deviations of
the mean?
c. Compare your findings with what would be expected on
the basis of the empirical rule. Are you surprised at the
results in (b)?
3.40 Consider a population of 1,024 mutual funds that
primarily invest in large companies. You have determined
thal p,, the mean one-year total percentage return
achieved by all the funds, is 8.20 and that o,the standard
deviation, is 2.’15.
a. According to the empirical rule, what percentage of these
funds are expected to be within t I standard deviation of
the mean?
b. According to the empirical rule, what percentage of these
funds are expected to be within t2 standard deviations
of the mean?
c. According to the Chebyshev rule, what percentage of
these funds are expected to be within +1, +.2, or t3 stan-
dard deviations of the mean?
d. According to the Chebyshev rule, at least 93.75Yo of
these funds are expected to have one-year total returns
between what two amounts?
3.41 The file ffiEffi contains the state cigarette tax,
in dollars, for each of the 50 states as ofApril I,2009.
a. Compute the population mean and population standard
deviation for the state cigarette tax.
b. Interpret the parameters in (a).
120 cHAPTER3
Car
Numerical Descriptive Measures
Owner Government
2005 Ford F-150
2005 Chevrolet Silverado
2002 Honda Accord LX
2002 Honda Civic
2004 Honda Civic Hybrid
2002 Ford Explorer
2005 Toyota Camry
2003 Toyota Corolla
2005 Toyota Prius
Source: Data extracted from J. Healey, “Fuel Economy Calculations
to Be Altered,” USA Today, January I I, 2006, p. IB.
a. Compute the covariance.
b. Compute the coefficient of correlation.
c. Which do you think is more valuable in expressing the
relationship between owner-calculated and current gov-
ernment standards mileage-the covariance or the coeffi-
cient of correlation? Explain.
d. Based on (a) and (b), what conclusions can you reach
about the relationship between owner-calculated and cur-
rent government standards mileage?
3.48 College basketball is big business, with coachr
salaries, revenues, and expenses in the millions of dolla
The file !!t|!!!![![!$![ contains the coaches’ salari
and revenue for college basketball at selected schools ir
recent year.
Source: Data extracted from R. Adams, “Pay for Playoffs,” The Wal
StreetJonrnal, March l1-12,2006,pp. P1, P8.
a. Compute the covariance.
b. Compute the coefficient of correlation.
c. Based on (a) and (b), what conclusions can you reach abr
the relationship between a coaches’ salaries and revenue
3.49 College football players trying out for the NFL r
given the Wonderlic standardized intelligence test. The f
EtrEtr!| contains the average Wonderlic score of fo,
ball players trying out for the NFL and the graduation n
for football players at selected schools.
Source: Data ertractedfrom S. Walkef “The NFL\ Smartest kam,’
The Wall Streer Journal, September 30, 2005, pp. Wl, WI0.
a. Compute the covariance.
b. Compute the coefficient of correlation.
c. Based on (a) and (b), what conclusions can you rea
about the relationship between the average Wonder
score and graduation rate?
14.3
15.0
27.8
27.9
48.8
15.8
23.7
32.8
a 4 a
J I . J
16.8
17.8
26.2
34:2
47.6
18.3
28.5
33 .1
56.0
3.6 Descriptive Statistics: Pitfalls and Ethical Issues
This chapter describes how a set of numerical data can be characterized by the statistics d
measure the properties ofcentral tendency, variation, and shape. In business, descriptive stat
tics such as the ones you have learned about are frequently included in summary reports d
are prepared periodically.
The volume of information available on the Internet, in newspapers, and in magazines I
produced much skepticism about the objectivity of data. When you are reading informatl
that contains descriptive statistics, you should keep in mind the quip often attributed to I
famous nineteenth-century British statesman Benjamin Disraeli: ‘oThere are three kinds of li
lies, damned lies, and statistics.”
For example, in examining statistics, you need to compare the mean and the median.,l
they similar or are they very different? Or, is only the mean provided? The answers to th
questions will enable you to determine whether the data are skewed or symmetrical and whed
the median might be a better measure of central tendency than the mean. In addition, you shc
look to see whether the standard deviation has been included in the statistics provided. Withi
the standard deviation. it is difficult to determine the amount of variation that exists in the
Ethical considerations arise when you are deciding what results to include in a report
should document both good and bad results. In addition, when making oral presentations
presenting written reports, you need to give results inafair, objective, and neutral
Unethical behavior occurs when you selectively fail to report pertinent findings that are
mental to the support of a particular position.
126 CHAPTER 3 Numerical
Descriptive Measures
3.72 Thefite EEtlEffilE conlains:h””pj:l”nv
taxes per
capita for the 50 ‘tut”‘-u”d the District
of Columbia’
a. Compute the mean,;diarr, first
quartile’ andthird quartile’
b. Compute the range, interquartile
tT?::lutiunce’ stan-
“‘
Jutdiwiation, and coefficient of
varratron’
c. Construct a to*ptot’ n'” the data
skewed? If so’ how?
d. Based on the results oTia) tnrougfr
(c)’ what conclusions
can you reach concerning property
taxes
9er
caPita’ in thou-
sands of dollars, ;;=;J state
and the District of
Columbia?
3.73 The file GtitllEEEE includes
the
lotal
compensa-
tion (in $) of cEos oT-tn” tu’g”
public companies in 2008′
Source: Data extractedfrom D’ Jones and
B’ Hansen’ “CEO Pay
Dives in a Rough 2008″‘”ttit-‘nsatoday’com’
May 1′ 2009′
a. Compute tt” *”a”, m”diu'” firsiquartile’andthrd
quartile’
b. Compute the range, interquartile
rafB!’variance’ stan-
dard deviation, u”d coefficient of
variation’
c. Construct u Uotpfot’ A'” the data
skewed? If so’ how?
d. Based on the tt’oit’ oi tul trtto”grt
(c)’ what conclusions
can you ‘”u”i’ tot”t”i”g tn”
lotal compensation (in
$millions) of CEOs?
e. Compute the correlation coefficient
between compensa-
tion and the amount of bonus’
f. Compute the correlation coefficient le^tween
compensa-
tion and the change in stock price
in 2008′
g. What
“on”tt”io”3-“;;;”t
reach from the results of (e)
and (0?
Aooendedtoyourreportshouldbeallappropriatetableq
ctrarts, and numerical descriptive
measures’
Source: Data extracted Jrom www’Beer100’com’
June I 5′ 2009′
TEAM PROJECTS
The file f!ff@! contains
information- regarding nin
variables from a sample of 180 mutual
funds:
Fund number-Id””tiii”ution number
for each bond fun
Type-Type of UonJ’ comprising
the bond fund (intet
mediate government or short-term
corporate)
Assets-In millions of dollars
n””r-Sut”t charges (no or Yes)
Expense ratio-Ratio of “*p””t”t
to ne] a::ets inpercentE
R.’r”t” 2008-Twelve-month return
in 2008
Three-year ..*-iA”””alized return’
2006-2008
Five-year return-Annualized return’
2004-2008
Risk-Risk-of-loss factor of the
mutual fund (loq av
age. or high)
3.76Forexp€nseratioinpercentage,three-yearreturn’z
i::Iffifilil” -“un, median’ first quartile’
and th
3.74 You are planning to study for
your statistics examlna-
tion with a group of ciassmates’
one of whgm you particu-
;;;;ffiimpress’ rnit inoiuia”al
has volunteeredto use
Microsoft Excel to g”f ,h” needed
summary information’
tables, and charts f”t;;;; set containing
several numerical
and categori”al uu’iuUiei*assigned
by the instructor for study
purposes. This person tott’ over
to you with the printout
and exclaims,
.ol,ve g”iii^”iittte means,.the medians,
the
standard deviations, ttt” Uo*pfott’tl
pie charts-for all our
variables. The problem-i;’ t”-” of
the output looks weird-
like the boxplots fo’ g”ni”t una fo1ryj91
and the pie charts
for grade point index-u”Jfot height’
Also’ I can’t under-
stand why Professor Krehiiei said-we
can’t get the descrip-
tive stats for some Jthe variables;
I got them for every-
thing! See, tfre -“u” fo’ fttigftt is
68’23’ the mean for grade
point index is2Jr6,’it” *”u’n for gender
is l’50’ the mean
i”t r”O”t is 4.33.” What is your
reply?
REPORT WRITING EXERCISES
3.75 The file EffiElE!!tr contains
the percentage of
alcohol, number orlt-per
12 ounces’ and number of
carbohydrate, ti” gtuittl p”t f,Z .o”111t
for 128 of the best-
,”if1rrg’Oo*tstic beers in the United
States’
Your task l’ to^ *tlt” a report
based on a complete
descriptive evaluation oi “u”t’
oi the numerical variables-
percentage of ut”oftot, number of
calories per 12 ounces’
and number of'”a’UoityOrates
(in grams) pet 12 ounces’
quartile.
b. Compute the range, interquartile
range’ variance’ sl
“‘
J”Ji.”iation, and coefficient of
variation’
c. Construct a boxplot’ Are the data
skewed? If so’ how
d. Based on the results oi 1u; ttttottglt !t):
tlut conclus:
–
;”; you reach concerning these
variables?
3.77 You want to compare bond
funds that have fee
ifror” ttt”t do not have fees’ For each
of these two groups
the variables expense ‘utio i” percentage’
return 2008′ tl
year return, and five-Year return’
a. Compute the mean, median’
first quartile’ and
quartile- . ,:1
b. iompute the range, interquartile
ralg:’varlance’
dard deviation, and coefficient of
variation’
c. Construct u Uo*ptot’ e’e the data
skewed? If so’ ho’
d. Based on the “”‘tt’
of (a) through (c)’ what conch
can you .”u”f’ uuot’t aiii”ten””t
between mutual
th”, il;t” fees and those that do not
have fees?
3.TSYouwanttocompareintermediategovernment
short-term “o’po’*-iond
funds’ For each of thes
n.”to., for the variables expense
ratio in percentage’
!.^ri”*, and five-Year
return’
a. Compute the mean’ median’
first quartile’ all(
quartile. . .:,
b. Compute the range’ interquartile
range’ varlanc€
dard deviation. un-d coefficient of
variation’
c. Construct u Uo*iot-Ate the data
skewed? If so’ h
d. Based on the results of (a) through
(c)’ what conc
can you reach about differences
between interr
g”t.”t-*t and short-term corporate
bond funds
3.79 You want to compare bond
funds based on r
each of these three levels of risk
(below average’ 2
60 CHAPTER 2 Organizrng and Visualizing Data
K E Y E Q U A T I O N S
Analyze 16
bar chart 32
categorical variable 16
cell 22
chartjunk 57
class boundaries 26
class interval 26
class interval width 26
class midpoint 27
class 26
Col lect 16
contingency table 22
continuous variable l7
cumulative percentage distribution 29
cumulative percentage PolYgon
(ogive) 43
data collection 20
DCOVA 16
Determining the Class Interval Width
hishest value – lowest value
Intervalwidth:f f i
Computing the Proportion or Relative Frequency
frequencY in each class
Proportion : relative frequencY : ffi
K E Y T E R M S
(2 .1 )
(2.2)
Define 16
discrete variable 16
drill down 54
frequency distribution 26
histogram 4l
interval scale I 8
multidimerrsional data 52
nominal scale 17
numerical variable 16
ogive (cumulative Percentage
polygon) 43
ordered array 25
ordinal scale 17
Organize l6
Pareto chart 35
Pareto princiPle 35
percentage distribution 28
percentage PolYgon 42
pie chart 34
PivotTable 52
primary data source 20
proportion 28
qualitative variable I 6
quantitative variable 1 6
ratio scale 18
relative frequencY 28
relative frequency distribution 28
scatter plot 48
secondary data source 20
side-by-side bar chart 37
stem-and-leaf disPlaY 40
summary table 2l
time-series plot 50
Visualize l6
C H A P T E R R E V I E W P R O B L E M S
CHECKING YCIUR [..JNDHRSTANSIN$
2.83 How do histograms and polygons differ in their con-
struction and use?
2.g4 Why would you construct a summary table?
2.85 What are the advantages and disadvantages of using a
bar chart, a pie chart, or a Pareto chart?
2.86 Compare and contrast the bar chart for categorical
data with the histogram for numerical data’
2.g7 What is the difference between a time-series plot an
d
a scatter plot?
2.88 Why is it said that the main feature of a Pareto chart
is its ability to separate the “vital few” from the “trivial
matty”?
2.89 What are the three different ways to break down the
percentages in a
contingency table?
2.90 \\’hat is the difference between a PivotTable and
contingency table?
2.91 \\’hat insights can you gain from a three-way tat
that are not available in a two-way table?
AP PLYI ING Th{ H C*hd*f; PTs
2.92 The summary table on the next page presents t
breakdown ofthe price ofa new college textbook:
a. Using the four categories, publisheq bookstore, author, a
freight, consh’uct a bar chart, a pie chafi, and a Pareto chi
b. Using the four subcategories of publisher and three st
categories of bookstore, along with the author and freil
categories, construct a Pareto chart.
c. Based on the results of (a) and (b), what conclusions t
you reach concerning who gets the revenue from the sa
of new college textbooks? Do any of these results s
prise you? ExPlain.
Rer-enue Category Percentage (%o)
>, :rce’. Data extracted Jrom T Lewin, “When Books Break the Bank,”- -,: \ew York Timeq September I 6, 200 3, pp. B 1, B 4.
2.93 The following table represents the estimated green
:,: , ‘.:r sales by renewable energy source in 2008:
:’ource Percentage (7o)
,-re otherrnal
l” dro
-:ndfill mass and biomass
S-.Iar
-‘rreported
“, \ :nd
-.,. -:ce National RenewabLe Energy Laboratory, 2008.
Chapter Review Problems 61
Results of a Yahool Kevword Tool for Searches Related
to “Sneakers”
Search Result Number of Occurrences
Jordan sneaker
Nike sneaker
Puma sneaker
Sneaker
Sneaker pimps*
* Sneaker pimps is a British electropop band
Source: Dala extracted,from K. J. Delaney, “The New Bene/its o.l lle:-
Search Qtteries,” The Wall Street Journal, February 6, 2007, p. 83.
a. For categories of online ad spending, construct a bar
chart, a pie chart, and a Pareto chart.
b. Which graphical method do you think is best for portray-
ing these data?
c. For the results of “sneakers” searches, construct a bar
chart, a pie chart, and a Pareto chart.
d. Which graphical method do you think is best for portray-
ing these data?
e. What conclusions can you reach concerning online ad
spending and the results of”sneakers” searches?
2.95 The owner of a restaurant serving Continental-style
entr6es has the business objective of learning more about
the patterns of patron demand during the Friday-to-Sunday
weekend time period. Data were collected from 630 cus-
tomers on the type of entr6e ordered and organized in the
following table:
Tlpe of Entr6e Number Served
187
103
30
25
122
63
74
26
630
Construct a percentage summary table for the trpes ..,
entr6es ordered.
b. Construct a bar chart, a pie chart, and a Pareto ci::: :- l
the tvoes ofentr6es ordered.
c. Do you prefer using a Pareto chart or a pie char
data? Why?
d. What conclusions can the restaurant o\\ ‘:r ir
cerning demand for different types of enlfeis
2.96 Suppose that the owner of the restaurf,ni rn Problem
2.95 also wanted to study the demand tbr dessen dunng the
same time period. She decided that in addition to srud)-ing
whether a dessert was ordered she ii’ould also srudv the gen-
der of the individual and uhether a beef entrie uas ordered.
f -bl isher
‘.lrnufacturing costs
r-{arketing and promotion
i.,lministrative costs and taxes
f.ier-tax profit
Jrokstore
!:npioyee salaries and benefits
-g,erations
letax profit
l”uthor
i:eight
64.8
1 a a
J Z . J
t5.4
10.0
1 . 1
22.4
I 1 . 3
6.6
4 .5
I 1 . 6
1 . 2
13.240
8 . 1 3 9
6.768
5 R g q i
2.8
1 1 .3
28.1
0.2
2.5
55 . I
iL arrnstruct a bar chafi, a pie chart, and a Pareto chart.
h. r,\ ‘hat conclusions can you reach about the sources of
.:een oower?
2-94 People conduct hundreds ofmiilions ofsearch queries
fl i :\ day. ln response, businesses are estimated to spend
&”:-..rst S20 billion annually on online ad spending. The fol-
, : i:1g represents the categories of online ad spending and
: :esults of a Yahoo! keyword tool for searches related to
:.;:kers”:
Spend ing
Ttpe Spending ($billions)
— -:ssified
r:splay aos
J’rid search
1-.’n medla/vlcleo
-tner
lrral
>: -::e’. Data extracted from K. J. Delaney, “The New BeneJits oJ Web-
k.:’:h Queries,” The Wa1l Street Journal, Februory 6. 2007, p. 83.
Beef
Chicken
Mixed
Duck
Fish
Pasta
Shellfish
Veal
Total
3 .32
3.90
8.29
2 . 1 5
1 . 8 5
1 9 . 5 1
62 CHAPTER 2 organizingandVisualizing Data
Datawereco l lec ted f rom600customersandorgan ized in
the following contingencY tables:
GENDER
DESSERT OROERED Male Female Total
96
z/.+
320
BEEF ENTREE
DESSERT ORDERED Yes Total
a. Construct a pie chart and a Pareto chart for the percent-
age of cor.tnties using the various methods’
U. fVhat conciusions can you reach concerning the type of
votins method used in November 2006?
c. \\’hat differences are there between the methods used in
2000 and 2006?
2.98 In summer 2000, a growing number of warranty
claims on Firestone tires sold on Ford SUVs prompted
Firestone and Ford to issue a major recall’ An analysis of
warrant\ claims data helped identify which models to recall’
A breakdou n of 2,504 warranty claims based on tire size is
giren in the following table:
Yes
No
Total
40
240
280
1 3 6
464
600
No
Tire Size Number of
WarrantY Claims
Yes
No
Total
a. For each of the two contingency tables, construct contln-
gency tables of row percentages’ column percentages’
and total Percentages’
b. Which type of percentage (row, column, or total) do
you
think is most informative for each gender’? For beef entree?
Explain.
c. What conclusious concerning the pattern of dessert
ordering can the restaurant owner reach?
2.97 The following data represent the method for recording
votes in the November 2006 election, broken down by percent-
age of counties in the United States, using each method and
G number of counties using each method in 2000 and 2006′
2,030
137
6 l
8 1
5 B
54
62
Sourc.: Ddk extrocted.from Robert L’ Simisott’ “Ford Steps Up Recall
llirhou: Ftresrotre,” The Wall Street Joutnal,Augttst l4′ 2000′ p A3′
The 1.0-r0 n’arranty claims for the 23575R15 tires can be
categonzed into ATX models and Wilderness models’ The
ti pe oi incident leading to a warranty claim, by model type’
is sunrntarized in the following table:
71 6s
1 16 348
187 413
t36
464
600
2 3 5 ” 5 R 1 5
3 1 1 0 5 0 R 1 5
30950R1 5
2 3 5 – 0 R 1 6
3 3 1 1 5 0 R 1 5
2 5 5 – 0 R 1 6
Others
Method
Percentage of
Counties Using
Method in 2006 (%)
Incident T1’Pe
ATX Model
Warranty Claims
Wilderness
WarrantY Claims
Tread separation
Blou’out
Other unknown 422
Total 1’864
r 165
77
59
4 l
66
166
Electronic
Hand-counted paPer ballots
Lever
Mixed
Optically’ scamed PaPer ballots
Punch card
Source: Data e\n’acIed -fi on1 R ltbtl
“‘
Paper-Trail l/oting Gets Organized
Opposition.” LS.\ Todar. ‘{pr.i1 )1. 2007, p 2A’
\umber of Counties
Method
309 1,742
370 51
434 62
92
l . r ) – l
572 13
Electronic
Hand-counted PaPer ballots
Lever
Mixed
Optically scanned PaPer ballots
Punch card
r49
1,279
Soutce’. Data extractedfrom R. Wolf, “Paper-Trail l/otittg Gets
Organized
Opposition,”IJSAToday,April 24, 2007′ p 2A’
Source: Dnla extracted;fi’om Rrtbert L’ Simison’
“Ford Steps Up Recall
Il’ir itottt Firestone, ” The Wall Street Journal, l ttgust 1 4′ 2000′
p A3 ‘
a. Construct a Pareto chart for the number of warrant'”
clain-rs by tire size. What tire size accounts for most c
the claims?
b. Construct a pie chart to display the percentage of the tot:.
number of warranty claims for the 23575R15 tires th’
come from the ATX model and Wilderness mode’
Interpret the chart.
c. Construct a Pareto chart for the type of incident causlll’
the warranty claim for the AIX model’ Does a certar:
type of incident account for most of the claims?
d. Construct a Pareto chart for the type of incident causir’:
thc warranty claim for the Wilderness model’ Does a ce:-
tain type of incident account for most of the claims?
2.99 One of the major measures of the qu’l i ty of sen’r;=
provideci by an organization is the speed u’th which t::
36.6
1 . 8
2.0
3.0
s6.2
0.4
2000 2006
rt
3a
t
t
i –
organization responds to customer complaints. A large
hmily-held department store selling furniture and flooring,
including carpet, had undergone a major expansion in the
lnst several years. In particular, the flooring department had
upanded ftom2 installation crews to an installation super-
risor, a measurer, and 15 installation crews. A business
djective of the company was to reduce the time between
lhen the complaint is received and when it is resolved.
Iluring a recent year, the company received 50 complaints
oncerning carpet installation. The data from the 50 com-
ints, organized in f@E, represent the number of days
the receipt of the complaint and the resolution of
complaint:
Chapter Review Problems 63
a. Construct a stem-and-leaf display for each of the three
variables.
b. Construct three scatter plots: money market account ver-
sus one-year CD, money market account versus five-year
CD, and one-year CD versus five-year CD.
c. Discuss what you learn from studying the graphs in (a)
and (b).
2.103 The file !!ftlslfftfi includes the total compensa-
tion (in $) of CEOs of large public companies in 2008.
Source: Data extractedfrom D. Jones and B. Hansen, “CEO Pay
Dives in a Rough 2008,” www.usatoday.com, May l, 2009.
a. Construct a frequency distribution and a percentage
distribution.
b. Construct a histogram and a percentage polygon.
c. Construct a cumulative percentage distribution and plot a
cumulative percentage polygon (ogive).
d. Based on (a) through (c), what conclusions can you reach
concerning CEO compensation in 2008?
2.104 Studies conducted by a manufacturer of “Boston”
and “Vermont” asphalt shingles have shown product weight
to be a major factor in customers’ perception of quality.
Moreover, the weight represents the amount of raw materi-
als being used and is therefore very impoftant to the com-
pany from a cost standpoint. The last stage of the assembly
line packages the shingles before the packages are placed on
wooden pallets. The variable of interest is the weight in
pounds of the pallet which for most brands holds 16 squares
of shingles. The company expects pallets of its “Boston”
brand-name shingles to weigh at least 3.050 pounds but less
than 3,260 pounds. For the company’s “Vermont” brand-
name shingles, pallets should weigh at least 3.600 pounds
but less than 3,800. Data are collected from a sample of 368
pallets of “Boston” shingles and 330 pallets of “Vermont”
shingles and stored in EEft!.
a. For the “Boston” shingles. construct a frequency distri-
bution and a percentage distribution having eight class
intervals, using 3,015, 3,050, 3,085, 3,120, 3,155, 3,190,
3,225,3,260, and 3.295 as the class boundaries.
b. For the “Vermont” shingles, construct a frequency distri-
bution and a percentage distribution having seven class
intervals, using 3,550, 3,600, 3,650, 3,700, 3,750, 3,800,
3,850, and 3,900 as the class boundaries.
c. Construct percentage histograms for the o’Boston” shin-
gles and for the “Vermont” shingles.
d. Comrnent on the distribution of pallet weights for the
“Boston” and “Vermont” shingles. Be sure to identify
the percentage of pallets that are underweight and
overweight.
2.105 The file !!!ft@ includes the overall cost
index, the monthly rent for a two-bedroom apartment, the
cost of a cup of coffee with service, the cost of a fast-food
hamburger meal, the cost of dry-cleaning a men’s blazer,the
cost of toothpaste, and the cost of movie tickets in 10 differ-
ent cities.
s r s
i l 1 9
[ 2 4
t3 10
33 68
35 137 31 27 152 2 r23 81 74 27
1 2 6 1 1 0 l l 0 2 9 6 t 3 5 9 4 3 1 2 6 5
165 32 29 28 29 26 25 | 14 13
s 2 7 4 5 2 3 0 2 2 3 6 2 6 2 0 2 3
ESa
d
Construct a frequency distribution and a percentage
distribution.
Construct a histogram and a percentage polygon.
Construct a cumulative percentage distribution and plot a
crrmulative percentage polygon (ogive).
the basis of the results of (a) through (c), if you had
m tell the president of the company how long a customer
stould expect to wait to have a complaint resolved” what
muld you say? Explain.
Data concerning 128 of the best-selling domestic
in the United States are contained in EEEEE!$.
ralues for three variables are included: percentage alco-
nrmber of calories per 12 ounces, and number of carbo-
(in grams) per 12 ounces.
Data extracted fromwww.Beerl00.com, June ‘/5, 2009
a percentage histogram for each of the three
niables.
three scatter plots: percentage alcohol versus
s, percentage alcohol versus carbohydrates, and
versus carbohydrates.
what you learn from studying the graphs in (a)
tb).
The file ![l[[[ffi contains the state cigarette tax,
for each state as of April 1,2009.
an ordered array.
e percentage histogram.
conclusions can you reach about the differences in
$ate cigarette tax between the states?
The file !!!@!f!l!l contains the yields for a money
rcount, a one-year certificate of deposit (CD), and
CD, for 23 banks in the metropolitan New York
of May.28,2009.
extracte d from www.Bankrat e.com, May. 2 8, 2 0 0 9
fg t
ftid
meats, poultry, and fish).
Source: U.S. Department of Agriculture’
a. Construct a percentage histogram
calories.
b. Construct a percentage histogram
cholesterol.
for the number of
for the amount of
64 CHAPTER 2 OrganrzingandVisualizing Data
a. Construct six separate scatter plots’ For each’ use the
overall cost index as the I axis’ Use the monthly rent
for a two-bedroom apartment, the costs of a cup of cof-
fee with service’ a fast-food hamburger meal’ dry-
cleaning a men’s blazer, toothpaste, and movie tickets
as the X axis’
b. What conclusions can you reach about the relationship of
the overall cost index to these six variables?
2.106 The file EEEE contains calorie and cholesterol
information .ot”.rnirrg popular protein foods (fresh red
IBM-Weekly closing stock price for IBM
AAPL-Weekly closing stock price for Apple
Source: D ata extracted from finance’yahoo’com’ January
I 3′ 2 0 09′
a. Construct a time-series plot for the weekly closing values
of the S&P 500Index, General Electric,IBM, andApple’
b. Explain any patterns present in the plots’
c. Write a short summary of your findings’
2.’l ‘ l} (Class Project) Have each student in the class
,.rpond’to the queslion “Which carbonated soft drink do
you most prefer?” so that the teacher can tally the results
into a summarY table.
a. Convert the data to percentages and construct a Pareto chart’
b. Analyze the findings.
2.111 (Class Project) Let each student in the class be cross-
classified on the basis of gender (male, female) and current
employment status (yes, no) so that the teacher can tally the
results.
a.Constructatablewitheitherroworcolumnpercentages’
depending on which you think is more informative’
b. Wirat rvould you conclude from this study?
c.Whatothervariableswouldyouwanttoknowregarding
employment in order to enhance your findings?
REPORT WRITING EXERCISES
2.’112 Referring to the results from Problem 2’104 onpage
63 concerning the weight of “Boston” and “Vermont” shin-
gles. u’rite a ieport that evaluates whether the weight of the
fallets of the two types of shingles are
what the companl-
.”p..rr. Be sure to incotporate tables and charts into the repon’
2.119 Referring to the results from Problem 2’98 on page
62 concerning the warranty claims on Firestone tires’ write
a report thatlvaluates warranty claims on Firestone tires
soldon Ford SUVs. Be sure to incorporate tables and charts
into the report.
TEAM PROJECT
The file f!!!f!fi!t contains information regarding
nine
variables from a sample of 180 mutual funds:
Fund number-Identification number for each bond fund
Type-Bond fund type (intermediate government or
short-term corPorate)
Assets-In millions of dollars
Fees-Sales charges (no or Yes)
Expense ratio-Ratio of expenses to net assets rn
percentage
Reiurn 20O8-Twelve-month return in 2008
Three-year return-Annualized return, 2006-2008
Five-year return-Annualized return, 2004-2 00 8
Risk-Risk-of-loss factor of the mutual fund (bel
average. average. or above average)
2.’114 For this problem, consider the expense ratio’
a. Construct a percentage histogram’
c. What conclusions can you reach from your analyses 1n
(a) and (b)?
2.107 The file G!!fft!! contains the weekly average
pri”. of gu*oline in the United States from January l’ 2007 ‘
io lun ruiy 12,2009. Prices are in dollars per gallon’
Source: U.S. Department of Energy, www’eia’doe’gov’ January 14′
2009.
a. Construct a time-series Plot.
b. What pattern, if any, is present in the data?
2.1Og The file EEI contains data for the amount of soft
drink filled in a sample of 50 consecutive 2liter bottles’ The
results are listed horizontally in the order of being filled:
2.109 2.086 2.066 2.015 2.065 2.05’1 2’052 2’044 2’036 2’038
2.031 2.029 2.025 2.029 2.023 2.020 2’015 2’014 2’013 2’014
2.012 2.012 2.012 2.010 2.005 2.003 1.999 1’996 l ‘991 1″992
i.994 1.986 1.984 1.981 1.9′ ,13 1.915 l .g ‘ , l l l ‘969 l ‘966 l ‘961
t .963 1 .957 1 .951 1 .951 I ‘941 1 .941 1 ‘941 1 ‘938 1 ‘908 l ‘894
a. Construct a time-series plot for the amount of soft drink
on the l’axis and the bottle number (going consecutively
from I to 50) on the Xaxis.
b. What pattern. if any, is present in these data?
c. If you had to maks a prediction about the amount of soft
arint nttea in the next bottle, what would you predict?
d. Based on the results of (a) through (c), explain why it is
important to construct a time-series plot and not just a
histogram, as was done in Problem 2’59 on page 48′
2.109 The S&P 500 lndex tracks the overall movement of
the stock market by considering the stock prices of 500 large
,orpotutionr. The file fs![!fft!!t contains weekly data for
this index as well as the daily closing stock prices for three
companies from January 2,2008, to January 12’2009’The
following variables are included:
WEEK-Week ending on date given
S&P-Weekly closing value for the S&P 500 Index
GE-Weekly closing stock price for General Electric