- Ch. 3, p. 95 – exercise 88
88. Refer to the Baseball 2005 data, which reports information on the 30 major league teams for the 2005 baseball season.
A. Select the variable team salary and find the mean, median, and the standard deviation.
B. Select the variable that refers to the age the stadium was built. (Hint: Subtract the year in which the stadium was built from the current year to find the stadium age and work with that variable.) Find the mean, median, and the standard deviation.
C. Select the variable that refers to the seating capacity of the stadium. Find the mean, median, and the standard deviation.
SEE EXCEL
- Ch. 5, p. 173 – exercise 56
Assume the likelihood that any flight on Northwest Airlines arrives within 15 minutes of the scheduled time is .90. We select four flights from yesterday for study.
A. What is the likelihood all four of the selected flights arrived within 15 minutes of the scheduled time?
B. What is the likelihood that none of the selected flights arrived within 15 minutes of the scheduled time?
C. What is the likelihood at least one of the selected flights did not arrive within 15 minutes of the scheduled time?
Assume the likelihood that any flight on Northwest Airlines arrives within 15 minutes of
the scheduled time is .90. We select four flights from yesterday for study.
a. What is the likelihood all four of the selected flights arrived within 15 minutes of the
scheduled time?
p^n
0.9^4 = 0.6561
b. What is the likelihood that none of the selected flights arrived within 15 minutes of
the scheduled time?
(1-p)^n
(1-0.9)^4
0.1^4
= 0.0001
c. What is the likelihood at least one of the selected flights did not arrive within 15 minutes
of the scheduled time?
at least one = 1 - prob(all arrived)
= 1 - 0.6561
= 0.3439
- Ch. 6, p. 217 – exercise 64
An internal study by the Technology Services department at Lahey Electronics revealed company employees receive an average of two emails per hour. Assume the arrival of these emails is approximated by the Poisson distribution.
A. What is the probability Linda Lahey, company president, received exactly 1 email between 4 P.M. and 5 P.M. yesterday?
B. What is the probability she received 5 or more email during the same period?
C. What is the probability she did not receive any email during the period?
The average is 2 per hour.
I'll use the Poisson calculator here:
a. What is the probability Linda Lahey, company president, received exactly 1 email between 4 P.M. and 5 P.M. yesterday?
0.270670566473225
Using the formula:
f(x) = e-λλx / x!
= e^(-2)*(2)^1 / 1
same result as the above with a calculator
b. What is the probability she received 5 or more email during the same period?
= 1 - 4 or less
= 1 - 0.947346982656289
= 0.052653017343711
Or with the formula:
1 - e^(-2)*(2)^0 / 1 - e^(-2)*(2)^1 / 1 - e^(-2)*(2)^2 / 2 - e^(-2)*(2)^3 / 6 - e^(-2)*(2)^4 / 24
which evaluates to the above
c. What is the probability she did not receive any email during the period?
0.135335283236613
Using the formula:
f(x) = e-λλx / x!
= e^(-2)*(2)^0 / 1
Again, the same result as the above...
- Ch. 7, p. 249 – exercise 50
Fast Service Truck Lines uses the Ford Super Duty F-750 exclusively. Management made a study of the maintenance costs and determined the number of miles traveled during the year followed the normal distribution. The mean of the distribution was 60,000 miles and the standard deviation 2,000 miles.
- What percent of the Ford Super Duty F-750s logged 65,200 miles or more?
- What percent of the trucks logged more than 57,060 but less than 58,280 miles?
- What percent of the Fords traveled 62,000 miles or less during the year?
- Is it reasonable to conclude that any of the trucks were driven more than 70,000 miles? Explain.
I'll be using this z table:
a. What percent of the Ford Super Duty F-750s logged 65,200 miles or more?
z(65200) = (65200-60000)/2000 = 2.6
prob(z > 2.6)
= 0.466%
b. What percent of the trucks logged more than 57,060 but less than 58,280 miles?
z(57060) = (57060-60000)/2000 = -1.47
z(58280) = (58280-60000)/2000 = -0.86
prob(-1.47 < z < -0.86)
= 12.41%
c. What percent of the Fords traveled 62,000 miles or less during the year?
z(62000) = (62000-60000)/2000 = 1
prob(z < 1)
= 84.13%
d. Is it reasonable to conclude that any of the trucks were driven more than 70,000 miles?
Explain.
z(70000) = (70000-60000)/2000 = 5
prob(z > 5)
= about 0% (it's greater than 0, but the z table shows it as 0)
It is not reasonable, since the proportion of a population that's more than 5 standard deviations above the mean is basically 0.
- Ch. 8, p. 288 – exercise 38
The mean amount purchased by a typical customer at Churchill's Grocery Store is $23.50 with a standard deviation of $5.00. Assume the distribution of amounts purchased follows the normal distribution. For a sample of 50 customers, answer the following questions.
A.What is the likelihood the sample mean is at least $25.00?
B.What is the likelihood the sample mean is greater than $22.50 but less than $25.00?
C.Within what limits will 90 percent of the sample means occur?
a. What is the likelihood the sample mean is at least $25.00?
z = (25-23.5)/(5/sqrt(50))
z = 2.12132
prob(z > 2.12132)
= 0.0169
b. What is the likelihood the sample mean is greater than $22.50 but less than $25.00?
z(22.5) = (22.5-23.5)/(5/sqrt(50)) = -1.4142
prob(-1.4142 < z < 2.12132)
= 0.9044
c. Within what limits will 90 percent of the sample means occur?
z = +/-1.6449
The interval goes from:
mean - z*sd/sqrt(N) to mean + z*sd/sqrt(N)
23.5 - 1.6449*5/sqrt(50) to 23.5 + 1.6449*5/sqrt(50)
22.3369 to 24.6631
- Ch. 9, p. 321 – exercise 54
Families USA, a monthly magazine that discusses issues related to health and health costs, surveyed 20 of its subscribers. It found that the annual health insurance premiums for a family with coverage through an employer averaged $10,979. The standard deviation of the sample was $1,000.
- Based on this sample information, develop a 90 percent confidence interval for the population mean yearly premium.
- How large a sample is needed to find the population mean within $250 at 99 percent confidence?
a. Based on this sample information, develop a 90 percent confidence interval for the
population mean yearly premium.
The t value for df = n-1 = 19, with 90% confidence is:
1.7291
The interval goes from:
mean - t*sd/sqrt(N) to mean + t*sd/sqrt(N)
10979 - 1.7291*1000/sqrt(20) to 10979 - 1.7291*1000/sqrt(20)
With a calculator:
10592.36 to 11365.64
b. How large a sample is needed to find the population mean within $250 at 99 percent
confidence?
The z value for 99% confidence is 2.5758
The formula for sample size is:
N = (z*sd/E)^2
N = (2.5758*1000/250)^2
N = 106.16
Round up to:
N = 107
- Ch. 10, p. 362 – exercise 42
During recent seasons, Major League Baseball has been criticized for the length of the games. A report indicated that the average game lasts 3 hours and 30 minutes. A sample of 17 games revealed the following times to completion. (Note that the minutes have been changed to fractions of hours, so that a game that lasted 2 hours and 24 minutes is reported at 2.40 hours.)
2.98 / 2.40 / 2.70 / 2.25 / 3.23 / 3.17 / 2.93 / 3.18 / 2.802.38 / 3.75 / 3.20 / 3.27 / 2.52 / 2.58 / 4.45 / 2.45
Can we conclude that the mean time for a game is less than 3.50 hours? Use the .05 significance level.
H0: game length is >= 3.5 hours
Ha: game length is < 3.5 hours
mean = 2.9553
stdev = 0.5596
Get the t test statistic:
t = (x-mu)/(stdev/sqrt(N))
t = (2.9553-3.5)/(0.5596/sqrt(17))
t = -4.0133
Get the critical value for df = N-1 = 16, one tail, alpha is 0.05:
-1.7459
Since our test statistic is much lower than the critical value, we reject the null hypothesis. There is enough evidence to conclude that games are shorter than 3.50 hours.
- Ch. 11, p. 402 – exercise 58
The amount of income spent on housing is an important component of the cost of living. The total costs of housing for homeowners might include mortgage payments, property taxes, and utility costs (water, heat, electricity). An economist selected a sample of 20 homeowners in New England and then calculated these total housing costs as a percent of monthly income, five years ago and now. The information is reported below. Is it reasonable to conclude the percent is less now than five years ago?
Home OwnerFive years agoNow
117%10%
22039
32937
44327
53612
64341
74524
81926
94928
104926
1135%32%
121632
132321
143312
154440
164442
172822
182919
193935
202212
SEE EXCEL
- Ch. 12, p. 445 – exercise 42
Martin Motors has in stock three cars of the same make and model. The president would like to compare the gas consumption of the three cars (labeled car A, car B, and car C) using four different types of gasoline.
For each trial, a gallon of gasoline was added to an empty tank, and the car was driven until it ran out of gas. The following table shows the number of miles driven in each trial.
of Gasoline / Distance (miles)Car A / Car B / Car C
Regular / 22.4 / 20.8 / 21.5
Super regular / 17.0 / 19.4 / 20.7
Unleaded / 19.2 / 20.2 / 21.2
Premium unleaded / 20.3 / 18.6 / 20.4
Using the .05 level of significance:
A. Is there a difference among types of gasoline?
B. Is there a difference in the cars?
SEE EXCEL
- Ch. 13, pp. 499 – exercise 37
A regional commuter airline selected a random sample of 25 flights and found that the correlation between the number of passengers and the total weight, in pounds, of luggage stored in the luggage compartment is 0.94. Using the .05 significance level, can we conclude that there is a positive association between the two variables?
SEE EXCEL
- Ch. 14, pp. 548-49 – exercise 17
The district manager of Jasons, a large discount electronics chain, is investigating why certain stores in her region are performing better than others. She believes that three factors are related to total sales:the number of competitors in the region, the population in the surrounding area, and the amount spent on advertising. From her district, consisting of several hundred stores, she selects a random sample of 30 stores. For each store she gathered the following information.
Y = / total sales last year (in $ thousands).X1 = / number of competitors in the region.
X2 = / population of the region (in millions).
X3 = / advertising expense (in $ thousands).
The sample data were run on MINITAB, with the following results.
Analysis of VarianceSource / DF / SS / MS
Regression / 3 / 3050.00 / 1016.67
Error / 26 / 2200.000 / 84.62
Total / 29 / 5250.000 / Top of Form
Â
Bottom of Form
Predictor / Coef / StDev / t-ratio
Constant / 14.00 / 7.00 / 2.00
X1 / -1.00 / 0.70 / 2.00
X2 / 30.00 / 5.20 / 5.77
X3 / 0.20 / 0.08 / 2.50
A.What are the estimated sales for the Bryne store, which has four competitors, a regional population of 0.4 (400,000), and advertising expense of 30 ($30,000)?
B.Compute the R2 value.
C.Compute the multiple standard error of estimate.
D.Conduct a global test of hypothesis to determine whether any of the regression coefficients are not equal to zero. Use the .05 level of significance.
E.Conduct tests of hypotheses to determine which of the independent variables have significant regression coefficients. Which variables would you consider eliminating? Use the .05 significance level.
a. What are the estimated sales for the Bryne store, which has four competitors, a
regional population of 0.4 (400,000), and advertising expense of 30 ($30,000)?
Use the regression:
14 - 1*4 + 30*0.4 + 0.2*30
= 28
So the sales would be $28000, since the regression gives the value in thousands
b. Compute the value of r^2.
ssr/sst
= 3050/5250
= 0.58095
c. Compute the multiple standard error of estimate.
sqrt(mse)
= sqrt(84.62)
= 9.1989
d. Conduct a global test of hypothesis to determine whether any of the regression coefficients
are not equal to zero. Use the .05 level of significance.
H0: coeffs are zero
Ha: coeffs are not zero
F = 1016.67/84.62
= 12.01454
The critical value, from an F table, with df = 3, 26 is: 2.975
The test statistic is much higher, so we reject the null.
At least one coefficient is non zero.
e. Conduct tests of hypotheses to determine which of the independent variables have
significant regression coefficien
Each hypothesis looks like this:
H0: coefficient for Xn is 0
Ha: coefficient for Xn is non zero
The critical T value, from a table, is 2.056
The t value for X1 is not outside that range, so we don't reject.
The t value for X2 and X3 are outside, so we reject.
X2 and X3 are significant.
- Ch. 17, p. 664-65 – exercise 22
Banner Mattress and Furniture Company wishes to study the number of credit applications received per day for the last 300 days. The information is reported on the next page.
Number of Credit Applications / Frequency (Number of Days)0 / 50
1 / 77
2 / 81
3 / 48
4 / 31
5 or more / 13
To interpret, there were 50 days on which no credit applications were received, 77 days on which only one application was received, and so on. Would it be reasonable to conclude that the population distribution is Poisson with a mean of 2.0? Use the .05 significance level. Hint: To find the expected frequencies use the Poisson distribution with a mean of 2.0. Find the probability of exactly one success given a Poisson distribution with a mean of 2.0. Multiply this probability by 300 to find the expected frequency for the number of days in which there was exactly one application. Determine the expected frequency for the other days in a similar manner.
SEE EXCEL