MATH 108 Fall 2007 Homework Solutions

These solutions are provided by the publisher of the textbook and are not for redistribution or copying beyond this semester.

HW# / Date Page# Question Answers

HW1 13 Sep p. 11 # 2.3 (a): Number of leaves; categorical; a single plant; 25
(b): Number of seizures; categorical; a single patient; 20

p. 24 # 2.5 (dotplot)

p. 24 # 2.6 (multiple ways to bin the histogram)

p. 30 # 2.20 mean = 3.492 lb, median = 3.36 lb.

p. 30 # 2.22 mean = 3.389 lb, median = 3.335 lb.

p. 30 # 2.28 mean approx 45, median approx 49 (median bigger than mean)

p. 30 # 2.29 approx 35%.

p. 38 # 2.33 (boxplot for men generally lower, although median and max are higher than for women)

p. 38 # 2.36 (b): centered at 25, min near 12, max near 36.

p. 48 # 2.45 mean = -12.44 mmHg, std dev = 17.6 mmHg.

p. 66 # 2.69 (a) mean = 0.19, std dev = 4.22.
(b) median = 1.0.
(c) mean = 1.44, std dev = 2.08, median = 1.25. median is resistant (robust); mean/stddev are not.

p. 66 # 2.78 The low volume distribution is symmetric, centered at 20, with a minimum of 0 and a maximum of 40. The high volume distribution is shifted down from the low volume distribution, with a median of about 18 and a maximum of 30, which is the third quartile for the low volume distribution. Thus, one-fourth of the low volume hospitals have mortality rates greater than the highest mortality rate among high volume hospitals.

HW2 20 Sep p. 77 # 3.1 (point is that human judgment doesn't yield very random samples)

p. 83 # 3.5 (a) 51%. (b) 94%. (c) 46%. (d) 54%

p. 87 # 3.6 (a) 0.3025. (b) 0.7975.

p. 87 # 3.9 (a) test positive: (0.10)(0.98) + (0.90)(0.01) = 10.7%
(b) 5.85%

p. 87 # 3.11 (a) (0.10)(0.92) + (0.90)(0.06) = 14.6%
(b) 0.092/0.146 = 63%

p. 92 # 3.13 (a) 1016/6549 = .1551
(b) 2480/6549 = .3787
(c) (1016 + 2480 - 526)/6549 = 2970/6549 = .4535
(d) 526/6549 = .0803

p. 92 # 3.14 Not independent: if smoking status of husband were independent of smoking status of wife, then the probability that in a couple both husband and wife would smoke would be (.30)(.20) = .06, rather than .08. Note that Pr{husband and wife both smoke} = Pr{husband smokes}Pr{wife smokes|husband smokes}. If smoking status of husband were independent of smoking status of wife, then we would have Pr{husband and wife both smoke} = Pr{husband smokes}Pr{wife smokes} = (.30)(.20) = .06. But Pr{husband and wife both smoke} = .08, not .06

p. 96 # 3.16 (a) Pr{20 < Y < 30} = .41 + .21 = .62
(b) .41 + .21 + .03 = .65
(c) .01 + .34 = .35

p. 96 # 3.17 (a) (.35)(.35) = .1225
(b) (.35)(.24) = .084
(c) (.35)(.24) + (.24)(.35) = .168

p. 102 # 3.21 (a) .189 + .027 = .216
(b) .343 + .441 + .189 = .973

p. 102 # 3.22 (0)(.343) + (1)(.441) + (2)(.189) + (3)(.027) = .9

p. 102 # 3.23 The variance is (0 - .9)²(.343) + (1 - .9)² (.441) + (2 - .9)² (.189) + (3 - .9)² (.027) = .63
Thus, the standard deviation is 0.794.

p. 117 # 3.38 (a) (0.10)² = 0.01
(b) (2)(.1)(.9) = 0.18

p. 117 # 3.46 (a) .66 (b) .21 (c) .38

HW3 27 Sep p. 92 # 3.12 (a) 1213/6549 = .1852
(b) 247/2115 = .1168
(c) No; the probability of a person being a smoker depends on whether or not the person has high income, since the answers to (a) and (b) differ.

p. 131 # 4.2 (a) 1.28 (b) 1.28

p. 131 # 4.3 (a) For y = 1500, z = (y - ľ)/s = (1500 - 1400)/100 = 1.00. From Table 3, the area is .8413 or 84.13%.
(b) For y = 1325, z = (y - ľ)/s = (1325 - 1400)/100 = -.75. From Table 3, the area below 1325 is .2266. From part (a), the area below 1500 is .8413. Thus, the percentage between 1325 and 1500 is .8413 - .2266 = .6147 or 61.47%.
(c) For y = 1325, z = (y - ľ)/s = (1325 - 1400)/100 = -.75. From Table 3, the area below 1325 is .2266. Thus, the percentage with Y ≥ 1325 is 1 - .2266 = .7734 or 77.34%.
(d) For y = 1475, z = 0.75. From Table 3, the area below 1475 is .7734. Thus, the percentage with Y ≥ 1475 is 1 - .7734 = .2266 or 22.66%.
(e) For y = 1600, z = 2.00. From Table 3, the area below 1600 is .9772. In part (d) we found that the area below 1475 is .7734. Thus, the percentage with 1475 ≤ Y ≤ 1600 is .9772 - .7734 = .2038 or 20.38%.
(f) For y = 1200, z = -2.00. From Table 3, the area below 1200 is .0228. In part (c) we found that the area below 1325 is .2266. Thus, the percentage with 1200 ≤ Y ≤ 1325 is .2266 - .0028 = .2038 or 20.38%.

p. 131 # 4.4 (a) For y = 1325, z = -0.75. From Table 3, the area is .2266 or 22.66%.
(b) This is the same as part (e) of Exercise 4.3. For y = 1600, z = 2.00. From Table 3, the area below 1600 is .9772. For y = 1475, z = 0.75. From Table 3, the area below 1475 is .7734. Thus, the percentage with 1475 ≤ Y ≤ 1600 is .9772 - .7734 = .2038 or 20.38%.

p. 131 # 4.12 (a) ľ = 5,000,000 which means that s = (.008)(5,000,000) = 40,000. For y = 4,900,000, z = (4900000 - 5000000)/40000 = -2.5. For y = 5,100,000, z = (5100000 - 5000000)/40000 = 2.5. Thus, Pr{4,900,000 < Y < 5,100,000} = Pr{-2.5 < Z < 2.5} = .9938 - .0062 = .9876.
(b) Pr{.98ľ < Y < 1.02ľ} = Pr{(.98ľ - ľ)/.008ľ < (Y - ľ)/s < (1.02ľ - ľ)/.008ľ } = Pr{-2.5 < Z < 2.5} = .9938 - .0062 = .9876
(c) A specimen reading Y differs from the correct value by 2% or more if it does not satisfy .98ľ < Y < 1.02ľ. Using the answer from part (b), this probability is 1 - .9876 = .0124 or 1.24%.

p. 141 # 4.21 (a) At the low end of the distribution the normal probability plot is fairly straight, indicating that the data agree with what one would expect from a normal distribution. Thus, the times for the fastest riders are roughly equal to the times one would expect if the data came from a truly normal distribution.
(b) At the high end of the distribution the normal probability plot bends upward, indicating that the times are greater than what one would expect from a normal distribution. Thus, the times for the slowest riders are worse than the times one would expect.

p. 144 # 4.22 (a) Pr{Y ≥ 40} = Pr{Y > 39.5} =~ Pr{Z > (39.5 - 38.5)/2.9 } = Pr{Z > .34} = 1 - .6331 = .3669 or 36.69%.
(b) Pr{Y = 40} = Pr{39.5 < Y < 40.5} =~ Pr{(39.5 - 38.5)/2.9 < Z < (40.5 - 38.5)/2.9 }. This is the probability that Z is between .34 and .69, which is .7549 - .6331 = .1218 or 12.18%.
(c) Pr{35 ≤ Y ≤ 40} = Pr{34.5 < Y < 40.5} =~ Pr{(34.5 - 38.5)/2.9 < Z < (40.5 - 38.5)/2.9 }. This is the probability that Z is between -1.38 and .69, which is .7549 - .0838 = .6711 or 67.11%.

p. 146 # 4.36 By symmetry of the normal curve, ľ = (61.2 + 67.4)/2 = 64.3 inches. In Table 3, the area closest to .9 is .8997, which corresponds to z = 1.28. Therefore, we have 1.28 = (67.4 - 64.3)/s, which yields σ = (67.4 - 64.3)/1.28 = 2.4 inches.

p. 146 # 4.40 ľ = 145; σ = 22. The distribution of readings is a normal distribution with mean ľ (the true concentration) and standard deviation σ. A reading of 40 or more is considered "unusually high." Suppose that ľ = 35 and σ = 4.
For y = 40, z = (40 - 35)/4 = 1.25. From Table 3, the area below 1.25 is .8944, which means that the area above 1.25 is 1 - .8944 = .1056. Thus, Pr{specimen is flagged as "unusually high} = .1056.

p. 146 # 4.41 (a) 1 - .5948 = .4052
(b) 1 - .8729 = .1271
(c) .7549 - .4168 = .3381

p. 146 # 4.42 0.2546

p. 146 # 4.43 Pr{0 < Y < 15} = .7549 - .2546 = .5003. Thus we expect (400)(.5003), or about 200 observations to fall between 0 and 15.

p. 146 # 4.44 The IQR is 14.74 - (-.14) = 14.88. An outlier on the high end of the distribution is any point greater than 14.74 + (1.5)(14.88) = 37.06.

p. 164 # 5.15 (a) In the population, ľ = 176 and σ = 30. For y = 186, z = (186 - 176)/30 = .33. From Table 3, the area below .33 is .6293. For y = 166, z = -0.33. From Table 3, the area below -.33 is .3707. Thus, the percentage with 166 ≤ y ≤ 186 is .6293 - .3707 = .2586, or 25.86%.
(b) We are concerned with the sampling distribution of Ÿ for n = 9. From Theorem 5.1, the mean of the sampling distribution of Ÿ is 176, the standard deviation is σ_Ÿ = σ/√n = 30/√9 = 10, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1).
For ÿ = 186, z = (186 - 176)/10 = 1.00. From Table 3, the area below 1.00 is .8413. For ÿ = 166, z = (166 - 176)/10 = -1.00. From Table 3, the area below -1.00 is .1587. Thus, the percentage with 166 ≤ ÿ ≤ 186 is .8413 - .1587 = .6826, or 68.26%.
(c) The probability of an event can be interpreted as the long-run relative frequency of occurrence of the event (Section 3.3). Thus, the question in part (c) is just a rephrasing of the question in part (b). It follows from part (b) that Pr{166 ≤ ÿ ≤ 186} = .6826.

HW4 11 Oct p. 164 # 5.16 (a) ľ = 3000; σ = 400.
The event E occurs if Ÿ is between 2900 and 3100. We are concerned with the sampling distribution of Ÿ for n = 15. From Theorem 5.1, the mean of the sampling distribution of Ÿ is ľ_Ÿ = ľ = 3000, the standard deviation is σ_Ÿ = σ/√n = 400/√15 = 103.3, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1).
For ÿ = 3100, z = (ÿ - ľ_Ÿ)/σ_Ÿ = (3100 - 3000)/103.3 = 0.97. From Table 3, the area below 0.97 is 0.8340. For ÿ = 2900, z = -0.97. From Table 3, the area below -0.97 is 0.1660.
Thus, Pr{2900 ≤ Ÿ ≤ 3100} = Pr{E} = 0.8340 - 0.1660 = 0.6680.
(b) n = 60; σ_Ÿ = 400/√60 = 51.64 z = ą100/51.64 = ą1.94; Table 3 gives .9738 and .0262, so Pr{E} = .9738 - .0262 = .9476.
(c) As n increases, Pr{E} increases.

p. 164 # 5.17 σ_Ÿ = 400/√15 = 103.3
(a) z = (2900 - 2800)/103.3 = .97. From Table 3, the area below .97 is .8340.
z = (2700 - 2800)/103.3 = -.97. From Table 3, the area below -.97 is .1660.
Thus, Pr{E} = .8340 - .1660 = .6680
(b) z = (2700 - 2600)/103.3 = .97. From Table 3, the area below .97 is .8340.
z = (2500 - 2600)/103.3 = -.97. From Table 3, the area below -.97 is .1660.
Thus, Pr{E} = .8340 - .1660 = .6680
(c) For fixed n and σ, Pr{E} does not depend on ľ.

p. 164 # 5.20 (a) In the population, 65.68% of the fish are between 51 and 60 mm long. To find the probability that four randomly chosen fish are all between 51 and 60 mm long,
Pr{all 4 are between 51 and 60} = (0.6568)⁴ =~ 0.1861.
(b) The mean length of four randomly chosen fish is Ÿ . Thus, we are concerned with the sampling distribution of Ÿ for a sample of size n = 4 from a population with ľ = 54 and σ = 4.5. From Theorem 5.1, the mean of the sampling distribution of Ÿ is ľ_Ÿ = ľ = 54, the standard deviation is σ_Ÿ = σ/√n = 4.5/√4 = 2.25, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1).
For ÿ = 60, z = (ÿ - ľ_Ÿ)/σ_Ÿ = (60 - 54)/2.25 = 2.67.
From Table 3, the area below 2.67 is .9962.
For ÿ = 51, z = (ÿ - ľ_Ÿ)/σ_Ÿ = (51 - 54)/2.25 = -1.33.
From Table 3, the area below -1.33 is .0918.
Thus, Pr{51 = Ÿ = 60} = .9962 - .0918 = .9044.

p. 164 # 5.21 Let E1 be the event that all four fish are between 51 and 60 mm long and let E2 be the event that Ÿ is between 51 and 60 mm long. If E1 occurs, then E2 must also occur -- the mean of four numbers, each of which is between 51 and 60, must be between 51 and 60 -- but E2 can occur without E1 occurring. Thus, in the long run, E2 will happen more often than E1, which shows that Pr{E2} > Pr{E1}.

p. 164 # 5.24 (a) ľ_Ÿ = ľ = 41.5.
(b) σ_Ÿ = 4.7/√4 = 2.35

p. 164 # 5.25 (a) Because the sample size of 2 is small, we would expect the histogram of the sample means to be skewed to the right, as is the histrogram of the data. However, the histogram of the sample means will be somewhat symmetric (more so than the histogram of the data). (b) Because the sample size of 25 is fairly large, we would expect the histogram to have a bell shape.

p. 164 # 5.26 The sample mean is just an individual observation when n=1. Thus, the histogram of the sample means will be the same as the histogram of the data (and therefore be skewed to the right).

p. 164 # 5.27 No. The histogram shows the distribution of observations in the sample. Such a distribution would look more like the population distribution for n = 400 than for n = 100, and the population distribution is apparently rather skewed. The Central Limit Theorem applies to the sampling distribution of Ÿ, which is not what is shown in the histogram.

p. 177 # 5.50 ľ = 1,200; σ = 35.
For Pr{1175 ≤ Y ≤ 1225}, z = (1225 - 1200)/35 = .71; Table 3 gives .7611.
z = (1175 - 1200)/35 = -.71; Table 3 gives .2389.
.7611 - .2389 = .5222.
For Pr{1175 ≤ Ÿ ≤ 1225}, σ_Ÿ = σ/√n = 35/√6 = 14.29.
z = (1225 - 1200)/14.29 = 1.75; Table 3 gives .9599.
z = (1175 - 1200)14.29 = -1.75; Table 3 gives .0401.
.9599 - .0401 = .9198.
Comparison: .9189 > .5222; this shows that the mean of 6 counts is more precise, in that it is more likely to be near the correct value (1200) than is a single count.

p. 177 # 5.51 μ = 8.3; σ = 1.7.
If the total weight of 10 mice is 90 gm, then their mean weight is 90/10 = 9.0 gm. Thus, we wish to find the percentage of litters for which ÿ = 9.0 gm. We are concerned with the sampling distribution of Ÿ for n = 10. From Theorem 5.1, the mean of the sampling distribution of Ÿ is μ_Ÿ = μ = 8.3, the standard deviation is σ_Ÿ = σ/√n = 1.7/√10 = .538, and the shape of the distribution is normal because the population distribution is normal (part 3a of Theorem 5.1).
For ÿ = 9.0, z = (ÿ - μ_Ÿ)/σ_Ÿ = (9.0 - 8.3)/0.538 = 1.30.
From Table 3, the area below 1.30 is .9032.
Thus, the percentage with ÿ ≥ 9.0 is 1 - .9032 = .0968, or 9.68%.

p. 177 # 5.52 Two possible factors are: (a) environmental and genetic differences between litters; (b) competition between mice in a litter.

p. 185 # 6.4 3.06/√86 = .33 mm

p. 185 # 6.5 (a) We would predict the SD of the new measurements to be about 3 mm because this is our estimate (based on Exercise 6.4) of the population SD.
(b) We would expect the SE of the new measurements to be 3/√500 ≅ .13 mm.

HW5 18 Oct p. 185 # 6.7 (a) the SE; (b) the SD; (c) the SE

p. 194 # 6.10 (a) ÿ = 31.720 mg; s = 8.729 mg; n = 5.
The standard error of the mean is SE_ÿ = s/√n = 8.729/√5 = 3.904 mg.
(b) The degrees of freedom are n - 1 = 5 - 1 = 4. The critical value is t_.05 = 2.132. The 90% confidence interval for μ is ÿ ± t_.05(s/√n) = 31.720 ± 2.132(8.729/√5 ) = (23.4, 40.0) or 23.4 < μ < 40.0 mg.

p. 194 # 6.11 (a) The degrees of freedom are n - 1 = 5 - 1 = 4. The critical value is t_.025 = 2.776.
The 95% confidence interval for μ is ÿ ± t_.025(s/√n) = 31.720 ± 2.776(8.729/√5) = (20.9, 42.5) or 20.9 < μ < 42.5 mg.
(b) We are 95% confident that the mean thymus gland weight in the population of chick embryos is between 20.9 and 42.5 mg.

p. 194 # 6.12 (a) ÿ = 28.7; s = 4.5898; SE = 4.5898/√6 = 1.87 µg/ml.
28.7 ± (2.571)(1.9) = (23.8,33.6) or 23.8 < μ < 33.6 µg/ml.
(b) μ = mean blood serum concentration of Gentamicin (1.5 hours after injection of 10 mg/kg body weight) in healthy three-year-old female Suffolk sheep.
(c) No. The "95%" refers to the percentage (in a meta-experiment) of confidence intervals that would contain μ. Since the width of a confidence interval depends on n, the percentage of observations contained in the confidence interval also depends on n, and would be very small if n were large.

p. 194 # 6.13 (a) This statement is false. The confidence interval allows us to make an inference concerning the mean of the entire population. We know that 59.77 < ÿ < 61.09.
(b) This statement is true. (See part (a).)

p. 194 # 6.14 This statement is false. The confidence interval concerns the mean of the population. It does not tell us where individual data points lie.

p. 194 # 6.20 ÿ = 1.20; s = .14; n = 50.
The degrees of freedom are 50 - 1 = 49. From Table 4 with df = 50 (the df value closest to 49) we find that t_.05 = 1.676. The 90% confidence interval for μ is ÿ ± t_.05 (s/√n) = 1.20 ± 1.676(.14/√50 ) = (1.17,1.23) or 1.17 < μ < 1.23 mm.

p. 194 # 6.21 We are 95% confident that the mean Bayley Index of prematurely born infants who receive intravenous-feeding solutions is between 93.8 and 102.1. (Although the center of the interval is 97.95, which is less than the general population average of 100, the interval extends above 100, so we cannot be sure that μ is less than 100.)

p. 199 # 6.28 We use the inequality (Guessed SD)/√n ≤ (Desired SE).
In this case, the desired SE is 3 mg/dl and the guessed SD is 40 mg/dl. Thus, the inequality is 40/√n ≤ 3 or 40/3 ≤ √n which means that n ≥ 177.8, so a sample of n = 178 men is needed.

p. 204 # 6.31 The fact that the mean is less than the SD casts doubt on the condition that the population is normal, for the following reason. In a normal population, about 15% of the observations fall more than one SD below the mean, whereas this sample cannot have any observations that far below the mean because ÿ - s is negative and the observed variable (serum SGOT) cannot be negative.

p. 204 # 6.32 (a) There were 36 cells, but only seven guinea pigs, so there is a hierarchical structure in the data, which suggests that the observations are not independent.
(b) The distribution has two or perhaps three modes, which may reflect the hierarchical structure in the data (that is, different modes may represent different animals or groups of animals.)

p. 204 # 6.33 The outlier (1,060) suggests that the population distribution is not normal but rather is skewed to the right or long-tailed. Because the sample size is small, Student's t method is not appropriate if the population is not normal.

p. 216 # 6.56 (a) 28.86 ± (2.576)(4.24/√1353) = (28.56,29.16) or 28.56 < μ < 29.16 days.
(b) The confidence interval is not consistent with the hypothesis because 29.5 is not in the interval.

p. 216 # 6.57 (a) The mean of all reported cycles is smaller because the women with shorter cycles had more cycles during the fixed time period, and therefore contributed more observations to the data. (b) It would not be valid because the 5412 observations are not independent -- there is a hierarchical structure in the data.

p. 216 # 6.60 The confidence interval should be (6.2,7.4). The confidence interval is an interval estimate of the population mean. The data only take on integer values, but the mean of the population need not be an integer (and probably is not).

p. 216 # 6.61 (a) .42/√84 = .04583
(c) 4.36 ± (1.984)(.04583) = (4.269,4.451) or 4.269 < μ < 4.451 mEq/l.
(d) We are 95% confident that the average serum potassium concentration in the blood of all healthy women is between 4.269 mEq/l and 4.451 mEq/l.

p. 216 # 6.62 No. The confidence interval would be much too narrow; only a minority of healthy women would fall within the confidence interval. Instead, the interval ÿ ą 2SD would be a reasonable choice for reference limits.

p. 216 # 6.63 (a) We would predict the SD of the new measurements to be about .42 mEq/l, because this is our best estimate (based on Exercise 6.54) of the population SD.
(b) .42/√200 = .030 mEq/l.

p. 216 # 6.68 (a) ÿ = 145.3; s = 12.87; SE = 12.87/√1139 = .381.
The confidence interval is 145.3 ą (1.96)(.381) or (144.55,146.05) or 144.55 < ľ < 146.05 g/l.
(b) No. The obtained 95% confidence interval is a confidence interval for the population mean hemoglobin level. It does not give limits for individual data points. (c) No. See the answer to part (b).

p. 226 # 7.4 SE₁ = 6.5/√5 = 2.907; SE₂ = 8.4/√7 = 3.175.
√(2.907² + 3.175²) = 4.30.

p. 226 # 7.5 SE₁ = 6.5/√10 = 2.055; SE₂ = 8.4/√14 = 2.245.
√(2.055² + 2.245²) = 3.04

p. 226 # 7.9 √( 5.5² + 8.6² ) = 10.2.

HW6 25 Oct p. 226 # 7.8 SE₁ = .400/√9 = .133; SE₂ = .220/√6 = .090.
√(0.133² + 0.090²) = 0.16

p. 231 # 7.10 Let 1 denote males and let 2 denote females.
ÿ₁ = 45.8; SE₁ = 2.8/√489 = .127.
ÿ₂ = 40.6; SE₂ = 2.9/√469 = .134.
The standard error of the difference is SE_{(y1 - y2)} = .1272 + .1342 = .185.
The critical value t_.025 is determined from Student's t distribution with df = 950. Using df = 1000 (the nearest value given in Table 4), we find that t(1000)_.025 = 1.962.
The 95% confidence interval is (ÿ₁ - ÿ₂) ą t_.025 SE_{(y1 - y2)} = (45.8 - 40.6) ± (1.962)(.185).
So the confidence interval is (4.84,5.56) or 4.84 < ľ1 - ľ2 < 5.56.

p. 231 # 7.12 (a) Let 1 denote biofeedback and let 2 denote control.
SE_{(y1 - y2)} = √( 1.34² + 1.30² ) = 1.867.
(13.8 - 4.0) ± (1.977)(1.867) (using df = 140)
(6.1,13.5) or 6.1 < ľ1 - ľ2 < 13.5 mm Hg.
(b) We are 95% confident that the population mean reduction in systolic blood pressure for those who receive training for eight weeks (ľ1) is larger than that for others (ľ2) by an amount that might be as small as 6.1 mm Hg or as large as 13.5 mm Hg.

p. 231 # 7.13 No. The confidence interval found in Exercise 7.11 is valid even if the distributions are not normal, because the sample sizes are large.

p. 244 # 7.25 (a) .085 < .10, which means that the P-value is less than α. Thus, we reject H₀.
(b) .065 > .05, which means that the P-value is greater than α. Thus, we do not reject H₀.
(c) Table 4 gives t(19)_.005 = 2.861 and t(19)_.0005 = 3.883, so .001 < P < .01. Since P < α, we reject H₀.
(d) Table 4 gives t(12)_.05 = 1.782 and t(12)_.04 = 1.912, so .08 < P < .10. Since P > α, we do not reject H₀.

p. 244 # 7.29 (a) The null and alternative hypotheses are H₀: μ₁ = μ₂; H_A: μ₁ ≠ μ₂, where 1 denotes heart disease and 2 denotes control. These hypotheses may be stated as

H₀: Mean serotonin concentration is the same in heart patients and in controls
H_A: Mean serotonin concentration is not the same in heart patients and in controls
The test statistic is t_s = (3840 - 5310)/1064 = -1.38.
From Table 4 with df = 14, we find the critical values t(.10) = 1.345 and t(.05) = 1.761. Thus, the P-value is bracketed as .10 < P < .20.
Since the P-value is greater than α (.05), H₀ is not rejected.
(b) There is insufficient evidence (.10 < P < .20) to conclude that serotonin levels are different in heart patients than in controls.
(c) SE_{(y1 - y2)} = √( 850² + 640² ) = 1064.

p. 244 # 7.31 (a) H₀: mean thymus weight is the same at 14 and 15 days (ľ₁ = ľ₂)
H_A: mean thymus weight is not the same at 14 and 15 days (ľ₁ ≠ ľ₂)
SE_{(y1 - y2)} = √(8.73²/5 + 7.19²/5 ) = 5.06
t = (31.72 - 29.22)/5.06 = .49. df = n₁ + n₂ - 2 = 8. (Formula (7.1) gives df = 7.7.) Table 4 gives t_.20 = .889; thus P > .40, so we do not reject H₀. There is insufficient evidence (P > .40) to conclude that mean thymus weight is different at 14 and 15 days.
(b) According to the P-value found in part (a), the fact that y₁ is greater than y₂ could easily be attributed to chance.

p. 244 # 7.33 (a) H₀: Albumin and polygelatin are equally effective as plasma expanders (ľ₁ = ľ₂)
H_A: Albumin and polygelatin are not equally effective as plasma expanders (ľ₁ ≠ ľ₂)
SE_{(y1 - y2)} = √(60² + 30² ) = 67.08.
ts = (490 - 240)/67.08 = 3.73. df = n₁ + n₂ - 2 = 37 and t(40)_.0005 = 3.551.
Thus, P < .001, so we reject H₀.
(b) There is sufficient evidence (P < .001) to conclude that albumin is more effective than polygelatin as a plasma expander.

p. 244 # 7.36 (a) True. We would reject H₀ because the P-value is less than α.
(b) False. We do not reject H₀ because the P-value is greater than α.
(c) False. The P-value is the probability, under H₀, of getting a result as extreme as, or more extreme than, the result that was actually observed.

p. 244 # 7.37 (a) H₀: mean number of colonies is the same for control and soap (ľ₁ = ľ₂)
H_A: mean number of colonies is not the same for control and soap (ľ₁ ≠ ľ₂)
SE_{(y1 - y2)} = 10.21. ts = (41.8 - 32.4)/10.21 = .92. Using Table 4 with df=10 we have t_.20 = .879 and t_.10 = 1.372. Thus .20 < P < .40, so we do not reject H₀.
(b) There is insufficient evidence (.20 < P < .40) to conclude that the mean number of colonies differs for control and soap.

p. 265 # 7.49 (a) No. With df = 23, Table 4 gives t_.10 = 1.319 and t_.05 = 1.714. Thus, .05 < P < .10. Since P > α, we do not reject H₀.
(b) Yes. With df = 5, Table 4 gives t_.04 = 2.191 and t_.03 = 2.422. Thus, .03 < P< .04. Since P < α, we reject H₀.
(c) No. Because ts > 0, the data do not deviate from H₀ in the direction specified by H_A. Thus, P > .50 and we do not reject H₀.
(d) Yes. With df = 27, Table 4 gives t_.005 = 2.771 and t_.0005 = 3.690. Thus, .0005 < P < .005. Since P < α, we reject H₀.

p. 265 # 7.51 Let 1 denote experimental (to be hypnotized) and 2 denote control.
SE_{(y1 - y2)} = √( .621²/8 + .652²/8 ) = .3183.
t = (6.169 - 5.291)/.3183 = 2.76. With df = n₁ + n₂ - 2 = 14 (Formula (7.1) yields df = 13.97), Table 4 gives t_.01 = 2.624 and t_.005 = 2.977.
(a) H₀: Mean ventilation is the same in the "to be hypnotized" condition and in the "control" condition (ľ₁ = ľ₂)
H_A: Mean ventilation is different in the "to be hypnotized" condition than in the "control" condition (ľ₁ ≠ ľ₂)
H₀ is rejected. There is sufficient evidence (.01 < P < .02) to conclude that mean ventilation is higher in the "to be hypnotized" condition than in the "control" condition.
(b) H₀: Mean ventilation is the same in the "to be hypnotized" condition and in the "control" condition (ľ₁ = ľ₂)
H_A: Mean ventilation is higher in the "to be hypnotized" condition than in the "control" condition ( ľ₁ > ľ₂)
H₀ is rejected. There is sufficient evidence (.005 < P < .01) to conclude that mean ventilation is higher in the "to be hypnotized" condition than in the "control" condition.
(c) The nondirectional alternative (part (a)) is more appropriate. According to the narrative, the researchers formulated the directional alternative in part (b) after they had seen the data. Thus, it would not be legitimate for them (or us) to use a directional alternative.

p. 265 # 7.54 (a) The null and alternative hypotheses are H₀: μ₁ = μ₂; and H_A: μ₁ > μ₂, where 1 denotes drug and 2 denotes placebo. These hypotheses may be stated as

H₀: The drug is not effective
H_A: The drug is effective
To check the directionality of the data, we note that ÿ₁ > ÿ₂. Thus, the data do deviate from H₀ in the direction (μ₁ > μ₂) specified by H_A. We proceed to calculate the test statistic.
The standard error of the difference is SE_{(y1 - y2)} = √( 12.05²/25 + 13.78²/25 ) = 3.66.
The test statistic is t = (31.96 - 25.32)/3.66 = 1.81.
From Table 4 with df = 25 + 25 -2 = 48 (Formula (7.1) yields df = 47.2), we find the critical values t_.04 = 1.787 and t_.03 = 1.924.
Thus, the P-value is bracketed as .03 < P < .04. Since the P-value is less than α (.05), we reject H₀.
There is sufficient evidence (.03 < P < .04) to conclude that the drug is effective at increasing pain relief.
(b) The only change in the calculations from part (a) would be that the one-tailed area would be doubled if the alternative were nondirectional. Thus, the p-value would be between .06 and .08 and at α = .05 we would not reject H₀.

p. 272 # 7.58 The lack of a statistically significant difference in therapeutic responses does not show that the two medications are equally effective. (Such evidence could be obtained from either a confidence interval or an analysis of the power of the test.)

p. 272 # 7.62 The mean difference in serum concentration of uric acid is μ₁ - μ₂, where 1 denotes men and 2 denotes women.
We construct a 95% confidence interval for μ₁ - μ₂.
SE_{(ÿ₁ - ÿ₂)} = √( .058²/530 + .051²/420 ) = .00354.
The critical value t_.025 is found from Student's t distribution with df = n₁ + n₂ - 2 = 948. (Formula (7.1) gives df = 937.8.)
From Table 4, we find t(1000)_.025 = 1.962. The 95% confidence interval is (.354 - .263) ± (1.962)(.00354) = (.0841,.0979) or .0841 < μ₁ - μ₂ < .0979 mmol/l.
All values in the confidence interval are greater than .08 mmol/l. Therefore, according to the confidence interval the data indicate that the difference is "clinically important."

p. 272 # 7.63 SE_{(ÿ₁ - ÿ₂)} = √( .058²/53 + .051²/42 ) = .0112.
Confidence interval is (.354 - .263) ± (1.984)(.0112) (using df = 100), or (.069,.113), or .069 < μ₁ - μ₂ < .113 mmol/l.
The difference could be greater than or less than .08 mmol/l, so the data do not indicate whether the difference is "clinically important."

p. 305 # 7.104 (a) False. The confidence interval includes zero, so we are not confident that 1 and 2 are different.
(b) True. This is what a confidence interval tells us.
(c) False. We know that the difference of sample means is exactly 6.9.
(d) False. The confidence interval is used to make an inference about the difference between ÿ₁ and ÿ₂; it does not tell us about individual data points (such as the length of hospitalization for a nitric oxide infant).
p. 305 # 7.105 False. The 95% confidence interval includes zero, which means that the P-value for a nondirectional test is greater than .05. Thus, we would not reject H0 at the .05 significance level.

p. 305 # 7.109 (a) Let 1 denote andro and let 2 denote control.

H₀: Andro has no effect (μ₁ = μ₂)
H_A: Andro has an effect (μ₁ ≠ μ₂)
SE = 5.94. t = (14.4 - 20.0)/5.94 = -.94. With df=16, Table 4 gives t_0.20 = 0.865 and t_0.10 = 1.337, so .20 < p < .40. (Using a computer, we get p = .359.) Thus, we fail to reject H₀. There is insufficient evidence (.20 < p < .40) to conclude that andro affects lat pulldown strength.
(b)

H₀: Andro has no effect (μ₁ = μ₂)
H_A: Andro has a positive effect (μ₁ > μ₂)
The one-tailed p-value is between .10 and .20; this is still larger than the level of significance. Thus, we still fail to reject H₀. There is insufficient evidence (.10 < p < .20) to conclude that andro increases lat pulldown strength.

HW7 1 Nov p. 296 # 7.79 (a) The null and alternative hypotheses are

H₀: Toluene has no effect on dopamine in rat striatum
H_A: Toluene has some effect on dopamine in rat striatum.
Let 1 denote toluene and let 2 denote control. For the K₁ count, we note that there are four Y2's less than the first Y1; there are five Y2's less than the second Y1; there are five Y2's less than the third Y1; and there are six Y2's less than the fourth, fifth, and sixth Y1.
Thus, K₁ = 4 + 5 + 5 + 6 + 6 + 6 = 32. Similarly, K₂ = 0 + 0 + 0 + 0 + 1 + 3 = 4.
To check the counts, we verify that K₁ + K₂ = 32 + 4 = 36 = (6)(6) = (n1)(n2). In the non-directional case, the Wilcoxon-Mann-Whitney test statistic is the larger of the two counts K₁ and K₂; thus U_s = 32. Looking in Table 6 under n = 6 and n' = 6, we find that for a nondirectional alternative, the .05 entry is 31 and the .02 entry is 33. Thus, the P-value is bracketed as .02 < P < .05. At significance level α = .05, we reject H0, since P < .05. We note that K1 is larger than K2, which indicates a tendency for the Y1's to be larger than the Y2's. Thus, there is sufficient evidence (.02 < P < .05) to conclude that toluene increases dopamine in rat striatum.
(b) When conducting a nondirectional test, we must check directionality. In this case, we note that K1 is larger than K2, which indicates a tendency for the Y1's to be larger than the Y2's, which is what the directional alternative predicts. We proceed as in part (a), except that we use the "directional" tail probabilities. Thus, .01 < P < .025. We reject H0 and conclude that there is sufficient evidence (.01 < P < .025) to conclude that toluene increases dopamine in rat striatum.

p. 296 # 7.82 (a)
H₀: There is no sex difference in preening behavior
H_A: There is a sex difference in preening behavior
For n = n' = 15, the largest critical value is 189, which is under the .001 heading for a nondirectional alternative. It follows that P < .001, so H₀ is rejected. There is sufficient evidence (P < .001) to conclude that females tend to preen longer than males.
(b)
H₀: There is no sex difference in preening behavior (μ₁ = μ₂)
H_A: There is a sex difference in preening behavior (μ₁ ≠ μ₂)
ts = (2.127 - 4.093)/.7933 = -2.48. With df = n₁ + n₂ - 2 = 28, Table 4 gives t_.01 = 2.467 and t_.005 = 2.763, so that .01 < P < .02. Formula (7.1) yields df = 15.1 and the conservative approach of df = min{n₁ - 1, n₂ - 1} gives df = 14. For either of these df values we get .02 < P < .04. In any case, H₀ is not rejected, since P > .01. There is sufficient evidence to conclude that there is a sex difference in preening behavior.
(c) Both tests require independent, random samples. The condition required for the t test but not for the Wilcoxon-Mann-Whitney test is that the population distributions are normal. The frequency distribution for the females is highly skewed, due to the two large observations of 10.7 and 11.7. This casts doubt on the normality condition.
(d) K₁ = 0 + 0 + 0 + 0 + 0 + .5 + 1 + 1.5 + 1.5 + 2 + 2 + 3.5 + 5 + 8.5 + 10 = 35.5
K₂ = 5.5 + 8 + 3(11.5) + 3(13) + 13.5 + 14 + 5(15) = 189.5
where 1 denotes male and 2 denotes female.

p. 296 # 7.84 Let 1 denote joggers and let 2 denote fitness program entrants.

H₀: There is no difference in resting blood concentration of HBE between joggers fitness program entrants
H_A: There is a difference in resting blood concentration of HBE between joggers fitness program entrants
K1 = 93.5, K2 = 71.5, Us = 93.5. With n = 15 and n' = 11, 108 is under the .20 heading for a nondirectional alternative and is the smallest entry listed. Thus, P > .20 and H₀ is not rejected. There is insufficient evidence (P > .20) to conclude that there is a difference in resting blood concentration of HBE between joggers fitness program entrants.

p. 300 # 7.86 The null and alternative hypotheses are

H0: Mean platelet calcium is the same in people with high blood pressure as in people with normal blood pressure (mu1 = mu2)
HA: Mean platelet calcium is different in people with high blood pressure than in people with normal blood pressure (mu1 /= mu2).
The standard error of the difference is SE = 5.399.
The test statistic is t = (168.2 - 107.9)/5.399 = 11.2.
From Table 4 with df = 45 + 38 -2 = 81 80, we find the critical value t(.0005) = 3.416. The tail area is doubled for the nondirectional test. Thus, the P-value is bracketed as P < .001.
(Formula (7.1) yields df = 67.5, but the P-value is still bracketed as P < .001.)
Since the P-value is less than alpha (.01), we reject H0. There is sufficient evidence (P < .001) to conclude that mean platelet calcium is higher in people with high blood pressure than in people with normal blood pressure.

p. 300 # 7.87 The critical value t(.025) is found from Student's t distribution with df given by Formula (7.1) as df = 67.5 ≅ 70.
Table 4 gives t(70)(.025) = 1.994. The 95% confidence interval is (49.5, 71.1).
Alternatively, we could use df = 45 + 38 -2 = 81 ≅ 80, in which case the critical value is t(80).025 = 1.990. This gives an interval of (49.6, 71.0).

p. 300 # 7.88 No; the t test is valid because the sample sizes are rather large

p. 300 # 7.98 (a) Two of the patients contributed two observations each to the data set. Thus, there is hierarchical structure, so the t test is not appropriate.
(b) No. The Wilcoxon-Mann-Whitney test, like the t test, requires that the observations within a sample be independent of each other, so the Wilcoxon-Mann-Whitney test is not appropriate.

p. 300 # 7.103 (a) Let 1 denote amphetamine and let 2 denote control.

H₀: Amphetamine is not related to water consumption (μ₁ = μ₂)
H_A: Amphetamine is associated with decreased water consumption (μ₁ < μ₂)
SE of the mean difference is 18.82.
t = (129.375 - 156)/18.82 = -1.415.
With df = n1 + n2 - 2 = 6 (Formula (7.1) yields df = 5.9), Table 4 gives t(.20) = 0.906 and t(.10) = 1.440, so .10 < P < .20.
(Using a computer, we get P = .104.) Thus, we fail to reject H0.
There is insufficient evidence (.10 < P < .20) to conclude that amphetamine is associated with decreased water consumption.
(b)
H0: Amphetamine is not related to water consumption
HA: Amphetamine is associated with decreased water consumption
K1 = 4, K2 = 12, Us = 12; the data deviate from H0 in the direction specified by HA. With n = 4, n' = 4, and a directional alternative, the smallest entry is 13, under the .10 heading. Thus, P > .10 and we fail to reject H0. There is insufficient evidence (P > .10) to conclude that amphetamine is associated with decreased water consumption.

p. 300 # 7.106 (a) False. The P-value for a test is the probability of getting data at least as extreme as those obtained, if H₀ is true; it is not the probability that the null hypothesis is true.
(b) True. The P-value for a test is the probability of getting data at least as extreme as those obtained, if H₀ is true, which is what this statement says.
(c) False. The probability that H₀ is rejected depends on the power of the test, which is not known. (If H₀ is true -- and we don't know if it is true or not -- and a new study is done that uses α = .04, then there is a 4% probability that H₀ will be rejected.)

p. 356 # 9.3 Let 1 denote control and let 2 denote progesterone.

H₀: Progesterone has no effect on cAMP (μ₁ = μ₂)
H_A: Progesterone has some effect on cAMP (μ₁ ≠ μ₂)
The standard error is SE_{(ÿ₁ - ÿ₂)} = SE_d = s_d/√n_d = .40/√4 = .20.
The test statistic is ts = ( ÿ₁ - ÿ₂ ) / SE_{(ÿ₁ - ÿ₂)} = d / SE_d = .68/.20 = 3.4.
To bracket the P-value, we consult Table 4 with df = 4 - 1 = 3. Table 4 gives t_.025 = 3.182 and t_.02 = 3.482. Thus, the P-value is bracketed as .04 < P < .05.
At significance level α = .10, we reject H0 if P < .10. Since .04 < P < .05, we reject H₀. There is sufficient evidence (.04 < P < .05) to conclude that progesterone decreases cAMP under these conditions.

p. 356 # 9.6 The data provide fairly strong evidence (p = .03) that desipramine is more effective than clomipramine in reducing the compulsion to pull one's hair.

p. 356 # 9.9 There is no single correct answer. Any data set with Y1 and Y2 varying, but d not varying, is correct; for example:
Y₁ Y₂ d

5 3 2

6 4 2

3 1 2

4 2 2

5 3 2

p. 370 # 9.16 Let p denote the probability that oral conjugated estrogen will decrease PAI-1 level.

H0: Oral conjugated estrogen has no effect on PAI-1 level (p = .5)
HA: Oral conjugated estrogen has an effect on PAI-1 level (p ≠ .5)
N+ = 8, N- = 22, Bs = 22. With nd = 30, 22 falls under the .02 heading (for a nondirectional alternative) in Table 7. Thus, .01 < P < .02 and we reject H0. There is sufficient evidence (.01 < P < .02) to conclude that oral conjugated estrogen tends to decrease PAI-1 level.

p. 370 # 9.19 Let p denote the probability that a patient will have fewer minor seizures with valproate then with placebo.

H0: Valproate is not effective against minor seizures (p = .5)
HA: Valproate is effective against minor seizures (p > .5)
N+ = 14, N- = 5, Bs = 14; the data deviate from H0 in the direction specified by HA. Eliminating the pair with d = 0, we refer to Table 7 with nd = 19. The only entry of 14 falls under the .05 heading (for a directional alternative). Thus, .025 < P < .05 and we reject H0. There is sufficient evidence (.025 < P < .05) to conclude that valproate is effective against minor seizures.

p. 385 # 9.38 (a) SE = 1.2/√15 = 0.3098. df=14. Confidence interval is -1 ± (2.145)(0.3098) = (-1.66, -0.34)
(b) SE = √(1.66²/15 + 2.37²/15) = 0.7471. Confidence interval is -1 ± (2.145)(0.7471) = (-2.60, 0.60). This interval is much wider than the one constructed in (a).

p. 385 # 9.39
H0: The before and after means are the same (mu1 = mu2)
HA: The before and after means are different (mu1 ≠ mu2)
SE = 1.2/√15 = .3098. t = -1/.3098 = -3.23. With df = 14, Table 4 gives t(.005) = 2.977 and t(.0005) = 4.140; thus, .001 < P < .01. We reject H0; there is strong evidence (.001 < P < .01) of a before and after difference.

p. 385 # 9.40a Let p denote the probability that a before count is higher than the corresponding after count.

H0: p = .5
HA: p ≠ .5
N+ = 2, N- = 10, Bs = 10. Looking under nd = 12 in Table 7, we see that .02 < P < .05. There is sufficient evidence (.02 < P < .05) to conclude that the after count tends to be higher than the before count.

p. 385 # 9.41 The scatterplot shows a positive relationship between before and after counts. The pairing removes the variability between cats from the analysis and is, therefore, effective.

p. 385 # 9.49 The null and alternative hypotheses are

H0: Caffeine has no effect on RER (mu1 = mu2)
HA: Caffeine has some effect on RER (mu1 ≠ mu2)
S = 5.59/√9 = 1.86. t = 7.33/1.86 = 3.94. To bracket the P-value, we consult Table 4 with df = 9 - 1 = 8. Table 4 gives t(.005) = 3.355 and t(.0005) = 5.041. Thus, the P-value for the nondirectional test is bracketed as .001 < P < .01. At significance level alpha = .05, we reject H0. There is sufficient evidence (.001 < P < .01) to conclude that caffeine tends to decrease RER under these conditions.

p. 385 # 9.51 Let p denote the probability that RER for a subject is higher after taking placebo than after taking caffeine.

H0: RER is not affected by caffeine (p = .5)
HA: RER is affected by caffeine (p ≠ .5)
N+ = 9, N- = 0, Bs = 9. Looking under nd = 9 in Table 7, we see that the rightmost column with a critical value less than or equal to 9 is the column headed .01 (for a nondirectional alternative), and the next column is headed .002. Therefore, .002 < P < .01. There is sufficient evidence (.002 < P < .01) to conclude that caffeine tends to decrease RER under these conditions.

p. 385 # 9.55 (a) By using matched pairs we eliminate the variability that is associated with the variables used to create the pairs (age, sex, etc.). This provides for greater precision and more power in the test.
(b) It may be that the pairing variables (age, sex, etc.) are unrelated to blood pressure. If this is the case, then the pairing accomplishes nothing, but it reduces the number of degrees of freedom, and therefore the power, of the test.

p. 385 # 9.57 A normal probability plot of the data shows that the normality condition is not met. However, a sign test can be conducted. Let p denote the probability that urinary protein excretion will go down after plasmapheresis.

H₀: Plasmapheresis affects urinary protein excretion (p = .5)
H_A: Plasmapheresis does not affect urinary protein excretion (p ≠ .5)
N+ = 6, N- = 0, Bs = 6. From Table 7, .02 < p < .05 (for a two-sided test).
The exact p-value is (2)(.56) = .03125. Thus, there is evidence (p = .03125) to conclude that urinary protein excretion tends to go down after plasmapheresis.
Note: Another approach would be to transform the data and then conduct a t test in the transformed scale. For example, taking the reciprocal of each difference yields a fairly symmetric distribution; a t test then gives t = 5.4 and p = .003.

HW8 15 Nov p. --- # 10.---

p. --- # 10.---