Free Essay

Statistic Solution 1

In:

Submitted By chaulenhu
Words 4802
Pages 20
[pic]Chapter 1
Introduction and descriptive Statistics

1-1. 1. quantitative/ratio 2. qualitative/nominal 3. quantitative/ratio 4. qualitative/nominal 5. quantitative/ratio 6. quantitative/interval 7. quantitative/ratio 8. quantitative/ratio 9. quantitative/ratio 10. quantitative/ratio 11. quantitative/ordinal

1-2. Data are based on numeric measurements of some variable, either from a data set comprising an entire population of interest, or else obtained from only a sample (subset) of the full population. Instead of doing the measurements ourselves, we may sometimes obtain data from previous results in published form.

1-3. The weakest is the Nominal Scale, in which categories of data are grouped by qualitative differences and assigned numbers simply as labels, not usable in numeric comparisons. Next in strength is the Ordinal Scale: data are ordered (ranked) according to relative size or quality, but the numbers themselves don't imply specific numeric relationships. Stronger than this is the Interval Scale: the ordered data points have meaningful distances between any two of them, measured in units. Finally is the Ratio Scale, which is like an Interval Scale but where the ratio of any two specific data values is also measured in units and has meaning in comparing values.

1-4. Fund: Qualitative Style: Qualitative US/Foreign: Qualitative 10 yr Return: Quantitative Expense Ratio: Quantitative

1-5. Ordinal.

1-6. A qualitative variable describes different categories or qualities of the members of a data set, which have no numeric relationships to each other, even when the categories happen to be coded as numbers for convenience. A quantitative variable gives numerically meaningful information, in terms of ranking, differences, or ratios between individual values.

1-7. The people from one particular neighborhood constitute a non-random sample (drawn from the larger town population). The group of 100 people would be a random sample.

1-8. A sample is a subset of the full population of interest, from which statistical inferences are drawn about the population, which is usually too large to permit the variables to be measured for all the members.

1-9. A random sample is a sample drawn from a population in a way that is not a priori biased with respect to the kinds of variables being measured. It attempts to give a representative cross-section of the population.

1-10. Nationality: qualitative. Length of intended stay: quantitative.

1-11. Ordinal. The colors are ranked, but no units of difference between any two of them are defined.

1-12. Income: quantitative, ratio Number of dependents: quantitative, ratio Filing singly/jointly: qualitative, nominal Itemized or not: qualitative, nominal Local taxes: quantitative, ratio

1-13. Lower quartile = 25th percentile = data point in position (n + 1)(25/100) = 34(25/100) = position 8.5. (Here n = 33.) Let us order our observations: 109, 110, 114, 116, 118, 119, 120, 121, 121, 123, 123, 125, 125, 127, 128, 128, 128, 128, 129, 129, 130, 131, 132, 132, 133, 134, 134, 134, 134, 136, 136, 136, 136. Lower quartile = 121 Middle quartile is in position: 34(50/100) = 17. Point is 128. Upper quartile is in position: 34(75/100) = 25.5. Point is 133.5 10th percentile is in position: 34(10/100) = 3.4. Point is 114.8. 15th percentile is in position: 34(15/100) = 5.1. Point is 118.1. 65th percentile is in position: 34(65/100) = 22.1. Point is 131.1. IQR = 133.5 - 121 = 12.5.

| | | | | | |
|Percentile and Percentile Rank Calculations | | |
| | |x-th Percentile| | |Percentile |
| | | | | |rank of y |
| |x | | |y | |
| |10 |116.4 | |116.4 |10 |
| |15 |118.8 | |118.8 |15 |
| |65 |130.8 | |130.8 |65 |
| | | | | | |
|Quartiles | | | | | |
| |1st Quartile |121 | | | |
| |Median |128 |IQR |12 | |
| |3rd Quartile |133 | | | |

1-14. First, order the data: -1.2, 3.9, 8.3, 9, 9.5, 10, 11, 11.6, 12.5, 13, 14.8, 15.5, 16.2, 16.7, 18 The median, or 50th percentile, is the point in position 16(50/100) = 8. The point is 11.6. First quartile is in position 16(25/100) = 4. Point is 9. Third quartile is in position 16(75/100) = 12. Point is 15.5. 55th percentile is in position 16(55/100) = 8.8. Point is 12.32. 85th percentile is in position 16(85/100) = 13.6. Point is 16.5.

1-15. Order the data: 38, 41, 44, 45, 45, 52, 54, 56, 60, 64, 69, 71, 76, 77, 78, 79, 80, 81, 87, 88, 90, 98 Median is in position 23(50/100) = 11.5. Point is 70. 20th percentile is in position 23(20/100) = 4.6. Point is 45. 30th percentile is in position 23(30/100) = 6.9. Point is 53.8. 60th percentile is in position 23(60/100) = 13.8. Point is 76.8. 90th percentile is in position 23(90/100) = 20.7. Point is 89.4.

|Percentile and Percentile Rank Calculations | |
| | |x-th Percentile | | |
| |x | | |y |
| |20 |46.4 | |46.4 |
| |30 |54.6 | |54.6 |
| |60 |76.6 | |76.6 |
| | | | | |
|Quartiles | | | | |
| |1st Quartile |52.5 | | |
| |Median |70 |IQR |27.25 |
| |3rd Quartile |79.75 | | |
| | | | | |

1-16. Order the data: 1, 1, 1, 2, 2, 2, 3, 3, 4, 4, 5, 5, 6, 6, 7. Lower quartile is the 25th percentile, in position 16(25/100) = 4. Point is 2. The median is in position 16(50/100) = 8. The point is 3. Upper quartile is in position 16(75/100) = 12. Point is 5. IQR = 5 - 2 = 3. 60th percentile is in position 16(60/100) = 9.6. Point is 4.

|Percentile and Percentile Rank Calculations | |
| | |x-th Percentile | | |
| |x | | |y |
| |60 |4 | |4.0 |
| | |1 | |0 |
| | |1 | |0 |
| | | | | |
|Quartiles | | | | |
| |1st Quartile |2 | | |
| |Median |3 |IQR |3 |
| |3rd Quartile |5 | | |

1-17. The data are already ordered; there are 16 data points. The median is the point in position 17(50/100) = 8.5 It is 51. Lower quartile is in position 17(25/100) = 4.25. It is 30.5. Upper quartile is in position 17(75/100) = 12.75. It is 194.25. IQR = 194.25 - 30.5 = 163.75. 45th percentile is in position 17(45/100) = 7.65. Point is 42.2.
| | | | | |
|Percentile and Percentile Rank Calculations | |
| | |x-th Percentile | | |
| |x | | |y |
| |45 |43 | |43.0 |
| | | | |0 |
| | | | |0 |
| | | | | |
|Quartiles | | | | |
| |1st Quartile |31.5 | | |
| |Median |51 |IQR |131.25 |
| |3rd Quartile |162.75 | | |

1-18. The mean is a central point that summarizes all the information in the data. It is sensitive to extreme observations. The median is a point "in the middle" of the data set and does not contain all the information in the set. It is resistant to extreme observations. The mode is a value that occurs most frequently.

19. Mean, median, mode(s) of the observations in Problem 1-13: [pic][pic] Median = 128 Modes = 128, 134, 136 (all have 4 points)

| | | | | | | | |
| |Measures of Central tendency | | | | |
| | | | | | | | |
| |Mean |126.63636 |Median |128 |Mode |128 | |

1-20. For the data of Problem 1-14: Mean = 11.2533 Median = 11.6 Mode: none

1-21. For the data of Problem 1-15: Mean = 66.955 Median = 70 Mode = 45

| | | | | | | | |
| |Measures of Central tendency | | | | |
| | | | | | | | |
| |Mean |66.954545 |Median |70 |Mode |45 | |

1-22. For the data of Problem 1-16: Mean = 3.466 Median = 3 Mode = 1 and 2

| | | | | | | |
|Measures of Central tendency | | | | |
| | | | | | | |
|Mean |3.4666667 |Median |3 |Mode |1 | |

1-23. For the data of Problem 1-17: Mean = 199.875 Median = 51 Mode: none

|Measures of Central tendency | | | | |
| | | | | | | |
|Mean |199.875 |Median |51 |Mode |#N/A | |

1-24. For the data of Example 1-1: Mean = 163,260 Median = 166,800 Mode: none

1-25. (Using the template: “Basic Statistics.xls”, enter the data in column K.) Basic Statistics from Raw Data

| |Measures of Central tendency | | | | |
| | | | | | | | |
| |Mean |21.75 |Median |13 |Mode |12 | |

1-26. (Using the template: “Basic Statistics.xls”) [pic] Mean = .0514 Median = 0.3 Outliers: none

1-27. Mean = 592.93 Median = 566 Std Dev = 117.03 QL = 546 QU = 618.75 Outliers: 940 Suspected Outlier: 399

1-28. Measures of variability tell us about the spread of our observations.

1-29. The most important measures of variability are the variance and its square root- the standard deviation. Both reflect all the information in the data set.

1-30. For a sample, we divide the sum of squared deviations from the mean by n – 1, rather than by n.

1-31. For the data of Problem 1-13, assumed a sample: Range = 136 – 109 = 27 Variance = 57.74 Standard deviation = 7.5986

| |If the data is of a |
| |Sample |Population |
|Variance |57.7386364 |55.9889807 |
|St. Dev. |7.59859437 |7.48257848 |

1-32. For the data of Problem 1-14: Range = 18 – (–1.2) = 19.2 Variance = 25.90 Standard deviation = 5.0896

1-33. For the data of Problem 1-15: Range = 98 – 38 = 60 Variance = 321.38 Standard deviation = 17.927

| |If the data is of a |
| |Sample |Population |
|Variance |321.378788 |306.770661 |
|St. Dev. |17.9270407 |17.5148697 |

1-34. For the data of Problem 1-16: Range = 7 – 1 = 6 Variance = 3.98 Standard deviation = 1.995

| |If the data is of a |
| |Sample |Population |
|Variance |3.98095238 |3.71555556 |
|St. Dev. |1.99523241 |1.92757764 |

1-35. For the data of Problem 1-17: Range = 1,209 – 23 = 1,186 Variance = 110,287.45 Standard deviation = 332.096

| |If the data is of a |
| |Sample |Population |
|Variance |110287.45 |103394.484 |
|St. Dev. |332.095543 |321.550127 |

1-36. [pic]; this captures 31/33 of the data points, so Chebyshev's theorem holds. The data set is not mound-shaped, so the empirical rule does not apply.

1-37. [pic]; this captures 14/15 of the data points, so Chebyshev's theorem holds. The data set is not mound-shaped, so the empirical rule does not apply

1-38. [pic]; this captures all the data points, so Chebyshev's theorem holds. The data set is not mound-shaped, so the empirical rule does not apply.

1-39. [pic]; this captures all the data points, so Chebyshev's theorem holds. The data set is not mound-shaped, so the empirical rule does not apply.

1-40. [pic]; this captures 15/16 of the data points, so Chebyshev's theorem holds. The data set is not mound-shaped, so the empirical rule does not apply.

1-41.

1-42.

1-43.

1-44. Mean = 0.917 Median = 0.85 Std dev = 0.4569 [pic]

1-45. Mean = $18.53 Median = $15.93 [pic]

1-46.

1-47. Using MINITAB
| |Stem |Leaves |
| |4 |5 |5688 |
| |8 |6 |0123 |
| |14 |6 |677789 |
| |(9) |7 |002223334 |
| |11 |7 |55667889 |
| |3 |8 |224 |

1-48.

There are no outliers. Distribution is skewed to the left.

1-49. A stem-and-leaf display is a quickly drawn type of histogram useful in analyzing data. A box plot is a more advanced display useful in identifying outliers and the shape of the distribution of the data.

|1-50. |Stem |Leaves |
| |1 |0 |5 |
| |1 |1 | |
| |1 |2 | |
| |7 |3 |234578 |
| |(13) |4 |2234567788899 |
| |11 |5 |012235678 |
| |2 |6 |3 |
| |1 |7 |8 |

1-51. The data are narrowly and symmetrically concentrated near the median (IQR and the whisker lengths are small), not counting the two extreme outliers.

1-52. Wider dispersion in data set #2. Not much difference in the lower whiskers or lower hinges of the two data sets. The high value, 24, in data set #2 has a significant impact on the median, upper hinge and upper whisker values for data set #2 with respect to data set #1.

1-53. Mean = 127 Var = 137 sd = 11.705 mode = 127 outliers: TWA, Lufthansa
[pic]

1-54. Stem-and-leaf of C2 N = 45 Leaf Unit = 1.0

| |f |Stem |Leaves |
| |13 |1 |0011111223444 |
| |18 |1 |55689 |
| |(6) |2 |022333 |
| |21 |2 |567789 |
| |15 |3 |0122234 |
| |8 |3 |78 |
| |6 |4 |012 |
| |3 |4 |7 |
| |2 |5 |23 |

1-55. Outliers are detected by looking at the data set, constructing a box plot or stem-and-leaf display. An outlier should be analyzed for information content and not merely eliminated.

1-56. The median is the line inside the box. The hinges are the upper and lower quartiles. The inner fences are the two points at a distance of 1.5 (IQR) from the upper and lower quartiles. Outer fences are similar to the inner fences but at a distance of 3 (IQR). The box itself represents 50% of the data.

|1-57. |Mine A: | | |Mine B: | | |
| |f |Stem |Leaves |f |Stem |Leaves |
| |2 |3 |24 |2 |2 |34 |
| |4 |3 |57 |4 |2 |89 |
| |7 |4 |123 |6 |3 |24 |
| |(5) |4 |55689 |9 |3 |578 |
| |7 |5 |123 |(3) |4 |034 |
| |4 |5 | |7 |4 |789 |
| |4 |6 |0 |4 |5 |012 |
| |3 |7 |36 |1 |5 |9 |
| |1 |8 |5 | | | |
| | | | | | | |

Values for Mine A are smaller than for Mine B, right-skewed, and there are three outliers. Values for Mine B are larger and the distribution is almost symmetric. There is larger variance in B.

1-58. No. One needs to use descriptive statistics and/or statistical inference.

1-59.
|Comparing two data sets using Box Plots | | | | | |
| | | | | | | | | | | |
| | | | |Lower |Lower |Median |Upper Hinge |Upper | | |
| | | | |Whisker |Hinge | | |Whisker | | |
| | | |Shipments |1.3 |1.975 |2.4 |3.4 |4.2 | | |
| | | |Market Share |3.6 |5.3 |6.55 |9.275 |11.4 | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| |Shipments | | | | | | | | | |
| | | | | | | | | | | |
| |Market Share | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |

1-60. Mean = 5.785 median = 5.782 The mean is impacted by the high rate of fatalities for the very small car classification. [pic]

1-61. Answers will vary.

a. If we add the value “5” to all the data points, then the average, median, mode, first quartile, third quartile and 80th percentile values will change by “5”. There is no change in the variance, standard deviation, skewness, kurtosis, range and interquartile range values. b. Average: if we add “5” to all the data points, then the sum of all the numbers will increase by “5*n”, where n is the number of data points. The sum is divided by n to get the average. So 5*n / n = 5: the average will increase by “5”.

Median: If we add “5” to all the data points, the median value will still be the midway point in the ordered array. Its value will also increase by “5”

Mode: Adding “5” to all the data points changes the number that occurs most frequently by “5”

First Quartile: adding “5” to all the data points does not change the location of the first quartile in the ordered array of numbers, which is: (.25)(n+1) where n is the number of data points. Whether the first quartile falls on a specific data point or between two data points, the resulting value will have been increased by “5”.

Third Quartile: adding “5” to all the data points does not change the location of the third quartile in the ordered array of numbers, which is: (.75)(n+1) where n is the number of data points. Whether the third quartile falls on a specific data point or between two data points, the resulting value will have been increased by “5”.

80th percentile: adding “5” to all the data points has the same effect as in the calculation of the first or third quartile. The value will be increased by “5”

Range: adding “5” to the all the data points will have no effect on the calculation of the range. Since both the highest value and the lowest value have been increase by the same number, the subtraction of the lowest value from the highest value still yields the same value for the range.

Variance: adding “5” to all the data points has no effect on the calculation of the variance. Since each data point is increased by “5” and the average has also been shown to increase by the same factor, the differences between each individual new data point and the new average will not change and will not be affected by squaring the difference, summing the squared differences and dividing by number of data points.

Standard Deviation: since the variance is not affected by adding “5” to each data point, neither is the standard deviation.

Skewness: Since each data point is increased by “5” and the average has also been shown to increase by the same factor, the differences between each individual new data point and the new average will not change. Therefore, the numerator in the formula for skewness is not affected. Since the standard deviation is not affected as well (the denominator), there is no change in the value for skewness.

Kurtosis: Since each data point is increased by “5” and the average has also been shown to increase by the same factor, the differences between each individual new data point and the new average will not change. Therefore, the numerator in the formula for kurtosis is not affected. Since the standard deviation is not affected as well (the denominator), there is no change in the value for kurtosis.

Interquartile Range: given that both the first quartile and the third quartile increased by the same factor, “5”, the difference between the two values remains the same.

c. Multiplying each data point by a factor “3” results in the following changes. The mean, median, mode, first quartile, third quartile and 80th percentile values will be increased by the same factor “3”. In addition, the standard deviation and the range will also increase by the same factor “3”. The variance will increase by the factor squared, and the skewness and kurtosis values will remain unchanged. d. Multiplying all data points by a factor “3” and adding a value “5” to each data point has the following results. The order of operation is first to multiply each data point and then add a value to each data point. Each data point is first multiplied by the factor “3” and then the value “5” is added to each newly multiplied data point. Multiplying each data point by the factor “3” yields the results listed in c). Adding a value 5 to the newly multiplied data points yields the results listed in a).

1-62. [pic] s = 13.944 s2 = 194.43

1-63. ( = 504.688 ( = 94.547

|Measures of Central tendency | | | | |
| | | | | | | |
|Mean |504.6875 |Median |501.5 |Mode |#N/A | |
| | | | | | | |
|Measures of Dispersion | | | | |
| | |If the data is of a | | | |
| | |Sample |Population | | | |
| |Variance |9227.5121 |8939.15234 |Range |346 | |
| |St. Dev. |96.0599401 |94.5470906 |IQR |149.5 | |

1-64.

Step 1: Enter the data from problem 1-63 into cells Y4:Y35 of the template: Histogram.xls from Chapter 1. The template will order the data automatically.

Step 2: We need to select a starting point for the first class, an ending point for the last class, and a class interval width. The starting point of the first class should be a value less than the smallest value in the data set. The smallest value in the data set is 344, so you would want to set the first class to start with a value smaller than 344. Let’s use 320. We also selected 710 as the ending value of the last class, and selected 50 as the interval width. The data input column and the histogram output from the template are presented below. The end-point for each class is included in that class; i.e., the first class of data goes from more than 320 up to and including 370, the second class starts with more than 370 up to and including 420, etc.

[pic]

1-65. Range: 690 – 344 = 346 90th percentile lies in position: 33(90/100) = 29.7 It is 632.7 First quartile lies in position: 33(25/100) = 8.25 It is 419.25 Median lies in position: 33(50/100) = 16.5 It is 501.5 Third quartile lies in position: 33(75/100) = 24.75 It is 585.75

1-66.

|1-67. | |Stem |Leaves |
| |2 |1 |24 |
| |7 |1 |56789 |
| |(3) |2 |023 |
| |6 |2 |55 |
| |4 |3 |24 |
| |2 |3 | |
| |2 |4 |01 |

1-68.

The data is skewed to the right.

|1-69. | |Stem |Leaves |
| |3 |1 |012 |
| |4 |1 |9 |
| |12 |2 |1122334 |
| |(9) |2 |556677889 |
| |6 |3 |024 |
| |3 |3 |57 |
| |1 |4 | |
| |1 |4 | |
| |1 |5 | |
| |1 |5 | |
| |1 |6 |2 |

The data is skewed to the right with one extreme outlier (62) and three suspected outliers (10,11,12)

1-70.

1-71. Mean = 25.857 sd = 9.651 [pic]

1-72. Mean = 18.875 var = 38.65 outliers: none

1-73. Mean = 33.271 sd = 16.945 var = 287.15 QL = 25.41 Med = 26.71 QU = 35 Outliers: Morgan Stanley (91.36%)

1-74. Mean = 3.18 sd = 1.348 var = 1.817 QL = 1.975 Med = 2.95 QU = 3.675 Outliers: 8.70

1-75. a. IQR = 3.5 b. data is right-skewed c. 9.5 is more likely to be the mode, since the data is right-skewed d. Will not affect the plot.

1-76. Bar graph showing changes over time. Both the employee’s out-of-pocket and payroll deduction expenses have increased substantially over the last three years.

1-77. Mean (billions of tons) = 1.439 Mean (per capita tons) = 9.98 The mathematical computation for both averages is the same, however, they do differ in meaning. On average, the countries listed emit 1.439 billion tons of carbon dioxide each. However, the emissions per person is 9.98 tons. Dividing billions of tons by the rate per capita for the US, we get a population estimate of 256 million people, which is close to the actual population for 1997.

1-78. Mean = 2.75 sd = 14.44 var = 208.59 QL = (5.075 Med = 7.9 QU = 13.675 Outliers: –30.2

1-79.
Mean = 10301.05 sd = 16.916 var = 286.155
(Using the template: “Basic Statistics.xls”)
|Measures of Central tendency | | | |
| | | | | | |
|Mean |10301.05 |Median |10300.5 |Mode |10300 |
| | | | | | |
|Measures of Dispersion | | | |
| | |If the data is of a | | |
| | |Sample |Population | | |
| |Variance |286.155263 |271.8475 |Range |54 |
| |St. Dev. |16.9161244 |16.4877985 |IQR |16.25 |

1-80. Mean = 99.039 sd = .4366 var = .1907 Median = 99.155

1-81. Mean = 17.587 sd = .466 var = .2172

|Measures of Central tendency | | | |
| | | | | | |
|Mean |17.5875 |Median |17.5 |Mode |18.3 |
| | | | | | |
|Measures of Dispersion | | | |
| | |If the data is of a | | |
| | |Sample |Population | | |
| |Variance |0.21716667 |0.20359375 |Range |1.4 |
| |St. Dev. |0.46601144 |0.45121364 |IQR |0.75 |

1-82. Mean = 29.018 sd = 4.611 (Using the template: “Basic Statistics.xls”)
|Measures of Central tendency | | | | |
| | | | | | | |
|Mean |29.018 |Median |29.75 |Mode |#N/A | |
| | | | | | | |
|Measures of Dispersion | | | | |
| | |If the data is of a | | | |
| | |Sample |Population | | | |
| |Variance |21.26552 |17.012416 |Range |12.38 | |
| |St. Dev. |4.6114553 |4.12461101 |IQR |2.92 | |

1-83. Mean = 4.8394 sd = .08 Median = 4.86

1-84. Stock Prices for period: April, 2001 through June, 2001 [Answers will vary due to dates used.]

a). Mean and Standard Deviation for Wal-Mart
|Basic Statistics from Raw Data |Stock Prices: Wal-Mart | |
| | | | | | | | |
| | | | | | | | |
| |Measures of Central tendency | | | | |
| | | | | | | | |
| |Mean |51.041478 |Median |51.1266 |Mode |50.158 | |
| | | | | | | | |
| |Measures of Dispersion | | | | |
| | | |If the data is of a | | | |
| | | |Sample |Population | | | |
| | |Variance |2.25711298 |2.22128579 |Range |6.1911 | |
| | |St. Dev. |1.50236912 |1.49039786 |IQR |1.9613 | |
| | | | | | | | |
| |Higher Moments | | | | | |
| | | |If the data is of a | | | |
| | | |Sample |Population | | | |
| | |Skewness |0.07083784 |0.06913994 | | | |
| | |(Relative) Kurtosis |-0.711512 |-0.7500338 | | | |
| | | | | | | | |

b). Mean and Standard Deviation for K-Mart
|Basic Statistics from Raw Data |Stock Prices: K-Mart | |
| | | | | | | | |
| | | | | | | | |
| |Measures of Central tendency | | | | |
| | | | | | | | |
| |Mean |10.450952 |Median |10.66 |Mode |11.8 | |
| | | | | | | | |
| |Measures of Dispersion | | | | |
| | | |If the data is of a | | | |
| | | |Sample |Population | | | |
| | |Variance |0.9852023 |0.96956417 |Range |3.51 | |
| | |St. Dev. |0.99257358 |0.9846645 |IQR |1.955 | |
| | | | | | | | |
| |Higher Moments | | | | | |
| | | |If the data is of a | | | |
| | | |Sample |Population | | | |
| | |Skewness |-0.4070262 |-0.3972703 | | | |
| | |(Relative) Kurtosis |-1.132009 |-1.1378913 | | | |
| | | | | | | | |

c). Coefficient of variation:

CV = std. dev ( mean

For Wal-Mart: for K-Mart: considering the data as a population: CV = 1.49039786 / 51.041478 = 0.0292 CV = 0.9846645 / 10.450952 = 0.0942

considering the data as a sample: CV = 1.50236912 / 51.041478 = 0.02943 CV = 0.99257358 / 10.450952 = 0.09497

d). There is a greater degree of risk in the stock prices for K-Mart than for Wal-Mart over this three month period.

e). For DJIA considering the data as a population: CV = 427.913791 / 10681.11 = 0.04006

considering the data as a sample: CV = 431.350905 / 10681.11 = 0.04038

Wal-Mart stocks provided a less risky return for this time period relative to DJIA and K-Mart.

f). 100 Shares of Wal-Mart stocks purchased April 2, 2001: Price = $50.5674 Cost = $5056.74 Mean of holding 100 shares: $5104.15 Std dev of holding 100 shares: 1.4904 (rounded: if data considered a population) 1.5024 (rounded: if data considered a sample)
1-85.
a). for a process mean = 2004 VARP = Average SSD2004 + offset2 VARP = 3.5 + offset2 where offset = target – process

b). if target = process, then offset = 0 substituting: VARP = 3.5 + offset2 = 3.5 + 02 = 3.5

1-86.
a) & b): CPI and Gas prices for period: June 97 through May 01. (Non-seasonally adjusted series.)

CPI index converted (by ( 100) in order to compare both series on same chart. There is no seasonal pattern present in the CPI index. Steady trend present in CPI; considerable variability in gas prices. Gas prices increased considerably more than the overall CPI for the same time period.

1-87.
a). Pie Chart: AIDS cases by Age groups

|Age Group |No. |% |
|Under 5: |6812 |0.90% |
|Ages 5 to 12: |1992 |0.26% |
|Ages 13 to 19: |3865 |0.51% |
|Ages 20 to 24: |26518 |3.52% |
|Ages 25 to 29: |99587 |13.21% |
|Ages 30 to 34: |168723 |22.38% |
|Ages 35 to 39: |168778 |22.39% |
|Ages 40 to 44: |124398 |16.50% |
|Ages 45 to 49: |72128 |9.57% |
|Ages 50 to 54: |38118 |5.06% |
|Ages 55 to 59: |20971 |2.78% |
|Ages 60 to 64: |11636 |1.54% |
|Ages 65 or older: |10378 |1.38% |

[pic]
b). Pie Chart: AIDS cases by Race
|Race |No. |% |
|White, not Hispanic |324822 |43.09% |
|Black, not Hispanic |282720 |37.50% |
|Hispanic |137575 |18.25% |
|Asian/Pacific Islander |5546 |0.74% |
|American Indian/Alaska Native |2234 |0.30% |
|Race/ethnicity unknown |1010 |0.13% |

[pic]
1-88. (Using the template: “Box Plot 2.xls”)
|Comparing two data sets using Box Plots | |Salaries 2004 | | |
| | | | | | | | | | | |
| | | | |Lower Whisker|Lower Hinge |Median |Upper Hinge |Upper Whisker| | |
| | | |Cubs |300000 |650000 |1550000 |5750000 |9500000 | | |
| | | |White Sox |301000 |340000 |775000 |3875000 |8000000 | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| |Cubs | | | | | | | | | |
| | | | | | | | | | | |
| |White Sox | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |
| | | | | | | | | | | |

Outliers: Cubs: Sosa’s salary of $16M White Sox: Ordonez’s salary of $14M
Furthermore, the median salary of the Cubs is twice the median salary of the White Sox. There are some players on both teams making the league minimum salary.

Somewhat lower salary range for the White Sox relative to the Cubs due to the fact that only seven (7) players on the Cubs were paid $500,000 or less while eleven (11) players earned less than that amount on the White Sox.

1-89 [pic]

[pic]

Correlation Table:

| |Errors |OT |Type |Skill |Stress |
|Errors |1 | | | | |
|OT |0.962672 |1 | | | |
|Type |0.036243 |0.065654 |1 | | |
|Skill |-0.89162 |-0.82627 |-0.00447 |1 | |
|Stress |0.979628 |0.926601 |0.053555 |-0.93428 |1 |

There is high positive correlation between the number of errors and the amount of overtime and stress, but a high negative correlation between the number of errors and skill level. Skill level appears to decrease the number of errors, but overtime and stress add to the number of errors.

Overtime is highly correlated with stress and negatively correlated with skill level. Skill level and stress are negatively correlated. The higher the skill level of the employee the lower the stress level.

-----------------------

[pic]

[pic]

[pic]

[pic]

[pic]

[pic]

Box and Whisker Plot

34 cases

8.5

7.9

7.3

6.7

6.1

5.5

80

60

40

20

0

31 cases

Box and Whisker Plot

C1

C1

C2

42

36

30

24

18

12

Box and Whisker Plot

C1

80

60

40

20

0

Box and Whisker Plot

C1

34

26

18

10

16 cases

Box and Whisker Plot

C1

100

80

60

40

20

15 cases

Box and Whisker Plot

C1

9

7

5

3

1

20 cases

Box and Whisker Plot

C1

20

0

-20

-40

8 cases

Box and Whisker Plot

Similar Documents

Premium Essay

Fin5Eme

...Econometric Methods FIN5EME Semester 1, 2013 Assignment 2 * Cobb-Douglas cost function: TCi = µQiβ2 pi1β3 pi2β4 pi3β5 (1) Where, TCi= Total Cost for firm i Q= Output of firm i pi1= Wage Rate pi2= Rental Price of Capital pi3= Fuel Price * Taking the natural log of equation (1) log(TCi)= β1 + β2 log(Qi) + β3 log(pi1) + β4 log(pi2) + β5 log(pi3) + ei (2) where β1= (logµ) and ei= error term. * Eviews Output of the log-log model is as follows: Dependent Variable: LOG(TC) | | | Method: Least Squares | | | Date: 06/12/13 Time: 12:47 | | | Sample: 1 145 | | | | Included observations: 145 | | | | | | | | | | | | | Variable | Coefficient | Std. Error | t-Statistic | Prob.   | | | | | | | | | | | C | -3.526503 | 1.774367 | -1.987471 | 0.0488 | LOG(Q) | 0.720394 | 0.017466 | 41.24448 | 0.0000 | LOG(WAGE) | 0.436341 | 0.291048 | 1.499209 | 0.1361 | LOG(CAPITAL) | -0.219888 | 0.339429 | -0.647819 | 0.5182 | LOG(FUEL) | 0.426517 | 0.100369 | 4.249483 | 0.0000 | | | | | | | | | | | R-squared | 0.925955 |     Mean dependent var | 1.724663 | Adjusted R-squared | 0.923840 |     S.D. dependent var | 1.421723 | S.E. of regression | 0.392356 |     Akaike info criterion | 1.000578 | Sum squared resid | 21.55201 |     Schwarz criterion | 1.103224 | Log likelihood | -67.54189 |     Hannan-Quinn criter. | 1.042286 | F-statistic | 437.6863 |     Durbin-Watson stat...

Words: 2131 - Pages: 9

Free Essay

The Unbearable Song

...National Statistical Coordination Board (NSCB), the Philippines’ Gross Domestic Product (GDP) growth for 2014 was at 6.1%, making our country one of the fastest growing economies in Asia. However, according to the National Statistics Office (NSO), unemployment rate remains at 7%. As an investigative journalist, you are trying to help the public reconcile that there is economic growth despite our deteriorating labor market. Using the data collected, you will select and analyse three statistics to show the current state of our labor market. Aside from this, you will supplement your quantitative analysis with qualitative information or data from minimum wage workers at SM. Your comprehensive investigation should justify your conclusion about the country’s economic condition. Depending on your conclusion, you will provide possible ways to sustain the current economic condition or solve the problem that our economy is facing. In doing so, you might want to focus on the labor market. You will be assessed on your ability to identify and explain relevant quantitative data related to the issue, and your use of narratives (qualitative) to justify quantitative analysis. You will also be assessed based on the effectivity and feasibility of recommended solutions to the issue. Criteria Qualitative Labor Market Analysis 30% Exceptional A+ More than three (3) highly relevant...

Words: 705 - Pages: 3

Premium Essay

The Effect of Environmental Regulations on Foreign Direct Investment

...Foreign Direct Investment inflow since investing firms experience significant cost efficiencies and comparative advantages. The data set is mainly chosen from the World Data Bank and five explanatory variables are used to investigate their influence on FDI inflow (as percentage of GDP). During the empirical analysis a pivotal factor will be the OECD membership even if several environmental standards are controlled. We expect to see some significant determinants of FDI inflow in order to either agree or reject the pollution haven hypotheses. Contents 1 Introduction 2 The Two Hypotheses 3 Data Set 4 Econometric Model and Results 4.1 Linear Regression Model (OLS) . . . . . . . . . . . . . . . . . 4.2 Assumptions of Gauss-Markov-Theorem . . . . . . . . . . . . 4.3 Chow Test for Structural Break . . . . . . . . . . . . . . . . . 5 Conclusion A Appendix A.1 Program Code EViews . . . . . . . . . . . . . . . . . . . . . . 1 1 1 2 3 4 6 7 9 9 1 Introduction International trade theory is based on the concept of comparative advantages which is consistent with what we could observe in the booming globalization process during the last decades. A multinational firm...

Words: 3184 - Pages: 13

Premium Essay

Understanding Business Research Terms and Concepts: Part 2

...Goudy RES/351 July 6, 2015 Tracy Sipma Descriptive statistics Descriptive statistics suggests a straightforward quantitative outline of a data-set which has been gathered. It helps us comprehend the experimentation or data-set in-detail and tells people concerning the mandatory details that help show the data perceptively. Descriptive statistics, we just convey exactly what the data reveals and tell us. Most of the statistical averages and numbers we estimate are essentially illustrative averages. For instance the Dow Jones Industrial tells us about the typical performance of select firms. The grade-point avg. tells us about the typical performance of a pupil in school. The GDP growth rate tells us about the typical performance of a state. Therefore illustrative statistics attempts to catch a sizable group of observations and offers us some concept concerning the data-set. Descriptive statistics aims to describe data set information with summary graphs and tables (Linda Hollis, n.d.). Inferential Statistics Inferential statistics includes drawing the correct conclusions from your statistical evaluation that's been performed using descriptive data. Ultimately, it really is the inferences that make studies significant and this element is dealt with-in inferential data. Most forecasts of the potential and generalizations of a population by analyzing a smaller sample come under the scope of inferential statistics. Many social sciences experiments offer with analyzing a little...

Words: 904 - Pages: 4

Premium Essay

Statictcs

...AVU-PARTNER INSTITUTION MODULE DEVELOPMENT TEMPLATE PROBABILITY AND STATISTICS Draft By Paul Chege Version 19.0, 23rd March, 2007 C. TEMPLATE STRUCTURE I. INTRODUCTION 1. TITLE OF MODULE Probability and Statistics 2. PREREQUISITE COURSES OR KNOWLEDGE Secondary school statistics and probability. 3. TIME The total time for this module is 120 study hours. 4. MATERIAL Students should have access to the core readings specified later. Also, they will need a computer to gain full access to the core readings. Additionally, students should be able to install the computer software wxMaxima and use it to practice algebraic concepts. 5. MODULE RATIONALE Probability and Statistics, besides being a key area in the secondary schools’ teaching syllabuses, it forms an important background to advanced mathematics at tertiary level. Statistics is a fundamental area of Mathematics that is applied across many academic subjects and is useful in analysis in industrial production. The study of statistics produces statisticians that analyse raw data collected from the field to provide useful insights about a population. The statisticians provide governments and organizations with concrete backgrounds of a situation that helps managers in decision making. For example, rate of spread of diseases, rumours, bush fires, rainfall patterns, and population changes. On the other hand, the study of probability...

Words: 8620 - Pages: 35

Premium Essay

Baxter

... 1 Master of Business Administration Course Instructor: Dr. Swapan Kumar Dhar Definition of Statistics Statistics is the science of collecting, organizing, presenting, analyzing and interpreting data for the purpose of making intelligent statements and drawing appropriate conclusions. So, according to this definition, there are four stages: (1) Collection of data (2) Presentation of data (3) Analysis of data and (4) Interpretation of data. Example of Statistics: Examples include the average starting salary of college graduates, the number of deaths due to road accidents last year, and 20% students of BBA are female. In these examples statistics are a value or a percentage. Other examples include: 95% students of BBA come to the class in time. 25% students of IBA come to the campus by car. The above are all examples of statistics. Data: Data are the facts and figures that are collected, analyzed and summarized for presentation and interpretation. The data collected in a particular study are referred as the data set for the study. For example, the heights (in cm.) of 14 randomly selected persons from a group of 100 persons are as follows: 152, 160, 158, 155, 154, 155, 162, 164, 160, 153, 161, 158, 167, 151. The above information on height of people constitutes a data. A set of five students is selected from a class of the course “Business Statistics’ and measurements...

Words: 6578 - Pages: 27

Premium Essay

Understanding Business Research Terms and Concepts: Part 2

...Business Research Terms and Concepts: Part 2 Descriptive statistics Descriptive statistics suggests a straightforward quantitative outline of a data-set which has been gathered. It helps us comprehend the experimentation or data-set in-detail and tells people concerning the mandatory details that help show the data perceptively. Descriptive statistics, we just convey exactly what the data reveals and tell us. Most of the statistical averages and numbers we estimate are essentially illustrative averages. For instance the Dow Jones Industrial tells us about the typical performance of select firms. The grade-point avg. tells us about the typical performance of a pupil in school. The GDP growth rate tells us about the typical performance of a state. Therefore illustrative statistics attempts to catch a sizable group of observations and offers us some concept concerning the data-set. Descriptive statistics aims to describe data set information with summary graphs and tables (Linda Hollis, n.d.). Inferential Statistics Inferential statistics includes drawing the correct conclusions from your statistical evaluation that's been performed using descriptive data. Ultimately, it really is the inferences that make studies significant and this element is dealt with-in inferential data. Most forecasts of the potential and generalizations of a population by analyzing a smaller sample come under the scope of inferential statistics. Many social sciences experiments offer with analyzing a little...

Words: 915 - Pages: 4

Premium Essay

Statistics

...Describe the role of statistics in business decision making. Provide at least three examples or problem situations in which statistics was used or could be used. Statistics plays a significant part in successful business decisions. Any successful entrepreneur has to be especially sharp and correct when making business decisions. The entrepreneur should have a feeling for the market demand for the company's products and should therefore be able to identify what to produce products or services that will sell. The volume of sales may also be accurately estimated. Statistics will help entrepreneurs to align production according to the market demand. Utilizing business statistics the quality of the products may also be verified in a more scientific manner to save on measuring cost. http://voices.yahoo.com/importance-statistics-business-decisions-7356560.html A major metropolitan newspaper randomly sampled 150 readers from their list of 100,000 subscribers. They asked whether the paper should increase its coverage of local news. Forty percent of the sample wanted more local news. What is the 95% confidence interval for the proportion of readers who would like more coverage of local news? (A) 0.32 to 0.48 (B) 0.33 to 0.47 (C) 0.34 to 0.46 (D) 0.35 to 0.45 (E) 0.36 to 0.44 Solution The correct answer is (A). The approach that we used to solve this problem is valid when the following condition are met. Suppose we want to estimate the average weight of an adult male in Dekalb...

Words: 623 - Pages: 3

Premium Essay

Simulation with Arena

...Simulation with Arena Assignment G2: a multi-echelon inventory policy Christopoulou Evdoxia Kuodzevicius Bernardas 101283 534893 10/12/2012 Preliminaries The given Arena model is a steady-state model, because there is no clear event that could indicate the end of model run and actually we are interested in the long run behavior of the system represented by the given model. Before we start to do the main parts of the assignment, that is design of experiments and optimization, we conduct some preliminary experiments to check if the model could be modified in order to get better results. First we set the number of replications to 30 and we choose the length of each replication be 730 days (two years), which we think should be enough to reach the steady-state. When we run the model with these settings, we get that the mean of the main response variable - average cost - is 568.4 and the half width is equal to 2.37. Although the confidence interval is not extremely wide taking into account the relatively high value of mean, we still perform a check whether it is possible to get more precise results by using common random numbers. To figure out if the model would benefit from the use of CRN we perform a pilot study. In this study we need to have two different scenarios and then we can decide whether it is useful to use CRN by checking the following inequality: { } { } { } and if this inequality holds, then it is worth using CRN in the model. In our case the scenarios differ in three...

Words: 3756 - Pages: 16

Premium Essay

Fina 301 Chapter 13 Solutions

...CHAPTER 13 - WEIGHING NET PRESENT VALUE AND OTHER CAPITAL BUDGETING CRITERIA Questions LG1 1. Is the set of cash flows depicted below normal or non-normal? Explain. |Time |0 |1 |2 |3 |4 |5 | |Cash Flow |-$100 |-$50 |$80 |$0 |$100 |$100 | They’re normal: there is only one change in cash flows from negative to positive. LG1 2. Derive an accept/reject rule for IRR similar to 13-8 that would make the correct decision on cash flows that are non-normal, but which always have one large positive cash flow at time zero followed by a series of negative cash flows: |Time |0 |1 |2 |3 |4 |5 | |Cash Flow Sign |+ |- |- |- |- |- | With one positive at the beginning and all future cash flows negative, this type of project would be worth more if rates were higher, implying that the NPV profile would be upward-sloping. So the appropriate accept/reject decision rule would look like Accept Project if IRR ≤ Cost of Capital Reject Project if IRR > Cost of Capital LG1 3. Is it possible for a company to initiate two products that target the same market and are not mutually exclusive? Sure, as long as the market has room for both products. LG2 4. Suppose that your company used “APV”, or “All-the-Present Value-Except-CF0”, to analyze capital budgeting projects. What would this rule’s benchmark value...

Words: 3605 - Pages: 15

Free Essay

Judeg

...Department of Statistics and Actuarial Science September 1, 1998 Table Of Contents Page Before Using This Manual……………………………………………………………………………….3 Introduction to SPSS……………………………………………………………………………………..4 SPSS Basics……………………………………………………………………………………………... 5 Tutorial 1: SPSS Windows.…………………………………………………………………………5 Tutorial 2: Starting A SPSS Session.……………………………………………………………...6 Tutorial 3: Getting Help on SPSS.………………………………………………………………… 6 Tutorial 4: Ending A SPSS Session.……………………………………………………………… 6 Creating and Manipulating Data in SPSS.……………………………………………………………. 7 Tutorial 1: Creating a New Data Set.……………………………………………………………... 7 Tutorial 2: Creating a New Data Set From Other File Formats.……………………………….10 Tutorial 3: Opening an Existing SPSS Data Set.………………………………………………. 16 Tutorial 4: Printing a Data Set.…………………………………………………………………… 16 Generating Descriptive Statistics in SPSS…………………………………………………………...17 Tutorial 1: Mean, Sum, Standard Deviation, Variance, Minimum Value, Maximum Value, and Range.……………………………………………………….. 17 Tutorial 2: Correlation.…………………………………………………………………………….. 18 Generating Graphical Statistics in SPSS……………………………………………………………..20 Tutorial 1: How to Generate Scatter Plots.………………………………………………………20 Tutorial 2: How to Generate A Histogram.………………….…………………………………... 22 Tutorial 3: How to Generate A Stem and Leaf Plot……………………………………………..23 Tutorial 4: How to Generate A Box Plot………………………………………………………….26 Statistical Models in SPSS……………………………………………………………………………..28 Tutorial 1: Linear Regression...

Words: 5895 - Pages: 24

Premium Essay

Stats

...variance, and (e) standard deviation: 2, 2, 0, 5, 1, 4, 1, 3, 0, 0, 1, 4, 4, 0, 1, 4, 3, 4, 2, 1, 0 A. 2, 2, 0, 5, 1, 4, 1, 3, 0, 0, 1, 4, 4, 0, 1, 4, 3, 4, 2, 1, 0 Solution: (a) Mean = sum/21 = 42/21 = 2 (b) Arrange the numbers in ascending order 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 2, 2, 2, 3, 3, 4, 4, 4, 4, 4, 5 Median = middle number = 2 (c) Sum of squared deviations: |x |x-mean |(x-mean)^2 | |2 |0 |0 | |2 |0 |0 | |0 |-2 |4 | |5 |3 |9 | |1 |-1 |1 | |4 |2 |4 | |1 |-1 |1 | |3 |1 |1 | |0 |-2 |4 | |0 |-2 |4 | |1 |-1 |1 | |4 |2 |4 | |4 |2 |4 | |0 |-2 |4 | |1 |-1 |1 | |4 |2 |4 | |3 |1 |1 | |4 |2 ...

Words: 1866 - Pages: 8

Free Essay

Accounting

...Click here to download the solutions manual / test bank INSTANTLY!! http://testbanksolutionsmanual.blogspot.com/2011/02/accounting-information-systems-romney.html ------------------------------------------------------------------------------------------------------------------------ Accounting Information Systems Romney 11th Edition Solutions Manual Accounting Information Systems Romney 11th Edition Solutions Manual Accounting Information Systems Romney 11th Edition Solutions Manual Accounting Information Systems Romney Steinbart 11th Edition Solutions Manual Accounting Information Systems Romney Steinbart 11th Edition Solutions Manual ------------------------------------------------------------------------------------------------------------------------ ***THIS IS NOT THE ACTUAL BOOK. YOU ARE BUYING the Solution Manual in e-version of the following book*** Name: Accounting Information Systems Author: Romney Steinbart Edition: 11th ISBN-10: 0136015182 Type: Solutions Manual - The file contains solutions and questions to all chapters and all questions. All the files are carefully checked and accuracy is ensured. - The file is either in .doc, .pdf, excel, or zipped in the package and can easily be read on PCs and Macs.  - Delivery is INSTANT. You can download the files IMMEDIATELY once payment is done. If you have any questions, please feel free to contact us. Our response is the fastest. All questions will always be answered in 6...

Words: 18533 - Pages: 75

Premium Essay

Saswork

...© Orangetree Business Solutions Private Limited, 2012 No part of this book should be referenced or copied without the prior permission of the company. A FEW WORDS TO THE STUDENTS Analytics is becoming a popular tool for managerial decision making. It‘s still not so widespread in countries like India, but in the west it has become a standard practice. Previously studying analytics involved an in depth knowledge of statistics and programming languages. But widespread availability of statistical package software has changed the reality to some extent. Now more emphasis is given on the application of the techniques to solve the business problems. So there is a need to understand the meaning of the statistical procedures. This book has been written to cater that need. In this book, all the necessary concepts have been explained keeping the business problem in mind. Also, to remove the apathy for statistics, use of mathematical expressions have been limited. That doesn‘t imply that we don‘t have to study the mathematics part. The intention is to put the substance over matter. As the students get accustomed to these statistical concepts, they can go for further investigations using various mathematical and statistical techniques. A list of suggested books and links have been given in the appendix. This book is directly related to the instructor‘s presentation. So it is highly advised that students should go through this material at the end of each class. As for general ...

Words: 24975 - Pages: 100

Premium Essay

Global Eating Disorder

...Gilbert,p 3). According to disabled world, an eating disorder is a continual disturbance of eating and or eating-related behavior that leads to altered consumption or absorption of food in the body system, in a great way impairing the physical health or psychological and social functioning of the person. Eating disorders are more often than not long-term problems, which can cause great suffering for victims and their families (Eating Disorder Symptoms, Types and Treatment Methods, Para 1). Analysis The writer has chosen this topic because as the statistics across the world have proven obesity has almost turned out to be a national disaster. The thesis of this report is that having an eating disorder puts your body into a high extent of harm. Solutions to the issue will be availed at the conclusion of the report. The writer has chosen this topic because the scope of this enquiry will extend from 2006 to 2011 is discussing the problem, and from 3,000 BC to 2011 AD in the expression of solutions. Source of information will be journals, books and reports by health organizations including the World Health Organization (WHO). There are generally two recognized types of eating disorders: Anorexia Nervosa (AN) and Bulimia Nervosa (BN). In Anorexia Nervosa, This name of this disorder literally means "loss of appetite." But In reality, the person has not in...

Words: 1991 - Pages: 8