Statistics

In this problem, you will calculate mean values for four independent samples and then explain the observed differences (please read the entire problem before beginning). Using the SWC data (SWC tab) in dataset DS2.xls, do the following: Select a 10% random sample of all cases and calculate the Mean for the IMR (infant mortality rate) for this sample. Record the mean. (2 points) Select a second 10% random sample and calculate the Mean for the IMR for this sample. Record the mean. (2 points) Select a third 10% random sample and calculate the Mean for the IMR for this sample. Record the mean. (2 points) Select a fourth 10% random sample and calculate the Mean for the IMR for this sample. Record the mean. (2 points) Create a table that displays the four means calculated above in parts a-d. Calculate the average of the four means (Mean-1, Mean-2, Mean-3, Mean-4) and add this figure to the table. Describe the mean values calculated and offer an explanation for any differences observed. That is, explain why the four means calculated in parts a-d are not exactly the same. Include a comment about the “fifth mean” (the average of the means calculated in parts a-d). (2 points) In the next two problems, you will calculate frequency distributions and express them in different kinds of charts. Using the Hospital Charges data in dataset DS2.xls, do the following: Compute the minimum and maximum age for the cases in the spreadsheet. (1 point) Create a frequency distribution of age using the following categories of age (<50, 50-64, 65-79, 80+). Compute percent and cumulative percent for each age category. (4 points) Label all elements in the frequency table. (2 points) Create a column chart of age (showing the number of cases in each age category). (1 point) Create a line chart of age (showing the number of cases in each age category). (1 point) Create a pie chart of age (showing the percent of cases in each age category). (1 point) Using Late Delivery data in dataset DS2.xls , do the following: Use the pivot table to create a frequency distribution for the reasons for the late delivery of the meal. (4 points) Sort the frequency distribution so that the reason with the most occurrences is first, the second next, and so on, and create a column chart showing the reasons from most to least. (3 points) Compute the cumulative frequency (%) for the data in (a) and construct a Pareto chart of the result. That is, create a line chart for cumulative frequency (%) and add this to the chart created in part 6.b. (3 points)