Statistics Assignment

Statistics Assignment

STA10003 FOUNDATIONS OF STATISTICS ASSIGNMENT

This assignment is worth 20% of your final mark.

Scenario
You are a new graduate at a social science and psychological sciences research institute, and the lead researcher has given you a dataset to analyse. You will be graded based on the rubric attached to these instructions.

The data set is based on Australian victims of crime statistics from the Australian Bureau of Statistics [ABS] collected over the period from 2010 to 2014. The original data file contained over 165,000 observations extracted from various ABS catalogues[eg Cat. 4510 /Cat. 4530.0], and a representative subset of the original data containing 8292 observations [STA10003_Assignment_SP3_2015.sav] is located in the Assignment Information tab on Blackboard.

Data Preparation

Before attempting the Assignment questions, you must use SPSS to draw a random sample of 2000 [from the 8292 cases]. You will conduct your analysis on your sample of 2000 observations. Instructions on how to generate your random sample is attached. Note, however, that some variables contain missing values, so each of your analyses may not contain the entire 2000 cases.

Submission Instructions
• Your submission must be a single Word file or PDF file and should contain the relevant output.
• You must submit your file via the Turnitin link on Blackboard by the specified due date and time. Only the last document you submit will be retained by Turnitin.
• This is an individual assignment. Do not share your work with other students. They will have a different random sample of data, so any copying will be detected by Turnitin.

Video demonstrations
Videos showing how to prepare your data and how to submit via Turnitin are provided for you in the Assignment Information tab [ > Assignment Information folder] on Blackboard.

For your assignment, you are required to complete the following five [5] questions by producing the appropriate analyses and writing the relevant report for each question.
Note: For each question you should include the relevant output with your report.

Question 1

The variable ‘CRIME’ indicates the type of crime reported by Australian victims of crime during the period 2010 – 2014. Produce the relevant graph and table to summarise the ‘CRIME’ variable and write a paragraph explaining the key features of the data observed in the output in the style presented in the course materials.

Question 2

The variable ‘INCIDENTS’ measures the number of times that the respondent – or a member of their household – has been a victim of crime at the time this incident was reported. Produce the relevant graph and tables to summarise the ‘INCIDENTS’ variable and write a paragraph explaining the key features of the data observed in the output in the style presented in the course materials.

Question 3

The variable ‘WEAPONTYPE’ indicates the type of weapon used during the crime.
Previous research has shown that 79% of all crimes are committed without any type of weapon being used. It has been suggested that the percentage of crimes committed without a weapon being used is now lower than this.
Conduct a Binomial test using the ‘WEAPONTYPE’ variable to test this claim. Produce the relevant output and write a Binomial test report based on your output in the style presented in the course materials.

Question 4

The variable ‘AGE’ indicates the age of the respondent at the time the crime was committed.
Previous research has indicated that the average age of victims of crime is 24 years. It is expected that the average age of victims of crime is currently higher than this.
Conduct a One-sample t-test using the ‘AGE’ variable to test this claim. Produce the relevant output and write a One-sample t-test report based on your output in the style presented in the course materials.

Question 5

Researchers predict that the average number of incidents of crime is higher for females than for males. Conduct an Independent samples t-test using the ‘INCIDENTS’ and ‘SEX’ variables to test this claim. Produce the relevant output and write an Independent samples t-test report based on your output in the style presented in the course materials.
How to generate your random sample of 2000 observations:
1. Open the STA10003_Assignment_SP3_2015.sav data file. From the Transform drop-down menu, select Random Number Generators:

2. From the Random Number Generators dialogue box, click the Set Active Generator and Set Starting Point as shown below. Click OK:

3. From the Data drop-down menu, select Select Cases:

4. From the Select Cases dialogue box, choose Random Sample of Cases and then click the Sample button [the Sample button is in grey-scale until you select the Random sample of cases choice]:

5. From the Select Cases: Random Sample dialogue box, click Exactly and type 2000 cases from the first 8292.
[again the information is in grey-scale until you select ‘Exactly’. We want to generate a random sample of 2000 from the entire data set, so enter 2000 cases from the first 8292 cases]:

6. After clicking Continue [this returns you to the Select Cases Dialogue Box] you will see next to the Sample button confirmation that 2000 cases have been selected:

7. We can remove the unselected cases by clicking the Delete unselected cases button under the Output heading:

8. After clicking OK, your data set will now only show the 2000 cases selected.

You should now save the data file with a new name. The data file is ready to use for your Assignment!

Marking Details

Prior to submitting your Assignment via Turnitin, use the following checklist as a guide to ensure that you have provided all of the relevant information.

Q1 – Should include (as appropriate):
A graph and Frequency output appropriate for a categorical variable.
A paragraph that includes mention of largest group, smallest group, spread of percentages/proportions [as appropriate], and any other relevant patterns based on the Model Reports from the learning materials.

Q2 – Should include (as appropriate):
A graph and Explore output appropriate for a metric variable.
A paragraph that includes a description of the centre, spread, outliers if present, and shape of distribution based on the Model Reports from the learning materials.

Q3 to Q5 – Should each include (as appropriate):
Appropriate output and a report that conforms to the style presented in the learning materials

Checklist:
• Correct variable used to produce output
• Correct procedure performed
• Graphs appropriately edited and labelled (eg edited variable names; “Figure 1. The distribution of …”
• Correct test values used
• All figures quoted in report correct according to your own output
• Including 95% confidence interval interpretations where appropriate
• Significance interpreted correctly (ie check that you are not saying the finding is significant when it is not or vice versa)
• Correctly referring to the sample or population where appropriate
• Proof reading of reports for errors

STA10003 – Assignment Marking Rubric [out of 20]
Clear Fail (0) Fail (1) Pass (2) Credit / Distinction (3) High Distinction (4)
Summary of Categorical Variable Inappropriate graphs or tables used Appropriate graphs and tables for categorical data
No summary Appropriate graphs and tables for categorical data
Reasonable attempt to summarise the information shown in the output Appropriate graphs and tables for categorical data
Graphs and/or tables suitably edited for presentation where appropriate (eg variable names, figure title)
Written summary covers all relevant features of output Appropriate graphs and tables for categorical data
Graphs and/or tables suitably edited for presentation where appropriate (eg variable names, figure title)
Written summary covers all relevant features of output and is clearly and concisely written
Summary of Metric Variable Inappropriate graphs or tables used Appropriate graphs and tables for metric data.
No summary Appropriate graphs and tables for metric data.
Reasonable attempt to summarise the information shown in the output Appropriate graphs and tables for metric data.
Graphs and/or tables suitably edited for presentation where appropriate (eg variable names, figure title)
Written summary covers all relevant features of output. Appropriate graphs and tables for metric data.
Graphs and/or tables suitably edited for presentation where appropriate (eg variable names, figure title)
Written summary covers all relevant features of output and is clearly and concisely written
Binomial Test Incorrect output and/or no report Correct output
Major errors in report Correct output
Report presented following format used in course materials and textbook
No major errors in report. More than 1 or 2 minor errors Correct output
Report presented following format used in course materials and textbook
Only 1-2 minor errors in report Correct output.
Report presented following format used in course materials and textbook.
No errors / minor error only in report
One Sample t-test Incorrect output and/or no report Correct output
Major errors in report Correct output
Report presented following format used in course materials and textbook
No major errors in report. More than 1 or 2 minor errors Correct output
Report presented following format used in course materials and textbook
Only 1-2 minor errors in report Correct output
Report presented following format used in course materials and textbook.
No errors / minor error only in report
Independent Samples t-test Incorrect output and/or no report Correct output
Major errors in report Correct output
Report presented following format used in course materials and textbook
No major errors in report. More than 1 or 2 minor errors Correct output
Report presented following format used in course materials and textbook
Only 1-2 minor errors in report Correct output
Report presented following format used in course materials and textbook.
No errors / minor error only in report