Work should be typed but some parts may include some clear legible hand-written parts. Copy & paste selected SPSS output into a Word document. Show working and reasoning. Do not hand in duplicated or unrequested SPSS output. Marks will be deducted for inadequate explanation and poor presentation.

Question 1. This question relates to the following table extracted from an article by Ibrahim et al. published in the Medical Journal of Australia 2017.

GIVE CLEAR EXPLANATIONS TO ANSWERS

(a) Of the total external causes of death, for men and women separately, what proportion had an intentional cause of death reported? Give your answer to three decimal places.

(b) Combining individuals in the two lowest age groups (i.e. <65 and 65-74), what percentage of deaths were male? Give your answer to two decimal places.

(c)Of the other unintentional deaths what percentage of women died by either choking or asphyxia and aspiration? Give your answer to the nearest whole number.

Question 2. Consider the following SPSS stem-and-leaf plot for a variable fup for a sample of 473 adults.

(a) What is the inter-quartile range of fup for this sample of adults? Show working/reasoning.

(b)What percentage of adults have a fup greater than 1?

(c)Is the mean fup likely to be smaller or larger than the median fup in this instance? Explain your reasoning.

Question 3. Consider the following contingency table that displays summary statistics for the variable TUMOUR SIZE for each AGE GROUP and Gender combination.

Gender
F / M
AGE GROUP / Less than 40
40 to 60
Greater than
60 / TUMOUR
SIZE
TUMOUR
SIZE
TUMOUR
SIZE / Count / 33 / 35
Mean / 17.81 / 18.29
Standard Deviation / 8.03 / 7.92
Count / 121 / 136
Mean / 15.93 / 16.25
Standard Deviation / 6.82 / 7.28
Count / 82 / 66
Mean / 15.70 / 16.03
Standard Deviation / 6.59 / 7.83

Using this contingency table:

(a) Obtain the mean tumour size (to two decimal places) for all females combined. Show working/reasoning.

(b)Determine which of the 6 age group and gender combinations has the lowest coefficient of variation.

(c) Assuming the distribution of TUMOUR SIZE is unimodal and symmetric, construct an interval within which you would expect the middle 95% of TUMOUR SIZE values to lie, for people from this population who are males aged 40 years or less.

Question 4. This question relates to the file bsn81.sav. See the doc on “Description of datasets” for information on this dataset. Use SPSS and the file bsn81.sav to produce the requested output and answer the following questions.

(a) Produce histograms for FVC and AGE and comment on the shape of the histograms.

(b)Obtain a frequency distribution for EXERCISE and highlight in your output the percentage with EXERCISE=7 and the percentage with EXERCISE in the range 0-2 inclusive.

(c) Produce a scatter plot of FVC against AGE and use this plot to comment on the relationship between FVC and age for this sample of people.

(d) Produce side-by-side box plots of FVC for for people that do no exercise, low exercise (1-2 days), medium exercise (3-4 days) and high exercise (>4 days), where

EXERCISE=0 is no exercise days

1 ≤ EXERCISE ≤ 2 is low exercise days

3 ≤ EXERCISE ≤ 4 is medium exercise days

EXERCISE > 4 is high exercise days

Comment on the similarities and differences in distribution of FVC for these groups.

(e)Produce a single table that shows count and median FVC for the EXERCISE group created in (d), by ASTHMA. Highlight in your table the median FVC for those who have had ASTHMA and don’t exercise and the median FVC for those who have not had ASTHMA and don’t exercise. Comment on the relationship between the median FVC and EXERCISE group for each ASTHMA status.

(f)Produce a bar chart that shows percent of people who exercise more than 2 days per week by SMOKING group and ASTHMA status.

Make comments that compare the difference in prevalence of people who exercise greater than two days per week across SMOKING groups in people who have and have not had

ASTHMA.

Copy of the SPSS file bsn81.sav is attached

1