# SPSS Homework problems

PM 510 Homework 2Problems not marked with [SPSS] should be done by hand, as problems similar to those could be

on the Midterm and/or Final. (Of course, feel free to check your work using SPSS!)

Problems marked with [SPSS] are intended to be done with SPSS. For these problems, please

attach the output file.

1. A study was conducted investigating the long-term prognosis of children who have suffered

an acute episode of bacterial meningitis, an inflammation of the membranes enclosing the

brain and spinal cord. Listed below are the times to the onset of seizure for 13 children who

took part in the study. In months, the measurements are:

0.10 0.25 0.50 4 12 12 24 24 31 36 42 55 96

Find the following numerical summary measures of the data:

a) Mean

b) Median

c) Mode

d) Range

e) Interquartile Range

f) Standard Deviation

g) How many standard deviations away from the mean is a child whose time to the onset of

seizure was 50 months? (Note: for the purpose of this problem, please assume that the

population standard deviation is the same as the sample standard deviation.)

h) What proportion of children have an onset to seizure time of 50 or more months?

i) What proportion of children have an onset to seizure time between the mean and 50

months?

j) Calculate a 95% confidence interval around the mean assuming that the data are normally

distributed with a known population variance of 20

k) Calculate a 95% confidence interval around the mean assuming that the data are normally

distributed with an unknown population variance.

l) Calculate a 99% confidence interval around the mean assuming that the data are normally

distributed with an unknown population variance.

2. [SPSS] A study was conducted comparing female adolescents who suffer from bulimia to

healthy females with similar body compositions and levels of physical activity. The file

bulimia.sav contains measures of daily caloric intake, recorded in kilocalories per kilogram, for

samples of adolescents from each group.

a) Find the median daily caloric intake for both the bulimic adolescents and the healthy ones.

b) Compute the IQR for each group.

c) Construct box-and-whisker plots for each group.

d) Describe the shape of the observed distribution for each group. Do you think that the

sampled data come from a population with a normal distribution? Why or why not?

e) Describe the qualitative differences between the two groups based on the box-and-whisker

plots. (For example, which average is higher? Which group has more variability? Are

there outlying values in either group?)

3. [SPSS] The declared concentrations of nicotine in milligrams for 35 brands of Canadian

cigarettes are saved under the variable name nicotine in the file cigarett.sav.

a) Find the mean and median concentrations of nicotine.

b) Produce a histogram of the nicotine measurements. Describe the shape of the observed

distribution. Do you think that the sampled data come from a population with a normal

distribution? Why or why not?

c) Which number do you think provides the best measure of central tendency for these

concentrations, the mean or the median? Why?

4. [SPSS] The data set lowbwt.sav contains information for the sample of 100 low birth weight

infants born in Boston, Massachusetts. This data set contains information on the infants,

including systolic blood pressure (SBP), gender, and gestational age of the infant, as well as

APGAR score at 5 minutes, toxemia diagnosis for mother and germinal matrix hemorrhage.

a) Run descriptive statistics in SPSS on all numeric variables, including all possible

dispersion statistics, as well as skewness and kurtosis. Attach the output file.

b) Use SPSS to provide a 95% confidence interval around the mean and show the quartiles

(25th, 50th, 75th percentiles) for each numeric variable. Attach the output file (can be all

one file).

c) Create frequency tables in SPSS of all categorical tables. Attach the output file (can be all

one file).

d) Create a cross-tabulation table of toxemia diagnosis for mother and germinal matrix

hemorrhage, including the expected frequencies and column percentages. Attach the

output file (can be all one file).

5. [SPSS] The dataset pulse_example.sav contains data examining the mean pulse rate of students

taking a midterm for PM 510. Two TAs each measured the pulse rate of 10 students taking the

midterm in the class after 1 hour. Each TA selects 10 students at random. Let 𝜇 represent the

true (population) mean pulse of the students taking the PM 510 midterm.

a) Calculate the 90% confidence interval for 𝜇 based on the data collected by the 1st TA.

b) Calculate the 90% confidence interval for 𝜇 based on the data collected by the 2nd TA.

c) Interpret the confidence intervals.

d) Compare the two confidence intervals. Give some possible reasons why they are different.

6. A library wants to determine the effectiveness of their summer literacy program among lowincome children. Because surveying the large numbers of students in the program would

require too many resources the library staff interviews 30 randomly chosen children among the

low-income program attendees. The 30 sampled children are given a reading test before and

after the program.

(a) Describe the population of this study.

(b) The difference in the reading test scores (after – before) has mean = 10 and SD = 4.

Assuming the score differences are normally distributed, what percent of the children

showed any improvement (difference > 0) in reading ability?

(c) What percent of children improved by more than 15 points?

7. [SPSS] Use SPSS (use a blank dataset) to calculate the following probabilities: Consider the

standard normal distribution with mean μ = 0 and standard deviation 𝜎 = 1. Provide the

answers to each question and attach the output file.

a) What is the probability that an outcome z is < -2.05?
b) What is the probability that an outcome z is > 1.82?

c) What is the probability that an outcome z is > -1.82?

d) What is the probability that an outcome z is between –2.28 and 1.92?

e) What value of z cuts off the upper 30% of the standard normal distribution?

f) What value of z cuts off the lower 8% of the standard normal distribution?

## We've got everything to become your favourite writing service

### Money back guarantee

Your money is safe. Even if we fail to satisfy your expectations, you can always request a refund and get your money back.

### Confidentiality

We don’t share your private information with anyone. What happens on our website stays on our website.

### Our service is legit

We provide you with a sample paper on the topic you need, and this kind of academic assistance is perfectly legitimate.

### Get a plagiarism-free paper

We check every paper with our plagiarism-detection software, so you get a unique paper written for your particular purposes.

### We can help with urgent tasks

Need a paper tomorrow? We can write it even while you’re sleeping. Place an order now and get your paper in 8 hours.

### Pay a fair price

Our prices depend on urgency. If you want a cheap essay, place your order in advance. Our prices start from $11 per page.