SPSS Homework problems
PM 510 Homework 2Problems not marked with [SPSS] should be done by hand, as problems similar to those could be
on the Midterm and/or Final. (Of course, feel free to check your work using SPSS!)
Problems marked with [SPSS] are intended to be done with SPSS. For these problems, please
attach the output file.
1. A study was conducted investigating the long-term prognosis of children who have suffered
an acute episode of bacterial meningitis, an inflammation of the membranes enclosing the
brain and spinal cord. Listed below are the times to the onset of seizure for 13 children who
took part in the study. In months, the measurements are:
0.10 0.25 0.50 4 12 12 24 24 31 36 42 55 96
Find the following numerical summary measures of the data:
a) Mean
b) Median
c) Mode
d) Range
e) Interquartile Range
f) Standard Deviation
g) How many standard deviations away from the mean is a child whose time to the onset of
seizure was 50 months? (Note: for the purpose of this problem, please assume that the
population standard deviation is the same as the sample standard deviation.)
h) What proportion of children have an onset to seizure time of 50 or more months?
i) What proportion of children have an onset to seizure time between the mean and 50
months?
j) Calculate a 95% confidence interval around the mean assuming that the data are normally
distributed with a known population variance of 20
k) Calculate a 95% confidence interval around the mean assuming that the data are normally
distributed with an unknown population variance.
l) Calculate a 99% confidence interval around the mean assuming that the data are normally
distributed with an unknown population variance.
2. [SPSS] A study was conducted comparing female adolescents who suffer from bulimia to
healthy females with similar body compositions and levels of physical activity. The file
bulimia.sav contains measures of daily caloric intake, recorded in kilocalories per kilogram, for
samples of adolescents from each group.
a) Find the median daily caloric intake for both the bulimic adolescents and the healthy ones.
b) Compute the IQR for each group.
c) Construct box-and-whisker plots for each group.
d) Describe the shape of the observed distribution for each group. Do you think that the
sampled data come from a population with a normal distribution? Why or why not?
e) Describe the qualitative differences between the two groups based on the box-and-whisker
plots. (For example, which average is higher? Which group has more variability? Are
there outlying values in either group?)
3. [SPSS] The declared concentrations of nicotine in milligrams for 35 brands of Canadian
cigarettes are saved under the variable name nicotine in the file cigarett.sav.
a) Find the mean and median concentrations of nicotine.
b) Produce a histogram of the nicotine measurements. Describe the shape of the observed
distribution. Do you think that the sampled data come from a population with a normal
distribution? Why or why not?
c) Which number do you think provides the best measure of central tendency for these
concentrations, the mean or the median? Why?
4. [SPSS] The data set lowbwt.sav contains information for the sample of 100 low birth weight
infants born in Boston, Massachusetts. This data set contains information on the infants,
including systolic blood pressure (SBP), gender, and gestational age of the infant, as well as
APGAR score at 5 minutes, toxemia diagnosis for mother and germinal matrix hemorrhage.
a) Run descriptive statistics in SPSS on all numeric variables, including all possible
dispersion statistics, as well as skewness and kurtosis. Attach the output file.
b) Use SPSS to provide a 95% confidence interval around the mean and show the quartiles
(25th, 50th, 75th percentiles) for each numeric variable. Attach the output file (can be all
one file).
c) Create frequency tables in SPSS of all categorical tables. Attach the output file (can be all
one file).
d) Create a cross-tabulation table of toxemia diagnosis for mother and germinal matrix
hemorrhage, including the expected frequencies and column percentages. Attach the
output file (can be all one file).
5. [SPSS] The dataset pulse_example.sav contains data examining the mean pulse rate of students
taking a midterm for PM 510. Two TAs each measured the pulse rate of 10 students taking the
midterm in the class after 1 hour. Each TA selects 10 students at random. Let 𝜇 represent the
true (population) mean pulse of the students taking the PM 510 midterm.
a) Calculate the 90% confidence interval for 𝜇 based on the data collected by the 1st TA.
b) Calculate the 90% confidence interval for 𝜇 based on the data collected by the 2nd TA.
c) Interpret the confidence intervals.
d) Compare the two confidence intervals. Give some possible reasons why they are different.
6. A library wants to determine the effectiveness of their summer literacy program among lowincome children. Because surveying the large numbers of students in the program would
require too many resources the library staff interviews 30 randomly chosen children among the
low-income program attendees. The 30 sampled children are given a reading test before and
after the program.
(a) Describe the population of this study.
(b) The difference in the reading test scores (after – before) has mean = 10 and SD = 4.
Assuming the score differences are normally distributed, what percent of the children
showed any improvement (difference > 0) in reading ability?
(c) What percent of children improved by more than 15 points?
7. [SPSS] Use SPSS (use a blank dataset) to calculate the following probabilities: Consider the
standard normal distribution with mean μ = 0 and standard deviation 𝜎 = 1. Provide the
answers to each question and attach the output file.
a) What is the probability that an outcome z is < -2.05?
b) What is the probability that an outcome z is > 1.82?
c) What is the probability that an outcome z is > -1.82?
d) What is the probability that an outcome z is between –2.28 and 1.92?
e) What value of z cuts off the upper 30% of the standard normal distribution?
f) What value of z cuts off the lower 8% of the standard normal distribution?
We've got everything to become your favourite writing service
Money back guarantee
Your money is safe. Even if we fail to satisfy your expectations, you can always request a refund and get your money back.
Confidentiality
We don’t share your private information with anyone. What happens on our website stays on our website.
Our service is legit
We provide you with a sample paper on the topic you need, and this kind of academic assistance is perfectly legitimate.
Get a plagiarism-free paper
We check every paper with our plagiarism-detection software, so you get a unique paper written for your particular purposes.
We can help with urgent tasks
Need a paper tomorrow? We can write it even while you’re sleeping. Place an order now and get your paper in 8 hours.
Pay a fair price
Our prices depend on urgency. If you want a cheap essay, place your order in advance. Our prices start from $11 per page.