Blog
PART 2: Analyzing and Interpreting Data (7.5%) / 45Submit Part 2 of your Statistics
PART 2: Analyzing and Interpreting Data (7.5%) / 45
Submit Part 2 of your Statistics Project to SLATE by Sunday, July 18th at 11:59PM.
Describing Data – Descriptive Statistics using Microsoft Excel
For questions 1 to 2, open-up your Excel Workbook from Part 1. Your Excel file should already contain
3 sheets. Add two more sheets as follows.
• Sheet 1 – “Raw Data”
• Sheet 2 – “Qualitative Variable”
• Sheet 3 – “Quantitative Variable”
• Sheet 4 – “Summary Statistics”
• Sheet 5 – “Regression and Scatter Diagram”
1) (26 marks – 1 mark for each statistic) On Sheet 4 – “Summary Statistics”, calculate the following
summary statistics for your two quantitative data sets (be sure to include labels to identify which is
which!). For all summary statistics, you must use Excel functions (e.g. =AVERAGE(A1:A10))
or formulae (e.g. =A2/C3*100) – values entered manually will receive a mark of ZERO:
• Mean, Median, Mode
• Sample Variance, Standard Deviation, and Coefficient of Variation
• Quartiles (Q0, Q1, Q2, Q3, Q4), Range, Inter-Quartile Range (IQR)
2) (9 marks) On Sheet 5 – “Regression and Scatter Diagram”, copy and paste your two quantitative
variables to compare in a correlation analysis.
a) (5 marks) Construct a scatter diagram to compare the two data sets. Be sure to add an
appropriate chart title, delete the legend, and label both axes appropriately.
b) (3 marks) Use Excel functions to add a linear trend line, regression equation, and 𝑟𝑟2 value to
your scatter diagram.
c) (1 mark) Compute the correlation coefficient 𝑟𝑟 for the bivariate data using the CORREL() function.
Interpreting Data – Written Report using Microsoft Word
For questions 3 to 7, write your responses using full sentences. Be sure to label your answers.
3) (2 marks) Consider the measures of central tendency for a quantitative variable of your choosing.
Which measure of central tendency best represents your data set? Explain why.
4) (2 marks) For the variable above, explain what the standard deviation tells you about the spread of
the data set.
5) (2 marks) For the scatter diagram, which variables have you identified as the independent and
dependent variables. Explain your reasoning.
6) (2 marks) Interpret the meaning of the regression equation coefficients as well as the 𝑟𝑟2 value for
the regression model you constructed.
7) (2 marks) Interpret the meaning of the correlation coefficient, 𝑟𝑟 in the context of your study.