# PS 5841 Ashford University Mse Bias Variance of Statistical Learning Questions

ACTU PS5841 Data Science in Finance and Insurance β Autumn 2019Dr. Yubo Wang
Assignment-1
Assigned 9/5/19, Due 9/17/19 (Tue)
Problem 1. Statistical Learning
Suppose the observed data are generated by
π¦ = 1 + 2π₯ + π,
π₯ β [β50,50],
π β π(π = 0, π 2 = 102 )
Use your preferred data analysis tool (a spreadsheet at this stage can be useful to many), demonstrate
numerically that a simple linear regression model π¦Μ = π½Μ0 + π½Μ1 π₯ is able to learn.
[a] Specifically, use a test set of size 100 and training sets of various sizes (30, 100, 200, 300),
numerically estimate the corresponding expected test MSE and complete the following table.
Training Set size
Expected Test MSE
30
100
200
300
[b] Please also provide a plot of the expected test MSE against the training set size.
Suppose the observed data are generated by
π₯
π¦=
+ π,
π₯ β [β25,25],
π β π(π = 0, π 2 = 0.52 )
2
β1 + π₯
Suppose you use polynomial regressions π¦Μ = βππ=0 π½Μπ π₯ π , π = 1, 2, β¦ ,6 to learn from data and make
predictions.
Use your preferred data analysis tool (a spreadsheet at this stage can be useful to many), numerically
demonstrate the trade-off between bias and variance.
Specifically, use 300 training sets and test them on the test set associated with π₯ = β20, β10, 0, 10, 20.
[a] Please complete the following table with your estimates to demonstrate that the variance-bias
decomposition roughly holds for each model.
degree n
Expeted Test MSE
squred bias
variance
variance of error term
LHS – RHS
1
2
3
4
5
6
Please see notes on linear model and on Excel on the next page.
ACTU PS5841 Data Science in Finance and Insurance β Autumn 2019
Dr. Yubo Wang
Assignment-1
Assigned 9/5/19, Due 9/17/19 (Tue)
Notes on linear model
Μ , the coefficients based on least squares estimation are
Μ = π½Μ0 + ππ π·
For a linear model π
Μ = (πΏπ πΏ)βπ πΏπ π
π·
π
Μ = (π½Μ0 , π·
Μ π ) , πΏ = (π, π1 , β¦ , ππ ) where ππ = (π₯1π , β¦ , π₯ππ )π , and π = (π¦1 , β¦ , π¦π )π .
where π·
Notes on Excel
Transposition π¨π = πππ΄ππππππΈ(π¨)
Inverse matrix π¨β1 = ππΌπππΈπππΈ(π¨)
ππ΄ππ·() returns a number randomly sampled [0,1)
ππππ. πΌππ(ππππππππππ‘π¦, ππππ, π π‘πππ£) returns the inverse of the normal cumulative distribution for
the specified mean and standard deviation.
Data->What-if analysis->Data Table is a convenient tool for automating repetitive tasks.
Bias vs Variance (2)
High Bias
Low Variance
Low Bias
High Variance
Prediction Error
Test Sample
Training Sample
Low
High
Model Complexity
Bias vs Variance
3
(3)
E

Don't use plagiarized sources. Get Your Custom Essay on
PS 5841 Ashford University Mse Bias Variance of Statistical Learning Questions
Just from \$13/Page
Calculator

Total price:\$26
Our features

## Need a better grade? We've got you covered.

Order your essay today and save 20% with the discount code GOLDEN