Ridge regression 

When doing linear regression, if we include a variable that is severely correlated with our regressors, we will be inflating our standard errors for those correlated variables. This happens because, if two variables are correlated, the model can't be sure to which one it should be assigning the effect/coefficient. Ridge Regression allows us to model highly correlated regressors, by introducing a bias. Our first thought in statistics is to avoid biased coefficients at all cost. But they might not be that bad after all: if the coefficients are biased but have a much smaller variance than our baseline method, we will be in a better situation. Unbiased coefficients with a high variance will change a lot between different model runs (unstable) but they will converge in probability to the right place. Biased coefficients with a low variance will be quite stable between different runs, but won't converge in probability to where they should. So, it should be evident that there is trade-off, and at the end the question is whether the variance is reduced more than the increase in the bias.  What ends up happening with Ridge Regression is that the estimated coefficients will get compressed, but with a lower variance than the corresponding ordinary least squares ones.  

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.145.65.250