Linear Regression

Linear Regression Formula

g(x)=αc+βα=ixiyiixiiyiixi2(ixi)2β=1niyiα1nixiLoss=i(g(xi)yi)

Advantages of Linear Regression

  1. Simple
  2. Works pretty well for linearly separable data
  3. Interpretable

Disadvantages of Linear Regression

  1. Lot of feature engineering is needed
  2. Bad at Handling Missing Data
  3. Bad at Handling Outliers

Basic Assumptions of Linear Regression

  1. Linearity: The relation between the independent variables (X) and dependent variables (Y) has to be linear.
  2. No or little Multicollinearity: The independent variables (X) should not be correlated to each other pair wise.
  3. Normality of Residuals: The residuals should be normally distributed, it will help ensure unbiased predictions and more confidence.
  4. Independence: The observations (X, Y) are independent to each other.
  5. Homoscedasticity: The variance of the errors is constant across all levels of the independent variable.


Related Notes