Mean Squared Error (MSE)
- AKA MSE
 - AKA L2-loss
 - MSE is used in Regression problems
 - Sensitive to large errors for the squaring
 - Gradient decreases as loss gets closer to 0
 
 Mean Squared Error Formula
Pros:
- Convex function, so one Global Minima
 - No Local Minima
 - If error is big, penalizes higher by squaring them
 - Good for predicting outliers as give more weight to them
 
Cons:
- Bad at Handling Outliers as it will give more weights to the outliers
- better to use Mean Absolute Error (MAE)
 
 - Same error for one big error and many small errors