XGBoost

Get initial_guess, unlike Gradient Boosting, its always 0.5
Start with one node which has all the residuals of the datapoint
Get similarity score for that node
Split the node
1. Get similarity score for each leaf
2. Calculate gain for that split, $g a i n_{s p l i t} = s i m i l a r i t y_{l e f t} + s i m i l a r i t y_{r i g h t} - s i m i l a r i t y_{r o o t}$
3. Go to 4 and continue splitting till predetermined number of depth is reached (usually 6)
4. Prune the tree
  1. Calculate $g a i n - γ$ for the lowest branch
  2. If its negative remove the branch
  3. And continue go up till one has positive value
Get $c u r r e n t_g u e s s = l o g_o d d s (0.5) + l r * c u r r e n t_p r e d i c t i o n$
$p r o b a b i l i t y = \frac{\exp (c u r r e n t_g u r e s s)}{1 + \exp (c u r r e n t_g u r e s s)}$
Got to step 2, until a predetermined number of estimator is reached

For each missing values, XGBoost push them to the default direction of the decision tree and learning the best direction during training.

Related Notes