General Remarks
A Note About Gradients in Classification Problems¶
For the gradient boosting packages we have to calculate the gradient of the Loss function with respect to the marginal probabilites.
In this case, we must calculate
The Hessian is similarly calculated:
Where y-hat is the sigmoid function, unless stated otherwise:
We will make use of the following property for the calculations of the Gradients and Hessians: