As on the picture:
Could someone help me understand what exactly what delta means in the gradient descent algorithm?
The term is a derivative with respect to the theta 0
.
theta
as coordinate on X-axis (let it be A)The derivative is used to control two aspects of the cost function (J function) minimization: