Why do we subtract the slope * a in Gradient Descent?

Ok, I got it. But still, why is it MINUS?

Because your goal is to MINIMIZE J(θ).