結果 : does gradient descent minimize loss function