結果 : optimization vs reinforcement learning