結果 : bayesian optimization vs reinforcement learning