結果 : reinforcement learning code in python