結果 : reinforcement learning sample code