結果 : q function and q learning algorithm in machine learning