QuBlitz: Optimal Control of Traffic Signals Using Quantum Annealing
"Reinforcement learning and traffic control" - Prof. Edouard Ivanjko
Stabilizing Q-learning with Weighted Bellman Losses
Using Reinforcement Learning to Control Traffic Signals in a Real World Scenario An Approach Based
ET3 N Step TD Backward View
Asynchronous Advantage Actor Critic (A3C) Traffic Signal Controller
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces
RL Unplugged: Benchmarks for Offline Reinforcement Learning
Approximate planning and learning in partially observed systems
DeepMind x UCL RL Lecture Series - Deep Reinforcement Learning #2 [13/13]
CoastRunners 7
Csaba Szepesvari: "Model misspecification in reinforcement learning"
Reinforcement Learning Fundamentals
Novel First Order Bayesian Optimization with an Application to Reinforcement Learning
Temporal Difference Learning - Reinforcement Learning Chapter 6
Decentralized Reinforcement Learning In-Walk Kicks using Finite Basis Functions
Piotr Januszewski - Planning in Deep Reinforcement Learning - PyCode Conference 2018
Q学習:モデルフリー強化学習と時間差分学習
Reconciling Reinforcement Learning: Optimization, Generalization, and Exploration -- Part 1 of 4
Tutorial 7 - Reinforcement Learning | Deep Learning on Hardware Accelerators