Large Language Models: A Survey
Large Language Models explained briefly
What are RLVR environments for LLMs? | Policy - Rollouts - Rubrics
Agentic Reinforcement Learning (RL) for Large Language Models (LLM).Markov Decision Processes (MDPs)
Reinforcement Learning (RL) for Large Reasoning Models (LRM/ LLM): A Survey.
Large Language Model Agent: A Survey... (Mar 2025)
DSFP Session 19: Reinforcement Learning Part I
Survey: Agentic RL for LLMs Explained
エージェントのための強化学習 - モルガン・スタンレーのML研究者、ウィル・ブラウン
From System 1 to System 2: A Survey of Reasoning Large Language Models (January 2025)
Survey: RL for Large Reasoning Models (LRMs)
[2024 Best AI Paper] Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontier
A Survey of Reinforcement Learning for Large Reasoning Models (Sep 2025)
A Survey of Reinforcement Learning for Large Reasoning Models
大規模言語モデルのセキュリティとプライバシーの課題:調査
A Survey on Post-training of Large Language Models
The SHOCKING Reality of Agentic Reinforcement Learning for LLMs
LLMのためのエージェント強化学習の展望:調査
Arshad presents: The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey