結果 : supervised fine tuning vs reinforcement learning
13:56

強化微調整 (RFT) とは何か - 教師あり vs. RL LLM 再トレーニング

What's AI by Louis-François Bouchard
2,980 回視聴 - 7 か月前
11:29

Reinforcement Learning from Human Feedback (RLHF) Explained

IBM Technology
65,493 回視聴 - 1 年前
11:41

Supervised Fine-Tuning vs. Reinforcement Learning in Foundation Models

AI Papers Podcast Daily
536 回視聴 - 8 か月前
13:10

RAG vs Fine-Tuning vs Prompt Engineering: Optimizing AI Models

IBM Technology
426,908 回視聴 - 6 か月前

-
8:28

Lesson 04/10 – Post-Training: Supervised Fine-Tuning (SFT) & Reinforcement Learning (RL)

Andrei Dumitrescu
492 回視聴 - 6 か月前
59:48

Build Hour: Reinforcement Fine-Tuning

OpenAI
10,852 回視聴 - 1 か月前
8:57

RAG vs. Fine Tuning

IBM Technology
373,612 回視聴 - 1 年前

-
18:02

Reinforcement Learning with Human Feedback (RLHF), Clearly Explained!!!

StatQuest with Josh Starmer
37,666 回視聴 - 5 か月前
52:40

Soc(AI)ety Seminars, Part 8: The Truth of the Matter in the Age of Generative AI

ThinkND
76 回視聴 - 2 日前
22:44

LLM Training & Reinforcement Learning from Google Engineer | SFT + RLHF | PPO vs GRPO vs DPO

Martin Is A Dad
8,675 回視聴 - 7 か月前
4:06

4 分で学ぶ、人間のフィードバックによる強化学習 (RLHF)

Sebastian Raschka
9,695 回視聴 - 8 か月前
19:52

Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved) (Jul 2025)

AI Papers Slop
28 回視聴 - 2 か月前
47:52

Maciej and Bartek - Fine-tuning Reinforcement Learning Models is a Forgetting Mitigation Problem

UCL DARK
808 回視聴 - 1 年前
2:42:28

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

AI Engineer
90,082 回視聴 - 3 か月前
0:46

🧠 Reinforcement Fine-Tuning vs. Supervised Learning – Which Wins? 🚀

Predibase
180 回視聴 - 7 か月前
19:30

How AI Becomes Human [pre-training, supervised fine-tuning, reinforcement learning, and more]

Jeffrey Codes
1,501 回視聴 - 7 か月前
5:31

SFT vs RL-FT: How Fine-Tuning Shapes LLMs

AI Research Roundup
76 回視聴 - 1 か月前
28:53

Fine-tuning LLMs on Human Feedback (RLHF + DPO)

Shaw Talebi
16,726 回視聴 - 7 か月前
1:18:19

Reinforcement Learning for LLMs in 2025

Trelis Research
14,420 回視聴 - 8 か月前