The Science Review: Visual Instruction Tuning
W2 9 How LLMs follow instructions, Instruction tuning and RLHF
[ECCV2024, Oral] LEGO: Learning Egocentric Action Frame Generation via Visual Instruction Tuning
LLaVA - the first instruction following multi-modal model (paper explained)
Visual Instruction Tuning
Visual Instruction Tuning [Moon Ye-Bin]
Visual Instruction Tuning using LLaVA
Visual Instruction Tuning Wisconsin Microsoft 2023
EP140 - LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning
TextSquare - Scaling up Text-Centric Visual Instruction Tuning
Efficient Vision Language Instruction Tuning: The MMA Approach
[Lab Seminar] Visual Instruction Tuning
[Audio notes] LLaVA - Visual Instruction Tuning
LLava: Visual Instruction Tuning
Osprey: Pixel Understanding with Visual Instruction Tuning
LL3DA Visual Interactive Instruction Tuning for Omni 3D Understanding Reasoning and Planning
[Paper Review] LLaVA: Large Language and Vision Assistant (Visual Instruction Tuning)
[2023 Best AI Paper] ImageBind-LLM: Multi-modality Instruction Tuning
命令チューニング(UTオースティンの自然言語処理)
Effective Instruction Tuning: Data & Methods