The Large Learning Rate Phase of Deep Learning with Yasaman Bahri
Gradient Descent | Cost Function | Learning Rate
MIT 6.S191 (2021): Introduction to Deep Learning
L12.1 Learning Rate Decay
Learning Rate Grafting: Big Step in Deep Learning. ML Techniques.
Kashif Rasul - What's new in Deep Learning?
What are Optimizers in Deep Learning?
Part 11 Deep Learning Optimization Algorithm 1 Parameter Gradient Descent
Machine Learning ROC Curve and AUC Explained | AIM End-to-End Session 97
Deep Learning State of the Art (2019) - MIT
[QA] Stepping on the Edge: Curvature Aware Learning Rate Tuners
Deep Learning for Natural Language Processing - Context and Neural Networks
Top Optimizers for Neural Networks
Introduction to Deep Learning Recitation 4
Neural Networks Audiobook: Chapter 2, Deep Learning Essentials
A practical guide to deep learning - Tess Ferrandez-Norlander
Learning Rate Grafting: Transferability of Optimizer Tuning (Machine Learning Research Paper Review)
[EEML'24] Yee Whye Teh - Bayesian Deep Learning
13L – Optimisation for Deep Learning
Introducing PyTorch Lightning to Simplify Deep Learning Training and Evaluation