Interior Design Application (Prototype) - Multimodal System (Speech Recognition & Gesture)
Multimodal speech understanding - Naomi Harte
Multi-Modal Speech Emotion Recognition Using Speech Embeddings and Audio Features
Multimodal Gesture Recognition
Speech Transformer | Automatic Speech Recognition (ASR)
Towards the explainability of Multimodal Speech Emotion Recognition - (3 minutes introduction)
AudioGPT - a multi-modal AI system for Audio
Multimodal representation and learning - Shah Nawaz
INTERSPEECH 2021 | Towards the Explainability of Multimodal Speech Emotion Recognition
Multimodal Speech
Multimodal pipeline for vision audio, and speech features
Build an AI Voice Assistant App using Multimodal LLM "Llava" and Whisper
Auxiliary Loss Multimodal GRU Model in Audio-visual Speech Recognition
A Novel Multi-Modal Fusion Method for Speaker Recognition
SeamlessM4T: Andrew Ng, OpenAI Multimodal Whisper - AI Paper Explained
IS2020: Multimodal Emotion Recognition using Cross Modal Attention and 1D CNNs
Learning Alignment for Multimodal Emotion Recognition from Speech
Wavoice: A Noise-resistant Multi-modal Speech Recognition System Fusing mmWave and... (Teaser Video)
How Does Speech Recognition Work?🎤
A Multimodal Speech and Graphical Interface