Speech Audio

Skill Topic — Text Reading
📑 Lessons 71 Lessons
1Index 2Sound Audio Basics 3Digital Audio 4Audio Formats 5Signal Processing 6Fourier Spectrograms 7MFCC 8Audio Feature Extraction 9Audio Data Collection 10ASR Fundamentals 11Traditional ASR 12End to End ASR 13Whisper Foundation ASR 14Language Models ASR 15ASR Streaming Realtime 16ASR Evaluation Metrics 17ASR Noise Robustness 18Speaker Diarization 19ASR Finetuning Domain Adaptation 20TTS Fundamentals 21TTS Text Frontend 22TTS Acoustic Models 23TTS Vocoders 24TTS Voice Cloning 25TTS Prosody Emotion 26TTS Multilingual 27TTS Evaluation 28Speaker Recognition Fundamentals 29Speaker Verification 30Speaker Embeddings 31Speaker Identification 32Voice Anti Spoofing 33Voice Activity Detection 34Speaker Recognition Production 35Audio Classification Fundamentals 36Environmental Sound Classification 37Music Information Retrieval 38Audio Emotion Recognition 39Sound Event Detection 40Audio Anomaly Detection 41Audio Segmentation 42Audio Anomaly 43Audio Generation 44Music Generation 45Sound Effects 46Noise Reduction 47Source Separation 48Audio Super Resolution 49Audio Style Transfer 50Audio Inpainting 51Diffusion Audio 52Voice Assistants 53Conversational Voice 54Voice Search 55IVR Systems 56Podcast AI 57Accessibility 58Translation Dubbing 59Voice Privacy 60Pipeline Architecture 61Edge Deployment 62Model Optimization 63Multimodal AV 64Datasets Benchmarks 65Research Frontiers 66Tools Libraries 67Project ASR 68Project TTS 69Project Voice Assistant 70Project Audio Classification 71Capstone