📑 Lessons 71 Lessons
1Index
2Sound Audio Basics
3Digital Audio
4Audio Formats
5Signal Processing
6Fourier Spectrograms
7MFCC
8Audio Feature Extraction
9Audio Data Collection
10ASR Fundamentals
11Traditional ASR
12End to End ASR
13Whisper Foundation ASR
14Language Models ASR
15ASR Streaming Realtime
16ASR Evaluation Metrics
17ASR Noise Robustness
18Speaker Diarization
19ASR Finetuning Domain Adaptation
20TTS Fundamentals
21TTS Text Frontend
22TTS Acoustic Models
23TTS Vocoders
24TTS Voice Cloning
25TTS Prosody Emotion
26TTS Multilingual
27TTS Evaluation
28Speaker Recognition Fundamentals
29Speaker Verification
30Speaker Embeddings
31Speaker Identification
32Voice Anti Spoofing
33Voice Activity Detection
34Speaker Recognition Production
35Audio Classification Fundamentals
36Environmental Sound Classification
37Music Information Retrieval
38Audio Emotion Recognition
39Sound Event Detection
40Audio Anomaly Detection
41Audio Segmentation
42Audio Anomaly
43Audio Generation
44Music Generation
45Sound Effects
46Noise Reduction
47Source Separation
48Audio Super Resolution
49Audio Style Transfer
50Audio Inpainting
51Diffusion Audio
52Voice Assistants
53Conversational Voice
54Voice Search
55IVR Systems
56Podcast AI
57Accessibility
58Translation Dubbing
59Voice Privacy
60Pipeline Architecture
61Edge Deployment
62Model Optimization
63Multimodal AV
64Datasets Benchmarks
65Research Frontiers
66Tools Libraries
67Project ASR
68Project TTS
69Project Voice Assistant
70Project Audio Classification
71Capstone