Month 8: Speech and Audio Processing
Week 1: Audio Signal Processing
- Day 1: Introduction to Speech and Audio Processing
- Day 2: Time Domain Analysis: Amplitude, Energy, Zero Crossing Rate
- Day 3: Frequency Domain Analysis: Fourier Transform, Spectrogram
- Day 4: Feature Extraction: MFCC, Chroma, Spectral Contrast
- Day 5: Practical application and a mini-project
Week 2: Speech Recognition and Text-to-Speech
- Day 1: Introduction to Speech Recognition
- Day 2: Hidden Markov Models (HMMs) and GMMs in Speech Recognition
- Day 3: Deep Learning Approaches for Speech Recognition
- Day 4: Introduction to Text-to-Speech (TTS)
- Day 5: Deep Learning Approaches for TTS: Tacotron, WaveNet
Week 3: Audio Classification and Segmentation
- Day 1: Introduction to Audio Classification
- Day 2: Machine Learning for Audio Classification: SVM, Random Forest
- Day 3: Deep Learning for Audio Classification: CNNs, RNNs
- Day 4: Introduction to Audio Segmentation
- Day 5: Applications and Practical Project
Week 4: Voice Conversion and Speech Enhancement
- Day 1: Introduction to Voice Conversion
- Day 2: Traditional Approaches for Voice Conversion
- Day 3: Deep Learning for Voice Conversion: AutoVC
- Day 4: Introduction to Speech Enhancement
- Day 5: Noise Reduction and Echo Cancellation: Traditional and Deep Learning Approaches