List of authors: Tom Bäckström, Okko Räsänen, Abraham Zewoudie, Pablo Pérez Zarazaga, Liisa Koivusalo
Includes contributions from Sneha Das
NOTE! These pages are deprecated and retained only for archiving purposes. Our new location is https://speechprocessingbook.aalto.fi .
Table of contents
- Introduction
- Basic representations and models
- Waveform
- Windowing
- Signal energy, loudness and decibel
- Spectrogram and the STFT
- Autocorrelation and autocovariance
- Cepstrum and MFCC
- Linear prediction
- Fundamental frequency (F0)
- Zero-crossing rate
- Deltas and Delta-deltas
- PSOLA
- Jitter and shimmer (also Jitter, shimmer, harmonicity etc (external link))
- Crest factor (Wikipedia)
- Pre-processing
- Pre-emphasis
- Noise gate (Wikipedia)
- Dynamic Range Compression (Wikipedia)
- Voice activity detection (VAD)
- Speech enhancement
- Modelling tools in speech processing
- Evaluation of speech processing methods
- Speech analysis
- Fundamental frequency estimation
- Formant estimation and tracking
- Inverse filtering for glottal activity estimation
- Recognition tasks in speech processing
- Natural language processing
- Speech synthesis
- Transmission, storage and telecommunication
- Speech enhancement
- Noise attenuation
- Echo cancellation
- Bandwidth extension (BWE)
- Dereverberation
- Source separation
- Beamforming
- Voice and speech analysis (wikipedia)
- Measurements for medical applications
- Electroglottography (Wikipedia)
- Stroboscopy and videokymography (Wikipedia)
- Highspeed camera
- MRI
- Rothenberg mask
- Glottal inverse filtering
- Forensic analysis
- Measurements for medical applications
- Chatbots / Conversational design (external link)
- Computational models of human language processing
- Security and privacy in speech technology
- References
Recent space activity
Recently Updated | ||||||||
---|---|---|---|---|---|---|---|---|
|
Space contributors
Contributors | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
|
Licence
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.