This course emphasizes processing of the human speech waveform, primarily using digital techniques. Theory of speech production and speech perception as related to signals in time and frequency-domains is covered, as well as the measurement of model parameters, short-time Fourier spectrum, and linear predictor coefficients. Speech coding, recognition, speech synthesis, and speaker identification are discussed. Application areas include telecommunications telephony, Internet VOIP, and man-machine interfaces. Considerations for embedded realization of the speech processing system will be covered as time permits. Several application-oriented software projects will be required.
Course Prerequisite(s)
EN.525.627 Digital Signal Processing and EN.525.614 Probability and Stochastic Processes for Engineers. Background in linear algebra and MATLAB is helpful.
Course Offerings
Open
Speech Processing
01/27/2025 - 05/05/2025
Mon 4:30 p.m. - 7:10 p.m. |