This course emphasizes processing of the human speech waveform, primarily using digital techniques. Theory of speech production and speech perception as related to signals in time and frequency-domains is covered, as well as the measurement of model parameters, short-time Fourier spectrum, and linear predictor coefficients. Speech coding, recognition, speech synthesis, and speaker identification are discussed. Application areas include telecommunications telephony, Internet VOIP, and man-machine interfaces. Considerations for embedded realization of the speech processing system will be covered as time permits. Several application-oriented software projects will be required.
EN.525.627 Digital Signal Processing and EN.525.614 Probability and Stochastic Processes for Engineers. Background in linear algebra and MATLAB is helpful.
Course instructor(s) :