This course emphasizes processing of the human speech waveform, primarily using digital techniques. Theory of speech production and speech perception as related to signals in time and frequency-domains is covered, as well as the measurement of model parameters, short-time Fourier spectrum, and linear predictor coefficients. Speech coding, recognition, speech synthesis, and speaker identification are discussed. Application areas include telecommunications telephony, Internet VOIP, and man-machine interfaces. Considerations for embedded realization of the speech processing system will be covered as time permits. Several application-oriented software projects will be required.

Course prerequisites: 

525.427 Digital Signal Processing and 525.414 Probability and Stochastic Processes for Engineers. Background in linear algebra and MATLAB is helpful.

Course instructor: 
Carmody

View Course Homepage(s) for this course.