Course Description

This course will bring the student to the forefront of signal processing with many practical results and a fundamental understanding of the basic requirements to develop novel algorithms in speech recognition and processing, where the resulting signals are meant for listening, such as speech coding. Speech processing in three parts:

  1. The theory of acoustics of speech production, introductory acoustic phonetics, including inhomogeneous transmission line theory, reflectance, room acoustics, the short-time Fourier Transform (and its inverse), and signal processing of speech, such as LPC/CELP/VQ.
  2. Psychoacoustics of speech perception, critical bands, masking JNDs, and the physiology of the auditory pathway; cochlear modeling.
  3. Information theory, entropy, channel capacity, the confusion matrix, state models, EM algorithms, and Bayesian networks. Classic papers on speech processing and speech perception assigned and presented by student groups.


The textbook is Speech Analysis Synthesis and Perception (Third Edition) by James L. Flanagan, Jont B. Allen and Mark A. Hasegawa-Johnson, in preparation; chapters are here. Here are a few of the particular chapters:

You will need the DjVu viewer to read/print many of the materials in this course. A description of Djvu can be found at Djvu is useful because it compresses scanned bitmaps (e.g., old papers for which there is no text or pdf version) in a highly compressed format. It automatically detects images and photos and treats them differently.

There are many versions of this reader software, all free, and any of them will work. The open-source versions are multiple, including an open source version djview4. Please let Prof. Hasegawa-Johnson know if you have trouble, and I will help you get started.


There will be two mid-term exams and one final exam. Final semester grades will be calculated as follows: 20% homework, 20% exam 1, 20% exam 2, 40% final exam.


The final is a 20-25 page paper, written in the style and format (but single column) of a journal paper, that discusses everything that you have learned in this course. Writing style, spelling, figures, labels of figures, are all part of the grade.

Here is a final exam template file. You may use this file if you wish.

The final is graded based on a list of all the topics that are covered. If there is a paragraph that discusses each topic on my list, then you get at least 1 point, and if the discussion covers the topic effectively, you can get up to 5 points. There are at least 20 topics on the list. When you get to 100 points, you get an A+ on the exam. I expect that you draw on the homework as a starting point. Don't just dump the homework into the exam without modification, that wont get you points. Don't just dump a large number of unexplained figures (that you got from someone else for example) and expect to get points. I need words around each figure. I am looking for insightful comments that link the material together.

Your comments on the relevance of each of the topics I covered in this course, homework problems, exams, etc., are welcome. No points will be taken off, nor given, for strong opinions on my teaching style, or lack thereof, organization, or lack thereof, etc. Please put all such comments in a discussion section at the end of the paper, isolated from the rest of the material.