11-[468]92 Speech Processing Fall 2019

Time: MW 3:00-4:20 (note, no Friday lecture)
Location: GHC 4102
Office Hours: TBA
Piazza (TBA)

Schedule

Description: Speech Processing offers a practical and theoretical understanding of how human speech can be processed by computers. It covers speech recognition, speech synthesis and spoken dialog systems. The course involves practicals where the student will build working speech recognition systems, build their own synthetic voice and build a complete spoken dialog system. This work will be based on existing toolkits. Details of algorithms, techniques and limitations of state of the art speech systems will also be presented. This course is designed for students wishing understand how to process real data for real applications, applying statistical and machine learning techniques as well as working with limitations in the technology.

Instructor: Alan W Black

TA: Tanmay Parekh tparekh@andrew.cmu.edu

Prerequisites: 15-211 for SCS undergraduates, exemption from this requirement requires the instructor's permission.
Availability: Open to juniors and seniors in the SCS undergraduate program and ECE Undergraduate program. Open to other students with the consent of an instructor.

Materials: The text required for the course will be "Spoken Language Processing" by Xuedong Huang, Alex Acero and Hsiao-wuen Hon, Prentice Hall (ISBN 0-13-22616-5). This book will be used for reading assignments, and background reading for homeworks and exams.

Homework: Homework consists of four programming projects: Speech Recognition, Speech Synthesis, Spoken Dialog Systems, and one other.
Grading: 10% class participation, 60% programming projects, 30% final.
Course policies: Late homework , Cheating

Final exam: TBA, It will be a closed book exam.


Homework 01:
Homework 02:
Homework 03:
Homework 04: