Skip To Content

Athabasca University

Section 5 : Speech Recognition as an application of HMM

Commentary

Section Goals

  • To introduce techniques related to speech recognition, such as the probabilistic acoustic model, language model, and speech recognizer.

Learning Objectives

Learning Objective 1

  • Describe the different components of speech recognition tasks.
  • Explain the principles of probabilistic models in speech recognition and speech recognizers.
  • Draw a system structure diagram to illustrate the information flow and process components for speech recognition.
  • Explain the following concepts or terms:
    • Speech recognition
    • Speech understanding
    • Acoustic model
    • Language model
    • Bigram and trigram model
    • Pronunciation model
    • Phone model
    • Coarticulation
    • Continuous speech
    • Segmentation
    • Speech recognizer

Objective Readings

Required readings:

Reading topics:

Speech Recognition (see Section 23.5 of AIMA3ed)

Rabiner, L. R. (1989). A tutorial on Hidden Markov Models and selected applications in speech recognition. Proceedings of the IEEE, 77(2), 257-286.

Objective Questions

  • Is continuous speech recognition always harder than isolated word recognition in real applications? Why?

Objective Activities

  • Explore current speech recognition software tools (commercial or free ones) in languages that are familiar to you. Find out what technologies they are based on, and to what degree you think they are acceptable. Explain your reasoning, and suggest ways they might be improved.

Updated November 17 2015 by FST Course Production Staff