Warning: include(): php_network_getaddresses: getaddrinfo failed: Name or service not known in /home1/isip/public_html/isip/projects/speech/software/tutorials/production/fundamentals/v1.0/section_04/s04_04_p01.html on line 5

Warning: include(http://www.isip.isipess.com/templates/speech/header/background.html): failed to open stream: php_network_getaddresses: getaddrinfo failed: Name or service not known in /home1/isip/public_html/isip/projects/speech/software/tutorials/production/fundamentals/v1.0/section_04/s04_04_p01.html on line 5

Warning: include(): Failed opening 'http://www.isip.isipess.com/templates/speech/header/background.html' for inclusion (include_path='.:/opt/php54/lib/php') in /home1/isip/public_html/isip/projects/speech/software/tutorials/production/fundamentals/v1.0/section_04/s04_04_p01.html on line 5
/ Recognition / Fundamentals / Production / Tutorials / Software / Home
Warning: include(/home1/isip/public_html/isip/templates/speech/header/header_with_navigation.html): failed to open stream: No such file or directory in /home1/isip/public_html/isip/projects/speech/software/tutorials/production/fundamentals/v1.0/section_04/s04_04_p01.html on line 22

Warning: include(): Failed opening '/home1/isip/public_html/isip/templates/speech/header/header_with_navigation.html' for inclusion (include_path='.:/opt/php54/lib/php') in /home1/isip/public_html/isip/projects/speech/software/tutorials/production/fundamentals/v1.0/section_04/s04_04_p01.html on line 22
4.4.1 Forced Alignment: Overview
Section 4.4.1: Forced Alignment

As we've seen thus far, a speech recognition system uses a search engine along with an acoustic and language model which contains a set of possible words, phonemes, or some other set of data to match speech data to the correct spoken utterance. The search engine processes the features extracted from the speech data to identify occurences of the words, phonemes, or whatever set of data it is equipped to search for and returns the results.

Section 4.4.1 Forced Alignment

Forced alignment is similar to this process, but it differs in one major respect. Rather than being given a set of possible words to search for, the search engine is given an exact transcription of what is being spoken in the speech data. The system then aligns the transcribed data with the speech data, identifying which time segments in the speech data correspond to particular words in the transcription data.

Section 4.4.1 Forced Alignment

Forced alignment can also be used to align the phonemes of the transcription data to the speech data given, similar to the image below, although with more explicitly defined boundaries on where each phoneme begins and ends.

Section 4.4.1 Forced Alignment
   
Table of Contents   Section Contents   Previous Page Up Next Page
      Glossary / Help / Support / Site Map / Contact Us / ISIP Home