Recursive likelihood evaluation and fast search algorithm for polynomial segment model with application to speech recognition

Chak Fai Li, Man Hung Siu, Jeff Siu Kei Au-Yeung

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

Polynomial segment models (PSMs), which are generalization of the hidden Markov models (HMMs), have opened an alternative research direction for speech recognition. However, they have been limited by their computational complexity. Traditionally, any change in PSM segment boundary requires likelihood recomputation of all the frames within the segment. This makes the PSM's segment likelihood evaluation an order of magnitude more expensive than the HMM's. Furthermore, because recognition using segment models needs to search over all possible segment boundaries, the PSM recognition is computationally unfeasible beyond N-best rescoring. By exploiting the properties of the time normalization in PSM, and by decomposing the PSM segment likelihood into a simple function of "sufficient statistics," in this paper, we show that segment likelihood can be evaluated efficiently in an order of computational complexity similar to HMM. In addition, by reformulating the PSM recognition as a search for the optimal path through a graph, this paper introduces a fast PSM search algorithm that intelligently prunes the number of hypothesized segment boundaries, such that PSM recognition can be performed in an order of complexity similar to HMM. We demonstrate the effectiveness of the proposed algorithms with experiments using a PSM-based recognition system on two different recognition tasks: TIDIGIT digit recognition and the Wall Street Journal dictation task. In both tasks, PSM recognition is feasible and out-performed traditional HMM by more than 14%.

Original languageEnglish
Pages (from-to)1704-1718
Number of pages15
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume14
Issue number5
DOIs
Publication statusPublished - Sept 2006
Externally publishedYes

Keywords

  • Fast algorithm
  • Polynomial segment model
  • Search
  • Speech recognition

Fingerprint

Dive into the research topics of 'Recursive likelihood evaluation and fast search algorithm for polynomial segment model with application to speech recognition'. Together they form a unique fingerprint.

Cite this