Large-vocabulary speech recognition system for Taiwanese (Min-nan)

  • Ren yuan Lyu*
  • , Yuang chin Chiang
  • , Wen ping Hsieh
  • , Ren zhou Fang
  • , Chih yu Chen
  • *此作品的通信作者

研究成果: 期刊稿件文章同行評審

3 引文 斯高帕斯(Scopus)

摘要

In this paper an initial study and some preliminary work about Taiwanese (Min-nan, Southern Hokkian) speech recognition has been described, including a set of phonetic transcription symbols, a Taiwanese pronunciation lexicon of more than 50 thousand words, several sets of phonetically balanced words, and a set of speech data. The inter-syllabic right context dependent Initial/Finals or phonemes are shown to be very useful in the acoustic modeling. Furthermore, we adopted not only model clustering based on an acoustic decision tree to improve data sharing, but also a hybrid duration model to improve the accuracy of state duration modeling in CHMM. Promising recognition rates and a satisfactory recognition speed can be achieved. A prototype real-time speech recognition system has been constructed and some preliminary experimental results have also been reported. The recognition task defined here is a large-vocabulary, isolated-word (multi-syllabic), and speaker-dependent system.

指紋

深入研究「Large-vocabulary speech recognition system for Taiwanese (Min-nan)」主題。共同形成了獨特的指紋。

引用此