Large-vocabulary speech recognition system for Taiwanese (Min-nan)

Ren yuan Lyu*, Yuang chin Chiang, Wen ping Hsieh, Ren zhou Fang, Chih yu Chen

*Corresponding author for this work

Research output: Contribution to journalJournal Article peer-review

3 Scopus citations

Abstract

In this paper an initial study and some preliminary work about Taiwanese (Min-nan, Southern Hokkian) speech recognition has been described, including a set of phonetic transcription symbols, a Taiwanese pronunciation lexicon of more than 50 thousand words, several sets of phonetically balanced words, and a set of speech data. The inter-syllabic right context dependent Initial/Finals or phonemes are shown to be very useful in the acoustic modeling. Furthermore, we adopted not only model clustering based on an acoustic decision tree to improve data sharing, but also a hybrid duration model to improve the accuracy of state duration modeling in CHMM. Promising recognition rates and a satisfactory recognition speed can be achieved. A prototype real-time speech recognition system has been constructed and some preliminary experimental results have also been reported. The recognition task defined here is a large-vocabulary, isolated-word (multi-syllabic), and speaker-dependent system.

Original languageEnglish
Pages (from-to)123-136
Number of pages14
JournalJournal of the Chinese Institute of Electrical Engineering, Transactions of the Chinese Institute of Engineers, Series E/Chung KuoTien Chi Kung Chieng Hsueh K'an
Volume7
Issue number2
StatePublished - 2000

Fingerprint

Dive into the research topics of 'Large-vocabulary speech recognition system for Taiwanese (Min-nan)'. Together they form a unique fingerprint.

Cite this