TY - JOUR
T1 - Large-vocabulary speech recognition system for Taiwanese (Min-nan)
AU - Lyu, Ren yuan
AU - Chiang, Yuang chin
AU - Hsieh, Wen ping
AU - Fang, Ren zhou
AU - Chen, Chih yu
PY - 2000
Y1 - 2000
N2 - In this paper an initial study and some preliminary work about Taiwanese (Min-nan, Southern Hokkian) speech recognition has been described, including a set of phonetic transcription symbols, a Taiwanese pronunciation lexicon of more than 50 thousand words, several sets of phonetically balanced words, and a set of speech data. The inter-syllabic right context dependent Initial/Finals or phonemes are shown to be very useful in the acoustic modeling. Furthermore, we adopted not only model clustering based on an acoustic decision tree to improve data sharing, but also a hybrid duration model to improve the accuracy of state duration modeling in CHMM. Promising recognition rates and a satisfactory recognition speed can be achieved. A prototype real-time speech recognition system has been constructed and some preliminary experimental results have also been reported. The recognition task defined here is a large-vocabulary, isolated-word (multi-syllabic), and speaker-dependent system.
AB - In this paper an initial study and some preliminary work about Taiwanese (Min-nan, Southern Hokkian) speech recognition has been described, including a set of phonetic transcription symbols, a Taiwanese pronunciation lexicon of more than 50 thousand words, several sets of phonetically balanced words, and a set of speech data. The inter-syllabic right context dependent Initial/Finals or phonemes are shown to be very useful in the acoustic modeling. Furthermore, we adopted not only model clustering based on an acoustic decision tree to improve data sharing, but also a hybrid duration model to improve the accuracy of state duration modeling in CHMM. Promising recognition rates and a satisfactory recognition speed can be achieved. A prototype real-time speech recognition system has been constructed and some preliminary experimental results have also been reported. The recognition task defined here is a large-vocabulary, isolated-word (multi-syllabic), and speaker-dependent system.
UR - http://www.scopus.com/inward/record.url?scp=0033732126&partnerID=8YFLogxK
M3 - 文章
AN - SCOPUS:0033732126
SN - 1023-4462
VL - 7
SP - 123
EP - 136
JO - Journal of the Chinese Institute of Electrical Engineering, Transactions of the Chinese Institute of Engineers, Series E/Chung KuoTien Chi Kung Chieng Hsueh K'an
JF - Journal of the Chinese Institute of Electrical Engineering, Transactions of the Chinese Institute of Engineers, Series E/Chung KuoTien Chi Kung Chieng Hsueh K'an
IS - 2
ER -