Abstract
In this paper, we apply context dependent phonetic modeling on the task of large vocabulary (with 20 thousand words) Taiwanese multi-syllabic word recognition. Considering the phonetic characteristics of Taiwanese, the right context dependent (RCD) phones instead of the general tri-phones are used. The RCDs are further clustered at the sub-phone or state level using a decision tree with a set of context-split questions specially designed for Taiwanese speech according to the acoustic/phonetic knowledge. For the speaker dependent case, 7.18% word error rate is achieved. A real-time prototype system implemented on a Pentium-II personal computer running MSWindows95/NT is also shown to validate the approaches proposed here.
| Original language | English |
|---|---|
| State | Published - 1998 |
| Event | 5th International Conference on Spoken Language Processing, ICSLP 1998 - Sydney, Australia Duration: 30 11 1998 → 04 12 1998 |
Conference
| Conference | 5th International Conference on Spoken Language Processing, ICSLP 1998 |
|---|---|
| Country/Territory | Australia |
| City | Sydney |
| Period | 30/11/98 → 04/12/98 |
Bibliographical note
Publisher Copyright:© 1998. 5th International Conference on Spoken Language Processing, ICSLP 1998. All rights reserved.
Fingerprint
Dive into the research topics of 'A LARAGE-VOCABULARY TAIWANESE (MIN-NAN) MULTI-SYLLABIC WORD RECOGNITION SYSTEM BASED UPON RIGHT-CONTEXT-DEPENDENT PHONES WITH STATE CLUSTERING BY ACOUSTIC DECISION TREE'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver