Large vocabulary Taiwanese (Min-nan) speech recognition using tone features and statistical pronunciation modeling

Dau Cheng Lyu, Min Siong Liang, Yuang Chin Chiang, Chun Nan Hsu, Ren Yuan Lyu

Research output: Contribution to conferenceConference Paperpeer-review

14 Scopus citations

Abstract

A large vocabulary Taiwanese (Min-nan) speech recognition system is described in this paper. Due to the severe multiple pronunciation phenomenon in Taiwanese partly caused by tone sandhi, a statistical pronunciation modeling technique based on tonal features is used. This system is speaker independent. It was trained by a bi-lingual Mandarin/Taiwanese speech corpus to alleviate the lack of pure Taiwanese speech corpus. The searching network is constructed based on nodes of Chinese characters and results in the direct output Chinese character string. Experiments show that by using the approaches proposed in this paper, the character error rate can decrease significantly from 21.50% to 11.97%.

Original languageEnglish
Pages1861-1864
Number of pages4
StatePublished - 2003
Event8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
Duration: 01 09 200304 09 2003

Conference

Conference8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
Country/TerritorySwitzerland
CityGeneva
Period01/09/0304/09/03

Fingerprint

Dive into the research topics of 'Large vocabulary Taiwanese (Min-nan) speech recognition using tone features and statistical pronunciation modeling'. Together they form a unique fingerprint.

Cite this