A unified framework for large vocabulary speech recognition of mutually unintelligible Chinese "regionalects"

Ren Yuan Lyu, Dau Cheng Lyu, Min Siong Liang, Min Hong Wang, Yuang Chin Chiang, Chun Nan Hsu

Research output: Contribution to conferenceConference Paperpeer-review

3 Scopus citations

Abstract

In this paper, a new approach is proposed for recognizing speech of mutually unintelligible spoken Chinese regionalects based on a unified three-layer framework and a one-stage searching strategy. This framework includes (1) a unified acoustic model for all the considered regionalects; (2) a multiple pronunciation lexicon constructed by both a rule-based and a data-driven approaches; (3) a one-stage searching network, whose nodes represent the Chinese characters with their multiple pronunciations. Unlike the traditional approaches, the new approach avoids searching the intermediate local optimal syllable sequences or lattices. Instead, by using the Chinese characters as the searching nodes, the new approach can search to find the globally optimal character sequences directly. This paper reports the experiments on two of the Chinese regionalects, i.e., Taiwanese and Mandarin. Results show that the unified framework can efficiently deal with the issues of multiple pronunciations of the spoken Chinese regionalects. The character error reduction rate is 34.1%, which is achieved by using the new approach compared with the traditional two-stage scheme. Furthermore, the new approach is shown more robust when dealing with the poor uttered speech database.

Original languageEnglish
Pages1001-1004
Number of pages4
StatePublished - 2004
Event8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
Duration: 04 10 200408 10 2004

Conference

Conference8th International Conference on Spoken Language Processing, ICSLP 2004
Country/TerritoryKorea, Republic of
CityJeju, Jeju Island
Period04/10/0408/10/04

Fingerprint

Dive into the research topics of 'A unified framework for large vocabulary speech recognition of mutually unintelligible Chinese "regionalects"'. Together they form a unique fingerprint.

Cite this