Using speech recognition technique for constructing a phonetically transcribed Taiwanese (Min-nan) text corpus

Min Siong Liang*, Ren Yuan Lyu, Yuang Chin Chiang

*此作品的通信作者

研究成果: 圖書/報告稿件的類型會議稿件同行評審

3 引文 斯高帕斯(Scopus)

摘要

Collection of Taiwanese text corpus with phonetic transcription suffers from the problems of multiple pronunciation variation. By augmenting the text with speech, and using automatic speech recognition with a sausage searching net constructed from the multiple pronunciations of the text corresponding to its speech utterance, we are able to reduce the effort for phonetic transcription. By using the multiple pronunciation lexicon, the error rate of transcription 13.94% was achieved. Further improvement can be achieved by adapting the pronunciation lexicon with pronunciation variation (PV) rules derived from a manual corrected speech corpus. The PV rules can be categorized into two kinds: the knowledge-based and data-driven rules. By incorporating the PV rules, the error rate reduction 13.63% could be achieved. Although the technique was developed for Taiwanese speech, it could also be adapted easily to be applied in the other similar "minority" Chinese spoken languages.

原文英語
主出版物標題INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP
發行者International Speech Communication Association
頁面193-196
頁數4
ISBN(列印)9781604234497
出版狀態已出版 - 2006
事件INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP - Pittsburgh, PA, 美國
持續時間: 17 09 200621 09 2006

出版系列

名字Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
1
ISSN(電子)1990-9772

Conference

ConferenceINTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP
國家/地區美國
城市Pittsburgh, PA
期間17/09/0621/09/06

指紋

深入研究「Using speech recognition technique for constructing a phonetically transcribed Taiwanese (Min-nan) text corpus」主題。共同形成了獨特的指紋。

引用此