Development of a taiwanese speech and text corpus

Tzu Yu Liao, Ren Yuan Lyu, Ming Tat Ko, Yuang Chin Chiang, Jyh Shing Roger Jang

研究成果: 圖書/報告稿件的類型會議稿件同行評審

摘要

The main goal of this paper is to develop a large scale Taiwanese corpus. In the mean time, we try to establish a successful model for the computational linguistic research on other minority Taiwanese languages such as Haka. In this paper, we will build a Taiwanese speech corpus. The source of speech corpus is Taiwanese dramas and news from TV stations. The goal of the corpus is 200 hours speech material with annotation.

原文英語
主出版物標題Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012
頁面102-111
頁數10
出版狀態已出版 - 2012
事件24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012 - Chung-Li, 台灣
持續時間: 21 09 201222 09 2012

出版系列

名字Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012

Conference

Conference24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012
國家/地區台灣
城市Chung-Li
期間21/09/1222/09/12

指紋

深入研究「Development of a taiwanese speech and text corpus」主題。共同形成了獨特的指紋。

引用此