Maximum entropy modeling of acoustic and linguistic features

Chuang Hua Chueh*, Jen Tzung Chien

*此作品的通信作者

研究成果: 圖書/報告稿件的類型會議稿件同行評審

3 引文 斯高帕斯(Scopus)

摘要

Traditionally, speech recognition system is established assuming that acoustic and linguistic information sources are independent, Parameters of hidden Markov model and n-gram are estimated individually and then plugged in a maximum a posteriori classification rule. However, acoustic and linguistic features are correlated in essence. Modeling performance is limited accordingly, This study aims to relax the independence assumption and achieve sophisticated acoustic and linguistic modeling for speech recognition. We propose an integrated approach based on maximum entropy (ME) principle where acoustic and linguistic features are optimally merged in a unified framework. The correlations between acoustic and linguistic features are explored and properly represented in the integrated models. Due to the flexibility of ME model, we can further combine other high-level linguistic features, In the experiments, we carry out the proposed methods for broadcast news transcription using MATBN database. We obtain significant improvement compared to conventional speech recognition system using individual maximum likelihood training.

原文英語
主出版物標題2006 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings
頁面I1061-I1064
出版狀態已出版 - 05 2006
事件2006 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006 - Toulouse, 法國
持續時間: 14 05 200619 05 2006

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
1
ISSN(列印)1520-6149

Conference

Conference2006 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2006
國家/地區法國
城市Toulouse
期間14/05/0619/05/06

指紋

深入研究「Maximum entropy modeling of acoustic and linguistic features」主題。共同形成了獨特的指紋。

引用此