摘要
A singing transcription system which transcribes human singing voice to musical notes is described in this paper. The fact that human singing rarely follows standard musical scale makes it a challenge to implement such a system. This system utilizes some new methods to deal with the issue of imprecise musical scale of input voice of a human singer, such as spectral standard deviation used for note segmentation, Adaptive Round Semitone used for melody tracking and Tune Map acting as a musical grammar constraint in melody tracking. Furthermore, a large vocabulary speech recognizer performing the lyric recognition tasks is also added, which is a new trial in a singing transcription system.
原文 | 英語 |
---|---|
頁面 | 1197-1200 |
頁數 | 4 |
出版狀態 | 已出版 - 2003 |
事件 | 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, 瑞士 持續時間: 01 09 2003 → 04 09 2003 |
Conference
Conference | 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 |
---|---|
國家/地區 | 瑞士 |
城市 | Geneva |
期間 | 01/09/03 → 04/09/03 |