Abstract
A singing transcription system which transcribes human singing voice to musical notes is described in this paper. The fact that human singing rarely follows standard musical scale makes it a challenge to implement such a system. This system utilizes some new methods to deal with the issue of imprecise musical scale of input voice of a human singer, such as spectral standard deviation used for note segmentation, Adaptive Round Semitone used for melody tracking and Tune Map acting as a musical grammar constraint in melody tracking. Furthermore, a large vocabulary speech recognizer performing the lyric recognition tasks is also added, which is a new trial in a singing transcription system.
Original language | English |
---|---|
Pages | 1197-1200 |
Number of pages | 4 |
State | Published - 2003 |
Event | 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland Duration: 01 09 2003 → 04 09 2003 |
Conference
Conference | 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 |
---|---|
Country/Territory | Switzerland |
City | Geneva |
Period | 01/09/03 → 04/09/03 |