An automatic singing transcription system with multilingual singing lyric recognizer and robust melody tracker

Chong Kai Wang, Ren Yuan Lyu, Yuang Chin Chiang

Research output: Contribution to conferenceConference Paperpeer-review

33 Scopus citations

Abstract

A singing transcription system which transcribes human singing voice to musical notes is described in this paper. The fact that human singing rarely follows standard musical scale makes it a challenge to implement such a system. This system utilizes some new methods to deal with the issue of imprecise musical scale of input voice of a human singer, such as spectral standard deviation used for note segmentation, Adaptive Round Semitone used for melody tracking and Tune Map acting as a musical grammar constraint in melody tracking. Furthermore, a large vocabulary speech recognizer performing the lyric recognition tasks is also added, which is a new trial in a singing transcription system.

Original languageEnglish
Pages1197-1200
Number of pages4
StatePublished - 2003
Event8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - Geneva, Switzerland
Duration: 01 09 200304 09 2003

Conference

Conference8th European Conference on Speech Communication and Technology, EUROSPEECH 2003
Country/TerritorySwitzerland
CityGeneva
Period01/09/0304/09/03

Fingerprint

Dive into the research topics of 'An automatic singing transcription system with multilingual singing lyric recognizer and robust melody tracker'. Together they form a unique fingerprint.

Cite this