Pitch marking based on an adaptable filter and a peak-valley estimation method

Jau Hung Chen, Yung An Kao

Research output: Contribution to conferenceConference Paperpeer-review

1 Scopus citations

Abstract

In a text-to-speech (TTS) conversion system based on the time-domain pitch-synchronous overlap-add (TD-PSOLA) method, accurate estimation of pitch periods and pitch marks is necessary for pitch modification to assure an optimal quality of the synthetic speech. In general, there are two major issues on pitch marking: pitch detection and location determination. In this paper, an adaptable filter, which serves as a bandpass filter, is proposed for pitch detection to transform the voiced speech into a sine-like wave. Based on the sine-like wave, a peak-valley decision method is investigated to determine the appropriate part (positive part and negative part) of the voiced speech for pitch mark estimation. At each pitch period, two possible peaks/valleys are searched and the dynamic programming is performed to obtain the pitch marks. Experimental results indicate that our proposed method performed very well if correct pitch information is estimated.

Original languageEnglish
StatePublished - 2001
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2001 Proceedings of the 14th Conference on Computational Linguistics and Speech Processing, ROCLING 2001. All rights reserved.

Fingerprint

Dive into the research topics of 'Pitch marking based on an adaptable filter and a peak-valley estimation method'. Together they form a unique fingerprint.

Cite this