Language identification by using syllable-based duration classification on code-switching speech

Dau Cheng Lyu*, Ren Yuan Lyu, Yuang Chin Chiang, Chun Nan Hsu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Many approaches to automatic spoken language identification (LID) on monolingual speech are successfully, but LID on the code-switching speech identifying at least 2 languages from one acoustic utterance challenges these approaches. In [6], we have successfully used one-pass approach to recognize the Chinese character on the Mandarin-Taiwanese code-switching speech. In this paper, we introduce a classification method (named syllable-based duration classification) based on three clues: recognized common tonal syllable tonal syllable, the corresponding duration and speech signal to identify specific language from code-switching speech. Experimental results show that the performance of the proposed LID approach on code-switching speech exhibits closely to that of parallel tonal syllable recognition LID system on monolingual speech.

Original languageEnglish
Title of host publicationChinese Spoken Language Processing - 5th International Symposium, ISCSLP 2006, Proceedings
Pages475-484
Number of pages10
DOIs
StatePublished - 2006
Event5th International Symposium on Chinese Spoken Language Processing, ISCSLP 2006 - Singapore, Singapore
Duration: 13 12 200616 12 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4274 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th International Symposium on Chinese Spoken Language Processing, ISCSLP 2006
Country/TerritorySingapore
CitySingapore
Period13/12/0616/12/06

Keywords

  • Code-switching speech
  • Language identification

Fingerprint

Dive into the research topics of 'Language identification by using syllable-based duration classification on code-switching speech'. Together they form a unique fingerprint.

Cite this