Is it possible to use chatbot for the Chinese word segmentation?

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

A word is the smallest item in Natural Language Processing. However, there is no obvious boundary for Chinese words. How to segment Chinese words always obstructs Chinese researches and applications. Nowadays, a neural network model, Seq2Seq with LSTM, is well-known for translation or chatbot application. In this paper, we try to transform the Chinese word segmentation problem into a translation problem. And we utilized an open-source chatbot to simulate the translation task. In our experimental results, we can produce similar Chinese word segmentation results when we provide training data which is automatically generated from famous Chinese word segmentation services.

Original languageEnglish
Title of host publicationProceedings of 2019 3rd International Conference on Natural Language Processing and Information Retrieval, NLPIR 2019
PublisherAssociation for Computing Machinery
Pages20-24
Number of pages5
ISBN (Electronic)9781450362795
DOIs
StatePublished - 28 06 2019
Event3rd International Conference on Natural Language Processing and Information Retrieval, NLPIR 2019 - Tokushima, Japan
Duration: 28 06 201930 06 2019

Publication series

NameACM International Conference Proceeding Series

Conference

Conference3rd International Conference on Natural Language Processing and Information Retrieval, NLPIR 2019
Country/TerritoryJapan
CityTokushima
Period28/06/1930/06/19

Bibliographical note

Publisher Copyright:
© 2019 Association for Computing Machinery.

Keywords

  • Chatbot
  • Chinese word segmentation
  • LSTM
  • Seq2Seq

Fingerprint

Dive into the research topics of 'Is it possible to use chatbot for the Chinese word segmentation?'. Together they form a unique fingerprint.

Cite this