Few-shot Text Classification with Saliency-equivalent Concatenation

Ying Jia Lin, Yu Fang Chang, Hung Yu Kao, Hsin Yang Wang, Mu Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In few-shot text classification, the lack of significant features limits models from generalizing to data not included in the training set. Data augmentation is a solution to the classification tasks; however, the standard augmentation methods in natural language processing are not feasible in few-shot learning. In this study, we explore data augmentation in few-shot text classification. We propose saliency-equivalent concatenation (SEC)1. The core concept of SEC is to append additional key information to an input sentence to help a model understand the sentence easier. In the proposed method, we first leverage a pre-trained language model to generate several novel sentences for each sample in datasets. Then we leave the most relevant one and concatenate it with the original sentence as additional information for each sample. Our experiments on the two fewshot text classification tasks verified that the proposed method can boost the performance of meta-learning models and outperform the previous unsupervised data augmentation methods.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE 5th International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages74-81
Number of pages8
ISBN (Electronic)9781665471206
DOIs
StatePublished - 2022
Externally publishedYes
Event5th IEEE International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2022 - Laguna Hills, United States
Duration: 19 09 202221 09 2022

Publication series

NameProceedings - 2022 IEEE 5th International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2022

Conference

Conference5th IEEE International Conference on Artificial Intelligence and Knowledge Engineering, AIKE 2022
Country/TerritoryUnited States
CityLaguna Hills
Period19/09/2221/09/22

Bibliographical note

Publisher Copyright:
© 2022 IEEE.

Keywords

  • data augmentation
  • few-shot learning
  • knowledge extraction
  • meta-learning
  • natural language processing

Fingerprint

Dive into the research topics of 'Few-shot Text Classification with Saliency-equivalent Concatenation'. Together they form a unique fingerprint.

Cite this