CFEVER: A Chinese Fact Extraction and VERification Dataset

  • Ying Jia Lin
  • , Chun Yi Lin
  • , Chia Jen Yeh
  • , Yi Ting Li
  • , Yun Yu Hu
  • , Chih Hao Hsu
  • , Mei Feng Lee
  • , Hung Yu Kao

研究成果: 期刊稿件會議文章同行評審

3 引文 斯高帕斯(Scopus)

摘要

We present CFEVER, a Chinese dataset designed for Fact Extraction and VERification. CFEVER comprises 30,012 manually created claims based on content in Chinese Wikipedia. Each claim in CFEVER is labeled as “Supports”, “Refutes”, or “Not Enough Info” to depict its degree of factualness. Similar to the FEVER dataset, claims in the “Supports” and “Refutes” categories are also annotated with corresponding evidence sentences sourced from single or multiple pages in Chinese Wikipedia. Our labeled dataset holds a Fleiss’ kappa value of 0.7934 for five-way inter-annotator agreement. In addition, through the experiments with the state-of-the-art approaches developed on the FEVER dataset and a simple baseline for CFEVER, we demonstrate that our dataset is a new rigorous benchmark for factual extraction and verification, which can be further used for developing automated systems to alleviate human fact-checking efforts. CFEVER is available at https://ikmlab.github.io/CFEVER.

原文英語
頁(從 - 到)18626-18634
頁數9
期刊Proceedings of the AAAI Conference on Artificial Intelligence
38
發行號17
DOIs
出版狀態已出版 - 25 03 2024
對外發佈
事件38th AAAI Conference on Artificial Intelligence, AAAI 2024 - Vancouver, 加拿大
持續時間: 20 02 202427 02 2024

文獻附註

Publisher Copyright:
© 2024, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

指紋

深入研究「CFEVER: A Chinese Fact Extraction and VERification Dataset」主題。共同形成了獨特的指紋。

引用此