FewRel

A Large-Scale Few-Shot Relation Extraction Dataset

Github星跟踪图

FewRel Dataset, Toolkits and Baseline Models

FewRel is a large-scale few-shot relation extraction dataset, which contains 70000 natural language sentences expressing 100 different relations. This dataset is presented in the our EMNLP 2018 paper FewRel: A Large-Scale Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation.

More info at https://thunlp.github.io/fewrel.html .

Citing

If you used our data, toolkits or baseline models, please kindly cite our paper:

@inproceedings{han2018fewrel,
               title={FewRel:A Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation},
               author={Han, Xu and Zhu, Hao and Yu, Pengfei and Wang, Ziyun and Yao, Yuan and Liu, Zhiyuan and Sun, Maosong},
               booktitle={EMNLP},
               year={2018}
}

If you have questions about any part of the paper, submission, leaderboard, codes, data, please e-mail zhuhao15@mails.tsinghua.edu.cn.

Contributions

Hao Zhu first proposed this problem and proposed the way to build the dataset and the baseline system; Ziyuan Wang built and maintained the crowdsourcing website; Yuan Yao helped download the original data and conducted preprocess;
Xu Han, Hao Zhu, Pengfei Yu and Ziyun Wang implemented baselines and wrote the paper together; Zhiyuan Liu provided thoughtful advice and funds through the whole project. The order of the first four authors are determined by dice rolling.

Dataset and Word Embedding

The dataset has already be contained in the github repo. However, due to the large size, glove files (pre-trained word embeddings) are not included. Please download glove.6B.50d.json from Tsinghua Cloud or Google Drive and put it under data/ folder.

Usage

To run our baseline models, use command

python train_demo.py {MODEL_NAME}

replace {MODEL_NAME} with proto, metanet, gnn or snail.

主要指标

概览
名称与所有者thunlp/FewRel
主编程语言Python
编程语言Python (语言数: 2)
平台
许可证MIT License
所有者活动
创建于2018-08-25 22:55:12
推送于2022-05-04 14:41:14
最后一次提交2022-05-04 22:41:14
发布数0
用户参与
星数733
关注者数21
派生数166
提交数111
已启用问题?
问题数82
打开的问题数9
拉请求数2
打开的拉请求数0
关闭的拉请求数0
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?