ParlAI

在各种公开可用的对话数据集上训练和评估人工智能模型的框架。「A framework for training and evaluating AI models on a variety of openly available dialogue datasets.」

Github星跟蹤圖

GitHub license CircleCI PRs Welcome Twitter


ParlAI (pronounced “par-lay”) is a python framework for
sharing, training and testing dialogue models, from open-domain chitchat to
VQA (Visual Question Answering).

Its goal is to provide researchers:

ParlAI is described in the following paper:
“ParlAI: A Dialog Research Software Platform", arXiv:1705.06476.

See the news page for the latest additions & updates, and the website http://parl.ai for further docs.

Installing ParlAI

ParlAI currently requires Python3 and Pytorch 1.1 or
newer. Dependencies of the core modules are listed in requirement.txt. Some
models included (in parlai/agents) have additional requirements.

Run the following commands to clone the repository and install ParlAI:

git clone https://github.com/facebookresearch/ParlAI.git ~/ParlAI
cd ~/ParlAI; python setup.py develop

This will link the cloned directory to your site-packages.

This is the recommended installation procedure, as it provides ready access to the examples and allows you to modify anything you might need. This is especially useful if you if you want to submit another task to the repository.

All needed data will be downloaded to ~/ParlAI/data, and any non-data files if requested will be downloaded to ~/ParlAI/downloads. If you need to clear out the space used by these files, you can safely delete these directories and any files needed will be downloaded again.

Documentation

Examples

A large set of examples can be found in this directory. Here are a few of them.
Note: If any of these examples fail, check the requirements section to see if you have missed something.

Display 10 random examples from the SQuAD task

python examples/display_data.py -t squad

Evaluate an IR baseline model on the validation set of the Personachat task:

python examples/eval_model.py -m ir_baseline -t personachat -dt valid

Train a single layer transformer on personachat (requires pytorch and torchtext).
Detail: embedding size 300, 4 attention heads, 2 epochs using batchsize 64, word vectors are initialized with fasttext and the other elements of the batch are used as negative during training.

python examples/train_model.py -t personachat -m transformer/ranker -mf /tmp/model_tr6 --n-layers 1 --embedding-size 300 --ffn-size 600 --n-heads 4 --num-epochs 2 -veps 0.25 -bs 64 -lr 0.001 --dropout 0.1 --embedding-type fasttext_cc --candidates batch

Code Organization

The code is set up into several main directories:

  • core: contains the primary code for the framework
  • agents: contains agents which can interact with the different tasks (e.g. machine learning models)
  • examples: contains a few basic examples of different loops (building dictionary, train/eval, displaying data)
  • tasks: contains code for the different tasks available from within ParlAI
  • mturk: contains code for setting up Mechanical Turk, as well as sample MTurk tasks
  • messenger: contains code for interfacing with Facebook Messenger
  • zoo: contains code to directly download and use pretrained models from our model zoo

Support

If you have any questions, bug reports or feature requests, please don't hesitate to post on our Github Issues page.

The Team

ParlAI is currently maintained by Emily Dinan, Alexander H. Miller, Stephen Roller, Kurt Shuster, Jack Urbanek and Jason Weston.
A non-exhaustive list of other major contributors includes:
Will Feng, Adam Fisch, Jiasen Lu, Antoine Bordes, Devi Parikh, Dhruv Batra,
Filipe de Avila Belbute Peres and Chao Pan.

Citation

Please cite the arXiv paper if you use ParlAI in your work:

@article{miller2017parlai,
  title={ParlAI: A Dialog Research Software Platform},
  author={{Miller}, A.~H. and {Feng}, W. and {Fisch}, A. and {Lu}, J. and {Batra}, D. and {Bordes}, A. and {Parikh}, D. and {Weston}, J.},
  journal={arXiv preprint arXiv:{1705.06476}},
  year={2017}
}

License

ParlAI is MIT licensed. See the LICENSE file for details.

概覽

名稱與所有者facebookresearch/ParlAI
主編程語言Python
編程語言Python (語言數: 9)
平台Linux, Mac, Windows
許可證MIT License
發布數38
最新版本名稱1.7.2 (發布於 )
第一版名稱personachat (發布於 )
創建於2017-04-24 17:10:44
推送於2023-11-03 14:30:00
最后一次提交
星數10.4k
關注者數284
派生數2.1k
提交數4.4k
已啟用問題?
問題數1544
打開的問題數50
拉請求數3058
打開的拉請求數1
關閉的拉請求數342
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?
去到頂部