DiDi AI Lab's submission for DSTC9 Multi-domain Task-oriented Dialog Challenge II

We participated in the Multi-domain Task-oriented Dialog Challenge II of DSTC9. Participants in this shared task build end-to-end dialog systems that can assist human to fulfil single or multiple tasks, such as making a restaurant reservation, booking a hotel, etc.

In the evaluation, human users are recruited and chat with the system to fulfill tasks. At the end of the conversation, human users mark whether the task is fulfilled or not based on system responses. Our best system achieves 74.8% success rate and ties for first place in the challenge. It scores 4.51/5 on language understanding and 4.45/5 on response appropriateness, which are the two additional metrics judged by human users.

We release our code for research purpose. At the moment, only the inference part is fully tested. For training new models, please refer to our internal documentation end2end/soloist (fair amount of hacking and modifications are expected). The code is based on Convlab-2 which is a dialog evaluation toolkit provided by the shared task organizer. Please checkout our paper for more details: A Hybrid Task-Oriented Dialog System with Domain and Task Adaptive Pretraining.

Installation

Python version >= 3.5 (Python virtual environment is strongly recommended)
Install dependencies and download pre-trained models:

bash install.sh

Add the project root directory to PYTHONPATH:

export PYTHONPATH=<project_root_path>

Quick Start

We provide a simple script that can let you interact with the chatbot in a termial.

python end2end/human_eval_example.py

You are provided with a randomly generated task goal. Just follow the goal and chat with the chatbot. Type success or fail to finish the conversation.

Evaluate with a user simulator

Convlab-2 comes with a user simulator that can be used to evaluate chatbots. You can find more details about the user simulator on Convlab-2 github page.

Command to run the user simulator based automatic evaluation:

python end2end/submission1/automatic.py

Cite

Please cite this paper if you find this repository useful:

@article{zhang2021hybrid,
      title={A Hybrid Task-Oriented Dialog System with Domain and Task Adaptive Pretraining}, 
      author={Boliang Zhang and Ying Lyu and Ning Ding and Tianhao Shen and Zhaoyang Jia and Kun Han and Kevin Knight},
      year={2021},
      eprint={2102.04506},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Contact

Questions? Suggestions? Please use Github "Issues" panel or email boliangzhang@didiglobal.com.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
convlab2		convlab2
data		data
deploy		deploy
docs		docs
end2end		end2end
tests		tests
tutorials		tutorials
.codebeatignore		.codebeatignore
.gitignore		.gitignore
.travis.yml		.travis.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
NOTICE		NOTICE
PULL_REQUEST_TEMPLATE.md		PULL_REQUEST_TEMPLATE.md
README.md		README.md
dev.dockerfile		dev.dockerfile
get_started.py		get_started.py
install.sh		install.sh
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiDi AI Lab's submission for DSTC9 Multi-domain Task-oriented Dialog Challenge II

Installation

Quick Start

Evaluate with a user simulator

Cite

Contact

About

Uh oh!

Releases

Packages

Languages

License

itamaker/dstc9

Folders and files

Latest commit

History

Repository files navigation

DiDi AI Lab's submission for DSTC9 Multi-domain Task-oriented Dialog Challenge II

Installation

Quick Start

Evaluate with a user simulator

Cite

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages