update readme

56ac405f · Hsien-Chin Lin · 04197837 · 56ac405f
Commit 56ac405f authored 2 years ago by Hsien-Chin Lin
--- a/README.md
+++ b/README.md
-# ConvLab-3
+**EmoUS** is a data-driven user simulator with transformers, generating user emotions, semantic actions, and natural language responses based on the user goal, the dialogue history, and the user persona.

-![PyPI](https://img.shields.io/pypi/v/convlab) ![PyPI - Python Version](https://img.shields.io/pypi/pyversions/convlab) ![GitHub](https://img.shields.io/github/license/ConvLab/ConvLab-3)
+## Introduction
+We propose a generative transform-based user simulator (EmoUS) in this work. EmoUS consists of an encoder-decoder structure, which can generate user emotions, semantic actions, and natural language responses based on the user goal, the dialogue history, and the user persona. By analysing what kind of system behaviour elicits what kind of user emotions, we show that EmoUS can be used as a probe to evaluate a variety of dialogue systems and in particular their effect on the user's emotional state. Developing such methods is important in the age of large language model chat-bots and rising ethical concerns.

-**ConvLab-3** is a flexible dialog system platform based on a **unified data format** for task-oriented dialog (TOD) datasets. The unified format serves as the adapter between TOD datasets and models: datasets are first transformed to the unified format and then loaded by models. In this way, the cost of adapting $M$ models to $N$ datasets is reduced from $M\times N$ to $M+N$. While retaining all features of [ConvLab-2](https://github.com/thu-coai/ConvLab-2),  ConvLab-3 greatly enlarges supported datasets and models thanks to the unified format, and enhances the utility of reinforcement learning (RL) toolkit for dialog policy module. For typical usage, see our [paper](http://arxiv.org/abs/2211.17148). Datasets and Trained models are also available on [Hugging Face Hub](https://huggingface.co/ConvLab).
+This repo is a snapshot of current convlab-3 and you can find the code of EmoUS is in `convlab/policy/emoUS`.

- [Installation](#installation)
- [Tutorials](#tutorials)
- [Unified Datasets](#unified-datasets)
- [Models](#models)
- [Contributing](#contributing)
- [Code Structure](#code-structure)
- [Team](#team)
- [Citing](#citing)
- [License](#license)
+If you want to check the latest version you can find in the [convlab3 official repo](https://github.com/ConvLab/ConvLab-3) 

-## Updates
-
- **2023.2.26**: Update ConvLab on PyPI to 3.0.1 to reflect bug fixes.
- **2022.11.30**: ConvLab-3 release.
-
-## Installation
-
-You can install ConvLab-3 in one of the following ways according to your need. We use `torch>=1.10.1,<=1.13` and `transformers>=4.17.0,<=4.24.0`. Higher versions of `torch` and `transformers` may also work.
-
-### Git clone and pip install in development mode (Recommend)
-
-For the latest and most configurable version, we recommend installing ConvLab-3 in development mode.
-
-Clone the newest repository:
-
-```bash
-git clone --depth 1 https://github.com/ConvLab/ConvLab-3.git
+## Usage
+### Train EmoUS 
+You need to generate the input files by `build_data.py`, then train the model by `train_model.py`.
 ```
-
-Install ConvLab-3 via pip:
-
-```bash
-cd ConvLab-3
-pip install -e .
+python3 convlab/policy/emoUS/unify/build_data.py --add-persona --dataset $dataset --add-history --dial-ids-order $dial_ids_order --split2ratio $split2ratio
+python3 convlab/policy/genTUS/train_model.py --data-name $dataset --dial-ids-order $dial_ids_order --split2ratio $split2ratio --batch-size 8
 ```

-### Pip install from PyPI
+`dataset` can be `emowoz+dialmage`, `emowoz`, or `dialmage`. `emowoz+dialmage` is the whole dataset pf EmoWOZ, which is composed with human-human (`emowoz`) and human-machine (`dialmage`) task-oriented dialogues
+`dial_ids_order` can be 0, 1 or 2
+`split2ratio` can be 0.01, 0.1 or 1

-To use ConvLab-3 as an off-the-shelf tool, you can install via:
+The `build_data.py` will generate three files, `train.json`, `validation.json`, and `test.json`, under the folder `convlab/policy/emoUS/unify/data/EmoUS_${dataset}_${dial_ids_order}_${split2ration}`. 
+We trained EmoUS on A100 or RTX6000.

-```bash
-pip install convlab
+### Evaluate EmoUS
 ```
-Note that the `data` directory will not be included due to the package size limitation.
-
-### Using Docker
-
-We also provide [Dockerfile](https://github.com/ConvLab/ConvLab-3/blob/master/Dockerfile) for building docker. Basically it uses the `requirement.txt` and then installs ConvLab-3 in development mode.
-
-```bash
-# create image
-docker build -t convlab .
-
-# run container
-docker run -dit convlab
-
-# open bash in container
-docker exec -it CONTAINER_ID bash
+python3 convlab/policy/emoUS/evaluate.py --model-checkpoint $model_checkpoint --input-file $in_file 
 ```
-
-## Tutorials
-
- [Getting Started](https://github.com/ConvLab/ConvLab-3/blob/master/tutorials/Getting_Started.ipynb) (Have a try on [Colab](https://colab.research.google.com/github/ConvLab/ConvLab-3/blob/master/tutorials/Getting_Started.ipynb)!)
- [Introduction to Unified Data Format](https://github.com/ConvLab/ConvLab-3/tree/master/data/unified_datasets)
- [Utility functions for unified datasets](https://github.com/ConvLab/ConvLab-3/blob/master/convlab/util/unified_datasets_util.py)
- [RL Toolkit](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/policy)
- [Interactive Tool](https://github.com/ConvLab/ConvLab-3/blob/master/deploy) [[demo video]](https://youtu.be/00VWzbcx26E)
-
-## Unified Datasets
-
-Current datasets in unified data format: (DA-U/DA-S stands for user/system dialog acts)
-
-| Dataset       | Dialogs | Goal               | DA-U               | DA-S               | State              | API result         | DataBase           |
-| ------------- | ------- | ------------------ | ------------------ | ------------------ | ------------------ | ------------------ | ------------------ |
-| Camrest       | 676     | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |                    | :white_check_mark: |
-| WOZ 2.0       | 1200    |                    | :white_check_mark: |                    | :white_check_mark: |                    |                    |
-| KVRET         | 3030    |                    | :white_check_mark: |                    | :white_check_mark: | :white_check_mark: |                    |
-| DailyDialog   | 13118   |                    | :white_check_mark: |                    |                    |                    |                    |
-| Taskmaster-1  | 13175   |                    | :white_check_mark: | :white_check_mark: | :white_check_mark: |                    |                    |
-| Taskmaster-2  | 17303   |                    | :white_check_mark: | :white_check_mark: | :white_check_mark: |                    |                    |
-| MultiWOZ 2.1  | 10438   | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |                    | :white_check_mark: |
-| Schema-Guided | 22825   |                    | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |                    |
-| MetaLWOZ      | 40203   | :white_check_mark: |                    |                    |                    |                    |                    |
-| CrossWOZ (zh) | 6012    | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |
-| Taskmaster-3  | 23757   |                    | :white_check_mark: | :white_check_mark: | :white_check_mark: | :white_check_mark: |                    |
-
-Unified datasets are available under `data/unified_datasets` directory as well as [Hugging Face Hub](https://huggingface.co/ConvLab). We will continue adding more datasets listed in [this issue](https://github.com/ConvLab/ConvLab-3/issues/11). If you want to add a listed/custom dataset to ConvLab-3, you can create an issue for discussion and then create pull-request. We will list you as the [contributors](#Team) and highly appreciate your contributions!
-
-## Models
-
-We list newly integrated models in ConvLab-3 that support unified data format and obtain strong performance. You can follow the link for more details about these models. Other models can be used in the same way as in ConvLab-2.
-
-| Task                           | Models                                                       | Input           | Output           |
-| ------------------------------ | ------------------------------------------------------------ | --------------- | ---------------- |
-| Response Generation            | [T5](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/base_models/t5) | Context         | Response         |
-| Goal-to-Dialogue                 | [T5](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/base_models/t5) | Goal            | Dialog           |
-| Natural Language Understanding | [T5](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/base_models/t5), [BERTNLU](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/nlu/jointBERT), [MILU](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/nlu/milu) | Context         | DA-U             |
-| Dialog State Tracking          | [T5](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/base_models/t5), [SUMBT](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/dst/sumbt), [SetSUMBT](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/dst/setsumbt), [TripPy](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/dst/trippy) | Context         | State            |
-| RL Policy                      | [DDPT](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/policy/vtrace_DPT), [PPO](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/policy/ppo), [PG](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/policy/pg) | State, DA-U, DB | DA-S             |
-| Word-Policy | [LAVA](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/policy/lava) | Context, State, DB | Response |
-| Natural Language Generation    | [T5](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/base_models/t5), [SC-GPT](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/nlg/scgpt) | DA-S            | Response         |
-| End-to-End                     | [SOLOIST](https://github.com/ConvLab/ConvLab-3/blob/master/convlab/e2e/soloist/README.md)                                                      | Context, DB     | State, Response  |
-| User simulator                 | [TUS](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/policy/tus), [GenTUS](https://github.com/ConvLab/ConvLab-3/tree/master/convlab/policy/genTUS) | Goal, DA-S      | DA-U, (Response) |
-
-Trained models are available on [Hugging Face Hub](https://huggingface.co/ConvLab).
-
-## Contributing
-
-We welcome contributions from community. Please see issues to find what we need.
-
- If you want to add a new dataset, model, or other feature, please describe the dataset/model/feature in an issue with corresponding issue template before creating pull-request.
- Small change like fixing a bug can be directly made by a pull-request.
-
-## Code Structure
-
-```bash
-.
-├── convlab                             # Source code, installed in pypi package
-│   ├── dialog_agent                    # Interface for dialog agent and session
-│   ├── base_models
-│   │   └── t5                          # T5 models with a unified training script
-│   │       ├── goal2dialogue           # T5-Goal2Dialogue
-│   │       ├── dst                     # T5-DST
-│   │       ├── nlu                     # T5-NLU
-│   │       ├── nlg                     # T5-NLG
-│   │       └── rg                      # T5-RG
-│   │
-│   ├── nlu                             # NLU models, interface, and evaluation script
-│   │   ├── jointBERT                   # BERTNLU
-│   │   ├── milu                        # MILU
-│   │   └── svm                         # SVMNLU*
-│   │
-│   ├── laug                            # Language understanding AUGmentation (LAUG) toolkit
-│   │
-│   ├── dst                             # DST models, interface, and evaluation script
-│   │   ├── rule                        # RuleDST
-│   │   ├── setsumbt                    # SetSUMBT, has uncertainty estimates
-│   │   ├── sumbt                       # SUMBT
-│   │   ├── trippy                      # TripPy
-│   │   ├── trade                       # TRADE*
-│   │   ├── comer                       # COMER*
-│   │   ├── mdbt                        # MDBT*
-│   │   └── dstc9                       # scripts for DSTC9 cross-lingual DST evaluation
-│   │
-│   ├── policy                          # Policy models, interface, and RL toolkit
-│   │   ├── vector                      # vectorizer class
-│   │   ├── plot_results                # RL plotting tool
-│   │   ├── mle                         # MLE (imitation learning) policy
-│   │   ├── pg                          # Policy Gradient
-│   │   ├── ppo                         # Proximal Policy Optimization
-│   │   ├── vtrace_DPT                  # DDPT
-│   │   ├── lava                        # LAVA
-│   │   ├── rule                        # Rule policies and rule-based user simulators 
-│   │   ├── tus                         # TUS
-│   │   ├── genTUS                      # GenTUS
-│   │   ├── dqn                         # DQN*
-│   │   ├── gdpl                        # GDPL*
-│   │   ├── vhus                        # VHUS*
-│   │   ├── hdsa                        # HDSA*
-│   │   ├── larl                        # LARL*
-│   │   └── mdrg                        # MDRG*
-│   │
-│   ├── nlg                             # NLG models, interface, and evaluation script
-│   │   ├── scgpt                       # SC-GPT
-│   │   ├── sclstm                      # SC-LSTM
-│   │   └── template                    # TemplateNLG*
-│   │
-│   ├── e2e                             # End2End models
-│   │   ├── soloist                     # SOLOIST
-│   │   ├── damd                        # DAMD*
-│   │   └── sequicity                   # Sequicity*
-│   │
-│   ├── evaluator                       # Evaluator for interactive evaluation
-│   ├── human_eval                      # Human evaluation with AMT
-│   ├── task                            # Goal generators for MultiWOZ, CrossWOZ, and Camrest
-│   ├── util
-│   │   └── unified_datasets_util.py    # Utility function for unified data format
-│   └── deploy                          # Deploy system for human conversion
-│
-├── data                                # Data dir, not included in pypi package
-│   ├── ...                             # ConvLab-2 data, not available for pypi installation
-│   └── unified_datasets                # Unified datasets, available for pypi installation
-├── examples
-│   └── agent_examples                  # Examples of building user and system agents
-└── tutorials                           # Tutorials
+The `in_file` is the file generated by `build_data.py`.
+
+The results include three parts: 
+1. Natural language generation: Sacre-BLEU score and slot error rate (SER)
+2. Semantic action prediction: the precision, recall, F1-score, and turn accuracy for full action prediction and intent-domain prediction.
+3. Emotion prediction: macro F1-score form emotion and sentiment.
+
+<!---
+### Train a dialogue policy with EmoUS
+You can use it as a normal user simulator by `PipelineAgent`. For example,
+```python
+from convlab.dialog_agent import PipelineAgent
+from convlab.util.custom_util import set_seed
+
+model_checkpoint = 'convlab/policy/genTUS/unify/experiments/multiwoz21-exp'
+usr_policy = UserPolicy(model_checkpoint, mode="semantic")
+simulator = PipelineAgent(None, None, usr_policy, None, 'user')
 ```
+then you can train your system with this simulator.

-*: models do not support unified datasets, only support MultiWOZ.
-
-## Team
-
-**ConvLab-3** is maintained and developed by [Tsinghua University Conversational AI](http://coai.cs.tsinghua.edu.cn/) group (THU-COAI), the [Dialogue Systems and Machine Learning Group](https://www.cs.hhu.de/en/research-groups/dialog-systems-and-machine-learning.html) at Heinrich Heine University, Düsseldorf, Germany and Microsoft Research (MSR).
-
-We would like to thank all contributors of ConvLab:
-
-Yan Fang, Zhuoer Feng, Jianfeng Gao, Qihan Guo, Kaili Huang, Minlie Huang, Sungjin Lee, Bing Li, Jinchao Li, Xiang Li, Xiujun Li, Jiexi Liu, Lingxiao Luo, Wenchang Ma, Mehrad Moradshahi, Baolin Peng, Runze Liang, Ryuichi Takanobu, Dazhen Wan, Hongru Wang, Jiaxin Wen, Yaoqin Zhang, Zheng Zhang, Qi Zhu, Xiaoyan Zhu, Carel van Niekerk, Christian Geishauser, Hsien-chin Lin, Nurul Lubis, Xiaochen Zhu, Michael Heck, Shutong Feng, Milica Gašić.
+You can also change the `mode` to `"language"`, then GenTUS will response in natural language instead of semantic actions.

+--->
 ## Citing

-If you use ConvLab-3 in your research, please cite:
-
 ```
-@article{zhu2022convlab3,
-    title={ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format},
-    author={Qi Zhu and Christian Geishauser and Hsien-chin Lin and Carel van Niekerk and Baolin Peng and Zheng Zhang and Michael Heck and Nurul Lubis and Dazhen Wan and Xiaochen Zhu and Jianfeng Gao and Milica Gašić and Minlie Huang},
-    journal={arXiv preprint arXiv:2211.17148},
-    year={2022},
-    url={http://arxiv.org/abs/2211.17148}
-}
+Hsien-Chin Lin, Shutong Feng, Christian Geishauser, Nurul Lubis, Carel van Niekerk, Michael Heck, Benjamin Matthias Ruppik, Renato Vukovic, and Milica Gašić. 2023. EmoUS: Simulating User Emotions in Task-Oriented Dialogues. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’23), July 23–27, 2023, Taipei, Taiwan. ACM, New York, NY, USA, 6 pages. https://doi.org/10.1145/3539618.3592092
+
+
 ```

 ## License