Skip to content
Snippets Groups Projects
Commit cd692f19 authored by zqwerty's avatar zqwerty Committed by zhuqi
Browse files

Notice: The results are for commits before bdc9dba7 (inclusive). We will update...

Notice: The results are for commits before bdc9dba7 (inclusive). We will update the results after improving user policy.
parent da88bd41
No related branches found
No related tags found
No related merge requests found
......@@ -69,6 +69,8 @@ For more details about these models, You can refer to `README.md` under `convla
## End-to-end Performance on MultiWOZ
*Notice*: The results are for commits before [`bdc9dba`](https://github.com/thu-coai/ConvLab-2/commit/bdc9dba72c957d97788e533f9458ed03a4b0137b) (inclusive). We will update the results after improving user policy.
We perform end-to-end evaluation (1000 dialogues) on MultiWOZ using the user simulator below (a full example on `tests/test_end2end.py`) :
```python
......@@ -141,6 +143,8 @@ By running `convlab2/dst/evaluate.py MultiWOZ $model`:
### Policy
*Notice*: The results are for commits before [`bdc9dba`](https://github.com/thu-coai/ConvLab-2/commit/bdc9dba72c957d97788e533f9458ed03a4b0137b) (inclusive). We will update the results after improving user policy.
By running `convlab2/policy/evalutate.py --model_name $model`
| | Task Success Rate |
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment