diff --git a/data/unified_datasets/README_TEMPLATE.md b/data/unified_datasets/README_TEMPLATE.md index f04e6be4c368042fe56611164e63fd37196dcd03..5c326a17234f0902ac1b7c784580a95d37b8cf11 100644 --- a/data/unified_datasets/README_TEMPLATE.md +++ b/data/unified_datasets/README_TEMPLATE.md @@ -2,7 +2,7 @@ - **Repository:** data link - **Paper:** paper link -- **Leaderboard:** leaderboard link if any else empty +- **Leaderboard:** leaderboard link if any else None - **Who transforms the dataset:** Name(email, e.g. xxx at gmail dot com) ### Dataset Summary diff --git a/data/unified_datasets/multiwoz21/README.md b/data/unified_datasets/multiwoz21/README.md index 027cee8a23503bd06cbd711c468e5dc8b92e2a01..7bf47f2c79ba03faa21fe9d03b4bc8a6e0c89ed8 100644 --- a/data/unified_datasets/multiwoz21/README.md +++ b/data/unified_datasets/multiwoz21/README.md @@ -17,6 +17,7 @@ MultiWOZ 2.1 fixed the noise in state annotations and dialogue utterances. It al - Correct some grammar errors in the text, mainly following `tokenization.md` in MultiWOZ_2.1. - Normalize slot name and value. See `normalize_domain_slot_value` function in `preprocess.py`. - Correct some non-categorical slots' values and provide character level span annotation. + - Concatenate multiple values in user goal & state using `|`. - **Annotations:** - user goal, dialogue acts, state.