Skip to content
Snippets Groups Projects
Select Git revision
  • c490c21489cab244e1eeb7f1b859a1ea66c3f76e
  • master default protected
  • emoUS
  • add_default_vectorizer_and_pretrained_loading
  • clean_code
  • readme
  • issue127
  • generalized_action_dicts
  • ppo_num_dialogues
  • crossowoz_ddpt
  • issue_114
  • robust_masking_feature
  • scgpt_exp
  • e2e-soloist
  • convlab_exp
  • change_system_act_in_env
  • pre-training
  • nlg-scgpt
  • remapping_actions
  • soloist
20 results

crosswoz_en

  • Open with
  • Download source code
  • Download directory
  • Your workspaces

      A workspace is a virtual sandbox environment for your code in GitLab.

      No agents available to create workspaces. Please consult Workspaces documentation for troubleshooting.

  • user avatar
    zqwerty authored
    0532f3dd
    History
    Code owners
    Assign users and groups as approvers for specific file changes. Learn more.

    CrossWOZ EN

    Kaili Huang

    This directory contains translated CrossWOZ dataset (from Chinese to English)

    Ontology translation

    • vocab_dict.json: translation of the ontology that appear in the data.
    • ontology_translate.py: translation function that can be used to translate original value using vocab_dict.json.

    annotation translation are categorized by slot:

    domain human translation machine translation template translation refer to other slots' translation don't translate
    Attraction name, address duration fee, rating nearby attract., nearby rest., nearby hotels, selectedResults, source domain phone
    Restaurant name, dishes, address open cost, rating nearby attract., nearby rest., nearby hotels, selectedResults, source domain phone
    Hotel name, type, Hotel Facilities, address Hotel Facilities - xxx price, rating nearby attract., nearby rest., nearby hotels, selectedResults, source domain phone
    Taxi to, from, selectedResults car type, plate number
    Metro from station, to station to, from, selectedResults

    Machine translation for dialogue data

    Machine translated dialogue data using the translated ontology and google translator.

    • [train|val|test].json.zip

    Human translation for dialogue data

    • human_val.json.zip: we sample 250 dialogues from machine translated validation set and ask professional human translators to correct the utterance translation.