diff --git a/README.md b/README.md index 252a121c39c87e00598ae7fbff313c22af7b6f6e..88c2888e95540ea34aec2f7455404018ad5a9990 100755 --- a/README.md +++ b/README.md @@ -105,23 +105,22 @@ Performance (the first row is the default config for each module. Empty entries | NLU | DST | Policy | NLG | Complete rate | Success rate | Book rate | Inform P/R/F1 | Turn(succ/all) | | ----------- | --------- | -------------- | ----------- | ------------- | ------------ | --------- | --------- | -------------- | -| **BERTNLU** | RuleDST | RulePolicy | TemplateNLG | 92.1 | 85.5 | 91.5 | 79.8/92.8/83.8 | 12.7/13.8 | -| **MILU** | RuleDST | RulePolicy | TemplateNLG | 89.9 | 83.1 | 90.9 | 78.3/91.7/82.5 | 12.1/13.9 | -| **SVMNLU** | RuleDST | RulePolicy | TemplateNLG | 84.2 | 70.4 | 76.1 | 79.1/88.8/81.5 | 14.8/17.7 | -| BERTNLU | RuleDST | RulePolicy | **SCLSTM** | 40.1 | 41.0 | 51.5 | 68.5/56.5/59.1 | 11.6/29.2 | -| BERTNLU | RuleDST | **MLEPolicy** | TemplateNLG | 52.6 | 48.4 | 35.5 | 66.3/72.7/66.0 | 12.5/26.3 | -| BERTNLU | RuleDST | **PGPolicy** | TemplateNLG | 42.9 | 43.3 | 31.0 | 61.9/66.8/60.4 | 14.7/29.1 | -| BERTNLU | RuleDST | **PPOPolicy** | TemplateNLG | 69.7 | 56.6 | 56.6 | 64.8/79.0/68.1 | 12.9/22.1 | -| BERTNLU | RuleDST | **GDPLPolicy** | TemplateNLG | 57.9 | 49.5 | 33.5 | 67.0/76.4/68.2 | 11.5/24.3 | -| None | **MDBT** | RulePolicy | TemplateNLG | 27.7 | 21.2 | 45.4 | 52.2/41.0/42.4 | 11.8/32.1 | -| None | **TRADE** | RulePolicy | TemplateNLG | 29.9 | 25.3 | 36.9 | 49.3/48.1/44.4 | 12.7/24.7 | -| None | **SUMBT** | RulePolicy | TemplateNLG | 34.7 | 33.8 | 57.8 | 52.3/50.6/47.3 | 12.1/26.6 | -| BERTNLU | RuleDST | **MDRG** | None | 27.0 | 25.2 | 49.0 | 46.6/43.1/42.0 | 13.6/33.6 | -| BERTNLU | RuleDST | **HDSA** | None | 35.6 | 27.5 | 5.4 | 47.8/57.2/48.8 | 13.0/31.5 | -| BERTNLU | RuleDST | **LaRL** | None | 40.6 | 34.0 | 45.6 | 47.8/54.1/47.6 | 15.0/28.6 | -| None | **SUMBT** | **LaRL** | None | 39.4| 33.1| 39.5 | 48.5/56.0/48.8| 15.5/28.7| -| None | None | **Sequicity*** | None | 21.7 | 14.0 | 4.9 | 36.3/35.1/32.0 | 18.2/35.2 | -| None | None | **DAMD*** | None | 38.5 | 33.6 | 50.9 | 62.1/60.7/57.4 | 10.4/28.2 | +| **BERTNLU** | RuleDST | RulePolicy | TemplateNLG | 90.5 | 81.3 | 91.1 | 79.7/92.6/83.5 | 11.6/12.3 | +| **MILU** | RuleDST | RulePolicy | TemplateNLG | 93.3 | 81.8 | 93.0 | 80.4/94.7/84.8 | 11.3/12.1 | +| **SVMNLU** | RuleDST | RulePolicy | TemplateNLG | 57.5 | 38.5 | 57.1 | 62.9/67.7/62.6 | 11.0/13.4 | +| BERTNLU | RuleDST | RulePolicy | **SCLSTM** | 48.5 | 40.2 | 56.9 | 62.3/62.5/58.7 | 11.9/27.1 | +| BERTNLU | RuleDST | **MLEPolicy** | TemplateNLG | 42.7 | 35.9 | 17.6 | 62.8/69.8/62.9 | 12.1/24.1 | +| BERTNLU | RuleDST | **PGPolicy** | TemplateNLG | 37.4 | 31.7 | 17.4 | 57.4/63.7/56.9 | 11.0/25.3 | +| BERTNLU | RuleDST | **PPOPolicy** | TemplateNLG | 61.1 | 44.0 | 44.6 | 63.9/76.8/67.2 | 12.5/20.8 | +| BERTNLU | RuleDST | **GDPLPolicy** | TemplateNLG | 49.4 | 38.4 | 20.1 | 64.5/73.8/65.6 | 11.5/21.3 | +| None | **MDBT** | RulePolicy | TemplateNLG | 12.4 | 7.8 | 30.4 | 32.6/21.5/23.9 | 7.9/25.1 | +| None | **TRADE** | RulePolicy | TemplateNLG | 32.4 | 20.1 | 34.7 | 46.9/48.5/44.0 | 11.4/23.9 | +| None | **SUMBT** | RulePolicy | TemplateNLG | 34.5 | 29.4 | 62.4 | 54.1/50.3/48.3 | 11.0/28.1 | +| BERTNLU | RuleDST | **MDRG** | None | 21.6 | 17.8 | 31.2 | 39.9/36.3/34.8 | 15.6/30.5| +| BERTNLU | RuleDST | **LaRL** | None | 34.8 | 27.0 | 29.6 | 49.1/53.6/47.8 |13.2/24.4| +| None | **SUMBT** | **LaRL** | None | 32.9 | 23.7 | 25.9 | 48.6/52.0/46.7 | 12.5/24.3| +| None | None | **Sequicity*** | None | 23.1| 9.8 | 4.1 | 33.0/32.7/29.9 | 12.2/32.6 | +| None | None | **DAMD*** | None | 39.5| 34.3 | 51.4 | 60.4/59.8/56.3 | 15.8/29.8 | *: end-to-end models used as sys_agent directly.