Test Report
Model Name: BERTNLU-RuleDST-MLEPolicy-TemplateNLG
Dataset: multiwoz
Time: 2020-04-27 02:37:46
Overall Results
 Success Rate: 48.4 %
 (Precision, Recall, F1)   :   (0.663,  0.727,  0.660) 
 Average Dialog Turn (Succ): 12.455 
 Average Dialog Turn (All): 26.270 
Metric
|  | Total Num | Succ Rate | Precision | Recall | F1 | Dialog Loop Failed Rate | Dialog Turn (Succ) | Dialog Turn (All) | 
|---|
| hotel | 375 | 0.432 | 0.422 | 0.503 | 0.434 | 0.469 | 8.074 | 19.781 | 
| restaurant | 394 | 0.609 | 0.492 | 0.722 | 0.558 | 0.386 | 10.183 | 20.365 | 
| attraction | 289 | 0.875 | 0.770 | 0.916 | 0.812 | 0.107 | 6.427 | 9.723 | 
| taxi | 99 | 0.677 | 0.747 | 0.712 | 0.724 | 0.323 | 7.313 | 13.192 | 
| train | 327 | 0.795 | 0.867 | 0.868 | 0.862 | 0.202 | 7.646 | 13.125 | 
| police | 23 | 1.000 | 0.935 | 1.000 | 0.957 | 0.000 | 2.000 | 2.000 | 
| hospital | 30 | 0.933 | 0.933 | 0.933 | 0.933 | 0.067 | 4.000 | 6.400 | 
Domain hotel
Domain restaurant
Domain attraction
Domain taxi
Domain train
Domain police
Overall Results
 Success Rate: 100.0 %
 (Precision, Recall, F1)   :   (0.935,  1.000,  0.957) 
 Average Dialog Turn (Succ): 2.000 
 Average Dialog Turn (All): 2.000 
 System NLU Failed Dialog Act:- Inform-Police-none-none- Occur Num:   7
- NLU Output
- Inform-Police-Name-Parkside Police Station    Occur Num:    6
- Request-Police-Addr-?    Occur Num:    1
 
 
User NLU Failed Dialog Act:- Inform-Police-Addr-Parkside, Cambridge- Occur Num:   46
- NLU Output
- Inform-Police-Addr-Parkside , Cambridge    Occur Num:    23
- Inform-Police-Phone-01223358966    Occur Num:    23
 
 
- Inform-Police-Name-none- Occur Num:   46
- NLU Output
- Inform-Police-Addr-Parkside , Cambridge    Occur Num:    23
- Inform-Police-Phone-01223358966    Occur Num:    23
 
 
- Inform-Police-Post-none- Occur Num:   46
- NLU Output
- Inform-Police-Addr-Parkside , Cambridge    Occur Num:    23
- Inform-Police-Phone-01223358966    Occur Num:    23
 
 
- Inform-Police-Phone-01223358966- Occur Num:   23
- NLU Output
- Inform-Police-Addr-Parkside , Cambridge    Occur Num:    23
 
 
Dialog LoopNothing
 Bad Inform Dialog ActNothing
 Request But Not Inform Dialog ActNothing
 Inform But Not Request Dialog Act- inform-police-phone    Occur Num:     3
Domain hospital
Overall Results
 Success Rate: 93.3 %
 (Precision, Recall, F1)   :   (0.933,  0.933,  0.933) 
 Average Dialog Turn (Succ): 4.000 
 Average Dialog Turn (All): 6.400 
 System NLU Failed Dialog Act:- Request-Hospital-Phone-?- Occur Num:   4
- NLU Output
- Request-Police-Phone-?    Occur Num:    2
- Request-Restaurant-Phone-?    Occur Num:    2
 
 
- Inform-Hospital-Department-plastic and vascular surgery plastics- Occur Num:   2
- NLU Output
- Inform-Hospital-none-none    Occur Num:    1
- Inform-Hospital-Department-plastic and vascular surgery    Occur Num:    1
 
 
- Inform-Hospital-Department-haematology and haematological oncology- Occur Num:   1
- NLU Output
- Inform-Hospital-Department-neurosurgery    Occur Num:    1
 
 
- Inform-Hospital-Department-oncology- Occur Num:   1
- NLU Output
- Inform-Hospital-none-none    Occur Num:    1
 
 
- Inform-Hospital-Department-surgery- Occur Num:   1
- NLU Output
- Inform-Hospital-none-none    Occur Num:    1
 
 
- Inform-Hospital-Department-intermediate dependancy area- Occur Num:   1
- NLU Output
- Inform-Hospital-Department-intermediate dependancy    Occur Num:    1
 
 
- Inform-Hospital-Department-children's oncology and haematology- Occur Num:   1
- NLU Output
- Inform-Hospital-none-none    Occur Num:    1
 
 
- Inform-Hospital-Department-emergency department- Occur Num:   1
- NLU Output
- Inform-Hospital-Department-emergency department department    Occur Num:    1
 
 
User NLU Failed Dialog Act:- Inform-Hospital-Addr-none- Occur Num:   58
- NLU Output
- Inform-Hospital-Department-neurosciences critical care unit    Occur Num:    28
- Inform-Hospital-Phone-01223216297    Occur Num:    28
- Request-Hospital-Department-?    Occur Num:    2
 
 
- Inform-Hospital-Department-neurosciences critical care unit- Occur Num:   30
- NLU Output
- Inform-Hospital-Phone-01223216297    Occur Num:    28
- Request-Hospital-Department-?    Occur Num:    2
 
 
- Inform-Hospital-Phone-01223216297- Occur Num:   30
- NLU Output
- Inform-Hospital-Department-neurosciences critical care unit    Occur Num:    28
- Request-Hospital-Department-?    Occur Num:    2
 
 
- Inform-Hospital-Post-none- Occur Num:   6
- NLU Output
- Request-Hospital-Department-?    Occur Num:    2
- Inform-Hospital-Department-neurosciences critical care unit    Occur Num:    2
- Inform-Hospital-Phone-01223216297    Occur Num:    2
 
 
- Request-Hospital-Department-?- Occur Num:   4
- NLU Output
- Inform-Hospital-Department-neurosciences critical care unit    Occur Num:    2
- Inform-Hospital-Phone-01223216297    Occur Num:    2
 
 
Dialog Loop- Request-Hospital-Phone-?    Occur Num:     2
Bad Inform Dialog ActNothing
 Request But Not Inform Dialog Act- request-hospital-phone    Occur Num:     2
Inform But Not Request Dialog ActNothing