Skip to content
Snippets Groups Projects
Commit 1e1b62bd authored by Nurul Fithria Lubis's avatar Nurul Fithria Lubis
Browse files

README.md

parent ef5ac22e
No related branches found
No related tags found
No related merge requests found
......@@ -3,7 +3,7 @@
Code for "Dialogue Evaluation with Offline Reinforcement Learning" paper.
<p align="center">
<img width="700" src="all2.pdf">
<img width="700" src="all2.png">
</p>
In this paper, we propose the use of offline reinforcement learning for dialogue evaluation based on static data.Such an evaluator is typically called a critic and utilized for policy optimization. We go one step further and show that offline RL critics can be trained for any dialogue system as external evaluators, allowing dialogue performance comparisons across various types of systems. This approach has the benefit of being corpus- and model-independent, while attaining strong correlation with human judgements, which we confirm via an interactive user trial.
......
all2.png 0 → 100644
all2.png

95.9 KiB

0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment