README.md

1e1b62bd · Nurul Fithria Lubis · ef5ac22e · 1e1b62bd · 1e1b62bd
Commit 1e1b62bd authored Sep 6, 2022 by Nurul Fithria Lubis
--- a/README.md
+++ b/README.md
@@ -3,7 +3,7 @@
 Code for "Dialogue Evaluation with Offline Reinforcement Learning" paper. 

 <p align="center">
-  <img width="700" src="all2.pdf">
+  <img width="700" src="all2.png">
 </p>

 In this paper, we propose the use of offline reinforcement learning for dialogue evaluation based on static data.Such an evaluator is typically called a critic and utilized for policy optimization. We go one step further and show that offline RL critics can be trained for any dialogue system as external evaluators, allowing dialogue performance comparisons across various types of systems. This approach has the benefit of being corpus- and model-independent, while attaining strong correlation with human judgements, which we confirm via an interactive user trial.

--- a/all2.png
+++ b/all2.png