-
- Downloads
refactored RL code, setting seed inside sampler function, evaluate can be...
refactored RL code, setting seed inside sampler function, evaluate can be constrained to only output specific goals, rule based simulator samples max_initiative
Showing
- convlab2/policy/evaluate.py 9 additions, 3 deletionsconvlab2/policy/evaluate.py
- convlab2/policy/gdpl/semantic_level_config.json 1 addition, 6 deletionsconvlab2/policy/gdpl/semantic_level_config.json
- convlab2/policy/gdpl/train.py 6 additions, 2 deletionsconvlab2/policy/gdpl/train.py
- convlab2/policy/pg/semantic_level_config.json 1 addition, 6 deletionsconvlab2/policy/pg/semantic_level_config.json
- convlab2/policy/pg/train.py 6 additions, 2 deletionsconvlab2/policy/pg/train.py
- convlab2/policy/ppo/semantic_level_config.json 1 addition, 6 deletionsconvlab2/policy/ppo/semantic_level_config.json
- convlab2/policy/ppo/train.py 6 additions, 2 deletionsconvlab2/policy/ppo/train.py
- convlab2/policy/rlmodule.py 1 addition, 10 deletionsconvlab2/policy/rlmodule.py
- convlab2/policy/rule/multiwoz/policy_agenda_multiwoz.py 2 additions, 2 deletionsconvlab2/policy/rule/multiwoz/policy_agenda_multiwoz.py
- convlab2/util/custom_util.py 15 additions, 0 deletionsconvlab2/util/custom_util.py
Loading
Please register or sign in to comment