-
- Downloads
DDPT model that stably optimizes from scratch or pre-trained with dataset. We...
DDPT model that stably optimizes from scratch or pre-trained with dataset. We can choose to only use some percentage of the data. Also fixed bug that recommend actions were not outputted and the seed could be overwritten incorrectly
Showing
- convlab/policy/evaluate_distributed.py 5 additions, 2 deletionsconvlab/policy/evaluate_distributed.py
- convlab/policy/gdpl/train.py 2 additions, 2 deletionsconvlab/policy/gdpl/train.py
- convlab/policy/pg/train.py 2 additions, 2 deletionsconvlab/policy/pg/train.py
- convlab/policy/ppo/train.py 2 additions, 2 deletionsconvlab/policy/ppo/train.py
- convlab/policy/rlmodule.py 1 addition, 1 deletionconvlab/policy/rlmodule.py
- convlab/policy/vtrace_DPT/config.json 3 additions, 3 deletionsconvlab/policy/vtrace_DPT/config.json
- convlab/policy/vtrace_DPT/supervised/train_supervised.py 1 addition, 1 deletionconvlab/policy/vtrace_DPT/supervised/train_supervised.py
- convlab/policy/vtrace_DPT/train.py 2 additions, 2 deletionsconvlab/policy/vtrace_DPT/train.py
- convlab/util/custom_util.py 12 additions, 7 deletionsconvlab/util/custom_util.py
Please register or sign in to comment