Name		Name	Last commit message	Last commit date
parent directory ..
.vscode		.vscode
assets		assets
config		config
docs		docs
envs		envs
install_contact_insert		install_contact_insert
tfagents_system		tfagents_system
README.md		README.md
environment.yml		environment.yml
run		run
run_script.sh		run_script.sh
train_script.sh		train_script.sh
training_variables.sh		training_variables.sh
training_variables_contact_insert.sh		training_variables_contact_insert.sh
training_variables_minitaur.sh		training_variables_minitaur.sh
utilities.py		utilities.py

README.md

Distributed SAC Utilities

Utilities for training reinforcement learning policies with the Soft Actor-Critic (SAC) algorithm. It uses TensorFlow Agents, and includes the following features:

Following this TF-Agents distributed training example, the framework is cleanly divided into completely separate programs:
- Experience collection workers (each with their own environment)
- Replay buffer implemented with deepmind/reverb
- SAC policy trainer
Can seed the replay buffer with experience collected with a random policy, to encourage exploration
Finegrained control over the number of CPUs allocated to each program
Checkpointing and tensorboard logging
"Supervision" of the training using daemontools/supervise automatically resumes the training from the last checkpoint if some program crashes, which is useful when running on a compute cluster
SLURM compute cluster support
Configure the environment hyperparameters and their curriculum with JSON

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rl

rl

README.md

Distributed SAC Utilities

Documentation Link

Reproduce ContactInsert

Files

rl

Directory actions

More options

Directory actions

More options

Latest commit

History

rl

Folders and files

parent directory

README.md

Distributed SAC Utilities

Documentation Link

Reproduce ContactInsert