This repository has been archived by the owner on Dec 11, 2022. It is now read-only.
Release 0.10.0
A complete redesign - non-backward compatible. Enabling multi-agent support.
New features -
- PIP package
- Benchmarks
- Hierarchical Reinforcement Learning (demonstrated by Hierarchical Actor-Critic)
- Tutorials
- Shared memory (e.g. Replay Buffer) between workers
- Tests (unit-tests, reward-based tests, trace-based tests)
- Using Coach as a library (see example here)
New Environments -
- Toy Environments (Exploration Chain, BitFlip)
- DeepMind PySC2 support (Starcraft 2)
- DeepMind Control Suite
New Algorithms -
- Hindsight Experience Replay
- Prioritized Experience Replay
- Hierarchical Actor-Critic
- UCB with Q-Ensembles