This repository has been archived by the owner on Dec 11, 2022. It is now read-only.

Release 0.10.0

gal-leibovich released this 26 Aug 12:25

· 391 commits to master since this release

3fd0bf4

A complete redesign - non-backward compatible. Enabling multi-agent support.

New features -

PIP package
Benchmarks
Hierarchical Reinforcement Learning (demonstrated by Hierarchical Actor-Critic)
Tutorials
Shared memory (e.g. Replay Buffer) between workers
Tests (unit-tests, reward-based tests, trace-based tests)
Using Coach as a library (see example here)

New Environments -

Toy Environments (Exploration Chain, BitFlip)
DeepMind PySC2 support (Starcraft 2)
DeepMind Control Suite

New Algorithms -

Hindsight Experience Replay
Prioritized Experience Replay
Hierarchical Actor-Critic
UCB with Q-Ensembles

Assets 2