Skip to content

v0.5.1

Compare
Choose a tag to compare
@PaParaZz1 PaParaZz1 released this 04 Feb 15:55
· 43 commits to main since this release

Env

  1. add MADDPG pettingzoo example (#774)
  2. polish NGU Atari configs (#767)
  3. fix bug in cliffwalking env (#759)
  4. add PettingZoo replay video demo
  5. change default max retry in env manager from 5 to 1

Algorithm

  1. add QGPO diffusion-model related algorithm (#757)
  2. add HAPPO multi-agent algorithm (#717)
  3. add DreamerV3 + MiniGrid adaption (#725)
  4. fix hppo entropy_weight to avoid nan error in log_prob (#761)
  5. fix structured action bug (#760)
  6. polish Decision Transformer entry (#754)
  7. fix EDAC policy/model bug

Fix

  1. fix env typos
  2. fix pynng requirements bug
  3. fix communication module unittest bug

Style

  1. polish policy API doc (#762) (#764) (#768)
  2. add agent API doc (#758)
  3. polish torch_utils/utils API doc (#745) (#747) (#752) (#755) (#763)

News

  1. AAAI 2024: SO2: A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
  2. LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Full Changelog: v0.5.0...v0.5.1

Contributors: @PaParaZz1 @zjowowen @nighood @kxzxvbk @puyuan1996 @Cloud-Pku @AltmanD @HarryXuancy