Skip to content

v0.5.2

Latest
Compare
Choose a tag to compare
@PaParaZz1 PaParaZz1 released this 27 Jun 08:56
· 13 commits to main since this release

Env

  1. add taxi env (#799) (#807)
  2. add ising model env (#782)
  3. add new Flozen Lake env (#781)
  4. optimize ppo continuous config in MuJoCo (#801)
  5. fix masac smac config multi_agent=True bug (#791)
  6. update/speed up pendulum ppo

Algorithm

  1. fix gtrxl compatibility bug (#796)
  2. fix complex obs demo for ppo pipeline (#786)
  3. add naive PWIL demo
  4. fix marl nstep td compatibility bug

Enhancement

  1. add GPU utils (#788)
  2. add deprecated function decorator (#778)

Style

  1. relax flask requirement (#811)
  2. add new badge (hellogithub) in readme (#805)
  3. update discord link and badge in readme (#795)
  4. fix typo in config.py (#776)
  5. polish rl_utils api docs
  6. add constraint about numpy<2
  7. polish macos platform test version to 12
  8. polish ci python version

News

  1. PsyDI: Towards a Multi-Modal and Interactive Chatbot for Psychological Assessments
  2. ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
  3. UniZero: Generalized and Efficient Planning with Scalable Latent World Models

Full Changelog: v0.5.1...v0.5.2

Contributors: @PaParaZz1 @zjowowen @YinminZhang @TuTuHuss @nighood @ruiheng123 @rongkunxue @ooooo-create @eltociear