-
Notifications
You must be signed in to change notification settings - Fork 315
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] Implement TQC for the example algorithms #1623
Comments
Hi @maxweissenbacher |
Hi @vmoens, that's great, thank you! I'll get started right away and come back here in case I need assistance (more likely than not). |
I cannot seem to get the logger to work for me. By running the (unchanged) training script for the SAC example, trying to execute the
Any suggestions? |
Another (very silly) question... this is my first open source contrib - how do I contribute the code to the github? Do I need to fork the repository first, and submit a pull request? |
For the logger it seems you lack write access in the folder where you're executing the code. For the OSS contribution:
You will need to format the code etc for which I invite you to read the CONTRIBUTE.md file in torchrl. Thanks again for your help! |
Thank you for all this info. I have submitted a pull request now, hopefully I did it right! I couldn't get the logging to work, so I implemented a custom logger to use on my local machine instead - however the logging is just as in the other example files in the pull request, so it should hopefully work as expected for you. |
Motivation
I suggest adding an implementation of TQC to the examples. I suggest adding this as a request in the 'call for distributions' stack. I would be happy to take on the implementation.
Solution
Add a performant, clear and minimal implementation of TQC to the examples. I would base the implementation on the already existing implementation of SAC in the examples. This seems reasonable due to the overall similar structure of the algorithms. In the Stable-Baselines3 library implementation of TQC, the implementation is likewise based on their corresponding implementation of SAC.
Checklist
Tasks
The text was updated successfully, but these errors were encountered: