-
Notifications
You must be signed in to change notification settings - Fork 747
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PyTorch 2.2.2-1.5.11-SNAPSHOT] Training produces poor MNIST model on Windows #1503
Comments
Is it the same with 2.3.0-1.5.11-SNAPSHOT? |
2.3.0 doesn't work on Windows. Failed to load |
That's strange. the missing library has been added to javacpp, not to pytorch presets. |
No idea why pytorch 2.2.2 would find liomp140 and not pytorch 2.3.0. |
So, after many experiments and code investigations, it turns out that, when the github runner was upgraded with a new version of Visual Studio (about 2 months ago, when we merged Pytorch 2.2.2) the Windows build of I included a fix in PR #1510 that consists in removing on windows the Official build uses MKL, which includes openmp. We could do this (linking dynamically instead of statically), but this would require to add a dependency to MKL, even for people using GPU only. And pytorch uses a 2022 version of MKL. Not sure it would work with the 2024 version of the current MKL presets. I also realized that openblas is not detected by pytorch build. I don't know if and how it is supposed to find it during build. |
Thanks for hard working! |
On macOS and Linux, PyTorch 2.2.2 training with MNIST produces good models that has training accuracy > 90%. However, the same code reports very low accuracy (about 11%) on Windows. The same code works fine with 2.2.1 on Windows. You can run your sample code to reproduce the issue. The below code is to calculate the accuracy:
The text was updated successfully, but these errors were encountered: