-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade Haystack 1.x to NLTK 3.9 #8238
Comments
Our pre-processing pipeline broke since the nltk update because of the following issues:
We had to create a custom preprocessor component to fix the above issues. Happy to contribute to a fix if needed. |
Closing as #8256 has been integrated on 1.26.x branch and a new 1.26.3 release has been released with this fix. |
We upgraded farm-haystack to 1.26.3 today and we confirm that the preprocessor is working fine with nltk 3.9.1 , so we don't need the custom preprocessor anymore. Thanks a lot for your prompt response to this issue! |
Awesome, thanks for reporting back @sagarneeldubey much appreciated 🚀 |
In Haystack 1.26.x we should replace the
nltk.download("punkt")
withnltk.download('punkt_tab')
herehaystack/haystack/nodes/preprocessor/preprocessor.py
Line 123 in 883cd46
so that users can use Haystack 1.26.x with NLTK 3.9. Prior NLTK versions are affected by https://nvd.nist.gov/vuln/detail/CVE-2024-39705. We should therefore also pin NLTK to >=3.9.
While the NLTK release notes list 3.8.2 https://pypi.org/project/nltk/#history with the fix, that release disappeared from pypi. https://pypi.org/project/nltk/#history
There is a comment on GitHub saying that the release was deleted and there will be a 3.9 nltk/nltk#3301 (comment)
The text was updated successfully, but these errors were encountered: