New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: cannot unload pdf #2395
Comments
Thank you for providing the detailed logs. It indeed seems like the NLTK library is unable to download the 'punkt' tokenizer due to some network issue. To better assist you, could you please provide the following additional information:
This information will help me understand the context better and provide a more accurate solution.
|
@dosu-bot. I am running the application in a virtual environment or Docker. Using Python 3.11. The system is Windows 10. try running the command nltk.download('punkt') in a Python shell the out is :""" Python 3.11.8 (tags/v3.11.8:db85d51, Feb 6 2024, 22:03:32) [MSC v.1937 64 bit (AMD64)] on win32
|
It appears the issue stems from the NLTK library not being installed or properly recognized in your environment, which is why you're seeing a pip install nltk Once NLTK is installed, you can proceed to download the 'punkt' tokenizer with the following Python commands: import nltk
nltk.download('punkt') Make sure these commands are executed within the same virtual environment or Docker where your project is running to avoid any discrepancies. Regarding the quivr codebase and its dependencies, it seems the NLTK library is not listed as a dependency in its requirements or setup files. You can confirm this by reviewing the Pipfile in the repository. If you encounter any further issues or have additional questions, feel free to reach out. Sources
|
What happened?
A bug happened!
Relevant log output
Twitter / LinkedIn details
No response
The text was updated successfully, but these errors were encountered: