New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Following dependencies are missing: pikepdf. Please install them using pip install pikepdf
.
#2984
Comments
@OLR-Nadia please copy and paste in the exact error message with full stack trace so we can see where this message is arising. |
Also, please mention how you installed |
@scanny Hi Steve, for the error message, it shows like the printed messages (see the image with the green mark) |
All of these commands should be run after activating your virtualenv. First, you'll need to install > pip install unstructured[pdf] Then you should be able to see > pip list And this command should produce no error messages: > python -c "import pikepdf" If that all works try again and let us know how you go. |
Ok, next thing to try is running this outside of VSCode, like in a terminal/command-line application. I don't completely understand why, but we've had other reports like this where the VSCode terminal was doing something unexpected. It's possible it's related to the VSCode environment setting: https://code.visualstudio.com/docs/python/environments#_creating-environments Anyway, we'll want to eliminate VSCode subtleties for diagnostic purposes and running it directly from the command-line in the activated virtualenv should do that. Btw are you using |
Hi Steve, I wanted to update you that the issue has been solved. I initially tried installing and uninstalling the library via the command line, outside of VSCode, and encountered a LongPathEnabled issue on my Windows system, which prevented the installation of the unstructured-inference library. After addressing this, I successfully installed the unstructured-inference and pikepdf libraries separately. Thank you for your assistance. |
@OLR-Nadia glad you got it working :) Can you say any more about the |
@OLR-Nadia This is terrific, thanks so much for this Nadia! :) |
Currently I developing RAG using libraries from langchain and unstructured for read pdf file. but I have an issue and cannot solve the error, which makes it unable to move to the next code.
from unstructured.partition.pdf import partition_pdf
fpath = "files/"
fname = "darlie-brief.pdf"
printed error message:
Following dependencies are missing: pikepdf. Please install them using
pip install pikepdf
.PDF text extraction failed, skip text extraction...
current version I'm using:
all libraries installed in venv
The text was updated successfully, but these errors were encountered: