-
Notifications
You must be signed in to change notification settings - Fork 301
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No text detected in pdf #620
Comments
Hi @rafaeldepablo, Ignore my comment if you find it irrelevant. I am not in Parsr team, I have some problem with Table detection so I looked around to see if anyone have the same. I tried to run your document, Parsr can detect fine with your document. I think you may missed something when you do the setting when you uploaded document. Here is how I configured |
Thanks I tried again and it crashed, but I retried again and it worked. Regards rafa |
SABADELL_GOBIERNO_CORPORATIVO_2022.pdf
Summary
great software
I'm running into strange behavior on some pdfs, apparently it's not finding any text except on the first sheet.
The pdf files are normal, it is possible to copy the text and search.
Instead if it finds the tables even though the text is blank.
Steps To Reproduce
Load the pdf and try
Expected behavior
The text is processed
Actual behavior
No text is identified
Screenshots
Environment
sudo docker run -p 3001:3001 axarev/parsr:latest
Thanks in advance
The text was updated successfully, but these errors were encountered: