We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The pdf file generated using hocr-pdf has Hebrew text printed in the opposite direction.
hocr-pdf
Steps I followed:
hocr-pdf --savefile output.pdf actual-file.jpg
The pdf file has Hebrew text inserted in it but in the reverse order.
Hebrew is a right to left language so not sure if I have to pass any language or direction parameters to get this right.
The text was updated successfully, but these errors were encountered:
I am afraid that hocr-pdf was never tested with RTL text. Using bidi like in https://github.com/tesseract-ocr/tesstrain/blob/master/generate_wordstr_box.py might fix that.
bidi
Sorry, something went wrong.
Thanks! I will try it out and see if that works for me.
@smijo149 Looks like you solved this. I wonder if the maintainers of hocr-tools would be interested in your PR?
@joewiz Yeah I was able to solve the issue based on @stweil suggestion. I have opened a PR #165 if anyone is interested. Thanks!
No branches or pull requests
The pdf file generated using
hocr-pdf
has Hebrew text printed in the opposite direction.Steps I followed:
hocr-pdf --savefile output.pdf actual-file.jpg
to generate pdf file.The pdf file has Hebrew text inserted in it but in the reverse order.
Actual image:
This is how my hocr file looks:
Text in pdf file: (I have set text visibility mode to 0 so that the inserted text is visible)
Hebrew is a right to left language so not sure if I have to pass any language or direction parameters to get this right.
The text was updated successfully, but these errors were encountered: