-
Hi there, I am parsing a PDF with tables and I'd like to be able to detect formatting like italics and bold in the text. Any ideas on if that's possible (or any hacks anyone has) and how to do it? Edit: I have this code snippet that works for characters:
Which is great, but how do I do this for a given table? |
Beta Was this translation helpful? Give feedback.
Answered by
enrac5
Mar 12, 2024
Replies: 1 comment 11 replies
-
You've got the right idea about finding bold/italic through the fontname. Re. your final question, however, it depends on what you mean by "do this for a given table". Could you expand on what you mean, and perhaps provide an example PDF? |
Beta Was this translation helpful? Give feedback.
11 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Ah, it looks like I needed to use
find_tables(...)
not the extraction methods. Hmmm, this is much better than where I was, thank you!