-
Notifications
You must be signed in to change notification settings - Fork 664
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for marked content section IDs #961
Conversation
Codecov Report
@@ Coverage Diff @@
## develop #961 +/- ##
=========================================
Coverage 100.00% 100.00%
=========================================
Files 18 18
Lines 1588 1613 +25
=========================================
+ Hits 1588 1613 +25
|
Note! This page only extracts marked-content identifiers for sequences of objects. There
|
Many thanks for this, @dhdaines! It's a clever solution, and adds what seems like will be a powerful feature for people working with PDFs that have marked content. For now, I'm going to mark |
Thank you! I will submit another PR soon to add the tag attributes, as these are useful for identifying headers and footers. |
As requested, this is the MCID part of #937 split out. Structure tree support (using
pdfminer.six
) will be a separate PR.