You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Might also be an issue with Textract's output itself, rather than this library's parsing. This issue seems isolated to a specific PDF, and other pdfs work fine. Notes: seems to be something related to the metadata / structure of the file itself, multiple runs + changing orientiation + deleting pages does not seem to fix the issue.
Is this an error than anyone else has encountered / figured out a resolution for?
The text was updated successfully, but these errors were encountered:
get_text_from_layout_json
throws'NoneType' object is not subscriptable
for a specific PDF.Unfortunately, I can't share the specific PDF for privacy reasons — but this line seems to be the cause:
amazon-textract-textractor/prettyprinter/textractprettyprinter/t_pretty_print_layout.py
Line 173 in 9fb7d22
Might also be an issue with Textract's output itself, rather than this library's parsing. This issue seems isolated to a specific PDF, and other pdfs work fine. Notes: seems to be something related to the metadata / structure of the file itself, multiple runs + changing orientiation + deleting pages does not seem to fix the issue.
Is this an error than anyone else has encountered / figured out a resolution for?
The text was updated successfully, but these errors were encountered: