You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
harvest halts when it encounters a directory with mode 000
📜 To Reproduce
In a bundle with many collections, set half the directories to unreadable, then run harvest on it, e.g.
% /usr/local/build11/harvest-3.6.0/bin/harvest -c killme.xml
[SUMMARY] Reading configuration from /data/home/pds4/test/killme.xml
[SUMMARY] Output directory: /tmp/harvest/out
[SUMMARY] Elasticsearch URL: https://search-en-prod-di7dor7quy7qwv3husi2wt5tde.us-west-2.es.amazonaws.com:443, index: registry
[INFO] Connecting to Elasticsearch
[INFO] Loading PDS to ES data type mapping from /usr/local/build11/harvest-3.6.0/elastic/data-dic-types.cfg
[INFO] Processing directory: /data/pds4/1700/killme
[INFO] Processing /data/pds4/1700/killme/resource/resource.maven_archive_information_1.0.xml
[INFO] Processing /data/pds4/1700/killme/resource/resource.phoenix_archive_information_1.0.xml
[INFO] Processing /data/pds4/1700/killme/resource/resource.ladee_archive_information_1.0.xml
[INFO] Processing /data/pds4/1700/killme/resource/Collection_resource_v1.0.xml
[INFO] Wrote 1 collection inventory document(s)
[INFO] Processing /data/pds4/1700/killme/resource/resource.phoenix_archive_information_1.1.xml
[INFO] Processing /data/pds4/1700/killme/miscellaneous/collection_miscellaneous.xml
[INFO] Wrote 1 collection inventory document(s)
[INFO] Processing /data/pds4/1700/killme/miscellaneous/lids_deprecated.xml
[INFO] Wrote 7 product(s)
[ERROR] /data/pds4/1700/killme/investigation
🕵️ Expected behavior
I would like harvest to skip such directories and not quit
🐛 Describe the bug
harvest halts when it encounters a directory with mode 000
📜 To Reproduce
In a bundle with many collections, set half the directories to unreadable, then run harvest on it, e.g.
% /usr/local/build11/harvest-3.6.0/bin/harvest -c killme.xml
[SUMMARY] Reading configuration from /data/home/pds4/test/killme.xml
[SUMMARY] Output directory: /tmp/harvest/out
[SUMMARY] Elasticsearch URL: https://search-en-prod-di7dor7quy7qwv3husi2wt5tde.us-west-2.es.amazonaws.com:443, index: registry
[INFO] Connecting to Elasticsearch
[INFO] Loading PDS to ES data type mapping from /usr/local/build11/harvest-3.6.0/elastic/data-dic-types.cfg
[INFO] Processing directory: /data/pds4/1700/killme
[INFO] Processing /data/pds4/1700/killme/resource/resource.maven_archive_information_1.0.xml
[INFO] Processing /data/pds4/1700/killme/resource/resource.phoenix_archive_information_1.0.xml
[INFO] Processing /data/pds4/1700/killme/resource/resource.ladee_archive_information_1.0.xml
[INFO] Processing /data/pds4/1700/killme/resource/Collection_resource_v1.0.xml
[INFO] Wrote 1 collection inventory document(s)
[INFO] Processing /data/pds4/1700/killme/resource/resource.phoenix_archive_information_1.1.xml
[INFO] Processing /data/pds4/1700/killme/miscellaneous/collection_miscellaneous.xml
[INFO] Wrote 1 collection inventory document(s)
[INFO] Processing /data/pds4/1700/killme/miscellaneous/lids_deprecated.xml
[INFO] Wrote 7 product(s)
[ERROR] /data/pds4/1700/killme/investigation
🕵️ Expected behavior
I would like harvest to skip such directories and not quit
📚 Version of Software Used
Harvest version: 3.6.0
Build time: 2022-04-13T17:42:54Z
🩺 Test Data / Additional context
🏞Screenshots
🖥 System Info
🦄 Related requirements
⚙️ Engineering Details
The text was updated successfully, but these errors were encountered: