-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrate new aggregation subsystem based content importer module #1361
Commits on Jun 24, 2022
-
Closes #1298: Refactor the way IIS imports contents by dumping Object…
…Store in favour of newly introduced PDF aggregation service Initial implementation, proof of concept.
Configuration menu - View commit details
-
Copy full SHA for 692128a - Browse repository at this point
Copy the full SHA 692128aView commit details -
Closes #1298: Refactor the way IIS imports contents by dumping Object…
…Store in favour of newly introduced PDF aggregation service Refined implementation. Subworkflow still to be integrated with IIS main workflow.
Configuration menu - View commit details
-
Copy full SHA for 90b86d0 - Browse repository at this point
Copy the full SHA 90b86d0View commit details -
Closes #1298: Refactor the way IIS imports contents by dumping Object…
…Store in favour of newly introduced PDF aggregation service Limiting entries to the ones with actual_url not null in other not to violate the DocumentContentURL schema constraints.
Configuration menu - View commit details
-
Copy full SHA for e94c5bb - Browse repository at this point
Copy the full SHA e94c5bbView commit details -
Closes #1298: Refactor the way IIS imports contents by dumping Object…
…Store in favour of newly introduced PDF aggregation service Renaming `actual_url` to `location` as a column reference pointing to S3 content location.
Configuration menu - View commit details
-
Copy full SHA for 3031c79 - Browse repository at this point
Copy the full SHA 3031c79View commit details -
Closes #1298: Refactor the way IIS imports contents by dumping Object…
…Store in favour of newly introduced PDF aggregation service Enabling the new hive-based content metadata importer with content_url importer uber workflow and metadataextraction cache builder.
Configuration menu - View commit details
-
Copy full SHA for 36606ab - Browse repository at this point
Copy the full SHA 36606abView commit details -
Closes #1298: Refactor the way IIS imports contents by dumping Object…
…Store in favour of newly introduced PDF aggregation service Excluding conflicting jackson libraries.
Configuration menu - View commit details
-
Copy full SHA for 8cf1b38 - Browse repository at this point
Copy the full SHA 8cf1b38View commit details -
Closes #1342: Fix content_url_chain subworkflow actions order for cac…
…he builder workflow
Configuration menu - View commit details
-
Copy full SHA for 2439e41 - Browse repository at this point
Copy the full SHA 2439e41View commit details
Commits on Jun 29, 2022
-
Closes #1298: Refactor the way IIS imports contents by dumping Object…
…Store in favour of newly introduced PDF aggregation service The following new properties are supported: * `import_content_pdfaggregation_table_name` to be specified in at runtime 'dbname.tablename' format to indicate both hive database and table * `import_content_pdfaggregation_hive_metastore_uris` with URIs pointing to the hive metastore utilized by the PDF aggregation subsystem, to be defined statically in IIS environment (default-config.xml file) New hive-based PDF aggregation service support is automatically enabled by providing explicit `import_content_pdfaggregation_table_name` parameter value at runtime. When the parameter is unspecified then content importer module works in legacy mode (objectstore compatible).
Configuration menu - View commit details
-
Copy full SHA for d481c39 - Browse repository at this point
Copy the full SHA d481c39View commit details -
Closes #1298: Refactor the way IIS imports contents by dumping Object…
…Store in favour of newly introduced PDF aggregation service Applying code review fixes.
Configuration menu - View commit details
-
Copy full SHA for 4c1b3a2 - Browse repository at this point
Copy the full SHA 4c1b3a2View commit details