Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add functionality for pandas_to_eland to handle columns with . #424

Open
wants to merge 7 commits into
base: main
Choose a base branch
from

Conversation

Ashton-Sidhu
Copy link
Contributor

Resolves #418

This PR adds the functionality to handle nested Elasticsearch fields and ETL processes that have dataframes that use periods in column names that map to nested fields in the Elastic mapping.

Instead of trying to take each column with a . in the pandas dataframe, create a nested mapping to map the Elasticsearch mapping format and change the verify_mapping_compatability function to be able to compare nested schemas - I took the Elastic mapping schema and then compressed it so it's column_name: type ( ex. {"a.b.c.d": {"type": "long"}} ). This significantly simplified the implementation.

@Ashton-Sidhu Ashton-Sidhu changed the title Verify mapping Add functionality for pandas_to_eland to handle columns with . Jan 5, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Inconsistency between Eland mapping and Elastic Mapping when appending to an index with . in the column names
1 participant