Skip to content

EarthbeamDAG.upload_to_s3 does not check for naming conflicts #70

@jayckaiser

Description

@jayckaiser

In the (rare) case where multiple input files are uploaded to S3 with the same name, they will overwrite one another. This can occur when processing Parquet files, where each file ends up with a name like part.N.parquet.

We need to update the code to check whether files are identically named and to append additional metadata to their names to prevent collisions in this instance.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions