Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parquet Encryption #1109

Open
norberttech opened this issue Jul 4, 2024 · 0 comments
Open

Parquet Encryption #1109

norberttech opened this issue Jul 4, 2024 · 0 comments

Comments

@norberttech
Copy link
Member

https://parquet.apache.org/docs/file-format/data-pages/encryption/

Parquet files containing sensitive information can be protected by the modular encryption mechanism that encrypts and authenticates the file data and metadata - while allowing for a regular Parquet functionality (columnar projection, predicate pushdown, encoding and compression).

The results of compression of column pages are encrypted before being written to the output stream. A new Thrift structure, with column crypto metadata, is added to column chunks of the encrypted columns. This metadata provides information about the column encryption keys.

That would make the parquet format even more attractive as it could significantly improve the security of data at rest

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Todo
Development

No branches or pull requests

1 participant