Add support for encrypted/protected data type in iceberg table

### Feature Request / Improvement

I am working on extending Apache Iceberg supported data types with a new complex type: 'ProtctedType'.
This new data type internally is a StructType including a header and a payload.
The Header to include at minimum:
1. Encryption Provider ID
2. Encryption Key ID
3. Data Type

The payload to include the encrypted data as BinaryType.

The goal is to allow end user transparent interaction with the new type, allowing operations between encrypted data items, and clear text.
Further more, allow extension of puffin files to store aggregate data based on the clear text values, bloom filters, and optionally inverted index for gerex search without a full table scan.

Looking for guidance  on how such data type can be introduced and what are the dependencies I would need to address with the various readers and writers.

[protected_type_merge.txt](https://github.com/user-attachments/files/18562222/protected_type_merge.txt)

 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for encrypted/protected data type in iceberg table #1582

Feature Request / Improvement

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add support for encrypted/protected data type in iceberg table #1582

Description

Feature Request / Improvement

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions