Skip to content

Conversation

@the-other-tim-brown
Copy link
Contributor

@the-other-tim-brown the-other-tim-brown commented Dec 9, 2025

Describe the issue this Pull Request addresses

Addresses #17469
The HoodieFileGroupReader is our main reader abstraction but it is still using the Avro schema instead of the internal HoodieSchema.

Summary and Changelog

  • Updates the HoodieFileGroupReader and FileGroupReaderSchemaHandler to operate solely on HoodieSchema instead of the Avro schema class.
  • Updates callers to pass in HoodieSchema. If the caller requires some schema manipulation or fetching before the call to the HoodieFileGroupReader, this is also updated to use HoodieSchema

Impact

Migrates core reader paths to use new HoodieSchema

Risk Level

Low

Documentation Update

Contributor's checklist

  • Read through contributor's guide
  • Enough context is provided in the sections above
  • Adequate tests were added if applicable

@github-actions github-actions bot added the size:XL PR with lines of changes > 1000 label Dec 9, 2025
@the-other-tim-brown the-other-tim-brown force-pushed the hoodie-schema-file-group-reader branch from 9a2446d to 467eac4 Compare December 10, 2025 02:49
@hudi-bot
Copy link
Collaborator

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:XL PR with lines of changes > 1000

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants