Releases: G-Research/ParquetSharp.Dataset
Releases · G-Research/ParquetSharp.Dataset
ParquetSharp.Dataset 0.4.0
What's Changed
- Allow ignoring files and directories using a prefix by @adamreeve in #29
Full Changelog: 0.3.1...0.4.0
ParquetSharp.Dataset 0.3.1
What's Changed
- Dispose of Parquet RecordBatches when creating a new filtered batch by @adamreeve in #25
Full Changelog: 0.3.0...0.3.1
ParquetSharp.Dataset 0.3.0
What's Changed
- Add support for Arrow LargeString, LargeBinary and LargeList types by @adamreeve in #24
- Bumped minimum supported Apache.Arrow version to 18.0.0
Full Changelog: 0.2.0...0.3.0
ParquetSharp.Dataset 0.2.0
What's Changed
- Add more binary comparison operators for filtering integer typed columns by @adamreeve in #21
- Allow filtering date and timestamp columns using binary comparisons by @adamreeve in #22
Full Changelog: 0.1.0...0.2.0
ParquetSharp.Dataset 0.1.0
What's Changed
- Fix incompatibility with Arrow 17.0.0 by @adamreeve in #19
- Bump ParquetSharp dependency to stable 15.0.2.1 release by @adamreeve in #20
Full Changelog: 0.1.0-beta4...0.1.0
ParquetSharp.Dataset 0.1.0-beta4
Note: This release has a breaking behaviour change. IsInRange methods now treat the range end as exclusive rather than inclusive.
What's Changed
- Set licence expression by @thompson-tomo in #15
- Handle data files with missing columns for nullable fields by @adamreeve in #16
- Add IsInRange filter method for Timestamp columns by @adamreeve in #18
- Make range filters use exclusive upper bounds by @adamreeve in #17
Full Changelog: 0.1.0-beta3...0.1.0-beta4
ParquetSharp.Dataset 0.1.0-beta3
What's Changed
- Fix only the first record batch being read from each file by @adamreeve in #12
- Fix reading all row groups when none are selected by @adamreeve in #13
Full Changelog: 0.1.0-beta2...0.1.0-beta3
ParquetSharp.Dataset 0.1.0-beta2
What's Changed
- Add initial implementation by @adamreeve in #1
- Configure CI and formatting by @adamreeve in #2
- Support filtering data based on data file columns by @adamreeve in #3
- Add tests demonstrating reading encryped files by @adamreeve in #5
- Skip row groups based on statistics by @adamreeve in #4
- Support filtering using date columns by @adamreeve in #7
- Add benchmark project and improve performance of constant array creation by @adamreeve in #8
- Fix mistake in README by @adamreeve in #6
- Improve performance of computing filter masks by @adamreeve in #10
- Return query results ordered by partition values by @adamreeve in #9
- Allow specifying columns to exclude rather than columns to read by @adamreeve in #11
Full Changelog: https://github.com/G-Research/ParquetSharp.Dataset/commits/0.1.0-beta2
ParquetSharp.Dataset 0.1.0-beta1
What's Changed
- Add initial implementation by @adamreeve in #1
- Configure CI and formatting by @adamreeve in #2
- Support filtering data based on data file columns by @adamreeve in #3
- Add tests demonstrating reading encryped files by @adamreeve in #5
- Skip row groups based on statistics by @adamreeve in #4
Full Changelog: https://github.com/G-Research/ParquetSharp.Dataset/commits/0.1.0-beta1