All notable changes to this project will be documented in this file.
- Support loading hudi global configs (#118) by @zzhpro
- Add base file records' in-memory size to
FileStats
(#140) by @xushiyan - Support partition prune api (#119) by @KnightChess
- Add partition filter arg in Python APIs (#153) by @xushiyan
- Add
HudiFileGroupReader
with consolidated APIs to read records (#164) by @xushiyan - Add
TableBuilder
API for creatingTable
instances (#163) by @kazdy - Implement datafusion
TableProviderFactory
(#162) by @kazdy
- Register object store with datafusion (#107) by @abyssnlp
- Handle validating table when
DropsPartitionFields
not present (#142) by @xushiyan - Make partition loading more efficient (#152) by @xushiyan
- Simplify partition filter format by taking tuple of strings (#170)
- Improve api to get file slices splits (#185)
- Handle schema retrieval for datafusion api (#187)
- Extract common test code for creating table (#117) by @gohalo
- Improve APIs for handling options (#161) by @xushiyan
- Improve
TableBuilder
API for taking single option (#171) by @xushiyan - Minor improvement to fix coverage report status (#173) by @xushiyan
- Update readme logo and example (#65) by @xushiyan
- Update in-code comments (#132) by @KnightChess
- Add hudi core API docs with examples (#113) by @KnightChess
- Add in-code docs to hudi-core APIs (#166) by @xushiyan
- Add python binding docstrings (#169) by @kazdy
- Add step-by-step release guide (#66) by @xushiyan
- Enforce Python code style (#101) by @muyihao
- Use exact versions for arrow and datafusion (#105) by @xushiyan
- Bump up datafusion to version 41, arrow to 52.2 (#120) by @yjshen
- (deps) Update zip-extract requirement from 0.1.3 to 0.2.1 (#130) by @dependabot[bot]
- (deps) Upgrade datafusion, pyarrow, pyo3, python versions (#149) by @kazdy
- (deps) Upgrade arrow dependencies (#168) by @kazdy
- (release) Bump version to 0.2.0-rc.1
- (deps) Upgrade datafusion and object store (#182)
- (deps) Upgrade datafusion to 42.2.0 (#192)
- (release) Bump version to 0.2.0-rc.2
- Improve release scripts (#68) by @xushiyan
- Add
CHANGELOG.md
with git-cliff config (#69) by @xushiyan - Configure labeler for PRs from forked repos (#83) by @xushiyan
- Fix labeler config (#85) by @xushiyan
- Fix labeler config for dev-x (#87) by @xushiyan
- Merge python code coverage report with rust (#67) by @xushiyan
- Add pull request template (#89) by @xushiyan
- Enable dependabot (#94) by @xushiyan
- Add path ignore files for ci workflow (#93) by @abyssnlp
- Improve workflows for code checking and PR (#110) by @xushiyan
- Disable labeler due to permission and policy (#115) by @xushiyan
- (ci) Fix PR title linting to support change scope (#138) by @kazdy
- Add feature request template for GH issues (#167) by @kazdy
-
@KnightChess made their first contribution in #119
-
@gohalo made their first contribution in #117
-
@zzhpro made their first contribution in #118
-
@yjshen made their first contribution in #120
-
@abyssnlp made their first contribution in #107
-
@muyihao made their first contribution in #101
- Initial rust implementation to integrate with datafusion (#1) by @xushiyan
- Add python binding (#21) by @xushiyan
- Implement
HudiTable
as python API (#23) by @xushiyan - Use
object_store
for common storage APIs (#25) by @xushiyan - Implement Rust and Python APIs to read file slices (#28) by @xushiyan
- Add APIs for time-travel read (#33) by @xushiyan
- Implement datafusion API using ParquetExec (#35) by @xushiyan
- Add
HudiConfigs
for parsing and managing named configs (#37) by @xushiyan - Add config validation when creating table (#49) by @xushiyan
- Add internal config to skip validation (#51) by @xushiyan
- Support time travel with read option (#52) by @xushiyan
- Support taking env vars for cloud storages (#55) by @xushiyan
- Handle replacecommit for loading file slices (#53) by @xushiyan
- Use
anyhow
for generic errors (#26) by @xushiyan - Use
object_store
API for Timeline (#27) by @xushiyan - Make APIs async (#31) by @xushiyan
- Improve thread safety and error handling (#32) by @xushiyan
- Improve error handling in storage module (#34) by @xushiyan
- Adjust table APIs to skip passing options (#56) by @xushiyan
- Update readme, contributing guide, and issue template (#57) by @xushiyan
- Update CONTRIBUTING with minor changes (#58) by @codope
- Enforce rust code style (#14) by @xushiyan
- Clean up and trim down dependencies (#54) by @xushiyan
- Add info for rust and python artifacts (#60) by @xushiyan
- Add release workflow (#63) by @xushiyan
- Add tests crate and adopt testing tables (#30) by @xushiyan
- Add test cases for different table setup (#36) by @xushiyan
- Setup ci for license file and headers (#2) by @xushiyan
- Fix failing check and test case (#10) by @xushiyan
- Fix asf notification (#11) by @xushiyan
- Add commit linting (#12) by @xushiyan
- Use cargo tarpaulin to generate code coverage (#15) by @xushiyan
- Remove codecov to keep ci green (#17) by @xushiyan
- Fix codecov setup (#20) by @xushiyan
- Configure codecov (#50) by @xushiyan
- Add scripts to streamline source release (#64) by @xushiyan
- @codope made their first contribution in #58
- @xushiyan made their first contribution in #1