Skip to content

Latest commit

 

History

History
176 lines (131 loc) · 6.46 KB

CHANGELOG.md

File metadata and controls

176 lines (131 loc) · 6.46 KB

Changelog

All notable changes to this project will be documented in this file.

[0.2.0] - 2024-11-25

🚀 Features

  • Support loading hudi global configs (#118) by @zzhpro
  • Add base file records' in-memory size to FileStats (#140) by @xushiyan
  • Support partition prune api (#119) by @KnightChess
  • Add partition filter arg in Python APIs (#153) by @xushiyan
  • Add HudiFileGroupReader with consolidated APIs to read records (#164) by @xushiyan
  • Add TableBuilder API for creating Table instances (#163) by @kazdy
  • Implement datafusion TableProviderFactory (#162) by @kazdy

🐛 Bug Fixes

  • Register object store with datafusion (#107) by @abyssnlp
  • Handle validating table when DropsPartitionFields not present (#142) by @xushiyan
  • Make partition loading more efficient (#152) by @xushiyan
  • Simplify partition filter format by taking tuple of strings (#170)
  • Improve api to get file slices splits (#185)
  • Handle schema retrieval for datafusion api (#187)

🚜 Refactor

  • Extract common test code for creating table (#117) by @gohalo
  • Improve APIs for handling options (#161) by @xushiyan
  • Improve TableBuilder API for taking single option (#171) by @xushiyan
  • Minor improvement to fix coverage report status (#173) by @xushiyan

📚 Documentation

  • Update readme logo and example (#65) by @xushiyan
  • Update in-code comments (#132) by @KnightChess
  • Add hudi core API docs with examples (#113) by @KnightChess
  • Add in-code docs to hudi-core APIs (#166) by @xushiyan
  • Add python binding docstrings (#169) by @kazdy
  • Add step-by-step release guide (#66) by @xushiyan

🎨 Styling

  • Enforce Python code style (#101) by @muyihao

🛠️ Build

  • Use exact versions for arrow and datafusion (#105) by @xushiyan
  • Bump up datafusion to version 41, arrow to 52.2 (#120) by @yjshen
  • (deps) Update zip-extract requirement from 0.1.3 to 0.2.1 (#130) by @dependabot[bot]
  • (deps) Upgrade datafusion, pyarrow, pyo3, python versions (#149) by @kazdy
  • (deps) Upgrade arrow dependencies (#168) by @kazdy
  • (release) Bump version to 0.2.0-rc.1
  • (deps) Upgrade datafusion and object store (#182)
  • (deps) Upgrade datafusion to 42.2.0 (#192)
  • (release) Bump version to 0.2.0-rc.2

⚙️ Miscellaneous Tasks

  • Improve release scripts (#68) by @xushiyan
  • Add CHANGELOG.md with git-cliff config (#69) by @xushiyan
  • Configure labeler for PRs from forked repos (#83) by @xushiyan
  • Fix labeler config (#85) by @xushiyan
  • Fix labeler config for dev-x (#87) by @xushiyan
  • Merge python code coverage report with rust (#67) by @xushiyan
  • Add pull request template (#89) by @xushiyan
  • Enable dependabot (#94) by @xushiyan
  • Add path ignore files for ci workflow (#93) by @abyssnlp
  • Improve workflows for code checking and PR (#110) by @xushiyan
  • Disable labeler due to permission and policy (#115) by @xushiyan
  • (ci) Fix PR title linting to support change scope (#138) by @kazdy
  • Add feature request template for GH issues (#167) by @kazdy

New Contributors

  • @KnightChess made their first contribution in #119

  • @gohalo made their first contribution in #117

  • @zzhpro made their first contribution in #118

  • @yjshen made their first contribution in #120

  • @abyssnlp made their first contribution in #107

  • @muyihao made their first contribution in #101

[0.1.0] - 2024-07-15

🚀 Features

  • Initial rust implementation to integrate with datafusion (#1) by @xushiyan
  • Add python binding (#21) by @xushiyan
  • Implement HudiTable as python API (#23) by @xushiyan
  • Use object_store for common storage APIs (#25) by @xushiyan
  • Implement Rust and Python APIs to read file slices (#28) by @xushiyan
  • Add APIs for time-travel read (#33) by @xushiyan
  • Implement datafusion API using ParquetExec (#35) by @xushiyan
  • Add HudiConfigs for parsing and managing named configs (#37) by @xushiyan
  • Add config validation when creating table (#49) by @xushiyan
  • Add internal config to skip validation (#51) by @xushiyan
  • Support time travel with read option (#52) by @xushiyan
  • Support taking env vars for cloud storages (#55) by @xushiyan

🐛 Bug Fixes

  • Handle replacecommit for loading file slices (#53) by @xushiyan

🚜 Refactor

  • Use anyhow for generic errors (#26) by @xushiyan
  • Use object_store API for Timeline (#27) by @xushiyan
  • Make APIs async (#31) by @xushiyan
  • Improve thread safety and error handling (#32) by @xushiyan
  • Improve error handling in storage module (#34) by @xushiyan
  • Adjust table APIs to skip passing options (#56) by @xushiyan

📚 Documentation

  • Update readme, contributing guide, and issue template (#57) by @xushiyan
  • Update CONTRIBUTING with minor changes (#58) by @codope

🎨 Styling

  • Enforce rust code style (#14) by @xushiyan

🛠️ Build

  • Clean up and trim down dependencies (#54) by @xushiyan
  • Add info for rust and python artifacts (#60) by @xushiyan
  • Add release workflow (#63) by @xushiyan

🧪 Testing

  • Add tests crate and adopt testing tables (#30) by @xushiyan
  • Add test cases for different table setup (#36) by @xushiyan

⚙️ Miscellaneous Tasks

  • Setup ci for license file and headers (#2) by @xushiyan
  • Fix failing check and test case (#10) by @xushiyan
  • Fix asf notification (#11) by @xushiyan
  • Add commit linting (#12) by @xushiyan
  • Use cargo tarpaulin to generate code coverage (#15) by @xushiyan
  • Remove codecov to keep ci green (#17) by @xushiyan
  • Fix codecov setup (#20) by @xushiyan
  • Configure codecov (#50) by @xushiyan
  • Add scripts to streamline source release (#64) by @xushiyan

New Contributors

  • @codope made their first contribution in #58
  • @xushiyan made their first contribution in #1