1. 182428f fix(spark): catch HoodieSchemaNotFoundException in 3-arg DefaultSource.createRelation (#18977) by Lokesh Jain · 7 hours ago master
  2. 173e76e [HUDI] Add input records caching to HoodieGlobalSimpleIndex (#18921) by Lokesh Jain · 25 hours ago
  3. 91f3d22 perf(clean): Avoid extra getPathInfo RPC per file during clean execution (#18963) by voonhous · 25 hours ago
  4. 84c1f59 docs: RFC-106 - Record Level and Secondary Index Support for Flink Writers (#17610) by Danny Chan · 26 hours ago
  5. 466ae49 chore: clean and fix the rfc readme (#18970) by Danny Chan · 27 hours ago
  6. 7f3676b feat(flink): Backport Flink 2.1 Dremel nested Parquet reader to hudi-… (#18959) by Shuo Cheng · 28 hours ago
  7. f1c6fe4 feat(flink): Backport Flink 2.1 Dremel nested Parquet reader to hudi-flink2.1.x (#18960) by Shuo Cheng · 28 hours ago
  8. 17b396e feat(mdt): Support writing bloom filter for hfile log block (#18936) by Shuo Cheng · 29 hours ago
  9. 21121ac fix(flink): Close write client properly in DefaultCleanHandler (#18940) by Shuo Cheng · 29 hours ago
  10. 8933224 fix(clustering): retain missing partitions in selected/regex incremental scheduling (#18945) by fhan · 2 days ago
  11. 86d1650 feat(flink): Backport Flink 2.1 Dremel nested Parquet reader to hudi-flink1.20 (#18953) by Shuo Cheng · 2 days ago
  12. 1fd2c367 fix(metrics): NPE handling when hudi metrics is disabled (#18947) by Peter Huang · 3 days ago
  13. 348e7f1 feat(flink): Add validation to reject multiple writers for flink RLI writes (#18946) by Shuo Cheng · 3 days ago
  14. 782bf65 fix(common): Close log writer output stream on append failure (#18909) by fhan · 3 days ago
  15. eb863d8 chore: drop support for Flink 1.17 (#18920) by Danny Chan · 3 days ago
  16. f4816ce fix(flink): avoid repeated timeline reload for unchanged lookup table commits (#18930) by fhan · 3 days ago
  17. aac975c feat(flink): Support reading VECTOR columns from Parquet and Avro format (#18723) by Shuo Cheng · 4 days ago
  18. c5e3cd5 chore: remove the CI validation for Flink1.17 (#18928) by Danny Chan · 4 days ago
  19. 9ca1867 chore: [MINOR] Update DOAP with 0.14.2 Release (#18924) by Danny Chan · 6 days ago
  20. 77f5851 feat(spark): add Spark 4.2 support (#18621) by Y Ethan Guo · 6 days ago
  21. 5182bf8 feat(flink): support remote partitioner for simple bucket index (#18897) by fhan · 7 days ago
  22. 091caad feat: add lance format for Flink MOR table (#18911) by Danny Chan · 7 days ago
  23. 6f81c51 fix(flink): Use the execution mode without rocksdb cache by default … (#18894) by Shuo Cheng · 7 days ago
  24. 9cddb23 fix(streamer): Use checkpoint V1 for non-incremental streamer sources (#18896) by Y Ethan Guo · 7 days ago
  25. cb14ca7 refactor(core): Unify record key/index key splitting and extraction (#18842) by Shuo Cheng · 8 days ago
  26. c40f765 chore(docker): silence Dockerfile lint warnings across image set (#18664) by voonhous · 8 days ago
  27. 95be210 docs(examples): pin blob.inline.mode=CONTENT after Lance default flip (#18823) by Rahil C · 9 days ago
  28. 163a15a refactor: Add Lombok Builder annotation to HoodieLogFormat (#17785) by voonhous · 9 days ago
  29. 438bbdf refactor: Add Lombok annotations to hudi-common module (part 3) (#17825) by voonhous · 9 days ago
  30. 508e295 refactor: Add Lombok annotations to hudi-utilities (Part 3) (#17877) by voonhous · 9 days ago
  31. 8ac3092 fix(ci): Flink version corresponds to used in Docker image, resolution for Everit JSON schema (#18905) by Geser Dugarov · 9 days ago
  32. ba8c4c7 fix(spark): align CTAS partition fields by table partition order (#18899) by fhan · 9 days ago
  33. b7adecc feat(schema): Config path implemented for spark record type (#18062) by voonhous · 10 days ago
  34. 7964202 refactor: Add Lombok Builders (#17781) by voonhous · 10 days ago
  35. 095e80a refactor: Add Lombok annotations to hudi-utilities (Part 1) (#17823) by voonhous · 10 days ago
  36. af38b88 feat(flink): add shard assign metrics for bucket assign function (#18790) by Peter Huang · 10 days ago
  37. ed9ea0e fix(flink): Trigger a failover after pending instants recommitted for both global and partitioned RLI (#18793) by Shuo Cheng · 10 days ago
  38. 8259182 fix: Fix NPE due to race condition while handling rocksdb handles (#18834) by Lokesh Jain · 10 days ago
  39. 11f0e7c feat(spark): add input records & bytes metrics (#18882) by fhan · 11 days ago
  40. 8490c96 feat(utilities): migrate HoodieStreamerWriteStatusValidator into pre-commit validator framework (#18765) by Xinli Shang · 11 days ago
  41. 4fdac3d fix(spark): fix MOR bulk insert commit operation error (#18878) by fhan · 11 days ago
  42. 4e5034d fix(flink): fix data loss in stream read from earliest (#18848) by fhan · 11 days ago
  43. 314ffaf feat: add lance format support for Flink COW table (#18862) by Danny Chan · 11 days ago
  44. da18995 fix(flink): set canonical base path in Hive sync config (#18884) by fhan · 11 days ago
  45. 65562af feat: Enable import order style check and error on violations (#18886) by wangyinsheng · 11 days ago
  46. 6e40cff fix(hudi-sync): Fix Hive test temp directory cleanup with JUnit TempDir (#18883) by fhan · 11 days ago
  47. fc85e3e fix(flink): remove pre-seed in metrics in GlobalRecordLevelIndexBackend and RecordLevelIndexBackend (#18875) by Shihuan Liu · 2 weeks ago
  48. 7af8cdf perf(streamer): fold validate() error-table WriteStatus sums into one pass (#18871) by Davis-Zhang-Onehouse · 2 weeks ago
  49. 4290550 chore: Pin third-party GitHub Actions to commit SHAs (#18872) by Arpit Jain · 2 weeks ago
  50. ff72186 fix(cli): Fix the typo in show-inflight CLI command (#18868) by Y Ethan Guo · 2 weeks ago
  51. 588998f refactor(flink): Refactor RowData writer factory to use HoodieSchema (#18873) by Shuo Cheng · 2 weeks ago
  52. 853cbef feat(vector_search): Implement functionality for pre-filters and maxD… (#18797) by Rahil C · 2 weeks ago
  53. d5c6b4d feat(flink): Add metrics for remote RLI lookup time in BucketAssign op (#18813) by Shihuan Liu · 2 weeks ago
  54. 1bf6b44 feat(flink): add lance format for Flink append only table (#18741) by Danny Chan · 2 weeks ago
  55. e5ae111 feat(trino): [RFC-105] Trino Hudi Connector — Shim/Bundle Refactor (#18782) by Y Ethan Guo · 2 weeks ago
  56. ce29423 docs: Update DOAP file to include 0.15.1 (#18838) by voonhous · 2 weeks ago
  57. b8cab71 fix(ci): make required status checks work for PRs (#18598) by Y Ethan Guo · 2 weeks ago
  58. 516d9e2 [MINOR] Handle cancellation error with HoodieMetadataTableValidator (#18371) by Lokesh Jain · 2 weeks ago
  59. 76ddebc chore: Fix Flink CI Maven profile arguments (#18845) by Shuo Cheng · 2 weeks ago
  60. 652d952 fix: RLI bootstrap fails due to NPE with cleaner table service (#18836) by Lokesh Jain · 2 weeks ago
  61. b5c5801 fix(spark): Add options for archive procedure (#18437) by fhan · 2 weeks ago
  62. 62ad88a chore: Add release information for Apache Hudi 1.2.0 (#18831) by Y Ethan Guo · 2 weeks ago
  63. 7e5b535 chore: Fix stale zip file for variant backward compatibility test (#18815) by Shuo Cheng · 2 weeks ago
  64. 06dc09b feat(flink): Backport Flink 2.1 Dremel nested Parquet reader rewrite to hudi-flink1.19.x (FLINK-35702) (#18809) by Shihuan Liu · 2 weeks ago
  65. e2e5598 chore: remove outdated required status checks in asf.yaml (#18832) by Y Ethan Guo · 2 weeks ago
  66. e299b84 chore: fix asf.yaml invalid key (#18808) by Shawn Chang · 3 weeks ago
  67. facb517 feat(spark): Drop Apache Spark 3.3 integration support (#18786) by Y Ethan Guo · 3 weeks ago
  68. b82a5b2 feat: add metrics for bucketassign.minibatch cache hit ratio (#18761) by Yao Li · 3 weeks ago
  69. 50eb95c fix(flink): fix disable table service not effective in hudi-flink (#13875) by fhan · 3 weeks ago
  70. a687786 chore: add spark4.1 and flink2.1 profile entries to RC bundle validation (#18796) by Y Ethan Guo · 3 weeks ago
  71. 9565926 feat(blob): default blob.inline.mode to DESCRIPTOR for Lance (#18744) by voonhous · 3 weeks ago
  72. 12e87b8 fix: Fix dangling lock from storage based lock (#18439) by Lin Liu · 3 weeks ago
  73. 0138f41 perf: Improve global index performance for commit time ordering (#17797) by Y Ethan Guo · 3 weeks ago
  74. d78b702 fix(spark): use HoodieStorageUtils factory in Spark 4.1 legacy parquet read (#18785) by Y Ethan Guo · 3 weeks ago
  75. 3abe066 feat(common): per-partition event-time rollup and decouple watermark tracking from EVENT_TIME_ORDERING (#18778) by Xinli Shang · 3 weeks ago
  76. 990fc29 feat(flink): add metrics for RLI load time for bucket assign functions (#18762) by Peter Huang · 3 weeks ago
  77. f044d3d fix(spark): handle Avro 1.12 logical type values in Spark 4.1 read path (#18773) by Y Ethan Guo · 3 weeks ago
  78. 4ac661a fix: Fix SQL syntax parser for CREATE TABLE on Spark 4.1 (#18779) by Rahil C · 3 weeks ago
  79. 89f49b1 fix: Skip pre-compaction rollback metadata reads in getValidInstantTimestamps (#18544) by Y Ethan Guo · 3 weeks ago
  80. 0b80239 fix: Disable column stats and partition stats indices for Lance base files (#18588) by Y Ethan Guo · 3 weeks ago
  81. 642d88c [MINOR] Reserve RFC-105 for Trino Hudi Connector — Shim/Bundle Refactor (#18781) by Y Ethan Guo · 3 weeks ago
  82. f67dc26 fix: Enhance hudi-azure-bundle (#18472) by Lin Liu · 3 weeks ago
  83. 58957a7 fix(aws): implement writer-version update in Glue sync client (#18707) by deepakpanda93 · 3 weeks ago
  84. a5d8fe1 test(azure): skip ITAzureStorageLockClientAzurite when MCR image pull fails (#18772) by Xinli Shang · 3 weeks ago
  85. 162cac2 fix(lance): fail fast when write schema contains VARIANT columns (#18775) by Rahil C · 3 weeks ago
  86. d94b2e2 feat(spark): add show_inflight_commits and cleanup_stale_inflight_com… (#18709) by Mahsood Ebrahim · 3 weeks ago
  87. e406e5d feat(spark): add restore_to_instant stored procedure (#18696) by Mahsood Ebrahim · 3 weeks ago
  88. 71aa121 fix(flink): enforce Parquet VARIANT annotation in Flink schema conversion for unshredded variant (#18539) by Krishen · 3 weeks ago
  89. 9026c7d test(trino): de-flake TestHudi*FileOperations by polling for span stability (#18766) by Xinli Shang · 3 weeks ago
  90. 0983c0d chore: Harden workflow against command injection in PR title validation (#18771) by oryn · 3 weeks ago
  91. 055aa41 fix(lance): Support Lance file format on Spark 4.1 (#18760) by voonhous · 4 weeks ago
  92. db46081 fix(flink): Fix minibatch lookup for global record level index (#18759) by Shuo Cheng · 4 weeks ago
  93. cd2c8b8 [MINOR] Fix typos in comments and assertion messages (#18763) by Xinli Shang · 4 weeks ago
  94. b934633 feat(utilities): add Spark/HoodieStreamer validators for pre-commit validation - Phase 3 (#18405) by Xinli Shang · 4 weeks ago
  95. 780595b fix(flink): fix the write handle close for append write (#18756) by Danny Chan · 4 weeks ago
  96. 4035f70 chore: Fix license for copy_staging_repo.sh (#18753) by Y Ethan Guo · 4 weeks ago
  97. ae9866a fix: Enable schema merging for incremental and dfs sources (#18385) by Lin Liu · 4 weeks ago
  98. 6e32f36 fix: Follow-ups to JsonKinesisSource: numeric sequence comparison and call-site fixes (#18689) by Lin Liu · 4 weeks ago
  99. dca76ca fix: Support data pruning using nested partition columns (#18126) by Lin Liu · 4 weeks ago
  100. 3d32596 fix(docker): tag base image per Java version to avoid latest collision (#18663) by voonhous · 4 weeks ago