1. 34e9c7c test(schema): Add MOR log-only compaction tests for custom types (#18583) by voonhous · 20 hours ago master
  2. 63f721d fix: Fix reflection ctor signature for AwsGlueCatalogSyncTool in HiveSyncContext (#18697) by Matthew · 21 hours ago
  3. 87019a3 fix(hive): Tolerate pruned ArrayWritable in nested BLOB projection (#18581) by voonhous · 22 hours ago
  4. 4029560 feat(flink): Backport Flink 2.1 nested Parquet column readers and INT64 timestamp dispatch (FLINK-35702) (#18636) by Shihuan Liu · 25 hours ago
  5. c36a5f7 fix(flink): Avoid emitting deletes for Flink source v2 batch reads (#18694) by Shuo Cheng · 31 hours ago
  6. 91f341f fix: filter EXTERNAL property in SparkCatalogMetaStoreClient.toCatalogTable (#18672) by Prashant Wason · 3 days ago
  7. 471bb48 refactor: move checkpoint metadata lookup helper to hudi-common (#18489) by Surya Prasanna · 3 days ago
  8. 127c6ee feat(common): roll over commit metadata to clean (#18590) by Krishen · 3 days ago
  9. 4d0e9cd fix(lance): prevent file splitting for Lance base files to avoid duplicate reads (#18678) by Rahil C · 5 days ago
  10. cde0e39 fix(ci): bump Maven heap to 8g to fix OOM in CI builds (#18618) by Hudi Agent · 5 days ago
  11. 695294c fix: Honor SparkSession overrides for rebase mode and timezone in compaction tasks (#18675) by Y Ethan Guo · 6 days ago
  12. f7508de test(spark): Add date logical type test to TestAvroConversionUtils (#18584) by ashokkumar-allu · 7 days ago
  13. f8d70cb feat: Introduce a Spark procedure to trigger LSM timeline compaction (#18659) by chaoyang · 7 days ago
  14. 39797b4 docs: claim RFC-104 for schema evolution unification (#18660) by voonhous · 7 days ago
  15. 38db5ed chore: Add context7.json with URL and public key (#18662) by vinoth chandar · 8 days ago
  16. 0a28d69 chore(docker): reduce base_java17 and spark_base image size (#18542) by kartikeyaagrawal · 8 days ago
  17. 7443856 fix: make SparkCatalogMetaStoreClient.setMetaConf a no-op (#18652) by Prashant Wason · 8 days ago
  18. 7b662e8 fix: filter spark.sql.* properties in SparkCatalogMetaStoreClient.toCatalogTable (#18654) by Prashant Wason · 8 days ago
  19. 8d348cc feat(examples): Add Hudi Unstructed Demo env (#18643) by Rahil C · 9 days ago
  20. 426cbb8 chore(deps): Pin AWS v1 SDK BOM to short-circuit transitive version-range walk (#18619) by voonhous · 9 days ago
  21. fd63851 feat(flink): extend Flink quickstart example to use source v2 (#18518) by Peter Huang · 9 days ago
  22. 7c2c56e fix(schema): Handle BLOB and VARIANT in Hive-reader rewriteRecordWithNewSchema (#18580) by voonhous · 9 days ago
  23. 642e1d3 chore(release): Moving to 1.3.0-SNAPSHOT on master branch (#18620) by Y Ethan Guo · 10 days ago
  24. eed9aa7 feat: Add variant support description to RFC-99 (#18274) by voonhous · 10 days ago
  25. 1ededfd feat(common): When inferring checkpoint/schema from timeline, check non-ingestion write commits (in case they have metadata rolled-over) (#18576) by Krishen · 10 days ago
  26. 5c73bc0 feat(lance): fix lance writer/reader regarding arrow memory limit issue (#18613) by Rahil C · 10 days ago
  27. 4bdcdf9b feat: Create JsonKinesisSource (#18224) by Lin Liu · 11 days ago
  28. 7ae0fd9 fix(schema): Allow nested projection on BLOB and VARIANT columns in pruneDataSchema (#18566) by voonhous · 11 days ago
  29. 782552a fix: Curator class conflict in ZookeeperBasedLockProvider (#18593) by yuqi · 11 days ago
  30. 20a0105 feat(spark): add Spark 4.1 support (#17674) by Y Ethan Guo · 11 days ago
  31. fdf27db fix(vector): Preserve VECTOR/BLOB metadata on SQL INSERT path (#18540) by voonhous · 11 days ago
  32. a8a6917 feat(flink): Vendor Flink 2.1 Dremel nested-reader support classes (#18567) by Shihuan Liu · 11 days ago
  33. 29f9c40 feat(ci): enable auto-merge and require all GitHub Actions checks on master (#18594) by Y Ethan Guo · 12 days ago
  34. 787953f feat(blob): add support for lance blob inline descriptor reading (#18586) by Rahil C · 12 days ago
  35. 8530644 feat(lance): support simplified path for lance blob inline reading (#18575) by Rahil C · 12 days ago
  36. 4f3e885 fix(ci): bump surefire test heap from 3g to 4g (#18589) by Y Ethan Guo · 13 days ago
  37. c1569db fix(clean): address review comments on empty clean support (#18587) by Y Ethan Guo · 13 days ago
  38. 436bd66 fix(vector): Pass plain FIXED through to VECTOR projection on Hive read (#18582) by voonhous · 13 days ago
  39. 2059c11 feat(clean): Adding empty clean support to hudi (#18337) by Sivabalan Narayanan · 14 days ago
  40. 217e2a7 feat: Adding support to inject custom configs to parquet writer (#18379) by Sivabalan Narayanan · 14 days ago
  41. edaa168 fix: FileGroupReader drops mandatory partition columns from dataSchema (#18570) by tiennguyen-onehouse · 2 weeks ago
  42. 2092890 fix: ProtoConversionUtil$AvroSupport static init under Avro 1.12 (#18571) by tiennguyen-onehouse · 2 weeks ago
  43. 110b9be7 fix(variant): allow VariantType writes through Hudi's V1 DataSource on Spark 4 (#18564) by voonhous · 2 weeks ago
  44. 9d1f817 fix(vector): Register VECTOR HMS column as BINARY on Spark CREATE (#18545) by voonhous · 2 weeks ago
  45. 1e64662 feat(lance): round-trip Hudi VECTOR columns as native Lance fixed-size lists (#18497) by Rahil C · 2 weeks ago
  46. ace2871 feat(flink): Introduces dictionary encoding of payload partition path for RocksDBIndexBackend (#18560) by Shuo Cheng · 2 weeks ago
  47. 7f4dd31 fix(lance): Add Hive InputFormat stubs and fix Spark SQL for Lance file format (#18162) by Rahil C · 2 weeks ago
  48. ddbdbb9 chore(spark): bump spark4.version to 4.0.2 (#18549) by voonhous · 2 weeks ago
  49. 8623898 fix: JDBC connection leak in HiveIncrementalPuller.saveDelta() (#18460) by Venkateswarlu Boggavarapu · 2 weeks ago
  50. 4260914 fix: Parquet small-precision decimals decode ClassCastException (#18552) by Shihuan Liu · 2 weeks ago
  51. cd83cf4 chore(docker): add Hadoop 3.4.0 / Hive 2.3.10 / Spark 4.0.2 compose stack (#18550) by voonhous · 2 weeks ago
  52. 4ef56e4 feat(blob): followup fixes for blob reader (#18538) by Rahil C · 2 weeks ago
  53. e4904ba feat: Add support for exclusive rollbacks with multi writer (#18448) by Lokesh Jain · 2 weeks ago
  54. 0d57435 fix: VARIANT Hive sync error when performing CREATE table DDL (#18511) by voonhous · 2 weeks ago
  55. e303579 fix: prevent parseTypeDescriptor crash for VARIANT (#18510) by voonhous · 2 weeks ago
  56. f9dead0 feat: Adding support to block archival on last known ECTR for v6 tables (#18380) by Sivabalan Narayanan · 2 weeks ago
  57. 59fee58 feat(lance): Bump lance to 4.0.0 and lance-spark to 0.4.0 (#18498) by Rahil C · 2 weeks ago
  58. a834736 feat(vector): Add Spark SQL DDL CREATE TABLE support for VECTOR type (#18488) by voonhous · 2 weeks ago
  59. 76a0a27 perf(common): Avoid double-iterating log files in file-system-view filters (#18531) by voonhous · 2 weeks ago
  60. adf29ac fix: HoodieStorage resource leak in FileSystemBasedLockProvider.close() (#18461) by mailtoboggavarapu-coder · 2 weeks ago
  61. f35b69c feat(blob): Read Blobs in Spark SQL (#18098) by Tim Brown · 2 weeks ago
  62. 0fb4454 feat(utilities): add external HudiHiveSyncJob for on-demand Hive sync (#18204) by Surya Prasanna · 3 weeks ago
  63. 3a387da feat(flink): Implement continuous sorting feature for append write (#18083) by Prashant Wason · 3 weeks ago
  64. 95199f0 perf(common): avoid stream allocation in CollectionUtils.createImmutableList (#18530) by voonhous · 3 weeks ago
  65. 91dba3e chore(common): Consolidate MapUtils into CollectionUtils (#18529) by voonhous · 3 weeks ago
  66. cfb9833 chore(docker): Remove duplicate yarn.nodemanager.bind-host in entrypoint.sh (#18527) by voonhous · 3 weeks ago
  67. 937a64a fix: whitelist Flink _2.12 artifacts in scala-2.13 enforcer rule (#18508) by voonhous · 3 weeks ago
  68. 55bf91a introduce static helper method to remove clones (#18533) by aaaZayne · 3 weeks ago
  69. eaaae8a feat: Add Azure-based storage lock (#17951) by chrevanthreddy · 3 weeks ago
  70. 3d0ab80 chore(docker): bump integ-test docker-compose to Hive 2.3.10 (#18525) by voonhous · 3 weeks ago
  71. 9ddf582 chore: add timing logs for file index partition and file listing (#18417) by Surya Prasanna · 3 weeks ago
  72. 97f9628 feat(docker): add --multi-arch flag for cross-platform image builds (#18522) by Y Ethan Guo · 3 weeks ago
  73. 7bcb8be feat(metadata): Allow users to safely execute compaction plans on metadata table concurrently through a table service platform (rather than only inline during write) (#18295) by Krishen · 3 weeks ago
  74. 41cfc19 chore: Add Java 17 Hadoop base image and Spark 4.0.1 docker compose setup (#18520) by voonhous · 3 weeks ago
  75. 0356488 fix(docker): fix docker image build with Java 11 and Hive 2.3.10 (#18519) by Y Ethan Guo · 3 weeks ago
  76. a369773 chore: cleanup docker-compose files (#17950) by voonhous · 3 weeks ago
  77. c1af4f5 fix: use forward slash literal and remove unused import in DFSPropertiesConfiguration (#18454) by mailtoboggavarapu-coder · 3 weeks ago
  78. 2b33f5d fix: fix BufferedReader resource leak in FileIOUtils.readAsUTFStringLines (#18470) by mailtoboggavarapu-coder · 3 weeks ago
  79. a649188 feat(spark): refresh parquet tools clustering strategy for current master (#18409) by Surya Prasanna · 3 weeks ago
  80. f144abc feat: Include ReverseOrderHoodieRecordPayload (#17928) by Surya Prasanna · 3 weeks ago
  81. 5b68607 fix: fix Scanner file handle leak in HiveIncrementalPuller.executeIncrementalSQL (#18457) by mailtoboggavarapu-coder · 3 weeks ago
  82. cad1530 fix: Scanner resource leak in SqlFileBasedSource.fetchNextBatch (#18467) by mailtoboggavarapu-coder · 3 weeks ago
  83. 5066fcc feat: Adding rolling extra metadata support (#18421) by Sivabalan Narayanan · 3 weeks ago
  84. d3e0201 fix(common): FutureUtils:allOf should always throw root cause exception (#18456) by Krishen · 3 weeks ago
  85. 12b3a06 feat: Support to cap max commits to clean in one round of clean commit (#18322) by Sivabalan Narayanan · 3 weeks ago
  86. 98f90c1 fix(payload): support sentinel no-op updates in DefaultHoodieRecordPayload (#18413) by Surya Prasanna · 3 weeks ago
  87. 4b15e50 feat(sync): Map VARIANT type to struct in Hive, Spark, and BigQuery sync (#18483) by voonhous · 3 weeks ago
  88. 8cd2648 chore: Allow versions to be specified in build_docker_images.sh (#17948) by voonhous · 3 weeks ago
  89. da2667c feat(sync): Map BLOB type to struct in Hive and BigQuery sync (#18482) by voonhous · 3 weeks ago
  90. 0bdab84 feat(flink): Add metrics for RocksDB index backend in bucket assigner (#18484) by Shuo Cheng · 3 weeks ago
  91. 613fc49 feat(common): add log reader scan metrics and logging for log block processing (#18412) by Surya Prasanna · 3 weeks ago
  92. fc7f303 chore(deps): bump org.apache.logging.log4j:log4j-core (#18490) by dependabot[bot] · 4 weeks ago
  93. 00a4066 feat(sync): Map VECTOR type to binary for metastore sync support (#18480) by voonhous · 4 weeks ago
  94. 88c146e chore(ci): Clean up env variable leak in TestSqlConf (#18486) by Geser Dugarov · 4 weeks ago
  95. c83ae87 fix(flink): Handle bootstrap write metadata correctly after job resca… (#18485) by Shuo Cheng · 4 weeks ago
  96. 8f153b2 perf(core): optimize rollback listing calls on metadata table (#18279) by Balajee Nagasubramaniam · 4 weeks ago
  97. 5d61a35 refactor(flink): Refactor Flink compaction/clean pipeline with composite table service handlers (#18477) by Shuo Cheng · 4 weeks ago
  98. eaa9c8b fix: fix BufferedReader resource leak in InputStreamConsumer (#18469) by mailtoboggavarapu-coder · 4 weeks ago
  99. 6310d70 fix: avoid duplicate archived timeline instants from leftover merge files (#18408) by Surya Prasanna · 4 weeks ago
  100. f063aa5 fix(hfile): use Hadoop WritableUtils VarInt encoding in HFile block index writer (#18465) by Asish Kumar · 4 weeks ago