Sign in
apache
/
hudi
/
HEAD
182428f
fix(spark): catch HoodieSchemaNotFoundException in 3-arg DefaultSource.createRelation (#18977)
by Lokesh Jain
· 7 hours ago
master
173e76e
[HUDI] Add input records caching to HoodieGlobalSimpleIndex (#18921)
by Lokesh Jain
· 25 hours ago
91f3d22
perf(clean): Avoid extra getPathInfo RPC per file during clean execution (#18963)
by voonhous
· 25 hours ago
84c1f59
docs: RFC-106 - Record Level and Secondary Index Support for Flink Writers (#17610)
by Danny Chan
· 26 hours ago
466ae49
chore: clean and fix the rfc readme (#18970)
by Danny Chan
· 27 hours ago
7f3676b
feat(flink): Backport Flink 2.1 Dremel nested Parquet reader to hudi-… (#18959)
by Shuo Cheng
· 28 hours ago
f1c6fe4
feat(flink): Backport Flink 2.1 Dremel nested Parquet reader to hudi-flink2.1.x (#18960)
by Shuo Cheng
· 28 hours ago
17b396e
feat(mdt): Support writing bloom filter for hfile log block (#18936)
by Shuo Cheng
· 29 hours ago
21121ac
fix(flink): Close write client properly in DefaultCleanHandler (#18940)
by Shuo Cheng
· 29 hours ago
8933224
fix(clustering): retain missing partitions in selected/regex incremental scheduling (#18945)
by fhan
· 2 days ago
86d1650
feat(flink): Backport Flink 2.1 Dremel nested Parquet reader to hudi-flink1.20 (#18953)
by Shuo Cheng
· 2 days ago
1fd2c367
fix(metrics): NPE handling when hudi metrics is disabled (#18947)
by Peter Huang
· 3 days ago
348e7f1
feat(flink): Add validation to reject multiple writers for flink RLI writes (#18946)
by Shuo Cheng
· 3 days ago
782bf65
fix(common): Close log writer output stream on append failure (#18909)
by fhan
· 3 days ago
eb863d8
chore: drop support for Flink 1.17 (#18920)
by Danny Chan
· 3 days ago
f4816ce
fix(flink): avoid repeated timeline reload for unchanged lookup table commits (#18930)
by fhan
· 3 days ago
aac975c
feat(flink): Support reading VECTOR columns from Parquet and Avro format (#18723)
by Shuo Cheng
· 4 days ago
c5e3cd5
chore: remove the CI validation for Flink1.17 (#18928)
by Danny Chan
· 4 days ago
9ca1867
chore: [MINOR] Update DOAP with 0.14.2 Release (#18924)
by Danny Chan
· 6 days ago
77f5851
feat(spark): add Spark 4.2 support (#18621)
by Y Ethan Guo
· 6 days ago
5182bf8
feat(flink): support remote partitioner for simple bucket index (#18897)
by fhan
· 7 days ago
091caad
feat: add lance format for Flink MOR table (#18911)
by Danny Chan
· 7 days ago
6f81c51
fix(flink): Use the execution mode without rocksdb cache by default … (#18894)
by Shuo Cheng
· 7 days ago
9cddb23
fix(streamer): Use checkpoint V1 for non-incremental streamer sources (#18896)
by Y Ethan Guo
· 7 days ago
cb14ca7
refactor(core): Unify record key/index key splitting and extraction (#18842)
by Shuo Cheng
· 8 days ago
c40f765
chore(docker): silence Dockerfile lint warnings across image set (#18664)
by voonhous
· 8 days ago
95be210
docs(examples): pin blob.inline.mode=CONTENT after Lance default flip (#18823)
by Rahil C
· 9 days ago
163a15a
refactor: Add Lombok Builder annotation to HoodieLogFormat (#17785)
by voonhous
· 9 days ago
438bbdf
refactor: Add Lombok annotations to hudi-common module (part 3) (#17825)
by voonhous
· 9 days ago
508e295
refactor: Add Lombok annotations to hudi-utilities (Part 3) (#17877)
by voonhous
· 9 days ago
8ac3092
fix(ci): Flink version corresponds to used in Docker image, resolution for Everit JSON schema (#18905)
by Geser Dugarov
· 9 days ago
ba8c4c7
fix(spark): align CTAS partition fields by table partition order (#18899)
by fhan
· 9 days ago
b7adecc
feat(schema): Config path implemented for spark record type (#18062)
by voonhous
· 10 days ago
7964202
refactor: Add Lombok Builders (#17781)
by voonhous
· 10 days ago
095e80a
refactor: Add Lombok annotations to hudi-utilities (Part 1) (#17823)
by voonhous
· 10 days ago
af38b88
feat(flink): add shard assign metrics for bucket assign function (#18790)
by Peter Huang
· 10 days ago
ed9ea0e
fix(flink): Trigger a failover after pending instants recommitted for both global and partitioned RLI (#18793)
by Shuo Cheng
· 10 days ago
8259182
fix: Fix NPE due to race condition while handling rocksdb handles (#18834)
by Lokesh Jain
· 10 days ago
11f0e7c
feat(spark): add input records & bytes metrics (#18882)
by fhan
· 11 days ago
8490c96
feat(utilities): migrate HoodieStreamerWriteStatusValidator into pre-commit validator framework (#18765)
by Xinli Shang
· 11 days ago
4fdac3d
fix(spark): fix MOR bulk insert commit operation error (#18878)
by fhan
· 11 days ago
4e5034d
fix(flink): fix data loss in stream read from earliest (#18848)
by fhan
· 11 days ago
314ffaf
feat: add lance format support for Flink COW table (#18862)
by Danny Chan
· 11 days ago
da18995
fix(flink): set canonical base path in Hive sync config (#18884)
by fhan
· 11 days ago
65562af
feat: Enable import order style check and error on violations (#18886)
by wangyinsheng
· 11 days ago
6e40cff
fix(hudi-sync): Fix Hive test temp directory cleanup with JUnit TempDir (#18883)
by fhan
· 11 days ago
fc85e3e
fix(flink): remove pre-seed in metrics in GlobalRecordLevelIndexBackend and RecordLevelIndexBackend (#18875)
by Shihuan Liu
· 2 weeks ago
7af8cdf
perf(streamer): fold validate() error-table WriteStatus sums into one pass (#18871)
by Davis-Zhang-Onehouse
· 2 weeks ago
4290550
chore: Pin third-party GitHub Actions to commit SHAs (#18872)
by Arpit Jain
· 2 weeks ago
ff72186
fix(cli): Fix the typo in show-inflight CLI command (#18868)
by Y Ethan Guo
· 2 weeks ago
588998f
refactor(flink): Refactor RowData writer factory to use HoodieSchema (#18873)
by Shuo Cheng
· 2 weeks ago
853cbef
feat(vector_search): Implement functionality for pre-filters and maxD… (#18797)
by Rahil C
· 2 weeks ago
d5c6b4d
feat(flink): Add metrics for remote RLI lookup time in BucketAssign op (#18813)
by Shihuan Liu
· 2 weeks ago
1bf6b44
feat(flink): add lance format for Flink append only table (#18741)
by Danny Chan
· 2 weeks ago
e5ae111
feat(trino): [RFC-105] Trino Hudi Connector — Shim/Bundle Refactor (#18782)
by Y Ethan Guo
· 2 weeks ago
ce29423
docs: Update DOAP file to include 0.15.1 (#18838)
by voonhous
· 2 weeks ago
b8cab71
fix(ci): make required status checks work for PRs (#18598)
by Y Ethan Guo
· 2 weeks ago
516d9e2
[MINOR] Handle cancellation error with HoodieMetadataTableValidator (#18371)
by Lokesh Jain
· 2 weeks ago
76ddebc
chore: Fix Flink CI Maven profile arguments (#18845)
by Shuo Cheng
· 2 weeks ago
652d952
fix: RLI bootstrap fails due to NPE with cleaner table service (#18836)
by Lokesh Jain
· 2 weeks ago
b5c5801
fix(spark): Add options for archive procedure (#18437)
by fhan
· 2 weeks ago
62ad88a
chore: Add release information for Apache Hudi 1.2.0 (#18831)
by Y Ethan Guo
· 2 weeks ago
7e5b535
chore: Fix stale zip file for variant backward compatibility test (#18815)
by Shuo Cheng
· 2 weeks ago
06dc09b
feat(flink): Backport Flink 2.1 Dremel nested Parquet reader rewrite to hudi-flink1.19.x (FLINK-35702) (#18809)
by Shihuan Liu
· 2 weeks ago
e2e5598
chore: remove outdated required status checks in asf.yaml (#18832)
by Y Ethan Guo
· 2 weeks ago
e299b84
chore: fix asf.yaml invalid key (#18808)
by Shawn Chang
· 3 weeks ago
facb517
feat(spark): Drop Apache Spark 3.3 integration support (#18786)
by Y Ethan Guo
· 3 weeks ago
b82a5b2
feat: add metrics for bucketassign.minibatch cache hit ratio (#18761)
by Yao Li
· 3 weeks ago
50eb95c
fix(flink): fix disable table service not effective in hudi-flink (#13875)
by fhan
· 3 weeks ago
a687786
chore: add spark4.1 and flink2.1 profile entries to RC bundle validation (#18796)
by Y Ethan Guo
· 3 weeks ago
9565926
feat(blob): default blob.inline.mode to DESCRIPTOR for Lance (#18744)
by voonhous
· 3 weeks ago
12e87b8
fix: Fix dangling lock from storage based lock (#18439)
by Lin Liu
· 3 weeks ago
0138f41
perf: Improve global index performance for commit time ordering (#17797)
by Y Ethan Guo
· 3 weeks ago
d78b702
fix(spark): use HoodieStorageUtils factory in Spark 4.1 legacy parquet read (#18785)
by Y Ethan Guo
· 3 weeks ago
3abe066
feat(common): per-partition event-time rollup and decouple watermark tracking from EVENT_TIME_ORDERING (#18778)
by Xinli Shang
· 3 weeks ago
990fc29
feat(flink): add metrics for RLI load time for bucket assign functions (#18762)
by Peter Huang
· 3 weeks ago
f044d3d
fix(spark): handle Avro 1.12 logical type values in Spark 4.1 read path (#18773)
by Y Ethan Guo
· 3 weeks ago
4ac661a
fix: Fix SQL syntax parser for CREATE TABLE on Spark 4.1 (#18779)
by Rahil C
· 3 weeks ago
89f49b1
fix: Skip pre-compaction rollback metadata reads in getValidInstantTimestamps (#18544)
by Y Ethan Guo
· 3 weeks ago
0b80239
fix: Disable column stats and partition stats indices for Lance base files (#18588)
by Y Ethan Guo
· 3 weeks ago
642d88c
[MINOR] Reserve RFC-105 for Trino Hudi Connector — Shim/Bundle Refactor (#18781)
by Y Ethan Guo
· 3 weeks ago
f67dc26
fix: Enhance hudi-azure-bundle (#18472)
by Lin Liu
· 3 weeks ago
58957a7
fix(aws): implement writer-version update in Glue sync client (#18707)
by deepakpanda93
· 3 weeks ago
a5d8fe1
test(azure): skip ITAzureStorageLockClientAzurite when MCR image pull fails (#18772)
by Xinli Shang
· 3 weeks ago
162cac2
fix(lance): fail fast when write schema contains VARIANT columns (#18775)
by Rahil C
· 3 weeks ago
d94b2e2
feat(spark): add show_inflight_commits and cleanup_stale_inflight_com… (#18709)
by Mahsood Ebrahim
· 3 weeks ago
e406e5d
feat(spark): add restore_to_instant stored procedure (#18696)
by Mahsood Ebrahim
· 3 weeks ago
71aa121
fix(flink): enforce Parquet VARIANT annotation in Flink schema conversion for unshredded variant (#18539)
by Krishen
· 3 weeks ago
9026c7d
test(trino): de-flake TestHudi*FileOperations by polling for span stability (#18766)
by Xinli Shang
· 3 weeks ago
0983c0d
chore: Harden workflow against command injection in PR title validation (#18771)
by oryn
· 3 weeks ago
055aa41
fix(lance): Support Lance file format on Spark 4.1 (#18760)
by voonhous
· 4 weeks ago
db46081
fix(flink): Fix minibatch lookup for global record level index (#18759)
by Shuo Cheng
· 4 weeks ago
cd2c8b8
[MINOR] Fix typos in comments and assertion messages (#18763)
by Xinli Shang
· 4 weeks ago
b934633
feat(utilities): add Spark/HoodieStreamer validators for pre-commit validation - Phase 3 (#18405)
by Xinli Shang
· 4 weeks ago
780595b
fix(flink): fix the write handle close for append write (#18756)
by Danny Chan
· 4 weeks ago
4035f70
chore: Fix license for copy_staging_repo.sh (#18753)
by Y Ethan Guo
· 4 weeks ago
ae9866a
fix: Enable schema merging for incremental and dfs sources (#18385)
by Lin Liu
· 4 weeks ago
6e32f36
fix: Follow-ups to JsonKinesisSource: numeric sequence comparison and call-site fixes (#18689)
by Lin Liu
· 4 weeks ago
dca76ca
fix: Support data pruning using nested partition columns (#18126)
by Lin Liu
· 4 weeks ago
3d32596
fix(docker): tag base image per Java version to avoid latest collision (#18663)
by voonhous
· 4 weeks ago
Next »