Sign in
apache
/
hudi
/
HEAD
34e9c7c
test(schema): Add MOR log-only compaction tests for custom types (#18583)
by voonhous
· 20 hours ago
master
63f721d
fix: Fix reflection ctor signature for AwsGlueCatalogSyncTool in HiveSyncContext (#18697)
by Matthew
· 21 hours ago
87019a3
fix(hive): Tolerate pruned ArrayWritable in nested BLOB projection (#18581)
by voonhous
· 22 hours ago
4029560
feat(flink): Backport Flink 2.1 nested Parquet column readers and INT64 timestamp dispatch (FLINK-35702) (#18636)
by Shihuan Liu
· 25 hours ago
c36a5f7
fix(flink): Avoid emitting deletes for Flink source v2 batch reads (#18694)
by Shuo Cheng
· 31 hours ago
91f341f
fix: filter EXTERNAL property in SparkCatalogMetaStoreClient.toCatalogTable (#18672)
by Prashant Wason
· 3 days ago
471bb48
refactor: move checkpoint metadata lookup helper to hudi-common (#18489)
by Surya Prasanna
· 3 days ago
127c6ee
feat(common): roll over commit metadata to clean (#18590)
by Krishen
· 3 days ago
4d0e9cd
fix(lance): prevent file splitting for Lance base files to avoid duplicate reads (#18678)
by Rahil C
· 5 days ago
cde0e39
fix(ci): bump Maven heap to 8g to fix OOM in CI builds (#18618)
by Hudi Agent
· 5 days ago
695294c
fix: Honor SparkSession overrides for rebase mode and timezone in compaction tasks (#18675)
by Y Ethan Guo
· 6 days ago
f7508de
test(spark): Add date logical type test to TestAvroConversionUtils (#18584)
by ashokkumar-allu
· 7 days ago
f8d70cb
feat: Introduce a Spark procedure to trigger LSM timeline compaction (#18659)
by chaoyang
· 7 days ago
39797b4
docs: claim RFC-104 for schema evolution unification (#18660)
by voonhous
· 7 days ago
38db5ed
chore: Add context7.json with URL and public key (#18662)
by vinoth chandar
· 8 days ago
0a28d69
chore(docker): reduce base_java17 and spark_base image size (#18542)
by kartikeyaagrawal
· 8 days ago
7443856
fix: make SparkCatalogMetaStoreClient.setMetaConf a no-op (#18652)
by Prashant Wason
· 8 days ago
7b662e8
fix: filter spark.sql.* properties in SparkCatalogMetaStoreClient.toCatalogTable (#18654)
by Prashant Wason
· 8 days ago
8d348cc
feat(examples): Add Hudi Unstructed Demo env (#18643)
by Rahil C
· 9 days ago
426cbb8
chore(deps): Pin AWS v1 SDK BOM to short-circuit transitive version-range walk (#18619)
by voonhous
· 9 days ago
fd63851
feat(flink): extend Flink quickstart example to use source v2 (#18518)
by Peter Huang
· 9 days ago
7c2c56e
fix(schema): Handle BLOB and VARIANT in Hive-reader rewriteRecordWithNewSchema (#18580)
by voonhous
· 9 days ago
642e1d3
chore(release): Moving to 1.3.0-SNAPSHOT on master branch (#18620)
by Y Ethan Guo
· 10 days ago
eed9aa7
feat: Add variant support description to RFC-99 (#18274)
by voonhous
· 10 days ago
1ededfd
feat(common): When inferring checkpoint/schema from timeline, check non-ingestion write commits (in case they have metadata rolled-over) (#18576)
by Krishen
· 10 days ago
5c73bc0
feat(lance): fix lance writer/reader regarding arrow memory limit issue (#18613)
by Rahil C
· 10 days ago
4bdcdf9b
feat: Create JsonKinesisSource (#18224)
by Lin Liu
· 11 days ago
7ae0fd9
fix(schema): Allow nested projection on BLOB and VARIANT columns in pruneDataSchema (#18566)
by voonhous
· 11 days ago
782552a
fix: Curator class conflict in ZookeeperBasedLockProvider (#18593)
by yuqi
· 11 days ago
20a0105
feat(spark): add Spark 4.1 support (#17674)
by Y Ethan Guo
· 11 days ago
fdf27db
fix(vector): Preserve VECTOR/BLOB metadata on SQL INSERT path (#18540)
by voonhous
· 11 days ago
a8a6917
feat(flink): Vendor Flink 2.1 Dremel nested-reader support classes (#18567)
by Shihuan Liu
· 11 days ago
29f9c40
feat(ci): enable auto-merge and require all GitHub Actions checks on master (#18594)
by Y Ethan Guo
· 12 days ago
787953f
feat(blob): add support for lance blob inline descriptor reading (#18586)
by Rahil C
· 12 days ago
8530644
feat(lance): support simplified path for lance blob inline reading (#18575)
by Rahil C
· 12 days ago
4f3e885
fix(ci): bump surefire test heap from 3g to 4g (#18589)
by Y Ethan Guo
· 13 days ago
c1569db
fix(clean): address review comments on empty clean support (#18587)
by Y Ethan Guo
· 13 days ago
436bd66
fix(vector): Pass plain FIXED through to VECTOR projection on Hive read (#18582)
by voonhous
· 13 days ago
2059c11
feat(clean): Adding empty clean support to hudi (#18337)
by Sivabalan Narayanan
· 14 days ago
217e2a7
feat: Adding support to inject custom configs to parquet writer (#18379)
by Sivabalan Narayanan
· 14 days ago
edaa168
fix: FileGroupReader drops mandatory partition columns from dataSchema (#18570)
by tiennguyen-onehouse
· 2 weeks ago
2092890
fix: ProtoConversionUtil$AvroSupport static init under Avro 1.12 (#18571)
by tiennguyen-onehouse
· 2 weeks ago
110b9be7
fix(variant): allow VariantType writes through Hudi's V1 DataSource on Spark 4 (#18564)
by voonhous
· 2 weeks ago
9d1f817
fix(vector): Register VECTOR HMS column as BINARY on Spark CREATE (#18545)
by voonhous
· 2 weeks ago
1e64662
feat(lance): round-trip Hudi VECTOR columns as native Lance fixed-size lists (#18497)
by Rahil C
· 2 weeks ago
ace2871
feat(flink): Introduces dictionary encoding of payload partition path for RocksDBIndexBackend (#18560)
by Shuo Cheng
· 2 weeks ago
7f4dd31
fix(lance): Add Hive InputFormat stubs and fix Spark SQL for Lance file format (#18162)
by Rahil C
· 2 weeks ago
ddbdbb9
chore(spark): bump spark4.version to 4.0.2 (#18549)
by voonhous
· 2 weeks ago
8623898
fix: JDBC connection leak in HiveIncrementalPuller.saveDelta() (#18460)
by Venkateswarlu Boggavarapu
· 2 weeks ago
4260914
fix: Parquet small-precision decimals decode ClassCastException (#18552)
by Shihuan Liu
· 2 weeks ago
cd83cf4
chore(docker): add Hadoop 3.4.0 / Hive 2.3.10 / Spark 4.0.2 compose stack (#18550)
by voonhous
· 2 weeks ago
4ef56e4
feat(blob): followup fixes for blob reader (#18538)
by Rahil C
· 2 weeks ago
e4904ba
feat: Add support for exclusive rollbacks with multi writer (#18448)
by Lokesh Jain
· 2 weeks ago
0d57435
fix: VARIANT Hive sync error when performing CREATE table DDL (#18511)
by voonhous
· 2 weeks ago
e303579
fix: prevent parseTypeDescriptor crash for VARIANT (#18510)
by voonhous
· 2 weeks ago
f9dead0
feat: Adding support to block archival on last known ECTR for v6 tables (#18380)
by Sivabalan Narayanan
· 2 weeks ago
59fee58
feat(lance): Bump lance to 4.0.0 and lance-spark to 0.4.0 (#18498)
by Rahil C
· 2 weeks ago
a834736
feat(vector): Add Spark SQL DDL CREATE TABLE support for VECTOR type (#18488)
by voonhous
· 2 weeks ago
76a0a27
perf(common): Avoid double-iterating log files in file-system-view filters (#18531)
by voonhous
· 2 weeks ago
adf29ac
fix: HoodieStorage resource leak in FileSystemBasedLockProvider.close() (#18461)
by mailtoboggavarapu-coder
· 2 weeks ago
f35b69c
feat(blob): Read Blobs in Spark SQL (#18098)
by Tim Brown
· 2 weeks ago
0fb4454
feat(utilities): add external HudiHiveSyncJob for on-demand Hive sync (#18204)
by Surya Prasanna
· 3 weeks ago
3a387da
feat(flink): Implement continuous sorting feature for append write (#18083)
by Prashant Wason
· 3 weeks ago
95199f0
perf(common): avoid stream allocation in CollectionUtils.createImmutableList (#18530)
by voonhous
· 3 weeks ago
91dba3e
chore(common): Consolidate MapUtils into CollectionUtils (#18529)
by voonhous
· 3 weeks ago
cfb9833
chore(docker): Remove duplicate yarn.nodemanager.bind-host in entrypoint.sh (#18527)
by voonhous
· 3 weeks ago
937a64a
fix: whitelist Flink _2.12 artifacts in scala-2.13 enforcer rule (#18508)
by voonhous
· 3 weeks ago
55bf91a
introduce static helper method to remove clones (#18533)
by aaaZayne
· 3 weeks ago
eaaae8a
feat: Add Azure-based storage lock (#17951)
by chrevanthreddy
· 3 weeks ago
3d0ab80
chore(docker): bump integ-test docker-compose to Hive 2.3.10 (#18525)
by voonhous
· 3 weeks ago
9ddf582
chore: add timing logs for file index partition and file listing (#18417)
by Surya Prasanna
· 3 weeks ago
97f9628
feat(docker): add --multi-arch flag for cross-platform image builds (#18522)
by Y Ethan Guo
· 3 weeks ago
7bcb8be
feat(metadata): Allow users to safely execute compaction plans on metadata table concurrently through a table service platform (rather than only inline during write) (#18295)
by Krishen
· 3 weeks ago
41cfc19
chore: Add Java 17 Hadoop base image and Spark 4.0.1 docker compose setup (#18520)
by voonhous
· 3 weeks ago
0356488
fix(docker): fix docker image build with Java 11 and Hive 2.3.10 (#18519)
by Y Ethan Guo
· 3 weeks ago
a369773
chore: cleanup docker-compose files (#17950)
by voonhous
· 3 weeks ago
c1af4f5
fix: use forward slash literal and remove unused import in DFSPropertiesConfiguration (#18454)
by mailtoboggavarapu-coder
· 3 weeks ago
2b33f5d
fix: fix BufferedReader resource leak in FileIOUtils.readAsUTFStringLines (#18470)
by mailtoboggavarapu-coder
· 3 weeks ago
a649188
feat(spark): refresh parquet tools clustering strategy for current master (#18409)
by Surya Prasanna
· 3 weeks ago
f144abc
feat: Include ReverseOrderHoodieRecordPayload (#17928)
by Surya Prasanna
· 3 weeks ago
5b68607
fix: fix Scanner file handle leak in HiveIncrementalPuller.executeIncrementalSQL (#18457)
by mailtoboggavarapu-coder
· 3 weeks ago
cad1530
fix: Scanner resource leak in SqlFileBasedSource.fetchNextBatch (#18467)
by mailtoboggavarapu-coder
· 3 weeks ago
5066fcc
feat: Adding rolling extra metadata support (#18421)
by Sivabalan Narayanan
· 3 weeks ago
d3e0201
fix(common): FutureUtils:allOf should always throw root cause exception (#18456)
by Krishen
· 3 weeks ago
12b3a06
feat: Support to cap max commits to clean in one round of clean commit (#18322)
by Sivabalan Narayanan
· 3 weeks ago
98f90c1
fix(payload): support sentinel no-op updates in DefaultHoodieRecordPayload (#18413)
by Surya Prasanna
· 3 weeks ago
4b15e50
feat(sync): Map VARIANT type to struct in Hive, Spark, and BigQuery sync (#18483)
by voonhous
· 3 weeks ago
8cd2648
chore: Allow versions to be specified in build_docker_images.sh (#17948)
by voonhous
· 3 weeks ago
da2667c
feat(sync): Map BLOB type to struct in Hive and BigQuery sync (#18482)
by voonhous
· 3 weeks ago
0bdab84
feat(flink): Add metrics for RocksDB index backend in bucket assigner (#18484)
by Shuo Cheng
· 3 weeks ago
613fc49
feat(common): add log reader scan metrics and logging for log block processing (#18412)
by Surya Prasanna
· 3 weeks ago
fc7f303
chore(deps): bump org.apache.logging.log4j:log4j-core (#18490)
by dependabot[bot]
· 4 weeks ago
00a4066
feat(sync): Map VECTOR type to binary for metastore sync support (#18480)
by voonhous
· 4 weeks ago
88c146e
chore(ci): Clean up env variable leak in TestSqlConf (#18486)
by Geser Dugarov
· 4 weeks ago
c83ae87
fix(flink): Handle bootstrap write metadata correctly after job resca… (#18485)
by Shuo Cheng
· 4 weeks ago
8f153b2
perf(core): optimize rollback listing calls on metadata table (#18279)
by Balajee Nagasubramaniam
· 4 weeks ago
5d61a35
refactor(flink): Refactor Flink compaction/clean pipeline with composite table service handlers (#18477)
by Shuo Cheng
· 4 weeks ago
eaa9c8b
fix: fix BufferedReader resource leak in InputStreamConsumer (#18469)
by mailtoboggavarapu-coder
· 4 weeks ago
6310d70
fix: avoid duplicate archived timeline instants from leftover merge files (#18408)
by Surya Prasanna
· 4 weeks ago
f063aa5
fix(hfile): use Hadoop WritableUtils VarInt encoding in HFile block index writer (#18465)
by Asish Kumar
· 4 weeks ago
Next »