Sign in
apache
/
paimon
/
HEAD
766d08b
[core] Limit max parallelism for postpone batch write fixed bucket to avoid too many buckets (#7212)
by yuzelin
· 5 hours ago
master
773dcf6
[spark] Create global index topology update spark parallelism (#7210)
by YeJunHao
· 9 hours ago
cf34dcf
[core] Support pushing down Timestamp filter in parquet format (#7203)
by Kerwin Zhang
· 9 hours ago
8f2a4ad
[python] fix data evolution with_slice out-of-bounds: return empty rows instead of all data (#7207)
by XiaoHongbo
· 10 hours ago
4681f4a
[core] Disable async write for blob writer (#7208)
by YeJunHao
· 10 hours ago
9aec7dd
[python] fix data evolution merge batch size mismatch issue (#7205)
by XiaoHongbo
· 10 hours ago
85c28c7
[python] Extract _write_manifest_file in FileStoreCommit
by JingsongLi
· 13 hours ago
f063108
[spark] Support UPDATE on Paimon append-only table in spark V2 write (#7097)
by Kerwin Zhang
· 15 hours ago
f64b147
[python] Record min and max row id in ManifestFileMeta (#7202)
by Jingsong Lee
· 15 hours ago
4a48d4d
[python] fix bucket num calculation (#7192)
by XiaoHongbo
· 17 hours ago
21a66d1
[python] Data Evolution with_slice should use row id to do slice (#7199)
by Jingsong Lee
· 21 hours ago
88618e5
[core] Fix that get format table in rest catalog might throw permission exception (#7195)
by yuzelin
· 28 hours ago
7f258f3
[python] Row Id Ranges push down should be finished in DataEvolutionSplitRead (#7196)
by Jingsong Lee
· 28 hours ago
a10a448
[spark] Add MergeIntoTable alignment methods for v2 write (#7144)
by Kerwin Zhang
· 29 hours ago
f6bee4e
[python] Python BTree support between predicate (#7193)
by Jingsong Lee
· 31 hours ago
bec9ead
[core] supports multiple partitions in BTreeGlobalIndexBuilder (#7191)
by Faiz
· 35 hours ago
ebe39f0
[core][python] Support null bitmap in Python BTree (#7190)
by Jingsong Lee
· 2 days ago
47e7801
[python] Use pyroaring.BitMap64 for better performance (#7185)
by Jingsong Lee
· 2 days ago
d171ad8
[spark] Fix rescale procedure to only read real buckets for postpone bucket table (#7184)
by Juntao Zhang
· 2 days ago
3e706fe
[spark] Support spark compaction for postpone bucket table (#7169)
by Juntao Zhang
· 2 days ago
6c1415b
[python] Refactor to File Scanner and add _filter_manifest_files_by_row_ranges (#7177)
by Jingsong Lee
· 2 days ago
61d3a57
[python] fix oss path handling for pyarrow 6.x (py36 compatibility) (#7180)
by XiaoHongbo
· 2 days ago
f17ca29
[core] remark row-tracking.enabled as Immutable (#7181)
by LsomeYeah
· 2 days ago
f0d1eb4
[python] fix column nullability info lost issue during schema rebuilding (#7174)
by XiaoHongbo
· 2 days ago
b8d7ac7
[core] Allow chain table to use non-deduplicate merge engines. (#7172)
by Junrui Lee
· 2 days ago
c0c91e2
[python] fix duplicated unused file in btree index module (#7178)
by Faiz
· 2 days ago
111efca
[core] Support pushing down Decimal filter in parquet format (#7175)
by Kerwin Zhang
· 2 days ago
ef263a7
[spark] Simplify codes of BTreeIndexTopoBuilder and CreateGlobalIndexProcedure
by JingsongLi
· 3 days ago
3c0dd80
[spark] Introduce substring transform (#7170)
by xuzifu666
· 3 days ago
1cc351c
[python] Support zstd decompress for btree index (#7167)
by Jingsong Lee
· 3 days ago
eecc022
[python] support drop_partitions for rest catalog (#7164)
by XiaoHongbo
· 3 days ago
6948f85
[python] Python BTree index reader supports INT and BIGINT (#7163)
by Jingsong Lee
· 3 days ago
8a54967
[hotfix] Add Variant TODO in ParquetSchemaConverter
by JingsongLi
· 4 days ago
c2e1532
[core] Do not annotate the Variant logical type for backward compatibility (#7161)
by Zouxxyy
· 4 days ago
327fe5c
[python] Implement BTree Reader in Python (#7160)
by Jingsong Lee
· 4 days ago
d2bbafa
[core] Improve clustering columns parsing with trim support (#7159)
by LsomeYeah
· 5 days ago
aef59c8
[core][spark] Move btree builder from spark to core (#7156)
by Jingsong Lee
· 5 days ago
1d8e93d
[doc] Refactor to 'Update Columns By Shards' in python
by JingsongLi
· 5 days ago
1ae465b
[doc] Add format table in python-api
by JingsongLi
· 5 days ago
4c7d9de
[python] support format table (#7154)
by XiaoHongbo
· 5 days ago
265f908
[python] Enable update data to compute a New Column (scan + rewrite with shards) (#7148)
by YeJunHao
· 6 days ago
f633efa
[hotfix] Rename to _compute_slice_split_file_idx_map in data evolution
by JingsongLi
· 6 days ago
64293b7
[python] correct with_slice index for split with multi files (#7153)
by XiaoHongbo
· 6 days ago
5be974b
[hotfix] Minor optimize performance in authedReader
by JingsongLi
· 6 days ago
8a7839b
[spark] Refactor UPDATE command to compute aligned expressions in analysis phase (#7149)
by Kerwin Zhang
· 6 days ago
2538f69
[core] DataTableRead should add auth columns to row filters
by JingsongLi
· 6 days ago
ba21d6b
[core][flink][spark] Fix column pruning with row filter (#7150)
by Jiajia Li
· 6 days ago
5925d39
[rest] Add pluggable signer architecture for REST API authentication (#7100)
by Dapeng Sun(孙大鹏)
· 6 days ago
58dfc4a
[docs] add description for data_evolution_merge_into procedure (#7147)
by Faiz
· 7 days ago
990b8e7
[core] Support dls file system (#7145)
by Kerwin Zhang
· 7 days ago
2ef7cd1
[build] Use PR title and description for commit message (#7142)
by cxzl25
· 7 days ago
bc6d341
[flink] introduce a simplified MERGE INTO procedure on data-evolution-table for flink (#7128)
by Faiz
· 7 days ago
50a3d8f
[python/hotfix] fix faiss-cpu version define and make faiss-cpu optional (#7138)
by XiaoHongbo
· 7 days ago
7fddbe4
[python/doc] add alterTable in pypaimon doc (#7140)
by XiaoHongbo
· 7 days ago
63d95bc
[rest] Fix auth header generation order in HttpClient (#7137)
by Dapeng Sun(孙大鹏)
· 7 days ago
a56d29c
[clone] support skipping cloning if the target table already exists. (#7139)
by shyjsarah
· 7 days ago
3218285
[iceberg] Support real uuid for iceberg table (#7136)
by jianguotian
· 7 days ago
6f702ad
[core] Add conflict detection for data evolution row id (#7124)
by Jingsong Lee
· 7 days ago
98f4de6
[spark] Make SparkCatalystPartitionPredicate thread safe (#7135)
by Zouxxyy
· 8 days ago
5237031
[core][spark] Introduce lower transform (#7134)
by xuzifu666
· 8 days ago
ccbfae1
[core] Notify null blob to consumer in BlobFormatWriter
by JingsongLi
· 8 days ago
11f0c1d
[format] Introduce null blob (#7125)
by YeJunHao
· 8 days ago
73a172f
[core] Refactor format scan optimize to simplify predicates (#7132)
by YeJunHao
· 8 days ago
442b6df
[spark] Explicitly disable v2-write for row-level operation tests (#7131)
by Kerwin Zhang
· 8 days ago
bb8381b
[parquet] remove assertion when creating Range (#7133)
by LsomeYeah
· 8 days ago
fcc845a
[python] add support for ROW/STRUCT types in PyPaimon (#7129)
by ChengHui Chen
· 8 days ago
b2ef40e
[spark] Refactor DataSourceV2 fallback checks into reusable helper methods (#7130)
by Kerwin Zhang
· 8 days ago
f0b864b
[spark] Fix DescriptorToStringFunction (#7127)
by Zouxxyy
· 9 days ago
1bdd662
[core] Rename VectorSearchGlobalIndexResult to ScoredGlobalIndexResult
by JingsongLi
· 9 days ago
c056231
[core] Format table scan less dir in partition predicate (#7122)
by YeJunHao
· 9 days ago
7cf171c
Bump org.assertj:assertj-core from 3.23.1 to 3.27.7 (#7126)
by dependabot[bot]
· 9 days ago
28edeaf
[core] Judge minSequenceNum to decide whether assign sequenceNum to ManifestEntry (#7121)
by YeJunHao
· 9 days ago
8714be2
[spark] Add test: Data Evolution: concurrent merge and compact (#7120)
by Jingsong Lee
· 9 days ago
10a54d3
[spark] Throw exception when to sort compact data evolution table (#7119)
by Jingsong Lee
· 9 days ago
ceefc5c
[spark] Use affected data splits in DataEvolutionPaimonWriter (#7118)
by Jingsong Lee
· 9 days ago
bdcd3ec
[spark] Avoid multiple scan in MergeIntoPaimonDataEvolutionTable (#7116)
by Jingsong Lee
· 9 days ago
a61f3fb
[spark] Enable blob as descriptor with v2 write (#7115)
by Zouxxyy
· 10 days ago
1258f5f
[spark] Fix the display of V2 delete metrics (#7114)
by Zouxxyy
· 11 days ago
548325a
[doc] Separate PyPaimon documentations
by JingsongLi
· 12 days ago
8706428
[core] Data Evolution: cache fileFields to speed up (#7112)
by YeJunHao
· 12 days ago
7af48df
[doc] Separate blob storage in append table
by JingsongLi
· 12 days ago
939b272
[docs] Move blob page to data-evolution (#7113)
by YeJunHao
· 12 days ago
6b61349
[common] Copy bundle while shredding (#7111)
by YeJunHao
· 13 days ago
527f2ab
[core] Support multiple blob fields definition (#7105)
by Jingsong Lee
· 13 days ago
da30f3e
[spark] Fix DELETE with range condition in v2 write (#7104)
by Kerwin Zhang
· 13 days ago
623d30d
[variant] Annotate Variant columns with Variant logical type annotation (#7110)
by Zouxxyy
· 13 days ago
8e5378a
[python] Python commit snapshot should not contain None fields (#7103)
by Jingsong Lee
· 13 days ago
b271ce6
[parquet] Bump parquet version to 1.16.0 (#7081)
by Zouxxyy
· 13 days ago
4f8a811
[python] fix data-evolution double-counting issue by using mergedRowCount in ray datasource (#7087)
by XiaoHongbo
· 13 days ago
88332de
[python] Avro schema inconsistents between Java and Python (#7077)
by ChengHui Chen
· 13 days ago
b309fe3
[core] Data-evolution mode support compact data-evolution table without manifest-meta:min/maxRowId (#7102)
by YeJunHao
· 13 days ago
a8a0e28
[core] Add option to support reading sequence_number in AuditLogTable and BinlogTable (#6933)
by bosiew.tian
· 14 days ago
f99ee47
[hotfix] Log channelWriterOutputView writeBytes after close (#7101)
by Zouxxyy
· 14 days ago
af23563
[core] Fix Iceberg metadata parsing error by excluding null doc field from JSON serialization (#7099)
by KangJi
· 14 days ago
f323c6a
[hotfix] Correct mergedRowCount in QueryAuthSplit
by JingsongLi
· 14 days ago
b249c84
[core] Do not use path.toUri().toString() when a location string is expected internally (#7098)
by Zouxxyy
· 14 days ago
f3d6caa
[doc] Add 2026 on NOTICE (#7096)
by yangjf2019
· 14 days ago
1f7477b
[rest] Support getTable by id (#7085)
by kevin
· 14 days ago
7f9a014
[core][spark] introduce action to handle updates on global-indexed columns (#7094)
by Faiz
· 2 weeks ago
b0b6375
[core] Introduce IncrementalSplit to simplify DataSplit (#7093)
by Jingsong Lee
· 2 weeks ago
Next »