Sign in
apache
/
arrow-rs
/
HEAD
ded985c
fix(arrow-csv): bound RecordDecoder::flush offset accumulation (#9886)
by masumi-ryugo
· 7 hours ago
main
7f6524d
fix(parquet): bound schema num_children before Vec::with_capacity (#9884)
by masumi-ryugo
· 7 hours ago
5bf8912
[arrow-string]: add `like::eq_ascii_ignore_case` kernel (#9871)
by albertlockett
· 9 hours ago
8091f3f
[arrow-array] Use consistent `value_length` name in FixedSizeBinaryArray (#9905)
by Andrew Lamb
· 27 hours ago
1d89737
fix(parquet): Prevent negative list sizes in Thrift compact protocol parser (#9868)
by masumi-ryugo
· 28 hours ago
3790d88
Pre-size dedup HashTable in GenericByteDictionaryBuilder::with_capacity (#9908)
by Sebastian Rabenhorst
· 28 hours ago
e6c9448
Update release schedule on README (#9881)
by Andrew Lamb
· 29 hours ago
21f739c
test(parquet): replace `InMemoryArrayReader` with `PrimitiveArrayReader` in tests (#9847)
by Hippolyte Barraud
· 29 hours ago
99998d6
API to help with the pattern of 'replaces the values of the REE array (#9891)
by RIchard Baah
· 32 hours ago
d88cc48
replace Dictionary::try_new() calls with with_values. (#9894)
by RIchard Baah
· 33 hours ago
f4a0f22
chore[benches]: add REE interleave benchmarks (#9849)
by Alfonso Subiotto Marqués
· 2 days ago
0e478d8
docs: Add guidance for AI assisted submissions to CONTRIBUTING.md (#9892)
by Ed Seidl
· 2 days ago
75f7916
Prevent `FixedSizeBinaryArray` `i32` offset overflows (try 2) (#9872)
by Andrew Lamb
· 2 days ago
70e4069
[arrow-ipc]: dictionary builders for delta - doc fix and integration tests for nested types (#9853)
by albertlockett
· 3 days ago
d3cad6e
[Parquet] Do not panic when trying to skip records in delta encoded files using non-standard block sizes (#9794)
by Ed Seidl
· 5 days ago
7ffcf0d
Add more documentation for FixedSizeBinary arrays (#9866)
by Andrew Lamb
· 5 days ago
1ddb4d9
[PARQUET] Allow `UNKNOWN` logical type annotation on any physical type (#9855)
by Ed Seidl
· 5 days ago
f725bc9
configurable data page v2 compression threshold (#9826)
by Leonardo Yvens
· 5 days ago
7ad2299
Minor: document why FixedSizeBinary offset is always 0 (#9861)
by Andrew Lamb
· 5 days ago
fd86c75
fix(arrow-cast): fix incorrect conversion (#9852)
by bboissin
· 6 days ago
5255de4
docs: Update contributing guidelines with benchmark results (#9782)
by Andrew Lamb
· 7 days ago
91102e7
REE row conversion speed up (#9845)
by RIchard Baah
· 7 days ago
73b05de
Prevent `ArrayData::slice` length overflow (#9813)
by Andrew Lamb
· 7 days ago
b114241
Fix JSON reader panic for non-nullable zero-size FixedSizeList (#9810)
by Liam Bao
· 8 days ago
ba7dada
fix(parquet): avoid panic on ColumnIndex length mismatch (#9833)
by pchintar
· 8 days ago
fe3c0c9
fix(parquet): Avoid panic on malformed thrift bool fields in parquet metadata (#9840)
by BoazC-MSFT
· 8 days ago
fc3f778
Prepare for 58.2.0 release (#9800)
by Andrew Lamb
· 8 days ago
58.2.0
58.2.0-rc1
3c4311c
Prevent buffer builder length overflow in `MutableBuffer::extend_zeros` (#9820)
by Andrew Lamb
· 8 days ago
11f13a5
feat(parquet): batch RLE runs in level encoder via scan-ahead (#9830)
by Hippolyte Barraud
· 8 days ago
a3dbc15
feat: add `has_non_empty_nulls` helper function in `OffsetBuffer` (#9711)
by Raz Luvaton
· 9 days ago
b00b5aa
fix(ipc): correct skip_field handling for V4 Union (#9829)
by pchintar
· 9 days ago
b4c02d0
support length() on Run-end encoding arrays (#9838)
by RIchard Baah
· 9 days ago
710e68e
Prevent ArrayData validation length overflow (#9816)
by Andrew Lamb
· 9 days ago
cb8d4c0
fix: correct accounting in `DictEncoder::estimated_memory_size`, `Interner::estimated_memory_size` (#9720)
by Mikhail Zabaluev
· 9 days ago
4fa8d2f
fix(ipc): replace wildcard in skip_field with explicit DataType handling (#9822)
by pchintar
· 11 days ago
c4b2569
Replace `BooleanBufferBuilder` with `NullBufferBuilder` in arrow-json if applicable (#9811)
by Liam Bao
· 11 days ago
04f207c
[Json] Remove arrow-data dependency from arrow-json (#9812)
by Liam Bao
· 11 days ago
4e4a430
fix(ipc): Avoid panic on malformed compressed buffer prefix (#9802)
by pchintar
· 11 days ago
479ad5b
Prevent Rows row index overflow (#9817)
by Andrew Lamb
· 11 days ago
a8e8261
Prevent repeat slice length overflow (#9819)
by Andrew Lamb
· 11 days ago
5fae47e
Prevent BitChunks length overflow (#9818)
by Andrew Lamb
· 11 days ago
e99815a
fix(ipc): reader misalignment when skipping ListView / LargeListView columns (#9806)
by pchintar
· 12 days ago
c09945f
parquet: fix panic in DeltaByteArrayDecoder on invalid prefix lengths (#9797)
by pchintar
· 12 days ago
54b5a84
refactor(ipc): derive Default for CompressionContext (#9809)
by Matt Butrovich
· 12 days ago
54fa893
feat(parquet): fuse level encoding with counting and histogram updates (#9795)
by Hippolyte Barraud
· 12 days ago
1a721a4
Document Security Policy (#9730)
by Andrew Lamb
· 12 days ago
620e39d
fix: lazy-init zstd compression contexts to avoid unnecessary FFI calls (#9808)
by Matt Butrovich
· 12 days ago
a3592d6
Push `LIMIT` / `OFFSET` into the last `RowFilter` predicate and skip unused row groups (#9766)
by Huaijin
· 12 days ago
2a77e10
feat(ipc): add with_skip_validation to StreamDecoder (#9749)
by pantShrey
· 13 days ago
b93240a
feat: make FFI structs fields `pub` (#9772)
by Eshed Schacham
· 2 weeks ago
9d27619
Fix RecordBatch::normalize() null bitmap bug and add StructArray::flatten() (#9733)
by Han You
· 2 weeks ago
41ccf2d
perf(parquet): Defer fixed length byte array buffer alloc and skip zero-batch init (#9756)
by Lanqing Yang
· 2 weeks ago
2d8cd5a
parquet: O(1) skip for bw=0 miniblocks in DeltaBitPackDecoder (#9786)
by Thaddeus Covert
· 2 weeks ago
e9cbabd
feat(parquet): batch consecutive null/empty rows in `write_list` (#9752)
by Hippolyte Barraud
· 2 weeks ago
73ceb1d
Expose ColumnCloseResult on ArrowColumnChunk (#9773)
by Leonardo Yvens
· 2 weeks ago
9a2b49c
Add benchmark for cast from/to decimals (#9729)
by Congxian Qiu
· 2 weeks ago
8de8900
arrow-ipc: Write 0 offset buffer for length-0 variable-size arrays (#9717)
by Atwam
· 2 weeks ago
63b5470
Remove `len` field from buffer builder (#9750)
by Peter L
· 2 weeks ago
5912469
refactor(arrow-avro): use `Decoder::flush_block` in async reader (#9726)
by Mikhail Zabaluev
· 2 weeks ago
98f8450
Docs: add example of how to read parquet row groups in parallel (#9396)
by Andrew Lamb
· 2 weeks ago
d03571b
feat(ipc): Remove per-message flush in IPC writer hot path (#9763)
by pchintar
· 2 weeks ago
922bdd5
docs(variant): link VariantArray doc to official Parquet Variant extension type (#9779)
by Martin Charrel
· 2 weeks ago
f73922b
[Json] Support `FixedSizeList` in json decoder (#9715)
by Liam Bao
· 2 weeks ago
52ff63c
refactor(parquet): replace magic `8` literals with named constants (#9751)
by Hippolyte Barraud
· 2 weeks ago
de11d9c
perf(parquet): Vectorize dict-index bounds check in RleDecoder::get_batch_with_dict (up to -7.9%) (#9746)
by Daniël Heres
· 2 weeks ago
9d3a4d9
chore: add benchmark for row filters with LIMIT short-circuit (#9767)
by Huaijin
· 2 weeks ago
5150224
feat[arrow-ord]: suppport REE comparisons (#9621)
by Alfonso Subiotto Marqués
· 2 weeks ago
fa1cc58
[Json] Use `partition` and `take` in RunEndEncoded decoder (#9658)
by Liam Bao
· 2 weeks ago
f0dd1c2
chore: Refine the error message for List to non List cast (#9757)
by Oleks V
· 2 weeks ago
51b02f1
[Parquet] perf: preallocate capacity for ArrayReaderBuilder (#9093)
by Lanqing Yang
· 3 weeks ago
89b1497
Improve take performance on List arrays (#9643)
by Adam Gutglick
· 3 weeks ago
84b3454
fix: ParquetError when reading corrupt parquet file with truncated data instead of Panic (#9725)
by xuzifu666
· 3 weeks ago
58fbb17
remove panics in unshred variant (#9741)
by Matthew Kim
· 3 weeks ago
c5fed03
Add benchmark for ListView interleave (#9738)
by Vegard Stikbakke
· 3 weeks ago
d7d9ad3
Refactor `RleEncoder::flush_bit_packed_run` to make flow clearer (#9735)
by Ed Seidl
· 3 weeks ago
4676c06
[Variant] Take top-level nulls into consideration when extracting perfectly shredded children (#9702)
by Adam Gutglick
· 3 weeks ago
370d426
arrow-arith: fix 'occured' -> 'occurred' in arity.rs comments (#9736)
by Sai Asish Y
· 3 weeks ago
182c7a9
Fix clippy warning in fixed_size_binary_array.rs (#9712)
by Adam Gutglick
· 3 weeks ago
9ed4c6f
feat(parquet): precompute `offset_index_disabled` at build-time (#9724)
by Hippolyte Barraud
· 3 weeks ago
ddde66d
Add a test for reading nested REE data in json (#9634)
by Andrew Lamb
· 3 weeks ago
2b8a761
ci: use ubuntu-slim runner for lightweight CI jobs (#9630)
by Yu-Chuan Hung
· 3 weeks ago
72b4576
[Variant] Support Binary/LargeBinary children (#9610)
by Adam Gutglick
· 3 weeks ago
b946165
Fedora license audit (#9704)
by Michel Lind
· 3 weeks ago
7a089ad
Optimize RowNumberReader to be 8x faster (#9680)
by Samyak Sarnayak
· 3 weeks ago
06c3bd0
feat(parquet): add wide-schema writer overhead benchmark (#9723)
by Hippolyte Barraud
· 3 weeks ago
38d78c3
Support `GenericListViewArray::new_unchecked` and refactor `ListView` json decoder (#9648)
by Liam Bao
· 3 weeks ago
471f6c3
chore(deps): bump actions/upload-pages-artifact from 4 to 5 (#9713)
by dependabot[bot]
· 3 weeks ago
bfee844
Add mutable bitwise operations to `BooleanArray` and `NullBuffer::union_many` (#9692)
by Matt Butrovich
· 3 weeks ago
711fac8
feat(parquet): add `push_decoder` benchmark for `PushBuffers` overhead (#9696)
by Hippolyte Barraud
· 3 weeks ago
d69c604
chore(deps): bump pytest from 7.2.0 to 9.0.3 in /parquet/pytest (#9706)
by dependabot[bot]
· 3 weeks ago
8b159ad
Add `finish_preserve_values` to `ArrayBuilder` trait (#9601)
by Adam Reichold
· 3 weeks ago
dad0be4
[Arrow] Add API to check if `Field` has a valid `ExtensionType` (#9677)
by Konstantin Tarasov
· 3 weeks ago
88b7fca
ParquetMetaDataPushDecoder API to clear all buffered ranges (#9673)
by Nathan
· 3 weeks ago
68851ef
chore(deps): update hashbrown requirement from 0.16.0 to 0.17.0 (#9691)
by dependabot[bot]
· 4 weeks ago
4778077
[Variant] Add `VariantArrayBuilder::append_nulls` API (#9685)
by Konstantin Tarasov
· 4 weeks ago
a42c475
Fix union cast incorrectness for duplicate field names (#9666)
by Matthew Kim
· 4 weeks ago
70ae764
minor: Re-enable CDC bench (#9686)
by Ed Seidl
· 4 weeks ago
c180812
chore(deps): bump actions/github-script from 8 to 9 (#9690)
by dependabot[bot]
· 4 weeks ago
b36beac
[Variant] `variant_get` should follow JSONPath semantics for Field path element (#9676)
by Konstantin Tarasov
· 4 weeks ago
adf9308
feat(parquet): add struct-column writer benchmarks (#9679)
by Hippolyte Barraud
· 4 weeks ago
Next »