1. ded985c fix(arrow-csv): bound RecordDecoder::flush offset accumulation (#9886) by masumi-ryugo · 7 hours ago main
  2. 7f6524d fix(parquet): bound schema num_children before Vec::with_capacity (#9884) by masumi-ryugo · 7 hours ago
  3. 5bf8912 [arrow-string]: add `like::eq_ascii_ignore_case` kernel (#9871) by albertlockett · 9 hours ago
  4. 8091f3f [arrow-array] Use consistent `value_length` name in FixedSizeBinaryArray (#9905) by Andrew Lamb · 27 hours ago
  5. 1d89737 fix(parquet): Prevent negative list sizes in Thrift compact protocol parser (#9868) by masumi-ryugo · 28 hours ago
  6. 3790d88 Pre-size dedup HashTable in GenericByteDictionaryBuilder::with_capacity (#9908) by Sebastian Rabenhorst · 28 hours ago
  7. e6c9448 Update release schedule on README (#9881) by Andrew Lamb · 29 hours ago
  8. 21f739c test(parquet): replace `InMemoryArrayReader` with `PrimitiveArrayReader` in tests (#9847) by Hippolyte Barraud · 29 hours ago
  9. 99998d6 API to help with the pattern of 'replaces the values of the REE array (#9891) by RIchard Baah · 32 hours ago
  10. d88cc48 replace Dictionary::try_new() calls with with_values. (#9894) by RIchard Baah · 33 hours ago
  11. f4a0f22 chore[benches]: add REE interleave benchmarks (#9849) by Alfonso Subiotto Marqués · 2 days ago
  12. 0e478d8 docs: Add guidance for AI assisted submissions to CONTRIBUTING.md (#9892) by Ed Seidl · 2 days ago
  13. 75f7916 Prevent `FixedSizeBinaryArray` `i32` offset overflows (try 2) (#9872) by Andrew Lamb · 2 days ago
  14. 70e4069 [arrow-ipc]: dictionary builders for delta - doc fix and integration tests for nested types (#9853) by albertlockett · 3 days ago
  15. d3cad6e [Parquet] Do not panic when trying to skip records in delta encoded files using non-standard block sizes (#9794) by Ed Seidl · 5 days ago
  16. 7ffcf0d Add more documentation for FixedSizeBinary arrays (#9866) by Andrew Lamb · 5 days ago
  17. 1ddb4d9 [PARQUET] Allow `UNKNOWN` logical type annotation on any physical type (#9855) by Ed Seidl · 5 days ago
  18. f725bc9 configurable data page v2 compression threshold (#9826) by Leonardo Yvens · 5 days ago
  19. 7ad2299 Minor: document why FixedSizeBinary offset is always 0 (#9861) by Andrew Lamb · 5 days ago
  20. fd86c75 fix(arrow-cast): fix incorrect conversion (#9852) by bboissin · 6 days ago
  21. 5255de4 docs: Update contributing guidelines with benchmark results (#9782) by Andrew Lamb · 7 days ago
  22. 91102e7 REE row conversion speed up (#9845) by RIchard Baah · 7 days ago
  23. 73b05de Prevent `ArrayData::slice` length overflow (#9813) by Andrew Lamb · 7 days ago
  24. b114241 Fix JSON reader panic for non-nullable zero-size FixedSizeList (#9810) by Liam Bao · 8 days ago
  25. ba7dada fix(parquet): avoid panic on ColumnIndex length mismatch (#9833) by pchintar · 8 days ago
  26. fe3c0c9 fix(parquet): Avoid panic on malformed thrift bool fields in parquet metadata (#9840) by BoazC-MSFT · 8 days ago
  27. fc3f778 Prepare for 58.2.0 release (#9800) by Andrew Lamb · 8 days ago 58.2.0 58.2.0-rc1
  28. 3c4311c Prevent buffer builder length overflow in `MutableBuffer::extend_zeros` (#9820) by Andrew Lamb · 8 days ago
  29. 11f13a5 feat(parquet): batch RLE runs in level encoder via scan-ahead (#9830) by Hippolyte Barraud · 8 days ago
  30. a3dbc15 feat: add `has_non_empty_nulls` helper function in `OffsetBuffer` (#9711) by Raz Luvaton · 9 days ago
  31. b00b5aa fix(ipc): correct skip_field handling for V4 Union (#9829) by pchintar · 9 days ago
  32. b4c02d0 support length() on Run-end encoding arrays (#9838) by RIchard Baah · 9 days ago
  33. 710e68e Prevent ArrayData validation length overflow (#9816) by Andrew Lamb · 9 days ago
  34. cb8d4c0 fix: correct accounting in `DictEncoder::estimated_memory_size`, `Interner::estimated_memory_size` (#9720) by Mikhail Zabaluev · 9 days ago
  35. 4fa8d2f fix(ipc): replace wildcard in skip_field with explicit DataType handling (#9822) by pchintar · 11 days ago
  36. c4b2569 Replace `BooleanBufferBuilder` with `NullBufferBuilder` in arrow-json if applicable (#9811) by Liam Bao · 11 days ago
  37. 04f207c [Json] Remove arrow-data dependency from arrow-json (#9812) by Liam Bao · 11 days ago
  38. 4e4a430 fix(ipc): Avoid panic on malformed compressed buffer prefix (#9802) by pchintar · 11 days ago
  39. 479ad5b Prevent Rows row index overflow (#9817) by Andrew Lamb · 11 days ago
  40. a8e8261 Prevent repeat slice length overflow (#9819) by Andrew Lamb · 11 days ago
  41. 5fae47e Prevent BitChunks length overflow (#9818) by Andrew Lamb · 11 days ago
  42. e99815a fix(ipc): reader misalignment when skipping ListView / LargeListView columns (#9806) by pchintar · 12 days ago
  43. c09945f parquet: fix panic in DeltaByteArrayDecoder on invalid prefix lengths (#9797) by pchintar · 12 days ago
  44. 54b5a84 refactor(ipc): derive Default for CompressionContext (#9809) by Matt Butrovich · 12 days ago
  45. 54fa893 feat(parquet): fuse level encoding with counting and histogram updates (#9795) by Hippolyte Barraud · 12 days ago
  46. 1a721a4 Document Security Policy (#9730) by Andrew Lamb · 12 days ago
  47. 620e39d fix: lazy-init zstd compression contexts to avoid unnecessary FFI calls (#9808) by Matt Butrovich · 12 days ago
  48. a3592d6 Push `LIMIT` / `OFFSET` into the last `RowFilter` predicate and skip unused row groups (#9766) by Huaijin · 12 days ago
  49. 2a77e10 feat(ipc): add with_skip_validation to StreamDecoder (#9749) by pantShrey · 13 days ago
  50. b93240a feat: make FFI structs fields `pub` (#9772) by Eshed Schacham · 2 weeks ago
  51. 9d27619 Fix RecordBatch::normalize() null bitmap bug and add StructArray::flatten() (#9733) by Han You · 2 weeks ago
  52. 41ccf2d perf(parquet): Defer fixed length byte array buffer alloc and skip zero-batch init (#9756) by Lanqing Yang · 2 weeks ago
  53. 2d8cd5a parquet: O(1) skip for bw=0 miniblocks in DeltaBitPackDecoder (#9786) by Thaddeus Covert · 2 weeks ago
  54. e9cbabd feat(parquet): batch consecutive null/empty rows in `write_list` (#9752) by Hippolyte Barraud · 2 weeks ago
  55. 73ceb1d Expose ColumnCloseResult on ArrowColumnChunk (#9773) by Leonardo Yvens · 2 weeks ago
  56. 9a2b49c Add benchmark for cast from/to decimals (#9729) by Congxian Qiu · 2 weeks ago
  57. 8de8900 arrow-ipc: Write 0 offset buffer for length-0 variable-size arrays (#9717) by Atwam · 2 weeks ago
  58. 63b5470 Remove `len` field from buffer builder (#9750) by Peter L · 2 weeks ago
  59. 5912469 refactor(arrow-avro): use `Decoder::flush_block` in async reader (#9726) by Mikhail Zabaluev · 2 weeks ago
  60. 98f8450 Docs: add example of how to read parquet row groups in parallel (#9396) by Andrew Lamb · 2 weeks ago
  61. d03571b feat(ipc): Remove per-message flush in IPC writer hot path (#9763) by pchintar · 2 weeks ago
  62. 922bdd5 docs(variant): link VariantArray doc to official Parquet Variant extension type (#9779) by Martin Charrel · 2 weeks ago
  63. f73922b [Json] Support `FixedSizeList` in json decoder (#9715) by Liam Bao · 2 weeks ago
  64. 52ff63c refactor(parquet): replace magic `8` literals with named constants (#9751) by Hippolyte Barraud · 2 weeks ago
  65. de11d9c perf(parquet): Vectorize dict-index bounds check in RleDecoder::get_batch_with_dict (up to -7.9%) (#9746) by Daniël Heres · 2 weeks ago
  66. 9d3a4d9 chore: add benchmark for row filters with LIMIT short-circuit (#9767) by Huaijin · 2 weeks ago
  67. 5150224 feat[arrow-ord]: suppport REE comparisons (#9621) by Alfonso Subiotto Marqués · 2 weeks ago
  68. fa1cc58 [Json] Use `partition` and `take` in RunEndEncoded decoder (#9658) by Liam Bao · 2 weeks ago
  69. f0dd1c2 chore: Refine the error message for List to non List cast (#9757) by Oleks V · 2 weeks ago
  70. 51b02f1 [Parquet] perf: preallocate capacity for ArrayReaderBuilder (#9093) by Lanqing Yang · 3 weeks ago
  71. 89b1497 Improve take performance on List arrays (#9643) by Adam Gutglick · 3 weeks ago
  72. 84b3454 fix: ParquetError when reading corrupt parquet file with truncated data instead of Panic (#9725) by xuzifu666 · 3 weeks ago
  73. 58fbb17 remove panics in unshred variant (#9741) by Matthew Kim · 3 weeks ago
  74. c5fed03 Add benchmark for ListView interleave (#9738) by Vegard Stikbakke · 3 weeks ago
  75. d7d9ad3 Refactor `RleEncoder::flush_bit_packed_run` to make flow clearer (#9735) by Ed Seidl · 3 weeks ago
  76. 4676c06 [Variant] Take top-level nulls into consideration when extracting perfectly shredded children (#9702) by Adam Gutglick · 3 weeks ago
  77. 370d426 arrow-arith: fix 'occured' -> 'occurred' in arity.rs comments (#9736) by Sai Asish Y · 3 weeks ago
  78. 182c7a9 Fix clippy warning in fixed_size_binary_array.rs (#9712) by Adam Gutglick · 3 weeks ago
  79. 9ed4c6f feat(parquet): precompute `offset_index_disabled` at build-time (#9724) by Hippolyte Barraud · 3 weeks ago
  80. ddde66d Add a test for reading nested REE data in json (#9634) by Andrew Lamb · 3 weeks ago
  81. 2b8a761 ci: use ubuntu-slim runner for lightweight CI jobs (#9630) by Yu-Chuan Hung · 3 weeks ago
  82. 72b4576 [Variant] Support Binary/LargeBinary children (#9610) by Adam Gutglick · 3 weeks ago
  83. b946165 Fedora license audit (#9704) by Michel Lind · 3 weeks ago
  84. 7a089ad Optimize RowNumberReader to be 8x faster (#9680) by Samyak Sarnayak · 3 weeks ago
  85. 06c3bd0 feat(parquet): add wide-schema writer overhead benchmark (#9723) by Hippolyte Barraud · 3 weeks ago
  86. 38d78c3 Support `GenericListViewArray::new_unchecked` and refactor `ListView` json decoder (#9648) by Liam Bao · 3 weeks ago
  87. 471f6c3 chore(deps): bump actions/upload-pages-artifact from 4 to 5 (#9713) by dependabot[bot] · 3 weeks ago
  88. bfee844 Add mutable bitwise operations to `BooleanArray` and `NullBuffer::union_many` (#9692) by Matt Butrovich · 3 weeks ago
  89. 711fac8 feat(parquet): add `push_decoder` benchmark for `PushBuffers` overhead (#9696) by Hippolyte Barraud · 3 weeks ago
  90. d69c604 chore(deps): bump pytest from 7.2.0 to 9.0.3 in /parquet/pytest (#9706) by dependabot[bot] · 3 weeks ago
  91. 8b159ad Add `finish_preserve_values` to `ArrayBuilder` trait (#9601) by Adam Reichold · 3 weeks ago
  92. dad0be4 [Arrow] Add API to check if `Field` has a valid `ExtensionType` (#9677) by Konstantin Tarasov · 3 weeks ago
  93. 88b7fca ParquetMetaDataPushDecoder API to clear all buffered ranges (#9673) by Nathan · 3 weeks ago
  94. 68851ef chore(deps): update hashbrown requirement from 0.16.0 to 0.17.0 (#9691) by dependabot[bot] · 4 weeks ago
  95. 4778077 [Variant] Add `VariantArrayBuilder::append_nulls` API (#9685) by Konstantin Tarasov · 4 weeks ago
  96. a42c475 Fix union cast incorrectness for duplicate field names (#9666) by Matthew Kim · 4 weeks ago
  97. 70ae764 minor: Re-enable CDC bench (#9686) by Ed Seidl · 4 weeks ago
  98. c180812 chore(deps): bump actions/github-script from 8 to 9 (#9690) by dependabot[bot] · 4 weeks ago
  99. b36beac [Variant] `variant_get` should follow JSONPath semantics for Field path element (#9676) by Konstantin Tarasov · 4 weeks ago
  100. adf9308 feat(parquet): add struct-column writer benchmarks (#9679) by Hippolyte Barraud · 4 weeks ago