blob: 7edb8fcdb4bf71a6835001346724057cc5d30bdf [file] [log] [blame]
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- The above meta tags *must* come first in the head; any other head content must come *after* these tags -->
<title>Apache Arrow 15.0.0 Release | Apache Arrow</title>
<!-- Begin Jekyll SEO tag v2.8.0 -->
<meta name="generator" content="Jekyll v4.3.3" />
<meta property="og:title" content="Apache Arrow 15.0.0 Release" />
<meta property="og:locale" content="en_US" />
<meta name="description" content="Apache Arrow 15.0.0 (21 January 2024) This is a major release covering more than 1 months of development. Download Source Artifacts Binary Artifacts For AlmaLinux For Amazon Linux For CentOS For C# For Debian For Python For Ubuntu Git tag Contributors This release includes 536 commits from 101 distinct contributors. $ git shortlog -sn apache-arrow-14.0.2..apache-arrow-15.0.0 67 dependabot[bot] 42 Sutou Kouhei 31 Raúl Cumplido 27 Josh Soref 20 Antoine Pitrou 19 Felipe Oliveira Carvalho 19 mwish 18 Dewey Dunnington 17 James Duong 16 Curt Hagenlocher 16 Dominik Moritz 16 Joris Van den Bossche 15 Jacob Wujciak-Jens 14 Hyunseok Seo 12 Alenka Frim 10 Yue 10 sgilmore10 9 Matt Topol 8 Dane Pitkin 8 Diego Fernández Giraldo 8 Thomas Newton 6 Bryce Mecum 5 Jin Shang 5 Jonathan Keane 5 Kevin Gurney 4 Benjamin Kietzman 4 Gang Wu 3 Ben Harkins 3 Ivan Chesnov 3 Judah Rand 3 Lei Hou 3 Rossi(Ruoxi) Sun 3 Tim Schaub 3 abandy 2 Alfonso Subiotto Marqués 2 David Li 2 Divyansh200102 2 Eero Lihavainen 2 Francis 2 Gavin Murrison 2 JB Onofré 2 Jeremy Aguilon 2 John 2 Junming Chen 2 Laurent Goujon 2 Nic Crane 2 Peter Andreas Entschev 2 Rok Mihevc 2 Stas Stepanov 2 Tsutomu Katsube 2 Vibhatha Lakmal Abeykoon 1 Abram Fleishman 1 Anja Kefala 1 Atheel Massalha 1 Benjamin Schmidt 1 Bob Plotts 1 Bryan Cutler 1 Carl Jackson 1 ChinYikMing 1 Chris Larsen 1 Christian Holm Christensen 1 Dan Homola 1 Dan Stone 1 Danyaal Khan 1 Diogo Teles Sant&#39;Anna 1 Dongjoon Hyun 1 Fatemah Panahi 1 Fernando Mayer 1 Fokko Driesprong 1 Hattonuri 1 Hendrik Makait 1 Jiaxing Liang 1 Jinpeng 1 John Garland 1 Kyle Barron 1 LucasG0 1 Maximilian Muecke 1 Miguel Pragier 1 Miles 1 Neal Richardson 1 Nick Hughes 1 Paul Spangler 1 Phillip LeBlanc 1 Pierre Moulon 1 Quang Hoang 1 Ravjot Brar 1 Tammy DiPrima 1 Will Jones 1 William Ayd 1 Yifeng-Sigma 1 david dali susanibar arce 1 davidhcoe 1 frazar 1 jiawei liang 1 loicalleyne 1 orgadish 1 panbingkun 1 prmoore77 1 shibei 1 zhipeng 1 谢天 Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -sn --group=trailer:signed-off-by apache-arrow-14.0.2..apache-arrow-15.0.0 154 Sutou Kouhei 56 David Li 50 Antoine Pitrou 35 Joris Van den Bossche 30 Jacob Wujciak-Jens 24 Raúl Cumplido 22 Matt Topol 19 Curt Hagenlocher 18 Kevin Gurney 16 Benjamin Kietzman 16 Felipe Oliveira Carvalho 14 mwish 13 Dewey Dunnington 12 AlenkaF 7 Nic Crane 3 Weston Pace 2 Gang Wu 2 Jonathan Keane 1 Josh Soref 1 Rok Mihevc 1 Will Jones 1 dependabot[bot] Changelog Apache Arrow 15.0.0 (2024-01-21 08:00:00) Bug Fixes GH-15192 - [C++] Bring back case_when tests for union types (#39308) GH-32570 - [C++] Fix the issue of ExecBatchBuilder when appending consecutive tail rows with the same id may exceed buffer boundary (#39234) GH-32662 - [C#] Make dictionaries in file and memory implementations work correctly and support integration tests (#39146) GH-33475 - [Java] Add parameter binding for Prepared Statements in JDBC driver (#38404) GH-34532 - [Java][FlightSQL] Change JDBC to handle multi-endpoints (#38521) GH-34610 - [Java] Fix valueCount and field name when loading/transferring NullVector (#38973) GH-34890 - [C++][Python] Add a no-op kernel for dictionary_encode(dictionary) (#38349) GH-35497 - [C++] Use the latest tagged version of flatbuffers (#38192) GH-36588 - [C#] Support blank column names and enable more integration tests. (#39167) GH-36594 - [C++] Don’t use MSVC_VERSION to determin -fms-compatibility-version (#36595) GH-36912 - [Java] JDBC driver stops consuming roots if it sees an empty root (#38590) GH-37055 - [C++] Optimize hash kernels for Dictionary ChunkedArrays (#38394) GH-37657 - [JS] Run bin scripts with ts-node (#38500) GH-37726 - [Swift][FlightSQL] Update behavior to be similar to existing impls (#37764) GH-37751 - [C++][Gandiva] Avoid registering exported functions multiple times in gandiva (#37752) GH-37796 - [C++][Acero] Fix race condition caused by straggling input in the as-of-join node (#37839) GH-37884 - [Swift] allow reading of unaligned FlatBuffers buffers (#38635) GH-37969 - [C++][Parquet] add more closed file checks for ParquetFileWriter (#38390) GH-38096 - [Java] FlightStream with metadata can cause error when closing (#38110) GH-38198 - [Go] Fix AuthenticateBasicToken to be reliable behind proxies (#38199) GH-38210 - [C++][FlightRPC] Add missing app_metadata arguments (#38231) GH-38216 - [R] open_dataset(format = “json”) not documented (#38258) GH-38242 - [Java] Fix incorrect internal struct accounting for DenseUnionVector#getBufferSizeFor (#38305) GH-38254 - [Java] Add reusable buffer getters to char/binary vectors (#38266) GH-38268 - [Java] Disable flaky TestFlightSqlStreams (#38319) GH-38281 - [Go] Ensure CData imported arrays are freed on release (#38314) GH-38297 - [C#] Fix build for .NET 4.7.2 (#38299) GH-38304 - [C++][Parquet] Fix Valgrind memory leak in arrow-dataset-file-parquet-encryption-test (#38306) GH-38307 - [CI] Remove gemfury_clean.rb (#38308) GH-38318 - [Java][FlightRPC] Enable tests that leaked (#38719) GH-38323 - [CI][Python] Use system gdb on test-conda-python (#38324) GH-38363 - [Release][CI] Omit tests for main/maintenance branches on RC branch (#38365) GH-38366 - [Java] Fix Murmur hash on buffers less than 4 bytes (#38368) GH-38378 - [C++][Parquet] Don’t initialize OpenSSL explicitly with OpenSSL 1.1 (#38379) GH-38382 - [R] Explicitly clean up arrow_duck_connection() on exit (#38495) GH-38387 - [Java] Fix JDK8 compilation issue with TestAllTypes (#38388) GH-38395 - [Go] fix rounding errors in decimal256 string functions (#38426) GH-38399 - [Go][Parquet] DeltaBitPack decoder reset usedFirst after SetData (#38413) GH-38401 - [C++] Re-generate flatbuffers C++ for Skyhook (#38405) GH-38436 - [R] Test segfault on reading CSVs with non-UTF-8 encoding GH-38439 - [Java][CI] Use Eclipse Temurin for all Java CI linux jobs (#38440) GH-38447 - [CI][Release] Don’t use “   {exit,continue}” (#38486) GH-38458 - [Go] Add ValueLen to BinaryLike interface (#39242) GH-38470 - [CI][Integration] Install jpype and build JNI c-data to run integration tests (#39502) GH-38477 - [Go] Fixing decimal 128 rounding issue (#38478) GH-38479 - [C++] Avoid passing null pointer to LZ4 frame decompressor (#39125) GH-38503 - [Go][Parquet] Make the arrow column writer internal (#38727) GH-38503 - [Go][Parquet] Style improvement for using ArrowColumnWriter (#38581) GH-38516 - [Go][Parquet] Increment the number of rows written when appending a new row group (#38517) GH-38535 - [Python] Fix S3FileSystem equals None segfault (#39276) GH-38554 - [Release][Website] post-03-website.sh doesn’t quote current.date (#38555) GH-38556 - [C++] Add missing explicit size_t cast for i386 (#38557) GH-38594 - [Docs][C++][Gandiva] Document how to register Gandiva external functions (#38763) GH-38599 - [Docs] Update Headers (#38696) GH-38614 - [Java] Add VarBinary and VarCharWriter helper methods to more writers (#38631) GH-38624 - [C++] Fix: add TestingEqualOptions for gtest functions. (#38642) GH-38630 - [MATLAB] arrow.array.BooleanArray’s toMATLAB method does not take slice offsets into account (#38636) GH-38653 - [Packaging][Java][Python][Ruby] Raise the minimum macOS version to 10.15 catalina to allow using new APIs in C++17 (#38677) GH-38683 - [Python][Docs] Update docstrings for Time32Type and Time64Type (#39059) GH-38684 - [Integration] Try to strengthen C Data Interface testing (#38846) GH-38697 - [C++][Gandiva] Use arrow io util to replace std::filesystem::path in gandiva (#38698) GH-38709 - [C++] Protect against PREALLOCATE preprocessor defined on macOS (#38760) GH-38711 - [CI] Rollback aws-cli for preview documentation (#38723) GH-38725 - [Java] decompression in Lz4CompressionCodec.java does not set writer index (#38840) GH-38728 - [Go] ipc: put lz4 decompression buffers back into sync.Pool (#38729) GH-38737 - [Java] Fix JDBC caching of SqlInfo values (#38739) GH-38738 - [C++] Check variadic buffer counts in bounds (#38740) GH-38745 - [Integration] Fix huge integration test (#38746) GH-38762 - [R] Versions of R and RTools in CI config are no longer current GH-38764 - [Java] Clarify warning about --add-opens=java.base/java.nio=ALL-UNNAMED (#38765) GH-38782 - [C++][FS][Azure] Do nothing for CreateDir(“/container”, true) (#38783) GH-38795 - [Go] Fix race GetToTimeFunc for Timestamp (#38797) GH-38811 - [R] Actually use fetched cmake on macos (#39453) GH-38816 - [C#] Fix IArrowRecord implementation on StructArray (#38827) GH-38823 - Fix TestArrowReaderAdHoc.ReadFloat16Files to use new uncompressed files (#38825) GH-38832 - [Java] Avoid building twice in ci/scripts/java_build.sh (#38829) GH-38844 - [C++] S3FileSystem export s3 sdk config “use_virtual_addressing” to arrow::fs::S3Options (#38858) GH-38851 - Website: Contributing link is not working GH-38879 - [C++][Gandiva] Fix Gandiva to_date function’s validation for supress errors parameter (#38987) GH-38883 - [Docs] Fix struct example to show hiding a child’s entry (#38898) GH-38906 - [R] Improve Windows CI configuration (#38927) GH-38921 - [CI] Fix spelling (#38922) GH-38925 - [CI] Fix spelling (#38926) GH-38928 - [R] Fix spelling (#38929) GH-38930 - [Java] Fix spelling (#38931) GH-38932 - [GO] Fix spelling (#38933) GH-38938 - [FlightRPC] Fix spelling (#38939) GH-38940 - [Ruby] Fix spelling (#38941) GH-38942 - [C#] Fix spelling (#38943) GH-38944 - [Python] Fix spelling (#38945) GH-38946 - [MATLAB] Fix spelling (#38947) GH-38948 - [Swift] Fix spelling (#38949) GH-38950 - [Docs] Fix spelling (#38951) GH-38952 - [Format] Fix spelling (#38953) GH-38954 - [Dev] Fix spelling (#38955) GH-38956 - [Gandiva] Fix spelling (#38957) GH-38960 - [C++] Fix spelling (acero) (#38961) GH-38964 - [C++] Fix spelling (compute) (#38965) GH-38966 - [C++] Fix spelling (util) (#38967) GH-38968 - [C++] Fix spelling (dataset) (#38969) GH-38971 - [C++] Fix spelling (filesystem) (#38972) GH-38975 - [Dev] Fix spelling (#38976) GH-38977 - [C++] Fix spelling (#38978) GH-38979 - [C++] Fix spelling (#38980) GH-38981 - [R][Release] Don’t update version.json on compatible version release (#38982) GH-39014 - [Java] Add default truststore along with KeychainStore when on Mac system (#39235) GH-39031 - [Docs] Remove misspelled rule from contrib css (#39032) GH-39045 - [C++][Acero] union node output batches should be unordered (#39046) GH-39113 - [Integration][Flight][Java] Fix occasional failure starting Java server (#39115) GH-39116 - [Go] Fix CI Staticcheck (#39117) GH-39126 - [C++][CI] Fix Valgrind failures (#39127) GH-39130 - [CI][GLib][Windows] Use old Ruby as workaround for load error (#39168) GH-39136 - [C++] Remove needless system Protobuf dependency with -DARROW_HDFS=ON (#39137) GH-39138 - [R] Fix implicit conversion warnings (#39250) GH-39156 - [C++][Compute] Fix negative duration division (#39158) GH-39163 - [C++] Add missing data copy in StreamDecoder::Consume(data) (#39164) GH-39185 - [C++] Remove compiler warnings with -Wconversion -Wno-sign-conversion in public headers (#39186) GH-39191 - [R] throw error when string_replace is passed vector of values in pattern (#39219) GH-39238 - [Go] PATCH Prevents empty record to be appended to empty resultset (#39239) GH-39288 - [Java][FlightSQL] Update Apache Avatica to version 1.24.0 (#39325) GH-39306 - [C++][Benchmarking] Remove hardcoded min times (#39307) GH-39327 - [Java] define assemble descriptor for new custom maven plugin project (#39331) GH-39333 - [C++] Don’t use “if constexpr” in lambda (#39334) GH-39359 - [CI][C++] Remove MinGW MINGW32 C++ job (#39376) GH-39384 - [C++] Disable -Werror=attributes for Azure SDK’s identity.hpp (#39448) GH-39387 - [C++] Fix compile warning (#39389) GH-39421 - [CI][Ruby] Update to using Ubuntu 22.04 on test-ruby and test-c-glib nightly jobs (#39422) GH-39423 - [CI][JS] TypeScript: Compilation failed on yarn build for several CI jobs GH-39425 - [CI] Fix import to match new substrait repo structure (#39426) GH-39433 - [Ruby] Add support for Table.load(format: json) options (#39464) GH-39437 - [CI][Python] Update pandas tests failing on pandas nightly CI build (#39498) GH-39468 - [Java] Fix site build for docs (#39471) GH-39469 - [CI][JS] Force node 20 on JS build on arm64 to fix build issues (#39499) GH-39488 - [Ruby] Add support for ChunkedArray in Ractor (#39490) GH-39517 - [C++] Disable parallelism for jemalloc external project (#39522) GH-39562 - [C++][Parquet] Fix crash in test_parquet_dataset_lazy_filtering (#39632) GH-39564 - [CI][Java] Set correct version on Java BOM (#39580) GH-39584 - [R] fallback to source gracefully (#39587) GH-39588 - [CI][Go] Add CGO_ENABLED=1 to cdata_integration build to fix macOS build with conda (#39589) GH-39598 - [C#] Fix verification script (#39605) GH-39604 - [JS] Do not use resizable buffers yet (#39607) GH-39628 - [C++] Disable parallelism for all `make`-based externalProjects when CMake &gt;= 3.28 is used New Features and Improvements GH-14936 - [Java] Remove netty dependency from arrow-vector (#38493) GH-28994 - [C++][JSON] Change the max rows to Unlimited(int_32) (#38582) GH-30117 - [C++][Python] Add “Z” to the end of timestamp print string when tz defined (#39272) GH-30717 - [C#] Add ToString() methods to Arrow classes (#36566) GH-31303 - [Python] Remove the legacy ParquetDataset custom python-based implementation (#39112) GH-31579 - [C#] : Remove out-of-support versions of .NET and update C# README (#39165) GH-33500 - [Python] add Table.to/from_struct_array (#38520) GH-33984 - [C++][Python] DLPack implementation for Arrow Arrays (producer) (#38472) GH-34316 - [Python] FixedSizeListArray.from_arrays supports mask parameter (#39396) GH-34569 - [C++] Diffing of Run-End Encoded arrays (#35003) GH-34636 - [C#] Reduce allocations when using ArrayPool (#39166) GH-35260 - [C++][Python][R] Allow users to adjust S3 log level by environment variable (#38267) GH-35331 - [Python] Expose Parquet sorting metadata (#37665) GH-35344 - [C++][Format] Implementation of the LIST_VIEW and LARGE_LIST_VIEW array formats (#35345) GH-35560 - [C++] Use Cast() instead of CastTo() for Scalar in test (#39044) GH-36036 - [C++][Python][Parquet] Implement Float16 logical type (#36073) GH-36044 - [Python][Docs] Added ParquetFileFragment to the API reference docs (#38277) GH-36099 - [C++] Add Utf8View and BinaryView to the c ABI (#38443) GH-36441 - [Python] Make CacheOptions configurable from Python (#36627) GH-36760 - [Go] Add Avro OCF reader (#37115) GH-36815 - [C#] : Enable net472 tests under Windows (#36818) GH-36898 - [CI] Hashpin Sensitive GitHub Actions (#37676) GH-37002 - [C++][Parquet] Add api to get RecordReader from RowGroupReader (#37003) GH-37061 - [Docs][Format] Clarify semantics of GetSchema in FSQL (#38549) GH-37199 - [C++] Expose a span converter for Buffer and ArraySpan (#38027) GH-37242 - [Python][Parquet] Parquet Support write and validate Page CRC (#38360) GH-37312 - [Python][Docs] Update Python docstrings to reflect new parquet encoding option (#38070) GH-37359 - [C#] Add ToList() to Decimal128Array and Decimal256Array (#37383) GH-37378 - [C++] Add A Dictionary Compaction Function For DictionaryArray (#37418) GH-37429 - [C++] Add arrow::ipc::StreamDecoder::Reset() (#37970) GH-37511 - [C++] Implement file reads for Azure filesystem (#38269) GH-37582 - [Go][Parquet] Implement Float16 logical type (#37599) GH-37592 - [MATLAB] Add NumRows property to arrow.tabular.RecordBatch (#38215) GH-37710 - [C++][Integration] Add C++ Utf8View implementation (#37792) GH-37753 - [C++][Gandiva] Add external function registry support (#38116) GH-37812 - [MATLAB] Add arrow.type.ListType MATLAB class (#38189) GH-37815 - [MATLAB] Add arrow.array.ListArray MATLAB class (#38357) GH-37848 - [C++][Gandiva] Migrate LLVM JIT engine from MCJIT to ORC v2/LLJIT (#39098) GH-37857 - [Python][Dataset] Expose file size to python dataset (#37868) GH-37889 - [Java][Doc] Improve JDBC driver documentation (#38469) GH-37895 - [C++] Feature: support concatenate recordbatches. (#37896) GH-37910 - [Java][Integration] Implement C Data Interface integration testing (#38248) GH-37943 - [Java] Add parquet file with all supported types (#38249) GH-37979 - [C++] Add support for specifying custom Array opening and closing delimiters to arrow::PrettyPrintDelimiters (#38187) GH-38022 - [Java][FlightRPC] Expose app_metadata on FlightInfo and FlightEndpoint (#38331) GH-38024 - [Java][FlightRPC] Expose appMetadata through JDBC ResultSet (#38781) GH-38033 - [R] Allow code() to return package name prefix. (#38144) GH-38042 - [C++][Benchmark] Add non-stream Codec Compression/Decompression (#38067) GH-38117 - [C++][Parquet] Change DictEncoder dtor checking to warning log (#38118) GH-38131 - [Swift][CI] Add linting and fix linting errors (#38133) GH-38153 - [C#] expose ArrayDataConcatenator.Concatenate (#38154) GH-38164 - [MATLAB] Rename Length property on arrow.array.Array and arrow.array.ChunkedArray to NumElements (#38190) GH-38166 - [MATLAB] Improve tabular object display (#38482) GH-38246 - [JAVA] added new getTransferPair() function that takes in a Field type for Complex Type Vectors (#38261) GH-38264 - [Java][Packaging] Add BOM file (#38336) GH-38271 - [C++][Parquet] Support reading parquet files with multiple gzip members (#38272) GH-38300 - [Dev][Docs] Update dev/README.md for the current merge_arrow_pr.py (#38301) GH-38310 - [MATLAB] Create the testing guideline document for testing MATLAB interface (#38459) GH-38316 - [C#] Implement interval types (#39043) GH-38326 - [C++][Parquet] check the decompressed page size same as size in page header (#38327) GH-38330 - [C++][Azure] Use properties for input stream metadata (#38524) GH-38333 - [C++][FS][Azure] Implement file writes (#38780) GH-38335 - [C++] Implement GetFileInfo for a single file in Azure filesystem (#38505) GH-38339 - [C++][CMake] Use transitive dependency for system GoogleTest (#38340) GH-38341 - [Python] Remove usage of pandas internals DatetimeTZBlock (#38321) GH-38346 - [C++][Parquet] Use new encrypted files for page index encryption test (#38347) GH-38348 - [C#] Make PrimitiveArray support IReadOnlyList&lt;T?&gt; (#38680) GH-38351 - [C#] Add SqlDecimal support to Decimal128Array (#38481) GH-38354 - [MATLAB] Implement fromMATLAB method for arrow.array.ListArray (#38561) GH-38361 - Add validation logic for offsets and values to arrow.array.ListArray.fromArrays (#38531) GH-38376 - [R] : Add dimnames method to Dataset class (#38377) GH-38381 - [C++][Acero] Create a sorted merge node (#38380) GH-38398 - [MATLAB] Improve array display (#38400) GH-38402 - [CI][Integration] Provide wrapper scripts for integration testing (#38403) GH-38415 - [MATLAB] Add indexing “slice” method to C++ Array Proxy class (#38674) GH-38417 - [MATLAB] Implement a TableTypeValidator class that validates a MATLAB cell array contains only tables that share the same schema (#38551) GH-38418 - [MATLAB] Add method for extracting one row of an arrow.tabular.Table as a string (#38463) GH-38419 - [MATLAB] Implement a ClassTypeValidator class that validates a MATLAB cell array contains only values of the same class type. (#38530) GH-38420 - [MATLAB] Implement a DatetimeValidator class that validates a MATLAB cell array contains only values of zoned or unzoned datetimes (#38533) GH-38424 - [CI][C++] Use Fedora 38 instead of 35 (#38425) GH-38452 - [C++][Benchmark] Adding benchmark for LZ4/Snappy Compression (#38453) GH-38457 - [C++] Support LogicalNullCount for DictionaryArray (#38681) GH-38460 - [Java][FlightRPC] Add mTLS support for Flight SQL JDBC driver (#38461) GH-38462 - [Go][Parquet] Handle Boolean RLE encoding/decoding (#38367) GH-38483 - [C#] Add support for more decimal conversions (#38508) GH-38506 - [Go][Parquet] Add NumRows and RowGroupNumRows to pqarrow.FileWriter (#38507) GH-38511 - [Java] Add getTransferPair(Field, BufferAllocator, CallBack) for StructVector and MapVector (#38512) GH-38528 - [Python][Compute] Describe strptime format semantics (#38665) GH-38537 - [Java] upgrade to netty 4.1.100.Final (#38538) GH-38541 - [MATLAB] Add remaining tests for arrow tabular object display (#38564) GH-38542 - [C++][Parquet] Faster scalar BYTE_STREAM_SPLIT (#38529) GH-38562 - [Packaging] Add support for Ubuntu 23.10 (#38563) GH-38576 - [Java] Change JDBC driver to optionally preserve cookies and auth tokens when getting streams (#38580) GH-38578 - [Java][FlightSQL] Remove joda usage from flight-sql library (#38579) GH-38589 - [C++][Gandiva] Support registering external C functions (#38632) GH-38597 - [C++] Implement GetFileInfo(selector) for Azure filesystem (#39009) GH-38602 - [R] Add missing prod for summarize (#38601) GH-38627 - [Java][FlightRPC] Handle null parameter values (#38628) GH-38648 - [Java] Regenerate Flatbuffers (#38650) GH-38652 - [MATLAB] Add tests about time precision preservation when converting MATLAB duration to arrow.array.Time32Array and arrow.array.Time64Array (#38661) GH-38662 - [Java] Add comparators (#38669) GH-38699 - [C++][FS][Azure] Implement CreateDir() (#38708) GH-38700 - [C++][FS][Azure] Implement DeleteDir() (#38793) GH-38701 - [C++][FS][Azure] Implement DeleteDirContents() (#38888) GH-38702 - [C++] : Implement AzureFileSystem::DeleteRootDirContents (#39151) GH-38705 - [C++][FS][Azure] Implement CopyFile() (#39058) GH-38712 - [Python] Remove dead code in _reconstruct_block (#38714) GH-38718 - [Go][Format][Integration] Add StringView/BinaryView to Go implementation (#35769) GH-38732 - [Java][FlightRPC] Add support for Array parameter binding in JDBC (#38733) GH-38751 - [C++][Go][Parquet] Add tests for reading Float16 files in parquet-testing (#38753) GH-38757 - [C#] Implement common interfaces for structure arrays and record batches (#38759) GH-38758 - [C++][FS][Azure] Rename AzurePath to AzureLocation (#38773) GH-38772 - [C++] Implement directory semantics even when the storage account doesn’t support HNS (#39361) GH-38798 - [Integration] Enable C Data Interface integration testing on Rust (#38799) GH-38814 - [C++][Parquet] Update parquet.thrift to sync with 2.10.0 (#38815) GH-38824 - [Go] Enable GC checks (#38826) GH-38836 - [Go] Add Size() for ArrayData (#38839) GH-38852 - [C++] Replace “#ifdef ARROW_WITH_GZIP” in dataset test to ARROW_WITH_ZLIB (#38853) GH-38857 - [Python] Fix append mode for cython 2 (#39027) GH-38857 - [Python] Add append mode for pyarrow.OsFile (#38820) GH-38860 - [C++][Parquet] Using length to optimize bloom filter read (#38863) GH-38870 - [Documentation] Add List View and Large List View to status.rst (#38871) GH-38874 - [C++][Parquet] Minor: making parquet TypedComparator operation as const method (#38875) GH-38884 - [C++] DatasetWriter release rows_in_flight_throttle when allocate writing failed (#38885) GH-38887 - [C++][Parquet] Move EstimatedBufferedValueBytes from TypedColumnWriter to ColumnWriter (#39055) GH-38907 - [C++] Stop installing internal bpacking_simd* headers (#38908) GH-38909 - [Packaging] Drop support for Ubuntu 23.04 (#38910) GH-38918 - [Go] Avoid schema.Fields allocations in some places (#38919) GH-38920 - [C++][Gandiva] Refactor function holder to return arrow Result (#38873) GH-38958 - [C++][Parquet] Fix spelling (#38959) GH-38990 - [Java] Upgrade to flatc version 23.5.26 (#38991) GH-38996 - [Java] Update dependencies and plugins for JPMS modules (#38994) GH-39006 - [Python] Extract libparquet requirements out of libarrow_python.so to new libarrow_python_parquet_encryption.so (#39316) GH-39013 - [Go][Integration] Support cABI import/export of StringView (#39019) GH-39020 - [CI][Release][JS] Use Node.js 18 instead of 16 (#39021) GH-39028 - [Python][CI] Fix dask integration build by temporarily skipping test_categorize_info (#39029) GH-39037 - [Java] Remove (Contrib/Experimental) mention in Flight SQL (#39040) GH-39049 - [C++] Use Cast() instead of CastTo() for Dictionary Scalar in test (#39362) GH-39050 - [C++] Use Cast() instead of CastTo() for Timestamp Scalar in test (#39060) GH-39051 - [C++] Use Cast() instead of CastTo() for List Scalar in test (#39353) GH-39064 - [C++][Parquet] Support row group filtering for nested paths for struct fields (#39065) GH-39088 - [Dev][Java] Add Dependabot configuration for Java (#39089) GH-39096 - [Python] Release GIL in .nbytes (#39097) GH-39119 - [C++] Refactor the Azure FS tests and filesystem class instantiation (#39207) GH-39122 - [C++][Parquet] Optimize FLBA record reader (#39124) GH-39134 - Create module info compiler plugin (#39135) GH-39159 - [C++] : Try to make Buffer::device_type_ non-optional (#39150) GH-39170 - [Java] Improve error message explaining why TestTls might fail (#39171) GH-39189 - [Java] Bump com.h2database:h2 from 1.4.196 to 2.2.224 in /java (#39188) GH-39196 - [Python][Docs] Document the Arrow PyCapsule protocol in the ‘extending pyarrow’ section of the Python docs (#39199) GH-39208 - [C++][Parquet] Remove deprecated AppendRowGroup(int64_t num_rows) (#39209) GH-39210 - [C++][Parquet] Avoid WriteRecordBatch from produce zero-sized RowGroup (#39211) GH-39217 - [Python] RecordBatchReader.from_stream constructor for objects implementing the Arrow PyCapsule protocol (#39218) GH-39223 - [C#] Support IReadOnlyList&lt;T?&gt; on remaining scalar types (#39224) GH-39225 - [GLib] Use Cast() instaed of CastTo (#39228) GH-39232 - [C++] Support binary to fixed_size_binary cast (#39236) GH-39243 - [R][CI] Remove r-arrow conda nightlies (#39244) GH-39246 - [CI][GLib][Ruby] Use Ubuntu 22.04 not 20.04 (#39247) GH-39262 - [C++][Azure][FS] Add default credential auth configuration (#39263) GH-39265 - [Java] Make it run well with the netty newest version 4.1.104 (#39266) GH-39268 - [C++] Don’t install bundled Azure SDK for C++ with CMake 3.28+ (#39269) GH-39292 - [C++][FS] : Remove the AzureBackend enum and add more flexible connection options (#39293) GH-39297 - [C++][FS] : Inform caller of container not-existing when checking for HNS support (#39298) GH-39299 - [Java] Upgrade to Avro 1.11.3 (#39300) GH-39303 - [Archery][Benchmarking] Allow setting C++ repetition min time (#39324) GH-39318 - [C++][FS][Azure] Add workload identity auth configuration (#39319) GH-39320 - [C++][FS][Azure] Add managed identity auth configuration (#39321) GH-39322 - [C++] Forward arguments to ExceptionToStatus all the way to Status::FromArgs (#39323) GH-39326 - [C++] Flaky DatasetWriterTestFixture.MaxRowsOneWriteBackpresure test (#39379) GH-39328 - [Java] Make default getConsumer public (#39329) GH-39335 - [C#] Support creating FlightClient with Grpc.Core.Channel (#39348) GH-39339 - [C++] Add ForceCachedHierarchicalNamespaceSupport to help with testing (#39340) GH-39341 - [C#] Support Utf8View, BinaryView and ListView (#39342) GH-39343 - [C++][FS][Azure] Add client secret auth configuration (#39346) GH-39355 - [Java] Improve JdbcConsumer exceptions (#39356) GH-39357 - [C++] Reduce function.h includes (#39312) GH-39363 - [C++] Use Cast() instead of CastTo() for Parquet (#39364) GH-39413 - [C++][Parquet] Vectorize decode plain on FLBA (#39414) GH-39419 - [C++][Parquet] Style: Using arrow::Buffer data_as api rather than reinterpret_cast (#39420) GH-39430 - [C++][ORC] Upgrade ORC to 1.9.2 (#39431) GH-39449 - [C++] Use default Azure credentials implicitly and support anonymous credentials explicitly (#39450) GH-39484 - [Java] Support 256 bit decimals in JdbcToArrowUtils (#39485) GH-39500 - [Docs] Pin pydata-sphinx-theme to 0.14 (#39501) GH-39515 - [Python] Pass in type to MapType.from_arrays (#39516) GH-39531 - [Python][CI] Skip failing dask tests: test_describe_empty and test_view (#39534) GH-39533 - [Python] NumPy 2.0 compat: remove usage of np.core (#39535) GH-39537 - [Packaging][Python] Add a numpy&lt;2 pin to the install requirements for the 15.x release branch (#39538) GH-39601 - [R] Don’t download cmake when TEST_OFFLINE_BUILD=true (#39602) GH-39624 - [R][CI] Add CMake to docker file and update envvars (#39625) GH-39626 - [Docs][R] Update NEWS.md for 15.0.0 PARQUET-2411 - [C++][Parquet] Allow reading dictionary without reading data via ByteArrayDictionaryRecordReader (#39153)" />
<meta property="og:description" content="Apache Arrow 15.0.0 (21 January 2024) This is a major release covering more than 1 months of development. Download Source Artifacts Binary Artifacts For AlmaLinux For Amazon Linux For CentOS For C# For Debian For Python For Ubuntu Git tag Contributors This release includes 536 commits from 101 distinct contributors. $ git shortlog -sn apache-arrow-14.0.2..apache-arrow-15.0.0 67 dependabot[bot] 42 Sutou Kouhei 31 Raúl Cumplido 27 Josh Soref 20 Antoine Pitrou 19 Felipe Oliveira Carvalho 19 mwish 18 Dewey Dunnington 17 James Duong 16 Curt Hagenlocher 16 Dominik Moritz 16 Joris Van den Bossche 15 Jacob Wujciak-Jens 14 Hyunseok Seo 12 Alenka Frim 10 Yue 10 sgilmore10 9 Matt Topol 8 Dane Pitkin 8 Diego Fernández Giraldo 8 Thomas Newton 6 Bryce Mecum 5 Jin Shang 5 Jonathan Keane 5 Kevin Gurney 4 Benjamin Kietzman 4 Gang Wu 3 Ben Harkins 3 Ivan Chesnov 3 Judah Rand 3 Lei Hou 3 Rossi(Ruoxi) Sun 3 Tim Schaub 3 abandy 2 Alfonso Subiotto Marqués 2 David Li 2 Divyansh200102 2 Eero Lihavainen 2 Francis 2 Gavin Murrison 2 JB Onofré 2 Jeremy Aguilon 2 John 2 Junming Chen 2 Laurent Goujon 2 Nic Crane 2 Peter Andreas Entschev 2 Rok Mihevc 2 Stas Stepanov 2 Tsutomu Katsube 2 Vibhatha Lakmal Abeykoon 1 Abram Fleishman 1 Anja Kefala 1 Atheel Massalha 1 Benjamin Schmidt 1 Bob Plotts 1 Bryan Cutler 1 Carl Jackson 1 ChinYikMing 1 Chris Larsen 1 Christian Holm Christensen 1 Dan Homola 1 Dan Stone 1 Danyaal Khan 1 Diogo Teles Sant&#39;Anna 1 Dongjoon Hyun 1 Fatemah Panahi 1 Fernando Mayer 1 Fokko Driesprong 1 Hattonuri 1 Hendrik Makait 1 Jiaxing Liang 1 Jinpeng 1 John Garland 1 Kyle Barron 1 LucasG0 1 Maximilian Muecke 1 Miguel Pragier 1 Miles 1 Neal Richardson 1 Nick Hughes 1 Paul Spangler 1 Phillip LeBlanc 1 Pierre Moulon 1 Quang Hoang 1 Ravjot Brar 1 Tammy DiPrima 1 Will Jones 1 William Ayd 1 Yifeng-Sigma 1 david dali susanibar arce 1 davidhcoe 1 frazar 1 jiawei liang 1 loicalleyne 1 orgadish 1 panbingkun 1 prmoore77 1 shibei 1 zhipeng 1 谢天 Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -sn --group=trailer:signed-off-by apache-arrow-14.0.2..apache-arrow-15.0.0 154 Sutou Kouhei 56 David Li 50 Antoine Pitrou 35 Joris Van den Bossche 30 Jacob Wujciak-Jens 24 Raúl Cumplido 22 Matt Topol 19 Curt Hagenlocher 18 Kevin Gurney 16 Benjamin Kietzman 16 Felipe Oliveira Carvalho 14 mwish 13 Dewey Dunnington 12 AlenkaF 7 Nic Crane 3 Weston Pace 2 Gang Wu 2 Jonathan Keane 1 Josh Soref 1 Rok Mihevc 1 Will Jones 1 dependabot[bot] Changelog Apache Arrow 15.0.0 (2024-01-21 08:00:00) Bug Fixes GH-15192 - [C++] Bring back case_when tests for union types (#39308) GH-32570 - [C++] Fix the issue of ExecBatchBuilder when appending consecutive tail rows with the same id may exceed buffer boundary (#39234) GH-32662 - [C#] Make dictionaries in file and memory implementations work correctly and support integration tests (#39146) GH-33475 - [Java] Add parameter binding for Prepared Statements in JDBC driver (#38404) GH-34532 - [Java][FlightSQL] Change JDBC to handle multi-endpoints (#38521) GH-34610 - [Java] Fix valueCount and field name when loading/transferring NullVector (#38973) GH-34890 - [C++][Python] Add a no-op kernel for dictionary_encode(dictionary) (#38349) GH-35497 - [C++] Use the latest tagged version of flatbuffers (#38192) GH-36588 - [C#] Support blank column names and enable more integration tests. (#39167) GH-36594 - [C++] Don’t use MSVC_VERSION to determin -fms-compatibility-version (#36595) GH-36912 - [Java] JDBC driver stops consuming roots if it sees an empty root (#38590) GH-37055 - [C++] Optimize hash kernels for Dictionary ChunkedArrays (#38394) GH-37657 - [JS] Run bin scripts with ts-node (#38500) GH-37726 - [Swift][FlightSQL] Update behavior to be similar to existing impls (#37764) GH-37751 - [C++][Gandiva] Avoid registering exported functions multiple times in gandiva (#37752) GH-37796 - [C++][Acero] Fix race condition caused by straggling input in the as-of-join node (#37839) GH-37884 - [Swift] allow reading of unaligned FlatBuffers buffers (#38635) GH-37969 - [C++][Parquet] add more closed file checks for ParquetFileWriter (#38390) GH-38096 - [Java] FlightStream with metadata can cause error when closing (#38110) GH-38198 - [Go] Fix AuthenticateBasicToken to be reliable behind proxies (#38199) GH-38210 - [C++][FlightRPC] Add missing app_metadata arguments (#38231) GH-38216 - [R] open_dataset(format = “json”) not documented (#38258) GH-38242 - [Java] Fix incorrect internal struct accounting for DenseUnionVector#getBufferSizeFor (#38305) GH-38254 - [Java] Add reusable buffer getters to char/binary vectors (#38266) GH-38268 - [Java] Disable flaky TestFlightSqlStreams (#38319) GH-38281 - [Go] Ensure CData imported arrays are freed on release (#38314) GH-38297 - [C#] Fix build for .NET 4.7.2 (#38299) GH-38304 - [C++][Parquet] Fix Valgrind memory leak in arrow-dataset-file-parquet-encryption-test (#38306) GH-38307 - [CI] Remove gemfury_clean.rb (#38308) GH-38318 - [Java][FlightRPC] Enable tests that leaked (#38719) GH-38323 - [CI][Python] Use system gdb on test-conda-python (#38324) GH-38363 - [Release][CI] Omit tests for main/maintenance branches on RC branch (#38365) GH-38366 - [Java] Fix Murmur hash on buffers less than 4 bytes (#38368) GH-38378 - [C++][Parquet] Don’t initialize OpenSSL explicitly with OpenSSL 1.1 (#38379) GH-38382 - [R] Explicitly clean up arrow_duck_connection() on exit (#38495) GH-38387 - [Java] Fix JDK8 compilation issue with TestAllTypes (#38388) GH-38395 - [Go] fix rounding errors in decimal256 string functions (#38426) GH-38399 - [Go][Parquet] DeltaBitPack decoder reset usedFirst after SetData (#38413) GH-38401 - [C++] Re-generate flatbuffers C++ for Skyhook (#38405) GH-38436 - [R] Test segfault on reading CSVs with non-UTF-8 encoding GH-38439 - [Java][CI] Use Eclipse Temurin for all Java CI linux jobs (#38440) GH-38447 - [CI][Release] Don’t use “   {exit,continue}” (#38486) GH-38458 - [Go] Add ValueLen to BinaryLike interface (#39242) GH-38470 - [CI][Integration] Install jpype and build JNI c-data to run integration tests (#39502) GH-38477 - [Go] Fixing decimal 128 rounding issue (#38478) GH-38479 - [C++] Avoid passing null pointer to LZ4 frame decompressor (#39125) GH-38503 - [Go][Parquet] Make the arrow column writer internal (#38727) GH-38503 - [Go][Parquet] Style improvement for using ArrowColumnWriter (#38581) GH-38516 - [Go][Parquet] Increment the number of rows written when appending a new row group (#38517) GH-38535 - [Python] Fix S3FileSystem equals None segfault (#39276) GH-38554 - [Release][Website] post-03-website.sh doesn’t quote current.date (#38555) GH-38556 - [C++] Add missing explicit size_t cast for i386 (#38557) GH-38594 - [Docs][C++][Gandiva] Document how to register Gandiva external functions (#38763) GH-38599 - [Docs] Update Headers (#38696) GH-38614 - [Java] Add VarBinary and VarCharWriter helper methods to more writers (#38631) GH-38624 - [C++] Fix: add TestingEqualOptions for gtest functions. (#38642) GH-38630 - [MATLAB] arrow.array.BooleanArray’s toMATLAB method does not take slice offsets into account (#38636) GH-38653 - [Packaging][Java][Python][Ruby] Raise the minimum macOS version to 10.15 catalina to allow using new APIs in C++17 (#38677) GH-38683 - [Python][Docs] Update docstrings for Time32Type and Time64Type (#39059) GH-38684 - [Integration] Try to strengthen C Data Interface testing (#38846) GH-38697 - [C++][Gandiva] Use arrow io util to replace std::filesystem::path in gandiva (#38698) GH-38709 - [C++] Protect against PREALLOCATE preprocessor defined on macOS (#38760) GH-38711 - [CI] Rollback aws-cli for preview documentation (#38723) GH-38725 - [Java] decompression in Lz4CompressionCodec.java does not set writer index (#38840) GH-38728 - [Go] ipc: put lz4 decompression buffers back into sync.Pool (#38729) GH-38737 - [Java] Fix JDBC caching of SqlInfo values (#38739) GH-38738 - [C++] Check variadic buffer counts in bounds (#38740) GH-38745 - [Integration] Fix huge integration test (#38746) GH-38762 - [R] Versions of R and RTools in CI config are no longer current GH-38764 - [Java] Clarify warning about --add-opens=java.base/java.nio=ALL-UNNAMED (#38765) GH-38782 - [C++][FS][Azure] Do nothing for CreateDir(“/container”, true) (#38783) GH-38795 - [Go] Fix race GetToTimeFunc for Timestamp (#38797) GH-38811 - [R] Actually use fetched cmake on macos (#39453) GH-38816 - [C#] Fix IArrowRecord implementation on StructArray (#38827) GH-38823 - Fix TestArrowReaderAdHoc.ReadFloat16Files to use new uncompressed files (#38825) GH-38832 - [Java] Avoid building twice in ci/scripts/java_build.sh (#38829) GH-38844 - [C++] S3FileSystem export s3 sdk config “use_virtual_addressing” to arrow::fs::S3Options (#38858) GH-38851 - Website: Contributing link is not working GH-38879 - [C++][Gandiva] Fix Gandiva to_date function’s validation for supress errors parameter (#38987) GH-38883 - [Docs] Fix struct example to show hiding a child’s entry (#38898) GH-38906 - [R] Improve Windows CI configuration (#38927) GH-38921 - [CI] Fix spelling (#38922) GH-38925 - [CI] Fix spelling (#38926) GH-38928 - [R] Fix spelling (#38929) GH-38930 - [Java] Fix spelling (#38931) GH-38932 - [GO] Fix spelling (#38933) GH-38938 - [FlightRPC] Fix spelling (#38939) GH-38940 - [Ruby] Fix spelling (#38941) GH-38942 - [C#] Fix spelling (#38943) GH-38944 - [Python] Fix spelling (#38945) GH-38946 - [MATLAB] Fix spelling (#38947) GH-38948 - [Swift] Fix spelling (#38949) GH-38950 - [Docs] Fix spelling (#38951) GH-38952 - [Format] Fix spelling (#38953) GH-38954 - [Dev] Fix spelling (#38955) GH-38956 - [Gandiva] Fix spelling (#38957) GH-38960 - [C++] Fix spelling (acero) (#38961) GH-38964 - [C++] Fix spelling (compute) (#38965) GH-38966 - [C++] Fix spelling (util) (#38967) GH-38968 - [C++] Fix spelling (dataset) (#38969) GH-38971 - [C++] Fix spelling (filesystem) (#38972) GH-38975 - [Dev] Fix spelling (#38976) GH-38977 - [C++] Fix spelling (#38978) GH-38979 - [C++] Fix spelling (#38980) GH-38981 - [R][Release] Don’t update version.json on compatible version release (#38982) GH-39014 - [Java] Add default truststore along with KeychainStore when on Mac system (#39235) GH-39031 - [Docs] Remove misspelled rule from contrib css (#39032) GH-39045 - [C++][Acero] union node output batches should be unordered (#39046) GH-39113 - [Integration][Flight][Java] Fix occasional failure starting Java server (#39115) GH-39116 - [Go] Fix CI Staticcheck (#39117) GH-39126 - [C++][CI] Fix Valgrind failures (#39127) GH-39130 - [CI][GLib][Windows] Use old Ruby as workaround for load error (#39168) GH-39136 - [C++] Remove needless system Protobuf dependency with -DARROW_HDFS=ON (#39137) GH-39138 - [R] Fix implicit conversion warnings (#39250) GH-39156 - [C++][Compute] Fix negative duration division (#39158) GH-39163 - [C++] Add missing data copy in StreamDecoder::Consume(data) (#39164) GH-39185 - [C++] Remove compiler warnings with -Wconversion -Wno-sign-conversion in public headers (#39186) GH-39191 - [R] throw error when string_replace is passed vector of values in pattern (#39219) GH-39238 - [Go] PATCH Prevents empty record to be appended to empty resultset (#39239) GH-39288 - [Java][FlightSQL] Update Apache Avatica to version 1.24.0 (#39325) GH-39306 - [C++][Benchmarking] Remove hardcoded min times (#39307) GH-39327 - [Java] define assemble descriptor for new custom maven plugin project (#39331) GH-39333 - [C++] Don’t use “if constexpr” in lambda (#39334) GH-39359 - [CI][C++] Remove MinGW MINGW32 C++ job (#39376) GH-39384 - [C++] Disable -Werror=attributes for Azure SDK’s identity.hpp (#39448) GH-39387 - [C++] Fix compile warning (#39389) GH-39421 - [CI][Ruby] Update to using Ubuntu 22.04 on test-ruby and test-c-glib nightly jobs (#39422) GH-39423 - [CI][JS] TypeScript: Compilation failed on yarn build for several CI jobs GH-39425 - [CI] Fix import to match new substrait repo structure (#39426) GH-39433 - [Ruby] Add support for Table.load(format: json) options (#39464) GH-39437 - [CI][Python] Update pandas tests failing on pandas nightly CI build (#39498) GH-39468 - [Java] Fix site build for docs (#39471) GH-39469 - [CI][JS] Force node 20 on JS build on arm64 to fix build issues (#39499) GH-39488 - [Ruby] Add support for ChunkedArray in Ractor (#39490) GH-39517 - [C++] Disable parallelism for jemalloc external project (#39522) GH-39562 - [C++][Parquet] Fix crash in test_parquet_dataset_lazy_filtering (#39632) GH-39564 - [CI][Java] Set correct version on Java BOM (#39580) GH-39584 - [R] fallback to source gracefully (#39587) GH-39588 - [CI][Go] Add CGO_ENABLED=1 to cdata_integration build to fix macOS build with conda (#39589) GH-39598 - [C#] Fix verification script (#39605) GH-39604 - [JS] Do not use resizable buffers yet (#39607) GH-39628 - [C++] Disable parallelism for all `make`-based externalProjects when CMake &gt;= 3.28 is used New Features and Improvements GH-14936 - [Java] Remove netty dependency from arrow-vector (#38493) GH-28994 - [C++][JSON] Change the max rows to Unlimited(int_32) (#38582) GH-30117 - [C++][Python] Add “Z” to the end of timestamp print string when tz defined (#39272) GH-30717 - [C#] Add ToString() methods to Arrow classes (#36566) GH-31303 - [Python] Remove the legacy ParquetDataset custom python-based implementation (#39112) GH-31579 - [C#] : Remove out-of-support versions of .NET and update C# README (#39165) GH-33500 - [Python] add Table.to/from_struct_array (#38520) GH-33984 - [C++][Python] DLPack implementation for Arrow Arrays (producer) (#38472) GH-34316 - [Python] FixedSizeListArray.from_arrays supports mask parameter (#39396) GH-34569 - [C++] Diffing of Run-End Encoded arrays (#35003) GH-34636 - [C#] Reduce allocations when using ArrayPool (#39166) GH-35260 - [C++][Python][R] Allow users to adjust S3 log level by environment variable (#38267) GH-35331 - [Python] Expose Parquet sorting metadata (#37665) GH-35344 - [C++][Format] Implementation of the LIST_VIEW and LARGE_LIST_VIEW array formats (#35345) GH-35560 - [C++] Use Cast() instead of CastTo() for Scalar in test (#39044) GH-36036 - [C++][Python][Parquet] Implement Float16 logical type (#36073) GH-36044 - [Python][Docs] Added ParquetFileFragment to the API reference docs (#38277) GH-36099 - [C++] Add Utf8View and BinaryView to the c ABI (#38443) GH-36441 - [Python] Make CacheOptions configurable from Python (#36627) GH-36760 - [Go] Add Avro OCF reader (#37115) GH-36815 - [C#] : Enable net472 tests under Windows (#36818) GH-36898 - [CI] Hashpin Sensitive GitHub Actions (#37676) GH-37002 - [C++][Parquet] Add api to get RecordReader from RowGroupReader (#37003) GH-37061 - [Docs][Format] Clarify semantics of GetSchema in FSQL (#38549) GH-37199 - [C++] Expose a span converter for Buffer and ArraySpan (#38027) GH-37242 - [Python][Parquet] Parquet Support write and validate Page CRC (#38360) GH-37312 - [Python][Docs] Update Python docstrings to reflect new parquet encoding option (#38070) GH-37359 - [C#] Add ToList() to Decimal128Array and Decimal256Array (#37383) GH-37378 - [C++] Add A Dictionary Compaction Function For DictionaryArray (#37418) GH-37429 - [C++] Add arrow::ipc::StreamDecoder::Reset() (#37970) GH-37511 - [C++] Implement file reads for Azure filesystem (#38269) GH-37582 - [Go][Parquet] Implement Float16 logical type (#37599) GH-37592 - [MATLAB] Add NumRows property to arrow.tabular.RecordBatch (#38215) GH-37710 - [C++][Integration] Add C++ Utf8View implementation (#37792) GH-37753 - [C++][Gandiva] Add external function registry support (#38116) GH-37812 - [MATLAB] Add arrow.type.ListType MATLAB class (#38189) GH-37815 - [MATLAB] Add arrow.array.ListArray MATLAB class (#38357) GH-37848 - [C++][Gandiva] Migrate LLVM JIT engine from MCJIT to ORC v2/LLJIT (#39098) GH-37857 - [Python][Dataset] Expose file size to python dataset (#37868) GH-37889 - [Java][Doc] Improve JDBC driver documentation (#38469) GH-37895 - [C++] Feature: support concatenate recordbatches. (#37896) GH-37910 - [Java][Integration] Implement C Data Interface integration testing (#38248) GH-37943 - [Java] Add parquet file with all supported types (#38249) GH-37979 - [C++] Add support for specifying custom Array opening and closing delimiters to arrow::PrettyPrintDelimiters (#38187) GH-38022 - [Java][FlightRPC] Expose app_metadata on FlightInfo and FlightEndpoint (#38331) GH-38024 - [Java][FlightRPC] Expose appMetadata through JDBC ResultSet (#38781) GH-38033 - [R] Allow code() to return package name prefix. (#38144) GH-38042 - [C++][Benchmark] Add non-stream Codec Compression/Decompression (#38067) GH-38117 - [C++][Parquet] Change DictEncoder dtor checking to warning log (#38118) GH-38131 - [Swift][CI] Add linting and fix linting errors (#38133) GH-38153 - [C#] expose ArrayDataConcatenator.Concatenate (#38154) GH-38164 - [MATLAB] Rename Length property on arrow.array.Array and arrow.array.ChunkedArray to NumElements (#38190) GH-38166 - [MATLAB] Improve tabular object display (#38482) GH-38246 - [JAVA] added new getTransferPair() function that takes in a Field type for Complex Type Vectors (#38261) GH-38264 - [Java][Packaging] Add BOM file (#38336) GH-38271 - [C++][Parquet] Support reading parquet files with multiple gzip members (#38272) GH-38300 - [Dev][Docs] Update dev/README.md for the current merge_arrow_pr.py (#38301) GH-38310 - [MATLAB] Create the testing guideline document for testing MATLAB interface (#38459) GH-38316 - [C#] Implement interval types (#39043) GH-38326 - [C++][Parquet] check the decompressed page size same as size in page header (#38327) GH-38330 - [C++][Azure] Use properties for input stream metadata (#38524) GH-38333 - [C++][FS][Azure] Implement file writes (#38780) GH-38335 - [C++] Implement GetFileInfo for a single file in Azure filesystem (#38505) GH-38339 - [C++][CMake] Use transitive dependency for system GoogleTest (#38340) GH-38341 - [Python] Remove usage of pandas internals DatetimeTZBlock (#38321) GH-38346 - [C++][Parquet] Use new encrypted files for page index encryption test (#38347) GH-38348 - [C#] Make PrimitiveArray support IReadOnlyList&lt;T?&gt; (#38680) GH-38351 - [C#] Add SqlDecimal support to Decimal128Array (#38481) GH-38354 - [MATLAB] Implement fromMATLAB method for arrow.array.ListArray (#38561) GH-38361 - Add validation logic for offsets and values to arrow.array.ListArray.fromArrays (#38531) GH-38376 - [R] : Add dimnames method to Dataset class (#38377) GH-38381 - [C++][Acero] Create a sorted merge node (#38380) GH-38398 - [MATLAB] Improve array display (#38400) GH-38402 - [CI][Integration] Provide wrapper scripts for integration testing (#38403) GH-38415 - [MATLAB] Add indexing “slice” method to C++ Array Proxy class (#38674) GH-38417 - [MATLAB] Implement a TableTypeValidator class that validates a MATLAB cell array contains only tables that share the same schema (#38551) GH-38418 - [MATLAB] Add method for extracting one row of an arrow.tabular.Table as a string (#38463) GH-38419 - [MATLAB] Implement a ClassTypeValidator class that validates a MATLAB cell array contains only values of the same class type. (#38530) GH-38420 - [MATLAB] Implement a DatetimeValidator class that validates a MATLAB cell array contains only values of zoned or unzoned datetimes (#38533) GH-38424 - [CI][C++] Use Fedora 38 instead of 35 (#38425) GH-38452 - [C++][Benchmark] Adding benchmark for LZ4/Snappy Compression (#38453) GH-38457 - [C++] Support LogicalNullCount for DictionaryArray (#38681) GH-38460 - [Java][FlightRPC] Add mTLS support for Flight SQL JDBC driver (#38461) GH-38462 - [Go][Parquet] Handle Boolean RLE encoding/decoding (#38367) GH-38483 - [C#] Add support for more decimal conversions (#38508) GH-38506 - [Go][Parquet] Add NumRows and RowGroupNumRows to pqarrow.FileWriter (#38507) GH-38511 - [Java] Add getTransferPair(Field, BufferAllocator, CallBack) for StructVector and MapVector (#38512) GH-38528 - [Python][Compute] Describe strptime format semantics (#38665) GH-38537 - [Java] upgrade to netty 4.1.100.Final (#38538) GH-38541 - [MATLAB] Add remaining tests for arrow tabular object display (#38564) GH-38542 - [C++][Parquet] Faster scalar BYTE_STREAM_SPLIT (#38529) GH-38562 - [Packaging] Add support for Ubuntu 23.10 (#38563) GH-38576 - [Java] Change JDBC driver to optionally preserve cookies and auth tokens when getting streams (#38580) GH-38578 - [Java][FlightSQL] Remove joda usage from flight-sql library (#38579) GH-38589 - [C++][Gandiva] Support registering external C functions (#38632) GH-38597 - [C++] Implement GetFileInfo(selector) for Azure filesystem (#39009) GH-38602 - [R] Add missing prod for summarize (#38601) GH-38627 - [Java][FlightRPC] Handle null parameter values (#38628) GH-38648 - [Java] Regenerate Flatbuffers (#38650) GH-38652 - [MATLAB] Add tests about time precision preservation when converting MATLAB duration to arrow.array.Time32Array and arrow.array.Time64Array (#38661) GH-38662 - [Java] Add comparators (#38669) GH-38699 - [C++][FS][Azure] Implement CreateDir() (#38708) GH-38700 - [C++][FS][Azure] Implement DeleteDir() (#38793) GH-38701 - [C++][FS][Azure] Implement DeleteDirContents() (#38888) GH-38702 - [C++] : Implement AzureFileSystem::DeleteRootDirContents (#39151) GH-38705 - [C++][FS][Azure] Implement CopyFile() (#39058) GH-38712 - [Python] Remove dead code in _reconstruct_block (#38714) GH-38718 - [Go][Format][Integration] Add StringView/BinaryView to Go implementation (#35769) GH-38732 - [Java][FlightRPC] Add support for Array parameter binding in JDBC (#38733) GH-38751 - [C++][Go][Parquet] Add tests for reading Float16 files in parquet-testing (#38753) GH-38757 - [C#] Implement common interfaces for structure arrays and record batches (#38759) GH-38758 - [C++][FS][Azure] Rename AzurePath to AzureLocation (#38773) GH-38772 - [C++] Implement directory semantics even when the storage account doesn’t support HNS (#39361) GH-38798 - [Integration] Enable C Data Interface integration testing on Rust (#38799) GH-38814 - [C++][Parquet] Update parquet.thrift to sync with 2.10.0 (#38815) GH-38824 - [Go] Enable GC checks (#38826) GH-38836 - [Go] Add Size() for ArrayData (#38839) GH-38852 - [C++] Replace “#ifdef ARROW_WITH_GZIP” in dataset test to ARROW_WITH_ZLIB (#38853) GH-38857 - [Python] Fix append mode for cython 2 (#39027) GH-38857 - [Python] Add append mode for pyarrow.OsFile (#38820) GH-38860 - [C++][Parquet] Using length to optimize bloom filter read (#38863) GH-38870 - [Documentation] Add List View and Large List View to status.rst (#38871) GH-38874 - [C++][Parquet] Minor: making parquet TypedComparator operation as const method (#38875) GH-38884 - [C++] DatasetWriter release rows_in_flight_throttle when allocate writing failed (#38885) GH-38887 - [C++][Parquet] Move EstimatedBufferedValueBytes from TypedColumnWriter to ColumnWriter (#39055) GH-38907 - [C++] Stop installing internal bpacking_simd* headers (#38908) GH-38909 - [Packaging] Drop support for Ubuntu 23.04 (#38910) GH-38918 - [Go] Avoid schema.Fields allocations in some places (#38919) GH-38920 - [C++][Gandiva] Refactor function holder to return arrow Result (#38873) GH-38958 - [C++][Parquet] Fix spelling (#38959) GH-38990 - [Java] Upgrade to flatc version 23.5.26 (#38991) GH-38996 - [Java] Update dependencies and plugins for JPMS modules (#38994) GH-39006 - [Python] Extract libparquet requirements out of libarrow_python.so to new libarrow_python_parquet_encryption.so (#39316) GH-39013 - [Go][Integration] Support cABI import/export of StringView (#39019) GH-39020 - [CI][Release][JS] Use Node.js 18 instead of 16 (#39021) GH-39028 - [Python][CI] Fix dask integration build by temporarily skipping test_categorize_info (#39029) GH-39037 - [Java] Remove (Contrib/Experimental) mention in Flight SQL (#39040) GH-39049 - [C++] Use Cast() instead of CastTo() for Dictionary Scalar in test (#39362) GH-39050 - [C++] Use Cast() instead of CastTo() for Timestamp Scalar in test (#39060) GH-39051 - [C++] Use Cast() instead of CastTo() for List Scalar in test (#39353) GH-39064 - [C++][Parquet] Support row group filtering for nested paths for struct fields (#39065) GH-39088 - [Dev][Java] Add Dependabot configuration for Java (#39089) GH-39096 - [Python] Release GIL in .nbytes (#39097) GH-39119 - [C++] Refactor the Azure FS tests and filesystem class instantiation (#39207) GH-39122 - [C++][Parquet] Optimize FLBA record reader (#39124) GH-39134 - Create module info compiler plugin (#39135) GH-39159 - [C++] : Try to make Buffer::device_type_ non-optional (#39150) GH-39170 - [Java] Improve error message explaining why TestTls might fail (#39171) GH-39189 - [Java] Bump com.h2database:h2 from 1.4.196 to 2.2.224 in /java (#39188) GH-39196 - [Python][Docs] Document the Arrow PyCapsule protocol in the ‘extending pyarrow’ section of the Python docs (#39199) GH-39208 - [C++][Parquet] Remove deprecated AppendRowGroup(int64_t num_rows) (#39209) GH-39210 - [C++][Parquet] Avoid WriteRecordBatch from produce zero-sized RowGroup (#39211) GH-39217 - [Python] RecordBatchReader.from_stream constructor for objects implementing the Arrow PyCapsule protocol (#39218) GH-39223 - [C#] Support IReadOnlyList&lt;T?&gt; on remaining scalar types (#39224) GH-39225 - [GLib] Use Cast() instaed of CastTo (#39228) GH-39232 - [C++] Support binary to fixed_size_binary cast (#39236) GH-39243 - [R][CI] Remove r-arrow conda nightlies (#39244) GH-39246 - [CI][GLib][Ruby] Use Ubuntu 22.04 not 20.04 (#39247) GH-39262 - [C++][Azure][FS] Add default credential auth configuration (#39263) GH-39265 - [Java] Make it run well with the netty newest version 4.1.104 (#39266) GH-39268 - [C++] Don’t install bundled Azure SDK for C++ with CMake 3.28+ (#39269) GH-39292 - [C++][FS] : Remove the AzureBackend enum and add more flexible connection options (#39293) GH-39297 - [C++][FS] : Inform caller of container not-existing when checking for HNS support (#39298) GH-39299 - [Java] Upgrade to Avro 1.11.3 (#39300) GH-39303 - [Archery][Benchmarking] Allow setting C++ repetition min time (#39324) GH-39318 - [C++][FS][Azure] Add workload identity auth configuration (#39319) GH-39320 - [C++][FS][Azure] Add managed identity auth configuration (#39321) GH-39322 - [C++] Forward arguments to ExceptionToStatus all the way to Status::FromArgs (#39323) GH-39326 - [C++] Flaky DatasetWriterTestFixture.MaxRowsOneWriteBackpresure test (#39379) GH-39328 - [Java] Make default getConsumer public (#39329) GH-39335 - [C#] Support creating FlightClient with Grpc.Core.Channel (#39348) GH-39339 - [C++] Add ForceCachedHierarchicalNamespaceSupport to help with testing (#39340) GH-39341 - [C#] Support Utf8View, BinaryView and ListView (#39342) GH-39343 - [C++][FS][Azure] Add client secret auth configuration (#39346) GH-39355 - [Java] Improve JdbcConsumer exceptions (#39356) GH-39357 - [C++] Reduce function.h includes (#39312) GH-39363 - [C++] Use Cast() instead of CastTo() for Parquet (#39364) GH-39413 - [C++][Parquet] Vectorize decode plain on FLBA (#39414) GH-39419 - [C++][Parquet] Style: Using arrow::Buffer data_as api rather than reinterpret_cast (#39420) GH-39430 - [C++][ORC] Upgrade ORC to 1.9.2 (#39431) GH-39449 - [C++] Use default Azure credentials implicitly and support anonymous credentials explicitly (#39450) GH-39484 - [Java] Support 256 bit decimals in JdbcToArrowUtils (#39485) GH-39500 - [Docs] Pin pydata-sphinx-theme to 0.14 (#39501) GH-39515 - [Python] Pass in type to MapType.from_arrays (#39516) GH-39531 - [Python][CI] Skip failing dask tests: test_describe_empty and test_view (#39534) GH-39533 - [Python] NumPy 2.0 compat: remove usage of np.core (#39535) GH-39537 - [Packaging][Python] Add a numpy&lt;2 pin to the install requirements for the 15.x release branch (#39538) GH-39601 - [R] Don’t download cmake when TEST_OFFLINE_BUILD=true (#39602) GH-39624 - [R][CI] Add CMake to docker file and update envvars (#39625) GH-39626 - [Docs][R] Update NEWS.md for 15.0.0 PARQUET-2411 - [C++][Parquet] Allow reading dictionary without reading data via ByteArrayDictionaryRecordReader (#39153)" />
<link rel="canonical" href="https://arrow.apache.org/release/15.0.0.html" />
<meta property="og:url" content="https://arrow.apache.org/release/15.0.0.html" />
<meta property="og:site_name" content="Apache Arrow" />
<meta property="og:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="og:type" content="article" />
<meta property="article:published_time" content="2024-04-29T17:30:49-04:00" />
<meta name="twitter:card" content="summary_large_image" />
<meta property="twitter:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="twitter:title" content="Apache Arrow 15.0.0 Release" />
<meta name="twitter:site" content="@ApacheArrow" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"BlogPosting","dateModified":"2024-04-29T17:30:49-04:00","datePublished":"2024-04-29T17:30:49-04:00","description":"Apache Arrow 15.0.0 (21 January 2024) This is a major release covering more than 1 months of development. Download Source Artifacts Binary Artifacts For AlmaLinux For Amazon Linux For CentOS For C# For Debian For Python For Ubuntu Git tag Contributors This release includes 536 commits from 101 distinct contributors. $ git shortlog -sn apache-arrow-14.0.2..apache-arrow-15.0.0 67 dependabot[bot] 42 Sutou Kouhei 31 Raúl Cumplido 27 Josh Soref 20 Antoine Pitrou 19 Felipe Oliveira Carvalho 19 mwish 18 Dewey Dunnington 17 James Duong 16 Curt Hagenlocher 16 Dominik Moritz 16 Joris Van den Bossche 15 Jacob Wujciak-Jens 14 Hyunseok Seo 12 Alenka Frim 10 Yue 10 sgilmore10 9 Matt Topol 8 Dane Pitkin 8 Diego Fernández Giraldo 8 Thomas Newton 6 Bryce Mecum 5 Jin Shang 5 Jonathan Keane 5 Kevin Gurney 4 Benjamin Kietzman 4 Gang Wu 3 Ben Harkins 3 Ivan Chesnov 3 Judah Rand 3 Lei Hou 3 Rossi(Ruoxi) Sun 3 Tim Schaub 3 abandy 2 Alfonso Subiotto Marqués 2 David Li 2 Divyansh200102 2 Eero Lihavainen 2 Francis 2 Gavin Murrison 2 JB Onofré 2 Jeremy Aguilon 2 John 2 Junming Chen 2 Laurent Goujon 2 Nic Crane 2 Peter Andreas Entschev 2 Rok Mihevc 2 Stas Stepanov 2 Tsutomu Katsube 2 Vibhatha Lakmal Abeykoon 1 Abram Fleishman 1 Anja Kefala 1 Atheel Massalha 1 Benjamin Schmidt 1 Bob Plotts 1 Bryan Cutler 1 Carl Jackson 1 ChinYikMing 1 Chris Larsen 1 Christian Holm Christensen 1 Dan Homola 1 Dan Stone 1 Danyaal Khan 1 Diogo Teles Sant&#39;Anna 1 Dongjoon Hyun 1 Fatemah Panahi 1 Fernando Mayer 1 Fokko Driesprong 1 Hattonuri 1 Hendrik Makait 1 Jiaxing Liang 1 Jinpeng 1 John Garland 1 Kyle Barron 1 LucasG0 1 Maximilian Muecke 1 Miguel Pragier 1 Miles 1 Neal Richardson 1 Nick Hughes 1 Paul Spangler 1 Phillip LeBlanc 1 Pierre Moulon 1 Quang Hoang 1 Ravjot Brar 1 Tammy DiPrima 1 Will Jones 1 William Ayd 1 Yifeng-Sigma 1 david dali susanibar arce 1 davidhcoe 1 frazar 1 jiawei liang 1 loicalleyne 1 orgadish 1 panbingkun 1 prmoore77 1 shibei 1 zhipeng 1 谢天 Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -sn --group=trailer:signed-off-by apache-arrow-14.0.2..apache-arrow-15.0.0 154 Sutou Kouhei 56 David Li 50 Antoine Pitrou 35 Joris Van den Bossche 30 Jacob Wujciak-Jens 24 Raúl Cumplido 22 Matt Topol 19 Curt Hagenlocher 18 Kevin Gurney 16 Benjamin Kietzman 16 Felipe Oliveira Carvalho 14 mwish 13 Dewey Dunnington 12 AlenkaF 7 Nic Crane 3 Weston Pace 2 Gang Wu 2 Jonathan Keane 1 Josh Soref 1 Rok Mihevc 1 Will Jones 1 dependabot[bot] Changelog Apache Arrow 15.0.0 (2024-01-21 08:00:00) Bug Fixes GH-15192 - [C++] Bring back case_when tests for union types (#39308) GH-32570 - [C++] Fix the issue of ExecBatchBuilder when appending consecutive tail rows with the same id may exceed buffer boundary (#39234) GH-32662 - [C#] Make dictionaries in file and memory implementations work correctly and support integration tests (#39146) GH-33475 - [Java] Add parameter binding for Prepared Statements in JDBC driver (#38404) GH-34532 - [Java][FlightSQL] Change JDBC to handle multi-endpoints (#38521) GH-34610 - [Java] Fix valueCount and field name when loading/transferring NullVector (#38973) GH-34890 - [C++][Python] Add a no-op kernel for dictionary_encode(dictionary) (#38349) GH-35497 - [C++] Use the latest tagged version of flatbuffers (#38192) GH-36588 - [C#] Support blank column names and enable more integration tests. (#39167) GH-36594 - [C++] Don’t use MSVC_VERSION to determin -fms-compatibility-version (#36595) GH-36912 - [Java] JDBC driver stops consuming roots if it sees an empty root (#38590) GH-37055 - [C++] Optimize hash kernels for Dictionary ChunkedArrays (#38394) GH-37657 - [JS] Run bin scripts with ts-node (#38500) GH-37726 - [Swift][FlightSQL] Update behavior to be similar to existing impls (#37764) GH-37751 - [C++][Gandiva] Avoid registering exported functions multiple times in gandiva (#37752) GH-37796 - [C++][Acero] Fix race condition caused by straggling input in the as-of-join node (#37839) GH-37884 - [Swift] allow reading of unaligned FlatBuffers buffers (#38635) GH-37969 - [C++][Parquet] add more closed file checks for ParquetFileWriter (#38390) GH-38096 - [Java] FlightStream with metadata can cause error when closing (#38110) GH-38198 - [Go] Fix AuthenticateBasicToken to be reliable behind proxies (#38199) GH-38210 - [C++][FlightRPC] Add missing app_metadata arguments (#38231) GH-38216 - [R] open_dataset(format = “json”) not documented (#38258) GH-38242 - [Java] Fix incorrect internal struct accounting for DenseUnionVector#getBufferSizeFor (#38305) GH-38254 - [Java] Add reusable buffer getters to char/binary vectors (#38266) GH-38268 - [Java] Disable flaky TestFlightSqlStreams (#38319) GH-38281 - [Go] Ensure CData imported arrays are freed on release (#38314) GH-38297 - [C#] Fix build for .NET 4.7.2 (#38299) GH-38304 - [C++][Parquet] Fix Valgrind memory leak in arrow-dataset-file-parquet-encryption-test (#38306) GH-38307 - [CI] Remove gemfury_clean.rb (#38308) GH-38318 - [Java][FlightRPC] Enable tests that leaked (#38719) GH-38323 - [CI][Python] Use system gdb on test-conda-python (#38324) GH-38363 - [Release][CI] Omit tests for main/maintenance branches on RC branch (#38365) GH-38366 - [Java] Fix Murmur hash on buffers less than 4 bytes (#38368) GH-38378 - [C++][Parquet] Don’t initialize OpenSSL explicitly with OpenSSL 1.1 (#38379) GH-38382 - [R] Explicitly clean up arrow_duck_connection() on exit (#38495) GH-38387 - [Java] Fix JDK8 compilation issue with TestAllTypes (#38388) GH-38395 - [Go] fix rounding errors in decimal256 string functions (#38426) GH-38399 - [Go][Parquet] DeltaBitPack decoder reset usedFirst after SetData (#38413) GH-38401 - [C++] Re-generate flatbuffers C++ for Skyhook (#38405) GH-38436 - [R] Test segfault on reading CSVs with non-UTF-8 encoding GH-38439 - [Java][CI] Use Eclipse Temurin for all Java CI linux jobs (#38440) GH-38447 - [CI][Release] Don’t use “   {exit,continue}” (#38486) GH-38458 - [Go] Add ValueLen to BinaryLike interface (#39242) GH-38470 - [CI][Integration] Install jpype and build JNI c-data to run integration tests (#39502) GH-38477 - [Go] Fixing decimal 128 rounding issue (#38478) GH-38479 - [C++] Avoid passing null pointer to LZ4 frame decompressor (#39125) GH-38503 - [Go][Parquet] Make the arrow column writer internal (#38727) GH-38503 - [Go][Parquet] Style improvement for using ArrowColumnWriter (#38581) GH-38516 - [Go][Parquet] Increment the number of rows written when appending a new row group (#38517) GH-38535 - [Python] Fix S3FileSystem equals None segfault (#39276) GH-38554 - [Release][Website] post-03-website.sh doesn’t quote current.date (#38555) GH-38556 - [C++] Add missing explicit size_t cast for i386 (#38557) GH-38594 - [Docs][C++][Gandiva] Document how to register Gandiva external functions (#38763) GH-38599 - [Docs] Update Headers (#38696) GH-38614 - [Java] Add VarBinary and VarCharWriter helper methods to more writers (#38631) GH-38624 - [C++] Fix: add TestingEqualOptions for gtest functions. (#38642) GH-38630 - [MATLAB] arrow.array.BooleanArray’s toMATLAB method does not take slice offsets into account (#38636) GH-38653 - [Packaging][Java][Python][Ruby] Raise the minimum macOS version to 10.15 catalina to allow using new APIs in C++17 (#38677) GH-38683 - [Python][Docs] Update docstrings for Time32Type and Time64Type (#39059) GH-38684 - [Integration] Try to strengthen C Data Interface testing (#38846) GH-38697 - [C++][Gandiva] Use arrow io util to replace std::filesystem::path in gandiva (#38698) GH-38709 - [C++] Protect against PREALLOCATE preprocessor defined on macOS (#38760) GH-38711 - [CI] Rollback aws-cli for preview documentation (#38723) GH-38725 - [Java] decompression in Lz4CompressionCodec.java does not set writer index (#38840) GH-38728 - [Go] ipc: put lz4 decompression buffers back into sync.Pool (#38729) GH-38737 - [Java] Fix JDBC caching of SqlInfo values (#38739) GH-38738 - [C++] Check variadic buffer counts in bounds (#38740) GH-38745 - [Integration] Fix huge integration test (#38746) GH-38762 - [R] Versions of R and RTools in CI config are no longer current GH-38764 - [Java] Clarify warning about --add-opens=java.base/java.nio=ALL-UNNAMED (#38765) GH-38782 - [C++][FS][Azure] Do nothing for CreateDir(“/container”, true) (#38783) GH-38795 - [Go] Fix race GetToTimeFunc for Timestamp (#38797) GH-38811 - [R] Actually use fetched cmake on macos (#39453) GH-38816 - [C#] Fix IArrowRecord implementation on StructArray (#38827) GH-38823 - Fix TestArrowReaderAdHoc.ReadFloat16Files to use new uncompressed files (#38825) GH-38832 - [Java] Avoid building twice in ci/scripts/java_build.sh (#38829) GH-38844 - [C++] S3FileSystem export s3 sdk config “use_virtual_addressing” to arrow::fs::S3Options (#38858) GH-38851 - Website: Contributing link is not working GH-38879 - [C++][Gandiva] Fix Gandiva to_date function’s validation for supress errors parameter (#38987) GH-38883 - [Docs] Fix struct example to show hiding a child’s entry (#38898) GH-38906 - [R] Improve Windows CI configuration (#38927) GH-38921 - [CI] Fix spelling (#38922) GH-38925 - [CI] Fix spelling (#38926) GH-38928 - [R] Fix spelling (#38929) GH-38930 - [Java] Fix spelling (#38931) GH-38932 - [GO] Fix spelling (#38933) GH-38938 - [FlightRPC] Fix spelling (#38939) GH-38940 - [Ruby] Fix spelling (#38941) GH-38942 - [C#] Fix spelling (#38943) GH-38944 - [Python] Fix spelling (#38945) GH-38946 - [MATLAB] Fix spelling (#38947) GH-38948 - [Swift] Fix spelling (#38949) GH-38950 - [Docs] Fix spelling (#38951) GH-38952 - [Format] Fix spelling (#38953) GH-38954 - [Dev] Fix spelling (#38955) GH-38956 - [Gandiva] Fix spelling (#38957) GH-38960 - [C++] Fix spelling (acero) (#38961) GH-38964 - [C++] Fix spelling (compute) (#38965) GH-38966 - [C++] Fix spelling (util) (#38967) GH-38968 - [C++] Fix spelling (dataset) (#38969) GH-38971 - [C++] Fix spelling (filesystem) (#38972) GH-38975 - [Dev] Fix spelling (#38976) GH-38977 - [C++] Fix spelling (#38978) GH-38979 - [C++] Fix spelling (#38980) GH-38981 - [R][Release] Don’t update version.json on compatible version release (#38982) GH-39014 - [Java] Add default truststore along with KeychainStore when on Mac system (#39235) GH-39031 - [Docs] Remove misspelled rule from contrib css (#39032) GH-39045 - [C++][Acero] union node output batches should be unordered (#39046) GH-39113 - [Integration][Flight][Java] Fix occasional failure starting Java server (#39115) GH-39116 - [Go] Fix CI Staticcheck (#39117) GH-39126 - [C++][CI] Fix Valgrind failures (#39127) GH-39130 - [CI][GLib][Windows] Use old Ruby as workaround for load error (#39168) GH-39136 - [C++] Remove needless system Protobuf dependency with -DARROW_HDFS=ON (#39137) GH-39138 - [R] Fix implicit conversion warnings (#39250) GH-39156 - [C++][Compute] Fix negative duration division (#39158) GH-39163 - [C++] Add missing data copy in StreamDecoder::Consume(data) (#39164) GH-39185 - [C++] Remove compiler warnings with -Wconversion -Wno-sign-conversion in public headers (#39186) GH-39191 - [R] throw error when string_replace is passed vector of values in pattern (#39219) GH-39238 - [Go] PATCH Prevents empty record to be appended to empty resultset (#39239) GH-39288 - [Java][FlightSQL] Update Apache Avatica to version 1.24.0 (#39325) GH-39306 - [C++][Benchmarking] Remove hardcoded min times (#39307) GH-39327 - [Java] define assemble descriptor for new custom maven plugin project (#39331) GH-39333 - [C++] Don’t use “if constexpr” in lambda (#39334) GH-39359 - [CI][C++] Remove MinGW MINGW32 C++ job (#39376) GH-39384 - [C++] Disable -Werror=attributes for Azure SDK’s identity.hpp (#39448) GH-39387 - [C++] Fix compile warning (#39389) GH-39421 - [CI][Ruby] Update to using Ubuntu 22.04 on test-ruby and test-c-glib nightly jobs (#39422) GH-39423 - [CI][JS] TypeScript: Compilation failed on yarn build for several CI jobs GH-39425 - [CI] Fix import to match new substrait repo structure (#39426) GH-39433 - [Ruby] Add support for Table.load(format: json) options (#39464) GH-39437 - [CI][Python] Update pandas tests failing on pandas nightly CI build (#39498) GH-39468 - [Java] Fix site build for docs (#39471) GH-39469 - [CI][JS] Force node 20 on JS build on arm64 to fix build issues (#39499) GH-39488 - [Ruby] Add support for ChunkedArray in Ractor (#39490) GH-39517 - [C++] Disable parallelism for jemalloc external project (#39522) GH-39562 - [C++][Parquet] Fix crash in test_parquet_dataset_lazy_filtering (#39632) GH-39564 - [CI][Java] Set correct version on Java BOM (#39580) GH-39584 - [R] fallback to source gracefully (#39587) GH-39588 - [CI][Go] Add CGO_ENABLED=1 to cdata_integration build to fix macOS build with conda (#39589) GH-39598 - [C#] Fix verification script (#39605) GH-39604 - [JS] Do not use resizable buffers yet (#39607) GH-39628 - [C++] Disable parallelism for all `make`-based externalProjects when CMake &gt;= 3.28 is used New Features and Improvements GH-14936 - [Java] Remove netty dependency from arrow-vector (#38493) GH-28994 - [C++][JSON] Change the max rows to Unlimited(int_32) (#38582) GH-30117 - [C++][Python] Add “Z” to the end of timestamp print string when tz defined (#39272) GH-30717 - [C#] Add ToString() methods to Arrow classes (#36566) GH-31303 - [Python] Remove the legacy ParquetDataset custom python-based implementation (#39112) GH-31579 - [C#] : Remove out-of-support versions of .NET and update C# README (#39165) GH-33500 - [Python] add Table.to/from_struct_array (#38520) GH-33984 - [C++][Python] DLPack implementation for Arrow Arrays (producer) (#38472) GH-34316 - [Python] FixedSizeListArray.from_arrays supports mask parameter (#39396) GH-34569 - [C++] Diffing of Run-End Encoded arrays (#35003) GH-34636 - [C#] Reduce allocations when using ArrayPool (#39166) GH-35260 - [C++][Python][R] Allow users to adjust S3 log level by environment variable (#38267) GH-35331 - [Python] Expose Parquet sorting metadata (#37665) GH-35344 - [C++][Format] Implementation of the LIST_VIEW and LARGE_LIST_VIEW array formats (#35345) GH-35560 - [C++] Use Cast() instead of CastTo() for Scalar in test (#39044) GH-36036 - [C++][Python][Parquet] Implement Float16 logical type (#36073) GH-36044 - [Python][Docs] Added ParquetFileFragment to the API reference docs (#38277) GH-36099 - [C++] Add Utf8View and BinaryView to the c ABI (#38443) GH-36441 - [Python] Make CacheOptions configurable from Python (#36627) GH-36760 - [Go] Add Avro OCF reader (#37115) GH-36815 - [C#] : Enable net472 tests under Windows (#36818) GH-36898 - [CI] Hashpin Sensitive GitHub Actions (#37676) GH-37002 - [C++][Parquet] Add api to get RecordReader from RowGroupReader (#37003) GH-37061 - [Docs][Format] Clarify semantics of GetSchema in FSQL (#38549) GH-37199 - [C++] Expose a span converter for Buffer and ArraySpan (#38027) GH-37242 - [Python][Parquet] Parquet Support write and validate Page CRC (#38360) GH-37312 - [Python][Docs] Update Python docstrings to reflect new parquet encoding option (#38070) GH-37359 - [C#] Add ToList() to Decimal128Array and Decimal256Array (#37383) GH-37378 - [C++] Add A Dictionary Compaction Function For DictionaryArray (#37418) GH-37429 - [C++] Add arrow::ipc::StreamDecoder::Reset() (#37970) GH-37511 - [C++] Implement file reads for Azure filesystem (#38269) GH-37582 - [Go][Parquet] Implement Float16 logical type (#37599) GH-37592 - [MATLAB] Add NumRows property to arrow.tabular.RecordBatch (#38215) GH-37710 - [C++][Integration] Add C++ Utf8View implementation (#37792) GH-37753 - [C++][Gandiva] Add external function registry support (#38116) GH-37812 - [MATLAB] Add arrow.type.ListType MATLAB class (#38189) GH-37815 - [MATLAB] Add arrow.array.ListArray MATLAB class (#38357) GH-37848 - [C++][Gandiva] Migrate LLVM JIT engine from MCJIT to ORC v2/LLJIT (#39098) GH-37857 - [Python][Dataset] Expose file size to python dataset (#37868) GH-37889 - [Java][Doc] Improve JDBC driver documentation (#38469) GH-37895 - [C++] Feature: support concatenate recordbatches. (#37896) GH-37910 - [Java][Integration] Implement C Data Interface integration testing (#38248) GH-37943 - [Java] Add parquet file with all supported types (#38249) GH-37979 - [C++] Add support for specifying custom Array opening and closing delimiters to arrow::PrettyPrintDelimiters (#38187) GH-38022 - [Java][FlightRPC] Expose app_metadata on FlightInfo and FlightEndpoint (#38331) GH-38024 - [Java][FlightRPC] Expose appMetadata through JDBC ResultSet (#38781) GH-38033 - [R] Allow code() to return package name prefix. (#38144) GH-38042 - [C++][Benchmark] Add non-stream Codec Compression/Decompression (#38067) GH-38117 - [C++][Parquet] Change DictEncoder dtor checking to warning log (#38118) GH-38131 - [Swift][CI] Add linting and fix linting errors (#38133) GH-38153 - [C#] expose ArrayDataConcatenator.Concatenate (#38154) GH-38164 - [MATLAB] Rename Length property on arrow.array.Array and arrow.array.ChunkedArray to NumElements (#38190) GH-38166 - [MATLAB] Improve tabular object display (#38482) GH-38246 - [JAVA] added new getTransferPair() function that takes in a Field type for Complex Type Vectors (#38261) GH-38264 - [Java][Packaging] Add BOM file (#38336) GH-38271 - [C++][Parquet] Support reading parquet files with multiple gzip members (#38272) GH-38300 - [Dev][Docs] Update dev/README.md for the current merge_arrow_pr.py (#38301) GH-38310 - [MATLAB] Create the testing guideline document for testing MATLAB interface (#38459) GH-38316 - [C#] Implement interval types (#39043) GH-38326 - [C++][Parquet] check the decompressed page size same as size in page header (#38327) GH-38330 - [C++][Azure] Use properties for input stream metadata (#38524) GH-38333 - [C++][FS][Azure] Implement file writes (#38780) GH-38335 - [C++] Implement GetFileInfo for a single file in Azure filesystem (#38505) GH-38339 - [C++][CMake] Use transitive dependency for system GoogleTest (#38340) GH-38341 - [Python] Remove usage of pandas internals DatetimeTZBlock (#38321) GH-38346 - [C++][Parquet] Use new encrypted files for page index encryption test (#38347) GH-38348 - [C#] Make PrimitiveArray support IReadOnlyList&lt;T?&gt; (#38680) GH-38351 - [C#] Add SqlDecimal support to Decimal128Array (#38481) GH-38354 - [MATLAB] Implement fromMATLAB method for arrow.array.ListArray (#38561) GH-38361 - Add validation logic for offsets and values to arrow.array.ListArray.fromArrays (#38531) GH-38376 - [R] : Add dimnames method to Dataset class (#38377) GH-38381 - [C++][Acero] Create a sorted merge node (#38380) GH-38398 - [MATLAB] Improve array display (#38400) GH-38402 - [CI][Integration] Provide wrapper scripts for integration testing (#38403) GH-38415 - [MATLAB] Add indexing “slice” method to C++ Array Proxy class (#38674) GH-38417 - [MATLAB] Implement a TableTypeValidator class that validates a MATLAB cell array contains only tables that share the same schema (#38551) GH-38418 - [MATLAB] Add method for extracting one row of an arrow.tabular.Table as a string (#38463) GH-38419 - [MATLAB] Implement a ClassTypeValidator class that validates a MATLAB cell array contains only values of the same class type. (#38530) GH-38420 - [MATLAB] Implement a DatetimeValidator class that validates a MATLAB cell array contains only values of zoned or unzoned datetimes (#38533) GH-38424 - [CI][C++] Use Fedora 38 instead of 35 (#38425) GH-38452 - [C++][Benchmark] Adding benchmark for LZ4/Snappy Compression (#38453) GH-38457 - [C++] Support LogicalNullCount for DictionaryArray (#38681) GH-38460 - [Java][FlightRPC] Add mTLS support for Flight SQL JDBC driver (#38461) GH-38462 - [Go][Parquet] Handle Boolean RLE encoding/decoding (#38367) GH-38483 - [C#] Add support for more decimal conversions (#38508) GH-38506 - [Go][Parquet] Add NumRows and RowGroupNumRows to pqarrow.FileWriter (#38507) GH-38511 - [Java] Add getTransferPair(Field, BufferAllocator, CallBack) for StructVector and MapVector (#38512) GH-38528 - [Python][Compute] Describe strptime format semantics (#38665) GH-38537 - [Java] upgrade to netty 4.1.100.Final (#38538) GH-38541 - [MATLAB] Add remaining tests for arrow tabular object display (#38564) GH-38542 - [C++][Parquet] Faster scalar BYTE_STREAM_SPLIT (#38529) GH-38562 - [Packaging] Add support for Ubuntu 23.10 (#38563) GH-38576 - [Java] Change JDBC driver to optionally preserve cookies and auth tokens when getting streams (#38580) GH-38578 - [Java][FlightSQL] Remove joda usage from flight-sql library (#38579) GH-38589 - [C++][Gandiva] Support registering external C functions (#38632) GH-38597 - [C++] Implement GetFileInfo(selector) for Azure filesystem (#39009) GH-38602 - [R] Add missing prod for summarize (#38601) GH-38627 - [Java][FlightRPC] Handle null parameter values (#38628) GH-38648 - [Java] Regenerate Flatbuffers (#38650) GH-38652 - [MATLAB] Add tests about time precision preservation when converting MATLAB duration to arrow.array.Time32Array and arrow.array.Time64Array (#38661) GH-38662 - [Java] Add comparators (#38669) GH-38699 - [C++][FS][Azure] Implement CreateDir() (#38708) GH-38700 - [C++][FS][Azure] Implement DeleteDir() (#38793) GH-38701 - [C++][FS][Azure] Implement DeleteDirContents() (#38888) GH-38702 - [C++] : Implement AzureFileSystem::DeleteRootDirContents (#39151) GH-38705 - [C++][FS][Azure] Implement CopyFile() (#39058) GH-38712 - [Python] Remove dead code in _reconstruct_block (#38714) GH-38718 - [Go][Format][Integration] Add StringView/BinaryView to Go implementation (#35769) GH-38732 - [Java][FlightRPC] Add support for Array parameter binding in JDBC (#38733) GH-38751 - [C++][Go][Parquet] Add tests for reading Float16 files in parquet-testing (#38753) GH-38757 - [C#] Implement common interfaces for structure arrays and record batches (#38759) GH-38758 - [C++][FS][Azure] Rename AzurePath to AzureLocation (#38773) GH-38772 - [C++] Implement directory semantics even when the storage account doesn’t support HNS (#39361) GH-38798 - [Integration] Enable C Data Interface integration testing on Rust (#38799) GH-38814 - [C++][Parquet] Update parquet.thrift to sync with 2.10.0 (#38815) GH-38824 - [Go] Enable GC checks (#38826) GH-38836 - [Go] Add Size() for ArrayData (#38839) GH-38852 - [C++] Replace “#ifdef ARROW_WITH_GZIP” in dataset test to ARROW_WITH_ZLIB (#38853) GH-38857 - [Python] Fix append mode for cython 2 (#39027) GH-38857 - [Python] Add append mode for pyarrow.OsFile (#38820) GH-38860 - [C++][Parquet] Using length to optimize bloom filter read (#38863) GH-38870 - [Documentation] Add List View and Large List View to status.rst (#38871) GH-38874 - [C++][Parquet] Minor: making parquet TypedComparator operation as const method (#38875) GH-38884 - [C++] DatasetWriter release rows_in_flight_throttle when allocate writing failed (#38885) GH-38887 - [C++][Parquet] Move EstimatedBufferedValueBytes from TypedColumnWriter to ColumnWriter (#39055) GH-38907 - [C++] Stop installing internal bpacking_simd* headers (#38908) GH-38909 - [Packaging] Drop support for Ubuntu 23.04 (#38910) GH-38918 - [Go] Avoid schema.Fields allocations in some places (#38919) GH-38920 - [C++][Gandiva] Refactor function holder to return arrow Result (#38873) GH-38958 - [C++][Parquet] Fix spelling (#38959) GH-38990 - [Java] Upgrade to flatc version 23.5.26 (#38991) GH-38996 - [Java] Update dependencies and plugins for JPMS modules (#38994) GH-39006 - [Python] Extract libparquet requirements out of libarrow_python.so to new libarrow_python_parquet_encryption.so (#39316) GH-39013 - [Go][Integration] Support cABI import/export of StringView (#39019) GH-39020 - [CI][Release][JS] Use Node.js 18 instead of 16 (#39021) GH-39028 - [Python][CI] Fix dask integration build by temporarily skipping test_categorize_info (#39029) GH-39037 - [Java] Remove (Contrib/Experimental) mention in Flight SQL (#39040) GH-39049 - [C++] Use Cast() instead of CastTo() for Dictionary Scalar in test (#39362) GH-39050 - [C++] Use Cast() instead of CastTo() for Timestamp Scalar in test (#39060) GH-39051 - [C++] Use Cast() instead of CastTo() for List Scalar in test (#39353) GH-39064 - [C++][Parquet] Support row group filtering for nested paths for struct fields (#39065) GH-39088 - [Dev][Java] Add Dependabot configuration for Java (#39089) GH-39096 - [Python] Release GIL in .nbytes (#39097) GH-39119 - [C++] Refactor the Azure FS tests and filesystem class instantiation (#39207) GH-39122 - [C++][Parquet] Optimize FLBA record reader (#39124) GH-39134 - Create module info compiler plugin (#39135) GH-39159 - [C++] : Try to make Buffer::device_type_ non-optional (#39150) GH-39170 - [Java] Improve error message explaining why TestTls might fail (#39171) GH-39189 - [Java] Bump com.h2database:h2 from 1.4.196 to 2.2.224 in /java (#39188) GH-39196 - [Python][Docs] Document the Arrow PyCapsule protocol in the ‘extending pyarrow’ section of the Python docs (#39199) GH-39208 - [C++][Parquet] Remove deprecated AppendRowGroup(int64_t num_rows) (#39209) GH-39210 - [C++][Parquet] Avoid WriteRecordBatch from produce zero-sized RowGroup (#39211) GH-39217 - [Python] RecordBatchReader.from_stream constructor for objects implementing the Arrow PyCapsule protocol (#39218) GH-39223 - [C#] Support IReadOnlyList&lt;T?&gt; on remaining scalar types (#39224) GH-39225 - [GLib] Use Cast() instaed of CastTo (#39228) GH-39232 - [C++] Support binary to fixed_size_binary cast (#39236) GH-39243 - [R][CI] Remove r-arrow conda nightlies (#39244) GH-39246 - [CI][GLib][Ruby] Use Ubuntu 22.04 not 20.04 (#39247) GH-39262 - [C++][Azure][FS] Add default credential auth configuration (#39263) GH-39265 - [Java] Make it run well with the netty newest version 4.1.104 (#39266) GH-39268 - [C++] Don’t install bundled Azure SDK for C++ with CMake 3.28+ (#39269) GH-39292 - [C++][FS] : Remove the AzureBackend enum and add more flexible connection options (#39293) GH-39297 - [C++][FS] : Inform caller of container not-existing when checking for HNS support (#39298) GH-39299 - [Java] Upgrade to Avro 1.11.3 (#39300) GH-39303 - [Archery][Benchmarking] Allow setting C++ repetition min time (#39324) GH-39318 - [C++][FS][Azure] Add workload identity auth configuration (#39319) GH-39320 - [C++][FS][Azure] Add managed identity auth configuration (#39321) GH-39322 - [C++] Forward arguments to ExceptionToStatus all the way to Status::FromArgs (#39323) GH-39326 - [C++] Flaky DatasetWriterTestFixture.MaxRowsOneWriteBackpresure test (#39379) GH-39328 - [Java] Make default getConsumer public (#39329) GH-39335 - [C#] Support creating FlightClient with Grpc.Core.Channel (#39348) GH-39339 - [C++] Add ForceCachedHierarchicalNamespaceSupport to help with testing (#39340) GH-39341 - [C#] Support Utf8View, BinaryView and ListView (#39342) GH-39343 - [C++][FS][Azure] Add client secret auth configuration (#39346) GH-39355 - [Java] Improve JdbcConsumer exceptions (#39356) GH-39357 - [C++] Reduce function.h includes (#39312) GH-39363 - [C++] Use Cast() instead of CastTo() for Parquet (#39364) GH-39413 - [C++][Parquet] Vectorize decode plain on FLBA (#39414) GH-39419 - [C++][Parquet] Style: Using arrow::Buffer data_as api rather than reinterpret_cast (#39420) GH-39430 - [C++][ORC] Upgrade ORC to 1.9.2 (#39431) GH-39449 - [C++] Use default Azure credentials implicitly and support anonymous credentials explicitly (#39450) GH-39484 - [Java] Support 256 bit decimals in JdbcToArrowUtils (#39485) GH-39500 - [Docs] Pin pydata-sphinx-theme to 0.14 (#39501) GH-39515 - [Python] Pass in type to MapType.from_arrays (#39516) GH-39531 - [Python][CI] Skip failing dask tests: test_describe_empty and test_view (#39534) GH-39533 - [Python] NumPy 2.0 compat: remove usage of np.core (#39535) GH-39537 - [Packaging][Python] Add a numpy&lt;2 pin to the install requirements for the 15.x release branch (#39538) GH-39601 - [R] Don’t download cmake when TEST_OFFLINE_BUILD=true (#39602) GH-39624 - [R][CI] Add CMake to docker file and update envvars (#39625) GH-39626 - [Docs][R] Update NEWS.md for 15.0.0 PARQUET-2411 - [C++][Parquet] Allow reading dictionary without reading data via ByteArrayDictionaryRecordReader (#39153)","headline":"Apache Arrow 15.0.0 Release","image":"https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png","mainEntityOfPage":{"@type":"WebPage","@id":"https://arrow.apache.org/release/15.0.0.html"},"publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"https://arrow.apache.org/img/logo.png"}},"url":"https://arrow.apache.org/release/15.0.0.html"}</script>
<!-- End Jekyll SEO tag -->
<!-- favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16.png" id="light1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32.png" id="light2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon.png" id="light3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120.png" id="light4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76.png" id="light5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60.png" id="light6">
<!-- dark mode favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16-dark.png" id="dark1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32-dark.png" id="dark2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon-dark.png" id="dark3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120-dark.png" id="dark4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76-dark.png" id="dark5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60-dark.png" id="dark6">
<script>
// Switch to the dark-mode favicons if prefers-color-scheme: dark
function onUpdate() {
light1 = document.querySelector('link#light1');
light2 = document.querySelector('link#light2');
light3 = document.querySelector('link#light3');
light4 = document.querySelector('link#light4');
light5 = document.querySelector('link#light5');
light6 = document.querySelector('link#light6');
dark1 = document.querySelector('link#dark1');
dark2 = document.querySelector('link#dark2');
dark3 = document.querySelector('link#dark3');
dark4 = document.querySelector('link#dark4');
dark5 = document.querySelector('link#dark5');
dark6 = document.querySelector('link#dark6');
if (matcher.matches) {
light1.remove();
light2.remove();
light3.remove();
light4.remove();
light5.remove();
light6.remove();
document.head.append(dark1);
document.head.append(dark2);
document.head.append(dark3);
document.head.append(dark4);
document.head.append(dark5);
document.head.append(dark6);
} else {
dark1.remove();
dark2.remove();
dark3.remove();
dark4.remove();
dark5.remove();
dark6.remove();
document.head.append(light1);
document.head.append(light2);
document.head.append(light3);
document.head.append(light4);
document.head.append(light5);
document.head.append(light6);
}
}
matcher = window.matchMedia('(prefers-color-scheme: dark)');
matcher.addListener(onUpdate);
onUpdate();
</script>
<link rel="stylesheet" href="//fonts.googleapis.com/css?family=Lato:300,300italic,400,400italic,700,700italic,900">
<link href="/css/main.css" rel="stylesheet">
<link href="/css/syntax.css" rel="stylesheet">
<script src="/javascript/main.js"></script>
<!-- Matomo -->
<script>
var _paq = window._paq = window._paq || [];
/* tracker methods like "setCustomDimension" should be called before "trackPageView" */
/* We explicitly disable cookie tracking to avoid privacy issues */
_paq.push(['disableCookies']);
_paq.push(['trackPageView']);
_paq.push(['enableLinkTracking']);
(function() {
var u="https://analytics.apache.org/";
_paq.push(['setTrackerUrl', u+'matomo.php']);
_paq.push(['setSiteId', '20']);
var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g,s);
})();
</script>
<!-- End Matomo Code -->
</head>
<body class="wrap">
<header>
<nav class="navbar navbar-expand-md navbar-dark bg-dark">
<a class="navbar-brand no-padding" href="/"><img src="/img/arrow-inverse-300px.png" height="40px"/></a>
<button class="navbar-toggler ml-auto" type="button" data-toggle="collapse" data-target="#arrow-navbar" aria-controls="arrow-navbar" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span>
</button>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse justify-content-end" id="arrow-navbar">
<ul class="nav navbar-nav">
<li class="nav-item"><a class="nav-link" href="/overview/" role="button" aria-haspopup="true" aria-expanded="false">Overview</a></li>
<li class="nav-item"><a class="nav-link" href="/faq/" role="button" aria-haspopup="true" aria-expanded="false">FAQ</a></li>
<li class="nav-item"><a class="nav-link" href="/blog" role="button" aria-haspopup="true" aria-expanded="false">Blog</a></li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownGetArrow" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Get Arrow
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownGetArrow">
<a class="dropdown-item" href="/install/">Install</a>
<a class="dropdown-item" href="/release/">Releases</a>
<a class="dropdown-item" href="https://github.com/apache/arrow">Source Code</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownDocumentation" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Documentation
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownDocumentation">
<a class="dropdown-item" href="/docs">Project Docs</a>
<a class="dropdown-item" href="/docs/format/Columnar.html">Format</a>
<hr/>
<a class="dropdown-item" href="/docs/c_glib">C GLib</a>
<a class="dropdown-item" href="/docs/cpp">C++</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/csharp/README.md">C#</a>
<a class="dropdown-item" href="https://godoc.org/github.com/apache/arrow/go/arrow">Go</a>
<a class="dropdown-item" href="/docs/java">Java</a>
<a class="dropdown-item" href="/docs/js">JavaScript</a>
<a class="dropdown-item" href="/julia/">Julia</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/matlab/README.md">MATLAB</a>
<a class="dropdown-item" href="/docs/python">Python</a>
<a class="dropdown-item" href="/docs/r">R</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/ruby/README.md">Ruby</a>
<a class="dropdown-item" href="https://docs.rs/arrow/latest">Rust</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownSubprojects" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Subprojects
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownSubprojects">
<a class="dropdown-item" href="/adbc">ADBC</a>
<a class="dropdown-item" href="/docs/format/Flight.html">Arrow Flight</a>
<a class="dropdown-item" href="/docs/format/FlightSql.html">Arrow Flight SQL</a>
<a class="dropdown-item" href="https://datafusion.apache.org">DataFusion</a>
<a class="dropdown-item" href="/nanoarrow">nanoarrow</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownCommunity" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Community
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownCommunity">
<a class="dropdown-item" href="/community/">Communication</a>
<a class="dropdown-item" href="/docs/developers/index.html">Contributing</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/issues">Issue Tracker</a>
<a class="dropdown-item" href="/committers/">Governance</a>
<a class="dropdown-item" href="/use_cases/">Use Cases</a>
<a class="dropdown-item" href="/powered_by/">Powered By</a>
<a class="dropdown-item" href="/visual_identity/">Visual Identity</a>
<a class="dropdown-item" href="/security/">Security</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/policies/conduct.html">Code of Conduct</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownASF" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
ASF Links
</a>
<div class="dropdown-menu dropdown-menu-right" aria-labelledby="navbarDropdownASF">
<a class="dropdown-item" href="https://www.apache.org/">ASF Website</a>
<a class="dropdown-item" href="https://www.apache.org/licenses/">License</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/sponsorship.html">Donate</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/thanks.html">Thanks</a>
<a class="dropdown-item" href="https://www.apache.org/security/">Security</a>
</div>
</li>
</ul>
</div><!-- /.navbar-collapse -->
</nav>
</header>
<div class="container p-4 pt-5">
<main role="main" class="pb-5">
<!--
-->
<h1 id="apache-arrow-1500-21-january-2024">Apache Arrow 15.0.0 (21 January 2024)</h1>
<p>This is a major release covering more than 1 months of development.</p>
<h2 id="download">Download</h2>
<ul>
<li><a href="https://www.apache.org/dyn/closer.lua/arrow/arrow-15.0.0/"><strong>Source Artifacts</strong></a></li>
<li><strong>Binary Artifacts</strong>
<ul>
<li><a href="https://apache.jfrog.io/artifactory/arrow/almalinux/">For AlmaLinux</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/amazon-linux/">For Amazon Linux</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/centos/">For CentOS</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/nuget/">For C#</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/debian/">For Debian</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/python/15.0.0/">For Python</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/ubuntu/">For Ubuntu</a></li>
</ul>
</li>
<li><a href="https://github.com/apache/arrow/releases/tag/apache-arrow-15.0.0">Git tag</a></li>
</ul>
<h2 id="contributors">Contributors</h2>
<p>This release includes 536 commits from 101 distinct contributors.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> apache-arrow-14.0.2..apache-arrow-15.0.0
<span class="go"> 67 dependabot[bot]
42 Sutou Kouhei
31 Raúl Cumplido
27 Josh Soref
20 Antoine Pitrou
19 Felipe Oliveira Carvalho
19 mwish
18 Dewey Dunnington
17 James Duong
16 Curt Hagenlocher
16 Dominik Moritz
16 Joris Van den Bossche
15 Jacob Wujciak-Jens
14 Hyunseok Seo
12 Alenka Frim
10 Yue
10 sgilmore10
9 Matt Topol
8 Dane Pitkin
8 Diego Fernández Giraldo
8 Thomas Newton
6 Bryce Mecum
5 Jin Shang
5 Jonathan Keane
5 Kevin Gurney
4 Benjamin Kietzman
4 Gang Wu
3 Ben Harkins
3 Ivan Chesnov
3 Judah Rand
3 Lei Hou
3 Rossi(Ruoxi) Sun
3 Tim Schaub
3 abandy
2 Alfonso Subiotto Marqués
2 David Li
2 Divyansh200102
2 Eero Lihavainen
2 Francis
2 Gavin Murrison
2 JB Onofré
2 Jeremy Aguilon
2 John
2 Junming Chen
2 Laurent Goujon
2 Nic Crane
2 Peter Andreas Entschev
2 Rok Mihevc
2 Stas Stepanov
2 Tsutomu Katsube
2 Vibhatha Lakmal Abeykoon
1 Abram Fleishman
1 Anja Kefala
1 Atheel Massalha
1 Benjamin Schmidt
1 Bob Plotts
1 Bryan Cutler
1 Carl Jackson
1 ChinYikMing
1 Chris Larsen
1 Christian Holm Christensen
1 Dan Homola
1 Dan Stone
1 Danyaal Khan
1 Diogo Teles Sant'Anna
1 Dongjoon Hyun
1 Fatemah Panahi
1 Fernando Mayer
1 Fokko Driesprong
1 Hattonuri
1 Hendrik Makait
1 Jiaxing Liang
1 Jinpeng
1 John Garland
1 Kyle Barron
1 LucasG0
1 Maximilian Muecke
1 Miguel Pragier
1 Miles
1 Neal Richardson
1 Nick Hughes
1 Paul Spangler
1 Phillip LeBlanc
1 Pierre Moulon
1 Quang Hoang
1 Ravjot Brar
1 Tammy DiPrima
1 Will Jones
1 William Ayd
1 Yifeng-Sigma
1 david dali susanibar arce
1 davidhcoe
1 frazar
1 jiawei liang
1 loicalleyne
1 orgadish
1 panbingkun
1 prmoore77
1 shibei
1 zhipeng
1 谢天
</span></code></pre></div></div>
<h2 id="patch-committers">Patch Committers</h2>
<p>The following Apache committers merged contributed patches to the repository.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> <span class="nt">--group</span><span class="o">=</span>trailer:signed-off-by apache-arrow-14.0.2..apache-arrow-15.0.0
<span class="go"> 154 Sutou Kouhei
56 David Li
50 Antoine Pitrou
35 Joris Van den Bossche
30 Jacob Wujciak-Jens
24 Raúl Cumplido
22 Matt Topol
19 Curt Hagenlocher
18 Kevin Gurney
16 Benjamin Kietzman
16 Felipe Oliveira Carvalho
14 mwish
13 Dewey Dunnington
12 AlenkaF
7 Nic Crane
3 Weston Pace
2 Gang Wu
2 Jonathan Keane
1 Josh Soref
1 Rok Mihevc
1 Will Jones
1 dependabot[bot]
</span></code></pre></div></div>
<h2 id="changelog">Changelog</h2>
<h2 id="apache-arrow-1500-2024-01-21-080000">Apache Arrow 15.0.0 (2024-01-21 08:00:00)</h2>
<h3 id="bug-fixes">Bug Fixes</h3>
<ul>
<li><a href="https://github.com/apache/arrow/issues/15192">GH-15192</a> - [C++] Bring back <code class="language-plaintext highlighter-rouge">case_when</code> tests for union types (#39308)</li>
<li><a href="https://github.com/apache/arrow/issues/32570">GH-32570</a> - [C++] Fix the issue of <code class="language-plaintext highlighter-rouge">ExecBatchBuilder</code> when appending consecutive tail rows with the same id may exceed buffer boundary (#39234)</li>
<li><a href="https://github.com/apache/arrow/issues/32662">GH-32662</a> - [C#] Make dictionaries in file and memory implementations work correctly and support integration tests (#39146)</li>
<li><a href="https://github.com/apache/arrow/issues/33475">GH-33475</a> - [Java] Add parameter binding for Prepared Statements in JDBC driver (#38404)</li>
<li><a href="https://github.com/apache/arrow/issues/34532">GH-34532</a> - [Java][FlightSQL] Change JDBC to handle multi-endpoints (#38521)</li>
<li><a href="https://github.com/apache/arrow/issues/34610">GH-34610</a> - [Java] Fix valueCount and field name when loading/transferring NullVector (#38973)</li>
<li><a href="https://github.com/apache/arrow/issues/34890">GH-34890</a> - [C++][Python] Add a no-op kernel for dictionary_encode(dictionary) (#38349)</li>
<li><a href="https://github.com/apache/arrow/issues/35497">GH-35497</a> - [C++] Use the latest tagged version of flatbuffers (#38192)</li>
<li><a href="https://github.com/apache/arrow/issues/36588">GH-36588</a> - [C#] Support blank column names and enable more integration tests. (#39167)</li>
<li><a href="https://github.com/apache/arrow/issues/36594">GH-36594</a> - [C++] Don’t use MSVC_VERSION to determin -fms-compatibility-version (#36595)</li>
<li><a href="https://github.com/apache/arrow/issues/36912">GH-36912</a> - [Java] JDBC driver stops consuming roots if it sees an empty root (#38590)</li>
<li><a href="https://github.com/apache/arrow/issues/37055">GH-37055</a> - [C++] Optimize hash kernels for Dictionary ChunkedArrays (#38394)</li>
<li><a href="https://github.com/apache/arrow/issues/37657">GH-37657</a> - [JS] Run bin scripts with ts-node (#38500)</li>
<li><a href="https://github.com/apache/arrow/issues/37726">GH-37726</a> - [Swift][FlightSQL] Update behavior to be similar to existing impls (#37764)</li>
<li><a href="https://github.com/apache/arrow/issues/37751">GH-37751</a> - [C++][Gandiva] Avoid registering exported functions multiple times in gandiva (#37752)</li>
<li><a href="https://github.com/apache/arrow/issues/37796">GH-37796</a> - [C++][Acero] Fix race condition caused by straggling input in the as-of-join node (#37839)</li>
<li><a href="https://github.com/apache/arrow/issues/37884">GH-37884</a> - [Swift] allow reading of unaligned FlatBuffers buffers (#38635)</li>
<li><a href="https://github.com/apache/arrow/issues/37969">GH-37969</a> - [C++][Parquet] add more closed file checks for ParquetFileWriter (#38390)</li>
<li><a href="https://github.com/apache/arrow/issues/38096">GH-38096</a> - [Java] FlightStream with metadata can cause error when closing (#38110)</li>
<li><a href="https://github.com/apache/arrow/issues/38198">GH-38198</a> - [Go] Fix AuthenticateBasicToken to be reliable behind proxies (#38199)</li>
<li><a href="https://github.com/apache/arrow/issues/38210">GH-38210</a> - [C++][FlightRPC] Add missing app_metadata arguments (#38231)</li>
<li><a href="https://github.com/apache/arrow/issues/38216">GH-38216</a> - [R] open_dataset(format = “json”) not documented (#38258)</li>
<li><a href="https://github.com/apache/arrow/issues/38242">GH-38242</a> - [Java] Fix incorrect internal struct accounting for DenseUnionVector#getBufferSizeFor (#38305)</li>
<li><a href="https://github.com/apache/arrow/issues/38254">GH-38254</a> - [Java] Add reusable buffer getters to char/binary vectors (#38266)</li>
<li><a href="https://github.com/apache/arrow/issues/38268">GH-38268</a> - [Java] Disable flaky TestFlightSqlStreams (#38319)</li>
<li><a href="https://github.com/apache/arrow/issues/38281">GH-38281</a> - [Go] Ensure CData imported arrays are freed on release (#38314)</li>
<li><a href="https://github.com/apache/arrow/issues/38297">GH-38297</a> - [C#] Fix build for .NET 4.7.2 (#38299)</li>
<li><a href="https://github.com/apache/arrow/issues/38304">GH-38304</a> - [C++][Parquet] Fix Valgrind memory leak in arrow-dataset-file-parquet-encryption-test (#38306)</li>
<li><a href="https://github.com/apache/arrow/issues/38307">GH-38307</a> - [CI] Remove gemfury_clean.rb (#38308)</li>
<li><a href="https://github.com/apache/arrow/issues/38318">GH-38318</a> - [Java][FlightRPC] Enable tests that leaked (#38719)</li>
<li><a href="https://github.com/apache/arrow/issues/38323">GH-38323</a> - [CI][Python] Use system gdb on test-conda-python (#38324)</li>
<li><a href="https://github.com/apache/arrow/issues/38363">GH-38363</a> - [Release][CI] Omit tests for main/maintenance branches on RC branch (#38365)</li>
<li><a href="https://github.com/apache/arrow/issues/38366">GH-38366</a> - [Java] Fix Murmur hash on buffers less than 4 bytes (#38368)</li>
<li><a href="https://github.com/apache/arrow/issues/38378">GH-38378</a> - [C++][Parquet] Don’t initialize OpenSSL explicitly with OpenSSL 1.1 (#38379)</li>
<li><a href="https://github.com/apache/arrow/issues/38382">GH-38382</a> - [R] Explicitly clean up <code class="language-plaintext highlighter-rouge">arrow_duck_connection()</code> on exit (#38495)</li>
<li><a href="https://github.com/apache/arrow/issues/38387">GH-38387</a> - [Java] Fix JDK8 compilation issue with TestAllTypes (#38388)</li>
<li><a href="https://github.com/apache/arrow/issues/38395">GH-38395</a> - [Go] fix rounding errors in decimal256 string functions (#38426)</li>
<li><a href="https://github.com/apache/arrow/issues/38399">GH-38399</a> - [Go][Parquet] DeltaBitPack decoder reset usedFirst after SetData (#38413)</li>
<li><a href="https://github.com/apache/arrow/issues/38401">GH-38401</a> - [C++] Re-generate flatbuffers C++ for Skyhook (#38405)</li>
<li><a href="https://github.com/apache/arrow/issues/38436">GH-38436</a> - [R] Test segfault on reading CSVs with non-UTF-8 encoding</li>
<li><a href="https://github.com/apache/arrow/issues/38439">GH-38439</a> - [Java][CI] Use Eclipse Temurin for all Java CI linux jobs (#38440)</li>
<li>
<table>
<tbody>
<tr>
<td><a href="https://github.com/apache/arrow/issues/38447">GH-38447</a> - [CI][Release] Don’t use “</td>
<td> </td>
<td>{exit,continue}” (#38486)</td>
</tr>
</tbody>
</table>
</li>
<li><a href="https://github.com/apache/arrow/issues/38458">GH-38458</a> - [Go] Add ValueLen to BinaryLike interface (#39242)</li>
<li><a href="https://github.com/apache/arrow/issues/38470">GH-38470</a> - [CI][Integration] Install jpype and build JNI c-data to run integration tests (#39502)</li>
<li><a href="https://github.com/apache/arrow/issues/38477">GH-38477</a> - [Go] Fixing decimal 128 rounding issue (#38478)</li>
<li><a href="https://github.com/apache/arrow/issues/38479">GH-38479</a> - [C++] Avoid passing null pointer to LZ4 frame decompressor (#39125)</li>
<li><a href="https://github.com/apache/arrow/issues/38503">GH-38503</a> - [Go][Parquet] Make the arrow column writer internal (#38727)</li>
<li><a href="https://github.com/apache/arrow/issues/38503">GH-38503</a> - [Go][Parquet] Style improvement for using ArrowColumnWriter (#38581)</li>
<li><a href="https://github.com/apache/arrow/issues/38516">GH-38516</a> - [Go][Parquet] Increment the number of rows written when appending a new row group (#38517)</li>
<li><a href="https://github.com/apache/arrow/issues/38535">GH-38535</a> - [Python] Fix S3FileSystem equals None segfault (#39276)</li>
<li><a href="https://github.com/apache/arrow/issues/38554">GH-38554</a> - [Release][Website] post-03-website.sh doesn’t quote current.date (#38555)</li>
<li><a href="https://github.com/apache/arrow/issues/38556">GH-38556</a> - [C++] Add missing explicit size_t cast for i386 (#38557)</li>
<li><a href="https://github.com/apache/arrow/issues/38594">GH-38594</a> - [Docs][C++][Gandiva] Document how to register Gandiva external functions (#38763)</li>
<li><a href="https://github.com/apache/arrow/issues/38599">GH-38599</a> - [Docs] Update Headers (#38696)</li>
<li><a href="https://github.com/apache/arrow/issues/38614">GH-38614</a> - [Java] Add VarBinary and VarCharWriter helper methods to more writers (#38631)</li>
<li><a href="https://github.com/apache/arrow/issues/38624">GH-38624</a> - [C++] Fix: add TestingEqualOptions for gtest functions. (#38642)</li>
<li><a href="https://github.com/apache/arrow/issues/38630">GH-38630</a> - [MATLAB] <code class="language-plaintext highlighter-rouge">arrow.array.BooleanArray</code>’s <code class="language-plaintext highlighter-rouge">toMATLAB</code> method does not take slice offsets into account (#38636)</li>
<li><a href="https://github.com/apache/arrow/issues/38653">GH-38653</a> - [Packaging][Java][Python][Ruby] Raise the minimum macOS version to 10.15 catalina to allow using new APIs in C++17 (#38677)</li>
<li><a href="https://github.com/apache/arrow/issues/38683">GH-38683</a> - [Python][Docs] Update docstrings for Time32Type and Time64Type (#39059)</li>
<li><a href="https://github.com/apache/arrow/issues/38684">GH-38684</a> - [Integration] Try to strengthen C Data Interface testing (#38846)</li>
<li><a href="https://github.com/apache/arrow/issues/38697">GH-38697</a> - [C++][Gandiva] Use arrow io util to replace std::filesystem::path in gandiva (#38698)</li>
<li><a href="https://github.com/apache/arrow/issues/38709">GH-38709</a> - [C++] Protect against PREALLOCATE preprocessor defined on macOS (#38760)</li>
<li><a href="https://github.com/apache/arrow/issues/38711">GH-38711</a> - [CI] Rollback aws-cli for preview documentation (#38723)</li>
<li><a href="https://github.com/apache/arrow/issues/38725">GH-38725</a> - [Java] decompression in Lz4CompressionCodec.java does not set writer index (#38840)</li>
<li><a href="https://github.com/apache/arrow/issues/38728">GH-38728</a> - [Go] ipc: put lz4 decompression buffers back into sync.Pool (#38729)</li>
<li><a href="https://github.com/apache/arrow/issues/38737">GH-38737</a> - [Java] Fix JDBC caching of SqlInfo values (#38739)</li>
<li><a href="https://github.com/apache/arrow/issues/38738">GH-38738</a> - [C++] Check variadic buffer counts in bounds (#38740)</li>
<li><a href="https://github.com/apache/arrow/issues/38745">GH-38745</a> - [Integration] Fix huge integration test (#38746)</li>
<li><a href="https://github.com/apache/arrow/issues/38762">GH-38762</a> - [R] Versions of R and RTools in CI config are no longer current</li>
<li><a href="https://github.com/apache/arrow/issues/38764">GH-38764</a> - [Java] Clarify warning about <code class="language-plaintext highlighter-rouge">--add-opens=java.base/java.nio=ALL-UNNAMED</code> (#38765)</li>
<li><a href="https://github.com/apache/arrow/issues/38782">GH-38782</a> - [C++][FS][Azure] Do nothing for CreateDir(“/container”, true) (#38783)</li>
<li><a href="https://github.com/apache/arrow/issues/38795">GH-38795</a> - [Go] Fix race GetToTimeFunc for Timestamp (#38797)</li>
<li><a href="https://github.com/apache/arrow/issues/38811">GH-38811</a> - [R] Actually use fetched cmake on macos (#39453)</li>
<li><a href="https://github.com/apache/arrow/issues/38816">GH-38816</a> - [C#] Fix IArrowRecord implementation on StructArray (#38827)</li>
<li><a href="https://github.com/apache/arrow/issues/38823">GH-38823</a> - Fix TestArrowReaderAdHoc.ReadFloat16Files to use new uncompressed files (#38825)</li>
<li><a href="https://github.com/apache/arrow/issues/38832">GH-38832</a> - [Java] Avoid building twice in <code class="language-plaintext highlighter-rouge">ci/scripts/java_build.sh</code> (#38829)</li>
<li><a href="https://github.com/apache/arrow/issues/38844">GH-38844</a> - [C++] S3FileSystem export s3 sdk config “use_virtual_addressing” to arrow::fs::S3Options (#38858)</li>
<li><a href="https://github.com/apache/arrow/issues/38851">GH-38851</a> - Website: Contributing link is not working</li>
<li><a href="https://github.com/apache/arrow/issues/38879">GH-38879</a> - [C++][Gandiva] Fix Gandiva to_date function’s validation for supress errors parameter (#38987)</li>
<li><a href="https://github.com/apache/arrow/issues/38883">GH-38883</a> - [Docs] Fix struct example to show hiding a child’s entry (#38898)</li>
<li><a href="https://github.com/apache/arrow/issues/38906">GH-38906</a> - [R] Improve Windows CI configuration (#38927)</li>
<li><a href="https://github.com/apache/arrow/issues/38921">GH-38921</a> - [CI] Fix spelling (#38922)</li>
<li><a href="https://github.com/apache/arrow/issues/38925">GH-38925</a> - [CI] Fix spelling (#38926)</li>
<li><a href="https://github.com/apache/arrow/issues/38928">GH-38928</a> - [R] Fix spelling (#38929)</li>
<li><a href="https://github.com/apache/arrow/issues/38930">GH-38930</a> - [Java] Fix spelling (#38931)</li>
<li><a href="https://github.com/apache/arrow/issues/38932">GH-38932</a> - [GO] Fix spelling (#38933)</li>
<li><a href="https://github.com/apache/arrow/issues/38938">GH-38938</a> - [FlightRPC] Fix spelling (#38939)</li>
<li><a href="https://github.com/apache/arrow/issues/38940">GH-38940</a> - [Ruby] Fix spelling (#38941)</li>
<li><a href="https://github.com/apache/arrow/issues/38942">GH-38942</a> - [C#] Fix spelling (#38943)</li>
<li><a href="https://github.com/apache/arrow/issues/38944">GH-38944</a> - [Python] Fix spelling (#38945)</li>
<li><a href="https://github.com/apache/arrow/issues/38946">GH-38946</a> - [MATLAB] Fix spelling (#38947)</li>
<li><a href="https://github.com/apache/arrow/issues/38948">GH-38948</a> - [Swift] Fix spelling (#38949)</li>
<li><a href="https://github.com/apache/arrow/issues/38950">GH-38950</a> - [Docs] Fix spelling (#38951)</li>
<li><a href="https://github.com/apache/arrow/issues/38952">GH-38952</a> - [Format] Fix spelling (#38953)</li>
<li><a href="https://github.com/apache/arrow/issues/38954">GH-38954</a> - [Dev] Fix spelling (#38955)</li>
<li><a href="https://github.com/apache/arrow/issues/38956">GH-38956</a> - [Gandiva] Fix spelling (#38957)</li>
<li><a href="https://github.com/apache/arrow/issues/38960">GH-38960</a> - [C++] Fix spelling (acero) (#38961)</li>
<li><a href="https://github.com/apache/arrow/issues/38964">GH-38964</a> - [C++] Fix spelling (compute) (#38965)</li>
<li><a href="https://github.com/apache/arrow/issues/38966">GH-38966</a> - [C++] Fix spelling (util) (#38967)</li>
<li><a href="https://github.com/apache/arrow/issues/38968">GH-38968</a> - [C++] Fix spelling (dataset) (#38969)</li>
<li><a href="https://github.com/apache/arrow/issues/38971">GH-38971</a> - [C++] Fix spelling (filesystem) (#38972)</li>
<li><a href="https://github.com/apache/arrow/issues/38975">GH-38975</a> - [Dev] Fix spelling (#38976)</li>
<li><a href="https://github.com/apache/arrow/issues/38977">GH-38977</a> - [C++] Fix spelling (#38978)</li>
<li><a href="https://github.com/apache/arrow/issues/38979">GH-38979</a> - [C++] Fix spelling (#38980)</li>
<li><a href="https://github.com/apache/arrow/issues/38981">GH-38981</a> - [R][Release] Don’t update version.json on compatible version release (#38982)</li>
<li><a href="https://github.com/apache/arrow/issues/39014">GH-39014</a> - [Java] Add default truststore along with KeychainStore when on Mac system (#39235)</li>
<li><a href="https://github.com/apache/arrow/issues/39031">GH-39031</a> - [Docs] Remove misspelled rule from contrib css (#39032)</li>
<li><a href="https://github.com/apache/arrow/issues/39045">GH-39045</a> - [C++][Acero] union node output batches should be unordered (#39046)</li>
<li><a href="https://github.com/apache/arrow/issues/39113">GH-39113</a> - [Integration][Flight][Java] Fix occasional failure starting Java server (#39115)</li>
<li><a href="https://github.com/apache/arrow/issues/39116">GH-39116</a> - [Go] Fix CI Staticcheck (#39117)</li>
<li><a href="https://github.com/apache/arrow/issues/39126">GH-39126</a> - [C++][CI] Fix Valgrind failures (#39127)</li>
<li><a href="https://github.com/apache/arrow/issues/39130">GH-39130</a> - [CI][GLib][Windows] Use old Ruby as workaround for load error (#39168)</li>
<li><a href="https://github.com/apache/arrow/issues/39136">GH-39136</a> - [C++] Remove needless system Protobuf dependency with -DARROW_HDFS=ON (#39137)</li>
<li><a href="https://github.com/apache/arrow/issues/39138">GH-39138</a> - [R] Fix implicit conversion warnings (#39250)</li>
<li><a href="https://github.com/apache/arrow/issues/39156">GH-39156</a> - [C++][Compute] Fix negative duration division (#39158)</li>
<li><a href="https://github.com/apache/arrow/issues/39163">GH-39163</a> - [C++] Add missing data copy in StreamDecoder::Consume(data) (#39164)</li>
<li><a href="https://github.com/apache/arrow/issues/39185">GH-39185</a> - [C++] Remove compiler warnings with <code class="language-plaintext highlighter-rouge">-Wconversion -Wno-sign-conversion</code> in public headers (#39186)</li>
<li><a href="https://github.com/apache/arrow/issues/39191">GH-39191</a> - [R] throw error when <code class="language-plaintext highlighter-rouge">string_replace</code> is passed vector of values in <code class="language-plaintext highlighter-rouge">pattern</code> (#39219)</li>
<li><a href="https://github.com/apache/arrow/issues/39238">GH-39238</a> - [Go] PATCH Prevents empty record to be appended to empty resultset (#39239)</li>
<li><a href="https://github.com/apache/arrow/issues/39288">GH-39288</a> - [Java][FlightSQL] Update Apache Avatica to version 1.24.0 (#39325)</li>
<li><a href="https://github.com/apache/arrow/issues/39306">GH-39306</a> - [C++][Benchmarking] Remove hardcoded min times (#39307)</li>
<li><a href="https://github.com/apache/arrow/issues/39327">GH-39327</a> - [Java] define assemble descriptor for new custom maven plugin project (#39331)</li>
<li><a href="https://github.com/apache/arrow/issues/39333">GH-39333</a> - [C++] Don’t use “if constexpr” in lambda (#39334)</li>
<li><a href="https://github.com/apache/arrow/issues/39359">GH-39359</a> - [CI][C++] Remove MinGW MINGW32 C++ job (#39376)</li>
<li><a href="https://github.com/apache/arrow/issues/39384">GH-39384</a> - [C++] Disable -Werror=attributes for Azure SDK’s identity.hpp (#39448)</li>
<li><a href="https://github.com/apache/arrow/issues/39387">GH-39387</a> - [C++] Fix compile warning (#39389)</li>
<li><a href="https://github.com/apache/arrow/issues/39421">GH-39421</a> - [CI][Ruby] Update to using Ubuntu 22.04 on test-ruby and test-c-glib nightly jobs (#39422)</li>
<li><a href="https://github.com/apache/arrow/issues/39423">GH-39423</a> - [CI][JS] TypeScript: Compilation failed on yarn build for several CI jobs</li>
<li><a href="https://github.com/apache/arrow/issues/39425">GH-39425</a> - [CI] Fix import to match new substrait repo structure (#39426)</li>
<li><a href="https://github.com/apache/arrow/issues/39433">GH-39433</a> - [Ruby] Add support for Table.load(format: json) options (#39464)</li>
<li><a href="https://github.com/apache/arrow/issues/39437">GH-39437</a> - [CI][Python] Update pandas tests failing on pandas nightly CI build (#39498)</li>
<li><a href="https://github.com/apache/arrow/issues/39468">GH-39468</a> - [Java] Fix site build for docs (#39471)</li>
<li><a href="https://github.com/apache/arrow/issues/39469">GH-39469</a> - [CI][JS] Force node 20 on JS build on arm64 to fix build issues (#39499)</li>
<li><a href="https://github.com/apache/arrow/issues/39488">GH-39488</a> - [Ruby] Add support for ChunkedArray in Ractor (#39490)</li>
<li><a href="https://github.com/apache/arrow/issues/39517">GH-39517</a> - [C++] Disable parallelism for jemalloc external project (#39522)</li>
<li><a href="https://github.com/apache/arrow/issues/39562">GH-39562</a> - [C++][Parquet] Fix crash in test_parquet_dataset_lazy_filtering (#39632)</li>
<li><a href="https://github.com/apache/arrow/issues/39564">GH-39564</a> - [CI][Java] Set correct version on Java BOM (#39580)</li>
<li><a href="https://github.com/apache/arrow/issues/39584">GH-39584</a> - [R] fallback to source gracefully (#39587)</li>
<li><a href="https://github.com/apache/arrow/issues/39588">GH-39588</a> - [CI][Go] Add CGO_ENABLED=1 to cdata_integration build to fix macOS build with conda (#39589)</li>
<li><a href="https://github.com/apache/arrow/issues/39598">GH-39598</a> - [C#] Fix verification script (#39605)</li>
<li><a href="https://github.com/apache/arrow/issues/39604">GH-39604</a> - [JS] Do not use resizable buffers yet (#39607)</li>
<li><a href="https://github.com/apache/arrow/issues/39628">GH-39628</a> - [C++] Disable parallelism for all `make`-based externalProjects when CMake &gt;= 3.28 is used</li>
</ul>
<h3 id="new-features-and-improvements">New Features and Improvements</h3>
<ul>
<li><a href="https://github.com/apache/arrow/issues/14936">GH-14936</a> - [Java] Remove netty dependency from arrow-vector (#38493)</li>
<li><a href="https://github.com/apache/arrow/issues/28994">GH-28994</a> - [C++][JSON] Change the max rows to Unlimited(int_32) (#38582)</li>
<li><a href="https://github.com/apache/arrow/issues/30117">GH-30117</a> - [C++][Python] Add “Z” to the end of timestamp print string when tz defined (#39272)</li>
<li><a href="https://github.com/apache/arrow/issues/30717">GH-30717</a> - [C#] Add ToString() methods to Arrow classes (#36566)</li>
<li><a href="https://github.com/apache/arrow/issues/31303">GH-31303</a> - [Python] Remove the legacy ParquetDataset custom python-based implementation (#39112)</li>
<li><a href="https://github.com/apache/arrow/issues/31579">GH-31579</a> - [C#] : Remove out-of-support versions of .NET and update C# README (#39165)</li>
<li><a href="https://github.com/apache/arrow/issues/33500">GH-33500</a> - [Python] add <code class="language-plaintext highlighter-rouge">Table.to/from_struct_array</code> (#38520)</li>
<li><a href="https://github.com/apache/arrow/issues/33984">GH-33984</a> - [C++][Python] DLPack implementation for Arrow Arrays (producer) (#38472)</li>
<li><a href="https://github.com/apache/arrow/issues/34316">GH-34316</a> - [Python] FixedSizeListArray.from_arrays supports mask parameter (#39396)</li>
<li><a href="https://github.com/apache/arrow/issues/34569">GH-34569</a> - [C++] Diffing of Run-End Encoded arrays (#35003)</li>
<li><a href="https://github.com/apache/arrow/issues/34636">GH-34636</a> - [C#] Reduce allocations when using ArrayPool (#39166)</li>
<li><a href="https://github.com/apache/arrow/issues/35260">GH-35260</a> - [C++][Python][R] Allow users to adjust S3 log level by environment variable (#38267)</li>
<li><a href="https://github.com/apache/arrow/issues/35331">GH-35331</a> - [Python] Expose Parquet sorting metadata (#37665)</li>
<li><a href="https://github.com/apache/arrow/issues/35344">GH-35344</a> - [C++][Format] Implementation of the LIST_VIEW and LARGE_LIST_VIEW array formats (#35345)</li>
<li><a href="https://github.com/apache/arrow/issues/35560">GH-35560</a> - [C++] Use Cast() instead of CastTo() for Scalar in test (#39044)</li>
<li><a href="https://github.com/apache/arrow/issues/36036">GH-36036</a> - [C++][Python][Parquet] Implement Float16 logical type (#36073)</li>
<li><a href="https://github.com/apache/arrow/issues/36044">GH-36044</a> - [Python][Docs] Added ParquetFileFragment to the API reference docs (#38277)</li>
<li><a href="https://github.com/apache/arrow/issues/36099">GH-36099</a> - [C++] Add Utf8View and BinaryView to the c ABI (#38443)</li>
<li><a href="https://github.com/apache/arrow/issues/36441">GH-36441</a> - [Python] Make <code class="language-plaintext highlighter-rouge">CacheOptions</code> configurable from Python (#36627)</li>
<li><a href="https://github.com/apache/arrow/issues/36760">GH-36760</a> - [Go] Add Avro OCF reader (#37115)</li>
<li><a href="https://github.com/apache/arrow/issues/36815">GH-36815</a> - [C#] : Enable net472 tests under Windows (#36818)</li>
<li><a href="https://github.com/apache/arrow/issues/36898">GH-36898</a> - [CI] Hashpin Sensitive GitHub Actions (#37676)</li>
<li><a href="https://github.com/apache/arrow/issues/37002">GH-37002</a> - [C++][Parquet] Add api to get RecordReader from RowGroupReader (#37003)</li>
<li><a href="https://github.com/apache/arrow/issues/37061">GH-37061</a> - [Docs][Format] Clarify semantics of GetSchema in FSQL (#38549)</li>
<li><a href="https://github.com/apache/arrow/issues/37199">GH-37199</a> - [C++] Expose a span converter for Buffer and ArraySpan (#38027)</li>
<li><a href="https://github.com/apache/arrow/issues/37242">GH-37242</a> - [Python][Parquet] Parquet Support write and validate Page CRC (#38360)</li>
<li><a href="https://github.com/apache/arrow/issues/37312">GH-37312</a> - [Python][Docs] Update Python docstrings to reflect new parquet encoding option (#38070)</li>
<li><a href="https://github.com/apache/arrow/issues/37359">GH-37359</a> - [C#] Add ToList() to Decimal128Array and Decimal256Array (#37383)</li>
<li><a href="https://github.com/apache/arrow/issues/37378">GH-37378</a> - [C++] Add A Dictionary Compaction Function For DictionaryArray (#37418)</li>
<li><a href="https://github.com/apache/arrow/issues/37429">GH-37429</a> - [C++] Add arrow::ipc::StreamDecoder::Reset() (#37970)</li>
<li><a href="https://github.com/apache/arrow/issues/37511">GH-37511</a> - [C++] Implement file reads for Azure filesystem (#38269)</li>
<li><a href="https://github.com/apache/arrow/issues/37582">GH-37582</a> - [Go][Parquet] Implement Float16 logical type (#37599)</li>
<li><a href="https://github.com/apache/arrow/issues/37592">GH-37592</a> - [MATLAB] Add <code class="language-plaintext highlighter-rouge">NumRows</code> property to <code class="language-plaintext highlighter-rouge">arrow.tabular.RecordBatch</code> (#38215)</li>
<li><a href="https://github.com/apache/arrow/issues/37710">GH-37710</a> - [C++][Integration] Add C++ Utf8View implementation (#37792)</li>
<li><a href="https://github.com/apache/arrow/issues/37753">GH-37753</a> - [C++][Gandiva] Add external function registry support (#38116)</li>
<li><a href="https://github.com/apache/arrow/issues/37812">GH-37812</a> - [MATLAB] Add <code class="language-plaintext highlighter-rouge">arrow.type.ListType</code> MATLAB class (#38189)</li>
<li><a href="https://github.com/apache/arrow/issues/37815">GH-37815</a> - [MATLAB] Add <code class="language-plaintext highlighter-rouge">arrow.array.ListArray</code> MATLAB class (#38357)</li>
<li><a href="https://github.com/apache/arrow/issues/37848">GH-37848</a> - [C++][Gandiva] Migrate LLVM JIT engine from MCJIT to ORC v2/LLJIT (#39098)</li>
<li><a href="https://github.com/apache/arrow/issues/37857">GH-37857</a> - [Python][Dataset] Expose file size to python dataset (#37868)</li>
<li><a href="https://github.com/apache/arrow/issues/37889">GH-37889</a> - [Java][Doc] Improve JDBC driver documentation (#38469)</li>
<li><a href="https://github.com/apache/arrow/issues/37895">GH-37895</a> - [C++] Feature: support concatenate recordbatches. (#37896)</li>
<li><a href="https://github.com/apache/arrow/issues/37910">GH-37910</a> - [Java][Integration] Implement C Data Interface integration testing (#38248)</li>
<li><a href="https://github.com/apache/arrow/issues/37943">GH-37943</a> - [Java] Add parquet file with all supported types (#38249)</li>
<li><a href="https://github.com/apache/arrow/issues/37979">GH-37979</a> - [C++] Add support for specifying custom Array opening and closing delimiters to <code class="language-plaintext highlighter-rouge">arrow::PrettyPrintDelimiters</code> (#38187)</li>
<li><a href="https://github.com/apache/arrow/issues/38022">GH-38022</a> - [Java][FlightRPC] Expose app_metadata on FlightInfo and FlightEndpoint (#38331)</li>
<li><a href="https://github.com/apache/arrow/issues/38024">GH-38024</a> - [Java][FlightRPC] Expose appMetadata through JDBC ResultSet (#38781)</li>
<li><a href="https://github.com/apache/arrow/issues/38033">GH-38033</a> - [R] Allow <code class="language-plaintext highlighter-rouge">code()</code> to return package name prefix. (#38144)</li>
<li><a href="https://github.com/apache/arrow/issues/38042">GH-38042</a> - [C++][Benchmark] Add non-stream Codec Compression/Decompression (#38067)</li>
<li><a href="https://github.com/apache/arrow/issues/38117">GH-38117</a> - [C++][Parquet] Change DictEncoder dtor checking to warning log (#38118)</li>
<li><a href="https://github.com/apache/arrow/issues/38131">GH-38131</a> - [Swift][CI] Add linting and fix linting errors (#38133)</li>
<li><a href="https://github.com/apache/arrow/issues/38153">GH-38153</a> - [C#] expose ArrayDataConcatenator.Concatenate (#38154)</li>
<li><a href="https://github.com/apache/arrow/issues/38164">GH-38164</a> - [MATLAB] Rename <code class="language-plaintext highlighter-rouge">Length</code> property on <code class="language-plaintext highlighter-rouge">arrow.array.Array</code> and <code class="language-plaintext highlighter-rouge">arrow.array.ChunkedArray</code> to <code class="language-plaintext highlighter-rouge">NumElements</code> (#38190)</li>
<li><a href="https://github.com/apache/arrow/issues/38166">GH-38166</a> - [MATLAB] Improve tabular object display (#38482)</li>
<li><a href="https://github.com/apache/arrow/issues/38246">GH-38246</a> - [JAVA] added new getTransferPair() function that takes in a Field type for Complex Type Vectors (#38261)</li>
<li><a href="https://github.com/apache/arrow/issues/38264">GH-38264</a> - [Java][Packaging] Add BOM file (#38336)</li>
<li><a href="https://github.com/apache/arrow/issues/38271">GH-38271</a> - [C++][Parquet] Support reading parquet files with multiple gzip members (#38272)</li>
<li><a href="https://github.com/apache/arrow/issues/38300">GH-38300</a> - [Dev][Docs] Update dev/README.md for the current merge_arrow_pr.py (#38301)</li>
<li><a href="https://github.com/apache/arrow/issues/38310">GH-38310</a> - [MATLAB] Create the testing guideline document for testing MATLAB interface (#38459)</li>
<li><a href="https://github.com/apache/arrow/issues/38316">GH-38316</a> - [C#] Implement interval types (#39043)</li>
<li><a href="https://github.com/apache/arrow/issues/38326">GH-38326</a> - [C++][Parquet] check the decompressed page size same as size in page header (#38327)</li>
<li><a href="https://github.com/apache/arrow/issues/38330">GH-38330</a> - [C++][Azure] Use properties for input stream metadata (#38524)</li>
<li><a href="https://github.com/apache/arrow/issues/38333">GH-38333</a> - [C++][FS][Azure] Implement file writes (#38780)</li>
<li><a href="https://github.com/apache/arrow/issues/38335">GH-38335</a> - [C++] Implement <code class="language-plaintext highlighter-rouge">GetFileInfo</code> for a single file in Azure filesystem (#38505)</li>
<li><a href="https://github.com/apache/arrow/issues/38339">GH-38339</a> - [C++][CMake] Use transitive dependency for system GoogleTest (#38340)</li>
<li><a href="https://github.com/apache/arrow/issues/38341">GH-38341</a> - [Python] Remove usage of pandas internals DatetimeTZBlock (#38321)</li>
<li><a href="https://github.com/apache/arrow/issues/38346">GH-38346</a> - [C++][Parquet] Use new encrypted files for page index encryption test (#38347)</li>
<li><a href="https://github.com/apache/arrow/issues/38348">GH-38348</a> - [C#] Make PrimitiveArray<T> support IReadOnlyList&lt;T?&gt; (#38680)</T></li>
<li><a href="https://github.com/apache/arrow/issues/38351">GH-38351</a> - [C#] Add SqlDecimal support to Decimal128Array (#38481)</li>
<li><a href="https://github.com/apache/arrow/issues/38354">GH-38354</a> - [MATLAB] Implement <code class="language-plaintext highlighter-rouge">fromMATLAB</code> method for <code class="language-plaintext highlighter-rouge">arrow.array.ListArray</code> (#38561)</li>
<li><a href="https://github.com/apache/arrow/issues/38361">GH-38361</a> - Add validation logic for <code class="language-plaintext highlighter-rouge">offsets</code> and <code class="language-plaintext highlighter-rouge">values</code> to <code class="language-plaintext highlighter-rouge">arrow.array.ListArray.fromArrays</code> (#38531)</li>
<li><a href="https://github.com/apache/arrow/issues/38376">GH-38376</a> - [R] : Add <code class="language-plaintext highlighter-rouge">dimnames</code> method to <code class="language-plaintext highlighter-rouge">Dataset</code> class (#38377)</li>
<li><a href="https://github.com/apache/arrow/issues/38381">GH-38381</a> - [C++][Acero] Create a sorted merge node (#38380)</li>
<li><a href="https://github.com/apache/arrow/issues/38398">GH-38398</a> - [MATLAB] Improve array display (#38400)</li>
<li><a href="https://github.com/apache/arrow/issues/38402">GH-38402</a> - [CI][Integration] Provide wrapper scripts for integration testing (#38403)</li>
<li><a href="https://github.com/apache/arrow/issues/38415">GH-38415</a> - [MATLAB] Add indexing “slice” method to C++ Array Proxy class (#38674)</li>
<li><a href="https://github.com/apache/arrow/issues/38417">GH-38417</a> - [MATLAB] Implement a <code class="language-plaintext highlighter-rouge">TableTypeValidator</code> class that validates a MATLAB <code class="language-plaintext highlighter-rouge">cell</code> array contains only <code class="language-plaintext highlighter-rouge">table</code>s that share the same schema (#38551)</li>
<li><a href="https://github.com/apache/arrow/issues/38418">GH-38418</a> - [MATLAB] Add method for extracting one row of an <code class="language-plaintext highlighter-rouge">arrow.tabular.Table</code> as a string (#38463)</li>
<li><a href="https://github.com/apache/arrow/issues/38419">GH-38419</a> - [MATLAB] Implement a <code class="language-plaintext highlighter-rouge">ClassTypeValidator</code> class that validates a MATLAB <code class="language-plaintext highlighter-rouge">cell</code> array contains only values of the same class type. (#38530)</li>
<li><a href="https://github.com/apache/arrow/issues/38420">GH-38420</a> - [MATLAB] Implement a <code class="language-plaintext highlighter-rouge">DatetimeValidator</code> class that validates a MATLAB <code class="language-plaintext highlighter-rouge">cell</code> array contains only values of zoned or unzoned <code class="language-plaintext highlighter-rouge">datetime</code>s (#38533)</li>
<li><a href="https://github.com/apache/arrow/issues/38424">GH-38424</a> - [CI][C++] Use Fedora 38 instead of 35 (#38425)</li>
<li><a href="https://github.com/apache/arrow/issues/38452">GH-38452</a> - [C++][Benchmark] Adding benchmark for LZ4/Snappy Compression (#38453)</li>
<li><a href="https://github.com/apache/arrow/issues/38457">GH-38457</a> - [C++] Support LogicalNullCount for DictionaryArray (#38681)</li>
<li><a href="https://github.com/apache/arrow/issues/38460">GH-38460</a> - [Java][FlightRPC] Add mTLS support for Flight SQL JDBC driver (#38461)</li>
<li><a href="https://github.com/apache/arrow/issues/38462">GH-38462</a> - [Go][Parquet] Handle Boolean RLE encoding/decoding (#38367)</li>
<li><a href="https://github.com/apache/arrow/issues/38483">GH-38483</a> - [C#] Add support for more decimal conversions (#38508)</li>
<li><a href="https://github.com/apache/arrow/issues/38506">GH-38506</a> - [Go][Parquet] Add NumRows and RowGroupNumRows to pqarrow.FileWriter (#38507)</li>
<li><a href="https://github.com/apache/arrow/issues/38511">GH-38511</a> - [Java] Add getTransferPair(Field, BufferAllocator, CallBack) for StructVector and MapVector (#38512)</li>
<li><a href="https://github.com/apache/arrow/issues/38528">GH-38528</a> - [Python][Compute] Describe strptime format semantics (#38665)</li>
<li><a href="https://github.com/apache/arrow/issues/38537">GH-38537</a> - [Java] upgrade to netty 4.1.100.Final (#38538)</li>
<li><a href="https://github.com/apache/arrow/issues/38541">GH-38541</a> - [MATLAB] Add remaining tests for arrow tabular object display (#38564)</li>
<li><a href="https://github.com/apache/arrow/issues/38542">GH-38542</a> - [C++][Parquet] Faster scalar BYTE_STREAM_SPLIT (#38529)</li>
<li><a href="https://github.com/apache/arrow/issues/38562">GH-38562</a> - [Packaging] Add support for Ubuntu 23.10 (#38563)</li>
<li><a href="https://github.com/apache/arrow/issues/38576">GH-38576</a> - [Java] Change JDBC driver to optionally preserve cookies and auth tokens when getting streams (#38580)</li>
<li><a href="https://github.com/apache/arrow/issues/38578">GH-38578</a> - [Java][FlightSQL] Remove joda usage from flight-sql library (#38579)</li>
<li><a href="https://github.com/apache/arrow/issues/38589">GH-38589</a> - [C++][Gandiva] Support registering external C functions (#38632)</li>
<li><a href="https://github.com/apache/arrow/issues/38597">GH-38597</a> - [C++] Implement GetFileInfo(selector) for Azure filesystem (#39009)</li>
<li><a href="https://github.com/apache/arrow/issues/38602">GH-38602</a> - [R] Add missing <code class="language-plaintext highlighter-rouge">prod</code> for summarize (#38601)</li>
<li><a href="https://github.com/apache/arrow/issues/38627">GH-38627</a> - [Java][FlightRPC] Handle null parameter values (#38628)</li>
<li><a href="https://github.com/apache/arrow/issues/38648">GH-38648</a> - [Java] Regenerate Flatbuffers (#38650)</li>
<li><a href="https://github.com/apache/arrow/issues/38652">GH-38652</a> - [MATLAB] Add tests about time precision preservation when converting MATLAB duration to <code class="language-plaintext highlighter-rouge">arrow.array.Time32Array</code> and <code class="language-plaintext highlighter-rouge">arrow.array.Time64Array</code> (#38661)</li>
<li><a href="https://github.com/apache/arrow/issues/38662">GH-38662</a> - [Java] Add comparators (#38669)</li>
<li><a href="https://github.com/apache/arrow/issues/38699">GH-38699</a> - [C++][FS][Azure] Implement <code class="language-plaintext highlighter-rouge">CreateDir()</code> (#38708)</li>
<li><a href="https://github.com/apache/arrow/issues/38700">GH-38700</a> - [C++][FS][Azure] Implement <code class="language-plaintext highlighter-rouge">DeleteDir()</code> (#38793)</li>
<li><a href="https://github.com/apache/arrow/issues/38701">GH-38701</a> - [C++][FS][Azure] Implement <code class="language-plaintext highlighter-rouge">DeleteDirContents()</code> (#38888)</li>
<li><a href="https://github.com/apache/arrow/issues/38702">GH-38702</a> - [C++] : Implement AzureFileSystem::DeleteRootDirContents (#39151)</li>
<li><a href="https://github.com/apache/arrow/issues/38705">GH-38705</a> - [C++][FS][Azure] Implement CopyFile() (#39058)</li>
<li><a href="https://github.com/apache/arrow/issues/38712">GH-38712</a> - [Python] Remove dead code in _reconstruct_block (#38714)</li>
<li><a href="https://github.com/apache/arrow/issues/38718">GH-38718</a> - [Go][Format][Integration] Add StringView/BinaryView to Go implementation (#35769)</li>
<li><a href="https://github.com/apache/arrow/issues/38732">GH-38732</a> - [Java][FlightRPC] Add support for Array parameter binding in JDBC (#38733)</li>
<li><a href="https://github.com/apache/arrow/issues/38751">GH-38751</a> - [C++][Go][Parquet] Add tests for reading Float16 files in parquet-testing (#38753)</li>
<li><a href="https://github.com/apache/arrow/issues/38757">GH-38757</a> - [C#] Implement common interfaces for structure arrays and record batches (#38759)</li>
<li><a href="https://github.com/apache/arrow/issues/38758">GH-38758</a> - [C++][FS][Azure] Rename AzurePath to AzureLocation (#38773)</li>
<li><a href="https://github.com/apache/arrow/issues/38772">GH-38772</a> - [C++] Implement directory semantics even when the storage account doesn’t support HNS (#39361)</li>
<li><a href="https://github.com/apache/arrow/issues/38798">GH-38798</a> - [Integration] Enable C Data Interface integration testing on Rust (#38799)</li>
<li><a href="https://github.com/apache/arrow/issues/38814">GH-38814</a> - [C++][Parquet] Update parquet.thrift to sync with 2.10.0 (#38815)</li>
<li><a href="https://github.com/apache/arrow/issues/38824">GH-38824</a> - [Go] Enable GC checks (#38826)</li>
<li><a href="https://github.com/apache/arrow/issues/38836">GH-38836</a> - [Go] Add Size() for ArrayData (#38839)</li>
<li><a href="https://github.com/apache/arrow/issues/38852">GH-38852</a> - [C++] Replace “#ifdef ARROW_WITH_GZIP” in dataset test to ARROW_WITH_ZLIB (#38853)</li>
<li><a href="https://github.com/apache/arrow/issues/38857">GH-38857</a> - [Python] Fix append mode for cython 2 (#39027)</li>
<li><a href="https://github.com/apache/arrow/issues/38857">GH-38857</a> - [Python] Add append mode for pyarrow.OsFile (#38820)</li>
<li><a href="https://github.com/apache/arrow/issues/38860">GH-38860</a> - [C++][Parquet] Using length to optimize bloom filter read (#38863)</li>
<li><a href="https://github.com/apache/arrow/issues/38870">GH-38870</a> - [Documentation] Add List View and Large List View to status.rst (#38871)</li>
<li><a href="https://github.com/apache/arrow/issues/38874">GH-38874</a> - [C++][Parquet] Minor: making parquet TypedComparator operation as const method (#38875)</li>
<li><a href="https://github.com/apache/arrow/issues/38884">GH-38884</a> - [C++] DatasetWriter release rows_in_flight_throttle when allocate writing failed (#38885)</li>
<li><a href="https://github.com/apache/arrow/issues/38887">GH-38887</a> - [C++][Parquet] Move EstimatedBufferedValueBytes from TypedColumnWriter to ColumnWriter (#39055)</li>
<li><a href="https://github.com/apache/arrow/issues/38907">GH-38907</a> - [C++] Stop installing internal bpacking_simd* headers (#38908)</li>
<li><a href="https://github.com/apache/arrow/issues/38909">GH-38909</a> - [Packaging] Drop support for Ubuntu 23.04 (#38910)</li>
<li><a href="https://github.com/apache/arrow/issues/38918">GH-38918</a> - [Go] Avoid schema.Fields allocations in some places (#38919)</li>
<li><a href="https://github.com/apache/arrow/issues/38920">GH-38920</a> - [C++][Gandiva] Refactor function holder to return arrow Result (#38873)</li>
<li><a href="https://github.com/apache/arrow/issues/38958">GH-38958</a> - [C++][Parquet] Fix spelling (#38959)</li>
<li><a href="https://github.com/apache/arrow/issues/38990">GH-38990</a> - [Java] Upgrade to flatc version 23.5.26 (#38991)</li>
<li><a href="https://github.com/apache/arrow/issues/38996">GH-38996</a> - [Java] Update dependencies and plugins for JPMS modules (#38994)</li>
<li><a href="https://github.com/apache/arrow/issues/39006">GH-39006</a> - [Python] Extract libparquet requirements out of libarrow_python.so to new libarrow_python_parquet_encryption.so (#39316)</li>
<li><a href="https://github.com/apache/arrow/issues/39013">GH-39013</a> - [Go][Integration] Support cABI import/export of StringView (#39019)</li>
<li><a href="https://github.com/apache/arrow/issues/39020">GH-39020</a> - [CI][Release][JS] Use Node.js 18 instead of 16 (#39021)</li>
<li><a href="https://github.com/apache/arrow/issues/39028">GH-39028</a> - [Python][CI] Fix dask integration build by temporarily skipping test_categorize_info (#39029)</li>
<li><a href="https://github.com/apache/arrow/issues/39037">GH-39037</a> - [Java] Remove (Contrib/Experimental) mention in Flight SQL (#39040)</li>
<li><a href="https://github.com/apache/arrow/issues/39049">GH-39049</a> - [C++] Use Cast() instead of CastTo() for Dictionary Scalar in test (#39362)</li>
<li><a href="https://github.com/apache/arrow/issues/39050">GH-39050</a> - [C++] Use Cast() instead of CastTo() for Timestamp Scalar in test (#39060)</li>
<li><a href="https://github.com/apache/arrow/issues/39051">GH-39051</a> - [C++] Use Cast() instead of CastTo() for List Scalar in test (#39353)</li>
<li><a href="https://github.com/apache/arrow/issues/39064">GH-39064</a> - [C++][Parquet] Support row group filtering for nested paths for struct fields (#39065)</li>
<li><a href="https://github.com/apache/arrow/issues/39088">GH-39088</a> - [Dev][Java] Add Dependabot configuration for Java (#39089)</li>
<li><a href="https://github.com/apache/arrow/issues/39096">GH-39096</a> - [Python] Release GIL in <code class="language-plaintext highlighter-rouge">.nbytes</code> (#39097)</li>
<li><a href="https://github.com/apache/arrow/issues/39119">GH-39119</a> - [C++] Refactor the Azure FS tests and filesystem class instantiation (#39207)</li>
<li><a href="https://github.com/apache/arrow/issues/39122">GH-39122</a> - [C++][Parquet] Optimize FLBA record reader (#39124)</li>
<li><a href="https://github.com/apache/arrow/issues/39134">GH-39134</a> - Create module info compiler plugin (#39135)</li>
<li><a href="https://github.com/apache/arrow/issues/39159">GH-39159</a> - [C++] : Try to make Buffer::device_type_ non-optional (#39150)</li>
<li><a href="https://github.com/apache/arrow/issues/39170">GH-39170</a> - [Java] Improve error message explaining why TestTls might fail (#39171)</li>
<li><a href="https://github.com/apache/arrow/issues/39189">GH-39189</a> - [Java] Bump com.h2database:h2 from 1.4.196 to 2.2.224 in /java (#39188)</li>
<li><a href="https://github.com/apache/arrow/issues/39196">GH-39196</a> - [Python][Docs] Document the Arrow PyCapsule protocol in the ‘extending pyarrow’ section of the Python docs (#39199)</li>
<li><a href="https://github.com/apache/arrow/issues/39208">GH-39208</a> - [C++][Parquet] Remove deprecated AppendRowGroup(int64_t num_rows) (#39209)</li>
<li><a href="https://github.com/apache/arrow/issues/39210">GH-39210</a> - [C++][Parquet] Avoid WriteRecordBatch from produce zero-sized RowGroup (#39211)</li>
<li><a href="https://github.com/apache/arrow/issues/39217">GH-39217</a> - [Python] RecordBatchReader.from_stream constructor for objects implementing the Arrow PyCapsule protocol (#39218)</li>
<li><a href="https://github.com/apache/arrow/issues/39223">GH-39223</a> - [C#] Support IReadOnlyList&lt;T?&gt; on remaining scalar types (#39224)</li>
<li><a href="https://github.com/apache/arrow/issues/39225">GH-39225</a> - [GLib] Use Cast() instaed of CastTo (#39228)</li>
<li><a href="https://github.com/apache/arrow/issues/39232">GH-39232</a> - [C++] Support binary to fixed_size_binary cast (#39236)</li>
<li><a href="https://github.com/apache/arrow/issues/39243">GH-39243</a> - [R][CI] Remove r-arrow conda nightlies (#39244)</li>
<li><a href="https://github.com/apache/arrow/issues/39246">GH-39246</a> - [CI][GLib][Ruby] Use Ubuntu 22.04 not 20.04 (#39247)</li>
<li><a href="https://github.com/apache/arrow/issues/39262">GH-39262</a> - [C++][Azure][FS] Add default credential auth configuration (#39263)</li>
<li><a href="https://github.com/apache/arrow/issues/39265">GH-39265</a> - [Java] Make it run well with the netty newest version 4.1.104 (#39266)</li>
<li><a href="https://github.com/apache/arrow/issues/39268">GH-39268</a> - [C++] Don’t install bundled Azure SDK for C++ with CMake 3.28+ (#39269)</li>
<li><a href="https://github.com/apache/arrow/issues/39292">GH-39292</a> - [C++][FS] : Remove the AzureBackend enum and add more flexible connection options (#39293)</li>
<li><a href="https://github.com/apache/arrow/issues/39297">GH-39297</a> - [C++][FS] : Inform caller of container not-existing when checking for HNS support (#39298)</li>
<li><a href="https://github.com/apache/arrow/issues/39299">GH-39299</a> - [Java] Upgrade to Avro 1.11.3 (#39300)</li>
<li><a href="https://github.com/apache/arrow/issues/39303">GH-39303</a> - [Archery][Benchmarking] Allow setting C++ repetition min time (#39324)</li>
<li><a href="https://github.com/apache/arrow/issues/39318">GH-39318</a> - [C++][FS][Azure] Add workload identity auth configuration (#39319)</li>
<li><a href="https://github.com/apache/arrow/issues/39320">GH-39320</a> - [C++][FS][Azure] Add managed identity auth configuration (#39321)</li>
<li><a href="https://github.com/apache/arrow/issues/39322">GH-39322</a> - [C++] Forward arguments to ExceptionToStatus all the way to Status::FromArgs (#39323)</li>
<li><a href="https://github.com/apache/arrow/issues/39326">GH-39326</a> - [C++] Flaky DatasetWriterTestFixture.MaxRowsOneWriteBackpresure test (#39379)</li>
<li><a href="https://github.com/apache/arrow/issues/39328">GH-39328</a> - [Java] Make default getConsumer public (#39329)</li>
<li><a href="https://github.com/apache/arrow/issues/39335">GH-39335</a> - [C#] Support creating FlightClient with Grpc.Core.Channel (#39348)</li>
<li><a href="https://github.com/apache/arrow/issues/39339">GH-39339</a> - [C++] Add ForceCachedHierarchicalNamespaceSupport to help with testing (#39340)</li>
<li><a href="https://github.com/apache/arrow/issues/39341">GH-39341</a> - [C#] Support Utf8View, BinaryView and ListView (#39342)</li>
<li><a href="https://github.com/apache/arrow/issues/39343">GH-39343</a> - [C++][FS][Azure] Add client secret auth configuration (#39346)</li>
<li><a href="https://github.com/apache/arrow/issues/39355">GH-39355</a> - [Java] Improve JdbcConsumer exceptions (#39356)</li>
<li><a href="https://github.com/apache/arrow/issues/39357">GH-39357</a> - [C++] Reduce function.h includes (#39312)</li>
<li><a href="https://github.com/apache/arrow/issues/39363">GH-39363</a> - [C++] Use Cast() instead of CastTo() for Parquet (#39364)</li>
<li><a href="https://github.com/apache/arrow/issues/39413">GH-39413</a> - [C++][Parquet] Vectorize decode plain on FLBA (#39414)</li>
<li><a href="https://github.com/apache/arrow/issues/39419">GH-39419</a> - [C++][Parquet] Style: Using arrow::Buffer data_as api rather than reinterpret_cast (#39420)</li>
<li><a href="https://github.com/apache/arrow/issues/39430">GH-39430</a> - [C++][ORC] Upgrade ORC to 1.9.2 (#39431)</li>
<li><a href="https://github.com/apache/arrow/issues/39449">GH-39449</a> - [C++] Use default Azure credentials implicitly and support anonymous credentials explicitly (#39450)</li>
<li><a href="https://github.com/apache/arrow/issues/39484">GH-39484</a> - [Java] Support 256 bit decimals in JdbcToArrowUtils (#39485)</li>
<li><a href="https://github.com/apache/arrow/issues/39500">GH-39500</a> - [Docs] Pin pydata-sphinx-theme to 0.14 (#39501)</li>
<li><a href="https://github.com/apache/arrow/issues/39515">GH-39515</a> - [Python] Pass in type to <code class="language-plaintext highlighter-rouge">MapType.from_arrays</code> (#39516)</li>
<li><a href="https://github.com/apache/arrow/issues/39531">GH-39531</a> - [Python][CI] Skip failing dask tests: test_describe_empty and test_view (#39534)</li>
<li><a href="https://github.com/apache/arrow/issues/39533">GH-39533</a> - [Python] NumPy 2.0 compat: remove usage of np.core (#39535)</li>
<li><a href="https://github.com/apache/arrow/issues/39537">GH-39537</a> - [Packaging][Python] Add a numpy&lt;2 pin to the install requirements for the 15.x release branch (#39538)</li>
<li><a href="https://github.com/apache/arrow/issues/39601">GH-39601</a> - [R] Don’t download cmake when TEST_OFFLINE_BUILD=true (#39602)</li>
<li><a href="https://github.com/apache/arrow/issues/39624">GH-39624</a> - [R][CI] Add CMake to docker file and update envvars (#39625)</li>
<li><a href="https://github.com/apache/arrow/issues/39626">GH-39626</a> - [Docs][R] Update NEWS.md for 15.0.0</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-2411">PARQUET-2411</a> - [C++][Parquet] Allow reading dictionary without reading data via ByteArrayDictionaryRecordReader (#39153)</li>
</ul>
</main>
<hr/>
<footer class="footer">
<div class="row">
<div class="col-md-9">
<p>Apache Arrow, Arrow, Apache, the Apache feather logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.</p>
<p>&copy; 2016-2024 The Apache Software Foundation</p>
</div>
<div class="col-md-3">
<a class="d-sm-none d-md-inline pr-2" href="https://www.apache.org/events/current-event.html">
<img src="https://www.apache.org/events/current-event-234x60.png"/>
</a>
</div>
</div>
</footer>
</div>
</body>
</html>