| <!DOCTYPE html> |
| <html lang="en-US"> |
| <head> |
| <meta charset="UTF-8"> |
| <meta http-equiv="X-UA-Compatible" content="IE=edge"> |
| <meta name="viewport" content="width=device-width, initial-scale=1"> |
| <!-- The above meta tags *must* come first in the head; any other head content must come *after* these tags --> |
| |
| <title>Apache Arrow 21.0.0 Release | Apache Arrow</title> |
| |
| |
| <!-- Begin Jekyll SEO tag v2.8.0 --> |
| <meta name="generator" content="Jekyll v4.4.1" /> |
| <meta property="og:title" content="Apache Arrow 21.0.0 Release" /> |
| <meta property="og:locale" content="en_US" /> |
| <meta name="description" content="Apache Arrow 21.0.0 (17 July 2025) This is a major release covering more than 2 months of development. Download Source Artifacts Binary Artifacts For AlmaLinux For Amazon Linux For CentOS For C# For Debian For Python For Ubuntu Git tag Contributors This release includes 400 commits from 82 distinct contributors. $ git shortlog -sn apache-arrow-20.0.0..apache-arrow-21.0.0 78 Sutou Kouhei 37 Raúl Cumplido 33 Hiroyuki Sato 30 William Ayd 22 Antoine Pitrou 19 Bryce Mecum 18 Nic Crane 11 Alenka Frim 10 Dewey Dunnington 9 Jacob Wujciak-Jens 9 dependabot[bot] 8 mwish 7 Jonathan Keane 7 Rossi Sun 6 Sarah Gilmore 5 Arash Andishgar 4 Dongjoon Hyun 4 takuya kodama 3 David Li 3 Eddie Chang 3 Enrico Minack 3 Ian Cook 3 Lester Fan 3 Ziy 3 abandy 2 Abhinav 2 David Sherrier 2 Krisztián Szűcs 2 Rok Mihevc 2 gitmodimo 1 Adam Reeve 1 Akum Kang 1 Alina (Xi) Li 1 Anatolii Tsyplenkov 1 Antoine Prouvost 1 Benjamin Kietzman 1 Brian Hulette 1 Bruno 1 Carsten Haubold 1 ChiLin Chiu 1 Colin 1 DenisTarasyuk 1 Eric Dinse 1 Etienne Bacher 1 Even Rouault 1 Gang Wu 1 Guilherme Martins Crocetti 1 Hadrian Reppas 1 HyunWoo Oh 1 Igor Antropov 1 JB Onofré 1 Joshua 1 Junwang Zhao 1 Kevin Gurney 1 Kevin Wilson 1 Kirill Tsyganov 1 Konstantin Malanchev 1 Kyle Hemker 1 Lukas 1 Mateusz Rzeszutek 1 Matt Topol 1 Michael 1 Michael Chirico 1 NazilaAk 1 Patrick Walsh 1 Ranjit Ranjan 1 Roman Karlstetter 1 Saurabh Singh 1 Thomas Newton 1 Tommy Hughes IV 1 Xingyu Long 1 Zihan Qi 1 bw513 1 dawg 1 koenvo 1 leopardracer 1 lriggs 1 neilechao 1 omahs 1 shu-kitamura 1 yuri@FreeBSD 1 yyossy Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -sn --group=trailer:signed-off-by apache-arrow-20.0.0..apache-arrow-21.0.0 179 Sutou Kouhei 51 Antoine Pitrou 47 Raúl Cumplido 15 Nic Crane 13 AlenkaF 13 Bryce Mecum 12 Jacob Wujciak-Jens 7 David Li 7 Dewey Dunnington 7 mwish 6 Rossi Sun 5 Curt Hagenlocher 5 Jonathan Keane 5 Sarah Gilmore 4 Rok Mihevc 3 Gang Wu 3 Will Ayd 2 Kevin Gurney 2 Krisztian Szucs 1 Benjamin Kietzman Changelog Bug Fixes GH-32276 - [C++][FlightRPC] Add option to align RecordBatch buffers given to IPC reader (#44279) GH-35166 - [C++][Compute] Increase precision of decimals in sum aggregates (#44184) GH-39811 - [R] better documentation for col_types argument in open_delim_dataset (#45719) GH-40756 - [C++] Remove dead Boost urls (#46452) GH-43132 - [CI] Fix pre-commit Rat check (#46541) GH-44366 - [Python][Acero] RecordBatch.filter on expression raises error if result set is empty (#46057) GH-44502 - [R] Negative fractional dates must be converted to integers by floor, not trunc (#46873) GH-44910 - [Swift] Fix IPC stream reader and writer impl (#45029) GH-45292 - [Python] test_dtypes hypotesis test fails sporadically (#46029) GH-45532 - [C++] RunEndEncodedBuilder should clear dimensions after a Finish() call (#45533) GH-45534 - [C++] Test: RunEndEncodeTableColumns should update REE columns' schema types (#45535) GH-45608 - [C++][Flight] Fix compilation for clang (#46264) GH-45716 - [R][CI] Refactor skip_on_python_older_than to not initialize reticulate (#46079) GH-45735 - [C++] Broken tests for extract_regex compute funcion (#45900) GH-45853 - [C++][Dev] Fix Meson compilation issues in Docker builds (#45858) GH-46011 - [C++] Hide DCHECK family from public headers (#46015) GH-46025 - [C++] Use ARROW_CUDA_EXPORT instead of ARROW_EXPORT for libarrow_cuda (#46030) GH-46052 - [C++][Benchmarking] Don't build grouper benchmark without ARROW_COMPUTE=ON (#46053) GH-46065 - [Release] Don't use --verify-tag for gh release upload in 02-source.sh (#46066) GH-46068 - [Release] Remove needless docs:rc task from 05-binary-upload.sh (#46069) GH-46070 - [C++] Remove duplicate storage_type in JsonExtension (#46071) GH-46080 - [Python][Docs] Provide guidance for tzdata related issues if installing with pip (#46591) GH-46084 - [C++] Always use ARROW_VCPKG to detect vcpkg mode (#46467) GH-46090 - [C++] Set default IPC option to enabled in Meson (#46114) GH-46094 - [C++][Docs] Add note to RleDecoder::Get's doc comment (#46874) GH-46121 - [Python] Add missing column_index argument to ArrowReaderProperties::read_dictionary's Cython binding (#46122) GH-46127 - [CI][Release] Make 02-source.sh test passable on fork (#46143) GH-46146 - [C++] Merge metadata in SchemaBuidler::AddMetadata (#46654) GH-46149 - [C++] Opening dataset fails with sshfs-3.7.3 due to F_RDADVISE error (#46346) GH-46157 - [C++] Move test utility RunEndEncodeTableColumns that uses REE to test_util_internal on acero instead of common gtest_util (#46161) GH-46174 - [Python] Failing tests in python minimal builds (#46175) GH-46192 - [C++] Add substrait dep to third party download script (#46191) GH-46197 - [C++] Tests use legacy timezones (#46201) GH-46214 - [C++] Improve S3 client initialization (#46723) GH-46224 - [C++][Acero] Fix the hang in asof join (#46300) GH-46231 - [C++][CMake] Fix arrow_bundled_dependencies to be externally accessible by FetchContent (#46232) GH-46233 - [C++] Fix missing nested braces in QueuedTask initialization (#46234) GH-46236 - [Release][Packaging] Fix dev/release/post-03-binary.sh errors (#46237) GH-46238 - [Release][Python] Use array to avoid empty argument in dev/release/post-11-python.sh (#46239) GH-46240 - [Release][Packaging] Fix a bug that existing APT repositories' metadata are lost (#46287) GH-46242 - [Release] Don't show gpg signature when getting release time (#46243) GH-46259 - [CI] Remove deprecated flag from mamba info (#46260) GH-46262 - [CI][Ruby] Don't update GCC of MSYS2 (#46278) GH-46268 - [C++] Improve ArrayData docstrings (#46271) GH-46270 - [C++][Parquet] Clarify GeoStatistics docstring (#46649) GH-46284 - [Release][Packaging] Add missing APT metadata for .ddeb (#46288) GH-46296 - [Swift] Add support for reading struct (#46302) GH-46299 - [C++][Compute] Don't use static inline const for default options (#46303) GH-46304 - [Release][Packaging] Use optimized debug build for .deb (#46392) GH-46306 - [C++][Parquet] Should use LoadEnumSafe for geo enum (#46307) GH-46314 - [C++][Parquet] Fix valgrind error when collecting parameterized tests for MakeWKBPoint (#46320) GH-46326 - [C++][Parquet] Fix stack overflow in rapidjson value comparison to integer (#46327) GH-46333 - [CI] Always pass --yes to mamba clean (#46341) GH-46333 - [CI] Explicitly pass --yes to mamba clean (#46334) GH-46343 - [CI][Python] Remove workaround for gdb packaging issue (#46848) GH-46343 - [CI] Avoid installing gdb 16.3 on python 3.10 jobs to fix CI (#46511) GH-46344 - [CI][Python] Skip doctest for s3.get_file_info to avoid bucket restrictions (#46345) GH-46351 - [Archery][Docs] Fix the cli argument parsing bug in docker subcommand (#46352) GH-46355 - [Python] Fix table.to_struct_array with an empty table (#46357) GH-46359 - [C++][Thirdparty] Bump Apache ORC to 2.1.2 (#46360) GH-46362 - [CGLib][Packaging] Use -fPIE explicitly for g-ir-scanner (#46366) GH-46363 - [CI][Packaging] Use mono from community repository on Alpine instead of from testing (#46364) GH-46394 - [C++][R] gcc-UBSAN errors on CRAN (#46397) GH-46395 - [C++][Statistics] Use EqualOptions for min and max in arrow::ArrayStatistics::Equals() (#46422) GH-46407 - [C++] Fix IPC serialization of sliced list arrays (#46408) GH-46414 - [C++] Fix GCS filesystem getFileInfo method (#46416) GH-46417 - [C++][Parquet] Fix UB in LoadEnumSafe for EdgeInterpolationAlgorithm (#46418) GH-46419 - [C++] Remove duplicate declaration and sync arg names on acero test_util_internal functions (#45400) GH-46420 - [C++][Dataset] Fix DatasetWriter deadlock on writting batch greater than max_rows_queued (#46139) GH-46424 - [C++][Parquet] Fix erroneous unit test skip (#46425) GH-46435 - [Parquet][C++] Fix uninitialized value in writer test (#46533) GH-46442 - [R] hms::as_hms tests fail on some of our crossbow builds (#46443) GH-46456 - [GLib] Add missing since: tag (#46457) GH-46478 - [C++] Implement recent JSON changes into Meson configuration (#46479) GH-46481 - [C++][Python] Allow nullable schema in FlightInfo (#46489) GH-46512 - [CI][C++] Install the llvm package explicitly on MSYS2 (#46525) GH-46516 - [CI][Python] Force Cython>3.1.1 for docs builds (#46770) GH-46523 - [GLib] Fix compiler warning: use gsize instead of int (#46524) GH-46538 - [CI][Packaging][AlmaLinux8] Ensure pip3 (#46539) GH-46564 - [C++] Export ARROW_VCPKG in ArrowConfig.cmake (#46565) GH-46576 - [C++] Suppress codecvt_utf8 deprecation warning (#46622) GH-46589 - [C++] Fix utf8_is_digit to support full Unicode digit range (#46590) GH-46593 - [CI][Integration] Disable nested log grouping (#46594) GH-46598 - [Dev] Use language name for alias (#46602) GH-46599 - [C++][Doc][Parquet] Update supported types documentation (#46620) GH-46605 - [CI][Release][C#] Update download URL for dotnet on verification script (#46612) GH-46606 - [Python] Do not require numpy when normalizing slice (#46732) GH-46609 - [Release][CI] Use System GTest for macos verification (#46823) GH-46610 - [CI][Release] Use Python 3.12 on AlmaLinux 8 (#46621) GH-46611 - [Python][C++] Allow building float16 arrays without numpy (#46618) GH-46623 - [C++][Compute] Fix the failure of large memory test in arrow-compute-row-test (#46635) GH-46636 - [R] Fix evaluation of external objects not in global environment in case_when() (#46667) GH-46659 - [C++] Fix export of extension arrays with binary view/string view storage (#46660) GH-46673 - [CI][R][Docs] Accept empty INSTALL_ARGS again (#46682) GH-46674 - [C++] Construct Array from ExtensionType Scalar (#46675) GH-46684 - [C++] Fix Meson configuration issue on Windows (#46685) GH-46688 - [Ruby] Fix a typo (#46689) GH-46691 - [CI][Packaging] Update platform tag on generated wheel name to match newest auditwheel naming (#46705) GH-46693 - [CI] Update GitHub hosted runner from deprecated windows-2019 to windows-2022 (#46694) GH-46704 - [C++] Fix OSS-Fuzz build failure (#46706) GH-46708 - [C++][Gandiva] Added zero return values for castDECIMAL_utf8 (#46709) GH-46710 - [C++] Fix ownership and lifetime issues in Dataset Writer (#46711) GH-46717 - [R][Docs] Add missing "internal" keywords for internal function (#46722) GH-46724 - [C++][Parquet] OSSFuzz: Prevent from Bad-cast in handling statistics (#46725) GH-46729 - [Python] Allow constructing InMemoryDataset from RecordBatchReader (#46731) GH-46736 - [CI] Disable Parquet in conan-minimum (#46744) GH-46761 - [C++] Add executable detection on FreeBSD (#46759) GH-46764 - [C++][Gandiva] Fix wrong .bc depends (#46765) GH-46777 - [C++] Use SimplifyIsIn only when the value_set of the expression is lower than a threshold (#46859) GH-46782 - [Docs] Link to same version of docs from Implementations page GH-46805 - [CI][Dev] Fix caching for R hooks in lint job (#46812) GH-46809 - [CI][Packaging] Stop trying to add headers from arrow/compu… (#46810) GH-46811 - [C++][Python] Fix crash on FileReaderImpl::GetRecordBatchReader (#46931) GH-46816 - [Docs] Fix links to Swift docs and source (#46817) GH-46827 - [C++] Update Meson Configuration for compute shared lib (#46839) GH-46831 - [C++][R] Remove some pending references to CMake < 3.25 (docs + minor CMake references) (#46834) GH-46841 - [C++][Gandiva] Fix date trunc edge case (#46842) GH-46863 - [CI][C++] Suppress a false positive UBSAN error in AWS SDK for C++ (#46870) GH-46871 - [C++][Parquet] Restore implementation of 3 arrow::FileReader::GetRecordBatchReader() functions (#46868) GH-46879 - [CI][Packaging][Linux] Don't check example build with old CMake (#46880) GH-46888 - [C++] Remove override of default buildtype in Meson config (#46919) GH-46915 - [C++][Compute] Initialize Compute kernels on benchmarks that require extra kernels (#46922) GH-46916 - [R] Test for negative fractional dates fails on older R versions due to change in base R as.Date() (#46917) GH-46920 - [FlightRPC] Fix Flight SQL ColumnMetadata retrieval (#46921) GH-46934 - [C++][Parquet] Trying to fix ub in AttachStatistics (#46940) GH-46947 - [R][Packaging] Add src/arrow/flight/sql/odbc to source excludes (#46948) GH-46964 - [CI][Packaging][Conan] Ensure using upper case for config suffix (#46967) GH-46986 - [CI][C++] Fix a build error with C++20 (#46987) GH-46988 - [C++][Parquet] Fix FLBA DecodeArrow multiply overflow (#46991) GH-46989 - [CI][R] Use Ubuntu 20.04 instead of OpenSUSE for R 4.1 (#46990) GH-46995 - [CI][R][C++] Use system memory allocator in sanitizer jobs (#47007) GH-46998 - [C++] Fix mockfs.cc compiling error with C++23 (#46999) GH-47015 - [CI][C++] Use mold on conda-cpp to work around issues with GNU ld (#47028) GH-47033 - [C++][Compute] Never use custom gtest main with MSVC (#47049) GH-47037 - [CI][C++] Fix Fedora 39 CI jobs (#47038) GH-47061 - [Release] Fix wrong variable name for signing (#47062) GH-47063 - [Release] Define missing RELEASE_TARBALL (#47064) GH-47065 - [Release] Fix timeout key in verify_rc.yml (#47066) GH-47067 - [Release] Fix wrong GitHub Actions context in verify_rc.yml (#47068) GH-47069 - [Release] Add missing "needs: target" (#47070) GH-47071 - [Release] Dereference all hard links in source archive (#47072) GH-47074 - [Release] Use reproducible mtime for csharp/ in source archive (#47076) GH-47078 - [Release] Ensure using cloned apache/arrow for reproducible check (#47079) GH-47092 - [Release] Binary verification CI jobs are failing New Features and Improvements GH-25025 - [C++] Move non core compute kernels into separate shared library (#46261) GH-26818 - [C++][Python] Preserve order when writing dataset multi-threaded (#44470) GH-35419 - [GLib] Add GArrowFixedShapeTensorDataType (#46305) GH-35644 - [MATLAB] Add tests verifying arrow.array.<Type>Array.fromMATLAB() throws an exception if given an array with the wrong type. (#47020) GH-36753 - [C++] Properly pretty-print and diff HalfFloatArrays (#46857) GH-37027 - [C++] Add float16 kernels to if-else and vector-replace functions (#46446) GH-37561 - [Ruby] Add empty chunked array tests for Arrow::Table#each_raw_records (#46862) GH-37577 - [MATLAB] Create a superclass for DateType-related MATLAB tests (#46923) GH-37677 - [C++][FlightRPC] Allow FlightInfo.schema to be nullable GH-37891 - [C++][Parquet] Refine several classes in Parquet encryption (#46202) GH-37891 - [C++] Followup Buffer change to use sptr move (#46027) GH-38214 - [MATLAB] Add a common arrow.tabular.Tabular MATLAB interface (#47014) GH-38369 - [MATLAB] Create utility functions for simplifying management of Proxy instances for Arrays (#46907) GH-38903 - [R][Docs] Improve documentation of col_types (#46145) GH-38914 - [Python] Add EncryptionConfiguration.uniform_encryption (#46347) GH-39294 - [C++][Python] DLPack on Tensor class (#42118) GH-39759 - [Docs] Update pydata-sphinx-theme to 0.16.1 (#46943) GH-40278 - [C++] Support casting string to duration in CSV converter (#46035) GH-40343 - [C++] Move S3FileSystem to the registry (#41559) GH-40754 - [Python] Expose tls_ca_file_path to S3FileSystem (#45881) GH-41496 - [Python][Azure][Docs] Turn on azure on debian-docs (#46892) GH-41672 - [Python][Doc] Clarify docstring of FixedSizeListArray.values that it ignores the offset (#46144) GH-41973 - Expose new S3 option check_directory_existence_before_creation - manual rebase (#46619) GH-42012 - [Python] Add Schema with_field or set_field method (#46348) GH-43041 - [C++][Python] Read/write Parquet BYTE_ARRAY as Large/View types directly (#46532) GH-43170 - [Swift] Add StructArray support to ArrowWriter (#43439) GH-43623 - [R] remove libarrow backwards compatibility enforcement (#46491) GH-43807 - [C++][Python] Add UUID extension type conversion support to/from Parquet (#45866) GH-43891 - [C++][Parquet] Faster reading of FIXED_LEN_BYTE_ARRAY data (#46886) GH-44208 - [R] Adding test to ensure bit64's new semantic works with arrow (#46651) GH-44435 - [GLib] Add distinct count support to GArrowArrayStatistics (#46894) GH-44500 - [Python][Parquet] Map Parquet logical types to Arrow extension types by default (#46772) GH-44900 - [Python] Support explicit fsspec+{protocol} and hf:// filesystem URIs (#45089) GH-44953 - [R] Add R bindings for new compute functions (#44971) GH-45028 - [C++][Compute] Allow cast to reorder struct fields (#45246) GH-45083 - [C++] Add HalfFloat kernels for is_nan, is_inf, is_finite, negate, negate_checked, sign (#46866) GH-45195 - [C++] Update bundled AWS SDK for C++ to 1.11.587 (#45306) GH-45229 - [Python] Migrate from scipy.spmatrix to scipy.sparray (#46423) GH-45229 - [Python] skip scipy.sparse roundtrip tests for float16 (#46413) GH-45290 - [Docs][Release] Change show_version_warning_banner substitution (#46883) GH-45522 - [Parquet][C++] Parquet GEOMETRY and GEOGRAPHY logical type implementations (#45459) GH-45531 - [Python] Add the dim_names argument to from_numpy_ndarray (#46170) GH-45619 - [Python] Use f-string instead of string.format (#45629) GH-45643 - [R] Implement hms functions to create and manipulate time of day variables (#46206) GH-45653 - [Python] Scalar subclasses should implement Python protocols (#45818) GH-45664 - [C++] Allow LargeString,LargeBinary,FixedSizeBinary,StringView and BinaryView for RecordBatch::MakeStatisticsArray() (#46031) GH-45713 - [GLib] Add garrow_chunked_array_(import|export)() (#46876) GH-45750 - [C++][Python][Parquet] Implement Content-Defined Chunking for the Parquet writer (#45360) GH-45794 - [C++] Add array directory to Meson configuration (#45795) GH-45796 - [C++] Add integration directory to Meson configuration (#45797) GH-45798 - [C++] Add extension directory to Meson (#45799) GH-45800 - [C++] Implement util configuration in Meson (#45824) GH-45829 - [C++] Add compute directory to Meson configuration (#45830) GH-45833 - [C++] Add JSON directory to Meson configuration (#45834) GH-45865 - [C++] Create dedicated benchmark dependency in Meson (#45909) GH-45908 - [C++][Docs] Rename and expose basic {Array,...}FromJSON helpers as public APIs (#46180) GH-45957 - [C++][Python] Expose allow_delayed_open on S3FileSystem (#46078) GH-45978 - [C++] Bump bundled mimalloc version (#45979) GH-45991 - [C++] Bump bundled nlohmann_json to v3.12.0 (#46112) GH-45992 - [C++] Bump bundled utf8proc version to 2.10.0 (#46032) GH-46019 - [Python] Raise TypeError on feather read_table if columns is not a Sequence (#46038) GH-46054 - [Python][Packaging] Re-enable pandas on Windows free-threaded wheel (#46109) GH-46058 - [Python] Run Python in AppVeyor outside of source directory (#46059) GH-46087 - [FlightSQL] Allow returning column remarks in FlightSQL's CommandGetTables (#46110) GH-46091 - [C++] Use feature options in Meson configuration (#46204) GH-46092 - [C++] Add filesystem related options to Meson (#46101) GH-46104 - GH-45937: [C++][Parquet] Logical type definition for variant GH-46115 - [C++] Implement compression libraries in Meson (#46358) GH-46116 - [C++] Implement IPC directory in Meson (#46117) GH-46118 - [C++] Add tensor directory to Meson (#46119) GH-46130 - [Python] Remove use_legacy_format in favour of setting IpcWriteOptions (#46131) GH-46132 - [C++][Parquet] Remove deprecated parquet APIs from 19.0.0 (#46133) GH-46141 - [C++] Add flight directory to Meson configuration (#46142) GH-46153 - [C++] Implement acero directory in Meson (#46154) GH-46155 - [C++] Implement Tensorflow directory in Meson (#46156) GH-46163 - [C++] Add vendored directory to Meson (#46164) GH-46189 - [C#] Use pooled buffers in ArrowStreamWriter (#46190) GH-46196 - [C++] Remove ARROW_USE_PRECOMPILED_HEADERS and related logic (#46200) GH-46198 - [Python] Remove deprecated PyExtensionType (#46199) GH-46207 - [C++] Rename arrow::util::StringBuilder and move to internal namespace (#46813) GH-46209 - [Documentation][C++][Compute] Add cpp developer documentation for row table (#46210) GH-46215 - [C++][Docs] Add README for Meson subprojects directory (#46216) GH-46217 - [C++][Parquet] Update the timestamp of parquet::encryption::TwoLevelCacheWithExpiration correctly (#46283) GH-46219 - [C++][Parquet] Remove PARQUET_MINIMAL_DEPENDENCY option (#46274) GH-46222 - [Python] Allow to specify footer metadata when opening IPC file for writing (#46354) GH-46241 - [Release][Packaging] Add support for regenerating metadata of APT repositories (#46277) GH-46245 - [Swift] Upgrade FlatBuffers to v25.2.10 (#46246) GH-46250 - [Swift] Update swift-tools-version to 5.10 (#46252) GH-46285 - [C++] Add support for Decimal32/64 and HalfFloat to run_end_encode/run_end_decode (#46286) GH-46289 - [Release][Packaging] Verify APT/Yum repositories keeps working for old versions (#46292) GH-46290 - [Swift] Upgrade grpc-swift to 1.25.0 and swift-protobuf to 1.29.0 (#46291) GH-46318 - [Docs][C++] Add Extension Array/Type documents (#46319) GH-46321 - [C++][Doc] Better explain ArrayData IsValid and GetNullCount (#46332) GH-46336 - [Release][Packaging] Add support for Reproducible Builds for source archive (#46342) GH-46338 - [C++] Add compile step for Meson in cpp_build.sh (#46339) GH-46349 - [Python] Move parquet definitions to pyarrow/includes/libparquet.pxd (#46437) GH-46367 - [C++] Prevent Meson from using git info if built as subproject (#46368) GH-46373 - [Python] Exercise fallback case on tests for parquet.read_table in case dataset is not available (#46550) GH-46376 - [Docs] Replace Xitter link with BlueSky link (#46402) GH-46378 - [Docs] Remove references to autotune from the docs (#46379) GH-46380 - [GLib] Add GArrowFixedShapeDataType#shape (#46381) GH-46386 - [C++] Ensure using our CMake packages not Find*.cmake (#46387) GH-46388 - [C++] Check Snappy::snappy{,-static} in FindSnappyAlt.cmake (#46389) GH-46396 - [C++][Documentation][Statistics] Revise the documentation to clarify that arrow::ArrayStatistics is ignored during arrow::Array comparisons (#46470) GH-46398 - [GLib] Add GArrowFixedShapeTensorDataType#n_dimensions (#46399) GH-46400 - [GLib] Add GArrowFixedShapeDataType#permutation (#46401) GH-46403 - [C++] Add support for limiting element size when printing data (#46536) GH-46433 - [GLib] Add GArrowFixedShapeDataType#dim_names (#46434) GH-46439 - [C++] Use result pattern for all FromJSONString Helpers (#46696) GH-46439 - [C++] Rename internal Converter class in from_string.cc (#46697) GH-46439 - [C++] Remove unneeded namespace prefix in test_util_internal.h (#46695) GH-46444 - [Documentation][C++][Acero] Move internal Swiss table doc into public C++ developer doc (#46445) GH-46450 - [GLib] Add GArrowFixedShapeDataType#strides (#46451) GH-46459 - [C++] Make some arrow/util headers internal (#46721) GH-46462 - [C++][Parquet] Expose currently thrown EncodedStatistics when checking is_stats_set (#46463) GH-46473 - [C++][Docs] Fix typos in decimal comments (#46474) GH-46475 - [Documentation][C++][Compute] Consolidate Acero developer docs (#46476) GH-46477 - [C++] Use vendored flatbuffers in Meson configuration (#46484) GH-46482 - [CI][Dev] Add shellcheck files without change (#46483) GH-46487 - [C++] Refactor lz4 from ExternalProject to FetchContent (#46390) GH-46490 - [CI][Dev] Add shellcheck ci/scripts/install_ccache.sh (#46492) GH-46494 - [CI][Dev] Add shellcheck files without change (#46495) GH-46496 - [CI][Dev] Fix shellcheck SC2086 errors in ci/scripts directory (#46497) GH-46499 - [CI][Crossbow][C++] Use apache/arrow for Meson (#46501) GH-46500 - [CI][Java] Remove CI scripts for Java (#46502) GH-46508 - [C++] Upgrade OpenTelemetry cpp to avoid build error on recent Clang (#46509) GH-46520 - [Docs] Fix variety of warnings and errors in the docs build (#46521) GH-46522 - [C++][FlightRPC] Add Arrow Flight SQL ODBC driver (#40939) GH-46526 - [CI][Dev] Fix shellcheck SC2086 and SC2223 errors ci/scripts directory (#46527) GH-46528 - [CI][Dev] Remove "archery lint" (#46686) GH-46529 - [C++] Convert static inline type trait functions to constexpr (#46559) GH-46537 - [Docs][C++] Add RunEndEncodedArray, FlatArray, and PrimitiveArray API Docs (#46540) GH-46544 - [CI][Dev][Python] Use pre-commit for autopep8 (#46552) GH-46545 - [CI][Dev][Python] Update pre-commit for cython-lint (#46580) GH-46546 - [CI][Dev][Python] Use pre-commit for numpydoc (#46595) GH-46547 - [CI][Dev][R] Use pre-commit for lintr (#46581) GH-46548 - [CI][Dev][R] Use pre-commit for cpplint (#46549) GH-46551 - [C++] Use std::string_view for type schema API (#46553) GH-46556 - [GLib] Add GArrowUUIDDataType (#46558) GH-46569 - [CI][Integration] Use apache/arrow-js for JS (#46570) GH-46572 - [Python] expose filter option to python for join (#46566) GH-46585 - [JS][Dev] Remove dependabot configuration for JS (#46586) GH-46587 - [CI][JS] Remove JS related test CI (#46588) GH-46603 - [JS][Release] Remove JavaScript related release code (#46604) GH-46613 - [GLib] Add GArrowBaseListDataType (#46615) GH-46632 - [R][Docs] Add docs for arrow::one (#46648) GH-46633 - [Docs][C++][Python] Update CombineChunks documentation to specify that binary columns can be combined into multiple chunks (#46638) GH-46642 - [Format] Add footnote clarifying REE layout has O(log n) random access (#46643) GH-46645 - [CI][Dev][R] Use pre-commit for styler (#46664) GH-46652 - [Python][Docs] Update language for row_group_size parameter (#46653) GH-46656 - [CI][Dev] Fix shellcheck SC2034 and SC2086 errors in ci/scripts directory (#46657) GH-46662 - [CI][Dev] Fix shellcheck SC2148 errors in ci/scripts directory (#46663) GH-46665 - [CI][Crossbow][C++] Use apache/arrow for Alpine Linux (#46666) GH-46676 - [C++][Python][Parquet] Allow reading Parquet LIST data as LargeList directly (#46678) GH-46679 - [C++][Meson] Use WrapDB entry for gflags instead of CMake wrapper (#46680) GH-46683 - [C++][Python] Add utf8_zero_fill compute function for sign-aware zero padding (#46815) GH-46699 - [CI][Dev] fix shellcheck errors in the ci/scripts/cpp_test.sh (#46700) GH-46702 - [JS] Remove js/ (#46703) GH-46714 - [C++] Use hidden symbol visibility in Meson configuration (#46715) GH-46719 - [R] Add 32 and 64 bit Decimal types (#46720) GH-46726 - [CI][Dev] fix shellcheck errors in the ci/scripts/conan_build.sh (#46727) GH-46740 - [C++] Update bundled Thrift GH-46745 - [C++] Update bundled Boost to 1.88.0 and Apache Thrift to 0.22.0 (#46912) GH-46746 - [C++] Assume AWS SDK >= 1.11.0 (#46742) GH-46748 - [C++] Initial port on AIX (#46749) GH-46757 - [CI][Packaging][Conan] Synchronize upstream conan (#46758) GH-46763 - [CI][Dev] fix shellcheck errors in the ci/scripts/ccache_setup.sh (#46766) GH-46767 - [C++] Enable EqualOptions::use_atol_ for arrow::Array, arrow::Scalar, arrow::RecordBatch, and arrow::ChuckedArray (#46779) GH-46771 - [Python][C++] Implement pa.arange function to generate array sequences (#46778) GH-46773 - [GLib] Add GArrowFixedSizeListDataType (#46774) GH-46775 - [Docs] Fix navigation issues (#46784) GH-46785 - [CI][Dev][C++] Suppress needless outputs of cpplint with pre-commit (#46786) GH-46787 - [CI][Integration] Use Node.js 20 (#46790) GH-46788 - [C++][Parquet] Enable SIMD for byte stream split with 2 streams (#46789) GH-46791 - [C++] Add Status::OrElse, IntoStatus<T> and ToStatus (#46792) GH-46794 - [CI][Dev] Fix shellcheck errors in the ci/scripts/csharp_test.sh (#46795) GH-46798 - [CI][Dev] Add support for pre-commit 2.17.0 (#46799) GH-46801 - [Dev] Remove some leftovers for Java, Go, JS and Swift on some config files (#46802) GH-46803 - [Swift] Remove swift implementation from apache/arrow after migration to new repository (#46804) GH-46806 - [Ci][Dev][Swift] Remove Swift related settings (#46807) GH-46820 - [CI][Integration] Use Node.js 20 by default (#46821) GH-46833 - [Python] Expose ConfigureManagedIdentityCredential and ConfigureClientSecretCredential to AzureFileSystem on PyArrow (#46837) GH-46843 - [C++] Don't use unity build for bundled AWS SDK for C++ (#46845) GH-46846 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_dask.sh (#46847) GH-46854 - [CI][MATLAB][Packaging] Add support for MATLAB R2025a in CI and crossbow packaging workflows (#46855) GH-46864 - [C++] Add half-float test for ArrayFromJSONString (#46865) GH-46869 - [C++][Parquet] Deprecate arrow::Status parquet::arrow::FileReadeder::GetRecordBatchReader() (#46932) GH-46877 - [MATLAB] Add arrow.tabular.Table.fromRecordBatches static method (#46885) GH-46881 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_gcs_testbench.sh (#46882) GH-46895 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_minio.sh (#46896) GH-46899 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_numba.sh (#46900) GH-46909 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_sccache.sh (#46910) GH-46911 - [Packaging] Add support for AlmaLinux 10 (#46933) GH-46952 - [Packaging] Drop support for CentOS Stream 8 (#46953) GH-46959 - [Python][Packaging] Drop support for manylinux2014 (#46965) GH-46968 - [CI][Packaging] Synchronize conan files for 20.0.0 (#46966) GH-46974 - [Integration][Archery] Add support for ARROW_JS_ROOT (#46975) GH-47025 - [C++][Docs] Increase minimum gcc for building from 7.1 to 9 (#47026) GH-47081 - [Release] Revisit reproducible source archive verification" /> |
| <meta property="og:description" content="Apache Arrow 21.0.0 (17 July 2025) This is a major release covering more than 2 months of development. Download Source Artifacts Binary Artifacts For AlmaLinux For Amazon Linux For CentOS For C# For Debian For Python For Ubuntu Git tag Contributors This release includes 400 commits from 82 distinct contributors. $ git shortlog -sn apache-arrow-20.0.0..apache-arrow-21.0.0 78 Sutou Kouhei 37 Raúl Cumplido 33 Hiroyuki Sato 30 William Ayd 22 Antoine Pitrou 19 Bryce Mecum 18 Nic Crane 11 Alenka Frim 10 Dewey Dunnington 9 Jacob Wujciak-Jens 9 dependabot[bot] 8 mwish 7 Jonathan Keane 7 Rossi Sun 6 Sarah Gilmore 5 Arash Andishgar 4 Dongjoon Hyun 4 takuya kodama 3 David Li 3 Eddie Chang 3 Enrico Minack 3 Ian Cook 3 Lester Fan 3 Ziy 3 abandy 2 Abhinav 2 David Sherrier 2 Krisztián Szűcs 2 Rok Mihevc 2 gitmodimo 1 Adam Reeve 1 Akum Kang 1 Alina (Xi) Li 1 Anatolii Tsyplenkov 1 Antoine Prouvost 1 Benjamin Kietzman 1 Brian Hulette 1 Bruno 1 Carsten Haubold 1 ChiLin Chiu 1 Colin 1 DenisTarasyuk 1 Eric Dinse 1 Etienne Bacher 1 Even Rouault 1 Gang Wu 1 Guilherme Martins Crocetti 1 Hadrian Reppas 1 HyunWoo Oh 1 Igor Antropov 1 JB Onofré 1 Joshua 1 Junwang Zhao 1 Kevin Gurney 1 Kevin Wilson 1 Kirill Tsyganov 1 Konstantin Malanchev 1 Kyle Hemker 1 Lukas 1 Mateusz Rzeszutek 1 Matt Topol 1 Michael 1 Michael Chirico 1 NazilaAk 1 Patrick Walsh 1 Ranjit Ranjan 1 Roman Karlstetter 1 Saurabh Singh 1 Thomas Newton 1 Tommy Hughes IV 1 Xingyu Long 1 Zihan Qi 1 bw513 1 dawg 1 koenvo 1 leopardracer 1 lriggs 1 neilechao 1 omahs 1 shu-kitamura 1 yuri@FreeBSD 1 yyossy Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -sn --group=trailer:signed-off-by apache-arrow-20.0.0..apache-arrow-21.0.0 179 Sutou Kouhei 51 Antoine Pitrou 47 Raúl Cumplido 15 Nic Crane 13 AlenkaF 13 Bryce Mecum 12 Jacob Wujciak-Jens 7 David Li 7 Dewey Dunnington 7 mwish 6 Rossi Sun 5 Curt Hagenlocher 5 Jonathan Keane 5 Sarah Gilmore 4 Rok Mihevc 3 Gang Wu 3 Will Ayd 2 Kevin Gurney 2 Krisztian Szucs 1 Benjamin Kietzman Changelog Bug Fixes GH-32276 - [C++][FlightRPC] Add option to align RecordBatch buffers given to IPC reader (#44279) GH-35166 - [C++][Compute] Increase precision of decimals in sum aggregates (#44184) GH-39811 - [R] better documentation for col_types argument in open_delim_dataset (#45719) GH-40756 - [C++] Remove dead Boost urls (#46452) GH-43132 - [CI] Fix pre-commit Rat check (#46541) GH-44366 - [Python][Acero] RecordBatch.filter on expression raises error if result set is empty (#46057) GH-44502 - [R] Negative fractional dates must be converted to integers by floor, not trunc (#46873) GH-44910 - [Swift] Fix IPC stream reader and writer impl (#45029) GH-45292 - [Python] test_dtypes hypotesis test fails sporadically (#46029) GH-45532 - [C++] RunEndEncodedBuilder should clear dimensions after a Finish() call (#45533) GH-45534 - [C++] Test: RunEndEncodeTableColumns should update REE columns' schema types (#45535) GH-45608 - [C++][Flight] Fix compilation for clang (#46264) GH-45716 - [R][CI] Refactor skip_on_python_older_than to not initialize reticulate (#46079) GH-45735 - [C++] Broken tests for extract_regex compute funcion (#45900) GH-45853 - [C++][Dev] Fix Meson compilation issues in Docker builds (#45858) GH-46011 - [C++] Hide DCHECK family from public headers (#46015) GH-46025 - [C++] Use ARROW_CUDA_EXPORT instead of ARROW_EXPORT for libarrow_cuda (#46030) GH-46052 - [C++][Benchmarking] Don't build grouper benchmark without ARROW_COMPUTE=ON (#46053) GH-46065 - [Release] Don't use --verify-tag for gh release upload in 02-source.sh (#46066) GH-46068 - [Release] Remove needless docs:rc task from 05-binary-upload.sh (#46069) GH-46070 - [C++] Remove duplicate storage_type in JsonExtension (#46071) GH-46080 - [Python][Docs] Provide guidance for tzdata related issues if installing with pip (#46591) GH-46084 - [C++] Always use ARROW_VCPKG to detect vcpkg mode (#46467) GH-46090 - [C++] Set default IPC option to enabled in Meson (#46114) GH-46094 - [C++][Docs] Add note to RleDecoder::Get's doc comment (#46874) GH-46121 - [Python] Add missing column_index argument to ArrowReaderProperties::read_dictionary's Cython binding (#46122) GH-46127 - [CI][Release] Make 02-source.sh test passable on fork (#46143) GH-46146 - [C++] Merge metadata in SchemaBuidler::AddMetadata (#46654) GH-46149 - [C++] Opening dataset fails with sshfs-3.7.3 due to F_RDADVISE error (#46346) GH-46157 - [C++] Move test utility RunEndEncodeTableColumns that uses REE to test_util_internal on acero instead of common gtest_util (#46161) GH-46174 - [Python] Failing tests in python minimal builds (#46175) GH-46192 - [C++] Add substrait dep to third party download script (#46191) GH-46197 - [C++] Tests use legacy timezones (#46201) GH-46214 - [C++] Improve S3 client initialization (#46723) GH-46224 - [C++][Acero] Fix the hang in asof join (#46300) GH-46231 - [C++][CMake] Fix arrow_bundled_dependencies to be externally accessible by FetchContent (#46232) GH-46233 - [C++] Fix missing nested braces in QueuedTask initialization (#46234) GH-46236 - [Release][Packaging] Fix dev/release/post-03-binary.sh errors (#46237) GH-46238 - [Release][Python] Use array to avoid empty argument in dev/release/post-11-python.sh (#46239) GH-46240 - [Release][Packaging] Fix a bug that existing APT repositories' metadata are lost (#46287) GH-46242 - [Release] Don't show gpg signature when getting release time (#46243) GH-46259 - [CI] Remove deprecated flag from mamba info (#46260) GH-46262 - [CI][Ruby] Don't update GCC of MSYS2 (#46278) GH-46268 - [C++] Improve ArrayData docstrings (#46271) GH-46270 - [C++][Parquet] Clarify GeoStatistics docstring (#46649) GH-46284 - [Release][Packaging] Add missing APT metadata for .ddeb (#46288) GH-46296 - [Swift] Add support for reading struct (#46302) GH-46299 - [C++][Compute] Don't use static inline const for default options (#46303) GH-46304 - [Release][Packaging] Use optimized debug build for .deb (#46392) GH-46306 - [C++][Parquet] Should use LoadEnumSafe for geo enum (#46307) GH-46314 - [C++][Parquet] Fix valgrind error when collecting parameterized tests for MakeWKBPoint (#46320) GH-46326 - [C++][Parquet] Fix stack overflow in rapidjson value comparison to integer (#46327) GH-46333 - [CI] Always pass --yes to mamba clean (#46341) GH-46333 - [CI] Explicitly pass --yes to mamba clean (#46334) GH-46343 - [CI][Python] Remove workaround for gdb packaging issue (#46848) GH-46343 - [CI] Avoid installing gdb 16.3 on python 3.10 jobs to fix CI (#46511) GH-46344 - [CI][Python] Skip doctest for s3.get_file_info to avoid bucket restrictions (#46345) GH-46351 - [Archery][Docs] Fix the cli argument parsing bug in docker subcommand (#46352) GH-46355 - [Python] Fix table.to_struct_array with an empty table (#46357) GH-46359 - [C++][Thirdparty] Bump Apache ORC to 2.1.2 (#46360) GH-46362 - [CGLib][Packaging] Use -fPIE explicitly for g-ir-scanner (#46366) GH-46363 - [CI][Packaging] Use mono from community repository on Alpine instead of from testing (#46364) GH-46394 - [C++][R] gcc-UBSAN errors on CRAN (#46397) GH-46395 - [C++][Statistics] Use EqualOptions for min and max in arrow::ArrayStatistics::Equals() (#46422) GH-46407 - [C++] Fix IPC serialization of sliced list arrays (#46408) GH-46414 - [C++] Fix GCS filesystem getFileInfo method (#46416) GH-46417 - [C++][Parquet] Fix UB in LoadEnumSafe for EdgeInterpolationAlgorithm (#46418) GH-46419 - [C++] Remove duplicate declaration and sync arg names on acero test_util_internal functions (#45400) GH-46420 - [C++][Dataset] Fix DatasetWriter deadlock on writting batch greater than max_rows_queued (#46139) GH-46424 - [C++][Parquet] Fix erroneous unit test skip (#46425) GH-46435 - [Parquet][C++] Fix uninitialized value in writer test (#46533) GH-46442 - [R] hms::as_hms tests fail on some of our crossbow builds (#46443) GH-46456 - [GLib] Add missing since: tag (#46457) GH-46478 - [C++] Implement recent JSON changes into Meson configuration (#46479) GH-46481 - [C++][Python] Allow nullable schema in FlightInfo (#46489) GH-46512 - [CI][C++] Install the llvm package explicitly on MSYS2 (#46525) GH-46516 - [CI][Python] Force Cython>3.1.1 for docs builds (#46770) GH-46523 - [GLib] Fix compiler warning: use gsize instead of int (#46524) GH-46538 - [CI][Packaging][AlmaLinux8] Ensure pip3 (#46539) GH-46564 - [C++] Export ARROW_VCPKG in ArrowConfig.cmake (#46565) GH-46576 - [C++] Suppress codecvt_utf8 deprecation warning (#46622) GH-46589 - [C++] Fix utf8_is_digit to support full Unicode digit range (#46590) GH-46593 - [CI][Integration] Disable nested log grouping (#46594) GH-46598 - [Dev] Use language name for alias (#46602) GH-46599 - [C++][Doc][Parquet] Update supported types documentation (#46620) GH-46605 - [CI][Release][C#] Update download URL for dotnet on verification script (#46612) GH-46606 - [Python] Do not require numpy when normalizing slice (#46732) GH-46609 - [Release][CI] Use System GTest for macos verification (#46823) GH-46610 - [CI][Release] Use Python 3.12 on AlmaLinux 8 (#46621) GH-46611 - [Python][C++] Allow building float16 arrays without numpy (#46618) GH-46623 - [C++][Compute] Fix the failure of large memory test in arrow-compute-row-test (#46635) GH-46636 - [R] Fix evaluation of external objects not in global environment in case_when() (#46667) GH-46659 - [C++] Fix export of extension arrays with binary view/string view storage (#46660) GH-46673 - [CI][R][Docs] Accept empty INSTALL_ARGS again (#46682) GH-46674 - [C++] Construct Array from ExtensionType Scalar (#46675) GH-46684 - [C++] Fix Meson configuration issue on Windows (#46685) GH-46688 - [Ruby] Fix a typo (#46689) GH-46691 - [CI][Packaging] Update platform tag on generated wheel name to match newest auditwheel naming (#46705) GH-46693 - [CI] Update GitHub hosted runner from deprecated windows-2019 to windows-2022 (#46694) GH-46704 - [C++] Fix OSS-Fuzz build failure (#46706) GH-46708 - [C++][Gandiva] Added zero return values for castDECIMAL_utf8 (#46709) GH-46710 - [C++] Fix ownership and lifetime issues in Dataset Writer (#46711) GH-46717 - [R][Docs] Add missing "internal" keywords for internal function (#46722) GH-46724 - [C++][Parquet] OSSFuzz: Prevent from Bad-cast in handling statistics (#46725) GH-46729 - [Python] Allow constructing InMemoryDataset from RecordBatchReader (#46731) GH-46736 - [CI] Disable Parquet in conan-minimum (#46744) GH-46761 - [C++] Add executable detection on FreeBSD (#46759) GH-46764 - [C++][Gandiva] Fix wrong .bc depends (#46765) GH-46777 - [C++] Use SimplifyIsIn only when the value_set of the expression is lower than a threshold (#46859) GH-46782 - [Docs] Link to same version of docs from Implementations page GH-46805 - [CI][Dev] Fix caching for R hooks in lint job (#46812) GH-46809 - [CI][Packaging] Stop trying to add headers from arrow/compu… (#46810) GH-46811 - [C++][Python] Fix crash on FileReaderImpl::GetRecordBatchReader (#46931) GH-46816 - [Docs] Fix links to Swift docs and source (#46817) GH-46827 - [C++] Update Meson Configuration for compute shared lib (#46839) GH-46831 - [C++][R] Remove some pending references to CMake < 3.25 (docs + minor CMake references) (#46834) GH-46841 - [C++][Gandiva] Fix date trunc edge case (#46842) GH-46863 - [CI][C++] Suppress a false positive UBSAN error in AWS SDK for C++ (#46870) GH-46871 - [C++][Parquet] Restore implementation of 3 arrow::FileReader::GetRecordBatchReader() functions (#46868) GH-46879 - [CI][Packaging][Linux] Don't check example build with old CMake (#46880) GH-46888 - [C++] Remove override of default buildtype in Meson config (#46919) GH-46915 - [C++][Compute] Initialize Compute kernels on benchmarks that require extra kernels (#46922) GH-46916 - [R] Test for negative fractional dates fails on older R versions due to change in base R as.Date() (#46917) GH-46920 - [FlightRPC] Fix Flight SQL ColumnMetadata retrieval (#46921) GH-46934 - [C++][Parquet] Trying to fix ub in AttachStatistics (#46940) GH-46947 - [R][Packaging] Add src/arrow/flight/sql/odbc to source excludes (#46948) GH-46964 - [CI][Packaging][Conan] Ensure using upper case for config suffix (#46967) GH-46986 - [CI][C++] Fix a build error with C++20 (#46987) GH-46988 - [C++][Parquet] Fix FLBA DecodeArrow multiply overflow (#46991) GH-46989 - [CI][R] Use Ubuntu 20.04 instead of OpenSUSE for R 4.1 (#46990) GH-46995 - [CI][R][C++] Use system memory allocator in sanitizer jobs (#47007) GH-46998 - [C++] Fix mockfs.cc compiling error with C++23 (#46999) GH-47015 - [CI][C++] Use mold on conda-cpp to work around issues with GNU ld (#47028) GH-47033 - [C++][Compute] Never use custom gtest main with MSVC (#47049) GH-47037 - [CI][C++] Fix Fedora 39 CI jobs (#47038) GH-47061 - [Release] Fix wrong variable name for signing (#47062) GH-47063 - [Release] Define missing RELEASE_TARBALL (#47064) GH-47065 - [Release] Fix timeout key in verify_rc.yml (#47066) GH-47067 - [Release] Fix wrong GitHub Actions context in verify_rc.yml (#47068) GH-47069 - [Release] Add missing "needs: target" (#47070) GH-47071 - [Release] Dereference all hard links in source archive (#47072) GH-47074 - [Release] Use reproducible mtime for csharp/ in source archive (#47076) GH-47078 - [Release] Ensure using cloned apache/arrow for reproducible check (#47079) GH-47092 - [Release] Binary verification CI jobs are failing New Features and Improvements GH-25025 - [C++] Move non core compute kernels into separate shared library (#46261) GH-26818 - [C++][Python] Preserve order when writing dataset multi-threaded (#44470) GH-35419 - [GLib] Add GArrowFixedShapeTensorDataType (#46305) GH-35644 - [MATLAB] Add tests verifying arrow.array.<Type>Array.fromMATLAB() throws an exception if given an array with the wrong type. (#47020) GH-36753 - [C++] Properly pretty-print and diff HalfFloatArrays (#46857) GH-37027 - [C++] Add float16 kernels to if-else and vector-replace functions (#46446) GH-37561 - [Ruby] Add empty chunked array tests for Arrow::Table#each_raw_records (#46862) GH-37577 - [MATLAB] Create a superclass for DateType-related MATLAB tests (#46923) GH-37677 - [C++][FlightRPC] Allow FlightInfo.schema to be nullable GH-37891 - [C++][Parquet] Refine several classes in Parquet encryption (#46202) GH-37891 - [C++] Followup Buffer change to use sptr move (#46027) GH-38214 - [MATLAB] Add a common arrow.tabular.Tabular MATLAB interface (#47014) GH-38369 - [MATLAB] Create utility functions for simplifying management of Proxy instances for Arrays (#46907) GH-38903 - [R][Docs] Improve documentation of col_types (#46145) GH-38914 - [Python] Add EncryptionConfiguration.uniform_encryption (#46347) GH-39294 - [C++][Python] DLPack on Tensor class (#42118) GH-39759 - [Docs] Update pydata-sphinx-theme to 0.16.1 (#46943) GH-40278 - [C++] Support casting string to duration in CSV converter (#46035) GH-40343 - [C++] Move S3FileSystem to the registry (#41559) GH-40754 - [Python] Expose tls_ca_file_path to S3FileSystem (#45881) GH-41496 - [Python][Azure][Docs] Turn on azure on debian-docs (#46892) GH-41672 - [Python][Doc] Clarify docstring of FixedSizeListArray.values that it ignores the offset (#46144) GH-41973 - Expose new S3 option check_directory_existence_before_creation - manual rebase (#46619) GH-42012 - [Python] Add Schema with_field or set_field method (#46348) GH-43041 - [C++][Python] Read/write Parquet BYTE_ARRAY as Large/View types directly (#46532) GH-43170 - [Swift] Add StructArray support to ArrowWriter (#43439) GH-43623 - [R] remove libarrow backwards compatibility enforcement (#46491) GH-43807 - [C++][Python] Add UUID extension type conversion support to/from Parquet (#45866) GH-43891 - [C++][Parquet] Faster reading of FIXED_LEN_BYTE_ARRAY data (#46886) GH-44208 - [R] Adding test to ensure bit64's new semantic works with arrow (#46651) GH-44435 - [GLib] Add distinct count support to GArrowArrayStatistics (#46894) GH-44500 - [Python][Parquet] Map Parquet logical types to Arrow extension types by default (#46772) GH-44900 - [Python] Support explicit fsspec+{protocol} and hf:// filesystem URIs (#45089) GH-44953 - [R] Add R bindings for new compute functions (#44971) GH-45028 - [C++][Compute] Allow cast to reorder struct fields (#45246) GH-45083 - [C++] Add HalfFloat kernels for is_nan, is_inf, is_finite, negate, negate_checked, sign (#46866) GH-45195 - [C++] Update bundled AWS SDK for C++ to 1.11.587 (#45306) GH-45229 - [Python] Migrate from scipy.spmatrix to scipy.sparray (#46423) GH-45229 - [Python] skip scipy.sparse roundtrip tests for float16 (#46413) GH-45290 - [Docs][Release] Change show_version_warning_banner substitution (#46883) GH-45522 - [Parquet][C++] Parquet GEOMETRY and GEOGRAPHY logical type implementations (#45459) GH-45531 - [Python] Add the dim_names argument to from_numpy_ndarray (#46170) GH-45619 - [Python] Use f-string instead of string.format (#45629) GH-45643 - [R] Implement hms functions to create and manipulate time of day variables (#46206) GH-45653 - [Python] Scalar subclasses should implement Python protocols (#45818) GH-45664 - [C++] Allow LargeString,LargeBinary,FixedSizeBinary,StringView and BinaryView for RecordBatch::MakeStatisticsArray() (#46031) GH-45713 - [GLib] Add garrow_chunked_array_(import|export)() (#46876) GH-45750 - [C++][Python][Parquet] Implement Content-Defined Chunking for the Parquet writer (#45360) GH-45794 - [C++] Add array directory to Meson configuration (#45795) GH-45796 - [C++] Add integration directory to Meson configuration (#45797) GH-45798 - [C++] Add extension directory to Meson (#45799) GH-45800 - [C++] Implement util configuration in Meson (#45824) GH-45829 - [C++] Add compute directory to Meson configuration (#45830) GH-45833 - [C++] Add JSON directory to Meson configuration (#45834) GH-45865 - [C++] Create dedicated benchmark dependency in Meson (#45909) GH-45908 - [C++][Docs] Rename and expose basic {Array,...}FromJSON helpers as public APIs (#46180) GH-45957 - [C++][Python] Expose allow_delayed_open on S3FileSystem (#46078) GH-45978 - [C++] Bump bundled mimalloc version (#45979) GH-45991 - [C++] Bump bundled nlohmann_json to v3.12.0 (#46112) GH-45992 - [C++] Bump bundled utf8proc version to 2.10.0 (#46032) GH-46019 - [Python] Raise TypeError on feather read_table if columns is not a Sequence (#46038) GH-46054 - [Python][Packaging] Re-enable pandas on Windows free-threaded wheel (#46109) GH-46058 - [Python] Run Python in AppVeyor outside of source directory (#46059) GH-46087 - [FlightSQL] Allow returning column remarks in FlightSQL's CommandGetTables (#46110) GH-46091 - [C++] Use feature options in Meson configuration (#46204) GH-46092 - [C++] Add filesystem related options to Meson (#46101) GH-46104 - GH-45937: [C++][Parquet] Logical type definition for variant GH-46115 - [C++] Implement compression libraries in Meson (#46358) GH-46116 - [C++] Implement IPC directory in Meson (#46117) GH-46118 - [C++] Add tensor directory to Meson (#46119) GH-46130 - [Python] Remove use_legacy_format in favour of setting IpcWriteOptions (#46131) GH-46132 - [C++][Parquet] Remove deprecated parquet APIs from 19.0.0 (#46133) GH-46141 - [C++] Add flight directory to Meson configuration (#46142) GH-46153 - [C++] Implement acero directory in Meson (#46154) GH-46155 - [C++] Implement Tensorflow directory in Meson (#46156) GH-46163 - [C++] Add vendored directory to Meson (#46164) GH-46189 - [C#] Use pooled buffers in ArrowStreamWriter (#46190) GH-46196 - [C++] Remove ARROW_USE_PRECOMPILED_HEADERS and related logic (#46200) GH-46198 - [Python] Remove deprecated PyExtensionType (#46199) GH-46207 - [C++] Rename arrow::util::StringBuilder and move to internal namespace (#46813) GH-46209 - [Documentation][C++][Compute] Add cpp developer documentation for row table (#46210) GH-46215 - [C++][Docs] Add README for Meson subprojects directory (#46216) GH-46217 - [C++][Parquet] Update the timestamp of parquet::encryption::TwoLevelCacheWithExpiration correctly (#46283) GH-46219 - [C++][Parquet] Remove PARQUET_MINIMAL_DEPENDENCY option (#46274) GH-46222 - [Python] Allow to specify footer metadata when opening IPC file for writing (#46354) GH-46241 - [Release][Packaging] Add support for regenerating metadata of APT repositories (#46277) GH-46245 - [Swift] Upgrade FlatBuffers to v25.2.10 (#46246) GH-46250 - [Swift] Update swift-tools-version to 5.10 (#46252) GH-46285 - [C++] Add support for Decimal32/64 and HalfFloat to run_end_encode/run_end_decode (#46286) GH-46289 - [Release][Packaging] Verify APT/Yum repositories keeps working for old versions (#46292) GH-46290 - [Swift] Upgrade grpc-swift to 1.25.0 and swift-protobuf to 1.29.0 (#46291) GH-46318 - [Docs][C++] Add Extension Array/Type documents (#46319) GH-46321 - [C++][Doc] Better explain ArrayData IsValid and GetNullCount (#46332) GH-46336 - [Release][Packaging] Add support for Reproducible Builds for source archive (#46342) GH-46338 - [C++] Add compile step for Meson in cpp_build.sh (#46339) GH-46349 - [Python] Move parquet definitions to pyarrow/includes/libparquet.pxd (#46437) GH-46367 - [C++] Prevent Meson from using git info if built as subproject (#46368) GH-46373 - [Python] Exercise fallback case on tests for parquet.read_table in case dataset is not available (#46550) GH-46376 - [Docs] Replace Xitter link with BlueSky link (#46402) GH-46378 - [Docs] Remove references to autotune from the docs (#46379) GH-46380 - [GLib] Add GArrowFixedShapeDataType#shape (#46381) GH-46386 - [C++] Ensure using our CMake packages not Find*.cmake (#46387) GH-46388 - [C++] Check Snappy::snappy{,-static} in FindSnappyAlt.cmake (#46389) GH-46396 - [C++][Documentation][Statistics] Revise the documentation to clarify that arrow::ArrayStatistics is ignored during arrow::Array comparisons (#46470) GH-46398 - [GLib] Add GArrowFixedShapeTensorDataType#n_dimensions (#46399) GH-46400 - [GLib] Add GArrowFixedShapeDataType#permutation (#46401) GH-46403 - [C++] Add support for limiting element size when printing data (#46536) GH-46433 - [GLib] Add GArrowFixedShapeDataType#dim_names (#46434) GH-46439 - [C++] Use result pattern for all FromJSONString Helpers (#46696) GH-46439 - [C++] Rename internal Converter class in from_string.cc (#46697) GH-46439 - [C++] Remove unneeded namespace prefix in test_util_internal.h (#46695) GH-46444 - [Documentation][C++][Acero] Move internal Swiss table doc into public C++ developer doc (#46445) GH-46450 - [GLib] Add GArrowFixedShapeDataType#strides (#46451) GH-46459 - [C++] Make some arrow/util headers internal (#46721) GH-46462 - [C++][Parquet] Expose currently thrown EncodedStatistics when checking is_stats_set (#46463) GH-46473 - [C++][Docs] Fix typos in decimal comments (#46474) GH-46475 - [Documentation][C++][Compute] Consolidate Acero developer docs (#46476) GH-46477 - [C++] Use vendored flatbuffers in Meson configuration (#46484) GH-46482 - [CI][Dev] Add shellcheck files without change (#46483) GH-46487 - [C++] Refactor lz4 from ExternalProject to FetchContent (#46390) GH-46490 - [CI][Dev] Add shellcheck ci/scripts/install_ccache.sh (#46492) GH-46494 - [CI][Dev] Add shellcheck files without change (#46495) GH-46496 - [CI][Dev] Fix shellcheck SC2086 errors in ci/scripts directory (#46497) GH-46499 - [CI][Crossbow][C++] Use apache/arrow for Meson (#46501) GH-46500 - [CI][Java] Remove CI scripts for Java (#46502) GH-46508 - [C++] Upgrade OpenTelemetry cpp to avoid build error on recent Clang (#46509) GH-46520 - [Docs] Fix variety of warnings and errors in the docs build (#46521) GH-46522 - [C++][FlightRPC] Add Arrow Flight SQL ODBC driver (#40939) GH-46526 - [CI][Dev] Fix shellcheck SC2086 and SC2223 errors ci/scripts directory (#46527) GH-46528 - [CI][Dev] Remove "archery lint" (#46686) GH-46529 - [C++] Convert static inline type trait functions to constexpr (#46559) GH-46537 - [Docs][C++] Add RunEndEncodedArray, FlatArray, and PrimitiveArray API Docs (#46540) GH-46544 - [CI][Dev][Python] Use pre-commit for autopep8 (#46552) GH-46545 - [CI][Dev][Python] Update pre-commit for cython-lint (#46580) GH-46546 - [CI][Dev][Python] Use pre-commit for numpydoc (#46595) GH-46547 - [CI][Dev][R] Use pre-commit for lintr (#46581) GH-46548 - [CI][Dev][R] Use pre-commit for cpplint (#46549) GH-46551 - [C++] Use std::string_view for type schema API (#46553) GH-46556 - [GLib] Add GArrowUUIDDataType (#46558) GH-46569 - [CI][Integration] Use apache/arrow-js for JS (#46570) GH-46572 - [Python] expose filter option to python for join (#46566) GH-46585 - [JS][Dev] Remove dependabot configuration for JS (#46586) GH-46587 - [CI][JS] Remove JS related test CI (#46588) GH-46603 - [JS][Release] Remove JavaScript related release code (#46604) GH-46613 - [GLib] Add GArrowBaseListDataType (#46615) GH-46632 - [R][Docs] Add docs for arrow::one (#46648) GH-46633 - [Docs][C++][Python] Update CombineChunks documentation to specify that binary columns can be combined into multiple chunks (#46638) GH-46642 - [Format] Add footnote clarifying REE layout has O(log n) random access (#46643) GH-46645 - [CI][Dev][R] Use pre-commit for styler (#46664) GH-46652 - [Python][Docs] Update language for row_group_size parameter (#46653) GH-46656 - [CI][Dev] Fix shellcheck SC2034 and SC2086 errors in ci/scripts directory (#46657) GH-46662 - [CI][Dev] Fix shellcheck SC2148 errors in ci/scripts directory (#46663) GH-46665 - [CI][Crossbow][C++] Use apache/arrow for Alpine Linux (#46666) GH-46676 - [C++][Python][Parquet] Allow reading Parquet LIST data as LargeList directly (#46678) GH-46679 - [C++][Meson] Use WrapDB entry for gflags instead of CMake wrapper (#46680) GH-46683 - [C++][Python] Add utf8_zero_fill compute function for sign-aware zero padding (#46815) GH-46699 - [CI][Dev] fix shellcheck errors in the ci/scripts/cpp_test.sh (#46700) GH-46702 - [JS] Remove js/ (#46703) GH-46714 - [C++] Use hidden symbol visibility in Meson configuration (#46715) GH-46719 - [R] Add 32 and 64 bit Decimal types (#46720) GH-46726 - [CI][Dev] fix shellcheck errors in the ci/scripts/conan_build.sh (#46727) GH-46740 - [C++] Update bundled Thrift GH-46745 - [C++] Update bundled Boost to 1.88.0 and Apache Thrift to 0.22.0 (#46912) GH-46746 - [C++] Assume AWS SDK >= 1.11.0 (#46742) GH-46748 - [C++] Initial port on AIX (#46749) GH-46757 - [CI][Packaging][Conan] Synchronize upstream conan (#46758) GH-46763 - [CI][Dev] fix shellcheck errors in the ci/scripts/ccache_setup.sh (#46766) GH-46767 - [C++] Enable EqualOptions::use_atol_ for arrow::Array, arrow::Scalar, arrow::RecordBatch, and arrow::ChuckedArray (#46779) GH-46771 - [Python][C++] Implement pa.arange function to generate array sequences (#46778) GH-46773 - [GLib] Add GArrowFixedSizeListDataType (#46774) GH-46775 - [Docs] Fix navigation issues (#46784) GH-46785 - [CI][Dev][C++] Suppress needless outputs of cpplint with pre-commit (#46786) GH-46787 - [CI][Integration] Use Node.js 20 (#46790) GH-46788 - [C++][Parquet] Enable SIMD for byte stream split with 2 streams (#46789) GH-46791 - [C++] Add Status::OrElse, IntoStatus<T> and ToStatus (#46792) GH-46794 - [CI][Dev] Fix shellcheck errors in the ci/scripts/csharp_test.sh (#46795) GH-46798 - [CI][Dev] Add support for pre-commit 2.17.0 (#46799) GH-46801 - [Dev] Remove some leftovers for Java, Go, JS and Swift on some config files (#46802) GH-46803 - [Swift] Remove swift implementation from apache/arrow after migration to new repository (#46804) GH-46806 - [Ci][Dev][Swift] Remove Swift related settings (#46807) GH-46820 - [CI][Integration] Use Node.js 20 by default (#46821) GH-46833 - [Python] Expose ConfigureManagedIdentityCredential and ConfigureClientSecretCredential to AzureFileSystem on PyArrow (#46837) GH-46843 - [C++] Don't use unity build for bundled AWS SDK for C++ (#46845) GH-46846 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_dask.sh (#46847) GH-46854 - [CI][MATLAB][Packaging] Add support for MATLAB R2025a in CI and crossbow packaging workflows (#46855) GH-46864 - [C++] Add half-float test for ArrayFromJSONString (#46865) GH-46869 - [C++][Parquet] Deprecate arrow::Status parquet::arrow::FileReadeder::GetRecordBatchReader() (#46932) GH-46877 - [MATLAB] Add arrow.tabular.Table.fromRecordBatches static method (#46885) GH-46881 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_gcs_testbench.sh (#46882) GH-46895 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_minio.sh (#46896) GH-46899 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_numba.sh (#46900) GH-46909 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_sccache.sh (#46910) GH-46911 - [Packaging] Add support for AlmaLinux 10 (#46933) GH-46952 - [Packaging] Drop support for CentOS Stream 8 (#46953) GH-46959 - [Python][Packaging] Drop support for manylinux2014 (#46965) GH-46968 - [CI][Packaging] Synchronize conan files for 20.0.0 (#46966) GH-46974 - [Integration][Archery] Add support for ARROW_JS_ROOT (#46975) GH-47025 - [C++][Docs] Increase minimum gcc for building from 7.1 to 9 (#47026) GH-47081 - [Release] Revisit reproducible source archive verification" /> |
| <link rel="canonical" href="https://arrow.apache.org/release/21.0.0.html" /> |
| <meta property="og:url" content="https://arrow.apache.org/release/21.0.0.html" /> |
| <meta property="og:site_name" content="Apache Arrow" /> |
| <meta property="og:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" /> |
| <meta property="og:type" content="article" /> |
| <meta property="article:published_time" content="2025-07-17T00:00:00-04:00" /> |
| <meta name="twitter:card" content="summary_large_image" /> |
| <meta property="twitter:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" /> |
| <meta property="twitter:title" content="Apache Arrow 21.0.0 Release" /> |
| <script type="application/ld+json"> |
| {"@context":"https://schema.org","@type":"BlogPosting","dateModified":"2025-07-17T00:00:00-04:00","datePublished":"2025-07-17T00:00:00-04:00","description":"Apache Arrow 21.0.0 (17 July 2025) This is a major release covering more than 2 months of development. Download Source Artifacts Binary Artifacts For AlmaLinux For Amazon Linux For CentOS For C# For Debian For Python For Ubuntu Git tag Contributors This release includes 400 commits from 82 distinct contributors. $ git shortlog -sn apache-arrow-20.0.0..apache-arrow-21.0.0 78 Sutou Kouhei 37 Raúl Cumplido 33 Hiroyuki Sato 30 William Ayd 22 Antoine Pitrou 19 Bryce Mecum 18 Nic Crane 11 Alenka Frim 10 Dewey Dunnington 9 Jacob Wujciak-Jens 9 dependabot[bot] 8 mwish 7 Jonathan Keane 7 Rossi Sun 6 Sarah Gilmore 5 Arash Andishgar 4 Dongjoon Hyun 4 takuya kodama 3 David Li 3 Eddie Chang 3 Enrico Minack 3 Ian Cook 3 Lester Fan 3 Ziy 3 abandy 2 Abhinav 2 David Sherrier 2 Krisztián Szűcs 2 Rok Mihevc 2 gitmodimo 1 Adam Reeve 1 Akum Kang 1 Alina (Xi) Li 1 Anatolii Tsyplenkov 1 Antoine Prouvost 1 Benjamin Kietzman 1 Brian Hulette 1 Bruno 1 Carsten Haubold 1 ChiLin Chiu 1 Colin 1 DenisTarasyuk 1 Eric Dinse 1 Etienne Bacher 1 Even Rouault 1 Gang Wu 1 Guilherme Martins Crocetti 1 Hadrian Reppas 1 HyunWoo Oh 1 Igor Antropov 1 JB Onofré 1 Joshua 1 Junwang Zhao 1 Kevin Gurney 1 Kevin Wilson 1 Kirill Tsyganov 1 Konstantin Malanchev 1 Kyle Hemker 1 Lukas 1 Mateusz Rzeszutek 1 Matt Topol 1 Michael 1 Michael Chirico 1 NazilaAk 1 Patrick Walsh 1 Ranjit Ranjan 1 Roman Karlstetter 1 Saurabh Singh 1 Thomas Newton 1 Tommy Hughes IV 1 Xingyu Long 1 Zihan Qi 1 bw513 1 dawg 1 koenvo 1 leopardracer 1 lriggs 1 neilechao 1 omahs 1 shu-kitamura 1 yuri@FreeBSD 1 yyossy Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -sn --group=trailer:signed-off-by apache-arrow-20.0.0..apache-arrow-21.0.0 179 Sutou Kouhei 51 Antoine Pitrou 47 Raúl Cumplido 15 Nic Crane 13 AlenkaF 13 Bryce Mecum 12 Jacob Wujciak-Jens 7 David Li 7 Dewey Dunnington 7 mwish 6 Rossi Sun 5 Curt Hagenlocher 5 Jonathan Keane 5 Sarah Gilmore 4 Rok Mihevc 3 Gang Wu 3 Will Ayd 2 Kevin Gurney 2 Krisztian Szucs 1 Benjamin Kietzman Changelog Bug Fixes GH-32276 - [C++][FlightRPC] Add option to align RecordBatch buffers given to IPC reader (#44279) GH-35166 - [C++][Compute] Increase precision of decimals in sum aggregates (#44184) GH-39811 - [R] better documentation for col_types argument in open_delim_dataset (#45719) GH-40756 - [C++] Remove dead Boost urls (#46452) GH-43132 - [CI] Fix pre-commit Rat check (#46541) GH-44366 - [Python][Acero] RecordBatch.filter on expression raises error if result set is empty (#46057) GH-44502 - [R] Negative fractional dates must be converted to integers by floor, not trunc (#46873) GH-44910 - [Swift] Fix IPC stream reader and writer impl (#45029) GH-45292 - [Python] test_dtypes hypotesis test fails sporadically (#46029) GH-45532 - [C++] RunEndEncodedBuilder should clear dimensions after a Finish() call (#45533) GH-45534 - [C++] Test: RunEndEncodeTableColumns should update REE columns' schema types (#45535) GH-45608 - [C++][Flight] Fix compilation for clang (#46264) GH-45716 - [R][CI] Refactor skip_on_python_older_than to not initialize reticulate (#46079) GH-45735 - [C++] Broken tests for extract_regex compute funcion (#45900) GH-45853 - [C++][Dev] Fix Meson compilation issues in Docker builds (#45858) GH-46011 - [C++] Hide DCHECK family from public headers (#46015) GH-46025 - [C++] Use ARROW_CUDA_EXPORT instead of ARROW_EXPORT for libarrow_cuda (#46030) GH-46052 - [C++][Benchmarking] Don't build grouper benchmark without ARROW_COMPUTE=ON (#46053) GH-46065 - [Release] Don't use --verify-tag for gh release upload in 02-source.sh (#46066) GH-46068 - [Release] Remove needless docs:rc task from 05-binary-upload.sh (#46069) GH-46070 - [C++] Remove duplicate storage_type in JsonExtension (#46071) GH-46080 - [Python][Docs] Provide guidance for tzdata related issues if installing with pip (#46591) GH-46084 - [C++] Always use ARROW_VCPKG to detect vcpkg mode (#46467) GH-46090 - [C++] Set default IPC option to enabled in Meson (#46114) GH-46094 - [C++][Docs] Add note to RleDecoder::Get's doc comment (#46874) GH-46121 - [Python] Add missing column_index argument to ArrowReaderProperties::read_dictionary's Cython binding (#46122) GH-46127 - [CI][Release] Make 02-source.sh test passable on fork (#46143) GH-46146 - [C++] Merge metadata in SchemaBuidler::AddMetadata (#46654) GH-46149 - [C++] Opening dataset fails with sshfs-3.7.3 due to F_RDADVISE error (#46346) GH-46157 - [C++] Move test utility RunEndEncodeTableColumns that uses REE to test_util_internal on acero instead of common gtest_util (#46161) GH-46174 - [Python] Failing tests in python minimal builds (#46175) GH-46192 - [C++] Add substrait dep to third party download script (#46191) GH-46197 - [C++] Tests use legacy timezones (#46201) GH-46214 - [C++] Improve S3 client initialization (#46723) GH-46224 - [C++][Acero] Fix the hang in asof join (#46300) GH-46231 - [C++][CMake] Fix arrow_bundled_dependencies to be externally accessible by FetchContent (#46232) GH-46233 - [C++] Fix missing nested braces in QueuedTask initialization (#46234) GH-46236 - [Release][Packaging] Fix dev/release/post-03-binary.sh errors (#46237) GH-46238 - [Release][Python] Use array to avoid empty argument in dev/release/post-11-python.sh (#46239) GH-46240 - [Release][Packaging] Fix a bug that existing APT repositories' metadata are lost (#46287) GH-46242 - [Release] Don't show gpg signature when getting release time (#46243) GH-46259 - [CI] Remove deprecated flag from mamba info (#46260) GH-46262 - [CI][Ruby] Don't update GCC of MSYS2 (#46278) GH-46268 - [C++] Improve ArrayData docstrings (#46271) GH-46270 - [C++][Parquet] Clarify GeoStatistics docstring (#46649) GH-46284 - [Release][Packaging] Add missing APT metadata for .ddeb (#46288) GH-46296 - [Swift] Add support for reading struct (#46302) GH-46299 - [C++][Compute] Don't use static inline const for default options (#46303) GH-46304 - [Release][Packaging] Use optimized debug build for .deb (#46392) GH-46306 - [C++][Parquet] Should use LoadEnumSafe for geo enum (#46307) GH-46314 - [C++][Parquet] Fix valgrind error when collecting parameterized tests for MakeWKBPoint (#46320) GH-46326 - [C++][Parquet] Fix stack overflow in rapidjson value comparison to integer (#46327) GH-46333 - [CI] Always pass --yes to mamba clean (#46341) GH-46333 - [CI] Explicitly pass --yes to mamba clean (#46334) GH-46343 - [CI][Python] Remove workaround for gdb packaging issue (#46848) GH-46343 - [CI] Avoid installing gdb 16.3 on python 3.10 jobs to fix CI (#46511) GH-46344 - [CI][Python] Skip doctest for s3.get_file_info to avoid bucket restrictions (#46345) GH-46351 - [Archery][Docs] Fix the cli argument parsing bug in docker subcommand (#46352) GH-46355 - [Python] Fix table.to_struct_array with an empty table (#46357) GH-46359 - [C++][Thirdparty] Bump Apache ORC to 2.1.2 (#46360) GH-46362 - [CGLib][Packaging] Use -fPIE explicitly for g-ir-scanner (#46366) GH-46363 - [CI][Packaging] Use mono from community repository on Alpine instead of from testing (#46364) GH-46394 - [C++][R] gcc-UBSAN errors on CRAN (#46397) GH-46395 - [C++][Statistics] Use EqualOptions for min and max in arrow::ArrayStatistics::Equals() (#46422) GH-46407 - [C++] Fix IPC serialization of sliced list arrays (#46408) GH-46414 - [C++] Fix GCS filesystem getFileInfo method (#46416) GH-46417 - [C++][Parquet] Fix UB in LoadEnumSafe for EdgeInterpolationAlgorithm (#46418) GH-46419 - [C++] Remove duplicate declaration and sync arg names on acero test_util_internal functions (#45400) GH-46420 - [C++][Dataset] Fix DatasetWriter deadlock on writting batch greater than max_rows_queued (#46139) GH-46424 - [C++][Parquet] Fix erroneous unit test skip (#46425) GH-46435 - [Parquet][C++] Fix uninitialized value in writer test (#46533) GH-46442 - [R] hms::as_hms tests fail on some of our crossbow builds (#46443) GH-46456 - [GLib] Add missing since: tag (#46457) GH-46478 - [C++] Implement recent JSON changes into Meson configuration (#46479) GH-46481 - [C++][Python] Allow nullable schema in FlightInfo (#46489) GH-46512 - [CI][C++] Install the llvm package explicitly on MSYS2 (#46525) GH-46516 - [CI][Python] Force Cython>3.1.1 for docs builds (#46770) GH-46523 - [GLib] Fix compiler warning: use gsize instead of int (#46524) GH-46538 - [CI][Packaging][AlmaLinux8] Ensure pip3 (#46539) GH-46564 - [C++] Export ARROW_VCPKG in ArrowConfig.cmake (#46565) GH-46576 - [C++] Suppress codecvt_utf8 deprecation warning (#46622) GH-46589 - [C++] Fix utf8_is_digit to support full Unicode digit range (#46590) GH-46593 - [CI][Integration] Disable nested log grouping (#46594) GH-46598 - [Dev] Use language name for alias (#46602) GH-46599 - [C++][Doc][Parquet] Update supported types documentation (#46620) GH-46605 - [CI][Release][C#] Update download URL for dotnet on verification script (#46612) GH-46606 - [Python] Do not require numpy when normalizing slice (#46732) GH-46609 - [Release][CI] Use System GTest for macos verification (#46823) GH-46610 - [CI][Release] Use Python 3.12 on AlmaLinux 8 (#46621) GH-46611 - [Python][C++] Allow building float16 arrays without numpy (#46618) GH-46623 - [C++][Compute] Fix the failure of large memory test in arrow-compute-row-test (#46635) GH-46636 - [R] Fix evaluation of external objects not in global environment in case_when() (#46667) GH-46659 - [C++] Fix export of extension arrays with binary view/string view storage (#46660) GH-46673 - [CI][R][Docs] Accept empty INSTALL_ARGS again (#46682) GH-46674 - [C++] Construct Array from ExtensionType Scalar (#46675) GH-46684 - [C++] Fix Meson configuration issue on Windows (#46685) GH-46688 - [Ruby] Fix a typo (#46689) GH-46691 - [CI][Packaging] Update platform tag on generated wheel name to match newest auditwheel naming (#46705) GH-46693 - [CI] Update GitHub hosted runner from deprecated windows-2019 to windows-2022 (#46694) GH-46704 - [C++] Fix OSS-Fuzz build failure (#46706) GH-46708 - [C++][Gandiva] Added zero return values for castDECIMAL_utf8 (#46709) GH-46710 - [C++] Fix ownership and lifetime issues in Dataset Writer (#46711) GH-46717 - [R][Docs] Add missing "internal" keywords for internal function (#46722) GH-46724 - [C++][Parquet] OSSFuzz: Prevent from Bad-cast in handling statistics (#46725) GH-46729 - [Python] Allow constructing InMemoryDataset from RecordBatchReader (#46731) GH-46736 - [CI] Disable Parquet in conan-minimum (#46744) GH-46761 - [C++] Add executable detection on FreeBSD (#46759) GH-46764 - [C++][Gandiva] Fix wrong .bc depends (#46765) GH-46777 - [C++] Use SimplifyIsIn only when the value_set of the expression is lower than a threshold (#46859) GH-46782 - [Docs] Link to same version of docs from Implementations page GH-46805 - [CI][Dev] Fix caching for R hooks in lint job (#46812) GH-46809 - [CI][Packaging] Stop trying to add headers from arrow/compu… (#46810) GH-46811 - [C++][Python] Fix crash on FileReaderImpl::GetRecordBatchReader (#46931) GH-46816 - [Docs] Fix links to Swift docs and source (#46817) GH-46827 - [C++] Update Meson Configuration for compute shared lib (#46839) GH-46831 - [C++][R] Remove some pending references to CMake < 3.25 (docs + minor CMake references) (#46834) GH-46841 - [C++][Gandiva] Fix date trunc edge case (#46842) GH-46863 - [CI][C++] Suppress a false positive UBSAN error in AWS SDK for C++ (#46870) GH-46871 - [C++][Parquet] Restore implementation of 3 arrow::FileReader::GetRecordBatchReader() functions (#46868) GH-46879 - [CI][Packaging][Linux] Don't check example build with old CMake (#46880) GH-46888 - [C++] Remove override of default buildtype in Meson config (#46919) GH-46915 - [C++][Compute] Initialize Compute kernels on benchmarks that require extra kernels (#46922) GH-46916 - [R] Test for negative fractional dates fails on older R versions due to change in base R as.Date() (#46917) GH-46920 - [FlightRPC] Fix Flight SQL ColumnMetadata retrieval (#46921) GH-46934 - [C++][Parquet] Trying to fix ub in AttachStatistics (#46940) GH-46947 - [R][Packaging] Add src/arrow/flight/sql/odbc to source excludes (#46948) GH-46964 - [CI][Packaging][Conan] Ensure using upper case for config suffix (#46967) GH-46986 - [CI][C++] Fix a build error with C++20 (#46987) GH-46988 - [C++][Parquet] Fix FLBA DecodeArrow multiply overflow (#46991) GH-46989 - [CI][R] Use Ubuntu 20.04 instead of OpenSUSE for R 4.1 (#46990) GH-46995 - [CI][R][C++] Use system memory allocator in sanitizer jobs (#47007) GH-46998 - [C++] Fix mockfs.cc compiling error with C++23 (#46999) GH-47015 - [CI][C++] Use mold on conda-cpp to work around issues with GNU ld (#47028) GH-47033 - [C++][Compute] Never use custom gtest main with MSVC (#47049) GH-47037 - [CI][C++] Fix Fedora 39 CI jobs (#47038) GH-47061 - [Release] Fix wrong variable name for signing (#47062) GH-47063 - [Release] Define missing RELEASE_TARBALL (#47064) GH-47065 - [Release] Fix timeout key in verify_rc.yml (#47066) GH-47067 - [Release] Fix wrong GitHub Actions context in verify_rc.yml (#47068) GH-47069 - [Release] Add missing "needs: target" (#47070) GH-47071 - [Release] Dereference all hard links in source archive (#47072) GH-47074 - [Release] Use reproducible mtime for csharp/ in source archive (#47076) GH-47078 - [Release] Ensure using cloned apache/arrow for reproducible check (#47079) GH-47092 - [Release] Binary verification CI jobs are failing New Features and Improvements GH-25025 - [C++] Move non core compute kernels into separate shared library (#46261) GH-26818 - [C++][Python] Preserve order when writing dataset multi-threaded (#44470) GH-35419 - [GLib] Add GArrowFixedShapeTensorDataType (#46305) GH-35644 - [MATLAB] Add tests verifying arrow.array.<Type>Array.fromMATLAB() throws an exception if given an array with the wrong type. (#47020) GH-36753 - [C++] Properly pretty-print and diff HalfFloatArrays (#46857) GH-37027 - [C++] Add float16 kernels to if-else and vector-replace functions (#46446) GH-37561 - [Ruby] Add empty chunked array tests for Arrow::Table#each_raw_records (#46862) GH-37577 - [MATLAB] Create a superclass for DateType-related MATLAB tests (#46923) GH-37677 - [C++][FlightRPC] Allow FlightInfo.schema to be nullable GH-37891 - [C++][Parquet] Refine several classes in Parquet encryption (#46202) GH-37891 - [C++] Followup Buffer change to use sptr move (#46027) GH-38214 - [MATLAB] Add a common arrow.tabular.Tabular MATLAB interface (#47014) GH-38369 - [MATLAB] Create utility functions for simplifying management of Proxy instances for Arrays (#46907) GH-38903 - [R][Docs] Improve documentation of col_types (#46145) GH-38914 - [Python] Add EncryptionConfiguration.uniform_encryption (#46347) GH-39294 - [C++][Python] DLPack on Tensor class (#42118) GH-39759 - [Docs] Update pydata-sphinx-theme to 0.16.1 (#46943) GH-40278 - [C++] Support casting string to duration in CSV converter (#46035) GH-40343 - [C++] Move S3FileSystem to the registry (#41559) GH-40754 - [Python] Expose tls_ca_file_path to S3FileSystem (#45881) GH-41496 - [Python][Azure][Docs] Turn on azure on debian-docs (#46892) GH-41672 - [Python][Doc] Clarify docstring of FixedSizeListArray.values that it ignores the offset (#46144) GH-41973 - Expose new S3 option check_directory_existence_before_creation - manual rebase (#46619) GH-42012 - [Python] Add Schema with_field or set_field method (#46348) GH-43041 - [C++][Python] Read/write Parquet BYTE_ARRAY as Large/View types directly (#46532) GH-43170 - [Swift] Add StructArray support to ArrowWriter (#43439) GH-43623 - [R] remove libarrow backwards compatibility enforcement (#46491) GH-43807 - [C++][Python] Add UUID extension type conversion support to/from Parquet (#45866) GH-43891 - [C++][Parquet] Faster reading of FIXED_LEN_BYTE_ARRAY data (#46886) GH-44208 - [R] Adding test to ensure bit64's new semantic works with arrow (#46651) GH-44435 - [GLib] Add distinct count support to GArrowArrayStatistics (#46894) GH-44500 - [Python][Parquet] Map Parquet logical types to Arrow extension types by default (#46772) GH-44900 - [Python] Support explicit fsspec+{protocol} and hf:// filesystem URIs (#45089) GH-44953 - [R] Add R bindings for new compute functions (#44971) GH-45028 - [C++][Compute] Allow cast to reorder struct fields (#45246) GH-45083 - [C++] Add HalfFloat kernels for is_nan, is_inf, is_finite, negate, negate_checked, sign (#46866) GH-45195 - [C++] Update bundled AWS SDK for C++ to 1.11.587 (#45306) GH-45229 - [Python] Migrate from scipy.spmatrix to scipy.sparray (#46423) GH-45229 - [Python] skip scipy.sparse roundtrip tests for float16 (#46413) GH-45290 - [Docs][Release] Change show_version_warning_banner substitution (#46883) GH-45522 - [Parquet][C++] Parquet GEOMETRY and GEOGRAPHY logical type implementations (#45459) GH-45531 - [Python] Add the dim_names argument to from_numpy_ndarray (#46170) GH-45619 - [Python] Use f-string instead of string.format (#45629) GH-45643 - [R] Implement hms functions to create and manipulate time of day variables (#46206) GH-45653 - [Python] Scalar subclasses should implement Python protocols (#45818) GH-45664 - [C++] Allow LargeString,LargeBinary,FixedSizeBinary,StringView and BinaryView for RecordBatch::MakeStatisticsArray() (#46031) GH-45713 - [GLib] Add garrow_chunked_array_(import|export)() (#46876) GH-45750 - [C++][Python][Parquet] Implement Content-Defined Chunking for the Parquet writer (#45360) GH-45794 - [C++] Add array directory to Meson configuration (#45795) GH-45796 - [C++] Add integration directory to Meson configuration (#45797) GH-45798 - [C++] Add extension directory to Meson (#45799) GH-45800 - [C++] Implement util configuration in Meson (#45824) GH-45829 - [C++] Add compute directory to Meson configuration (#45830) GH-45833 - [C++] Add JSON directory to Meson configuration (#45834) GH-45865 - [C++] Create dedicated benchmark dependency in Meson (#45909) GH-45908 - [C++][Docs] Rename and expose basic {Array,...}FromJSON helpers as public APIs (#46180) GH-45957 - [C++][Python] Expose allow_delayed_open on S3FileSystem (#46078) GH-45978 - [C++] Bump bundled mimalloc version (#45979) GH-45991 - [C++] Bump bundled nlohmann_json to v3.12.0 (#46112) GH-45992 - [C++] Bump bundled utf8proc version to 2.10.0 (#46032) GH-46019 - [Python] Raise TypeError on feather read_table if columns is not a Sequence (#46038) GH-46054 - [Python][Packaging] Re-enable pandas on Windows free-threaded wheel (#46109) GH-46058 - [Python] Run Python in AppVeyor outside of source directory (#46059) GH-46087 - [FlightSQL] Allow returning column remarks in FlightSQL's CommandGetTables (#46110) GH-46091 - [C++] Use feature options in Meson configuration (#46204) GH-46092 - [C++] Add filesystem related options to Meson (#46101) GH-46104 - GH-45937: [C++][Parquet] Logical type definition for variant GH-46115 - [C++] Implement compression libraries in Meson (#46358) GH-46116 - [C++] Implement IPC directory in Meson (#46117) GH-46118 - [C++] Add tensor directory to Meson (#46119) GH-46130 - [Python] Remove use_legacy_format in favour of setting IpcWriteOptions (#46131) GH-46132 - [C++][Parquet] Remove deprecated parquet APIs from 19.0.0 (#46133) GH-46141 - [C++] Add flight directory to Meson configuration (#46142) GH-46153 - [C++] Implement acero directory in Meson (#46154) GH-46155 - [C++] Implement Tensorflow directory in Meson (#46156) GH-46163 - [C++] Add vendored directory to Meson (#46164) GH-46189 - [C#] Use pooled buffers in ArrowStreamWriter (#46190) GH-46196 - [C++] Remove ARROW_USE_PRECOMPILED_HEADERS and related logic (#46200) GH-46198 - [Python] Remove deprecated PyExtensionType (#46199) GH-46207 - [C++] Rename arrow::util::StringBuilder and move to internal namespace (#46813) GH-46209 - [Documentation][C++][Compute] Add cpp developer documentation for row table (#46210) GH-46215 - [C++][Docs] Add README for Meson subprojects directory (#46216) GH-46217 - [C++][Parquet] Update the timestamp of parquet::encryption::TwoLevelCacheWithExpiration correctly (#46283) GH-46219 - [C++][Parquet] Remove PARQUET_MINIMAL_DEPENDENCY option (#46274) GH-46222 - [Python] Allow to specify footer metadata when opening IPC file for writing (#46354) GH-46241 - [Release][Packaging] Add support for regenerating metadata of APT repositories (#46277) GH-46245 - [Swift] Upgrade FlatBuffers to v25.2.10 (#46246) GH-46250 - [Swift] Update swift-tools-version to 5.10 (#46252) GH-46285 - [C++] Add support for Decimal32/64 and HalfFloat to run_end_encode/run_end_decode (#46286) GH-46289 - [Release][Packaging] Verify APT/Yum repositories keeps working for old versions (#46292) GH-46290 - [Swift] Upgrade grpc-swift to 1.25.0 and swift-protobuf to 1.29.0 (#46291) GH-46318 - [Docs][C++] Add Extension Array/Type documents (#46319) GH-46321 - [C++][Doc] Better explain ArrayData IsValid and GetNullCount (#46332) GH-46336 - [Release][Packaging] Add support for Reproducible Builds for source archive (#46342) GH-46338 - [C++] Add compile step for Meson in cpp_build.sh (#46339) GH-46349 - [Python] Move parquet definitions to pyarrow/includes/libparquet.pxd (#46437) GH-46367 - [C++] Prevent Meson from using git info if built as subproject (#46368) GH-46373 - [Python] Exercise fallback case on tests for parquet.read_table in case dataset is not available (#46550) GH-46376 - [Docs] Replace Xitter link with BlueSky link (#46402) GH-46378 - [Docs] Remove references to autotune from the docs (#46379) GH-46380 - [GLib] Add GArrowFixedShapeDataType#shape (#46381) GH-46386 - [C++] Ensure using our CMake packages not Find*.cmake (#46387) GH-46388 - [C++] Check Snappy::snappy{,-static} in FindSnappyAlt.cmake (#46389) GH-46396 - [C++][Documentation][Statistics] Revise the documentation to clarify that arrow::ArrayStatistics is ignored during arrow::Array comparisons (#46470) GH-46398 - [GLib] Add GArrowFixedShapeTensorDataType#n_dimensions (#46399) GH-46400 - [GLib] Add GArrowFixedShapeDataType#permutation (#46401) GH-46403 - [C++] Add support for limiting element size when printing data (#46536) GH-46433 - [GLib] Add GArrowFixedShapeDataType#dim_names (#46434) GH-46439 - [C++] Use result pattern for all FromJSONString Helpers (#46696) GH-46439 - [C++] Rename internal Converter class in from_string.cc (#46697) GH-46439 - [C++] Remove unneeded namespace prefix in test_util_internal.h (#46695) GH-46444 - [Documentation][C++][Acero] Move internal Swiss table doc into public C++ developer doc (#46445) GH-46450 - [GLib] Add GArrowFixedShapeDataType#strides (#46451) GH-46459 - [C++] Make some arrow/util headers internal (#46721) GH-46462 - [C++][Parquet] Expose currently thrown EncodedStatistics when checking is_stats_set (#46463) GH-46473 - [C++][Docs] Fix typos in decimal comments (#46474) GH-46475 - [Documentation][C++][Compute] Consolidate Acero developer docs (#46476) GH-46477 - [C++] Use vendored flatbuffers in Meson configuration (#46484) GH-46482 - [CI][Dev] Add shellcheck files without change (#46483) GH-46487 - [C++] Refactor lz4 from ExternalProject to FetchContent (#46390) GH-46490 - [CI][Dev] Add shellcheck ci/scripts/install_ccache.sh (#46492) GH-46494 - [CI][Dev] Add shellcheck files without change (#46495) GH-46496 - [CI][Dev] Fix shellcheck SC2086 errors in ci/scripts directory (#46497) GH-46499 - [CI][Crossbow][C++] Use apache/arrow for Meson (#46501) GH-46500 - [CI][Java] Remove CI scripts for Java (#46502) GH-46508 - [C++] Upgrade OpenTelemetry cpp to avoid build error on recent Clang (#46509) GH-46520 - [Docs] Fix variety of warnings and errors in the docs build (#46521) GH-46522 - [C++][FlightRPC] Add Arrow Flight SQL ODBC driver (#40939) GH-46526 - [CI][Dev] Fix shellcheck SC2086 and SC2223 errors ci/scripts directory (#46527) GH-46528 - [CI][Dev] Remove "archery lint" (#46686) GH-46529 - [C++] Convert static inline type trait functions to constexpr (#46559) GH-46537 - [Docs][C++] Add RunEndEncodedArray, FlatArray, and PrimitiveArray API Docs (#46540) GH-46544 - [CI][Dev][Python] Use pre-commit for autopep8 (#46552) GH-46545 - [CI][Dev][Python] Update pre-commit for cython-lint (#46580) GH-46546 - [CI][Dev][Python] Use pre-commit for numpydoc (#46595) GH-46547 - [CI][Dev][R] Use pre-commit for lintr (#46581) GH-46548 - [CI][Dev][R] Use pre-commit for cpplint (#46549) GH-46551 - [C++] Use std::string_view for type schema API (#46553) GH-46556 - [GLib] Add GArrowUUIDDataType (#46558) GH-46569 - [CI][Integration] Use apache/arrow-js for JS (#46570) GH-46572 - [Python] expose filter option to python for join (#46566) GH-46585 - [JS][Dev] Remove dependabot configuration for JS (#46586) GH-46587 - [CI][JS] Remove JS related test CI (#46588) GH-46603 - [JS][Release] Remove JavaScript related release code (#46604) GH-46613 - [GLib] Add GArrowBaseListDataType (#46615) GH-46632 - [R][Docs] Add docs for arrow::one (#46648) GH-46633 - [Docs][C++][Python] Update CombineChunks documentation to specify that binary columns can be combined into multiple chunks (#46638) GH-46642 - [Format] Add footnote clarifying REE layout has O(log n) random access (#46643) GH-46645 - [CI][Dev][R] Use pre-commit for styler (#46664) GH-46652 - [Python][Docs] Update language for row_group_size parameter (#46653) GH-46656 - [CI][Dev] Fix shellcheck SC2034 and SC2086 errors in ci/scripts directory (#46657) GH-46662 - [CI][Dev] Fix shellcheck SC2148 errors in ci/scripts directory (#46663) GH-46665 - [CI][Crossbow][C++] Use apache/arrow for Alpine Linux (#46666) GH-46676 - [C++][Python][Parquet] Allow reading Parquet LIST data as LargeList directly (#46678) GH-46679 - [C++][Meson] Use WrapDB entry for gflags instead of CMake wrapper (#46680) GH-46683 - [C++][Python] Add utf8_zero_fill compute function for sign-aware zero padding (#46815) GH-46699 - [CI][Dev] fix shellcheck errors in the ci/scripts/cpp_test.sh (#46700) GH-46702 - [JS] Remove js/ (#46703) GH-46714 - [C++] Use hidden symbol visibility in Meson configuration (#46715) GH-46719 - [R] Add 32 and 64 bit Decimal types (#46720) GH-46726 - [CI][Dev] fix shellcheck errors in the ci/scripts/conan_build.sh (#46727) GH-46740 - [C++] Update bundled Thrift GH-46745 - [C++] Update bundled Boost to 1.88.0 and Apache Thrift to 0.22.0 (#46912) GH-46746 - [C++] Assume AWS SDK >= 1.11.0 (#46742) GH-46748 - [C++] Initial port on AIX (#46749) GH-46757 - [CI][Packaging][Conan] Synchronize upstream conan (#46758) GH-46763 - [CI][Dev] fix shellcheck errors in the ci/scripts/ccache_setup.sh (#46766) GH-46767 - [C++] Enable EqualOptions::use_atol_ for arrow::Array, arrow::Scalar, arrow::RecordBatch, and arrow::ChuckedArray (#46779) GH-46771 - [Python][C++] Implement pa.arange function to generate array sequences (#46778) GH-46773 - [GLib] Add GArrowFixedSizeListDataType (#46774) GH-46775 - [Docs] Fix navigation issues (#46784) GH-46785 - [CI][Dev][C++] Suppress needless outputs of cpplint with pre-commit (#46786) GH-46787 - [CI][Integration] Use Node.js 20 (#46790) GH-46788 - [C++][Parquet] Enable SIMD for byte stream split with 2 streams (#46789) GH-46791 - [C++] Add Status::OrElse, IntoStatus<T> and ToStatus (#46792) GH-46794 - [CI][Dev] Fix shellcheck errors in the ci/scripts/csharp_test.sh (#46795) GH-46798 - [CI][Dev] Add support for pre-commit 2.17.0 (#46799) GH-46801 - [Dev] Remove some leftovers for Java, Go, JS and Swift on some config files (#46802) GH-46803 - [Swift] Remove swift implementation from apache/arrow after migration to new repository (#46804) GH-46806 - [Ci][Dev][Swift] Remove Swift related settings (#46807) GH-46820 - [CI][Integration] Use Node.js 20 by default (#46821) GH-46833 - [Python] Expose ConfigureManagedIdentityCredential and ConfigureClientSecretCredential to AzureFileSystem on PyArrow (#46837) GH-46843 - [C++] Don't use unity build for bundled AWS SDK for C++ (#46845) GH-46846 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_dask.sh (#46847) GH-46854 - [CI][MATLAB][Packaging] Add support for MATLAB R2025a in CI and crossbow packaging workflows (#46855) GH-46864 - [C++] Add half-float test for ArrayFromJSONString (#46865) GH-46869 - [C++][Parquet] Deprecate arrow::Status parquet::arrow::FileReadeder::GetRecordBatchReader() (#46932) GH-46877 - [MATLAB] Add arrow.tabular.Table.fromRecordBatches static method (#46885) GH-46881 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_gcs_testbench.sh (#46882) GH-46895 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_minio.sh (#46896) GH-46899 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_numba.sh (#46900) GH-46909 - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_sccache.sh (#46910) GH-46911 - [Packaging] Add support for AlmaLinux 10 (#46933) GH-46952 - [Packaging] Drop support for CentOS Stream 8 (#46953) GH-46959 - [Python][Packaging] Drop support for manylinux2014 (#46965) GH-46968 - [CI][Packaging] Synchronize conan files for 20.0.0 (#46966) GH-46974 - [Integration][Archery] Add support for ARROW_JS_ROOT (#46975) GH-47025 - [C++][Docs] Increase minimum gcc for building from 7.1 to 9 (#47026) GH-47081 - [Release] Revisit reproducible source archive verification","headline":"Apache Arrow 21.0.0 Release","image":"https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png","mainEntityOfPage":{"@type":"WebPage","@id":"https://arrow.apache.org/release/21.0.0.html"},"publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"https://arrow.apache.org/img/logo.png"}},"url":"https://arrow.apache.org/release/21.0.0.html"}</script> |
| <!-- End Jekyll SEO tag --> |
| |
| |
| <!-- favicons --> |
| <link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16.png" id="light1"> |
| <link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32.png" id="light2"> |
| <link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon.png" id="light3"> |
| <link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120.png" id="light4"> |
| <link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76.png" id="light5"> |
| <link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60.png" id="light6"> |
| <!-- dark mode favicons --> |
| <link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16-dark.png" id="dark1"> |
| <link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32-dark.png" id="dark2"> |
| <link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon-dark.png" id="dark3"> |
| <link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120-dark.png" id="dark4"> |
| <link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76-dark.png" id="dark5"> |
| <link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60-dark.png" id="dark6"> |
| |
| <script> |
| // Switch to the dark-mode favicons if prefers-color-scheme: dark |
| function onUpdate() { |
| light1 = document.querySelector('link#light1'); |
| light2 = document.querySelector('link#light2'); |
| light3 = document.querySelector('link#light3'); |
| light4 = document.querySelector('link#light4'); |
| light5 = document.querySelector('link#light5'); |
| light6 = document.querySelector('link#light6'); |
| |
| dark1 = document.querySelector('link#dark1'); |
| dark2 = document.querySelector('link#dark2'); |
| dark3 = document.querySelector('link#dark3'); |
| dark4 = document.querySelector('link#dark4'); |
| dark5 = document.querySelector('link#dark5'); |
| dark6 = document.querySelector('link#dark6'); |
| |
| if (matcher.matches) { |
| light1.remove(); |
| light2.remove(); |
| light3.remove(); |
| light4.remove(); |
| light5.remove(); |
| light6.remove(); |
| document.head.append(dark1); |
| document.head.append(dark2); |
| document.head.append(dark3); |
| document.head.append(dark4); |
| document.head.append(dark5); |
| document.head.append(dark6); |
| } else { |
| dark1.remove(); |
| dark2.remove(); |
| dark3.remove(); |
| dark4.remove(); |
| dark5.remove(); |
| dark6.remove(); |
| document.head.append(light1); |
| document.head.append(light2); |
| document.head.append(light3); |
| document.head.append(light4); |
| document.head.append(light5); |
| document.head.append(light6); |
| } |
| } |
| matcher = window.matchMedia('(prefers-color-scheme: dark)'); |
| matcher.addListener(onUpdate); |
| onUpdate(); |
| </script> |
| |
| <link href="/css/main.css" rel="stylesheet"> |
| <link href="/css/syntax.css" rel="stylesheet"> |
| <script src="/javascript/main.js"></script> |
| |
| <!-- Matomo --> |
| <script> |
| var _paq = window._paq = window._paq || []; |
| /* tracker methods like "setCustomDimension" should be called before "trackPageView" */ |
| /* We explicitly disable cookie tracking to avoid privacy issues */ |
| _paq.push(['disableCookies']); |
| _paq.push(['trackPageView']); |
| _paq.push(['enableLinkTracking']); |
| (function() { |
| var u="https://analytics.apache.org/"; |
| _paq.push(['setTrackerUrl', u+'matomo.php']); |
| _paq.push(['setSiteId', '20']); |
| var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0]; |
| g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g,s); |
| })(); |
| </script> |
| <!-- End Matomo Code --> |
| |
| |
| <link type="application/atom+xml" rel="alternate" href="https://arrow.apache.org/feed.xml" title="Apache Arrow" /> |
| </head> |
| |
| |
| <body class="wrap"> |
| <header> |
| <nav class="navbar navbar-expand-md navbar-dark bg-dark"> |
| |
| <a class="navbar-brand no-padding" href="/"><img src="/img/arrow-inverse-300px.png" height="40px"></a> |
| |
| <button class="navbar-toggler ml-auto" type="button" data-toggle="collapse" data-target="#arrow-navbar" aria-controls="arrow-navbar" aria-expanded="false" aria-label="Toggle navigation"> |
| <span class="navbar-toggler-icon"></span> |
| </button> |
| |
| <!-- Collect the nav links, forms, and other content for toggling --> |
| <div class="collapse navbar-collapse justify-content-end" id="arrow-navbar"> |
| <ul class="nav navbar-nav"> |
| <li class="nav-item"><a class="nav-link" href="/overview/" role="button" aria-haspopup="true" aria-expanded="false">Overview</a></li> |
| <li class="nav-item"><a class="nav-link" href="/faq/" role="button" aria-haspopup="true" aria-expanded="false">FAQ</a></li> |
| <li class="nav-item"><a class="nav-link" href="/blog" role="button" aria-haspopup="true" aria-expanded="false">Blog</a></li> |
| <li class="nav-item dropdown"> |
| <a class="nav-link dropdown-toggle" href="#" id="navbarDropdownGetArrow" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false"> |
| Get Arrow |
| </a> |
| <div class="dropdown-menu" aria-labelledby="navbarDropdownGetArrow"> |
| <a class="dropdown-item" href="/install/">Install</a> |
| <a class="dropdown-item" href="/release/">Releases</a> |
| </div> |
| </li> |
| <li class="nav-item dropdown"> |
| <a class="nav-link dropdown-toggle" href="#" id="navbarDropdownDocumentation" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false"> |
| Docs |
| </a> |
| <div class="dropdown-menu" aria-labelledby="navbarDropdownDocumentation"> |
| <a class="dropdown-item" href="/docs">Project Docs</a> |
| <a class="dropdown-item" href="/docs/format/Columnar.html">Format</a> |
| <hr> |
| <a class="dropdown-item" href="/docs/c_glib">C GLib</a> |
| <a class="dropdown-item" href="/docs/cpp">C++</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/csharp/README.md" target="_blank" rel="noopener">C#</a> |
| <a class="dropdown-item" href="https://godoc.org/github.com/apache/arrow/go/arrow" target="_blank" rel="noopener">Go</a> |
| <a class="dropdown-item" href="/docs/java">Java</a> |
| <a class="dropdown-item" href="/docs/js">JavaScript</a> |
| <a class="dropdown-item" href="/julia/">Julia</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/matlab/README.md" target="_blank" rel="noopener">MATLAB</a> |
| <a class="dropdown-item" href="/docs/python">Python</a> |
| <a class="dropdown-item" href="/docs/r">R</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/ruby/README.md" target="_blank" rel="noopener">Ruby</a> |
| <a class="dropdown-item" href="https://docs.rs/arrow/latest" target="_blank" rel="noopener">Rust</a> |
| <a class="dropdown-item" href="/swift">Swift</a> |
| </div> |
| </li> |
| <li class="nav-item dropdown"> |
| <a class="nav-link dropdown-toggle" href="#" id="navbarDropdownSource" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false"> |
| Source |
| </a> |
| <div class="dropdown-menu" aria-labelledby="navbarDropdownSource"> |
| <a class="dropdown-item" href="https://github.com/apache/arrow" target="_blank" rel="noopener">Main Repo</a> |
| <hr> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/c_glib" target="_blank" rel="noopener">C GLib</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/cpp" target="_blank" rel="noopener">C++</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/csharp" target="_blank" rel="noopener">C#</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow-go" target="_blank" rel="noopener">Go</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow-java" target="_blank" rel="noopener">Java</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow-js" target="_blank" rel="noopener">JavaScript</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow-julia" target="_blank" rel="noopener">Julia</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/matlab" target="_blank" rel="noopener">MATLAB</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/python" target="_blank" rel="noopener">Python</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/r" target="_blank" rel="noopener">R</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/ruby" target="_blank" rel="noopener">Ruby</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow-rs" target="_blank" rel="noopener">Rust</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow-swift" target="_blank" rel="noopener">Swift</a> |
| </div> |
| </li> |
| <li class="nav-item dropdown"> |
| <a class="nav-link dropdown-toggle" href="#" id="navbarDropdownSubprojects" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false"> |
| Subprojects |
| </a> |
| <div class="dropdown-menu" aria-labelledby="navbarDropdownSubprojects"> |
| <a class="dropdown-item" href="/adbc">ADBC</a> |
| <a class="dropdown-item" href="/docs/format/Flight.html">Arrow Flight</a> |
| <a class="dropdown-item" href="/docs/format/FlightSql.html">Arrow Flight SQL</a> |
| <a class="dropdown-item" href="https://datafusion.apache.org" target="_blank" rel="noopener">DataFusion</a> |
| <a class="dropdown-item" href="/nanoarrow">nanoarrow</a> |
| </div> |
| </li> |
| <li class="nav-item dropdown"> |
| <a class="nav-link dropdown-toggle" href="#" id="navbarDropdownCommunity" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false"> |
| Community |
| </a> |
| <div class="dropdown-menu" aria-labelledby="navbarDropdownCommunity"> |
| <a class="dropdown-item" href="/community/">Communication</a> |
| <a class="dropdown-item" href="/docs/developers/index.html">Contributing</a> |
| <a class="dropdown-item" href="https://github.com/apache/arrow/issues" target="_blank" rel="noopener">Issue Tracker</a> |
| <a class="dropdown-item" href="/committers/">Governance</a> |
| <a class="dropdown-item" href="/use_cases/">Use Cases</a> |
| <a class="dropdown-item" href="/powered_by/">Powered By</a> |
| <a class="dropdown-item" href="/visual_identity/">Visual Identity</a> |
| <a class="dropdown-item" href="/security/">Security</a> |
| <a class="dropdown-item" href="https://www.apache.org/foundation/policies/conduct.html" target="_blank" rel="noopener">Code of Conduct</a> |
| </div> |
| </li> |
| <li class="nav-item dropdown"> |
| <a class="nav-link dropdown-toggle" href="#" id="navbarDropdownASF" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false"> |
| ASF Links |
| </a> |
| <div class="dropdown-menu dropdown-menu-right" aria-labelledby="navbarDropdownASF"> |
| <a class="dropdown-item" href="https://www.apache.org/" target="_blank" rel="noopener">ASF Website</a> |
| <a class="dropdown-item" href="https://www.apache.org/licenses/" target="_blank" rel="noopener">License</a> |
| <a class="dropdown-item" href="https://www.apache.org/foundation/sponsorship.html" target="_blank" rel="noopener">Donate</a> |
| <a class="dropdown-item" href="https://www.apache.org/foundation/thanks.html" target="_blank" rel="noopener">Thanks</a> |
| <a class="dropdown-item" href="https://www.apache.org/security/" target="_blank" rel="noopener">Security</a> |
| </div> |
| </li> |
| </ul> |
| </div> |
| <!-- /.navbar-collapse --> |
| </nav> |
| |
| </header> |
| |
| <div class="container p-4 pt-5"> |
| <main role="main" class="pb-5"> |
| <!-- |
| |
| --> |
| <h1>Apache Arrow 21.0.0 (17 July 2025)</h1> |
| <p>This is a major release covering more than 2 months of development.</p> |
| <h2>Download</h2> |
| <ul> |
| <li><a href="https://www.apache.org/dyn/closer.lua/arrow/arrow-21.0.0/" target="_blank" rel="noopener"><strong>Source Artifacts</strong></a></li> |
| <li> |
| <strong>Binary Artifacts</strong> |
| <ul> |
| <li><a href="https://apache.jfrog.io/artifactory/arrow/almalinux/" target="_blank" rel="noopener">For AlmaLinux</a></li> |
| <li><a href="https://apache.jfrog.io/artifactory/arrow/amazon-linux/" target="_blank" rel="noopener">For Amazon Linux</a></li> |
| <li><a href="https://apache.jfrog.io/artifactory/arrow/centos/" target="_blank" rel="noopener">For CentOS</a></li> |
| <li><a href="https://apache.jfrog.io/artifactory/arrow/nuget/" target="_blank" rel="noopener">For C#</a></li> |
| <li><a href="https://apache.jfrog.io/artifactory/arrow/debian/" target="_blank" rel="noopener">For Debian</a></li> |
| <li><a href="https://apache.jfrog.io/artifactory/arrow/python/21.0.0/" target="_blank" rel="noopener">For Python</a></li> |
| <li><a href="https://apache.jfrog.io/artifactory/arrow/ubuntu/" target="_blank" rel="noopener">For Ubuntu</a></li> |
| </ul> |
| </li> |
| <li><a href="https://github.com/apache/arrow/releases/tag/apache-arrow-21.0.0" target="_blank" rel="noopener">Git tag</a></li> |
| </ul> |
| <h2>Contributors</h2> |
| <p>This release includes 400 commits from 82 distinct contributors.</p> |
| <div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code data-lang="console"><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> apache-arrow-20.0.0..apache-arrow-21.0.0 |
| <span class="go"> 78 Sutou Kouhei |
| 37 Raúl Cumplido |
| 33 Hiroyuki Sato |
| 30 William Ayd |
| 22 Antoine Pitrou |
| 19 Bryce Mecum |
| 18 Nic Crane |
| 11 Alenka Frim |
| 10 Dewey Dunnington |
| 9 Jacob Wujciak-Jens |
| 9 dependabot[bot] |
| 8 mwish |
| 7 Jonathan Keane |
| 7 Rossi Sun |
| 6 Sarah Gilmore |
| 5 Arash Andishgar |
| 4 Dongjoon Hyun |
| 4 takuya kodama |
| 3 David Li |
| 3 Eddie Chang |
| 3 Enrico Minack |
| 3 Ian Cook |
| 3 Lester Fan |
| 3 Ziy |
| 3 abandy |
| 2 Abhinav |
| 2 David Sherrier |
| 2 Krisztián Szűcs |
| 2 Rok Mihevc |
| 2 gitmodimo |
| 1 Adam Reeve |
| 1 Akum Kang |
| 1 Alina (Xi) Li |
| 1 Anatolii Tsyplenkov |
| 1 Antoine Prouvost |
| 1 Benjamin Kietzman |
| 1 Brian Hulette |
| 1 Bruno |
| 1 Carsten Haubold |
| 1 ChiLin Chiu |
| 1 Colin |
| 1 DenisTarasyuk |
| 1 Eric Dinse |
| 1 Etienne Bacher |
| 1 Even Rouault |
| 1 Gang Wu |
| 1 Guilherme Martins Crocetti |
| 1 Hadrian Reppas |
| 1 HyunWoo Oh |
| 1 Igor Antropov |
| 1 JB Onofré |
| 1 Joshua |
| 1 Junwang Zhao |
| 1 Kevin Gurney |
| 1 Kevin Wilson |
| 1 Kirill Tsyganov |
| 1 Konstantin Malanchev |
| 1 Kyle Hemker |
| 1 Lukas |
| 1 Mateusz Rzeszutek |
| 1 Matt Topol |
| 1 Michael |
| 1 Michael Chirico |
| 1 NazilaAk |
| 1 Patrick Walsh |
| 1 Ranjit Ranjan |
| 1 Roman Karlstetter |
| 1 Saurabh Singh |
| 1 Thomas Newton |
| 1 Tommy Hughes IV |
| 1 Xingyu Long |
| 1 Zihan Qi |
| 1 bw513 |
| 1 dawg |
| 1 koenvo |
| 1 leopardracer |
| 1 lriggs |
| 1 neilechao |
| 1 omahs |
| 1 shu-kitamura |
| 1 yuri@FreeBSD |
| 1 yyossy |
| </span></code></pre></div></div> |
| <h2>Patch Committers</h2> |
| <p>The following Apache committers merged contributed patches to the repository.</p> |
| <div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code data-lang="console"><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> <span class="nt">--group</span><span class="o">=</span>trailer:signed-off-by apache-arrow-20.0.0..apache-arrow-21.0.0 |
| <span class="go"> 179 Sutou Kouhei |
| 51 Antoine Pitrou |
| 47 Raúl Cumplido |
| 15 Nic Crane |
| 13 AlenkaF |
| 13 Bryce Mecum |
| 12 Jacob Wujciak-Jens |
| 7 David Li |
| 7 Dewey Dunnington |
| 7 mwish |
| 6 Rossi Sun |
| 5 Curt Hagenlocher |
| 5 Jonathan Keane |
| 5 Sarah Gilmore |
| 4 Rok Mihevc |
| 3 Gang Wu |
| 3 Will Ayd |
| 2 Kevin Gurney |
| 2 Krisztian Szucs |
| 1 Benjamin Kietzman |
| </span></code></pre></div></div> |
| <h2>Changelog</h2> |
| <h3>Bug Fixes</h3> |
| <ul> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/32276" target="_blank" rel="noopener">GH-32276</a> - [C++][FlightRPC] Add option to align RecordBatch buffers given to IPC reader (#44279)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/35166" target="_blank" rel="noopener">GH-35166</a> - [C++][Compute] Increase precision of decimals in sum aggregates (#44184)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/39811" target="_blank" rel="noopener">GH-39811</a> - [R] better documentation for col_types argument in open_delim_dataset (#45719)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/40756" target="_blank" rel="noopener">GH-40756</a> - [C++] Remove dead Boost urls (#46452)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/43132" target="_blank" rel="noopener">GH-43132</a> - [CI] Fix pre-commit Rat check (#46541)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/44366" target="_blank" rel="noopener">GH-44366</a> - [Python][Acero] RecordBatch.filter on expression raises error if result set is empty (#46057)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/44502" target="_blank" rel="noopener">GH-44502</a> - [R] Negative fractional dates must be converted to integers by floor, not trunc (#46873)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/44910" target="_blank" rel="noopener">GH-44910</a> - [Swift] Fix IPC stream reader and writer impl (#45029)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45292" target="_blank" rel="noopener">GH-45292</a> - [Python] test_dtypes hypotesis test fails sporadically (#46029)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45532" target="_blank" rel="noopener">GH-45532</a> - [C++] <code>RunEndEncodedBuilder</code> should clear dimensions after a <code>Finish()</code> call (#45533)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45534" target="_blank" rel="noopener">GH-45534</a> - [C++] Test: <code>RunEndEncodeTableColumns</code> should update REE columns' schema types (#45535)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45608" target="_blank" rel="noopener">GH-45608</a> - [C++][Flight] Fix compilation for clang (#46264)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45716" target="_blank" rel="noopener">GH-45716</a> - [R][CI] Refactor skip_on_python_older_than to not initialize reticulate (#46079)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45735" target="_blank" rel="noopener">GH-45735</a> - [C++] Broken tests for extract_regex compute funcion (#45900)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45853" target="_blank" rel="noopener">GH-45853</a> - [C++][Dev] Fix Meson compilation issues in Docker builds (#45858)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46011" target="_blank" rel="noopener">GH-46011</a> - [C++] Hide DCHECK family from public headers (#46015)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46025" target="_blank" rel="noopener">GH-46025</a> - [C++] Use ARROW_CUDA_EXPORT instead of ARROW_EXPORT for libarrow_cuda (#46030)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46052" target="_blank" rel="noopener">GH-46052</a> - [C++][Benchmarking] Don't build grouper benchmark without ARROW_COMPUTE=ON (#46053)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46065" target="_blank" rel="noopener">GH-46065</a> - [Release] Don't use <code>--verify-tag</code> for <code>gh release upload</code> in <code>02-source.sh</code> (#46066)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46068" target="_blank" rel="noopener">GH-46068</a> - [Release] Remove needless <code>docs:rc</code> task from 05-binary-upload.sh (#46069)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46070" target="_blank" rel="noopener">GH-46070</a> - [C++] Remove duplicate storage_type in JsonExtension (#46071)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46080" target="_blank" rel="noopener">GH-46080</a> - [Python][Docs] Provide guidance for tzdata related issues if installing with pip (#46591)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46084" target="_blank" rel="noopener">GH-46084</a> - [C++] Always use ARROW_VCPKG to detect vcpkg mode (#46467)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46090" target="_blank" rel="noopener">GH-46090</a> - [C++] Set default IPC option to enabled in Meson (#46114)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46094" target="_blank" rel="noopener">GH-46094</a> - [C++][Docs] Add note to RleDecoder::Get's doc comment (#46874)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46121" target="_blank" rel="noopener">GH-46121</a> - [Python] Add missing <code>column_index</code> argument to <code>ArrowReaderProperties::read_dictionary</code>'s Cython binding (#46122)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46127" target="_blank" rel="noopener">GH-46127</a> - [CI][Release] Make 02-source.sh test passable on fork (#46143)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46146" target="_blank" rel="noopener">GH-46146</a> - [C++] Merge metadata in SchemaBuidler::AddMetadata (#46654)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46149" target="_blank" rel="noopener">GH-46149</a> - [C++] Opening dataset fails with sshfs-3.7.3 due to F_RDADVISE error (#46346)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46157" target="_blank" rel="noopener">GH-46157</a> - [C++] Move test utility RunEndEncodeTableColumns that uses REE to test_util_internal on acero instead of common gtest_util (#46161)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46174" target="_blank" rel="noopener">GH-46174</a> - [Python] Failing tests in python minimal builds (#46175)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46192" target="_blank" rel="noopener">GH-46192</a> - [C++] Add <code>substrait</code> dep to third party download script (#46191)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46197" target="_blank" rel="noopener">GH-46197</a> - [C++] Tests use legacy timezones (#46201)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46214" target="_blank" rel="noopener">GH-46214</a> - [C++] Improve S3 client initialization (#46723)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46224" target="_blank" rel="noopener">GH-46224</a> - [C++][Acero] Fix the hang in asof join (#46300)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46231" target="_blank" rel="noopener">GH-46231</a> - [C++][CMake] Fix <code>arrow_bundled_dependencies</code> to be externally accessible by FetchContent (#46232)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46233" target="_blank" rel="noopener">GH-46233</a> - [C++] Fix missing nested braces in QueuedTask initialization (#46234)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46236" target="_blank" rel="noopener">GH-46236</a> - [Release][Packaging] Fix <code>dev/release/post-03-binary.sh</code> errors (#46237)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46238" target="_blank" rel="noopener">GH-46238</a> - [Release][Python] Use array to avoid empty argument in <code>dev/release/post-11-python.sh</code> (#46239)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46240" target="_blank" rel="noopener">GH-46240</a> - [Release][Packaging] Fix a bug that existing APT repositories' metadata are lost (#46287)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46242" target="_blank" rel="noopener">GH-46242</a> - [Release] Don't show gpg signature when getting release time (#46243)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46259" target="_blank" rel="noopener">GH-46259</a> - [CI] Remove deprecated flag from mamba info (#46260)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46262" target="_blank" rel="noopener">GH-46262</a> - [CI][Ruby] Don't update GCC of MSYS2 (#46278)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46268" target="_blank" rel="noopener">GH-46268</a> - [C++] Improve ArrayData docstrings (#46271)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46270" target="_blank" rel="noopener">GH-46270</a> - [C++][Parquet] Clarify GeoStatistics docstring (#46649)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46284" target="_blank" rel="noopener">GH-46284</a> - [Release][Packaging] Add missing APT metadata for .ddeb (#46288)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46296" target="_blank" rel="noopener">GH-46296</a> - [Swift] Add support for reading struct (#46302)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46299" target="_blank" rel="noopener">GH-46299</a> - [C++][Compute] Don't use <code>static inline const</code> for default options (#46303)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46304" target="_blank" rel="noopener">GH-46304</a> - [Release][Packaging] Use optimized debug build for .deb (#46392)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46306" target="_blank" rel="noopener">GH-46306</a> - [C++][Parquet] Should use LoadEnumSafe for geo enum (#46307)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46314" target="_blank" rel="noopener">GH-46314</a> - [C++][Parquet] Fix valgrind error when collecting parameterized tests for MakeWKBPoint (#46320)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46326" target="_blank" rel="noopener">GH-46326</a> - [C++][Parquet] Fix stack overflow in rapidjson value comparison to integer (#46327)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46333" target="_blank" rel="noopener">GH-46333</a> - [CI] Always pass <code>--yes</code> to <code>mamba clean</code> (#46341)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46333" target="_blank" rel="noopener">GH-46333</a> - [CI] Explicitly pass <code>--yes</code> to <code>mamba clean</code> (#46334)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46343" target="_blank" rel="noopener">GH-46343</a> - [CI][Python] Remove workaround for gdb packaging issue (#46848)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46343" target="_blank" rel="noopener">GH-46343</a> - [CI] Avoid installing gdb 16.3 on python 3.10 jobs to fix CI (#46511)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46344" target="_blank" rel="noopener">GH-46344</a> - [CI][Python] Skip doctest for s3.get_file_info to avoid bucket restrictions (#46345)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46351" target="_blank" rel="noopener">GH-46351</a> - [Archery][Docs] Fix the cli argument parsing bug in docker subcommand (#46352)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46355" target="_blank" rel="noopener">GH-46355</a> - [Python] Fix table.to_struct_array with an empty table (#46357)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46359" target="_blank" rel="noopener">GH-46359</a> - [C++][Thirdparty] Bump Apache ORC to 2.1.2 (#46360)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46362" target="_blank" rel="noopener">GH-46362</a> - [CGLib][Packaging] Use -fPIE explicitly for g-ir-scanner (#46366)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46363" target="_blank" rel="noopener">GH-46363</a> - [CI][Packaging] Use mono from community repository on Alpine instead of from testing (#46364)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46394" target="_blank" rel="noopener">GH-46394</a> - [C++][R] gcc-UBSAN errors on CRAN (#46397)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46395" target="_blank" rel="noopener">GH-46395</a> - [C++][Statistics] Use EqualOptions for min and max in arrow::ArrayStatistics::Equals() (#46422)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46407" target="_blank" rel="noopener">GH-46407</a> - [C++] Fix IPC serialization of sliced list arrays (#46408)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46414" target="_blank" rel="noopener">GH-46414</a> - [C++] Fix GCS filesystem getFileInfo method (#46416)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46417" target="_blank" rel="noopener">GH-46417</a> - [C++][Parquet] Fix UB in LoadEnumSafe for EdgeInterpolationAlgorithm (#46418)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46419" target="_blank" rel="noopener">GH-46419</a> - [C++] Remove duplicate declaration and sync arg names on acero test_util_internal functions (#45400)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46420" target="_blank" rel="noopener">GH-46420</a> - [C++][Dataset] Fix DatasetWriter deadlock on writting batch greater than max_rows_queued (#46139)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46424" target="_blank" rel="noopener">GH-46424</a> - [C++][Parquet] Fix erroneous unit test skip (#46425)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46435" target="_blank" rel="noopener">GH-46435</a> - [Parquet][C++] Fix uninitialized value in writer test (#46533)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46442" target="_blank" rel="noopener">GH-46442</a> - [R] hms::as_hms tests fail on some of our crossbow builds (#46443)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46456" target="_blank" rel="noopener">GH-46456</a> - [GLib] Add missing <code>since:</code> tag (#46457)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46478" target="_blank" rel="noopener">GH-46478</a> - [C++] Implement recent JSON changes into Meson configuration (#46479)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46481" target="_blank" rel="noopener">GH-46481</a> - [C++][Python] Allow nullable schema in FlightInfo (#46489)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46512" target="_blank" rel="noopener">GH-46512</a> - [CI][C++] Install the llvm package explicitly on MSYS2 (#46525)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46516" target="_blank" rel="noopener">GH-46516</a> - [CI][Python] Force Cython>3.1.1 for docs builds (#46770)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46523" target="_blank" rel="noopener">GH-46523</a> - [GLib] Fix compiler warning: use gsize instead of int (#46524)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46538" target="_blank" rel="noopener">GH-46538</a> - [CI][Packaging][AlmaLinux8] Ensure pip3 (#46539)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46564" target="_blank" rel="noopener">GH-46564</a> - [C++] Export ARROW_VCPKG in ArrowConfig.cmake (#46565)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46576" target="_blank" rel="noopener">GH-46576</a> - [C++] Suppress <code>codecvt_utf8</code> deprecation warning (#46622)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46589" target="_blank" rel="noopener">GH-46589</a> - [C++] Fix utf8_is_digit to support full Unicode digit range (#46590)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46593" target="_blank" rel="noopener">GH-46593</a> - [CI][Integration] Disable nested log grouping (#46594)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46598" target="_blank" rel="noopener">GH-46598</a> - [Dev] Use language name for alias (#46602)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46599" target="_blank" rel="noopener">GH-46599</a> - [C++][Doc][Parquet] Update supported types documentation (#46620)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46605" target="_blank" rel="noopener">GH-46605</a> - [CI][Release][C#] Update download URL for dotnet on verification script (#46612)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46606" target="_blank" rel="noopener">GH-46606</a> - [Python] Do not require numpy when normalizing slice (#46732)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46609" target="_blank" rel="noopener">GH-46609</a> - [Release][CI] Use System GTest for macos verification (#46823)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46610" target="_blank" rel="noopener">GH-46610</a> - [CI][Release] Use Python 3.12 on AlmaLinux 8 (#46621)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46611" target="_blank" rel="noopener">GH-46611</a> - [Python][C++] Allow building float16 arrays without numpy (#46618)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46623" target="_blank" rel="noopener">GH-46623</a> - [C++][Compute] Fix the failure of large memory test in arrow-compute-row-test (#46635)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46636" target="_blank" rel="noopener">GH-46636</a> - [R] Fix evaluation of external objects not in global environment in <code>case_when()</code> (#46667)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46659" target="_blank" rel="noopener">GH-46659</a> - [C++] Fix export of extension arrays with binary view/string view storage (#46660)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46673" target="_blank" rel="noopener">GH-46673</a> - [CI][R][Docs] Accept empty INSTALL_ARGS again (#46682)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46674" target="_blank" rel="noopener">GH-46674</a> - [C++] Construct Array from ExtensionType Scalar (#46675)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46684" target="_blank" rel="noopener">GH-46684</a> - [C++] Fix Meson configuration issue on Windows (#46685)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46688" target="_blank" rel="noopener">GH-46688</a> - [Ruby] Fix a typo (#46689)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46691" target="_blank" rel="noopener">GH-46691</a> - [CI][Packaging] Update platform tag on generated wheel name to match newest auditwheel naming (#46705)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46693" target="_blank" rel="noopener">GH-46693</a> - [CI] Update GitHub hosted runner from deprecated windows-2019 to windows-2022 (#46694)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46704" target="_blank" rel="noopener">GH-46704</a> - [C++] Fix OSS-Fuzz build failure (#46706)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46708" target="_blank" rel="noopener">GH-46708</a> - [C++][Gandiva] Added zero return values for castDECIMAL_utf8 (#46709)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46710" target="_blank" rel="noopener">GH-46710</a> - [C++] Fix ownership and lifetime issues in Dataset Writer (#46711)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46717" target="_blank" rel="noopener">GH-46717</a> - [R][Docs] Add missing "internal" keywords for internal function (#46722)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46724" target="_blank" rel="noopener">GH-46724</a> - [C++][Parquet] OSSFuzz: Prevent from Bad-cast in handling statistics (#46725)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46729" target="_blank" rel="noopener">GH-46729</a> - [Python] Allow constructing InMemoryDataset from RecordBatchReader (#46731)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46736" target="_blank" rel="noopener">GH-46736</a> - [CI] Disable Parquet in conan-minimum (#46744)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46761" target="_blank" rel="noopener">GH-46761</a> - [C++] Add executable detection on FreeBSD (#46759)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46764" target="_blank" rel="noopener">GH-46764</a> - [C++][Gandiva] Fix wrong <code>.bc</code> depends (#46765)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46777" target="_blank" rel="noopener">GH-46777</a> - [C++] Use SimplifyIsIn only when the value_set of the expression is lower than a threshold (#46859)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46782" target="_blank" rel="noopener">GH-46782</a> - [Docs] Link to same version of docs from Implementations page</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46805" target="_blank" rel="noopener">GH-46805</a> - [CI][Dev] Fix caching for R hooks in lint job (#46812)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46809" target="_blank" rel="noopener">GH-46809</a> - [CI][Packaging] Stop trying to add headers from arrow/compu… (#46810)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46811" target="_blank" rel="noopener">GH-46811</a> - [C++][Python] Fix crash on FileReaderImpl::GetRecordBatchReader (#46931)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46816" target="_blank" rel="noopener">GH-46816</a> - [Docs] Fix links to Swift docs and source (#46817)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46827" target="_blank" rel="noopener">GH-46827</a> - [C++] Update Meson Configuration for compute shared lib (#46839)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46831" target="_blank" rel="noopener">GH-46831</a> - [C++][R] Remove some pending references to CMake < 3.25 (docs + minor CMake references) (#46834)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46841" target="_blank" rel="noopener">GH-46841</a> - [C++][Gandiva] Fix date trunc edge case (#46842)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46863" target="_blank" rel="noopener">GH-46863</a> - [CI][C++] Suppress a false positive UBSAN error in AWS SDK for C++ (#46870)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46871" target="_blank" rel="noopener">GH-46871</a> - [C++][Parquet] Restore implementation of 3 arrow::FileReader::GetRecordBatchReader() functions (#46868)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46879" target="_blank" rel="noopener">GH-46879</a> - [CI][Packaging][Linux] Don't check example build with old CMake (#46880)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46888" target="_blank" rel="noopener">GH-46888</a> - [C++] Remove override of default buildtype in Meson config (#46919)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46915" target="_blank" rel="noopener">GH-46915</a> - [C++][Compute] Initialize Compute kernels on benchmarks that require extra kernels (#46922)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46916" target="_blank" rel="noopener">GH-46916</a> - [R] Test for negative fractional dates fails on older R versions due to change in base R as.Date() (#46917)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46920" target="_blank" rel="noopener">GH-46920</a> - [FlightRPC] Fix Flight SQL ColumnMetadata retrieval (#46921)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46934" target="_blank" rel="noopener">GH-46934</a> - [C++][Parquet] Trying to fix ub in AttachStatistics (#46940)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46947" target="_blank" rel="noopener">GH-46947</a> - [R][Packaging] Add src/arrow/flight/sql/odbc to source excludes (#46948)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46964" target="_blank" rel="noopener">GH-46964</a> - [CI][Packaging][Conan] Ensure using upper case for config suffix (#46967)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46986" target="_blank" rel="noopener">GH-46986</a> - [CI][C++] Fix a build error with C++20 (#46987)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46988" target="_blank" rel="noopener">GH-46988</a> - [C++][Parquet] Fix FLBA DecodeArrow multiply overflow (#46991)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46989" target="_blank" rel="noopener">GH-46989</a> - [CI][R] Use Ubuntu 20.04 instead of OpenSUSE for R 4.1 (#46990)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46995" target="_blank" rel="noopener">GH-46995</a> - [CI][R][C++] Use system memory allocator in sanitizer jobs (#47007)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46998" target="_blank" rel="noopener">GH-46998</a> - [C++] Fix mockfs.cc compiling error with C++23 (#46999)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47015" target="_blank" rel="noopener">GH-47015</a> - [CI][C++] Use mold on conda-cpp to work around issues with GNU ld (#47028)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47033" target="_blank" rel="noopener">GH-47033</a> - [C++][Compute] Never use custom gtest main with MSVC (#47049)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47037" target="_blank" rel="noopener">GH-47037</a> - [CI][C++] Fix Fedora 39 CI jobs (#47038)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47061" target="_blank" rel="noopener">GH-47061</a> - [Release] Fix wrong variable name for signing (#47062)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47063" target="_blank" rel="noopener">GH-47063</a> - [Release] Define missing RELEASE_TARBALL (#47064)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47065" target="_blank" rel="noopener">GH-47065</a> - [Release] Fix timeout key in verify_rc.yml (#47066)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47067" target="_blank" rel="noopener">GH-47067</a> - [Release] Fix wrong GitHub Actions context in verify_rc.yml (#47068)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47069" target="_blank" rel="noopener">GH-47069</a> - [Release] Add missing "needs: target" (#47070)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47071" target="_blank" rel="noopener">GH-47071</a> - [Release] Dereference all hard links in source archive (#47072)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47074" target="_blank" rel="noopener">GH-47074</a> - [Release] Use reproducible mtime for csharp/ in source archive (#47076)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47078" target="_blank" rel="noopener">GH-47078</a> - [Release] Ensure using cloned apache/arrow for reproducible check (#47079)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47092" target="_blank" rel="noopener">GH-47092</a> - [Release] Binary verification CI jobs are failing</li> |
| </ul> |
| <h3>New Features and Improvements</h3> |
| <ul> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/25025" target="_blank" rel="noopener">GH-25025</a> - [C++] Move non core compute kernels into separate shared library (#46261)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/26818" target="_blank" rel="noopener">GH-26818</a> - [C++][Python] Preserve order when writing dataset multi-threaded (#44470)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/35419" target="_blank" rel="noopener">GH-35419</a> - [GLib] Add GArrowFixedShapeTensorDataType (#46305)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/35644" target="_blank" rel="noopener">GH-35644</a> - [MATLAB] Add tests verifying <code>arrow.array.<Type>Array.fromMATLAB()</code> throws an exception if given an array with the wrong type. (#47020)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/36753" target="_blank" rel="noopener">GH-36753</a> - [C++] Properly pretty-print and diff HalfFloatArrays (#46857)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/37027" target="_blank" rel="noopener">GH-37027</a> - [C++] Add float16 kernels to if-else and vector-replace functions (#46446)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/37561" target="_blank" rel="noopener">GH-37561</a> - [Ruby] Add empty chunked array tests for Arrow::Table#each_raw_records (#46862)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/37577" target="_blank" rel="noopener">GH-37577</a> - [MATLAB] Create a superclass for <code>DateType</code>-related MATLAB tests (#46923)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/37677" target="_blank" rel="noopener">GH-37677</a> - [C++][FlightRPC] Allow FlightInfo.schema to be nullable</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/37891" target="_blank" rel="noopener">GH-37891</a> - [C++][Parquet] Refine several classes in Parquet encryption (#46202)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/37891" target="_blank" rel="noopener">GH-37891</a> - [C++] Followup Buffer change to use sptr move (#46027)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/38214" target="_blank" rel="noopener">GH-38214</a> - [MATLAB] Add a common <code>arrow.tabular.Tabular</code> MATLAB interface (#47014)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/38369" target="_blank" rel="noopener">GH-38369</a> - [MATLAB] Create utility functions for simplifying management of <code>Proxy</code> instances for <code>Array</code>s (#46907)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/38903" target="_blank" rel="noopener">GH-38903</a> - [R][Docs] Improve documentation of col_types (#46145)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/38914" target="_blank" rel="noopener">GH-38914</a> - [Python] Add EncryptionConfiguration.uniform_encryption (#46347)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/39294" target="_blank" rel="noopener">GH-39294</a> - [C++][Python] DLPack on Tensor class (#42118)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/39759" target="_blank" rel="noopener">GH-39759</a> - [Docs] Update pydata-sphinx-theme to 0.16.1 (#46943)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/40278" target="_blank" rel="noopener">GH-40278</a> - [C++] Support casting string to duration in CSV converter (#46035)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/40343" target="_blank" rel="noopener">GH-40343</a> - [C++] Move S3FileSystem to the registry (#41559)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/40754" target="_blank" rel="noopener">GH-40754</a> - [Python] Expose tls_ca_file_path to S3FileSystem (#45881)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/41496" target="_blank" rel="noopener">GH-41496</a> - [Python][Azure][Docs] Turn on azure on debian-docs (#46892)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/41672" target="_blank" rel="noopener">GH-41672</a> - [Python][Doc] Clarify docstring of FixedSizeListArray.values that it ignores the offset (#46144)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/41973" target="_blank" rel="noopener">GH-41973</a> - Expose new S3 option check_directory_existence_before_creation - manual rebase (#46619)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/42012" target="_blank" rel="noopener">GH-42012</a> - [Python] Add Schema with_field or set_field method (#46348)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/43041" target="_blank" rel="noopener">GH-43041</a> - [C++][Python] Read/write Parquet BYTE_ARRAY as Large/View types directly (#46532)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/43170" target="_blank" rel="noopener">GH-43170</a> - [Swift] Add StructArray support to ArrowWriter (#43439)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/43623" target="_blank" rel="noopener">GH-43623</a> - [R] remove libarrow backwards compatibility enforcement (#46491)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/43807" target="_blank" rel="noopener">GH-43807</a> - [C++][Python] Add UUID extension type conversion support to/from Parquet (#45866)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/43891" target="_blank" rel="noopener">GH-43891</a> - [C++][Parquet] Faster reading of FIXED_LEN_BYTE_ARRAY data (#46886)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/44208" target="_blank" rel="noopener">GH-44208</a> - [R] Adding test to ensure bit64's new semantic works with arrow (#46651)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/44435" target="_blank" rel="noopener">GH-44435</a> - [GLib] Add distinct count support to GArrowArrayStatistics (#46894)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/44500" target="_blank" rel="noopener">GH-44500</a> - [Python][Parquet] Map Parquet logical types to Arrow extension types by default (#46772)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/44900" target="_blank" rel="noopener">GH-44900</a> - [Python] Support explicit <code>fsspec+{protocol}</code> and <code>hf://</code> filesystem URIs (#45089)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/44953" target="_blank" rel="noopener">GH-44953</a> - [R] Add R bindings for new compute functions (#44971)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45028" target="_blank" rel="noopener">GH-45028</a> - [C++][Compute] Allow cast to reorder struct fields (#45246)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45083" target="_blank" rel="noopener">GH-45083</a> - [C++] Add HalfFloat kernels for is_nan, is_inf, is_finite, negate, negate_checked, sign (#46866)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45195" target="_blank" rel="noopener">GH-45195</a> - [C++] Update bundled AWS SDK for C++ to 1.11.587 (#45306)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45229" target="_blank" rel="noopener">GH-45229</a> - [Python] Migrate from scipy.spmatrix to scipy.sparray (#46423)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45229" target="_blank" rel="noopener">GH-45229</a> - [Python] skip scipy.sparse roundtrip tests for float16 (#46413)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45290" target="_blank" rel="noopener">GH-45290</a> - [Docs][Release] Change show_version_warning_banner substitution (#46883)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45522" target="_blank" rel="noopener">GH-45522</a> - [Parquet][C++] Parquet GEOMETRY and GEOGRAPHY logical type implementations (#45459)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45531" target="_blank" rel="noopener">GH-45531</a> - [Python] Add the <code>dim_names</code> argument to <code>from_numpy_ndarray</code> (#46170)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45619" target="_blank" rel="noopener">GH-45619</a> - [Python] Use f-string instead of string.format (#45629)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45643" target="_blank" rel="noopener">GH-45643</a> - [R] Implement hms functions to create and manipulate time of day variables (#46206)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45653" target="_blank" rel="noopener">GH-45653</a> - [Python] Scalar subclasses should implement Python protocols (#45818)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45664" target="_blank" rel="noopener">GH-45664</a> - [C++] Allow LargeString,LargeBinary,FixedSizeBinary,StringView and BinaryView for RecordBatch::MakeStatisticsArray() (#46031)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45713" target="_blank" rel="noopener">GH-45713</a> - [GLib] Add garrow_chunked_array_(import|export)() (#46876)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45750" target="_blank" rel="noopener">GH-45750</a> - [C++][Python][Parquet] Implement Content-Defined Chunking for the Parquet writer (#45360)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45794" target="_blank" rel="noopener">GH-45794</a> - [C++] Add array directory to Meson configuration (#45795)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45796" target="_blank" rel="noopener">GH-45796</a> - [C++] Add integration directory to Meson configuration (#45797)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45798" target="_blank" rel="noopener">GH-45798</a> - [C++] Add extension directory to Meson (#45799)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45800" target="_blank" rel="noopener">GH-45800</a> - [C++] Implement util configuration in Meson (#45824)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45829" target="_blank" rel="noopener">GH-45829</a> - [C++] Add compute directory to Meson configuration (#45830)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45833" target="_blank" rel="noopener">GH-45833</a> - [C++] Add JSON directory to Meson configuration (#45834)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45865" target="_blank" rel="noopener">GH-45865</a> - [C++] Create dedicated benchmark dependency in Meson (#45909)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45908" target="_blank" rel="noopener">GH-45908</a> - [C++][Docs] Rename and expose basic {Array,...}FromJSON helpers as public APIs (#46180)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45957" target="_blank" rel="noopener">GH-45957</a> - [C++][Python] Expose <code>allow_delayed_open</code> on S3FileSystem (#46078)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45978" target="_blank" rel="noopener">GH-45978</a> - [C++] Bump bundled mimalloc version (#45979)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45991" target="_blank" rel="noopener">GH-45991</a> - [C++] Bump bundled nlohmann_json to v3.12.0 (#46112)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/45992" target="_blank" rel="noopener">GH-45992</a> - [C++] Bump bundled utf8proc version to 2.10.0 (#46032)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46019" target="_blank" rel="noopener">GH-46019</a> - [Python] Raise TypeError on feather read_table if columns is not a Sequence (#46038)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46054" target="_blank" rel="noopener">GH-46054</a> - [Python][Packaging] Re-enable pandas on Windows free-threaded wheel (#46109)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46058" target="_blank" rel="noopener">GH-46058</a> - [Python] Run Python in AppVeyor outside of source directory (#46059)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46087" target="_blank" rel="noopener">GH-46087</a> - [FlightSQL] Allow returning column remarks in FlightSQL's CommandGetTables (#46110)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46091" target="_blank" rel="noopener">GH-46091</a> - [C++] Use feature options in Meson configuration (#46204)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46092" target="_blank" rel="noopener">GH-46092</a> - [C++] Add filesystem related options to Meson (#46101)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46104" target="_blank" rel="noopener">GH-46104</a> - GH-45937: [C++][Parquet] Logical type definition for variant</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46115" target="_blank" rel="noopener">GH-46115</a> - [C++] Implement compression libraries in Meson (#46358)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46116" target="_blank" rel="noopener">GH-46116</a> - [C++] Implement IPC directory in Meson (#46117)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46118" target="_blank" rel="noopener">GH-46118</a> - [C++] Add tensor directory to Meson (#46119)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46130" target="_blank" rel="noopener">GH-46130</a> - [Python] Remove <code>use_legacy_format</code> in favour of setting <code>IpcWriteOptions</code> (#46131)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46132" target="_blank" rel="noopener">GH-46132</a> - [C++][Parquet] Remove deprecated parquet APIs from 19.0.0 (#46133)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46141" target="_blank" rel="noopener">GH-46141</a> - [C++] Add flight directory to Meson configuration (#46142)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46153" target="_blank" rel="noopener">GH-46153</a> - [C++] Implement acero directory in Meson (#46154)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46155" target="_blank" rel="noopener">GH-46155</a> - [C++] Implement Tensorflow directory in Meson (#46156)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46163" target="_blank" rel="noopener">GH-46163</a> - [C++] Add vendored directory to Meson (#46164)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46189" target="_blank" rel="noopener">GH-46189</a> - [C#] Use pooled buffers in ArrowStreamWriter (#46190)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46196" target="_blank" rel="noopener">GH-46196</a> - [C++] Remove ARROW_USE_PRECOMPILED_HEADERS and related logic (#46200)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46198" target="_blank" rel="noopener">GH-46198</a> - [Python] Remove deprecated PyExtensionType (#46199)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46207" target="_blank" rel="noopener">GH-46207</a> - [C++] Rename arrow::util::StringBuilder and move to internal namespace (#46813)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46209" target="_blank" rel="noopener">GH-46209</a> - [Documentation][C++][Compute] Add cpp developer documentation for row table (#46210)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46215" target="_blank" rel="noopener">GH-46215</a> - [C++][Docs] Add README for Meson subprojects directory (#46216)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46217" target="_blank" rel="noopener">GH-46217</a> - [C++][Parquet] Update the timestamp of parquet::encryption::TwoLevelCacheWithExpiration correctly (#46283)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46219" target="_blank" rel="noopener">GH-46219</a> - [C++][Parquet] Remove PARQUET_MINIMAL_DEPENDENCY option (#46274)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46222" target="_blank" rel="noopener">GH-46222</a> - [Python] Allow to specify footer metadata when opening IPC file for writing (#46354)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46241" target="_blank" rel="noopener">GH-46241</a> - [Release][Packaging] Add support for regenerating metadata of APT repositories (#46277)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46245" target="_blank" rel="noopener">GH-46245</a> - [Swift] Upgrade <code>FlatBuffers</code> to v25.2.10 (#46246)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46250" target="_blank" rel="noopener">GH-46250</a> - [Swift] Update <code>swift-tools-version</code> to 5.10 (#46252)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46285" target="_blank" rel="noopener">GH-46285</a> - [C++] Add support for Decimal32/64 and HalfFloat to run_end_encode/run_end_decode (#46286)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46289" target="_blank" rel="noopener">GH-46289</a> - [Release][Packaging] Verify APT/Yum repositories keeps working for old versions (#46292)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46290" target="_blank" rel="noopener">GH-46290</a> - [Swift] Upgrade <code>grpc-swift</code> to <code>1.25.0</code> and <code>swift-protobuf</code> to <code>1.29.0</code> (#46291)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46318" target="_blank" rel="noopener">GH-46318</a> - [Docs][C++] Add Extension Array/Type documents (#46319)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46321" target="_blank" rel="noopener">GH-46321</a> - [C++][Doc] Better explain ArrayData IsValid and GetNullCount (#46332)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46336" target="_blank" rel="noopener">GH-46336</a> - [Release][Packaging] Add support for Reproducible Builds for source archive (#46342)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46338" target="_blank" rel="noopener">GH-46338</a> - [C++] Add compile step for Meson in cpp_build.sh (#46339)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46349" target="_blank" rel="noopener">GH-46349</a> - [Python] Move parquet definitions to pyarrow/includes/libparquet.pxd (#46437)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46367" target="_blank" rel="noopener">GH-46367</a> - [C++] Prevent Meson from using git info if built as subproject (#46368)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46373" target="_blank" rel="noopener">GH-46373</a> - [Python] Exercise fallback case on tests for parquet.read_table in case dataset is not available (#46550)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46376" target="_blank" rel="noopener">GH-46376</a> - [Docs] Replace Xitter link with BlueSky link (#46402)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46378" target="_blank" rel="noopener">GH-46378</a> - [Docs] Remove references to autotune from the docs (#46379)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46380" target="_blank" rel="noopener">GH-46380</a> - [GLib] Add GArrowFixedShapeDataType#shape (#46381)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46386" target="_blank" rel="noopener">GH-46386</a> - [C++] Ensure using our CMake packages not Find*.cmake (#46387)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46388" target="_blank" rel="noopener">GH-46388</a> - [C++] Check <code>Snappy::snappy{,-static}</code> in <code>FindSnappyAlt.cmake</code> (#46389)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46396" target="_blank" rel="noopener">GH-46396</a> - [C++][Documentation][Statistics] Revise the documentation to clarify that arrow::ArrayStatistics is ignored during arrow::Array comparisons (#46470)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46398" target="_blank" rel="noopener">GH-46398</a> - [GLib] Add GArrowFixedShapeTensorDataType#n_dimensions (#46399)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46400" target="_blank" rel="noopener">GH-46400</a> - [GLib] Add GArrowFixedShapeDataType#permutation (#46401)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46403" target="_blank" rel="noopener">GH-46403</a> - [C++] Add support for limiting element size when printing data (#46536)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46433" target="_blank" rel="noopener">GH-46433</a> - [GLib] Add GArrowFixedShapeDataType#dim_names (#46434)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46439" target="_blank" rel="noopener">GH-46439</a> - [C++] Use result pattern for all FromJSONString Helpers (#46696)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46439" target="_blank" rel="noopener">GH-46439</a> - [C++] Rename internal Converter class in from_string.cc (#46697)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46439" target="_blank" rel="noopener">GH-46439</a> - [C++] Remove unneeded namespace prefix in test_util_internal.h (#46695)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46444" target="_blank" rel="noopener">GH-46444</a> - [Documentation][C++][Acero] Move internal Swiss table doc into public C++ developer doc (#46445)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46450" target="_blank" rel="noopener">GH-46450</a> - [GLib] Add GArrowFixedShapeDataType#strides (#46451)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46459" target="_blank" rel="noopener">GH-46459</a> - [C++] Make some arrow/util headers internal (#46721)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46462" target="_blank" rel="noopener">GH-46462</a> - [C++][Parquet] Expose currently thrown EncodedStatistics when checking is_stats_set (#46463)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46473" target="_blank" rel="noopener">GH-46473</a> - [C++][Docs] Fix typos in decimal comments (#46474)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46475" target="_blank" rel="noopener">GH-46475</a> - [Documentation][C++][Compute] Consolidate Acero developer docs (#46476)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46477" target="_blank" rel="noopener">GH-46477</a> - [C++] Use vendored flatbuffers in Meson configuration (#46484)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46482" target="_blank" rel="noopener">GH-46482</a> - [CI][Dev] Add shellcheck files without change (#46483)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46487" target="_blank" rel="noopener">GH-46487</a> - [C++] Refactor lz4 from ExternalProject to FetchContent (#46390)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46490" target="_blank" rel="noopener">GH-46490</a> - [CI][Dev] Add shellcheck ci/scripts/install_ccache.sh (#46492)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46494" target="_blank" rel="noopener">GH-46494</a> - [CI][Dev] Add shellcheck files without change (#46495)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46496" target="_blank" rel="noopener">GH-46496</a> - [CI][Dev] Fix shellcheck SC2086 errors in ci/scripts directory (#46497)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46499" target="_blank" rel="noopener">GH-46499</a> - [CI][Crossbow][C++] Use apache/arrow for Meson (#46501)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46500" target="_blank" rel="noopener">GH-46500</a> - [CI][Java] Remove CI scripts for Java (#46502)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46508" target="_blank" rel="noopener">GH-46508</a> - [C++] Upgrade OpenTelemetry cpp to avoid build error on recent Clang (#46509)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46520" target="_blank" rel="noopener">GH-46520</a> - [Docs] Fix variety of warnings and errors in the docs build (#46521)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46522" target="_blank" rel="noopener">GH-46522</a> - [C++][FlightRPC] Add Arrow Flight SQL ODBC driver (#40939)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46526" target="_blank" rel="noopener">GH-46526</a> - [CI][Dev] Fix shellcheck SC2086 and SC2223 errors ci/scripts directory (#46527)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46528" target="_blank" rel="noopener">GH-46528</a> - [CI][Dev] Remove "archery lint" (#46686)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46529" target="_blank" rel="noopener">GH-46529</a> - [C++] Convert static inline type trait functions to constexpr (#46559)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46537" target="_blank" rel="noopener">GH-46537</a> - [Docs][C++] Add RunEndEncodedArray, FlatArray, and PrimitiveArray API Docs (#46540)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46544" target="_blank" rel="noopener">GH-46544</a> - [CI][Dev][Python] Use pre-commit for autopep8 (#46552)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46545" target="_blank" rel="noopener">GH-46545</a> - [CI][Dev][Python] Update pre-commit for cython-lint (#46580)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46546" target="_blank" rel="noopener">GH-46546</a> - [CI][Dev][Python] Use pre-commit for numpydoc (#46595)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46547" target="_blank" rel="noopener">GH-46547</a> - [CI][Dev][R] Use pre-commit for lintr (#46581)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46548" target="_blank" rel="noopener">GH-46548</a> - [CI][Dev][R] Use pre-commit for cpplint (#46549)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46551" target="_blank" rel="noopener">GH-46551</a> - [C++] Use <code>std::string_view</code> for type schema API (#46553)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46556" target="_blank" rel="noopener">GH-46556</a> - [GLib] Add GArrowUUIDDataType (#46558)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46569" target="_blank" rel="noopener">GH-46569</a> - [CI][Integration] Use apache/arrow-js for JS (#46570)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46572" target="_blank" rel="noopener">GH-46572</a> - [Python] expose filter option to python for join (#46566)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46585" target="_blank" rel="noopener">GH-46585</a> - [JS][Dev] Remove dependabot configuration for JS (#46586)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46587" target="_blank" rel="noopener">GH-46587</a> - [CI][JS] Remove JS related test CI (#46588)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46603" target="_blank" rel="noopener">GH-46603</a> - [JS][Release] Remove JavaScript related release code (#46604)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46613" target="_blank" rel="noopener">GH-46613</a> - [GLib] Add GArrowBaseListDataType (#46615)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46632" target="_blank" rel="noopener">GH-46632</a> - [R][Docs] Add docs for arrow::one (#46648)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46633" target="_blank" rel="noopener">GH-46633</a> - [Docs][C++][Python] Update CombineChunks documentation to specify that binary columns can be combined into multiple chunks (#46638)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46642" target="_blank" rel="noopener">GH-46642</a> - [Format] Add footnote clarifying REE layout has O(log n) random access (#46643)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46645" target="_blank" rel="noopener">GH-46645</a> - [CI][Dev][R] Use pre-commit for styler (#46664)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46652" target="_blank" rel="noopener">GH-46652</a> - [Python][Docs] Update language for row_group_size parameter (#46653)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46656" target="_blank" rel="noopener">GH-46656</a> - [CI][Dev] Fix shellcheck SC2034 and SC2086 errors in ci/scripts directory (#46657)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46662" target="_blank" rel="noopener">GH-46662</a> - [CI][Dev] Fix shellcheck SC2148 errors in ci/scripts directory (#46663)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46665" target="_blank" rel="noopener">GH-46665</a> - [CI][Crossbow][C++] Use apache/arrow for Alpine Linux (#46666)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46676" target="_blank" rel="noopener">GH-46676</a> - [C++][Python][Parquet] Allow reading Parquet LIST data as LargeList directly (#46678)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46679" target="_blank" rel="noopener">GH-46679</a> - [C++][Meson] Use WrapDB entry for gflags instead of CMake wrapper (#46680)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46683" target="_blank" rel="noopener">GH-46683</a> - [C++][Python] Add utf8_zero_fill compute function for sign-aware zero padding (#46815)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46699" target="_blank" rel="noopener">GH-46699</a> - [CI][Dev] fix shellcheck errors in the ci/scripts/cpp_test.sh (#46700)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46702" target="_blank" rel="noopener">GH-46702</a> - [JS] Remove js/ (#46703)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46714" target="_blank" rel="noopener">GH-46714</a> - [C++] Use hidden symbol visibility in Meson configuration (#46715)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46719" target="_blank" rel="noopener">GH-46719</a> - [R] Add 32 and 64 bit Decimal types (#46720)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46726" target="_blank" rel="noopener">GH-46726</a> - [CI][Dev] fix shellcheck errors in the ci/scripts/conan_build.sh (#46727)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46740" target="_blank" rel="noopener">GH-46740</a> - [C++] Update bundled Thrift</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46745" target="_blank" rel="noopener">GH-46745</a> - [C++] Update bundled Boost to 1.88.0 and Apache Thrift to 0.22.0 (#46912)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46746" target="_blank" rel="noopener">GH-46746</a> - [C++] Assume AWS SDK >= 1.11.0 (#46742)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46748" target="_blank" rel="noopener">GH-46748</a> - [C++] Initial port on AIX (#46749)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46757" target="_blank" rel="noopener">GH-46757</a> - [CI][Packaging][Conan] Synchronize upstream conan (#46758)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46763" target="_blank" rel="noopener">GH-46763</a> - [CI][Dev] fix shellcheck errors in the ci/scripts/ccache_setup.sh (#46766)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46767" target="_blank" rel="noopener">GH-46767</a> - [C++] Enable EqualOptions::use_atol_ for arrow::Array, arrow::Scalar, arrow::RecordBatch, and arrow::ChuckedArray (#46779)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46771" target="_blank" rel="noopener">GH-46771</a> - [Python][C++] Implement pa.arange function to generate array sequences (#46778)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46773" target="_blank" rel="noopener">GH-46773</a> - [GLib] Add GArrowFixedSizeListDataType (#46774)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46775" target="_blank" rel="noopener">GH-46775</a> - [Docs] Fix navigation issues (#46784)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46785" target="_blank" rel="noopener">GH-46785</a> - [CI][Dev][C++] Suppress needless outputs of cpplint with pre-commit (#46786)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46787" target="_blank" rel="noopener">GH-46787</a> - [CI][Integration] Use Node.js 20 (#46790)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46788" target="_blank" rel="noopener">GH-46788</a> - [C++][Parquet] Enable SIMD for byte stream split with 2 streams (#46789)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46791" target="_blank" rel="noopener">GH-46791</a> - [C++] Add <code>Status::OrElse</code>, <code>IntoStatus<T></code> and <code>ToStatus</code> (#46792)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46794" target="_blank" rel="noopener">GH-46794</a> - [CI][Dev] Fix shellcheck errors in the ci/scripts/csharp_test.sh (#46795)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46798" target="_blank" rel="noopener">GH-46798</a> - [CI][Dev] Add support for pre-commit 2.17.0 (#46799)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46801" target="_blank" rel="noopener">GH-46801</a> - [Dev] Remove some leftovers for Java, Go, JS and Swift on some config files (#46802)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46803" target="_blank" rel="noopener">GH-46803</a> - [Swift] Remove swift implementation from apache/arrow after migration to new repository (#46804)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46806" target="_blank" rel="noopener">GH-46806</a> - [Ci][Dev][Swift] Remove Swift related settings (#46807)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46820" target="_blank" rel="noopener">GH-46820</a> - [CI][Integration] Use Node.js 20 by default (#46821)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46833" target="_blank" rel="noopener">GH-46833</a> - [Python] Expose ConfigureManagedIdentityCredential and ConfigureClientSecretCredential to AzureFileSystem on PyArrow (#46837)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46843" target="_blank" rel="noopener">GH-46843</a> - [C++] Don't use unity build for bundled AWS SDK for C++ (#46845)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46846" target="_blank" rel="noopener">GH-46846</a> - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_dask.sh (#46847)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46854" target="_blank" rel="noopener">GH-46854</a> - [CI][MATLAB][Packaging] Add support for MATLAB <code>R2025a</code> in CI and crossbow packaging workflows (#46855)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46864" target="_blank" rel="noopener">GH-46864</a> - [C++] Add half-float test for <code>ArrayFromJSONString</code> (#46865)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46869" target="_blank" rel="noopener">GH-46869</a> - [C++][Parquet] Deprecate <code>arrow::Status parquet::arrow::FileReadeder::GetRecordBatchReader()</code> (#46932)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46877" target="_blank" rel="noopener">GH-46877</a> - [MATLAB] Add <code>arrow.tabular.Table.fromRecordBatches</code> static method (#46885)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46881" target="_blank" rel="noopener">GH-46881</a> - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_gcs_testbench.sh (#46882)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46895" target="_blank" rel="noopener">GH-46895</a> - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_minio.sh (#46896)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46899" target="_blank" rel="noopener">GH-46899</a> - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_numba.sh (#46900)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46909" target="_blank" rel="noopener">GH-46909</a> - [CI][Dev] Fix shellcheck errors in the ci/scripts/install_sccache.sh (#46910)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46911" target="_blank" rel="noopener">GH-46911</a> - [Packaging] Add support for AlmaLinux 10 (#46933)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46952" target="_blank" rel="noopener">GH-46952</a> - [Packaging] Drop support for CentOS Stream 8 (#46953)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46959" target="_blank" rel="noopener">GH-46959</a> - [Python][Packaging] Drop support for manylinux2014 (#46965)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46968" target="_blank" rel="noopener">GH-46968</a> - [CI][Packaging] Synchronize conan files for 20.0.0 (#46966)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/46974" target="_blank" rel="noopener">GH-46974</a> - [Integration][Archery] Add support for ARROW_JS_ROOT (#46975)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47025" target="_blank" rel="noopener">GH-47025</a> - [C++][Docs] Increase minimum gcc for building from 7.1 to 9 (#47026)</li> |
| <li> |
| <a href="https://github.com/apache/arrow/issues/47081" target="_blank" rel="noopener">GH-47081</a> - [Release] Revisit reproducible source archive verification</li> |
| </ul> |
| |
| </main> |
| |
| <hr> |
| <footer class="footer"> |
| <div class="row"> |
| <div class="col-md-9"> |
| <p>Apache Arrow, Arrow, Apache, the Apache logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.</p> |
| <p>© 2016-2025 The Apache Software Foundation</p> |
| </div> |
| <div class="col-md-3"> |
| <a class="d-sm-none d-md-inline pr-2" href="https://www.apache.org/events/current-event.html" target="_blank" rel="noopener"> |
| <img src="https://www.apache.org/events/current-event-234x60.png"> |
| </a> |
| </div> |
| </div> |
| </footer> |
| |
| </div> |
| </body> |
| </html> |