blob: 5af306b749008d1f352aaf96c1b418f388e07baa [file] [log] [blame]
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- The above meta tags *must* come first in the head; any other head content must come *after* these tags -->
<title>Apache Arrow 5.0.0 Release | Apache Arrow</title>
<!-- Begin Jekyll SEO tag v2.8.0 -->
<meta name="generator" content="Jekyll v4.4.1" />
<meta property="og:title" content="Apache Arrow 5.0.0 Release" />
<meta property="og:locale" content="en_US" />
<meta name="description" content="Apache Arrow 5.0.0 (29 July 2021) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 684 commits from 99 distinct contributors in 2 Arrow repositories. 77 David Li 43 Krisztián Szűcs 42 Antoine Pitrou 35 Dominik Moritz 33 Nic Crane 27 Weston Pace 27 Sutou Kouhei 27 Andrew Lamb 23 Jonathan Keane 21 Joris Van den Bossche 17 Jorge C. Leitao 17 Ian Cook 16 João Pedro 15 Jiayu Liu 14 Yibo Cai 14 Alessandro Molina 13 Jorge Leitao 12 Ritchie Vink 12 Benjamin Kietzman 10 niranda perera 10 Wakahisa 10 Matthew Topol 9 Anthony Louis 8 Raphael Taylor-Davies 8 Nate Clark 7 Eduardo Ponce 6 Daniël Heres 5 Neal Richardson 5 Navin 5 Jörn Horstmann 5 Andy Grove 4 crystrix 4 Uwe L. Korn 4 Romain Francois 4 Rok 4 Marco Neumann 4 Ádám Lippai 3 liyafan82 3 frank400 3 Ying Zhou 3 Roee Shlomo 3 Michael Edwards 3 Matthijs Brobbel 3 Alenka Frim 2 Zachary Blackwood 2 Wes McKinney 2 Steven 2 P42 2 Michal Nowakiewicz 2 Kazuaki Ishizaki 2 Karik Isichei 2 Jeroen Ooms 2 Gary Pennington 2 Diana Clarke 2 Bryan Cutler 1 sjgupta2 1 sgilmore 1 ptaylor 1 nullptr 1 kazuhiko kikuchi 1 hulunbier 1 baishen 1 Zimo Zhang 1 Yordan Pavlov 1 Will Jones 1 Steven Burns 1 Stephen 1 Shuai Zhang 1 Ray Bell 1 Projjal Chanda 1 Pachamaltese 1 Nick Bruno 1 Micah Kornfield 1 Menno Siekerman 1 Max Meldrum 1 Mauricio Vargas 1 Marc van Heerden 1 Manish Gill 1 Maarten A. Breddels 1 Lorenz Walthert 1 Linan Qiu 1 Laurent Mazare 1 Kornelijus Survila 1 Kevin Gurney 1 Joris Peeters 1 Jinpeng Zhou 1 Jiangtao Peng 1 Hideaki Hayashi 1 Giordon Stark 1 Gang Liao 1 GALI PREM SAGAR 1 Edd Robinson 1 Dorian Kind 1 Dmitry Patsura 1 Chojan Shang 1 Boaz 1 Ben Chambers 1 Alex Baden 1 Adam Johnson Patch Committers The following Apache committers merged contributed patches to Arrow repositories. 96 Antoine Pitrou 69 Sutou Kouhei 63 David Li 59 Krisztián Szűcs 36 Jonathan Keane 33 Benjamin Kietzman 33 Neal Richardson 27 Andrew Lamb 22 Ian Cook 22 Praveen 21 Jorge C. Leitao 19 Yibo Cai 16 Dominik Moritz 16 Joris Van den Bossche 15 Jiayu Liu 12 GitHub 12 Ritchie Vink 11 Micah Kornfield 10 Wakahisa 8 Jorge Leitao 8 Raphael Taylor-Davies 6 Daniël Heres 5 Andy Grove 5 Jörn Horstmann 5 liyafan82 5 Navin 5 Weston Pace 4 Ádám Lippai 4 Marco Neumann 3 Brian Hulette 3 Michael Edwards 3 Roee Shlomo 2 Eric Erhardt 2 Gary Pennington 2 Steven 2 Uwe L. Korn 2 Wes McKinney 1 baishen 1 Ben Chambers 1 Boaz 1 Bryan Cutler 1 Chojan Shang 1 Dmitry Patsura 1 Edd Robinson 1 Gang Liao 1 hulunbier 1 ishizaki 1 Kazuaki Ishizaki 1 kazuhiko kikuchi 1 Kornelijus Survila 1 Laurent Mazare 1 Manish Gill 1 Marc van Heerden 1 Max Meldrum 1 Yordan Pavlov Changelog The following changelog is for the apache/arrow repository. For the Rust implementation of Apache Arrow, see the apache/arrow-rs changelog. Apache Arrow 5.0.0 (2021-07-28) Bug Fixes ARROW-6189 - [Rust] [Parquet] Plain encoded boolean column chunks limited to 2048 values ARROW-6312 - [C++] Declare required Libs.private in arrow.pc package config ARROW-7948 - [Go][Integration] Decimal integration failures ARROW-9594 - [Python] DictionaryArray.to_numpy does not correctly convert null indexes to null values ARROW-10910 - [Python] Segmentation Fault when None given to read_table with legacy dataset ARROW-10958 - [GLib] &quot;Nested data conversions not implemented&quot; through glib, but not through pyarrow ARROW-11077 - [Rust] ParquetFileArrowReader panicks when trying to read nested list ARROW-11146 - [CI][Python] Failing conda-python-3.8-jpype Nightly Build ARROW-11161 - [Python][C++] S3Filesystem: file Content-Type not set correctly? ARROW-11633 - [CI] [Documentation] Maven default skin not found ARROW-11780 - [C++][Python] StructArray.from_arrays() crashes Python interpreter ARROW-11908 - [Rust] Intermittent Flight integration test failures ARROW-12007 - [C++] Loading parquet file returns &quot;Invalid UTF8 payload&quot; error ARROW-12055 - [R] is.na() evaluates to FALSE on Arrow NaN values ARROW-12096 - [Python][C++] Pyarrow Parquet reader overflows INT96 timestamps when converting to Arrow Array (timestamp[ns]) ARROW-12122 - [Python] Cannot install via pip M1 mac ARROW-12142 - [Python] undefined symbol: _ZN5arrow6StatusC1ENS_10StatusCodeERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE ARROW-12150 - [Python] Bad type inference of mixed-precision Decimals ARROW-12232 - [Rust][Datafusion] Error with CAST: Unsupported SQL type Time ARROW-12240 - [Python] invalid-offsetof warning from apple clang-12 ARROW-12377 - [Doc][Java] Java doc build broken ARROW-12407 - [Python] Deprecation warning when building PyArrow ARROW-12431 - [Python] pa.array mask inverted when type is binary and value to be converted is numpy array ARROW-12472 - [Python] read_table fails when passing a PEP519 filesystem object ARROW-12482 - [Doc][Python] Mention CSVStreamingReader pitfalls with type inference ARROW-12491 - [Packaging] Required dependency on LZ4 &gt;= 1.8 missing from CentOS RPM packages ARROW-12503 - [C++] Ensure using &quot;lib/&quot; for jemalloc&#39;s library directory ARROW-12508 - [R] expect_as_vector implementation causes test failure on R &lt;= 3.3 and variables defined outside of test_that break build when no arrow install ARROW-12543 - [CI][Python] Failing conda-python-3.9 Nightly Build ARROW-12568 - [Python][C++] Segfault when casting a sliced ListArray of int64 in v4.0.0 ARROW-12569 - [R] [CI] Run revdep in CI ARROW-12570 - [JS] Fix issues that blocked the v4.0.0 release ARROW-12579 - [Python] Pyarrow 4.0.0 dependency numpy 1.19.4 throws errors on Apple silicon/M1 compilation ARROW-12589 - [C++] Compiling on windows doesn&#39;t work when -DARROW_WITH_BACKTRACE=OFF ARROW-12601 - [R][Packaging] Fix pkg-config check in r/configure ARROW-12604 - [R][Packaging] Dataset, Parquet off in autobrew and CRAN Mac builds ARROW-12605 - [Documentation] Repair line numbers in dataset.rst ARROW-12606 - [C++] Quantile and Mode functions failing on arrays with offset ARROW-12610 - [C++] Skip TestS3FSGeneric TestDeleteDir and TestDeleteDirContents on windows as they are flaky ARROW-12611 - [CI][Python] Nightly test-conda-python-pandas-0.24 is failing due to numpy compat issue ARROW-12613 - [Python] AttributeError when comparing a Scalar with None ARROW-12614 - [C++][Compute] Revert support for Tables in ExecuteScalarExpression ARROW-12617 - [Python] pyarrow.orc.write_table signature reverses that of pyarrow.parquet.write_table ARROW-12620 - [C++] Dataset writing can only include projected columns if input columns are also included ARROW-12622 - [Python] Segfault when reading CSV inside Flight server ARROW-12630 - [Dev][Integration] conda-integration docker build fails ARROW-12639 - [CI][Archery] Archery build fails to create branch ARROW-12640 - [C++] Fix errors from VS 2019 in cpp/src/parquet/types.h ARROW-12642 - [R] LIBARROW_MINIMAL, LIBARROW_DOWNLOAD, NOT_CRAN env vars should not be case-sensitive ARROW-12644 - [C++][Dataset] Support reading date/time-partitioned datasets accounting for URL encoding (Spark) ARROW-12646 - [C++][CI][Packaging][Python] Bump vcpkg version to its latest release ARROW-12663 - [C++] segfault when arrow header is compiled with nvcc 11.2 ARROW-12668 - [C++][Dataset] CountRows occasionally segfaulting ARROW-12670 - [C++] extract_regex gives bizarre behavior after nulls or non-matches ARROW-12672 - [C++] Segfault casting result of &quot;fill_null()&quot; (not bitmap but unknown null_count) ARROW-12679 - [Java] JDBC adapter does not preserve SQL-nullability ARROW-12684 - [Go][Flight] Fix nil dereference in error case ARROW-12708 - [C++] Valgrind errors when calling negate_checked ARROW-12729 - [R] Fix length method for Table, RecordBatch ARROW-12746 - [Go][Flight] Client Auth handler overwrites outgoing metadata ARROW-12756 - [C++] MSVC build fails with latest gtest from vcpkg ARROW-12757 - [Dev][Archery] Warning about RUST variable in &quot;archery docker run&quot; ARROW-12762 - [Python] ListType doesn&#39;t preserve field name after pickle and unpickle ARROW-12769 - [Python] Negative out of range slices yield invalid arrays ARROW-12771 - [C++] Arrow compute hash_count skips following chunked arrays in streaming execution ARROW-12772 - [CI] Merge script test fails due to missing dependency ARROW-12773 - [Docs] Clarify Java support for ORC and Parquet via JNI bindings ARROW-12774 - [C++][Compute] replace_substring_regex() creates invalid arrays =&gt; crash ARROW-12776 - [Archery][Integration] Fix decimal case generation in write_js_test_json ARROW-12779 - [Python][FlightRPC] Flight server segfaults with certain data ARROW-12780 - [CI][C++] MinGW builds failing when trying to build Gandiva ARROW-12790 - [Python] Cannot read from HDFS with blanks in path names ARROW-12793 - [Python] PYARROW_BUILD_TYPE=Debug does not work correctly ARROW-12797 - [JS] Update readme with new links and remove outdated examples ARROW-12798 - [JS] Use == null Comparison ARROW-12799 - [JS] Use Nullish Coalescing Operator (??) For Defaults ARROW-12804 - [C++] Array methods IsNull and IsValid is confused for NullType ARROW-12807 - [C++] Fix merge conflicts with Future refactor/async IPC ARROW-12838 - [Java][Gandiva] Fix JNI CI test for Gandiva ARROW-12842 - [Java][FlightRPC] Error metadata from FlightStatusException is not propagated to client ARROW-12850 - [R] is.nan() evaluates to null on Arrow null values ARROW-12854 - [Dev][Release] Windows wheel verification script fails to download artifacts ARROW-12857 - [C++] hash_aggregate_test not building on master ARROW-12864 - [C++] Remove needless out argument from arrow::internal::InvertBitmap ARROW-12865 - [C++][Python] Python FlightRPC server cannot find RE2 symbols ARROW-12882 - [C++][Gandiva] Fix behavior of convevrt_replace function for empty replacement char ARROW-12887 - [CI] AppVeyor pip install failure during setup ARROW-12906 - [Python] `fill_null` called with a null value seg faults on non fixed-sized types. ARROW-12907 - [Java] Memory leak possible when exception reading from channel happens ARROW-12911 - [Python] Export scalar aggregate options to pc.sum (sum of zero rows gives null; should give 0) ARROW-12917 - [C++][R][pyarrow] Failure importing some decimal types using the C data interface ARROW-12918 - [C++] Build errors with Visual Studio 16.10.31321.278 ARROW-12919 - [Developer Tools] Crossbow comment bot failing to react to comments ARROW-12935 - [C++][CI] Compiler error on some clang versions ARROW-12941 - [C++] csv reader skip_row does not properly update num_rows_seen ARROW-12942 - [C++][Compute] The result of Arrow compute hash_min_max is incorrect if there are new groups in the subsequent chunks ARROW-12956 - [C++] Fix crash on Parquet file (OSS-Fuzz) ARROW-12969 - [C++] match_substring doesn&#39;t match empty needle to empty haystack ARROW-12974 - [R] test-r-without-arrow build fails because of example requiring Arrow ARROW-12983 - [C++][Python] Converter::Extend gets stuck in infinite loop causing OOM if values don&#39;t fit in single chunk ARROW-12987 - [CI] test-ubuntu-18.04 nightly builds are failing due to Gandiva &quot;TestUpper&quot; test failure ARROW-12988 - [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty) ARROW-12988 - [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty) ARROW-12989 - [CI] &quot;Dev PR&quot; jobs undully cancelled ARROW-12991 - [CI] Travis ARM builds often crash ARROW-12993 - [Python] Address boundary error with invalid Feather file and stackprinter ARROW-12995 - [C++] CSV reader should validate options ARROW-12998 - [C++] Datasets needs dependency on xsimd ARROW-13001 - [Go] Build failure in parquet/internal/bmi on s390x ARROW-13003 - [C++] unaligned access in compute/exec/ cc files ARROW-13008 - [C++] Deprecation warning when compiling minimal example ARROW-13010 - [C++][Compute] Support outputting to slices from kleene kernels ARROW-13018 - [C++][Docs] Use consistent terminology for nulls (min_count) in scalar aggregate kernels ARROW-13026 - [C++][CI] s390x job setup fails ARROW-13037 - [R] Incorrect param when creating Expression crashes R ARROW-13039 - [R] Fix error message handling ARROW-13041 - [C++] Unary kernels can leave uninitialized data under null entries ARROW-13046 - [Release] JS package failing test prior to publish ARROW-13048 - [C++] S3FileSystem fails moving filepaths containing = or + ARROW-13053 - [Python] Build fails on MacOS Big Sur using homebrewed Arrow libraries ARROW-13069 - [Website] Add Daniël to committer list ARROW-13073 - [Developer] archery benchmark list: unexpected keyword &#39;benchmark_filter&#39; ARROW-13080 - [Release] Generate the API docs in ubuntu 20.10 ARROW-13083 - [Python] Wrong SCM version detection both in setup.py and crossbow ARROW-13085 - [Python] Apache Arrow minimal cpp build segfaults with pyarrow libs ARROW-13090 - [Python] Test failure with ffspec 2021.6.0 ARROW-13104 - [C++] ByteStreamSplit implementation uses invalid pointer cast ARROW-13108 - [Python] Pyarrow 4.0.0 crashes upon import on macOS 10.13.6 ARROW-13116 - [R] Test for RecordBatchReader to C-interface fails on arrow-r-minimal due to missing dependencies ARROW-13125 - [R] Throw error when 2+ args passed to desc() in arrange() ARROW-13128 - [C#] TimestampArray conversion logic for nano and micro is wrong ARROW-13135 - [C++] Fix Status propagation in END_PARQUET_CATCH_EXCEPTIONS ARROW-13139 - [C++] ReadaheadGenerator cannot be safely copied/moved ARROW-13145 - [C++][CI] Flight test crashes on MinGW ARROW-13148 - [Dev][Archery] Crossbow build submission fails ARROW-13153 - [C++] `parquet_dataset` loses ordering of files in `_metadata` ARROW-13154 - [C++] Unions can not have 126 and 127 as type_codes ARROW-13169 - [R] [C++] sorted partition keys can cause issues ARROW-13173 - [C++] TestAsyncUtil.ReadaheadFailed asserts occasionally ARROW-13187 - [c++][python] Possibly memory not deallocated when reading in CSV ARROW-13189 - [R] Disable row-level metadata application on datasets ARROW-13203 - [R] Fix optional component checks causing failures ARROW-13207 - [Python][Doc] Dataset documentation still suggests deprecated scan method as the preferred iterative approach ARROW-13216 - [R] Type checks test fails with rtools35 ARROW-13217 - [C++][Gandiva] Correct convert_replace function for invalid chars on string beginning ARROW-13223 - [C++][CI] Fix thread sanitizer failures ARROW-13225 - [Go][Flight] Implement Custom Middleware Interface and Enable Integration Tests ARROW-13229 - [Python] ascii_trim, ascii_ltrim and ascii_rtrim lack options ARROW-13239 - [Doc][Python] Dataset.head function doesn&#39;t mention required argument ARROW-13243 - [R] altrep function call in R 3.5 ARROW-13246 - [C++] CSV skip_rows_after_names can discard data prematurally ARROW-13249 - [Java][CI] Consistent timeout in the Java JNI build ARROW-13253 - [C++][FlightRPC] Segfault when sending record batch &gt;2GB ARROW-13254 - [Python] Processes killed and semaphore objects leaked when reading pandas data ARROW-13265 - [R] cli valgrind errors in nightlies ARROW-13266 - [JS] Improve benchmark names &amp; add suite name to json ARROW-13281 - [C++][Gandiva] Error on timestampDiffMonth function behavior for negative diff values ARROW-13284 - [C++] Wrong pkg_check_modules() option name ARROW-13288 - [Python] Missing default values of kernel options in PyArrow ARROW-13290 - Compilation fails on clang-12 and gcc-11 due to missing include ARROW-13305 - [C++] Unable to install nightly on Ubuntu 21.04 due to CSV options ARROW-13315 - [R] Wrap r_task_group includes with ARROW_R_WITH_ARROW checking ARROW-13321 - [C++][Python] MakeArrayFromScalar doesn&#39;t work for FixedSizeBinaryType ARROW-13324 - [R] Typo in bindings for utf8_reverse and ascii_reverse ARROW-13332 - [C++] TSAN failure in TestAsyncUtil.ReadaheadFailed ARROW-13341 - [C++] Segfault in arrow-compute-plan-test ExecPlanExecution.SourceScalarAggSink ARROW-13350 - [Python][CI] conda-python-3.7-pandas-0.24 nightly build failing in test_extract_datetime_components ARROW-13352 - [C++] Valgrind failure in case_when kernel ARROW-13353 - [Documentation] Build failing with sphinx.util.cfamily.DefinitionError ARROW-13360 - [C++] Missing dependencies in C++ thirdparty offline dependencies versions.txt ARROW-13363 - [R] is.nan() errors on non-floating point data ARROW-13368 - [C++][Doc] Rename project to make_struct in docs ARROW-13381 - [C++] ArrayFromJSON doesn&#39;t work for float value dictionary type ARROW-13382 - [C++] Aggregation over scalars fails autobrew R job ARROW-13384 - [C++] Specify minimum required zstd version in cmake ARROW-13391 - [C++] CSV streaming reader does not include same error information as table reader ARROW-13417 - [C++] The merged generator can sometimes pull from source sync-reentrant ARROW-13419 - [JS] Fix perf tests ARROW-13428 - [C++][Flight] -lssl is missing with bundled gRPC and system shared OpenSSL ARROW-13431 - [Release] Bump go version to 1.15; don&#39;t verify rust source anymore ARROW-13432 - [Release] Fix ssh connection to the binary uploader container ARROW-13450 - [Python][Packaging] Set deployment target to 10.13 for universal2 wheels New Features and Improvements ARROW-2665 - [Python/C++] Add index() method to find first occurence of Python scalar ARROW-3014 - [C++] Minimal writer adapter for ORC file format ARROW-3316 - [R] Multi-threaded conversion from R data.frame to Arrow table / record batch ARROW-5385 - [Go] implement EXTENSION datatype ARROW-5640 - [Go] implement Map array ARROW-6513 - [CI] The conda environment files arrow/ci/conda_env_*.yml should have .txt extension ARROW-6513 - [CI] The conda environment files arrow/ci/conda_env_*.yml should have .txt extension ARROW-7001 - [C++] Develop threading APIs to accommodate nested parallelism ARROW-7114 - [JS][CI] NodeJS build fails on Github Actions Windows node ARROW-7252 - [Rust] [Parquet] Reading UTF-8/JSON/ENUM field results in a lot of vec allocation ARROW-7396 - [Format] Register media types (MIME types) for Apache Arrow formats to IANA ARROW-8421 - [Rust] [Parquet] Implement parquet writer ARROW-8459 - [Dev][Archery] Use a more recent cmake-format ARROW-8527 - [C++][CSV] Add support for ReadOptions::skip_rows &gt;= block_size ARROW-8655 - [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset ARROW-8676 - [Rust] Create implementation of IPC RecordBatch body buffer compression from ARROW-300 ARROW-9054 - [C++] Add ScalarAggregateOptions ARROW-9056 - [C++] Support scalar aggregation over scalars ARROW-9140 - [R] Zero-copy Arrow to R where possible ARROW-9295 - [Archery] Support rust clippy in the lint command ARROW-9299 - [Python] Expose ORC metadata() in Python ORCFile ARROW-9313 - [Rust] Use feature enum ARROW-9421 - [C++][Parquet] Redundancies SchemaManifest::GetFieldIndices ARROW-9430 - [C++/Python] Kernel for SetItem(BooleanArray, values) ARROW-9697 - [C++][Dataset] num_rows method for Dataset/Scanner ARROW-10031 - [Java] Support Java benchmark in Archery ARROW-10115 - [C++] CSV empty quoted string is treated as NULL ARROW-10316 - [Python] Consider using __wrapped__ for compute function introspection ARROW-10391 - [Rust] [Parquet] Nested Arrow reader ARROW-10440 - [C++][Dataset][Python] Add a callback to visit file writers just before Finish() ARROW-10550 - [Rust] [Parquet] Write nested types (struct, list) ARROW-10557 - [C++] Add scalar string slicing/substring extract kernel ARROW-10640 - [C++] An &quot;if_else&quot; kernel to combine two arrays based on a mask ARROW-10658 - [Python][Packaging] Wheel builds for Apple Silicon ARROW-10675 - [C++][Python] Support AWS S3 Web identity credentials ARROW-10797 - [C++] Investigate faster random generation for tests and benchmarks ARROW-10926 - [Rust] Add parquet reader / writer for decimal types ARROW-10959 - [C++] Add scalar string join kernel ARROW-11061 - [Rust] Validate array properties against schema ARROW-11173 - Add Map type as reader / writer in FieldReader / FieldWriter ARROW-11199 - [C++][Python] Fix the unit tests for the ORC reader ARROW-11206 - [C++][Compute][Python] Rename &quot;project&quot; kernel to &quot;make_struct&quot; ARROW-11342 - [Python] [Gandiva] Expose ToString and result type information ARROW-11499 - [Packaging] Remove all use of bintray ARROW-11514 - [R][C++] Bindings for paste(), paste0(), str_c() ARROW-11515 - [R] Bindings for strsplit ARROW-11565 - [C++][Gandiva] Modify upper()/lower() to work with UTF8 and add INIT_CAP function ARROW-11608 - [CI] turbodbc integration tests are failing (build isue) ARROW-11660 - [C++] Move RecordBatch::SelectColumns method from R to C++ library ARROW-11673 - [C++] Casting dictionary type to use different index type ARROW-11675 - [CI][C++] Resolve ctest failures on VS 2019 builds ARROW-11705 - [R] Support scalar value recycling in RecordBatch/Table$create() ARROW-11748 - [C++] Ensure Decimal128 and Decimal256&#39;s fields are in native endian order ARROW-11759 - [C++] Kernel to extract datetime components (year, month, day, etc) from timestamp type ARROW-11769 - [R] Pull groups from grouped_df into RecordBatch or Table ARROW-11772 - [C++] Add asynchronous read to ipc::RecordBatchFileReader ARROW-11782 - [GLib][Ruby][Dataset] Remove bindings for internal classes ARROW-11787 - [R] Implement write csv ARROW-11843 - [C++] Add asynchronous read to parquet::arrow::FileReader ARROW-11849 - [R] Use roxygen @examplesIf tag in R docs ARROW-11889 - [C++] Add parallelism to streaming CSV reader ARROW-11909 - [C++] Get rid of MakeIteratorGenerator ARROW-11926 - [R] Pass on the new UCRT CRAN windows builds ARROW-11926 - [R] Pass on the new UCRT CRAN windows builds ARROW-11928 - [C++][Compute] Add ExecNode hierarchy ARROW-11929 - [C++][Compute] Promote Expression to the compute namespace ARROW-11930 - [C++][Dataset][Compute] Refactor Dataset scans to use an ExecNode graph ARROW-11932 - [C++] Provide ArrayBuilder::AppendScalar ARROW-11950 - [C++][Compute] Add unary negative kernel ARROW-11960 - [C++][Gandiva] Support escape in LIKE ARROW-11980 - [Python] Remove &quot;experimental&quot; status from Table.replace_schema_metadata ARROW-11986 - [C++][Gandiva] Implement IN expressions for doubles and floats ARROW-11990 - [C++][Compute] Use Status/Result return consistently to indicate errors ARROW-12004 - [C++] Resultdetail::Empty\ is annoying ARROW-12010 - [C++][Compute] Improve performance of the hash table used in GroupIdentifier ARROW-12016 - [C++] Implement array_sort_indices and sort_indices for BOOL type ARROW-12050 - [C++][Python][FlightRPC] Use StopToken to enable interrupting long Flight operations ARROW-12074 - [C++][Compute] Add scalar arithmetic kernels for decimal inputs ARROW-12083 - [R] schema use in open_dataset ARROW-12166 - [C++][Gandiva] Implements CONVERT_TO(value, type) function ARROW-12184 - [R] Bindings for na.fail, na.omit, na.exclude, na.pass ARROW-12185 - [R] Bindings for any, all ARROW-12198 - [R] bindings for strptime ARROW-12199 - [R] bindings for stddev, variance ARROW-12205 - [C++][Gandiva] Implement TO_TIME([number] secs) and TO_TIMESTAMP([number] secs) function ARROW-12231 - [C++][Dataset] Separate datasets backed by readers from InMemoryDataset ARROW-12253 - [Rust] [Ballista] Implement scalable joins ARROW-12255 - [Rust] [Ballista] Integrate scheduler with DataFusion ARROW-12256 - [Rust] [Ballista] Add DataFrame support ARROW-12257 - [Rust] [Ballista] Publish user guide to Arrow site ARROW-12261 - [Rust] [Ballista] Ballista should not have its own DataFrame API ARROW-12291 - [R] Determine the type of an unevaluated expression ARROW-12310 - [Java] ValueVector#getObject should support covariance for complex types ARROW-12355 - [C++] Implement efficient async CSV scanning ARROW-12362 - [Rust] [DataFusion] topk_query test failure ARROW-12364 - [Python] [Dataset] Add metadata_collector option to ds.write_dataset() ARROW-12378 - [C++][Gandiva] Implement castVARBINARY functions ARROW-12386 - [C++] Support file parallelism in AsyncScanner ARROW-12391 - [Rust][DataFusion] Implement date_trunc() function ARROW-12392 - [C++] Restore asynchronous streaming CSV reader ARROW-12393 - [JS] Optimally use closure compiler ARROW-12403 - [Rust] [Ballista] Integration tests should check that query results are correct ARROW-12415 - [CI] [Python] ERROR: Failed building wheel for pygit2 on ARM64 ARROW-12424 - [Go][Parquet] Add Schema Package ARROW-12428 - [Python] pyarrow.parquet.read_* should use pre_buffer=True ARROW-12434 - [Rust] [Ballista] Show executed plans with metrics ARROW-12442 - [CI] Set job timeouts on GitHub Actions ARROW-12443 - [C++][Gandiva] Implement castVARCHAR function for binary input ARROW-12444 - [RUST] [CI] Remove Rust and point integration tests to arrow-rs repo ARROW-12445 - [Rust] Design and implement packaging process to bundle Rust in signed tar ARROW-12468 - [Python][R] Expose UseAsync to python/R ARROW-12478 - [C++] Support LLVM 12 ARROW-12484 - [CI] Change jinja macros to not require CROSSBOW_TOKEN to upload artifacts in Github Actions ARROW-12489 - [Developer] autotune is broken ARROW-12490 - [Dev] Use miniforge for all platforms ARROW-12492 - [Python] Add an helper method to decode a DictionaryArray back to a plain Array ARROW-12496 - [C++][Dataset] Ensure Scanner tests fully cover async ARROW-12499 - [C++][Compute][R] Add ScalarAggregateOptions to Any and All kernels ARROW-12500 - [C++][Dataset] Consolidate similar tests for file formats ARROW-12501 - [CI][Ruby] Remove needless workaround for MinGW build ARROW-12507 - [CI] Remove duplicated cron/nightly builds ARROW-12512 - [C++][Dataset] Implement CSV writing support ARROW-12514 - [Release] Don&#39;t run Gandiva related Ruby test with ARROW_GANDIVA=OFF ARROW-12517 - [Go] Expose App Metadata in Flight client ARROW-12518 - [Python] Expose Parquet statistics has_null_count / has_distinct_count ARROW-12520 - [R] Minor docs updates ARROW-12522 - [C++] Implement asynchronous/&quot;lazy&quot; variants of ReadRangeCache ARROW-12525 - [JS] Vector toJSON returns an array ARROW-12527 - [Dev] Don&#39;t try getting JIRA information for MINOR PR ARROW-12528 - [JS] Support typed arrays in Table.new ARROW-12530 - [C++] Remove Buffer::mutable_data_ member and use const_cast on data_ only if is_mutable_ is true ARROW-12533 - [C++] Random real generator is slow on Arm64 Linux when built with clang ARROW-12534 - [C++][Gandiva] Implement LEFT and RIGHT functions on Gandiva for string input values ARROW-12537 - [JS] Docs build should not include test sources ARROW-12541 - [Docs] Improve styling/readability of tables in the new doc theme ARROW-12551 - [Java][Release] Java post-release tests fail due to missing testing data ARROW-12554 - Allow duplicates in the value_set for compute::is_in ARROW-12555 - [Java][Release] Java post-release script misses dataset JNI bindings ARROW-12556 - [C++][Gandiva] Implement BYTESUBSTRING functions on Gandiva ARROW-12560 - [C++] Investigate utilizing aggressive thread task creation when adding callback to finished future ARROW-12567 - [C++][Gandiva] Implement LPAD and RPAD functions for string input values ARROW-12567 - [C++][Gandiva] Implement LPAD and RPAD functions for string input values ARROW-12571 - [R][CI] Run nightly R with valgrind ARROW-12575 - [R] Use unary negative kernel ARROW-12577 - [Website] Use Artifactory instead of Bintray in all places ARROW-12578 - [JS] Simplify UTF8 handling in NodeJS ARROW-12581 - [C++][FlightRPC] Benchmark compression with real data ARROW-12584 - [C++][Python] Expose method for benchmarking tools to release unused memory from the allocators ARROW-12591 - [Java][Gandiva] Create single Gandiva jar for MacOS and Linux ARROW-12593 - [Packaging][Ubuntu] Add support for Ubuntu 21.04 ARROW-12597 - [C++] Implement OptionalParallelForAsync ARROW-12598 - [C++][Dataset] Implement row-count for CSV or allow selecting 0 columns from CSV ARROW-12599 - [Doc][Python] Documentation missing for pyarrow.Table ARROW-12600 - [CI] Push docker images from crossbow tasks ARROW-12602 - [R] Add BuildInfo from C++ to arrow_info ARROW-12608 - [C++] Add split_pattern_regex function ARROW-12612 - [C++][Compute] Add Expression to type_fwd.h ARROW-12619 - [Python] pyarrow sdist should not require git ARROW-12621 - [C++][Gandiva] Add alias to sha1 and sha256 functions ARROW-12631 - [Python] pyarrow.dataset.write_table should accept a Scanner to write ARROW-12643 - Add documentation for experimental repos ARROW-12645 - [Python] Fix numpydoc validation ARROW-12648 - [C++][FlightRPC] Allow using TLS in benchmark ARROW-12649 - [Python/Packaging] Move conda-aarch64 to Azure with cross-compilation ARROW-12653 - [Archery] allow me to add a comment to crossbow requests ARROW-12658 - [C++] Bump aws-c-common to v0.5.10 ARROW-12660 - [R] Post-4.0 adjustments for CRAN ARROW-12661 - [C++] CSV add skip rows after column names ARROW-12662 - [Website] Force to use squash merge ARROW-12667 - [Python] Ensure test coverage for conversion of strided numpy arrays ARROW-12675 - [C++] CSV should include line/row numbers in parsing error messages ARROW-12677 - [Python] Add a mask argument to pyarrow.StructArray.from_arrays ARROW-12685 - [C++][Compute] Add unary absolute value kernel ARROW-12686 - [C++][Python][FlightRPC] Support export_to_c in DoGet/inherit from RecordBatchReader ARROW-12687 - [C++][Python][Dataset] Support C Data Interface with Scanner ARROW-12689 - [R] Implement ArrowArrayStream C interface ARROW-12692 - [R] Improve tests and comments for strsplit() bindings ARROW-12694 - [R][CI] rtools35 job failing on 32-bit build tests ARROW-12696 - [R] Improve testing of error messages converted to warnings ARROW-12699 - [CI][Packaging][Java] Generate a jar compatible with Linux and MacOS for all Arrow components ARROW-12701 - [Website][Release] Include Rust contributors, committers, and commits in release notes ARROW-12702 - [JS] Upgrade Webpack and terser ARROW-12703 - [JS] Separate Table from DataFrame ARROW-12704 - [JS] use optional chaining ARROW-12709 - [C++] Add variadic string join kernel ARROW-12713 - [C++] String reverse kernel ARROW-12715 - [C++] SQL-style glob string match kernel ARROW-12716 - [C++] Left/right/center string padding kernels ARROW-12717 - [C++] Substring find position kernel ARROW-12719 - [C++][Python] pyarrow.fs.S3FileSystem pass extra kwargs i.e ACL ARROW-12721 - [CI] Fix path for uploading aarch64 conda artifacts from the nightly builds ARROW-12722 - [R] Raise error when attemping to print table with duplicated naming ARROW-12730 - [MATLAB] Update featherreadmex and featherwritemex to build against latest arrow c++ APIs ARROW-12731 - [R] Use InMemoryDataset for Table/RecordBatch in dplyr code ARROW-12736 - [C++] Eliminate unnecessary copy in FieldPath::Get() ARROW-12738 - [CI] [Gandiva] Nightly build error in azure-conda-osx-clang-py38 (and py39, py*-r*) ARROW-12741 - [CI] Configure GitHub Token for Nightly Builds ARROW-12745 - [C++][Compute] Add floor, ceiling, and truncate kernels ARROW-12749 - [C++] Unnecessary copy cause by constructing RecordBatch/Table/Schema from lvalues ARROW-12750 - [CI] [R] Actually pass parameterized docker options to the templates ARROW-12751 - [C++] Add variadic row-wise min/max kernels (least/greatest) ARROW-12758 - [R] Add examples to more function documentation ARROW-12759 - [C++][Compute] Wrap grouped aggregation in an ExecNode ARROW-12760 - [C++][Python][R] S3FileSystem: IO thread parallelism limited to 8 threads ARROW-12761 - [R] Better error handling for write_to_raw ARROW-12764 - [CI] Fix arguments in Conda Windows builds ARROW-12777 - [R] Convert all inputs to Arrow objects in match_arrow and is_in ARROW-12781 - [R] Implement is.type() functions for dplyr ARROW-12785 - [CI] the r-devdocs build errors when brew installing gcc ARROW-12791 - [R] Better error handling for DatasetFactory$Finish() when no format specified ARROW-12796 - [JS] Support JSON output from benchmarks ARROW-12800 - [JS] Drop IE Support and remove text encoder and decoder polyfills ARROW-12801 - [CI][Packaging][Java] Include all modules in script that generate Arrow jars ARROW-12806 - [Python] test_write_to_dataset_filesystem missing a dataset mark ARROW-12808 - [JS] Document browser support ARROW-12810 - [Python] Run tests with AWS_EC2_METADATA_DISABLED=true ARROW-12812 - [Packaging][Java] Improve JNI jars build ARROW-12824 - [R][CI] Upgrade builds for R 4.1 release ARROW-12827 - [C++] [Dataset] Review error pass-through in the datasets API ARROW-12829 - [GLib][Ruby] Add support for Apache Arrow Flight ARROW-12831 - [CI][macOS] Remove needless Homebrew workaround ARROW-12832 - [JS] Write benchmarks in TypeScript ARROW-12833 - [JS] Construct perf data in JS ARROW-12835 - [C++] Implement case insenstive match in match_substring(_regex) and match_like ARROW-12836 - [C++] Installation on IBM i fails because of CxxFlags ARROW-12841 - [R] Add examples to more function documentation - part 2 ARROW-12843 - [C++][Compute] Add is_inf kernel for floating point arrays ARROW-12848 - [Release] Mail template points to 404 ARROW-12851 - [Go][Parquet] Add Encoding Package Part 1 ARROW-12856 - [C++][Gandiva] Implement castBIT and castBOOLEAN functions on Gandiva ARROW-12859 - [C++] Add ScalarFromJSON for easier testing ARROW-12861 - [C++][Compute] Add sign function kernels ARROW-12867 - [R] Bindings for abs() ARROW-12868 - [R] Bindings for find_substring and find_substring_regex ARROW-12869 - [R] Bindings for utf8_reverse and ascii_reverse ARROW-12870 - [R] Bindings for stringr::str_like ARROW-12875 - [JS] Upgrade Jest and other minor updates ARROW-12883 - [R] [CI] version compatibility fails on R 4.1 ARROW-12891 - [C++][Compute][Dataset] Extract subtree pruning logic to compute:: ARROW-12894 - [R] Bump R version ARROW-12895 - [CI] Use &quot;concurrency&quot; setting on Github Actions ARROW-12898 - [Release][C#] Package upload script is broken ARROW-12900 - [Python][Documentation] an np import in Reading Datasets docs ARROW-12901 - [R] Follow on to more examples ARROW-12909 - [R][Release] Build of ubuntu-docs is failing ARROW-12912 - [Website] Use .asf.yaml for publishing ARROW-12915 - [Release] Build of ubuntu-docs is failing on thrift ARROW-12936 - [C++][Gandiva] Implement ASCII Hive function on Gandiva ARROW-12937 - [C++] Allow specifying default metadata for new S3 files ARROW-12939 - [R] Simplify RTask stop handling ARROW-12940 - [R] Expose C interface as R6 methods ARROW-12948 - [C++] Add string slice replace kernel ARROW-12949 - [C++] Add string starts-with/ends-with kernels ARROW-12950 - [C++] Add substring count kernel ARROW-12951 - [C++] Refactor StringTransform ARROW-12952 - [C++] Add regex count kernel ARROW-12955 - [C++] Add additional type support for if_else kernel ARROW-12957 - [R] rchk issues on cran ARROW-12961 - [C++] MSVC issues warning building PyArrow on Windows ARROW-12962 - [GLib][Ruby] Add Arrow:Scalar ARROW-12964 - [R] Add bindings for ifelse() and if_else() ARROW-12966 - [Python] Expose Python binding for ElementWiseAggregateOptions ARROW-12967 - [R] Add bindings for pmin() and pmax() ARROW-12968 - [R] [CI] Add an rchk job to our nightlies ARROW-12972 - [CI] ][C++] archive_write_add_filter_zstd error on CentOS + ARM64 ARROW-12975 - [C++][Python] if_else kernel doesn&#39;t support upcasting ARROW-12982 - [C++] Re-enable unused-variable warning ARROW-12984 - [C++] Passing options parameter of Count/Index aggregation by reference ARROW-12985 - [Python][Packaging] Unable to install pygit2 in the arm64 wheel builds ARROW-12986 - [C++][Gandiva] Implement new cache eviction policy in Gandiva ARROW-12992 - [R] bindings for substr(), substring(), str_sub() ARROW-12994 - [R] Fix tests that assume UTC local tz ARROW-12996 - [C++] CSV stream reader has no progress indication ARROW-13002 - [C++] Add a check for the utf8proc&#39;s version in CMake ARROW-13005 - [C++] Support filter/take for union data type. ARROW-13006 - [C++][Gandiva] Implement BASE64 and UNBASE64 Hive functions on Gandiva ARROW-13009 - [Doc][Dev] Document builds mailing-list ARROW-13022 - [R] bindings for lubridate&#39;s year, isoyear, quarter, month, day, wday, yday, isoweek, hour, minute, and second functions ARROW-13025 - [C++][Compute] Enhance FunctionOptions with equality, debug representability, and serializability ARROW-13027 - [C++] Fix ASAN stack traces in CI ARROW-13030 - [CI][Go] Setup Arm64 golang CI ARROW-13031 - [JS] Support arm in closure compiler on macOS ARROW-13032 - [Java] Update gauva version ARROW-13034 - [Python][Docs] Update outdated examples for hdfs/azure on the Parquet doc page ARROW-13036 - [Doc] Mention recommended file extension(s) for Arrow IPC ARROW-13042 - [C++] Automatic checks that kernels don&#39;t leave uninitialized data in output ARROW-13043 - [GLib][Ruby] Add GArrowEqualOptions ARROW-13044 - [Java] Union vectors should extend ValueVector ARROW-13045 - [Packaging][RPM][deb] Don&#39;t install system utf8proc if it&#39;s old ARROW-13047 - [Website] Add kiszk to committer list ARROW-13049 - [C++][Gandiva] Implement BIN Hive function on Gandiva ARROW-13050 - [C++][Gandiva] Implement SPACE Hive function on Gandiva ARROW-13054 - [C++] Add option to specify the first day of the week for the &quot;day_of_week&quot; temporal kernel ARROW-13064 - [C++] Add a general &quot;if, ifelse, ..., else&quot; kernel (&quot;CASE WHEN&quot;) ARROW-13065 - [Packaging][RPM] Add missing required LZ4 version information ARROW-13068 - [GLib][Dataset] Change prefix to gadataset_ from gad_ ARROW-13070 - [R] bindings for sd and var ARROW-13072 - [C++] Add bitwise arithmetic compute functions ARROW-13074 - [Python] Start with deprecating ParquetDataset custom attributes ARROW-13075 - [Python] Expose C data interface API for pyarrow.Field ARROW-13076 - [Java] Enable ExtensionType to use StructVector and UnionVector for underlying storage ARROW-13082 - [CI] Forward R argument to ubuntu-docs build ARROW-13086 - [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_ ARROW-13086 - [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_ ARROW-13091 - [Python] Add compression_level argument to IpcWriteOptions constructor ARROW-13092 - [C++] CreateDir should fail if the target exists and is not a directory ARROW-13095 - [C++] Implement trigonometric compute functions ARROW-13096 - [C++] Implement logarithm compute functions ARROW-13097 - [C++] Provide a simple reflection utility for s ARROW-13098 - [Dev][Archery] Reorganize docker submodule to its own subpackage ARROW-13100 - [MATLAB] Integrate GoogleTest with MATLAB Interface C++ Code ARROW-13101 - [Python][Doc] pyarrow.FixedSizeListArray does not appear in the documentation ARROW-13110 - [C++] Deadlock can happen when using BackgroundGenerator without transferring callbacks ARROW-13113 - [R] use RTasks to manage parallel in converting arrow to R ARROW-13117 - [R] Retain schema in new Expressions ARROW-13119 - [R] Set empty schema in scalar Expressions ARROW-13124 - [Ruby] Add support for memory view ARROW-13127 - [R] Valgrind nightly errors ARROW-13136 - [C++] Add a &quot;coalesce&quot; variadic scalar kernel ARROW-13137 - [C++][Documentation] Make in-table references consistent ARROW-13140 - [C++/Python] Upgrade libthrift pin in the nightlies ARROW-13142 - [Python] Use vector append when converting from list of non-strided numpy arrays ARROW-13147 - [Java] Respect the rounding policy when allocating vector buffers ARROW-13157 - [C++] Add find_substring_regex kernel and implement ignore_case for find_substring ARROW-13158 - [Python] Fix repr and contains of StructScalar with duplicate field names ARROW-13162 - [C++][Gandiva] Add new alias for extract date functions in Gandiva registry ARROW-13171 - [R] Add binding for str_pad() ARROW-13190 - [C++] [Gandiva] Change behavior of INITCAP function ARROW-13194 - [Java][Document] Create prose document about Java algorithms ARROW-13195 - [R] Problem with rlang reverse dependency checks ARROW-13199 - [R] add ubuntu 21.04 to nightly builds ARROW-13200 - [R] Add binding for case_when() ARROW-13201 - [R] Add binding for coalesce() ARROW-13210 - [Python][CI] Fix vcpkg caching mechanism for the macOS wheels ARROW-13211 - [C++][CI] Remove outdated Github Actions ARM builds ARROW-13212 - [Release] Support deploying to test PyPI in the python post release script ARROW-13215 - [R] [CI] Add ENV TZ to docker files ARROW-13218 - [Doc] Document/clarify conventions for timestamp storage ARROW-13219 - [C++][GLib] Demote/deprecate CompareOptions ARROW-13224 - [Python][Doc] Documentation missing for pyarrow.dataset.write_dataset ARROW-13226 - [Python] Add a general purpose cython trampolining utility ARROW-13228 - [C++] S3 CreateBucket fails because AWS treats us-east-1 differently than other regions ARROW-13230 - Add CSV Writer documentation ARROW-13234 - [C++] Add string padding option to determine which side the extra space goes on ARROW-13235 - [C++] Make type_name equal to options class name for all FunctionOptionTypes ARROW-13236 - [Python] Improve repr of pyarrow.compute.FunctionOptions ARROW-13238 - [C++][Dataset][Compute] Substitute ExecPlan impl for dataset scans ARROW-13242 - [C++] Improve decimal random generation ARROW-13244 - [C++] Add facility to get current thread id ARROW-13258 - [Python] Improve the repr of ParquetFileFragment ARROW-13262 - [R] transmute() fails after pulling data into R ARROW-13273 - [C++] Don&#39;t use .pc only in CMake paths for Requires.private ARROW-13274 - [JS] Remove Webpack ARROW-13275 - [JS] Fix perf tests ARROW-13276 - [GLib][Ruby][Flight] Add support for ListFlights ARROW-13277 - [JS] Add declaration maps ARROW-13280 - [R] Bindings for log and trig functions ARROW-13282 - [C++] Remove obsolete generated files ARROW-13283 - [Developer Tools] Support passing through memory limits in archery docker run ARROW-13286 - [CI] Require docker-compose 1.27.0 or later ARROW-13289 - [C++] Log functions don&#39;t have int kernels ARROW-13291 - [GLib][CI] Require gobject-introspection 3.4.5 or later ARROW-13296 - [C++] Provide reflection-compatible enum replacement ARROW-13299 - [JS] Upgrade ix and rxjs ARROW-13303 - [JS] Revise bundles ARROW-13306 - [Java][JDBC] use ResultSetMetaData.getColumnLabel instead of ResultSetMetaData.getColumnName ARROW-13313 - [C++][Compute] Add ScalarAggregateNode ARROW-13320 - [Website] Add MIME types to FAQ ARROW-13323 - [Archery] Validate docker compose configuration ARROW-13343 - [R] Update NEWS.md for 5.0 ARROW-13346 - [C++] Remove compile time parsing from EnumType ARROW-13355 - [R] ensure that sf is installed in our revdep job ARROW-13357 - [R] bindings for sign() ARROW-13365 - [R] bindings for floor/ceiling/truncate ARROW-13385 - [C++][Compute] Document out-of-source addition to the FunctionRegistry ARROW-13386 - [R][C++] CSV streaming changes break Rtools 35 32-bit build ARROW-13418 - [R] typo in python.r ARROW-13461 - [Python][Packaging] Build M1 wheels for python 3.8 PARQUET-1798 - [C++] Review logic around automatic assignment of field_id&#39;s PARQUET-1998 - [C++] Implement LZ4_RAW compression PARQUET-2056 - [C++] Add ability for retrieving dictionary and indices separately for ColumnReader" />
<meta property="og:description" content="Apache Arrow 5.0.0 (29 July 2021) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 684 commits from 99 distinct contributors in 2 Arrow repositories. 77 David Li 43 Krisztián Szűcs 42 Antoine Pitrou 35 Dominik Moritz 33 Nic Crane 27 Weston Pace 27 Sutou Kouhei 27 Andrew Lamb 23 Jonathan Keane 21 Joris Van den Bossche 17 Jorge C. Leitao 17 Ian Cook 16 João Pedro 15 Jiayu Liu 14 Yibo Cai 14 Alessandro Molina 13 Jorge Leitao 12 Ritchie Vink 12 Benjamin Kietzman 10 niranda perera 10 Wakahisa 10 Matthew Topol 9 Anthony Louis 8 Raphael Taylor-Davies 8 Nate Clark 7 Eduardo Ponce 6 Daniël Heres 5 Neal Richardson 5 Navin 5 Jörn Horstmann 5 Andy Grove 4 crystrix 4 Uwe L. Korn 4 Romain Francois 4 Rok 4 Marco Neumann 4 Ádám Lippai 3 liyafan82 3 frank400 3 Ying Zhou 3 Roee Shlomo 3 Michael Edwards 3 Matthijs Brobbel 3 Alenka Frim 2 Zachary Blackwood 2 Wes McKinney 2 Steven 2 P42 2 Michal Nowakiewicz 2 Kazuaki Ishizaki 2 Karik Isichei 2 Jeroen Ooms 2 Gary Pennington 2 Diana Clarke 2 Bryan Cutler 1 sjgupta2 1 sgilmore 1 ptaylor 1 nullptr 1 kazuhiko kikuchi 1 hulunbier 1 baishen 1 Zimo Zhang 1 Yordan Pavlov 1 Will Jones 1 Steven Burns 1 Stephen 1 Shuai Zhang 1 Ray Bell 1 Projjal Chanda 1 Pachamaltese 1 Nick Bruno 1 Micah Kornfield 1 Menno Siekerman 1 Max Meldrum 1 Mauricio Vargas 1 Marc van Heerden 1 Manish Gill 1 Maarten A. Breddels 1 Lorenz Walthert 1 Linan Qiu 1 Laurent Mazare 1 Kornelijus Survila 1 Kevin Gurney 1 Joris Peeters 1 Jinpeng Zhou 1 Jiangtao Peng 1 Hideaki Hayashi 1 Giordon Stark 1 Gang Liao 1 GALI PREM SAGAR 1 Edd Robinson 1 Dorian Kind 1 Dmitry Patsura 1 Chojan Shang 1 Boaz 1 Ben Chambers 1 Alex Baden 1 Adam Johnson Patch Committers The following Apache committers merged contributed patches to Arrow repositories. 96 Antoine Pitrou 69 Sutou Kouhei 63 David Li 59 Krisztián Szűcs 36 Jonathan Keane 33 Benjamin Kietzman 33 Neal Richardson 27 Andrew Lamb 22 Ian Cook 22 Praveen 21 Jorge C. Leitao 19 Yibo Cai 16 Dominik Moritz 16 Joris Van den Bossche 15 Jiayu Liu 12 GitHub 12 Ritchie Vink 11 Micah Kornfield 10 Wakahisa 8 Jorge Leitao 8 Raphael Taylor-Davies 6 Daniël Heres 5 Andy Grove 5 Jörn Horstmann 5 liyafan82 5 Navin 5 Weston Pace 4 Ádám Lippai 4 Marco Neumann 3 Brian Hulette 3 Michael Edwards 3 Roee Shlomo 2 Eric Erhardt 2 Gary Pennington 2 Steven 2 Uwe L. Korn 2 Wes McKinney 1 baishen 1 Ben Chambers 1 Boaz 1 Bryan Cutler 1 Chojan Shang 1 Dmitry Patsura 1 Edd Robinson 1 Gang Liao 1 hulunbier 1 ishizaki 1 Kazuaki Ishizaki 1 kazuhiko kikuchi 1 Kornelijus Survila 1 Laurent Mazare 1 Manish Gill 1 Marc van Heerden 1 Max Meldrum 1 Yordan Pavlov Changelog The following changelog is for the apache/arrow repository. For the Rust implementation of Apache Arrow, see the apache/arrow-rs changelog. Apache Arrow 5.0.0 (2021-07-28) Bug Fixes ARROW-6189 - [Rust] [Parquet] Plain encoded boolean column chunks limited to 2048 values ARROW-6312 - [C++] Declare required Libs.private in arrow.pc package config ARROW-7948 - [Go][Integration] Decimal integration failures ARROW-9594 - [Python] DictionaryArray.to_numpy does not correctly convert null indexes to null values ARROW-10910 - [Python] Segmentation Fault when None given to read_table with legacy dataset ARROW-10958 - [GLib] &quot;Nested data conversions not implemented&quot; through glib, but not through pyarrow ARROW-11077 - [Rust] ParquetFileArrowReader panicks when trying to read nested list ARROW-11146 - [CI][Python] Failing conda-python-3.8-jpype Nightly Build ARROW-11161 - [Python][C++] S3Filesystem: file Content-Type not set correctly? ARROW-11633 - [CI] [Documentation] Maven default skin not found ARROW-11780 - [C++][Python] StructArray.from_arrays() crashes Python interpreter ARROW-11908 - [Rust] Intermittent Flight integration test failures ARROW-12007 - [C++] Loading parquet file returns &quot;Invalid UTF8 payload&quot; error ARROW-12055 - [R] is.na() evaluates to FALSE on Arrow NaN values ARROW-12096 - [Python][C++] Pyarrow Parquet reader overflows INT96 timestamps when converting to Arrow Array (timestamp[ns]) ARROW-12122 - [Python] Cannot install via pip M1 mac ARROW-12142 - [Python] undefined symbol: _ZN5arrow6StatusC1ENS_10StatusCodeERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE ARROW-12150 - [Python] Bad type inference of mixed-precision Decimals ARROW-12232 - [Rust][Datafusion] Error with CAST: Unsupported SQL type Time ARROW-12240 - [Python] invalid-offsetof warning from apple clang-12 ARROW-12377 - [Doc][Java] Java doc build broken ARROW-12407 - [Python] Deprecation warning when building PyArrow ARROW-12431 - [Python] pa.array mask inverted when type is binary and value to be converted is numpy array ARROW-12472 - [Python] read_table fails when passing a PEP519 filesystem object ARROW-12482 - [Doc][Python] Mention CSVStreamingReader pitfalls with type inference ARROW-12491 - [Packaging] Required dependency on LZ4 &gt;= 1.8 missing from CentOS RPM packages ARROW-12503 - [C++] Ensure using &quot;lib/&quot; for jemalloc&#39;s library directory ARROW-12508 - [R] expect_as_vector implementation causes test failure on R &lt;= 3.3 and variables defined outside of test_that break build when no arrow install ARROW-12543 - [CI][Python] Failing conda-python-3.9 Nightly Build ARROW-12568 - [Python][C++] Segfault when casting a sliced ListArray of int64 in v4.0.0 ARROW-12569 - [R] [CI] Run revdep in CI ARROW-12570 - [JS] Fix issues that blocked the v4.0.0 release ARROW-12579 - [Python] Pyarrow 4.0.0 dependency numpy 1.19.4 throws errors on Apple silicon/M1 compilation ARROW-12589 - [C++] Compiling on windows doesn&#39;t work when -DARROW_WITH_BACKTRACE=OFF ARROW-12601 - [R][Packaging] Fix pkg-config check in r/configure ARROW-12604 - [R][Packaging] Dataset, Parquet off in autobrew and CRAN Mac builds ARROW-12605 - [Documentation] Repair line numbers in dataset.rst ARROW-12606 - [C++] Quantile and Mode functions failing on arrays with offset ARROW-12610 - [C++] Skip TestS3FSGeneric TestDeleteDir and TestDeleteDirContents on windows as they are flaky ARROW-12611 - [CI][Python] Nightly test-conda-python-pandas-0.24 is failing due to numpy compat issue ARROW-12613 - [Python] AttributeError when comparing a Scalar with None ARROW-12614 - [C++][Compute] Revert support for Tables in ExecuteScalarExpression ARROW-12617 - [Python] pyarrow.orc.write_table signature reverses that of pyarrow.parquet.write_table ARROW-12620 - [C++] Dataset writing can only include projected columns if input columns are also included ARROW-12622 - [Python] Segfault when reading CSV inside Flight server ARROW-12630 - [Dev][Integration] conda-integration docker build fails ARROW-12639 - [CI][Archery] Archery build fails to create branch ARROW-12640 - [C++] Fix errors from VS 2019 in cpp/src/parquet/types.h ARROW-12642 - [R] LIBARROW_MINIMAL, LIBARROW_DOWNLOAD, NOT_CRAN env vars should not be case-sensitive ARROW-12644 - [C++][Dataset] Support reading date/time-partitioned datasets accounting for URL encoding (Spark) ARROW-12646 - [C++][CI][Packaging][Python] Bump vcpkg version to its latest release ARROW-12663 - [C++] segfault when arrow header is compiled with nvcc 11.2 ARROW-12668 - [C++][Dataset] CountRows occasionally segfaulting ARROW-12670 - [C++] extract_regex gives bizarre behavior after nulls or non-matches ARROW-12672 - [C++] Segfault casting result of &quot;fill_null()&quot; (not bitmap but unknown null_count) ARROW-12679 - [Java] JDBC adapter does not preserve SQL-nullability ARROW-12684 - [Go][Flight] Fix nil dereference in error case ARROW-12708 - [C++] Valgrind errors when calling negate_checked ARROW-12729 - [R] Fix length method for Table, RecordBatch ARROW-12746 - [Go][Flight] Client Auth handler overwrites outgoing metadata ARROW-12756 - [C++] MSVC build fails with latest gtest from vcpkg ARROW-12757 - [Dev][Archery] Warning about RUST variable in &quot;archery docker run&quot; ARROW-12762 - [Python] ListType doesn&#39;t preserve field name after pickle and unpickle ARROW-12769 - [Python] Negative out of range slices yield invalid arrays ARROW-12771 - [C++] Arrow compute hash_count skips following chunked arrays in streaming execution ARROW-12772 - [CI] Merge script test fails due to missing dependency ARROW-12773 - [Docs] Clarify Java support for ORC and Parquet via JNI bindings ARROW-12774 - [C++][Compute] replace_substring_regex() creates invalid arrays =&gt; crash ARROW-12776 - [Archery][Integration] Fix decimal case generation in write_js_test_json ARROW-12779 - [Python][FlightRPC] Flight server segfaults with certain data ARROW-12780 - [CI][C++] MinGW builds failing when trying to build Gandiva ARROW-12790 - [Python] Cannot read from HDFS with blanks in path names ARROW-12793 - [Python] PYARROW_BUILD_TYPE=Debug does not work correctly ARROW-12797 - [JS] Update readme with new links and remove outdated examples ARROW-12798 - [JS] Use == null Comparison ARROW-12799 - [JS] Use Nullish Coalescing Operator (??) For Defaults ARROW-12804 - [C++] Array methods IsNull and IsValid is confused for NullType ARROW-12807 - [C++] Fix merge conflicts with Future refactor/async IPC ARROW-12838 - [Java][Gandiva] Fix JNI CI test for Gandiva ARROW-12842 - [Java][FlightRPC] Error metadata from FlightStatusException is not propagated to client ARROW-12850 - [R] is.nan() evaluates to null on Arrow null values ARROW-12854 - [Dev][Release] Windows wheel verification script fails to download artifacts ARROW-12857 - [C++] hash_aggregate_test not building on master ARROW-12864 - [C++] Remove needless out argument from arrow::internal::InvertBitmap ARROW-12865 - [C++][Python] Python FlightRPC server cannot find RE2 symbols ARROW-12882 - [C++][Gandiva] Fix behavior of convevrt_replace function for empty replacement char ARROW-12887 - [CI] AppVeyor pip install failure during setup ARROW-12906 - [Python] `fill_null` called with a null value seg faults on non fixed-sized types. ARROW-12907 - [Java] Memory leak possible when exception reading from channel happens ARROW-12911 - [Python] Export scalar aggregate options to pc.sum (sum of zero rows gives null; should give 0) ARROW-12917 - [C++][R][pyarrow] Failure importing some decimal types using the C data interface ARROW-12918 - [C++] Build errors with Visual Studio 16.10.31321.278 ARROW-12919 - [Developer Tools] Crossbow comment bot failing to react to comments ARROW-12935 - [C++][CI] Compiler error on some clang versions ARROW-12941 - [C++] csv reader skip_row does not properly update num_rows_seen ARROW-12942 - [C++][Compute] The result of Arrow compute hash_min_max is incorrect if there are new groups in the subsequent chunks ARROW-12956 - [C++] Fix crash on Parquet file (OSS-Fuzz) ARROW-12969 - [C++] match_substring doesn&#39;t match empty needle to empty haystack ARROW-12974 - [R] test-r-without-arrow build fails because of example requiring Arrow ARROW-12983 - [C++][Python] Converter::Extend gets stuck in infinite loop causing OOM if values don&#39;t fit in single chunk ARROW-12987 - [CI] test-ubuntu-18.04 nightly builds are failing due to Gandiva &quot;TestUpper&quot; test failure ARROW-12988 - [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty) ARROW-12988 - [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty) ARROW-12989 - [CI] &quot;Dev PR&quot; jobs undully cancelled ARROW-12991 - [CI] Travis ARM builds often crash ARROW-12993 - [Python] Address boundary error with invalid Feather file and stackprinter ARROW-12995 - [C++] CSV reader should validate options ARROW-12998 - [C++] Datasets needs dependency on xsimd ARROW-13001 - [Go] Build failure in parquet/internal/bmi on s390x ARROW-13003 - [C++] unaligned access in compute/exec/ cc files ARROW-13008 - [C++] Deprecation warning when compiling minimal example ARROW-13010 - [C++][Compute] Support outputting to slices from kleene kernels ARROW-13018 - [C++][Docs] Use consistent terminology for nulls (min_count) in scalar aggregate kernels ARROW-13026 - [C++][CI] s390x job setup fails ARROW-13037 - [R] Incorrect param when creating Expression crashes R ARROW-13039 - [R] Fix error message handling ARROW-13041 - [C++] Unary kernels can leave uninitialized data under null entries ARROW-13046 - [Release] JS package failing test prior to publish ARROW-13048 - [C++] S3FileSystem fails moving filepaths containing = or + ARROW-13053 - [Python] Build fails on MacOS Big Sur using homebrewed Arrow libraries ARROW-13069 - [Website] Add Daniël to committer list ARROW-13073 - [Developer] archery benchmark list: unexpected keyword &#39;benchmark_filter&#39; ARROW-13080 - [Release] Generate the API docs in ubuntu 20.10 ARROW-13083 - [Python] Wrong SCM version detection both in setup.py and crossbow ARROW-13085 - [Python] Apache Arrow minimal cpp build segfaults with pyarrow libs ARROW-13090 - [Python] Test failure with ffspec 2021.6.0 ARROW-13104 - [C++] ByteStreamSplit implementation uses invalid pointer cast ARROW-13108 - [Python] Pyarrow 4.0.0 crashes upon import on macOS 10.13.6 ARROW-13116 - [R] Test for RecordBatchReader to C-interface fails on arrow-r-minimal due to missing dependencies ARROW-13125 - [R] Throw error when 2+ args passed to desc() in arrange() ARROW-13128 - [C#] TimestampArray conversion logic for nano and micro is wrong ARROW-13135 - [C++] Fix Status propagation in END_PARQUET_CATCH_EXCEPTIONS ARROW-13139 - [C++] ReadaheadGenerator cannot be safely copied/moved ARROW-13145 - [C++][CI] Flight test crashes on MinGW ARROW-13148 - [Dev][Archery] Crossbow build submission fails ARROW-13153 - [C++] `parquet_dataset` loses ordering of files in `_metadata` ARROW-13154 - [C++] Unions can not have 126 and 127 as type_codes ARROW-13169 - [R] [C++] sorted partition keys can cause issues ARROW-13173 - [C++] TestAsyncUtil.ReadaheadFailed asserts occasionally ARROW-13187 - [c++][python] Possibly memory not deallocated when reading in CSV ARROW-13189 - [R] Disable row-level metadata application on datasets ARROW-13203 - [R] Fix optional component checks causing failures ARROW-13207 - [Python][Doc] Dataset documentation still suggests deprecated scan method as the preferred iterative approach ARROW-13216 - [R] Type checks test fails with rtools35 ARROW-13217 - [C++][Gandiva] Correct convert_replace function for invalid chars on string beginning ARROW-13223 - [C++][CI] Fix thread sanitizer failures ARROW-13225 - [Go][Flight] Implement Custom Middleware Interface and Enable Integration Tests ARROW-13229 - [Python] ascii_trim, ascii_ltrim and ascii_rtrim lack options ARROW-13239 - [Doc][Python] Dataset.head function doesn&#39;t mention required argument ARROW-13243 - [R] altrep function call in R 3.5 ARROW-13246 - [C++] CSV skip_rows_after_names can discard data prematurally ARROW-13249 - [Java][CI] Consistent timeout in the Java JNI build ARROW-13253 - [C++][FlightRPC] Segfault when sending record batch &gt;2GB ARROW-13254 - [Python] Processes killed and semaphore objects leaked when reading pandas data ARROW-13265 - [R] cli valgrind errors in nightlies ARROW-13266 - [JS] Improve benchmark names &amp; add suite name to json ARROW-13281 - [C++][Gandiva] Error on timestampDiffMonth function behavior for negative diff values ARROW-13284 - [C++] Wrong pkg_check_modules() option name ARROW-13288 - [Python] Missing default values of kernel options in PyArrow ARROW-13290 - Compilation fails on clang-12 and gcc-11 due to missing include ARROW-13305 - [C++] Unable to install nightly on Ubuntu 21.04 due to CSV options ARROW-13315 - [R] Wrap r_task_group includes with ARROW_R_WITH_ARROW checking ARROW-13321 - [C++][Python] MakeArrayFromScalar doesn&#39;t work for FixedSizeBinaryType ARROW-13324 - [R] Typo in bindings for utf8_reverse and ascii_reverse ARROW-13332 - [C++] TSAN failure in TestAsyncUtil.ReadaheadFailed ARROW-13341 - [C++] Segfault in arrow-compute-plan-test ExecPlanExecution.SourceScalarAggSink ARROW-13350 - [Python][CI] conda-python-3.7-pandas-0.24 nightly build failing in test_extract_datetime_components ARROW-13352 - [C++] Valgrind failure in case_when kernel ARROW-13353 - [Documentation] Build failing with sphinx.util.cfamily.DefinitionError ARROW-13360 - [C++] Missing dependencies in C++ thirdparty offline dependencies versions.txt ARROW-13363 - [R] is.nan() errors on non-floating point data ARROW-13368 - [C++][Doc] Rename project to make_struct in docs ARROW-13381 - [C++] ArrayFromJSON doesn&#39;t work for float value dictionary type ARROW-13382 - [C++] Aggregation over scalars fails autobrew R job ARROW-13384 - [C++] Specify minimum required zstd version in cmake ARROW-13391 - [C++] CSV streaming reader does not include same error information as table reader ARROW-13417 - [C++] The merged generator can sometimes pull from source sync-reentrant ARROW-13419 - [JS] Fix perf tests ARROW-13428 - [C++][Flight] -lssl is missing with bundled gRPC and system shared OpenSSL ARROW-13431 - [Release] Bump go version to 1.15; don&#39;t verify rust source anymore ARROW-13432 - [Release] Fix ssh connection to the binary uploader container ARROW-13450 - [Python][Packaging] Set deployment target to 10.13 for universal2 wheels New Features and Improvements ARROW-2665 - [Python/C++] Add index() method to find first occurence of Python scalar ARROW-3014 - [C++] Minimal writer adapter for ORC file format ARROW-3316 - [R] Multi-threaded conversion from R data.frame to Arrow table / record batch ARROW-5385 - [Go] implement EXTENSION datatype ARROW-5640 - [Go] implement Map array ARROW-6513 - [CI] The conda environment files arrow/ci/conda_env_*.yml should have .txt extension ARROW-6513 - [CI] The conda environment files arrow/ci/conda_env_*.yml should have .txt extension ARROW-7001 - [C++] Develop threading APIs to accommodate nested parallelism ARROW-7114 - [JS][CI] NodeJS build fails on Github Actions Windows node ARROW-7252 - [Rust] [Parquet] Reading UTF-8/JSON/ENUM field results in a lot of vec allocation ARROW-7396 - [Format] Register media types (MIME types) for Apache Arrow formats to IANA ARROW-8421 - [Rust] [Parquet] Implement parquet writer ARROW-8459 - [Dev][Archery] Use a more recent cmake-format ARROW-8527 - [C++][CSV] Add support for ReadOptions::skip_rows &gt;= block_size ARROW-8655 - [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset ARROW-8676 - [Rust] Create implementation of IPC RecordBatch body buffer compression from ARROW-300 ARROW-9054 - [C++] Add ScalarAggregateOptions ARROW-9056 - [C++] Support scalar aggregation over scalars ARROW-9140 - [R] Zero-copy Arrow to R where possible ARROW-9295 - [Archery] Support rust clippy in the lint command ARROW-9299 - [Python] Expose ORC metadata() in Python ORCFile ARROW-9313 - [Rust] Use feature enum ARROW-9421 - [C++][Parquet] Redundancies SchemaManifest::GetFieldIndices ARROW-9430 - [C++/Python] Kernel for SetItem(BooleanArray, values) ARROW-9697 - [C++][Dataset] num_rows method for Dataset/Scanner ARROW-10031 - [Java] Support Java benchmark in Archery ARROW-10115 - [C++] CSV empty quoted string is treated as NULL ARROW-10316 - [Python] Consider using __wrapped__ for compute function introspection ARROW-10391 - [Rust] [Parquet] Nested Arrow reader ARROW-10440 - [C++][Dataset][Python] Add a callback to visit file writers just before Finish() ARROW-10550 - [Rust] [Parquet] Write nested types (struct, list) ARROW-10557 - [C++] Add scalar string slicing/substring extract kernel ARROW-10640 - [C++] An &quot;if_else&quot; kernel to combine two arrays based on a mask ARROW-10658 - [Python][Packaging] Wheel builds for Apple Silicon ARROW-10675 - [C++][Python] Support AWS S3 Web identity credentials ARROW-10797 - [C++] Investigate faster random generation for tests and benchmarks ARROW-10926 - [Rust] Add parquet reader / writer for decimal types ARROW-10959 - [C++] Add scalar string join kernel ARROW-11061 - [Rust] Validate array properties against schema ARROW-11173 - Add Map type as reader / writer in FieldReader / FieldWriter ARROW-11199 - [C++][Python] Fix the unit tests for the ORC reader ARROW-11206 - [C++][Compute][Python] Rename &quot;project&quot; kernel to &quot;make_struct&quot; ARROW-11342 - [Python] [Gandiva] Expose ToString and result type information ARROW-11499 - [Packaging] Remove all use of bintray ARROW-11514 - [R][C++] Bindings for paste(), paste0(), str_c() ARROW-11515 - [R] Bindings for strsplit ARROW-11565 - [C++][Gandiva] Modify upper()/lower() to work with UTF8 and add INIT_CAP function ARROW-11608 - [CI] turbodbc integration tests are failing (build isue) ARROW-11660 - [C++] Move RecordBatch::SelectColumns method from R to C++ library ARROW-11673 - [C++] Casting dictionary type to use different index type ARROW-11675 - [CI][C++] Resolve ctest failures on VS 2019 builds ARROW-11705 - [R] Support scalar value recycling in RecordBatch/Table$create() ARROW-11748 - [C++] Ensure Decimal128 and Decimal256&#39;s fields are in native endian order ARROW-11759 - [C++] Kernel to extract datetime components (year, month, day, etc) from timestamp type ARROW-11769 - [R] Pull groups from grouped_df into RecordBatch or Table ARROW-11772 - [C++] Add asynchronous read to ipc::RecordBatchFileReader ARROW-11782 - [GLib][Ruby][Dataset] Remove bindings for internal classes ARROW-11787 - [R] Implement write csv ARROW-11843 - [C++] Add asynchronous read to parquet::arrow::FileReader ARROW-11849 - [R] Use roxygen @examplesIf tag in R docs ARROW-11889 - [C++] Add parallelism to streaming CSV reader ARROW-11909 - [C++] Get rid of MakeIteratorGenerator ARROW-11926 - [R] Pass on the new UCRT CRAN windows builds ARROW-11926 - [R] Pass on the new UCRT CRAN windows builds ARROW-11928 - [C++][Compute] Add ExecNode hierarchy ARROW-11929 - [C++][Compute] Promote Expression to the compute namespace ARROW-11930 - [C++][Dataset][Compute] Refactor Dataset scans to use an ExecNode graph ARROW-11932 - [C++] Provide ArrayBuilder::AppendScalar ARROW-11950 - [C++][Compute] Add unary negative kernel ARROW-11960 - [C++][Gandiva] Support escape in LIKE ARROW-11980 - [Python] Remove &quot;experimental&quot; status from Table.replace_schema_metadata ARROW-11986 - [C++][Gandiva] Implement IN expressions for doubles and floats ARROW-11990 - [C++][Compute] Use Status/Result return consistently to indicate errors ARROW-12004 - [C++] Resultdetail::Empty\ is annoying ARROW-12010 - [C++][Compute] Improve performance of the hash table used in GroupIdentifier ARROW-12016 - [C++] Implement array_sort_indices and sort_indices for BOOL type ARROW-12050 - [C++][Python][FlightRPC] Use StopToken to enable interrupting long Flight operations ARROW-12074 - [C++][Compute] Add scalar arithmetic kernels for decimal inputs ARROW-12083 - [R] schema use in open_dataset ARROW-12166 - [C++][Gandiva] Implements CONVERT_TO(value, type) function ARROW-12184 - [R] Bindings for na.fail, na.omit, na.exclude, na.pass ARROW-12185 - [R] Bindings for any, all ARROW-12198 - [R] bindings for strptime ARROW-12199 - [R] bindings for stddev, variance ARROW-12205 - [C++][Gandiva] Implement TO_TIME([number] secs) and TO_TIMESTAMP([number] secs) function ARROW-12231 - [C++][Dataset] Separate datasets backed by readers from InMemoryDataset ARROW-12253 - [Rust] [Ballista] Implement scalable joins ARROW-12255 - [Rust] [Ballista] Integrate scheduler with DataFusion ARROW-12256 - [Rust] [Ballista] Add DataFrame support ARROW-12257 - [Rust] [Ballista] Publish user guide to Arrow site ARROW-12261 - [Rust] [Ballista] Ballista should not have its own DataFrame API ARROW-12291 - [R] Determine the type of an unevaluated expression ARROW-12310 - [Java] ValueVector#getObject should support covariance for complex types ARROW-12355 - [C++] Implement efficient async CSV scanning ARROW-12362 - [Rust] [DataFusion] topk_query test failure ARROW-12364 - [Python] [Dataset] Add metadata_collector option to ds.write_dataset() ARROW-12378 - [C++][Gandiva] Implement castVARBINARY functions ARROW-12386 - [C++] Support file parallelism in AsyncScanner ARROW-12391 - [Rust][DataFusion] Implement date_trunc() function ARROW-12392 - [C++] Restore asynchronous streaming CSV reader ARROW-12393 - [JS] Optimally use closure compiler ARROW-12403 - [Rust] [Ballista] Integration tests should check that query results are correct ARROW-12415 - [CI] [Python] ERROR: Failed building wheel for pygit2 on ARM64 ARROW-12424 - [Go][Parquet] Add Schema Package ARROW-12428 - [Python] pyarrow.parquet.read_* should use pre_buffer=True ARROW-12434 - [Rust] [Ballista] Show executed plans with metrics ARROW-12442 - [CI] Set job timeouts on GitHub Actions ARROW-12443 - [C++][Gandiva] Implement castVARCHAR function for binary input ARROW-12444 - [RUST] [CI] Remove Rust and point integration tests to arrow-rs repo ARROW-12445 - [Rust] Design and implement packaging process to bundle Rust in signed tar ARROW-12468 - [Python][R] Expose UseAsync to python/R ARROW-12478 - [C++] Support LLVM 12 ARROW-12484 - [CI] Change jinja macros to not require CROSSBOW_TOKEN to upload artifacts in Github Actions ARROW-12489 - [Developer] autotune is broken ARROW-12490 - [Dev] Use miniforge for all platforms ARROW-12492 - [Python] Add an helper method to decode a DictionaryArray back to a plain Array ARROW-12496 - [C++][Dataset] Ensure Scanner tests fully cover async ARROW-12499 - [C++][Compute][R] Add ScalarAggregateOptions to Any and All kernels ARROW-12500 - [C++][Dataset] Consolidate similar tests for file formats ARROW-12501 - [CI][Ruby] Remove needless workaround for MinGW build ARROW-12507 - [CI] Remove duplicated cron/nightly builds ARROW-12512 - [C++][Dataset] Implement CSV writing support ARROW-12514 - [Release] Don&#39;t run Gandiva related Ruby test with ARROW_GANDIVA=OFF ARROW-12517 - [Go] Expose App Metadata in Flight client ARROW-12518 - [Python] Expose Parquet statistics has_null_count / has_distinct_count ARROW-12520 - [R] Minor docs updates ARROW-12522 - [C++] Implement asynchronous/&quot;lazy&quot; variants of ReadRangeCache ARROW-12525 - [JS] Vector toJSON returns an array ARROW-12527 - [Dev] Don&#39;t try getting JIRA information for MINOR PR ARROW-12528 - [JS] Support typed arrays in Table.new ARROW-12530 - [C++] Remove Buffer::mutable_data_ member and use const_cast on data_ only if is_mutable_ is true ARROW-12533 - [C++] Random real generator is slow on Arm64 Linux when built with clang ARROW-12534 - [C++][Gandiva] Implement LEFT and RIGHT functions on Gandiva for string input values ARROW-12537 - [JS] Docs build should not include test sources ARROW-12541 - [Docs] Improve styling/readability of tables in the new doc theme ARROW-12551 - [Java][Release] Java post-release tests fail due to missing testing data ARROW-12554 - Allow duplicates in the value_set for compute::is_in ARROW-12555 - [Java][Release] Java post-release script misses dataset JNI bindings ARROW-12556 - [C++][Gandiva] Implement BYTESUBSTRING functions on Gandiva ARROW-12560 - [C++] Investigate utilizing aggressive thread task creation when adding callback to finished future ARROW-12567 - [C++][Gandiva] Implement LPAD and RPAD functions for string input values ARROW-12567 - [C++][Gandiva] Implement LPAD and RPAD functions for string input values ARROW-12571 - [R][CI] Run nightly R with valgrind ARROW-12575 - [R] Use unary negative kernel ARROW-12577 - [Website] Use Artifactory instead of Bintray in all places ARROW-12578 - [JS] Simplify UTF8 handling in NodeJS ARROW-12581 - [C++][FlightRPC] Benchmark compression with real data ARROW-12584 - [C++][Python] Expose method for benchmarking tools to release unused memory from the allocators ARROW-12591 - [Java][Gandiva] Create single Gandiva jar for MacOS and Linux ARROW-12593 - [Packaging][Ubuntu] Add support for Ubuntu 21.04 ARROW-12597 - [C++] Implement OptionalParallelForAsync ARROW-12598 - [C++][Dataset] Implement row-count for CSV or allow selecting 0 columns from CSV ARROW-12599 - [Doc][Python] Documentation missing for pyarrow.Table ARROW-12600 - [CI] Push docker images from crossbow tasks ARROW-12602 - [R] Add BuildInfo from C++ to arrow_info ARROW-12608 - [C++] Add split_pattern_regex function ARROW-12612 - [C++][Compute] Add Expression to type_fwd.h ARROW-12619 - [Python] pyarrow sdist should not require git ARROW-12621 - [C++][Gandiva] Add alias to sha1 and sha256 functions ARROW-12631 - [Python] pyarrow.dataset.write_table should accept a Scanner to write ARROW-12643 - Add documentation for experimental repos ARROW-12645 - [Python] Fix numpydoc validation ARROW-12648 - [C++][FlightRPC] Allow using TLS in benchmark ARROW-12649 - [Python/Packaging] Move conda-aarch64 to Azure with cross-compilation ARROW-12653 - [Archery] allow me to add a comment to crossbow requests ARROW-12658 - [C++] Bump aws-c-common to v0.5.10 ARROW-12660 - [R] Post-4.0 adjustments for CRAN ARROW-12661 - [C++] CSV add skip rows after column names ARROW-12662 - [Website] Force to use squash merge ARROW-12667 - [Python] Ensure test coverage for conversion of strided numpy arrays ARROW-12675 - [C++] CSV should include line/row numbers in parsing error messages ARROW-12677 - [Python] Add a mask argument to pyarrow.StructArray.from_arrays ARROW-12685 - [C++][Compute] Add unary absolute value kernel ARROW-12686 - [C++][Python][FlightRPC] Support export_to_c in DoGet/inherit from RecordBatchReader ARROW-12687 - [C++][Python][Dataset] Support C Data Interface with Scanner ARROW-12689 - [R] Implement ArrowArrayStream C interface ARROW-12692 - [R] Improve tests and comments for strsplit() bindings ARROW-12694 - [R][CI] rtools35 job failing on 32-bit build tests ARROW-12696 - [R] Improve testing of error messages converted to warnings ARROW-12699 - [CI][Packaging][Java] Generate a jar compatible with Linux and MacOS for all Arrow components ARROW-12701 - [Website][Release] Include Rust contributors, committers, and commits in release notes ARROW-12702 - [JS] Upgrade Webpack and terser ARROW-12703 - [JS] Separate Table from DataFrame ARROW-12704 - [JS] use optional chaining ARROW-12709 - [C++] Add variadic string join kernel ARROW-12713 - [C++] String reverse kernel ARROW-12715 - [C++] SQL-style glob string match kernel ARROW-12716 - [C++] Left/right/center string padding kernels ARROW-12717 - [C++] Substring find position kernel ARROW-12719 - [C++][Python] pyarrow.fs.S3FileSystem pass extra kwargs i.e ACL ARROW-12721 - [CI] Fix path for uploading aarch64 conda artifacts from the nightly builds ARROW-12722 - [R] Raise error when attemping to print table with duplicated naming ARROW-12730 - [MATLAB] Update featherreadmex and featherwritemex to build against latest arrow c++ APIs ARROW-12731 - [R] Use InMemoryDataset for Table/RecordBatch in dplyr code ARROW-12736 - [C++] Eliminate unnecessary copy in FieldPath::Get() ARROW-12738 - [CI] [Gandiva] Nightly build error in azure-conda-osx-clang-py38 (and py39, py*-r*) ARROW-12741 - [CI] Configure GitHub Token for Nightly Builds ARROW-12745 - [C++][Compute] Add floor, ceiling, and truncate kernels ARROW-12749 - [C++] Unnecessary copy cause by constructing RecordBatch/Table/Schema from lvalues ARROW-12750 - [CI] [R] Actually pass parameterized docker options to the templates ARROW-12751 - [C++] Add variadic row-wise min/max kernels (least/greatest) ARROW-12758 - [R] Add examples to more function documentation ARROW-12759 - [C++][Compute] Wrap grouped aggregation in an ExecNode ARROW-12760 - [C++][Python][R] S3FileSystem: IO thread parallelism limited to 8 threads ARROW-12761 - [R] Better error handling for write_to_raw ARROW-12764 - [CI] Fix arguments in Conda Windows builds ARROW-12777 - [R] Convert all inputs to Arrow objects in match_arrow and is_in ARROW-12781 - [R] Implement is.type() functions for dplyr ARROW-12785 - [CI] the r-devdocs build errors when brew installing gcc ARROW-12791 - [R] Better error handling for DatasetFactory$Finish() when no format specified ARROW-12796 - [JS] Support JSON output from benchmarks ARROW-12800 - [JS] Drop IE Support and remove text encoder and decoder polyfills ARROW-12801 - [CI][Packaging][Java] Include all modules in script that generate Arrow jars ARROW-12806 - [Python] test_write_to_dataset_filesystem missing a dataset mark ARROW-12808 - [JS] Document browser support ARROW-12810 - [Python] Run tests with AWS_EC2_METADATA_DISABLED=true ARROW-12812 - [Packaging][Java] Improve JNI jars build ARROW-12824 - [R][CI] Upgrade builds for R 4.1 release ARROW-12827 - [C++] [Dataset] Review error pass-through in the datasets API ARROW-12829 - [GLib][Ruby] Add support for Apache Arrow Flight ARROW-12831 - [CI][macOS] Remove needless Homebrew workaround ARROW-12832 - [JS] Write benchmarks in TypeScript ARROW-12833 - [JS] Construct perf data in JS ARROW-12835 - [C++] Implement case insenstive match in match_substring(_regex) and match_like ARROW-12836 - [C++] Installation on IBM i fails because of CxxFlags ARROW-12841 - [R] Add examples to more function documentation - part 2 ARROW-12843 - [C++][Compute] Add is_inf kernel for floating point arrays ARROW-12848 - [Release] Mail template points to 404 ARROW-12851 - [Go][Parquet] Add Encoding Package Part 1 ARROW-12856 - [C++][Gandiva] Implement castBIT and castBOOLEAN functions on Gandiva ARROW-12859 - [C++] Add ScalarFromJSON for easier testing ARROW-12861 - [C++][Compute] Add sign function kernels ARROW-12867 - [R] Bindings for abs() ARROW-12868 - [R] Bindings for find_substring and find_substring_regex ARROW-12869 - [R] Bindings for utf8_reverse and ascii_reverse ARROW-12870 - [R] Bindings for stringr::str_like ARROW-12875 - [JS] Upgrade Jest and other minor updates ARROW-12883 - [R] [CI] version compatibility fails on R 4.1 ARROW-12891 - [C++][Compute][Dataset] Extract subtree pruning logic to compute:: ARROW-12894 - [R] Bump R version ARROW-12895 - [CI] Use &quot;concurrency&quot; setting on Github Actions ARROW-12898 - [Release][C#] Package upload script is broken ARROW-12900 - [Python][Documentation] an np import in Reading Datasets docs ARROW-12901 - [R] Follow on to more examples ARROW-12909 - [R][Release] Build of ubuntu-docs is failing ARROW-12912 - [Website] Use .asf.yaml for publishing ARROW-12915 - [Release] Build of ubuntu-docs is failing on thrift ARROW-12936 - [C++][Gandiva] Implement ASCII Hive function on Gandiva ARROW-12937 - [C++] Allow specifying default metadata for new S3 files ARROW-12939 - [R] Simplify RTask stop handling ARROW-12940 - [R] Expose C interface as R6 methods ARROW-12948 - [C++] Add string slice replace kernel ARROW-12949 - [C++] Add string starts-with/ends-with kernels ARROW-12950 - [C++] Add substring count kernel ARROW-12951 - [C++] Refactor StringTransform ARROW-12952 - [C++] Add regex count kernel ARROW-12955 - [C++] Add additional type support for if_else kernel ARROW-12957 - [R] rchk issues on cran ARROW-12961 - [C++] MSVC issues warning building PyArrow on Windows ARROW-12962 - [GLib][Ruby] Add Arrow:Scalar ARROW-12964 - [R] Add bindings for ifelse() and if_else() ARROW-12966 - [Python] Expose Python binding for ElementWiseAggregateOptions ARROW-12967 - [R] Add bindings for pmin() and pmax() ARROW-12968 - [R] [CI] Add an rchk job to our nightlies ARROW-12972 - [CI] ][C++] archive_write_add_filter_zstd error on CentOS + ARM64 ARROW-12975 - [C++][Python] if_else kernel doesn&#39;t support upcasting ARROW-12982 - [C++] Re-enable unused-variable warning ARROW-12984 - [C++] Passing options parameter of Count/Index aggregation by reference ARROW-12985 - [Python][Packaging] Unable to install pygit2 in the arm64 wheel builds ARROW-12986 - [C++][Gandiva] Implement new cache eviction policy in Gandiva ARROW-12992 - [R] bindings for substr(), substring(), str_sub() ARROW-12994 - [R] Fix tests that assume UTC local tz ARROW-12996 - [C++] CSV stream reader has no progress indication ARROW-13002 - [C++] Add a check for the utf8proc&#39;s version in CMake ARROW-13005 - [C++] Support filter/take for union data type. ARROW-13006 - [C++][Gandiva] Implement BASE64 and UNBASE64 Hive functions on Gandiva ARROW-13009 - [Doc][Dev] Document builds mailing-list ARROW-13022 - [R] bindings for lubridate&#39;s year, isoyear, quarter, month, day, wday, yday, isoweek, hour, minute, and second functions ARROW-13025 - [C++][Compute] Enhance FunctionOptions with equality, debug representability, and serializability ARROW-13027 - [C++] Fix ASAN stack traces in CI ARROW-13030 - [CI][Go] Setup Arm64 golang CI ARROW-13031 - [JS] Support arm in closure compiler on macOS ARROW-13032 - [Java] Update gauva version ARROW-13034 - [Python][Docs] Update outdated examples for hdfs/azure on the Parquet doc page ARROW-13036 - [Doc] Mention recommended file extension(s) for Arrow IPC ARROW-13042 - [C++] Automatic checks that kernels don&#39;t leave uninitialized data in output ARROW-13043 - [GLib][Ruby] Add GArrowEqualOptions ARROW-13044 - [Java] Union vectors should extend ValueVector ARROW-13045 - [Packaging][RPM][deb] Don&#39;t install system utf8proc if it&#39;s old ARROW-13047 - [Website] Add kiszk to committer list ARROW-13049 - [C++][Gandiva] Implement BIN Hive function on Gandiva ARROW-13050 - [C++][Gandiva] Implement SPACE Hive function on Gandiva ARROW-13054 - [C++] Add option to specify the first day of the week for the &quot;day_of_week&quot; temporal kernel ARROW-13064 - [C++] Add a general &quot;if, ifelse, ..., else&quot; kernel (&quot;CASE WHEN&quot;) ARROW-13065 - [Packaging][RPM] Add missing required LZ4 version information ARROW-13068 - [GLib][Dataset] Change prefix to gadataset_ from gad_ ARROW-13070 - [R] bindings for sd and var ARROW-13072 - [C++] Add bitwise arithmetic compute functions ARROW-13074 - [Python] Start with deprecating ParquetDataset custom attributes ARROW-13075 - [Python] Expose C data interface API for pyarrow.Field ARROW-13076 - [Java] Enable ExtensionType to use StructVector and UnionVector for underlying storage ARROW-13082 - [CI] Forward R argument to ubuntu-docs build ARROW-13086 - [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_ ARROW-13086 - [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_ ARROW-13091 - [Python] Add compression_level argument to IpcWriteOptions constructor ARROW-13092 - [C++] CreateDir should fail if the target exists and is not a directory ARROW-13095 - [C++] Implement trigonometric compute functions ARROW-13096 - [C++] Implement logarithm compute functions ARROW-13097 - [C++] Provide a simple reflection utility for s ARROW-13098 - [Dev][Archery] Reorganize docker submodule to its own subpackage ARROW-13100 - [MATLAB] Integrate GoogleTest with MATLAB Interface C++ Code ARROW-13101 - [Python][Doc] pyarrow.FixedSizeListArray does not appear in the documentation ARROW-13110 - [C++] Deadlock can happen when using BackgroundGenerator without transferring callbacks ARROW-13113 - [R] use RTasks to manage parallel in converting arrow to R ARROW-13117 - [R] Retain schema in new Expressions ARROW-13119 - [R] Set empty schema in scalar Expressions ARROW-13124 - [Ruby] Add support for memory view ARROW-13127 - [R] Valgrind nightly errors ARROW-13136 - [C++] Add a &quot;coalesce&quot; variadic scalar kernel ARROW-13137 - [C++][Documentation] Make in-table references consistent ARROW-13140 - [C++/Python] Upgrade libthrift pin in the nightlies ARROW-13142 - [Python] Use vector append when converting from list of non-strided numpy arrays ARROW-13147 - [Java] Respect the rounding policy when allocating vector buffers ARROW-13157 - [C++] Add find_substring_regex kernel and implement ignore_case for find_substring ARROW-13158 - [Python] Fix repr and contains of StructScalar with duplicate field names ARROW-13162 - [C++][Gandiva] Add new alias for extract date functions in Gandiva registry ARROW-13171 - [R] Add binding for str_pad() ARROW-13190 - [C++] [Gandiva] Change behavior of INITCAP function ARROW-13194 - [Java][Document] Create prose document about Java algorithms ARROW-13195 - [R] Problem with rlang reverse dependency checks ARROW-13199 - [R] add ubuntu 21.04 to nightly builds ARROW-13200 - [R] Add binding for case_when() ARROW-13201 - [R] Add binding for coalesce() ARROW-13210 - [Python][CI] Fix vcpkg caching mechanism for the macOS wheels ARROW-13211 - [C++][CI] Remove outdated Github Actions ARM builds ARROW-13212 - [Release] Support deploying to test PyPI in the python post release script ARROW-13215 - [R] [CI] Add ENV TZ to docker files ARROW-13218 - [Doc] Document/clarify conventions for timestamp storage ARROW-13219 - [C++][GLib] Demote/deprecate CompareOptions ARROW-13224 - [Python][Doc] Documentation missing for pyarrow.dataset.write_dataset ARROW-13226 - [Python] Add a general purpose cython trampolining utility ARROW-13228 - [C++] S3 CreateBucket fails because AWS treats us-east-1 differently than other regions ARROW-13230 - Add CSV Writer documentation ARROW-13234 - [C++] Add string padding option to determine which side the extra space goes on ARROW-13235 - [C++] Make type_name equal to options class name for all FunctionOptionTypes ARROW-13236 - [Python] Improve repr of pyarrow.compute.FunctionOptions ARROW-13238 - [C++][Dataset][Compute] Substitute ExecPlan impl for dataset scans ARROW-13242 - [C++] Improve decimal random generation ARROW-13244 - [C++] Add facility to get current thread id ARROW-13258 - [Python] Improve the repr of ParquetFileFragment ARROW-13262 - [R] transmute() fails after pulling data into R ARROW-13273 - [C++] Don&#39;t use .pc only in CMake paths for Requires.private ARROW-13274 - [JS] Remove Webpack ARROW-13275 - [JS] Fix perf tests ARROW-13276 - [GLib][Ruby][Flight] Add support for ListFlights ARROW-13277 - [JS] Add declaration maps ARROW-13280 - [R] Bindings for log and trig functions ARROW-13282 - [C++] Remove obsolete generated files ARROW-13283 - [Developer Tools] Support passing through memory limits in archery docker run ARROW-13286 - [CI] Require docker-compose 1.27.0 or later ARROW-13289 - [C++] Log functions don&#39;t have int kernels ARROW-13291 - [GLib][CI] Require gobject-introspection 3.4.5 or later ARROW-13296 - [C++] Provide reflection-compatible enum replacement ARROW-13299 - [JS] Upgrade ix and rxjs ARROW-13303 - [JS] Revise bundles ARROW-13306 - [Java][JDBC] use ResultSetMetaData.getColumnLabel instead of ResultSetMetaData.getColumnName ARROW-13313 - [C++][Compute] Add ScalarAggregateNode ARROW-13320 - [Website] Add MIME types to FAQ ARROW-13323 - [Archery] Validate docker compose configuration ARROW-13343 - [R] Update NEWS.md for 5.0 ARROW-13346 - [C++] Remove compile time parsing from EnumType ARROW-13355 - [R] ensure that sf is installed in our revdep job ARROW-13357 - [R] bindings for sign() ARROW-13365 - [R] bindings for floor/ceiling/truncate ARROW-13385 - [C++][Compute] Document out-of-source addition to the FunctionRegistry ARROW-13386 - [R][C++] CSV streaming changes break Rtools 35 32-bit build ARROW-13418 - [R] typo in python.r ARROW-13461 - [Python][Packaging] Build M1 wheels for python 3.8 PARQUET-1798 - [C++] Review logic around automatic assignment of field_id&#39;s PARQUET-1998 - [C++] Implement LZ4_RAW compression PARQUET-2056 - [C++] Add ability for retrieving dictionary and indices separately for ColumnReader" />
<link rel="canonical" href="https://arrow.apache.org/release/5.0.0.html" />
<meta property="og:url" content="https://arrow.apache.org/release/5.0.0.html" />
<meta property="og:site_name" content="Apache Arrow" />
<meta property="og:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="og:type" content="article" />
<meta property="article:published_time" content="2021-07-29T00:00:00-04:00" />
<meta name="twitter:card" content="summary_large_image" />
<meta property="twitter:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="twitter:title" content="Apache Arrow 5.0.0 Release" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"BlogPosting","dateModified":"2021-07-29T00:00:00-04:00","datePublished":"2021-07-29T00:00:00-04:00","description":"Apache Arrow 5.0.0 (29 July 2021) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 684 commits from 99 distinct contributors in 2 Arrow repositories. 77 David Li 43 Krisztián Szűcs 42 Antoine Pitrou 35 Dominik Moritz 33 Nic Crane 27 Weston Pace 27 Sutou Kouhei 27 Andrew Lamb 23 Jonathan Keane 21 Joris Van den Bossche 17 Jorge C. Leitao 17 Ian Cook 16 João Pedro 15 Jiayu Liu 14 Yibo Cai 14 Alessandro Molina 13 Jorge Leitao 12 Ritchie Vink 12 Benjamin Kietzman 10 niranda perera 10 Wakahisa 10 Matthew Topol 9 Anthony Louis 8 Raphael Taylor-Davies 8 Nate Clark 7 Eduardo Ponce 6 Daniël Heres 5 Neal Richardson 5 Navin 5 Jörn Horstmann 5 Andy Grove 4 crystrix 4 Uwe L. Korn 4 Romain Francois 4 Rok 4 Marco Neumann 4 Ádám Lippai 3 liyafan82 3 frank400 3 Ying Zhou 3 Roee Shlomo 3 Michael Edwards 3 Matthijs Brobbel 3 Alenka Frim 2 Zachary Blackwood 2 Wes McKinney 2 Steven 2 P42 2 Michal Nowakiewicz 2 Kazuaki Ishizaki 2 Karik Isichei 2 Jeroen Ooms 2 Gary Pennington 2 Diana Clarke 2 Bryan Cutler 1 sjgupta2 1 sgilmore 1 ptaylor 1 nullptr 1 kazuhiko kikuchi 1 hulunbier 1 baishen 1 Zimo Zhang 1 Yordan Pavlov 1 Will Jones 1 Steven Burns 1 Stephen 1 Shuai Zhang 1 Ray Bell 1 Projjal Chanda 1 Pachamaltese 1 Nick Bruno 1 Micah Kornfield 1 Menno Siekerman 1 Max Meldrum 1 Mauricio Vargas 1 Marc van Heerden 1 Manish Gill 1 Maarten A. Breddels 1 Lorenz Walthert 1 Linan Qiu 1 Laurent Mazare 1 Kornelijus Survila 1 Kevin Gurney 1 Joris Peeters 1 Jinpeng Zhou 1 Jiangtao Peng 1 Hideaki Hayashi 1 Giordon Stark 1 Gang Liao 1 GALI PREM SAGAR 1 Edd Robinson 1 Dorian Kind 1 Dmitry Patsura 1 Chojan Shang 1 Boaz 1 Ben Chambers 1 Alex Baden 1 Adam Johnson Patch Committers The following Apache committers merged contributed patches to Arrow repositories. 96 Antoine Pitrou 69 Sutou Kouhei 63 David Li 59 Krisztián Szűcs 36 Jonathan Keane 33 Benjamin Kietzman 33 Neal Richardson 27 Andrew Lamb 22 Ian Cook 22 Praveen 21 Jorge C. Leitao 19 Yibo Cai 16 Dominik Moritz 16 Joris Van den Bossche 15 Jiayu Liu 12 GitHub 12 Ritchie Vink 11 Micah Kornfield 10 Wakahisa 8 Jorge Leitao 8 Raphael Taylor-Davies 6 Daniël Heres 5 Andy Grove 5 Jörn Horstmann 5 liyafan82 5 Navin 5 Weston Pace 4 Ádám Lippai 4 Marco Neumann 3 Brian Hulette 3 Michael Edwards 3 Roee Shlomo 2 Eric Erhardt 2 Gary Pennington 2 Steven 2 Uwe L. Korn 2 Wes McKinney 1 baishen 1 Ben Chambers 1 Boaz 1 Bryan Cutler 1 Chojan Shang 1 Dmitry Patsura 1 Edd Robinson 1 Gang Liao 1 hulunbier 1 ishizaki 1 Kazuaki Ishizaki 1 kazuhiko kikuchi 1 Kornelijus Survila 1 Laurent Mazare 1 Manish Gill 1 Marc van Heerden 1 Max Meldrum 1 Yordan Pavlov Changelog The following changelog is for the apache/arrow repository. For the Rust implementation of Apache Arrow, see the apache/arrow-rs changelog. Apache Arrow 5.0.0 (2021-07-28) Bug Fixes ARROW-6189 - [Rust] [Parquet] Plain encoded boolean column chunks limited to 2048 values ARROW-6312 - [C++] Declare required Libs.private in arrow.pc package config ARROW-7948 - [Go][Integration] Decimal integration failures ARROW-9594 - [Python] DictionaryArray.to_numpy does not correctly convert null indexes to null values ARROW-10910 - [Python] Segmentation Fault when None given to read_table with legacy dataset ARROW-10958 - [GLib] &quot;Nested data conversions not implemented&quot; through glib, but not through pyarrow ARROW-11077 - [Rust] ParquetFileArrowReader panicks when trying to read nested list ARROW-11146 - [CI][Python] Failing conda-python-3.8-jpype Nightly Build ARROW-11161 - [Python][C++] S3Filesystem: file Content-Type not set correctly? ARROW-11633 - [CI] [Documentation] Maven default skin not found ARROW-11780 - [C++][Python] StructArray.from_arrays() crashes Python interpreter ARROW-11908 - [Rust] Intermittent Flight integration test failures ARROW-12007 - [C++] Loading parquet file returns &quot;Invalid UTF8 payload&quot; error ARROW-12055 - [R] is.na() evaluates to FALSE on Arrow NaN values ARROW-12096 - [Python][C++] Pyarrow Parquet reader overflows INT96 timestamps when converting to Arrow Array (timestamp[ns]) ARROW-12122 - [Python] Cannot install via pip M1 mac ARROW-12142 - [Python] undefined symbol: _ZN5arrow6StatusC1ENS_10StatusCodeERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE ARROW-12150 - [Python] Bad type inference of mixed-precision Decimals ARROW-12232 - [Rust][Datafusion] Error with CAST: Unsupported SQL type Time ARROW-12240 - [Python] invalid-offsetof warning from apple clang-12 ARROW-12377 - [Doc][Java] Java doc build broken ARROW-12407 - [Python] Deprecation warning when building PyArrow ARROW-12431 - [Python] pa.array mask inverted when type is binary and value to be converted is numpy array ARROW-12472 - [Python] read_table fails when passing a PEP519 filesystem object ARROW-12482 - [Doc][Python] Mention CSVStreamingReader pitfalls with type inference ARROW-12491 - [Packaging] Required dependency on LZ4 &gt;= 1.8 missing from CentOS RPM packages ARROW-12503 - [C++] Ensure using &quot;lib/&quot; for jemalloc&#39;s library directory ARROW-12508 - [R] expect_as_vector implementation causes test failure on R &lt;= 3.3 and variables defined outside of test_that break build when no arrow install ARROW-12543 - [CI][Python] Failing conda-python-3.9 Nightly Build ARROW-12568 - [Python][C++] Segfault when casting a sliced ListArray of int64 in v4.0.0 ARROW-12569 - [R] [CI] Run revdep in CI ARROW-12570 - [JS] Fix issues that blocked the v4.0.0 release ARROW-12579 - [Python] Pyarrow 4.0.0 dependency numpy 1.19.4 throws errors on Apple silicon/M1 compilation ARROW-12589 - [C++] Compiling on windows doesn&#39;t work when -DARROW_WITH_BACKTRACE=OFF ARROW-12601 - [R][Packaging] Fix pkg-config check in r/configure ARROW-12604 - [R][Packaging] Dataset, Parquet off in autobrew and CRAN Mac builds ARROW-12605 - [Documentation] Repair line numbers in dataset.rst ARROW-12606 - [C++] Quantile and Mode functions failing on arrays with offset ARROW-12610 - [C++] Skip TestS3FSGeneric TestDeleteDir and TestDeleteDirContents on windows as they are flaky ARROW-12611 - [CI][Python] Nightly test-conda-python-pandas-0.24 is failing due to numpy compat issue ARROW-12613 - [Python] AttributeError when comparing a Scalar with None ARROW-12614 - [C++][Compute] Revert support for Tables in ExecuteScalarExpression ARROW-12617 - [Python] pyarrow.orc.write_table signature reverses that of pyarrow.parquet.write_table ARROW-12620 - [C++] Dataset writing can only include projected columns if input columns are also included ARROW-12622 - [Python] Segfault when reading CSV inside Flight server ARROW-12630 - [Dev][Integration] conda-integration docker build fails ARROW-12639 - [CI][Archery] Archery build fails to create branch ARROW-12640 - [C++] Fix errors from VS 2019 in cpp/src/parquet/types.h ARROW-12642 - [R] LIBARROW_MINIMAL, LIBARROW_DOWNLOAD, NOT_CRAN env vars should not be case-sensitive ARROW-12644 - [C++][Dataset] Support reading date/time-partitioned datasets accounting for URL encoding (Spark) ARROW-12646 - [C++][CI][Packaging][Python] Bump vcpkg version to its latest release ARROW-12663 - [C++] segfault when arrow header is compiled with nvcc 11.2 ARROW-12668 - [C++][Dataset] CountRows occasionally segfaulting ARROW-12670 - [C++] extract_regex gives bizarre behavior after nulls or non-matches ARROW-12672 - [C++] Segfault casting result of &quot;fill_null()&quot; (not bitmap but unknown null_count) ARROW-12679 - [Java] JDBC adapter does not preserve SQL-nullability ARROW-12684 - [Go][Flight] Fix nil dereference in error case ARROW-12708 - [C++] Valgrind errors when calling negate_checked ARROW-12729 - [R] Fix length method for Table, RecordBatch ARROW-12746 - [Go][Flight] Client Auth handler overwrites outgoing metadata ARROW-12756 - [C++] MSVC build fails with latest gtest from vcpkg ARROW-12757 - [Dev][Archery] Warning about RUST variable in &quot;archery docker run&quot; ARROW-12762 - [Python] ListType doesn&#39;t preserve field name after pickle and unpickle ARROW-12769 - [Python] Negative out of range slices yield invalid arrays ARROW-12771 - [C++] Arrow compute hash_count skips following chunked arrays in streaming execution ARROW-12772 - [CI] Merge script test fails due to missing dependency ARROW-12773 - [Docs] Clarify Java support for ORC and Parquet via JNI bindings ARROW-12774 - [C++][Compute] replace_substring_regex() creates invalid arrays =&gt; crash ARROW-12776 - [Archery][Integration] Fix decimal case generation in write_js_test_json ARROW-12779 - [Python][FlightRPC] Flight server segfaults with certain data ARROW-12780 - [CI][C++] MinGW builds failing when trying to build Gandiva ARROW-12790 - [Python] Cannot read from HDFS with blanks in path names ARROW-12793 - [Python] PYARROW_BUILD_TYPE=Debug does not work correctly ARROW-12797 - [JS] Update readme with new links and remove outdated examples ARROW-12798 - [JS] Use == null Comparison ARROW-12799 - [JS] Use Nullish Coalescing Operator (??) For Defaults ARROW-12804 - [C++] Array methods IsNull and IsValid is confused for NullType ARROW-12807 - [C++] Fix merge conflicts with Future refactor/async IPC ARROW-12838 - [Java][Gandiva] Fix JNI CI test for Gandiva ARROW-12842 - [Java][FlightRPC] Error metadata from FlightStatusException is not propagated to client ARROW-12850 - [R] is.nan() evaluates to null on Arrow null values ARROW-12854 - [Dev][Release] Windows wheel verification script fails to download artifacts ARROW-12857 - [C++] hash_aggregate_test not building on master ARROW-12864 - [C++] Remove needless out argument from arrow::internal::InvertBitmap ARROW-12865 - [C++][Python] Python FlightRPC server cannot find RE2 symbols ARROW-12882 - [C++][Gandiva] Fix behavior of convevrt_replace function for empty replacement char ARROW-12887 - [CI] AppVeyor pip install failure during setup ARROW-12906 - [Python] `fill_null` called with a null value seg faults on non fixed-sized types. ARROW-12907 - [Java] Memory leak possible when exception reading from channel happens ARROW-12911 - [Python] Export scalar aggregate options to pc.sum (sum of zero rows gives null; should give 0) ARROW-12917 - [C++][R][pyarrow] Failure importing some decimal types using the C data interface ARROW-12918 - [C++] Build errors with Visual Studio 16.10.31321.278 ARROW-12919 - [Developer Tools] Crossbow comment bot failing to react to comments ARROW-12935 - [C++][CI] Compiler error on some clang versions ARROW-12941 - [C++] csv reader skip_row does not properly update num_rows_seen ARROW-12942 - [C++][Compute] The result of Arrow compute hash_min_max is incorrect if there are new groups in the subsequent chunks ARROW-12956 - [C++] Fix crash on Parquet file (OSS-Fuzz) ARROW-12969 - [C++] match_substring doesn&#39;t match empty needle to empty haystack ARROW-12974 - [R] test-r-without-arrow build fails because of example requiring Arrow ARROW-12983 - [C++][Python] Converter::Extend gets stuck in infinite loop causing OOM if values don&#39;t fit in single chunk ARROW-12987 - [CI] test-ubuntu-18.04 nightly builds are failing due to Gandiva &quot;TestUpper&quot; test failure ARROW-12988 - [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty) ARROW-12988 - [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty) ARROW-12989 - [CI] &quot;Dev PR&quot; jobs undully cancelled ARROW-12991 - [CI] Travis ARM builds often crash ARROW-12993 - [Python] Address boundary error with invalid Feather file and stackprinter ARROW-12995 - [C++] CSV reader should validate options ARROW-12998 - [C++] Datasets needs dependency on xsimd ARROW-13001 - [Go] Build failure in parquet/internal/bmi on s390x ARROW-13003 - [C++] unaligned access in compute/exec/ cc files ARROW-13008 - [C++] Deprecation warning when compiling minimal example ARROW-13010 - [C++][Compute] Support outputting to slices from kleene kernels ARROW-13018 - [C++][Docs] Use consistent terminology for nulls (min_count) in scalar aggregate kernels ARROW-13026 - [C++][CI] s390x job setup fails ARROW-13037 - [R] Incorrect param when creating Expression crashes R ARROW-13039 - [R] Fix error message handling ARROW-13041 - [C++] Unary kernels can leave uninitialized data under null entries ARROW-13046 - [Release] JS package failing test prior to publish ARROW-13048 - [C++] S3FileSystem fails moving filepaths containing = or + ARROW-13053 - [Python] Build fails on MacOS Big Sur using homebrewed Arrow libraries ARROW-13069 - [Website] Add Daniël to committer list ARROW-13073 - [Developer] archery benchmark list: unexpected keyword &#39;benchmark_filter&#39; ARROW-13080 - [Release] Generate the API docs in ubuntu 20.10 ARROW-13083 - [Python] Wrong SCM version detection both in setup.py and crossbow ARROW-13085 - [Python] Apache Arrow minimal cpp build segfaults with pyarrow libs ARROW-13090 - [Python] Test failure with ffspec 2021.6.0 ARROW-13104 - [C++] ByteStreamSplit implementation uses invalid pointer cast ARROW-13108 - [Python] Pyarrow 4.0.0 crashes upon import on macOS 10.13.6 ARROW-13116 - [R] Test for RecordBatchReader to C-interface fails on arrow-r-minimal due to missing dependencies ARROW-13125 - [R] Throw error when 2+ args passed to desc() in arrange() ARROW-13128 - [C#] TimestampArray conversion logic for nano and micro is wrong ARROW-13135 - [C++] Fix Status propagation in END_PARQUET_CATCH_EXCEPTIONS ARROW-13139 - [C++] ReadaheadGenerator cannot be safely copied/moved ARROW-13145 - [C++][CI] Flight test crashes on MinGW ARROW-13148 - [Dev][Archery] Crossbow build submission fails ARROW-13153 - [C++] `parquet_dataset` loses ordering of files in `_metadata` ARROW-13154 - [C++] Unions can not have 126 and 127 as type_codes ARROW-13169 - [R] [C++] sorted partition keys can cause issues ARROW-13173 - [C++] TestAsyncUtil.ReadaheadFailed asserts occasionally ARROW-13187 - [c++][python] Possibly memory not deallocated when reading in CSV ARROW-13189 - [R] Disable row-level metadata application on datasets ARROW-13203 - [R] Fix optional component checks causing failures ARROW-13207 - [Python][Doc] Dataset documentation still suggests deprecated scan method as the preferred iterative approach ARROW-13216 - [R] Type checks test fails with rtools35 ARROW-13217 - [C++][Gandiva] Correct convert_replace function for invalid chars on string beginning ARROW-13223 - [C++][CI] Fix thread sanitizer failures ARROW-13225 - [Go][Flight] Implement Custom Middleware Interface and Enable Integration Tests ARROW-13229 - [Python] ascii_trim, ascii_ltrim and ascii_rtrim lack options ARROW-13239 - [Doc][Python] Dataset.head function doesn&#39;t mention required argument ARROW-13243 - [R] altrep function call in R 3.5 ARROW-13246 - [C++] CSV skip_rows_after_names can discard data prematurally ARROW-13249 - [Java][CI] Consistent timeout in the Java JNI build ARROW-13253 - [C++][FlightRPC] Segfault when sending record batch &gt;2GB ARROW-13254 - [Python] Processes killed and semaphore objects leaked when reading pandas data ARROW-13265 - [R] cli valgrind errors in nightlies ARROW-13266 - [JS] Improve benchmark names &amp; add suite name to json ARROW-13281 - [C++][Gandiva] Error on timestampDiffMonth function behavior for negative diff values ARROW-13284 - [C++] Wrong pkg_check_modules() option name ARROW-13288 - [Python] Missing default values of kernel options in PyArrow ARROW-13290 - Compilation fails on clang-12 and gcc-11 due to missing include ARROW-13305 - [C++] Unable to install nightly on Ubuntu 21.04 due to CSV options ARROW-13315 - [R] Wrap r_task_group includes with ARROW_R_WITH_ARROW checking ARROW-13321 - [C++][Python] MakeArrayFromScalar doesn&#39;t work for FixedSizeBinaryType ARROW-13324 - [R] Typo in bindings for utf8_reverse and ascii_reverse ARROW-13332 - [C++] TSAN failure in TestAsyncUtil.ReadaheadFailed ARROW-13341 - [C++] Segfault in arrow-compute-plan-test ExecPlanExecution.SourceScalarAggSink ARROW-13350 - [Python][CI] conda-python-3.7-pandas-0.24 nightly build failing in test_extract_datetime_components ARROW-13352 - [C++] Valgrind failure in case_when kernel ARROW-13353 - [Documentation] Build failing with sphinx.util.cfamily.DefinitionError ARROW-13360 - [C++] Missing dependencies in C++ thirdparty offline dependencies versions.txt ARROW-13363 - [R] is.nan() errors on non-floating point data ARROW-13368 - [C++][Doc] Rename project to make_struct in docs ARROW-13381 - [C++] ArrayFromJSON doesn&#39;t work for float value dictionary type ARROW-13382 - [C++] Aggregation over scalars fails autobrew R job ARROW-13384 - [C++] Specify minimum required zstd version in cmake ARROW-13391 - [C++] CSV streaming reader does not include same error information as table reader ARROW-13417 - [C++] The merged generator can sometimes pull from source sync-reentrant ARROW-13419 - [JS] Fix perf tests ARROW-13428 - [C++][Flight] -lssl is missing with bundled gRPC and system shared OpenSSL ARROW-13431 - [Release] Bump go version to 1.15; don&#39;t verify rust source anymore ARROW-13432 - [Release] Fix ssh connection to the binary uploader container ARROW-13450 - [Python][Packaging] Set deployment target to 10.13 for universal2 wheels New Features and Improvements ARROW-2665 - [Python/C++] Add index() method to find first occurence of Python scalar ARROW-3014 - [C++] Minimal writer adapter for ORC file format ARROW-3316 - [R] Multi-threaded conversion from R data.frame to Arrow table / record batch ARROW-5385 - [Go] implement EXTENSION datatype ARROW-5640 - [Go] implement Map array ARROW-6513 - [CI] The conda environment files arrow/ci/conda_env_*.yml should have .txt extension ARROW-6513 - [CI] The conda environment files arrow/ci/conda_env_*.yml should have .txt extension ARROW-7001 - [C++] Develop threading APIs to accommodate nested parallelism ARROW-7114 - [JS][CI] NodeJS build fails on Github Actions Windows node ARROW-7252 - [Rust] [Parquet] Reading UTF-8/JSON/ENUM field results in a lot of vec allocation ARROW-7396 - [Format] Register media types (MIME types) for Apache Arrow formats to IANA ARROW-8421 - [Rust] [Parquet] Implement parquet writer ARROW-8459 - [Dev][Archery] Use a more recent cmake-format ARROW-8527 - [C++][CSV] Add support for ReadOptions::skip_rows &gt;= block_size ARROW-8655 - [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset ARROW-8676 - [Rust] Create implementation of IPC RecordBatch body buffer compression from ARROW-300 ARROW-9054 - [C++] Add ScalarAggregateOptions ARROW-9056 - [C++] Support scalar aggregation over scalars ARROW-9140 - [R] Zero-copy Arrow to R where possible ARROW-9295 - [Archery] Support rust clippy in the lint command ARROW-9299 - [Python] Expose ORC metadata() in Python ORCFile ARROW-9313 - [Rust] Use feature enum ARROW-9421 - [C++][Parquet] Redundancies SchemaManifest::GetFieldIndices ARROW-9430 - [C++/Python] Kernel for SetItem(BooleanArray, values) ARROW-9697 - [C++][Dataset] num_rows method for Dataset/Scanner ARROW-10031 - [Java] Support Java benchmark in Archery ARROW-10115 - [C++] CSV empty quoted string is treated as NULL ARROW-10316 - [Python] Consider using __wrapped__ for compute function introspection ARROW-10391 - [Rust] [Parquet] Nested Arrow reader ARROW-10440 - [C++][Dataset][Python] Add a callback to visit file writers just before Finish() ARROW-10550 - [Rust] [Parquet] Write nested types (struct, list) ARROW-10557 - [C++] Add scalar string slicing/substring extract kernel ARROW-10640 - [C++] An &quot;if_else&quot; kernel to combine two arrays based on a mask ARROW-10658 - [Python][Packaging] Wheel builds for Apple Silicon ARROW-10675 - [C++][Python] Support AWS S3 Web identity credentials ARROW-10797 - [C++] Investigate faster random generation for tests and benchmarks ARROW-10926 - [Rust] Add parquet reader / writer for decimal types ARROW-10959 - [C++] Add scalar string join kernel ARROW-11061 - [Rust] Validate array properties against schema ARROW-11173 - Add Map type as reader / writer in FieldReader / FieldWriter ARROW-11199 - [C++][Python] Fix the unit tests for the ORC reader ARROW-11206 - [C++][Compute][Python] Rename &quot;project&quot; kernel to &quot;make_struct&quot; ARROW-11342 - [Python] [Gandiva] Expose ToString and result type information ARROW-11499 - [Packaging] Remove all use of bintray ARROW-11514 - [R][C++] Bindings for paste(), paste0(), str_c() ARROW-11515 - [R] Bindings for strsplit ARROW-11565 - [C++][Gandiva] Modify upper()/lower() to work with UTF8 and add INIT_CAP function ARROW-11608 - [CI] turbodbc integration tests are failing (build isue) ARROW-11660 - [C++] Move RecordBatch::SelectColumns method from R to C++ library ARROW-11673 - [C++] Casting dictionary type to use different index type ARROW-11675 - [CI][C++] Resolve ctest failures on VS 2019 builds ARROW-11705 - [R] Support scalar value recycling in RecordBatch/Table$create() ARROW-11748 - [C++] Ensure Decimal128 and Decimal256&#39;s fields are in native endian order ARROW-11759 - [C++] Kernel to extract datetime components (year, month, day, etc) from timestamp type ARROW-11769 - [R] Pull groups from grouped_df into RecordBatch or Table ARROW-11772 - [C++] Add asynchronous read to ipc::RecordBatchFileReader ARROW-11782 - [GLib][Ruby][Dataset] Remove bindings for internal classes ARROW-11787 - [R] Implement write csv ARROW-11843 - [C++] Add asynchronous read to parquet::arrow::FileReader ARROW-11849 - [R] Use roxygen @examplesIf tag in R docs ARROW-11889 - [C++] Add parallelism to streaming CSV reader ARROW-11909 - [C++] Get rid of MakeIteratorGenerator ARROW-11926 - [R] Pass on the new UCRT CRAN windows builds ARROW-11926 - [R] Pass on the new UCRT CRAN windows builds ARROW-11928 - [C++][Compute] Add ExecNode hierarchy ARROW-11929 - [C++][Compute] Promote Expression to the compute namespace ARROW-11930 - [C++][Dataset][Compute] Refactor Dataset scans to use an ExecNode graph ARROW-11932 - [C++] Provide ArrayBuilder::AppendScalar ARROW-11950 - [C++][Compute] Add unary negative kernel ARROW-11960 - [C++][Gandiva] Support escape in LIKE ARROW-11980 - [Python] Remove &quot;experimental&quot; status from Table.replace_schema_metadata ARROW-11986 - [C++][Gandiva] Implement IN expressions for doubles and floats ARROW-11990 - [C++][Compute] Use Status/Result return consistently to indicate errors ARROW-12004 - [C++] Resultdetail::Empty\\ is annoying ARROW-12010 - [C++][Compute] Improve performance of the hash table used in GroupIdentifier ARROW-12016 - [C++] Implement array_sort_indices and sort_indices for BOOL type ARROW-12050 - [C++][Python][FlightRPC] Use StopToken to enable interrupting long Flight operations ARROW-12074 - [C++][Compute] Add scalar arithmetic kernels for decimal inputs ARROW-12083 - [R] schema use in open_dataset ARROW-12166 - [C++][Gandiva] Implements CONVERT_TO(value, type) function ARROW-12184 - [R] Bindings for na.fail, na.omit, na.exclude, na.pass ARROW-12185 - [R] Bindings for any, all ARROW-12198 - [R] bindings for strptime ARROW-12199 - [R] bindings for stddev, variance ARROW-12205 - [C++][Gandiva] Implement TO_TIME([number] secs) and TO_TIMESTAMP([number] secs) function ARROW-12231 - [C++][Dataset] Separate datasets backed by readers from InMemoryDataset ARROW-12253 - [Rust] [Ballista] Implement scalable joins ARROW-12255 - [Rust] [Ballista] Integrate scheduler with DataFusion ARROW-12256 - [Rust] [Ballista] Add DataFrame support ARROW-12257 - [Rust] [Ballista] Publish user guide to Arrow site ARROW-12261 - [Rust] [Ballista] Ballista should not have its own DataFrame API ARROW-12291 - [R] Determine the type of an unevaluated expression ARROW-12310 - [Java] ValueVector#getObject should support covariance for complex types ARROW-12355 - [C++] Implement efficient async CSV scanning ARROW-12362 - [Rust] [DataFusion] topk_query test failure ARROW-12364 - [Python] [Dataset] Add metadata_collector option to ds.write_dataset() ARROW-12378 - [C++][Gandiva] Implement castVARBINARY functions ARROW-12386 - [C++] Support file parallelism in AsyncScanner ARROW-12391 - [Rust][DataFusion] Implement date_trunc() function ARROW-12392 - [C++] Restore asynchronous streaming CSV reader ARROW-12393 - [JS] Optimally use closure compiler ARROW-12403 - [Rust] [Ballista] Integration tests should check that query results are correct ARROW-12415 - [CI] [Python] ERROR: Failed building wheel for pygit2 on ARM64 ARROW-12424 - [Go][Parquet] Add Schema Package ARROW-12428 - [Python] pyarrow.parquet.read_* should use pre_buffer=True ARROW-12434 - [Rust] [Ballista] Show executed plans with metrics ARROW-12442 - [CI] Set job timeouts on GitHub Actions ARROW-12443 - [C++][Gandiva] Implement castVARCHAR function for binary input ARROW-12444 - [RUST] [CI] Remove Rust and point integration tests to arrow-rs repo ARROW-12445 - [Rust] Design and implement packaging process to bundle Rust in signed tar ARROW-12468 - [Python][R] Expose UseAsync to python/R ARROW-12478 - [C++] Support LLVM 12 ARROW-12484 - [CI] Change jinja macros to not require CROSSBOW_TOKEN to upload artifacts in Github Actions ARROW-12489 - [Developer] autotune is broken ARROW-12490 - [Dev] Use miniforge for all platforms ARROW-12492 - [Python] Add an helper method to decode a DictionaryArray back to a plain Array ARROW-12496 - [C++][Dataset] Ensure Scanner tests fully cover async ARROW-12499 - [C++][Compute][R] Add ScalarAggregateOptions to Any and All kernels ARROW-12500 - [C++][Dataset] Consolidate similar tests for file formats ARROW-12501 - [CI][Ruby] Remove needless workaround for MinGW build ARROW-12507 - [CI] Remove duplicated cron/nightly builds ARROW-12512 - [C++][Dataset] Implement CSV writing support ARROW-12514 - [Release] Don&#39;t run Gandiva related Ruby test with ARROW_GANDIVA=OFF ARROW-12517 - [Go] Expose App Metadata in Flight client ARROW-12518 - [Python] Expose Parquet statistics has_null_count / has_distinct_count ARROW-12520 - [R] Minor docs updates ARROW-12522 - [C++] Implement asynchronous/&quot;lazy&quot; variants of ReadRangeCache ARROW-12525 - [JS] Vector toJSON returns an array ARROW-12527 - [Dev] Don&#39;t try getting JIRA information for MINOR PR ARROW-12528 - [JS] Support typed arrays in Table.new ARROW-12530 - [C++] Remove Buffer::mutable_data_ member and use const_cast on data_ only if is_mutable_ is true ARROW-12533 - [C++] Random real generator is slow on Arm64 Linux when built with clang ARROW-12534 - [C++][Gandiva] Implement LEFT and RIGHT functions on Gandiva for string input values ARROW-12537 - [JS] Docs build should not include test sources ARROW-12541 - [Docs] Improve styling/readability of tables in the new doc theme ARROW-12551 - [Java][Release] Java post-release tests fail due to missing testing data ARROW-12554 - Allow duplicates in the value_set for compute::is_in ARROW-12555 - [Java][Release] Java post-release script misses dataset JNI bindings ARROW-12556 - [C++][Gandiva] Implement BYTESUBSTRING functions on Gandiva ARROW-12560 - [C++] Investigate utilizing aggressive thread task creation when adding callback to finished future ARROW-12567 - [C++][Gandiva] Implement LPAD and RPAD functions for string input values ARROW-12567 - [C++][Gandiva] Implement LPAD and RPAD functions for string input values ARROW-12571 - [R][CI] Run nightly R with valgrind ARROW-12575 - [R] Use unary negative kernel ARROW-12577 - [Website] Use Artifactory instead of Bintray in all places ARROW-12578 - [JS] Simplify UTF8 handling in NodeJS ARROW-12581 - [C++][FlightRPC] Benchmark compression with real data ARROW-12584 - [C++][Python] Expose method for benchmarking tools to release unused memory from the allocators ARROW-12591 - [Java][Gandiva] Create single Gandiva jar for MacOS and Linux ARROW-12593 - [Packaging][Ubuntu] Add support for Ubuntu 21.04 ARROW-12597 - [C++] Implement OptionalParallelForAsync ARROW-12598 - [C++][Dataset] Implement row-count for CSV or allow selecting 0 columns from CSV ARROW-12599 - [Doc][Python] Documentation missing for pyarrow.Table ARROW-12600 - [CI] Push docker images from crossbow tasks ARROW-12602 - [R] Add BuildInfo from C++ to arrow_info ARROW-12608 - [C++] Add split_pattern_regex function ARROW-12612 - [C++][Compute] Add Expression to type_fwd.h ARROW-12619 - [Python] pyarrow sdist should not require git ARROW-12621 - [C++][Gandiva] Add alias to sha1 and sha256 functions ARROW-12631 - [Python] pyarrow.dataset.write_table should accept a Scanner to write ARROW-12643 - Add documentation for experimental repos ARROW-12645 - [Python] Fix numpydoc validation ARROW-12648 - [C++][FlightRPC] Allow using TLS in benchmark ARROW-12649 - [Python/Packaging] Move conda-aarch64 to Azure with cross-compilation ARROW-12653 - [Archery] allow me to add a comment to crossbow requests ARROW-12658 - [C++] Bump aws-c-common to v0.5.10 ARROW-12660 - [R] Post-4.0 adjustments for CRAN ARROW-12661 - [C++] CSV add skip rows after column names ARROW-12662 - [Website] Force to use squash merge ARROW-12667 - [Python] Ensure test coverage for conversion of strided numpy arrays ARROW-12675 - [C++] CSV should include line/row numbers in parsing error messages ARROW-12677 - [Python] Add a mask argument to pyarrow.StructArray.from_arrays ARROW-12685 - [C++][Compute] Add unary absolute value kernel ARROW-12686 - [C++][Python][FlightRPC] Support export_to_c in DoGet/inherit from RecordBatchReader ARROW-12687 - [C++][Python][Dataset] Support C Data Interface with Scanner ARROW-12689 - [R] Implement ArrowArrayStream C interface ARROW-12692 - [R] Improve tests and comments for strsplit() bindings ARROW-12694 - [R][CI] rtools35 job failing on 32-bit build tests ARROW-12696 - [R] Improve testing of error messages converted to warnings ARROW-12699 - [CI][Packaging][Java] Generate a jar compatible with Linux and MacOS for all Arrow components ARROW-12701 - [Website][Release] Include Rust contributors, committers, and commits in release notes ARROW-12702 - [JS] Upgrade Webpack and terser ARROW-12703 - [JS] Separate Table from DataFrame ARROW-12704 - [JS] use optional chaining ARROW-12709 - [C++] Add variadic string join kernel ARROW-12713 - [C++] String reverse kernel ARROW-12715 - [C++] SQL-style glob string match kernel ARROW-12716 - [C++] Left/right/center string padding kernels ARROW-12717 - [C++] Substring find position kernel ARROW-12719 - [C++][Python] pyarrow.fs.S3FileSystem pass extra kwargs i.e ACL ARROW-12721 - [CI] Fix path for uploading aarch64 conda artifacts from the nightly builds ARROW-12722 - [R] Raise error when attemping to print table with duplicated naming ARROW-12730 - [MATLAB] Update featherreadmex and featherwritemex to build against latest arrow c++ APIs ARROW-12731 - [R] Use InMemoryDataset for Table/RecordBatch in dplyr code ARROW-12736 - [C++] Eliminate unnecessary copy in FieldPath::Get() ARROW-12738 - [CI] [Gandiva] Nightly build error in azure-conda-osx-clang-py38 (and py39, py*-r*) ARROW-12741 - [CI] Configure GitHub Token for Nightly Builds ARROW-12745 - [C++][Compute] Add floor, ceiling, and truncate kernels ARROW-12749 - [C++] Unnecessary copy cause by constructing RecordBatch/Table/Schema from lvalues ARROW-12750 - [CI] [R] Actually pass parameterized docker options to the templates ARROW-12751 - [C++] Add variadic row-wise min/max kernels (least/greatest) ARROW-12758 - [R] Add examples to more function documentation ARROW-12759 - [C++][Compute] Wrap grouped aggregation in an ExecNode ARROW-12760 - [C++][Python][R] S3FileSystem: IO thread parallelism limited to 8 threads ARROW-12761 - [R] Better error handling for write_to_raw ARROW-12764 - [CI] Fix arguments in Conda Windows builds ARROW-12777 - [R] Convert all inputs to Arrow objects in match_arrow and is_in ARROW-12781 - [R] Implement is.type() functions for dplyr ARROW-12785 - [CI] the r-devdocs build errors when brew installing gcc ARROW-12791 - [R] Better error handling for DatasetFactory$Finish() when no format specified ARROW-12796 - [JS] Support JSON output from benchmarks ARROW-12800 - [JS] Drop IE Support and remove text encoder and decoder polyfills ARROW-12801 - [CI][Packaging][Java] Include all modules in script that generate Arrow jars ARROW-12806 - [Python] test_write_to_dataset_filesystem missing a dataset mark ARROW-12808 - [JS] Document browser support ARROW-12810 - [Python] Run tests with AWS_EC2_METADATA_DISABLED=true ARROW-12812 - [Packaging][Java] Improve JNI jars build ARROW-12824 - [R][CI] Upgrade builds for R 4.1 release ARROW-12827 - [C++] [Dataset] Review error pass-through in the datasets API ARROW-12829 - [GLib][Ruby] Add support for Apache Arrow Flight ARROW-12831 - [CI][macOS] Remove needless Homebrew workaround ARROW-12832 - [JS] Write benchmarks in TypeScript ARROW-12833 - [JS] Construct perf data in JS ARROW-12835 - [C++] Implement case insenstive match in match_substring(_regex) and match_like ARROW-12836 - [C++] Installation on IBM i fails because of CxxFlags ARROW-12841 - [R] Add examples to more function documentation - part 2 ARROW-12843 - [C++][Compute] Add is_inf kernel for floating point arrays ARROW-12848 - [Release] Mail template points to 404 ARROW-12851 - [Go][Parquet] Add Encoding Package Part 1 ARROW-12856 - [C++][Gandiva] Implement castBIT and castBOOLEAN functions on Gandiva ARROW-12859 - [C++] Add ScalarFromJSON for easier testing ARROW-12861 - [C++][Compute] Add sign function kernels ARROW-12867 - [R] Bindings for abs() ARROW-12868 - [R] Bindings for find_substring and find_substring_regex ARROW-12869 - [R] Bindings for utf8_reverse and ascii_reverse ARROW-12870 - [R] Bindings for stringr::str_like ARROW-12875 - [JS] Upgrade Jest and other minor updates ARROW-12883 - [R] [CI] version compatibility fails on R 4.1 ARROW-12891 - [C++][Compute][Dataset] Extract subtree pruning logic to compute:: ARROW-12894 - [R] Bump R version ARROW-12895 - [CI] Use &quot;concurrency&quot; setting on Github Actions ARROW-12898 - [Release][C#] Package upload script is broken ARROW-12900 - [Python][Documentation] an np import in Reading Datasets docs ARROW-12901 - [R] Follow on to more examples ARROW-12909 - [R][Release] Build of ubuntu-docs is failing ARROW-12912 - [Website] Use .asf.yaml for publishing ARROW-12915 - [Release] Build of ubuntu-docs is failing on thrift ARROW-12936 - [C++][Gandiva] Implement ASCII Hive function on Gandiva ARROW-12937 - [C++] Allow specifying default metadata for new S3 files ARROW-12939 - [R] Simplify RTask stop handling ARROW-12940 - [R] Expose C interface as R6 methods ARROW-12948 - [C++] Add string slice replace kernel ARROW-12949 - [C++] Add string starts-with/ends-with kernels ARROW-12950 - [C++] Add substring count kernel ARROW-12951 - [C++] Refactor StringTransform ARROW-12952 - [C++] Add regex count kernel ARROW-12955 - [C++] Add additional type support for if_else kernel ARROW-12957 - [R] rchk issues on cran ARROW-12961 - [C++] MSVC issues warning building PyArrow on Windows ARROW-12962 - [GLib][Ruby] Add Arrow:Scalar ARROW-12964 - [R] Add bindings for ifelse() and if_else() ARROW-12966 - [Python] Expose Python binding for ElementWiseAggregateOptions ARROW-12967 - [R] Add bindings for pmin() and pmax() ARROW-12968 - [R] [CI] Add an rchk job to our nightlies ARROW-12972 - [CI] ][C++] archive_write_add_filter_zstd error on CentOS + ARM64 ARROW-12975 - [C++][Python] if_else kernel doesn&#39;t support upcasting ARROW-12982 - [C++] Re-enable unused-variable warning ARROW-12984 - [C++] Passing options parameter of Count/Index aggregation by reference ARROW-12985 - [Python][Packaging] Unable to install pygit2 in the arm64 wheel builds ARROW-12986 - [C++][Gandiva] Implement new cache eviction policy in Gandiva ARROW-12992 - [R] bindings for substr(), substring(), str_sub() ARROW-12994 - [R] Fix tests that assume UTC local tz ARROW-12996 - [C++] CSV stream reader has no progress indication ARROW-13002 - [C++] Add a check for the utf8proc&#39;s version in CMake ARROW-13005 - [C++] Support filter/take for union data type. ARROW-13006 - [C++][Gandiva] Implement BASE64 and UNBASE64 Hive functions on Gandiva ARROW-13009 - [Doc][Dev] Document builds mailing-list ARROW-13022 - [R] bindings for lubridate&#39;s year, isoyear, quarter, month, day, wday, yday, isoweek, hour, minute, and second functions ARROW-13025 - [C++][Compute] Enhance FunctionOptions with equality, debug representability, and serializability ARROW-13027 - [C++] Fix ASAN stack traces in CI ARROW-13030 - [CI][Go] Setup Arm64 golang CI ARROW-13031 - [JS] Support arm in closure compiler on macOS ARROW-13032 - [Java] Update gauva version ARROW-13034 - [Python][Docs] Update outdated examples for hdfs/azure on the Parquet doc page ARROW-13036 - [Doc] Mention recommended file extension(s) for Arrow IPC ARROW-13042 - [C++] Automatic checks that kernels don&#39;t leave uninitialized data in output ARROW-13043 - [GLib][Ruby] Add GArrowEqualOptions ARROW-13044 - [Java] Union vectors should extend ValueVector ARROW-13045 - [Packaging][RPM][deb] Don&#39;t install system utf8proc if it&#39;s old ARROW-13047 - [Website] Add kiszk to committer list ARROW-13049 - [C++][Gandiva] Implement BIN Hive function on Gandiva ARROW-13050 - [C++][Gandiva] Implement SPACE Hive function on Gandiva ARROW-13054 - [C++] Add option to specify the first day of the week for the &quot;day_of_week&quot; temporal kernel ARROW-13064 - [C++] Add a general &quot;if, ifelse, ..., else&quot; kernel (&quot;CASE WHEN&quot;) ARROW-13065 - [Packaging][RPM] Add missing required LZ4 version information ARROW-13068 - [GLib][Dataset] Change prefix to gadataset_ from gad_ ARROW-13070 - [R] bindings for sd and var ARROW-13072 - [C++] Add bitwise arithmetic compute functions ARROW-13074 - [Python] Start with deprecating ParquetDataset custom attributes ARROW-13075 - [Python] Expose C data interface API for pyarrow.Field ARROW-13076 - [Java] Enable ExtensionType to use StructVector and UnionVector for underlying storage ARROW-13082 - [CI] Forward R argument to ubuntu-docs build ARROW-13086 - [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_ ARROW-13086 - [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_ ARROW-13091 - [Python] Add compression_level argument to IpcWriteOptions constructor ARROW-13092 - [C++] CreateDir should fail if the target exists and is not a directory ARROW-13095 - [C++] Implement trigonometric compute functions ARROW-13096 - [C++] Implement logarithm compute functions ARROW-13097 - [C++] Provide a simple reflection utility for s ARROW-13098 - [Dev][Archery] Reorganize docker submodule to its own subpackage ARROW-13100 - [MATLAB] Integrate GoogleTest with MATLAB Interface C++ Code ARROW-13101 - [Python][Doc] pyarrow.FixedSizeListArray does not appear in the documentation ARROW-13110 - [C++] Deadlock can happen when using BackgroundGenerator without transferring callbacks ARROW-13113 - [R] use RTasks to manage parallel in converting arrow to R ARROW-13117 - [R] Retain schema in new Expressions ARROW-13119 - [R] Set empty schema in scalar Expressions ARROW-13124 - [Ruby] Add support for memory view ARROW-13127 - [R] Valgrind nightly errors ARROW-13136 - [C++] Add a &quot;coalesce&quot; variadic scalar kernel ARROW-13137 - [C++][Documentation] Make in-table references consistent ARROW-13140 - [C++/Python] Upgrade libthrift pin in the nightlies ARROW-13142 - [Python] Use vector append when converting from list of non-strided numpy arrays ARROW-13147 - [Java] Respect the rounding policy when allocating vector buffers ARROW-13157 - [C++] Add find_substring_regex kernel and implement ignore_case for find_substring ARROW-13158 - [Python] Fix repr and contains of StructScalar with duplicate field names ARROW-13162 - [C++][Gandiva] Add new alias for extract date functions in Gandiva registry ARROW-13171 - [R] Add binding for str_pad() ARROW-13190 - [C++] [Gandiva] Change behavior of INITCAP function ARROW-13194 - [Java][Document] Create prose document about Java algorithms ARROW-13195 - [R] Problem with rlang reverse dependency checks ARROW-13199 - [R] add ubuntu 21.04 to nightly builds ARROW-13200 - [R] Add binding for case_when() ARROW-13201 - [R] Add binding for coalesce() ARROW-13210 - [Python][CI] Fix vcpkg caching mechanism for the macOS wheels ARROW-13211 - [C++][CI] Remove outdated Github Actions ARM builds ARROW-13212 - [Release] Support deploying to test PyPI in the python post release script ARROW-13215 - [R] [CI] Add ENV TZ to docker files ARROW-13218 - [Doc] Document/clarify conventions for timestamp storage ARROW-13219 - [C++][GLib] Demote/deprecate CompareOptions ARROW-13224 - [Python][Doc] Documentation missing for pyarrow.dataset.write_dataset ARROW-13226 - [Python] Add a general purpose cython trampolining utility ARROW-13228 - [C++] S3 CreateBucket fails because AWS treats us-east-1 differently than other regions ARROW-13230 - Add CSV Writer documentation ARROW-13234 - [C++] Add string padding option to determine which side the extra space goes on ARROW-13235 - [C++] Make type_name equal to options class name for all FunctionOptionTypes ARROW-13236 - [Python] Improve repr of pyarrow.compute.FunctionOptions ARROW-13238 - [C++][Dataset][Compute] Substitute ExecPlan impl for dataset scans ARROW-13242 - [C++] Improve decimal random generation ARROW-13244 - [C++] Add facility to get current thread id ARROW-13258 - [Python] Improve the repr of ParquetFileFragment ARROW-13262 - [R] transmute() fails after pulling data into R ARROW-13273 - [C++] Don&#39;t use .pc only in CMake paths for Requires.private ARROW-13274 - [JS] Remove Webpack ARROW-13275 - [JS] Fix perf tests ARROW-13276 - [GLib][Ruby][Flight] Add support for ListFlights ARROW-13277 - [JS] Add declaration maps ARROW-13280 - [R] Bindings for log and trig functions ARROW-13282 - [C++] Remove obsolete generated files ARROW-13283 - [Developer Tools] Support passing through memory limits in archery docker run ARROW-13286 - [CI] Require docker-compose 1.27.0 or later ARROW-13289 - [C++] Log functions don&#39;t have int kernels ARROW-13291 - [GLib][CI] Require gobject-introspection 3.4.5 or later ARROW-13296 - [C++] Provide reflection-compatible enum replacement ARROW-13299 - [JS] Upgrade ix and rxjs ARROW-13303 - [JS] Revise bundles ARROW-13306 - [Java][JDBC] use ResultSetMetaData.getColumnLabel instead of ResultSetMetaData.getColumnName ARROW-13313 - [C++][Compute] Add ScalarAggregateNode ARROW-13320 - [Website] Add MIME types to FAQ ARROW-13323 - [Archery] Validate docker compose configuration ARROW-13343 - [R] Update NEWS.md for 5.0 ARROW-13346 - [C++] Remove compile time parsing from EnumType ARROW-13355 - [R] ensure that sf is installed in our revdep job ARROW-13357 - [R] bindings for sign() ARROW-13365 - [R] bindings for floor/ceiling/truncate ARROW-13385 - [C++][Compute] Document out-of-source addition to the FunctionRegistry ARROW-13386 - [R][C++] CSV streaming changes break Rtools 35 32-bit build ARROW-13418 - [R] typo in python.r ARROW-13461 - [Python][Packaging] Build M1 wheels for python 3.8 PARQUET-1798 - [C++] Review logic around automatic assignment of field_id&#39;s PARQUET-1998 - [C++] Implement LZ4_RAW compression PARQUET-2056 - [C++] Add ability for retrieving dictionary and indices separately for ColumnReader","headline":"Apache Arrow 5.0.0 Release","image":"https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png","mainEntityOfPage":{"@type":"WebPage","@id":"https://arrow.apache.org/release/5.0.0.html"},"publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"https://arrow.apache.org/img/logo.png"}},"url":"https://arrow.apache.org/release/5.0.0.html"}</script>
<!-- End Jekyll SEO tag -->
<!-- favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16.png" id="light1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32.png" id="light2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon.png" id="light3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120.png" id="light4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76.png" id="light5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60.png" id="light6">
<!-- dark mode favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16-dark.png" id="dark1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32-dark.png" id="dark2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon-dark.png" id="dark3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120-dark.png" id="dark4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76-dark.png" id="dark5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60-dark.png" id="dark6">
<script>
// Switch to the dark-mode favicons if prefers-color-scheme: dark
function onUpdate() {
light1 = document.querySelector('link#light1');
light2 = document.querySelector('link#light2');
light3 = document.querySelector('link#light3');
light4 = document.querySelector('link#light4');
light5 = document.querySelector('link#light5');
light6 = document.querySelector('link#light6');
dark1 = document.querySelector('link#dark1');
dark2 = document.querySelector('link#dark2');
dark3 = document.querySelector('link#dark3');
dark4 = document.querySelector('link#dark4');
dark5 = document.querySelector('link#dark5');
dark6 = document.querySelector('link#dark6');
if (matcher.matches) {
light1.remove();
light2.remove();
light3.remove();
light4.remove();
light5.remove();
light6.remove();
document.head.append(dark1);
document.head.append(dark2);
document.head.append(dark3);
document.head.append(dark4);
document.head.append(dark5);
document.head.append(dark6);
} else {
dark1.remove();
dark2.remove();
dark3.remove();
dark4.remove();
dark5.remove();
dark6.remove();
document.head.append(light1);
document.head.append(light2);
document.head.append(light3);
document.head.append(light4);
document.head.append(light5);
document.head.append(light6);
}
}
matcher = window.matchMedia('(prefers-color-scheme: dark)');
matcher.addListener(onUpdate);
onUpdate();
</script>
<link href="/css/main.css" rel="stylesheet">
<link href="/css/syntax.css" rel="stylesheet">
<script src="/javascript/main.js"></script>
<!-- Matomo -->
<script>
var _paq = window._paq = window._paq || [];
/* tracker methods like "setCustomDimension" should be called before "trackPageView" */
/* We explicitly disable cookie tracking to avoid privacy issues */
_paq.push(['disableCookies']);
_paq.push(['trackPageView']);
_paq.push(['enableLinkTracking']);
(function() {
var u="https://analytics.apache.org/";
_paq.push(['setTrackerUrl', u+'matomo.php']);
_paq.push(['setSiteId', '20']);
var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g,s);
})();
</script>
<!-- End Matomo Code -->
<link type="application/atom+xml" rel="alternate" href="https://arrow.apache.org/feed.xml" title="Apache Arrow" />
</head>
<body class="wrap">
<header>
<nav class="navbar navbar-expand-md navbar-dark bg-dark">
<a class="navbar-brand no-padding" href="/"><img src="/img/arrow-inverse-300px.png" height="40px"></a>
<button class="navbar-toggler ml-auto" type="button" data-toggle="collapse" data-target="#arrow-navbar" aria-controls="arrow-navbar" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span>
</button>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse justify-content-end" id="arrow-navbar">
<ul class="nav navbar-nav">
<li class="nav-item"><a class="nav-link" href="/overview/" role="button" aria-haspopup="true" aria-expanded="false">Overview</a></li>
<li class="nav-item"><a class="nav-link" href="/faq/" role="button" aria-haspopup="true" aria-expanded="false">FAQ</a></li>
<li class="nav-item"><a class="nav-link" href="/blog" role="button" aria-haspopup="true" aria-expanded="false">Blog</a></li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#" id="navbarDropdownGetArrow" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">
Get Arrow
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownGetArrow">
<a class="dropdown-item" href="/install/">Install</a>
<a class="dropdown-item" href="/release/">Releases</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#" id="navbarDropdownDocumentation" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">
Docs
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownDocumentation">
<a class="dropdown-item" href="/docs">Project Docs</a>
<a class="dropdown-item" href="/docs/format/Columnar.html">Format</a>
<hr>
<a class="dropdown-item" href="/docs/c_glib">C GLib</a>
<a class="dropdown-item" href="/docs/cpp">C++</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/csharp/README.md" target="_blank" rel="noopener">C#</a>
<a class="dropdown-item" href="https://godoc.org/github.com/apache/arrow/go/arrow" target="_blank" rel="noopener">Go</a>
<a class="dropdown-item" href="/docs/java">Java</a>
<a class="dropdown-item" href="/docs/js">JavaScript</a>
<a class="dropdown-item" href="/julia/">Julia</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/matlab/README.md" target="_blank" rel="noopener">MATLAB</a>
<a class="dropdown-item" href="/docs/python">Python</a>
<a class="dropdown-item" href="/docs/r">R</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/ruby/README.md" target="_blank" rel="noopener">Ruby</a>
<a class="dropdown-item" href="https://docs.rs/arrow/latest" target="_blank" rel="noopener">Rust</a>
<a class="dropdown-item" href="/swift">Swift</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#" id="navbarDropdownSource" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">
Source
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownSource">
<a class="dropdown-item" href="https://github.com/apache/arrow" target="_blank" rel="noopener">Main Repo</a>
<hr>
<a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/c_glib" target="_blank" rel="noopener">C GLib</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/cpp" target="_blank" rel="noopener">C++</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/csharp" target="_blank" rel="noopener">C#</a>
<a class="dropdown-item" href="https://github.com/apache/arrow-go" target="_blank" rel="noopener">Go</a>
<a class="dropdown-item" href="https://github.com/apache/arrow-java" target="_blank" rel="noopener">Java</a>
<a class="dropdown-item" href="https://github.com/apache/arrow-js" target="_blank" rel="noopener">JavaScript</a>
<a class="dropdown-item" href="https://github.com/apache/arrow-julia" target="_blank" rel="noopener">Julia</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/matlab" target="_blank" rel="noopener">MATLAB</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/python" target="_blank" rel="noopener">Python</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/r" target="_blank" rel="noopener">R</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/tree/main/ruby" target="_blank" rel="noopener">Ruby</a>
<a class="dropdown-item" href="https://github.com/apache/arrow-rs" target="_blank" rel="noopener">Rust</a>
<a class="dropdown-item" href="https://github.com/apache/arrow-swift" target="_blank" rel="noopener">Swift</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#" id="navbarDropdownSubprojects" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">
Subprojects
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownSubprojects">
<a class="dropdown-item" href="/adbc">ADBC</a>
<a class="dropdown-item" href="/docs/format/Flight.html">Arrow Flight</a>
<a class="dropdown-item" href="/docs/format/FlightSql.html">Arrow Flight SQL</a>
<a class="dropdown-item" href="https://datafusion.apache.org" target="_blank" rel="noopener">DataFusion</a>
<a class="dropdown-item" href="/nanoarrow">nanoarrow</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#" id="navbarDropdownCommunity" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">
Community
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownCommunity">
<a class="dropdown-item" href="/community/">Communication</a>
<a class="dropdown-item" href="/docs/developers/index.html">Contributing</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/issues" target="_blank" rel="noopener">Issue Tracker</a>
<a class="dropdown-item" href="/committers/">Governance</a>
<a class="dropdown-item" href="/use_cases/">Use Cases</a>
<a class="dropdown-item" href="/powered_by/">Powered By</a>
<a class="dropdown-item" href="/visual_identity/">Visual Identity</a>
<a class="dropdown-item" href="/security/">Security</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/policies/conduct.html" target="_blank" rel="noopener">Code of Conduct</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#" id="navbarDropdownASF" role="button" data-toggle="dropdown" aria-haspopup="true" aria-expanded="false">
ASF Links
</a>
<div class="dropdown-menu dropdown-menu-right" aria-labelledby="navbarDropdownASF">
<a class="dropdown-item" href="https://www.apache.org/" target="_blank" rel="noopener">ASF Website</a>
<a class="dropdown-item" href="https://www.apache.org/licenses/" target="_blank" rel="noopener">License</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/sponsorship.html" target="_blank" rel="noopener">Donate</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/thanks.html" target="_blank" rel="noopener">Thanks</a>
<a class="dropdown-item" href="https://www.apache.org/security/" target="_blank" rel="noopener">Security</a>
</div>
</li>
</ul>
</div>
<!-- /.navbar-collapse -->
</nav>
</header>
<div class="container p-4 pt-5">
<main role="main" class="pb-5">
<!--
-->
<h1>Apache Arrow 5.0.0 (29 July 2021)</h1>
<p>This is a major release covering more than 3 months of development.</p>
<h2>Download</h2>
<ul>
<li><a href="https://www.apache.org/dyn/closer.lua/arrow/arrow-5.0.0/" target="_blank" rel="noopener"><strong>Source Artifacts</strong></a></li>
<li>
<strong>Binary Artifacts</strong>
<ul>
<li><a href="https://apache.jfrog.io/artifactory/arrow/centos/" target="_blank" rel="noopener">For CentOS</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/debian/" target="_blank" rel="noopener">For Debian</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/python/5.0.0/" target="_blank" rel="noopener">For Python</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/ubuntu/" target="_blank" rel="noopener">For Ubuntu</a></li>
</ul>
</li>
<li><a href="https://github.com/apache/arrow/releases/tag/apache-arrow-5.0.0" target="_blank" rel="noopener">Git tag</a></li>
</ul>
<h2>Contributors</h2>
<p>This release includes 684 commits from 99 distinct contributors in 2 Arrow repositories.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code data-lang="console"><span class="go">77 David Li
43 Krisztián Szűcs
42 Antoine Pitrou
35 Dominik Moritz
33 Nic Crane
27 Weston Pace
27 Sutou Kouhei
27 Andrew Lamb
23 Jonathan Keane
21 Joris Van den Bossche
17 Jorge C. Leitao
17 Ian Cook
16 João Pedro
15 Jiayu Liu
14 Yibo Cai
14 Alessandro Molina
13 Jorge Leitao
12 Ritchie Vink
12 Benjamin Kietzman
10 niranda perera
10 Wakahisa
10 Matthew Topol
9 Anthony Louis
8 Raphael Taylor-Davies
8 Nate Clark
7 Eduardo Ponce
6 Daniël Heres
5 Neal Richardson
5 Navin
5 Jörn Horstmann
5 Andy Grove
4 crystrix
4 Uwe L. Korn
4 Romain Francois
4 Rok
4 Marco Neumann
4 Ádám Lippai
3 liyafan82
3 frank400
3 Ying Zhou
3 Roee Shlomo
3 Michael Edwards
3 Matthijs Brobbel
3 Alenka Frim
2 Zachary Blackwood
2 Wes McKinney
2 Steven
2 P42
2 Michal Nowakiewicz
2 Kazuaki Ishizaki
2 Karik Isichei
2 Jeroen Ooms
2 Gary Pennington
2 Diana Clarke
2 Bryan Cutler
1 sjgupta2
1 sgilmore
1 ptaylor
1 nullptr
1 kazuhiko kikuchi
1 hulunbier
1 baishen
1 Zimo Zhang
1 Yordan Pavlov
1 Will Jones
1 Steven Burns
1 Stephen
1 Shuai Zhang
1 Ray Bell
1 Projjal Chanda
1 Pachamaltese
1 Nick Bruno
1 Micah Kornfield
1 Menno Siekerman
1 Max Meldrum
1 Mauricio Vargas
1 Marc van Heerden
1 Manish Gill
1 Maarten A. Breddels
1 Lorenz Walthert
1 Linan Qiu
1 Laurent Mazare
1 Kornelijus Survila
1 Kevin Gurney
1 Joris Peeters
1 Jinpeng Zhou
1 Jiangtao Peng
1 Hideaki Hayashi
1 Giordon Stark
1 Gang Liao
1 GALI PREM SAGAR
1 Edd Robinson
1 Dorian Kind
1 Dmitry Patsura
1 Chojan Shang
1 Boaz
1 Ben Chambers
1 Alex Baden
1 Adam Johnson
</span></code></pre></div></div>
<h2>Patch Committers</h2>
<p>The following Apache committers merged contributed patches to Arrow repositories.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code data-lang="console"><span class="go">96 Antoine Pitrou
69 Sutou Kouhei
63 David Li
59 Krisztián Szűcs
36 Jonathan Keane
33 Benjamin Kietzman
33 Neal Richardson
27 Andrew Lamb
22 Ian Cook
22 Praveen
21 Jorge C. Leitao
19 Yibo Cai
16 Dominik Moritz
16 Joris Van den Bossche
15 Jiayu Liu
12 GitHub
12 Ritchie Vink
11 Micah Kornfield
10 Wakahisa
8 Jorge Leitao
8 Raphael Taylor-Davies
6 Daniël Heres
5 Andy Grove
5 Jörn Horstmann
5 liyafan82
5 Navin
5 Weston Pace
4 Ádám Lippai
4 Marco Neumann
3 Brian Hulette
3 Michael Edwards
3 Roee Shlomo
2 Eric Erhardt
2 Gary Pennington
2 Steven
2 Uwe L. Korn
2 Wes McKinney
1 baishen
1 Ben Chambers
1 Boaz
1 Bryan Cutler
1 Chojan Shang
1 Dmitry Patsura
1 Edd Robinson
1 Gang Liao
1 hulunbier
1 ishizaki
1 Kazuaki Ishizaki
1 kazuhiko kikuchi
1 Kornelijus Survila
1 Laurent Mazare
1 Manish Gill
1 Marc van Heerden
1 Max Meldrum
1 Yordan Pavlov
</span></code></pre></div></div>
<h2>Changelog</h2>
<p>The following changelog is for the <code>apache/arrow</code> repository. For the Rust
implementation of Apache Arrow, see the <a href="https://github.com/apache/arrow-rs/blob/5.0.0/CHANGELOG.md" target="_blank" rel="noopener"><code>apache/arrow-rs</code> changelog</a>.</p>
<h2>Apache Arrow 5.0.0 (2021-07-28)</h2>
<h3>Bug Fixes</h3>
<ul>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-6189" target="_blank" rel="noopener">ARROW-6189</a> - [Rust] [Parquet] Plain encoded boolean column chunks limited to 2048 values</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-6312" target="_blank" rel="noopener">ARROW-6312</a> - [C++] Declare required Libs.private in arrow.pc package config</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-7948" target="_blank" rel="noopener">ARROW-7948</a> - [Go][Integration] Decimal integration failures</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9594" target="_blank" rel="noopener">ARROW-9594</a> - [Python] DictionaryArray.to_numpy does not correctly convert null indexes to null values</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10910" target="_blank" rel="noopener">ARROW-10910</a> - [Python] Segmentation Fault when None given to read_table with legacy dataset</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10958" target="_blank" rel="noopener">ARROW-10958</a> - [GLib] "Nested data conversions not implemented" through glib, but not through pyarrow</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11077" target="_blank" rel="noopener">ARROW-11077</a> - [Rust] ParquetFileArrowReader panicks when trying to read nested list</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11146" target="_blank" rel="noopener">ARROW-11146</a> - [CI][Python] Failing conda-python-3.8-jpype Nightly Build</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11161" target="_blank" rel="noopener">ARROW-11161</a> - [Python][C++] S3Filesystem: file Content-Type not set correctly?</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11633" target="_blank" rel="noopener">ARROW-11633</a> - [CI] [Documentation] Maven default skin not found</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11780" target="_blank" rel="noopener">ARROW-11780</a> - [C++][Python] StructArray.from_arrays() crashes Python interpreter</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11908" target="_blank" rel="noopener">ARROW-11908</a> - [Rust] Intermittent Flight integration test failures</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12007" target="_blank" rel="noopener">ARROW-12007</a> - [C++] Loading parquet file returns "Invalid UTF8 payload" error</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12055" target="_blank" rel="noopener">ARROW-12055</a> - [R] is.na() evaluates to FALSE on Arrow NaN values</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12096" target="_blank" rel="noopener">ARROW-12096</a> - [Python][C++] Pyarrow Parquet reader overflows INT96 timestamps when converting to Arrow Array (timestamp[ns])</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12122" target="_blank" rel="noopener">ARROW-12122</a> - [Python] Cannot install via pip M1 mac</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12142" target="_blank" rel="noopener">ARROW-12142</a> - [Python] undefined symbol: _ZN5arrow6StatusC1ENS_10StatusCodeERKNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12150" target="_blank" rel="noopener">ARROW-12150</a> - [Python] Bad type inference of mixed-precision Decimals</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12232" target="_blank" rel="noopener">ARROW-12232</a> - [Rust][Datafusion] Error with CAST: Unsupported SQL type Time</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12240" target="_blank" rel="noopener">ARROW-12240</a> - [Python] invalid-offsetof warning from apple clang-12</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12377" target="_blank" rel="noopener">ARROW-12377</a> - [Doc][Java] Java doc build broken</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12407" target="_blank" rel="noopener">ARROW-12407</a> - [Python] Deprecation warning when building PyArrow</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12431" target="_blank" rel="noopener">ARROW-12431</a> - [Python] pa.array mask inverted when type is binary and value to be converted is numpy array</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12472" target="_blank" rel="noopener">ARROW-12472</a> - [Python] read_table fails when passing a PEP519 filesystem object</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12482" target="_blank" rel="noopener">ARROW-12482</a> - [Doc][Python] Mention CSVStreamingReader pitfalls with type inference</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12491" target="_blank" rel="noopener">ARROW-12491</a> - [Packaging] Required dependency on LZ4 &gt;= 1.8 missing from CentOS RPM packages</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12503" target="_blank" rel="noopener">ARROW-12503</a> - [C++] Ensure using "lib/" for jemalloc's library directory</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12508" target="_blank" rel="noopener">ARROW-12508</a> - [R] expect_as_vector implementation causes test failure on R &lt;= 3.3 and variables defined outside of test_that break build when no arrow install</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12543" target="_blank" rel="noopener">ARROW-12543</a> - [CI][Python] Failing conda-python-3.9 Nightly Build</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12568" target="_blank" rel="noopener">ARROW-12568</a> - [Python][C++] Segfault when casting a sliced ListArray of int64 in v4.0.0</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12569" target="_blank" rel="noopener">ARROW-12569</a> - [R] [CI] Run revdep in CI</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12570" target="_blank" rel="noopener">ARROW-12570</a> - [JS] Fix issues that blocked the v4.0.0 release</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12579" target="_blank" rel="noopener">ARROW-12579</a> - [Python] Pyarrow 4.0.0 dependency numpy 1.19.4 throws errors on Apple silicon/M1 compilation</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12589" target="_blank" rel="noopener">ARROW-12589</a> - [C++] Compiling on windows doesn't work when -DARROW_WITH_BACKTRACE=OFF</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12601" target="_blank" rel="noopener">ARROW-12601</a> - [R][Packaging] Fix pkg-config check in r/configure</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12604" target="_blank" rel="noopener">ARROW-12604</a> - [R][Packaging] Dataset, Parquet off in autobrew and CRAN Mac builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12605" target="_blank" rel="noopener">ARROW-12605</a> - [Documentation] Repair line numbers in dataset.rst</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12606" target="_blank" rel="noopener">ARROW-12606</a> - [C++] Quantile and Mode functions failing on arrays with offset</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12610" target="_blank" rel="noopener">ARROW-12610</a> - [C++] Skip TestS3FSGeneric TestDeleteDir and TestDeleteDirContents on windows as they are flaky</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12611" target="_blank" rel="noopener">ARROW-12611</a> - [CI][Python] Nightly test-conda-python-pandas-0.24 is failing due to numpy compat issue</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12613" target="_blank" rel="noopener">ARROW-12613</a> - [Python] AttributeError when comparing a Scalar with None</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12614" target="_blank" rel="noopener">ARROW-12614</a> - [C++][Compute] Revert support for Tables in ExecuteScalarExpression</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12617" target="_blank" rel="noopener">ARROW-12617</a> - [Python] pyarrow.orc.write_table signature reverses that of pyarrow.parquet.write_table</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12620" target="_blank" rel="noopener">ARROW-12620</a> - [C++] Dataset writing can only include projected columns if input columns are also included</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12622" target="_blank" rel="noopener">ARROW-12622</a> - [Python] Segfault when reading CSV inside Flight server</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12630" target="_blank" rel="noopener">ARROW-12630</a> - [Dev][Integration] conda-integration docker build fails</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12639" target="_blank" rel="noopener">ARROW-12639</a> - [CI][Archery] Archery build fails to create branch</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12640" target="_blank" rel="noopener">ARROW-12640</a> - [C++] Fix errors from VS 2019 in cpp/src/parquet/types.h</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12642" target="_blank" rel="noopener">ARROW-12642</a> - [R] LIBARROW_MINIMAL, LIBARROW_DOWNLOAD, NOT_CRAN env vars should not be case-sensitive</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12644" target="_blank" rel="noopener">ARROW-12644</a> - [C++][Dataset] Support reading date/time-partitioned datasets accounting for URL encoding (Spark)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12646" target="_blank" rel="noopener">ARROW-12646</a> - [C++][CI][Packaging][Python] Bump vcpkg version to its latest release</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12663" target="_blank" rel="noopener">ARROW-12663</a> - [C++] segfault when arrow header is compiled with nvcc 11.2</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12668" target="_blank" rel="noopener">ARROW-12668</a> - [C++][Dataset] CountRows occasionally segfaulting</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12670" target="_blank" rel="noopener">ARROW-12670</a> - [C++] extract_regex gives bizarre behavior after nulls or non-matches</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12672" target="_blank" rel="noopener">ARROW-12672</a> - [C++] Segfault casting result of "fill_null()" (not bitmap but unknown null_count)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12679" target="_blank" rel="noopener">ARROW-12679</a> - [Java] JDBC adapter does not preserve SQL-nullability</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12684" target="_blank" rel="noopener">ARROW-12684</a> - [Go][Flight] Fix nil dereference in error case</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12708" target="_blank" rel="noopener">ARROW-12708</a> - [C++] Valgrind errors when calling negate_checked</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12729" target="_blank" rel="noopener">ARROW-12729</a> - [R] Fix length method for Table, RecordBatch</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12746" target="_blank" rel="noopener">ARROW-12746</a> - [Go][Flight] Client Auth handler overwrites outgoing metadata</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12756" target="_blank" rel="noopener">ARROW-12756</a> - [C++] MSVC build fails with latest gtest from vcpkg</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12757" target="_blank" rel="noopener">ARROW-12757</a> - [Dev][Archery] Warning about RUST variable in "archery docker run"</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12762" target="_blank" rel="noopener">ARROW-12762</a> - [Python] ListType doesn't preserve field name after pickle and unpickle</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12769" target="_blank" rel="noopener">ARROW-12769</a> - [Python] Negative out of range slices yield invalid arrays</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12771" target="_blank" rel="noopener">ARROW-12771</a> - [C++] Arrow compute hash_count skips following chunked arrays in streaming execution</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12772" target="_blank" rel="noopener">ARROW-12772</a> - [CI] Merge script test fails due to missing dependency</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12773" target="_blank" rel="noopener">ARROW-12773</a> - [Docs] Clarify Java support for ORC and Parquet via JNI bindings</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12774" target="_blank" rel="noopener">ARROW-12774</a> - [C++][Compute] replace_substring_regex() creates invalid arrays =&gt; crash</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12776" target="_blank" rel="noopener">ARROW-12776</a> - [Archery][Integration] Fix decimal case generation in write_js_test_json</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12779" target="_blank" rel="noopener">ARROW-12779</a> - [Python][FlightRPC] Flight server segfaults with certain data</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12780" target="_blank" rel="noopener">ARROW-12780</a> - [CI][C++] MinGW builds failing when trying to build Gandiva</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12790" target="_blank" rel="noopener">ARROW-12790</a> - [Python] Cannot read from HDFS with blanks in path names</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12793" target="_blank" rel="noopener">ARROW-12793</a> - [Python] PYARROW_BUILD_TYPE=Debug does not work correctly</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12797" target="_blank" rel="noopener">ARROW-12797</a> - [JS] Update readme with new links and remove outdated examples</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12798" target="_blank" rel="noopener">ARROW-12798</a> - [JS] Use == null Comparison</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12799" target="_blank" rel="noopener">ARROW-12799</a> - [JS] Use Nullish Coalescing Operator (??) For Defaults</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12804" target="_blank" rel="noopener">ARROW-12804</a> - [C++] Array methods IsNull and IsValid is confused for NullType</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12807" target="_blank" rel="noopener">ARROW-12807</a> - [C++] Fix merge conflicts with Future refactor/async IPC</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12838" target="_blank" rel="noopener">ARROW-12838</a> - [Java][Gandiva] Fix JNI CI test for Gandiva</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12842" target="_blank" rel="noopener">ARROW-12842</a> - [Java][FlightRPC] Error metadata from FlightStatusException is not propagated to client</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12850" target="_blank" rel="noopener">ARROW-12850</a> - [R] is.nan() evaluates to null on Arrow null values</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12854" target="_blank" rel="noopener">ARROW-12854</a> - [Dev][Release] Windows wheel verification script fails to download artifacts</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12857" target="_blank" rel="noopener">ARROW-12857</a> - [C++] hash_aggregate_test not building on master</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12864" target="_blank" rel="noopener">ARROW-12864</a> - [C++] Remove needless out argument from arrow::internal::InvertBitmap</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12865" target="_blank" rel="noopener">ARROW-12865</a> - [C++][Python] Python FlightRPC server cannot find RE2 symbols</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12882" target="_blank" rel="noopener">ARROW-12882</a> - [C++][Gandiva] Fix behavior of convevrt_replace function for empty replacement char</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12887" target="_blank" rel="noopener">ARROW-12887</a> - [CI] AppVeyor pip install failure during setup</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12906" target="_blank" rel="noopener">ARROW-12906</a> - [Python] `fill_null` called with a null value seg faults on non fixed-sized types.</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12907" target="_blank" rel="noopener">ARROW-12907</a> - [Java] Memory leak possible when exception reading from channel happens</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12911" target="_blank" rel="noopener">ARROW-12911</a> - [Python] Export scalar aggregate options to pc.sum (sum of zero rows gives null; should give 0)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12917" target="_blank" rel="noopener">ARROW-12917</a> - [C++][R][pyarrow] Failure importing some decimal types using the C data interface</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12918" target="_blank" rel="noopener">ARROW-12918</a> - [C++] Build errors with Visual Studio 16.10.31321.278</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12919" target="_blank" rel="noopener">ARROW-12919</a> - [Developer Tools] Crossbow comment bot failing to react to comments</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12935" target="_blank" rel="noopener">ARROW-12935</a> - [C++][CI] Compiler error on some clang versions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12941" target="_blank" rel="noopener">ARROW-12941</a> - [C++] csv reader skip_row does not properly update num_rows_seen</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12942" target="_blank" rel="noopener">ARROW-12942</a> - [C++][Compute] The result of Arrow compute hash_min_max is incorrect if there are new groups in the subsequent chunks</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12956" target="_blank" rel="noopener">ARROW-12956</a> - [C++] Fix crash on Parquet file (OSS-Fuzz)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12969" target="_blank" rel="noopener">ARROW-12969</a> - [C++] match_substring doesn't match empty needle to empty haystack</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12974" target="_blank" rel="noopener">ARROW-12974</a> - [R] test-r-without-arrow build fails because of example requiring Arrow</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12983" target="_blank" rel="noopener">ARROW-12983</a> - [C++][Python] Converter::Extend gets stuck in infinite loop causing OOM if values don't fit in single chunk</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12987" target="_blank" rel="noopener">ARROW-12987</a> - [CI] test-ubuntu-18.04 nightly builds are failing due to Gandiva "TestUpper" test failure</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12988" target="_blank" rel="noopener">ARROW-12988</a> - [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12988" target="_blank" rel="noopener">ARROW-12988</a> - [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12989" target="_blank" rel="noopener">ARROW-12989</a> - [CI] "Dev PR" jobs undully cancelled</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12991" target="_blank" rel="noopener">ARROW-12991</a> - [CI] Travis ARM builds often crash</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12993" target="_blank" rel="noopener">ARROW-12993</a> - [Python] Address boundary error with invalid Feather file and stackprinter</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12995" target="_blank" rel="noopener">ARROW-12995</a> - [C++] CSV reader should validate options</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12998" target="_blank" rel="noopener">ARROW-12998</a> - [C++] Datasets needs dependency on xsimd</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13001" target="_blank" rel="noopener">ARROW-13001</a> - [Go] Build failure in parquet/internal/bmi on s390x</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13003" target="_blank" rel="noopener">ARROW-13003</a> - [C++] unaligned access in compute/exec/ cc files</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13008" target="_blank" rel="noopener">ARROW-13008</a> - [C++] Deprecation warning when compiling minimal example</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13010" target="_blank" rel="noopener">ARROW-13010</a> - [C++][Compute] Support outputting to slices from kleene kernels</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13018" target="_blank" rel="noopener">ARROW-13018</a> - [C++][Docs] Use consistent terminology for nulls (min_count) in scalar aggregate kernels</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13026" target="_blank" rel="noopener">ARROW-13026</a> - [C++][CI] s390x job setup fails</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13037" target="_blank" rel="noopener">ARROW-13037</a> - [R] Incorrect param when creating Expression crashes R</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13039" target="_blank" rel="noopener">ARROW-13039</a> - [R] Fix error message handling</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13041" target="_blank" rel="noopener">ARROW-13041</a> - [C++] Unary kernels can leave uninitialized data under null entries</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13046" target="_blank" rel="noopener">ARROW-13046</a> - [Release] JS package failing test prior to publish</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13048" target="_blank" rel="noopener">ARROW-13048</a> - [C++] S3FileSystem fails moving filepaths containing = or +</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13053" target="_blank" rel="noopener">ARROW-13053</a> - [Python] Build fails on MacOS Big Sur using homebrewed Arrow libraries</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13069" target="_blank" rel="noopener">ARROW-13069</a> - [Website] Add Daniël to committer list</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13073" target="_blank" rel="noopener">ARROW-13073</a> - [Developer] archery benchmark list: unexpected keyword 'benchmark_filter'</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13080" target="_blank" rel="noopener">ARROW-13080</a> - [Release] Generate the API docs in ubuntu 20.10</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13083" target="_blank" rel="noopener">ARROW-13083</a> - [Python] Wrong SCM version detection both in setup.py and crossbow</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13085" target="_blank" rel="noopener">ARROW-13085</a> - [Python] Apache Arrow minimal cpp build segfaults with pyarrow libs</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13090" target="_blank" rel="noopener">ARROW-13090</a> - [Python] Test failure with ffspec 2021.6.0</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13104" target="_blank" rel="noopener">ARROW-13104</a> - [C++] ByteStreamSplit implementation uses invalid pointer cast</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13108" target="_blank" rel="noopener">ARROW-13108</a> - [Python] Pyarrow 4.0.0 crashes upon import on macOS 10.13.6</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13116" target="_blank" rel="noopener">ARROW-13116</a> - [R] Test for RecordBatchReader to C-interface fails on arrow-r-minimal due to missing dependencies</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13125" target="_blank" rel="noopener">ARROW-13125</a> - [R] Throw error when 2+ args passed to desc() in arrange()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13128" target="_blank" rel="noopener">ARROW-13128</a> - [C#] TimestampArray conversion logic for nano and micro is wrong</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13135" target="_blank" rel="noopener">ARROW-13135</a> - [C++] Fix Status propagation in END_PARQUET_CATCH_EXCEPTIONS</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13139" target="_blank" rel="noopener">ARROW-13139</a> - [C++] ReadaheadGenerator cannot be safely copied/moved</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13145" target="_blank" rel="noopener">ARROW-13145</a> - [C++][CI] Flight test crashes on MinGW</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13148" target="_blank" rel="noopener">ARROW-13148</a> - [Dev][Archery] Crossbow build submission fails</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13153" target="_blank" rel="noopener">ARROW-13153</a> - [C++] `parquet_dataset` loses ordering of files in `_metadata`</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13154" target="_blank" rel="noopener">ARROW-13154</a> - [C++] Unions can not have 126 and 127 as type_codes</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13169" target="_blank" rel="noopener">ARROW-13169</a> - [R] [C++] sorted partition keys can cause issues</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13173" target="_blank" rel="noopener">ARROW-13173</a> - [C++] TestAsyncUtil.ReadaheadFailed asserts occasionally</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13187" target="_blank" rel="noopener">ARROW-13187</a> - [c++][python] Possibly memory not deallocated when reading in CSV</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13189" target="_blank" rel="noopener">ARROW-13189</a> - [R] Disable row-level metadata application on datasets</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13203" target="_blank" rel="noopener">ARROW-13203</a> - [R] Fix optional component checks causing failures</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13207" target="_blank" rel="noopener">ARROW-13207</a> - [Python][Doc] Dataset documentation still suggests deprecated scan method as the preferred iterative approach</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13216" target="_blank" rel="noopener">ARROW-13216</a> - [R] Type checks test fails with rtools35</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13217" target="_blank" rel="noopener">ARROW-13217</a> - [C++][Gandiva] Correct convert_replace function for invalid chars on string beginning</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13223" target="_blank" rel="noopener">ARROW-13223</a> - [C++][CI] Fix thread sanitizer failures</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13225" target="_blank" rel="noopener">ARROW-13225</a> - [Go][Flight] Implement Custom Middleware Interface and Enable Integration Tests</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13229" target="_blank" rel="noopener">ARROW-13229</a> - [Python] ascii_trim, ascii_ltrim and ascii_rtrim lack options</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13239" target="_blank" rel="noopener">ARROW-13239</a> - [Doc][Python] Dataset.head function doesn't mention required argument</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13243" target="_blank" rel="noopener">ARROW-13243</a> - [R] altrep function call in R 3.5</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13246" target="_blank" rel="noopener">ARROW-13246</a> - [C++] CSV skip_rows_after_names can discard data prematurally</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13249" target="_blank" rel="noopener">ARROW-13249</a> - [Java][CI] Consistent timeout in the Java JNI build</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13253" target="_blank" rel="noopener">ARROW-13253</a> - [C++][FlightRPC] Segfault when sending record batch &gt;2GB</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13254" target="_blank" rel="noopener">ARROW-13254</a> - [Python] Processes killed and semaphore objects leaked when reading pandas data</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13265" target="_blank" rel="noopener">ARROW-13265</a> - [R] cli valgrind errors in nightlies</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13266" target="_blank" rel="noopener">ARROW-13266</a> - [JS] Improve benchmark names &amp; add suite name to json</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13281" target="_blank" rel="noopener">ARROW-13281</a> - [C++][Gandiva] Error on timestampDiffMonth function behavior for negative diff values</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13284" target="_blank" rel="noopener">ARROW-13284</a> - [C++] Wrong pkg_check_modules() option name</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13288" target="_blank" rel="noopener">ARROW-13288</a> - [Python] Missing default values of kernel options in PyArrow</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13290" target="_blank" rel="noopener">ARROW-13290</a> - Compilation fails on clang-12 and gcc-11 due to missing include</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13305" target="_blank" rel="noopener">ARROW-13305</a> - [C++] Unable to install nightly on Ubuntu 21.04 due to CSV options</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13315" target="_blank" rel="noopener">ARROW-13315</a> - [R] Wrap r_task_group includes with ARROW_R_WITH_ARROW checking</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13321" target="_blank" rel="noopener">ARROW-13321</a> - [C++][Python] MakeArrayFromScalar doesn't work for FixedSizeBinaryType</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13324" target="_blank" rel="noopener">ARROW-13324</a> - [R] Typo in bindings for utf8_reverse and ascii_reverse</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13332" target="_blank" rel="noopener">ARROW-13332</a> - [C++] TSAN failure in TestAsyncUtil.ReadaheadFailed</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13341" target="_blank" rel="noopener">ARROW-13341</a> - [C++] Segfault in arrow-compute-plan-test ExecPlanExecution.SourceScalarAggSink</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13350" target="_blank" rel="noopener">ARROW-13350</a> - [Python][CI] conda-python-3.7-pandas-0.24 nightly build failing in test_extract_datetime_components</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13352" target="_blank" rel="noopener">ARROW-13352</a> - [C++] Valgrind failure in case_when kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13353" target="_blank" rel="noopener">ARROW-13353</a> - [Documentation] Build failing with sphinx.util.cfamily.DefinitionError</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13360" target="_blank" rel="noopener">ARROW-13360</a> - [C++] Missing dependencies in C++ thirdparty offline dependencies versions.txt</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13363" target="_blank" rel="noopener">ARROW-13363</a> - [R] is.nan() errors on non-floating point data</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13368" target="_blank" rel="noopener">ARROW-13368</a> - [C++][Doc] Rename project to make_struct in docs</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13381" target="_blank" rel="noopener">ARROW-13381</a> - [C++] ArrayFromJSON doesn't work for float value dictionary type</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13382" target="_blank" rel="noopener">ARROW-13382</a> - [C++] Aggregation over scalars fails autobrew R job</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13384" target="_blank" rel="noopener">ARROW-13384</a> - [C++] Specify minimum required zstd version in cmake</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13391" target="_blank" rel="noopener">ARROW-13391</a> - [C++] CSV streaming reader does not include same error information as table reader</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13417" target="_blank" rel="noopener">ARROW-13417</a> - [C++] The merged generator can sometimes pull from source sync-reentrant</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13419" target="_blank" rel="noopener">ARROW-13419</a> - [JS] Fix perf tests</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13428" target="_blank" rel="noopener">ARROW-13428</a> - [C++][Flight] -lssl is missing with bundled gRPC and system shared OpenSSL</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13431" target="_blank" rel="noopener">ARROW-13431</a> - [Release] Bump go version to 1.15; don't verify rust source anymore</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13432" target="_blank" rel="noopener">ARROW-13432</a> - [Release] Fix ssh connection to the binary uploader container</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13450" target="_blank" rel="noopener">ARROW-13450</a> - [Python][Packaging] Set deployment target to 10.13 for universal2 wheels</li>
</ul>
<h3>New Features and Improvements</h3>
<ul>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-2665" target="_blank" rel="noopener">ARROW-2665</a> - [Python/C++] Add index() method to find first occurence of Python scalar</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-3014" target="_blank" rel="noopener">ARROW-3014</a> - [C++] Minimal writer adapter for ORC file format</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-3316" target="_blank" rel="noopener">ARROW-3316</a> - [R] Multi-threaded conversion from R data.frame to Arrow table / record batch</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-5385" target="_blank" rel="noopener">ARROW-5385</a> - [Go] implement EXTENSION datatype</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-5640" target="_blank" rel="noopener">ARROW-5640</a> - [Go] implement Map array</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-6513" target="_blank" rel="noopener">ARROW-6513</a> - [CI] The conda environment files arrow/ci/conda_env_*.yml should have .txt extension</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-6513" target="_blank" rel="noopener">ARROW-6513</a> - [CI] The conda environment files arrow/ci/conda_env_*.yml should have .txt extension</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-7001" target="_blank" rel="noopener">ARROW-7001</a> - [C++] Develop threading APIs to accommodate nested parallelism</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-7114" target="_blank" rel="noopener">ARROW-7114</a> - [JS][CI] NodeJS build fails on Github Actions Windows node</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-7252" target="_blank" rel="noopener">ARROW-7252</a> - [Rust] [Parquet] Reading UTF-8/JSON/ENUM field results in a lot of vec allocation</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-7396" target="_blank" rel="noopener">ARROW-7396</a> - [Format] Register media types (MIME types) for Apache Arrow formats to IANA</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-8421" target="_blank" rel="noopener">ARROW-8421</a> - [Rust] [Parquet] Implement parquet writer</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-8459" target="_blank" rel="noopener">ARROW-8459</a> - [Dev][Archery] Use a more recent cmake-format</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-8527" target="_blank" rel="noopener">ARROW-8527</a> - [C++][CSV] Add support for ReadOptions::skip_rows &gt;= block_size</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-8655" target="_blank" rel="noopener">ARROW-8655</a> - [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-8676" target="_blank" rel="noopener">ARROW-8676</a> - [Rust] Create implementation of IPC RecordBatch body buffer compression from ARROW-300</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9054" target="_blank" rel="noopener">ARROW-9054</a> - [C++] Add ScalarAggregateOptions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9056" target="_blank" rel="noopener">ARROW-9056</a> - [C++] Support scalar aggregation over scalars</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9140" target="_blank" rel="noopener">ARROW-9140</a> - [R] Zero-copy Arrow to R where possible</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9295" target="_blank" rel="noopener">ARROW-9295</a> - [Archery] Support rust clippy in the lint command</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9299" target="_blank" rel="noopener">ARROW-9299</a> - [Python] Expose ORC metadata() in Python ORCFile</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9313" target="_blank" rel="noopener">ARROW-9313</a> - [Rust] Use feature enum</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9421" target="_blank" rel="noopener">ARROW-9421</a> - [C++][Parquet] Redundancies SchemaManifest::GetFieldIndices</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9430" target="_blank" rel="noopener">ARROW-9430</a> - [C++/Python] Kernel for SetItem(BooleanArray, values)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-9697" target="_blank" rel="noopener">ARROW-9697</a> - [C++][Dataset] num_rows method for Dataset/Scanner</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10031" target="_blank" rel="noopener">ARROW-10031</a> - [Java] Support Java benchmark in Archery</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10115" target="_blank" rel="noopener">ARROW-10115</a> - [C++] CSV empty quoted string is treated as NULL</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10316" target="_blank" rel="noopener">ARROW-10316</a> - [Python] Consider using __wrapped__ for compute function introspection</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10391" target="_blank" rel="noopener">ARROW-10391</a> - [Rust] [Parquet] Nested Arrow reader</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10440" target="_blank" rel="noopener">ARROW-10440</a> - [C++][Dataset][Python] Add a callback to visit file writers just before Finish()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10550" target="_blank" rel="noopener">ARROW-10550</a> - [Rust] [Parquet] Write nested types (struct, list)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10557" target="_blank" rel="noopener">ARROW-10557</a> - [C++] Add scalar string slicing/substring extract kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10640" target="_blank" rel="noopener">ARROW-10640</a> - [C++] An "if_else" kernel to combine two arrays based on a mask</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10658" target="_blank" rel="noopener">ARROW-10658</a> - [Python][Packaging] Wheel builds for Apple Silicon</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10675" target="_blank" rel="noopener">ARROW-10675</a> - [C++][Python] Support AWS S3 Web identity credentials</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10797" target="_blank" rel="noopener">ARROW-10797</a> - [C++] Investigate faster random generation for tests and benchmarks</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10926" target="_blank" rel="noopener">ARROW-10926</a> - [Rust] Add parquet reader / writer for decimal types</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-10959" target="_blank" rel="noopener">ARROW-10959</a> - [C++] Add scalar string join kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11061" target="_blank" rel="noopener">ARROW-11061</a> - [Rust] Validate array properties against schema</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11173" target="_blank" rel="noopener">ARROW-11173</a> - Add Map type as reader / writer in FieldReader / FieldWriter</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11199" target="_blank" rel="noopener">ARROW-11199</a> - [C++][Python] Fix the unit tests for the ORC reader</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11206" target="_blank" rel="noopener">ARROW-11206</a> - [C++][Compute][Python] Rename "project" kernel to "make_struct"</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11342" target="_blank" rel="noopener">ARROW-11342</a> - [Python] [Gandiva] Expose ToString and result type information</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11499" target="_blank" rel="noopener">ARROW-11499</a> - [Packaging] Remove all use of bintray</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11514" target="_blank" rel="noopener">ARROW-11514</a> - [R][C++] Bindings for paste(), paste0(), str_c()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11515" target="_blank" rel="noopener">ARROW-11515</a> - [R] Bindings for strsplit</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11565" target="_blank" rel="noopener">ARROW-11565</a> - [C++][Gandiva] Modify upper()/lower() to work with UTF8 and add INIT_CAP function</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11608" target="_blank" rel="noopener">ARROW-11608</a> - [CI] turbodbc integration tests are failing (build isue)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11660" target="_blank" rel="noopener">ARROW-11660</a> - [C++] Move RecordBatch::SelectColumns method from R to C++ library</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11673" target="_blank" rel="noopener">ARROW-11673</a> - [C++] Casting dictionary type to use different index type</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11675" target="_blank" rel="noopener">ARROW-11675</a> - [CI][C++] Resolve ctest failures on VS 2019 builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11705" target="_blank" rel="noopener">ARROW-11705</a> - [R] Support scalar value recycling in RecordBatch/Table$create()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11748" target="_blank" rel="noopener">ARROW-11748</a> - [C++] Ensure Decimal128 and Decimal256's fields are in native endian order</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11759" target="_blank" rel="noopener">ARROW-11759</a> - [C++] Kernel to extract datetime components (year, month, day, etc) from timestamp type</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11769" target="_blank" rel="noopener">ARROW-11769</a> - [R] Pull groups from grouped_df into RecordBatch or Table</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11772" target="_blank" rel="noopener">ARROW-11772</a> - [C++] Add asynchronous read to ipc::RecordBatchFileReader</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11782" target="_blank" rel="noopener">ARROW-11782</a> - [GLib][Ruby][Dataset] Remove bindings for internal classes</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11787" target="_blank" rel="noopener">ARROW-11787</a> - [R] Implement write csv</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11843" target="_blank" rel="noopener">ARROW-11843</a> - [C++] Add asynchronous read to parquet::arrow::FileReader</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11849" target="_blank" rel="noopener">ARROW-11849</a> - [R] Use roxygen @examplesIf tag in R docs</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11889" target="_blank" rel="noopener">ARROW-11889</a> - [C++] Add parallelism to streaming CSV reader</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11909" target="_blank" rel="noopener">ARROW-11909</a> - [C++] Get rid of MakeIteratorGenerator</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11926" target="_blank" rel="noopener">ARROW-11926</a> - [R] Pass on the new UCRT CRAN windows builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11926" target="_blank" rel="noopener">ARROW-11926</a> - [R] Pass on the new UCRT CRAN windows builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11928" target="_blank" rel="noopener">ARROW-11928</a> - [C++][Compute] Add ExecNode hierarchy</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11929" target="_blank" rel="noopener">ARROW-11929</a> - [C++][Compute] Promote Expression to the compute namespace</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11930" target="_blank" rel="noopener">ARROW-11930</a> - [C++][Dataset][Compute] Refactor Dataset scans to use an ExecNode graph</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11932" target="_blank" rel="noopener">ARROW-11932</a> - [C++] Provide ArrayBuilder::AppendScalar</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11950" target="_blank" rel="noopener">ARROW-11950</a> - [C++][Compute] Add unary negative kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11960" target="_blank" rel="noopener">ARROW-11960</a> - [C++][Gandiva] Support escape in LIKE</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11980" target="_blank" rel="noopener">ARROW-11980</a> - [Python] Remove "experimental" status from Table.replace_schema_metadata</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11986" target="_blank" rel="noopener">ARROW-11986</a> - [C++][Gandiva] Implement IN expressions for doubles and floats</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-11990" target="_blank" rel="noopener">ARROW-11990</a> - [C++][Compute] Use Status/Result return consistently to indicate errors</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12004" target="_blank" rel="noopener">ARROW-12004</a> - [C++] Result<a href="detail::Empty%5C">detail::Empty\</a> is annoying</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12010" target="_blank" rel="noopener">ARROW-12010</a> - [C++][Compute] Improve performance of the hash table used in GroupIdentifier</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12016" target="_blank" rel="noopener">ARROW-12016</a> - [C++] Implement array_sort_indices and sort_indices for BOOL type</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12050" target="_blank" rel="noopener">ARROW-12050</a> - [C++][Python][FlightRPC] Use StopToken to enable interrupting long Flight operations</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12074" target="_blank" rel="noopener">ARROW-12074</a> - [C++][Compute] Add scalar arithmetic kernels for decimal inputs</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12083" target="_blank" rel="noopener">ARROW-12083</a> - [R] schema use in open_dataset</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12166" target="_blank" rel="noopener">ARROW-12166</a> - [C++][Gandiva] Implements CONVERT_TO(value, type) function</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12184" target="_blank" rel="noopener">ARROW-12184</a> - [R] Bindings for na.fail, na.omit, na.exclude, na.pass</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12185" target="_blank" rel="noopener">ARROW-12185</a> - [R] Bindings for any, all</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12198" target="_blank" rel="noopener">ARROW-12198</a> - [R] bindings for strptime</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12199" target="_blank" rel="noopener">ARROW-12199</a> - [R] bindings for stddev, variance</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12205" target="_blank" rel="noopener">ARROW-12205</a> - [C++][Gandiva] Implement TO_TIME([number] secs) and TO_TIMESTAMP([number] secs) function</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12231" target="_blank" rel="noopener">ARROW-12231</a> - [C++][Dataset] Separate datasets backed by readers from InMemoryDataset</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12253" target="_blank" rel="noopener">ARROW-12253</a> - [Rust] [Ballista] Implement scalable joins</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12255" target="_blank" rel="noopener">ARROW-12255</a> - [Rust] [Ballista] Integrate scheduler with DataFusion</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12256" target="_blank" rel="noopener">ARROW-12256</a> - [Rust] [Ballista] Add DataFrame support</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12257" target="_blank" rel="noopener">ARROW-12257</a> - [Rust] [Ballista] Publish user guide to Arrow site</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12261" target="_blank" rel="noopener">ARROW-12261</a> - [Rust] [Ballista] Ballista should not have its own DataFrame API</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12291" target="_blank" rel="noopener">ARROW-12291</a> - [R] Determine the type of an unevaluated expression</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12310" target="_blank" rel="noopener">ARROW-12310</a> - [Java] ValueVector#getObject should support covariance for complex types</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12355" target="_blank" rel="noopener">ARROW-12355</a> - [C++] Implement efficient async CSV scanning</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12362" target="_blank" rel="noopener">ARROW-12362</a> - [Rust] [DataFusion] topk_query test failure</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12364" target="_blank" rel="noopener">ARROW-12364</a> - [Python] [Dataset] Add metadata_collector option to ds.write_dataset()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12378" target="_blank" rel="noopener">ARROW-12378</a> - [C++][Gandiva] Implement castVARBINARY functions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12386" target="_blank" rel="noopener">ARROW-12386</a> - [C++] Support file parallelism in AsyncScanner</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12391" target="_blank" rel="noopener">ARROW-12391</a> - [Rust][DataFusion] Implement date_trunc() function</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12392" target="_blank" rel="noopener">ARROW-12392</a> - [C++] Restore asynchronous streaming CSV reader</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12393" target="_blank" rel="noopener">ARROW-12393</a> - [JS] Optimally use closure compiler</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12403" target="_blank" rel="noopener">ARROW-12403</a> - [Rust] [Ballista] Integration tests should check that query results are correct</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12415" target="_blank" rel="noopener">ARROW-12415</a> - [CI] [Python] ERROR: Failed building wheel for pygit2 on ARM64</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12424" target="_blank" rel="noopener">ARROW-12424</a> - [Go][Parquet] Add Schema Package</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12428" target="_blank" rel="noopener">ARROW-12428</a> - [Python] pyarrow.parquet.read_* should use pre_buffer=True</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12434" target="_blank" rel="noopener">ARROW-12434</a> - [Rust] [Ballista] Show executed plans with metrics</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12442" target="_blank" rel="noopener">ARROW-12442</a> - [CI] Set job timeouts on GitHub Actions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12443" target="_blank" rel="noopener">ARROW-12443</a> - [C++][Gandiva] Implement castVARCHAR function for binary input</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12444" target="_blank" rel="noopener">ARROW-12444</a> - [RUST] [CI] Remove Rust and point integration tests to arrow-rs repo</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12445" target="_blank" rel="noopener">ARROW-12445</a> - [Rust] Design and implement packaging process to bundle Rust in signed tar</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12468" target="_blank" rel="noopener">ARROW-12468</a> - [Python][R] Expose UseAsync to python/R</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12478" target="_blank" rel="noopener">ARROW-12478</a> - [C++] Support LLVM 12</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12484" target="_blank" rel="noopener">ARROW-12484</a> - [CI] Change jinja macros to not require CROSSBOW_TOKEN to upload artifacts in Github Actions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12489" target="_blank" rel="noopener">ARROW-12489</a> - [Developer] autotune is broken</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12490" target="_blank" rel="noopener">ARROW-12490</a> - [Dev] Use miniforge for all platforms</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12492" target="_blank" rel="noopener">ARROW-12492</a> - [Python] Add an helper method to decode a DictionaryArray back to a plain Array</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12496" target="_blank" rel="noopener">ARROW-12496</a> - [C++][Dataset] Ensure Scanner tests fully cover async</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12499" target="_blank" rel="noopener">ARROW-12499</a> - [C++][Compute][R] Add ScalarAggregateOptions to Any and All kernels</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12500" target="_blank" rel="noopener">ARROW-12500</a> - [C++][Dataset] Consolidate similar tests for file formats</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12501" target="_blank" rel="noopener">ARROW-12501</a> - [CI][Ruby] Remove needless workaround for MinGW build</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12507" target="_blank" rel="noopener">ARROW-12507</a> - [CI] Remove duplicated cron/nightly builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12512" target="_blank" rel="noopener">ARROW-12512</a> - [C++][Dataset] Implement CSV writing support</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12514" target="_blank" rel="noopener">ARROW-12514</a> - [Release] Don't run Gandiva related Ruby test with ARROW_GANDIVA=OFF</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12517" target="_blank" rel="noopener">ARROW-12517</a> - [Go] Expose App Metadata in Flight client</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12518" target="_blank" rel="noopener">ARROW-12518</a> - [Python] Expose Parquet statistics has_null_count / has_distinct_count</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12520" target="_blank" rel="noopener">ARROW-12520</a> - [R] Minor docs updates</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12522" target="_blank" rel="noopener">ARROW-12522</a> - [C++] Implement asynchronous/"lazy" variants of ReadRangeCache</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12525" target="_blank" rel="noopener">ARROW-12525</a> - [JS] Vector toJSON returns an array</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12527" target="_blank" rel="noopener">ARROW-12527</a> - [Dev] Don't try getting JIRA information for MINOR PR</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12528" target="_blank" rel="noopener">ARROW-12528</a> - [JS] Support typed arrays in Table.new</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12530" target="_blank" rel="noopener">ARROW-12530</a> - [C++] Remove Buffer::mutable_data_ member and use const_cast on data_ only if is_mutable_ is true</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12533" target="_blank" rel="noopener">ARROW-12533</a> - [C++] Random real generator is slow on Arm64 Linux when built with clang</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12534" target="_blank" rel="noopener">ARROW-12534</a> - [C++][Gandiva] Implement LEFT and RIGHT functions on Gandiva for string input values</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12537" target="_blank" rel="noopener">ARROW-12537</a> - [JS] Docs build should not include test sources</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12541" target="_blank" rel="noopener">ARROW-12541</a> - [Docs] Improve styling/readability of tables in the new doc theme</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12551" target="_blank" rel="noopener">ARROW-12551</a> - [Java][Release] Java post-release tests fail due to missing testing data</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12554" target="_blank" rel="noopener">ARROW-12554</a> - Allow duplicates in the value_set for compute::is_in</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12555" target="_blank" rel="noopener">ARROW-12555</a> - [Java][Release] Java post-release script misses dataset JNI bindings</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12556" target="_blank" rel="noopener">ARROW-12556</a> - [C++][Gandiva] Implement BYTESUBSTRING functions on Gandiva</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12560" target="_blank" rel="noopener">ARROW-12560</a> - [C++] Investigate utilizing aggressive thread task creation when adding callback to finished future</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12567" target="_blank" rel="noopener">ARROW-12567</a> - [C++][Gandiva] Implement LPAD and RPAD functions for string input values</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12567" target="_blank" rel="noopener">ARROW-12567</a> - [C++][Gandiva] Implement LPAD and RPAD functions for string input values</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12571" target="_blank" rel="noopener">ARROW-12571</a> - [R][CI] Run nightly R with valgrind</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12575" target="_blank" rel="noopener">ARROW-12575</a> - [R] Use unary negative kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12577" target="_blank" rel="noopener">ARROW-12577</a> - [Website] Use Artifactory instead of Bintray in all places</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12578" target="_blank" rel="noopener">ARROW-12578</a> - [JS] Simplify UTF8 handling in NodeJS</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12581" target="_blank" rel="noopener">ARROW-12581</a> - [C++][FlightRPC] Benchmark compression with real data</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12584" target="_blank" rel="noopener">ARROW-12584</a> - [C++][Python] Expose method for benchmarking tools to release unused memory from the allocators</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12591" target="_blank" rel="noopener">ARROW-12591</a> - [Java][Gandiva] Create single Gandiva jar for MacOS and Linux</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12593" target="_blank" rel="noopener">ARROW-12593</a> - [Packaging][Ubuntu] Add support for Ubuntu 21.04</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12597" target="_blank" rel="noopener">ARROW-12597</a> - [C++] Implement OptionalParallelForAsync</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12598" target="_blank" rel="noopener">ARROW-12598</a> - [C++][Dataset] Implement row-count for CSV or allow selecting 0 columns from CSV</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12599" target="_blank" rel="noopener">ARROW-12599</a> - [Doc][Python] Documentation missing for pyarrow.Table</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12600" target="_blank" rel="noopener">ARROW-12600</a> - [CI] Push docker images from crossbow tasks</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12602" target="_blank" rel="noopener">ARROW-12602</a> - [R] Add BuildInfo from C++ to arrow_info</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12608" target="_blank" rel="noopener">ARROW-12608</a> - [C++] Add split_pattern_regex function</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12612" target="_blank" rel="noopener">ARROW-12612</a> - [C++][Compute] Add Expression to type_fwd.h</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12619" target="_blank" rel="noopener">ARROW-12619</a> - [Python] pyarrow sdist should not require git</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12621" target="_blank" rel="noopener">ARROW-12621</a> - [C++][Gandiva] Add alias to sha1 and sha256 functions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12631" target="_blank" rel="noopener">ARROW-12631</a> - [Python] pyarrow.dataset.write_table should accept a Scanner to write</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12643" target="_blank" rel="noopener">ARROW-12643</a> - Add documentation for experimental repos</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12645" target="_blank" rel="noopener">ARROW-12645</a> - [Python] Fix numpydoc validation</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12648" target="_blank" rel="noopener">ARROW-12648</a> - [C++][FlightRPC] Allow using TLS in benchmark</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12649" target="_blank" rel="noopener">ARROW-12649</a> - [Python/Packaging] Move conda-aarch64 to Azure with cross-compilation</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12653" target="_blank" rel="noopener">ARROW-12653</a> - [Archery] allow me to add a comment to crossbow requests</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12658" target="_blank" rel="noopener">ARROW-12658</a> - [C++] Bump aws-c-common to v0.5.10</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12660" target="_blank" rel="noopener">ARROW-12660</a> - [R] Post-4.0 adjustments for CRAN</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12661" target="_blank" rel="noopener">ARROW-12661</a> - [C++] CSV add skip rows after column names</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12662" target="_blank" rel="noopener">ARROW-12662</a> - [Website] Force to use squash merge</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12667" target="_blank" rel="noopener">ARROW-12667</a> - [Python] Ensure test coverage for conversion of strided numpy arrays</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12675" target="_blank" rel="noopener">ARROW-12675</a> - [C++] CSV should include line/row numbers in parsing error messages</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12677" target="_blank" rel="noopener">ARROW-12677</a> - [Python] Add a mask argument to pyarrow.StructArray.from_arrays</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12685" target="_blank" rel="noopener">ARROW-12685</a> - [C++][Compute] Add unary absolute value kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12686" target="_blank" rel="noopener">ARROW-12686</a> - [C++][Python][FlightRPC] Support export_to_c in DoGet/inherit from RecordBatchReader</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12687" target="_blank" rel="noopener">ARROW-12687</a> - [C++][Python][Dataset] Support C Data Interface with Scanner</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12689" target="_blank" rel="noopener">ARROW-12689</a> - [R] Implement ArrowArrayStream C interface</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12692" target="_blank" rel="noopener">ARROW-12692</a> - [R] Improve tests and comments for strsplit() bindings</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12694" target="_blank" rel="noopener">ARROW-12694</a> - [R][CI] rtools35 job failing on 32-bit build tests</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12696" target="_blank" rel="noopener">ARROW-12696</a> - [R] Improve testing of error messages converted to warnings</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12699" target="_blank" rel="noopener">ARROW-12699</a> - [CI][Packaging][Java] Generate a jar compatible with Linux and MacOS for all Arrow components</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12701" target="_blank" rel="noopener">ARROW-12701</a> - [Website][Release] Include Rust contributors, committers, and commits in release notes</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12702" target="_blank" rel="noopener">ARROW-12702</a> - [JS] Upgrade Webpack and terser</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12703" target="_blank" rel="noopener">ARROW-12703</a> - [JS] Separate Table from DataFrame</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12704" target="_blank" rel="noopener">ARROW-12704</a> - [JS] use optional chaining</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12709" target="_blank" rel="noopener">ARROW-12709</a> - [C++] Add variadic string join kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12713" target="_blank" rel="noopener">ARROW-12713</a> - [C++] String reverse kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12715" target="_blank" rel="noopener">ARROW-12715</a> - [C++] SQL-style glob string match kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12716" target="_blank" rel="noopener">ARROW-12716</a> - [C++] Left/right/center string padding kernels</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12717" target="_blank" rel="noopener">ARROW-12717</a> - [C++] Substring find position kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12719" target="_blank" rel="noopener">ARROW-12719</a> - [C++][Python] pyarrow.fs.S3FileSystem pass extra kwargs i.e ACL</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12721" target="_blank" rel="noopener">ARROW-12721</a> - [CI] Fix path for uploading aarch64 conda artifacts from the nightly builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12722" target="_blank" rel="noopener">ARROW-12722</a> - [R] Raise error when attemping to print table with duplicated naming</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12730" target="_blank" rel="noopener">ARROW-12730</a> - [MATLAB] Update featherreadmex and featherwritemex to build against latest arrow c++ APIs</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12731" target="_blank" rel="noopener">ARROW-12731</a> - [R] Use InMemoryDataset for Table/RecordBatch in dplyr code</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12736" target="_blank" rel="noopener">ARROW-12736</a> - [C++] Eliminate unnecessary copy in FieldPath::Get()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12738" target="_blank" rel="noopener">ARROW-12738</a> - [CI] [Gandiva] Nightly build error in azure-conda-osx-clang-py38 (and py39, py*-r*)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12741" target="_blank" rel="noopener">ARROW-12741</a> - [CI] Configure GitHub Token for Nightly Builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12745" target="_blank" rel="noopener">ARROW-12745</a> - [C++][Compute] Add floor, ceiling, and truncate kernels</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12749" target="_blank" rel="noopener">ARROW-12749</a> - [C++] Unnecessary copy cause by constructing RecordBatch/Table/Schema from lvalues</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12750" target="_blank" rel="noopener">ARROW-12750</a> - [CI] [R] Actually pass parameterized docker options to the templates</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12751" target="_blank" rel="noopener">ARROW-12751</a> - [C++] Add variadic row-wise min/max kernels (least/greatest)</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12758" target="_blank" rel="noopener">ARROW-12758</a> - [R] Add examples to more function documentation</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12759" target="_blank" rel="noopener">ARROW-12759</a> - [C++][Compute] Wrap grouped aggregation in an ExecNode</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12760" target="_blank" rel="noopener">ARROW-12760</a> - [C++][Python][R] S3FileSystem: IO thread parallelism limited to 8 threads</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12761" target="_blank" rel="noopener">ARROW-12761</a> - [R] Better error handling for write_to_raw</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12764" target="_blank" rel="noopener">ARROW-12764</a> - [CI] Fix arguments in Conda Windows builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12777" target="_blank" rel="noopener">ARROW-12777</a> - [R] Convert all inputs to Arrow objects in match_arrow and is_in</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12781" target="_blank" rel="noopener">ARROW-12781</a> - [R] Implement is.type() functions for dplyr</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12785" target="_blank" rel="noopener">ARROW-12785</a> - [CI] the r-devdocs build errors when brew installing gcc</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12791" target="_blank" rel="noopener">ARROW-12791</a> - [R] Better error handling for DatasetFactory$Finish() when no format specified</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12796" target="_blank" rel="noopener">ARROW-12796</a> - [JS] Support JSON output from benchmarks</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12800" target="_blank" rel="noopener">ARROW-12800</a> - [JS] Drop IE Support and remove text encoder and decoder polyfills</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12801" target="_blank" rel="noopener">ARROW-12801</a> - [CI][Packaging][Java] Include all modules in script that generate Arrow jars</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12806" target="_blank" rel="noopener">ARROW-12806</a> - [Python] test_write_to_dataset_filesystem missing a dataset mark</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12808" target="_blank" rel="noopener">ARROW-12808</a> - [JS] Document browser support</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12810" target="_blank" rel="noopener">ARROW-12810</a> - [Python] Run tests with AWS_EC2_METADATA_DISABLED=true</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12812" target="_blank" rel="noopener">ARROW-12812</a> - [Packaging][Java] Improve JNI jars build</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12824" target="_blank" rel="noopener">ARROW-12824</a> - [R][CI] Upgrade builds for R 4.1 release</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12827" target="_blank" rel="noopener">ARROW-12827</a> - [C++] [Dataset] Review error pass-through in the datasets API</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12829" target="_blank" rel="noopener">ARROW-12829</a> - [GLib][Ruby] Add support for Apache Arrow Flight</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12831" target="_blank" rel="noopener">ARROW-12831</a> - [CI][macOS] Remove needless Homebrew workaround</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12832" target="_blank" rel="noopener">ARROW-12832</a> - [JS] Write benchmarks in TypeScript</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12833" target="_blank" rel="noopener">ARROW-12833</a> - [JS] Construct perf data in JS</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12835" target="_blank" rel="noopener">ARROW-12835</a> - [C++] Implement case insenstive match in match_substring(_regex) and match_like</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12836" target="_blank" rel="noopener">ARROW-12836</a> - [C++] Installation on IBM i fails because of CxxFlags</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12841" target="_blank" rel="noopener">ARROW-12841</a> - [R] Add examples to more function documentation - part 2</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12843" target="_blank" rel="noopener">ARROW-12843</a> - [C++][Compute] Add is_inf kernel for floating point arrays</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12848" target="_blank" rel="noopener">ARROW-12848</a> - [Release] Mail template points to 404</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12851" target="_blank" rel="noopener">ARROW-12851</a> - [Go][Parquet] Add Encoding Package Part 1</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12856" target="_blank" rel="noopener">ARROW-12856</a> - [C++][Gandiva] Implement castBIT and castBOOLEAN functions on Gandiva</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12859" target="_blank" rel="noopener">ARROW-12859</a> - [C++] Add ScalarFromJSON for easier testing</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12861" target="_blank" rel="noopener">ARROW-12861</a> - [C++][Compute] Add sign function kernels</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12867" target="_blank" rel="noopener">ARROW-12867</a> - [R] Bindings for abs()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12868" target="_blank" rel="noopener">ARROW-12868</a> - [R] Bindings for find_substring and find_substring_regex</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12869" target="_blank" rel="noopener">ARROW-12869</a> - [R] Bindings for utf8_reverse and ascii_reverse</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12870" target="_blank" rel="noopener">ARROW-12870</a> - [R] Bindings for stringr::str_like</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12875" target="_blank" rel="noopener">ARROW-12875</a> - [JS] Upgrade Jest and other minor updates</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12883" target="_blank" rel="noopener">ARROW-12883</a> - [R] [CI] version compatibility fails on R 4.1</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12891" target="_blank" rel="noopener">ARROW-12891</a> - [C++][Compute][Dataset] Extract subtree pruning logic to compute::</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12894" target="_blank" rel="noopener">ARROW-12894</a> - [R] Bump R version</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12895" target="_blank" rel="noopener">ARROW-12895</a> - [CI] Use "concurrency" setting on Github Actions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12898" target="_blank" rel="noopener">ARROW-12898</a> - [Release][C#] Package upload script is broken</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12900" target="_blank" rel="noopener">ARROW-12900</a> - [Python][Documentation] an np import in Reading Datasets docs</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12901" target="_blank" rel="noopener">ARROW-12901</a> - [R] Follow on to more examples</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12909" target="_blank" rel="noopener">ARROW-12909</a> - [R][Release] Build of ubuntu-docs is failing</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12912" target="_blank" rel="noopener">ARROW-12912</a> - [Website] Use .asf.yaml for publishing</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12915" target="_blank" rel="noopener">ARROW-12915</a> - [Release] Build of ubuntu-docs is failing on thrift</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12936" target="_blank" rel="noopener">ARROW-12936</a> - [C++][Gandiva] Implement ASCII Hive function on Gandiva</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12937" target="_blank" rel="noopener">ARROW-12937</a> - [C++] Allow specifying default metadata for new S3 files</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12939" target="_blank" rel="noopener">ARROW-12939</a> - [R] Simplify RTask stop handling</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12940" target="_blank" rel="noopener">ARROW-12940</a> - [R] Expose C interface as R6 methods</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12948" target="_blank" rel="noopener">ARROW-12948</a> - [C++] Add string slice replace kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12949" target="_blank" rel="noopener">ARROW-12949</a> - [C++] Add string starts-with/ends-with kernels</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12950" target="_blank" rel="noopener">ARROW-12950</a> - [C++] Add substring count kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12951" target="_blank" rel="noopener">ARROW-12951</a> - [C++] Refactor StringTransform</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12952" target="_blank" rel="noopener">ARROW-12952</a> - [C++] Add regex count kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12955" target="_blank" rel="noopener">ARROW-12955</a> - [C++] Add additional type support for if_else kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12957" target="_blank" rel="noopener">ARROW-12957</a> - [R] rchk issues on cran</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12961" target="_blank" rel="noopener">ARROW-12961</a> - [C++] MSVC issues warning building PyArrow on Windows</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12962" target="_blank" rel="noopener">ARROW-12962</a> - [GLib][Ruby] Add Arrow:Scalar</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12964" target="_blank" rel="noopener">ARROW-12964</a> - [R] Add bindings for ifelse() and if_else()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12966" target="_blank" rel="noopener">ARROW-12966</a> - [Python] Expose Python binding for ElementWiseAggregateOptions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12967" target="_blank" rel="noopener">ARROW-12967</a> - [R] Add bindings for pmin() and pmax()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12968" target="_blank" rel="noopener">ARROW-12968</a> - [R] [CI] Add an rchk job to our nightlies</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12972" target="_blank" rel="noopener">ARROW-12972</a> - [CI] ][C++] archive_write_add_filter_zstd error on CentOS + ARM64</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12975" target="_blank" rel="noopener">ARROW-12975</a> - [C++][Python] if_else kernel doesn't support upcasting</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12982" target="_blank" rel="noopener">ARROW-12982</a> - [C++] Re-enable unused-variable warning</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12984" target="_blank" rel="noopener">ARROW-12984</a> - [C++] Passing options parameter of Count/Index aggregation by reference</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12985" target="_blank" rel="noopener">ARROW-12985</a> - [Python][Packaging] Unable to install pygit2 in the arm64 wheel builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12986" target="_blank" rel="noopener">ARROW-12986</a> - [C++][Gandiva] Implement new cache eviction policy in Gandiva</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12992" target="_blank" rel="noopener">ARROW-12992</a> - [R] bindings for substr(), substring(), str_sub()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12994" target="_blank" rel="noopener">ARROW-12994</a> - [R] Fix tests that assume UTC local tz</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-12996" target="_blank" rel="noopener">ARROW-12996</a> - [C++] CSV stream reader has no progress indication</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13002" target="_blank" rel="noopener">ARROW-13002</a> - [C++] Add a check for the utf8proc's version in CMake</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13005" target="_blank" rel="noopener">ARROW-13005</a> - [C++] Support filter/take for union data type.</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13006" target="_blank" rel="noopener">ARROW-13006</a> - [C++][Gandiva] Implement BASE64 and UNBASE64 Hive functions on Gandiva</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13009" target="_blank" rel="noopener">ARROW-13009</a> - [Doc][Dev] Document builds mailing-list</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13022" target="_blank" rel="noopener">ARROW-13022</a> - [R] bindings for lubridate's year, isoyear, quarter, month, day, wday, yday, isoweek, hour, minute, and second functions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13025" target="_blank" rel="noopener">ARROW-13025</a> - [C++][Compute] Enhance FunctionOptions with equality, debug representability, and serializability</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13027" target="_blank" rel="noopener">ARROW-13027</a> - [C++] Fix ASAN stack traces in CI</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13030" target="_blank" rel="noopener">ARROW-13030</a> - [CI][Go] Setup Arm64 golang CI</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13031" target="_blank" rel="noopener">ARROW-13031</a> - [JS] Support arm in closure compiler on macOS</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13032" target="_blank" rel="noopener">ARROW-13032</a> - [Java] Update gauva version</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13034" target="_blank" rel="noopener">ARROW-13034</a> - [Python][Docs] Update outdated examples for hdfs/azure on the Parquet doc page</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13036" target="_blank" rel="noopener">ARROW-13036</a> - [Doc] Mention recommended file extension(s) for Arrow IPC</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13042" target="_blank" rel="noopener">ARROW-13042</a> - [C++] Automatic checks that kernels don't leave uninitialized data in output</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13043" target="_blank" rel="noopener">ARROW-13043</a> - [GLib][Ruby] Add GArrowEqualOptions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13044" target="_blank" rel="noopener">ARROW-13044</a> - [Java] Union vectors should extend ValueVector</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13045" target="_blank" rel="noopener">ARROW-13045</a> - [Packaging][RPM][deb] Don't install system utf8proc if it's old</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13047" target="_blank" rel="noopener">ARROW-13047</a> - [Website] Add kiszk to committer list</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13049" target="_blank" rel="noopener">ARROW-13049</a> - [C++][Gandiva] Implement BIN Hive function on Gandiva</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13050" target="_blank" rel="noopener">ARROW-13050</a> - [C++][Gandiva] Implement SPACE Hive function on Gandiva</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13054" target="_blank" rel="noopener">ARROW-13054</a> - [C++] Add option to specify the first day of the week for the "day_of_week" temporal kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13064" target="_blank" rel="noopener">ARROW-13064</a> - [C++] Add a general "if, ifelse, ..., else" kernel ("CASE WHEN")</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13065" target="_blank" rel="noopener">ARROW-13065</a> - [Packaging][RPM] Add missing required LZ4 version information</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13068" target="_blank" rel="noopener">ARROW-13068</a> - [GLib][Dataset] Change prefix to gadataset_ from gad_</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13070" target="_blank" rel="noopener">ARROW-13070</a> - [R] bindings for sd and var</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13072" target="_blank" rel="noopener">ARROW-13072</a> - [C++] Add bitwise arithmetic compute functions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13074" target="_blank" rel="noopener">ARROW-13074</a> - [Python] Start with deprecating ParquetDataset custom attributes</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13075" target="_blank" rel="noopener">ARROW-13075</a> - [Python] Expose C data interface API for pyarrow.Field</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13076" target="_blank" rel="noopener">ARROW-13076</a> - [Java] Enable ExtensionType to use StructVector and UnionVector for underlying storage</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13082" target="_blank" rel="noopener">ARROW-13082</a> - [CI] Forward R argument to ubuntu-docs build</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13086" target="_blank" rel="noopener">ARROW-13086</a> - [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13086" target="_blank" rel="noopener">ARROW-13086</a> - [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13091" target="_blank" rel="noopener">ARROW-13091</a> - [Python] Add compression_level argument to IpcWriteOptions constructor</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13092" target="_blank" rel="noopener">ARROW-13092</a> - [C++] CreateDir should fail if the target exists and is not a directory</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13095" target="_blank" rel="noopener">ARROW-13095</a> - [C++] Implement trigonometric compute functions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13096" target="_blank" rel="noopener">ARROW-13096</a> - [C++] Implement logarithm compute functions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13097" target="_blank" rel="noopener">ARROW-13097</a> - [C++] Provide a simple reflection utility for s</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13098" target="_blank" rel="noopener">ARROW-13098</a> - [Dev][Archery] Reorganize docker submodule to its own subpackage</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13100" target="_blank" rel="noopener">ARROW-13100</a> - [MATLAB] Integrate GoogleTest with MATLAB Interface C++ Code</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13101" target="_blank" rel="noopener">ARROW-13101</a> - [Python][Doc] pyarrow.FixedSizeListArray does not appear in the documentation</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13110" target="_blank" rel="noopener">ARROW-13110</a> - [C++] Deadlock can happen when using BackgroundGenerator without transferring callbacks</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13113" target="_blank" rel="noopener">ARROW-13113</a> - [R] use RTasks to manage parallel in converting arrow to R</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13117" target="_blank" rel="noopener">ARROW-13117</a> - [R] Retain schema in new Expressions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13119" target="_blank" rel="noopener">ARROW-13119</a> - [R] Set empty schema in scalar Expressions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13124" target="_blank" rel="noopener">ARROW-13124</a> - [Ruby] Add support for memory view</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13127" target="_blank" rel="noopener">ARROW-13127</a> - [R] Valgrind nightly errors</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13136" target="_blank" rel="noopener">ARROW-13136</a> - [C++] Add a "coalesce" variadic scalar kernel</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13137" target="_blank" rel="noopener">ARROW-13137</a> - [C++][Documentation] Make in-table references consistent</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13140" target="_blank" rel="noopener">ARROW-13140</a> - [C++/Python] Upgrade libthrift pin in the nightlies</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13142" target="_blank" rel="noopener">ARROW-13142</a> - [Python] Use vector append when converting from list of non-strided numpy arrays</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13147" target="_blank" rel="noopener">ARROW-13147</a> - [Java] Respect the rounding policy when allocating vector buffers</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13157" target="_blank" rel="noopener">ARROW-13157</a> - [C++] Add find_substring_regex kernel and implement ignore_case for find_substring</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13158" target="_blank" rel="noopener">ARROW-13158</a> - [Python] Fix repr and contains of StructScalar with duplicate field names</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13162" target="_blank" rel="noopener">ARROW-13162</a> - [C++][Gandiva] Add new alias for extract date functions in Gandiva registry</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13171" target="_blank" rel="noopener">ARROW-13171</a> - [R] Add binding for str_pad()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13190" target="_blank" rel="noopener">ARROW-13190</a> - [C++] [Gandiva] Change behavior of INITCAP function</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13194" target="_blank" rel="noopener">ARROW-13194</a> - [Java][Document] Create prose document about Java algorithms</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13195" target="_blank" rel="noopener">ARROW-13195</a> - [R] Problem with rlang reverse dependency checks</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13199" target="_blank" rel="noopener">ARROW-13199</a> - [R] add ubuntu 21.04 to nightly builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13200" target="_blank" rel="noopener">ARROW-13200</a> - [R] Add binding for case_when()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13201" target="_blank" rel="noopener">ARROW-13201</a> - [R] Add binding for coalesce()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13210" target="_blank" rel="noopener">ARROW-13210</a> - [Python][CI] Fix vcpkg caching mechanism for the macOS wheels</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13211" target="_blank" rel="noopener">ARROW-13211</a> - [C++][CI] Remove outdated Github Actions ARM builds</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13212" target="_blank" rel="noopener">ARROW-13212</a> - [Release] Support deploying to test PyPI in the python post release script</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13215" target="_blank" rel="noopener">ARROW-13215</a> - [R] [CI] Add ENV TZ to docker files</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13218" target="_blank" rel="noopener">ARROW-13218</a> - [Doc] Document/clarify conventions for timestamp storage</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13219" target="_blank" rel="noopener">ARROW-13219</a> - [C++][GLib] Demote/deprecate CompareOptions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13224" target="_blank" rel="noopener">ARROW-13224</a> - [Python][Doc] Documentation missing for pyarrow.dataset.write_dataset</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13226" target="_blank" rel="noopener">ARROW-13226</a> - [Python] Add a general purpose cython trampolining utility</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13228" target="_blank" rel="noopener">ARROW-13228</a> - [C++] S3 CreateBucket fails because AWS treats us-east-1 differently than other regions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13230" target="_blank" rel="noopener">ARROW-13230</a> - Add CSV Writer documentation</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13234" target="_blank" rel="noopener">ARROW-13234</a> - [C++] Add string padding option to determine which side the extra space goes on</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13235" target="_blank" rel="noopener">ARROW-13235</a> - [C++] Make type_name equal to options class name for all FunctionOptionTypes</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13236" target="_blank" rel="noopener">ARROW-13236</a> - [Python] Improve repr of pyarrow.compute.FunctionOptions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13238" target="_blank" rel="noopener">ARROW-13238</a> - [C++][Dataset][Compute] Substitute ExecPlan impl for dataset scans</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13242" target="_blank" rel="noopener">ARROW-13242</a> - [C++] Improve decimal random generation</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13244" target="_blank" rel="noopener">ARROW-13244</a> - [C++] Add facility to get current thread id</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13258" target="_blank" rel="noopener">ARROW-13258</a> - [Python] Improve the repr of ParquetFileFragment</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13262" target="_blank" rel="noopener">ARROW-13262</a> - [R] transmute() fails after pulling data into R</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13273" target="_blank" rel="noopener">ARROW-13273</a> - [C++] Don't use .pc only in CMake paths for Requires.private</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13274" target="_blank" rel="noopener">ARROW-13274</a> - [JS] Remove Webpack</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13275" target="_blank" rel="noopener">ARROW-13275</a> - [JS] Fix perf tests</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13276" target="_blank" rel="noopener">ARROW-13276</a> - [GLib][Ruby][Flight] Add support for ListFlights</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13277" target="_blank" rel="noopener">ARROW-13277</a> - [JS] Add declaration maps</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13280" target="_blank" rel="noopener">ARROW-13280</a> - [R] Bindings for log and trig functions</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13282" target="_blank" rel="noopener">ARROW-13282</a> - [C++] Remove obsolete generated files</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13283" target="_blank" rel="noopener">ARROW-13283</a> - [Developer Tools] Support passing through memory limits in archery docker run</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13286" target="_blank" rel="noopener">ARROW-13286</a> - [CI] Require docker-compose 1.27.0 or later</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13289" target="_blank" rel="noopener">ARROW-13289</a> - [C++] Log functions don't have int kernels</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13291" target="_blank" rel="noopener">ARROW-13291</a> - [GLib][CI] Require gobject-introspection 3.4.5 or later</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13296" target="_blank" rel="noopener">ARROW-13296</a> - [C++] Provide reflection-compatible enum replacement</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13299" target="_blank" rel="noopener">ARROW-13299</a> - [JS] Upgrade ix and rxjs</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13303" target="_blank" rel="noopener">ARROW-13303</a> - [JS] Revise bundles</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13306" target="_blank" rel="noopener">ARROW-13306</a> - [Java][JDBC] use ResultSetMetaData.getColumnLabel instead of ResultSetMetaData.getColumnName</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13313" target="_blank" rel="noopener">ARROW-13313</a> - [C++][Compute] Add ScalarAggregateNode</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13320" target="_blank" rel="noopener">ARROW-13320</a> - [Website] Add MIME types to FAQ</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13323" target="_blank" rel="noopener">ARROW-13323</a> - [Archery] Validate docker compose configuration</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13343" target="_blank" rel="noopener">ARROW-13343</a> - [R] Update NEWS.md for 5.0</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13346" target="_blank" rel="noopener">ARROW-13346</a> - [C++] Remove compile time parsing from EnumType</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13355" target="_blank" rel="noopener">ARROW-13355</a> - [R] ensure that sf is installed in our revdep job</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13357" target="_blank" rel="noopener">ARROW-13357</a> - [R] bindings for sign()</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13365" target="_blank" rel="noopener">ARROW-13365</a> - [R] bindings for floor/ceiling/truncate</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13385" target="_blank" rel="noopener">ARROW-13385</a> - [C++][Compute] Document out-of-source addition to the FunctionRegistry</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13386" target="_blank" rel="noopener">ARROW-13386</a> - [R][C++] CSV streaming changes break Rtools 35 32-bit build</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13418" target="_blank" rel="noopener">ARROW-13418</a> - [R] typo in python.r</li>
<li>
<a href="https://issues.apache.org/jira/browse/ARROW-13461" target="_blank" rel="noopener">ARROW-13461</a> - [Python][Packaging] Build M1 wheels for python 3.8</li>
<li>
<a href="https://issues.apache.org/jira/browse/PARQUET-1798" target="_blank" rel="noopener">PARQUET-1798</a> - [C++] Review logic around automatic assignment of field_id's</li>
<li>
<a href="https://issues.apache.org/jira/browse/PARQUET-1998" target="_blank" rel="noopener">PARQUET-1998</a> - [C++] Implement LZ4_RAW compression</li>
<li>
<a href="https://issues.apache.org/jira/browse/PARQUET-2056" target="_blank" rel="noopener">PARQUET-2056</a> - [C++] Add ability for retrieving dictionary and indices separately for ColumnReader</li>
</ul>
</main>
<hr>
<footer class="footer">
<div class="row">
<div class="col-md-9">
<p>Apache Arrow, Arrow, Apache, the Apache logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.</p>
<p>© 2016-2025 The Apache Software Foundation</p>
</div>
<div class="col-md-3">
<a class="d-sm-none d-md-inline pr-2" href="https://www.apache.org/events/current-event.html" target="_blank" rel="noopener">
<img src="https://www.apache.org/events/current-event-234x60.png">
</a>
</div>
</div>
</footer>
</div>
</body>
</html>