blob: f5890295bdd7e7fd58103aa746388f32b06efafc [file] [log] [blame]
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- The above meta tags *must* come first in the head; any other head content must come *after* these tags -->
<title>Apache Arrow 4.0.0 Release | Apache Arrow</title>
<!-- Begin Jekyll SEO tag v2.8.0 -->
<meta name="generator" content="Jekyll v4.3.3" />
<meta property="og:title" content="Apache Arrow 4.0.0 Release" />
<meta property="og:locale" content="en_US" />
<meta name="description" content="Apache Arrow 4.0.0 (26 April 2021) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 719 commits from 114 distinct contributors. $ git shortlog -sn apache-arrow-3.0.0..apache-arrow-4.0.0 65 Antoine Pitrou 47 Andrew Lamb 41 Heres, Daniel 40 David Li 37 Sutou Kouhei 33 Neal Richardson 30 Weston Pace 28 Jorge C. Leitao 26 Krisztián Szűcs 25 Ian Cook 21 Dominik Moritz 20 Andy Grove 19 Yibo Cai 18 Joris Van den Bossche 17 Neville Dipale 17 Jonathan Keane 17 Ritchie Vink 12 Mike Seddon 12 Benjamin Kietzman 11 Mauricio Vargas 10 Qingping Hou 10 Diana Clarke 8 Micah Kornfield 7 Matthew Topol 7 Dmitry Patsura 5 Projjal Chanda 5 Kenta Murata 4 Anthony Louis 4 Ximo Guanter 4 liyafan82 3 Andre Braga Reis 3 Kazuaki Ishizaki 3 Maarten A. Breddels 3 Uwe L. Korn 3 ptaylor 3 Steven Fackler 3 Sagnik Chakraborty 3 Nic Crane 2 Marc Prud&#39;hommeaux 2 Raphael Taylor-Davies 2 João Pedro 2 Yordan Pavlov 2 emkornfield 2 Max Burke 2 Florian Müller 2 Ben Chambers 2 mqy 2 Christoph Schulze 2 Manoj Karthick 2 Sathis Kumar 2 Ryan Jennings 2 Ruan Pearce-Authers 2 Tao He 2 Eric Burden 2 Tyrel Rink 2 Romain Francois 2 Rok 1 witchard 1 Adam Lippai 1 Albert Villanova del Moral 1 Alessandro Molina 1 Ali 1 Andrew Wieteska 1 Bob Tinsman 1 Brian Hulette 1 Bryan Cutler 1 Clcanny 1 Daniel Russo 1 Daniël Heres 1 Eduardo Ponce 1 Evan Chan 1 FawnD2 1 Felix Zhu 1 Fernando Herrera 1 Fiona La 1 François Saint-Jacques 1 GALI PREM SAGAR 1 Gert Hulselmans 1 Ha Thi Tham 1 Hongze Zhang 1 Ilya Biryukov 1 Ivan Smirnov 1 James Winegar 1 Joe Roberts 1 Johannes Müller 1 Jörn Horstmann 1 Mahmut Bulut 1 Marco Gorelli 1 Marko Mikulicic 1 Markus Silberstein Hont 1 Martin Nowak 1 Matt Brubeck 1 Matt Summersgill 1 Max Meldrum 1 Nathaniel Bauernfeind 1 Nga Tran 1 Nick Bruno 1 Rok Mihevc 1 Roman Karlstetter 1 Sam Albers 1 Simon Bertron 1 Szangin 1 Truc Lam Nguyen 1 Weichen Xu 1 Ying Zhou 1 frank400 1 ivan 1 jpeeter 1 martinblostein 1 nmcdonnell-kx 1 pierwill 1 sjgupta2 1 sundy-li 1 ARF1 Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-3.0.0..apache-arrow-4.0.0 157 Andrew Lamb 101 Antoine Pitrou 93 Neal Richardson 88 Krisztián Szűcs 72 Sutou Kouhei 41 David Li 30 Benjamin Kietzman 25 Neville Dipale 22 Micah Kornfield 19 Jorge C. Leitao 16 Andy Grove 14 Praveen 11 Joris Van den Bossche 9 GitHub 8 Yibo Cai 4 Uwe L. Korn 3 Sebastien Binet 2 liyafan82 1 Kenta Murata 1 Eric Erhardt 1 Chao Sun 1 Bryan Cutler Changelog Apache Arrow 4.0.0 (2021-04-26) New Features and Improvements ARROW-951 - [JS] Fix generated API documentation ARROW-2229 - [C++] Write CSV files from RecordBatch, Table ARROW-3690 - [Rust] Add Rust to the format integration testing ARROW-6103 - [Java] Stop using the maven release plugin ARROW-6248 - [Python] Use FileNotFoundError in HadoopFileSystem.open() in Python 3 ARROW-6455 - [C++] Implement ExtensionType for non-UTF8 Unicode data ARROW-6604 - [C++] Add support for nested types to MakeArrayFromScalar ARROW-7215 - [C++][Gandiva] Implement castVARCHAR(numeric_type) functions in Gandiva ARROW-7364 - [Rust] Add cast options to cast kernel ARROW-7633 - [C++][CI] Create fuzz targets for tensors and sparse tensors ARROW-7808 - [Java][Dataset] Implement Datasets Java API ARROW-7906 - [C++][Python] Full functionality for ORC format ARROW-8049 - [C++] Upgrade bundled Thrift version to 0.13.0 ARROW-8282 - [C++/Python][Dataset] Support schema evolution for integer columns ARROW-8284 - [C++][Dataset] Schema evolution for timestamp columns ARROW-8630 - [C++][Dataset] Pass schema including all materialized fields to catch CSV edge cases ARROW-8631 - [C++][Dataset] Add ConvertOptions and ReadOptions to CsvFileFormat ARROW-8658 - [C++][Dataset] Implement subtree pruning for FileSystemDataset::GetFragments ARROW-8732 - [C++] Let Futures support cancellation ARROW-8771 - [C++] Add boost/process library to build support ARROW-8796 - [Rust] Allow parquet to be written directly to memory ARROW-8797 - [C++] Support Flight RPC among diffent endian platforms ARROW-8900 - [C++] Respect HTTP(S)_PROXY for S3 Filesystems and/or expose proxy options as parameters ARROW-8919 - [C++] Add “DispatchBest” APIs to compute::Function that selects a kernel that may require implicit casts to invoke ARROW-9128 - [C++] Implement string space trimming kernels: trim, ltrim, and rtrim ARROW-9149 - [C++] Improve configurability of RandomArrayGenerator::ArrayOf ARROW-9196 - [C++] Make temporal casts work on Scalar inputs ARROW-9318 - [C++][Parquet] Encryption key management tools ARROW-9731 - [C++][Dataset] Port “head” method from R to C++ Dataset Scanner ARROW-9749 - [C++][Dataset] Extract format-specific scan options from FileFormat ARROW-9777 - [Rust] Implement IPC changes to catch up to 1.0.0 format ARROW-9856 - [R] Add bindings for string compute functions ARROW-10014 - [C++] TaskGroup::Finish should execute tasks ARROW-10089 - [R] inject base class for Array, ChunkedArray and Scalar ARROW-10183 - [C++] Create a ForEach library function that runs on an iterator of futures ARROW-10195 - [C++] Add string struct extract kernel using re2 ARROW-10250 - [FlightRPC][C++] Remove default constructor for FlightClientOptions ARROW-10255 - [JS] Reorganize imports and exports to be more friendly to ESM tree-shaking ARROW-10297 - [Rust] Parameter for parquet-read to output data in json format ARROW-10299 - [Rust] Support reading and writing V5 of IPC metadata ARROW-10305 - [R] Filter with regular expressions ARROW-10306 - [C++] Add string replacement kernel ARROW-10349 - [Python] Build and publish aarch64 wheels ARROW-10354 - [Rust] [DataFusion] Add support for regex extract ARROW-10360 - [CI] Bump github actions cache version ARROW-10372 - [C++][Dataset] Read compressed CSVs ARROW-10406 - [C++] Unify dictionaries when writing IPC file in a single shot ARROW-10420 - [C++] FileSystem::OpenInput{File,Stream} should accept a MemoryPool ARROW-10421 - [R] Feather reader/writer should accept a MemoryPool ARROW-10438 - [C++][Dataset] Partitioning::Format on nulls ARROW-10520 - [C++][R] Implement add/remove/replace for RecordBatch ARROW-10570 - [R] Use Converter API to convert SEXP to Array/ChunkedArray ARROW-10580 - [C++] When Validating, ensure DenseUnionArray offsets are increasing ARROW-10606 - [C++][Compute] Support casts to and from Decimal256 type. ARROW-10655 - [C++] Add LRU cache facility ARROW-10734 - [R] Build and test on Solaris ARROW-10735 - [R] Remove arrow-without-arrow wrapping ARROW-10766 - [Rust] Compute nested definition and repetition for list arrays ARROW-10797 - [C++] Investigate faster random generation for tests and benchmarks ARROW-10816 - [Rust] [DataFusion] Implement INTERVAL ARROW-10831 - [C++][Compute] Implemement quantile kernel ARROW-10846 - [C++] Add async filesystem operations ARROW-10880 - [Java] Support compressing RecordBatch IPC buffers by LZ4 ARROW-10882 - [Python][Dataset] Writing dataset from python iterator of record batches ARROW-10895 - [C++][Gandiva] Implement bool to varchar cast function in Gandiva ARROW-10903 - [Rust] Implement FromIter&lt;Option&lt;Vec&lt;u8&gt;&gt;&gt; constructor for FixedSizeBinaryArray ARROW-11022 - [Rust] [DataFusion] Upgrade to tokio 1.0 ARROW-11070 - [C++] Implement power / exponentiation compute kernel ARROW-11074 - [Rust][DataFusion] Implement predicate push-down for parquet tables ARROW-11081 - [Java] Make IPC option immutable ARROW-11108 - [Rust] Improve performance of MutableBuffer ARROW-11141 - [Rust]: Miri checks ARROW-11149 - [Rust] create_batch_empty - support List, LargeList ARROW-11150 - [Rust] Set up bi-weekly Rust sync call and update website ARROW-11154 - [CI][C++] Move homebrew crossbow tests off of Travis-CI ARROW-11156 - [Rust][DataFusion] Create hashes vectorized in hash join ARROW-11174 - [C++][Dataset] Make Expressions available for projection ARROW-11179 - [Format] Make comments in fb files friendly to rust doc ARROW-11183 - [Rust] [Parquet] LogicalType::TIMESTAMP_NANOS missing ARROW-11191 - [C++] Use FnOnce for TaskGroup’s tasks instead of std::function ARROW-11216 - [Rust] Improve documentation for StringDictionaryBuilder ARROW-11220 - [Rust] DF Implement GROUP BY support for Boolean ARROW-11222 - [Rust] [Arrow] catch up with flatbuffers 0.8.1 ARROW-11246 - DF - Add type to Unexpected accumulator state message ARROW-11254 - [Rust][DataFusion] Add SIMD and snmalloc flags as options to benchmarks ARROW-11260 - [C++][Dataset] Don’t require dictionaries for reading dataset with schema-based Partitioning ARROW-11265 - [Rust] Made bool not convertable to bytes ARROW-11268 - [Rust][DataFusion] Support specifying repartitions in MemTable ARROW-11270 - [Rust] Use slices for simple array data buffer access ARROW-11279 - [Rust][Parquet] ArrowWriter Definition Levels Memory Usage ARROW-11284 - [R] Support dplyr verb transmute() ARROW-11289 - [Rust] [DataFusion] Support GROUP BY for Dictionary columns ARROW-11290 - [Rust][DataFusion] Address hash aggregate performance with high number of groups ARROW-11291 - [Rust] implement extend for MutableBuffer (from iterator) ARROW-11300 - [Rust][DataFusion] Improve hash aggregate performance with large number of groups in ARROW-11308 - [Rust] [Parquet] Add Arrow decimal array writer ARROW-11309 - [Release][C#] Use .NET 3.1 for verification ARROW-11310 - [Rust] Implement arrow JSON writer ARROW-11314 - [Release][APT][Yum] Add support for verifying arm64 packages ARROW-11317 - [Rust] Test the prettyprint feature in CI ARROW-11318 - [Rust] Support pretty printing timestamp, date, and time types ARROW-11319 - [Rust] [DataFusion] Improve test comparisons to record batch ARROW-11321 - [Rust][DataFusion] Fix DataFusion compilation error ARROW-11325 - [Packaging][C#] Release Apache.Arrow.Flight and Apache.Arrow.Flight.AspNetCore ARROW-11329 - [Rust] Do not rebuild the library on every change ARROW-11330 - [Rust][DataFusion] Add ExpressionVisitor pattern ARROW-11332 - [Rust] Use MutableBuffer in take_string instead of Vec ARROW-11333 - [Rust] Suport creating arbitrary nested empty arrays ARROW-11336 - [C++][Doc] Improve Developing on Windows docs ARROW-11338 - [R] Bindings for quantile and median ARROW-11340 - [C++] Add vcpkg.json manifest to cpp project root ARROW-11343 - [DataFusion] Simplified example ARROW-11346 - [C++][Compute] Implement quantile kernel benchmark ARROW-11349 - [Rust] Add from_iter_values to create arrays from T instead of Option&lt;T&gt; ARROW-11350 - [C++] Bump dependency versions ARROW-11354 - [Rust] Speed-up casts of dates and times ARROW-11355 - [Rust] Align Date type with spec ARROW-11358 - [Rust] Add benchmark for concatenating small arrays ARROW-11360 - [Rust] [DataFusion] Improve CSV “No files found” error message ARROW-11361 - [Rust] Build buffers from iterator of booleans ARROW-11362 - [Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance ARROW-11365 - [Rust] [Parquet] Implement parsers for v2 of the text schema ARROW-11366 - [Rust][DataFusion] Add Constant Folding / Support boolean literal in equality expression ARROW-11367 - [C++] Implement approximante quantile utility ARROW-11369 - [DataFusion] Split expressions.rs ARROW-11372 - Support RC verification on macOS-ARM64 ARROW-11373 - [Python][Docs] Add example of specifying type for a column when reading csv file ARROW-11374 - [Python] Make legacy pyarrow.filesystem / pyarrow.serialize warnings more visisble ARROW-11375 - [Rust] CI fails due to deprecation warning in clippy ARROW-11377 - [C++][CI] Add ThreadSanitizer nightly build ARROW-11383 - [Rust] use trusted len on bit ops ARROW-11386 - [Release] Fix post documents update script ARROW-11389 - [Rust] Inconsistent comments for datatypes ARROW-11395 - [DataFusion] Support custom optimizations ARROW-11401 - [Rust][DataFusion] Pass slices instead of Vec in DataFrame API ARROW-11404 - [Rust][DataFusion] Upgrade to aHash 0.7 ARROW-11405 - [DataFusion] Support multiple custom nodes ARROW-11406 - [CI][C++] Fix caching on Travis-CI builds ARROW-11408 - Add window support to datafusion readme ARROW-11411 - [Packaging][Linux] Disable arm64 nightly builds ARROW-11414 - [Rust] Reduce copies in Schema::try_merge ARROW-11417 - [Integration] Add integration test for buffer compression ARROW-11418 - [Doc] Add IPC buffer compression to support matrix ARROW-11421 - [Rust][DataFusion] Support group by Date32 ARROW-11422 - [C#] Add support for decimals ARROW-11423 - [R] value_counts and some StructArray methods ARROW-11425 - [C++][Compute] Improve quantile kernel for integers ARROW-11426 - [Rust][DataFusion] EXTRACT support ARROW-11428 - [Rust] Add power kernel ARROW-11429 - Make string comparisson kernels generic over Utf8 and LargeUtf8 ARROW-11430 - [Rust] Kernel to combine two arrays based on boolean mask ARROW-11431 - [Rust] [DataFusion] Add support for the SQL HAVING clause ARROW-11435 - Allow creating ParquetPartition from external crate ARROW-11436 - [Rust] Allow non-sized iterators in Primitive::from_iter ARROW-11437 - [Rust] Simplify benches ARROW-11438 - Unsupported ast node Value(Boolean(true)) in sqltorel ARROW-11439 - [Rust] Add year support to temporal kernel ARROW-11440 - [Rust] [DataFusion] Add method to CsvExec to get CSV schema ARROW-11442 - [Rust] Expose the logic used to interpret date/times ARROW-11443 - [Rust] Write datetime information for Date64 Type in csv writer ARROW-11444 - [Rust][DataFusion] Pass slices instead of &amp;Vec to functions ARROW-11446 - [DataFusion] Support scalars in builtin functions ARROW-11447 - [Rust] Add shift kernel ARROW-11449 - [CI][R][Windows] Use ccache ARROW-11457 - [Rust] Make string comparisson kernels generic over Utf8 and LargeUtf8 ARROW-11459 - [Rust] Allow ListArray of primitives to be built from iterator ARROW-11462 - [Developer] Remove needless quote from the default DOCKER_VOLUME_PREFIX ARROW-11463 - [Python] Allow configuration of IpcWriterOptions 64Bit from PyArrow ARROW-11466 - [Flight][Go] Add BasicAuth and BearerToken handlers for Go ARROW-11467 - [R] Fix reference to json_table_reader() in R docs ARROW-11468 - [R] Allow user to pass schema to read_json_arrow() ARROW-11474 - [C++] Update bundled re2 version ARROW-11476 - [Rust][DataFusion] Test running of TPCH benchmarks in CI ARROW-11477 - [R][Doc] Reorganize and improve README and vignette content ARROW-11478 - [R] Consider ways to make arrow.skip_nul option more user-friendly ARROW-11479 - [Rust][Parquet] Add method to return compressed size of row group ARROW-11481 - [Rust] More cast implementations ARROW-11484 - [Rust] Derive Clone for ExecutionContext ARROW-11486 - [Website] Use Jekyll 4 and webpack to support Ruby 3.0 or later ARROW-11489 - [Rust][DataFusion] Make DataFrame Send+Sync ARROW-11491 - [Rust] Support json schema inference for nested list and struct ARROW-11493 - [CI][Packaging][deb][RPM] Test built packages ARROW-11500 - [R] Allow bundled build script to run on Solaris ARROW-11501 - [C++] endianness check does not work on Solaris ARROW-11504 - [Rust] verify Datatype in ListArray::from(ArrayDataRef) ARROW-11505 - [Rust] Add support for LargeUtf8 in csv-writer ARROW-11507 - [R] Bindings for GetRuntimeInfo ARROW-11510 - [Python] Add note that pip &gt;= 19.0 is required to get binary packages ARROW-11511 - [Rust] Replace Arc&lt;ArrayData&gt; by ArrayData ARROW-11512 - [Packaging][deb] Add missing gRPC dependency for Ubuntu 21.04 ARROW-11513 - [R] Bindings for sub/gsub ARROW-11516 - [R] Allow all C++ compute functions to be called by name in dplyr ARROW-11539 - [Developer][Archery] Change items_per_seconds units ARROW-11541 - [C++][Compute] Implement approximate quantile kernel ARROW-11542 - [Rust] json reader should not crash when reading nested list ARROW-11544 - [Rust] [DataFusion] Implement as_any for AggregateExpr ARROW-11545 - [Rust] [DataFusion] SendableRecordBatchStream should implement Sync ARROW-11556 - [C++] Minor benchmark improvements ARROW-11557 - [Rust] Add table de-registration to DataFusion ExecutionContext ARROW-11559 - [C++] Improve flatbuffers verification limits ARROW-11559 - [C++] Improve flatbuffers verification limits ARROW-11561 - [Rust][DataFusion] Add Send + Sync to MemTable::load ARROW-11563 - [Rust] Support Cast(Utf8, TimeStamp(Nanoseconds, None)) ARROW-11568 - [C++][Compute] Mode kernel performance is bad in some conditions ARROW-11570 - [Rust] ScalarValue - support Date64 ARROW-11571 - [CI] Cancel stale Github Actions workflow runs ARROW-11572 - [Rust] Add a kernel for division by single scalar ARROW-11573 - [Developer][Archery] Google benchmark now reports run type ARROW-11574 - [Rust][DataFusion] Upgrade sqlparser to 0.8 to support parsing all TPC-H queries ARROW-11575 - [Developer][Archery] Expose execution time in benchmark results ARROW-11576 - [Rust] Remove unused variable in example ARROW-11580 - [C++] Add CMake option ARROW_DEPENDENCY_SOURCE=VCPKG ARROW-11589 - [R] Add methods for modifying Schemas ARROW-11590 - [C++] Move CSV background generator to IO thread pool ARROW-11591 - [C++][Compute] Prototype version of hash aggregation ARROW-11592 - [Rust] Typo in comment ARROW-11594 - [Rust] Support pretty printing with NullArrays ARROW-11597 - [Rust] Split datatypes in a module ARROW-11598 - [Rust] Split buffer.rs in smaller files ARROW-11599 - [Rust] Add function to create array with all nulls ARROW-11601 - [C++][Dataset] Expose pre-buffering in ParquetFileFormatReaderOptions ARROW-11606 - [Rust] [DataFusion] Need guidance on HashAggregateExec reconstruction ARROW-11610 - [C++] Download boost from sourceforge instead of bintray ARROW-11612 - [C++] Rebuild trimmed boost bundle for 1.75.0 ARROW-11613 - [R] Move nightly C++ builds off of bintray ARROW-11616 - [Rust][DataFusion] Expose collect_partitioned for DataFrame ARROW-11621 - [CI][Gandiva][Linux] Fix Crossbow setup failure ARROW-11626 - [Rust][DataFusion] Move DataFusion examples to own project to reduce nr dependencies ARROW-11627 - [Rust] Typed allocator ARROW-11637 - [CI][Conda] Update nightly clean target platforms and packages list ARROW-11641 - [CI] Use docker buildkit’s inline cache to reuse build cache across different hosts ARROW-11649 - [R] Add support for null_fallback to R ARROW-11651 - [Rust][DataFusion] Implement Postgres Length Functions ARROW-11653 - Ascii/unicode functions ARROW-11655 - Pad/trim functions ARROW-11656 - Left over functions/fixes ARROW-11659 - [R] Preserve group_by .drop argument ARROW-11662 - [C++] Support sorting for decimal data type. ARROW-11664 - [Rust] Cast to LargeUtf8 ARROW-11665 - [Python] Document precision and scale parameters of decimal128() ARROW-11666 - [Integration] Add endianness “gold” integration file for decimal256 ARROW-11667 - [Rust] Add docs for utf8 comparison functions ARROW-11669 - [Rust] [DataFusion] Remove concurrency field from GlobalLimitExec ARROW-11671 - [Rust][DataFusion] Clean up docs on Expr ARROW-11677 - [C++][Dataset] Write documentation ARROW-11680 - [C++] Add vendored version of folly’s spsc queue ARROW-11683 - [R] Support dplyr::mutate() ARROW-11685 - [C++] Typo in future_test.cc ARROW-11688 - [Rust] Casts between utf8 and large-utf8 ARROW-11690 - [Rust][DataFusion] Avoid Expr::clone in Expr builder methods ARROW-11692 - [Rust][DataFusion] Improve documentation on Optimizer ARROW-11693 - [C++] Add string length kernel ARROW-11700 - [R] Internationalize error handling in tidy eval ARROW-11701 - [R] Implement dplyr::relocate() ARROW-11703 - [R] Implement dplyr::arrange() ARROW-11704 - [R] Wire up dplyr::mutate() for datasets ARROW-11707 - Support CSV schema inference without IO ARROW-11708 - Clean up Rust 2021 linting warning ARROW-11709 - [Rust][DataFusion] Move `expressions` and `inputs` into LogicalPlan rather than helpers in util ARROW-11710 - [Rust][DataFusion] Implement ExprRewriter to avoid tree traversal redundancy ARROW-11719 - Support merged schema for memory table ARROW-11721 - json schema inference should return Schema type instead of SchemaRef ARROW-11722 - Improve error message in FFI ARROW-11724 - [C++] Namespace collisions with protobuf 3.15 ARROW-11725 - [Rust][DataFusion] Make use of the new divide_scalar kernel in arrow ARROW-11727 - [C++][FlightRPC] Use TDigest to estimate latency quantiles in benchmark ARROW-11730 - [C++] Add implicit Future(Status) constructor for convenience ARROW-11733 - [Rust][DataFusion] Support hash repartitioning ARROW-11734 - [C++] vendored safe-math.h does not compile on Solaris ARROW-11735 - [R] Allow Parquet and Arrow Dataset to be optional components ARROW-11736 - [R] Allow string compute functions to be optional ARROW-11737 - [C++] Patch vendored xxhash for Solaris ARROW-11738 - [Rust][DataFusion] Concat Functions ARROW-11740 - [C++] posix_memalign not declared in scope on Solaris ARROW-11742 - [Rust] [DataFusion] Add Expr::is_null and Expr::is_not_null functions ARROW-11744 - [C++] Add xsimd dependency ARROW-11745 - [C++] Improve configurability of random data generation ARROW-11750 - [Python][Dataset] Add support for project expressions ARROW-11752 - [R] Replace usage of testthat::expect_is() ARROW-11753 - [Rust][DataFusion] Add test for Join Statement: Schema contains duplicate unqualified field name ARROW-11754 - [R] Support dplyr::compute() ARROW-11761 - [C++] Increase public API testing ARROW-11766 - [R] Better handling for missing compression codecs on Linux ARROW-11768 - [C++][CI] Make s390x build non-optional ARROW-11773 - [Rust] Allow json writer to write out JSON arrays as well as newline formatted objects ARROW-11774 - [R] one-line install from source on macOS ARROW-11775 - [Rust][DataFusion] Feature Flags for Dependencies ARROW-11777 - [Rust] impl AsRef for StringBuilder/BinaryBuilder ARROW-11778 - Cast from large-utf8 to numerical arrays ARROW-11779 - [Rust] make alloc module public ARROW-11790 - [Rust][DataFusion] Change plan builder signature to take Vec&lt;Expr&gt; rather than &amp;[Expr] ARROW-11794 - [Go] Add concurrent-safe ipc.FileReader.RecordAt(i) ARROW-11795 - [MATLAB] Migrate MATLAB Interface for Apache Arrow design doc to Markdown ARROW-11797 - [C++][Dataset] Provide Scanner methods to yield/visit scanned batches ARROW-11798 - [Integration] Update testing submodule ARROW-11799 - [Rust] String and Binary arrays created with incorrect length from unbound iterator ARROW-11801 - [C++] Remove bad header guard in filesystem/type_fwd.h ARROW-11803 - [Rust] [Parquet] Support v2 LogicalType ARROW-11806 - [Rust][DataFusion] Optimize inner join creation of indices ARROW-11820 - Added macro create_native to construct impl ARROW-11822 - Support case sensitive for function ARROW-11824 - [Rust] [Parquet] Use logical types in Arrow writer ARROW-11825 - [Rust][DataFusion] Add mimalloc as option to benchmarks ARROW-11833 - [C++] Vendored fast_float errors for emscripten (architecture flag missing) ARROW-11837 - [C++][Dataset] Expose originating fragment as a property of ScanTask ARROW-11838 - [C++] Support reading IPC data with shared dictionaries ARROW-11839 - [C++] Rewrite bit-unpacking optimizations using xsimd ARROW-11842 - [Rust][Parquet] Use more efficient clone_from in get_batch_with_dict ARROW-11852 - [Documentation] Update CONTRIBUTING to explain Contributor role ARROW-11856 - [C++] Remove unused reference to RecordBatchStreamWriter ARROW-11858 - [GLib] Gandiva Filter in GLib ARROW-11859 - [GLib] GArrowArray: concatenate is missing ARROW-11864 - [R] Document arrow.int64_downcast option ARROW-11870 - [Dev] Automatically run merge script in venv ARROW-11876 - [Website] Update governance page ARROW-11877 - [C++] Add initial microbenchmarks for Dataset internals ARROW-11879 - [Rust][DataFusion] ExecutionContext::sql should optimize query plan ARROW-11883 - [C++] Add ConcatMap, MergeMap, and an async-reentrant version of Map ARROW-11887 - [C++] Add asynchronous read to streaming CSV reader ARROW-11894 - [Rust][DataFusion] Change flight server example to use DataFrame API ARROW-11895 - [Rust][DataFusion] Add support for extra column statistics ARROW-11898 - [Rust] Pretty print columns ARROW-11899 - [Java] Refactor the compression codec implementation into core/Arrow specific parts ARROW-11900 - [Website] Add Yibo to committer list ARROW-11906 - [R] Make FeatherReader print method more informative ARROW-11907 - [C++] Use our own executor in S3FileSystem ARROW-11910 - [Packaging][Ubuntu] Drop support for 16.04 ARROW-11911 - [Website] Add protobuf vs arrow to FAQ ARROW-11912 - [R] Remove args from FeatherReader$create ARROW-11913 - [Rust] Improve performance of StringBuilder ARROW-11920 - [R] Add r/libarrow to make clean ARROW-11921 - [R] Set LC_COLLATE in r/data-raw/codegen.R ARROW-11924 - [C++] Provide streaming output from GetFileInfo ARROW-11925 - [R] Add `between` method for arrow_dplyr_query ARROW-11927 - [Rust][DataFusion] Support limit push down ARROW-11931 - [Go][CI] Bump CI to use Go 1.15 ARROW-11935 - [C++] Add push generator ARROW-11944 - [Developer] Achery benchmark diff regression: cannot compare jsons ARROW-11949 - [Ruby] Accept raw Ruby objects as sort key and options ARROW-11951 - [Rust] Remove OffsetSize::prefix ARROW-11952 - [Rust] Make ArrayData –&gt; GenericListArray fallable instead of `panic!` ARROW-11954 - [C++] arrow/util/io_util.cc does not compile on Solaris ARROW-11955 - [Rust][DataFusion] Support Union ARROW-11958 - [GLib] GArrowChunkedArray: combine is missing ARROW-11959 - [Rust][DataFusion] Fix logging of optimized plan ARROW-11962 - [Rust][DataFusion] Update Datafusion Docs / readme ARROW-11969 - [Rust][DataFusion] Improve Examples in documentation ARROW-11972 - [C++][Dataset] Extract IpcFragmentScanOptions, ParquetFragmentScanOptions ARROW-11973 - [Rust] Boolean AND/OR kernels should follow sql behaviour regarding null values ARROW-11977 - [Rust] Add documentation examples for sort kernel ARROW-11982 - [Rust] Donate Ballista Distributed Compute Platform ARROW-11984 - [C++][Gandiva] Implement SHA1 and SHA256 functions ARROW-11987 - [C++][Gandiva] Implement trigonometric functions on Gandiva ARROW-11988 - [C++][Gandiva] Implements the last_day function ARROW-11992 - [Rust][Parquet] Add upgrade notes on 4.0 rename of LogicalType #9731 ARROW-11993 - [C++] Don’t download xsimd if ARROW_SIMD_LEVEL=NONE ARROW-11996 - [R] Make r/configure run successfully on Solaris ARROW-11999 - [Java] Support parallel vector element search with user-specified comparator ARROW-12000 - [Documentation] Add note about deviation from style guide on struct/classes ARROW-12005 - [R] Fix a bash typo in configure ARROW-12017 - [R] [Documentation] Make proper developing arrow docs ARROW-12019 - [Rust] [Parquet] Update README for 2.6.0 support ARROW-12020 - [Rust][DataFusion] Adding SHOW TABLES and SHOW COLUMNS + partial information_schema support to DataFusion ARROW-12031 - [C++][CSV] infer CSV timestamps columns with fractional seconds ARROW-12032 - [Rust] Optimize comparison kernels using trusted_len iterator for bools ARROW-12034 - [Docs] Formalize Minor PRs ARROW-12037 - [Rust] [DataFusion] Support catalogs and schemas for table namespacing ARROW-12038 - [Rust][DataFusion] Upgrade hashbrown to 0.11 ARROW-12039 - [CI][C++][Gandiva] Fix gandiva nightly linux build failure ARROW-12040 - [R] [CI] [C++] test-r-rstudio-r-base-3.6-opensuse15 timing out during tests ARROW-12043 - [Rust] [Parquet] Write fixed size binary arrays ARROW-12045 - First Chunk of ported Parquet Code ARROW-12047 - [Rust] Clippy parquet ARROW-12048 - [Rust][DataFusion] Support Common Table Expressions ARROW-12052 - [Rust] Implement child data in C FFI ARROW-12056 - [C++] Create sequencing AsyncGenerator ARROW-12058 - [Python] Enable arithmetic operations on Expressions ARROW-12068 - [Python] Stop using distutils ARROW-12069 - [C++][Gandiva]Implement IN expressions for Decimal types ARROW-12070 - [GLib] Drop support for GNU Autotools ARROW-12071 - [GLib] Keep input stream reference of GArrowJSONReader ARROW-12075 - [Rust][DataFusion] Add CTE to list of supported features ARROW-12081 - [R] Bindings for utf8_length ARROW-12082 - [R][Dataset] Allow create dataset from vector of file paths ARROW-12094 - [C++][R] Fix/workaround re2 building on clang/libc++ ARROW-12097 - [C++] Modify BackgroundGenerator so it creates fewer threads ARROW-12098 - [R] Catch cpp build failures on linux ARROW-12104 - Next Chunk of ported Code ARROW-12106 - [Rust][DataFusion] Support `SELECT * from information_schema.tables` ARROW-12107 - [Rust][DataFusion] Support `SELECT * from information_schema.columns` ARROW-12108 - [Rust][DataFusion] Support `SHOW TABLES` ARROW-12109 - [Rust][DataFusion] Support `SHOW COLUMNS` ARROW-12110 - [Java] Implement ZSTD buffer compression for java ARROW-12111 - [Java] place files generated by flatc under source control ARROW-12116 - [Rust] Fix or ignore 1.51 clippy lints ARROW-12119 - [Rust][DataFusion] Improve performance of to_array_of_size ARROW-12120 - [Rust] Generate random arrays and batches ARROW-12121 - [Rust] [Parquet] Arrow writer benchmarks ARROW-12123 - [Rust][DataFusion] Use smallvec for indices for better join performance ARROW-12128 - [CI][Crossbow] Remove (or fix) test-ubuntu-16.04-cpp job ARROW-12131 - [CI][GLib] Ensure upgrading MSYS2 ARROW-12133 - [C++][Gandiva] Add option to disable setting mcpu flag to host cpu during llvm ir compilation ARROW-12134 - [C++] Add regex string match kernel ARROW-12136 - [Rust][DataFusion] Reduce default batch_size to 8192 ARROW-12139 - [Python][Packaging] Use vcpkg to build macOS wheels ARROW-12141 - [R] Bindings for grepl ARROW-12143 - [CI] R builds should timeout and fail after some threshold and dump the output. ARROW-12146 - [C++][Gandiva] Implement CONVERT_FROM(expression, ‘UTF8’, replacement char) function ARROW-12151 - [Docs] Add Jira component + summary conventions to the docs ARROW-12153 - [Rust] [Parquet] Return file metadata after writing Parquet file ARROW-12160 - [Rust] Add an `into_inner()` method to ipc::writer::StreamWriter ARROW-12164 - [Java] Make BaseAllocator.Config public ARROW-12165 - [Rust] Inline append functions in builders for performance ARROW-12168 - [Go][IPC] Implement Compression handling for IPC ARROW-12170 - [Rust][DataFusion] Introduce repartition optimization ARROW-12173 - [GLib] Remove #include &lt;config.h&gt; ARROW-12176 - parquet/low-level-api/reader-writer.cc has some typos. ARROW-12187 - [C++][FlightRPC] Enable compression in Flight benchmark ARROW-12188 - [Docs] Switch to pydata-sphinx-theme for the main sphinx docs ARROW-12190 - [Rust][DataFusion] Implement partitioned hash join ARROW-12192 - [Website] Use downloadable URL for archive download ARROW-12193 - [Dev][Release] Use downloadable URL for archive download ARROW-12194 - [Rust] [Parquet] Update zstd version ARROW-12197 - [R] dplyr bindings for cast, dictionary_encode ARROW-12200 - [R] Export and document list_compute_functions ARROW-12204 - [Rust][CI] Reduce size of rust build artifacts in integration test ARROW-12206 - [Python] Fix Table docstrings ARROW-12208 - [C++] Add the ability to run async tasks without using the CPU thread pool ARROW-12210 - [Rust][DataFusion] Document SHOW TABLES / SHOW COLUMNS / InformationSchema ARROW-12214 - [Rust][DataFusion] Add some tests for limit ARROW-12215 - [C++] fixed size binary columns cannot be null in CSV reader ARROW-12217 - [C++] Cleanup cpp examples source file names ARROW-12222 - [Dev][Packaging] Include build url in the crossbow console report ARROW-12224 - [Rust] Use stable rust for no default test, clean up CI tests ARROW-12228 - [CI] Create base image for conda environments ARROW-12236 - [R][CI] Add check that all docs pages are listed in _pkgdown.yml ARROW-12237 - [Packaging][Debian] Add support for bulleye ARROW-12238 - [JS] Remove trailing spaces ARROW-12239 - [JS] Switch to yarn ARROW-12242 - [Python][Doc] Tweak nightly build instructions ARROW-12246 - [CI] Sync conda recipes with upstream feedstock ARROW-12248 - [C++] Allow static builds to change memory allocators ARROW-12249 - [R] [CI] Fix test-r-install-local nightlies ARROW-12251 - [Rust] [Ballista] Add Ballista tests to CI ARROW-12263 - [Dev][Packaging] Move Crossbow to Archery ARROW-12269 - [JS] Move to eslint ARROW-12274 - [JS] Document how to run tests without building ARROW-12277 - [Rust][DataFusion] Min/Max are not supported for timestamp types ARROW-12278 - [Rust][DataFusion]Use Timestamp(Nanosecond, None) for SQL TIMESTAMP Type ARROW-12280 - [Developer] Remove @-mentions from commit messages in merge tool ARROW-12281 - [JS] Remove shx, trash, and rimraf ARROW-12283 - [R] Bindings for basic type convert functions in dplyr verbs ARROW-12286 - [C++] Create AsyncGenerator from Future&lt;AsyncGenerator&lt;T&gt;&gt; ARROW-12287 - [C++] Create enumerating generator ARROW-12288 - [C++] Create Scanner interface ARROW-12289 - [C++] Create basic AsyncScanner implementation ARROW-12303 - [JS] Use iterators instead of generators in critical code paths ARROW-12304 - [R] Update news and polish docs for 4.0 ARROW-12305 - [JS] Benchmark test data generate.py assumes python 2 ARROW-12309 - [JS] Make es2015 bundles the default ARROW-12316 - [C++] Switch default memory allocator from jemalloc to mimalloc on macOS ARROW-12317 - [Rust] JSON writer does not support time, date or interval types ARROW-12320 - [CI] REPO arg missing from conda-cpp-valgrind ARROW-12323 - [C++][Gandiva] Implement castTIME(timestamp) function ARROW-12325 - [C++] [CI] Nightly gandiva build failing due to failure of compiler to move return value ARROW-12326 - [C++] Avoid needless c-ares detection ARROW-12328 - [Rust] [Ballista] Fix code formatting ARROW-12329 - [Rust] [Ballista] Add README ARROW-12332 - [Rust] [Ballista] Api server for scheduler ARROW-12333 - [JS] Remove jest-environment-node-debug and do not emit from typescript by default ARROW-12335 - [Rust] [Ballista] Bump DataFusion version ARROW-12337 - add DoubleEndedIterator and ExactSizeIterator traits ARROW-12351 - [CI][Ruby] Use ruby/setup-ruby instead of actions/setup-ruby ARROW-12352 - [CI][R][Windows] Remove needless workaround for MSYS2 ARROW-12353 - [Packaging][deb] Rename -archive-keyring to -apt-source ARROW-12354 - [Packaging][RPM] Use apache.jfrog.io/artifactory/ instead of apache.bintray.com/ ARROW-12356 - [Website] Update install page instructions to point to artifactory ARROW-12361 - [Rust] [DataFusion] Allow users to override physical optimization rules ARROW-12367 - [C++] Stop producing when PushGenerator was destroyed ARROW-12370 - [R] Bindings for power kernel ARROW-12374 - [CI][C++][cron] Use Ubuntu 20.04 instead of 16.04 ARROW-12375 - [Release] Remove rebase post-release scripts ARROW-12376 - [Dev] archery trigger-bot should use logger.exception ARROW-12380 - [Rust][Ballista] Add scheduler ui ARROW-12381 - [Packaging][Python] macOS wheels are built with wrong package kind ARROW-12383 - [JS] Update direct deps ARROW-12384 - [JS] Improve code style ARROW-12389 - [R] [Docs] Add note about autocasting ARROW-12395 - [C++]: Create RunInSerialExecutor benchmark ARROW-12396 - [Python][Docs] Clarify serialization docstrings about deprecated status ARROW-12397 - [Rust] [DataFusion] Simplify readme example #10038 ARROW-12398 - [Rust] Remove double bound checks in iterators ARROW-12400 - [Rust] Re-enable transform module tests ARROW-12402 - [Rust] [DataFusion] Implement SQL metrics framework ARROW-12406 - [R] fix checkbashims violation in configure ARROW-12409 - [R] Remove LazyData from DESCRIPTION ARROW-12419 - [Java] flatc is not used in mvn ARROW-12420 - [C++/Dataset] Reading null columns as dictionary not longer possible ARROW-12423 - [Docs] Codecov badge in main Readme only applies to Rust ARROW-12425 - [Rust] new_null_array doesn’t allocate keys buffer for dictionary arrays ARROW-12432 - [Rust] [DataFusion] Add metrics for SortExec ARROW-12436 - [Rust][Ballista] Add watch capabilities to config backend trait ARROW-12467 - [C++][Gandiva] Add support for LLVM12 ARROW-12477 - [Release] Download linux aarch64 miniforge in verify-release-candidate.sh ARROW-12485 - [C++] Use mimalloc as the default memory allocator on macOS ARROW-12488 - [GLib] Use g_memdup2() with GLib 2.68 or later ARROW-12494 - [C++] ORC adapter fails to compile on GCC 4.8 PARQUET-1846 - [C++] Remove deprecated IO classes and related functions PARQUET-1899 - [C++] Deprecated ReadBatchSpaced in parquet/column_reader PARQUET-1990 - [C++] ConvertedType::NA is written out in some cases PARQUET-1993 - [C++] Expose when prefetching completes PARQUET-1998 - [C++] Implement LZ4_RAW compression Bug Fixes ARROW-4784 - [C++][CI] Re-enable flaky mingw tests. ARROW-6818 - [Doc] Format docs confusing ARROW-7288 - [C++][R] read_parquet() freezes on Windows with Japanese locale ARROW-7830 - [C++] Parquet library version doesn’t change with releases ARROW-9451 - [Python] Unsigned integer types will accept string values in pyarrow.array ARROW-9634 - [C++][Python] Restore non-UTC time zones when reading Parquet file that was previously Arrow ARROW-9878 - [Python] table to_pandas self_destruct=True + split_blocks=True cannot prevent doubling memory ARROW-10038 - [C++] SetCpuThreadPoolCapacity(1) spins up nCPUs threads ARROW-10056 - [C++] Increase flatbuffers max_tables parameter in order to read wide tables ARROW-10364 - [Dev][Archery] Test is failed with semver 2.13.0 ARROW-10370 - [Python] Spurious s3fs-related test failures ARROW-10403 - [C++] Implement unique kernel for dictionary type ARROW-10405 - [C++] IsIn kernel should be able to lookup dictionary in string ARROW-10457 - [CI] Fix Spark branch-3.0 integration tests ARROW-10489 - [C++] Unable to configure or make with intel compiler ARROW-10514 - [C++][Parquet] Data inconsistency in parquet-reader output modes ARROW-10953 - [R] Validate when creating Table with schema ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11134 - [C++][CI] ARM64 job on Travis-CI doesn’t run tests ARROW-11147 - [Python][CI] Parquet tests failing in nightly build with Dask master ARROW-11180 - [Developer] cmake-format pre-commit hook doesn’t run ARROW-11192 - [Documentation] Describe opening Visual Studio so it inherits a working env ARROW-11223 - [Java] BaseVariableWidthVector/BaseLargeVariableWidthVector setNull and getBufferSizeFor is buggy ARROW-11235 - [Python] S3 test failures inside non-default regions ARROW-11239 - [Rust] array::transform::tests::test_struct failed ARROW-11269 - [Rust] Unable to read Parquet file because of mismatch in column-derived and embedded schemas ARROW-11277 - [C++] Fix compilation error in dataset expressions on macOS 10.11 ARROW-11299 - [Python] build warning in python ARROW-11303 - [Release][C++] Enable mimalloc in the windows verification script ARROW-11305 - [Rust]: parquet-rowcount binary tries to open itself as a parquet file ARROW-11311 - [Rust] unset_bit is toggling bits, not unsetting them ARROW-11313 - [Rust] Size hint of iterators is incorrect ARROW-11315 - [Packaging][APT][arm64] Add missing gir1.2 files ARROW-11320 - [C++] Spurious test failure when creating temporary dir ARROW-11322 - [Rust] Arrow `memory` made private is a breaking API change ARROW-11323 - [Rust][DataFusion] ComputeError(“concat requires input of at least one array”)) with queries with ORDER BY or GROUP BY that return no ARROW-11328 - [R] Collecting zero columns from a dataset returns entire dataset ARROW-11334 - [Python][CI] Nightly pandas builds failing because of internal pandas change ARROW-11337 - [C++] Compilation error with ThreadSanitizer ARROW-11357 - [Rust] take primitive implementation is unsound ARROW-11376 - [C++] ThreadedTaskGroup failure with Thread Sanitizer enabled ARROW-11379 - [C++][Dataset] Reading dataset with filtering on timestamp partition field crashes ARROW-11387 - [Rust] Arrow 3.0.0 release with simd feature doesn’t compile without feature=avx512. ARROW-11391 - [C++] HdfsOutputStream::Write unsafely truncates integers exceeding INT32_MAX ARROW-11394 - [Rust] Slice + Concat incorrect for structs ARROW-11400 - [Python] Pickled ParquetFileFragment has invalid partition_expresion with dictionary type in pyarrow 2.0 ARROW-11403 - [Developer] archery benchmark list: unexpected keyword ‘benchmark_filter’ ARROW-11412 - [Python] Expressions not working with logical boolean operators (and, or, not) ARROW-11412 - [Python] Expressions not working with logical boolean operators (and, or, not) ARROW-11427 - [C++] Arrow uses AVX512 instructions even when not supported by the OS ARROW-11448 - [C++] tdigest build failure on Windows with Visual Studio ARROW-11451 - [C++] Fix gcc-4.8 build error ARROW-11452 - [Rust] Parquet reader cannot read file where a struct column has the same name as struct member columns ARROW-11461 - [Flight][Go] GetSchema does not work with Java Flight Server ARROW-11464 - [Python] pyarrow.parquet.read_pandas doesn’t conform to its docs ARROW-11470 - [C++] Overflow occurs on integer multiplications in ComputeRowMajorStrides, ComputeColumnMajorStrides, and CheckTensorStridesValidity ARROW-11472 - [Python][CI] Kartothek integrations build is failing with numpy 1.20 ARROW-11472 - [Python][CI] Kartothek integrations build is failing with numpy 1.20 ARROW-11480 - [Python] Segmentation fault reading parquet with date filter with INT96 column ARROW-11483 - [Java][C++][Integration] C++ integration test creates JSON files incompatible with Java ARROW-11488 - [Rust]: StructBuilder’s Drop impl leaks memory ARROW-11490 - [C++] BM_ArrowBinaryDict/EncodeLowLevel is not deterministic ARROW-11494 - [Rust] Fix take bench ARROW-11497 - [Python] pyarrow parquet writer for list does not conform with Apache Parquet specification ARROW-11538 - [Python] Segfault reading Parquet dataset with Timestamp filter ARROW-11547 - [Packaging][Conda][Drone] Nightly builds are failed by undefined variable error ARROW-11548 - [C++] RandomArrayGenerator::List size mismatch ARROW-11551 - [C++][Gandiva] castTIMESTAMP(utf8) function doesn’t show error out for invalid inputs ARROW-11560 - [FlightRPC][C++][Python] Interrupting a Flight server results in abort ARROW-11567 - [C++][Compute] Variance kernel has precision issue ARROW-11577 - [Rust] Concat kernel panics on slices of string arrays ARROW-11582 - [R] write_dataset “format” argument default and validation could be better ARROW-11586 - [Rust] [Datafusion] Invalid SQL sometimes panics ARROW-11595 - [C++][NIGHTLY:test-conda-cpp-valgrind] GenerateBitsUnrolled triggers valgrind on uninit inputs ARROW-11596 - [Python][Dataset] SIGSEGV when executing scan tasks with Python executors ARROW-11603 - [Rust] Fix clippy error ARROW-11607 - [Python] Error when reading table with list values from parquet ARROW-11614 - [C++][Gandiva] Fix round() logic to return positive zero when argument is zero ARROW-11617 - [C++][Gandiva] Fix nested if-else optimisation in gandiva ARROW-11620 - [Rust] [DataFusion] Inconsistent use of Box and Arc for TableProvider ARROW-11630 - [Rust] Introduce partial_sort and limit option for sort kernel ARROW-11632 - [Rust] csv::Reader doesn’t propagate schema metadata to RecordBatches ARROW-11639 - [C++][Gandiva] Fix signbit compilation issue in Ubuntu nightly build ARROW-11642 - [C++] Incorrect preprocessor directive for Windows in JVM detection ARROW-11657 - [R] group_by with .drop specified errors ARROW-11658 - [R] Handle mutate/rename inside group_by ARROW-11663 - [DataFusion] Master does not compile ARROW-11668 - [C++] Sporadic UBSAN error in FutureStessTest.TryAddCallback ARROW-11672 - [R] Fix string function test failure on R 3.3 ARROW-11681 - [Rust] IPC writers shouldn’t unwrap in destructors ARROW-11686 - [C++]flight-test-integration-client sometimes exits by SIGABRT but does not print the stack trace ARROW-11687 - [Rust][DataFusion] RepartitionExec Hanging ARROW-11694 - [C++] Array Take may dereference absent null bitmap ARROW-11695 - [C++][FlightRPC][Packaging] Update support for disabling TLS server verification for recent gRPC versions ARROW-11717 - [Integration] Intermittent (but frequent) flight integration failures with auth:basic_proto ARROW-11718 - [Rust] IPC writers shouldn’t implicitly finish on drop ARROW-11741 - [C++] Decimal cast failure on big-endian ARROW-11743 - [R] Use pkgdown’s new found ability to autolink Jiras ARROW-11746 - [Developer][Archery] Fix prefer real time check ARROW-11756 - [R] passing a partition as a schema leads to segfaults ARROW-11758 - [C++][Compute] Summation kernel round-off error ARROW-11767 - [C++] Scalar::hash may segfault for null scalars ARROW-11771 - [Developer][Archery] Move benchmark tests (so CI runs them) ARROW-11784 - [Rust][DataFusion] CoalesceBatchesStream doesn’t honor Stream interface ARROW-11785 - [R] Fallback when filtering Table with unsupported expression fails ARROW-11786 - [C++] CMake output noisy ARROW-11788 - [Java] Appending Empty List Vector yields NPE ARROW-11791 - [Rust][DataFusion] RepartitionExec Blocking ARROW-11802 - [Rust][DataFusion] Mixing of crossbeam channel and async tasks can lead to deadlock ARROW-11819 - [Rust] Add link to the doc ARROW-11821 - [Rust] Edit Rust README ARROW-11830 - [C++] gRPC compilation tests occur every time ARROW-11832 - [R] Handle conversion of extra nested struct column ARROW-11836 - Target libarrow_bundled_dependencies.a is not alreay created but is already required. ARROW-11845 - [Rust] Debug implementation of Date32Array panics if array contains negative values ARROW-11850 - [GLib] GARROW_VERSION_0_16 macro is missing ARROW-11855 - [C++] [Python] Memory leak in to_pandas when converting chunked struct array ARROW-11857 - [Python] Resource temporarily unavailable when using the new Dataset API with Pandas ARROW-11860 - [Rust] [DataFusion] Add DataFusion logos ARROW-11866 - [C++] Arrow Flight SetShutdownOnSignals cause potential mutex deadlock in gRPC ARROW-11872 - [C++] Array Validation of GPU buffers fails due to incorrect validation check ARROW-11880 - [R] Handle empty or NULL transmute() args properly ARROW-11881 - [Rust][DataFusion] Fix Clippy Lint ARROW-11896 - [Rust] Hang / failure in CI on AMD64 Debian 10 Rust stable test workspace ARROW-11904 - [C++] “pure virtual method called” crash at the end of arrow-csv-test ARROW-11905 - [C++] SIMD info always returning none on MacOS ARROW-11914 - [R] [CI] r-sanitizer nightly is broken ARROW-11918 - [R] [Documentation] Docs cleanups ARROW-11923 - [CI] Update branch name for dask dev integration tests ARROW-11937 - [C++] GZip codec hangs if flushed twice ARROW-11941 - [Dev] “DEBUG=1 merge_arrow_pr.py” updates Jira issue ARROW-11942 - [C++] If tasks are submitted quickly the thread pool may fail to spin up new threads ARROW-11945 - [R] filter doesn’t accept negative numbers as valid ARROW-11956 - [C++] Fix system re2 dependency detection for static library ARROW-11965 - [R][Docs] Fix install.packages command in R dev docs ARROW-11970 - [C++][CI] Fix Valgrind failures ARROW-11971 - [Packaging] Vcpkg patch doesn’t apply on windows due to line endings ARROW-11975 - [CI][GLib] Failed to update gcc ARROW-11976 - [C++] Sporadic TSAN error in TestThreadPool.SetCapacity ARROW-11983 - [Python] ImportError calling pyarrow from_pandas within ThreadPool ARROW-11997 - [Python] concat_tables crashes python interpreter ARROW-12003 - [R] Fix NOTE re undefined global function group_by_drop_default ARROW-12006 - [Java] Fix checkstyle config to work on Windows ARROW-12012 - [Java] [JDBC] BinaryConsumer cannot reallocate memory correctly ARROW-12013 - [C++][FlightRPC] Failed to detect gRPC version ARROW-12015 - [Rust] [DataFusion] Integrate doc-comment crate to ensure readme examples remain valid ARROW-12028 - [Rust][DataFusion] Unsupported GROUP BY for Timestamp(Millisecond, None) ARROW-12029 - Remove args from FeatherReader$create v2 ARROW-12033 - [Docs] Fix link in developers/benchmarks.html ARROW-12041 - [C++] Fix string description of tensor IPC messages ARROW-12051 - [GLib] Intermittent CI failure in test_add_column_type(TestCSVReader::#read::options) ARROW-12057 - [Python] Remove direct usage of pandas’ Block subclasses ARROW-12065 - [C++][Python] Segfault reading JSON file ARROW-12067 - [Python][Doc] Document pyarrow_(un)wrap_scalar ARROW-12073 - [R] Fix R CMD check NOTE about ‘X_____X’ ARROW-12076 - [Rust] Fix build ARROW-12077 - [C++] Out-of-bounds write in ListArray::FromArrays ARROW-12086 - [C++] offline builds does not use ARROW_$LIBRARY_URL to search for packages ARROW-12088 - [Python][C++] Warning about offsetof in pyarrow.dataset.RecordBatchIterator ARROW-12089 - [Doc] Fix warnings when building Sphinx docs ARROW-12100 - [C#] Cannot round-trip record batch with PyArrow ARROW-12103 - [C++] “load of misaligned address” in Parquet reader ARROW-12112 - [CI] No space left on device - AMD64 Conda Integration test ARROW-12112 - [CI] No space left on device - AMD64 Conda Integration test ARROW-12113 - [R] Fix rlang deprecation warning from check_select_helpers() ARROW-12130 - [C++] Arm64 buid failed if -DARROW_SIMD_LEVEL=NONE ARROW-12138 - [Go][IPC] ARROW-12140 - [C++][CI] Valgrind failure on Grouper tests ARROW-12145 - [Developer][Archery] Flaky test: test_static_runner_from_json ARROW-12149 - [Dev] Archery benchmark test case is failing ARROW-12154 - [C++][Gandiva] Fix gandiva crash in certain OS/CPU combinations ARROW-12155 - [R] Require Table columns to be same length ARROW-12161 - [C++][R] Async streaming CSV reader deadlocking when being run synchronously from datasets ARROW-12161 - [C++][R] Async streaming CSV reader deadlocking when being run synchronously from datasets ARROW-12169 - [C++] Fix compressed file reading with an empty stream at end of file ARROW-12171 - [Rust] Clippy error ARROW-12172 - [Python][Packaging] Pass python version as setuptools pretend version in the macOS wheel builds ARROW-12178 - [CI] Update setuptools in the ubuntu images ARROW-12186 - [Rust][DataFusion] Fix regexp_match test ARROW-12209 - [JS] @apache-arrow/ts nor apache-arrow does not compile ARROW-12220 - [C++][CI] Thread sanitizer failure ARROW-12226 - [C++] ASAN error in s3fs_test.cc ARROW-12227 - [R] Fix RE2 and median nightly build failures ARROW-12235 - [Rust][DataFusion] LIMIT returns incorrect results when used with several small partitions ARROW-12241 - [Python] Parallel csv reader cancellation test kills pytest ARROW-12250 - [Rust] Failing test arrow::arrow_writer::tests::fixed_size_binary_single_column ARROW-12254 - [Rust][DataFusion] Limit keeps polling input after limit is reached ARROW-12258 - [R] Never do as.data.frame() on collect(as_data_frame = FALSE) ARROW-12262 - [Doc][C++][Python] Docs built and pushed with S3 and Flight disabled ARROW-12267 - [Rust] JSON writer does not support timestamp types ARROW-12273 - [JS] Coveralls does not work anymore ARROW-12279 - [Rust][DataFusion] Add test for null handling in hash join (ARROW-12266) ARROW-12294 - [Rust] Fix Boolean Kleene Kernels with no Remainder ARROW-12299 - [Python] pq.write_to_dataset does not recognize S3FileSystem ARROW-12300 - [C++] ArrowCUDA erroneously links to CUDA Runtime while only using CUDA Driver API ARROW-12313 - [Rust] [Ballista] Benchmark documentation out of date ARROW-12314 - [Python] pq.read_pandas with use_legacy_dataset=False does not accept columns as a set (kartothek integration failure) ARROW-12327 - [Dev] Use pull request’s head remote when submitting crossbow jobs via the comment bot ARROW-12330 - [Developer] Restore values in counters column of Archery benchmark ARROW-12334 - [Rust] [Ballista] Aggregate queries producing incorrect results ARROW-12342 - [Packaging] Fix tabulation in crossbow templates for submitting nightly builds ARROW-12357 - [Archery] Error running “crossbow submit …” ARROW-12377 - [Doc][Java] Java doc build broken ARROW-12379 - [C++][CI] Thread sanitizer failure in SerialExecutor ARROW-12382 - [C++][CI] Conda nightly jobs fail due to not bundling xsimd ARROW-12385 - [R] [CI] fix cran picking in CI ARROW-12390 - [Rust] Inline Inline from_trusted_len_iter, try_from_trusted_len_iter, extend_from_slice ARROW-12401 - [R] Fix guard around dataset___Scanner__TakeRows ARROW-12405 - [Packaging] Fix apt artifact patterns and artifact uploading from travis ARROW-12408 - [R] Delete Scan() bindings ARROW-12421 - [Rust] [DataFusion] topk_query test fails in master ARROW-12421 - [Rust] [DataFusion] topk_query test fails in master ARROW-12429 - [C++] MergedGeneratorTestFixture is incorrectly instantiated ARROW-12433 - [Rust] Builds failing due to new flatbuffer release introducing const generics ARROW-12437 - [Rust] [Ballista] Ballista plans must not include RepartitionExec ARROW-12440 - [Release] Various packaging, release script and release verification script fixes ARROW-12466 - [Python] Comparing array to None raises error ARROW-12475 - [C++] Build warning from thread_pool_benchmark.cc ARROW-12487 - [C++][Dataset] ScanBatches() hangs if there’s an error during scanning ARROW-12495 - [C++][Python] NumPy buffer sets is_mutable_ to true but does not set mutable_data_ when the NumPy array is writable PARQUET-1655 - [C++] Decimal comparisons used for min/max statistics are not correct PARQUET-2008 - [C++] Wrong information written in RowGroup::total_byte_size" />
<meta property="og:description" content="Apache Arrow 4.0.0 (26 April 2021) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 719 commits from 114 distinct contributors. $ git shortlog -sn apache-arrow-3.0.0..apache-arrow-4.0.0 65 Antoine Pitrou 47 Andrew Lamb 41 Heres, Daniel 40 David Li 37 Sutou Kouhei 33 Neal Richardson 30 Weston Pace 28 Jorge C. Leitao 26 Krisztián Szűcs 25 Ian Cook 21 Dominik Moritz 20 Andy Grove 19 Yibo Cai 18 Joris Van den Bossche 17 Neville Dipale 17 Jonathan Keane 17 Ritchie Vink 12 Mike Seddon 12 Benjamin Kietzman 11 Mauricio Vargas 10 Qingping Hou 10 Diana Clarke 8 Micah Kornfield 7 Matthew Topol 7 Dmitry Patsura 5 Projjal Chanda 5 Kenta Murata 4 Anthony Louis 4 Ximo Guanter 4 liyafan82 3 Andre Braga Reis 3 Kazuaki Ishizaki 3 Maarten A. Breddels 3 Uwe L. Korn 3 ptaylor 3 Steven Fackler 3 Sagnik Chakraborty 3 Nic Crane 2 Marc Prud&#39;hommeaux 2 Raphael Taylor-Davies 2 João Pedro 2 Yordan Pavlov 2 emkornfield 2 Max Burke 2 Florian Müller 2 Ben Chambers 2 mqy 2 Christoph Schulze 2 Manoj Karthick 2 Sathis Kumar 2 Ryan Jennings 2 Ruan Pearce-Authers 2 Tao He 2 Eric Burden 2 Tyrel Rink 2 Romain Francois 2 Rok 1 witchard 1 Adam Lippai 1 Albert Villanova del Moral 1 Alessandro Molina 1 Ali 1 Andrew Wieteska 1 Bob Tinsman 1 Brian Hulette 1 Bryan Cutler 1 Clcanny 1 Daniel Russo 1 Daniël Heres 1 Eduardo Ponce 1 Evan Chan 1 FawnD2 1 Felix Zhu 1 Fernando Herrera 1 Fiona La 1 François Saint-Jacques 1 GALI PREM SAGAR 1 Gert Hulselmans 1 Ha Thi Tham 1 Hongze Zhang 1 Ilya Biryukov 1 Ivan Smirnov 1 James Winegar 1 Joe Roberts 1 Johannes Müller 1 Jörn Horstmann 1 Mahmut Bulut 1 Marco Gorelli 1 Marko Mikulicic 1 Markus Silberstein Hont 1 Martin Nowak 1 Matt Brubeck 1 Matt Summersgill 1 Max Meldrum 1 Nathaniel Bauernfeind 1 Nga Tran 1 Nick Bruno 1 Rok Mihevc 1 Roman Karlstetter 1 Sam Albers 1 Simon Bertron 1 Szangin 1 Truc Lam Nguyen 1 Weichen Xu 1 Ying Zhou 1 frank400 1 ivan 1 jpeeter 1 martinblostein 1 nmcdonnell-kx 1 pierwill 1 sjgupta2 1 sundy-li 1 ARF1 Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-3.0.0..apache-arrow-4.0.0 157 Andrew Lamb 101 Antoine Pitrou 93 Neal Richardson 88 Krisztián Szűcs 72 Sutou Kouhei 41 David Li 30 Benjamin Kietzman 25 Neville Dipale 22 Micah Kornfield 19 Jorge C. Leitao 16 Andy Grove 14 Praveen 11 Joris Van den Bossche 9 GitHub 8 Yibo Cai 4 Uwe L. Korn 3 Sebastien Binet 2 liyafan82 1 Kenta Murata 1 Eric Erhardt 1 Chao Sun 1 Bryan Cutler Changelog Apache Arrow 4.0.0 (2021-04-26) New Features and Improvements ARROW-951 - [JS] Fix generated API documentation ARROW-2229 - [C++] Write CSV files from RecordBatch, Table ARROW-3690 - [Rust] Add Rust to the format integration testing ARROW-6103 - [Java] Stop using the maven release plugin ARROW-6248 - [Python] Use FileNotFoundError in HadoopFileSystem.open() in Python 3 ARROW-6455 - [C++] Implement ExtensionType for non-UTF8 Unicode data ARROW-6604 - [C++] Add support for nested types to MakeArrayFromScalar ARROW-7215 - [C++][Gandiva] Implement castVARCHAR(numeric_type) functions in Gandiva ARROW-7364 - [Rust] Add cast options to cast kernel ARROW-7633 - [C++][CI] Create fuzz targets for tensors and sparse tensors ARROW-7808 - [Java][Dataset] Implement Datasets Java API ARROW-7906 - [C++][Python] Full functionality for ORC format ARROW-8049 - [C++] Upgrade bundled Thrift version to 0.13.0 ARROW-8282 - [C++/Python][Dataset] Support schema evolution for integer columns ARROW-8284 - [C++][Dataset] Schema evolution for timestamp columns ARROW-8630 - [C++][Dataset] Pass schema including all materialized fields to catch CSV edge cases ARROW-8631 - [C++][Dataset] Add ConvertOptions and ReadOptions to CsvFileFormat ARROW-8658 - [C++][Dataset] Implement subtree pruning for FileSystemDataset::GetFragments ARROW-8732 - [C++] Let Futures support cancellation ARROW-8771 - [C++] Add boost/process library to build support ARROW-8796 - [Rust] Allow parquet to be written directly to memory ARROW-8797 - [C++] Support Flight RPC among diffent endian platforms ARROW-8900 - [C++] Respect HTTP(S)_PROXY for S3 Filesystems and/or expose proxy options as parameters ARROW-8919 - [C++] Add “DispatchBest” APIs to compute::Function that selects a kernel that may require implicit casts to invoke ARROW-9128 - [C++] Implement string space trimming kernels: trim, ltrim, and rtrim ARROW-9149 - [C++] Improve configurability of RandomArrayGenerator::ArrayOf ARROW-9196 - [C++] Make temporal casts work on Scalar inputs ARROW-9318 - [C++][Parquet] Encryption key management tools ARROW-9731 - [C++][Dataset] Port “head” method from R to C++ Dataset Scanner ARROW-9749 - [C++][Dataset] Extract format-specific scan options from FileFormat ARROW-9777 - [Rust] Implement IPC changes to catch up to 1.0.0 format ARROW-9856 - [R] Add bindings for string compute functions ARROW-10014 - [C++] TaskGroup::Finish should execute tasks ARROW-10089 - [R] inject base class for Array, ChunkedArray and Scalar ARROW-10183 - [C++] Create a ForEach library function that runs on an iterator of futures ARROW-10195 - [C++] Add string struct extract kernel using re2 ARROW-10250 - [FlightRPC][C++] Remove default constructor for FlightClientOptions ARROW-10255 - [JS] Reorganize imports and exports to be more friendly to ESM tree-shaking ARROW-10297 - [Rust] Parameter for parquet-read to output data in json format ARROW-10299 - [Rust] Support reading and writing V5 of IPC metadata ARROW-10305 - [R] Filter with regular expressions ARROW-10306 - [C++] Add string replacement kernel ARROW-10349 - [Python] Build and publish aarch64 wheels ARROW-10354 - [Rust] [DataFusion] Add support for regex extract ARROW-10360 - [CI] Bump github actions cache version ARROW-10372 - [C++][Dataset] Read compressed CSVs ARROW-10406 - [C++] Unify dictionaries when writing IPC file in a single shot ARROW-10420 - [C++] FileSystem::OpenInput{File,Stream} should accept a MemoryPool ARROW-10421 - [R] Feather reader/writer should accept a MemoryPool ARROW-10438 - [C++][Dataset] Partitioning::Format on nulls ARROW-10520 - [C++][R] Implement add/remove/replace for RecordBatch ARROW-10570 - [R] Use Converter API to convert SEXP to Array/ChunkedArray ARROW-10580 - [C++] When Validating, ensure DenseUnionArray offsets are increasing ARROW-10606 - [C++][Compute] Support casts to and from Decimal256 type. ARROW-10655 - [C++] Add LRU cache facility ARROW-10734 - [R] Build and test on Solaris ARROW-10735 - [R] Remove arrow-without-arrow wrapping ARROW-10766 - [Rust] Compute nested definition and repetition for list arrays ARROW-10797 - [C++] Investigate faster random generation for tests and benchmarks ARROW-10816 - [Rust] [DataFusion] Implement INTERVAL ARROW-10831 - [C++][Compute] Implemement quantile kernel ARROW-10846 - [C++] Add async filesystem operations ARROW-10880 - [Java] Support compressing RecordBatch IPC buffers by LZ4 ARROW-10882 - [Python][Dataset] Writing dataset from python iterator of record batches ARROW-10895 - [C++][Gandiva] Implement bool to varchar cast function in Gandiva ARROW-10903 - [Rust] Implement FromIter&lt;Option&lt;Vec&lt;u8&gt;&gt;&gt; constructor for FixedSizeBinaryArray ARROW-11022 - [Rust] [DataFusion] Upgrade to tokio 1.0 ARROW-11070 - [C++] Implement power / exponentiation compute kernel ARROW-11074 - [Rust][DataFusion] Implement predicate push-down for parquet tables ARROW-11081 - [Java] Make IPC option immutable ARROW-11108 - [Rust] Improve performance of MutableBuffer ARROW-11141 - [Rust]: Miri checks ARROW-11149 - [Rust] create_batch_empty - support List, LargeList ARROW-11150 - [Rust] Set up bi-weekly Rust sync call and update website ARROW-11154 - [CI][C++] Move homebrew crossbow tests off of Travis-CI ARROW-11156 - [Rust][DataFusion] Create hashes vectorized in hash join ARROW-11174 - [C++][Dataset] Make Expressions available for projection ARROW-11179 - [Format] Make comments in fb files friendly to rust doc ARROW-11183 - [Rust] [Parquet] LogicalType::TIMESTAMP_NANOS missing ARROW-11191 - [C++] Use FnOnce for TaskGroup’s tasks instead of std::function ARROW-11216 - [Rust] Improve documentation for StringDictionaryBuilder ARROW-11220 - [Rust] DF Implement GROUP BY support for Boolean ARROW-11222 - [Rust] [Arrow] catch up with flatbuffers 0.8.1 ARROW-11246 - DF - Add type to Unexpected accumulator state message ARROW-11254 - [Rust][DataFusion] Add SIMD and snmalloc flags as options to benchmarks ARROW-11260 - [C++][Dataset] Don’t require dictionaries for reading dataset with schema-based Partitioning ARROW-11265 - [Rust] Made bool not convertable to bytes ARROW-11268 - [Rust][DataFusion] Support specifying repartitions in MemTable ARROW-11270 - [Rust] Use slices for simple array data buffer access ARROW-11279 - [Rust][Parquet] ArrowWriter Definition Levels Memory Usage ARROW-11284 - [R] Support dplyr verb transmute() ARROW-11289 - [Rust] [DataFusion] Support GROUP BY for Dictionary columns ARROW-11290 - [Rust][DataFusion] Address hash aggregate performance with high number of groups ARROW-11291 - [Rust] implement extend for MutableBuffer (from iterator) ARROW-11300 - [Rust][DataFusion] Improve hash aggregate performance with large number of groups in ARROW-11308 - [Rust] [Parquet] Add Arrow decimal array writer ARROW-11309 - [Release][C#] Use .NET 3.1 for verification ARROW-11310 - [Rust] Implement arrow JSON writer ARROW-11314 - [Release][APT][Yum] Add support for verifying arm64 packages ARROW-11317 - [Rust] Test the prettyprint feature in CI ARROW-11318 - [Rust] Support pretty printing timestamp, date, and time types ARROW-11319 - [Rust] [DataFusion] Improve test comparisons to record batch ARROW-11321 - [Rust][DataFusion] Fix DataFusion compilation error ARROW-11325 - [Packaging][C#] Release Apache.Arrow.Flight and Apache.Arrow.Flight.AspNetCore ARROW-11329 - [Rust] Do not rebuild the library on every change ARROW-11330 - [Rust][DataFusion] Add ExpressionVisitor pattern ARROW-11332 - [Rust] Use MutableBuffer in take_string instead of Vec ARROW-11333 - [Rust] Suport creating arbitrary nested empty arrays ARROW-11336 - [C++][Doc] Improve Developing on Windows docs ARROW-11338 - [R] Bindings for quantile and median ARROW-11340 - [C++] Add vcpkg.json manifest to cpp project root ARROW-11343 - [DataFusion] Simplified example ARROW-11346 - [C++][Compute] Implement quantile kernel benchmark ARROW-11349 - [Rust] Add from_iter_values to create arrays from T instead of Option&lt;T&gt; ARROW-11350 - [C++] Bump dependency versions ARROW-11354 - [Rust] Speed-up casts of dates and times ARROW-11355 - [Rust] Align Date type with spec ARROW-11358 - [Rust] Add benchmark for concatenating small arrays ARROW-11360 - [Rust] [DataFusion] Improve CSV “No files found” error message ARROW-11361 - [Rust] Build buffers from iterator of booleans ARROW-11362 - [Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance ARROW-11365 - [Rust] [Parquet] Implement parsers for v2 of the text schema ARROW-11366 - [Rust][DataFusion] Add Constant Folding / Support boolean literal in equality expression ARROW-11367 - [C++] Implement approximante quantile utility ARROW-11369 - [DataFusion] Split expressions.rs ARROW-11372 - Support RC verification on macOS-ARM64 ARROW-11373 - [Python][Docs] Add example of specifying type for a column when reading csv file ARROW-11374 - [Python] Make legacy pyarrow.filesystem / pyarrow.serialize warnings more visisble ARROW-11375 - [Rust] CI fails due to deprecation warning in clippy ARROW-11377 - [C++][CI] Add ThreadSanitizer nightly build ARROW-11383 - [Rust] use trusted len on bit ops ARROW-11386 - [Release] Fix post documents update script ARROW-11389 - [Rust] Inconsistent comments for datatypes ARROW-11395 - [DataFusion] Support custom optimizations ARROW-11401 - [Rust][DataFusion] Pass slices instead of Vec in DataFrame API ARROW-11404 - [Rust][DataFusion] Upgrade to aHash 0.7 ARROW-11405 - [DataFusion] Support multiple custom nodes ARROW-11406 - [CI][C++] Fix caching on Travis-CI builds ARROW-11408 - Add window support to datafusion readme ARROW-11411 - [Packaging][Linux] Disable arm64 nightly builds ARROW-11414 - [Rust] Reduce copies in Schema::try_merge ARROW-11417 - [Integration] Add integration test for buffer compression ARROW-11418 - [Doc] Add IPC buffer compression to support matrix ARROW-11421 - [Rust][DataFusion] Support group by Date32 ARROW-11422 - [C#] Add support for decimals ARROW-11423 - [R] value_counts and some StructArray methods ARROW-11425 - [C++][Compute] Improve quantile kernel for integers ARROW-11426 - [Rust][DataFusion] EXTRACT support ARROW-11428 - [Rust] Add power kernel ARROW-11429 - Make string comparisson kernels generic over Utf8 and LargeUtf8 ARROW-11430 - [Rust] Kernel to combine two arrays based on boolean mask ARROW-11431 - [Rust] [DataFusion] Add support for the SQL HAVING clause ARROW-11435 - Allow creating ParquetPartition from external crate ARROW-11436 - [Rust] Allow non-sized iterators in Primitive::from_iter ARROW-11437 - [Rust] Simplify benches ARROW-11438 - Unsupported ast node Value(Boolean(true)) in sqltorel ARROW-11439 - [Rust] Add year support to temporal kernel ARROW-11440 - [Rust] [DataFusion] Add method to CsvExec to get CSV schema ARROW-11442 - [Rust] Expose the logic used to interpret date/times ARROW-11443 - [Rust] Write datetime information for Date64 Type in csv writer ARROW-11444 - [Rust][DataFusion] Pass slices instead of &amp;Vec to functions ARROW-11446 - [DataFusion] Support scalars in builtin functions ARROW-11447 - [Rust] Add shift kernel ARROW-11449 - [CI][R][Windows] Use ccache ARROW-11457 - [Rust] Make string comparisson kernels generic over Utf8 and LargeUtf8 ARROW-11459 - [Rust] Allow ListArray of primitives to be built from iterator ARROW-11462 - [Developer] Remove needless quote from the default DOCKER_VOLUME_PREFIX ARROW-11463 - [Python] Allow configuration of IpcWriterOptions 64Bit from PyArrow ARROW-11466 - [Flight][Go] Add BasicAuth and BearerToken handlers for Go ARROW-11467 - [R] Fix reference to json_table_reader() in R docs ARROW-11468 - [R] Allow user to pass schema to read_json_arrow() ARROW-11474 - [C++] Update bundled re2 version ARROW-11476 - [Rust][DataFusion] Test running of TPCH benchmarks in CI ARROW-11477 - [R][Doc] Reorganize and improve README and vignette content ARROW-11478 - [R] Consider ways to make arrow.skip_nul option more user-friendly ARROW-11479 - [Rust][Parquet] Add method to return compressed size of row group ARROW-11481 - [Rust] More cast implementations ARROW-11484 - [Rust] Derive Clone for ExecutionContext ARROW-11486 - [Website] Use Jekyll 4 and webpack to support Ruby 3.0 or later ARROW-11489 - [Rust][DataFusion] Make DataFrame Send+Sync ARROW-11491 - [Rust] Support json schema inference for nested list and struct ARROW-11493 - [CI][Packaging][deb][RPM] Test built packages ARROW-11500 - [R] Allow bundled build script to run on Solaris ARROW-11501 - [C++] endianness check does not work on Solaris ARROW-11504 - [Rust] verify Datatype in ListArray::from(ArrayDataRef) ARROW-11505 - [Rust] Add support for LargeUtf8 in csv-writer ARROW-11507 - [R] Bindings for GetRuntimeInfo ARROW-11510 - [Python] Add note that pip &gt;= 19.0 is required to get binary packages ARROW-11511 - [Rust] Replace Arc&lt;ArrayData&gt; by ArrayData ARROW-11512 - [Packaging][deb] Add missing gRPC dependency for Ubuntu 21.04 ARROW-11513 - [R] Bindings for sub/gsub ARROW-11516 - [R] Allow all C++ compute functions to be called by name in dplyr ARROW-11539 - [Developer][Archery] Change items_per_seconds units ARROW-11541 - [C++][Compute] Implement approximate quantile kernel ARROW-11542 - [Rust] json reader should not crash when reading nested list ARROW-11544 - [Rust] [DataFusion] Implement as_any for AggregateExpr ARROW-11545 - [Rust] [DataFusion] SendableRecordBatchStream should implement Sync ARROW-11556 - [C++] Minor benchmark improvements ARROW-11557 - [Rust] Add table de-registration to DataFusion ExecutionContext ARROW-11559 - [C++] Improve flatbuffers verification limits ARROW-11559 - [C++] Improve flatbuffers verification limits ARROW-11561 - [Rust][DataFusion] Add Send + Sync to MemTable::load ARROW-11563 - [Rust] Support Cast(Utf8, TimeStamp(Nanoseconds, None)) ARROW-11568 - [C++][Compute] Mode kernel performance is bad in some conditions ARROW-11570 - [Rust] ScalarValue - support Date64 ARROW-11571 - [CI] Cancel stale Github Actions workflow runs ARROW-11572 - [Rust] Add a kernel for division by single scalar ARROW-11573 - [Developer][Archery] Google benchmark now reports run type ARROW-11574 - [Rust][DataFusion] Upgrade sqlparser to 0.8 to support parsing all TPC-H queries ARROW-11575 - [Developer][Archery] Expose execution time in benchmark results ARROW-11576 - [Rust] Remove unused variable in example ARROW-11580 - [C++] Add CMake option ARROW_DEPENDENCY_SOURCE=VCPKG ARROW-11589 - [R] Add methods for modifying Schemas ARROW-11590 - [C++] Move CSV background generator to IO thread pool ARROW-11591 - [C++][Compute] Prototype version of hash aggregation ARROW-11592 - [Rust] Typo in comment ARROW-11594 - [Rust] Support pretty printing with NullArrays ARROW-11597 - [Rust] Split datatypes in a module ARROW-11598 - [Rust] Split buffer.rs in smaller files ARROW-11599 - [Rust] Add function to create array with all nulls ARROW-11601 - [C++][Dataset] Expose pre-buffering in ParquetFileFormatReaderOptions ARROW-11606 - [Rust] [DataFusion] Need guidance on HashAggregateExec reconstruction ARROW-11610 - [C++] Download boost from sourceforge instead of bintray ARROW-11612 - [C++] Rebuild trimmed boost bundle for 1.75.0 ARROW-11613 - [R] Move nightly C++ builds off of bintray ARROW-11616 - [Rust][DataFusion] Expose collect_partitioned for DataFrame ARROW-11621 - [CI][Gandiva][Linux] Fix Crossbow setup failure ARROW-11626 - [Rust][DataFusion] Move DataFusion examples to own project to reduce nr dependencies ARROW-11627 - [Rust] Typed allocator ARROW-11637 - [CI][Conda] Update nightly clean target platforms and packages list ARROW-11641 - [CI] Use docker buildkit’s inline cache to reuse build cache across different hosts ARROW-11649 - [R] Add support for null_fallback to R ARROW-11651 - [Rust][DataFusion] Implement Postgres Length Functions ARROW-11653 - Ascii/unicode functions ARROW-11655 - Pad/trim functions ARROW-11656 - Left over functions/fixes ARROW-11659 - [R] Preserve group_by .drop argument ARROW-11662 - [C++] Support sorting for decimal data type. ARROW-11664 - [Rust] Cast to LargeUtf8 ARROW-11665 - [Python] Document precision and scale parameters of decimal128() ARROW-11666 - [Integration] Add endianness “gold” integration file for decimal256 ARROW-11667 - [Rust] Add docs for utf8 comparison functions ARROW-11669 - [Rust] [DataFusion] Remove concurrency field from GlobalLimitExec ARROW-11671 - [Rust][DataFusion] Clean up docs on Expr ARROW-11677 - [C++][Dataset] Write documentation ARROW-11680 - [C++] Add vendored version of folly’s spsc queue ARROW-11683 - [R] Support dplyr::mutate() ARROW-11685 - [C++] Typo in future_test.cc ARROW-11688 - [Rust] Casts between utf8 and large-utf8 ARROW-11690 - [Rust][DataFusion] Avoid Expr::clone in Expr builder methods ARROW-11692 - [Rust][DataFusion] Improve documentation on Optimizer ARROW-11693 - [C++] Add string length kernel ARROW-11700 - [R] Internationalize error handling in tidy eval ARROW-11701 - [R] Implement dplyr::relocate() ARROW-11703 - [R] Implement dplyr::arrange() ARROW-11704 - [R] Wire up dplyr::mutate() for datasets ARROW-11707 - Support CSV schema inference without IO ARROW-11708 - Clean up Rust 2021 linting warning ARROW-11709 - [Rust][DataFusion] Move `expressions` and `inputs` into LogicalPlan rather than helpers in util ARROW-11710 - [Rust][DataFusion] Implement ExprRewriter to avoid tree traversal redundancy ARROW-11719 - Support merged schema for memory table ARROW-11721 - json schema inference should return Schema type instead of SchemaRef ARROW-11722 - Improve error message in FFI ARROW-11724 - [C++] Namespace collisions with protobuf 3.15 ARROW-11725 - [Rust][DataFusion] Make use of the new divide_scalar kernel in arrow ARROW-11727 - [C++][FlightRPC] Use TDigest to estimate latency quantiles in benchmark ARROW-11730 - [C++] Add implicit Future(Status) constructor for convenience ARROW-11733 - [Rust][DataFusion] Support hash repartitioning ARROW-11734 - [C++] vendored safe-math.h does not compile on Solaris ARROW-11735 - [R] Allow Parquet and Arrow Dataset to be optional components ARROW-11736 - [R] Allow string compute functions to be optional ARROW-11737 - [C++] Patch vendored xxhash for Solaris ARROW-11738 - [Rust][DataFusion] Concat Functions ARROW-11740 - [C++] posix_memalign not declared in scope on Solaris ARROW-11742 - [Rust] [DataFusion] Add Expr::is_null and Expr::is_not_null functions ARROW-11744 - [C++] Add xsimd dependency ARROW-11745 - [C++] Improve configurability of random data generation ARROW-11750 - [Python][Dataset] Add support for project expressions ARROW-11752 - [R] Replace usage of testthat::expect_is() ARROW-11753 - [Rust][DataFusion] Add test for Join Statement: Schema contains duplicate unqualified field name ARROW-11754 - [R] Support dplyr::compute() ARROW-11761 - [C++] Increase public API testing ARROW-11766 - [R] Better handling for missing compression codecs on Linux ARROW-11768 - [C++][CI] Make s390x build non-optional ARROW-11773 - [Rust] Allow json writer to write out JSON arrays as well as newline formatted objects ARROW-11774 - [R] one-line install from source on macOS ARROW-11775 - [Rust][DataFusion] Feature Flags for Dependencies ARROW-11777 - [Rust] impl AsRef for StringBuilder/BinaryBuilder ARROW-11778 - Cast from large-utf8 to numerical arrays ARROW-11779 - [Rust] make alloc module public ARROW-11790 - [Rust][DataFusion] Change plan builder signature to take Vec&lt;Expr&gt; rather than &amp;[Expr] ARROW-11794 - [Go] Add concurrent-safe ipc.FileReader.RecordAt(i) ARROW-11795 - [MATLAB] Migrate MATLAB Interface for Apache Arrow design doc to Markdown ARROW-11797 - [C++][Dataset] Provide Scanner methods to yield/visit scanned batches ARROW-11798 - [Integration] Update testing submodule ARROW-11799 - [Rust] String and Binary arrays created with incorrect length from unbound iterator ARROW-11801 - [C++] Remove bad header guard in filesystem/type_fwd.h ARROW-11803 - [Rust] [Parquet] Support v2 LogicalType ARROW-11806 - [Rust][DataFusion] Optimize inner join creation of indices ARROW-11820 - Added macro create_native to construct impl ARROW-11822 - Support case sensitive for function ARROW-11824 - [Rust] [Parquet] Use logical types in Arrow writer ARROW-11825 - [Rust][DataFusion] Add mimalloc as option to benchmarks ARROW-11833 - [C++] Vendored fast_float errors for emscripten (architecture flag missing) ARROW-11837 - [C++][Dataset] Expose originating fragment as a property of ScanTask ARROW-11838 - [C++] Support reading IPC data with shared dictionaries ARROW-11839 - [C++] Rewrite bit-unpacking optimizations using xsimd ARROW-11842 - [Rust][Parquet] Use more efficient clone_from in get_batch_with_dict ARROW-11852 - [Documentation] Update CONTRIBUTING to explain Contributor role ARROW-11856 - [C++] Remove unused reference to RecordBatchStreamWriter ARROW-11858 - [GLib] Gandiva Filter in GLib ARROW-11859 - [GLib] GArrowArray: concatenate is missing ARROW-11864 - [R] Document arrow.int64_downcast option ARROW-11870 - [Dev] Automatically run merge script in venv ARROW-11876 - [Website] Update governance page ARROW-11877 - [C++] Add initial microbenchmarks for Dataset internals ARROW-11879 - [Rust][DataFusion] ExecutionContext::sql should optimize query plan ARROW-11883 - [C++] Add ConcatMap, MergeMap, and an async-reentrant version of Map ARROW-11887 - [C++] Add asynchronous read to streaming CSV reader ARROW-11894 - [Rust][DataFusion] Change flight server example to use DataFrame API ARROW-11895 - [Rust][DataFusion] Add support for extra column statistics ARROW-11898 - [Rust] Pretty print columns ARROW-11899 - [Java] Refactor the compression codec implementation into core/Arrow specific parts ARROW-11900 - [Website] Add Yibo to committer list ARROW-11906 - [R] Make FeatherReader print method more informative ARROW-11907 - [C++] Use our own executor in S3FileSystem ARROW-11910 - [Packaging][Ubuntu] Drop support for 16.04 ARROW-11911 - [Website] Add protobuf vs arrow to FAQ ARROW-11912 - [R] Remove args from FeatherReader$create ARROW-11913 - [Rust] Improve performance of StringBuilder ARROW-11920 - [R] Add r/libarrow to make clean ARROW-11921 - [R] Set LC_COLLATE in r/data-raw/codegen.R ARROW-11924 - [C++] Provide streaming output from GetFileInfo ARROW-11925 - [R] Add `between` method for arrow_dplyr_query ARROW-11927 - [Rust][DataFusion] Support limit push down ARROW-11931 - [Go][CI] Bump CI to use Go 1.15 ARROW-11935 - [C++] Add push generator ARROW-11944 - [Developer] Achery benchmark diff regression: cannot compare jsons ARROW-11949 - [Ruby] Accept raw Ruby objects as sort key and options ARROW-11951 - [Rust] Remove OffsetSize::prefix ARROW-11952 - [Rust] Make ArrayData –&gt; GenericListArray fallable instead of `panic!` ARROW-11954 - [C++] arrow/util/io_util.cc does not compile on Solaris ARROW-11955 - [Rust][DataFusion] Support Union ARROW-11958 - [GLib] GArrowChunkedArray: combine is missing ARROW-11959 - [Rust][DataFusion] Fix logging of optimized plan ARROW-11962 - [Rust][DataFusion] Update Datafusion Docs / readme ARROW-11969 - [Rust][DataFusion] Improve Examples in documentation ARROW-11972 - [C++][Dataset] Extract IpcFragmentScanOptions, ParquetFragmentScanOptions ARROW-11973 - [Rust] Boolean AND/OR kernels should follow sql behaviour regarding null values ARROW-11977 - [Rust] Add documentation examples for sort kernel ARROW-11982 - [Rust] Donate Ballista Distributed Compute Platform ARROW-11984 - [C++][Gandiva] Implement SHA1 and SHA256 functions ARROW-11987 - [C++][Gandiva] Implement trigonometric functions on Gandiva ARROW-11988 - [C++][Gandiva] Implements the last_day function ARROW-11992 - [Rust][Parquet] Add upgrade notes on 4.0 rename of LogicalType #9731 ARROW-11993 - [C++] Don’t download xsimd if ARROW_SIMD_LEVEL=NONE ARROW-11996 - [R] Make r/configure run successfully on Solaris ARROW-11999 - [Java] Support parallel vector element search with user-specified comparator ARROW-12000 - [Documentation] Add note about deviation from style guide on struct/classes ARROW-12005 - [R] Fix a bash typo in configure ARROW-12017 - [R] [Documentation] Make proper developing arrow docs ARROW-12019 - [Rust] [Parquet] Update README for 2.6.0 support ARROW-12020 - [Rust][DataFusion] Adding SHOW TABLES and SHOW COLUMNS + partial information_schema support to DataFusion ARROW-12031 - [C++][CSV] infer CSV timestamps columns with fractional seconds ARROW-12032 - [Rust] Optimize comparison kernels using trusted_len iterator for bools ARROW-12034 - [Docs] Formalize Minor PRs ARROW-12037 - [Rust] [DataFusion] Support catalogs and schemas for table namespacing ARROW-12038 - [Rust][DataFusion] Upgrade hashbrown to 0.11 ARROW-12039 - [CI][C++][Gandiva] Fix gandiva nightly linux build failure ARROW-12040 - [R] [CI] [C++] test-r-rstudio-r-base-3.6-opensuse15 timing out during tests ARROW-12043 - [Rust] [Parquet] Write fixed size binary arrays ARROW-12045 - First Chunk of ported Parquet Code ARROW-12047 - [Rust] Clippy parquet ARROW-12048 - [Rust][DataFusion] Support Common Table Expressions ARROW-12052 - [Rust] Implement child data in C FFI ARROW-12056 - [C++] Create sequencing AsyncGenerator ARROW-12058 - [Python] Enable arithmetic operations on Expressions ARROW-12068 - [Python] Stop using distutils ARROW-12069 - [C++][Gandiva]Implement IN expressions for Decimal types ARROW-12070 - [GLib] Drop support for GNU Autotools ARROW-12071 - [GLib] Keep input stream reference of GArrowJSONReader ARROW-12075 - [Rust][DataFusion] Add CTE to list of supported features ARROW-12081 - [R] Bindings for utf8_length ARROW-12082 - [R][Dataset] Allow create dataset from vector of file paths ARROW-12094 - [C++][R] Fix/workaround re2 building on clang/libc++ ARROW-12097 - [C++] Modify BackgroundGenerator so it creates fewer threads ARROW-12098 - [R] Catch cpp build failures on linux ARROW-12104 - Next Chunk of ported Code ARROW-12106 - [Rust][DataFusion] Support `SELECT * from information_schema.tables` ARROW-12107 - [Rust][DataFusion] Support `SELECT * from information_schema.columns` ARROW-12108 - [Rust][DataFusion] Support `SHOW TABLES` ARROW-12109 - [Rust][DataFusion] Support `SHOW COLUMNS` ARROW-12110 - [Java] Implement ZSTD buffer compression for java ARROW-12111 - [Java] place files generated by flatc under source control ARROW-12116 - [Rust] Fix or ignore 1.51 clippy lints ARROW-12119 - [Rust][DataFusion] Improve performance of to_array_of_size ARROW-12120 - [Rust] Generate random arrays and batches ARROW-12121 - [Rust] [Parquet] Arrow writer benchmarks ARROW-12123 - [Rust][DataFusion] Use smallvec for indices for better join performance ARROW-12128 - [CI][Crossbow] Remove (or fix) test-ubuntu-16.04-cpp job ARROW-12131 - [CI][GLib] Ensure upgrading MSYS2 ARROW-12133 - [C++][Gandiva] Add option to disable setting mcpu flag to host cpu during llvm ir compilation ARROW-12134 - [C++] Add regex string match kernel ARROW-12136 - [Rust][DataFusion] Reduce default batch_size to 8192 ARROW-12139 - [Python][Packaging] Use vcpkg to build macOS wheels ARROW-12141 - [R] Bindings for grepl ARROW-12143 - [CI] R builds should timeout and fail after some threshold and dump the output. ARROW-12146 - [C++][Gandiva] Implement CONVERT_FROM(expression, ‘UTF8’, replacement char) function ARROW-12151 - [Docs] Add Jira component + summary conventions to the docs ARROW-12153 - [Rust] [Parquet] Return file metadata after writing Parquet file ARROW-12160 - [Rust] Add an `into_inner()` method to ipc::writer::StreamWriter ARROW-12164 - [Java] Make BaseAllocator.Config public ARROW-12165 - [Rust] Inline append functions in builders for performance ARROW-12168 - [Go][IPC] Implement Compression handling for IPC ARROW-12170 - [Rust][DataFusion] Introduce repartition optimization ARROW-12173 - [GLib] Remove #include &lt;config.h&gt; ARROW-12176 - parquet/low-level-api/reader-writer.cc has some typos. ARROW-12187 - [C++][FlightRPC] Enable compression in Flight benchmark ARROW-12188 - [Docs] Switch to pydata-sphinx-theme for the main sphinx docs ARROW-12190 - [Rust][DataFusion] Implement partitioned hash join ARROW-12192 - [Website] Use downloadable URL for archive download ARROW-12193 - [Dev][Release] Use downloadable URL for archive download ARROW-12194 - [Rust] [Parquet] Update zstd version ARROW-12197 - [R] dplyr bindings for cast, dictionary_encode ARROW-12200 - [R] Export and document list_compute_functions ARROW-12204 - [Rust][CI] Reduce size of rust build artifacts in integration test ARROW-12206 - [Python] Fix Table docstrings ARROW-12208 - [C++] Add the ability to run async tasks without using the CPU thread pool ARROW-12210 - [Rust][DataFusion] Document SHOW TABLES / SHOW COLUMNS / InformationSchema ARROW-12214 - [Rust][DataFusion] Add some tests for limit ARROW-12215 - [C++] fixed size binary columns cannot be null in CSV reader ARROW-12217 - [C++] Cleanup cpp examples source file names ARROW-12222 - [Dev][Packaging] Include build url in the crossbow console report ARROW-12224 - [Rust] Use stable rust for no default test, clean up CI tests ARROW-12228 - [CI] Create base image for conda environments ARROW-12236 - [R][CI] Add check that all docs pages are listed in _pkgdown.yml ARROW-12237 - [Packaging][Debian] Add support for bulleye ARROW-12238 - [JS] Remove trailing spaces ARROW-12239 - [JS] Switch to yarn ARROW-12242 - [Python][Doc] Tweak nightly build instructions ARROW-12246 - [CI] Sync conda recipes with upstream feedstock ARROW-12248 - [C++] Allow static builds to change memory allocators ARROW-12249 - [R] [CI] Fix test-r-install-local nightlies ARROW-12251 - [Rust] [Ballista] Add Ballista tests to CI ARROW-12263 - [Dev][Packaging] Move Crossbow to Archery ARROW-12269 - [JS] Move to eslint ARROW-12274 - [JS] Document how to run tests without building ARROW-12277 - [Rust][DataFusion] Min/Max are not supported for timestamp types ARROW-12278 - [Rust][DataFusion]Use Timestamp(Nanosecond, None) for SQL TIMESTAMP Type ARROW-12280 - [Developer] Remove @-mentions from commit messages in merge tool ARROW-12281 - [JS] Remove shx, trash, and rimraf ARROW-12283 - [R] Bindings for basic type convert functions in dplyr verbs ARROW-12286 - [C++] Create AsyncGenerator from Future&lt;AsyncGenerator&lt;T&gt;&gt; ARROW-12287 - [C++] Create enumerating generator ARROW-12288 - [C++] Create Scanner interface ARROW-12289 - [C++] Create basic AsyncScanner implementation ARROW-12303 - [JS] Use iterators instead of generators in critical code paths ARROW-12304 - [R] Update news and polish docs for 4.0 ARROW-12305 - [JS] Benchmark test data generate.py assumes python 2 ARROW-12309 - [JS] Make es2015 bundles the default ARROW-12316 - [C++] Switch default memory allocator from jemalloc to mimalloc on macOS ARROW-12317 - [Rust] JSON writer does not support time, date or interval types ARROW-12320 - [CI] REPO arg missing from conda-cpp-valgrind ARROW-12323 - [C++][Gandiva] Implement castTIME(timestamp) function ARROW-12325 - [C++] [CI] Nightly gandiva build failing due to failure of compiler to move return value ARROW-12326 - [C++] Avoid needless c-ares detection ARROW-12328 - [Rust] [Ballista] Fix code formatting ARROW-12329 - [Rust] [Ballista] Add README ARROW-12332 - [Rust] [Ballista] Api server for scheduler ARROW-12333 - [JS] Remove jest-environment-node-debug and do not emit from typescript by default ARROW-12335 - [Rust] [Ballista] Bump DataFusion version ARROW-12337 - add DoubleEndedIterator and ExactSizeIterator traits ARROW-12351 - [CI][Ruby] Use ruby/setup-ruby instead of actions/setup-ruby ARROW-12352 - [CI][R][Windows] Remove needless workaround for MSYS2 ARROW-12353 - [Packaging][deb] Rename -archive-keyring to -apt-source ARROW-12354 - [Packaging][RPM] Use apache.jfrog.io/artifactory/ instead of apache.bintray.com/ ARROW-12356 - [Website] Update install page instructions to point to artifactory ARROW-12361 - [Rust] [DataFusion] Allow users to override physical optimization rules ARROW-12367 - [C++] Stop producing when PushGenerator was destroyed ARROW-12370 - [R] Bindings for power kernel ARROW-12374 - [CI][C++][cron] Use Ubuntu 20.04 instead of 16.04 ARROW-12375 - [Release] Remove rebase post-release scripts ARROW-12376 - [Dev] archery trigger-bot should use logger.exception ARROW-12380 - [Rust][Ballista] Add scheduler ui ARROW-12381 - [Packaging][Python] macOS wheels are built with wrong package kind ARROW-12383 - [JS] Update direct deps ARROW-12384 - [JS] Improve code style ARROW-12389 - [R] [Docs] Add note about autocasting ARROW-12395 - [C++]: Create RunInSerialExecutor benchmark ARROW-12396 - [Python][Docs] Clarify serialization docstrings about deprecated status ARROW-12397 - [Rust] [DataFusion] Simplify readme example #10038 ARROW-12398 - [Rust] Remove double bound checks in iterators ARROW-12400 - [Rust] Re-enable transform module tests ARROW-12402 - [Rust] [DataFusion] Implement SQL metrics framework ARROW-12406 - [R] fix checkbashims violation in configure ARROW-12409 - [R] Remove LazyData from DESCRIPTION ARROW-12419 - [Java] flatc is not used in mvn ARROW-12420 - [C++/Dataset] Reading null columns as dictionary not longer possible ARROW-12423 - [Docs] Codecov badge in main Readme only applies to Rust ARROW-12425 - [Rust] new_null_array doesn’t allocate keys buffer for dictionary arrays ARROW-12432 - [Rust] [DataFusion] Add metrics for SortExec ARROW-12436 - [Rust][Ballista] Add watch capabilities to config backend trait ARROW-12467 - [C++][Gandiva] Add support for LLVM12 ARROW-12477 - [Release] Download linux aarch64 miniforge in verify-release-candidate.sh ARROW-12485 - [C++] Use mimalloc as the default memory allocator on macOS ARROW-12488 - [GLib] Use g_memdup2() with GLib 2.68 or later ARROW-12494 - [C++] ORC adapter fails to compile on GCC 4.8 PARQUET-1846 - [C++] Remove deprecated IO classes and related functions PARQUET-1899 - [C++] Deprecated ReadBatchSpaced in parquet/column_reader PARQUET-1990 - [C++] ConvertedType::NA is written out in some cases PARQUET-1993 - [C++] Expose when prefetching completes PARQUET-1998 - [C++] Implement LZ4_RAW compression Bug Fixes ARROW-4784 - [C++][CI] Re-enable flaky mingw tests. ARROW-6818 - [Doc] Format docs confusing ARROW-7288 - [C++][R] read_parquet() freezes on Windows with Japanese locale ARROW-7830 - [C++] Parquet library version doesn’t change with releases ARROW-9451 - [Python] Unsigned integer types will accept string values in pyarrow.array ARROW-9634 - [C++][Python] Restore non-UTC time zones when reading Parquet file that was previously Arrow ARROW-9878 - [Python] table to_pandas self_destruct=True + split_blocks=True cannot prevent doubling memory ARROW-10038 - [C++] SetCpuThreadPoolCapacity(1) spins up nCPUs threads ARROW-10056 - [C++] Increase flatbuffers max_tables parameter in order to read wide tables ARROW-10364 - [Dev][Archery] Test is failed with semver 2.13.0 ARROW-10370 - [Python] Spurious s3fs-related test failures ARROW-10403 - [C++] Implement unique kernel for dictionary type ARROW-10405 - [C++] IsIn kernel should be able to lookup dictionary in string ARROW-10457 - [CI] Fix Spark branch-3.0 integration tests ARROW-10489 - [C++] Unable to configure or make with intel compiler ARROW-10514 - [C++][Parquet] Data inconsistency in parquet-reader output modes ARROW-10953 - [R] Validate when creating Table with schema ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11134 - [C++][CI] ARM64 job on Travis-CI doesn’t run tests ARROW-11147 - [Python][CI] Parquet tests failing in nightly build with Dask master ARROW-11180 - [Developer] cmake-format pre-commit hook doesn’t run ARROW-11192 - [Documentation] Describe opening Visual Studio so it inherits a working env ARROW-11223 - [Java] BaseVariableWidthVector/BaseLargeVariableWidthVector setNull and getBufferSizeFor is buggy ARROW-11235 - [Python] S3 test failures inside non-default regions ARROW-11239 - [Rust] array::transform::tests::test_struct failed ARROW-11269 - [Rust] Unable to read Parquet file because of mismatch in column-derived and embedded schemas ARROW-11277 - [C++] Fix compilation error in dataset expressions on macOS 10.11 ARROW-11299 - [Python] build warning in python ARROW-11303 - [Release][C++] Enable mimalloc in the windows verification script ARROW-11305 - [Rust]: parquet-rowcount binary tries to open itself as a parquet file ARROW-11311 - [Rust] unset_bit is toggling bits, not unsetting them ARROW-11313 - [Rust] Size hint of iterators is incorrect ARROW-11315 - [Packaging][APT][arm64] Add missing gir1.2 files ARROW-11320 - [C++] Spurious test failure when creating temporary dir ARROW-11322 - [Rust] Arrow `memory` made private is a breaking API change ARROW-11323 - [Rust][DataFusion] ComputeError(“concat requires input of at least one array”)) with queries with ORDER BY or GROUP BY that return no ARROW-11328 - [R] Collecting zero columns from a dataset returns entire dataset ARROW-11334 - [Python][CI] Nightly pandas builds failing because of internal pandas change ARROW-11337 - [C++] Compilation error with ThreadSanitizer ARROW-11357 - [Rust] take primitive implementation is unsound ARROW-11376 - [C++] ThreadedTaskGroup failure with Thread Sanitizer enabled ARROW-11379 - [C++][Dataset] Reading dataset with filtering on timestamp partition field crashes ARROW-11387 - [Rust] Arrow 3.0.0 release with simd feature doesn’t compile without feature=avx512. ARROW-11391 - [C++] HdfsOutputStream::Write unsafely truncates integers exceeding INT32_MAX ARROW-11394 - [Rust] Slice + Concat incorrect for structs ARROW-11400 - [Python] Pickled ParquetFileFragment has invalid partition_expresion with dictionary type in pyarrow 2.0 ARROW-11403 - [Developer] archery benchmark list: unexpected keyword ‘benchmark_filter’ ARROW-11412 - [Python] Expressions not working with logical boolean operators (and, or, not) ARROW-11412 - [Python] Expressions not working with logical boolean operators (and, or, not) ARROW-11427 - [C++] Arrow uses AVX512 instructions even when not supported by the OS ARROW-11448 - [C++] tdigest build failure on Windows with Visual Studio ARROW-11451 - [C++] Fix gcc-4.8 build error ARROW-11452 - [Rust] Parquet reader cannot read file where a struct column has the same name as struct member columns ARROW-11461 - [Flight][Go] GetSchema does not work with Java Flight Server ARROW-11464 - [Python] pyarrow.parquet.read_pandas doesn’t conform to its docs ARROW-11470 - [C++] Overflow occurs on integer multiplications in ComputeRowMajorStrides, ComputeColumnMajorStrides, and CheckTensorStridesValidity ARROW-11472 - [Python][CI] Kartothek integrations build is failing with numpy 1.20 ARROW-11472 - [Python][CI] Kartothek integrations build is failing with numpy 1.20 ARROW-11480 - [Python] Segmentation fault reading parquet with date filter with INT96 column ARROW-11483 - [Java][C++][Integration] C++ integration test creates JSON files incompatible with Java ARROW-11488 - [Rust]: StructBuilder’s Drop impl leaks memory ARROW-11490 - [C++] BM_ArrowBinaryDict/EncodeLowLevel is not deterministic ARROW-11494 - [Rust] Fix take bench ARROW-11497 - [Python] pyarrow parquet writer for list does not conform with Apache Parquet specification ARROW-11538 - [Python] Segfault reading Parquet dataset with Timestamp filter ARROW-11547 - [Packaging][Conda][Drone] Nightly builds are failed by undefined variable error ARROW-11548 - [C++] RandomArrayGenerator::List size mismatch ARROW-11551 - [C++][Gandiva] castTIMESTAMP(utf8) function doesn’t show error out for invalid inputs ARROW-11560 - [FlightRPC][C++][Python] Interrupting a Flight server results in abort ARROW-11567 - [C++][Compute] Variance kernel has precision issue ARROW-11577 - [Rust] Concat kernel panics on slices of string arrays ARROW-11582 - [R] write_dataset “format” argument default and validation could be better ARROW-11586 - [Rust] [Datafusion] Invalid SQL sometimes panics ARROW-11595 - [C++][NIGHTLY:test-conda-cpp-valgrind] GenerateBitsUnrolled triggers valgrind on uninit inputs ARROW-11596 - [Python][Dataset] SIGSEGV when executing scan tasks with Python executors ARROW-11603 - [Rust] Fix clippy error ARROW-11607 - [Python] Error when reading table with list values from parquet ARROW-11614 - [C++][Gandiva] Fix round() logic to return positive zero when argument is zero ARROW-11617 - [C++][Gandiva] Fix nested if-else optimisation in gandiva ARROW-11620 - [Rust] [DataFusion] Inconsistent use of Box and Arc for TableProvider ARROW-11630 - [Rust] Introduce partial_sort and limit option for sort kernel ARROW-11632 - [Rust] csv::Reader doesn’t propagate schema metadata to RecordBatches ARROW-11639 - [C++][Gandiva] Fix signbit compilation issue in Ubuntu nightly build ARROW-11642 - [C++] Incorrect preprocessor directive for Windows in JVM detection ARROW-11657 - [R] group_by with .drop specified errors ARROW-11658 - [R] Handle mutate/rename inside group_by ARROW-11663 - [DataFusion] Master does not compile ARROW-11668 - [C++] Sporadic UBSAN error in FutureStessTest.TryAddCallback ARROW-11672 - [R] Fix string function test failure on R 3.3 ARROW-11681 - [Rust] IPC writers shouldn’t unwrap in destructors ARROW-11686 - [C++]flight-test-integration-client sometimes exits by SIGABRT but does not print the stack trace ARROW-11687 - [Rust][DataFusion] RepartitionExec Hanging ARROW-11694 - [C++] Array Take may dereference absent null bitmap ARROW-11695 - [C++][FlightRPC][Packaging] Update support for disabling TLS server verification for recent gRPC versions ARROW-11717 - [Integration] Intermittent (but frequent) flight integration failures with auth:basic_proto ARROW-11718 - [Rust] IPC writers shouldn’t implicitly finish on drop ARROW-11741 - [C++] Decimal cast failure on big-endian ARROW-11743 - [R] Use pkgdown’s new found ability to autolink Jiras ARROW-11746 - [Developer][Archery] Fix prefer real time check ARROW-11756 - [R] passing a partition as a schema leads to segfaults ARROW-11758 - [C++][Compute] Summation kernel round-off error ARROW-11767 - [C++] Scalar::hash may segfault for null scalars ARROW-11771 - [Developer][Archery] Move benchmark tests (so CI runs them) ARROW-11784 - [Rust][DataFusion] CoalesceBatchesStream doesn’t honor Stream interface ARROW-11785 - [R] Fallback when filtering Table with unsupported expression fails ARROW-11786 - [C++] CMake output noisy ARROW-11788 - [Java] Appending Empty List Vector yields NPE ARROW-11791 - [Rust][DataFusion] RepartitionExec Blocking ARROW-11802 - [Rust][DataFusion] Mixing of crossbeam channel and async tasks can lead to deadlock ARROW-11819 - [Rust] Add link to the doc ARROW-11821 - [Rust] Edit Rust README ARROW-11830 - [C++] gRPC compilation tests occur every time ARROW-11832 - [R] Handle conversion of extra nested struct column ARROW-11836 - Target libarrow_bundled_dependencies.a is not alreay created but is already required. ARROW-11845 - [Rust] Debug implementation of Date32Array panics if array contains negative values ARROW-11850 - [GLib] GARROW_VERSION_0_16 macro is missing ARROW-11855 - [C++] [Python] Memory leak in to_pandas when converting chunked struct array ARROW-11857 - [Python] Resource temporarily unavailable when using the new Dataset API with Pandas ARROW-11860 - [Rust] [DataFusion] Add DataFusion logos ARROW-11866 - [C++] Arrow Flight SetShutdownOnSignals cause potential mutex deadlock in gRPC ARROW-11872 - [C++] Array Validation of GPU buffers fails due to incorrect validation check ARROW-11880 - [R] Handle empty or NULL transmute() args properly ARROW-11881 - [Rust][DataFusion] Fix Clippy Lint ARROW-11896 - [Rust] Hang / failure in CI on AMD64 Debian 10 Rust stable test workspace ARROW-11904 - [C++] “pure virtual method called” crash at the end of arrow-csv-test ARROW-11905 - [C++] SIMD info always returning none on MacOS ARROW-11914 - [R] [CI] r-sanitizer nightly is broken ARROW-11918 - [R] [Documentation] Docs cleanups ARROW-11923 - [CI] Update branch name for dask dev integration tests ARROW-11937 - [C++] GZip codec hangs if flushed twice ARROW-11941 - [Dev] “DEBUG=1 merge_arrow_pr.py” updates Jira issue ARROW-11942 - [C++] If tasks are submitted quickly the thread pool may fail to spin up new threads ARROW-11945 - [R] filter doesn’t accept negative numbers as valid ARROW-11956 - [C++] Fix system re2 dependency detection for static library ARROW-11965 - [R][Docs] Fix install.packages command in R dev docs ARROW-11970 - [C++][CI] Fix Valgrind failures ARROW-11971 - [Packaging] Vcpkg patch doesn’t apply on windows due to line endings ARROW-11975 - [CI][GLib] Failed to update gcc ARROW-11976 - [C++] Sporadic TSAN error in TestThreadPool.SetCapacity ARROW-11983 - [Python] ImportError calling pyarrow from_pandas within ThreadPool ARROW-11997 - [Python] concat_tables crashes python interpreter ARROW-12003 - [R] Fix NOTE re undefined global function group_by_drop_default ARROW-12006 - [Java] Fix checkstyle config to work on Windows ARROW-12012 - [Java] [JDBC] BinaryConsumer cannot reallocate memory correctly ARROW-12013 - [C++][FlightRPC] Failed to detect gRPC version ARROW-12015 - [Rust] [DataFusion] Integrate doc-comment crate to ensure readme examples remain valid ARROW-12028 - [Rust][DataFusion] Unsupported GROUP BY for Timestamp(Millisecond, None) ARROW-12029 - Remove args from FeatherReader$create v2 ARROW-12033 - [Docs] Fix link in developers/benchmarks.html ARROW-12041 - [C++] Fix string description of tensor IPC messages ARROW-12051 - [GLib] Intermittent CI failure in test_add_column_type(TestCSVReader::#read::options) ARROW-12057 - [Python] Remove direct usage of pandas’ Block subclasses ARROW-12065 - [C++][Python] Segfault reading JSON file ARROW-12067 - [Python][Doc] Document pyarrow_(un)wrap_scalar ARROW-12073 - [R] Fix R CMD check NOTE about ‘X_____X’ ARROW-12076 - [Rust] Fix build ARROW-12077 - [C++] Out-of-bounds write in ListArray::FromArrays ARROW-12086 - [C++] offline builds does not use ARROW_$LIBRARY_URL to search for packages ARROW-12088 - [Python][C++] Warning about offsetof in pyarrow.dataset.RecordBatchIterator ARROW-12089 - [Doc] Fix warnings when building Sphinx docs ARROW-12100 - [C#] Cannot round-trip record batch with PyArrow ARROW-12103 - [C++] “load of misaligned address” in Parquet reader ARROW-12112 - [CI] No space left on device - AMD64 Conda Integration test ARROW-12112 - [CI] No space left on device - AMD64 Conda Integration test ARROW-12113 - [R] Fix rlang deprecation warning from check_select_helpers() ARROW-12130 - [C++] Arm64 buid failed if -DARROW_SIMD_LEVEL=NONE ARROW-12138 - [Go][IPC] ARROW-12140 - [C++][CI] Valgrind failure on Grouper tests ARROW-12145 - [Developer][Archery] Flaky test: test_static_runner_from_json ARROW-12149 - [Dev] Archery benchmark test case is failing ARROW-12154 - [C++][Gandiva] Fix gandiva crash in certain OS/CPU combinations ARROW-12155 - [R] Require Table columns to be same length ARROW-12161 - [C++][R] Async streaming CSV reader deadlocking when being run synchronously from datasets ARROW-12161 - [C++][R] Async streaming CSV reader deadlocking when being run synchronously from datasets ARROW-12169 - [C++] Fix compressed file reading with an empty stream at end of file ARROW-12171 - [Rust] Clippy error ARROW-12172 - [Python][Packaging] Pass python version as setuptools pretend version in the macOS wheel builds ARROW-12178 - [CI] Update setuptools in the ubuntu images ARROW-12186 - [Rust][DataFusion] Fix regexp_match test ARROW-12209 - [JS] @apache-arrow/ts nor apache-arrow does not compile ARROW-12220 - [C++][CI] Thread sanitizer failure ARROW-12226 - [C++] ASAN error in s3fs_test.cc ARROW-12227 - [R] Fix RE2 and median nightly build failures ARROW-12235 - [Rust][DataFusion] LIMIT returns incorrect results when used with several small partitions ARROW-12241 - [Python] Parallel csv reader cancellation test kills pytest ARROW-12250 - [Rust] Failing test arrow::arrow_writer::tests::fixed_size_binary_single_column ARROW-12254 - [Rust][DataFusion] Limit keeps polling input after limit is reached ARROW-12258 - [R] Never do as.data.frame() on collect(as_data_frame = FALSE) ARROW-12262 - [Doc][C++][Python] Docs built and pushed with S3 and Flight disabled ARROW-12267 - [Rust] JSON writer does not support timestamp types ARROW-12273 - [JS] Coveralls does not work anymore ARROW-12279 - [Rust][DataFusion] Add test for null handling in hash join (ARROW-12266) ARROW-12294 - [Rust] Fix Boolean Kleene Kernels with no Remainder ARROW-12299 - [Python] pq.write_to_dataset does not recognize S3FileSystem ARROW-12300 - [C++] ArrowCUDA erroneously links to CUDA Runtime while only using CUDA Driver API ARROW-12313 - [Rust] [Ballista] Benchmark documentation out of date ARROW-12314 - [Python] pq.read_pandas with use_legacy_dataset=False does not accept columns as a set (kartothek integration failure) ARROW-12327 - [Dev] Use pull request’s head remote when submitting crossbow jobs via the comment bot ARROW-12330 - [Developer] Restore values in counters column of Archery benchmark ARROW-12334 - [Rust] [Ballista] Aggregate queries producing incorrect results ARROW-12342 - [Packaging] Fix tabulation in crossbow templates for submitting nightly builds ARROW-12357 - [Archery] Error running “crossbow submit …” ARROW-12377 - [Doc][Java] Java doc build broken ARROW-12379 - [C++][CI] Thread sanitizer failure in SerialExecutor ARROW-12382 - [C++][CI] Conda nightly jobs fail due to not bundling xsimd ARROW-12385 - [R] [CI] fix cran picking in CI ARROW-12390 - [Rust] Inline Inline from_trusted_len_iter, try_from_trusted_len_iter, extend_from_slice ARROW-12401 - [R] Fix guard around dataset___Scanner__TakeRows ARROW-12405 - [Packaging] Fix apt artifact patterns and artifact uploading from travis ARROW-12408 - [R] Delete Scan() bindings ARROW-12421 - [Rust] [DataFusion] topk_query test fails in master ARROW-12421 - [Rust] [DataFusion] topk_query test fails in master ARROW-12429 - [C++] MergedGeneratorTestFixture is incorrectly instantiated ARROW-12433 - [Rust] Builds failing due to new flatbuffer release introducing const generics ARROW-12437 - [Rust] [Ballista] Ballista plans must not include RepartitionExec ARROW-12440 - [Release] Various packaging, release script and release verification script fixes ARROW-12466 - [Python] Comparing array to None raises error ARROW-12475 - [C++] Build warning from thread_pool_benchmark.cc ARROW-12487 - [C++][Dataset] ScanBatches() hangs if there’s an error during scanning ARROW-12495 - [C++][Python] NumPy buffer sets is_mutable_ to true but does not set mutable_data_ when the NumPy array is writable PARQUET-1655 - [C++] Decimal comparisons used for min/max statistics are not correct PARQUET-2008 - [C++] Wrong information written in RowGroup::total_byte_size" />
<link rel="canonical" href="https://arrow.apache.org/release/4.0.0.html" />
<meta property="og:url" content="https://arrow.apache.org/release/4.0.0.html" />
<meta property="og:site_name" content="Apache Arrow" />
<meta property="og:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="og:type" content="article" />
<meta property="article:published_time" content="2024-04-29T17:30:49-04:00" />
<meta name="twitter:card" content="summary_large_image" />
<meta property="twitter:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="twitter:title" content="Apache Arrow 4.0.0 Release" />
<meta name="twitter:site" content="@ApacheArrow" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"BlogPosting","dateModified":"2024-04-29T17:30:49-04:00","datePublished":"2024-04-29T17:30:49-04:00","description":"Apache Arrow 4.0.0 (26 April 2021) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 719 commits from 114 distinct contributors. $ git shortlog -sn apache-arrow-3.0.0..apache-arrow-4.0.0 65 Antoine Pitrou 47 Andrew Lamb 41 Heres, Daniel 40 David Li 37 Sutou Kouhei 33 Neal Richardson 30 Weston Pace 28 Jorge C. Leitao 26 Krisztián Szűcs 25 Ian Cook 21 Dominik Moritz 20 Andy Grove 19 Yibo Cai 18 Joris Van den Bossche 17 Neville Dipale 17 Jonathan Keane 17 Ritchie Vink 12 Mike Seddon 12 Benjamin Kietzman 11 Mauricio Vargas 10 Qingping Hou 10 Diana Clarke 8 Micah Kornfield 7 Matthew Topol 7 Dmitry Patsura 5 Projjal Chanda 5 Kenta Murata 4 Anthony Louis 4 Ximo Guanter 4 liyafan82 3 Andre Braga Reis 3 Kazuaki Ishizaki 3 Maarten A. Breddels 3 Uwe L. Korn 3 ptaylor 3 Steven Fackler 3 Sagnik Chakraborty 3 Nic Crane 2 Marc Prud&#39;hommeaux 2 Raphael Taylor-Davies 2 João Pedro 2 Yordan Pavlov 2 emkornfield 2 Max Burke 2 Florian Müller 2 Ben Chambers 2 mqy 2 Christoph Schulze 2 Manoj Karthick 2 Sathis Kumar 2 Ryan Jennings 2 Ruan Pearce-Authers 2 Tao He 2 Eric Burden 2 Tyrel Rink 2 Romain Francois 2 Rok 1 witchard 1 Adam Lippai 1 Albert Villanova del Moral 1 Alessandro Molina 1 Ali 1 Andrew Wieteska 1 Bob Tinsman 1 Brian Hulette 1 Bryan Cutler 1 Clcanny 1 Daniel Russo 1 Daniël Heres 1 Eduardo Ponce 1 Evan Chan 1 FawnD2 1 Felix Zhu 1 Fernando Herrera 1 Fiona La 1 François Saint-Jacques 1 GALI PREM SAGAR 1 Gert Hulselmans 1 Ha Thi Tham 1 Hongze Zhang 1 Ilya Biryukov 1 Ivan Smirnov 1 James Winegar 1 Joe Roberts 1 Johannes Müller 1 Jörn Horstmann 1 Mahmut Bulut 1 Marco Gorelli 1 Marko Mikulicic 1 Markus Silberstein Hont 1 Martin Nowak 1 Matt Brubeck 1 Matt Summersgill 1 Max Meldrum 1 Nathaniel Bauernfeind 1 Nga Tran 1 Nick Bruno 1 Rok Mihevc 1 Roman Karlstetter 1 Sam Albers 1 Simon Bertron 1 Szangin 1 Truc Lam Nguyen 1 Weichen Xu 1 Ying Zhou 1 frank400 1 ivan 1 jpeeter 1 martinblostein 1 nmcdonnell-kx 1 pierwill 1 sjgupta2 1 sundy-li 1 ARF1 Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-3.0.0..apache-arrow-4.0.0 157 Andrew Lamb 101 Antoine Pitrou 93 Neal Richardson 88 Krisztián Szűcs 72 Sutou Kouhei 41 David Li 30 Benjamin Kietzman 25 Neville Dipale 22 Micah Kornfield 19 Jorge C. Leitao 16 Andy Grove 14 Praveen 11 Joris Van den Bossche 9 GitHub 8 Yibo Cai 4 Uwe L. Korn 3 Sebastien Binet 2 liyafan82 1 Kenta Murata 1 Eric Erhardt 1 Chao Sun 1 Bryan Cutler Changelog Apache Arrow 4.0.0 (2021-04-26) New Features and Improvements ARROW-951 - [JS] Fix generated API documentation ARROW-2229 - [C++] Write CSV files from RecordBatch, Table ARROW-3690 - [Rust] Add Rust to the format integration testing ARROW-6103 - [Java] Stop using the maven release plugin ARROW-6248 - [Python] Use FileNotFoundError in HadoopFileSystem.open() in Python 3 ARROW-6455 - [C++] Implement ExtensionType for non-UTF8 Unicode data ARROW-6604 - [C++] Add support for nested types to MakeArrayFromScalar ARROW-7215 - [C++][Gandiva] Implement castVARCHAR(numeric_type) functions in Gandiva ARROW-7364 - [Rust] Add cast options to cast kernel ARROW-7633 - [C++][CI] Create fuzz targets for tensors and sparse tensors ARROW-7808 - [Java][Dataset] Implement Datasets Java API ARROW-7906 - [C++][Python] Full functionality for ORC format ARROW-8049 - [C++] Upgrade bundled Thrift version to 0.13.0 ARROW-8282 - [C++/Python][Dataset] Support schema evolution for integer columns ARROW-8284 - [C++][Dataset] Schema evolution for timestamp columns ARROW-8630 - [C++][Dataset] Pass schema including all materialized fields to catch CSV edge cases ARROW-8631 - [C++][Dataset] Add ConvertOptions and ReadOptions to CsvFileFormat ARROW-8658 - [C++][Dataset] Implement subtree pruning for FileSystemDataset::GetFragments ARROW-8732 - [C++] Let Futures support cancellation ARROW-8771 - [C++] Add boost/process library to build support ARROW-8796 - [Rust] Allow parquet to be written directly to memory ARROW-8797 - [C++] Support Flight RPC among diffent endian platforms ARROW-8900 - [C++] Respect HTTP(S)_PROXY for S3 Filesystems and/or expose proxy options as parameters ARROW-8919 - [C++] Add “DispatchBest” APIs to compute::Function that selects a kernel that may require implicit casts to invoke ARROW-9128 - [C++] Implement string space trimming kernels: trim, ltrim, and rtrim ARROW-9149 - [C++] Improve configurability of RandomArrayGenerator::ArrayOf ARROW-9196 - [C++] Make temporal casts work on Scalar inputs ARROW-9318 - [C++][Parquet] Encryption key management tools ARROW-9731 - [C++][Dataset] Port “head” method from R to C++ Dataset Scanner ARROW-9749 - [C++][Dataset] Extract format-specific scan options from FileFormat ARROW-9777 - [Rust] Implement IPC changes to catch up to 1.0.0 format ARROW-9856 - [R] Add bindings for string compute functions ARROW-10014 - [C++] TaskGroup::Finish should execute tasks ARROW-10089 - [R] inject base class for Array, ChunkedArray and Scalar ARROW-10183 - [C++] Create a ForEach library function that runs on an iterator of futures ARROW-10195 - [C++] Add string struct extract kernel using re2 ARROW-10250 - [FlightRPC][C++] Remove default constructor for FlightClientOptions ARROW-10255 - [JS] Reorganize imports and exports to be more friendly to ESM tree-shaking ARROW-10297 - [Rust] Parameter for parquet-read to output data in json format ARROW-10299 - [Rust] Support reading and writing V5 of IPC metadata ARROW-10305 - [R] Filter with regular expressions ARROW-10306 - [C++] Add string replacement kernel ARROW-10349 - [Python] Build and publish aarch64 wheels ARROW-10354 - [Rust] [DataFusion] Add support for regex extract ARROW-10360 - [CI] Bump github actions cache version ARROW-10372 - [C++][Dataset] Read compressed CSVs ARROW-10406 - [C++] Unify dictionaries when writing IPC file in a single shot ARROW-10420 - [C++] FileSystem::OpenInput{File,Stream} should accept a MemoryPool ARROW-10421 - [R] Feather reader/writer should accept a MemoryPool ARROW-10438 - [C++][Dataset] Partitioning::Format on nulls ARROW-10520 - [C++][R] Implement add/remove/replace for RecordBatch ARROW-10570 - [R] Use Converter API to convert SEXP to Array/ChunkedArray ARROW-10580 - [C++] When Validating, ensure DenseUnionArray offsets are increasing ARROW-10606 - [C++][Compute] Support casts to and from Decimal256 type. ARROW-10655 - [C++] Add LRU cache facility ARROW-10734 - [R] Build and test on Solaris ARROW-10735 - [R] Remove arrow-without-arrow wrapping ARROW-10766 - [Rust] Compute nested definition and repetition for list arrays ARROW-10797 - [C++] Investigate faster random generation for tests and benchmarks ARROW-10816 - [Rust] [DataFusion] Implement INTERVAL ARROW-10831 - [C++][Compute] Implemement quantile kernel ARROW-10846 - [C++] Add async filesystem operations ARROW-10880 - [Java] Support compressing RecordBatch IPC buffers by LZ4 ARROW-10882 - [Python][Dataset] Writing dataset from python iterator of record batches ARROW-10895 - [C++][Gandiva] Implement bool to varchar cast function in Gandiva ARROW-10903 - [Rust] Implement FromIter&lt;Option&lt;Vec&lt;u8&gt;&gt;&gt; constructor for FixedSizeBinaryArray ARROW-11022 - [Rust] [DataFusion] Upgrade to tokio 1.0 ARROW-11070 - [C++] Implement power / exponentiation compute kernel ARROW-11074 - [Rust][DataFusion] Implement predicate push-down for parquet tables ARROW-11081 - [Java] Make IPC option immutable ARROW-11108 - [Rust] Improve performance of MutableBuffer ARROW-11141 - [Rust]: Miri checks ARROW-11149 - [Rust] create_batch_empty - support List, LargeList ARROW-11150 - [Rust] Set up bi-weekly Rust sync call and update website ARROW-11154 - [CI][C++] Move homebrew crossbow tests off of Travis-CI ARROW-11156 - [Rust][DataFusion] Create hashes vectorized in hash join ARROW-11174 - [C++][Dataset] Make Expressions available for projection ARROW-11179 - [Format] Make comments in fb files friendly to rust doc ARROW-11183 - [Rust] [Parquet] LogicalType::TIMESTAMP_NANOS missing ARROW-11191 - [C++] Use FnOnce for TaskGroup’s tasks instead of std::function ARROW-11216 - [Rust] Improve documentation for StringDictionaryBuilder ARROW-11220 - [Rust] DF Implement GROUP BY support for Boolean ARROW-11222 - [Rust] [Arrow] catch up with flatbuffers 0.8.1 ARROW-11246 - DF - Add type to Unexpected accumulator state message ARROW-11254 - [Rust][DataFusion] Add SIMD and snmalloc flags as options to benchmarks ARROW-11260 - [C++][Dataset] Don’t require dictionaries for reading dataset with schema-based Partitioning ARROW-11265 - [Rust] Made bool not convertable to bytes ARROW-11268 - [Rust][DataFusion] Support specifying repartitions in MemTable ARROW-11270 - [Rust] Use slices for simple array data buffer access ARROW-11279 - [Rust][Parquet] ArrowWriter Definition Levels Memory Usage ARROW-11284 - [R] Support dplyr verb transmute() ARROW-11289 - [Rust] [DataFusion] Support GROUP BY for Dictionary columns ARROW-11290 - [Rust][DataFusion] Address hash aggregate performance with high number of groups ARROW-11291 - [Rust] implement extend for MutableBuffer (from iterator) ARROW-11300 - [Rust][DataFusion] Improve hash aggregate performance with large number of groups in ARROW-11308 - [Rust] [Parquet] Add Arrow decimal array writer ARROW-11309 - [Release][C#] Use .NET 3.1 for verification ARROW-11310 - [Rust] Implement arrow JSON writer ARROW-11314 - [Release][APT][Yum] Add support for verifying arm64 packages ARROW-11317 - [Rust] Test the prettyprint feature in CI ARROW-11318 - [Rust] Support pretty printing timestamp, date, and time types ARROW-11319 - [Rust] [DataFusion] Improve test comparisons to record batch ARROW-11321 - [Rust][DataFusion] Fix DataFusion compilation error ARROW-11325 - [Packaging][C#] Release Apache.Arrow.Flight and Apache.Arrow.Flight.AspNetCore ARROW-11329 - [Rust] Do not rebuild the library on every change ARROW-11330 - [Rust][DataFusion] Add ExpressionVisitor pattern ARROW-11332 - [Rust] Use MutableBuffer in take_string instead of Vec ARROW-11333 - [Rust] Suport creating arbitrary nested empty arrays ARROW-11336 - [C++][Doc] Improve Developing on Windows docs ARROW-11338 - [R] Bindings for quantile and median ARROW-11340 - [C++] Add vcpkg.json manifest to cpp project root ARROW-11343 - [DataFusion] Simplified example ARROW-11346 - [C++][Compute] Implement quantile kernel benchmark ARROW-11349 - [Rust] Add from_iter_values to create arrays from T instead of Option&lt;T&gt; ARROW-11350 - [C++] Bump dependency versions ARROW-11354 - [Rust] Speed-up casts of dates and times ARROW-11355 - [Rust] Align Date type with spec ARROW-11358 - [Rust] Add benchmark for concatenating small arrays ARROW-11360 - [Rust] [DataFusion] Improve CSV “No files found” error message ARROW-11361 - [Rust] Build buffers from iterator of booleans ARROW-11362 - [Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance ARROW-11365 - [Rust] [Parquet] Implement parsers for v2 of the text schema ARROW-11366 - [Rust][DataFusion] Add Constant Folding / Support boolean literal in equality expression ARROW-11367 - [C++] Implement approximante quantile utility ARROW-11369 - [DataFusion] Split expressions.rs ARROW-11372 - Support RC verification on macOS-ARM64 ARROW-11373 - [Python][Docs] Add example of specifying type for a column when reading csv file ARROW-11374 - [Python] Make legacy pyarrow.filesystem / pyarrow.serialize warnings more visisble ARROW-11375 - [Rust] CI fails due to deprecation warning in clippy ARROW-11377 - [C++][CI] Add ThreadSanitizer nightly build ARROW-11383 - [Rust] use trusted len on bit ops ARROW-11386 - [Release] Fix post documents update script ARROW-11389 - [Rust] Inconsistent comments for datatypes ARROW-11395 - [DataFusion] Support custom optimizations ARROW-11401 - [Rust][DataFusion] Pass slices instead of Vec in DataFrame API ARROW-11404 - [Rust][DataFusion] Upgrade to aHash 0.7 ARROW-11405 - [DataFusion] Support multiple custom nodes ARROW-11406 - [CI][C++] Fix caching on Travis-CI builds ARROW-11408 - Add window support to datafusion readme ARROW-11411 - [Packaging][Linux] Disable arm64 nightly builds ARROW-11414 - [Rust] Reduce copies in Schema::try_merge ARROW-11417 - [Integration] Add integration test for buffer compression ARROW-11418 - [Doc] Add IPC buffer compression to support matrix ARROW-11421 - [Rust][DataFusion] Support group by Date32 ARROW-11422 - [C#] Add support for decimals ARROW-11423 - [R] value_counts and some StructArray methods ARROW-11425 - [C++][Compute] Improve quantile kernel for integers ARROW-11426 - [Rust][DataFusion] EXTRACT support ARROW-11428 - [Rust] Add power kernel ARROW-11429 - Make string comparisson kernels generic over Utf8 and LargeUtf8 ARROW-11430 - [Rust] Kernel to combine two arrays based on boolean mask ARROW-11431 - [Rust] [DataFusion] Add support for the SQL HAVING clause ARROW-11435 - Allow creating ParquetPartition from external crate ARROW-11436 - [Rust] Allow non-sized iterators in Primitive::from_iter ARROW-11437 - [Rust] Simplify benches ARROW-11438 - Unsupported ast node Value(Boolean(true)) in sqltorel ARROW-11439 - [Rust] Add year support to temporal kernel ARROW-11440 - [Rust] [DataFusion] Add method to CsvExec to get CSV schema ARROW-11442 - [Rust] Expose the logic used to interpret date/times ARROW-11443 - [Rust] Write datetime information for Date64 Type in csv writer ARROW-11444 - [Rust][DataFusion] Pass slices instead of &amp;Vec to functions ARROW-11446 - [DataFusion] Support scalars in builtin functions ARROW-11447 - [Rust] Add shift kernel ARROW-11449 - [CI][R][Windows] Use ccache ARROW-11457 - [Rust] Make string comparisson kernels generic over Utf8 and LargeUtf8 ARROW-11459 - [Rust] Allow ListArray of primitives to be built from iterator ARROW-11462 - [Developer] Remove needless quote from the default DOCKER_VOLUME_PREFIX ARROW-11463 - [Python] Allow configuration of IpcWriterOptions 64Bit from PyArrow ARROW-11466 - [Flight][Go] Add BasicAuth and BearerToken handlers for Go ARROW-11467 - [R] Fix reference to json_table_reader() in R docs ARROW-11468 - [R] Allow user to pass schema to read_json_arrow() ARROW-11474 - [C++] Update bundled re2 version ARROW-11476 - [Rust][DataFusion] Test running of TPCH benchmarks in CI ARROW-11477 - [R][Doc] Reorganize and improve README and vignette content ARROW-11478 - [R] Consider ways to make arrow.skip_nul option more user-friendly ARROW-11479 - [Rust][Parquet] Add method to return compressed size of row group ARROW-11481 - [Rust] More cast implementations ARROW-11484 - [Rust] Derive Clone for ExecutionContext ARROW-11486 - [Website] Use Jekyll 4 and webpack to support Ruby 3.0 or later ARROW-11489 - [Rust][DataFusion] Make DataFrame Send+Sync ARROW-11491 - [Rust] Support json schema inference for nested list and struct ARROW-11493 - [CI][Packaging][deb][RPM] Test built packages ARROW-11500 - [R] Allow bundled build script to run on Solaris ARROW-11501 - [C++] endianness check does not work on Solaris ARROW-11504 - [Rust] verify Datatype in ListArray::from(ArrayDataRef) ARROW-11505 - [Rust] Add support for LargeUtf8 in csv-writer ARROW-11507 - [R] Bindings for GetRuntimeInfo ARROW-11510 - [Python] Add note that pip &gt;= 19.0 is required to get binary packages ARROW-11511 - [Rust] Replace Arc&lt;ArrayData&gt; by ArrayData ARROW-11512 - [Packaging][deb] Add missing gRPC dependency for Ubuntu 21.04 ARROW-11513 - [R] Bindings for sub/gsub ARROW-11516 - [R] Allow all C++ compute functions to be called by name in dplyr ARROW-11539 - [Developer][Archery] Change items_per_seconds units ARROW-11541 - [C++][Compute] Implement approximate quantile kernel ARROW-11542 - [Rust] json reader should not crash when reading nested list ARROW-11544 - [Rust] [DataFusion] Implement as_any for AggregateExpr ARROW-11545 - [Rust] [DataFusion] SendableRecordBatchStream should implement Sync ARROW-11556 - [C++] Minor benchmark improvements ARROW-11557 - [Rust] Add table de-registration to DataFusion ExecutionContext ARROW-11559 - [C++] Improve flatbuffers verification limits ARROW-11559 - [C++] Improve flatbuffers verification limits ARROW-11561 - [Rust][DataFusion] Add Send + Sync to MemTable::load ARROW-11563 - [Rust] Support Cast(Utf8, TimeStamp(Nanoseconds, None)) ARROW-11568 - [C++][Compute] Mode kernel performance is bad in some conditions ARROW-11570 - [Rust] ScalarValue - support Date64 ARROW-11571 - [CI] Cancel stale Github Actions workflow runs ARROW-11572 - [Rust] Add a kernel for division by single scalar ARROW-11573 - [Developer][Archery] Google benchmark now reports run type ARROW-11574 - [Rust][DataFusion] Upgrade sqlparser to 0.8 to support parsing all TPC-H queries ARROW-11575 - [Developer][Archery] Expose execution time in benchmark results ARROW-11576 - [Rust] Remove unused variable in example ARROW-11580 - [C++] Add CMake option ARROW_DEPENDENCY_SOURCE=VCPKG ARROW-11589 - [R] Add methods for modifying Schemas ARROW-11590 - [C++] Move CSV background generator to IO thread pool ARROW-11591 - [C++][Compute] Prototype version of hash aggregation ARROW-11592 - [Rust] Typo in comment ARROW-11594 - [Rust] Support pretty printing with NullArrays ARROW-11597 - [Rust] Split datatypes in a module ARROW-11598 - [Rust] Split buffer.rs in smaller files ARROW-11599 - [Rust] Add function to create array with all nulls ARROW-11601 - [C++][Dataset] Expose pre-buffering in ParquetFileFormatReaderOptions ARROW-11606 - [Rust] [DataFusion] Need guidance on HashAggregateExec reconstruction ARROW-11610 - [C++] Download boost from sourceforge instead of bintray ARROW-11612 - [C++] Rebuild trimmed boost bundle for 1.75.0 ARROW-11613 - [R] Move nightly C++ builds off of bintray ARROW-11616 - [Rust][DataFusion] Expose collect_partitioned for DataFrame ARROW-11621 - [CI][Gandiva][Linux] Fix Crossbow setup failure ARROW-11626 - [Rust][DataFusion] Move DataFusion examples to own project to reduce nr dependencies ARROW-11627 - [Rust] Typed allocator ARROW-11637 - [CI][Conda] Update nightly clean target platforms and packages list ARROW-11641 - [CI] Use docker buildkit’s inline cache to reuse build cache across different hosts ARROW-11649 - [R] Add support for null_fallback to R ARROW-11651 - [Rust][DataFusion] Implement Postgres Length Functions ARROW-11653 - Ascii/unicode functions ARROW-11655 - Pad/trim functions ARROW-11656 - Left over functions/fixes ARROW-11659 - [R] Preserve group_by .drop argument ARROW-11662 - [C++] Support sorting for decimal data type. ARROW-11664 - [Rust] Cast to LargeUtf8 ARROW-11665 - [Python] Document precision and scale parameters of decimal128() ARROW-11666 - [Integration] Add endianness “gold” integration file for decimal256 ARROW-11667 - [Rust] Add docs for utf8 comparison functions ARROW-11669 - [Rust] [DataFusion] Remove concurrency field from GlobalLimitExec ARROW-11671 - [Rust][DataFusion] Clean up docs on Expr ARROW-11677 - [C++][Dataset] Write documentation ARROW-11680 - [C++] Add vendored version of folly’s spsc queue ARROW-11683 - [R] Support dplyr::mutate() ARROW-11685 - [C++] Typo in future_test.cc ARROW-11688 - [Rust] Casts between utf8 and large-utf8 ARROW-11690 - [Rust][DataFusion] Avoid Expr::clone in Expr builder methods ARROW-11692 - [Rust][DataFusion] Improve documentation on Optimizer ARROW-11693 - [C++] Add string length kernel ARROW-11700 - [R] Internationalize error handling in tidy eval ARROW-11701 - [R] Implement dplyr::relocate() ARROW-11703 - [R] Implement dplyr::arrange() ARROW-11704 - [R] Wire up dplyr::mutate() for datasets ARROW-11707 - Support CSV schema inference without IO ARROW-11708 - Clean up Rust 2021 linting warning ARROW-11709 - [Rust][DataFusion] Move `expressions` and `inputs` into LogicalPlan rather than helpers in util ARROW-11710 - [Rust][DataFusion] Implement ExprRewriter to avoid tree traversal redundancy ARROW-11719 - Support merged schema for memory table ARROW-11721 - json schema inference should return Schema type instead of SchemaRef ARROW-11722 - Improve error message in FFI ARROW-11724 - [C++] Namespace collisions with protobuf 3.15 ARROW-11725 - [Rust][DataFusion] Make use of the new divide_scalar kernel in arrow ARROW-11727 - [C++][FlightRPC] Use TDigest to estimate latency quantiles in benchmark ARROW-11730 - [C++] Add implicit Future(Status) constructor for convenience ARROW-11733 - [Rust][DataFusion] Support hash repartitioning ARROW-11734 - [C++] vendored safe-math.h does not compile on Solaris ARROW-11735 - [R] Allow Parquet and Arrow Dataset to be optional components ARROW-11736 - [R] Allow string compute functions to be optional ARROW-11737 - [C++] Patch vendored xxhash for Solaris ARROW-11738 - [Rust][DataFusion] Concat Functions ARROW-11740 - [C++] posix_memalign not declared in scope on Solaris ARROW-11742 - [Rust] [DataFusion] Add Expr::is_null and Expr::is_not_null functions ARROW-11744 - [C++] Add xsimd dependency ARROW-11745 - [C++] Improve configurability of random data generation ARROW-11750 - [Python][Dataset] Add support for project expressions ARROW-11752 - [R] Replace usage of testthat::expect_is() ARROW-11753 - [Rust][DataFusion] Add test for Join Statement: Schema contains duplicate unqualified field name ARROW-11754 - [R] Support dplyr::compute() ARROW-11761 - [C++] Increase public API testing ARROW-11766 - [R] Better handling for missing compression codecs on Linux ARROW-11768 - [C++][CI] Make s390x build non-optional ARROW-11773 - [Rust] Allow json writer to write out JSON arrays as well as newline formatted objects ARROW-11774 - [R] one-line install from source on macOS ARROW-11775 - [Rust][DataFusion] Feature Flags for Dependencies ARROW-11777 - [Rust] impl AsRef for StringBuilder/BinaryBuilder ARROW-11778 - Cast from large-utf8 to numerical arrays ARROW-11779 - [Rust] make alloc module public ARROW-11790 - [Rust][DataFusion] Change plan builder signature to take Vec&lt;Expr&gt; rather than &amp;[Expr] ARROW-11794 - [Go] Add concurrent-safe ipc.FileReader.RecordAt(i) ARROW-11795 - [MATLAB] Migrate MATLAB Interface for Apache Arrow design doc to Markdown ARROW-11797 - [C++][Dataset] Provide Scanner methods to yield/visit scanned batches ARROW-11798 - [Integration] Update testing submodule ARROW-11799 - [Rust] String and Binary arrays created with incorrect length from unbound iterator ARROW-11801 - [C++] Remove bad header guard in filesystem/type_fwd.h ARROW-11803 - [Rust] [Parquet] Support v2 LogicalType ARROW-11806 - [Rust][DataFusion] Optimize inner join creation of indices ARROW-11820 - Added macro create_native to construct impl ARROW-11822 - Support case sensitive for function ARROW-11824 - [Rust] [Parquet] Use logical types in Arrow writer ARROW-11825 - [Rust][DataFusion] Add mimalloc as option to benchmarks ARROW-11833 - [C++] Vendored fast_float errors for emscripten (architecture flag missing) ARROW-11837 - [C++][Dataset] Expose originating fragment as a property of ScanTask ARROW-11838 - [C++] Support reading IPC data with shared dictionaries ARROW-11839 - [C++] Rewrite bit-unpacking optimizations using xsimd ARROW-11842 - [Rust][Parquet] Use more efficient clone_from in get_batch_with_dict ARROW-11852 - [Documentation] Update CONTRIBUTING to explain Contributor role ARROW-11856 - [C++] Remove unused reference to RecordBatchStreamWriter ARROW-11858 - [GLib] Gandiva Filter in GLib ARROW-11859 - [GLib] GArrowArray: concatenate is missing ARROW-11864 - [R] Document arrow.int64_downcast option ARROW-11870 - [Dev] Automatically run merge script in venv ARROW-11876 - [Website] Update governance page ARROW-11877 - [C++] Add initial microbenchmarks for Dataset internals ARROW-11879 - [Rust][DataFusion] ExecutionContext::sql should optimize query plan ARROW-11883 - [C++] Add ConcatMap, MergeMap, and an async-reentrant version of Map ARROW-11887 - [C++] Add asynchronous read to streaming CSV reader ARROW-11894 - [Rust][DataFusion] Change flight server example to use DataFrame API ARROW-11895 - [Rust][DataFusion] Add support for extra column statistics ARROW-11898 - [Rust] Pretty print columns ARROW-11899 - [Java] Refactor the compression codec implementation into core/Arrow specific parts ARROW-11900 - [Website] Add Yibo to committer list ARROW-11906 - [R] Make FeatherReader print method more informative ARROW-11907 - [C++] Use our own executor in S3FileSystem ARROW-11910 - [Packaging][Ubuntu] Drop support for 16.04 ARROW-11911 - [Website] Add protobuf vs arrow to FAQ ARROW-11912 - [R] Remove args from FeatherReader$create ARROW-11913 - [Rust] Improve performance of StringBuilder ARROW-11920 - [R] Add r/libarrow to make clean ARROW-11921 - [R] Set LC_COLLATE in r/data-raw/codegen.R ARROW-11924 - [C++] Provide streaming output from GetFileInfo ARROW-11925 - [R] Add `between` method for arrow_dplyr_query ARROW-11927 - [Rust][DataFusion] Support limit push down ARROW-11931 - [Go][CI] Bump CI to use Go 1.15 ARROW-11935 - [C++] Add push generator ARROW-11944 - [Developer] Achery benchmark diff regression: cannot compare jsons ARROW-11949 - [Ruby] Accept raw Ruby objects as sort key and options ARROW-11951 - [Rust] Remove OffsetSize::prefix ARROW-11952 - [Rust] Make ArrayData –&gt; GenericListArray fallable instead of `panic!` ARROW-11954 - [C++] arrow/util/io_util.cc does not compile on Solaris ARROW-11955 - [Rust][DataFusion] Support Union ARROW-11958 - [GLib] GArrowChunkedArray: combine is missing ARROW-11959 - [Rust][DataFusion] Fix logging of optimized plan ARROW-11962 - [Rust][DataFusion] Update Datafusion Docs / readme ARROW-11969 - [Rust][DataFusion] Improve Examples in documentation ARROW-11972 - [C++][Dataset] Extract IpcFragmentScanOptions, ParquetFragmentScanOptions ARROW-11973 - [Rust] Boolean AND/OR kernels should follow sql behaviour regarding null values ARROW-11977 - [Rust] Add documentation examples for sort kernel ARROW-11982 - [Rust] Donate Ballista Distributed Compute Platform ARROW-11984 - [C++][Gandiva] Implement SHA1 and SHA256 functions ARROW-11987 - [C++][Gandiva] Implement trigonometric functions on Gandiva ARROW-11988 - [C++][Gandiva] Implements the last_day function ARROW-11992 - [Rust][Parquet] Add upgrade notes on 4.0 rename of LogicalType #9731 ARROW-11993 - [C++] Don’t download xsimd if ARROW_SIMD_LEVEL=NONE ARROW-11996 - [R] Make r/configure run successfully on Solaris ARROW-11999 - [Java] Support parallel vector element search with user-specified comparator ARROW-12000 - [Documentation] Add note about deviation from style guide on struct/classes ARROW-12005 - [R] Fix a bash typo in configure ARROW-12017 - [R] [Documentation] Make proper developing arrow docs ARROW-12019 - [Rust] [Parquet] Update README for 2.6.0 support ARROW-12020 - [Rust][DataFusion] Adding SHOW TABLES and SHOW COLUMNS + partial information_schema support to DataFusion ARROW-12031 - [C++][CSV] infer CSV timestamps columns with fractional seconds ARROW-12032 - [Rust] Optimize comparison kernels using trusted_len iterator for bools ARROW-12034 - [Docs] Formalize Minor PRs ARROW-12037 - [Rust] [DataFusion] Support catalogs and schemas for table namespacing ARROW-12038 - [Rust][DataFusion] Upgrade hashbrown to 0.11 ARROW-12039 - [CI][C++][Gandiva] Fix gandiva nightly linux build failure ARROW-12040 - [R] [CI] [C++] test-r-rstudio-r-base-3.6-opensuse15 timing out during tests ARROW-12043 - [Rust] [Parquet] Write fixed size binary arrays ARROW-12045 - First Chunk of ported Parquet Code ARROW-12047 - [Rust] Clippy parquet ARROW-12048 - [Rust][DataFusion] Support Common Table Expressions ARROW-12052 - [Rust] Implement child data in C FFI ARROW-12056 - [C++] Create sequencing AsyncGenerator ARROW-12058 - [Python] Enable arithmetic operations on Expressions ARROW-12068 - [Python] Stop using distutils ARROW-12069 - [C++][Gandiva]Implement IN expressions for Decimal types ARROW-12070 - [GLib] Drop support for GNU Autotools ARROW-12071 - [GLib] Keep input stream reference of GArrowJSONReader ARROW-12075 - [Rust][DataFusion] Add CTE to list of supported features ARROW-12081 - [R] Bindings for utf8_length ARROW-12082 - [R][Dataset] Allow create dataset from vector of file paths ARROW-12094 - [C++][R] Fix/workaround re2 building on clang/libc++ ARROW-12097 - [C++] Modify BackgroundGenerator so it creates fewer threads ARROW-12098 - [R] Catch cpp build failures on linux ARROW-12104 - Next Chunk of ported Code ARROW-12106 - [Rust][DataFusion] Support `SELECT * from information_schema.tables` ARROW-12107 - [Rust][DataFusion] Support `SELECT * from information_schema.columns` ARROW-12108 - [Rust][DataFusion] Support `SHOW TABLES` ARROW-12109 - [Rust][DataFusion] Support `SHOW COLUMNS` ARROW-12110 - [Java] Implement ZSTD buffer compression for java ARROW-12111 - [Java] place files generated by flatc under source control ARROW-12116 - [Rust] Fix or ignore 1.51 clippy lints ARROW-12119 - [Rust][DataFusion] Improve performance of to_array_of_size ARROW-12120 - [Rust] Generate random arrays and batches ARROW-12121 - [Rust] [Parquet] Arrow writer benchmarks ARROW-12123 - [Rust][DataFusion] Use smallvec for indices for better join performance ARROW-12128 - [CI][Crossbow] Remove (or fix) test-ubuntu-16.04-cpp job ARROW-12131 - [CI][GLib] Ensure upgrading MSYS2 ARROW-12133 - [C++][Gandiva] Add option to disable setting mcpu flag to host cpu during llvm ir compilation ARROW-12134 - [C++] Add regex string match kernel ARROW-12136 - [Rust][DataFusion] Reduce default batch_size to 8192 ARROW-12139 - [Python][Packaging] Use vcpkg to build macOS wheels ARROW-12141 - [R] Bindings for grepl ARROW-12143 - [CI] R builds should timeout and fail after some threshold and dump the output. ARROW-12146 - [C++][Gandiva] Implement CONVERT_FROM(expression, ‘UTF8’, replacement char) function ARROW-12151 - [Docs] Add Jira component + summary conventions to the docs ARROW-12153 - [Rust] [Parquet] Return file metadata after writing Parquet file ARROW-12160 - [Rust] Add an `into_inner()` method to ipc::writer::StreamWriter ARROW-12164 - [Java] Make BaseAllocator.Config public ARROW-12165 - [Rust] Inline append functions in builders for performance ARROW-12168 - [Go][IPC] Implement Compression handling for IPC ARROW-12170 - [Rust][DataFusion] Introduce repartition optimization ARROW-12173 - [GLib] Remove #include &lt;config.h&gt; ARROW-12176 - parquet/low-level-api/reader-writer.cc has some typos. ARROW-12187 - [C++][FlightRPC] Enable compression in Flight benchmark ARROW-12188 - [Docs] Switch to pydata-sphinx-theme for the main sphinx docs ARROW-12190 - [Rust][DataFusion] Implement partitioned hash join ARROW-12192 - [Website] Use downloadable URL for archive download ARROW-12193 - [Dev][Release] Use downloadable URL for archive download ARROW-12194 - [Rust] [Parquet] Update zstd version ARROW-12197 - [R] dplyr bindings for cast, dictionary_encode ARROW-12200 - [R] Export and document list_compute_functions ARROW-12204 - [Rust][CI] Reduce size of rust build artifacts in integration test ARROW-12206 - [Python] Fix Table docstrings ARROW-12208 - [C++] Add the ability to run async tasks without using the CPU thread pool ARROW-12210 - [Rust][DataFusion] Document SHOW TABLES / SHOW COLUMNS / InformationSchema ARROW-12214 - [Rust][DataFusion] Add some tests for limit ARROW-12215 - [C++] fixed size binary columns cannot be null in CSV reader ARROW-12217 - [C++] Cleanup cpp examples source file names ARROW-12222 - [Dev][Packaging] Include build url in the crossbow console report ARROW-12224 - [Rust] Use stable rust for no default test, clean up CI tests ARROW-12228 - [CI] Create base image for conda environments ARROW-12236 - [R][CI] Add check that all docs pages are listed in _pkgdown.yml ARROW-12237 - [Packaging][Debian] Add support for bulleye ARROW-12238 - [JS] Remove trailing spaces ARROW-12239 - [JS] Switch to yarn ARROW-12242 - [Python][Doc] Tweak nightly build instructions ARROW-12246 - [CI] Sync conda recipes with upstream feedstock ARROW-12248 - [C++] Allow static builds to change memory allocators ARROW-12249 - [R] [CI] Fix test-r-install-local nightlies ARROW-12251 - [Rust] [Ballista] Add Ballista tests to CI ARROW-12263 - [Dev][Packaging] Move Crossbow to Archery ARROW-12269 - [JS] Move to eslint ARROW-12274 - [JS] Document how to run tests without building ARROW-12277 - [Rust][DataFusion] Min/Max are not supported for timestamp types ARROW-12278 - [Rust][DataFusion]Use Timestamp(Nanosecond, None) for SQL TIMESTAMP Type ARROW-12280 - [Developer] Remove @-mentions from commit messages in merge tool ARROW-12281 - [JS] Remove shx, trash, and rimraf ARROW-12283 - [R] Bindings for basic type convert functions in dplyr verbs ARROW-12286 - [C++] Create AsyncGenerator from Future&lt;AsyncGenerator&lt;T&gt;&gt; ARROW-12287 - [C++] Create enumerating generator ARROW-12288 - [C++] Create Scanner interface ARROW-12289 - [C++] Create basic AsyncScanner implementation ARROW-12303 - [JS] Use iterators instead of generators in critical code paths ARROW-12304 - [R] Update news and polish docs for 4.0 ARROW-12305 - [JS] Benchmark test data generate.py assumes python 2 ARROW-12309 - [JS] Make es2015 bundles the default ARROW-12316 - [C++] Switch default memory allocator from jemalloc to mimalloc on macOS ARROW-12317 - [Rust] JSON writer does not support time, date or interval types ARROW-12320 - [CI] REPO arg missing from conda-cpp-valgrind ARROW-12323 - [C++][Gandiva] Implement castTIME(timestamp) function ARROW-12325 - [C++] [CI] Nightly gandiva build failing due to failure of compiler to move return value ARROW-12326 - [C++] Avoid needless c-ares detection ARROW-12328 - [Rust] [Ballista] Fix code formatting ARROW-12329 - [Rust] [Ballista] Add README ARROW-12332 - [Rust] [Ballista] Api server for scheduler ARROW-12333 - [JS] Remove jest-environment-node-debug and do not emit from typescript by default ARROW-12335 - [Rust] [Ballista] Bump DataFusion version ARROW-12337 - add DoubleEndedIterator and ExactSizeIterator traits ARROW-12351 - [CI][Ruby] Use ruby/setup-ruby instead of actions/setup-ruby ARROW-12352 - [CI][R][Windows] Remove needless workaround for MSYS2 ARROW-12353 - [Packaging][deb] Rename -archive-keyring to -apt-source ARROW-12354 - [Packaging][RPM] Use apache.jfrog.io/artifactory/ instead of apache.bintray.com/ ARROW-12356 - [Website] Update install page instructions to point to artifactory ARROW-12361 - [Rust] [DataFusion] Allow users to override physical optimization rules ARROW-12367 - [C++] Stop producing when PushGenerator was destroyed ARROW-12370 - [R] Bindings for power kernel ARROW-12374 - [CI][C++][cron] Use Ubuntu 20.04 instead of 16.04 ARROW-12375 - [Release] Remove rebase post-release scripts ARROW-12376 - [Dev] archery trigger-bot should use logger.exception ARROW-12380 - [Rust][Ballista] Add scheduler ui ARROW-12381 - [Packaging][Python] macOS wheels are built with wrong package kind ARROW-12383 - [JS] Update direct deps ARROW-12384 - [JS] Improve code style ARROW-12389 - [R] [Docs] Add note about autocasting ARROW-12395 - [C++]: Create RunInSerialExecutor benchmark ARROW-12396 - [Python][Docs] Clarify serialization docstrings about deprecated status ARROW-12397 - [Rust] [DataFusion] Simplify readme example #10038 ARROW-12398 - [Rust] Remove double bound checks in iterators ARROW-12400 - [Rust] Re-enable transform module tests ARROW-12402 - [Rust] [DataFusion] Implement SQL metrics framework ARROW-12406 - [R] fix checkbashims violation in configure ARROW-12409 - [R] Remove LazyData from DESCRIPTION ARROW-12419 - [Java] flatc is not used in mvn ARROW-12420 - [C++/Dataset] Reading null columns as dictionary not longer possible ARROW-12423 - [Docs] Codecov badge in main Readme only applies to Rust ARROW-12425 - [Rust] new_null_array doesn’t allocate keys buffer for dictionary arrays ARROW-12432 - [Rust] [DataFusion] Add metrics for SortExec ARROW-12436 - [Rust][Ballista] Add watch capabilities to config backend trait ARROW-12467 - [C++][Gandiva] Add support for LLVM12 ARROW-12477 - [Release] Download linux aarch64 miniforge in verify-release-candidate.sh ARROW-12485 - [C++] Use mimalloc as the default memory allocator on macOS ARROW-12488 - [GLib] Use g_memdup2() with GLib 2.68 or later ARROW-12494 - [C++] ORC adapter fails to compile on GCC 4.8 PARQUET-1846 - [C++] Remove deprecated IO classes and related functions PARQUET-1899 - [C++] Deprecated ReadBatchSpaced in parquet/column_reader PARQUET-1990 - [C++] ConvertedType::NA is written out in some cases PARQUET-1993 - [C++] Expose when prefetching completes PARQUET-1998 - [C++] Implement LZ4_RAW compression Bug Fixes ARROW-4784 - [C++][CI] Re-enable flaky mingw tests. ARROW-6818 - [Doc] Format docs confusing ARROW-7288 - [C++][R] read_parquet() freezes on Windows with Japanese locale ARROW-7830 - [C++] Parquet library version doesn’t change with releases ARROW-9451 - [Python] Unsigned integer types will accept string values in pyarrow.array ARROW-9634 - [C++][Python] Restore non-UTC time zones when reading Parquet file that was previously Arrow ARROW-9878 - [Python] table to_pandas self_destruct=True + split_blocks=True cannot prevent doubling memory ARROW-10038 - [C++] SetCpuThreadPoolCapacity(1) spins up nCPUs threads ARROW-10056 - [C++] Increase flatbuffers max_tables parameter in order to read wide tables ARROW-10364 - [Dev][Archery] Test is failed with semver 2.13.0 ARROW-10370 - [Python] Spurious s3fs-related test failures ARROW-10403 - [C++] Implement unique kernel for dictionary type ARROW-10405 - [C++] IsIn kernel should be able to lookup dictionary in string ARROW-10457 - [CI] Fix Spark branch-3.0 integration tests ARROW-10489 - [C++] Unable to configure or make with intel compiler ARROW-10514 - [C++][Parquet] Data inconsistency in parquet-reader output modes ARROW-10953 - [R] Validate when creating Table with schema ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11066 - [Java] Is there a bug in flight AddWritableBuffer ARROW-11134 - [C++][CI] ARM64 job on Travis-CI doesn’t run tests ARROW-11147 - [Python][CI] Parquet tests failing in nightly build with Dask master ARROW-11180 - [Developer] cmake-format pre-commit hook doesn’t run ARROW-11192 - [Documentation] Describe opening Visual Studio so it inherits a working env ARROW-11223 - [Java] BaseVariableWidthVector/BaseLargeVariableWidthVector setNull and getBufferSizeFor is buggy ARROW-11235 - [Python] S3 test failures inside non-default regions ARROW-11239 - [Rust] array::transform::tests::test_struct failed ARROW-11269 - [Rust] Unable to read Parquet file because of mismatch in column-derived and embedded schemas ARROW-11277 - [C++] Fix compilation error in dataset expressions on macOS 10.11 ARROW-11299 - [Python] build warning in python ARROW-11303 - [Release][C++] Enable mimalloc in the windows verification script ARROW-11305 - [Rust]: parquet-rowcount binary tries to open itself as a parquet file ARROW-11311 - [Rust] unset_bit is toggling bits, not unsetting them ARROW-11313 - [Rust] Size hint of iterators is incorrect ARROW-11315 - [Packaging][APT][arm64] Add missing gir1.2 files ARROW-11320 - [C++] Spurious test failure when creating temporary dir ARROW-11322 - [Rust] Arrow `memory` made private is a breaking API change ARROW-11323 - [Rust][DataFusion] ComputeError(“concat requires input of at least one array”)) with queries with ORDER BY or GROUP BY that return no ARROW-11328 - [R] Collecting zero columns from a dataset returns entire dataset ARROW-11334 - [Python][CI] Nightly pandas builds failing because of internal pandas change ARROW-11337 - [C++] Compilation error with ThreadSanitizer ARROW-11357 - [Rust] take primitive implementation is unsound ARROW-11376 - [C++] ThreadedTaskGroup failure with Thread Sanitizer enabled ARROW-11379 - [C++][Dataset] Reading dataset with filtering on timestamp partition field crashes ARROW-11387 - [Rust] Arrow 3.0.0 release with simd feature doesn’t compile without feature=avx512. ARROW-11391 - [C++] HdfsOutputStream::Write unsafely truncates integers exceeding INT32_MAX ARROW-11394 - [Rust] Slice + Concat incorrect for structs ARROW-11400 - [Python] Pickled ParquetFileFragment has invalid partition_expresion with dictionary type in pyarrow 2.0 ARROW-11403 - [Developer] archery benchmark list: unexpected keyword ‘benchmark_filter’ ARROW-11412 - [Python] Expressions not working with logical boolean operators (and, or, not) ARROW-11412 - [Python] Expressions not working with logical boolean operators (and, or, not) ARROW-11427 - [C++] Arrow uses AVX512 instructions even when not supported by the OS ARROW-11448 - [C++] tdigest build failure on Windows with Visual Studio ARROW-11451 - [C++] Fix gcc-4.8 build error ARROW-11452 - [Rust] Parquet reader cannot read file where a struct column has the same name as struct member columns ARROW-11461 - [Flight][Go] GetSchema does not work with Java Flight Server ARROW-11464 - [Python] pyarrow.parquet.read_pandas doesn’t conform to its docs ARROW-11470 - [C++] Overflow occurs on integer multiplications in ComputeRowMajorStrides, ComputeColumnMajorStrides, and CheckTensorStridesValidity ARROW-11472 - [Python][CI] Kartothek integrations build is failing with numpy 1.20 ARROW-11472 - [Python][CI] Kartothek integrations build is failing with numpy 1.20 ARROW-11480 - [Python] Segmentation fault reading parquet with date filter with INT96 column ARROW-11483 - [Java][C++][Integration] C++ integration test creates JSON files incompatible with Java ARROW-11488 - [Rust]: StructBuilder’s Drop impl leaks memory ARROW-11490 - [C++] BM_ArrowBinaryDict/EncodeLowLevel is not deterministic ARROW-11494 - [Rust] Fix take bench ARROW-11497 - [Python] pyarrow parquet writer for list does not conform with Apache Parquet specification ARROW-11538 - [Python] Segfault reading Parquet dataset with Timestamp filter ARROW-11547 - [Packaging][Conda][Drone] Nightly builds are failed by undefined variable error ARROW-11548 - [C++] RandomArrayGenerator::List size mismatch ARROW-11551 - [C++][Gandiva] castTIMESTAMP(utf8) function doesn’t show error out for invalid inputs ARROW-11560 - [FlightRPC][C++][Python] Interrupting a Flight server results in abort ARROW-11567 - [C++][Compute] Variance kernel has precision issue ARROW-11577 - [Rust] Concat kernel panics on slices of string arrays ARROW-11582 - [R] write_dataset “format” argument default and validation could be better ARROW-11586 - [Rust] [Datafusion] Invalid SQL sometimes panics ARROW-11595 - [C++][NIGHTLY:test-conda-cpp-valgrind] GenerateBitsUnrolled triggers valgrind on uninit inputs ARROW-11596 - [Python][Dataset] SIGSEGV when executing scan tasks with Python executors ARROW-11603 - [Rust] Fix clippy error ARROW-11607 - [Python] Error when reading table with list values from parquet ARROW-11614 - [C++][Gandiva] Fix round() logic to return positive zero when argument is zero ARROW-11617 - [C++][Gandiva] Fix nested if-else optimisation in gandiva ARROW-11620 - [Rust] [DataFusion] Inconsistent use of Box and Arc for TableProvider ARROW-11630 - [Rust] Introduce partial_sort and limit option for sort kernel ARROW-11632 - [Rust] csv::Reader doesn’t propagate schema metadata to RecordBatches ARROW-11639 - [C++][Gandiva] Fix signbit compilation issue in Ubuntu nightly build ARROW-11642 - [C++] Incorrect preprocessor directive for Windows in JVM detection ARROW-11657 - [R] group_by with .drop specified errors ARROW-11658 - [R] Handle mutate/rename inside group_by ARROW-11663 - [DataFusion] Master does not compile ARROW-11668 - [C++] Sporadic UBSAN error in FutureStessTest.TryAddCallback ARROW-11672 - [R] Fix string function test failure on R 3.3 ARROW-11681 - [Rust] IPC writers shouldn’t unwrap in destructors ARROW-11686 - [C++]flight-test-integration-client sometimes exits by SIGABRT but does not print the stack trace ARROW-11687 - [Rust][DataFusion] RepartitionExec Hanging ARROW-11694 - [C++] Array Take may dereference absent null bitmap ARROW-11695 - [C++][FlightRPC][Packaging] Update support for disabling TLS server verification for recent gRPC versions ARROW-11717 - [Integration] Intermittent (but frequent) flight integration failures with auth:basic_proto ARROW-11718 - [Rust] IPC writers shouldn’t implicitly finish on drop ARROW-11741 - [C++] Decimal cast failure on big-endian ARROW-11743 - [R] Use pkgdown’s new found ability to autolink Jiras ARROW-11746 - [Developer][Archery] Fix prefer real time check ARROW-11756 - [R] passing a partition as a schema leads to segfaults ARROW-11758 - [C++][Compute] Summation kernel round-off error ARROW-11767 - [C++] Scalar::hash may segfault for null scalars ARROW-11771 - [Developer][Archery] Move benchmark tests (so CI runs them) ARROW-11784 - [Rust][DataFusion] CoalesceBatchesStream doesn’t honor Stream interface ARROW-11785 - [R] Fallback when filtering Table with unsupported expression fails ARROW-11786 - [C++] CMake output noisy ARROW-11788 - [Java] Appending Empty List Vector yields NPE ARROW-11791 - [Rust][DataFusion] RepartitionExec Blocking ARROW-11802 - [Rust][DataFusion] Mixing of crossbeam channel and async tasks can lead to deadlock ARROW-11819 - [Rust] Add link to the doc ARROW-11821 - [Rust] Edit Rust README ARROW-11830 - [C++] gRPC compilation tests occur every time ARROW-11832 - [R] Handle conversion of extra nested struct column ARROW-11836 - Target libarrow_bundled_dependencies.a is not alreay created but is already required. ARROW-11845 - [Rust] Debug implementation of Date32Array panics if array contains negative values ARROW-11850 - [GLib] GARROW_VERSION_0_16 macro is missing ARROW-11855 - [C++] [Python] Memory leak in to_pandas when converting chunked struct array ARROW-11857 - [Python] Resource temporarily unavailable when using the new Dataset API with Pandas ARROW-11860 - [Rust] [DataFusion] Add DataFusion logos ARROW-11866 - [C++] Arrow Flight SetShutdownOnSignals cause potential mutex deadlock in gRPC ARROW-11872 - [C++] Array Validation of GPU buffers fails due to incorrect validation check ARROW-11880 - [R] Handle empty or NULL transmute() args properly ARROW-11881 - [Rust][DataFusion] Fix Clippy Lint ARROW-11896 - [Rust] Hang / failure in CI on AMD64 Debian 10 Rust stable test workspace ARROW-11904 - [C++] “pure virtual method called” crash at the end of arrow-csv-test ARROW-11905 - [C++] SIMD info always returning none on MacOS ARROW-11914 - [R] [CI] r-sanitizer nightly is broken ARROW-11918 - [R] [Documentation] Docs cleanups ARROW-11923 - [CI] Update branch name for dask dev integration tests ARROW-11937 - [C++] GZip codec hangs if flushed twice ARROW-11941 - [Dev] “DEBUG=1 merge_arrow_pr.py” updates Jira issue ARROW-11942 - [C++] If tasks are submitted quickly the thread pool may fail to spin up new threads ARROW-11945 - [R] filter doesn’t accept negative numbers as valid ARROW-11956 - [C++] Fix system re2 dependency detection for static library ARROW-11965 - [R][Docs] Fix install.packages command in R dev docs ARROW-11970 - [C++][CI] Fix Valgrind failures ARROW-11971 - [Packaging] Vcpkg patch doesn’t apply on windows due to line endings ARROW-11975 - [CI][GLib] Failed to update gcc ARROW-11976 - [C++] Sporadic TSAN error in TestThreadPool.SetCapacity ARROW-11983 - [Python] ImportError calling pyarrow from_pandas within ThreadPool ARROW-11997 - [Python] concat_tables crashes python interpreter ARROW-12003 - [R] Fix NOTE re undefined global function group_by_drop_default ARROW-12006 - [Java] Fix checkstyle config to work on Windows ARROW-12012 - [Java] [JDBC] BinaryConsumer cannot reallocate memory correctly ARROW-12013 - [C++][FlightRPC] Failed to detect gRPC version ARROW-12015 - [Rust] [DataFusion] Integrate doc-comment crate to ensure readme examples remain valid ARROW-12028 - [Rust][DataFusion] Unsupported GROUP BY for Timestamp(Millisecond, None) ARROW-12029 - Remove args from FeatherReader$create v2 ARROW-12033 - [Docs] Fix link in developers/benchmarks.html ARROW-12041 - [C++] Fix string description of tensor IPC messages ARROW-12051 - [GLib] Intermittent CI failure in test_add_column_type(TestCSVReader::#read::options) ARROW-12057 - [Python] Remove direct usage of pandas’ Block subclasses ARROW-12065 - [C++][Python] Segfault reading JSON file ARROW-12067 - [Python][Doc] Document pyarrow_(un)wrap_scalar ARROW-12073 - [R] Fix R CMD check NOTE about ‘X_____X’ ARROW-12076 - [Rust] Fix build ARROW-12077 - [C++] Out-of-bounds write in ListArray::FromArrays ARROW-12086 - [C++] offline builds does not use ARROW_$LIBRARY_URL to search for packages ARROW-12088 - [Python][C++] Warning about offsetof in pyarrow.dataset.RecordBatchIterator ARROW-12089 - [Doc] Fix warnings when building Sphinx docs ARROW-12100 - [C#] Cannot round-trip record batch with PyArrow ARROW-12103 - [C++] “load of misaligned address” in Parquet reader ARROW-12112 - [CI] No space left on device - AMD64 Conda Integration test ARROW-12112 - [CI] No space left on device - AMD64 Conda Integration test ARROW-12113 - [R] Fix rlang deprecation warning from check_select_helpers() ARROW-12130 - [C++] Arm64 buid failed if -DARROW_SIMD_LEVEL=NONE ARROW-12138 - [Go][IPC] ARROW-12140 - [C++][CI] Valgrind failure on Grouper tests ARROW-12145 - [Developer][Archery] Flaky test: test_static_runner_from_json ARROW-12149 - [Dev] Archery benchmark test case is failing ARROW-12154 - [C++][Gandiva] Fix gandiva crash in certain OS/CPU combinations ARROW-12155 - [R] Require Table columns to be same length ARROW-12161 - [C++][R] Async streaming CSV reader deadlocking when being run synchronously from datasets ARROW-12161 - [C++][R] Async streaming CSV reader deadlocking when being run synchronously from datasets ARROW-12169 - [C++] Fix compressed file reading with an empty stream at end of file ARROW-12171 - [Rust] Clippy error ARROW-12172 - [Python][Packaging] Pass python version as setuptools pretend version in the macOS wheel builds ARROW-12178 - [CI] Update setuptools in the ubuntu images ARROW-12186 - [Rust][DataFusion] Fix regexp_match test ARROW-12209 - [JS] @apache-arrow/ts nor apache-arrow does not compile ARROW-12220 - [C++][CI] Thread sanitizer failure ARROW-12226 - [C++] ASAN error in s3fs_test.cc ARROW-12227 - [R] Fix RE2 and median nightly build failures ARROW-12235 - [Rust][DataFusion] LIMIT returns incorrect results when used with several small partitions ARROW-12241 - [Python] Parallel csv reader cancellation test kills pytest ARROW-12250 - [Rust] Failing test arrow::arrow_writer::tests::fixed_size_binary_single_column ARROW-12254 - [Rust][DataFusion] Limit keeps polling input after limit is reached ARROW-12258 - [R] Never do as.data.frame() on collect(as_data_frame = FALSE) ARROW-12262 - [Doc][C++][Python] Docs built and pushed with S3 and Flight disabled ARROW-12267 - [Rust] JSON writer does not support timestamp types ARROW-12273 - [JS] Coveralls does not work anymore ARROW-12279 - [Rust][DataFusion] Add test for null handling in hash join (ARROW-12266) ARROW-12294 - [Rust] Fix Boolean Kleene Kernels with no Remainder ARROW-12299 - [Python] pq.write_to_dataset does not recognize S3FileSystem ARROW-12300 - [C++] ArrowCUDA erroneously links to CUDA Runtime while only using CUDA Driver API ARROW-12313 - [Rust] [Ballista] Benchmark documentation out of date ARROW-12314 - [Python] pq.read_pandas with use_legacy_dataset=False does not accept columns as a set (kartothek integration failure) ARROW-12327 - [Dev] Use pull request’s head remote when submitting crossbow jobs via the comment bot ARROW-12330 - [Developer] Restore values in counters column of Archery benchmark ARROW-12334 - [Rust] [Ballista] Aggregate queries producing incorrect results ARROW-12342 - [Packaging] Fix tabulation in crossbow templates for submitting nightly builds ARROW-12357 - [Archery] Error running “crossbow submit …” ARROW-12377 - [Doc][Java] Java doc build broken ARROW-12379 - [C++][CI] Thread sanitizer failure in SerialExecutor ARROW-12382 - [C++][CI] Conda nightly jobs fail due to not bundling xsimd ARROW-12385 - [R] [CI] fix cran picking in CI ARROW-12390 - [Rust] Inline Inline from_trusted_len_iter, try_from_trusted_len_iter, extend_from_slice ARROW-12401 - [R] Fix guard around dataset___Scanner__TakeRows ARROW-12405 - [Packaging] Fix apt artifact patterns and artifact uploading from travis ARROW-12408 - [R] Delete Scan() bindings ARROW-12421 - [Rust] [DataFusion] topk_query test fails in master ARROW-12421 - [Rust] [DataFusion] topk_query test fails in master ARROW-12429 - [C++] MergedGeneratorTestFixture is incorrectly instantiated ARROW-12433 - [Rust] Builds failing due to new flatbuffer release introducing const generics ARROW-12437 - [Rust] [Ballista] Ballista plans must not include RepartitionExec ARROW-12440 - [Release] Various packaging, release script and release verification script fixes ARROW-12466 - [Python] Comparing array to None raises error ARROW-12475 - [C++] Build warning from thread_pool_benchmark.cc ARROW-12487 - [C++][Dataset] ScanBatches() hangs if there’s an error during scanning ARROW-12495 - [C++][Python] NumPy buffer sets is_mutable_ to true but does not set mutable_data_ when the NumPy array is writable PARQUET-1655 - [C++] Decimal comparisons used for min/max statistics are not correct PARQUET-2008 - [C++] Wrong information written in RowGroup::total_byte_size","headline":"Apache Arrow 4.0.0 Release","image":"https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png","mainEntityOfPage":{"@type":"WebPage","@id":"https://arrow.apache.org/release/4.0.0.html"},"publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"https://arrow.apache.org/img/logo.png"}},"url":"https://arrow.apache.org/release/4.0.0.html"}</script>
<!-- End Jekyll SEO tag -->
<!-- favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16.png" id="light1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32.png" id="light2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon.png" id="light3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120.png" id="light4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76.png" id="light5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60.png" id="light6">
<!-- dark mode favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16-dark.png" id="dark1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32-dark.png" id="dark2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon-dark.png" id="dark3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120-dark.png" id="dark4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76-dark.png" id="dark5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60-dark.png" id="dark6">
<script>
// Switch to the dark-mode favicons if prefers-color-scheme: dark
function onUpdate() {
light1 = document.querySelector('link#light1');
light2 = document.querySelector('link#light2');
light3 = document.querySelector('link#light3');
light4 = document.querySelector('link#light4');
light5 = document.querySelector('link#light5');
light6 = document.querySelector('link#light6');
dark1 = document.querySelector('link#dark1');
dark2 = document.querySelector('link#dark2');
dark3 = document.querySelector('link#dark3');
dark4 = document.querySelector('link#dark4');
dark5 = document.querySelector('link#dark5');
dark6 = document.querySelector('link#dark6');
if (matcher.matches) {
light1.remove();
light2.remove();
light3.remove();
light4.remove();
light5.remove();
light6.remove();
document.head.append(dark1);
document.head.append(dark2);
document.head.append(dark3);
document.head.append(dark4);
document.head.append(dark5);
document.head.append(dark6);
} else {
dark1.remove();
dark2.remove();
dark3.remove();
dark4.remove();
dark5.remove();
dark6.remove();
document.head.append(light1);
document.head.append(light2);
document.head.append(light3);
document.head.append(light4);
document.head.append(light5);
document.head.append(light6);
}
}
matcher = window.matchMedia('(prefers-color-scheme: dark)');
matcher.addListener(onUpdate);
onUpdate();
</script>
<link rel="stylesheet" href="//fonts.googleapis.com/css?family=Lato:300,300italic,400,400italic,700,700italic,900">
<link href="/css/main.css" rel="stylesheet">
<link href="/css/syntax.css" rel="stylesheet">
<script src="/javascript/main.js"></script>
<!-- Matomo -->
<script>
var _paq = window._paq = window._paq || [];
/* tracker methods like "setCustomDimension" should be called before "trackPageView" */
/* We explicitly disable cookie tracking to avoid privacy issues */
_paq.push(['disableCookies']);
_paq.push(['trackPageView']);
_paq.push(['enableLinkTracking']);
(function() {
var u="https://analytics.apache.org/";
_paq.push(['setTrackerUrl', u+'matomo.php']);
_paq.push(['setSiteId', '20']);
var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g,s);
})();
</script>
<!-- End Matomo Code -->
</head>
<body class="wrap">
<header>
<nav class="navbar navbar-expand-md navbar-dark bg-dark">
<a class="navbar-brand no-padding" href="/"><img src="/img/arrow-inverse-300px.png" height="40px"/></a>
<button class="navbar-toggler ml-auto" type="button" data-toggle="collapse" data-target="#arrow-navbar" aria-controls="arrow-navbar" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span>
</button>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse justify-content-end" id="arrow-navbar">
<ul class="nav navbar-nav">
<li class="nav-item"><a class="nav-link" href="/overview/" role="button" aria-haspopup="true" aria-expanded="false">Overview</a></li>
<li class="nav-item"><a class="nav-link" href="/faq/" role="button" aria-haspopup="true" aria-expanded="false">FAQ</a></li>
<li class="nav-item"><a class="nav-link" href="/blog" role="button" aria-haspopup="true" aria-expanded="false">Blog</a></li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownGetArrow" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Get Arrow
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownGetArrow">
<a class="dropdown-item" href="/install/">Install</a>
<a class="dropdown-item" href="/release/">Releases</a>
<a class="dropdown-item" href="https://github.com/apache/arrow">Source Code</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownDocumentation" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Documentation
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownDocumentation">
<a class="dropdown-item" href="/docs">Project Docs</a>
<a class="dropdown-item" href="/docs/format/Columnar.html">Format</a>
<hr/>
<a class="dropdown-item" href="/docs/c_glib">C GLib</a>
<a class="dropdown-item" href="/docs/cpp">C++</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/csharp/README.md">C#</a>
<a class="dropdown-item" href="https://godoc.org/github.com/apache/arrow/go/arrow">Go</a>
<a class="dropdown-item" href="/docs/java">Java</a>
<a class="dropdown-item" href="/docs/js">JavaScript</a>
<a class="dropdown-item" href="/julia/">Julia</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/matlab/README.md">MATLAB</a>
<a class="dropdown-item" href="/docs/python">Python</a>
<a class="dropdown-item" href="/docs/r">R</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/ruby/README.md">Ruby</a>
<a class="dropdown-item" href="https://docs.rs/arrow/latest">Rust</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownSubprojects" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Subprojects
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownSubprojects">
<a class="dropdown-item" href="/adbc">ADBC</a>
<a class="dropdown-item" href="/docs/format/Flight.html">Arrow Flight</a>
<a class="dropdown-item" href="/docs/format/FlightSql.html">Arrow Flight SQL</a>
<a class="dropdown-item" href="https://datafusion.apache.org">DataFusion</a>
<a class="dropdown-item" href="/nanoarrow">nanoarrow</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownCommunity" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Community
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownCommunity">
<a class="dropdown-item" href="/community/">Communication</a>
<a class="dropdown-item" href="/docs/developers/index.html">Contributing</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/issues">Issue Tracker</a>
<a class="dropdown-item" href="/committers/">Governance</a>
<a class="dropdown-item" href="/use_cases/">Use Cases</a>
<a class="dropdown-item" href="/powered_by/">Powered By</a>
<a class="dropdown-item" href="/visual_identity/">Visual Identity</a>
<a class="dropdown-item" href="/security/">Security</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/policies/conduct.html">Code of Conduct</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownASF" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
ASF Links
</a>
<div class="dropdown-menu dropdown-menu-right" aria-labelledby="navbarDropdownASF">
<a class="dropdown-item" href="https://www.apache.org/">ASF Website</a>
<a class="dropdown-item" href="https://www.apache.org/licenses/">License</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/sponsorship.html">Donate</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/thanks.html">Thanks</a>
<a class="dropdown-item" href="https://www.apache.org/security/">Security</a>
</div>
</li>
</ul>
</div><!-- /.navbar-collapse -->
</nav>
</header>
<div class="container p-4 pt-5">
<main role="main" class="pb-5">
<!--
-->
<h1 id="apache-arrow-400-26-april-2021">Apache Arrow 4.0.0 (26 April 2021)</h1>
<p>This is a major release covering more than 3 months of development.</p>
<h2 id="download">Download</h2>
<ul>
<li><a href="https://www.apache.org/dyn/closer.lua/arrow/arrow-4.0.0/"><strong>Source Artifacts</strong></a></li>
<li><strong>Binary Artifacts</strong>
<ul>
<li><a href="https://apache.jfrog.io/artifactory/arrow/centos/">For CentOS</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/debian/">For Debian</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/python/4.0.0/">For Python</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/ubuntu/">For Ubuntu</a></li>
</ul>
</li>
<li><a href="https://github.com/apache/arrow/releases/tag/apache-arrow-4.0.0">Git tag</a></li>
</ul>
<h2 id="contributors">Contributors</h2>
<p>This release includes 719 commits from 114 distinct contributors.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> apache-arrow-3.0.0..apache-arrow-4.0.0
<span class="go"> 65 Antoine Pitrou
47 Andrew Lamb
41 Heres, Daniel
40 David Li
37 Sutou Kouhei
33 Neal Richardson
30 Weston Pace
28 Jorge C. Leitao
26 Krisztián Szűcs
25 Ian Cook
21 Dominik Moritz
20 Andy Grove
19 Yibo Cai
18 Joris Van den Bossche
17 Neville Dipale
17 Jonathan Keane
17 Ritchie Vink
12 Mike Seddon
12 Benjamin Kietzman
11 Mauricio Vargas
10 Qingping Hou
10 Diana Clarke
8 Micah Kornfield
7 Matthew Topol
7 Dmitry Patsura
5 Projjal Chanda
5 Kenta Murata
4 Anthony Louis
4 Ximo Guanter
4 liyafan82
3 Andre Braga Reis
3 Kazuaki Ishizaki
3 Maarten A. Breddels
3 Uwe L. Korn
3 ptaylor
3 Steven Fackler
3 Sagnik Chakraborty
3 Nic Crane
2 Marc Prud'hommeaux
2 Raphael Taylor-Davies
2 João Pedro
2 Yordan Pavlov
2 emkornfield
2 Max Burke
2 Florian Müller
2 Ben Chambers
2 mqy
2 Christoph Schulze
2 Manoj Karthick
2 Sathis Kumar
2 Ryan Jennings
2 Ruan Pearce-Authers
2 Tao He
2 Eric Burden
2 Tyrel Rink
2 Romain Francois
2 Rok
1 witchard
1 Adam Lippai
1 Albert Villanova del Moral
1 Alessandro Molina
1 Ali
1 Andrew Wieteska
1 Bob Tinsman
1 Brian Hulette
1 Bryan Cutler
1 Clcanny
1 Daniel Russo
1 Daniël Heres
1 Eduardo Ponce
1 Evan Chan
1 FawnD2
1 Felix Zhu
1 Fernando Herrera
1 Fiona La
1 François Saint-Jacques
1 GALI PREM SAGAR
1 Gert Hulselmans
1 Ha Thi Tham
1 Hongze Zhang
1 Ilya Biryukov
1 Ivan Smirnov
1 James Winegar
1 Joe Roberts
1 Johannes Müller
1 Jörn Horstmann
1 Mahmut Bulut
1 Marco Gorelli
1 Marko Mikulicic
1 Markus Silberstein Hont
1 Martin Nowak
1 Matt Brubeck
1 Matt Summersgill
1 Max Meldrum
1 Nathaniel Bauernfeind
1 Nga Tran
1 Nick Bruno
1 Rok Mihevc
1 Roman Karlstetter
1 Sam Albers
1 Simon Bertron
1 Szangin
1 Truc Lam Nguyen
1 Weichen Xu
1 Ying Zhou
1 frank400
1 ivan
1 jpeeter
1 martinblostein
1 nmcdonnell-kx
1 pierwill
1 sjgupta2
1 sundy-li
1 ARF1
</span></code></pre></div></div>
<h2 id="patch-committers">Patch Committers</h2>
<p>The following Apache committers merged contributed patches to the repository.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-csn</span> apache-arrow-3.0.0..apache-arrow-4.0.0
<span class="go"> 157 Andrew Lamb
101 Antoine Pitrou
93 Neal Richardson
88 Krisztián Szűcs
72 Sutou Kouhei
41 David Li
30 Benjamin Kietzman
25 Neville Dipale
22 Micah Kornfield
19 Jorge C. Leitao
16 Andy Grove
14 Praveen
11 Joris Van den Bossche
9 GitHub
8 Yibo Cai
4 Uwe L. Korn
3 Sebastien Binet
2 liyafan82
1 Kenta Murata
1 Eric Erhardt
1 Chao Sun
1 Bryan Cutler
</span></code></pre></div></div>
<h2 id="changelog">Changelog</h2>
<h2 id="apache-arrow-400-2021-04-26">Apache Arrow 4.0.0 (2021-04-26)</h2>
<h3 id="new-features-and-improvements">New Features and Improvements</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-951">ARROW-951</a> - [JS] Fix generated API documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2229">ARROW-2229</a> - [C++] Write CSV files from RecordBatch, Table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3690">ARROW-3690</a> - [Rust] Add Rust to the format integration testing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6103">ARROW-6103</a> - [Java] Stop using the maven release plugin</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6248">ARROW-6248</a> - [Python] Use FileNotFoundError in HadoopFileSystem.open() in Python 3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6455">ARROW-6455</a> - [C++] Implement ExtensionType for non-UTF8 Unicode data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6604">ARROW-6604</a> - [C++] Add support for nested types to MakeArrayFromScalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7215">ARROW-7215</a> - [C++][Gandiva] Implement castVARCHAR(numeric_type) functions in Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7364">ARROW-7364</a> - [Rust] Add cast options to cast kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7633">ARROW-7633</a> - [C++][CI] Create fuzz targets for tensors and sparse tensors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7808">ARROW-7808</a> - [Java][Dataset] Implement Datasets Java API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7906">ARROW-7906</a> - [C++][Python] Full functionality for ORC format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8049">ARROW-8049</a> - [C++] Upgrade bundled Thrift version to 0.13.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8282">ARROW-8282</a> - [C++/Python][Dataset] Support schema evolution for integer columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8284">ARROW-8284</a> - [C++][Dataset] Schema evolution for timestamp columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8630">ARROW-8630</a> - [C++][Dataset] Pass schema including all materialized fields to catch CSV edge cases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8631">ARROW-8631</a> - [C++][Dataset] Add ConvertOptions and ReadOptions to CsvFileFormat</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8658">ARROW-8658</a> - [C++][Dataset] Implement subtree pruning for FileSystemDataset::GetFragments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8732">ARROW-8732</a> - [C++] Let Futures support cancellation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8771">ARROW-8771</a> - [C++] Add boost/process library to build support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8796">ARROW-8796</a> - [Rust] Allow parquet to be written directly to memory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8797">ARROW-8797</a> - [C++] Support Flight RPC among diffent endian platforms</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8900">ARROW-8900</a> - [C++] Respect HTTP(S)_PROXY for S3 Filesystems and/or expose proxy options as parameters</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8919">ARROW-8919</a> - [C++] Add “DispatchBest” APIs to compute::Function that selects a kernel that may require implicit casts to invoke</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9128">ARROW-9128</a> - [C++] Implement string space trimming kernels: trim, ltrim, and rtrim</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9149">ARROW-9149</a> - [C++] Improve configurability of RandomArrayGenerator::ArrayOf</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9196">ARROW-9196</a> - [C++] Make temporal casts work on Scalar inputs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9318">ARROW-9318</a> - [C++][Parquet] Encryption key management tools</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9731">ARROW-9731</a> - [C++][Dataset] Port “head” method from R to C++ Dataset Scanner</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9749">ARROW-9749</a> - [C++][Dataset] Extract format-specific scan options from FileFormat</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9777">ARROW-9777</a> - [Rust] Implement IPC changes to catch up to 1.0.0 format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9856">ARROW-9856</a> - [R] Add bindings for string compute functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10014">ARROW-10014</a> - [C++] TaskGroup::Finish should execute tasks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10089">ARROW-10089</a> - [R] inject base class for Array, ChunkedArray and Scalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10183">ARROW-10183</a> - [C++] Create a ForEach library function that runs on an iterator of futures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10195">ARROW-10195</a> - [C++] Add string struct extract kernel using re2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10250">ARROW-10250</a> - [FlightRPC][C++] Remove default constructor for FlightClientOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10255">ARROW-10255</a> - [JS] Reorganize imports and exports to be more friendly to ESM tree-shaking</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10297">ARROW-10297</a> - [Rust] Parameter for parquet-read to output data in json format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10299">ARROW-10299</a> - [Rust] Support reading and writing V5 of IPC metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10305">ARROW-10305</a> - [R] Filter with regular expressions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10306">ARROW-10306</a> - [C++] Add string replacement kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10349">ARROW-10349</a> - [Python] Build and publish aarch64 wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10354">ARROW-10354</a> - [Rust] [DataFusion] Add support for regex extract</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10360">ARROW-10360</a> - [CI] Bump github actions cache version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10372">ARROW-10372</a> - [C++][Dataset] Read compressed CSVs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10406">ARROW-10406</a> - [C++] Unify dictionaries when writing IPC file in a single shot</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10420">ARROW-10420</a> - [C++] FileSystem::OpenInput{File,Stream} should accept a MemoryPool</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10421">ARROW-10421</a> - [R] Feather reader/writer should accept a MemoryPool</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10438">ARROW-10438</a> - [C++][Dataset] Partitioning::Format on nulls</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10520">ARROW-10520</a> - [C++][R] Implement add/remove/replace for RecordBatch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10570">ARROW-10570</a> - [R] Use Converter API to convert SEXP to Array/ChunkedArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10580">ARROW-10580</a> - [C++] When Validating, ensure DenseUnionArray offsets are increasing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10606">ARROW-10606</a> - [C++][Compute] Support casts to and from Decimal256 type.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10655">ARROW-10655</a> - [C++] Add LRU cache facility</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10734">ARROW-10734</a> - [R] Build and test on Solaris</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10735">ARROW-10735</a> - [R] Remove arrow-without-arrow wrapping</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10766">ARROW-10766</a> - [Rust] Compute nested definition and repetition for list arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10797">ARROW-10797</a> - [C++] Investigate faster random generation for tests and benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10816">ARROW-10816</a> - [Rust] [DataFusion] Implement INTERVAL</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10831">ARROW-10831</a> - [C++][Compute] Implemement quantile kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10846">ARROW-10846</a> - [C++] Add async filesystem operations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10880">ARROW-10880</a> - [Java] Support compressing RecordBatch IPC buffers by LZ4</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10882">ARROW-10882</a> - [Python][Dataset] Writing dataset from python iterator of record batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10895">ARROW-10895</a> - [C++][Gandiva] Implement bool to varchar cast function in Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10903">ARROW-10903</a> - [Rust] Implement FromIter&lt;Option&lt;Vec&lt;u8&gt;&gt;&gt; constructor for FixedSizeBinaryArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11022">ARROW-11022</a> - [Rust] [DataFusion] Upgrade to tokio 1.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11070">ARROW-11070</a> - [C++] Implement power / exponentiation compute kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11074">ARROW-11074</a> - [Rust][DataFusion] Implement predicate push-down for parquet tables</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11081">ARROW-11081</a> - [Java] Make IPC option immutable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11108">ARROW-11108</a> - [Rust] Improve performance of MutableBuffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11141">ARROW-11141</a> - [Rust]: Miri checks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11149">ARROW-11149</a> - [Rust] create_batch_empty - support List, LargeList</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11150">ARROW-11150</a> - [Rust] Set up bi-weekly Rust sync call and update website</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11154">ARROW-11154</a> - [CI][C++] Move homebrew crossbow tests off of Travis-CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11156">ARROW-11156</a> - [Rust][DataFusion] Create hashes vectorized in hash join</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11174">ARROW-11174</a> - [C++][Dataset] Make Expressions available for projection</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11179">ARROW-11179</a> - [Format] Make comments in fb files friendly to rust doc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11183">ARROW-11183</a> - [Rust] [Parquet] LogicalType::TIMESTAMP_NANOS missing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11191">ARROW-11191</a> - [C++] Use FnOnce for TaskGroup’s tasks instead of std::function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11216">ARROW-11216</a> - [Rust] Improve documentation for StringDictionaryBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11220">ARROW-11220</a> - [Rust] DF Implement GROUP BY support for Boolean</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11222">ARROW-11222</a> - [Rust] [Arrow] catch up with flatbuffers 0.8.1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11246">ARROW-11246</a> - DF - Add type to Unexpected accumulator state message</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11254">ARROW-11254</a> - [Rust][DataFusion] Add SIMD and snmalloc flags as options to benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11260">ARROW-11260</a> - [C++][Dataset] Don’t require dictionaries for reading dataset with schema-based Partitioning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11265">ARROW-11265</a> - [Rust] Made bool not convertable to bytes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11268">ARROW-11268</a> - [Rust][DataFusion] Support specifying repartitions in MemTable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11270">ARROW-11270</a> - [Rust] Use slices for simple array data buffer access</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11279">ARROW-11279</a> - [Rust][Parquet] ArrowWriter Definition Levels Memory Usage</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11284">ARROW-11284</a> - [R] Support dplyr verb transmute()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11289">ARROW-11289</a> - [Rust] [DataFusion] Support GROUP BY for Dictionary columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11290">ARROW-11290</a> - [Rust][DataFusion] Address hash aggregate performance with high number of groups</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11291">ARROW-11291</a> - [Rust] implement extend for MutableBuffer (from iterator)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11300">ARROW-11300</a> - [Rust][DataFusion] Improve hash aggregate performance with large number of groups in</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11308">ARROW-11308</a> - [Rust] [Parquet] Add Arrow decimal array writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11309">ARROW-11309</a> - [Release][C#] Use .NET 3.1 for verification</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11310">ARROW-11310</a> - [Rust] Implement arrow JSON writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11314">ARROW-11314</a> - [Release][APT][Yum] Add support for verifying arm64 packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11317">ARROW-11317</a> - [Rust] Test the prettyprint feature in CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11318">ARROW-11318</a> - [Rust] Support pretty printing timestamp, date, and time types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11319">ARROW-11319</a> - [Rust] [DataFusion] Improve test comparisons to record batch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11321">ARROW-11321</a> - [Rust][DataFusion] Fix DataFusion compilation error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11325">ARROW-11325</a> - [Packaging][C#] Release Apache.Arrow.Flight and Apache.Arrow.Flight.AspNetCore</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11329">ARROW-11329</a> - [Rust] Do not rebuild the library on every change</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11330">ARROW-11330</a> - [Rust][DataFusion] Add ExpressionVisitor pattern</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11332">ARROW-11332</a> - [Rust] Use MutableBuffer in take_string instead of Vec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11333">ARROW-11333</a> - [Rust] Suport creating arbitrary nested empty arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11336">ARROW-11336</a> - [C++][Doc] Improve Developing on Windows docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11338">ARROW-11338</a> - [R] Bindings for quantile and median</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11340">ARROW-11340</a> - [C++] Add vcpkg.json manifest to cpp project root</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11343">ARROW-11343</a> - [DataFusion] Simplified example</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11346">ARROW-11346</a> - [C++][Compute] Implement quantile kernel benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11349">ARROW-11349</a> - [Rust] Add from_iter_values to create arrays from T instead of Option&lt;T&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11350">ARROW-11350</a> - [C++] Bump dependency versions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11354">ARROW-11354</a> - [Rust] Speed-up casts of dates and times</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11355">ARROW-11355</a> - [Rust] Align Date type with spec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11358">ARROW-11358</a> - [Rust] Add benchmark for concatenating small arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11360">ARROW-11360</a> - [Rust] [DataFusion] Improve CSV “No files found” error message</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11361">ARROW-11361</a> - [Rust] Build buffers from iterator of booleans</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11362">ARROW-11362</a> - [Rust][DataFusion] Use iterator APIs in to_array_of_size to improve performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11365">ARROW-11365</a> - [Rust] [Parquet] Implement parsers for v2 of the text schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11366">ARROW-11366</a> - [Rust][DataFusion] Add Constant Folding / Support boolean literal in equality expression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11367">ARROW-11367</a> - [C++] Implement approximante quantile utility</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11369">ARROW-11369</a> - [DataFusion] Split expressions.rs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11372">ARROW-11372</a> - Support RC verification on macOS-ARM64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11373">ARROW-11373</a> - [Python][Docs] Add example of specifying type for a column when reading csv file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11374">ARROW-11374</a> - [Python] Make legacy pyarrow.filesystem / pyarrow.serialize warnings more visisble</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11375">ARROW-11375</a> - [Rust] CI fails due to deprecation warning in clippy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11377">ARROW-11377</a> - [C++][CI] Add ThreadSanitizer nightly build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11383">ARROW-11383</a> - [Rust] use trusted len on bit ops</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11386">ARROW-11386</a> - [Release] Fix post documents update script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11389">ARROW-11389</a> - [Rust] Inconsistent comments for datatypes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11395">ARROW-11395</a> - [DataFusion] Support custom optimizations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11401">ARROW-11401</a> - [Rust][DataFusion] Pass slices instead of Vec in DataFrame API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11404">ARROW-11404</a> - [Rust][DataFusion] Upgrade to aHash 0.7</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11405">ARROW-11405</a> - [DataFusion] Support multiple custom nodes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11406">ARROW-11406</a> - [CI][C++] Fix caching on Travis-CI builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11408">ARROW-11408</a> - Add window support to datafusion readme</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11411">ARROW-11411</a> - [Packaging][Linux] Disable arm64 nightly builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11414">ARROW-11414</a> - [Rust] Reduce copies in Schema::try_merge</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11417">ARROW-11417</a> - [Integration] Add integration test for buffer compression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11418">ARROW-11418</a> - [Doc] Add IPC buffer compression to support matrix</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11421">ARROW-11421</a> - [Rust][DataFusion] Support group by Date32</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11422">ARROW-11422</a> - [C#] Add support for decimals</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11423">ARROW-11423</a> - [R] value_counts and some StructArray methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11425">ARROW-11425</a> - [C++][Compute] Improve quantile kernel for integers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11426">ARROW-11426</a> - [Rust][DataFusion] EXTRACT support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11428">ARROW-11428</a> - [Rust] Add power kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11429">ARROW-11429</a> - Make string comparisson kernels generic over Utf8 and LargeUtf8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11430">ARROW-11430</a> - [Rust] Kernel to combine two arrays based on boolean mask</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11431">ARROW-11431</a> - [Rust] [DataFusion] Add support for the SQL HAVING clause</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11435">ARROW-11435</a> - Allow creating ParquetPartition from external crate</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11436">ARROW-11436</a> - [Rust] Allow non-sized iterators in Primitive::from_iter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11437">ARROW-11437</a> - [Rust] Simplify benches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11438">ARROW-11438</a> - Unsupported ast node Value(Boolean(true)) in sqltorel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11439">ARROW-11439</a> - [Rust] Add year support to temporal kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11440">ARROW-11440</a> - [Rust] [DataFusion] Add method to CsvExec to get CSV schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11442">ARROW-11442</a> - [Rust] Expose the logic used to interpret date/times</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11443">ARROW-11443</a> - [Rust] Write datetime information for Date64 Type in csv writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11444">ARROW-11444</a> - [Rust][DataFusion] Pass slices instead of &amp;Vec to functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11446">ARROW-11446</a> - [DataFusion] Support scalars in builtin functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11447">ARROW-11447</a> - [Rust] Add shift kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11449">ARROW-11449</a> - [CI][R][Windows] Use ccache</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11457">ARROW-11457</a> - [Rust] Make string comparisson kernels generic over Utf8 and LargeUtf8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11459">ARROW-11459</a> - [Rust] Allow ListArray of primitives to be built from iterator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11462">ARROW-11462</a> - [Developer] Remove needless quote from the default DOCKER_VOLUME_PREFIX</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11463">ARROW-11463</a> - [Python] Allow configuration of IpcWriterOptions 64Bit from PyArrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11466">ARROW-11466</a> - [Flight][Go] Add BasicAuth and BearerToken handlers for Go</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11467">ARROW-11467</a> - [R] Fix reference to json_table_reader() in R docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11468">ARROW-11468</a> - [R] Allow user to pass schema to read_json_arrow()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11474">ARROW-11474</a> - [C++] Update bundled re2 version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11476">ARROW-11476</a> - [Rust][DataFusion] Test running of TPCH benchmarks in CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11477">ARROW-11477</a> - [R][Doc] Reorganize and improve README and vignette content</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11478">ARROW-11478</a> - [R] Consider ways to make arrow.skip_nul option more user-friendly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11479">ARROW-11479</a> - [Rust][Parquet] Add method to return compressed size of row group</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11481">ARROW-11481</a> - [Rust] More cast implementations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11484">ARROW-11484</a> - [Rust] Derive Clone for ExecutionContext</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11486">ARROW-11486</a> - [Website] Use Jekyll 4 and webpack to support Ruby 3.0 or later</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11489">ARROW-11489</a> - [Rust][DataFusion] Make DataFrame Send+Sync</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11491">ARROW-11491</a> - [Rust] Support json schema inference for nested list and struct</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11493">ARROW-11493</a> - [CI][Packaging][deb][RPM] Test built packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11500">ARROW-11500</a> - [R] Allow bundled build script to run on Solaris</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11501">ARROW-11501</a> - [C++] endianness check does not work on Solaris</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11504">ARROW-11504</a> - [Rust] verify Datatype in ListArray::from(ArrayDataRef)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11505">ARROW-11505</a> - [Rust] Add support for LargeUtf8 in csv-writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11507">ARROW-11507</a> - [R] Bindings for GetRuntimeInfo</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11510">ARROW-11510</a> - [Python] Add note that pip &gt;= 19.0 is required to get binary packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11511">ARROW-11511</a> - [Rust] Replace Arc&lt;ArrayData&gt; by ArrayData</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11512">ARROW-11512</a> - [Packaging][deb] Add missing gRPC dependency for Ubuntu 21.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11513">ARROW-11513</a> - [R] Bindings for sub/gsub</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11516">ARROW-11516</a> - [R] Allow all C++ compute functions to be called by name in dplyr</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11539">ARROW-11539</a> - [Developer][Archery] Change items_per_seconds units</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11541">ARROW-11541</a> - [C++][Compute] Implement approximate quantile kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11542">ARROW-11542</a> - [Rust] json reader should not crash when reading nested list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11544">ARROW-11544</a> - [Rust] [DataFusion] Implement as_any for AggregateExpr</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11545">ARROW-11545</a> - [Rust] [DataFusion] SendableRecordBatchStream should implement Sync</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11556">ARROW-11556</a> - [C++] Minor benchmark improvements</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11557">ARROW-11557</a> - [Rust] Add table de-registration to DataFusion ExecutionContext</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11559">ARROW-11559</a> - [C++] Improve flatbuffers verification limits</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11559">ARROW-11559</a> - [C++] Improve flatbuffers verification limits</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11561">ARROW-11561</a> - [Rust][DataFusion] Add Send + Sync to MemTable::load</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11563">ARROW-11563</a> - [Rust] Support Cast(Utf8, TimeStamp(Nanoseconds, None))</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11568">ARROW-11568</a> - [C++][Compute] Mode kernel performance is bad in some conditions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11570">ARROW-11570</a> - [Rust] ScalarValue - support Date64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11571">ARROW-11571</a> - [CI] Cancel stale Github Actions workflow runs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11572">ARROW-11572</a> - [Rust] Add a kernel for division by single scalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11573">ARROW-11573</a> - [Developer][Archery] Google benchmark now reports run type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11574">ARROW-11574</a> - [Rust][DataFusion] Upgrade sqlparser to 0.8 to support parsing all TPC-H queries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11575">ARROW-11575</a> - [Developer][Archery] Expose execution time in benchmark results</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11576">ARROW-11576</a> - [Rust] Remove unused variable in example</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11580">ARROW-11580</a> - [C++] Add CMake option ARROW_DEPENDENCY_SOURCE=VCPKG</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11589">ARROW-11589</a> - [R] Add methods for modifying Schemas</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11590">ARROW-11590</a> - [C++] Move CSV background generator to IO thread pool</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11591">ARROW-11591</a> - [C++][Compute] Prototype version of hash aggregation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11592">ARROW-11592</a> - [Rust] Typo in comment</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11594">ARROW-11594</a> - [Rust] Support pretty printing with NullArrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11597">ARROW-11597</a> - [Rust] Split datatypes in a module</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11598">ARROW-11598</a> - [Rust] Split buffer.rs in smaller files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11599">ARROW-11599</a> - [Rust] Add function to create array with all nulls</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11601">ARROW-11601</a> - [C++][Dataset] Expose pre-buffering in ParquetFileFormatReaderOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11606">ARROW-11606</a> - [Rust] [DataFusion] Need guidance on HashAggregateExec reconstruction</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11610">ARROW-11610</a> - [C++] Download boost from sourceforge instead of bintray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11612">ARROW-11612</a> - [C++] Rebuild trimmed boost bundle for 1.75.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11613">ARROW-11613</a> - [R] Move nightly C++ builds off of bintray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11616">ARROW-11616</a> - [Rust][DataFusion] Expose collect_partitioned for DataFrame</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11621">ARROW-11621</a> - [CI][Gandiva][Linux] Fix Crossbow setup failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11626">ARROW-11626</a> - [Rust][DataFusion] Move DataFusion examples to own project to reduce nr dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11627">ARROW-11627</a> - [Rust] Typed allocator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11637">ARROW-11637</a> - [CI][Conda] Update nightly clean target platforms and packages list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11641">ARROW-11641</a> - [CI] Use docker buildkit’s inline cache to reuse build cache across different hosts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11649">ARROW-11649</a> - [R] Add support for null_fallback to R</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11651">ARROW-11651</a> - [Rust][DataFusion] Implement Postgres Length Functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11653">ARROW-11653</a> - Ascii/unicode functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11655">ARROW-11655</a> - Pad/trim functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11656">ARROW-11656</a> - Left over functions/fixes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11659">ARROW-11659</a> - [R] Preserve group_by .drop argument</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11662">ARROW-11662</a> - [C++] Support sorting for decimal data type.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11664">ARROW-11664</a> - [Rust] Cast to LargeUtf8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11665">ARROW-11665</a> - [Python] Document precision and scale parameters of decimal128()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11666">ARROW-11666</a> - [Integration] Add endianness “gold” integration file for decimal256</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11667">ARROW-11667</a> - [Rust] Add docs for utf8 comparison functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11669">ARROW-11669</a> - [Rust] [DataFusion] Remove concurrency field from GlobalLimitExec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11671">ARROW-11671</a> - [Rust][DataFusion] Clean up docs on Expr</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11677">ARROW-11677</a> - [C++][Dataset] Write documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11680">ARROW-11680</a> - [C++] Add vendored version of folly’s spsc queue</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11683">ARROW-11683</a> - [R] Support dplyr::mutate()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11685">ARROW-11685</a> - [C++] Typo in future_test.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11688">ARROW-11688</a> - [Rust] Casts between utf8 and large-utf8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11690">ARROW-11690</a> - [Rust][DataFusion] Avoid Expr::clone in Expr builder methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11692">ARROW-11692</a> - [Rust][DataFusion] Improve documentation on Optimizer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11693">ARROW-11693</a> - [C++] Add string length kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11700">ARROW-11700</a> - [R] Internationalize error handling in tidy eval</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11701">ARROW-11701</a> - [R] Implement dplyr::relocate()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11703">ARROW-11703</a> - [R] Implement dplyr::arrange()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11704">ARROW-11704</a> - [R] Wire up dplyr::mutate() for datasets</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11707">ARROW-11707</a> - Support CSV schema inference without IO</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11708">ARROW-11708</a> - Clean up Rust 2021 linting warning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11709">ARROW-11709</a> - [Rust][DataFusion] Move `expressions` and `inputs` into LogicalPlan rather than helpers in util</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11710">ARROW-11710</a> - [Rust][DataFusion] Implement ExprRewriter to avoid tree traversal redundancy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11719">ARROW-11719</a> - Support merged schema for memory table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11721">ARROW-11721</a> - json schema inference should return Schema type instead of SchemaRef</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11722">ARROW-11722</a> - Improve error message in FFI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11724">ARROW-11724</a> - [C++] Namespace collisions with protobuf 3.15</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11725">ARROW-11725</a> - [Rust][DataFusion] Make use of the new divide_scalar kernel in arrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11727">ARROW-11727</a> - [C++][FlightRPC] Use TDigest to estimate latency quantiles in benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11730">ARROW-11730</a> - [C++] Add implicit Future(Status) constructor for convenience</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11733">ARROW-11733</a> - [Rust][DataFusion] Support hash repartitioning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11734">ARROW-11734</a> - [C++] vendored safe-math.h does not compile on Solaris</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11735">ARROW-11735</a> - [R] Allow Parquet and Arrow Dataset to be optional components</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11736">ARROW-11736</a> - [R] Allow string compute functions to be optional</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11737">ARROW-11737</a> - [C++] Patch vendored xxhash for Solaris</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11738">ARROW-11738</a> - [Rust][DataFusion] Concat Functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11740">ARROW-11740</a> - [C++] posix_memalign not declared in scope on Solaris</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11742">ARROW-11742</a> - [Rust] [DataFusion] Add Expr::is_null and Expr::is_not_null functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11744">ARROW-11744</a> - [C++] Add xsimd dependency</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11745">ARROW-11745</a> - [C++] Improve configurability of random data generation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11750">ARROW-11750</a> - [Python][Dataset] Add support for project expressions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11752">ARROW-11752</a> - [R] Replace usage of testthat::expect_is()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11753">ARROW-11753</a> - [Rust][DataFusion] Add test for Join Statement: Schema contains duplicate unqualified field name</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11754">ARROW-11754</a> - [R] Support dplyr::compute()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11761">ARROW-11761</a> - [C++] Increase public API testing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11766">ARROW-11766</a> - [R] Better handling for missing compression codecs on Linux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11768">ARROW-11768</a> - [C++][CI] Make s390x build non-optional</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11773">ARROW-11773</a> - [Rust] Allow json writer to write out JSON arrays as well as newline formatted objects</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11774">ARROW-11774</a> - [R] one-line install from source on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11775">ARROW-11775</a> - [Rust][DataFusion] Feature Flags for Dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11777">ARROW-11777</a> - [Rust] impl AsRef for StringBuilder/BinaryBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11778">ARROW-11778</a> - Cast from large-utf8 to numerical arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11779">ARROW-11779</a> - [Rust] make alloc module public</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11790">ARROW-11790</a> - [Rust][DataFusion] Change plan builder signature to take Vec&lt;Expr&gt; rather than &amp;[Expr]</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11794">ARROW-11794</a> - [Go] Add concurrent-safe ipc.FileReader.RecordAt(i)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11795">ARROW-11795</a> - [MATLAB] Migrate MATLAB Interface for Apache Arrow design doc to Markdown</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11797">ARROW-11797</a> - [C++][Dataset] Provide Scanner methods to yield/visit scanned batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11798">ARROW-11798</a> - [Integration] Update testing submodule</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11799">ARROW-11799</a> - [Rust] String and Binary arrays created with incorrect length from unbound iterator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11801">ARROW-11801</a> - [C++] Remove bad header guard in filesystem/type_fwd.h</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11803">ARROW-11803</a> - [Rust] [Parquet] Support v2 LogicalType</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11806">ARROW-11806</a> - [Rust][DataFusion] Optimize inner join creation of indices</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11820">ARROW-11820</a> - Added macro create_native to construct impl</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11822">ARROW-11822</a> - Support case sensitive for function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11824">ARROW-11824</a> - [Rust] [Parquet] Use logical types in Arrow writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11825">ARROW-11825</a> - [Rust][DataFusion] Add mimalloc as option to benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11833">ARROW-11833</a> - [C++] Vendored fast_float errors for emscripten (architecture flag missing)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11837">ARROW-11837</a> - [C++][Dataset] Expose originating fragment as a property of ScanTask</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11838">ARROW-11838</a> - [C++] Support reading IPC data with shared dictionaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11839">ARROW-11839</a> - [C++] Rewrite bit-unpacking optimizations using xsimd</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11842">ARROW-11842</a> - [Rust][Parquet] Use more efficient clone_from in get_batch_with_dict</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11852">ARROW-11852</a> - [Documentation] Update CONTRIBUTING to explain Contributor role</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11856">ARROW-11856</a> - [C++] Remove unused reference to RecordBatchStreamWriter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11858">ARROW-11858</a> - [GLib] Gandiva Filter in GLib</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11859">ARROW-11859</a> - [GLib] GArrowArray: concatenate is missing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11864">ARROW-11864</a> - [R] Document arrow.int64_downcast option</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11870">ARROW-11870</a> - [Dev] Automatically run merge script in venv</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11876">ARROW-11876</a> - [Website] Update governance page</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11877">ARROW-11877</a> - [C++] Add initial microbenchmarks for Dataset internals</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11879">ARROW-11879</a> - [Rust][DataFusion] ExecutionContext::sql should optimize query plan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11883">ARROW-11883</a> - [C++] Add ConcatMap, MergeMap, and an async-reentrant version of Map</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11887">ARROW-11887</a> - [C++] Add asynchronous read to streaming CSV reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11894">ARROW-11894</a> - [Rust][DataFusion] Change flight server example to use DataFrame API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11895">ARROW-11895</a> - [Rust][DataFusion] Add support for extra column statistics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11898">ARROW-11898</a> - [Rust] Pretty print columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11899">ARROW-11899</a> - [Java] Refactor the compression codec implementation into core/Arrow specific parts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11900">ARROW-11900</a> - [Website] Add Yibo to committer list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11906">ARROW-11906</a> - [R] Make FeatherReader print method more informative</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11907">ARROW-11907</a> - [C++] Use our own executor in S3FileSystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11910">ARROW-11910</a> - [Packaging][Ubuntu] Drop support for 16.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11911">ARROW-11911</a> - [Website] Add protobuf vs arrow to FAQ</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11912">ARROW-11912</a> - [R] Remove args from FeatherReader$create</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11913">ARROW-11913</a> - [Rust] Improve performance of StringBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11920">ARROW-11920</a> - [R] Add r/libarrow to make clean</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11921">ARROW-11921</a> - [R] Set LC_COLLATE in r/data-raw/codegen.R</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11924">ARROW-11924</a> - [C++] Provide streaming output from GetFileInfo</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11925">ARROW-11925</a> - [R] Add `between` method for arrow_dplyr_query</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11927">ARROW-11927</a> - [Rust][DataFusion] Support limit push down</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11931">ARROW-11931</a> - [Go][CI] Bump CI to use Go 1.15</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11935">ARROW-11935</a> - [C++] Add push generator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11944">ARROW-11944</a> - [Developer] Achery benchmark diff regression: cannot compare jsons</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11949">ARROW-11949</a> - [Ruby] Accept raw Ruby objects as sort key and options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11951">ARROW-11951</a> - [Rust] Remove OffsetSize::prefix</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11952">ARROW-11952</a> - [Rust] Make ArrayData –&gt; GenericListArray fallable instead of `panic!`</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11954">ARROW-11954</a> - [C++] arrow/util/io_util.cc does not compile on Solaris</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11955">ARROW-11955</a> - [Rust][DataFusion] Support Union</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11958">ARROW-11958</a> - [GLib] GArrowChunkedArray: combine is missing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11959">ARROW-11959</a> - [Rust][DataFusion] Fix logging of optimized plan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11962">ARROW-11962</a> - [Rust][DataFusion] Update Datafusion Docs / readme</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11969">ARROW-11969</a> - [Rust][DataFusion] Improve Examples in documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11972">ARROW-11972</a> - [C++][Dataset] Extract IpcFragmentScanOptions, ParquetFragmentScanOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11973">ARROW-11973</a> - [Rust] Boolean AND/OR kernels should follow sql behaviour regarding null values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11977">ARROW-11977</a> - [Rust] Add documentation examples for sort kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11982">ARROW-11982</a> - [Rust] Donate Ballista Distributed Compute Platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11984">ARROW-11984</a> - [C++][Gandiva] Implement SHA1 and SHA256 functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11987">ARROW-11987</a> - [C++][Gandiva] Implement trigonometric functions on Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11988">ARROW-11988</a> - [C++][Gandiva] Implements the last_day function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11992">ARROW-11992</a> - [Rust][Parquet] Add upgrade notes on 4.0 rename of LogicalType #9731</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11993">ARROW-11993</a> - [C++] Don’t download xsimd if ARROW_SIMD_LEVEL=NONE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11996">ARROW-11996</a> - [R] Make r/configure run successfully on Solaris</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11999">ARROW-11999</a> - [Java] Support parallel vector element search with user-specified comparator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12000">ARROW-12000</a> - [Documentation] Add note about deviation from style guide on struct/classes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12005">ARROW-12005</a> - [R] Fix a bash typo in configure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12017">ARROW-12017</a> - [R] [Documentation] Make proper developing arrow docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12019">ARROW-12019</a> - [Rust] [Parquet] Update README for 2.6.0 support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12020">ARROW-12020</a> - [Rust][DataFusion] Adding SHOW TABLES and SHOW COLUMNS + partial information_schema support to DataFusion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12031">ARROW-12031</a> - [C++][CSV] infer CSV timestamps columns with fractional seconds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12032">ARROW-12032</a> - [Rust] Optimize comparison kernels using trusted_len iterator for bools</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12034">ARROW-12034</a> - [Docs] Formalize Minor PRs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12037">ARROW-12037</a> - [Rust] [DataFusion] Support catalogs and schemas for table namespacing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12038">ARROW-12038</a> - [Rust][DataFusion] Upgrade hashbrown to 0.11</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12039">ARROW-12039</a> - [CI][C++][Gandiva] Fix gandiva nightly linux build failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12040">ARROW-12040</a> - [R] [CI] [C++] test-r-rstudio-r-base-3.6-opensuse15 timing out during tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12043">ARROW-12043</a> - [Rust] [Parquet] Write fixed size binary arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12045">ARROW-12045</a> - First Chunk of ported Parquet Code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12047">ARROW-12047</a> - [Rust] Clippy parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12048">ARROW-12048</a> - [Rust][DataFusion] Support Common Table Expressions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12052">ARROW-12052</a> - [Rust] Implement child data in C FFI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12056">ARROW-12056</a> - [C++] Create sequencing AsyncGenerator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12058">ARROW-12058</a> - [Python] Enable arithmetic operations on Expressions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12068">ARROW-12068</a> - [Python] Stop using distutils</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12069">ARROW-12069</a> - [C++][Gandiva]Implement IN expressions for Decimal types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12070">ARROW-12070</a> - [GLib] Drop support for GNU Autotools</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12071">ARROW-12071</a> - [GLib] Keep input stream reference of GArrowJSONReader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12075">ARROW-12075</a> - [Rust][DataFusion] Add CTE to list of supported features</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12081">ARROW-12081</a> - [R] Bindings for utf8_length</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12082">ARROW-12082</a> - [R][Dataset] Allow create dataset from vector of file paths</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12094">ARROW-12094</a> - [C++][R] Fix/workaround re2 building on clang/libc++</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12097">ARROW-12097</a> - [C++] Modify BackgroundGenerator so it creates fewer threads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12098">ARROW-12098</a> - [R] Catch cpp build failures on linux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12104">ARROW-12104</a> - Next Chunk of ported Code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12106">ARROW-12106</a> - [Rust][DataFusion] Support `SELECT * from information_schema.tables`</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12107">ARROW-12107</a> - [Rust][DataFusion] Support `SELECT * from information_schema.columns`</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12108">ARROW-12108</a> - [Rust][DataFusion] Support `SHOW TABLES`</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12109">ARROW-12109</a> - [Rust][DataFusion] Support `SHOW COLUMNS`</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12110">ARROW-12110</a> - [Java] Implement ZSTD buffer compression for java</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12111">ARROW-12111</a> - [Java] place files generated by flatc under source control</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12116">ARROW-12116</a> - [Rust] Fix or ignore 1.51 clippy lints</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12119">ARROW-12119</a> - [Rust][DataFusion] Improve performance of to_array_of_size</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12120">ARROW-12120</a> - [Rust] Generate random arrays and batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12121">ARROW-12121</a> - [Rust] [Parquet] Arrow writer benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12123">ARROW-12123</a> - [Rust][DataFusion] Use smallvec for indices for better join performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12128">ARROW-12128</a> - [CI][Crossbow] Remove (or fix) test-ubuntu-16.04-cpp job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12131">ARROW-12131</a> - [CI][GLib] Ensure upgrading MSYS2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12133">ARROW-12133</a> - [C++][Gandiva] Add option to disable setting mcpu flag to host cpu during llvm ir compilation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12134">ARROW-12134</a> - [C++] Add regex string match kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12136">ARROW-12136</a> - [Rust][DataFusion] Reduce default batch_size to 8192</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12139">ARROW-12139</a> - [Python][Packaging] Use vcpkg to build macOS wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12141">ARROW-12141</a> - [R] Bindings for grepl</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12143">ARROW-12143</a> - [CI] R builds should timeout and fail after some threshold and dump the output.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12146">ARROW-12146</a> - [C++][Gandiva] Implement CONVERT_FROM(expression, ‘UTF8’, replacement char) function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12151">ARROW-12151</a> - [Docs] Add Jira component + summary conventions to the docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12153">ARROW-12153</a> - [Rust] [Parquet] Return file metadata after writing Parquet file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12160">ARROW-12160</a> - [Rust] Add an `into_inner()` method to ipc::writer::StreamWriter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12164">ARROW-12164</a> - [Java] Make BaseAllocator.Config public</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12165">ARROW-12165</a> - [Rust] Inline append functions in builders for performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12168">ARROW-12168</a> - [Go][IPC] Implement Compression handling for IPC</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12170">ARROW-12170</a> - [Rust][DataFusion] Introduce repartition optimization</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12173">ARROW-12173</a> - [GLib] Remove #include &lt;config.h&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12176">ARROW-12176</a> - parquet/low-level-api/reader-writer.cc has some typos.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12187">ARROW-12187</a> - [C++][FlightRPC] Enable compression in Flight benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12188">ARROW-12188</a> - [Docs] Switch to pydata-sphinx-theme for the main sphinx docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12190">ARROW-12190</a> - [Rust][DataFusion] Implement partitioned hash join</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12192">ARROW-12192</a> - [Website] Use downloadable URL for archive download</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12193">ARROW-12193</a> - [Dev][Release] Use downloadable URL for archive download</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12194">ARROW-12194</a> - [Rust] [Parquet] Update zstd version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12197">ARROW-12197</a> - [R] dplyr bindings for cast, dictionary_encode</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12200">ARROW-12200</a> - [R] Export and document list_compute_functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12204">ARROW-12204</a> - [Rust][CI] Reduce size of rust build artifacts in integration test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12206">ARROW-12206</a> - [Python] Fix Table docstrings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12208">ARROW-12208</a> - [C++] Add the ability to run async tasks without using the CPU thread pool</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12210">ARROW-12210</a> - [Rust][DataFusion] Document SHOW TABLES / SHOW COLUMNS / InformationSchema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12214">ARROW-12214</a> - [Rust][DataFusion] Add some tests for limit</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12215">ARROW-12215</a> - [C++] fixed size binary columns cannot be null in CSV reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12217">ARROW-12217</a> - [C++] Cleanup cpp examples source file names</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12222">ARROW-12222</a> - [Dev][Packaging] Include build url in the crossbow console report</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12224">ARROW-12224</a> - [Rust] Use stable rust for no default test, clean up CI tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12228">ARROW-12228</a> - [CI] Create base image for conda environments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12236">ARROW-12236</a> - [R][CI] Add check that all docs pages are listed in _pkgdown.yml</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12237">ARROW-12237</a> - [Packaging][Debian] Add support for bulleye</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12238">ARROW-12238</a> - [JS] Remove trailing spaces</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12239">ARROW-12239</a> - [JS] Switch to yarn</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12242">ARROW-12242</a> - [Python][Doc] Tweak nightly build instructions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12246">ARROW-12246</a> - [CI] Sync conda recipes with upstream feedstock</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12248">ARROW-12248</a> - [C++] Allow static builds to change memory allocators</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12249">ARROW-12249</a> - [R] [CI] Fix test-r-install-local nightlies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12251">ARROW-12251</a> - [Rust] [Ballista] Add Ballista tests to CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12263">ARROW-12263</a> - [Dev][Packaging] Move Crossbow to Archery</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12269">ARROW-12269</a> - [JS] Move to eslint</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12274">ARROW-12274</a> - [JS] Document how to run tests without building</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12277">ARROW-12277</a> - [Rust][DataFusion] Min/Max are not supported for timestamp types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12278">ARROW-12278</a> - [Rust][DataFusion]Use Timestamp(Nanosecond, None) for SQL TIMESTAMP Type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12280">ARROW-12280</a> - [Developer] Remove @-mentions from commit messages in merge tool</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12281">ARROW-12281</a> - [JS] Remove shx, trash, and rimraf</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12283">ARROW-12283</a> - [R] Bindings for basic type convert functions in dplyr verbs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12286">ARROW-12286</a> - [C++] Create AsyncGenerator from Future&lt;AsyncGenerator&lt;T&gt;&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12287">ARROW-12287</a> - [C++] Create enumerating generator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12288">ARROW-12288</a> - [C++] Create Scanner interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12289">ARROW-12289</a> - [C++] Create basic AsyncScanner implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12303">ARROW-12303</a> - [JS] Use iterators instead of generators in critical code paths</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12304">ARROW-12304</a> - [R] Update news and polish docs for 4.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12305">ARROW-12305</a> - [JS] Benchmark test data generate.py assumes python 2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12309">ARROW-12309</a> - [JS] Make es2015 bundles the default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12316">ARROW-12316</a> - [C++] Switch default memory allocator from jemalloc to mimalloc on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12317">ARROW-12317</a> - [Rust] JSON writer does not support time, date or interval types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12320">ARROW-12320</a> - [CI] REPO arg missing from conda-cpp-valgrind</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12323">ARROW-12323</a> - [C++][Gandiva] Implement castTIME(timestamp) function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12325">ARROW-12325</a> - [C++] [CI] Nightly gandiva build failing due to failure of compiler to move return value</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12326">ARROW-12326</a> - [C++] Avoid needless c-ares detection</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12328">ARROW-12328</a> - [Rust] [Ballista] Fix code formatting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12329">ARROW-12329</a> - [Rust] [Ballista] Add README</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12332">ARROW-12332</a> - [Rust] [Ballista] Api server for scheduler</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12333">ARROW-12333</a> - [JS] Remove jest-environment-node-debug and do not emit from typescript by default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12335">ARROW-12335</a> - [Rust] [Ballista] Bump DataFusion version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12337">ARROW-12337</a> - add DoubleEndedIterator and ExactSizeIterator traits</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12351">ARROW-12351</a> - [CI][Ruby] Use ruby/setup-ruby instead of actions/setup-ruby</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12352">ARROW-12352</a> - [CI][R][Windows] Remove needless workaround for MSYS2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12353">ARROW-12353</a> - [Packaging][deb] Rename -archive-keyring to -apt-source</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12354">ARROW-12354</a> - [Packaging][RPM] Use apache.jfrog.io/artifactory/ instead of apache.bintray.com/</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12356">ARROW-12356</a> - [Website] Update install page instructions to point to artifactory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12361">ARROW-12361</a> - [Rust] [DataFusion] Allow users to override physical optimization rules</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12367">ARROW-12367</a> - [C++] Stop producing when PushGenerator was destroyed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12370">ARROW-12370</a> - [R] Bindings for power kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12374">ARROW-12374</a> - [CI][C++][cron] Use Ubuntu 20.04 instead of 16.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12375">ARROW-12375</a> - [Release] Remove rebase post-release scripts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12376">ARROW-12376</a> - [Dev] archery trigger-bot should use logger.exception</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12380">ARROW-12380</a> - [Rust][Ballista] Add scheduler ui</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12381">ARROW-12381</a> - [Packaging][Python] macOS wheels are built with wrong package kind</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12383">ARROW-12383</a> - [JS] Update direct deps</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12384">ARROW-12384</a> - [JS] Improve code style</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12389">ARROW-12389</a> - [R] [Docs] Add note about autocasting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12395">ARROW-12395</a> - [C++]: Create RunInSerialExecutor benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12396">ARROW-12396</a> - [Python][Docs] Clarify serialization docstrings about deprecated status</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12397">ARROW-12397</a> - [Rust] [DataFusion] Simplify readme example #10038</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12398">ARROW-12398</a> - [Rust] Remove double bound checks in iterators</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12400">ARROW-12400</a> - [Rust] Re-enable transform module tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12402">ARROW-12402</a> - [Rust] [DataFusion] Implement SQL metrics framework</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12406">ARROW-12406</a> - [R] fix checkbashims violation in configure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12409">ARROW-12409</a> - [R] Remove LazyData from DESCRIPTION</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12419">ARROW-12419</a> - [Java] flatc is not used in mvn</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12420">ARROW-12420</a> - [C++/Dataset] Reading null columns as dictionary not longer possible</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12423">ARROW-12423</a> - [Docs] Codecov badge in main Readme only applies to Rust</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12425">ARROW-12425</a> - [Rust] new_null_array doesn’t allocate keys buffer for dictionary arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12432">ARROW-12432</a> - [Rust] [DataFusion] Add metrics for SortExec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12436">ARROW-12436</a> - [Rust][Ballista] Add watch capabilities to config backend trait</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12467">ARROW-12467</a> - [C++][Gandiva] Add support for LLVM12</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12477">ARROW-12477</a> - [Release] Download linux aarch64 miniforge in verify-release-candidate.sh</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12485">ARROW-12485</a> - [C++] Use mimalloc as the default memory allocator on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12488">ARROW-12488</a> - [GLib] Use g_memdup2() with GLib 2.68 or later</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12494">ARROW-12494</a> - [C++] ORC adapter fails to compile on GCC 4.8</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1846">PARQUET-1846</a> - [C++] Remove deprecated IO classes and related functions</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1899">PARQUET-1899</a> - [C++] Deprecated ReadBatchSpaced in parquet/column_reader</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1990">PARQUET-1990</a> - [C++] ConvertedType::NA is written out in some cases</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1993">PARQUET-1993</a> - [C++] Expose when prefetching completes</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1998">PARQUET-1998</a> - [C++] Implement LZ4_RAW compression</li>
</ul>
<h3 id="bug-fixes">Bug Fixes</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4784">ARROW-4784</a> - [C++][CI] Re-enable flaky mingw tests.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6818">ARROW-6818</a> - [Doc] Format docs confusing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7288">ARROW-7288</a> - [C++][R] read_parquet() freezes on Windows with Japanese locale</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7830">ARROW-7830</a> - [C++] Parquet library version doesn’t change with releases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9451">ARROW-9451</a> - [Python] Unsigned integer types will accept string values in pyarrow.array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9634">ARROW-9634</a> - [C++][Python] Restore non-UTC time zones when reading Parquet file that was previously Arrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9878">ARROW-9878</a> - [Python] table to_pandas self_destruct=True + split_blocks=True cannot prevent doubling memory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10038">ARROW-10038</a> - [C++] SetCpuThreadPoolCapacity(1) spins up nCPUs threads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10056">ARROW-10056</a> - [C++] Increase flatbuffers max_tables parameter in order to read wide tables</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10364">ARROW-10364</a> - [Dev][Archery] Test is failed with semver 2.13.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10370">ARROW-10370</a> - [Python] Spurious s3fs-related test failures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10403">ARROW-10403</a> - [C++] Implement unique kernel for dictionary type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10405">ARROW-10405</a> - [C++] IsIn kernel should be able to lookup dictionary in string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10457">ARROW-10457</a> - [CI] Fix Spark branch-3.0 integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10489">ARROW-10489</a> - [C++] Unable to configure or make with intel compiler</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10514">ARROW-10514</a> - [C++][Parquet] Data inconsistency in parquet-reader output modes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10953">ARROW-10953</a> - [R] Validate when creating Table with schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11066">ARROW-11066</a> - [Java] Is there a bug in flight AddWritableBuffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11066">ARROW-11066</a> - [Java] Is there a bug in flight AddWritableBuffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11066">ARROW-11066</a> - [Java] Is there a bug in flight AddWritableBuffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11066">ARROW-11066</a> - [Java] Is there a bug in flight AddWritableBuffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11066">ARROW-11066</a> - [Java] Is there a bug in flight AddWritableBuffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11134">ARROW-11134</a> - [C++][CI] ARM64 job on Travis-CI doesn’t run tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11147">ARROW-11147</a> - [Python][CI] Parquet tests failing in nightly build with Dask master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11180">ARROW-11180</a> - [Developer] cmake-format pre-commit hook doesn’t run</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11192">ARROW-11192</a> - [Documentation] Describe opening Visual Studio so it inherits a working env</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11223">ARROW-11223</a> - [Java] BaseVariableWidthVector/BaseLargeVariableWidthVector setNull and getBufferSizeFor is buggy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11235">ARROW-11235</a> - [Python] S3 test failures inside non-default regions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11239">ARROW-11239</a> - [Rust] array::transform::tests::test_struct failed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11269">ARROW-11269</a> - [Rust] Unable to read Parquet file because of mismatch in column-derived and embedded schemas</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11277">ARROW-11277</a> - [C++] Fix compilation error in dataset expressions on macOS 10.11</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11299">ARROW-11299</a> - [Python] build warning in python</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11303">ARROW-11303</a> - [Release][C++] Enable mimalloc in the windows verification script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11305">ARROW-11305</a> - [Rust]: parquet-rowcount binary tries to open itself as a parquet file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11311">ARROW-11311</a> - [Rust] unset_bit is toggling bits, not unsetting them</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11313">ARROW-11313</a> - [Rust] Size hint of iterators is incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11315">ARROW-11315</a> - [Packaging][APT][arm64] Add missing gir1.2 files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11320">ARROW-11320</a> - [C++] Spurious test failure when creating temporary dir</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11322">ARROW-11322</a> - [Rust] Arrow `memory` made private is a breaking API change</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11323">ARROW-11323</a> - [Rust][DataFusion] ComputeError(“concat requires input of at least one array”)) with queries with ORDER BY or GROUP BY that return no</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11328">ARROW-11328</a> - [R] Collecting zero columns from a dataset returns entire dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11334">ARROW-11334</a> - [Python][CI] Nightly pandas builds failing because of internal pandas change</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11337">ARROW-11337</a> - [C++] Compilation error with ThreadSanitizer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11357">ARROW-11357</a> - [Rust] take primitive implementation is unsound</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11376">ARROW-11376</a> - [C++] ThreadedTaskGroup failure with Thread Sanitizer enabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11379">ARROW-11379</a> - [C++][Dataset] Reading dataset with filtering on timestamp partition field crashes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11387">ARROW-11387</a> - [Rust] Arrow 3.0.0 release with simd feature doesn’t compile without feature=avx512.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11391">ARROW-11391</a> - [C++] HdfsOutputStream::Write unsafely truncates integers exceeding INT32_MAX</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11394">ARROW-11394</a> - [Rust] Slice + Concat incorrect for structs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11400">ARROW-11400</a> - [Python] Pickled ParquetFileFragment has invalid partition_expresion with dictionary type in pyarrow 2.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11403">ARROW-11403</a> - [Developer] archery benchmark list: unexpected keyword ‘benchmark_filter’</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11412">ARROW-11412</a> - [Python] Expressions not working with logical boolean operators (and, or, not)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11412">ARROW-11412</a> - [Python] Expressions not working with logical boolean operators (and, or, not)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11427">ARROW-11427</a> - [C++] Arrow uses AVX512 instructions even when not supported by the OS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11448">ARROW-11448</a> - [C++] tdigest build failure on Windows with Visual Studio</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11451">ARROW-11451</a> - [C++] Fix gcc-4.8 build error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11452">ARROW-11452</a> - [Rust] Parquet reader cannot read file where a struct column has the same name as struct member columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11461">ARROW-11461</a> - [Flight][Go] GetSchema does not work with Java Flight Server</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11464">ARROW-11464</a> - [Python] pyarrow.parquet.read_pandas doesn’t conform to its docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11470">ARROW-11470</a> - [C++] Overflow occurs on integer multiplications in ComputeRowMajorStrides, ComputeColumnMajorStrides, and CheckTensorStridesValidity</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11472">ARROW-11472</a> - [Python][CI] Kartothek integrations build is failing with numpy 1.20</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11472">ARROW-11472</a> - [Python][CI] Kartothek integrations build is failing with numpy 1.20</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11480">ARROW-11480</a> - [Python] Segmentation fault reading parquet with date filter with INT96 column</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11483">ARROW-11483</a> - [Java][C++][Integration] C++ integration test creates JSON files incompatible with Java</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11488">ARROW-11488</a> - [Rust]: StructBuilder’s Drop impl leaks memory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11490">ARROW-11490</a> - [C++] BM_ArrowBinaryDict/EncodeLowLevel is not deterministic</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11494">ARROW-11494</a> - [Rust] Fix take bench</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11497">ARROW-11497</a> - [Python] pyarrow parquet writer for list does not conform with Apache Parquet specification</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11538">ARROW-11538</a> - [Python] Segfault reading Parquet dataset with Timestamp filter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11547">ARROW-11547</a> - [Packaging][Conda][Drone] Nightly builds are failed by undefined variable error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11548">ARROW-11548</a> - [C++] RandomArrayGenerator::List size mismatch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11551">ARROW-11551</a> - [C++][Gandiva] castTIMESTAMP(utf8) function doesn’t show error out for invalid inputs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11560">ARROW-11560</a> - [FlightRPC][C++][Python] Interrupting a Flight server results in abort</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11567">ARROW-11567</a> - [C++][Compute] Variance kernel has precision issue</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11577">ARROW-11577</a> - [Rust] Concat kernel panics on slices of string arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11582">ARROW-11582</a> - [R] write_dataset “format” argument default and validation could be better</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11586">ARROW-11586</a> - [Rust] [Datafusion] Invalid SQL sometimes panics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11595">ARROW-11595</a> - [C++][NIGHTLY:test-conda-cpp-valgrind] GenerateBitsUnrolled triggers valgrind on uninit inputs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11596">ARROW-11596</a> - [Python][Dataset] SIGSEGV when executing scan tasks with Python executors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11603">ARROW-11603</a> - [Rust] Fix clippy error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11607">ARROW-11607</a> - [Python] Error when reading table with list values from parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11614">ARROW-11614</a> - [C++][Gandiva] Fix round() logic to return positive zero when argument is zero</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11617">ARROW-11617</a> - [C++][Gandiva] Fix nested if-else optimisation in gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11620">ARROW-11620</a> - [Rust] [DataFusion] Inconsistent use of Box and Arc for TableProvider</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11630">ARROW-11630</a> - [Rust] Introduce partial_sort and limit option for sort kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11632">ARROW-11632</a> - [Rust] csv::Reader doesn’t propagate schema metadata to RecordBatches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11639">ARROW-11639</a> - [C++][Gandiva] Fix signbit compilation issue in Ubuntu nightly build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11642">ARROW-11642</a> - [C++] Incorrect preprocessor directive for Windows in JVM detection</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11657">ARROW-11657</a> - [R] group_by with .drop specified errors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11658">ARROW-11658</a> - [R] Handle mutate/rename inside group_by</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11663">ARROW-11663</a> - [DataFusion] Master does not compile</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11668">ARROW-11668</a> - [C++] Sporadic UBSAN error in FutureStessTest.TryAddCallback</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11672">ARROW-11672</a> - [R] Fix string function test failure on R 3.3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11681">ARROW-11681</a> - [Rust] IPC writers shouldn’t unwrap in destructors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11686">ARROW-11686</a> - [C++]flight-test-integration-client sometimes exits by SIGABRT but does not print the stack trace</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11687">ARROW-11687</a> - [Rust][DataFusion] RepartitionExec Hanging</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11694">ARROW-11694</a> - [C++] Array Take may dereference absent null bitmap</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11695">ARROW-11695</a> - [C++][FlightRPC][Packaging] Update support for disabling TLS server verification for recent gRPC versions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11717">ARROW-11717</a> - [Integration] Intermittent (but frequent) flight integration failures with auth:basic_proto</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11718">ARROW-11718</a> - [Rust] IPC writers shouldn’t implicitly finish on drop</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11741">ARROW-11741</a> - [C++] Decimal cast failure on big-endian</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11743">ARROW-11743</a> - [R] Use pkgdown’s new found ability to autolink Jiras</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11746">ARROW-11746</a> - [Developer][Archery] Fix prefer real time check</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11756">ARROW-11756</a> - [R] passing a partition as a schema leads to segfaults</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11758">ARROW-11758</a> - [C++][Compute] Summation kernel round-off error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11767">ARROW-11767</a> - [C++] Scalar::hash may segfault for null scalars</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11771">ARROW-11771</a> - [Developer][Archery] Move benchmark tests (so CI runs them)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11784">ARROW-11784</a> - [Rust][DataFusion] CoalesceBatchesStream doesn’t honor Stream interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11785">ARROW-11785</a> - [R] Fallback when filtering Table with unsupported expression fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11786">ARROW-11786</a> - [C++] CMake output noisy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11788">ARROW-11788</a> - [Java] Appending Empty List Vector yields NPE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11791">ARROW-11791</a> - [Rust][DataFusion] RepartitionExec Blocking</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11802">ARROW-11802</a> - [Rust][DataFusion] Mixing of crossbeam channel and async tasks can lead to deadlock</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11819">ARROW-11819</a> - [Rust] Add link to the doc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11821">ARROW-11821</a> - [Rust] Edit Rust README</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11830">ARROW-11830</a> - [C++] gRPC compilation tests occur every time</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11832">ARROW-11832</a> - [R] Handle conversion of extra nested struct column</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11836">ARROW-11836</a> - Target libarrow_bundled_dependencies.a is not alreay created but is already required.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11845">ARROW-11845</a> - [Rust] Debug implementation of Date32Array panics if array contains negative values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11850">ARROW-11850</a> - [GLib] GARROW_VERSION_0_16 macro is missing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11855">ARROW-11855</a> - [C++] [Python] Memory leak in to_pandas when converting chunked struct array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11857">ARROW-11857</a> - [Python] Resource temporarily unavailable when using the new Dataset API with Pandas</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11860">ARROW-11860</a> - [Rust] [DataFusion] Add DataFusion logos</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11866">ARROW-11866</a> - [C++] Arrow Flight SetShutdownOnSignals cause potential mutex deadlock in gRPC</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11872">ARROW-11872</a> - [C++] Array Validation of GPU buffers fails due to incorrect validation check</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11880">ARROW-11880</a> - [R] Handle empty or NULL transmute() args properly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11881">ARROW-11881</a> - [Rust][DataFusion] Fix Clippy Lint</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11896">ARROW-11896</a> - [Rust] Hang / failure in CI on AMD64 Debian 10 Rust stable test workspace</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11904">ARROW-11904</a> - [C++] “pure virtual method called” crash at the end of arrow-csv-test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11905">ARROW-11905</a> - [C++] SIMD info always returning none on MacOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11914">ARROW-11914</a> - [R] [CI] r-sanitizer nightly is broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11918">ARROW-11918</a> - [R] [Documentation] Docs cleanups</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11923">ARROW-11923</a> - [CI] Update branch name for dask dev integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11937">ARROW-11937</a> - [C++] GZip codec hangs if flushed twice</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11941">ARROW-11941</a> - [Dev] “DEBUG=1 merge_arrow_pr.py” updates Jira issue</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11942">ARROW-11942</a> - [C++] If tasks are submitted quickly the thread pool may fail to spin up new threads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11945">ARROW-11945</a> - [R] filter doesn’t accept negative numbers as valid</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11956">ARROW-11956</a> - [C++] Fix system re2 dependency detection for static library</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11965">ARROW-11965</a> - [R][Docs] Fix install.packages command in R dev docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11970">ARROW-11970</a> - [C++][CI] Fix Valgrind failures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11971">ARROW-11971</a> - [Packaging] Vcpkg patch doesn’t apply on windows due to line endings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11975">ARROW-11975</a> - [CI][GLib] Failed to update gcc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11976">ARROW-11976</a> - [C++] Sporadic TSAN error in TestThreadPool.SetCapacity</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11983">ARROW-11983</a> - [Python] ImportError calling pyarrow from_pandas within ThreadPool</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11997">ARROW-11997</a> - [Python] concat_tables crashes python interpreter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12003">ARROW-12003</a> - [R] Fix NOTE re undefined global function group_by_drop_default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12006">ARROW-12006</a> - [Java] Fix checkstyle config to work on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12012">ARROW-12012</a> - [Java] [JDBC] BinaryConsumer cannot reallocate memory correctly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12013">ARROW-12013</a> - [C++][FlightRPC] Failed to detect gRPC version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12015">ARROW-12015</a> - [Rust] [DataFusion] Integrate doc-comment crate to ensure readme examples remain valid</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12028">ARROW-12028</a> - [Rust][DataFusion] Unsupported GROUP BY for Timestamp(Millisecond, None)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12029">ARROW-12029</a> - Remove args from FeatherReader$create v2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12033">ARROW-12033</a> - [Docs] Fix link in developers/benchmarks.html</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12041">ARROW-12041</a> - [C++] Fix string description of tensor IPC messages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12051">ARROW-12051</a> - [GLib] Intermittent CI failure in test_add_column_type(TestCSVReader::#read::options)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12057">ARROW-12057</a> - [Python] Remove direct usage of pandas’ Block subclasses</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12065">ARROW-12065</a> - [C++][Python] Segfault reading JSON file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12067">ARROW-12067</a> - [Python][Doc] Document pyarrow_(un)wrap_scalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12073">ARROW-12073</a> - [R] Fix R CMD check NOTE about ‘X_____X’</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12076">ARROW-12076</a> - [Rust] Fix build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12077">ARROW-12077</a> - [C++] Out-of-bounds write in ListArray::FromArrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12086">ARROW-12086</a> - [C++] offline builds does not use ARROW_$LIBRARY_URL to search for packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12088">ARROW-12088</a> - [Python][C++] Warning about offsetof in pyarrow.dataset.RecordBatchIterator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12089">ARROW-12089</a> - [Doc] Fix warnings when building Sphinx docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12100">ARROW-12100</a> - [C#] Cannot round-trip record batch with PyArrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12103">ARROW-12103</a> - [C++] “load of misaligned address” in Parquet reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12112">ARROW-12112</a> - [CI] No space left on device - AMD64 Conda Integration test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12112">ARROW-12112</a> - [CI] No space left on device - AMD64 Conda Integration test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12113">ARROW-12113</a> - [R] Fix rlang deprecation warning from check_select_helpers()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12130">ARROW-12130</a> - [C++] Arm64 buid failed if -DARROW_SIMD_LEVEL=NONE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12138">ARROW-12138</a> - [Go][IPC]</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12140">ARROW-12140</a> - [C++][CI] Valgrind failure on Grouper tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12145">ARROW-12145</a> - [Developer][Archery] Flaky test: test_static_runner_from_json</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12149">ARROW-12149</a> - [Dev] Archery benchmark test case is failing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12154">ARROW-12154</a> - [C++][Gandiva] Fix gandiva crash in certain OS/CPU combinations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12155">ARROW-12155</a> - [R] Require Table columns to be same length</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12161">ARROW-12161</a> - [C++][R] Async streaming CSV reader deadlocking when being run synchronously from datasets</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12161">ARROW-12161</a> - [C++][R] Async streaming CSV reader deadlocking when being run synchronously from datasets</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12169">ARROW-12169</a> - [C++] Fix compressed file reading with an empty stream at end of file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12171">ARROW-12171</a> - [Rust] Clippy error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12172">ARROW-12172</a> - [Python][Packaging] Pass python version as setuptools pretend version in the macOS wheel builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12178">ARROW-12178</a> - [CI] Update setuptools in the ubuntu images</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12186">ARROW-12186</a> - [Rust][DataFusion] Fix regexp_match test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12209">ARROW-12209</a> - [JS] @apache-arrow/ts nor apache-arrow does not compile</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12220">ARROW-12220</a> - [C++][CI] Thread sanitizer failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12226">ARROW-12226</a> - [C++] ASAN error in s3fs_test.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12227">ARROW-12227</a> - [R] Fix RE2 and median nightly build failures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12235">ARROW-12235</a> - [Rust][DataFusion] LIMIT returns incorrect results when used with several small partitions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12241">ARROW-12241</a> - [Python] Parallel csv reader cancellation test kills pytest</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12250">ARROW-12250</a> - [Rust] Failing test arrow::arrow_writer::tests::fixed_size_binary_single_column</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12254">ARROW-12254</a> - [Rust][DataFusion] Limit keeps polling input after limit is reached</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12258">ARROW-12258</a> - [R] Never do as.data.frame() on collect(as_data_frame = FALSE)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12262">ARROW-12262</a> - [Doc][C++][Python] Docs built and pushed with S3 and Flight disabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12267">ARROW-12267</a> - [Rust] JSON writer does not support timestamp types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12273">ARROW-12273</a> - [JS] Coveralls does not work anymore</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12279">ARROW-12279</a> - [Rust][DataFusion] Add test for null handling in hash join (ARROW-12266)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12294">ARROW-12294</a> - [Rust] Fix Boolean Kleene Kernels with no Remainder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12299">ARROW-12299</a> - [Python] pq.write_to_dataset does not recognize S3FileSystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12300">ARROW-12300</a> - [C++] ArrowCUDA erroneously links to CUDA Runtime while only using CUDA Driver API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12313">ARROW-12313</a> - [Rust] [Ballista] Benchmark documentation out of date</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12314">ARROW-12314</a> - [Python] pq.read_pandas with use_legacy_dataset=False does not accept columns as a set (kartothek integration failure)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12327">ARROW-12327</a> - [Dev] Use pull request’s head remote when submitting crossbow jobs via the comment bot</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12330">ARROW-12330</a> - [Developer] Restore values in counters column of Archery benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12334">ARROW-12334</a> - [Rust] [Ballista] Aggregate queries producing incorrect results</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12342">ARROW-12342</a> - [Packaging] Fix tabulation in crossbow templates for submitting nightly builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12357">ARROW-12357</a> - [Archery] Error running “crossbow submit …”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12377">ARROW-12377</a> - [Doc][Java] Java doc build broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12379">ARROW-12379</a> - [C++][CI] Thread sanitizer failure in SerialExecutor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12382">ARROW-12382</a> - [C++][CI] Conda nightly jobs fail due to not bundling xsimd</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12385">ARROW-12385</a> - [R] [CI] fix cran picking in CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12390">ARROW-12390</a> - [Rust] Inline Inline from_trusted_len_iter, try_from_trusted_len_iter, extend_from_slice</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12401">ARROW-12401</a> - [R] Fix guard around dataset___Scanner__TakeRows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12405">ARROW-12405</a> - [Packaging] Fix apt artifact patterns and artifact uploading from travis</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12408">ARROW-12408</a> - [R] Delete Scan() bindings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12421">ARROW-12421</a> - [Rust] [DataFusion] topk_query test fails in master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12421">ARROW-12421</a> - [Rust] [DataFusion] topk_query test fails in master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12429">ARROW-12429</a> - [C++] MergedGeneratorTestFixture is incorrectly instantiated</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12433">ARROW-12433</a> - [Rust] Builds failing due to new flatbuffer release introducing const generics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12437">ARROW-12437</a> - [Rust] [Ballista] Ballista plans must not include RepartitionExec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12440">ARROW-12440</a> - [Release] Various packaging, release script and release verification script fixes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12466">ARROW-12466</a> - [Python] Comparing array to None raises error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12475">ARROW-12475</a> - [C++] Build warning from thread_pool_benchmark.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12487">ARROW-12487</a> - [C++][Dataset] ScanBatches() hangs if there’s an error during scanning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12495">ARROW-12495</a> - [C++][Python] NumPy buffer sets is_mutable_ to true but does not set mutable_data_ when the NumPy array is writable</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1655">PARQUET-1655</a> - [C++] Decimal comparisons used for min/max statistics are not correct</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-2008">PARQUET-2008</a> - [C++] Wrong information written in RowGroup::total_byte_size</li>
</ul>
</main>
<hr/>
<footer class="footer">
<div class="row">
<div class="col-md-9">
<p>Apache Arrow, Arrow, Apache, the Apache feather logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.</p>
<p>&copy; 2016-2024 The Apache Software Foundation</p>
</div>
<div class="col-md-3">
<a class="d-sm-none d-md-inline pr-2" href="https://www.apache.org/events/current-event.html">
<img src="https://www.apache.org/events/current-event-234x60.png"/>
</a>
</div>
</div>
</footer>
</div>
</body>
</html>