blob: 579adddcb6307a78ca9c5c376e1de13e9dce6109 [file] [log] [blame]
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- The above meta tags *must* come first in the head; any other head content must come *after* these tags -->
<title>Apache Arrow 2.0.0 Release | Apache Arrow</title>
<!-- Begin Jekyll SEO tag v2.8.0 -->
<meta name="generator" content="Jekyll v4.3.3" />
<meta property="og:title" content="Apache Arrow 2.0.0 Release" />
<meta property="og:locale" content="en_US" />
<meta name="description" content="Apache Arrow 2.0.0 (19 October 2020) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 511 commits from 81 distinct contributors. $ git shortlog -sn apache-arrow-1.0.0..apache-arrow-2.0.0 68 Jorge C. Leitao 48 Antoine Pitrou 40 Krisztián Szűcs 34 alamb 33 Neal Richardson 30 Andy Grove 25 Benjamin Kietzman 25 Joris Van den Bossche 19 Sutou Kouhei 13 Uwe L. Korn 12 Micah Kornfield 10 Frank Du 10 Jörn Horstmann 9 Neville Dipale 9 Romain Francois 9 arw2019 8 Yibo Cai 8 liyafan82 7 Sagnik Chakraborty 6 David Li 5 Kazuaki Ishizaki 5 Mahmut Bulut 4 Mingyu Zhong 4 fredgan 3 Bryan Cutler 3 wqc200 2 Daniel Russo 2 Diana Clarke 2 James Duong 2 Kenta Murata 2 Patrick Woody 2 Projjal Chanda 2 naman1996 2 ptaylor 2 tianchen 1 Adam Szmigin 1 Ali McMaster 1 Andrew Stevenson 1 Ben Kimock 1 Brian Dunlay 1 Christoph Schulze 1 Derek Marsh 1 Dominik Moritz 1 Eric Erhardt 1 Ezra 1 Fernando José Herrera Elizalde 1 FredGan 1 Hongze Zhang 1 Jim Klucar 1 Josiah 1 Kyle Strand 1 Laurent Goujon 1 Lawrence Chan 1 Mark Rushakoff 1 Matt Corley 1 Matthew Topol 1 Matthias 1 Morgan Cassels 1 Ofek 1 Patrick Pai 1 Paul 1 PoojaChandak 1 Prashanth Govindarajan 1 Pratik raj 1 Revital Sur 1 Ruan Pearce-Authers 1 Ryan Murray 1 Simon Bertron 1 Steve Suh 1 Tanguy Fautre 1 Tobias Mayer 1 Troels Nielsen 1 Vivian Kong 1 Wes McKinney 1 Xavier Lange 1 Yordan Pavlov 1 kanga333 1 karldw 1 mubai 1 offthewall123 1 zanmato1984 Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-1.0.0..apache-arrow-2.0.0 127 Andy Grove 92 Antoine Pitrou 56 Krisztián Szűcs 51 Neal Richardson 44 Sutou Kouhei 18 Joris Van den Bossche 18 Micah Kornfield 17 Benjamin Kietzman 17 Wes McKinney 16 Neville Dipale 12 Jorge C. Leitao 10 Praveen 7 Paddy Horan 4 David Li 4 Eric Erhardt 4 Sebastien Binet 4 Uwe L. Korn 4 liyafan82 3 GitHub 1 Bryan Cutler 1 Chao Sun 1 tianchen Changelog Apache Arrow 2.0.0 (2020-10-19) Bug Fixes ARROW-2367 - [Python] ListArray has trouble with sizes greater than kMaximumCapacity ARROW-4189 - [CI] [Rust] Fix broken cargo coverage ARROW-4917 - [C++] orc_ep fails in cpp-alpine docker ARROW-5578 - [C++][Flight] Flight does not build out of the box on Alpine Linux ARROW-7226 - [JSON][Python] Json loader fails on example in documentation. ARROW-7384 - [Website] Fix search indexing warning reported by Google ARROW-7517 - [C++] Builder does not honour dictionary type provided during initialization ARROW-7663 - [Python] from_pandas gives TypeError instead of ArrowTypeError in some cases ARROW-7903 - [Rust] [DataFusion] Upgrade SQLParser dependency for DataFusion ARROW-7957 - [Python] ParquetDataset cannot take HadoopFileSystem as filesystem ARROW-8265 - [Rust] [DataFusion] Table API collect() should not require context ARROW-8394 - [JS] Typescript compiler errors for arrow d.ts files, when using es2015-esm package ARROW-8735 - [Rust] [Parquet] Parquet crate fails to compile on Arm architecture ARROW-8749 - [C++] IpcFormatWriter writes dictionary batches with wrong ID ARROW-8773 - [Python] pyarrow schema.empty_table() does not preserve nullability of fields ARROW-9028 - [R] Should be able to convert an empty table ARROW-9096 - [Python] Pandas roundtrip with object-dtype column labels with integer values: data type “integer” not understood ARROW-9177 - [C++][Parquet] Tracking issue for cross-implementation LZ4 Parquet compression compatibility ARROW-9414 - [C++] apt package includes headers for S3 interface, but no support ARROW-9462 - [Go] The Indentation after the first Record arrjson writer is missing ARROW-9463 - [Go] The writer is double closed in TestReadWrite ARROW-9490 - [Python] pyarrow array creation for specific set of numpy scalars fails ARROW-9495 - [C++] Equality assertions don’t handle Inf / -Inf properly ARROW-9520 - [Rust] [DataFusion] Can’t alias an aggregate expression ARROW-9528 - [Python] Honor tzinfo information when converting from datetime to pyarrow ARROW-9532 - [Python] Building pyarrow for MacPorts on macOS ARROW-9535 - [Python] Remove symlink fixes from conda recipe ARROW-9536 - Missing parameters in PlasmaOutOfMemoryException.java ARROW-9541 - [C++] CMakeLists requires UTF8PROC_STATIC when building static library ARROW-9544 - [R] version argument of write_parquet not working ARROW-9546 - [Python] Clean up Pandas Metadata Conversion test ARROW-9548 - [Go] Test output files in tmp directory are not removed correctly ARROW-9549 - [Rust] Parquet no longer builds ARROW-9554 - [Java] FixedWidthInPlaceVectorSorter sometimes produces wrong result ARROW-9556 - [Python][C++] Segfaults in UnionArray with null values ARROW-9560 - [Packaging] conda recipes failing due to missing conda-forge.yml ARROW-9569 - [CI][R] Fix rtools35 builds for msys2 key change ARROW-9570 - [Doc] Clean up sphinx sidebar ARROW-9573 - [Python] Parquet doesn’t load when partitioned column starts with ‘_’ ARROW-9574 - [R] Cleanups for CRAN 1.0.0 release ARROW-9575 - [R] gcc-UBSAN failure on CRAN ARROW-9577 - [Python][C++] posix_madvise error on Debian in pyarrow 1.0.0 ARROW-9583 - [Rust] Offset is mishandled in arithmetic and boolean compute kernels ARROW-9588 - [C++] clang/win: Copy constructor of ParquetInvalidOrCorruptedFileException not correctly triggered ARROW-9589 - [C++/R] arrow_exports.h contains structs declared as class ARROW-9592 - [CI] Update homebrew before calling brew bundle ARROW-9596 - [CI][Crossbow] Fix homebrew-cpp again, again ARROW-9597 - [C++] AddAlias in compute::FunctionRegistry should be synchronized ARROW-9598 - [C++][Parquet] Spaced definition levels is not assigned correctly. ARROW-9599 - [CI] Appveyor toolchain build fails because CMake detects different C and C++ compilers ARROW-9600 - [Rust] When used as a crate dependency, arrow-flight is rebuilt on every invocation of cargo build ARROW-9600 - [Rust] When used as a crate dependency, arrow-flight is rebuilt on every invocation of cargo build ARROW-9602 - [R] Improve cmake detection in Linux build ARROW-9603 - [C++][Parquet] Write Arrow relies on unspecified behavior for nested types ARROW-9606 - [C++][Dataset] in expressions don’t work with &gt;1 partition levels ARROW-9609 - [C++] CSV datasets don’t materialize virtual columns ARROW-9621 - [Python] test_move_file() is failed with fsspec 0.8.0 ARROW-9622 - [Java] ComplexCopier fails if a structvector has a child UnionVector with nulls ARROW-9628 - [Rust] Clippy PR test failing intermittently on Rust / AMD64 MacOS ARROW-9629 - [Python] Kartothek integration tests failing due to missing freezegun module ARROW-9631 - [Rust] Arrow crate should not depend on flight ARROW-9631 - [Rust] Arrow crate should not depend on flight ARROW-9642 - [C++] Let MakeBuilder refer DictionaryType’s index_type for deciding the starting bit width of the indices ARROW-9643 - [C++] Illegal instruction on haswell cpu ARROW-9644 - [C++][Dataset] Do not check for ignore_prefixes in the base path ARROW-9652 - [Rust][DataFusion] Panic trying to select * from a CSV (panicked at ‘index out of bounds: the len is 0 but the index is 0) ARROW-9653 - [Rust][DataFusion] Multi-column Group by: Invalid Argument Error ARROW-9659 - [C++] RecordBatchStreamReader throws on CUDA device buffers ARROW-9660 - [C++] IPC - dictionaries in maps ARROW-9666 - [Python][wheel][Windows] library missing failure by ARROW-9412 ARROW-9670 - [C++][FlightRPC] Close()ing a DoPut with an ongoing read locks up the client ARROW-9684 - [C++] Fix undefined behaviour on invalid IPC / Parquet input (OSS-Fuzz) ARROW-9692 - [Python] distutils import warning ARROW-9693 - [CI][Docs] Nightly docs build fails ARROW-9696 - [Rust] [Datafusion] nested binary expressions broken ARROW-9698 - [C++] Revert “Add -NDEBUG flag to arrow.pc” ARROW-9700 - [Python] create_library_symlinks doesn’t work in macos ARROW-9712 - [Rust] [DataFusion] ParquetScanExec panics on error ARROW-9714 - [Rust] [DataFusion] TypeCoercionRule not implemented for Limit or Sort ARROW-9716 - [Rust] [DataFusion] MergeExec should have concurrency limit ARROW-9726 - [Rust] [DataFusion] ParquetScanExec launches threads too early ARROW-9727 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9729 - [Java] Error Prone causes other annotation processors to not work with Eclipse ARROW-9733 - [Rust][DataFusion] Aggregates COUNT/MIN/MAX don’t work on VARCHAR columns ARROW-9734 - [Rust] [DataFusion] TableProvider.scan executing partitions prematurely ARROW-9741 - [Rust] [DataFusion] Incorrect count in TPC-H query 1 result set ARROW-9743 - [R] Sanitize paths in open_dataset ARROW-9744 - [Python] Failed to install on aarch64 ARROW-9764 - [CI][Java] Push wrong Docker image ARROW-9768 - [Python] Pyarrow allows for unsafe conversions of datetime objects to timestamp nanoseconds ARROW-9768 - [Python] Pyarrow allows for unsafe conversions of datetime objects to timestamp nanoseconds ARROW-9778 - [Rust] [DataFusion] Logical and physical schemas’ nullability does not match in 8 out of 20 end-to-end tests ARROW-9783 - [Rust] [DataFusion] Logical aggregate expressions require explicit data type ARROW-9785 - [Python] pyarrow/tests/test_fs.py::test_s3_options too slow ARROW-9789 - [C++] Don’t install jemalloc in parallel ARROW-9790 - [Rust] [Parquet] ParquetFileArrowReader fails to decode all pages if batches fall exactly on row group boundaries ARROW-9790 - [Rust] [Parquet] ParquetFileArrowReader fails to decode all pages if batches fall exactly on row group boundaries ARROW-9793 - [Rust] [DataFusion] Tests failing in master ARROW-9797 - [Rust] AMD64 Conda Integration Tests is failing for the Master branch ARROW-9799 - [Rust] [DataFusion] Implementation of physical binary expression get_type method is incorrect ARROW-9800 - [Rust] [Parquet] “min” and “max” written to standard out when writing columns ARROW-9809 - [Rust] [DataFusion] logical schema = physical schema is not true ARROW-9814 - [Python] Crash in test_parquet.py::test_read_partitioned_directory_s3fs ARROW-9815 - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs ARROW-9815 - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs ARROW-9815 - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs ARROW-9816 - [C++] Escape quotes in config.h ARROW-9827 - [Python] pandas.read_parquet fails for wide parquet files and pyarrow 1.0.X ARROW-9831 - [Rust] [DataFusion] Fix compilation error ARROW-9840 - [Python] Python fs documentation out of date with code ARROW-9846 - [Rust] Master branch broken build ARROW-9851 - [C++] Valgrind errors due to unrecognized instructions ARROW-9852 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9852 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9855 - [R] Fix bad merge/Rcpp conflict ARROW-9859 - [C++] S3 FileSystemFromUri with special char in secret key fails ARROW-9864 - [Python] pathlib.Path not supported in write_to_dataset with partition columns ARROW-9874 - [C++] NewStreamWriter / NewFileWriter don’t own output stream ARROW-9876 - [CI][C++] Travis ARM jobs timeout ARROW-9877 - [C++][CI] homebrew-cpp fails due to avx512 ARROW-9879 - [Python] ChunkedArray.__getitem__ doesn’t work with numpy scalars ARROW-9882 - [C++/Python] Update conda-forge-pinning to 3 for OSX conda packages ARROW-9883 - [R] Fix linuxlibs.R install script for R &lt; 3.6 ARROW-9888 - [Rust] [DataFusion] ExecutionContext can not be shared between threads ARROW-9889 - [Rust][DataFusion] Datafusion CLI: CREATE EXTERNAL TABLE errors with “Unsupported logical plan variant” ARROW-9897 - [C++][Gandiva] Add to_date() function from pattern ARROW-9898 - [C++][Gandiva] Error handling in castINT fails in some enviroments ARROW-9906 - [Python] Crash in test_parquet.py::test_parquet_writer_filesystem_s3_uri (closing NativeFile from S3FileSystem) ARROW-9913 - [C++] Outputs of Decimal128::FromString depend on presence of one another ARROW-9920 - [Python] pyarrow.concat_arrays segfaults when passing it a chunked array ARROW-9922 - [Rust] Add `try_from(Vec&lt;Option&lt;(&amp;str, ArrayRef)&gt;&gt;)` to StructArray ARROW-9924 - [Python] Performance regression reading individual Parquet files using Dataset interface ARROW-9931 - [C++] Fix undefined behaviour on invalid IPC (OSS-Fuzz) ARROW-9932 - [R] Arrow 1.0.1 R package fails to install on R3.4 over linux ARROW-9936 - [Python] Fix / test relative file paths in pyarrow.parquet ARROW-9937 - [Rust] [DataFusion] Average is not correct ARROW-9943 - [C++] Arrow metadata not applied recursively when reading Parquet file ARROW-9946 - [R] ParquetFileWriter segfaults when `sink` is a string ARROW-9953 - [R] Declare minimum version for bit64 ARROW-9962 - [Python] Conversion to pandas with index column using fixed timezone fails ARROW-9968 - [C++] UBSAN link failure with __int8_t ARROW-9969 - [C++] RecordBatchBuilder yields invalid result with dictionary fields ARROW-9970 - [Go] checkptr failures in sum methods ARROW-9972 - [CI] Work around grpc-re2 clash on Homebrew ARROW-9973 - [Java] JDBC DateConsumer does not allow dates before epoch ARROW-9976 - [Python] ArrowCapacityError when doing Table.from_pandas with large dataframe ARROW-9990 - [Rust] [DataFusion] NOT is not plannable ARROW-9993 - [Python] Tzinfo - string roundtrip fails on pytz.StaticTzInfo objects ARROW-9994 - [C++][Python] Auto chunking nested array containing binary-like fields result malformed output ARROW-9996 - [C++] Dictionary is unset when calling DictionaryArray.GetScalar for null values ARROW-10003 - [C++] Create directories in CopyFiles when copying within the same filesystem ARROW-10008 - [Python] pyarrow.parquet.read_table fails with predicate pushdown on categorical data with use_legacy_dataset=False ARROW-10011 - [C++] Make FindRE2.cmake re-entrant ARROW-10012 - [C++] Sporadic failures in CopyFiles test ARROW-10013 - [C++][CI] Flight test failure in TestFlightClient.GenericOptions ARROW-10017 - [Java] LargeMemoryUtil.checkedCastToInt has buggy logic ARROW-10022 - [C++] [Compute] core dumped on some scalar-arithmetic-benchmark ARROW-10027 - [Python] Incorrect null column returned when using a dataset filter expression. ARROW-10034 - [Rust] Master build broken ARROW-10041 - [Rust] Possible to create LargeStringArray with DataType::Utf8 ARROW-10047 - [CI] Conda integration tests failing with cmake error ARROW-10048 - [Rust] Error in aggregate of min/max for strings ARROW-10049 - [C++/Python] Sync conda recipe with conda-forge ARROW-10060 - [Rust] [DataFusion] MergeExec currently discards partitions with errors ARROW-10062 - [Rust]: Fix for null elems for DoubleEndedIter for DictArray ARROW-10073 - [Python] Test test_parquet_nested_storage relies on dict item ordering ARROW-10081 - [C++/Python] Fix bash syntax in drone.io conda builds ARROW-10085 - [C++] S3 tests fail on AppVeyor ARROW-10087 - [CI] Fix nightly docs job ARROW-10098 - [R][Doc] Fix copy_files doc mismatch ARROW-10104 - [Python] Separate tests into its own conda package ARROW-10114 - [R] Segfault in to_dataframe_parallel with deeply nested structs ARROW-10116 - [Python][Packaging] Fix gRPC linking error in macOS wheels builds ARROW-10119 - [C++] Fix Parquet crashes on invalid input (OSS-Fuzz) ARROW-10121 - [C++][Python] Variable dictionaries do not survive roundtrip to IPC stream ARROW-10124 - [R] Write functions don’t follow umask setting ARROW-10125 - [R] Int64 downcast check doesn’t consider all chunks ARROW-10130 - [C++][Dataset] ParquetFileFragment::SplitByRowGroup does not preserve “complete_metadata” status ARROW-10136 - [Rust][Arrow] Nulls are transformed into “” after filtering for StringArray ARROW-10137 - [R] Fix cpp helper that breaks if libarrow is not present ARROW-10147 - [Python] Constructing pandas metadata fails if an Index name is not JSON-serializable by default ARROW-10150 - [C++] Fix crashes on invalid Parquet file (OSS-Fuzz) ARROW-10169 - [Rust] Nulls should be rendered as “” rather than default value when pretty printing arrays ARROW-10174 - [Java] Reading of Dictionary encoded struct vector fails ARROW-10175 - [CI] Nightly hdfs integration test job fails ARROW-10176 - [CI] Nightly valgrind job fails ARROW-10178 - [CI] Fix spark master integration test build setup ARROW-10179 - [Rust] Labeler is not labeling ARROW-10181 - [Rust] Arrow tests fail to compile on Raspberry Pi (32 bit) ARROW-10188 - [Rust] [DataFusion] Some examples are broken ARROW-10189 - [Doc] C data interface example for i32 uses `l`, not `i`, in the format ARROW-10192 - [C++][Python] Segfault when converting nested struct array with dictionary field to pandas series ARROW-10193 - [Python] Segfault when converting to fixed size binary array ARROW-10200 - [Java][CI] Fix failure of Java CI on s390x ARROW-10204 - [RUST] [Datafusion] Test failure in aggregate_grouped_empty with simd feature enabled ARROW-10214 - [Python] UnicodeDecodeError when printing schema with binary metadata ARROW-10226 - [Rust] [Parquet] Parquet reader reading wrong columns in some batches within a parquet file ARROW-10230 - [JS][Doc] JavaScript documentation fails to build ARROW-10232 - FixedSizeListArray is incorrectly written/read to/from parquet ARROW-10234 - [C++][Gandiva] Fix logic of round() for floats/decimals in Gandiva ARROW-10237 - [C++] Duplicate values in a dictionary result in corrupted parquet ARROW-10238 - [C#] List&lt;Struct&gt; is broken ARROW-10239 - [C++] aws-sdk-cpp apparently requires zlib too ARROW-10244 - [Python][Docs] Add docs on using pyarrow.dataset.parquet_dataset ARROW-10248 - [C++][Dataset] Dataset writing does not write schema metadata ARROW-10262 - [C++] Some TypeClass in Scalar classes seem incorrect ARROW-10270 - [R] Fix CSV timestamp_parsers test on R-devel ARROW-10271 - [Rust] packed_simd is broken and continued under a new project ARROW-10279 - [Release][Python] Fix verification script to align with the new macos wheel platform tags ARROW-10280 - [Packaging][Python] Fix macOS wheel artifact patterns ARROW-10281 - [Python] Fix warnings when running tests ARROW-10284 - [Python] Pyarrow is raising deprecation warning about filesystems on import ARROW-10285 - [Python] pyarrow.orc submodule is using deprecated functionality ARROW-10286 - [C++][Flight] Misleading CMake errors ARROW-10288 - [C++] Compilation fails on i386 ARROW-10290 - [C++] List POP_BACK is not available in older CMake versions ARROW-10293 - [Rust] [DataFusion] Fix benchmarks ARROW-10296 - [R] Data saved as integer64 loaded as integer New Features and Improvements ARROW-983 - [C++] Implement InputStream and OutputStream classes for interacting with socket connections ARROW-1105 - [C++] SQLite record batch reader ARROW-1509 - [Python] Write serialized object as a stream of encapsulated IPC messages ARROW-1669 - [C++] Consider adding Abseil (Google C++11 standard library extensions) to toolchain ARROW-1797 - [C++] Implement binary arithmetic kernels for numeric arrays ARROW-2164 - [C++] Clean up unnecessary decimal module refs ARROW-3080 - [Python] Unify Arrow to Python object conversion paths ARROW-3757 - [R] R bindings for Flight RPC client ARROW-3872 - [R] Add ad hoc test of feather compatibility ARROW-4046 - [Python/CI] Exercise large memory tests ARROW-4248 - [C++][Plasma] Build on Windows / Visual Studio ARROW-4685 - [C++] Update Boost to 1.69 in manylinux1 docker image ARROW-4927 - [Rust] Update top level README to describe current functionality ARROW-4957 - [Rust] [DataFusion] Implement get_supertype correctly ARROW-4965 - [Python] Timestamp array type detection should use tzname of datetime.datetime objects ARROW-5034 - [C#] ArrowStreamWriter should expose synchronous Write methods ARROW-5123 - [Rust] derive RecordWriter from struct definitions ARROW-6075 - [FlightRPC] Handle uncaught exceptions in middleware ARROW-6281 - [Python] Produce chunked arrays for nested types in pyarrow.array ARROW-6282 - [Format] Support lossy compression ARROW-6437 - [R] Add AWS SDK to system dependencies for macOS and Windows ARROW-6535 - [C++] Status::WithMessage should accept variadic parameters ARROW-6537 - [R] Pass column_types to CSV reader ARROW-6972 - [C#] Should support StructField arrays ARROW-6982 - [R] Add bindings for compare and boolean kernels ARROW-7136 - [Rust][CI] Pre-install the rust dependencies in the dockerfile ARROW-7218 - [Python] Conversion from boolean numpy scalars not working ARROW-7302 - [C++] CSV: allow converting a column to a specific dictionary type ARROW-7372 - [C++] Allow creating dictionary array from simple JSON ARROW-7871 - [Python] Expose more compute kernels ARROW-7960 - [C++][Parquet] Add support for schema translation from parquet nodes back to arrow for missing types ARROW-8001 - [R][Dataset] Bindings for dataset writing ARROW-8002 - [C++][Dataset] Dataset writing should let you (re)partition the data ARROW-8048 - [Python] Run memory leak tests nightly as follow up to ARROW-4120 ARROW-8172 - [C++] ArrayFromJSON for dictionary arrays ARROW-8205 - [Rust] [DataFusion] DataFusion should enforce unique field names in a schema ARROW-8253 - [Rust] [DataFusion] Improve ergonomics of registering UDFs ARROW-8262 - [Rust] [DataFusion] Add example that uses LogicalPlanBuilder ARROW-8289 - [Rust] [Parquet] Implement minimal Arrow Parquet writer as starting point for full writer ARROW-8296 - [C++][Dataset] IpcFileFormat should support writing files with compressed buffers ARROW-8355 - [Python] Reduce the number of pandas dependent test cases in test_feather ARROW-8359 - [C++/Python] Enable aarch64/ppc64le build in conda recipes ARROW-8383 - [Rust] Easier random access to DictionaryArray keys and values ARROW-8402 - [Java] Support ValidateFull methods in Java ARROW-8423 - [Rust] [Parquet] Serialize arrow schema into metadata when writing parquet ARROW-8426 - [Rust] [Parquet] Add support for writing dictionary types ARROW-8493 - [C++] Create unified schema resolution code for Array reconstruction. ARROW-8494 - [C++] Implement basic array-by-array reassembly logic ARROW-8581 - [C#] Date32/64Array.Builder should accept DateTime, not DateTimeOffset ARROW-8601 - [Go][Flight] Implement Flight Writer interface ARROW-8601 - [Go][Flight] Implement Flight Writer interface ARROW-8618 - [C++] ASSIGN_OR_RAISE should move its argument ARROW-8678 - [C++][Parquet] Remove legacy arrow to level translation. ARROW-8712 - [R] Expose strptime timestamp parsing in read_csv conversion options ARROW-8774 - [Rust] [DataFusion] Improve threading model ARROW-8810 - [R] Add documentation about Parquet format, appending to stream format ARROW-8824 - [Rust] [DataFusion] Implement new SQL parser ARROW-8828 - [Rust] Implement SQL tokenizer ARROW-8829 - [Rust] Implement SQL parser ARROW-9010 - [Java] Framework and interface changes for RecordBatch IPC buffer compression ARROW-9065 - [C++] Support parsing date32 in dataset partition folders ARROW-9068 - [C++][Dataset] Simplify Partitioning interface ARROW-9078 - [C++] Parquet writing of extension type with nested storage type fails ARROW-9104 - [C++] Parquet encryption tests should write files to a temporary directory instead of the testing submodule’s directory ARROW-9107 - [C++][Dataset] Time-based types support ARROW-9147 - [C++][Dataset] Support null -&gt; other type promotion in Dataset scanning ARROW-9205 - [Documentation] Fix typos in Columnar.rst ARROW-9266 - [Python][Packaging] Enable S3 support in macOS wheels ARROW-9271 - [R] Preserve data frame metadata in round trip ARROW-9286 - [C++] Add function “aliases” to compute::FunctionRegistry ARROW-9328 - [C++][Gandiva] Add LTRIM, RTRIM, BTRIM functions for string ARROW-9338 - [Rust] Add instructions for running clippy locally ARROW-9344 - [C++][Flight] measure latency quantile in flight benchmark ARROW-9358 - [Integration] Reconsider generated_large_batch.json ARROW-9371 - [Java] Run vector tests for both allocators ARROW-9377 - [Java] Support unsigned dictionary indices ARROW-9387 - [R] Use new C++ table select method ARROW-9388 - [C++] Division kernels ARROW-9394 - [Python] Support pickling of Scalars ARROW-9398 - [C++] Register the SIMD sum variants under function instance instead a SIMD function ARROW-9402 - [C++] Add portable wrappers for __builtin_add_overflow and friends ARROW-9405 - [R] Switch to cpp11 ARROW-9412 - [C++] Add non-BUNDLED dependencies to exported INSTALL_INTERFACE_LIBS of arrow_static and test that it works ARROW-9429 - [Python] ChunkedArray.to_numpy ARROW-9454 - [GLib] Add binding of some dictionary builders ARROW-9465 - [Python] Improve ergonomics of compute functions ARROW-9469 - [Python] Make more objects weakrefable ARROW-9487 - [Developer] Cover the archery release utilities with unittests ARROW-9488 - [Release] Use the new changelog generation when updating the website ARROW-9507 - [Rust] [DataFusion] PhysicalExpr should implement Display trait ARROW-9508 - [Release][APT][Yum] Enable verification for arm64 binaries ARROW-9516 - [Rust][DataFusion] Refactor physical expressions to not care about their names nor indexes ARROW-9517 - [C++][Python] Allow session_token argument when initializing S3FileSystem ARROW-9518 - [Python] Deprecate pyarrow serialization ARROW-9521 - [Rust] CsvReadOptions should allow file extension to be specified ARROW-9523 - [Rust] improve performance of filter kernel ARROW-9534 - [Rust] [DataFusion] Implement functions for creating literal expressions for all types ARROW-9550 - [Rust] [DataFusion] Remove Rc&lt;RefCell&lt;_&gt;&gt; from hash aggregate operator ARROW-9553 - [Rust] Release script doesn’t bump parquet crate’s arrow dependency version ARROW-9557 - [R] Iterating over parquet columns is slow in R ARROW-9559 - [Rust] [DataFusion] Revert privatization of exprlist_to_fields ARROW-9563 - [Dev][Release] Use archery’s changelog generator when creating release notes for the website ARROW-9568 - [CI] Use official msys action on GHA ARROW-9576 - [Python][Doc] Fix error in code example for extension types ARROW-9580 - [JS] Docs have superfluous () ARROW-9581 - [Dev][Release] Bump next snapshot versions to 2.0.0 ARROW-9582 - [Rust] Implement Array::memory_size() ARROW-9585 - [Rust] Remove duplicated to-do line in DataFusion readme ARROW-9587 - [FlightRPC][Java] Clean up DoPut/FlightStream memory handling ARROW-9593 - [Python] Add custom pickle reducers for DictionaryScalar ARROW-9604 - [C++] Add benchmark for aggregate min/max compute kernels ARROW-9605 - [C++] Optimize performance for aggregate min/max compute kernels ARROW-9607 - [C++][Gandiva] Add bitwise_and(), bitwise_or() and bitwise_not() functions for integers ARROW-9608 - [Rust] Remove arrow flight from parquet’s feature gating ARROW-9615 - [Rust] Add kernel to compute length of string array ARROW-9617 - [Rust] [DataFusion] Add length of string array ARROW-9618 - [Rust] [DataFusion] Make it easier to write optimizers ARROW-9619 - [Rust] [DataFusion] Add predicate push-down ARROW-9632 - [Rust] Add a “new” method for ExecutionContextSchemaProvider ARROW-9638 - [C++][Compute] Implement mode(most frequent number) kernel ARROW-9639 - [Ruby] Add dependency version check ARROW-9640 - [C++][Gandiva] Implement round() for integers and long integers ARROW-9641 - [C++][Gandiva] Implement round() for floating point and double floating point numbers ARROW-9645 - [Python] Deprecate the legacy pyarrow.filesystem interface ARROW-9646 - [C++][Dataset] Add support for writing parquet datasets ARROW-9650 - [Packaging][APT] Drop support for Ubuntu 19.10 ARROW-9654 - [Rust][DataFusion] Add an EXPLAIN command to the datafusion CLI ARROW-9656 - [Rust][DataFusion] Slightly confusing error message when unsupported type is provided to CREATE EXTERNAL TABLE ARROW-9658 - [Python][Dataset] Bindings for dataset writing ARROW-9665 - [R] head/tail/take for Datasets ARROW-9667 - [CI][Crossbow] Segfault in 2 nightly R builds ARROW-9671 - [C++] BasicDecimal128 constructor interprets uint64_t integers with highest bit set as negative ARROW-9673 - [Rust] Add a param “dialect” for DFParser::parse_sql ARROW-9678 - [Rust] [DataFusion] Improve projection push down to remove unused columns ARROW-9679 - [Rust] [DataFusion] HashAggregate walks map many times building final batch ARROW-9681 - [Java] Failed Arrow Memory - Core on big-endian platform ARROW-9683 - [Rust][DataFusion] Implement Debug for ExecutionPlan trait ARROW-9691 - [Rust] [DataFusion] Make sql_statement_to_plan public ARROW-9695 - [Rust][DataFusion] Improve documentation on LogicalPlan variants ARROW-9699 - [C++][Compute] Improve mode kernel performance for small integer types ARROW-9701 - [Java][CI] Add a test job on s390x ARROW-9702 - [C++] Move bpacking simd to runtime path ARROW-9703 - [Developer][Archery] Restartable cherry-picking process for creating maintenance branches ARROW-9706 - [Java] Tests in TestLargeListVector fails on big endian platform ARROW-9710 - [C++] Generalize Decimal ToString in preparation for Decimal256 ARROW-9711 - [Rust] Add benchmark based on TPC-H ARROW-9713 - [Rust][DataFusion] Remove explicit panics ARROW-9715 - [R] changelog/doc updates for 1.0.1 ARROW-9718 - [Python] Make pyarrow.parquet work with the new filesystem interfaces ARROW-9721 - [Packaging][Python] Update wheel dependency files ARROW-9722 - [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays ARROW-9723 - [C++] Expected behaviour of “mode” kernel with NaNs ? ARROW-9725 - [Rust] [DataFusion] LimitExec and SortExec should use MergeExec ARROW-9737 - [C++][Gandiva] Add bitwise_xor() for integers ARROW-9739 - [CI][Ruby] Don’t install gem documents ARROW-9742 - [Rust] Create one standard DataFrame API ARROW-9751 - [Rust] [DataFusion] Extend UDFs to accept more than one type per argument ARROW-9752 - [Rust] [DataFusion] Add support for Aggregate UDFs ARROW-9753 - [Rust] [DataFusion] Remove the use of Mutex in ExecutionPlan trait ARROW-9754 - [Rust] [DataFusion] Implement async in DataFusion traits ARROW-9757 - [Rust] [DataFusion] Use “pub use” to expose a clean public API ARROW-9758 - [Rust] [DataFusion] Implement extension API for DataFusion ARROW-9759 - [Rust] [DataFusion] Implement DataFrame::sort ARROW-9760 - [Rust] [DataFusion] Implement DataFrame::explain ARROW-9761 - [C++] Add experimental pull-based iterator structures to C interface implementation ARROW-9762 - [Rust] [DataFusion] ExecutionContext::sql should return DataFrame ARROW-9769 - [Python] Remove skip for in-memory fsspec in test_move_file ARROW-9775 - [C++] Automatic S3 region selection ARROW-9781 - [C++] Fix uninitialized value warnings ARROW-9782 - [C++][Dataset] Ability to write “.feather” files with IpcFileFormat ARROW-9784 - [Rust] [DataFusion] Improve instructions for running tpch benchmark ARROW-9786 - [R] Unvendor cpp11 before release ARROW-9788 - Handle naming inconsistencies between SQL, DataFrame API and struct names ARROW-9792 - [Rust] [DataFusion] Logical aggregate functions should not return Result ARROW-9794 - [C++] Add functionality to cpu_info to discriminate between Intel vs AMD x86 ARROW-9795 - [C++][Gandiva] Implement castTIMESTAMP(int64) in Gandiva ARROW-9806 - [R] More compute kernel bindings ARROW-9807 - [R] News update/version bump post-1.0.1 ARROW-9808 - [Python] parquet.read_table docstring wrong use_legacy_dataset explanation ARROW-9811 - [C++] Unchecked floating point division by 0 should succeed ARROW-9813 - [C++] Disable semantic interposition ARROW-9819 - [C++] Bump mimalloc to 1.6.4 ARROW-9821 - [Rust][DataFusion] User Defined PlanNode / Operator API ARROW-9821 - [Rust][DataFusion] User Defined PlanNode / Operator API ARROW-9823 - [CI][C++][MinGW] Enable S3 ARROW-9832 - [Rust] [DataFusion] Refactor PhysicalPlan to remove Partition ARROW-9833 - [Rust] [DataFusion] Refactor TableProvider.scan to return ExecutionPlan ARROW-9834 - [Rust] [DataFusion] Remove Partition trait ARROW-9835 - [Rust] [DataFusion] Remove FunctionMeta ARROW-9836 - [Rust] [DataFusion] Improve API for usage of UDFs ARROW-9837 - [Rust] Add provider for variable ARROW-9838 - [Rust] [DataFusion] DefaultPhysicalPlanner should insert explicit MergeExec nodes ARROW-9839 - [Rust] [DataFusion] Add ability to downcast ExecutionPlan to specific operator ARROW-9841 - [Rust] Update checked-in flatbuffer files ARROW-9844 - [Go][CI] Add Travis CI job for Go on s390x ARROW-9845 - [Rust] [Parquet] serde_json is only used in tests but isn’t in dev-dependencies ARROW-9848 - [Rust] Implement changes to ensure flatbuffer alignment ARROW-9849 - [Rust] [DataFusion] Make UDFs not need a Field ARROW-9850 - [Go] Defer should not be used in the loop ARROW-9853 - [RUST] Implement “take” kernel for dictionary arrays ARROW-9854 - [R] Support reading/writing data to/from S3 ARROW-9858 - [C++][Python][Docs] Expand user guide for FileSystem ARROW-9863 - [C++] [PARQUET] Optimize meta data recovery of ApplicationVersion ARROW-9867 - [C++][Dataset] FileSystemDataset should expose its filesystem ARROW-9868 - [C++] Provide utility for copying files between filesystems ARROW-9869 - [R] Implement full S3FileSystem/S3Options constructor ARROW-9870 - [R] Friendly interface for filesystems (S3) ARROW-9871 - [C++] Add uppercase support to ARROW_USER_SIMD_LEVEL. ARROW-9873 - [C++][Compute] Improve mode kernel for intergers within limited value range ARROW-9875 - [Python] Let FileSystem.get_file_info accept a single path ARROW-9884 - [R] Bindings for writing datasets to Parquet ARROW-9885 - [Rust] [DataFusion] Simplify code of type coercion for binary types ARROW-9886 - [Rust] [DataFusion] Simplify code to test cast ARROW-9887 - [Rust] [DataFusion] Add support for complex return types of built-in functions ARROW-9890 - [R] Add zstandard compression codec in macOS build ARROW-9891 - [Rust] [DataFusion] Make math functions support f32 ARROW-9892 - [Rust] [DataFusion] Add support for concat ARROW-9893 - [Python] Bindings for writing datasets to Parquet ARROW-9895 - [RUST] Improve sort kernels ARROW-9899 - [Rust] [DataFusion] Switch from Box&lt;Schema&gt; –&gt; SchemaRef (Arc&lt;Schema&gt;) to be consistent with the rest of Arrow ARROW-9900 - [Rust][DataFusion] Use Arc&lt;&gt; instead of Box&lt;&gt; in LogicalPlan ARROW-9901 - [C++] Add hand-crafted Parquet to Arrow reconstruction test for nested reading ARROW-9902 - [Rust] [DataFusion] Add support for array() ARROW-9904 - [C++] Unroll the loop manually for CountSetBits ARROW-9908 - [Rust] Support temporal data types in JSON reader ARROW-9910 - [Rust] [DataFusion] Type coercion of Variadic is wrong ARROW-9914 - [Rust][DataFusion] Document the SQL -&gt; Arrow type mapping ARROW-9916 - [RUST] Avoid cloning ArrayData in several places ARROW-9917 - [Python][Compute] Add bindings for mode kernel ARROW-9919 - [Rust] [DataFusion] Math functions ARROW-9921 - [Rust] Add `from(Vec&lt;Option&lt;&amp;str&gt;&gt;)` to [Large]StringArray ARROW-9925 - [GLib] Add low level value readers for GArrowListArray family ARROW-9926 - [GLib] Use placement new for GArrowRecordBatchFileReader ARROW-9928 - [C++] Speed up integer parsing slightly ARROW-9929 - [Developer] Autotune cmake-format ARROW-9933 - [Developer] Add drone as a CI provider for crossbow ARROW-9934 - [Rust] Shape and stride check in tensor ARROW-9941 - [Python] Better string representation for extension types ARROW-9944 - [Rust] Implement TO_TIMESTAMP function ARROW-9949 - [C++] Generalize Decimal128::FromString for reuse in Decimal256 ARROW-9950 - [Rust] [DataFusion] Allow UDF usage without registry ARROW-9952 - [Python] Use pyarrow.dataset writing for pq.write_to_dataset ARROW-9954 - [Rust] [DataFusion] Simplify code of aggregate planning ARROW-9956 - [C++][Gandiva] Implement Binary string function in Gandiva ARROW-9957 - [Rust] Remove unmaintained tempdir dependency ARROW-9961 - [Rust][DataFusion] to_timestamp function parses timestamp without timezone offset as UTC rather than local ARROW-9964 - [C++] CSV date support ARROW-9965 - [Java] Buffer capacity calculations are slow for fixed-width vectors ARROW-9966 - [Rust] Speedup aggregate kernels ARROW-9967 - [Python] Add compute module docs ARROW-9971 - [Rust] Speedup take ARROW-9977 - [Rust] Add min/max for [Large]String ARROW-9979 - [Rust] Fix arrow crate clippy lints ARROW-9980 - [Rust] Fix parquet crate clippy lints ARROW-9981 - [Rust] Allow configuring flight IPC with IpcWriteOptions ARROW-9983 - [C++][Dataset][Python] Use larger default batch size than 32K for Datasets API ARROW-9984 - [Rust] [DataFusion] DRY of function to string ARROW-9986 - [Rust][DataFusion] TO_TIMESTAMP function erroneously requires fractional seconds when no timezone is present ARROW-9987 - [Rust] [DataFusion] Improve docs of `Expr`. ARROW-9988 - [Rust] [DataFusion] Added std::ops to logical expressions ARROW-9992 - [C++][Python] Refactor python to arrow conversions based on a reusable conversion API ARROW-9998 - [Python] Support pickling DictionaryScalar ARROW-9999 - [Python] Support constructing dictionary array directly through pa.array() ARROW-10000 - [C++][Python] Support constructing StructArray from list of key-value pairs ARROW-10001 - [Rust] [DataFusion] Add developer guide to README ARROW-10010 - [Rust] Speedup arithmetic ARROW-10015 - [Rust] Implement SIMD for aggregate kernel sum ARROW-10016 - [Rust] [DataFusion] Implement IsNull and IsNotNull ARROW-10018 - [CI] Disable Sphinx and API documentation build since it takes 6 hours on master ARROW-10019 - [Rust] Add substring kernel ARROW-10023 - [Gandiva][C++] Implementing Split part function in gandiva ARROW-10024 - [C++][Parquet] Create nested reading benchmarks ARROW-10028 - [Rust] Simplify macro def_numeric_from_vec ARROW-10030 - [Rust] Support fromIter and toIter ARROW-10035 - [C++] Bump versions of vendored code ARROW-10037 - [C++] Workaround to force find AWS SDK to look for shared libraries ARROW-10040 - [Rust] Create a way to slice unalligned offset buffers ARROW-10043 - [Rust] [DataFusion] Introduce support for DISTINCT by partially implementing COUNT(DISTINCT) ARROW-10044 - [Rust] Improve README ARROW-10046 - [Rust] [DataFusion] Made `*Iterator` implement Iterator ARROW-10050 - [C++][Gandiva] Implement concat() in Gandiva for up to 10 arguments ARROW-10051 - [C++][Compute] Make aggregate kernel merge state mutable ARROW-10054 - [Python] Slice methods should return empty arrays instead of crashing ARROW-10055 - [Rust] Implement DoubleEndedIterator for NullableIter ARROW-10057 - [C++] Add Parquet-Arrow roundtrip tests for nested data ARROW-10058 - [C++] Investigate performance of LevelsToBitmap without BMI2 ARROW-10059 - [R][Doc] Give more advice on how to set up C++ build ARROW-10063 - [Archery][CI] Fetch main branch in archery build only when it is a pull request ARROW-10064 - [C++] Resolve compile warnings on Apple Clang 12 ARROW-10065 - [Rust] DRY downcasted Arrays ARROW-10066 - [C++] Make sure that default AWS region is respected ARROW-10068 - [C++] Add bundled external project for aws-sdk-cpp ARROW-10069 - [Java] Support running Java benchmarks from command line ARROW-10070 - [C++][Compute] Implement stdev aggregate kernel ARROW-10071 - [R] segfault with ArrowObject from previous session, or saved ARROW-10074 - [C++] Don’t use string_view.to_string() ARROW-10075 - [C++] Don’t use nonstd::nullopt this breaks out vendoring abstraction. ARROW-10076 - [C++] Use TemporaryDir for all tests that don’t already use it. ARROW-10077 - [C++] Potential overflow in bit_stream_utils.h multiplication. ARROW-10083 - [C++] Improve Parquet fuzz seed corpus ARROW-10084 - [Rust] [DataFusion] Add length of large string array ARROW-10086 - [Rust] Migrate min_large_string -&gt; min_string kernels ARROW-10090 - [C++][Compute] Improve mode kernel ARROW-10092 - [Dev][Go] Add grpc generated go files to rat exclusion list ARROW-10093 - [R] Add ability to opt-out of int64 -&gt; int demotion ARROW-10095 - [Rust] [Parquet] Update for IPC changes ARROW-10096 - [Rust] [DataFusion] Remove unused code ARROW-10099 - [C++][Dataset] Also allow integer partition fields to be dictionary encoded ARROW-10100 - [C++][Dataset] Ability to read/subset a ParquetFileFragment with given set of row group ids ARROW-10102 - [C++] Generalize BasicDecimal128::operator*= for reuse in Decimal256 ARROW-10103 - [Rust] Add a Contains kernel ARROW-10105 - [FlightRPC] Add client option to disable certificate validation with TLS ARROW-10120 - [C++][Parquet] Create reading benchmarks for 2-level nested data ARROW-10127 - [Format] Update specification to support 256-bit Decimal types ARROW-10129 - [Rust] Cargo build is rebuilding dependencies on arrow changes ARROW-10134 - [C++][Dataset] Add ParquetFileFragment::num_row_groups property ARROW-10139 - [C++] Add support for building arrow_testing without building tests ARROW-10148 - [Rust] Add documentation to lib.rs ARROW-10151 - [Python] Add support MapArray to_pandas conversion ARROW-10155 - [Rust] [DataFusion] Add documentation to lib.rs ARROW-10156 - [Rust] Auto-label PRs ARROW-10157 - [Rust] Add more documentation about take ARROW-10160 - [Rust] Improve documentation of DictionaryType ARROW-10161 - [Rust] [DataFusion] Simplify expression tests ARROW-10162 - [Rust] Support display of DictionaryArrays in pretty printing ARROW-10164 - [Rust] Add support for DictionaryArray types to cast kernels ARROW-10167 - [Rust] Support display of DictionaryArrays in sql.rs ARROW-10168 - [Rust] [Parquet] Extend arrow schema conversion to projected fields ARROW-10171 - [Rust] [DataFusion] Add `ExecutionContext::from&lt;ExecutionContextState&gt;` ARROW-10190 - [Website] Add Jorge to list of committers ARROW-10191 - [Rust] [Parquet] Add roundtrip tests for single column batches ARROW-10196 - [C++] Add Future::DeferNotOk() ARROW-10199 - [Rust][Parquet] Release Parquet at crates.io to remove debug prints ARROW-10201 - [C++][CI] Disable S3 in arm64 job on Travis CI ARROW-10202 - [CI][Windows] Use sf.net mirror for MSYS2 ARROW-10205 - [Java][FlightRPC] Add client option to disable server verification ARROW-10206 - [Python][C++][FlightRPC] Add client option to disable server validation ARROW-10215 - [Rust] [DataFusion] Rename “Source” typedef ARROW-10217 - [CI] Run fewer GitHub Actions jobs ARROW-10225 - [Rust] [Parquet] Fix null bitmap comparisons in roundtrip tests ARROW-10227 - [Ruby] Use a table size as the default for parquet chunk_size ARROW-10229 - [C++][Parquet] Remove left over ARROW_LOG statement. ARROW-10231 - [CI] Unable to download minio in arm32v7 docker image ARROW-10233 - [Rust] Make array_value_to_string available in all Arrow builds ARROW-10235 - [Rust][DataFusion] Improve documentation for type coercion ARROW-10240 - [Rust] [Datafusion] Optionally load tpch data into memory before running benchmark query ARROW-10251 - [Rust] [DataFusion] MemTable::load() should load partitions in parallel ARROW-10252 - [Python] Add option to skip inclusion of Arrow headers in Python installation ARROW-10256 - [C++][Flight] Disable -Werror carefully ARROW-10257 - [R] Prepare news/docs for 2.0 release ARROW-10260 - [Python] Missing MapType to Pandas dtype ARROW-10263 - [C++][Compute] Improve numerical stability of variances merging ARROW-10265 - [CI] Use smaler build when cache doesn’t exit on Travis CI ARROW-10266 - [CI][macOS] Ensure using Python 3.8 with Homebrew ARROW-10267 - [Python] Skip flight test if disable_server_verification feature is not available ARROW-10272 - [Packaging][Python] Pin newer multibuild version to avoid updating homebrew ARROW-10273 - [CI][Homebrew] Fix “brew audit” usage ARROW-10287 - [C++] Avoid std::random_device whenever possible ARROW-10289 - [Rust] Support reading dictionary streams ARROW-10295 - [Rust] [DataFusion] Simplify accumulators ARROW-10310 - [C++][Gandiva] Add single argument round() in Gandiva PARQUET-1845 - [C++] Int96 memory images in test cases assume only little-endian PARQUET-1878 - [C++] lz4 codec is not compatible with Hadoop Lz4Codec PARQUET-1904 - [C++] Export file_offset in RowGroupMetaData" />
<meta property="og:description" content="Apache Arrow 2.0.0 (19 October 2020) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 511 commits from 81 distinct contributors. $ git shortlog -sn apache-arrow-1.0.0..apache-arrow-2.0.0 68 Jorge C. Leitao 48 Antoine Pitrou 40 Krisztián Szűcs 34 alamb 33 Neal Richardson 30 Andy Grove 25 Benjamin Kietzman 25 Joris Van den Bossche 19 Sutou Kouhei 13 Uwe L. Korn 12 Micah Kornfield 10 Frank Du 10 Jörn Horstmann 9 Neville Dipale 9 Romain Francois 9 arw2019 8 Yibo Cai 8 liyafan82 7 Sagnik Chakraborty 6 David Li 5 Kazuaki Ishizaki 5 Mahmut Bulut 4 Mingyu Zhong 4 fredgan 3 Bryan Cutler 3 wqc200 2 Daniel Russo 2 Diana Clarke 2 James Duong 2 Kenta Murata 2 Patrick Woody 2 Projjal Chanda 2 naman1996 2 ptaylor 2 tianchen 1 Adam Szmigin 1 Ali McMaster 1 Andrew Stevenson 1 Ben Kimock 1 Brian Dunlay 1 Christoph Schulze 1 Derek Marsh 1 Dominik Moritz 1 Eric Erhardt 1 Ezra 1 Fernando José Herrera Elizalde 1 FredGan 1 Hongze Zhang 1 Jim Klucar 1 Josiah 1 Kyle Strand 1 Laurent Goujon 1 Lawrence Chan 1 Mark Rushakoff 1 Matt Corley 1 Matthew Topol 1 Matthias 1 Morgan Cassels 1 Ofek 1 Patrick Pai 1 Paul 1 PoojaChandak 1 Prashanth Govindarajan 1 Pratik raj 1 Revital Sur 1 Ruan Pearce-Authers 1 Ryan Murray 1 Simon Bertron 1 Steve Suh 1 Tanguy Fautre 1 Tobias Mayer 1 Troels Nielsen 1 Vivian Kong 1 Wes McKinney 1 Xavier Lange 1 Yordan Pavlov 1 kanga333 1 karldw 1 mubai 1 offthewall123 1 zanmato1984 Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-1.0.0..apache-arrow-2.0.0 127 Andy Grove 92 Antoine Pitrou 56 Krisztián Szűcs 51 Neal Richardson 44 Sutou Kouhei 18 Joris Van den Bossche 18 Micah Kornfield 17 Benjamin Kietzman 17 Wes McKinney 16 Neville Dipale 12 Jorge C. Leitao 10 Praveen 7 Paddy Horan 4 David Li 4 Eric Erhardt 4 Sebastien Binet 4 Uwe L. Korn 4 liyafan82 3 GitHub 1 Bryan Cutler 1 Chao Sun 1 tianchen Changelog Apache Arrow 2.0.0 (2020-10-19) Bug Fixes ARROW-2367 - [Python] ListArray has trouble with sizes greater than kMaximumCapacity ARROW-4189 - [CI] [Rust] Fix broken cargo coverage ARROW-4917 - [C++] orc_ep fails in cpp-alpine docker ARROW-5578 - [C++][Flight] Flight does not build out of the box on Alpine Linux ARROW-7226 - [JSON][Python] Json loader fails on example in documentation. ARROW-7384 - [Website] Fix search indexing warning reported by Google ARROW-7517 - [C++] Builder does not honour dictionary type provided during initialization ARROW-7663 - [Python] from_pandas gives TypeError instead of ArrowTypeError in some cases ARROW-7903 - [Rust] [DataFusion] Upgrade SQLParser dependency for DataFusion ARROW-7957 - [Python] ParquetDataset cannot take HadoopFileSystem as filesystem ARROW-8265 - [Rust] [DataFusion] Table API collect() should not require context ARROW-8394 - [JS] Typescript compiler errors for arrow d.ts files, when using es2015-esm package ARROW-8735 - [Rust] [Parquet] Parquet crate fails to compile on Arm architecture ARROW-8749 - [C++] IpcFormatWriter writes dictionary batches with wrong ID ARROW-8773 - [Python] pyarrow schema.empty_table() does not preserve nullability of fields ARROW-9028 - [R] Should be able to convert an empty table ARROW-9096 - [Python] Pandas roundtrip with object-dtype column labels with integer values: data type “integer” not understood ARROW-9177 - [C++][Parquet] Tracking issue for cross-implementation LZ4 Parquet compression compatibility ARROW-9414 - [C++] apt package includes headers for S3 interface, but no support ARROW-9462 - [Go] The Indentation after the first Record arrjson writer is missing ARROW-9463 - [Go] The writer is double closed in TestReadWrite ARROW-9490 - [Python] pyarrow array creation for specific set of numpy scalars fails ARROW-9495 - [C++] Equality assertions don’t handle Inf / -Inf properly ARROW-9520 - [Rust] [DataFusion] Can’t alias an aggregate expression ARROW-9528 - [Python] Honor tzinfo information when converting from datetime to pyarrow ARROW-9532 - [Python] Building pyarrow for MacPorts on macOS ARROW-9535 - [Python] Remove symlink fixes from conda recipe ARROW-9536 - Missing parameters in PlasmaOutOfMemoryException.java ARROW-9541 - [C++] CMakeLists requires UTF8PROC_STATIC when building static library ARROW-9544 - [R] version argument of write_parquet not working ARROW-9546 - [Python] Clean up Pandas Metadata Conversion test ARROW-9548 - [Go] Test output files in tmp directory are not removed correctly ARROW-9549 - [Rust] Parquet no longer builds ARROW-9554 - [Java] FixedWidthInPlaceVectorSorter sometimes produces wrong result ARROW-9556 - [Python][C++] Segfaults in UnionArray with null values ARROW-9560 - [Packaging] conda recipes failing due to missing conda-forge.yml ARROW-9569 - [CI][R] Fix rtools35 builds for msys2 key change ARROW-9570 - [Doc] Clean up sphinx sidebar ARROW-9573 - [Python] Parquet doesn’t load when partitioned column starts with ‘_’ ARROW-9574 - [R] Cleanups for CRAN 1.0.0 release ARROW-9575 - [R] gcc-UBSAN failure on CRAN ARROW-9577 - [Python][C++] posix_madvise error on Debian in pyarrow 1.0.0 ARROW-9583 - [Rust] Offset is mishandled in arithmetic and boolean compute kernels ARROW-9588 - [C++] clang/win: Copy constructor of ParquetInvalidOrCorruptedFileException not correctly triggered ARROW-9589 - [C++/R] arrow_exports.h contains structs declared as class ARROW-9592 - [CI] Update homebrew before calling brew bundle ARROW-9596 - [CI][Crossbow] Fix homebrew-cpp again, again ARROW-9597 - [C++] AddAlias in compute::FunctionRegistry should be synchronized ARROW-9598 - [C++][Parquet] Spaced definition levels is not assigned correctly. ARROW-9599 - [CI] Appveyor toolchain build fails because CMake detects different C and C++ compilers ARROW-9600 - [Rust] When used as a crate dependency, arrow-flight is rebuilt on every invocation of cargo build ARROW-9600 - [Rust] When used as a crate dependency, arrow-flight is rebuilt on every invocation of cargo build ARROW-9602 - [R] Improve cmake detection in Linux build ARROW-9603 - [C++][Parquet] Write Arrow relies on unspecified behavior for nested types ARROW-9606 - [C++][Dataset] in expressions don’t work with &gt;1 partition levels ARROW-9609 - [C++] CSV datasets don’t materialize virtual columns ARROW-9621 - [Python] test_move_file() is failed with fsspec 0.8.0 ARROW-9622 - [Java] ComplexCopier fails if a structvector has a child UnionVector with nulls ARROW-9628 - [Rust] Clippy PR test failing intermittently on Rust / AMD64 MacOS ARROW-9629 - [Python] Kartothek integration tests failing due to missing freezegun module ARROW-9631 - [Rust] Arrow crate should not depend on flight ARROW-9631 - [Rust] Arrow crate should not depend on flight ARROW-9642 - [C++] Let MakeBuilder refer DictionaryType’s index_type for deciding the starting bit width of the indices ARROW-9643 - [C++] Illegal instruction on haswell cpu ARROW-9644 - [C++][Dataset] Do not check for ignore_prefixes in the base path ARROW-9652 - [Rust][DataFusion] Panic trying to select * from a CSV (panicked at ‘index out of bounds: the len is 0 but the index is 0) ARROW-9653 - [Rust][DataFusion] Multi-column Group by: Invalid Argument Error ARROW-9659 - [C++] RecordBatchStreamReader throws on CUDA device buffers ARROW-9660 - [C++] IPC - dictionaries in maps ARROW-9666 - [Python][wheel][Windows] library missing failure by ARROW-9412 ARROW-9670 - [C++][FlightRPC] Close()ing a DoPut with an ongoing read locks up the client ARROW-9684 - [C++] Fix undefined behaviour on invalid IPC / Parquet input (OSS-Fuzz) ARROW-9692 - [Python] distutils import warning ARROW-9693 - [CI][Docs] Nightly docs build fails ARROW-9696 - [Rust] [Datafusion] nested binary expressions broken ARROW-9698 - [C++] Revert “Add -NDEBUG flag to arrow.pc” ARROW-9700 - [Python] create_library_symlinks doesn’t work in macos ARROW-9712 - [Rust] [DataFusion] ParquetScanExec panics on error ARROW-9714 - [Rust] [DataFusion] TypeCoercionRule not implemented for Limit or Sort ARROW-9716 - [Rust] [DataFusion] MergeExec should have concurrency limit ARROW-9726 - [Rust] [DataFusion] ParquetScanExec launches threads too early ARROW-9727 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9729 - [Java] Error Prone causes other annotation processors to not work with Eclipse ARROW-9733 - [Rust][DataFusion] Aggregates COUNT/MIN/MAX don’t work on VARCHAR columns ARROW-9734 - [Rust] [DataFusion] TableProvider.scan executing partitions prematurely ARROW-9741 - [Rust] [DataFusion] Incorrect count in TPC-H query 1 result set ARROW-9743 - [R] Sanitize paths in open_dataset ARROW-9744 - [Python] Failed to install on aarch64 ARROW-9764 - [CI][Java] Push wrong Docker image ARROW-9768 - [Python] Pyarrow allows for unsafe conversions of datetime objects to timestamp nanoseconds ARROW-9768 - [Python] Pyarrow allows for unsafe conversions of datetime objects to timestamp nanoseconds ARROW-9778 - [Rust] [DataFusion] Logical and physical schemas’ nullability does not match in 8 out of 20 end-to-end tests ARROW-9783 - [Rust] [DataFusion] Logical aggregate expressions require explicit data type ARROW-9785 - [Python] pyarrow/tests/test_fs.py::test_s3_options too slow ARROW-9789 - [C++] Don’t install jemalloc in parallel ARROW-9790 - [Rust] [Parquet] ParquetFileArrowReader fails to decode all pages if batches fall exactly on row group boundaries ARROW-9790 - [Rust] [Parquet] ParquetFileArrowReader fails to decode all pages if batches fall exactly on row group boundaries ARROW-9793 - [Rust] [DataFusion] Tests failing in master ARROW-9797 - [Rust] AMD64 Conda Integration Tests is failing for the Master branch ARROW-9799 - [Rust] [DataFusion] Implementation of physical binary expression get_type method is incorrect ARROW-9800 - [Rust] [Parquet] “min” and “max” written to standard out when writing columns ARROW-9809 - [Rust] [DataFusion] logical schema = physical schema is not true ARROW-9814 - [Python] Crash in test_parquet.py::test_read_partitioned_directory_s3fs ARROW-9815 - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs ARROW-9815 - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs ARROW-9815 - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs ARROW-9816 - [C++] Escape quotes in config.h ARROW-9827 - [Python] pandas.read_parquet fails for wide parquet files and pyarrow 1.0.X ARROW-9831 - [Rust] [DataFusion] Fix compilation error ARROW-9840 - [Python] Python fs documentation out of date with code ARROW-9846 - [Rust] Master branch broken build ARROW-9851 - [C++] Valgrind errors due to unrecognized instructions ARROW-9852 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9852 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9855 - [R] Fix bad merge/Rcpp conflict ARROW-9859 - [C++] S3 FileSystemFromUri with special char in secret key fails ARROW-9864 - [Python] pathlib.Path not supported in write_to_dataset with partition columns ARROW-9874 - [C++] NewStreamWriter / NewFileWriter don’t own output stream ARROW-9876 - [CI][C++] Travis ARM jobs timeout ARROW-9877 - [C++][CI] homebrew-cpp fails due to avx512 ARROW-9879 - [Python] ChunkedArray.__getitem__ doesn’t work with numpy scalars ARROW-9882 - [C++/Python] Update conda-forge-pinning to 3 for OSX conda packages ARROW-9883 - [R] Fix linuxlibs.R install script for R &lt; 3.6 ARROW-9888 - [Rust] [DataFusion] ExecutionContext can not be shared between threads ARROW-9889 - [Rust][DataFusion] Datafusion CLI: CREATE EXTERNAL TABLE errors with “Unsupported logical plan variant” ARROW-9897 - [C++][Gandiva] Add to_date() function from pattern ARROW-9898 - [C++][Gandiva] Error handling in castINT fails in some enviroments ARROW-9906 - [Python] Crash in test_parquet.py::test_parquet_writer_filesystem_s3_uri (closing NativeFile from S3FileSystem) ARROW-9913 - [C++] Outputs of Decimal128::FromString depend on presence of one another ARROW-9920 - [Python] pyarrow.concat_arrays segfaults when passing it a chunked array ARROW-9922 - [Rust] Add `try_from(Vec&lt;Option&lt;(&amp;str, ArrayRef)&gt;&gt;)` to StructArray ARROW-9924 - [Python] Performance regression reading individual Parquet files using Dataset interface ARROW-9931 - [C++] Fix undefined behaviour on invalid IPC (OSS-Fuzz) ARROW-9932 - [R] Arrow 1.0.1 R package fails to install on R3.4 over linux ARROW-9936 - [Python] Fix / test relative file paths in pyarrow.parquet ARROW-9937 - [Rust] [DataFusion] Average is not correct ARROW-9943 - [C++] Arrow metadata not applied recursively when reading Parquet file ARROW-9946 - [R] ParquetFileWriter segfaults when `sink` is a string ARROW-9953 - [R] Declare minimum version for bit64 ARROW-9962 - [Python] Conversion to pandas with index column using fixed timezone fails ARROW-9968 - [C++] UBSAN link failure with __int8_t ARROW-9969 - [C++] RecordBatchBuilder yields invalid result with dictionary fields ARROW-9970 - [Go] checkptr failures in sum methods ARROW-9972 - [CI] Work around grpc-re2 clash on Homebrew ARROW-9973 - [Java] JDBC DateConsumer does not allow dates before epoch ARROW-9976 - [Python] ArrowCapacityError when doing Table.from_pandas with large dataframe ARROW-9990 - [Rust] [DataFusion] NOT is not plannable ARROW-9993 - [Python] Tzinfo - string roundtrip fails on pytz.StaticTzInfo objects ARROW-9994 - [C++][Python] Auto chunking nested array containing binary-like fields result malformed output ARROW-9996 - [C++] Dictionary is unset when calling DictionaryArray.GetScalar for null values ARROW-10003 - [C++] Create directories in CopyFiles when copying within the same filesystem ARROW-10008 - [Python] pyarrow.parquet.read_table fails with predicate pushdown on categorical data with use_legacy_dataset=False ARROW-10011 - [C++] Make FindRE2.cmake re-entrant ARROW-10012 - [C++] Sporadic failures in CopyFiles test ARROW-10013 - [C++][CI] Flight test failure in TestFlightClient.GenericOptions ARROW-10017 - [Java] LargeMemoryUtil.checkedCastToInt has buggy logic ARROW-10022 - [C++] [Compute] core dumped on some scalar-arithmetic-benchmark ARROW-10027 - [Python] Incorrect null column returned when using a dataset filter expression. ARROW-10034 - [Rust] Master build broken ARROW-10041 - [Rust] Possible to create LargeStringArray with DataType::Utf8 ARROW-10047 - [CI] Conda integration tests failing with cmake error ARROW-10048 - [Rust] Error in aggregate of min/max for strings ARROW-10049 - [C++/Python] Sync conda recipe with conda-forge ARROW-10060 - [Rust] [DataFusion] MergeExec currently discards partitions with errors ARROW-10062 - [Rust]: Fix for null elems for DoubleEndedIter for DictArray ARROW-10073 - [Python] Test test_parquet_nested_storage relies on dict item ordering ARROW-10081 - [C++/Python] Fix bash syntax in drone.io conda builds ARROW-10085 - [C++] S3 tests fail on AppVeyor ARROW-10087 - [CI] Fix nightly docs job ARROW-10098 - [R][Doc] Fix copy_files doc mismatch ARROW-10104 - [Python] Separate tests into its own conda package ARROW-10114 - [R] Segfault in to_dataframe_parallel with deeply nested structs ARROW-10116 - [Python][Packaging] Fix gRPC linking error in macOS wheels builds ARROW-10119 - [C++] Fix Parquet crashes on invalid input (OSS-Fuzz) ARROW-10121 - [C++][Python] Variable dictionaries do not survive roundtrip to IPC stream ARROW-10124 - [R] Write functions don’t follow umask setting ARROW-10125 - [R] Int64 downcast check doesn’t consider all chunks ARROW-10130 - [C++][Dataset] ParquetFileFragment::SplitByRowGroup does not preserve “complete_metadata” status ARROW-10136 - [Rust][Arrow] Nulls are transformed into “” after filtering for StringArray ARROW-10137 - [R] Fix cpp helper that breaks if libarrow is not present ARROW-10147 - [Python] Constructing pandas metadata fails if an Index name is not JSON-serializable by default ARROW-10150 - [C++] Fix crashes on invalid Parquet file (OSS-Fuzz) ARROW-10169 - [Rust] Nulls should be rendered as “” rather than default value when pretty printing arrays ARROW-10174 - [Java] Reading of Dictionary encoded struct vector fails ARROW-10175 - [CI] Nightly hdfs integration test job fails ARROW-10176 - [CI] Nightly valgrind job fails ARROW-10178 - [CI] Fix spark master integration test build setup ARROW-10179 - [Rust] Labeler is not labeling ARROW-10181 - [Rust] Arrow tests fail to compile on Raspberry Pi (32 bit) ARROW-10188 - [Rust] [DataFusion] Some examples are broken ARROW-10189 - [Doc] C data interface example for i32 uses `l`, not `i`, in the format ARROW-10192 - [C++][Python] Segfault when converting nested struct array with dictionary field to pandas series ARROW-10193 - [Python] Segfault when converting to fixed size binary array ARROW-10200 - [Java][CI] Fix failure of Java CI on s390x ARROW-10204 - [RUST] [Datafusion] Test failure in aggregate_grouped_empty with simd feature enabled ARROW-10214 - [Python] UnicodeDecodeError when printing schema with binary metadata ARROW-10226 - [Rust] [Parquet] Parquet reader reading wrong columns in some batches within a parquet file ARROW-10230 - [JS][Doc] JavaScript documentation fails to build ARROW-10232 - FixedSizeListArray is incorrectly written/read to/from parquet ARROW-10234 - [C++][Gandiva] Fix logic of round() for floats/decimals in Gandiva ARROW-10237 - [C++] Duplicate values in a dictionary result in corrupted parquet ARROW-10238 - [C#] List&lt;Struct&gt; is broken ARROW-10239 - [C++] aws-sdk-cpp apparently requires zlib too ARROW-10244 - [Python][Docs] Add docs on using pyarrow.dataset.parquet_dataset ARROW-10248 - [C++][Dataset] Dataset writing does not write schema metadata ARROW-10262 - [C++] Some TypeClass in Scalar classes seem incorrect ARROW-10270 - [R] Fix CSV timestamp_parsers test on R-devel ARROW-10271 - [Rust] packed_simd is broken and continued under a new project ARROW-10279 - [Release][Python] Fix verification script to align with the new macos wheel platform tags ARROW-10280 - [Packaging][Python] Fix macOS wheel artifact patterns ARROW-10281 - [Python] Fix warnings when running tests ARROW-10284 - [Python] Pyarrow is raising deprecation warning about filesystems on import ARROW-10285 - [Python] pyarrow.orc submodule is using deprecated functionality ARROW-10286 - [C++][Flight] Misleading CMake errors ARROW-10288 - [C++] Compilation fails on i386 ARROW-10290 - [C++] List POP_BACK is not available in older CMake versions ARROW-10293 - [Rust] [DataFusion] Fix benchmarks ARROW-10296 - [R] Data saved as integer64 loaded as integer New Features and Improvements ARROW-983 - [C++] Implement InputStream and OutputStream classes for interacting with socket connections ARROW-1105 - [C++] SQLite record batch reader ARROW-1509 - [Python] Write serialized object as a stream of encapsulated IPC messages ARROW-1669 - [C++] Consider adding Abseil (Google C++11 standard library extensions) to toolchain ARROW-1797 - [C++] Implement binary arithmetic kernels for numeric arrays ARROW-2164 - [C++] Clean up unnecessary decimal module refs ARROW-3080 - [Python] Unify Arrow to Python object conversion paths ARROW-3757 - [R] R bindings for Flight RPC client ARROW-3872 - [R] Add ad hoc test of feather compatibility ARROW-4046 - [Python/CI] Exercise large memory tests ARROW-4248 - [C++][Plasma] Build on Windows / Visual Studio ARROW-4685 - [C++] Update Boost to 1.69 in manylinux1 docker image ARROW-4927 - [Rust] Update top level README to describe current functionality ARROW-4957 - [Rust] [DataFusion] Implement get_supertype correctly ARROW-4965 - [Python] Timestamp array type detection should use tzname of datetime.datetime objects ARROW-5034 - [C#] ArrowStreamWriter should expose synchronous Write methods ARROW-5123 - [Rust] derive RecordWriter from struct definitions ARROW-6075 - [FlightRPC] Handle uncaught exceptions in middleware ARROW-6281 - [Python] Produce chunked arrays for nested types in pyarrow.array ARROW-6282 - [Format] Support lossy compression ARROW-6437 - [R] Add AWS SDK to system dependencies for macOS and Windows ARROW-6535 - [C++] Status::WithMessage should accept variadic parameters ARROW-6537 - [R] Pass column_types to CSV reader ARROW-6972 - [C#] Should support StructField arrays ARROW-6982 - [R] Add bindings for compare and boolean kernels ARROW-7136 - [Rust][CI] Pre-install the rust dependencies in the dockerfile ARROW-7218 - [Python] Conversion from boolean numpy scalars not working ARROW-7302 - [C++] CSV: allow converting a column to a specific dictionary type ARROW-7372 - [C++] Allow creating dictionary array from simple JSON ARROW-7871 - [Python] Expose more compute kernels ARROW-7960 - [C++][Parquet] Add support for schema translation from parquet nodes back to arrow for missing types ARROW-8001 - [R][Dataset] Bindings for dataset writing ARROW-8002 - [C++][Dataset] Dataset writing should let you (re)partition the data ARROW-8048 - [Python] Run memory leak tests nightly as follow up to ARROW-4120 ARROW-8172 - [C++] ArrayFromJSON for dictionary arrays ARROW-8205 - [Rust] [DataFusion] DataFusion should enforce unique field names in a schema ARROW-8253 - [Rust] [DataFusion] Improve ergonomics of registering UDFs ARROW-8262 - [Rust] [DataFusion] Add example that uses LogicalPlanBuilder ARROW-8289 - [Rust] [Parquet] Implement minimal Arrow Parquet writer as starting point for full writer ARROW-8296 - [C++][Dataset] IpcFileFormat should support writing files with compressed buffers ARROW-8355 - [Python] Reduce the number of pandas dependent test cases in test_feather ARROW-8359 - [C++/Python] Enable aarch64/ppc64le build in conda recipes ARROW-8383 - [Rust] Easier random access to DictionaryArray keys and values ARROW-8402 - [Java] Support ValidateFull methods in Java ARROW-8423 - [Rust] [Parquet] Serialize arrow schema into metadata when writing parquet ARROW-8426 - [Rust] [Parquet] Add support for writing dictionary types ARROW-8493 - [C++] Create unified schema resolution code for Array reconstruction. ARROW-8494 - [C++] Implement basic array-by-array reassembly logic ARROW-8581 - [C#] Date32/64Array.Builder should accept DateTime, not DateTimeOffset ARROW-8601 - [Go][Flight] Implement Flight Writer interface ARROW-8601 - [Go][Flight] Implement Flight Writer interface ARROW-8618 - [C++] ASSIGN_OR_RAISE should move its argument ARROW-8678 - [C++][Parquet] Remove legacy arrow to level translation. ARROW-8712 - [R] Expose strptime timestamp parsing in read_csv conversion options ARROW-8774 - [Rust] [DataFusion] Improve threading model ARROW-8810 - [R] Add documentation about Parquet format, appending to stream format ARROW-8824 - [Rust] [DataFusion] Implement new SQL parser ARROW-8828 - [Rust] Implement SQL tokenizer ARROW-8829 - [Rust] Implement SQL parser ARROW-9010 - [Java] Framework and interface changes for RecordBatch IPC buffer compression ARROW-9065 - [C++] Support parsing date32 in dataset partition folders ARROW-9068 - [C++][Dataset] Simplify Partitioning interface ARROW-9078 - [C++] Parquet writing of extension type with nested storage type fails ARROW-9104 - [C++] Parquet encryption tests should write files to a temporary directory instead of the testing submodule’s directory ARROW-9107 - [C++][Dataset] Time-based types support ARROW-9147 - [C++][Dataset] Support null -&gt; other type promotion in Dataset scanning ARROW-9205 - [Documentation] Fix typos in Columnar.rst ARROW-9266 - [Python][Packaging] Enable S3 support in macOS wheels ARROW-9271 - [R] Preserve data frame metadata in round trip ARROW-9286 - [C++] Add function “aliases” to compute::FunctionRegistry ARROW-9328 - [C++][Gandiva] Add LTRIM, RTRIM, BTRIM functions for string ARROW-9338 - [Rust] Add instructions for running clippy locally ARROW-9344 - [C++][Flight] measure latency quantile in flight benchmark ARROW-9358 - [Integration] Reconsider generated_large_batch.json ARROW-9371 - [Java] Run vector tests for both allocators ARROW-9377 - [Java] Support unsigned dictionary indices ARROW-9387 - [R] Use new C++ table select method ARROW-9388 - [C++] Division kernels ARROW-9394 - [Python] Support pickling of Scalars ARROW-9398 - [C++] Register the SIMD sum variants under function instance instead a SIMD function ARROW-9402 - [C++] Add portable wrappers for __builtin_add_overflow and friends ARROW-9405 - [R] Switch to cpp11 ARROW-9412 - [C++] Add non-BUNDLED dependencies to exported INSTALL_INTERFACE_LIBS of arrow_static and test that it works ARROW-9429 - [Python] ChunkedArray.to_numpy ARROW-9454 - [GLib] Add binding of some dictionary builders ARROW-9465 - [Python] Improve ergonomics of compute functions ARROW-9469 - [Python] Make more objects weakrefable ARROW-9487 - [Developer] Cover the archery release utilities with unittests ARROW-9488 - [Release] Use the new changelog generation when updating the website ARROW-9507 - [Rust] [DataFusion] PhysicalExpr should implement Display trait ARROW-9508 - [Release][APT][Yum] Enable verification for arm64 binaries ARROW-9516 - [Rust][DataFusion] Refactor physical expressions to not care about their names nor indexes ARROW-9517 - [C++][Python] Allow session_token argument when initializing S3FileSystem ARROW-9518 - [Python] Deprecate pyarrow serialization ARROW-9521 - [Rust] CsvReadOptions should allow file extension to be specified ARROW-9523 - [Rust] improve performance of filter kernel ARROW-9534 - [Rust] [DataFusion] Implement functions for creating literal expressions for all types ARROW-9550 - [Rust] [DataFusion] Remove Rc&lt;RefCell&lt;_&gt;&gt; from hash aggregate operator ARROW-9553 - [Rust] Release script doesn’t bump parquet crate’s arrow dependency version ARROW-9557 - [R] Iterating over parquet columns is slow in R ARROW-9559 - [Rust] [DataFusion] Revert privatization of exprlist_to_fields ARROW-9563 - [Dev][Release] Use archery’s changelog generator when creating release notes for the website ARROW-9568 - [CI] Use official msys action on GHA ARROW-9576 - [Python][Doc] Fix error in code example for extension types ARROW-9580 - [JS] Docs have superfluous () ARROW-9581 - [Dev][Release] Bump next snapshot versions to 2.0.0 ARROW-9582 - [Rust] Implement Array::memory_size() ARROW-9585 - [Rust] Remove duplicated to-do line in DataFusion readme ARROW-9587 - [FlightRPC][Java] Clean up DoPut/FlightStream memory handling ARROW-9593 - [Python] Add custom pickle reducers for DictionaryScalar ARROW-9604 - [C++] Add benchmark for aggregate min/max compute kernels ARROW-9605 - [C++] Optimize performance for aggregate min/max compute kernels ARROW-9607 - [C++][Gandiva] Add bitwise_and(), bitwise_or() and bitwise_not() functions for integers ARROW-9608 - [Rust] Remove arrow flight from parquet’s feature gating ARROW-9615 - [Rust] Add kernel to compute length of string array ARROW-9617 - [Rust] [DataFusion] Add length of string array ARROW-9618 - [Rust] [DataFusion] Make it easier to write optimizers ARROW-9619 - [Rust] [DataFusion] Add predicate push-down ARROW-9632 - [Rust] Add a “new” method for ExecutionContextSchemaProvider ARROW-9638 - [C++][Compute] Implement mode(most frequent number) kernel ARROW-9639 - [Ruby] Add dependency version check ARROW-9640 - [C++][Gandiva] Implement round() for integers and long integers ARROW-9641 - [C++][Gandiva] Implement round() for floating point and double floating point numbers ARROW-9645 - [Python] Deprecate the legacy pyarrow.filesystem interface ARROW-9646 - [C++][Dataset] Add support for writing parquet datasets ARROW-9650 - [Packaging][APT] Drop support for Ubuntu 19.10 ARROW-9654 - [Rust][DataFusion] Add an EXPLAIN command to the datafusion CLI ARROW-9656 - [Rust][DataFusion] Slightly confusing error message when unsupported type is provided to CREATE EXTERNAL TABLE ARROW-9658 - [Python][Dataset] Bindings for dataset writing ARROW-9665 - [R] head/tail/take for Datasets ARROW-9667 - [CI][Crossbow] Segfault in 2 nightly R builds ARROW-9671 - [C++] BasicDecimal128 constructor interprets uint64_t integers with highest bit set as negative ARROW-9673 - [Rust] Add a param “dialect” for DFParser::parse_sql ARROW-9678 - [Rust] [DataFusion] Improve projection push down to remove unused columns ARROW-9679 - [Rust] [DataFusion] HashAggregate walks map many times building final batch ARROW-9681 - [Java] Failed Arrow Memory - Core on big-endian platform ARROW-9683 - [Rust][DataFusion] Implement Debug for ExecutionPlan trait ARROW-9691 - [Rust] [DataFusion] Make sql_statement_to_plan public ARROW-9695 - [Rust][DataFusion] Improve documentation on LogicalPlan variants ARROW-9699 - [C++][Compute] Improve mode kernel performance for small integer types ARROW-9701 - [Java][CI] Add a test job on s390x ARROW-9702 - [C++] Move bpacking simd to runtime path ARROW-9703 - [Developer][Archery] Restartable cherry-picking process for creating maintenance branches ARROW-9706 - [Java] Tests in TestLargeListVector fails on big endian platform ARROW-9710 - [C++] Generalize Decimal ToString in preparation for Decimal256 ARROW-9711 - [Rust] Add benchmark based on TPC-H ARROW-9713 - [Rust][DataFusion] Remove explicit panics ARROW-9715 - [R] changelog/doc updates for 1.0.1 ARROW-9718 - [Python] Make pyarrow.parquet work with the new filesystem interfaces ARROW-9721 - [Packaging][Python] Update wheel dependency files ARROW-9722 - [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays ARROW-9723 - [C++] Expected behaviour of “mode” kernel with NaNs ? ARROW-9725 - [Rust] [DataFusion] LimitExec and SortExec should use MergeExec ARROW-9737 - [C++][Gandiva] Add bitwise_xor() for integers ARROW-9739 - [CI][Ruby] Don’t install gem documents ARROW-9742 - [Rust] Create one standard DataFrame API ARROW-9751 - [Rust] [DataFusion] Extend UDFs to accept more than one type per argument ARROW-9752 - [Rust] [DataFusion] Add support for Aggregate UDFs ARROW-9753 - [Rust] [DataFusion] Remove the use of Mutex in ExecutionPlan trait ARROW-9754 - [Rust] [DataFusion] Implement async in DataFusion traits ARROW-9757 - [Rust] [DataFusion] Use “pub use” to expose a clean public API ARROW-9758 - [Rust] [DataFusion] Implement extension API for DataFusion ARROW-9759 - [Rust] [DataFusion] Implement DataFrame::sort ARROW-9760 - [Rust] [DataFusion] Implement DataFrame::explain ARROW-9761 - [C++] Add experimental pull-based iterator structures to C interface implementation ARROW-9762 - [Rust] [DataFusion] ExecutionContext::sql should return DataFrame ARROW-9769 - [Python] Remove skip for in-memory fsspec in test_move_file ARROW-9775 - [C++] Automatic S3 region selection ARROW-9781 - [C++] Fix uninitialized value warnings ARROW-9782 - [C++][Dataset] Ability to write “.feather” files with IpcFileFormat ARROW-9784 - [Rust] [DataFusion] Improve instructions for running tpch benchmark ARROW-9786 - [R] Unvendor cpp11 before release ARROW-9788 - Handle naming inconsistencies between SQL, DataFrame API and struct names ARROW-9792 - [Rust] [DataFusion] Logical aggregate functions should not return Result ARROW-9794 - [C++] Add functionality to cpu_info to discriminate between Intel vs AMD x86 ARROW-9795 - [C++][Gandiva] Implement castTIMESTAMP(int64) in Gandiva ARROW-9806 - [R] More compute kernel bindings ARROW-9807 - [R] News update/version bump post-1.0.1 ARROW-9808 - [Python] parquet.read_table docstring wrong use_legacy_dataset explanation ARROW-9811 - [C++] Unchecked floating point division by 0 should succeed ARROW-9813 - [C++] Disable semantic interposition ARROW-9819 - [C++] Bump mimalloc to 1.6.4 ARROW-9821 - [Rust][DataFusion] User Defined PlanNode / Operator API ARROW-9821 - [Rust][DataFusion] User Defined PlanNode / Operator API ARROW-9823 - [CI][C++][MinGW] Enable S3 ARROW-9832 - [Rust] [DataFusion] Refactor PhysicalPlan to remove Partition ARROW-9833 - [Rust] [DataFusion] Refactor TableProvider.scan to return ExecutionPlan ARROW-9834 - [Rust] [DataFusion] Remove Partition trait ARROW-9835 - [Rust] [DataFusion] Remove FunctionMeta ARROW-9836 - [Rust] [DataFusion] Improve API for usage of UDFs ARROW-9837 - [Rust] Add provider for variable ARROW-9838 - [Rust] [DataFusion] DefaultPhysicalPlanner should insert explicit MergeExec nodes ARROW-9839 - [Rust] [DataFusion] Add ability to downcast ExecutionPlan to specific operator ARROW-9841 - [Rust] Update checked-in flatbuffer files ARROW-9844 - [Go][CI] Add Travis CI job for Go on s390x ARROW-9845 - [Rust] [Parquet] serde_json is only used in tests but isn’t in dev-dependencies ARROW-9848 - [Rust] Implement changes to ensure flatbuffer alignment ARROW-9849 - [Rust] [DataFusion] Make UDFs not need a Field ARROW-9850 - [Go] Defer should not be used in the loop ARROW-9853 - [RUST] Implement “take” kernel for dictionary arrays ARROW-9854 - [R] Support reading/writing data to/from S3 ARROW-9858 - [C++][Python][Docs] Expand user guide for FileSystem ARROW-9863 - [C++] [PARQUET] Optimize meta data recovery of ApplicationVersion ARROW-9867 - [C++][Dataset] FileSystemDataset should expose its filesystem ARROW-9868 - [C++] Provide utility for copying files between filesystems ARROW-9869 - [R] Implement full S3FileSystem/S3Options constructor ARROW-9870 - [R] Friendly interface for filesystems (S3) ARROW-9871 - [C++] Add uppercase support to ARROW_USER_SIMD_LEVEL. ARROW-9873 - [C++][Compute] Improve mode kernel for intergers within limited value range ARROW-9875 - [Python] Let FileSystem.get_file_info accept a single path ARROW-9884 - [R] Bindings for writing datasets to Parquet ARROW-9885 - [Rust] [DataFusion] Simplify code of type coercion for binary types ARROW-9886 - [Rust] [DataFusion] Simplify code to test cast ARROW-9887 - [Rust] [DataFusion] Add support for complex return types of built-in functions ARROW-9890 - [R] Add zstandard compression codec in macOS build ARROW-9891 - [Rust] [DataFusion] Make math functions support f32 ARROW-9892 - [Rust] [DataFusion] Add support for concat ARROW-9893 - [Python] Bindings for writing datasets to Parquet ARROW-9895 - [RUST] Improve sort kernels ARROW-9899 - [Rust] [DataFusion] Switch from Box&lt;Schema&gt; –&gt; SchemaRef (Arc&lt;Schema&gt;) to be consistent with the rest of Arrow ARROW-9900 - [Rust][DataFusion] Use Arc&lt;&gt; instead of Box&lt;&gt; in LogicalPlan ARROW-9901 - [C++] Add hand-crafted Parquet to Arrow reconstruction test for nested reading ARROW-9902 - [Rust] [DataFusion] Add support for array() ARROW-9904 - [C++] Unroll the loop manually for CountSetBits ARROW-9908 - [Rust] Support temporal data types in JSON reader ARROW-9910 - [Rust] [DataFusion] Type coercion of Variadic is wrong ARROW-9914 - [Rust][DataFusion] Document the SQL -&gt; Arrow type mapping ARROW-9916 - [RUST] Avoid cloning ArrayData in several places ARROW-9917 - [Python][Compute] Add bindings for mode kernel ARROW-9919 - [Rust] [DataFusion] Math functions ARROW-9921 - [Rust] Add `from(Vec&lt;Option&lt;&amp;str&gt;&gt;)` to [Large]StringArray ARROW-9925 - [GLib] Add low level value readers for GArrowListArray family ARROW-9926 - [GLib] Use placement new for GArrowRecordBatchFileReader ARROW-9928 - [C++] Speed up integer parsing slightly ARROW-9929 - [Developer] Autotune cmake-format ARROW-9933 - [Developer] Add drone as a CI provider for crossbow ARROW-9934 - [Rust] Shape and stride check in tensor ARROW-9941 - [Python] Better string representation for extension types ARROW-9944 - [Rust] Implement TO_TIMESTAMP function ARROW-9949 - [C++] Generalize Decimal128::FromString for reuse in Decimal256 ARROW-9950 - [Rust] [DataFusion] Allow UDF usage without registry ARROW-9952 - [Python] Use pyarrow.dataset writing for pq.write_to_dataset ARROW-9954 - [Rust] [DataFusion] Simplify code of aggregate planning ARROW-9956 - [C++][Gandiva] Implement Binary string function in Gandiva ARROW-9957 - [Rust] Remove unmaintained tempdir dependency ARROW-9961 - [Rust][DataFusion] to_timestamp function parses timestamp without timezone offset as UTC rather than local ARROW-9964 - [C++] CSV date support ARROW-9965 - [Java] Buffer capacity calculations are slow for fixed-width vectors ARROW-9966 - [Rust] Speedup aggregate kernels ARROW-9967 - [Python] Add compute module docs ARROW-9971 - [Rust] Speedup take ARROW-9977 - [Rust] Add min/max for [Large]String ARROW-9979 - [Rust] Fix arrow crate clippy lints ARROW-9980 - [Rust] Fix parquet crate clippy lints ARROW-9981 - [Rust] Allow configuring flight IPC with IpcWriteOptions ARROW-9983 - [C++][Dataset][Python] Use larger default batch size than 32K for Datasets API ARROW-9984 - [Rust] [DataFusion] DRY of function to string ARROW-9986 - [Rust][DataFusion] TO_TIMESTAMP function erroneously requires fractional seconds when no timezone is present ARROW-9987 - [Rust] [DataFusion] Improve docs of `Expr`. ARROW-9988 - [Rust] [DataFusion] Added std::ops to logical expressions ARROW-9992 - [C++][Python] Refactor python to arrow conversions based on a reusable conversion API ARROW-9998 - [Python] Support pickling DictionaryScalar ARROW-9999 - [Python] Support constructing dictionary array directly through pa.array() ARROW-10000 - [C++][Python] Support constructing StructArray from list of key-value pairs ARROW-10001 - [Rust] [DataFusion] Add developer guide to README ARROW-10010 - [Rust] Speedup arithmetic ARROW-10015 - [Rust] Implement SIMD for aggregate kernel sum ARROW-10016 - [Rust] [DataFusion] Implement IsNull and IsNotNull ARROW-10018 - [CI] Disable Sphinx and API documentation build since it takes 6 hours on master ARROW-10019 - [Rust] Add substring kernel ARROW-10023 - [Gandiva][C++] Implementing Split part function in gandiva ARROW-10024 - [C++][Parquet] Create nested reading benchmarks ARROW-10028 - [Rust] Simplify macro def_numeric_from_vec ARROW-10030 - [Rust] Support fromIter and toIter ARROW-10035 - [C++] Bump versions of vendored code ARROW-10037 - [C++] Workaround to force find AWS SDK to look for shared libraries ARROW-10040 - [Rust] Create a way to slice unalligned offset buffers ARROW-10043 - [Rust] [DataFusion] Introduce support for DISTINCT by partially implementing COUNT(DISTINCT) ARROW-10044 - [Rust] Improve README ARROW-10046 - [Rust] [DataFusion] Made `*Iterator` implement Iterator ARROW-10050 - [C++][Gandiva] Implement concat() in Gandiva for up to 10 arguments ARROW-10051 - [C++][Compute] Make aggregate kernel merge state mutable ARROW-10054 - [Python] Slice methods should return empty arrays instead of crashing ARROW-10055 - [Rust] Implement DoubleEndedIterator for NullableIter ARROW-10057 - [C++] Add Parquet-Arrow roundtrip tests for nested data ARROW-10058 - [C++] Investigate performance of LevelsToBitmap without BMI2 ARROW-10059 - [R][Doc] Give more advice on how to set up C++ build ARROW-10063 - [Archery][CI] Fetch main branch in archery build only when it is a pull request ARROW-10064 - [C++] Resolve compile warnings on Apple Clang 12 ARROW-10065 - [Rust] DRY downcasted Arrays ARROW-10066 - [C++] Make sure that default AWS region is respected ARROW-10068 - [C++] Add bundled external project for aws-sdk-cpp ARROW-10069 - [Java] Support running Java benchmarks from command line ARROW-10070 - [C++][Compute] Implement stdev aggregate kernel ARROW-10071 - [R] segfault with ArrowObject from previous session, or saved ARROW-10074 - [C++] Don’t use string_view.to_string() ARROW-10075 - [C++] Don’t use nonstd::nullopt this breaks out vendoring abstraction. ARROW-10076 - [C++] Use TemporaryDir for all tests that don’t already use it. ARROW-10077 - [C++] Potential overflow in bit_stream_utils.h multiplication. ARROW-10083 - [C++] Improve Parquet fuzz seed corpus ARROW-10084 - [Rust] [DataFusion] Add length of large string array ARROW-10086 - [Rust] Migrate min_large_string -&gt; min_string kernels ARROW-10090 - [C++][Compute] Improve mode kernel ARROW-10092 - [Dev][Go] Add grpc generated go files to rat exclusion list ARROW-10093 - [R] Add ability to opt-out of int64 -&gt; int demotion ARROW-10095 - [Rust] [Parquet] Update for IPC changes ARROW-10096 - [Rust] [DataFusion] Remove unused code ARROW-10099 - [C++][Dataset] Also allow integer partition fields to be dictionary encoded ARROW-10100 - [C++][Dataset] Ability to read/subset a ParquetFileFragment with given set of row group ids ARROW-10102 - [C++] Generalize BasicDecimal128::operator*= for reuse in Decimal256 ARROW-10103 - [Rust] Add a Contains kernel ARROW-10105 - [FlightRPC] Add client option to disable certificate validation with TLS ARROW-10120 - [C++][Parquet] Create reading benchmarks for 2-level nested data ARROW-10127 - [Format] Update specification to support 256-bit Decimal types ARROW-10129 - [Rust] Cargo build is rebuilding dependencies on arrow changes ARROW-10134 - [C++][Dataset] Add ParquetFileFragment::num_row_groups property ARROW-10139 - [C++] Add support for building arrow_testing without building tests ARROW-10148 - [Rust] Add documentation to lib.rs ARROW-10151 - [Python] Add support MapArray to_pandas conversion ARROW-10155 - [Rust] [DataFusion] Add documentation to lib.rs ARROW-10156 - [Rust] Auto-label PRs ARROW-10157 - [Rust] Add more documentation about take ARROW-10160 - [Rust] Improve documentation of DictionaryType ARROW-10161 - [Rust] [DataFusion] Simplify expression tests ARROW-10162 - [Rust] Support display of DictionaryArrays in pretty printing ARROW-10164 - [Rust] Add support for DictionaryArray types to cast kernels ARROW-10167 - [Rust] Support display of DictionaryArrays in sql.rs ARROW-10168 - [Rust] [Parquet] Extend arrow schema conversion to projected fields ARROW-10171 - [Rust] [DataFusion] Add `ExecutionContext::from&lt;ExecutionContextState&gt;` ARROW-10190 - [Website] Add Jorge to list of committers ARROW-10191 - [Rust] [Parquet] Add roundtrip tests for single column batches ARROW-10196 - [C++] Add Future::DeferNotOk() ARROW-10199 - [Rust][Parquet] Release Parquet at crates.io to remove debug prints ARROW-10201 - [C++][CI] Disable S3 in arm64 job on Travis CI ARROW-10202 - [CI][Windows] Use sf.net mirror for MSYS2 ARROW-10205 - [Java][FlightRPC] Add client option to disable server verification ARROW-10206 - [Python][C++][FlightRPC] Add client option to disable server validation ARROW-10215 - [Rust] [DataFusion] Rename “Source” typedef ARROW-10217 - [CI] Run fewer GitHub Actions jobs ARROW-10225 - [Rust] [Parquet] Fix null bitmap comparisons in roundtrip tests ARROW-10227 - [Ruby] Use a table size as the default for parquet chunk_size ARROW-10229 - [C++][Parquet] Remove left over ARROW_LOG statement. ARROW-10231 - [CI] Unable to download minio in arm32v7 docker image ARROW-10233 - [Rust] Make array_value_to_string available in all Arrow builds ARROW-10235 - [Rust][DataFusion] Improve documentation for type coercion ARROW-10240 - [Rust] [Datafusion] Optionally load tpch data into memory before running benchmark query ARROW-10251 - [Rust] [DataFusion] MemTable::load() should load partitions in parallel ARROW-10252 - [Python] Add option to skip inclusion of Arrow headers in Python installation ARROW-10256 - [C++][Flight] Disable -Werror carefully ARROW-10257 - [R] Prepare news/docs for 2.0 release ARROW-10260 - [Python] Missing MapType to Pandas dtype ARROW-10263 - [C++][Compute] Improve numerical stability of variances merging ARROW-10265 - [CI] Use smaler build when cache doesn’t exit on Travis CI ARROW-10266 - [CI][macOS] Ensure using Python 3.8 with Homebrew ARROW-10267 - [Python] Skip flight test if disable_server_verification feature is not available ARROW-10272 - [Packaging][Python] Pin newer multibuild version to avoid updating homebrew ARROW-10273 - [CI][Homebrew] Fix “brew audit” usage ARROW-10287 - [C++] Avoid std::random_device whenever possible ARROW-10289 - [Rust] Support reading dictionary streams ARROW-10295 - [Rust] [DataFusion] Simplify accumulators ARROW-10310 - [C++][Gandiva] Add single argument round() in Gandiva PARQUET-1845 - [C++] Int96 memory images in test cases assume only little-endian PARQUET-1878 - [C++] lz4 codec is not compatible with Hadoop Lz4Codec PARQUET-1904 - [C++] Export file_offset in RowGroupMetaData" />
<link rel="canonical" href="https://arrow.apache.org/release/2.0.0.html" />
<meta property="og:url" content="https://arrow.apache.org/release/2.0.0.html" />
<meta property="og:site_name" content="Apache Arrow" />
<meta property="og:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="og:type" content="article" />
<meta property="article:published_time" content="2024-05-07T06:07:05-04:00" />
<meta name="twitter:card" content="summary_large_image" />
<meta property="twitter:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="twitter:title" content="Apache Arrow 2.0.0 Release" />
<meta name="twitter:site" content="@ApacheArrow" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"BlogPosting","dateModified":"2024-05-07T06:07:05-04:00","datePublished":"2024-05-07T06:07:05-04:00","description":"Apache Arrow 2.0.0 (19 October 2020) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 511 commits from 81 distinct contributors. $ git shortlog -sn apache-arrow-1.0.0..apache-arrow-2.0.0 68 Jorge C. Leitao 48 Antoine Pitrou 40 Krisztián Szűcs 34 alamb 33 Neal Richardson 30 Andy Grove 25 Benjamin Kietzman 25 Joris Van den Bossche 19 Sutou Kouhei 13 Uwe L. Korn 12 Micah Kornfield 10 Frank Du 10 Jörn Horstmann 9 Neville Dipale 9 Romain Francois 9 arw2019 8 Yibo Cai 8 liyafan82 7 Sagnik Chakraborty 6 David Li 5 Kazuaki Ishizaki 5 Mahmut Bulut 4 Mingyu Zhong 4 fredgan 3 Bryan Cutler 3 wqc200 2 Daniel Russo 2 Diana Clarke 2 James Duong 2 Kenta Murata 2 Patrick Woody 2 Projjal Chanda 2 naman1996 2 ptaylor 2 tianchen 1 Adam Szmigin 1 Ali McMaster 1 Andrew Stevenson 1 Ben Kimock 1 Brian Dunlay 1 Christoph Schulze 1 Derek Marsh 1 Dominik Moritz 1 Eric Erhardt 1 Ezra 1 Fernando José Herrera Elizalde 1 FredGan 1 Hongze Zhang 1 Jim Klucar 1 Josiah 1 Kyle Strand 1 Laurent Goujon 1 Lawrence Chan 1 Mark Rushakoff 1 Matt Corley 1 Matthew Topol 1 Matthias 1 Morgan Cassels 1 Ofek 1 Patrick Pai 1 Paul 1 PoojaChandak 1 Prashanth Govindarajan 1 Pratik raj 1 Revital Sur 1 Ruan Pearce-Authers 1 Ryan Murray 1 Simon Bertron 1 Steve Suh 1 Tanguy Fautre 1 Tobias Mayer 1 Troels Nielsen 1 Vivian Kong 1 Wes McKinney 1 Xavier Lange 1 Yordan Pavlov 1 kanga333 1 karldw 1 mubai 1 offthewall123 1 zanmato1984 Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-1.0.0..apache-arrow-2.0.0 127 Andy Grove 92 Antoine Pitrou 56 Krisztián Szűcs 51 Neal Richardson 44 Sutou Kouhei 18 Joris Van den Bossche 18 Micah Kornfield 17 Benjamin Kietzman 17 Wes McKinney 16 Neville Dipale 12 Jorge C. Leitao 10 Praveen 7 Paddy Horan 4 David Li 4 Eric Erhardt 4 Sebastien Binet 4 Uwe L. Korn 4 liyafan82 3 GitHub 1 Bryan Cutler 1 Chao Sun 1 tianchen Changelog Apache Arrow 2.0.0 (2020-10-19) Bug Fixes ARROW-2367 - [Python] ListArray has trouble with sizes greater than kMaximumCapacity ARROW-4189 - [CI] [Rust] Fix broken cargo coverage ARROW-4917 - [C++] orc_ep fails in cpp-alpine docker ARROW-5578 - [C++][Flight] Flight does not build out of the box on Alpine Linux ARROW-7226 - [JSON][Python] Json loader fails on example in documentation. ARROW-7384 - [Website] Fix search indexing warning reported by Google ARROW-7517 - [C++] Builder does not honour dictionary type provided during initialization ARROW-7663 - [Python] from_pandas gives TypeError instead of ArrowTypeError in some cases ARROW-7903 - [Rust] [DataFusion] Upgrade SQLParser dependency for DataFusion ARROW-7957 - [Python] ParquetDataset cannot take HadoopFileSystem as filesystem ARROW-8265 - [Rust] [DataFusion] Table API collect() should not require context ARROW-8394 - [JS] Typescript compiler errors for arrow d.ts files, when using es2015-esm package ARROW-8735 - [Rust] [Parquet] Parquet crate fails to compile on Arm architecture ARROW-8749 - [C++] IpcFormatWriter writes dictionary batches with wrong ID ARROW-8773 - [Python] pyarrow schema.empty_table() does not preserve nullability of fields ARROW-9028 - [R] Should be able to convert an empty table ARROW-9096 - [Python] Pandas roundtrip with object-dtype column labels with integer values: data type “integer” not understood ARROW-9177 - [C++][Parquet] Tracking issue for cross-implementation LZ4 Parquet compression compatibility ARROW-9414 - [C++] apt package includes headers for S3 interface, but no support ARROW-9462 - [Go] The Indentation after the first Record arrjson writer is missing ARROW-9463 - [Go] The writer is double closed in TestReadWrite ARROW-9490 - [Python] pyarrow array creation for specific set of numpy scalars fails ARROW-9495 - [C++] Equality assertions don’t handle Inf / -Inf properly ARROW-9520 - [Rust] [DataFusion] Can’t alias an aggregate expression ARROW-9528 - [Python] Honor tzinfo information when converting from datetime to pyarrow ARROW-9532 - [Python] Building pyarrow for MacPorts on macOS ARROW-9535 - [Python] Remove symlink fixes from conda recipe ARROW-9536 - Missing parameters in PlasmaOutOfMemoryException.java ARROW-9541 - [C++] CMakeLists requires UTF8PROC_STATIC when building static library ARROW-9544 - [R] version argument of write_parquet not working ARROW-9546 - [Python] Clean up Pandas Metadata Conversion test ARROW-9548 - [Go] Test output files in tmp directory are not removed correctly ARROW-9549 - [Rust] Parquet no longer builds ARROW-9554 - [Java] FixedWidthInPlaceVectorSorter sometimes produces wrong result ARROW-9556 - [Python][C++] Segfaults in UnionArray with null values ARROW-9560 - [Packaging] conda recipes failing due to missing conda-forge.yml ARROW-9569 - [CI][R] Fix rtools35 builds for msys2 key change ARROW-9570 - [Doc] Clean up sphinx sidebar ARROW-9573 - [Python] Parquet doesn’t load when partitioned column starts with ‘_’ ARROW-9574 - [R] Cleanups for CRAN 1.0.0 release ARROW-9575 - [R] gcc-UBSAN failure on CRAN ARROW-9577 - [Python][C++] posix_madvise error on Debian in pyarrow 1.0.0 ARROW-9583 - [Rust] Offset is mishandled in arithmetic and boolean compute kernels ARROW-9588 - [C++] clang/win: Copy constructor of ParquetInvalidOrCorruptedFileException not correctly triggered ARROW-9589 - [C++/R] arrow_exports.h contains structs declared as class ARROW-9592 - [CI] Update homebrew before calling brew bundle ARROW-9596 - [CI][Crossbow] Fix homebrew-cpp again, again ARROW-9597 - [C++] AddAlias in compute::FunctionRegistry should be synchronized ARROW-9598 - [C++][Parquet] Spaced definition levels is not assigned correctly. ARROW-9599 - [CI] Appveyor toolchain build fails because CMake detects different C and C++ compilers ARROW-9600 - [Rust] When used as a crate dependency, arrow-flight is rebuilt on every invocation of cargo build ARROW-9600 - [Rust] When used as a crate dependency, arrow-flight is rebuilt on every invocation of cargo build ARROW-9602 - [R] Improve cmake detection in Linux build ARROW-9603 - [C++][Parquet] Write Arrow relies on unspecified behavior for nested types ARROW-9606 - [C++][Dataset] in expressions don’t work with &gt;1 partition levels ARROW-9609 - [C++] CSV datasets don’t materialize virtual columns ARROW-9621 - [Python] test_move_file() is failed with fsspec 0.8.0 ARROW-9622 - [Java] ComplexCopier fails if a structvector has a child UnionVector with nulls ARROW-9628 - [Rust] Clippy PR test failing intermittently on Rust / AMD64 MacOS ARROW-9629 - [Python] Kartothek integration tests failing due to missing freezegun module ARROW-9631 - [Rust] Arrow crate should not depend on flight ARROW-9631 - [Rust] Arrow crate should not depend on flight ARROW-9642 - [C++] Let MakeBuilder refer DictionaryType’s index_type for deciding the starting bit width of the indices ARROW-9643 - [C++] Illegal instruction on haswell cpu ARROW-9644 - [C++][Dataset] Do not check for ignore_prefixes in the base path ARROW-9652 - [Rust][DataFusion] Panic trying to select * from a CSV (panicked at ‘index out of bounds: the len is 0 but the index is 0) ARROW-9653 - [Rust][DataFusion] Multi-column Group by: Invalid Argument Error ARROW-9659 - [C++] RecordBatchStreamReader throws on CUDA device buffers ARROW-9660 - [C++] IPC - dictionaries in maps ARROW-9666 - [Python][wheel][Windows] library missing failure by ARROW-9412 ARROW-9670 - [C++][FlightRPC] Close()ing a DoPut with an ongoing read locks up the client ARROW-9684 - [C++] Fix undefined behaviour on invalid IPC / Parquet input (OSS-Fuzz) ARROW-9692 - [Python] distutils import warning ARROW-9693 - [CI][Docs] Nightly docs build fails ARROW-9696 - [Rust] [Datafusion] nested binary expressions broken ARROW-9698 - [C++] Revert “Add -NDEBUG flag to arrow.pc” ARROW-9700 - [Python] create_library_symlinks doesn’t work in macos ARROW-9712 - [Rust] [DataFusion] ParquetScanExec panics on error ARROW-9714 - [Rust] [DataFusion] TypeCoercionRule not implemented for Limit or Sort ARROW-9716 - [Rust] [DataFusion] MergeExec should have concurrency limit ARROW-9726 - [Rust] [DataFusion] ParquetScanExec launches threads too early ARROW-9727 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9729 - [Java] Error Prone causes other annotation processors to not work with Eclipse ARROW-9733 - [Rust][DataFusion] Aggregates COUNT/MIN/MAX don’t work on VARCHAR columns ARROW-9734 - [Rust] [DataFusion] TableProvider.scan executing partitions prematurely ARROW-9741 - [Rust] [DataFusion] Incorrect count in TPC-H query 1 result set ARROW-9743 - [R] Sanitize paths in open_dataset ARROW-9744 - [Python] Failed to install on aarch64 ARROW-9764 - [CI][Java] Push wrong Docker image ARROW-9768 - [Python] Pyarrow allows for unsafe conversions of datetime objects to timestamp nanoseconds ARROW-9768 - [Python] Pyarrow allows for unsafe conversions of datetime objects to timestamp nanoseconds ARROW-9778 - [Rust] [DataFusion] Logical and physical schemas’ nullability does not match in 8 out of 20 end-to-end tests ARROW-9783 - [Rust] [DataFusion] Logical aggregate expressions require explicit data type ARROW-9785 - [Python] pyarrow/tests/test_fs.py::test_s3_options too slow ARROW-9789 - [C++] Don’t install jemalloc in parallel ARROW-9790 - [Rust] [Parquet] ParquetFileArrowReader fails to decode all pages if batches fall exactly on row group boundaries ARROW-9790 - [Rust] [Parquet] ParquetFileArrowReader fails to decode all pages if batches fall exactly on row group boundaries ARROW-9793 - [Rust] [DataFusion] Tests failing in master ARROW-9797 - [Rust] AMD64 Conda Integration Tests is failing for the Master branch ARROW-9799 - [Rust] [DataFusion] Implementation of physical binary expression get_type method is incorrect ARROW-9800 - [Rust] [Parquet] “min” and “max” written to standard out when writing columns ARROW-9809 - [Rust] [DataFusion] logical schema = physical schema is not true ARROW-9814 - [Python] Crash in test_parquet.py::test_read_partitioned_directory_s3fs ARROW-9815 - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs ARROW-9815 - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs ARROW-9815 - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs ARROW-9816 - [C++] Escape quotes in config.h ARROW-9827 - [Python] pandas.read_parquet fails for wide parquet files and pyarrow 1.0.X ARROW-9831 - [Rust] [DataFusion] Fix compilation error ARROW-9840 - [Python] Python fs documentation out of date with code ARROW-9846 - [Rust] Master branch broken build ARROW-9851 - [C++] Valgrind errors due to unrecognized instructions ARROW-9852 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9852 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9855 - [R] Fix bad merge/Rcpp conflict ARROW-9859 - [C++] S3 FileSystemFromUri with special char in secret key fails ARROW-9864 - [Python] pathlib.Path not supported in write_to_dataset with partition columns ARROW-9874 - [C++] NewStreamWriter / NewFileWriter don’t own output stream ARROW-9876 - [CI][C++] Travis ARM jobs timeout ARROW-9877 - [C++][CI] homebrew-cpp fails due to avx512 ARROW-9879 - [Python] ChunkedArray.__getitem__ doesn’t work with numpy scalars ARROW-9882 - [C++/Python] Update conda-forge-pinning to 3 for OSX conda packages ARROW-9883 - [R] Fix linuxlibs.R install script for R &lt; 3.6 ARROW-9888 - [Rust] [DataFusion] ExecutionContext can not be shared between threads ARROW-9889 - [Rust][DataFusion] Datafusion CLI: CREATE EXTERNAL TABLE errors with “Unsupported logical plan variant” ARROW-9897 - [C++][Gandiva] Add to_date() function from pattern ARROW-9898 - [C++][Gandiva] Error handling in castINT fails in some enviroments ARROW-9906 - [Python] Crash in test_parquet.py::test_parquet_writer_filesystem_s3_uri (closing NativeFile from S3FileSystem) ARROW-9913 - [C++] Outputs of Decimal128::FromString depend on presence of one another ARROW-9920 - [Python] pyarrow.concat_arrays segfaults when passing it a chunked array ARROW-9922 - [Rust] Add `try_from(Vec&lt;Option&lt;(&amp;str, ArrayRef)&gt;&gt;)` to StructArray ARROW-9924 - [Python] Performance regression reading individual Parquet files using Dataset interface ARROW-9931 - [C++] Fix undefined behaviour on invalid IPC (OSS-Fuzz) ARROW-9932 - [R] Arrow 1.0.1 R package fails to install on R3.4 over linux ARROW-9936 - [Python] Fix / test relative file paths in pyarrow.parquet ARROW-9937 - [Rust] [DataFusion] Average is not correct ARROW-9943 - [C++] Arrow metadata not applied recursively when reading Parquet file ARROW-9946 - [R] ParquetFileWriter segfaults when `sink` is a string ARROW-9953 - [R] Declare minimum version for bit64 ARROW-9962 - [Python] Conversion to pandas with index column using fixed timezone fails ARROW-9968 - [C++] UBSAN link failure with __int8_t ARROW-9969 - [C++] RecordBatchBuilder yields invalid result with dictionary fields ARROW-9970 - [Go] checkptr failures in sum methods ARROW-9972 - [CI] Work around grpc-re2 clash on Homebrew ARROW-9973 - [Java] JDBC DateConsumer does not allow dates before epoch ARROW-9976 - [Python] ArrowCapacityError when doing Table.from_pandas with large dataframe ARROW-9990 - [Rust] [DataFusion] NOT is not plannable ARROW-9993 - [Python] Tzinfo - string roundtrip fails on pytz.StaticTzInfo objects ARROW-9994 - [C++][Python] Auto chunking nested array containing binary-like fields result malformed output ARROW-9996 - [C++] Dictionary is unset when calling DictionaryArray.GetScalar for null values ARROW-10003 - [C++] Create directories in CopyFiles when copying within the same filesystem ARROW-10008 - [Python] pyarrow.parquet.read_table fails with predicate pushdown on categorical data with use_legacy_dataset=False ARROW-10011 - [C++] Make FindRE2.cmake re-entrant ARROW-10012 - [C++] Sporadic failures in CopyFiles test ARROW-10013 - [C++][CI] Flight test failure in TestFlightClient.GenericOptions ARROW-10017 - [Java] LargeMemoryUtil.checkedCastToInt has buggy logic ARROW-10022 - [C++] [Compute] core dumped on some scalar-arithmetic-benchmark ARROW-10027 - [Python] Incorrect null column returned when using a dataset filter expression. ARROW-10034 - [Rust] Master build broken ARROW-10041 - [Rust] Possible to create LargeStringArray with DataType::Utf8 ARROW-10047 - [CI] Conda integration tests failing with cmake error ARROW-10048 - [Rust] Error in aggregate of min/max for strings ARROW-10049 - [C++/Python] Sync conda recipe with conda-forge ARROW-10060 - [Rust] [DataFusion] MergeExec currently discards partitions with errors ARROW-10062 - [Rust]: Fix for null elems for DoubleEndedIter for DictArray ARROW-10073 - [Python] Test test_parquet_nested_storage relies on dict item ordering ARROW-10081 - [C++/Python] Fix bash syntax in drone.io conda builds ARROW-10085 - [C++] S3 tests fail on AppVeyor ARROW-10087 - [CI] Fix nightly docs job ARROW-10098 - [R][Doc] Fix copy_files doc mismatch ARROW-10104 - [Python] Separate tests into its own conda package ARROW-10114 - [R] Segfault in to_dataframe_parallel with deeply nested structs ARROW-10116 - [Python][Packaging] Fix gRPC linking error in macOS wheels builds ARROW-10119 - [C++] Fix Parquet crashes on invalid input (OSS-Fuzz) ARROW-10121 - [C++][Python] Variable dictionaries do not survive roundtrip to IPC stream ARROW-10124 - [R] Write functions don’t follow umask setting ARROW-10125 - [R] Int64 downcast check doesn’t consider all chunks ARROW-10130 - [C++][Dataset] ParquetFileFragment::SplitByRowGroup does not preserve “complete_metadata” status ARROW-10136 - [Rust][Arrow] Nulls are transformed into “” after filtering for StringArray ARROW-10137 - [R] Fix cpp helper that breaks if libarrow is not present ARROW-10147 - [Python] Constructing pandas metadata fails if an Index name is not JSON-serializable by default ARROW-10150 - [C++] Fix crashes on invalid Parquet file (OSS-Fuzz) ARROW-10169 - [Rust] Nulls should be rendered as “” rather than default value when pretty printing arrays ARROW-10174 - [Java] Reading of Dictionary encoded struct vector fails ARROW-10175 - [CI] Nightly hdfs integration test job fails ARROW-10176 - [CI] Nightly valgrind job fails ARROW-10178 - [CI] Fix spark master integration test build setup ARROW-10179 - [Rust] Labeler is not labeling ARROW-10181 - [Rust] Arrow tests fail to compile on Raspberry Pi (32 bit) ARROW-10188 - [Rust] [DataFusion] Some examples are broken ARROW-10189 - [Doc] C data interface example for i32 uses `l`, not `i`, in the format ARROW-10192 - [C++][Python] Segfault when converting nested struct array with dictionary field to pandas series ARROW-10193 - [Python] Segfault when converting to fixed size binary array ARROW-10200 - [Java][CI] Fix failure of Java CI on s390x ARROW-10204 - [RUST] [Datafusion] Test failure in aggregate_grouped_empty with simd feature enabled ARROW-10214 - [Python] UnicodeDecodeError when printing schema with binary metadata ARROW-10226 - [Rust] [Parquet] Parquet reader reading wrong columns in some batches within a parquet file ARROW-10230 - [JS][Doc] JavaScript documentation fails to build ARROW-10232 - FixedSizeListArray is incorrectly written/read to/from parquet ARROW-10234 - [C++][Gandiva] Fix logic of round() for floats/decimals in Gandiva ARROW-10237 - [C++] Duplicate values in a dictionary result in corrupted parquet ARROW-10238 - [C#] List&lt;Struct&gt; is broken ARROW-10239 - [C++] aws-sdk-cpp apparently requires zlib too ARROW-10244 - [Python][Docs] Add docs on using pyarrow.dataset.parquet_dataset ARROW-10248 - [C++][Dataset] Dataset writing does not write schema metadata ARROW-10262 - [C++] Some TypeClass in Scalar classes seem incorrect ARROW-10270 - [R] Fix CSV timestamp_parsers test on R-devel ARROW-10271 - [Rust] packed_simd is broken and continued under a new project ARROW-10279 - [Release][Python] Fix verification script to align with the new macos wheel platform tags ARROW-10280 - [Packaging][Python] Fix macOS wheel artifact patterns ARROW-10281 - [Python] Fix warnings when running tests ARROW-10284 - [Python] Pyarrow is raising deprecation warning about filesystems on import ARROW-10285 - [Python] pyarrow.orc submodule is using deprecated functionality ARROW-10286 - [C++][Flight] Misleading CMake errors ARROW-10288 - [C++] Compilation fails on i386 ARROW-10290 - [C++] List POP_BACK is not available in older CMake versions ARROW-10293 - [Rust] [DataFusion] Fix benchmarks ARROW-10296 - [R] Data saved as integer64 loaded as integer New Features and Improvements ARROW-983 - [C++] Implement InputStream and OutputStream classes for interacting with socket connections ARROW-1105 - [C++] SQLite record batch reader ARROW-1509 - [Python] Write serialized object as a stream of encapsulated IPC messages ARROW-1669 - [C++] Consider adding Abseil (Google C++11 standard library extensions) to toolchain ARROW-1797 - [C++] Implement binary arithmetic kernels for numeric arrays ARROW-2164 - [C++] Clean up unnecessary decimal module refs ARROW-3080 - [Python] Unify Arrow to Python object conversion paths ARROW-3757 - [R] R bindings for Flight RPC client ARROW-3872 - [R] Add ad hoc test of feather compatibility ARROW-4046 - [Python/CI] Exercise large memory tests ARROW-4248 - [C++][Plasma] Build on Windows / Visual Studio ARROW-4685 - [C++] Update Boost to 1.69 in manylinux1 docker image ARROW-4927 - [Rust] Update top level README to describe current functionality ARROW-4957 - [Rust] [DataFusion] Implement get_supertype correctly ARROW-4965 - [Python] Timestamp array type detection should use tzname of datetime.datetime objects ARROW-5034 - [C#] ArrowStreamWriter should expose synchronous Write methods ARROW-5123 - [Rust] derive RecordWriter from struct definitions ARROW-6075 - [FlightRPC] Handle uncaught exceptions in middleware ARROW-6281 - [Python] Produce chunked arrays for nested types in pyarrow.array ARROW-6282 - [Format] Support lossy compression ARROW-6437 - [R] Add AWS SDK to system dependencies for macOS and Windows ARROW-6535 - [C++] Status::WithMessage should accept variadic parameters ARROW-6537 - [R] Pass column_types to CSV reader ARROW-6972 - [C#] Should support StructField arrays ARROW-6982 - [R] Add bindings for compare and boolean kernels ARROW-7136 - [Rust][CI] Pre-install the rust dependencies in the dockerfile ARROW-7218 - [Python] Conversion from boolean numpy scalars not working ARROW-7302 - [C++] CSV: allow converting a column to a specific dictionary type ARROW-7372 - [C++] Allow creating dictionary array from simple JSON ARROW-7871 - [Python] Expose more compute kernels ARROW-7960 - [C++][Parquet] Add support for schema translation from parquet nodes back to arrow for missing types ARROW-8001 - [R][Dataset] Bindings for dataset writing ARROW-8002 - [C++][Dataset] Dataset writing should let you (re)partition the data ARROW-8048 - [Python] Run memory leak tests nightly as follow up to ARROW-4120 ARROW-8172 - [C++] ArrayFromJSON for dictionary arrays ARROW-8205 - [Rust] [DataFusion] DataFusion should enforce unique field names in a schema ARROW-8253 - [Rust] [DataFusion] Improve ergonomics of registering UDFs ARROW-8262 - [Rust] [DataFusion] Add example that uses LogicalPlanBuilder ARROW-8289 - [Rust] [Parquet] Implement minimal Arrow Parquet writer as starting point for full writer ARROW-8296 - [C++][Dataset] IpcFileFormat should support writing files with compressed buffers ARROW-8355 - [Python] Reduce the number of pandas dependent test cases in test_feather ARROW-8359 - [C++/Python] Enable aarch64/ppc64le build in conda recipes ARROW-8383 - [Rust] Easier random access to DictionaryArray keys and values ARROW-8402 - [Java] Support ValidateFull methods in Java ARROW-8423 - [Rust] [Parquet] Serialize arrow schema into metadata when writing parquet ARROW-8426 - [Rust] [Parquet] Add support for writing dictionary types ARROW-8493 - [C++] Create unified schema resolution code for Array reconstruction. ARROW-8494 - [C++] Implement basic array-by-array reassembly logic ARROW-8581 - [C#] Date32/64Array.Builder should accept DateTime, not DateTimeOffset ARROW-8601 - [Go][Flight] Implement Flight Writer interface ARROW-8601 - [Go][Flight] Implement Flight Writer interface ARROW-8618 - [C++] ASSIGN_OR_RAISE should move its argument ARROW-8678 - [C++][Parquet] Remove legacy arrow to level translation. ARROW-8712 - [R] Expose strptime timestamp parsing in read_csv conversion options ARROW-8774 - [Rust] [DataFusion] Improve threading model ARROW-8810 - [R] Add documentation about Parquet format, appending to stream format ARROW-8824 - [Rust] [DataFusion] Implement new SQL parser ARROW-8828 - [Rust] Implement SQL tokenizer ARROW-8829 - [Rust] Implement SQL parser ARROW-9010 - [Java] Framework and interface changes for RecordBatch IPC buffer compression ARROW-9065 - [C++] Support parsing date32 in dataset partition folders ARROW-9068 - [C++][Dataset] Simplify Partitioning interface ARROW-9078 - [C++] Parquet writing of extension type with nested storage type fails ARROW-9104 - [C++] Parquet encryption tests should write files to a temporary directory instead of the testing submodule’s directory ARROW-9107 - [C++][Dataset] Time-based types support ARROW-9147 - [C++][Dataset] Support null -&gt; other type promotion in Dataset scanning ARROW-9205 - [Documentation] Fix typos in Columnar.rst ARROW-9266 - [Python][Packaging] Enable S3 support in macOS wheels ARROW-9271 - [R] Preserve data frame metadata in round trip ARROW-9286 - [C++] Add function “aliases” to compute::FunctionRegistry ARROW-9328 - [C++][Gandiva] Add LTRIM, RTRIM, BTRIM functions for string ARROW-9338 - [Rust] Add instructions for running clippy locally ARROW-9344 - [C++][Flight] measure latency quantile in flight benchmark ARROW-9358 - [Integration] Reconsider generated_large_batch.json ARROW-9371 - [Java] Run vector tests for both allocators ARROW-9377 - [Java] Support unsigned dictionary indices ARROW-9387 - [R] Use new C++ table select method ARROW-9388 - [C++] Division kernels ARROW-9394 - [Python] Support pickling of Scalars ARROW-9398 - [C++] Register the SIMD sum variants under function instance instead a SIMD function ARROW-9402 - [C++] Add portable wrappers for __builtin_add_overflow and friends ARROW-9405 - [R] Switch to cpp11 ARROW-9412 - [C++] Add non-BUNDLED dependencies to exported INSTALL_INTERFACE_LIBS of arrow_static and test that it works ARROW-9429 - [Python] ChunkedArray.to_numpy ARROW-9454 - [GLib] Add binding of some dictionary builders ARROW-9465 - [Python] Improve ergonomics of compute functions ARROW-9469 - [Python] Make more objects weakrefable ARROW-9487 - [Developer] Cover the archery release utilities with unittests ARROW-9488 - [Release] Use the new changelog generation when updating the website ARROW-9507 - [Rust] [DataFusion] PhysicalExpr should implement Display trait ARROW-9508 - [Release][APT][Yum] Enable verification for arm64 binaries ARROW-9516 - [Rust][DataFusion] Refactor physical expressions to not care about their names nor indexes ARROW-9517 - [C++][Python] Allow session_token argument when initializing S3FileSystem ARROW-9518 - [Python] Deprecate pyarrow serialization ARROW-9521 - [Rust] CsvReadOptions should allow file extension to be specified ARROW-9523 - [Rust] improve performance of filter kernel ARROW-9534 - [Rust] [DataFusion] Implement functions for creating literal expressions for all types ARROW-9550 - [Rust] [DataFusion] Remove Rc&lt;RefCell&lt;_&gt;&gt; from hash aggregate operator ARROW-9553 - [Rust] Release script doesn’t bump parquet crate’s arrow dependency version ARROW-9557 - [R] Iterating over parquet columns is slow in R ARROW-9559 - [Rust] [DataFusion] Revert privatization of exprlist_to_fields ARROW-9563 - [Dev][Release] Use archery’s changelog generator when creating release notes for the website ARROW-9568 - [CI] Use official msys action on GHA ARROW-9576 - [Python][Doc] Fix error in code example for extension types ARROW-9580 - [JS] Docs have superfluous () ARROW-9581 - [Dev][Release] Bump next snapshot versions to 2.0.0 ARROW-9582 - [Rust] Implement Array::memory_size() ARROW-9585 - [Rust] Remove duplicated to-do line in DataFusion readme ARROW-9587 - [FlightRPC][Java] Clean up DoPut/FlightStream memory handling ARROW-9593 - [Python] Add custom pickle reducers for DictionaryScalar ARROW-9604 - [C++] Add benchmark for aggregate min/max compute kernels ARROW-9605 - [C++] Optimize performance for aggregate min/max compute kernels ARROW-9607 - [C++][Gandiva] Add bitwise_and(), bitwise_or() and bitwise_not() functions for integers ARROW-9608 - [Rust] Remove arrow flight from parquet’s feature gating ARROW-9615 - [Rust] Add kernel to compute length of string array ARROW-9617 - [Rust] [DataFusion] Add length of string array ARROW-9618 - [Rust] [DataFusion] Make it easier to write optimizers ARROW-9619 - [Rust] [DataFusion] Add predicate push-down ARROW-9632 - [Rust] Add a “new” method for ExecutionContextSchemaProvider ARROW-9638 - [C++][Compute] Implement mode(most frequent number) kernel ARROW-9639 - [Ruby] Add dependency version check ARROW-9640 - [C++][Gandiva] Implement round() for integers and long integers ARROW-9641 - [C++][Gandiva] Implement round() for floating point and double floating point numbers ARROW-9645 - [Python] Deprecate the legacy pyarrow.filesystem interface ARROW-9646 - [C++][Dataset] Add support for writing parquet datasets ARROW-9650 - [Packaging][APT] Drop support for Ubuntu 19.10 ARROW-9654 - [Rust][DataFusion] Add an EXPLAIN command to the datafusion CLI ARROW-9656 - [Rust][DataFusion] Slightly confusing error message when unsupported type is provided to CREATE EXTERNAL TABLE ARROW-9658 - [Python][Dataset] Bindings for dataset writing ARROW-9665 - [R] head/tail/take for Datasets ARROW-9667 - [CI][Crossbow] Segfault in 2 nightly R builds ARROW-9671 - [C++] BasicDecimal128 constructor interprets uint64_t integers with highest bit set as negative ARROW-9673 - [Rust] Add a param “dialect” for DFParser::parse_sql ARROW-9678 - [Rust] [DataFusion] Improve projection push down to remove unused columns ARROW-9679 - [Rust] [DataFusion] HashAggregate walks map many times building final batch ARROW-9681 - [Java] Failed Arrow Memory - Core on big-endian platform ARROW-9683 - [Rust][DataFusion] Implement Debug for ExecutionPlan trait ARROW-9691 - [Rust] [DataFusion] Make sql_statement_to_plan public ARROW-9695 - [Rust][DataFusion] Improve documentation on LogicalPlan variants ARROW-9699 - [C++][Compute] Improve mode kernel performance for small integer types ARROW-9701 - [Java][CI] Add a test job on s390x ARROW-9702 - [C++] Move bpacking simd to runtime path ARROW-9703 - [Developer][Archery] Restartable cherry-picking process for creating maintenance branches ARROW-9706 - [Java] Tests in TestLargeListVector fails on big endian platform ARROW-9710 - [C++] Generalize Decimal ToString in preparation for Decimal256 ARROW-9711 - [Rust] Add benchmark based on TPC-H ARROW-9713 - [Rust][DataFusion] Remove explicit panics ARROW-9715 - [R] changelog/doc updates for 1.0.1 ARROW-9718 - [Python] Make pyarrow.parquet work with the new filesystem interfaces ARROW-9721 - [Packaging][Python] Update wheel dependency files ARROW-9722 - [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays ARROW-9723 - [C++] Expected behaviour of “mode” kernel with NaNs ? ARROW-9725 - [Rust] [DataFusion] LimitExec and SortExec should use MergeExec ARROW-9737 - [C++][Gandiva] Add bitwise_xor() for integers ARROW-9739 - [CI][Ruby] Don’t install gem documents ARROW-9742 - [Rust] Create one standard DataFrame API ARROW-9751 - [Rust] [DataFusion] Extend UDFs to accept more than one type per argument ARROW-9752 - [Rust] [DataFusion] Add support for Aggregate UDFs ARROW-9753 - [Rust] [DataFusion] Remove the use of Mutex in ExecutionPlan trait ARROW-9754 - [Rust] [DataFusion] Implement async in DataFusion traits ARROW-9757 - [Rust] [DataFusion] Use “pub use” to expose a clean public API ARROW-9758 - [Rust] [DataFusion] Implement extension API for DataFusion ARROW-9759 - [Rust] [DataFusion] Implement DataFrame::sort ARROW-9760 - [Rust] [DataFusion] Implement DataFrame::explain ARROW-9761 - [C++] Add experimental pull-based iterator structures to C interface implementation ARROW-9762 - [Rust] [DataFusion] ExecutionContext::sql should return DataFrame ARROW-9769 - [Python] Remove skip for in-memory fsspec in test_move_file ARROW-9775 - [C++] Automatic S3 region selection ARROW-9781 - [C++] Fix uninitialized value warnings ARROW-9782 - [C++][Dataset] Ability to write “.feather” files with IpcFileFormat ARROW-9784 - [Rust] [DataFusion] Improve instructions for running tpch benchmark ARROW-9786 - [R] Unvendor cpp11 before release ARROW-9788 - Handle naming inconsistencies between SQL, DataFrame API and struct names ARROW-9792 - [Rust] [DataFusion] Logical aggregate functions should not return Result ARROW-9794 - [C++] Add functionality to cpu_info to discriminate between Intel vs AMD x86 ARROW-9795 - [C++][Gandiva] Implement castTIMESTAMP(int64) in Gandiva ARROW-9806 - [R] More compute kernel bindings ARROW-9807 - [R] News update/version bump post-1.0.1 ARROW-9808 - [Python] parquet.read_table docstring wrong use_legacy_dataset explanation ARROW-9811 - [C++] Unchecked floating point division by 0 should succeed ARROW-9813 - [C++] Disable semantic interposition ARROW-9819 - [C++] Bump mimalloc to 1.6.4 ARROW-9821 - [Rust][DataFusion] User Defined PlanNode / Operator API ARROW-9821 - [Rust][DataFusion] User Defined PlanNode / Operator API ARROW-9823 - [CI][C++][MinGW] Enable S3 ARROW-9832 - [Rust] [DataFusion] Refactor PhysicalPlan to remove Partition ARROW-9833 - [Rust] [DataFusion] Refactor TableProvider.scan to return ExecutionPlan ARROW-9834 - [Rust] [DataFusion] Remove Partition trait ARROW-9835 - [Rust] [DataFusion] Remove FunctionMeta ARROW-9836 - [Rust] [DataFusion] Improve API for usage of UDFs ARROW-9837 - [Rust] Add provider for variable ARROW-9838 - [Rust] [DataFusion] DefaultPhysicalPlanner should insert explicit MergeExec nodes ARROW-9839 - [Rust] [DataFusion] Add ability to downcast ExecutionPlan to specific operator ARROW-9841 - [Rust] Update checked-in flatbuffer files ARROW-9844 - [Go][CI] Add Travis CI job for Go on s390x ARROW-9845 - [Rust] [Parquet] serde_json is only used in tests but isn’t in dev-dependencies ARROW-9848 - [Rust] Implement changes to ensure flatbuffer alignment ARROW-9849 - [Rust] [DataFusion] Make UDFs not need a Field ARROW-9850 - [Go] Defer should not be used in the loop ARROW-9853 - [RUST] Implement “take” kernel for dictionary arrays ARROW-9854 - [R] Support reading/writing data to/from S3 ARROW-9858 - [C++][Python][Docs] Expand user guide for FileSystem ARROW-9863 - [C++] [PARQUET] Optimize meta data recovery of ApplicationVersion ARROW-9867 - [C++][Dataset] FileSystemDataset should expose its filesystem ARROW-9868 - [C++] Provide utility for copying files between filesystems ARROW-9869 - [R] Implement full S3FileSystem/S3Options constructor ARROW-9870 - [R] Friendly interface for filesystems (S3) ARROW-9871 - [C++] Add uppercase support to ARROW_USER_SIMD_LEVEL. ARROW-9873 - [C++][Compute] Improve mode kernel for intergers within limited value range ARROW-9875 - [Python] Let FileSystem.get_file_info accept a single path ARROW-9884 - [R] Bindings for writing datasets to Parquet ARROW-9885 - [Rust] [DataFusion] Simplify code of type coercion for binary types ARROW-9886 - [Rust] [DataFusion] Simplify code to test cast ARROW-9887 - [Rust] [DataFusion] Add support for complex return types of built-in functions ARROW-9890 - [R] Add zstandard compression codec in macOS build ARROW-9891 - [Rust] [DataFusion] Make math functions support f32 ARROW-9892 - [Rust] [DataFusion] Add support for concat ARROW-9893 - [Python] Bindings for writing datasets to Parquet ARROW-9895 - [RUST] Improve sort kernels ARROW-9899 - [Rust] [DataFusion] Switch from Box&lt;Schema&gt; –&gt; SchemaRef (Arc&lt;Schema&gt;) to be consistent with the rest of Arrow ARROW-9900 - [Rust][DataFusion] Use Arc&lt;&gt; instead of Box&lt;&gt; in LogicalPlan ARROW-9901 - [C++] Add hand-crafted Parquet to Arrow reconstruction test for nested reading ARROW-9902 - [Rust] [DataFusion] Add support for array() ARROW-9904 - [C++] Unroll the loop manually for CountSetBits ARROW-9908 - [Rust] Support temporal data types in JSON reader ARROW-9910 - [Rust] [DataFusion] Type coercion of Variadic is wrong ARROW-9914 - [Rust][DataFusion] Document the SQL -&gt; Arrow type mapping ARROW-9916 - [RUST] Avoid cloning ArrayData in several places ARROW-9917 - [Python][Compute] Add bindings for mode kernel ARROW-9919 - [Rust] [DataFusion] Math functions ARROW-9921 - [Rust] Add `from(Vec&lt;Option&lt;&amp;str&gt;&gt;)` to [Large]StringArray ARROW-9925 - [GLib] Add low level value readers for GArrowListArray family ARROW-9926 - [GLib] Use placement new for GArrowRecordBatchFileReader ARROW-9928 - [C++] Speed up integer parsing slightly ARROW-9929 - [Developer] Autotune cmake-format ARROW-9933 - [Developer] Add drone as a CI provider for crossbow ARROW-9934 - [Rust] Shape and stride check in tensor ARROW-9941 - [Python] Better string representation for extension types ARROW-9944 - [Rust] Implement TO_TIMESTAMP function ARROW-9949 - [C++] Generalize Decimal128::FromString for reuse in Decimal256 ARROW-9950 - [Rust] [DataFusion] Allow UDF usage without registry ARROW-9952 - [Python] Use pyarrow.dataset writing for pq.write_to_dataset ARROW-9954 - [Rust] [DataFusion] Simplify code of aggregate planning ARROW-9956 - [C++][Gandiva] Implement Binary string function in Gandiva ARROW-9957 - [Rust] Remove unmaintained tempdir dependency ARROW-9961 - [Rust][DataFusion] to_timestamp function parses timestamp without timezone offset as UTC rather than local ARROW-9964 - [C++] CSV date support ARROW-9965 - [Java] Buffer capacity calculations are slow for fixed-width vectors ARROW-9966 - [Rust] Speedup aggregate kernels ARROW-9967 - [Python] Add compute module docs ARROW-9971 - [Rust] Speedup take ARROW-9977 - [Rust] Add min/max for [Large]String ARROW-9979 - [Rust] Fix arrow crate clippy lints ARROW-9980 - [Rust] Fix parquet crate clippy lints ARROW-9981 - [Rust] Allow configuring flight IPC with IpcWriteOptions ARROW-9983 - [C++][Dataset][Python] Use larger default batch size than 32K for Datasets API ARROW-9984 - [Rust] [DataFusion] DRY of function to string ARROW-9986 - [Rust][DataFusion] TO_TIMESTAMP function erroneously requires fractional seconds when no timezone is present ARROW-9987 - [Rust] [DataFusion] Improve docs of `Expr`. ARROW-9988 - [Rust] [DataFusion] Added std::ops to logical expressions ARROW-9992 - [C++][Python] Refactor python to arrow conversions based on a reusable conversion API ARROW-9998 - [Python] Support pickling DictionaryScalar ARROW-9999 - [Python] Support constructing dictionary array directly through pa.array() ARROW-10000 - [C++][Python] Support constructing StructArray from list of key-value pairs ARROW-10001 - [Rust] [DataFusion] Add developer guide to README ARROW-10010 - [Rust] Speedup arithmetic ARROW-10015 - [Rust] Implement SIMD for aggregate kernel sum ARROW-10016 - [Rust] [DataFusion] Implement IsNull and IsNotNull ARROW-10018 - [CI] Disable Sphinx and API documentation build since it takes 6 hours on master ARROW-10019 - [Rust] Add substring kernel ARROW-10023 - [Gandiva][C++] Implementing Split part function in gandiva ARROW-10024 - [C++][Parquet] Create nested reading benchmarks ARROW-10028 - [Rust] Simplify macro def_numeric_from_vec ARROW-10030 - [Rust] Support fromIter and toIter ARROW-10035 - [C++] Bump versions of vendored code ARROW-10037 - [C++] Workaround to force find AWS SDK to look for shared libraries ARROW-10040 - [Rust] Create a way to slice unalligned offset buffers ARROW-10043 - [Rust] [DataFusion] Introduce support for DISTINCT by partially implementing COUNT(DISTINCT) ARROW-10044 - [Rust] Improve README ARROW-10046 - [Rust] [DataFusion] Made `*Iterator` implement Iterator ARROW-10050 - [C++][Gandiva] Implement concat() in Gandiva for up to 10 arguments ARROW-10051 - [C++][Compute] Make aggregate kernel merge state mutable ARROW-10054 - [Python] Slice methods should return empty arrays instead of crashing ARROW-10055 - [Rust] Implement DoubleEndedIterator for NullableIter ARROW-10057 - [C++] Add Parquet-Arrow roundtrip tests for nested data ARROW-10058 - [C++] Investigate performance of LevelsToBitmap without BMI2 ARROW-10059 - [R][Doc] Give more advice on how to set up C++ build ARROW-10063 - [Archery][CI] Fetch main branch in archery build only when it is a pull request ARROW-10064 - [C++] Resolve compile warnings on Apple Clang 12 ARROW-10065 - [Rust] DRY downcasted Arrays ARROW-10066 - [C++] Make sure that default AWS region is respected ARROW-10068 - [C++] Add bundled external project for aws-sdk-cpp ARROW-10069 - [Java] Support running Java benchmarks from command line ARROW-10070 - [C++][Compute] Implement stdev aggregate kernel ARROW-10071 - [R] segfault with ArrowObject from previous session, or saved ARROW-10074 - [C++] Don’t use string_view.to_string() ARROW-10075 - [C++] Don’t use nonstd::nullopt this breaks out vendoring abstraction. ARROW-10076 - [C++] Use TemporaryDir for all tests that don’t already use it. ARROW-10077 - [C++] Potential overflow in bit_stream_utils.h multiplication. ARROW-10083 - [C++] Improve Parquet fuzz seed corpus ARROW-10084 - [Rust] [DataFusion] Add length of large string array ARROW-10086 - [Rust] Migrate min_large_string -&gt; min_string kernels ARROW-10090 - [C++][Compute] Improve mode kernel ARROW-10092 - [Dev][Go] Add grpc generated go files to rat exclusion list ARROW-10093 - [R] Add ability to opt-out of int64 -&gt; int demotion ARROW-10095 - [Rust] [Parquet] Update for IPC changes ARROW-10096 - [Rust] [DataFusion] Remove unused code ARROW-10099 - [C++][Dataset] Also allow integer partition fields to be dictionary encoded ARROW-10100 - [C++][Dataset] Ability to read/subset a ParquetFileFragment with given set of row group ids ARROW-10102 - [C++] Generalize BasicDecimal128::operator*= for reuse in Decimal256 ARROW-10103 - [Rust] Add a Contains kernel ARROW-10105 - [FlightRPC] Add client option to disable certificate validation with TLS ARROW-10120 - [C++][Parquet] Create reading benchmarks for 2-level nested data ARROW-10127 - [Format] Update specification to support 256-bit Decimal types ARROW-10129 - [Rust] Cargo build is rebuilding dependencies on arrow changes ARROW-10134 - [C++][Dataset] Add ParquetFileFragment::num_row_groups property ARROW-10139 - [C++] Add support for building arrow_testing without building tests ARROW-10148 - [Rust] Add documentation to lib.rs ARROW-10151 - [Python] Add support MapArray to_pandas conversion ARROW-10155 - [Rust] [DataFusion] Add documentation to lib.rs ARROW-10156 - [Rust] Auto-label PRs ARROW-10157 - [Rust] Add more documentation about take ARROW-10160 - [Rust] Improve documentation of DictionaryType ARROW-10161 - [Rust] [DataFusion] Simplify expression tests ARROW-10162 - [Rust] Support display of DictionaryArrays in pretty printing ARROW-10164 - [Rust] Add support for DictionaryArray types to cast kernels ARROW-10167 - [Rust] Support display of DictionaryArrays in sql.rs ARROW-10168 - [Rust] [Parquet] Extend arrow schema conversion to projected fields ARROW-10171 - [Rust] [DataFusion] Add `ExecutionContext::from&lt;ExecutionContextState&gt;` ARROW-10190 - [Website] Add Jorge to list of committers ARROW-10191 - [Rust] [Parquet] Add roundtrip tests for single column batches ARROW-10196 - [C++] Add Future::DeferNotOk() ARROW-10199 - [Rust][Parquet] Release Parquet at crates.io to remove debug prints ARROW-10201 - [C++][CI] Disable S3 in arm64 job on Travis CI ARROW-10202 - [CI][Windows] Use sf.net mirror for MSYS2 ARROW-10205 - [Java][FlightRPC] Add client option to disable server verification ARROW-10206 - [Python][C++][FlightRPC] Add client option to disable server validation ARROW-10215 - [Rust] [DataFusion] Rename “Source” typedef ARROW-10217 - [CI] Run fewer GitHub Actions jobs ARROW-10225 - [Rust] [Parquet] Fix null bitmap comparisons in roundtrip tests ARROW-10227 - [Ruby] Use a table size as the default for parquet chunk_size ARROW-10229 - [C++][Parquet] Remove left over ARROW_LOG statement. ARROW-10231 - [CI] Unable to download minio in arm32v7 docker image ARROW-10233 - [Rust] Make array_value_to_string available in all Arrow builds ARROW-10235 - [Rust][DataFusion] Improve documentation for type coercion ARROW-10240 - [Rust] [Datafusion] Optionally load tpch data into memory before running benchmark query ARROW-10251 - [Rust] [DataFusion] MemTable::load() should load partitions in parallel ARROW-10252 - [Python] Add option to skip inclusion of Arrow headers in Python installation ARROW-10256 - [C++][Flight] Disable -Werror carefully ARROW-10257 - [R] Prepare news/docs for 2.0 release ARROW-10260 - [Python] Missing MapType to Pandas dtype ARROW-10263 - [C++][Compute] Improve numerical stability of variances merging ARROW-10265 - [CI] Use smaler build when cache doesn’t exit on Travis CI ARROW-10266 - [CI][macOS] Ensure using Python 3.8 with Homebrew ARROW-10267 - [Python] Skip flight test if disable_server_verification feature is not available ARROW-10272 - [Packaging][Python] Pin newer multibuild version to avoid updating homebrew ARROW-10273 - [CI][Homebrew] Fix “brew audit” usage ARROW-10287 - [C++] Avoid std::random_device whenever possible ARROW-10289 - [Rust] Support reading dictionary streams ARROW-10295 - [Rust] [DataFusion] Simplify accumulators ARROW-10310 - [C++][Gandiva] Add single argument round() in Gandiva PARQUET-1845 - [C++] Int96 memory images in test cases assume only little-endian PARQUET-1878 - [C++] lz4 codec is not compatible with Hadoop Lz4Codec PARQUET-1904 - [C++] Export file_offset in RowGroupMetaData","headline":"Apache Arrow 2.0.0 Release","image":"https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png","mainEntityOfPage":{"@type":"WebPage","@id":"https://arrow.apache.org/release/2.0.0.html"},"publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"https://arrow.apache.org/img/logo.png"}},"url":"https://arrow.apache.org/release/2.0.0.html"}</script>
<!-- End Jekyll SEO tag -->
<!-- favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16.png" id="light1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32.png" id="light2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon.png" id="light3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120.png" id="light4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76.png" id="light5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60.png" id="light6">
<!-- dark mode favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16-dark.png" id="dark1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32-dark.png" id="dark2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon-dark.png" id="dark3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120-dark.png" id="dark4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76-dark.png" id="dark5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60-dark.png" id="dark6">
<script>
// Switch to the dark-mode favicons if prefers-color-scheme: dark
function onUpdate() {
light1 = document.querySelector('link#light1');
light2 = document.querySelector('link#light2');
light3 = document.querySelector('link#light3');
light4 = document.querySelector('link#light4');
light5 = document.querySelector('link#light5');
light6 = document.querySelector('link#light6');
dark1 = document.querySelector('link#dark1');
dark2 = document.querySelector('link#dark2');
dark3 = document.querySelector('link#dark3');
dark4 = document.querySelector('link#dark4');
dark5 = document.querySelector('link#dark5');
dark6 = document.querySelector('link#dark6');
if (matcher.matches) {
light1.remove();
light2.remove();
light3.remove();
light4.remove();
light5.remove();
light6.remove();
document.head.append(dark1);
document.head.append(dark2);
document.head.append(dark3);
document.head.append(dark4);
document.head.append(dark5);
document.head.append(dark6);
} else {
dark1.remove();
dark2.remove();
dark3.remove();
dark4.remove();
dark5.remove();
dark6.remove();
document.head.append(light1);
document.head.append(light2);
document.head.append(light3);
document.head.append(light4);
document.head.append(light5);
document.head.append(light6);
}
}
matcher = window.matchMedia('(prefers-color-scheme: dark)');
matcher.addListener(onUpdate);
onUpdate();
</script>
<link rel="stylesheet" href="//fonts.googleapis.com/css?family=Lato:300,300italic,400,400italic,700,700italic,900">
<link href="/css/main.css" rel="stylesheet">
<link href="/css/syntax.css" rel="stylesheet">
<script src="/javascript/main.js"></script>
<!-- Matomo -->
<script>
var _paq = window._paq = window._paq || [];
/* tracker methods like "setCustomDimension" should be called before "trackPageView" */
/* We explicitly disable cookie tracking to avoid privacy issues */
_paq.push(['disableCookies']);
_paq.push(['trackPageView']);
_paq.push(['enableLinkTracking']);
(function() {
var u="https://analytics.apache.org/";
_paq.push(['setTrackerUrl', u+'matomo.php']);
_paq.push(['setSiteId', '20']);
var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g,s);
})();
</script>
<!-- End Matomo Code -->
</head>
<body class="wrap">
<header>
<nav class="navbar navbar-expand-md navbar-dark bg-dark">
<a class="navbar-brand no-padding" href="/"><img src="/img/arrow-inverse-300px.png" height="40px"/></a>
<button class="navbar-toggler ml-auto" type="button" data-toggle="collapse" data-target="#arrow-navbar" aria-controls="arrow-navbar" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span>
</button>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse justify-content-end" id="arrow-navbar">
<ul class="nav navbar-nav">
<li class="nav-item"><a class="nav-link" href="/overview/" role="button" aria-haspopup="true" aria-expanded="false">Overview</a></li>
<li class="nav-item"><a class="nav-link" href="/faq/" role="button" aria-haspopup="true" aria-expanded="false">FAQ</a></li>
<li class="nav-item"><a class="nav-link" href="/blog" role="button" aria-haspopup="true" aria-expanded="false">Blog</a></li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownGetArrow" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Get Arrow
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownGetArrow">
<a class="dropdown-item" href="/install/">Install</a>
<a class="dropdown-item" href="/release/">Releases</a>
<a class="dropdown-item" href="https://github.com/apache/arrow">Source Code</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownDocumentation" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Documentation
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownDocumentation">
<a class="dropdown-item" href="/docs">Project Docs</a>
<a class="dropdown-item" href="/docs/format/Columnar.html">Format</a>
<hr/>
<a class="dropdown-item" href="/docs/c_glib">C GLib</a>
<a class="dropdown-item" href="/docs/cpp">C++</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/csharp/README.md">C#</a>
<a class="dropdown-item" href="https://godoc.org/github.com/apache/arrow/go/arrow">Go</a>
<a class="dropdown-item" href="/docs/java">Java</a>
<a class="dropdown-item" href="/docs/js">JavaScript</a>
<a class="dropdown-item" href="/julia/">Julia</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/matlab/README.md">MATLAB</a>
<a class="dropdown-item" href="/docs/python">Python</a>
<a class="dropdown-item" href="/docs/r">R</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/ruby/README.md">Ruby</a>
<a class="dropdown-item" href="https://docs.rs/arrow/latest">Rust</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownSubprojects" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Subprojects
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownSubprojects">
<a class="dropdown-item" href="/adbc">ADBC</a>
<a class="dropdown-item" href="/docs/format/Flight.html">Arrow Flight</a>
<a class="dropdown-item" href="/docs/format/FlightSql.html">Arrow Flight SQL</a>
<a class="dropdown-item" href="https://datafusion.apache.org">DataFusion</a>
<a class="dropdown-item" href="/nanoarrow">nanoarrow</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownCommunity" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Community
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownCommunity">
<a class="dropdown-item" href="/community/">Communication</a>
<a class="dropdown-item" href="/docs/developers/index.html">Contributing</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/issues">Issue Tracker</a>
<a class="dropdown-item" href="/committers/">Governance</a>
<a class="dropdown-item" href="/use_cases/">Use Cases</a>
<a class="dropdown-item" href="/powered_by/">Powered By</a>
<a class="dropdown-item" href="/visual_identity/">Visual Identity</a>
<a class="dropdown-item" href="/security/">Security</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/policies/conduct.html">Code of Conduct</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownASF" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
ASF Links
</a>
<div class="dropdown-menu dropdown-menu-right" aria-labelledby="navbarDropdownASF">
<a class="dropdown-item" href="https://www.apache.org/">ASF Website</a>
<a class="dropdown-item" href="https://www.apache.org/licenses/">License</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/sponsorship.html">Donate</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/thanks.html">Thanks</a>
<a class="dropdown-item" href="https://www.apache.org/security/">Security</a>
</div>
</li>
</ul>
</div><!-- /.navbar-collapse -->
</nav>
</header>
<div class="container p-4 pt-5">
<main role="main" class="pb-5">
<!--
-->
<h1 id="apache-arrow-200-19-october-2020">Apache Arrow 2.0.0 (19 October 2020)</h1>
<p>This is a major release covering more than 3 months of development.</p>
<h2 id="download">Download</h2>
<ul>
<li><a href="https://www.apache.org/dyn/closer.lua/arrow/arrow-2.0.0/"><strong>Source Artifacts</strong></a></li>
<li><strong>Binary Artifacts</strong>
<ul>
<li><a href="https://apache.jfrog.io/artifactory/arrow/centos/">For CentOS</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/debian/">For Debian</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/python/2.0.0/">For Python</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/ubuntu/">For Ubuntu</a></li>
</ul>
</li>
<li><a href="https://github.com/apache/arrow/releases/tag/apache-arrow-2.0.0">Git tag</a></li>
</ul>
<h2 id="contributors">Contributors</h2>
<p>This release includes 511 commits from 81 distinct contributors.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> apache-arrow-1.0.0..apache-arrow-2.0.0
<span class="go"> 68 Jorge C. Leitao
48 Antoine Pitrou
40 Krisztián Szűcs
34 alamb
33 Neal Richardson
30 Andy Grove
25 Benjamin Kietzman
25 Joris Van den Bossche
19 Sutou Kouhei
13 Uwe L. Korn
12 Micah Kornfield
10 Frank Du
10 Jörn Horstmann
9 Neville Dipale
9 Romain Francois
9 arw2019
8 Yibo Cai
8 liyafan82
7 Sagnik Chakraborty
6 David Li
5 Kazuaki Ishizaki
5 Mahmut Bulut
4 Mingyu Zhong
4 fredgan
3 Bryan Cutler
3 wqc200
2 Daniel Russo
2 Diana Clarke
2 James Duong
2 Kenta Murata
2 Patrick Woody
2 Projjal Chanda
2 naman1996
2 ptaylor
2 tianchen
1 Adam Szmigin
1 Ali McMaster
1 Andrew Stevenson
1 Ben Kimock
1 Brian Dunlay
1 Christoph Schulze
1 Derek Marsh
1 Dominik Moritz
1 Eric Erhardt
1 Ezra
1 Fernando José Herrera Elizalde
1 FredGan
1 Hongze Zhang
1 Jim Klucar
1 Josiah
1 Kyle Strand
1 Laurent Goujon
1 Lawrence Chan
1 Mark Rushakoff
1 Matt Corley
1 Matthew Topol
1 Matthias
1 Morgan Cassels
1 Ofek
1 Patrick Pai
1 Paul
1 PoojaChandak
1 Prashanth Govindarajan
1 Pratik raj
1 Revital Sur
1 Ruan Pearce-Authers
1 Ryan Murray
1 Simon Bertron
1 Steve Suh
1 Tanguy Fautre
1 Tobias Mayer
1 Troels Nielsen
1 Vivian Kong
1 Wes McKinney
1 Xavier Lange
1 Yordan Pavlov
1 kanga333
1 karldw
1 mubai
1 offthewall123
1 zanmato1984
</span></code></pre></div></div>
<h2 id="patch-committers">Patch Committers</h2>
<p>The following Apache committers merged contributed patches to the repository.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-csn</span> apache-arrow-1.0.0..apache-arrow-2.0.0
<span class="go"> 127 Andy Grove
92 Antoine Pitrou
56 Krisztián Szűcs
51 Neal Richardson
44 Sutou Kouhei
18 Joris Van den Bossche
18 Micah Kornfield
17 Benjamin Kietzman
17 Wes McKinney
16 Neville Dipale
12 Jorge C. Leitao
10 Praveen
7 Paddy Horan
4 David Li
4 Eric Erhardt
4 Sebastien Binet
4 Uwe L. Korn
4 liyafan82
3 GitHub
1 Bryan Cutler
1 Chao Sun
1 tianchen
</span></code></pre></div></div>
<h2 id="changelog">Changelog</h2>
<h2 id="apache-arrow-200-2020-10-19">Apache Arrow 2.0.0 (2020-10-19)</h2>
<h3 id="bug-fixes">Bug Fixes</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2367">ARROW-2367</a> - [Python] ListArray has trouble with sizes greater than kMaximumCapacity</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4189">ARROW-4189</a> - [CI] [Rust] Fix broken cargo coverage</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4917">ARROW-4917</a> - [C++] orc_ep fails in cpp-alpine docker</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5578">ARROW-5578</a> - [C++][Flight] Flight does not build out of the box on Alpine Linux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7226">ARROW-7226</a> - [JSON][Python] Json loader fails on example in documentation.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7384">ARROW-7384</a> - [Website] Fix search indexing warning reported by Google</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7517">ARROW-7517</a> - [C++] Builder does not honour dictionary type provided during initialization</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7663">ARROW-7663</a> - [Python] from_pandas gives TypeError instead of ArrowTypeError in some cases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7903">ARROW-7903</a> - [Rust] [DataFusion] Upgrade SQLParser dependency for DataFusion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7957">ARROW-7957</a> - [Python] ParquetDataset cannot take HadoopFileSystem as filesystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8265">ARROW-8265</a> - [Rust] [DataFusion] Table API collect() should not require context</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8394">ARROW-8394</a> - [JS] Typescript compiler errors for arrow d.ts files, when using es2015-esm package</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8735">ARROW-8735</a> - [Rust] [Parquet] Parquet crate fails to compile on Arm architecture</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8749">ARROW-8749</a> - [C++] IpcFormatWriter writes dictionary batches with wrong ID</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8773">ARROW-8773</a> - [Python] pyarrow schema.empty_table() does not preserve nullability of fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9028">ARROW-9028</a> - [R] Should be able to convert an empty table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9096">ARROW-9096</a> - [Python] Pandas roundtrip with object-dtype column labels with integer values: data type “integer” not understood</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9177">ARROW-9177</a> - [C++][Parquet] Tracking issue for cross-implementation LZ4 Parquet compression compatibility</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9414">ARROW-9414</a> - [C++] apt package includes headers for S3 interface, but no support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9462">ARROW-9462</a> - [Go] The Indentation after the first Record arrjson writer is missing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9463">ARROW-9463</a> - [Go] The writer is double closed in TestReadWrite</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9490">ARROW-9490</a> - [Python] pyarrow array creation for specific set of numpy scalars fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9495">ARROW-9495</a> - [C++] Equality assertions don’t handle Inf / -Inf properly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9520">ARROW-9520</a> - [Rust] [DataFusion] Can’t alias an aggregate expression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9528">ARROW-9528</a> - [Python] Honor tzinfo information when converting from datetime to pyarrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9532">ARROW-9532</a> - [Python] Building pyarrow for MacPorts on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9535">ARROW-9535</a> - [Python] Remove symlink fixes from conda recipe</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9536">ARROW-9536</a> - Missing parameters in PlasmaOutOfMemoryException.java</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9541">ARROW-9541</a> - [C++] CMakeLists requires UTF8PROC_STATIC when building static library</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9544">ARROW-9544</a> - [R] version argument of write_parquet not working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9546">ARROW-9546</a> - [Python] Clean up Pandas Metadata Conversion test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9548">ARROW-9548</a> - [Go] Test output files in tmp directory are not removed correctly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9549">ARROW-9549</a> - [Rust] Parquet no longer builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9554">ARROW-9554</a> - [Java] FixedWidthInPlaceVectorSorter sometimes produces wrong result</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9556">ARROW-9556</a> - [Python][C++] Segfaults in UnionArray with null values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9560">ARROW-9560</a> - [Packaging] conda recipes failing due to missing conda-forge.yml</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9569">ARROW-9569</a> - [CI][R] Fix rtools35 builds for msys2 key change</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9570">ARROW-9570</a> - [Doc] Clean up sphinx sidebar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9573">ARROW-9573</a> - [Python] Parquet doesn’t load when partitioned column starts with ‘_’</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9574">ARROW-9574</a> - [R] Cleanups for CRAN 1.0.0 release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9575">ARROW-9575</a> - [R] gcc-UBSAN failure on CRAN</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9577">ARROW-9577</a> - [Python][C++] posix_madvise error on Debian in pyarrow 1.0.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9583">ARROW-9583</a> - [Rust] Offset is mishandled in arithmetic and boolean compute kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9588">ARROW-9588</a> - [C++] clang/win: Copy constructor of ParquetInvalidOrCorruptedFileException not correctly triggered</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9589">ARROW-9589</a> - [C++/R] arrow_exports.h contains structs declared as class</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9592">ARROW-9592</a> - [CI] Update homebrew before calling brew bundle</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9596">ARROW-9596</a> - [CI][Crossbow] Fix homebrew-cpp again, again</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9597">ARROW-9597</a> - [C++] AddAlias in compute::FunctionRegistry should be synchronized</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9598">ARROW-9598</a> - [C++][Parquet] Spaced definition levels is not assigned correctly.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9599">ARROW-9599</a> - [CI] Appveyor toolchain build fails because CMake detects different C and C++ compilers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9600">ARROW-9600</a> - [Rust] When used as a crate dependency, arrow-flight is rebuilt on every invocation of cargo build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9600">ARROW-9600</a> - [Rust] When used as a crate dependency, arrow-flight is rebuilt on every invocation of cargo build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9602">ARROW-9602</a> - [R] Improve cmake detection in Linux build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9603">ARROW-9603</a> - [C++][Parquet] Write Arrow relies on unspecified behavior for nested types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9606">ARROW-9606</a> - [C++][Dataset] in expressions don’t work with &gt;1 partition levels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9609">ARROW-9609</a> - [C++] CSV datasets don’t materialize virtual columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9621">ARROW-9621</a> - [Python] test_move_file() is failed with fsspec 0.8.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9622">ARROW-9622</a> - [Java] ComplexCopier fails if a structvector has a child UnionVector with nulls</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9628">ARROW-9628</a> - [Rust] Clippy PR test failing intermittently on Rust / AMD64 MacOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9629">ARROW-9629</a> - [Python] Kartothek integration tests failing due to missing freezegun module</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9631">ARROW-9631</a> - [Rust] Arrow crate should not depend on flight</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9631">ARROW-9631</a> - [Rust] Arrow crate should not depend on flight</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9642">ARROW-9642</a> - [C++] Let MakeBuilder refer DictionaryType’s index_type for deciding the starting bit width of the indices</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9643">ARROW-9643</a> - [C++] Illegal instruction on haswell cpu</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9644">ARROW-9644</a> - [C++][Dataset] Do not check for ignore_prefixes in the base path</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9652">ARROW-9652</a> - [Rust][DataFusion] Panic trying to select * from a CSV (panicked at ‘index out of bounds: the len is 0 but the index is 0)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9653">ARROW-9653</a> - [Rust][DataFusion] Multi-column Group by: Invalid Argument Error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9659">ARROW-9659</a> - [C++] RecordBatchStreamReader throws on CUDA device buffers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9660">ARROW-9660</a> - [C++] IPC - dictionaries in maps</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9666">ARROW-9666</a> - [Python][wheel][Windows] library missing failure by ARROW-9412</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9670">ARROW-9670</a> - [C++][FlightRPC] Close()ing a DoPut with an ongoing read locks up the client</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9684">ARROW-9684</a> - [C++] Fix undefined behaviour on invalid IPC / Parquet input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9692">ARROW-9692</a> - [Python] distutils import warning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9693">ARROW-9693</a> - [CI][Docs] Nightly docs build fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9696">ARROW-9696</a> - [Rust] [Datafusion] nested binary expressions broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9698">ARROW-9698</a> - [C++] Revert “Add -NDEBUG flag to arrow.pc”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9700">ARROW-9700</a> - [Python] create_library_symlinks doesn’t work in macos</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9712">ARROW-9712</a> - [Rust] [DataFusion] ParquetScanExec panics on error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9714">ARROW-9714</a> - [Rust] [DataFusion] TypeCoercionRule not implemented for Limit or Sort</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9716">ARROW-9716</a> - [Rust] [DataFusion] MergeExec should have concurrency limit</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9726">ARROW-9726</a> - [Rust] [DataFusion] ParquetScanExec launches threads too early</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9727">ARROW-9727</a> - [C++] Fix crash on invalid IPC input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9729">ARROW-9729</a> - [Java] Error Prone causes other annotation processors to not work with Eclipse</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9733">ARROW-9733</a> - [Rust][DataFusion] Aggregates COUNT/MIN/MAX don’t work on VARCHAR columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9734">ARROW-9734</a> - [Rust] [DataFusion] TableProvider.scan executing partitions prematurely</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9741">ARROW-9741</a> - [Rust] [DataFusion] Incorrect count in TPC-H query 1 result set</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9743">ARROW-9743</a> - [R] Sanitize paths in open_dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9744">ARROW-9744</a> - [Python] Failed to install on aarch64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9764">ARROW-9764</a> - [CI][Java] Push wrong Docker image</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9768">ARROW-9768</a> - [Python] Pyarrow allows for unsafe conversions of datetime objects to timestamp nanoseconds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9768">ARROW-9768</a> - [Python] Pyarrow allows for unsafe conversions of datetime objects to timestamp nanoseconds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9778">ARROW-9778</a> - [Rust] [DataFusion] Logical and physical schemas’ nullability does not match in 8 out of 20 end-to-end tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9783">ARROW-9783</a> - [Rust] [DataFusion] Logical aggregate expressions require explicit data type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9785">ARROW-9785</a> - [Python] pyarrow/tests/test_fs.py::test_s3_options too slow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9789">ARROW-9789</a> - [C++] Don’t install jemalloc in parallel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9790">ARROW-9790</a> - [Rust] [Parquet] ParquetFileArrowReader fails to decode all pages if batches fall exactly on row group boundaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9790">ARROW-9790</a> - [Rust] [Parquet] ParquetFileArrowReader fails to decode all pages if batches fall exactly on row group boundaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9793">ARROW-9793</a> - [Rust] [DataFusion] Tests failing in master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9797">ARROW-9797</a> - [Rust] AMD64 Conda Integration Tests is failing for the Master branch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9799">ARROW-9799</a> - [Rust] [DataFusion] Implementation of physical binary expression get_type method is incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9800">ARROW-9800</a> - [Rust] [Parquet] “min” and “max” written to standard out when writing columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9809">ARROW-9809</a> - [Rust] [DataFusion] logical schema = physical schema is not true</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9814">ARROW-9814</a> - [Python] Crash in test_parquet.py::test_read_partitioned_directory_s3fs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9815">ARROW-9815</a> - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9815">ARROW-9815</a> - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9815">ARROW-9815</a> - [Rust] [DataFusion] Deadlock in creation of physical plan with two udfs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9816">ARROW-9816</a> - [C++] Escape quotes in config.h</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9827">ARROW-9827</a> - [Python] pandas.read_parquet fails for wide parquet files and pyarrow 1.0.X</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9831">ARROW-9831</a> - [Rust] [DataFusion] Fix compilation error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9840">ARROW-9840</a> - [Python] Python fs documentation out of date with code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9846">ARROW-9846</a> - [Rust] Master branch broken build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9851">ARROW-9851</a> - [C++] Valgrind errors due to unrecognized instructions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9852">ARROW-9852</a> - [C++] Fix crash on invalid IPC input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9852">ARROW-9852</a> - [C++] Fix crash on invalid IPC input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9855">ARROW-9855</a> - [R] Fix bad merge/Rcpp conflict</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9859">ARROW-9859</a> - [C++] S3 FileSystemFromUri with special char in secret key fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9864">ARROW-9864</a> - [Python] pathlib.Path not supported in write_to_dataset with partition columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9874">ARROW-9874</a> - [C++] NewStreamWriter / NewFileWriter don’t own output stream</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9876">ARROW-9876</a> - [CI][C++] Travis ARM jobs timeout</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9877">ARROW-9877</a> - [C++][CI] homebrew-cpp fails due to avx512</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9879">ARROW-9879</a> - [Python] ChunkedArray.__getitem__ doesn’t work with numpy scalars</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9882">ARROW-9882</a> - [C++/Python] Update conda-forge-pinning to 3 for OSX conda packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9883">ARROW-9883</a> - [R] Fix linuxlibs.R install script for R &lt; 3.6</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9888">ARROW-9888</a> - [Rust] [DataFusion] ExecutionContext can not be shared between threads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9889">ARROW-9889</a> - [Rust][DataFusion] Datafusion CLI: CREATE EXTERNAL TABLE errors with “Unsupported logical plan variant”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9897">ARROW-9897</a> - [C++][Gandiva] Add to_date() function from pattern</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9898">ARROW-9898</a> - [C++][Gandiva] Error handling in castINT fails in some enviroments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9906">ARROW-9906</a> - [Python] Crash in test_parquet.py::test_parquet_writer_filesystem_s3_uri (closing NativeFile from S3FileSystem)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9913">ARROW-9913</a> - [C++] Outputs of Decimal128::FromString depend on presence of one another</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9920">ARROW-9920</a> - [Python] pyarrow.concat_arrays segfaults when passing it a chunked array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9922">ARROW-9922</a> - [Rust] Add `try_from(Vec&lt;Option&lt;(&amp;str, ArrayRef)&gt;&gt;)` to StructArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9924">ARROW-9924</a> - [Python] Performance regression reading individual Parquet files using Dataset interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9931">ARROW-9931</a> - [C++] Fix undefined behaviour on invalid IPC (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9932">ARROW-9932</a> - [R] Arrow 1.0.1 R package fails to install on R3.4 over linux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9936">ARROW-9936</a> - [Python] Fix / test relative file paths in pyarrow.parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9937">ARROW-9937</a> - [Rust] [DataFusion] Average is not correct</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9943">ARROW-9943</a> - [C++] Arrow metadata not applied recursively when reading Parquet file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9946">ARROW-9946</a> - [R] ParquetFileWriter segfaults when `sink` is a string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9953">ARROW-9953</a> - [R] Declare minimum version for bit64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9962">ARROW-9962</a> - [Python] Conversion to pandas with index column using fixed timezone fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9968">ARROW-9968</a> - [C++] UBSAN link failure with __int8_t</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9969">ARROW-9969</a> - [C++] RecordBatchBuilder yields invalid result with dictionary fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9970">ARROW-9970</a> - [Go] checkptr failures in sum methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9972">ARROW-9972</a> - [CI] Work around grpc-re2 clash on Homebrew</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9973">ARROW-9973</a> - [Java] JDBC DateConsumer does not allow dates before epoch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9976">ARROW-9976</a> - [Python] ArrowCapacityError when doing Table.from_pandas with large dataframe</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9990">ARROW-9990</a> - [Rust] [DataFusion] NOT is not plannable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9993">ARROW-9993</a> - [Python] Tzinfo - string roundtrip fails on pytz.StaticTzInfo objects</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9994">ARROW-9994</a> - [C++][Python] Auto chunking nested array containing binary-like fields result malformed output</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9996">ARROW-9996</a> - [C++] Dictionary is unset when calling DictionaryArray.GetScalar for null values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10003">ARROW-10003</a> - [C++] Create directories in CopyFiles when copying within the same filesystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10008">ARROW-10008</a> - [Python] pyarrow.parquet.read_table fails with predicate pushdown on categorical data with use_legacy_dataset=False</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10011">ARROW-10011</a> - [C++] Make FindRE2.cmake re-entrant</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10012">ARROW-10012</a> - [C++] Sporadic failures in CopyFiles test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10013">ARROW-10013</a> - [C++][CI] Flight test failure in TestFlightClient.GenericOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10017">ARROW-10017</a> - [Java] LargeMemoryUtil.checkedCastToInt has buggy logic</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10022">ARROW-10022</a> - [C++] [Compute] core dumped on some scalar-arithmetic-benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10027">ARROW-10027</a> - [Python] Incorrect null column returned when using a dataset filter expression.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10034">ARROW-10034</a> - [Rust] Master build broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10041">ARROW-10041</a> - [Rust] Possible to create LargeStringArray with DataType::Utf8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10047">ARROW-10047</a> - [CI] Conda integration tests failing with cmake error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10048">ARROW-10048</a> - [Rust] Error in aggregate of min/max for strings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10049">ARROW-10049</a> - [C++/Python] Sync conda recipe with conda-forge</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10060">ARROW-10060</a> - [Rust] [DataFusion] MergeExec currently discards partitions with errors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10062">ARROW-10062</a> - [Rust]: Fix for null elems for DoubleEndedIter for DictArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10073">ARROW-10073</a> - [Python] Test test_parquet_nested_storage relies on dict item ordering</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10081">ARROW-10081</a> - [C++/Python] Fix bash syntax in drone.io conda builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10085">ARROW-10085</a> - [C++] S3 tests fail on AppVeyor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10087">ARROW-10087</a> - [CI] Fix nightly docs job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10098">ARROW-10098</a> - [R][Doc] Fix copy_files doc mismatch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10104">ARROW-10104</a> - [Python] Separate tests into its own conda package</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10114">ARROW-10114</a> - [R] Segfault in to_dataframe_parallel with deeply nested structs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10116">ARROW-10116</a> - [Python][Packaging] Fix gRPC linking error in macOS wheels builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10119">ARROW-10119</a> - [C++] Fix Parquet crashes on invalid input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10121">ARROW-10121</a> - [C++][Python] Variable dictionaries do not survive roundtrip to IPC stream</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10124">ARROW-10124</a> - [R] Write functions don’t follow umask setting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10125">ARROW-10125</a> - [R] Int64 downcast check doesn’t consider all chunks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10130">ARROW-10130</a> - [C++][Dataset] ParquetFileFragment::SplitByRowGroup does not preserve “complete_metadata” status</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10136">ARROW-10136</a> - [Rust][Arrow] Nulls are transformed into “” after filtering for StringArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10137">ARROW-10137</a> - [R] Fix cpp helper that breaks if libarrow is not present</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10147">ARROW-10147</a> - [Python] Constructing pandas metadata fails if an Index name is not JSON-serializable by default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10150">ARROW-10150</a> - [C++] Fix crashes on invalid Parquet file (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10169">ARROW-10169</a> - [Rust] Nulls should be rendered as “” rather than default value when pretty printing arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10174">ARROW-10174</a> - [Java] Reading of Dictionary encoded struct vector fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10175">ARROW-10175</a> - [CI] Nightly hdfs integration test job fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10176">ARROW-10176</a> - [CI] Nightly valgrind job fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10178">ARROW-10178</a> - [CI] Fix spark master integration test build setup</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10179">ARROW-10179</a> - [Rust] Labeler is not labeling</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10181">ARROW-10181</a> - [Rust] Arrow tests fail to compile on Raspberry Pi (32 bit)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10188">ARROW-10188</a> - [Rust] [DataFusion] Some examples are broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10189">ARROW-10189</a> - [Doc] C data interface example for i32 uses `l`, not `i`, in the format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10192">ARROW-10192</a> - [C++][Python] Segfault when converting nested struct array with dictionary field to pandas series</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10193">ARROW-10193</a> - [Python] Segfault when converting to fixed size binary array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10200">ARROW-10200</a> - [Java][CI] Fix failure of Java CI on s390x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10204">ARROW-10204</a> - [RUST] [Datafusion] Test failure in aggregate_grouped_empty with simd feature enabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10214">ARROW-10214</a> - [Python] UnicodeDecodeError when printing schema with binary metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10226">ARROW-10226</a> - [Rust] [Parquet] Parquet reader reading wrong columns in some batches within a parquet file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10230">ARROW-10230</a> - [JS][Doc] JavaScript documentation fails to build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10232">ARROW-10232</a> - FixedSizeListArray is incorrectly written/read to/from parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10234">ARROW-10234</a> - [C++][Gandiva] Fix logic of round() for floats/decimals in Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10237">ARROW-10237</a> - [C++] Duplicate values in a dictionary result in corrupted parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10238">ARROW-10238</a> - [C#] List&lt;Struct&gt; is broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10239">ARROW-10239</a> - [C++] aws-sdk-cpp apparently requires zlib too</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10244">ARROW-10244</a> - [Python][Docs] Add docs on using pyarrow.dataset.parquet_dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10248">ARROW-10248</a> - [C++][Dataset] Dataset writing does not write schema metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10262">ARROW-10262</a> - [C++] Some TypeClass in Scalar classes seem incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10270">ARROW-10270</a> - [R] Fix CSV timestamp_parsers test on R-devel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10271">ARROW-10271</a> - [Rust] packed_simd is broken and continued under a new project</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10279">ARROW-10279</a> - [Release][Python] Fix verification script to align with the new macos wheel platform tags</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10280">ARROW-10280</a> - [Packaging][Python] Fix macOS wheel artifact patterns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10281">ARROW-10281</a> - [Python] Fix warnings when running tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10284">ARROW-10284</a> - [Python] Pyarrow is raising deprecation warning about filesystems on import</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10285">ARROW-10285</a> - [Python] pyarrow.orc submodule is using deprecated functionality</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10286">ARROW-10286</a> - [C++][Flight] Misleading CMake errors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10288">ARROW-10288</a> - [C++] Compilation fails on i386</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10290">ARROW-10290</a> - [C++] List POP_BACK is not available in older CMake versions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10293">ARROW-10293</a> - [Rust] [DataFusion] Fix benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10296">ARROW-10296</a> - [R] Data saved as integer64 loaded as integer</li>
</ul>
<h3 id="new-features-and-improvements">New Features and Improvements</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-983">ARROW-983</a> - [C++] Implement InputStream and OutputStream classes for interacting with socket connections</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1105">ARROW-1105</a> - [C++] SQLite record batch reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1509">ARROW-1509</a> - [Python] Write serialized object as a stream of encapsulated IPC messages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1669">ARROW-1669</a> - [C++] Consider adding Abseil (Google C++11 standard library extensions) to toolchain</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1797">ARROW-1797</a> - [C++] Implement binary arithmetic kernels for numeric arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2164">ARROW-2164</a> - [C++] Clean up unnecessary decimal module refs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3080">ARROW-3080</a> - [Python] Unify Arrow to Python object conversion paths</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3757">ARROW-3757</a> - [R] R bindings for Flight RPC client</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3872">ARROW-3872</a> - [R] Add ad hoc test of feather compatibility</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4046">ARROW-4046</a> - [Python/CI] Exercise large memory tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4248">ARROW-4248</a> - [C++][Plasma] Build on Windows / Visual Studio</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4685">ARROW-4685</a> - [C++] Update Boost to 1.69 in manylinux1 docker image</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4927">ARROW-4927</a> - [Rust] Update top level README to describe current functionality</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4957">ARROW-4957</a> - [Rust] [DataFusion] Implement get_supertype correctly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4965">ARROW-4965</a> - [Python] Timestamp array type detection should use tzname of datetime.datetime objects</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5034">ARROW-5034</a> - [C#] ArrowStreamWriter should expose synchronous Write methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5123">ARROW-5123</a> - [Rust] derive RecordWriter from struct definitions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6075">ARROW-6075</a> - [FlightRPC] Handle uncaught exceptions in middleware</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6281">ARROW-6281</a> - [Python] Produce chunked arrays for nested types in pyarrow.array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6282">ARROW-6282</a> - [Format] Support lossy compression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6437">ARROW-6437</a> - [R] Add AWS SDK to system dependencies for macOS and Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6535">ARROW-6535</a> - [C++] Status::WithMessage should accept variadic parameters</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6537">ARROW-6537</a> - [R] Pass column_types to CSV reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6972">ARROW-6972</a> - [C#] Should support StructField arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6982">ARROW-6982</a> - [R] Add bindings for compare and boolean kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7136">ARROW-7136</a> - [Rust][CI] Pre-install the rust dependencies in the dockerfile</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7218">ARROW-7218</a> - [Python] Conversion from boolean numpy scalars not working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7302">ARROW-7302</a> - [C++] CSV: allow converting a column to a specific dictionary type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7372">ARROW-7372</a> - [C++] Allow creating dictionary array from simple JSON</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7871">ARROW-7871</a> - [Python] Expose more compute kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7960">ARROW-7960</a> - [C++][Parquet] Add support for schema translation from parquet nodes back to arrow for missing types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8001">ARROW-8001</a> - [R][Dataset] Bindings for dataset writing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8002">ARROW-8002</a> - [C++][Dataset] Dataset writing should let you (re)partition the data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8048">ARROW-8048</a> - [Python] Run memory leak tests nightly as follow up to ARROW-4120</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8172">ARROW-8172</a> - [C++] ArrayFromJSON for dictionary arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8205">ARROW-8205</a> - [Rust] [DataFusion] DataFusion should enforce unique field names in a schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8253">ARROW-8253</a> - [Rust] [DataFusion] Improve ergonomics of registering UDFs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8262">ARROW-8262</a> - [Rust] [DataFusion] Add example that uses LogicalPlanBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8289">ARROW-8289</a> - [Rust] [Parquet] Implement minimal Arrow Parquet writer as starting point for full writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8296">ARROW-8296</a> - [C++][Dataset] IpcFileFormat should support writing files with compressed buffers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8355">ARROW-8355</a> - [Python] Reduce the number of pandas dependent test cases in test_feather</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8359">ARROW-8359</a> - [C++/Python] Enable aarch64/ppc64le build in conda recipes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8383">ARROW-8383</a> - [Rust] Easier random access to DictionaryArray keys and values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8402">ARROW-8402</a> - [Java] Support ValidateFull methods in Java</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8423">ARROW-8423</a> - [Rust] [Parquet] Serialize arrow schema into metadata when writing parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8426">ARROW-8426</a> - [Rust] [Parquet] Add support for writing dictionary types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8493">ARROW-8493</a> - [C++] Create unified schema resolution code for Array reconstruction.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8494">ARROW-8494</a> - [C++] Implement basic array-by-array reassembly logic</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8581">ARROW-8581</a> - [C#] Date32/64Array.Builder should accept DateTime, not DateTimeOffset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8601">ARROW-8601</a> - [Go][Flight] Implement Flight Writer interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8601">ARROW-8601</a> - [Go][Flight] Implement Flight Writer interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8618">ARROW-8618</a> - [C++] ASSIGN_OR_RAISE should move its argument</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8678">ARROW-8678</a> - [C++][Parquet] Remove legacy arrow to level translation.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8712">ARROW-8712</a> - [R] Expose strptime timestamp parsing in read_csv conversion options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8774">ARROW-8774</a> - [Rust] [DataFusion] Improve threading model</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8810">ARROW-8810</a> - [R] Add documentation about Parquet format, appending to stream format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8824">ARROW-8824</a> - [Rust] [DataFusion] Implement new SQL parser</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8828">ARROW-8828</a> - [Rust] Implement SQL tokenizer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8829">ARROW-8829</a> - [Rust] Implement SQL parser</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9010">ARROW-9010</a> - [Java] Framework and interface changes for RecordBatch IPC buffer compression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9065">ARROW-9065</a> - [C++] Support parsing date32 in dataset partition folders</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9068">ARROW-9068</a> - [C++][Dataset] Simplify Partitioning interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9078">ARROW-9078</a> - [C++] Parquet writing of extension type with nested storage type fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9104">ARROW-9104</a> - [C++] Parquet encryption tests should write files to a temporary directory instead of the testing submodule’s directory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9107">ARROW-9107</a> - [C++][Dataset] Time-based types support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9147">ARROW-9147</a> - [C++][Dataset] Support null -&gt; other type promotion in Dataset scanning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9205">ARROW-9205</a> - [Documentation] Fix typos in Columnar.rst</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9266">ARROW-9266</a> - [Python][Packaging] Enable S3 support in macOS wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9271">ARROW-9271</a> - [R] Preserve data frame metadata in round trip</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9286">ARROW-9286</a> - [C++] Add function “aliases” to compute::FunctionRegistry</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9328">ARROW-9328</a> - [C++][Gandiva] Add LTRIM, RTRIM, BTRIM functions for string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9338">ARROW-9338</a> - [Rust] Add instructions for running clippy locally</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9344">ARROW-9344</a> - [C++][Flight] measure latency quantile in flight benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9358">ARROW-9358</a> - [Integration] Reconsider generated_large_batch.json</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9371">ARROW-9371</a> - [Java] Run vector tests for both allocators</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9377">ARROW-9377</a> - [Java] Support unsigned dictionary indices</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9387">ARROW-9387</a> - [R] Use new C++ table select method</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9388">ARROW-9388</a> - [C++] Division kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9394">ARROW-9394</a> - [Python] Support pickling of Scalars</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9398">ARROW-9398</a> - [C++] Register the SIMD sum variants under function instance instead a SIMD function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9402">ARROW-9402</a> - [C++] Add portable wrappers for __builtin_add_overflow and friends</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9405">ARROW-9405</a> - [R] Switch to cpp11</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9412">ARROW-9412</a> - [C++] Add non-BUNDLED dependencies to exported INSTALL_INTERFACE_LIBS of arrow_static and test that it works</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9429">ARROW-9429</a> - [Python] ChunkedArray.to_numpy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9454">ARROW-9454</a> - [GLib] Add binding of some dictionary builders</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9465">ARROW-9465</a> - [Python] Improve ergonomics of compute functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9469">ARROW-9469</a> - [Python] Make more objects weakrefable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9487">ARROW-9487</a> - [Developer] Cover the archery release utilities with unittests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9488">ARROW-9488</a> - [Release] Use the new changelog generation when updating the website</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9507">ARROW-9507</a> - [Rust] [DataFusion] PhysicalExpr should implement Display trait</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9508">ARROW-9508</a> - [Release][APT][Yum] Enable verification for arm64 binaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9516">ARROW-9516</a> - [Rust][DataFusion] Refactor physical expressions to not care about their names nor indexes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9517">ARROW-9517</a> - [C++][Python] Allow session_token argument when initializing S3FileSystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9518">ARROW-9518</a> - [Python] Deprecate pyarrow serialization</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9521">ARROW-9521</a> - [Rust] CsvReadOptions should allow file extension to be specified</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9523">ARROW-9523</a> - [Rust] improve performance of filter kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9534">ARROW-9534</a> - [Rust] [DataFusion] Implement functions for creating literal expressions for all types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9550">ARROW-9550</a> - [Rust] [DataFusion] Remove Rc&lt;RefCell&lt;_&gt;&gt; from hash aggregate operator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9553">ARROW-9553</a> - [Rust] Release script doesn’t bump parquet crate’s arrow dependency version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9557">ARROW-9557</a> - [R] Iterating over parquet columns is slow in R</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9559">ARROW-9559</a> - [Rust] [DataFusion] Revert privatization of exprlist_to_fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9563">ARROW-9563</a> - [Dev][Release] Use archery’s changelog generator when creating release notes for the website</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9568">ARROW-9568</a> - [CI] Use official msys action on GHA</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9576">ARROW-9576</a> - [Python][Doc] Fix error in code example for extension types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9580">ARROW-9580</a> - [JS] Docs have superfluous ()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9581">ARROW-9581</a> - [Dev][Release] Bump next snapshot versions to 2.0.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9582">ARROW-9582</a> - [Rust] Implement Array::memory_size()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9585">ARROW-9585</a> - [Rust] Remove duplicated to-do line in DataFusion readme</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9587">ARROW-9587</a> - [FlightRPC][Java] Clean up DoPut/FlightStream memory handling</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9593">ARROW-9593</a> - [Python] Add custom pickle reducers for DictionaryScalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9604">ARROW-9604</a> - [C++] Add benchmark for aggregate min/max compute kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9605">ARROW-9605</a> - [C++] Optimize performance for aggregate min/max compute kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9607">ARROW-9607</a> - [C++][Gandiva] Add bitwise_and(), bitwise_or() and bitwise_not() functions for integers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9608">ARROW-9608</a> - [Rust] Remove arrow flight from parquet’s feature gating</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9615">ARROW-9615</a> - [Rust] Add kernel to compute length of string array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9617">ARROW-9617</a> - [Rust] [DataFusion] Add length of string array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9618">ARROW-9618</a> - [Rust] [DataFusion] Make it easier to write optimizers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9619">ARROW-9619</a> - [Rust] [DataFusion] Add predicate push-down</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9632">ARROW-9632</a> - [Rust] Add a “new” method for ExecutionContextSchemaProvider</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9638">ARROW-9638</a> - [C++][Compute] Implement mode(most frequent number) kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9639">ARROW-9639</a> - [Ruby] Add dependency version check</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9640">ARROW-9640</a> - [C++][Gandiva] Implement round() for integers and long integers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9641">ARROW-9641</a> - [C++][Gandiva] Implement round() for floating point and double floating point numbers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9645">ARROW-9645</a> - [Python] Deprecate the legacy pyarrow.filesystem interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9646">ARROW-9646</a> - [C++][Dataset] Add support for writing parquet datasets</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9650">ARROW-9650</a> - [Packaging][APT] Drop support for Ubuntu 19.10</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9654">ARROW-9654</a> - [Rust][DataFusion] Add an EXPLAIN command to the datafusion CLI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9656">ARROW-9656</a> - [Rust][DataFusion] Slightly confusing error message when unsupported type is provided to CREATE EXTERNAL TABLE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9658">ARROW-9658</a> - [Python][Dataset] Bindings for dataset writing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9665">ARROW-9665</a> - [R] head/tail/take for Datasets</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9667">ARROW-9667</a> - [CI][Crossbow] Segfault in 2 nightly R builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9671">ARROW-9671</a> - [C++] BasicDecimal128 constructor interprets uint64_t integers with highest bit set as negative</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9673">ARROW-9673</a> - [Rust] Add a param “dialect” for DFParser::parse_sql</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9678">ARROW-9678</a> - [Rust] [DataFusion] Improve projection push down to remove unused columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9679">ARROW-9679</a> - [Rust] [DataFusion] HashAggregate walks map many times building final batch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9681">ARROW-9681</a> - [Java] Failed Arrow Memory - Core on big-endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9683">ARROW-9683</a> - [Rust][DataFusion] Implement Debug for ExecutionPlan trait</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9691">ARROW-9691</a> - [Rust] [DataFusion] Make sql_statement_to_plan public</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9695">ARROW-9695</a> - [Rust][DataFusion] Improve documentation on LogicalPlan variants</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9699">ARROW-9699</a> - [C++][Compute] Improve mode kernel performance for small integer types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9701">ARROW-9701</a> - [Java][CI] Add a test job on s390x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9702">ARROW-9702</a> - [C++] Move bpacking simd to runtime path</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9703">ARROW-9703</a> - [Developer][Archery] Restartable cherry-picking process for creating maintenance branches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9706">ARROW-9706</a> - [Java] Tests in TestLargeListVector fails on big endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9710">ARROW-9710</a> - [C++] Generalize Decimal ToString in preparation for Decimal256</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9711">ARROW-9711</a> - [Rust] Add benchmark based on TPC-H</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9713">ARROW-9713</a> - [Rust][DataFusion] Remove explicit panics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9715">ARROW-9715</a> - [R] changelog/doc updates for 1.0.1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9718">ARROW-9718</a> - [Python] Make pyarrow.parquet work with the new filesystem interfaces</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9721">ARROW-9721</a> - [Packaging][Python] Update wheel dependency files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9722">ARROW-9722</a> - [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9723">ARROW-9723</a> - [C++] Expected behaviour of “mode” kernel with NaNs ?</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9725">ARROW-9725</a> - [Rust] [DataFusion] LimitExec and SortExec should use MergeExec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9737">ARROW-9737</a> - [C++][Gandiva] Add bitwise_xor() for integers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9739">ARROW-9739</a> - [CI][Ruby] Don’t install gem documents</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9742">ARROW-9742</a> - [Rust] Create one standard DataFrame API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9751">ARROW-9751</a> - [Rust] [DataFusion] Extend UDFs to accept more than one type per argument</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9752">ARROW-9752</a> - [Rust] [DataFusion] Add support for Aggregate UDFs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9753">ARROW-9753</a> - [Rust] [DataFusion] Remove the use of Mutex in ExecutionPlan trait</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9754">ARROW-9754</a> - [Rust] [DataFusion] Implement async in DataFusion traits</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9757">ARROW-9757</a> - [Rust] [DataFusion] Use “pub use” to expose a clean public API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9758">ARROW-9758</a> - [Rust] [DataFusion] Implement extension API for DataFusion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9759">ARROW-9759</a> - [Rust] [DataFusion] Implement DataFrame::sort</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9760">ARROW-9760</a> - [Rust] [DataFusion] Implement DataFrame::explain</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9761">ARROW-9761</a> - [C++] Add experimental pull-based iterator structures to C interface implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9762">ARROW-9762</a> - [Rust] [DataFusion] ExecutionContext::sql should return DataFrame</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9769">ARROW-9769</a> - [Python] Remove skip for in-memory fsspec in test_move_file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9775">ARROW-9775</a> - [C++] Automatic S3 region selection</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9781">ARROW-9781</a> - [C++] Fix uninitialized value warnings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9782">ARROW-9782</a> - [C++][Dataset] Ability to write “.feather” files with IpcFileFormat</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9784">ARROW-9784</a> - [Rust] [DataFusion] Improve instructions for running tpch benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9786">ARROW-9786</a> - [R] Unvendor cpp11 before release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9788">ARROW-9788</a> - Handle naming inconsistencies between SQL, DataFrame API and struct names</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9792">ARROW-9792</a> - [Rust] [DataFusion] Logical aggregate functions should not return Result</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9794">ARROW-9794</a> - [C++] Add functionality to cpu_info to discriminate between Intel vs AMD x86</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9795">ARROW-9795</a> - [C++][Gandiva] Implement castTIMESTAMP(int64) in Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9806">ARROW-9806</a> - [R] More compute kernel bindings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9807">ARROW-9807</a> - [R] News update/version bump post-1.0.1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9808">ARROW-9808</a> - [Python] parquet.read_table docstring wrong use_legacy_dataset explanation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9811">ARROW-9811</a> - [C++] Unchecked floating point division by 0 should succeed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9813">ARROW-9813</a> - [C++] Disable semantic interposition</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9819">ARROW-9819</a> - [C++] Bump mimalloc to 1.6.4</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9821">ARROW-9821</a> - [Rust][DataFusion] User Defined PlanNode / Operator API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9821">ARROW-9821</a> - [Rust][DataFusion] User Defined PlanNode / Operator API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9823">ARROW-9823</a> - [CI][C++][MinGW] Enable S3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9832">ARROW-9832</a> - [Rust] [DataFusion] Refactor PhysicalPlan to remove Partition</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9833">ARROW-9833</a> - [Rust] [DataFusion] Refactor TableProvider.scan to return ExecutionPlan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9834">ARROW-9834</a> - [Rust] [DataFusion] Remove Partition trait</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9835">ARROW-9835</a> - [Rust] [DataFusion] Remove FunctionMeta</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9836">ARROW-9836</a> - [Rust] [DataFusion] Improve API for usage of UDFs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9837">ARROW-9837</a> - [Rust] Add provider for variable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9838">ARROW-9838</a> - [Rust] [DataFusion] DefaultPhysicalPlanner should insert explicit MergeExec nodes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9839">ARROW-9839</a> - [Rust] [DataFusion] Add ability to downcast ExecutionPlan to specific operator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9841">ARROW-9841</a> - [Rust] Update checked-in flatbuffer files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9844">ARROW-9844</a> - [Go][CI] Add Travis CI job for Go on s390x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9845">ARROW-9845</a> - [Rust] [Parquet] serde_json is only used in tests but isn’t in dev-dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9848">ARROW-9848</a> - [Rust] Implement changes to ensure flatbuffer alignment</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9849">ARROW-9849</a> - [Rust] [DataFusion] Make UDFs not need a Field</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9850">ARROW-9850</a> - [Go] Defer should not be used in the loop</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9853">ARROW-9853</a> - [RUST] Implement “take” kernel for dictionary arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9854">ARROW-9854</a> - [R] Support reading/writing data to/from S3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9858">ARROW-9858</a> - [C++][Python][Docs] Expand user guide for FileSystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9863">ARROW-9863</a> - [C++] [PARQUET] Optimize meta data recovery of ApplicationVersion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9867">ARROW-9867</a> - [C++][Dataset] FileSystemDataset should expose its filesystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9868">ARROW-9868</a> - [C++] Provide utility for copying files between filesystems</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9869">ARROW-9869</a> - [R] Implement full S3FileSystem/S3Options constructor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9870">ARROW-9870</a> - [R] Friendly interface for filesystems (S3)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9871">ARROW-9871</a> - [C++] Add uppercase support to ARROW_USER_SIMD_LEVEL.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9873">ARROW-9873</a> - [C++][Compute] Improve mode kernel for intergers within limited value range</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9875">ARROW-9875</a> - [Python] Let FileSystem.get_file_info accept a single path</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9884">ARROW-9884</a> - [R] Bindings for writing datasets to Parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9885">ARROW-9885</a> - [Rust] [DataFusion] Simplify code of type coercion for binary types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9886">ARROW-9886</a> - [Rust] [DataFusion] Simplify code to test cast</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9887">ARROW-9887</a> - [Rust] [DataFusion] Add support for complex return types of built-in functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9890">ARROW-9890</a> - [R] Add zstandard compression codec in macOS build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9891">ARROW-9891</a> - [Rust] [DataFusion] Make math functions support f32</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9892">ARROW-9892</a> - [Rust] [DataFusion] Add support for concat</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9893">ARROW-9893</a> - [Python] Bindings for writing datasets to Parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9895">ARROW-9895</a> - [RUST] Improve sort kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9899">ARROW-9899</a> - [Rust] [DataFusion] Switch from Box&lt;Schema&gt; –&gt; SchemaRef (Arc&lt;Schema&gt;) to be consistent with the rest of Arrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9900">ARROW-9900</a> - [Rust][DataFusion] Use Arc&lt;&gt; instead of Box&lt;&gt; in LogicalPlan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9901">ARROW-9901</a> - [C++] Add hand-crafted Parquet to Arrow reconstruction test for nested reading</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9902">ARROW-9902</a> - [Rust] [DataFusion] Add support for array()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9904">ARROW-9904</a> - [C++] Unroll the loop manually for CountSetBits</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9908">ARROW-9908</a> - [Rust] Support temporal data types in JSON reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9910">ARROW-9910</a> - [Rust] [DataFusion] Type coercion of Variadic is wrong</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9914">ARROW-9914</a> - [Rust][DataFusion] Document the SQL -&gt; Arrow type mapping</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9916">ARROW-9916</a> - [RUST] Avoid cloning ArrayData in several places</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9917">ARROW-9917</a> - [Python][Compute] Add bindings for mode kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9919">ARROW-9919</a> - [Rust] [DataFusion] Math functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9921">ARROW-9921</a> - [Rust] Add `from(Vec&lt;Option&lt;&amp;str&gt;&gt;)` to [Large]StringArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9925">ARROW-9925</a> - [GLib] Add low level value readers for GArrowListArray family</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9926">ARROW-9926</a> - [GLib] Use placement new for GArrowRecordBatchFileReader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9928">ARROW-9928</a> - [C++] Speed up integer parsing slightly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9929">ARROW-9929</a> - [Developer] Autotune cmake-format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9933">ARROW-9933</a> - [Developer] Add drone as a CI provider for crossbow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9934">ARROW-9934</a> - [Rust] Shape and stride check in tensor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9941">ARROW-9941</a> - [Python] Better string representation for extension types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9944">ARROW-9944</a> - [Rust] Implement TO_TIMESTAMP function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9949">ARROW-9949</a> - [C++] Generalize Decimal128::FromString for reuse in Decimal256</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9950">ARROW-9950</a> - [Rust] [DataFusion] Allow UDF usage without registry</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9952">ARROW-9952</a> - [Python] Use pyarrow.dataset writing for pq.write_to_dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9954">ARROW-9954</a> - [Rust] [DataFusion] Simplify code of aggregate planning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9956">ARROW-9956</a> - [C++][Gandiva] Implement Binary string function in Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9957">ARROW-9957</a> - [Rust] Remove unmaintained tempdir dependency</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9961">ARROW-9961</a> - [Rust][DataFusion] to_timestamp function parses timestamp without timezone offset as UTC rather than local</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9964">ARROW-9964</a> - [C++] CSV date support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9965">ARROW-9965</a> - [Java] Buffer capacity calculations are slow for fixed-width vectors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9966">ARROW-9966</a> - [Rust] Speedup aggregate kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9967">ARROW-9967</a> - [Python] Add compute module docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9971">ARROW-9971</a> - [Rust] Speedup take</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9977">ARROW-9977</a> - [Rust] Add min/max for [Large]String</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9979">ARROW-9979</a> - [Rust] Fix arrow crate clippy lints</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9980">ARROW-9980</a> - [Rust] Fix parquet crate clippy lints</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9981">ARROW-9981</a> - [Rust] Allow configuring flight IPC with IpcWriteOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9983">ARROW-9983</a> - [C++][Dataset][Python] Use larger default batch size than 32K for Datasets API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9984">ARROW-9984</a> - [Rust] [DataFusion] DRY of function to string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9986">ARROW-9986</a> - [Rust][DataFusion] TO_TIMESTAMP function erroneously requires fractional seconds when no timezone is present</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9987">ARROW-9987</a> - [Rust] [DataFusion] Improve docs of `Expr`.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9988">ARROW-9988</a> - [Rust] [DataFusion] Added std::ops to logical expressions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9992">ARROW-9992</a> - [C++][Python] Refactor python to arrow conversions based on a reusable conversion API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9998">ARROW-9998</a> - [Python] Support pickling DictionaryScalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9999">ARROW-9999</a> - [Python] Support constructing dictionary array directly through pa.array()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10000">ARROW-10000</a> - [C++][Python] Support constructing StructArray from list of key-value pairs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10001">ARROW-10001</a> - [Rust] [DataFusion] Add developer guide to README</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10010">ARROW-10010</a> - [Rust] Speedup arithmetic</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10015">ARROW-10015</a> - [Rust] Implement SIMD for aggregate kernel sum</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10016">ARROW-10016</a> - [Rust] [DataFusion] Implement IsNull and IsNotNull</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10018">ARROW-10018</a> - [CI] Disable Sphinx and API documentation build since it takes 6 hours on master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10019">ARROW-10019</a> - [Rust] Add substring kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10023">ARROW-10023</a> - [Gandiva][C++] Implementing Split part function in gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10024">ARROW-10024</a> - [C++][Parquet] Create nested reading benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10028">ARROW-10028</a> - [Rust] Simplify macro def_numeric_from_vec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10030">ARROW-10030</a> - [Rust] Support fromIter and toIter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10035">ARROW-10035</a> - [C++] Bump versions of vendored code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10037">ARROW-10037</a> - [C++] Workaround to force find AWS SDK to look for shared libraries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10040">ARROW-10040</a> - [Rust] Create a way to slice unalligned offset buffers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10043">ARROW-10043</a> - [Rust] [DataFusion] Introduce support for DISTINCT by partially implementing COUNT(DISTINCT)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10044">ARROW-10044</a> - [Rust] Improve README</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10046">ARROW-10046</a> - [Rust] [DataFusion] Made `*Iterator` implement Iterator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10050">ARROW-10050</a> - [C++][Gandiva] Implement concat() in Gandiva for up to 10 arguments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10051">ARROW-10051</a> - [C++][Compute] Make aggregate kernel merge state mutable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10054">ARROW-10054</a> - [Python] Slice methods should return empty arrays instead of crashing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10055">ARROW-10055</a> - [Rust] Implement DoubleEndedIterator for NullableIter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10057">ARROW-10057</a> - [C++] Add Parquet-Arrow roundtrip tests for nested data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10058">ARROW-10058</a> - [C++] Investigate performance of LevelsToBitmap without BMI2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10059">ARROW-10059</a> - [R][Doc] Give more advice on how to set up C++ build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10063">ARROW-10063</a> - [Archery][CI] Fetch main branch in archery build only when it is a pull request</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10064">ARROW-10064</a> - [C++] Resolve compile warnings on Apple Clang 12</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10065">ARROW-10065</a> - [Rust] DRY downcasted Arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10066">ARROW-10066</a> - [C++] Make sure that default AWS region is respected</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10068">ARROW-10068</a> - [C++] Add bundled external project for aws-sdk-cpp</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10069">ARROW-10069</a> - [Java] Support running Java benchmarks from command line</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10070">ARROW-10070</a> - [C++][Compute] Implement stdev aggregate kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10071">ARROW-10071</a> - [R] segfault with ArrowObject from previous session, or saved</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10074">ARROW-10074</a> - [C++] Don’t use string_view.to_string()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10075">ARROW-10075</a> - [C++] Don’t use nonstd::nullopt this breaks out vendoring abstraction.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10076">ARROW-10076</a> - [C++] Use TemporaryDir for all tests that don’t already use it.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10077">ARROW-10077</a> - [C++] Potential overflow in bit_stream_utils.h multiplication.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10083">ARROW-10083</a> - [C++] Improve Parquet fuzz seed corpus</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10084">ARROW-10084</a> - [Rust] [DataFusion] Add length of large string array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10086">ARROW-10086</a> - [Rust] Migrate min_large_string -&gt; min_string kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10090">ARROW-10090</a> - [C++][Compute] Improve mode kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10092">ARROW-10092</a> - [Dev][Go] Add grpc generated go files to rat exclusion list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10093">ARROW-10093</a> - [R] Add ability to opt-out of int64 -&gt; int demotion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10095">ARROW-10095</a> - [Rust] [Parquet] Update for IPC changes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10096">ARROW-10096</a> - [Rust] [DataFusion] Remove unused code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10099">ARROW-10099</a> - [C++][Dataset] Also allow integer partition fields to be dictionary encoded</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10100">ARROW-10100</a> - [C++][Dataset] Ability to read/subset a ParquetFileFragment with given set of row group ids</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10102">ARROW-10102</a> - [C++] Generalize BasicDecimal128::operator*= for reuse in Decimal256</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10103">ARROW-10103</a> - [Rust] Add a Contains kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10105">ARROW-10105</a> - [FlightRPC] Add client option to disable certificate validation with TLS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10120">ARROW-10120</a> - [C++][Parquet] Create reading benchmarks for 2-level nested data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10127">ARROW-10127</a> - [Format] Update specification to support 256-bit Decimal types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10129">ARROW-10129</a> - [Rust] Cargo build is rebuilding dependencies on arrow changes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10134">ARROW-10134</a> - [C++][Dataset] Add ParquetFileFragment::num_row_groups property</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10139">ARROW-10139</a> - [C++] Add support for building arrow_testing without building tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10148">ARROW-10148</a> - [Rust] Add documentation to lib.rs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10151">ARROW-10151</a> - [Python] Add support MapArray to_pandas conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10155">ARROW-10155</a> - [Rust] [DataFusion] Add documentation to lib.rs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10156">ARROW-10156</a> - [Rust] Auto-label PRs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10157">ARROW-10157</a> - [Rust] Add more documentation about take</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10160">ARROW-10160</a> - [Rust] Improve documentation of DictionaryType</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10161">ARROW-10161</a> - [Rust] [DataFusion] Simplify expression tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10162">ARROW-10162</a> - [Rust] Support display of DictionaryArrays in pretty printing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10164">ARROW-10164</a> - [Rust] Add support for DictionaryArray types to cast kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10167">ARROW-10167</a> - [Rust] Support display of DictionaryArrays in sql.rs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10168">ARROW-10168</a> - [Rust] [Parquet] Extend arrow schema conversion to projected fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10171">ARROW-10171</a> - [Rust] [DataFusion] Add `ExecutionContext::from&lt;ExecutionContextState&gt;`</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10190">ARROW-10190</a> - [Website] Add Jorge to list of committers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10191">ARROW-10191</a> - [Rust] [Parquet] Add roundtrip tests for single column batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10196">ARROW-10196</a> - [C++] Add Future::DeferNotOk()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10199">ARROW-10199</a> - [Rust][Parquet] Release Parquet at crates.io to remove debug prints</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10201">ARROW-10201</a> - [C++][CI] Disable S3 in arm64 job on Travis CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10202">ARROW-10202</a> - [CI][Windows] Use sf.net mirror for MSYS2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10205">ARROW-10205</a> - [Java][FlightRPC] Add client option to disable server verification</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10206">ARROW-10206</a> - [Python][C++][FlightRPC] Add client option to disable server validation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10215">ARROW-10215</a> - [Rust] [DataFusion] Rename “Source” typedef</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10217">ARROW-10217</a> - [CI] Run fewer GitHub Actions jobs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10225">ARROW-10225</a> - [Rust] [Parquet] Fix null bitmap comparisons in roundtrip tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10227">ARROW-10227</a> - [Ruby] Use a table size as the default for parquet chunk_size</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10229">ARROW-10229</a> - [C++][Parquet] Remove left over ARROW_LOG statement.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10231">ARROW-10231</a> - [CI] Unable to download minio in arm32v7 docker image</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10233">ARROW-10233</a> - [Rust] Make array_value_to_string available in all Arrow builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10235">ARROW-10235</a> - [Rust][DataFusion] Improve documentation for type coercion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10240">ARROW-10240</a> - [Rust] [Datafusion] Optionally load tpch data into memory before running benchmark query</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10251">ARROW-10251</a> - [Rust] [DataFusion] MemTable::load() should load partitions in parallel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10252">ARROW-10252</a> - [Python] Add option to skip inclusion of Arrow headers in Python installation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10256">ARROW-10256</a> - [C++][Flight] Disable -Werror carefully</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10257">ARROW-10257</a> - [R] Prepare news/docs for 2.0 release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10260">ARROW-10260</a> - [Python] Missing MapType to Pandas dtype</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10263">ARROW-10263</a> - [C++][Compute] Improve numerical stability of variances merging</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10265">ARROW-10265</a> - [CI] Use smaler build when cache doesn’t exit on Travis CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10266">ARROW-10266</a> - [CI][macOS] Ensure using Python 3.8 with Homebrew</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10267">ARROW-10267</a> - [Python] Skip flight test if disable_server_verification feature is not available</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10272">ARROW-10272</a> - [Packaging][Python] Pin newer multibuild version to avoid updating homebrew</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10273">ARROW-10273</a> - [CI][Homebrew] Fix “brew audit” usage</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10287">ARROW-10287</a> - [C++] Avoid std::random_device whenever possible</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10289">ARROW-10289</a> - [Rust] Support reading dictionary streams</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10295">ARROW-10295</a> - [Rust] [DataFusion] Simplify accumulators</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10310">ARROW-10310</a> - [C++][Gandiva] Add single argument round() in Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1845">PARQUET-1845</a> - [C++] Int96 memory images in test cases assume only little-endian</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1878">PARQUET-1878</a> - [C++] lz4 codec is not compatible with Hadoop Lz4Codec</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1904">PARQUET-1904</a> - [C++] Export file_offset in RowGroupMetaData</li>
</ul>
</main>
<hr/>
<footer class="footer">
<div class="row">
<div class="col-md-9">
<p>Apache Arrow, Arrow, Apache, the Apache feather logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.</p>
<p>&copy; 2016-2024 The Apache Software Foundation</p>
</div>
<div class="col-md-3">
<a class="d-sm-none d-md-inline pr-2" href="https://www.apache.org/events/current-event.html">
<img src="https://www.apache.org/events/current-event-234x60.png"/>
</a>
</div>
</div>
</footer>
</div>
</body>
</html>