blob: 29c151cce702ca85892d6e5a38309a9e024fbf06 [file] [log] [blame]
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- The above meta tags *must* come first in the head; any other head content must come *after* these tags -->
<title>Apache Arrow 0.17.0 Release | Apache Arrow</title>
<!-- Begin Jekyll SEO tag v2.8.0 -->
<meta name="generator" content="Jekyll v4.3.3" />
<meta property="og:title" content="Apache Arrow 0.17.0 Release" />
<meta property="og:locale" content="en_US" />
<meta name="description" content="Apache Arrow 0.17.0 (20 April 2020) This is a major release covering more than 2 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 569 commits from 79 distinct contributors. $ git shortlog -sn apache-arrow-0.16.0..apache-arrow-0.17.0 80 Antoine Pitrou 78 Krisztián Szűcs 58 Wes McKinney 55 Neal Richardson 39 Sutou Kouhei 30 Benjamin Kietzman 26 Joris Van den Bossche 20 Andy Grove 13 liyafan82 11 David Li 11 François Saint-Jacques 8 Micah Kornfield 8 Projjal Chanda 7 Max Burke 7 Prudhvi Porandla 6 Kenta Murata 6 Uwe L. Korn 6 Yibo Cai 6 tianchen 5 Frank Du 5 Martin Radev 4 Rok 3 Paddy Horan 3 Takashi Hashida 3 Yosuke Shiro 3 Zhuo Peng 2 Bryan Cutler 2 David Kegley 2 Jacek Pliszka 2 Ji Kunshang 2 Jorge C. Leitao 2 Jörn Horstmann 2 Kazuaki Ishizaki 2 Markus Westerlind 2 Morgan Cassels 2 Ryan Murray 2 Sam Albers 2 Siyuan 2 Stephanie Wang 2 Yuqi Gu 2 gawain.bolton 1 Adam Hooper 1 Andreas Zimmerer 1 Bryant Biggs 1 Chao Sun 1 Dan Nugent 1 Dominik Durner 1 Eyal Farago-Hagag 1 Grant Monroe 1 Hatem Helal 1 Ismaël Mejía 1 John Muehlhausen 1 Ken Suenobu 1 Kevin Conaway 1 Li, Jiajia 1 Mahmut Bulut 1 Marius van Niekerk 1 Michael Chirico 1 NAITOH Jun 1 Neville Dipale 1 Omega Gamage 1 Onur Satici 1 Palmer Lao 1 Paul Balanca 1 Prudhvi 1 Ravindra Wagh 1 Richard Artoul 1 Sebastien Binet 1 TP Boudreau 1 Tarek Allam 1 Tobias Mayer 1 Yuan Zhou 1 Zherui Cao 1 andy-thomason 1 caleboverman 1 mubai 1 psuman 1 ravindra-wagh 1 wenjun.swj Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-0.16.0..apache-arrow-0.17.0 128 Wes McKinney 109 Krisztián Szűcs 86 Antoine Pitrou 61 Neal Richardson 60 Sutou Kouhei 33 Benjamin Kietzman 23 Andy Grove 14 François Saint-Jacques 10 Micah Kornfield 7 Paddy Horan 7 Praveen 5 David Li 5 Sebastien Binet 3 GitHub 3 Joris Van den Bossche 3 Neville Dipale 3 Pindikura Ravindra 3 Yosuke Shiro 2 Bryan Cutler 2 Philipp Moritz 1 Brian Hulette 1 Chao Sun Changelog New Features and Improvements ARROW-1560 - [C++] Kernel implementations for “match” function ARROW-1571 - [C++] Implement argsort kernels (sort indices) for integers using O(n) counting sort ARROW-1581 - [Packaging] Tooling to make nightly wheels available for install ARROW-1582 - [Python] Set up + document nightly conda builds for macOS ARROW-1636 - [Format] Integration tests for null type ARROW-2447 - [C++] Create a device abstraction ARROW-3054 - [Packaging] Tooling to enable nightly conda packages to be updated to some anaconda.org channel ARROW-3410 - [C++][Dataset] Streaming CSV reader interface for memory-constrainted environments ARROW-3750 - [R] Pass various wrapped Arrow objects created in Python into R with zero copy via reticulate ARROW-4120 - [Python] Define process for testing procedures that check for no macro-level memory leaks ARROW-4226 - [Format][C++] Add CSF sparse tensor support ARROW-4304 - [Rust] Enhance documentation for arrow ARROW-4428 - [R] Feature flags for R build ARROW-4482 - [Website] Add blog archive page ARROW-4815 - [Rust] [DataFusion] Add support for * in SQL projection ARROW-5357 - [Rust] Add capacity field in Buffer ARROW-5405 - [Documentation] Move integration testing documentation to Sphinx docs, add instructions for JavaScript ARROW-5497 - [Release] Build and publish R/Java/JS docs ARROW-5501 - [R] Reorganize read/write file/stream functions ARROW-5510 - [Format] Feather V2 based on Arrow IPC file format, with compression support ARROW-5563 - [Format] Update integration test JSON format documentation ARROW-5585 - [Go] rename arrow.TypeEquals into arrow.TypeEqual ARROW-5742 - [CI] Add daily / weekly Valgrind build ARROW-5757 - [Python] Stop supporting Python 2.7 ARROW-590 - [Integration] Add integration tests for Union types ARROW-5949 - [Rust] Implement DictionaryArray ARROW-6165 - [Integration] Use multiprocessing to run integration tests on multiple CPU cores ARROW-6176 - [Python] Allow to subclass ExtensionArray to attach to custom extension type ARROW-6393 - [C++] Add EqualOptions support in SparseTensor::Equals ARROW-6479 - [C++] inline errors from external projects’ build logs ARROW-6510 - [Python][Filesystem] Expose nanosecond resolution mtime ARROW-6666 - [Rust] [DataFusion] Implement string literal expression ARROW-6724 - [C++] Add simpler static ctor for BufferOutputStream than the current Create function ARROW-6821 - [C++][Parquet] Do not require Thrift compiler when building (but still require library) ARROW-6823 - [C++][Python][R] Support metadata in the feather format? ARROW-6837 - [C++/Python] access File Footer custom_metadata ARROW-6841 - [C++] Upgrade to LLVM 8 ARROW-6875 - [FlightRPC] Implement Criteria for ListFlights RPC / list_flights method ARROW-6915 - [Developer] Do not overwrite minor release version with merge script, even if not specified by committer ARROW-6947 - [Rust] [DataFusion] Add support for scalar UDFs ARROW-6996 - [Python] Expose boolean filter kernel on Table ARROW-7044 - [Release] Create a post release script for the home-brew formulas ARROW-7048 - [Java] Support for combining multiple vectors under VectorSchemaRoot ARROW-7063 - [C++] Schema print method prints too much metadata ARROW-7073 - [Java] Support concating vectors values in batch ARROW-7080 - [Python][Parquet][C++] Expose parquet field_id in Schema objects ARROW-7091 - [C++] Move all factories to type_fwd.h ARROW-7119 - [C++][CI] Use scripts/util_coredump.sh to show automatic backtraces ARROW-7201 - [GLib][Gandiva] Add support for BooleanNode ARROW-7202 - [R][CI] Improve rwinlib building on CI to stop re-downloading dependencies ARROW-7222 - [Python][Release] Wipe any existing generated Python API documentation when updating website ARROW-7233 - [C++] Add Result APIs to IPC module ARROW-7256 - [C++] Remove ARROW_MEMORY_POOL_DEFAULT macro ARROW-7330 - [C++] Add Result to APIs to arrow/gpu ARROW-7332 - [C++][Parquet] Explicitly catch status exceptions in PARQUET_CATCH_NOT_OK ARROW-7336 - [C++] Implement MinMax options to not skip nulls ARROW-7338 - [C++] Improve InMemoryDataSource to support generator instead of static list ARROW-7365 - [Python] Support FixedSizeList type in conversion to numpy/pandas ARROW-7400 - [Java] Avoids the worst case for quick sort ARROW-7412 - [C++][Dataset] Ensure that dataset code is robust to schemas with duplicate field names ARROW-7419 - [Python] Support SparseCSCMatrix ARROW-7427 - [Python] Support SparseCSFTensor ARROW-7428 - [Format][C++] Add serialization for CSF sparse tensors ARROW-7444 - [GLib] Add LocalFileSystem support ARROW-7462 - [C++] Add CpuInfo detection for Arm64 Architecture ARROW-7491 - [Java] Improve the performance of aligning ARROW-7515 - [C++] Rename nonexistent and non_existent to not_found ARROW-7524 - [C++][CI] Build parquet support in the VS2019 GitHub Actions job ARROW-7530 - [Developer] Do not include list of commits from PR in squashed summary message ARROW-7534 - [Java] Create a new java/contrib module ARROW-7547 - [C++] [Python] [Dataset] Additional reader options in ParquetFileFormat ARROW-7587 - [C++][Compute] Add Top-k kernel ARROW-7608 - [C++][Dataset] Expose more informational properties ARROW-7615 - [CI][Gandiva] Ensure that the gandiva_jni library has only a whitelisted set of shared dependencies as part of Travis CI job ARROW-7616 - [Java] Support comparing value ranges for dense union vector ARROW-7625 - [GLib] Parquet GLib and Red Parquet (Ruby) do not allow specifying compression type ARROW-7641 - [R] Make dataset vignette have executable code ARROW-7662 - [R] Support creating ListArray from R list ARROW-7664 - [C++] Extract localfs default from FileSystemFromUri ARROW-7675 - [R][CI] Move Windows CI from Appveyor to GHA ARROW-7679 - [R] Cleaner interface for creating UnionDataset ARROW-7684 - [Rust] Provide example of Flight server for DataFusion ARROW-7685 - [Developer] Add support for GitHub Actions to Crossbow ARROW-7691 - [C++] Verify missing fields when walking Flatbuffers data ARROW-7708 - [Release] Include PARQUET commits from git changelog in release changelogs ARROW-7712 - [CI][Crossbow] Fix or delete fuzzit jobs ARROW-7720 - [C++][Python] Add check_metadata argument to Table.equals ARROW-7725 - [C++] Add infrastructure for unity builds and precompiled headers ARROW-7726 - [CI] [C++] Use boost binaries on Windows GHA build ARROW-7729 - [Python][CI] Pin pandas version to 0.25 in the dask integration test ARROW-7733 - [Developer] Install locally a new enough version of Go for release verification script ARROW-7735 - [Release] conda-forge channel is missing for verifying wheels ARROW-7736 - [Release] Binary verification sometimes fails with transient error ARROW-7739 - [GLib] Use placement new to initialize shared_ptr object in private structs ARROW-7741 - [C++][Parquet] Incorporate new level generation logic in parquet write path with a flag to revert back to old logic ARROW-7742 - [GLib] Add support for MapArray ARROW-7745 - [Doc] [C++] Update Parquet documentation ARROW-7749 - [C++] Link some more tests together ARROW-7750 - [Release] Make the source release verification script restartable ARROW-7751 - [Release] macOS wheel verification also needs arrow-testing ARROW-7752 - [Release] Enable and test dataset in the verification script ARROW-7754 - [C++] Result is slow ARROW-7761 - [C++] Add S3 support to fs::FileSystemFromUri ARROW-7764 - [C++] Builders allocate a null bitmap buffer even if there is no nulls ARROW-7771 - [Developer] Use ARROW_TMPDIR environment variable in the verification scripts instead of TMPDIR ARROW-7774 - [Packaging][Python] Update macos and windows wheel filenames ARROW-7787 - [Rust] Add collect to Table API ARROW-7788 - [C++] Add schema conversion support for map type ARROW-7790 - [Website] Update how to install Linux packages ARROW-7795 - [Rust - DataFusion] Support boolean negation (NOT) ARROW-7796 - [R] write_* functions should invisibly return their inputs ARROW-7799 - [R][CI] Remove flatbuffers from homebrew formulae ARROW-7804 - [C++][R] Compile error on macOS 10.11 ARROW-7812 - [Packaging][Python] Upgrade LLVM in manylinux1 docker image ARROW-7817 - [CI] macOS R autobrew nightly failed on installing dependency from source ARROW-7819 - [C++][Gandiva] Add DumpIR to Filter/Projector classes ARROW-7824 - [C++][Dataset] Provide Dataset writing to IPC format ARROW-7828 - [Release] Remove SSH keys for internal use ARROW-7829 - [R] Test R bindings on clang ARROW-7833 - [R] Make install_arrow() actually install arrow ARROW-7834 - [Release] Post release task for updating the documentations ARROW-7839 - [Python][Dataset] Add IPC format to python bindings ARROW-7846 - [Python][Dev] Remove last dependencies on six ARROW-7849 - [Packaging][Python] Remove the remaining py27 crossbow wheel tasks from the nightlies ARROW-7858 - [C++][Python] Support casting an Extension type to its storage type ARROW-7859 - [R] Minor patches for CRAN submission 0.16.0.2 ARROW-7862 - [R] Linux installation should run quieter by default ARROW-7863 - [C++][Python][CI] Ensure running HDFS related tests ARROW-7864 - [R] Make sure bundled installation works even if there are system packages ARROW-7865 - [R] Test builds on latest Linux versions ARROW-7868 - [Crossbow] Reduce GitHub API query parallelism ARROW-7869 - [Python] Boost::system and boost::filesystem not necessary anymore in Python wheels ARROW-7872 - [Python] Support conversion of list-of-struct in Array/Table.to_pandas ARROW-7874 - [Python][Archery] Validate docstrings with numpydoc ARROW-7876 - [R] Installation fails in the documentation generation image ARROW-7877 - [Packaging] Fix crossbow deployment to github artifacts ARROW-7879 - [C++][Doc] Add doc for the Device API ARROW-7880 - [CI][R] R sanitizer job is not really working ARROW-7881 - [C++] Fix pedantic warnings ARROW-7882 - [C++][Gandiva] Optimise like function for substring pattern ARROW-7886 - [C++][Dataset] Consolidate Source and Dataset ARROW-7888 - [Python] Allow using a more modern version of jpype in pyarrow.jvm ARROW-7890 - [C++] Add Promise / Future implementation ARROW-7891 - [C++] RecordBatch-&gt;Equals should also have a check_metadata argument ARROW-7892 - [Python] Expose FilesystemSource.format attribute ARROW-7895 - [Python] Remove more python 2.7 cruft ARROW-7896 - [C++] Refactor from #include guards to #pragma once ARROW-7897 - [Packaging] Temporarily disable artifact uploading until we fix the deployment issues ARROW-7898 - [Python] Reduce the number docstring violations using numpydoc ARROW-7904 - [C++] Decide about Field/Schema metadata printing parameters and how much to show by default ARROW-7907 - [Python] Conversion to pandas of empty table with timestamp type aborts ARROW-7912 - [Format] C data interface ARROW-7913 - [C++][Python][R] C++ implementation of C data interface ARROW-7915 - [CI] [Python] Run tests with Python development mode enabled ARROW-7916 - [C++][Dataset] Project IPC record batches to materialized fields ARROW-7917 - [CMake] FindPythonInterp should check for python3 ARROW-7919 - [R] install_arrow() should conda install if appropriate ARROW-7920 - [R] Fill in some missing input validation ARROW-7921 - [Go] Add Reset method to various components and clean up comments ARROW-7927 - [C++] Fix ‘cpu_info.cc’ compilation warning ARROW-7929 - [C++] CMake target names differ from upstream provided names ARROW-7930 - [Python][CI] Test jpype integration in CI ARROW-7932 - [Rust] [Parquet] Implement array reader for temporal types ARROW-7934 - [C++] Fix UriEscape for empty string ARROW-7935 - [Java] Remove Netty dependency for BufferAllocator and ReferenceManager ARROW-7937 - [Python][Packaging] Remove boost from the macos wheels ARROW-7941 - [Rust] [DataFusion] Logical plan should support unresolved column references ARROW-7943 - [C++][Parquet] Add a new level builder capable of handling nested data ARROW-7947 - [Rust] [Flight] [DataFusion] Implement example for get_schema ARROW-7949 - [Developer] Update to ‘.gitignore’ to not track user specific ‘cpp/Brewfile.lock.json’ file ARROW-7951 - [Python][Parquet] Expose BYTE_STREAM_SPLIT to pyarrow ARROW-7959 - [Ruby] Add support for Ruby 2.3 again ARROW-7963 - [C++][Python][Dataset] Expose listing fragments ARROW-7965 - [Python] Refine higher level dataset API ARROW-7966 - [Integration][Flight][C++] Client should verify each batch independently ARROW-7969 - [Packaging] Use cURL to upload artifacts ARROW-7970 - [Packaging][Python] Use system boost to build the macos wheels ARROW-7971 - [Rust] Create rowcount utility ARROW-7977 - [C++] Rename fs::FileStats to fs::FileInfo ARROW-7979 - [C++] Implement experimental buffer compression in IPC messages ARROW-7982 - [C++] Let ArrayDataVisitor accept void-returning functions ARROW-7983 - [CI][R] Nightly builds should be more verbose when they fail ARROW-7984 - [R] Check for valid inputs in more places ARROW-7986 - [Python] pa.Array.from_pandas cannot convert pandas.Series containing pyspark.ml.linalg.SparseVector ARROW-7987 - [CI][R] Fix for verbose nightly builds ARROW-7988 - [R] Fix on.exit calls in reticulate bindings ARROW-7991 - [C++][Plasma] Allow option for evicting if full when creating an object ARROW-7993 - [Java] Support decimal type in ComplexCopier ARROW-7994 - [CI][C++] Move AppVeyor MinGW builds to GitHub Actions ARROW-7995 - [C++] IO: coalescing and caching read ranges ARROW-7998 - [C++][Plasma] Make Seal requests synchronous ARROW-8005 - [Website] Review and adjust any usages of Apache dist system from website / tools ARROW-8014 - [C++] Provide CMake targets to test only within a given label ARROW-8016 - [Developer] Fix deprecation warning in PR merge tool ARROW-8018 - [C++][Parquet]Parquet Modular Encryption ARROW-8024 - [R] Bindings for BinaryType and FixedBinaryType ARROW-8026 - [Python] Support memoryview in addition to string value types for constructing string and binary type arrays ARROW-8027 - [Developer][Integration] Add integration tests for duplicate field names ARROW-8028 - [Go] Allow duplicate field names in schemas and nested types ARROW-8030 - [C++][Plasma] Fix inconsistent comment style ARROW-8039 - [Python][Dataset] Support using dataset API in pyarrow.parquet with a minimal ParquetDataset shim ARROW-8044 - [CI][NIGHTLY:gandiva-jar-osx] pygit2 needs libgit2 v1.0.x ARROW-8055 - [GLib][Ruby] Add some metadata bindings to GArrowSchema ARROW-8058 - [C++][Python][Dataset] Provide an option to toggle validation and schema inference in FileSystemDatasetFactoryOptions ARROW-8059 - [Python] Make FileSystem objects serializable ARROW-8060 - [Python] Make dataset Expression objects serializable ARROW-8061 - [C++][Dataset] Ability to specify granularity of ParquetFileFragment (support row groups) ARROW-8063 - [Python] Add user guide documentation for Datasets API ARROW-8064 - [Dev] Implement Comment bot via Github actions ARROW-8072 - [C++][Plasma] Add const constraint when parsing data ARROW-8077 - [Python] Add wheel build script and Crossbow configuration for Windows on Python 3.5 ARROW-8079 - [Python] Implement a wrapper for KeyValueMetadata, duck-typing dict where relevant ARROW-8080 - [C++] Add AVX512 build option ARROW-8082 - [Java][Plasma] Add JNI list() interface ARROW-8083 - [GLib] Add support for Peek() to GIOInputStream ARROW-8086 - [Java] Support writing decimal from big endian byte array in UnionListWriter ARROW-8087 - [C++][Dataset] Order of keys with HivePartitioning is lost in resulting schema ARROW-8096 - [C++][Gandiva] Create null node of Interval type ARROW-8097 - [Dev] Comment bot’s crossbow command acts on the master branch ARROW-8103 - [R] Make default Linux build more minimal ARROW-8104 - [C++] Don’t install bundled Thrift ARROW-8107 - [Packaging][APT] Use HTTPS for LLVM APT repository for Debian GNU/Linux stretch ARROW-8109 - [Packaging][APT] Drop support for Ubuntu Disco ARROW-8117 - [Rust] [Datafusion] Allow CAST from number to timestamp ARROW-8118 - [R] dim method for FileSystemDataset ARROW-8120 - [Packaging][APT] Add support for Ubuntu Focal ARROW-8123 - [Rust] [DataFusion] Create LogicalPlanBuilder ARROW-8124 - [Rust] Update library dependencies ARROW-8126 - [C++][Compute] Add Top-K kernel benchmark ARROW-8129 - [C++][Compute] Refine compare sorting kernel ARROW-8130 - [C++][Gandiva] Fix Dex visitor in llvm_generator to handle interval type ARROW-8140 - [Developer] Follow NullType -&gt; NullField change ARROW-8141 - [C++] Optimize BM_PlainDecodingBoolean performance using AVX512 Intrinsics API ARROW-8145 - [C++] Rename GetTargetInfos ARROW-8146 - [C++] Add per-filesystem facility to sanitize a path ARROW-8150 - [Rust] Allow writing custom FileMetaData k/v pairs ARROW-8151 - [Benchmarking][Dataset] Benchmark Parquet read performance with S3File ARROW-8153 - [Packaging] Update the conda feedstock files and upload artifacts to Anaconda ARROW-8158 - [Java] Getting length of data buffer and base variable width vector ARROW-8164 - [C++][Dataset] Let datasets be viewable with non-identical schema ARROW-8165 - [Packaging] Make nightly wheels available on a PyPI server ARROW-8167 - [CI] Add support for skipping builds with skip pattern in pull request title ARROW-8168 - [Java][Plasma] Improve Java Plasma client off-heap memory usage ARROW-8177 - [Rust] Make schema_to_fb_offset public ARROW-8178 - [C++] Upgrade to Flatbuffers 1.12 ARROW-8179 - [R] Windows build script tweaking for nightly packaging on GHA ARROW-8181 - [Java][FlightRPC] Expose transport error metadata ARROW-8182 - [Packaging] Increment the version number detected from the latest git tag ARROW-8183 - [c++][FlightRPC] Expose transport error metadata ARROW-8184 - [Packaging] Use arrow-nightlies organization name on Anaconda and Gemfury to host the nightlies ARROW-8185 - [Packaging] Document the available nightly wheels and conda packages ARROW-8187 - [R] Make test assertions robust to i18n ARROW-8191 - [Packaging][APT] Fix cmake removal in Debian GNU/Linux Stretch ARROW-8192 - [C++] script for unpack avx512 intrinsics code ARROW-8194 - [CI] Github Actions Windows job should run tests in parallel ARROW-8195 - [CI] Remove Boost download step in Github Actions ARROW-8198 - [C++] Diffing should handle null arrays ARROW-8200 - [GLib] Rename garrow_file_system_target_info{,s}() to …_file_info{,s}() ARROW-8203 - [C#] “dotnet pack” is failed ARROW-8204 - [Rust] [DataFusion] Add support for aliased expressions in SQL ARROW-8207 - [Packaging][wheel] Use LLVM 8 in manylinux2010 and manylinux2014 ARROW-8215 - [CI][GLib] Meson install fails in the macOS build ARROW-8218 - [C++] Parallelize decompression at field level in experimental IPC compression code ARROW-8220 - [Python] Make dataset FileFormat objects serializable ARROW-8222 - [C++] Use bcp to make a slim boost for bundled build ARROW-8224 - [C++] Remove APIs deprecated prior to 0.16.0 ARROW-8225 - [Rust] IPC reader must respect continuation markers ARROW-8227 - [C++] Refine SIMD feature definitions ARROW-8231 - [Rust] Parse key_value_metadata from parquet FileMetaData into arrow schema metadata ARROW-8232 - [Python] Deprecate pa.open_file and pa.open_stream in favor of pa.ipc.open_file/open_stream ARROW-8241 - [Rust] Add convenience methods to Schema ARROW-8242 - [C++] Flight fails to compile on GCC 4.8 ARROW-8243 - [Rust] [DataFusion] Fix inconsistent API in LogicalPlanBuilder ARROW-8244 - [Python][Parquet] Add `write_to_dataset` option to populate the “file_path” metadata fields ARROW-8246 - [C++] Add -Wa,-mbig-obj when compiling with MinGW to avoid linking errors ARROW-8247 - [Python] Expose Parquet writing “engine” setting in pyarrow.parquet.write_table ARROW-8249 - [Rust] [DataFusion] Make Table and LogicalPlanBuilder APIs more consistent ARROW-8252 - [CI][Ruby] Add Ubuntu 20.04 ARROW-8256 - [Rust] [DataFusion] Update CLI documentation for 0.17.0 release ARROW-8264 - [Rust] [DataFusion] Create utility for printing record batches ARROW-8266 - [C++] Add backup mirrors for external project source downloads ARROW-8267 - [CI][GLib] Failed to build on Ubuntu 16.04 ARROW-8271 - [Packaging] Allow wheel upload failures to gemfury ARROW-8275 - [Python][Docs] Review Feather + IPC file documentation per “Feather V2” changes ARROW-8277 - [Python] RecordBatch interface improvements ARROW-8279 - [C++] Do not export symbols from Codec implementations, remove need for PIMPL pattern ARROW-8288 - [Python] Expose with_ modifiers on DataType ARROW-8290 - [Python][Dataset] Improve ergonomy of the FileSystemDataset constructor ARROW-8291 - [Packaging] Conda nightly builds can’t locate Numpy ARROW-8292 - [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function ARROW-8294 - [Format][Flight] Add DoExchange RPC to Flight protocol ARROW-8295 - [C++][Dataset] IpcFileFormat should expliclity push down column projection ARROW-8299 - [C++] Reusable “optional ParallelFor” function for optional use of multithreading ARROW-8300 - [R] Documentation and changelog updates for 0.17 ARROW-8307 - [Python] Expose use_memory_map option in pyarrow.feather APIs ARROW-8308 - [Rust] [Flight] Implement DoExchange on examples ARROW-8309 - [CI] C++/Java/Rust workflows should trigger on changes to Flight.proto ARROW-8311 - [C++] Add push style stream format reader ARROW-8316 - [CI] Set docker-compose to use docker-cli instead of docker-py for building images ARROW-8319 - [CI] Install thrift compiler in the debian build ARROW-8320 - [Documentation][Format] Clarify (lack of) alignment requirements in C data interface ARROW-8321 - [CI] Use bundled thrift in Fedora 30 build ARROW-8322 - [CI] Fix C# workflow file syntax ARROW-8325 - [R][CI] Stop including boost in R windows bundle ARROW-8329 - [Documentation][C++] Undocumented FilterOptions argument in Filter kernel ARROW-8330 - [Documentation] The post release script generates the documentation with a development version ARROW-8332 - [C++] Require Thrift compiler to use system libthrift for Parquet build ARROW-8335 - [Release] Add crossbow jobs to run release verification ARROW-8336 - [Packaging][deb] Use libthrift-dev on Debian 10 and Ubuntu 19.10 or later ARROW-8341 - [Packaging][deb] Fail to build by no disk space ARROW-8343 - [GLib] Add GArrowRecordBatchIterator ARROW-8347 - [C++] Add Result APIs to Array methods ARROW-8351 - [R][CI] Store the Rtools-built Arrow C++ library as a build artifact ARROW-8352 - [R] Add install_pyarrow() ARROW-8356 - [Developer] Support * wildcards with “crossbow submit” via GitHub actions ARROW-8361 - [C++] Add Result APIs to Buffer methods and functions ARROW-8362 - [Crossbow] Ensure that the locally generated version is used in the docker tasks ARROW-8367 - [C++] Deprecate Buffer::FromString(…, pool) ARROW-8368 - [Format] In C interface, clarify resource management for consumers needing only a subset of child fields in ArrowArray ARROW-8370 - [C++] Add Result to type / schema APIs ARROW-8371 - [Crossbow] Implement and exercise sanity checks for tasks.yml ARROW-8372 - [C++] Add Result to table / record batch APIs ARROW-8375 - [CI][R] Make Windows tests more verbose in case of segfault ARROW-8376 - [R] Add experimental interface to ScanTask/RecordBatch iterators ARROW-8387 - [Rust] Make schema_to_fb public ARROW-8389 - [Integration] Run tests in parallel ARROW-8390 - [R] Expose schema unification features ARROW-8393 - [C++][Gandiva] Make gandiva function registry case-insensitive ARROW-8396 - [Rust] Remove libc from dependencies ARROW-8398 - [Python] Remove deprecation warnings originating from python tests ARROW-8401 - [C++] Add AVX2/AVX512 version of ByteStreamSplitDecode/ByteStreamSplitEncode ARROW-8403 - [C++] Add ToString() to ChunkedArray, Table and RecordBatch ARROW-8407 - [Rust] Add rustdoc for Dictionary type ARROW-8408 - [Python] Add memory_map= toggle to pyarrow.feather.read_feather ARROW-8409 - [R] Add arrow::cpu_count, arrow::set_cpu_count wrapper functions a la Python ARROW-8412 - [C++][Gandiva] Fix gandiva date_diff function definitions ARROW-8433 - [R] Add feather alias for ipc format in dataset API ARROW-8443 - [Gandiva][C++] Fix round/truncate to no-op for special cases ARROW-8444 - [Documentation] Fix spelling errors across the codebase ARROW-8449 - [R] Use CMAKE_UNITY_BUILD everywhere ARROW-8450 - [Integration][C++] Implement large list/binary/utf8 integration ARROW-8457 - [C++] bridge test does not take care of endianness ARROW-8458 - [C++] Prefer the original mirrors for the bundled thirdparty dependencies ARROW-8461 - [Packaging][deb] Use zstd package for Ubuntu Xenial ARROW-8463 - [CI] Balance the nightly test builds between CircleCI, Azure and Github ARROW-8467 - [C++] Test cases using ArrayFromJSON assume only a little-endian platform ARROW-8477 - [C++] Enable reading and writing of long filenames for Windows ARROW-8496 - [C++] Refine ByteStreamSplitDecodeScalar ARROW-8499 - [C++][Dataset] In ScannerBuilder, batch_size will not work if projecter is not empty ARROW-8506 - [c++] Miss tests to verify expected_buffer with bit_width &gt; 8 in RLE ARROW-8515 - [C++] Bitmap ToString should have an option of grouping by bytes ARROW-8517 - [Developer][Release] Update Crossbow RC verification setup for changes since 0.16.0 ARROW-8523 - [C++] Optimize BitmapReader PARQUET-1663 - [C++] Provide API to check the presence of complex data types PARQUET-1716 - [C++] Add support for BYTE_STREAM_SPLIT encoding PARQUET-1770 - [C++][CI] Add fuzz target for reading Parquet files PARQUET-1785 - [C++] Improve code reusability in encoding-test.cc PARQUET-1786 - [C++] Use simd to improve BYTE_STREAM_SPLIT decoding performance PARQUET-1806 - [C++] [CI] Improve fuzzing seed corpus PARQUET-1825 - [C++] Fix compilation error in column_io_benchmark.cc PARQUET-1828 - [C++] Add a SSE2 path for the ByteStreamSplit encoder implementation PARQUET-1840 - [C++] DecodeSpaced copies more values then necessary PARQUET-458 - [C++] Implement support for DataPageV2 Bug Fixes ARROW-1907 - [C++/Python] Feather format cannot accommodate string columns containing more than a total of 2GB of data ARROW-2255 - [Developer][Integration] Serialize schema- and field-level custom metadata in integration test JSON format ARROW-2587 - [Python] Unable to write StructArrays with multiple children to parquet ARROW-3004 - [Documentation] Builds docs for master rather than a pinned commit ARROW-3543 - [R] Better support for timestamp format and time zones in R ARROW-5265 - [Python/CI] Add integration test with kartothek ARROW-5473 - [C++] Build failure on googletest_ep on Windows when using Ninja ARROW-5981 - [C++] DictionaryBuilder initialization with Array can fail silently ARROW-6528 - [C++] Spurious Flight test failures (port allocation failure) ARROW-6738 - [Java] Fix problems with current union comparison logic ARROW-6757 - [Python] Creating csv.ParseOptions() causes “Windows fatal exception: access violation” with Visual Studio 2017 ARROW-6871 - [Java] Enhance TransferPair related parameters check and tests ARROW-6872 - [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError ARROW-6895 - [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()` ARROW-7008 - [Python] pyarrow.chunked_array([array]) fails on array with all-None buffers ARROW-7049 - [C++] warnings building on mingw-w64 ARROW-7301 - [Java] Sql type DATE should correspond to DateDayVector ARROW-7335 - [C++][Gandiva] Add castBIGINT, extractDay interval_day functions in Gandiva ARROW-7390 - [C++][Dataset] Concurrency race in Projector::Project ARROW-7405 - [Java] ListVector isEmpty API is incorrect ARROW-7466 - [CI][Java] Fix gandiva-jar-osx nightly build failure ARROW-7467 - [Java] ComplexCopier does incorrect copy for Map nullable info ARROW-7520 - [R] Writing many batches causes a crash ARROW-7546 - [Java] Use new implementation to concat vectors values in batch ARROW-7624 - [Rust] Soundness issues via `Buffer` methods ARROW-7628 - [Python] Better document some read_csv corner cases ARROW-7631 - [C++][Gandiva] return zero if there is an overflow while converting a decimal to a lower precision/scale ARROW-7680 - [C++][Dataset] Partition discovery is not working with windows path ARROW-7701 - [C++] [CI] Flight test error on macOS ARROW-7713 - [Java] TastLeak was put at the wrong location ARROW-7722 - [Java][FlightRPC] Memory leak ARROW-7734 - [C++] Segfault when comparing status with and without detail ARROW-7740 - [C++] Array internals corruption in StructArray::Flatten ARROW-7755 - [Python] Windows wheel cannot be installed on Python 3.8 ARROW-7758 - [Python] Wrong conversion of timestamps that are out of bounds for pandas (eg 0000-01-01) ARROW-7760 - [Release] Fix verify-release-candidate.sh since pip3 seems to no longer be in miniconda ARROW-7762 - [Python] Exceptions in ParquetWriter get ignored ARROW-7766 - [Python][Packaging] Windows py38 wheels are built with wrong ABI tag ARROW-7772 - [R][C++][Dataset] Unable to filter on date32 object with date64 scalar ARROW-7775 - [Rust] Don’t let safe code arbitrarily transmute readers and writers ARROW-7777 - [Go] StructBuilder/ListBuilder index out of range panic ARROW-7780 - [Release] Fix Windows wheel RC verification script given lack of “m” ABI tag in Python 3.8 ARROW-7781 - [C++][Dataset] Filtering on a non-existent column gives a segfault ARROW-7783 - [C++] ARROW_DATASET should enable ARROW_COMPUTE ARROW-7785 - [C++] sparse_tensor.cc is extremely slow to compile ARROW-7786 - [R] Wire up check_metadata in Table.Equals method ARROW-7789 - [R] Can’t initialize arrow objects when R.oo package is loaded ARROW-7791 - [C++][Parquet] Fix building error “cannot bind lvalue” ARROW-7792 - [R] read_* functions should close connection to file ARROW-7793 - [Java] If there is a leak the base allocator should release the excess memory to parent before throwing exception ARROW-7794 - [Rust] cargo publish fails for arrow-flight due to relative path to Flight.proto ARROW-7797 - [Release][Rust] Fix arrow-flight’s version in datafusion crate ARROW-7802 - [C++] Support for LargeBinary and LargeString in the hash kernel ARROW-7806 - [Python] Implement to_pandas for lists of LargeBinary/String ARROW-7807 - [R] Installation on RHEL 7 Cannot call io___MemoryMappedFile__Open() ARROW-7809 - [R] vignette does not run on Win 10 nor ubuntu ARROW-7813 - [Rust] Fix undefined behaviour and and remove unsafe ARROW-7815 - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz) ARROW-7827 - [Python] conda-forge pyarrow package does not have s3 enabled ARROW-7832 - [R] Patches to 0.16.0 release ARROW-7836 - [Rust] “allocate_aligned”/”reallocate” need to initialize memory to avoid UB ARROW-7837 - [Java] bug in BaseVariableWidthVector.copyFromSafe results with an index out of bounds exception ARROW-7838 - [C++] Installed plasma-store-server fails finding Boost ARROW-7841 - [C++] HADOOP_HOME doesn’t work to find libhdfs.so ARROW-7844 - [R] array_to_vector is not thread safe ARROW-7848 - Add doc for MapType ARROW-7852 - [Python] 0.16.0 wheels not compatible with older numpy ARROW-7857 - [Python] Failing test with pandas master for extension type conversion ARROW-7861 - [C++][Parquet] Add fuzz regression corpus for parquet reader ARROW-7884 - [C++][Python] Crash in pq.read_table() ARROW-7887 - [Rust] Filter kernel does not support temporal types ARROW-7889 - [Rust] Datafusion CLI does not support registering Parquet files ARROW-7899 - [Integration][Java] null type integration test ARROW-7908 - [R] Can’t install package without setting LIBARROW_DOWNLOAD=true ARROW-7922 - [CI][Crossbow] Nightly macOS wheel builds fail (brew bundle edition) ARROW-7923 - [CI][Crossbow] macOS autobrew fails on homebrew-versions ARROW-7926 - [Developer] “archery lint” target is not ergonomic for running a single check like IWYU ARROW-7928 - [Python] Example of flight server and client not working ARROW-7931 - [C++] Fix crash on corrupt Map array input (OSS-Fuzz) ARROW-7936 - [Python] FileSystem.from_uri test fails on python 3.5 ARROW-7940 - [C++] Unable to generate cmake build with settings other than default ARROW-7958 - [Java] Update Avro to version 1.9.2 ARROW-7962 - [R][Dataset] Followup to “Consolidate Source and Dataset classes” ARROW-7968 - [C++] orc_ep build fails on 64-bit Raspbian ARROW-7973 - [Developer][C++] ResourceWarnings in run_cpplint.py ARROW-7974 - [Developer][C++] ResourceWarning in “make check-format” ARROW-7975 - [C++] Do not include padding bytes in “Buffer” IPC metadata accounting ARROW-7978 - [Developer] GitHub Actions “lint” task is running include-what-you-use and failing ARROW-7980 - [Python] Deserialization with pyarrow fails for certain Timestamp-based data frame ARROW-7981 - [C++][Dataset] Fails to compile on gcc 5.4 ARROW-7985 - [C++] ListBuilder.Finish fails if underlying value builder is empty and .Reserve’d ARROW-7990 - [C++][Developer] Add “archery lint” option for running “iwyu.sh all” ARROW-7992 - [C++] MSVC warning causing Appveyor failure in sort_to_indices.cc ARROW-7996 - [Python] Error serializing empty pandas DataFrame with pyarrow ARROW-7997 - [Python] Schema equals method with inconsistent docs in pyarrow ARROW-7999 - [C++] Fix crash on corrupt Map array input (OSS-Fuzz) ARROW-8000 - [C++] gcc 4.8 build failures ARROW-8003 - [C++] -DBZip2_SOURCE=BUNDLED fails when building with clang ARROW-8006 - [C++] Unsafe arrow dictionary recovered from parquet ARROW-8007 - [Python] Remove unused and defunct assert_get_object_equal in plasma tests ARROW-8008 - [C++/Python] Framework Python is preferred even though not the activated one ARROW-8009 - [Java] Fix the hash code methods for BitVector ARROW-8011 - [C++] Some buffers not resized when reading from Parquet ARROW-8013 - [Python][Packaging] Fix manylinux wheels ARROW-8021 - [Python] Appveyor does not appear to be including pandas in test runs ARROW-8036 - [C++] Compilation failure with gtest 1.10.0 ARROW-8042 - [Python] pyarrow.ChunkedArray docstring is incorrect regarding zero-length ChunkedArray having no chunks ARROW-8057 - [Python] Don’t check Schema metadata in __eq__ and __ne__ ARROW-8070 - [C++] Cast segfaults on unsupported cast from list to utf8 ARROW-8071 - [GLib] Build error with configure ARROW-8088 - [C++][Dataset] Partition columns with specified dictionary type result in all nulls ARROW-8091 - [CI][Crossbow] Fix nightly homebrew and R failures ARROW-8092 - [CI][Crossbow] OSX wheels fail on bundled bzip2 ARROW-8094 - [CI][Crossbow] Nightly valgrind test fails ARROW-8095 - [CI][Crossbow] Nightly turbodbc job fails ARROW-8098 - [go] Checkptr Failures on Go 1.14 ARROW-8101 - [FlightRPC][Java] Can’t read/write only an empty null array ARROW-8102 - [Dev] Crossbow’s version detection doesn’t work in the comment bot’s scenario ARROW-8105 - [Python] pyarrow.array segfaults when passed masked array with shrunken mask ARROW-8106 - [Python] Builds on master broken by pandas 1.0.2 release ARROW-8110 - [C#] BuildArrays fails if NestedType is included ARROW-8112 - [FlightRPC][C++] Some status codes don’t round-trip through gRPC ARROW-8119 - [Dev] Make Yaml optional dependency for archery ARROW-8122 - [Python] Empty numpy arrays with shape cannot be deserialized ARROW-8125 - [C++] “arrow-tests” target broken with ninja build ARROW-8127 - [C++] [Parquet] Incorrect column chunk metadata for multipage batch writes ARROW-8128 - [C#] NestedType children serialized on wrong length ARROW-8132 - [C++] arrow-s3fs-test failing on master ARROW-8133 - [CI] Github Actions sometimes fail to checkout Arrow ARROW-8136 - [C++][Python] Creating dataset from relative path no longer working ARROW-8139 - [C++] FileSystem enum causes attributes warning ARROW-8142 - [C++] Casting a chunked array with 0 chunks critical failure ARROW-8144 - [CI] Cmake 3.2 nightly build fails ARROW-8159 - [Python] pyarrow.Schema.from_pandas doesn’t support ExtensionDtype ARROW-8166 - [C++] AVX512 intrinsics fail to compile with clang-8 on Ubuntu 18.04 ARROW-8176 - [FlightRPC][Integration] Have Flight services bind to port 0 in integration ARROW-8186 - [Python] Dataset expression != returns bool instead of expression for invalid value ARROW-8188 - [R] Adapt to latest checks in R-devel ARROW-8193 - [C++] arrow-future-test fails to compile on gcc 4.8 ARROW-8197 - [Rust] DataFusion “create_physical_plan” returns incorrect schema? ARROW-8206 - [R] Minor fix for backwards compatibility on Linux installation ARROW-8209 - [Python] Accessing duplicate column of Table by name gives wrong error ARROW-8213 - [Python][Dataset] Opening a dataset with a local incorrect path gives confusing error message ARROW-8216 - [R][C++][Dataset] Filtering returns all-missing rows where the filtering column is missing ARROW-8217 - [R][C++] Fix crashing test in test-dataset.R on 32-bit Windows from ARROW-7979 ARROW-8219 - [Rust] sqlparser crate needs to be bumped to version 0.2.5 ARROW-8233 - [CI] Build timeouts on “AMD64 Windows MinGW 64 GLib &amp; Ruby “ ARROW-8237 - [Python] Review Developer build instructions for conda and non-conda users ARROW-8238 - [C++][Compute] Failed to build compute tests on windows with msvc2015 ARROW-8239 - [Java] fix param checks in splitAndTransfer method ARROW-8245 - [Python][Parquet] Skip hidden directories when reading partitioned parquet files ARROW-8255 - [Rust] [DataFusion] COUNT(*) results in confusing error ARROW-8259 - [Rust] [DataFusion] ProjectionPushDownRule does not rewrite LIMIT ARROW-8268 - [Ruby] Test failure due to lack of built ZSTD support ARROW-8269 - [Python] Failure in “nopandas” build in test_parquet_row_group_fragments ARROW-8270 - [Python][Flight] Example Flight server with TLS’s certificate and key is not working ARROW-8272 - [CI][Python] Test failure on Ubuntu 16.04 ARROW-8274 - [C++] Use LZ4 frame format for “LZ4” compression in IPC write ARROW-8276 - [C++][Dataset] Scanning a Fragment does not take into account the partition columns ARROW-8280 - [C++] MinGW builds failing due to CARES-related toolchain issue ARROW-8286 - [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset ARROW-8298 - [C++][CI] MinGW builds fail building grpc ARROW-8303 - [Python] Fix test failure caused by non-deterministic dict key ordering on Python 3.5 ARROW-8304 - [Flight][Python] Flight client with TLS root certificate is reporting error on do_get() ARROW-8305 - [Java] ExtensionTypeVector should make sure underlyingVector not null ARROW-8310 - [C++] Minio’s exceptions not recognized by IsConnectError() ARROW-8315 - [Python][Dataset] Don’t rely on ordered dict keys in test_dataset.py ARROW-8323 - [C++] Pin gRPC at v1.27 to avoid compilation error in its headers ARROW-8326 - [C++] Don’t use deprecated TYPED_TEST_CASE ARROW-8327 - [FlightRPC][Java] gRPC trailers may be null ARROW-8331 - [C++] arrow-compute-filter-benchmark fails to compile ARROW-8333 - [C++][CI] Always compile benchmarks in some C++ CI entry ARROW-8334 - [C++] [Gandiva] Missing DATE32 in LLVM Types / Simple D32 Compute Functions ARROW-8342 - [Python] dask and kartothek integration tests are failing ARROW-8345 - [Python] feather.read_table should not require pandas ARROW-8346 - [CI][Ruby] GLib/Ruby macOS build fails on zlib ARROW-8349 - [CI][NIGHTLY:gandiva-jar-osx] Use latest pygit2 ARROW-8353 - [C++] is_nullable maybe not initialized in parquet writer ARROW-8354 - [R] Fix segfault in Table to Array conversion ARROW-8357 - [Rust] [DataFusion] Dockerfile for CLI is missing format dir ARROW-8358 - [C++] Fix -Wrange-loop-construct warnings in clang-11 ARROW-8360 - [C++][Gandiva] Fixes date32 support for date/time functions ARROW-8365 - [C++] Error when writing files to S3 larger than 5 GB ARROW-8366 - [Rust] Need to revert recent arrow-flight build change ARROW-8369 - [CI] Fix crossbow wildcard groups ARROW-8373 - [GLib] Problems resolving gobject-introspection, arrow in Meson builds ARROW-8380 - [RUST] StringDictionaryBuilder not publicly exported from arrow::array ARROW-8384 - [C++][Python] arrow/filesystem/hdfs.h and Python wrapper does not have an option for setting a path to a Kerberos ticket ARROW-8386 - [Python] pyarrow.jvm raises error for empty Arrays ARROW-8388 - [C++] GCC 4.8 fails to move on return ARROW-8397 - [C++] Fail to compile aggregate_test.cc on Ubuntu 16.04 ARROW-8406 - [Python] test_fs fails when run from a different drive on Windows ARROW-8410 - [C++] CMake fails on aarch64 systems that do not support -march=armv8-a+crc+crypto ARROW-8414 - [Python] Non-deterministic row order failure in test_parquet.py ARROW-8415 - [C++][Packaging] fix gandiva linux job ARROW-8416 - [Python] Provide a “feather” alias in the dataset API ARROW-8420 - [C++] CMake fails to configure on armv7l platform (e.g. Raspberry Pi 3) ARROW-8427 - [C++][Dataset] Do not ignore file paths with underscore/dot when full path was specified ARROW-8428 - [C++][NIGHTLY:gandiva-jar-trusty] GCC 4.8 failures in C++ unit tests ARROW-8429 - [C++] Fix Buffer::CopySlice on 0-sized buffer ARROW-8432 - [Python][CI] Failure to download Hadoop ARROW-8437 - [C++] Remove std::move return value from MakeRandomNullBitmap test utility ARROW-8438 - [C++] arrow-io-memory-benchmark crashes ARROW-8439 - [Python] Filesystem docs are outdated ARROW-8441 - [C++] Fix crashes on invalid input (OSS-Fuzz) ARROW-8442 - [Python] NullType.to_pandas_dtype inconsisent with dtype returned in to_pandas/to_numpy ARROW-8460 - [Packaging][deb] Ubuntu Focal build is failed ARROW-8465 - [Packaging][Python] Windows py35 wheel build fails because of boost ARROW-8466 - [Packaging] The python unittests are not running in the windows wheel builds ARROW-8468 - [Document] Fix the incorrect null bits description ARROW-8469 - [Dev] Fix nightly docker tests on azure ARROW-8478 - [Java] Rollback contrib package changes. ARROW-8498 - [Python] Schema.from_pandas fails on extension type, while Table.from_pandas works ARROW-8511 - [Developer][Release] Windows release verification script does not halt if C++ compilation fails PARQUET-1780 - [C++] Set ColumnMetadata.encoding_stats field PARQUET-1788 - [C++] ColumnWriter has undefined behavior when writing arrow chunks PARQUET-1797 - [C++] Fix fuzzing errors PARQUET-1799 - [C++] Stream API: Relax schema checking when reading PARQUET-1810 - [C++] Fix undefined behaviour on invalid enum values (OSS-Fuzz) PARQUET-1813 - [C++] Remove logging statement in unit test PARQUET-1819 - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz) PARQUET-1823 - [C++] Invalid RowGroup returned when reading with parquet::arrow::FileReader-&gt;RowGroup(i)-&gt;Column(j) PARQUET-1829 - [C++] Fix crashes on invalid input (OSS-Fuzz) PARQUET-1831 - [C++] Fix crashes on invalid input (OSS-Fuzz) PARQUET-1835 - [C++] Fix crashes on invalid input (OSS-Fuzz)" />
<meta property="og:description" content="Apache Arrow 0.17.0 (20 April 2020) This is a major release covering more than 2 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 569 commits from 79 distinct contributors. $ git shortlog -sn apache-arrow-0.16.0..apache-arrow-0.17.0 80 Antoine Pitrou 78 Krisztián Szűcs 58 Wes McKinney 55 Neal Richardson 39 Sutou Kouhei 30 Benjamin Kietzman 26 Joris Van den Bossche 20 Andy Grove 13 liyafan82 11 David Li 11 François Saint-Jacques 8 Micah Kornfield 8 Projjal Chanda 7 Max Burke 7 Prudhvi Porandla 6 Kenta Murata 6 Uwe L. Korn 6 Yibo Cai 6 tianchen 5 Frank Du 5 Martin Radev 4 Rok 3 Paddy Horan 3 Takashi Hashida 3 Yosuke Shiro 3 Zhuo Peng 2 Bryan Cutler 2 David Kegley 2 Jacek Pliszka 2 Ji Kunshang 2 Jorge C. Leitao 2 Jörn Horstmann 2 Kazuaki Ishizaki 2 Markus Westerlind 2 Morgan Cassels 2 Ryan Murray 2 Sam Albers 2 Siyuan 2 Stephanie Wang 2 Yuqi Gu 2 gawain.bolton 1 Adam Hooper 1 Andreas Zimmerer 1 Bryant Biggs 1 Chao Sun 1 Dan Nugent 1 Dominik Durner 1 Eyal Farago-Hagag 1 Grant Monroe 1 Hatem Helal 1 Ismaël Mejía 1 John Muehlhausen 1 Ken Suenobu 1 Kevin Conaway 1 Li, Jiajia 1 Mahmut Bulut 1 Marius van Niekerk 1 Michael Chirico 1 NAITOH Jun 1 Neville Dipale 1 Omega Gamage 1 Onur Satici 1 Palmer Lao 1 Paul Balanca 1 Prudhvi 1 Ravindra Wagh 1 Richard Artoul 1 Sebastien Binet 1 TP Boudreau 1 Tarek Allam 1 Tobias Mayer 1 Yuan Zhou 1 Zherui Cao 1 andy-thomason 1 caleboverman 1 mubai 1 psuman 1 ravindra-wagh 1 wenjun.swj Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-0.16.0..apache-arrow-0.17.0 128 Wes McKinney 109 Krisztián Szűcs 86 Antoine Pitrou 61 Neal Richardson 60 Sutou Kouhei 33 Benjamin Kietzman 23 Andy Grove 14 François Saint-Jacques 10 Micah Kornfield 7 Paddy Horan 7 Praveen 5 David Li 5 Sebastien Binet 3 GitHub 3 Joris Van den Bossche 3 Neville Dipale 3 Pindikura Ravindra 3 Yosuke Shiro 2 Bryan Cutler 2 Philipp Moritz 1 Brian Hulette 1 Chao Sun Changelog New Features and Improvements ARROW-1560 - [C++] Kernel implementations for “match” function ARROW-1571 - [C++] Implement argsort kernels (sort indices) for integers using O(n) counting sort ARROW-1581 - [Packaging] Tooling to make nightly wheels available for install ARROW-1582 - [Python] Set up + document nightly conda builds for macOS ARROW-1636 - [Format] Integration tests for null type ARROW-2447 - [C++] Create a device abstraction ARROW-3054 - [Packaging] Tooling to enable nightly conda packages to be updated to some anaconda.org channel ARROW-3410 - [C++][Dataset] Streaming CSV reader interface for memory-constrainted environments ARROW-3750 - [R] Pass various wrapped Arrow objects created in Python into R with zero copy via reticulate ARROW-4120 - [Python] Define process for testing procedures that check for no macro-level memory leaks ARROW-4226 - [Format][C++] Add CSF sparse tensor support ARROW-4304 - [Rust] Enhance documentation for arrow ARROW-4428 - [R] Feature flags for R build ARROW-4482 - [Website] Add blog archive page ARROW-4815 - [Rust] [DataFusion] Add support for * in SQL projection ARROW-5357 - [Rust] Add capacity field in Buffer ARROW-5405 - [Documentation] Move integration testing documentation to Sphinx docs, add instructions for JavaScript ARROW-5497 - [Release] Build and publish R/Java/JS docs ARROW-5501 - [R] Reorganize read/write file/stream functions ARROW-5510 - [Format] Feather V2 based on Arrow IPC file format, with compression support ARROW-5563 - [Format] Update integration test JSON format documentation ARROW-5585 - [Go] rename arrow.TypeEquals into arrow.TypeEqual ARROW-5742 - [CI] Add daily / weekly Valgrind build ARROW-5757 - [Python] Stop supporting Python 2.7 ARROW-590 - [Integration] Add integration tests for Union types ARROW-5949 - [Rust] Implement DictionaryArray ARROW-6165 - [Integration] Use multiprocessing to run integration tests on multiple CPU cores ARROW-6176 - [Python] Allow to subclass ExtensionArray to attach to custom extension type ARROW-6393 - [C++] Add EqualOptions support in SparseTensor::Equals ARROW-6479 - [C++] inline errors from external projects’ build logs ARROW-6510 - [Python][Filesystem] Expose nanosecond resolution mtime ARROW-6666 - [Rust] [DataFusion] Implement string literal expression ARROW-6724 - [C++] Add simpler static ctor for BufferOutputStream than the current Create function ARROW-6821 - [C++][Parquet] Do not require Thrift compiler when building (but still require library) ARROW-6823 - [C++][Python][R] Support metadata in the feather format? ARROW-6837 - [C++/Python] access File Footer custom_metadata ARROW-6841 - [C++] Upgrade to LLVM 8 ARROW-6875 - [FlightRPC] Implement Criteria for ListFlights RPC / list_flights method ARROW-6915 - [Developer] Do not overwrite minor release version with merge script, even if not specified by committer ARROW-6947 - [Rust] [DataFusion] Add support for scalar UDFs ARROW-6996 - [Python] Expose boolean filter kernel on Table ARROW-7044 - [Release] Create a post release script for the home-brew formulas ARROW-7048 - [Java] Support for combining multiple vectors under VectorSchemaRoot ARROW-7063 - [C++] Schema print method prints too much metadata ARROW-7073 - [Java] Support concating vectors values in batch ARROW-7080 - [Python][Parquet][C++] Expose parquet field_id in Schema objects ARROW-7091 - [C++] Move all factories to type_fwd.h ARROW-7119 - [C++][CI] Use scripts/util_coredump.sh to show automatic backtraces ARROW-7201 - [GLib][Gandiva] Add support for BooleanNode ARROW-7202 - [R][CI] Improve rwinlib building on CI to stop re-downloading dependencies ARROW-7222 - [Python][Release] Wipe any existing generated Python API documentation when updating website ARROW-7233 - [C++] Add Result APIs to IPC module ARROW-7256 - [C++] Remove ARROW_MEMORY_POOL_DEFAULT macro ARROW-7330 - [C++] Add Result to APIs to arrow/gpu ARROW-7332 - [C++][Parquet] Explicitly catch status exceptions in PARQUET_CATCH_NOT_OK ARROW-7336 - [C++] Implement MinMax options to not skip nulls ARROW-7338 - [C++] Improve InMemoryDataSource to support generator instead of static list ARROW-7365 - [Python] Support FixedSizeList type in conversion to numpy/pandas ARROW-7400 - [Java] Avoids the worst case for quick sort ARROW-7412 - [C++][Dataset] Ensure that dataset code is robust to schemas with duplicate field names ARROW-7419 - [Python] Support SparseCSCMatrix ARROW-7427 - [Python] Support SparseCSFTensor ARROW-7428 - [Format][C++] Add serialization for CSF sparse tensors ARROW-7444 - [GLib] Add LocalFileSystem support ARROW-7462 - [C++] Add CpuInfo detection for Arm64 Architecture ARROW-7491 - [Java] Improve the performance of aligning ARROW-7515 - [C++] Rename nonexistent and non_existent to not_found ARROW-7524 - [C++][CI] Build parquet support in the VS2019 GitHub Actions job ARROW-7530 - [Developer] Do not include list of commits from PR in squashed summary message ARROW-7534 - [Java] Create a new java/contrib module ARROW-7547 - [C++] [Python] [Dataset] Additional reader options in ParquetFileFormat ARROW-7587 - [C++][Compute] Add Top-k kernel ARROW-7608 - [C++][Dataset] Expose more informational properties ARROW-7615 - [CI][Gandiva] Ensure that the gandiva_jni library has only a whitelisted set of shared dependencies as part of Travis CI job ARROW-7616 - [Java] Support comparing value ranges for dense union vector ARROW-7625 - [GLib] Parquet GLib and Red Parquet (Ruby) do not allow specifying compression type ARROW-7641 - [R] Make dataset vignette have executable code ARROW-7662 - [R] Support creating ListArray from R list ARROW-7664 - [C++] Extract localfs default from FileSystemFromUri ARROW-7675 - [R][CI] Move Windows CI from Appveyor to GHA ARROW-7679 - [R] Cleaner interface for creating UnionDataset ARROW-7684 - [Rust] Provide example of Flight server for DataFusion ARROW-7685 - [Developer] Add support for GitHub Actions to Crossbow ARROW-7691 - [C++] Verify missing fields when walking Flatbuffers data ARROW-7708 - [Release] Include PARQUET commits from git changelog in release changelogs ARROW-7712 - [CI][Crossbow] Fix or delete fuzzit jobs ARROW-7720 - [C++][Python] Add check_metadata argument to Table.equals ARROW-7725 - [C++] Add infrastructure for unity builds and precompiled headers ARROW-7726 - [CI] [C++] Use boost binaries on Windows GHA build ARROW-7729 - [Python][CI] Pin pandas version to 0.25 in the dask integration test ARROW-7733 - [Developer] Install locally a new enough version of Go for release verification script ARROW-7735 - [Release] conda-forge channel is missing for verifying wheels ARROW-7736 - [Release] Binary verification sometimes fails with transient error ARROW-7739 - [GLib] Use placement new to initialize shared_ptr object in private structs ARROW-7741 - [C++][Parquet] Incorporate new level generation logic in parquet write path with a flag to revert back to old logic ARROW-7742 - [GLib] Add support for MapArray ARROW-7745 - [Doc] [C++] Update Parquet documentation ARROW-7749 - [C++] Link some more tests together ARROW-7750 - [Release] Make the source release verification script restartable ARROW-7751 - [Release] macOS wheel verification also needs arrow-testing ARROW-7752 - [Release] Enable and test dataset in the verification script ARROW-7754 - [C++] Result is slow ARROW-7761 - [C++] Add S3 support to fs::FileSystemFromUri ARROW-7764 - [C++] Builders allocate a null bitmap buffer even if there is no nulls ARROW-7771 - [Developer] Use ARROW_TMPDIR environment variable in the verification scripts instead of TMPDIR ARROW-7774 - [Packaging][Python] Update macos and windows wheel filenames ARROW-7787 - [Rust] Add collect to Table API ARROW-7788 - [C++] Add schema conversion support for map type ARROW-7790 - [Website] Update how to install Linux packages ARROW-7795 - [Rust - DataFusion] Support boolean negation (NOT) ARROW-7796 - [R] write_* functions should invisibly return their inputs ARROW-7799 - [R][CI] Remove flatbuffers from homebrew formulae ARROW-7804 - [C++][R] Compile error on macOS 10.11 ARROW-7812 - [Packaging][Python] Upgrade LLVM in manylinux1 docker image ARROW-7817 - [CI] macOS R autobrew nightly failed on installing dependency from source ARROW-7819 - [C++][Gandiva] Add DumpIR to Filter/Projector classes ARROW-7824 - [C++][Dataset] Provide Dataset writing to IPC format ARROW-7828 - [Release] Remove SSH keys for internal use ARROW-7829 - [R] Test R bindings on clang ARROW-7833 - [R] Make install_arrow() actually install arrow ARROW-7834 - [Release] Post release task for updating the documentations ARROW-7839 - [Python][Dataset] Add IPC format to python bindings ARROW-7846 - [Python][Dev] Remove last dependencies on six ARROW-7849 - [Packaging][Python] Remove the remaining py27 crossbow wheel tasks from the nightlies ARROW-7858 - [C++][Python] Support casting an Extension type to its storage type ARROW-7859 - [R] Minor patches for CRAN submission 0.16.0.2 ARROW-7862 - [R] Linux installation should run quieter by default ARROW-7863 - [C++][Python][CI] Ensure running HDFS related tests ARROW-7864 - [R] Make sure bundled installation works even if there are system packages ARROW-7865 - [R] Test builds on latest Linux versions ARROW-7868 - [Crossbow] Reduce GitHub API query parallelism ARROW-7869 - [Python] Boost::system and boost::filesystem not necessary anymore in Python wheels ARROW-7872 - [Python] Support conversion of list-of-struct in Array/Table.to_pandas ARROW-7874 - [Python][Archery] Validate docstrings with numpydoc ARROW-7876 - [R] Installation fails in the documentation generation image ARROW-7877 - [Packaging] Fix crossbow deployment to github artifacts ARROW-7879 - [C++][Doc] Add doc for the Device API ARROW-7880 - [CI][R] R sanitizer job is not really working ARROW-7881 - [C++] Fix pedantic warnings ARROW-7882 - [C++][Gandiva] Optimise like function for substring pattern ARROW-7886 - [C++][Dataset] Consolidate Source and Dataset ARROW-7888 - [Python] Allow using a more modern version of jpype in pyarrow.jvm ARROW-7890 - [C++] Add Promise / Future implementation ARROW-7891 - [C++] RecordBatch-&gt;Equals should also have a check_metadata argument ARROW-7892 - [Python] Expose FilesystemSource.format attribute ARROW-7895 - [Python] Remove more python 2.7 cruft ARROW-7896 - [C++] Refactor from #include guards to #pragma once ARROW-7897 - [Packaging] Temporarily disable artifact uploading until we fix the deployment issues ARROW-7898 - [Python] Reduce the number docstring violations using numpydoc ARROW-7904 - [C++] Decide about Field/Schema metadata printing parameters and how much to show by default ARROW-7907 - [Python] Conversion to pandas of empty table with timestamp type aborts ARROW-7912 - [Format] C data interface ARROW-7913 - [C++][Python][R] C++ implementation of C data interface ARROW-7915 - [CI] [Python] Run tests with Python development mode enabled ARROW-7916 - [C++][Dataset] Project IPC record batches to materialized fields ARROW-7917 - [CMake] FindPythonInterp should check for python3 ARROW-7919 - [R] install_arrow() should conda install if appropriate ARROW-7920 - [R] Fill in some missing input validation ARROW-7921 - [Go] Add Reset method to various components and clean up comments ARROW-7927 - [C++] Fix ‘cpu_info.cc’ compilation warning ARROW-7929 - [C++] CMake target names differ from upstream provided names ARROW-7930 - [Python][CI] Test jpype integration in CI ARROW-7932 - [Rust] [Parquet] Implement array reader for temporal types ARROW-7934 - [C++] Fix UriEscape for empty string ARROW-7935 - [Java] Remove Netty dependency for BufferAllocator and ReferenceManager ARROW-7937 - [Python][Packaging] Remove boost from the macos wheels ARROW-7941 - [Rust] [DataFusion] Logical plan should support unresolved column references ARROW-7943 - [C++][Parquet] Add a new level builder capable of handling nested data ARROW-7947 - [Rust] [Flight] [DataFusion] Implement example for get_schema ARROW-7949 - [Developer] Update to ‘.gitignore’ to not track user specific ‘cpp/Brewfile.lock.json’ file ARROW-7951 - [Python][Parquet] Expose BYTE_STREAM_SPLIT to pyarrow ARROW-7959 - [Ruby] Add support for Ruby 2.3 again ARROW-7963 - [C++][Python][Dataset] Expose listing fragments ARROW-7965 - [Python] Refine higher level dataset API ARROW-7966 - [Integration][Flight][C++] Client should verify each batch independently ARROW-7969 - [Packaging] Use cURL to upload artifacts ARROW-7970 - [Packaging][Python] Use system boost to build the macos wheels ARROW-7971 - [Rust] Create rowcount utility ARROW-7977 - [C++] Rename fs::FileStats to fs::FileInfo ARROW-7979 - [C++] Implement experimental buffer compression in IPC messages ARROW-7982 - [C++] Let ArrayDataVisitor accept void-returning functions ARROW-7983 - [CI][R] Nightly builds should be more verbose when they fail ARROW-7984 - [R] Check for valid inputs in more places ARROW-7986 - [Python] pa.Array.from_pandas cannot convert pandas.Series containing pyspark.ml.linalg.SparseVector ARROW-7987 - [CI][R] Fix for verbose nightly builds ARROW-7988 - [R] Fix on.exit calls in reticulate bindings ARROW-7991 - [C++][Plasma] Allow option for evicting if full when creating an object ARROW-7993 - [Java] Support decimal type in ComplexCopier ARROW-7994 - [CI][C++] Move AppVeyor MinGW builds to GitHub Actions ARROW-7995 - [C++] IO: coalescing and caching read ranges ARROW-7998 - [C++][Plasma] Make Seal requests synchronous ARROW-8005 - [Website] Review and adjust any usages of Apache dist system from website / tools ARROW-8014 - [C++] Provide CMake targets to test only within a given label ARROW-8016 - [Developer] Fix deprecation warning in PR merge tool ARROW-8018 - [C++][Parquet]Parquet Modular Encryption ARROW-8024 - [R] Bindings for BinaryType and FixedBinaryType ARROW-8026 - [Python] Support memoryview in addition to string value types for constructing string and binary type arrays ARROW-8027 - [Developer][Integration] Add integration tests for duplicate field names ARROW-8028 - [Go] Allow duplicate field names in schemas and nested types ARROW-8030 - [C++][Plasma] Fix inconsistent comment style ARROW-8039 - [Python][Dataset] Support using dataset API in pyarrow.parquet with a minimal ParquetDataset shim ARROW-8044 - [CI][NIGHTLY:gandiva-jar-osx] pygit2 needs libgit2 v1.0.x ARROW-8055 - [GLib][Ruby] Add some metadata bindings to GArrowSchema ARROW-8058 - [C++][Python][Dataset] Provide an option to toggle validation and schema inference in FileSystemDatasetFactoryOptions ARROW-8059 - [Python] Make FileSystem objects serializable ARROW-8060 - [Python] Make dataset Expression objects serializable ARROW-8061 - [C++][Dataset] Ability to specify granularity of ParquetFileFragment (support row groups) ARROW-8063 - [Python] Add user guide documentation for Datasets API ARROW-8064 - [Dev] Implement Comment bot via Github actions ARROW-8072 - [C++][Plasma] Add const constraint when parsing data ARROW-8077 - [Python] Add wheel build script and Crossbow configuration for Windows on Python 3.5 ARROW-8079 - [Python] Implement a wrapper for KeyValueMetadata, duck-typing dict where relevant ARROW-8080 - [C++] Add AVX512 build option ARROW-8082 - [Java][Plasma] Add JNI list() interface ARROW-8083 - [GLib] Add support for Peek() to GIOInputStream ARROW-8086 - [Java] Support writing decimal from big endian byte array in UnionListWriter ARROW-8087 - [C++][Dataset] Order of keys with HivePartitioning is lost in resulting schema ARROW-8096 - [C++][Gandiva] Create null node of Interval type ARROW-8097 - [Dev] Comment bot’s crossbow command acts on the master branch ARROW-8103 - [R] Make default Linux build more minimal ARROW-8104 - [C++] Don’t install bundled Thrift ARROW-8107 - [Packaging][APT] Use HTTPS for LLVM APT repository for Debian GNU/Linux stretch ARROW-8109 - [Packaging][APT] Drop support for Ubuntu Disco ARROW-8117 - [Rust] [Datafusion] Allow CAST from number to timestamp ARROW-8118 - [R] dim method for FileSystemDataset ARROW-8120 - [Packaging][APT] Add support for Ubuntu Focal ARROW-8123 - [Rust] [DataFusion] Create LogicalPlanBuilder ARROW-8124 - [Rust] Update library dependencies ARROW-8126 - [C++][Compute] Add Top-K kernel benchmark ARROW-8129 - [C++][Compute] Refine compare sorting kernel ARROW-8130 - [C++][Gandiva] Fix Dex visitor in llvm_generator to handle interval type ARROW-8140 - [Developer] Follow NullType -&gt; NullField change ARROW-8141 - [C++] Optimize BM_PlainDecodingBoolean performance using AVX512 Intrinsics API ARROW-8145 - [C++] Rename GetTargetInfos ARROW-8146 - [C++] Add per-filesystem facility to sanitize a path ARROW-8150 - [Rust] Allow writing custom FileMetaData k/v pairs ARROW-8151 - [Benchmarking][Dataset] Benchmark Parquet read performance with S3File ARROW-8153 - [Packaging] Update the conda feedstock files and upload artifacts to Anaconda ARROW-8158 - [Java] Getting length of data buffer and base variable width vector ARROW-8164 - [C++][Dataset] Let datasets be viewable with non-identical schema ARROW-8165 - [Packaging] Make nightly wheels available on a PyPI server ARROW-8167 - [CI] Add support for skipping builds with skip pattern in pull request title ARROW-8168 - [Java][Plasma] Improve Java Plasma client off-heap memory usage ARROW-8177 - [Rust] Make schema_to_fb_offset public ARROW-8178 - [C++] Upgrade to Flatbuffers 1.12 ARROW-8179 - [R] Windows build script tweaking for nightly packaging on GHA ARROW-8181 - [Java][FlightRPC] Expose transport error metadata ARROW-8182 - [Packaging] Increment the version number detected from the latest git tag ARROW-8183 - [c++][FlightRPC] Expose transport error metadata ARROW-8184 - [Packaging] Use arrow-nightlies organization name on Anaconda and Gemfury to host the nightlies ARROW-8185 - [Packaging] Document the available nightly wheels and conda packages ARROW-8187 - [R] Make test assertions robust to i18n ARROW-8191 - [Packaging][APT] Fix cmake removal in Debian GNU/Linux Stretch ARROW-8192 - [C++] script for unpack avx512 intrinsics code ARROW-8194 - [CI] Github Actions Windows job should run tests in parallel ARROW-8195 - [CI] Remove Boost download step in Github Actions ARROW-8198 - [C++] Diffing should handle null arrays ARROW-8200 - [GLib] Rename garrow_file_system_target_info{,s}() to …_file_info{,s}() ARROW-8203 - [C#] “dotnet pack” is failed ARROW-8204 - [Rust] [DataFusion] Add support for aliased expressions in SQL ARROW-8207 - [Packaging][wheel] Use LLVM 8 in manylinux2010 and manylinux2014 ARROW-8215 - [CI][GLib] Meson install fails in the macOS build ARROW-8218 - [C++] Parallelize decompression at field level in experimental IPC compression code ARROW-8220 - [Python] Make dataset FileFormat objects serializable ARROW-8222 - [C++] Use bcp to make a slim boost for bundled build ARROW-8224 - [C++] Remove APIs deprecated prior to 0.16.0 ARROW-8225 - [Rust] IPC reader must respect continuation markers ARROW-8227 - [C++] Refine SIMD feature definitions ARROW-8231 - [Rust] Parse key_value_metadata from parquet FileMetaData into arrow schema metadata ARROW-8232 - [Python] Deprecate pa.open_file and pa.open_stream in favor of pa.ipc.open_file/open_stream ARROW-8241 - [Rust] Add convenience methods to Schema ARROW-8242 - [C++] Flight fails to compile on GCC 4.8 ARROW-8243 - [Rust] [DataFusion] Fix inconsistent API in LogicalPlanBuilder ARROW-8244 - [Python][Parquet] Add `write_to_dataset` option to populate the “file_path” metadata fields ARROW-8246 - [C++] Add -Wa,-mbig-obj when compiling with MinGW to avoid linking errors ARROW-8247 - [Python] Expose Parquet writing “engine” setting in pyarrow.parquet.write_table ARROW-8249 - [Rust] [DataFusion] Make Table and LogicalPlanBuilder APIs more consistent ARROW-8252 - [CI][Ruby] Add Ubuntu 20.04 ARROW-8256 - [Rust] [DataFusion] Update CLI documentation for 0.17.0 release ARROW-8264 - [Rust] [DataFusion] Create utility for printing record batches ARROW-8266 - [C++] Add backup mirrors for external project source downloads ARROW-8267 - [CI][GLib] Failed to build on Ubuntu 16.04 ARROW-8271 - [Packaging] Allow wheel upload failures to gemfury ARROW-8275 - [Python][Docs] Review Feather + IPC file documentation per “Feather V2” changes ARROW-8277 - [Python] RecordBatch interface improvements ARROW-8279 - [C++] Do not export symbols from Codec implementations, remove need for PIMPL pattern ARROW-8288 - [Python] Expose with_ modifiers on DataType ARROW-8290 - [Python][Dataset] Improve ergonomy of the FileSystemDataset constructor ARROW-8291 - [Packaging] Conda nightly builds can’t locate Numpy ARROW-8292 - [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function ARROW-8294 - [Format][Flight] Add DoExchange RPC to Flight protocol ARROW-8295 - [C++][Dataset] IpcFileFormat should expliclity push down column projection ARROW-8299 - [C++] Reusable “optional ParallelFor” function for optional use of multithreading ARROW-8300 - [R] Documentation and changelog updates for 0.17 ARROW-8307 - [Python] Expose use_memory_map option in pyarrow.feather APIs ARROW-8308 - [Rust] [Flight] Implement DoExchange on examples ARROW-8309 - [CI] C++/Java/Rust workflows should trigger on changes to Flight.proto ARROW-8311 - [C++] Add push style stream format reader ARROW-8316 - [CI] Set docker-compose to use docker-cli instead of docker-py for building images ARROW-8319 - [CI] Install thrift compiler in the debian build ARROW-8320 - [Documentation][Format] Clarify (lack of) alignment requirements in C data interface ARROW-8321 - [CI] Use bundled thrift in Fedora 30 build ARROW-8322 - [CI] Fix C# workflow file syntax ARROW-8325 - [R][CI] Stop including boost in R windows bundle ARROW-8329 - [Documentation][C++] Undocumented FilterOptions argument in Filter kernel ARROW-8330 - [Documentation] The post release script generates the documentation with a development version ARROW-8332 - [C++] Require Thrift compiler to use system libthrift for Parquet build ARROW-8335 - [Release] Add crossbow jobs to run release verification ARROW-8336 - [Packaging][deb] Use libthrift-dev on Debian 10 and Ubuntu 19.10 or later ARROW-8341 - [Packaging][deb] Fail to build by no disk space ARROW-8343 - [GLib] Add GArrowRecordBatchIterator ARROW-8347 - [C++] Add Result APIs to Array methods ARROW-8351 - [R][CI] Store the Rtools-built Arrow C++ library as a build artifact ARROW-8352 - [R] Add install_pyarrow() ARROW-8356 - [Developer] Support * wildcards with “crossbow submit” via GitHub actions ARROW-8361 - [C++] Add Result APIs to Buffer methods and functions ARROW-8362 - [Crossbow] Ensure that the locally generated version is used in the docker tasks ARROW-8367 - [C++] Deprecate Buffer::FromString(…, pool) ARROW-8368 - [Format] In C interface, clarify resource management for consumers needing only a subset of child fields in ArrowArray ARROW-8370 - [C++] Add Result to type / schema APIs ARROW-8371 - [Crossbow] Implement and exercise sanity checks for tasks.yml ARROW-8372 - [C++] Add Result to table / record batch APIs ARROW-8375 - [CI][R] Make Windows tests more verbose in case of segfault ARROW-8376 - [R] Add experimental interface to ScanTask/RecordBatch iterators ARROW-8387 - [Rust] Make schema_to_fb public ARROW-8389 - [Integration] Run tests in parallel ARROW-8390 - [R] Expose schema unification features ARROW-8393 - [C++][Gandiva] Make gandiva function registry case-insensitive ARROW-8396 - [Rust] Remove libc from dependencies ARROW-8398 - [Python] Remove deprecation warnings originating from python tests ARROW-8401 - [C++] Add AVX2/AVX512 version of ByteStreamSplitDecode/ByteStreamSplitEncode ARROW-8403 - [C++] Add ToString() to ChunkedArray, Table and RecordBatch ARROW-8407 - [Rust] Add rustdoc for Dictionary type ARROW-8408 - [Python] Add memory_map= toggle to pyarrow.feather.read_feather ARROW-8409 - [R] Add arrow::cpu_count, arrow::set_cpu_count wrapper functions a la Python ARROW-8412 - [C++][Gandiva] Fix gandiva date_diff function definitions ARROW-8433 - [R] Add feather alias for ipc format in dataset API ARROW-8443 - [Gandiva][C++] Fix round/truncate to no-op for special cases ARROW-8444 - [Documentation] Fix spelling errors across the codebase ARROW-8449 - [R] Use CMAKE_UNITY_BUILD everywhere ARROW-8450 - [Integration][C++] Implement large list/binary/utf8 integration ARROW-8457 - [C++] bridge test does not take care of endianness ARROW-8458 - [C++] Prefer the original mirrors for the bundled thirdparty dependencies ARROW-8461 - [Packaging][deb] Use zstd package for Ubuntu Xenial ARROW-8463 - [CI] Balance the nightly test builds between CircleCI, Azure and Github ARROW-8467 - [C++] Test cases using ArrayFromJSON assume only a little-endian platform ARROW-8477 - [C++] Enable reading and writing of long filenames for Windows ARROW-8496 - [C++] Refine ByteStreamSplitDecodeScalar ARROW-8499 - [C++][Dataset] In ScannerBuilder, batch_size will not work if projecter is not empty ARROW-8506 - [c++] Miss tests to verify expected_buffer with bit_width &gt; 8 in RLE ARROW-8515 - [C++] Bitmap ToString should have an option of grouping by bytes ARROW-8517 - [Developer][Release] Update Crossbow RC verification setup for changes since 0.16.0 ARROW-8523 - [C++] Optimize BitmapReader PARQUET-1663 - [C++] Provide API to check the presence of complex data types PARQUET-1716 - [C++] Add support for BYTE_STREAM_SPLIT encoding PARQUET-1770 - [C++][CI] Add fuzz target for reading Parquet files PARQUET-1785 - [C++] Improve code reusability in encoding-test.cc PARQUET-1786 - [C++] Use simd to improve BYTE_STREAM_SPLIT decoding performance PARQUET-1806 - [C++] [CI] Improve fuzzing seed corpus PARQUET-1825 - [C++] Fix compilation error in column_io_benchmark.cc PARQUET-1828 - [C++] Add a SSE2 path for the ByteStreamSplit encoder implementation PARQUET-1840 - [C++] DecodeSpaced copies more values then necessary PARQUET-458 - [C++] Implement support for DataPageV2 Bug Fixes ARROW-1907 - [C++/Python] Feather format cannot accommodate string columns containing more than a total of 2GB of data ARROW-2255 - [Developer][Integration] Serialize schema- and field-level custom metadata in integration test JSON format ARROW-2587 - [Python] Unable to write StructArrays with multiple children to parquet ARROW-3004 - [Documentation] Builds docs for master rather than a pinned commit ARROW-3543 - [R] Better support for timestamp format and time zones in R ARROW-5265 - [Python/CI] Add integration test with kartothek ARROW-5473 - [C++] Build failure on googletest_ep on Windows when using Ninja ARROW-5981 - [C++] DictionaryBuilder initialization with Array can fail silently ARROW-6528 - [C++] Spurious Flight test failures (port allocation failure) ARROW-6738 - [Java] Fix problems with current union comparison logic ARROW-6757 - [Python] Creating csv.ParseOptions() causes “Windows fatal exception: access violation” with Visual Studio 2017 ARROW-6871 - [Java] Enhance TransferPair related parameters check and tests ARROW-6872 - [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError ARROW-6895 - [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()` ARROW-7008 - [Python] pyarrow.chunked_array([array]) fails on array with all-None buffers ARROW-7049 - [C++] warnings building on mingw-w64 ARROW-7301 - [Java] Sql type DATE should correspond to DateDayVector ARROW-7335 - [C++][Gandiva] Add castBIGINT, extractDay interval_day functions in Gandiva ARROW-7390 - [C++][Dataset] Concurrency race in Projector::Project ARROW-7405 - [Java] ListVector isEmpty API is incorrect ARROW-7466 - [CI][Java] Fix gandiva-jar-osx nightly build failure ARROW-7467 - [Java] ComplexCopier does incorrect copy for Map nullable info ARROW-7520 - [R] Writing many batches causes a crash ARROW-7546 - [Java] Use new implementation to concat vectors values in batch ARROW-7624 - [Rust] Soundness issues via `Buffer` methods ARROW-7628 - [Python] Better document some read_csv corner cases ARROW-7631 - [C++][Gandiva] return zero if there is an overflow while converting a decimal to a lower precision/scale ARROW-7680 - [C++][Dataset] Partition discovery is not working with windows path ARROW-7701 - [C++] [CI] Flight test error on macOS ARROW-7713 - [Java] TastLeak was put at the wrong location ARROW-7722 - [Java][FlightRPC] Memory leak ARROW-7734 - [C++] Segfault when comparing status with and without detail ARROW-7740 - [C++] Array internals corruption in StructArray::Flatten ARROW-7755 - [Python] Windows wheel cannot be installed on Python 3.8 ARROW-7758 - [Python] Wrong conversion of timestamps that are out of bounds for pandas (eg 0000-01-01) ARROW-7760 - [Release] Fix verify-release-candidate.sh since pip3 seems to no longer be in miniconda ARROW-7762 - [Python] Exceptions in ParquetWriter get ignored ARROW-7766 - [Python][Packaging] Windows py38 wheels are built with wrong ABI tag ARROW-7772 - [R][C++][Dataset] Unable to filter on date32 object with date64 scalar ARROW-7775 - [Rust] Don’t let safe code arbitrarily transmute readers and writers ARROW-7777 - [Go] StructBuilder/ListBuilder index out of range panic ARROW-7780 - [Release] Fix Windows wheel RC verification script given lack of “m” ABI tag in Python 3.8 ARROW-7781 - [C++][Dataset] Filtering on a non-existent column gives a segfault ARROW-7783 - [C++] ARROW_DATASET should enable ARROW_COMPUTE ARROW-7785 - [C++] sparse_tensor.cc is extremely slow to compile ARROW-7786 - [R] Wire up check_metadata in Table.Equals method ARROW-7789 - [R] Can’t initialize arrow objects when R.oo package is loaded ARROW-7791 - [C++][Parquet] Fix building error “cannot bind lvalue” ARROW-7792 - [R] read_* functions should close connection to file ARROW-7793 - [Java] If there is a leak the base allocator should release the excess memory to parent before throwing exception ARROW-7794 - [Rust] cargo publish fails for arrow-flight due to relative path to Flight.proto ARROW-7797 - [Release][Rust] Fix arrow-flight’s version in datafusion crate ARROW-7802 - [C++] Support for LargeBinary and LargeString in the hash kernel ARROW-7806 - [Python] Implement to_pandas for lists of LargeBinary/String ARROW-7807 - [R] Installation on RHEL 7 Cannot call io___MemoryMappedFile__Open() ARROW-7809 - [R] vignette does not run on Win 10 nor ubuntu ARROW-7813 - [Rust] Fix undefined behaviour and and remove unsafe ARROW-7815 - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz) ARROW-7827 - [Python] conda-forge pyarrow package does not have s3 enabled ARROW-7832 - [R] Patches to 0.16.0 release ARROW-7836 - [Rust] “allocate_aligned”/”reallocate” need to initialize memory to avoid UB ARROW-7837 - [Java] bug in BaseVariableWidthVector.copyFromSafe results with an index out of bounds exception ARROW-7838 - [C++] Installed plasma-store-server fails finding Boost ARROW-7841 - [C++] HADOOP_HOME doesn’t work to find libhdfs.so ARROW-7844 - [R] array_to_vector is not thread safe ARROW-7848 - Add doc for MapType ARROW-7852 - [Python] 0.16.0 wheels not compatible with older numpy ARROW-7857 - [Python] Failing test with pandas master for extension type conversion ARROW-7861 - [C++][Parquet] Add fuzz regression corpus for parquet reader ARROW-7884 - [C++][Python] Crash in pq.read_table() ARROW-7887 - [Rust] Filter kernel does not support temporal types ARROW-7889 - [Rust] Datafusion CLI does not support registering Parquet files ARROW-7899 - [Integration][Java] null type integration test ARROW-7908 - [R] Can’t install package without setting LIBARROW_DOWNLOAD=true ARROW-7922 - [CI][Crossbow] Nightly macOS wheel builds fail (brew bundle edition) ARROW-7923 - [CI][Crossbow] macOS autobrew fails on homebrew-versions ARROW-7926 - [Developer] “archery lint” target is not ergonomic for running a single check like IWYU ARROW-7928 - [Python] Example of flight server and client not working ARROW-7931 - [C++] Fix crash on corrupt Map array input (OSS-Fuzz) ARROW-7936 - [Python] FileSystem.from_uri test fails on python 3.5 ARROW-7940 - [C++] Unable to generate cmake build with settings other than default ARROW-7958 - [Java] Update Avro to version 1.9.2 ARROW-7962 - [R][Dataset] Followup to “Consolidate Source and Dataset classes” ARROW-7968 - [C++] orc_ep build fails on 64-bit Raspbian ARROW-7973 - [Developer][C++] ResourceWarnings in run_cpplint.py ARROW-7974 - [Developer][C++] ResourceWarning in “make check-format” ARROW-7975 - [C++] Do not include padding bytes in “Buffer” IPC metadata accounting ARROW-7978 - [Developer] GitHub Actions “lint” task is running include-what-you-use and failing ARROW-7980 - [Python] Deserialization with pyarrow fails for certain Timestamp-based data frame ARROW-7981 - [C++][Dataset] Fails to compile on gcc 5.4 ARROW-7985 - [C++] ListBuilder.Finish fails if underlying value builder is empty and .Reserve’d ARROW-7990 - [C++][Developer] Add “archery lint” option for running “iwyu.sh all” ARROW-7992 - [C++] MSVC warning causing Appveyor failure in sort_to_indices.cc ARROW-7996 - [Python] Error serializing empty pandas DataFrame with pyarrow ARROW-7997 - [Python] Schema equals method with inconsistent docs in pyarrow ARROW-7999 - [C++] Fix crash on corrupt Map array input (OSS-Fuzz) ARROW-8000 - [C++] gcc 4.8 build failures ARROW-8003 - [C++] -DBZip2_SOURCE=BUNDLED fails when building with clang ARROW-8006 - [C++] Unsafe arrow dictionary recovered from parquet ARROW-8007 - [Python] Remove unused and defunct assert_get_object_equal in plasma tests ARROW-8008 - [C++/Python] Framework Python is preferred even though not the activated one ARROW-8009 - [Java] Fix the hash code methods for BitVector ARROW-8011 - [C++] Some buffers not resized when reading from Parquet ARROW-8013 - [Python][Packaging] Fix manylinux wheels ARROW-8021 - [Python] Appveyor does not appear to be including pandas in test runs ARROW-8036 - [C++] Compilation failure with gtest 1.10.0 ARROW-8042 - [Python] pyarrow.ChunkedArray docstring is incorrect regarding zero-length ChunkedArray having no chunks ARROW-8057 - [Python] Don’t check Schema metadata in __eq__ and __ne__ ARROW-8070 - [C++] Cast segfaults on unsupported cast from list to utf8 ARROW-8071 - [GLib] Build error with configure ARROW-8088 - [C++][Dataset] Partition columns with specified dictionary type result in all nulls ARROW-8091 - [CI][Crossbow] Fix nightly homebrew and R failures ARROW-8092 - [CI][Crossbow] OSX wheels fail on bundled bzip2 ARROW-8094 - [CI][Crossbow] Nightly valgrind test fails ARROW-8095 - [CI][Crossbow] Nightly turbodbc job fails ARROW-8098 - [go] Checkptr Failures on Go 1.14 ARROW-8101 - [FlightRPC][Java] Can’t read/write only an empty null array ARROW-8102 - [Dev] Crossbow’s version detection doesn’t work in the comment bot’s scenario ARROW-8105 - [Python] pyarrow.array segfaults when passed masked array with shrunken mask ARROW-8106 - [Python] Builds on master broken by pandas 1.0.2 release ARROW-8110 - [C#] BuildArrays fails if NestedType is included ARROW-8112 - [FlightRPC][C++] Some status codes don’t round-trip through gRPC ARROW-8119 - [Dev] Make Yaml optional dependency for archery ARROW-8122 - [Python] Empty numpy arrays with shape cannot be deserialized ARROW-8125 - [C++] “arrow-tests” target broken with ninja build ARROW-8127 - [C++] [Parquet] Incorrect column chunk metadata for multipage batch writes ARROW-8128 - [C#] NestedType children serialized on wrong length ARROW-8132 - [C++] arrow-s3fs-test failing on master ARROW-8133 - [CI] Github Actions sometimes fail to checkout Arrow ARROW-8136 - [C++][Python] Creating dataset from relative path no longer working ARROW-8139 - [C++] FileSystem enum causes attributes warning ARROW-8142 - [C++] Casting a chunked array with 0 chunks critical failure ARROW-8144 - [CI] Cmake 3.2 nightly build fails ARROW-8159 - [Python] pyarrow.Schema.from_pandas doesn’t support ExtensionDtype ARROW-8166 - [C++] AVX512 intrinsics fail to compile with clang-8 on Ubuntu 18.04 ARROW-8176 - [FlightRPC][Integration] Have Flight services bind to port 0 in integration ARROW-8186 - [Python] Dataset expression != returns bool instead of expression for invalid value ARROW-8188 - [R] Adapt to latest checks in R-devel ARROW-8193 - [C++] arrow-future-test fails to compile on gcc 4.8 ARROW-8197 - [Rust] DataFusion “create_physical_plan” returns incorrect schema? ARROW-8206 - [R] Minor fix for backwards compatibility on Linux installation ARROW-8209 - [Python] Accessing duplicate column of Table by name gives wrong error ARROW-8213 - [Python][Dataset] Opening a dataset with a local incorrect path gives confusing error message ARROW-8216 - [R][C++][Dataset] Filtering returns all-missing rows where the filtering column is missing ARROW-8217 - [R][C++] Fix crashing test in test-dataset.R on 32-bit Windows from ARROW-7979 ARROW-8219 - [Rust] sqlparser crate needs to be bumped to version 0.2.5 ARROW-8233 - [CI] Build timeouts on “AMD64 Windows MinGW 64 GLib &amp; Ruby “ ARROW-8237 - [Python] Review Developer build instructions for conda and non-conda users ARROW-8238 - [C++][Compute] Failed to build compute tests on windows with msvc2015 ARROW-8239 - [Java] fix param checks in splitAndTransfer method ARROW-8245 - [Python][Parquet] Skip hidden directories when reading partitioned parquet files ARROW-8255 - [Rust] [DataFusion] COUNT(*) results in confusing error ARROW-8259 - [Rust] [DataFusion] ProjectionPushDownRule does not rewrite LIMIT ARROW-8268 - [Ruby] Test failure due to lack of built ZSTD support ARROW-8269 - [Python] Failure in “nopandas” build in test_parquet_row_group_fragments ARROW-8270 - [Python][Flight] Example Flight server with TLS’s certificate and key is not working ARROW-8272 - [CI][Python] Test failure on Ubuntu 16.04 ARROW-8274 - [C++] Use LZ4 frame format for “LZ4” compression in IPC write ARROW-8276 - [C++][Dataset] Scanning a Fragment does not take into account the partition columns ARROW-8280 - [C++] MinGW builds failing due to CARES-related toolchain issue ARROW-8286 - [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset ARROW-8298 - [C++][CI] MinGW builds fail building grpc ARROW-8303 - [Python] Fix test failure caused by non-deterministic dict key ordering on Python 3.5 ARROW-8304 - [Flight][Python] Flight client with TLS root certificate is reporting error on do_get() ARROW-8305 - [Java] ExtensionTypeVector should make sure underlyingVector not null ARROW-8310 - [C++] Minio’s exceptions not recognized by IsConnectError() ARROW-8315 - [Python][Dataset] Don’t rely on ordered dict keys in test_dataset.py ARROW-8323 - [C++] Pin gRPC at v1.27 to avoid compilation error in its headers ARROW-8326 - [C++] Don’t use deprecated TYPED_TEST_CASE ARROW-8327 - [FlightRPC][Java] gRPC trailers may be null ARROW-8331 - [C++] arrow-compute-filter-benchmark fails to compile ARROW-8333 - [C++][CI] Always compile benchmarks in some C++ CI entry ARROW-8334 - [C++] [Gandiva] Missing DATE32 in LLVM Types / Simple D32 Compute Functions ARROW-8342 - [Python] dask and kartothek integration tests are failing ARROW-8345 - [Python] feather.read_table should not require pandas ARROW-8346 - [CI][Ruby] GLib/Ruby macOS build fails on zlib ARROW-8349 - [CI][NIGHTLY:gandiva-jar-osx] Use latest pygit2 ARROW-8353 - [C++] is_nullable maybe not initialized in parquet writer ARROW-8354 - [R] Fix segfault in Table to Array conversion ARROW-8357 - [Rust] [DataFusion] Dockerfile for CLI is missing format dir ARROW-8358 - [C++] Fix -Wrange-loop-construct warnings in clang-11 ARROW-8360 - [C++][Gandiva] Fixes date32 support for date/time functions ARROW-8365 - [C++] Error when writing files to S3 larger than 5 GB ARROW-8366 - [Rust] Need to revert recent arrow-flight build change ARROW-8369 - [CI] Fix crossbow wildcard groups ARROW-8373 - [GLib] Problems resolving gobject-introspection, arrow in Meson builds ARROW-8380 - [RUST] StringDictionaryBuilder not publicly exported from arrow::array ARROW-8384 - [C++][Python] arrow/filesystem/hdfs.h and Python wrapper does not have an option for setting a path to a Kerberos ticket ARROW-8386 - [Python] pyarrow.jvm raises error for empty Arrays ARROW-8388 - [C++] GCC 4.8 fails to move on return ARROW-8397 - [C++] Fail to compile aggregate_test.cc on Ubuntu 16.04 ARROW-8406 - [Python] test_fs fails when run from a different drive on Windows ARROW-8410 - [C++] CMake fails on aarch64 systems that do not support -march=armv8-a+crc+crypto ARROW-8414 - [Python] Non-deterministic row order failure in test_parquet.py ARROW-8415 - [C++][Packaging] fix gandiva linux job ARROW-8416 - [Python] Provide a “feather” alias in the dataset API ARROW-8420 - [C++] CMake fails to configure on armv7l platform (e.g. Raspberry Pi 3) ARROW-8427 - [C++][Dataset] Do not ignore file paths with underscore/dot when full path was specified ARROW-8428 - [C++][NIGHTLY:gandiva-jar-trusty] GCC 4.8 failures in C++ unit tests ARROW-8429 - [C++] Fix Buffer::CopySlice on 0-sized buffer ARROW-8432 - [Python][CI] Failure to download Hadoop ARROW-8437 - [C++] Remove std::move return value from MakeRandomNullBitmap test utility ARROW-8438 - [C++] arrow-io-memory-benchmark crashes ARROW-8439 - [Python] Filesystem docs are outdated ARROW-8441 - [C++] Fix crashes on invalid input (OSS-Fuzz) ARROW-8442 - [Python] NullType.to_pandas_dtype inconsisent with dtype returned in to_pandas/to_numpy ARROW-8460 - [Packaging][deb] Ubuntu Focal build is failed ARROW-8465 - [Packaging][Python] Windows py35 wheel build fails because of boost ARROW-8466 - [Packaging] The python unittests are not running in the windows wheel builds ARROW-8468 - [Document] Fix the incorrect null bits description ARROW-8469 - [Dev] Fix nightly docker tests on azure ARROW-8478 - [Java] Rollback contrib package changes. ARROW-8498 - [Python] Schema.from_pandas fails on extension type, while Table.from_pandas works ARROW-8511 - [Developer][Release] Windows release verification script does not halt if C++ compilation fails PARQUET-1780 - [C++] Set ColumnMetadata.encoding_stats field PARQUET-1788 - [C++] ColumnWriter has undefined behavior when writing arrow chunks PARQUET-1797 - [C++] Fix fuzzing errors PARQUET-1799 - [C++] Stream API: Relax schema checking when reading PARQUET-1810 - [C++] Fix undefined behaviour on invalid enum values (OSS-Fuzz) PARQUET-1813 - [C++] Remove logging statement in unit test PARQUET-1819 - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz) PARQUET-1823 - [C++] Invalid RowGroup returned when reading with parquet::arrow::FileReader-&gt;RowGroup(i)-&gt;Column(j) PARQUET-1829 - [C++] Fix crashes on invalid input (OSS-Fuzz) PARQUET-1831 - [C++] Fix crashes on invalid input (OSS-Fuzz) PARQUET-1835 - [C++] Fix crashes on invalid input (OSS-Fuzz)" />
<link rel="canonical" href="https://arrow.apache.org/release/0.17.0.html" />
<meta property="og:url" content="https://arrow.apache.org/release/0.17.0.html" />
<meta property="og:site_name" content="Apache Arrow" />
<meta property="og:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="og:type" content="article" />
<meta property="article:published_time" content="2024-04-06T11:31:22-04:00" />
<meta name="twitter:card" content="summary_large_image" />
<meta property="twitter:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="twitter:title" content="Apache Arrow 0.17.0 Release" />
<meta name="twitter:site" content="@ApacheArrow" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"BlogPosting","dateModified":"2024-04-06T11:31:22-04:00","datePublished":"2024-04-06T11:31:22-04:00","description":"Apache Arrow 0.17.0 (20 April 2020) This is a major release covering more than 2 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 569 commits from 79 distinct contributors. $ git shortlog -sn apache-arrow-0.16.0..apache-arrow-0.17.0 80 Antoine Pitrou 78 Krisztián Szűcs 58 Wes McKinney 55 Neal Richardson 39 Sutou Kouhei 30 Benjamin Kietzman 26 Joris Van den Bossche 20 Andy Grove 13 liyafan82 11 David Li 11 François Saint-Jacques 8 Micah Kornfield 8 Projjal Chanda 7 Max Burke 7 Prudhvi Porandla 6 Kenta Murata 6 Uwe L. Korn 6 Yibo Cai 6 tianchen 5 Frank Du 5 Martin Radev 4 Rok 3 Paddy Horan 3 Takashi Hashida 3 Yosuke Shiro 3 Zhuo Peng 2 Bryan Cutler 2 David Kegley 2 Jacek Pliszka 2 Ji Kunshang 2 Jorge C. Leitao 2 Jörn Horstmann 2 Kazuaki Ishizaki 2 Markus Westerlind 2 Morgan Cassels 2 Ryan Murray 2 Sam Albers 2 Siyuan 2 Stephanie Wang 2 Yuqi Gu 2 gawain.bolton 1 Adam Hooper 1 Andreas Zimmerer 1 Bryant Biggs 1 Chao Sun 1 Dan Nugent 1 Dominik Durner 1 Eyal Farago-Hagag 1 Grant Monroe 1 Hatem Helal 1 Ismaël Mejía 1 John Muehlhausen 1 Ken Suenobu 1 Kevin Conaway 1 Li, Jiajia 1 Mahmut Bulut 1 Marius van Niekerk 1 Michael Chirico 1 NAITOH Jun 1 Neville Dipale 1 Omega Gamage 1 Onur Satici 1 Palmer Lao 1 Paul Balanca 1 Prudhvi 1 Ravindra Wagh 1 Richard Artoul 1 Sebastien Binet 1 TP Boudreau 1 Tarek Allam 1 Tobias Mayer 1 Yuan Zhou 1 Zherui Cao 1 andy-thomason 1 caleboverman 1 mubai 1 psuman 1 ravindra-wagh 1 wenjun.swj Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-0.16.0..apache-arrow-0.17.0 128 Wes McKinney 109 Krisztián Szűcs 86 Antoine Pitrou 61 Neal Richardson 60 Sutou Kouhei 33 Benjamin Kietzman 23 Andy Grove 14 François Saint-Jacques 10 Micah Kornfield 7 Paddy Horan 7 Praveen 5 David Li 5 Sebastien Binet 3 GitHub 3 Joris Van den Bossche 3 Neville Dipale 3 Pindikura Ravindra 3 Yosuke Shiro 2 Bryan Cutler 2 Philipp Moritz 1 Brian Hulette 1 Chao Sun Changelog New Features and Improvements ARROW-1560 - [C++] Kernel implementations for “match” function ARROW-1571 - [C++] Implement argsort kernels (sort indices) for integers using O(n) counting sort ARROW-1581 - [Packaging] Tooling to make nightly wheels available for install ARROW-1582 - [Python] Set up + document nightly conda builds for macOS ARROW-1636 - [Format] Integration tests for null type ARROW-2447 - [C++] Create a device abstraction ARROW-3054 - [Packaging] Tooling to enable nightly conda packages to be updated to some anaconda.org channel ARROW-3410 - [C++][Dataset] Streaming CSV reader interface for memory-constrainted environments ARROW-3750 - [R] Pass various wrapped Arrow objects created in Python into R with zero copy via reticulate ARROW-4120 - [Python] Define process for testing procedures that check for no macro-level memory leaks ARROW-4226 - [Format][C++] Add CSF sparse tensor support ARROW-4304 - [Rust] Enhance documentation for arrow ARROW-4428 - [R] Feature flags for R build ARROW-4482 - [Website] Add blog archive page ARROW-4815 - [Rust] [DataFusion] Add support for * in SQL projection ARROW-5357 - [Rust] Add capacity field in Buffer ARROW-5405 - [Documentation] Move integration testing documentation to Sphinx docs, add instructions for JavaScript ARROW-5497 - [Release] Build and publish R/Java/JS docs ARROW-5501 - [R] Reorganize read/write file/stream functions ARROW-5510 - [Format] Feather V2 based on Arrow IPC file format, with compression support ARROW-5563 - [Format] Update integration test JSON format documentation ARROW-5585 - [Go] rename arrow.TypeEquals into arrow.TypeEqual ARROW-5742 - [CI] Add daily / weekly Valgrind build ARROW-5757 - [Python] Stop supporting Python 2.7 ARROW-590 - [Integration] Add integration tests for Union types ARROW-5949 - [Rust] Implement DictionaryArray ARROW-6165 - [Integration] Use multiprocessing to run integration tests on multiple CPU cores ARROW-6176 - [Python] Allow to subclass ExtensionArray to attach to custom extension type ARROW-6393 - [C++] Add EqualOptions support in SparseTensor::Equals ARROW-6479 - [C++] inline errors from external projects’ build logs ARROW-6510 - [Python][Filesystem] Expose nanosecond resolution mtime ARROW-6666 - [Rust] [DataFusion] Implement string literal expression ARROW-6724 - [C++] Add simpler static ctor for BufferOutputStream than the current Create function ARROW-6821 - [C++][Parquet] Do not require Thrift compiler when building (but still require library) ARROW-6823 - [C++][Python][R] Support metadata in the feather format? ARROW-6837 - [C++/Python] access File Footer custom_metadata ARROW-6841 - [C++] Upgrade to LLVM 8 ARROW-6875 - [FlightRPC] Implement Criteria for ListFlights RPC / list_flights method ARROW-6915 - [Developer] Do not overwrite minor release version with merge script, even if not specified by committer ARROW-6947 - [Rust] [DataFusion] Add support for scalar UDFs ARROW-6996 - [Python] Expose boolean filter kernel on Table ARROW-7044 - [Release] Create a post release script for the home-brew formulas ARROW-7048 - [Java] Support for combining multiple vectors under VectorSchemaRoot ARROW-7063 - [C++] Schema print method prints too much metadata ARROW-7073 - [Java] Support concating vectors values in batch ARROW-7080 - [Python][Parquet][C++] Expose parquet field_id in Schema objects ARROW-7091 - [C++] Move all factories to type_fwd.h ARROW-7119 - [C++][CI] Use scripts/util_coredump.sh to show automatic backtraces ARROW-7201 - [GLib][Gandiva] Add support for BooleanNode ARROW-7202 - [R][CI] Improve rwinlib building on CI to stop re-downloading dependencies ARROW-7222 - [Python][Release] Wipe any existing generated Python API documentation when updating website ARROW-7233 - [C++] Add Result APIs to IPC module ARROW-7256 - [C++] Remove ARROW_MEMORY_POOL_DEFAULT macro ARROW-7330 - [C++] Add Result to APIs to arrow/gpu ARROW-7332 - [C++][Parquet] Explicitly catch status exceptions in PARQUET_CATCH_NOT_OK ARROW-7336 - [C++] Implement MinMax options to not skip nulls ARROW-7338 - [C++] Improve InMemoryDataSource to support generator instead of static list ARROW-7365 - [Python] Support FixedSizeList type in conversion to numpy/pandas ARROW-7400 - [Java] Avoids the worst case for quick sort ARROW-7412 - [C++][Dataset] Ensure that dataset code is robust to schemas with duplicate field names ARROW-7419 - [Python] Support SparseCSCMatrix ARROW-7427 - [Python] Support SparseCSFTensor ARROW-7428 - [Format][C++] Add serialization for CSF sparse tensors ARROW-7444 - [GLib] Add LocalFileSystem support ARROW-7462 - [C++] Add CpuInfo detection for Arm64 Architecture ARROW-7491 - [Java] Improve the performance of aligning ARROW-7515 - [C++] Rename nonexistent and non_existent to not_found ARROW-7524 - [C++][CI] Build parquet support in the VS2019 GitHub Actions job ARROW-7530 - [Developer] Do not include list of commits from PR in squashed summary message ARROW-7534 - [Java] Create a new java/contrib module ARROW-7547 - [C++] [Python] [Dataset] Additional reader options in ParquetFileFormat ARROW-7587 - [C++][Compute] Add Top-k kernel ARROW-7608 - [C++][Dataset] Expose more informational properties ARROW-7615 - [CI][Gandiva] Ensure that the gandiva_jni library has only a whitelisted set of shared dependencies as part of Travis CI job ARROW-7616 - [Java] Support comparing value ranges for dense union vector ARROW-7625 - [GLib] Parquet GLib and Red Parquet (Ruby) do not allow specifying compression type ARROW-7641 - [R] Make dataset vignette have executable code ARROW-7662 - [R] Support creating ListArray from R list ARROW-7664 - [C++] Extract localfs default from FileSystemFromUri ARROW-7675 - [R][CI] Move Windows CI from Appveyor to GHA ARROW-7679 - [R] Cleaner interface for creating UnionDataset ARROW-7684 - [Rust] Provide example of Flight server for DataFusion ARROW-7685 - [Developer] Add support for GitHub Actions to Crossbow ARROW-7691 - [C++] Verify missing fields when walking Flatbuffers data ARROW-7708 - [Release] Include PARQUET commits from git changelog in release changelogs ARROW-7712 - [CI][Crossbow] Fix or delete fuzzit jobs ARROW-7720 - [C++][Python] Add check_metadata argument to Table.equals ARROW-7725 - [C++] Add infrastructure for unity builds and precompiled headers ARROW-7726 - [CI] [C++] Use boost binaries on Windows GHA build ARROW-7729 - [Python][CI] Pin pandas version to 0.25 in the dask integration test ARROW-7733 - [Developer] Install locally a new enough version of Go for release verification script ARROW-7735 - [Release] conda-forge channel is missing for verifying wheels ARROW-7736 - [Release] Binary verification sometimes fails with transient error ARROW-7739 - [GLib] Use placement new to initialize shared_ptr object in private structs ARROW-7741 - [C++][Parquet] Incorporate new level generation logic in parquet write path with a flag to revert back to old logic ARROW-7742 - [GLib] Add support for MapArray ARROW-7745 - [Doc] [C++] Update Parquet documentation ARROW-7749 - [C++] Link some more tests together ARROW-7750 - [Release] Make the source release verification script restartable ARROW-7751 - [Release] macOS wheel verification also needs arrow-testing ARROW-7752 - [Release] Enable and test dataset in the verification script ARROW-7754 - [C++] Result is slow ARROW-7761 - [C++] Add S3 support to fs::FileSystemFromUri ARROW-7764 - [C++] Builders allocate a null bitmap buffer even if there is no nulls ARROW-7771 - [Developer] Use ARROW_TMPDIR environment variable in the verification scripts instead of TMPDIR ARROW-7774 - [Packaging][Python] Update macos and windows wheel filenames ARROW-7787 - [Rust] Add collect to Table API ARROW-7788 - [C++] Add schema conversion support for map type ARROW-7790 - [Website] Update how to install Linux packages ARROW-7795 - [Rust - DataFusion] Support boolean negation (NOT) ARROW-7796 - [R] write_* functions should invisibly return their inputs ARROW-7799 - [R][CI] Remove flatbuffers from homebrew formulae ARROW-7804 - [C++][R] Compile error on macOS 10.11 ARROW-7812 - [Packaging][Python] Upgrade LLVM in manylinux1 docker image ARROW-7817 - [CI] macOS R autobrew nightly failed on installing dependency from source ARROW-7819 - [C++][Gandiva] Add DumpIR to Filter/Projector classes ARROW-7824 - [C++][Dataset] Provide Dataset writing to IPC format ARROW-7828 - [Release] Remove SSH keys for internal use ARROW-7829 - [R] Test R bindings on clang ARROW-7833 - [R] Make install_arrow() actually install arrow ARROW-7834 - [Release] Post release task for updating the documentations ARROW-7839 - [Python][Dataset] Add IPC format to python bindings ARROW-7846 - [Python][Dev] Remove last dependencies on six ARROW-7849 - [Packaging][Python] Remove the remaining py27 crossbow wheel tasks from the nightlies ARROW-7858 - [C++][Python] Support casting an Extension type to its storage type ARROW-7859 - [R] Minor patches for CRAN submission 0.16.0.2 ARROW-7862 - [R] Linux installation should run quieter by default ARROW-7863 - [C++][Python][CI] Ensure running HDFS related tests ARROW-7864 - [R] Make sure bundled installation works even if there are system packages ARROW-7865 - [R] Test builds on latest Linux versions ARROW-7868 - [Crossbow] Reduce GitHub API query parallelism ARROW-7869 - [Python] Boost::system and boost::filesystem not necessary anymore in Python wheels ARROW-7872 - [Python] Support conversion of list-of-struct in Array/Table.to_pandas ARROW-7874 - [Python][Archery] Validate docstrings with numpydoc ARROW-7876 - [R] Installation fails in the documentation generation image ARROW-7877 - [Packaging] Fix crossbow deployment to github artifacts ARROW-7879 - [C++][Doc] Add doc for the Device API ARROW-7880 - [CI][R] R sanitizer job is not really working ARROW-7881 - [C++] Fix pedantic warnings ARROW-7882 - [C++][Gandiva] Optimise like function for substring pattern ARROW-7886 - [C++][Dataset] Consolidate Source and Dataset ARROW-7888 - [Python] Allow using a more modern version of jpype in pyarrow.jvm ARROW-7890 - [C++] Add Promise / Future implementation ARROW-7891 - [C++] RecordBatch-&gt;Equals should also have a check_metadata argument ARROW-7892 - [Python] Expose FilesystemSource.format attribute ARROW-7895 - [Python] Remove more python 2.7 cruft ARROW-7896 - [C++] Refactor from #include guards to #pragma once ARROW-7897 - [Packaging] Temporarily disable artifact uploading until we fix the deployment issues ARROW-7898 - [Python] Reduce the number docstring violations using numpydoc ARROW-7904 - [C++] Decide about Field/Schema metadata printing parameters and how much to show by default ARROW-7907 - [Python] Conversion to pandas of empty table with timestamp type aborts ARROW-7912 - [Format] C data interface ARROW-7913 - [C++][Python][R] C++ implementation of C data interface ARROW-7915 - [CI] [Python] Run tests with Python development mode enabled ARROW-7916 - [C++][Dataset] Project IPC record batches to materialized fields ARROW-7917 - [CMake] FindPythonInterp should check for python3 ARROW-7919 - [R] install_arrow() should conda install if appropriate ARROW-7920 - [R] Fill in some missing input validation ARROW-7921 - [Go] Add Reset method to various components and clean up comments ARROW-7927 - [C++] Fix ‘cpu_info.cc’ compilation warning ARROW-7929 - [C++] CMake target names differ from upstream provided names ARROW-7930 - [Python][CI] Test jpype integration in CI ARROW-7932 - [Rust] [Parquet] Implement array reader for temporal types ARROW-7934 - [C++] Fix UriEscape for empty string ARROW-7935 - [Java] Remove Netty dependency for BufferAllocator and ReferenceManager ARROW-7937 - [Python][Packaging] Remove boost from the macos wheels ARROW-7941 - [Rust] [DataFusion] Logical plan should support unresolved column references ARROW-7943 - [C++][Parquet] Add a new level builder capable of handling nested data ARROW-7947 - [Rust] [Flight] [DataFusion] Implement example for get_schema ARROW-7949 - [Developer] Update to ‘.gitignore’ to not track user specific ‘cpp/Brewfile.lock.json’ file ARROW-7951 - [Python][Parquet] Expose BYTE_STREAM_SPLIT to pyarrow ARROW-7959 - [Ruby] Add support for Ruby 2.3 again ARROW-7963 - [C++][Python][Dataset] Expose listing fragments ARROW-7965 - [Python] Refine higher level dataset API ARROW-7966 - [Integration][Flight][C++] Client should verify each batch independently ARROW-7969 - [Packaging] Use cURL to upload artifacts ARROW-7970 - [Packaging][Python] Use system boost to build the macos wheels ARROW-7971 - [Rust] Create rowcount utility ARROW-7977 - [C++] Rename fs::FileStats to fs::FileInfo ARROW-7979 - [C++] Implement experimental buffer compression in IPC messages ARROW-7982 - [C++] Let ArrayDataVisitor accept void-returning functions ARROW-7983 - [CI][R] Nightly builds should be more verbose when they fail ARROW-7984 - [R] Check for valid inputs in more places ARROW-7986 - [Python] pa.Array.from_pandas cannot convert pandas.Series containing pyspark.ml.linalg.SparseVector ARROW-7987 - [CI][R] Fix for verbose nightly builds ARROW-7988 - [R] Fix on.exit calls in reticulate bindings ARROW-7991 - [C++][Plasma] Allow option for evicting if full when creating an object ARROW-7993 - [Java] Support decimal type in ComplexCopier ARROW-7994 - [CI][C++] Move AppVeyor MinGW builds to GitHub Actions ARROW-7995 - [C++] IO: coalescing and caching read ranges ARROW-7998 - [C++][Plasma] Make Seal requests synchronous ARROW-8005 - [Website] Review and adjust any usages of Apache dist system from website / tools ARROW-8014 - [C++] Provide CMake targets to test only within a given label ARROW-8016 - [Developer] Fix deprecation warning in PR merge tool ARROW-8018 - [C++][Parquet]Parquet Modular Encryption ARROW-8024 - [R] Bindings for BinaryType and FixedBinaryType ARROW-8026 - [Python] Support memoryview in addition to string value types for constructing string and binary type arrays ARROW-8027 - [Developer][Integration] Add integration tests for duplicate field names ARROW-8028 - [Go] Allow duplicate field names in schemas and nested types ARROW-8030 - [C++][Plasma] Fix inconsistent comment style ARROW-8039 - [Python][Dataset] Support using dataset API in pyarrow.parquet with a minimal ParquetDataset shim ARROW-8044 - [CI][NIGHTLY:gandiva-jar-osx] pygit2 needs libgit2 v1.0.x ARROW-8055 - [GLib][Ruby] Add some metadata bindings to GArrowSchema ARROW-8058 - [C++][Python][Dataset] Provide an option to toggle validation and schema inference in FileSystemDatasetFactoryOptions ARROW-8059 - [Python] Make FileSystem objects serializable ARROW-8060 - [Python] Make dataset Expression objects serializable ARROW-8061 - [C++][Dataset] Ability to specify granularity of ParquetFileFragment (support row groups) ARROW-8063 - [Python] Add user guide documentation for Datasets API ARROW-8064 - [Dev] Implement Comment bot via Github actions ARROW-8072 - [C++][Plasma] Add const constraint when parsing data ARROW-8077 - [Python] Add wheel build script and Crossbow configuration for Windows on Python 3.5 ARROW-8079 - [Python] Implement a wrapper for KeyValueMetadata, duck-typing dict where relevant ARROW-8080 - [C++] Add AVX512 build option ARROW-8082 - [Java][Plasma] Add JNI list() interface ARROW-8083 - [GLib] Add support for Peek() to GIOInputStream ARROW-8086 - [Java] Support writing decimal from big endian byte array in UnionListWriter ARROW-8087 - [C++][Dataset] Order of keys with HivePartitioning is lost in resulting schema ARROW-8096 - [C++][Gandiva] Create null node of Interval type ARROW-8097 - [Dev] Comment bot’s crossbow command acts on the master branch ARROW-8103 - [R] Make default Linux build more minimal ARROW-8104 - [C++] Don’t install bundled Thrift ARROW-8107 - [Packaging][APT] Use HTTPS for LLVM APT repository for Debian GNU/Linux stretch ARROW-8109 - [Packaging][APT] Drop support for Ubuntu Disco ARROW-8117 - [Rust] [Datafusion] Allow CAST from number to timestamp ARROW-8118 - [R] dim method for FileSystemDataset ARROW-8120 - [Packaging][APT] Add support for Ubuntu Focal ARROW-8123 - [Rust] [DataFusion] Create LogicalPlanBuilder ARROW-8124 - [Rust] Update library dependencies ARROW-8126 - [C++][Compute] Add Top-K kernel benchmark ARROW-8129 - [C++][Compute] Refine compare sorting kernel ARROW-8130 - [C++][Gandiva] Fix Dex visitor in llvm_generator to handle interval type ARROW-8140 - [Developer] Follow NullType -&gt; NullField change ARROW-8141 - [C++] Optimize BM_PlainDecodingBoolean performance using AVX512 Intrinsics API ARROW-8145 - [C++] Rename GetTargetInfos ARROW-8146 - [C++] Add per-filesystem facility to sanitize a path ARROW-8150 - [Rust] Allow writing custom FileMetaData k/v pairs ARROW-8151 - [Benchmarking][Dataset] Benchmark Parquet read performance with S3File ARROW-8153 - [Packaging] Update the conda feedstock files and upload artifacts to Anaconda ARROW-8158 - [Java] Getting length of data buffer and base variable width vector ARROW-8164 - [C++][Dataset] Let datasets be viewable with non-identical schema ARROW-8165 - [Packaging] Make nightly wheels available on a PyPI server ARROW-8167 - [CI] Add support for skipping builds with skip pattern in pull request title ARROW-8168 - [Java][Plasma] Improve Java Plasma client off-heap memory usage ARROW-8177 - [Rust] Make schema_to_fb_offset public ARROW-8178 - [C++] Upgrade to Flatbuffers 1.12 ARROW-8179 - [R] Windows build script tweaking for nightly packaging on GHA ARROW-8181 - [Java][FlightRPC] Expose transport error metadata ARROW-8182 - [Packaging] Increment the version number detected from the latest git tag ARROW-8183 - [c++][FlightRPC] Expose transport error metadata ARROW-8184 - [Packaging] Use arrow-nightlies organization name on Anaconda and Gemfury to host the nightlies ARROW-8185 - [Packaging] Document the available nightly wheels and conda packages ARROW-8187 - [R] Make test assertions robust to i18n ARROW-8191 - [Packaging][APT] Fix cmake removal in Debian GNU/Linux Stretch ARROW-8192 - [C++] script for unpack avx512 intrinsics code ARROW-8194 - [CI] Github Actions Windows job should run tests in parallel ARROW-8195 - [CI] Remove Boost download step in Github Actions ARROW-8198 - [C++] Diffing should handle null arrays ARROW-8200 - [GLib] Rename garrow_file_system_target_info{,s}() to …_file_info{,s}() ARROW-8203 - [C#] “dotnet pack” is failed ARROW-8204 - [Rust] [DataFusion] Add support for aliased expressions in SQL ARROW-8207 - [Packaging][wheel] Use LLVM 8 in manylinux2010 and manylinux2014 ARROW-8215 - [CI][GLib] Meson install fails in the macOS build ARROW-8218 - [C++] Parallelize decompression at field level in experimental IPC compression code ARROW-8220 - [Python] Make dataset FileFormat objects serializable ARROW-8222 - [C++] Use bcp to make a slim boost for bundled build ARROW-8224 - [C++] Remove APIs deprecated prior to 0.16.0 ARROW-8225 - [Rust] IPC reader must respect continuation markers ARROW-8227 - [C++] Refine SIMD feature definitions ARROW-8231 - [Rust] Parse key_value_metadata from parquet FileMetaData into arrow schema metadata ARROW-8232 - [Python] Deprecate pa.open_file and pa.open_stream in favor of pa.ipc.open_file/open_stream ARROW-8241 - [Rust] Add convenience methods to Schema ARROW-8242 - [C++] Flight fails to compile on GCC 4.8 ARROW-8243 - [Rust] [DataFusion] Fix inconsistent API in LogicalPlanBuilder ARROW-8244 - [Python][Parquet] Add `write_to_dataset` option to populate the “file_path” metadata fields ARROW-8246 - [C++] Add -Wa,-mbig-obj when compiling with MinGW to avoid linking errors ARROW-8247 - [Python] Expose Parquet writing “engine” setting in pyarrow.parquet.write_table ARROW-8249 - [Rust] [DataFusion] Make Table and LogicalPlanBuilder APIs more consistent ARROW-8252 - [CI][Ruby] Add Ubuntu 20.04 ARROW-8256 - [Rust] [DataFusion] Update CLI documentation for 0.17.0 release ARROW-8264 - [Rust] [DataFusion] Create utility for printing record batches ARROW-8266 - [C++] Add backup mirrors for external project source downloads ARROW-8267 - [CI][GLib] Failed to build on Ubuntu 16.04 ARROW-8271 - [Packaging] Allow wheel upload failures to gemfury ARROW-8275 - [Python][Docs] Review Feather + IPC file documentation per “Feather V2” changes ARROW-8277 - [Python] RecordBatch interface improvements ARROW-8279 - [C++] Do not export symbols from Codec implementations, remove need for PIMPL pattern ARROW-8288 - [Python] Expose with_ modifiers on DataType ARROW-8290 - [Python][Dataset] Improve ergonomy of the FileSystemDataset constructor ARROW-8291 - [Packaging] Conda nightly builds can’t locate Numpy ARROW-8292 - [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function ARROW-8294 - [Format][Flight] Add DoExchange RPC to Flight protocol ARROW-8295 - [C++][Dataset] IpcFileFormat should expliclity push down column projection ARROW-8299 - [C++] Reusable “optional ParallelFor” function for optional use of multithreading ARROW-8300 - [R] Documentation and changelog updates for 0.17 ARROW-8307 - [Python] Expose use_memory_map option in pyarrow.feather APIs ARROW-8308 - [Rust] [Flight] Implement DoExchange on examples ARROW-8309 - [CI] C++/Java/Rust workflows should trigger on changes to Flight.proto ARROW-8311 - [C++] Add push style stream format reader ARROW-8316 - [CI] Set docker-compose to use docker-cli instead of docker-py for building images ARROW-8319 - [CI] Install thrift compiler in the debian build ARROW-8320 - [Documentation][Format] Clarify (lack of) alignment requirements in C data interface ARROW-8321 - [CI] Use bundled thrift in Fedora 30 build ARROW-8322 - [CI] Fix C# workflow file syntax ARROW-8325 - [R][CI] Stop including boost in R windows bundle ARROW-8329 - [Documentation][C++] Undocumented FilterOptions argument in Filter kernel ARROW-8330 - [Documentation] The post release script generates the documentation with a development version ARROW-8332 - [C++] Require Thrift compiler to use system libthrift for Parquet build ARROW-8335 - [Release] Add crossbow jobs to run release verification ARROW-8336 - [Packaging][deb] Use libthrift-dev on Debian 10 and Ubuntu 19.10 or later ARROW-8341 - [Packaging][deb] Fail to build by no disk space ARROW-8343 - [GLib] Add GArrowRecordBatchIterator ARROW-8347 - [C++] Add Result APIs to Array methods ARROW-8351 - [R][CI] Store the Rtools-built Arrow C++ library as a build artifact ARROW-8352 - [R] Add install_pyarrow() ARROW-8356 - [Developer] Support * wildcards with “crossbow submit” via GitHub actions ARROW-8361 - [C++] Add Result APIs to Buffer methods and functions ARROW-8362 - [Crossbow] Ensure that the locally generated version is used in the docker tasks ARROW-8367 - [C++] Deprecate Buffer::FromString(…, pool) ARROW-8368 - [Format] In C interface, clarify resource management for consumers needing only a subset of child fields in ArrowArray ARROW-8370 - [C++] Add Result to type / schema APIs ARROW-8371 - [Crossbow] Implement and exercise sanity checks for tasks.yml ARROW-8372 - [C++] Add Result to table / record batch APIs ARROW-8375 - [CI][R] Make Windows tests more verbose in case of segfault ARROW-8376 - [R] Add experimental interface to ScanTask/RecordBatch iterators ARROW-8387 - [Rust] Make schema_to_fb public ARROW-8389 - [Integration] Run tests in parallel ARROW-8390 - [R] Expose schema unification features ARROW-8393 - [C++][Gandiva] Make gandiva function registry case-insensitive ARROW-8396 - [Rust] Remove libc from dependencies ARROW-8398 - [Python] Remove deprecation warnings originating from python tests ARROW-8401 - [C++] Add AVX2/AVX512 version of ByteStreamSplitDecode/ByteStreamSplitEncode ARROW-8403 - [C++] Add ToString() to ChunkedArray, Table and RecordBatch ARROW-8407 - [Rust] Add rustdoc for Dictionary type ARROW-8408 - [Python] Add memory_map= toggle to pyarrow.feather.read_feather ARROW-8409 - [R] Add arrow::cpu_count, arrow::set_cpu_count wrapper functions a la Python ARROW-8412 - [C++][Gandiva] Fix gandiva date_diff function definitions ARROW-8433 - [R] Add feather alias for ipc format in dataset API ARROW-8443 - [Gandiva][C++] Fix round/truncate to no-op for special cases ARROW-8444 - [Documentation] Fix spelling errors across the codebase ARROW-8449 - [R] Use CMAKE_UNITY_BUILD everywhere ARROW-8450 - [Integration][C++] Implement large list/binary/utf8 integration ARROW-8457 - [C++] bridge test does not take care of endianness ARROW-8458 - [C++] Prefer the original mirrors for the bundled thirdparty dependencies ARROW-8461 - [Packaging][deb] Use zstd package for Ubuntu Xenial ARROW-8463 - [CI] Balance the nightly test builds between CircleCI, Azure and Github ARROW-8467 - [C++] Test cases using ArrayFromJSON assume only a little-endian platform ARROW-8477 - [C++] Enable reading and writing of long filenames for Windows ARROW-8496 - [C++] Refine ByteStreamSplitDecodeScalar ARROW-8499 - [C++][Dataset] In ScannerBuilder, batch_size will not work if projecter is not empty ARROW-8506 - [c++] Miss tests to verify expected_buffer with bit_width &gt; 8 in RLE ARROW-8515 - [C++] Bitmap ToString should have an option of grouping by bytes ARROW-8517 - [Developer][Release] Update Crossbow RC verification setup for changes since 0.16.0 ARROW-8523 - [C++] Optimize BitmapReader PARQUET-1663 - [C++] Provide API to check the presence of complex data types PARQUET-1716 - [C++] Add support for BYTE_STREAM_SPLIT encoding PARQUET-1770 - [C++][CI] Add fuzz target for reading Parquet files PARQUET-1785 - [C++] Improve code reusability in encoding-test.cc PARQUET-1786 - [C++] Use simd to improve BYTE_STREAM_SPLIT decoding performance PARQUET-1806 - [C++] [CI] Improve fuzzing seed corpus PARQUET-1825 - [C++] Fix compilation error in column_io_benchmark.cc PARQUET-1828 - [C++] Add a SSE2 path for the ByteStreamSplit encoder implementation PARQUET-1840 - [C++] DecodeSpaced copies more values then necessary PARQUET-458 - [C++] Implement support for DataPageV2 Bug Fixes ARROW-1907 - [C++/Python] Feather format cannot accommodate string columns containing more than a total of 2GB of data ARROW-2255 - [Developer][Integration] Serialize schema- and field-level custom metadata in integration test JSON format ARROW-2587 - [Python] Unable to write StructArrays with multiple children to parquet ARROW-3004 - [Documentation] Builds docs for master rather than a pinned commit ARROW-3543 - [R] Better support for timestamp format and time zones in R ARROW-5265 - [Python/CI] Add integration test with kartothek ARROW-5473 - [C++] Build failure on googletest_ep on Windows when using Ninja ARROW-5981 - [C++] DictionaryBuilder initialization with Array can fail silently ARROW-6528 - [C++] Spurious Flight test failures (port allocation failure) ARROW-6738 - [Java] Fix problems with current union comparison logic ARROW-6757 - [Python] Creating csv.ParseOptions() causes “Windows fatal exception: access violation” with Visual Studio 2017 ARROW-6871 - [Java] Enhance TransferPair related parameters check and tests ARROW-6872 - [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError ARROW-6895 - [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()` ARROW-7008 - [Python] pyarrow.chunked_array([array]) fails on array with all-None buffers ARROW-7049 - [C++] warnings building on mingw-w64 ARROW-7301 - [Java] Sql type DATE should correspond to DateDayVector ARROW-7335 - [C++][Gandiva] Add castBIGINT, extractDay interval_day functions in Gandiva ARROW-7390 - [C++][Dataset] Concurrency race in Projector::Project ARROW-7405 - [Java] ListVector isEmpty API is incorrect ARROW-7466 - [CI][Java] Fix gandiva-jar-osx nightly build failure ARROW-7467 - [Java] ComplexCopier does incorrect copy for Map nullable info ARROW-7520 - [R] Writing many batches causes a crash ARROW-7546 - [Java] Use new implementation to concat vectors values in batch ARROW-7624 - [Rust] Soundness issues via `Buffer` methods ARROW-7628 - [Python] Better document some read_csv corner cases ARROW-7631 - [C++][Gandiva] return zero if there is an overflow while converting a decimal to a lower precision/scale ARROW-7680 - [C++][Dataset] Partition discovery is not working with windows path ARROW-7701 - [C++] [CI] Flight test error on macOS ARROW-7713 - [Java] TastLeak was put at the wrong location ARROW-7722 - [Java][FlightRPC] Memory leak ARROW-7734 - [C++] Segfault when comparing status with and without detail ARROW-7740 - [C++] Array internals corruption in StructArray::Flatten ARROW-7755 - [Python] Windows wheel cannot be installed on Python 3.8 ARROW-7758 - [Python] Wrong conversion of timestamps that are out of bounds for pandas (eg 0000-01-01) ARROW-7760 - [Release] Fix verify-release-candidate.sh since pip3 seems to no longer be in miniconda ARROW-7762 - [Python] Exceptions in ParquetWriter get ignored ARROW-7766 - [Python][Packaging] Windows py38 wheels are built with wrong ABI tag ARROW-7772 - [R][C++][Dataset] Unable to filter on date32 object with date64 scalar ARROW-7775 - [Rust] Don’t let safe code arbitrarily transmute readers and writers ARROW-7777 - [Go] StructBuilder/ListBuilder index out of range panic ARROW-7780 - [Release] Fix Windows wheel RC verification script given lack of “m” ABI tag in Python 3.8 ARROW-7781 - [C++][Dataset] Filtering on a non-existent column gives a segfault ARROW-7783 - [C++] ARROW_DATASET should enable ARROW_COMPUTE ARROW-7785 - [C++] sparse_tensor.cc is extremely slow to compile ARROW-7786 - [R] Wire up check_metadata in Table.Equals method ARROW-7789 - [R] Can’t initialize arrow objects when R.oo package is loaded ARROW-7791 - [C++][Parquet] Fix building error “cannot bind lvalue” ARROW-7792 - [R] read_* functions should close connection to file ARROW-7793 - [Java] If there is a leak the base allocator should release the excess memory to parent before throwing exception ARROW-7794 - [Rust] cargo publish fails for arrow-flight due to relative path to Flight.proto ARROW-7797 - [Release][Rust] Fix arrow-flight’s version in datafusion crate ARROW-7802 - [C++] Support for LargeBinary and LargeString in the hash kernel ARROW-7806 - [Python] Implement to_pandas for lists of LargeBinary/String ARROW-7807 - [R] Installation on RHEL 7 Cannot call io___MemoryMappedFile__Open() ARROW-7809 - [R] vignette does not run on Win 10 nor ubuntu ARROW-7813 - [Rust] Fix undefined behaviour and and remove unsafe ARROW-7815 - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz) ARROW-7827 - [Python] conda-forge pyarrow package does not have s3 enabled ARROW-7832 - [R] Patches to 0.16.0 release ARROW-7836 - [Rust] “allocate_aligned”/”reallocate” need to initialize memory to avoid UB ARROW-7837 - [Java] bug in BaseVariableWidthVector.copyFromSafe results with an index out of bounds exception ARROW-7838 - [C++] Installed plasma-store-server fails finding Boost ARROW-7841 - [C++] HADOOP_HOME doesn’t work to find libhdfs.so ARROW-7844 - [R] array_to_vector is not thread safe ARROW-7848 - Add doc for MapType ARROW-7852 - [Python] 0.16.0 wheels not compatible with older numpy ARROW-7857 - [Python] Failing test with pandas master for extension type conversion ARROW-7861 - [C++][Parquet] Add fuzz regression corpus for parquet reader ARROW-7884 - [C++][Python] Crash in pq.read_table() ARROW-7887 - [Rust] Filter kernel does not support temporal types ARROW-7889 - [Rust] Datafusion CLI does not support registering Parquet files ARROW-7899 - [Integration][Java] null type integration test ARROW-7908 - [R] Can’t install package without setting LIBARROW_DOWNLOAD=true ARROW-7922 - [CI][Crossbow] Nightly macOS wheel builds fail (brew bundle edition) ARROW-7923 - [CI][Crossbow] macOS autobrew fails on homebrew-versions ARROW-7926 - [Developer] “archery lint” target is not ergonomic for running a single check like IWYU ARROW-7928 - [Python] Example of flight server and client not working ARROW-7931 - [C++] Fix crash on corrupt Map array input (OSS-Fuzz) ARROW-7936 - [Python] FileSystem.from_uri test fails on python 3.5 ARROW-7940 - [C++] Unable to generate cmake build with settings other than default ARROW-7958 - [Java] Update Avro to version 1.9.2 ARROW-7962 - [R][Dataset] Followup to “Consolidate Source and Dataset classes” ARROW-7968 - [C++] orc_ep build fails on 64-bit Raspbian ARROW-7973 - [Developer][C++] ResourceWarnings in run_cpplint.py ARROW-7974 - [Developer][C++] ResourceWarning in “make check-format” ARROW-7975 - [C++] Do not include padding bytes in “Buffer” IPC metadata accounting ARROW-7978 - [Developer] GitHub Actions “lint” task is running include-what-you-use and failing ARROW-7980 - [Python] Deserialization with pyarrow fails for certain Timestamp-based data frame ARROW-7981 - [C++][Dataset] Fails to compile on gcc 5.4 ARROW-7985 - [C++] ListBuilder.Finish fails if underlying value builder is empty and .Reserve’d ARROW-7990 - [C++][Developer] Add “archery lint” option for running “iwyu.sh all” ARROW-7992 - [C++] MSVC warning causing Appveyor failure in sort_to_indices.cc ARROW-7996 - [Python] Error serializing empty pandas DataFrame with pyarrow ARROW-7997 - [Python] Schema equals method with inconsistent docs in pyarrow ARROW-7999 - [C++] Fix crash on corrupt Map array input (OSS-Fuzz) ARROW-8000 - [C++] gcc 4.8 build failures ARROW-8003 - [C++] -DBZip2_SOURCE=BUNDLED fails when building with clang ARROW-8006 - [C++] Unsafe arrow dictionary recovered from parquet ARROW-8007 - [Python] Remove unused and defunct assert_get_object_equal in plasma tests ARROW-8008 - [C++/Python] Framework Python is preferred even though not the activated one ARROW-8009 - [Java] Fix the hash code methods for BitVector ARROW-8011 - [C++] Some buffers not resized when reading from Parquet ARROW-8013 - [Python][Packaging] Fix manylinux wheels ARROW-8021 - [Python] Appveyor does not appear to be including pandas in test runs ARROW-8036 - [C++] Compilation failure with gtest 1.10.0 ARROW-8042 - [Python] pyarrow.ChunkedArray docstring is incorrect regarding zero-length ChunkedArray having no chunks ARROW-8057 - [Python] Don’t check Schema metadata in __eq__ and __ne__ ARROW-8070 - [C++] Cast segfaults on unsupported cast from list to utf8 ARROW-8071 - [GLib] Build error with configure ARROW-8088 - [C++][Dataset] Partition columns with specified dictionary type result in all nulls ARROW-8091 - [CI][Crossbow] Fix nightly homebrew and R failures ARROW-8092 - [CI][Crossbow] OSX wheels fail on bundled bzip2 ARROW-8094 - [CI][Crossbow] Nightly valgrind test fails ARROW-8095 - [CI][Crossbow] Nightly turbodbc job fails ARROW-8098 - [go] Checkptr Failures on Go 1.14 ARROW-8101 - [FlightRPC][Java] Can’t read/write only an empty null array ARROW-8102 - [Dev] Crossbow’s version detection doesn’t work in the comment bot’s scenario ARROW-8105 - [Python] pyarrow.array segfaults when passed masked array with shrunken mask ARROW-8106 - [Python] Builds on master broken by pandas 1.0.2 release ARROW-8110 - [C#] BuildArrays fails if NestedType is included ARROW-8112 - [FlightRPC][C++] Some status codes don’t round-trip through gRPC ARROW-8119 - [Dev] Make Yaml optional dependency for archery ARROW-8122 - [Python] Empty numpy arrays with shape cannot be deserialized ARROW-8125 - [C++] “arrow-tests” target broken with ninja build ARROW-8127 - [C++] [Parquet] Incorrect column chunk metadata for multipage batch writes ARROW-8128 - [C#] NestedType children serialized on wrong length ARROW-8132 - [C++] arrow-s3fs-test failing on master ARROW-8133 - [CI] Github Actions sometimes fail to checkout Arrow ARROW-8136 - [C++][Python] Creating dataset from relative path no longer working ARROW-8139 - [C++] FileSystem enum causes attributes warning ARROW-8142 - [C++] Casting a chunked array with 0 chunks critical failure ARROW-8144 - [CI] Cmake 3.2 nightly build fails ARROW-8159 - [Python] pyarrow.Schema.from_pandas doesn’t support ExtensionDtype ARROW-8166 - [C++] AVX512 intrinsics fail to compile with clang-8 on Ubuntu 18.04 ARROW-8176 - [FlightRPC][Integration] Have Flight services bind to port 0 in integration ARROW-8186 - [Python] Dataset expression != returns bool instead of expression for invalid value ARROW-8188 - [R] Adapt to latest checks in R-devel ARROW-8193 - [C++] arrow-future-test fails to compile on gcc 4.8 ARROW-8197 - [Rust] DataFusion “create_physical_plan” returns incorrect schema? ARROW-8206 - [R] Minor fix for backwards compatibility on Linux installation ARROW-8209 - [Python] Accessing duplicate column of Table by name gives wrong error ARROW-8213 - [Python][Dataset] Opening a dataset with a local incorrect path gives confusing error message ARROW-8216 - [R][C++][Dataset] Filtering returns all-missing rows where the filtering column is missing ARROW-8217 - [R][C++] Fix crashing test in test-dataset.R on 32-bit Windows from ARROW-7979 ARROW-8219 - [Rust] sqlparser crate needs to be bumped to version 0.2.5 ARROW-8233 - [CI] Build timeouts on “AMD64 Windows MinGW 64 GLib &amp; Ruby “ ARROW-8237 - [Python] Review Developer build instructions for conda and non-conda users ARROW-8238 - [C++][Compute] Failed to build compute tests on windows with msvc2015 ARROW-8239 - [Java] fix param checks in splitAndTransfer method ARROW-8245 - [Python][Parquet] Skip hidden directories when reading partitioned parquet files ARROW-8255 - [Rust] [DataFusion] COUNT(*) results in confusing error ARROW-8259 - [Rust] [DataFusion] ProjectionPushDownRule does not rewrite LIMIT ARROW-8268 - [Ruby] Test failure due to lack of built ZSTD support ARROW-8269 - [Python] Failure in “nopandas” build in test_parquet_row_group_fragments ARROW-8270 - [Python][Flight] Example Flight server with TLS’s certificate and key is not working ARROW-8272 - [CI][Python] Test failure on Ubuntu 16.04 ARROW-8274 - [C++] Use LZ4 frame format for “LZ4” compression in IPC write ARROW-8276 - [C++][Dataset] Scanning a Fragment does not take into account the partition columns ARROW-8280 - [C++] MinGW builds failing due to CARES-related toolchain issue ARROW-8286 - [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset ARROW-8298 - [C++][CI] MinGW builds fail building grpc ARROW-8303 - [Python] Fix test failure caused by non-deterministic dict key ordering on Python 3.5 ARROW-8304 - [Flight][Python] Flight client with TLS root certificate is reporting error on do_get() ARROW-8305 - [Java] ExtensionTypeVector should make sure underlyingVector not null ARROW-8310 - [C++] Minio’s exceptions not recognized by IsConnectError() ARROW-8315 - [Python][Dataset] Don’t rely on ordered dict keys in test_dataset.py ARROW-8323 - [C++] Pin gRPC at v1.27 to avoid compilation error in its headers ARROW-8326 - [C++] Don’t use deprecated TYPED_TEST_CASE ARROW-8327 - [FlightRPC][Java] gRPC trailers may be null ARROW-8331 - [C++] arrow-compute-filter-benchmark fails to compile ARROW-8333 - [C++][CI] Always compile benchmarks in some C++ CI entry ARROW-8334 - [C++] [Gandiva] Missing DATE32 in LLVM Types / Simple D32 Compute Functions ARROW-8342 - [Python] dask and kartothek integration tests are failing ARROW-8345 - [Python] feather.read_table should not require pandas ARROW-8346 - [CI][Ruby] GLib/Ruby macOS build fails on zlib ARROW-8349 - [CI][NIGHTLY:gandiva-jar-osx] Use latest pygit2 ARROW-8353 - [C++] is_nullable maybe not initialized in parquet writer ARROW-8354 - [R] Fix segfault in Table to Array conversion ARROW-8357 - [Rust] [DataFusion] Dockerfile for CLI is missing format dir ARROW-8358 - [C++] Fix -Wrange-loop-construct warnings in clang-11 ARROW-8360 - [C++][Gandiva] Fixes date32 support for date/time functions ARROW-8365 - [C++] Error when writing files to S3 larger than 5 GB ARROW-8366 - [Rust] Need to revert recent arrow-flight build change ARROW-8369 - [CI] Fix crossbow wildcard groups ARROW-8373 - [GLib] Problems resolving gobject-introspection, arrow in Meson builds ARROW-8380 - [RUST] StringDictionaryBuilder not publicly exported from arrow::array ARROW-8384 - [C++][Python] arrow/filesystem/hdfs.h and Python wrapper does not have an option for setting a path to a Kerberos ticket ARROW-8386 - [Python] pyarrow.jvm raises error for empty Arrays ARROW-8388 - [C++] GCC 4.8 fails to move on return ARROW-8397 - [C++] Fail to compile aggregate_test.cc on Ubuntu 16.04 ARROW-8406 - [Python] test_fs fails when run from a different drive on Windows ARROW-8410 - [C++] CMake fails on aarch64 systems that do not support -march=armv8-a+crc+crypto ARROW-8414 - [Python] Non-deterministic row order failure in test_parquet.py ARROW-8415 - [C++][Packaging] fix gandiva linux job ARROW-8416 - [Python] Provide a “feather” alias in the dataset API ARROW-8420 - [C++] CMake fails to configure on armv7l platform (e.g. Raspberry Pi 3) ARROW-8427 - [C++][Dataset] Do not ignore file paths with underscore/dot when full path was specified ARROW-8428 - [C++][NIGHTLY:gandiva-jar-trusty] GCC 4.8 failures in C++ unit tests ARROW-8429 - [C++] Fix Buffer::CopySlice on 0-sized buffer ARROW-8432 - [Python][CI] Failure to download Hadoop ARROW-8437 - [C++] Remove std::move return value from MakeRandomNullBitmap test utility ARROW-8438 - [C++] arrow-io-memory-benchmark crashes ARROW-8439 - [Python] Filesystem docs are outdated ARROW-8441 - [C++] Fix crashes on invalid input (OSS-Fuzz) ARROW-8442 - [Python] NullType.to_pandas_dtype inconsisent with dtype returned in to_pandas/to_numpy ARROW-8460 - [Packaging][deb] Ubuntu Focal build is failed ARROW-8465 - [Packaging][Python] Windows py35 wheel build fails because of boost ARROW-8466 - [Packaging] The python unittests are not running in the windows wheel builds ARROW-8468 - [Document] Fix the incorrect null bits description ARROW-8469 - [Dev] Fix nightly docker tests on azure ARROW-8478 - [Java] Rollback contrib package changes. ARROW-8498 - [Python] Schema.from_pandas fails on extension type, while Table.from_pandas works ARROW-8511 - [Developer][Release] Windows release verification script does not halt if C++ compilation fails PARQUET-1780 - [C++] Set ColumnMetadata.encoding_stats field PARQUET-1788 - [C++] ColumnWriter has undefined behavior when writing arrow chunks PARQUET-1797 - [C++] Fix fuzzing errors PARQUET-1799 - [C++] Stream API: Relax schema checking when reading PARQUET-1810 - [C++] Fix undefined behaviour on invalid enum values (OSS-Fuzz) PARQUET-1813 - [C++] Remove logging statement in unit test PARQUET-1819 - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz) PARQUET-1823 - [C++] Invalid RowGroup returned when reading with parquet::arrow::FileReader-&gt;RowGroup(i)-&gt;Column(j) PARQUET-1829 - [C++] Fix crashes on invalid input (OSS-Fuzz) PARQUET-1831 - [C++] Fix crashes on invalid input (OSS-Fuzz) PARQUET-1835 - [C++] Fix crashes on invalid input (OSS-Fuzz)","headline":"Apache Arrow 0.17.0 Release","image":"https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png","mainEntityOfPage":{"@type":"WebPage","@id":"https://arrow.apache.org/release/0.17.0.html"},"publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"https://arrow.apache.org/img/logo.png"}},"url":"https://arrow.apache.org/release/0.17.0.html"}</script>
<!-- End Jekyll SEO tag -->
<!-- favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16.png" id="light1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32.png" id="light2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon.png" id="light3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120.png" id="light4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76.png" id="light5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60.png" id="light6">
<!-- dark mode favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16-dark.png" id="dark1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32-dark.png" id="dark2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon-dark.png" id="dark3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120-dark.png" id="dark4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76-dark.png" id="dark5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60-dark.png" id="dark6">
<script>
// Switch to the dark-mode favicons if prefers-color-scheme: dark
function onUpdate() {
light1 = document.querySelector('link#light1');
light2 = document.querySelector('link#light2');
light3 = document.querySelector('link#light3');
light4 = document.querySelector('link#light4');
light5 = document.querySelector('link#light5');
light6 = document.querySelector('link#light6');
dark1 = document.querySelector('link#dark1');
dark2 = document.querySelector('link#dark2');
dark3 = document.querySelector('link#dark3');
dark4 = document.querySelector('link#dark4');
dark5 = document.querySelector('link#dark5');
dark6 = document.querySelector('link#dark6');
if (matcher.matches) {
light1.remove();
light2.remove();
light3.remove();
light4.remove();
light5.remove();
light6.remove();
document.head.append(dark1);
document.head.append(dark2);
document.head.append(dark3);
document.head.append(dark4);
document.head.append(dark5);
document.head.append(dark6);
} else {
dark1.remove();
dark2.remove();
dark3.remove();
dark4.remove();
dark5.remove();
dark6.remove();
document.head.append(light1);
document.head.append(light2);
document.head.append(light3);
document.head.append(light4);
document.head.append(light5);
document.head.append(light6);
}
}
matcher = window.matchMedia('(prefers-color-scheme: dark)');
matcher.addListener(onUpdate);
onUpdate();
</script>
<link rel="stylesheet" href="//fonts.googleapis.com/css?family=Lato:300,300italic,400,400italic,700,700italic,900">
<link href="/css/main.css" rel="stylesheet">
<link href="/css/syntax.css" rel="stylesheet">
<script src="/javascript/main.js"></script>
<!-- Matomo -->
<script>
var _paq = window._paq = window._paq || [];
/* tracker methods like "setCustomDimension" should be called before "trackPageView" */
/* We explicitly disable cookie tracking to avoid privacy issues */
_paq.push(['disableCookies']);
_paq.push(['trackPageView']);
_paq.push(['enableLinkTracking']);
(function() {
var u="https://analytics.apache.org/";
_paq.push(['setTrackerUrl', u+'matomo.php']);
_paq.push(['setSiteId', '20']);
var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g,s);
})();
</script>
<!-- End Matomo Code -->
</head>
<body class="wrap">
<header>
<nav class="navbar navbar-expand-md navbar-dark bg-dark">
<a class="navbar-brand no-padding" href="/"><img src="/img/arrow-inverse-300px.png" height="40px"/></a>
<button class="navbar-toggler ml-auto" type="button" data-toggle="collapse" data-target="#arrow-navbar" aria-controls="arrow-navbar" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span>
</button>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse justify-content-end" id="arrow-navbar">
<ul class="nav navbar-nav">
<li class="nav-item"><a class="nav-link" href="/overview/" role="button" aria-haspopup="true" aria-expanded="false">Overview</a></li>
<li class="nav-item"><a class="nav-link" href="/faq/" role="button" aria-haspopup="true" aria-expanded="false">FAQ</a></li>
<li class="nav-item"><a class="nav-link" href="/blog" role="button" aria-haspopup="true" aria-expanded="false">Blog</a></li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownGetArrow" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Get Arrow
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownGetArrow">
<a class="dropdown-item" href="/install/">Install</a>
<a class="dropdown-item" href="/release/">Releases</a>
<a class="dropdown-item" href="https://github.com/apache/arrow">Source Code</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownDocumentation" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Documentation
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownDocumentation">
<a class="dropdown-item" href="/docs">Project Docs</a>
<a class="dropdown-item" href="/docs/format/Columnar.html">Format</a>
<hr/>
<a class="dropdown-item" href="/docs/c_glib">C GLib</a>
<a class="dropdown-item" href="/docs/cpp">C++</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/csharp/README.md">C#</a>
<a class="dropdown-item" href="https://godoc.org/github.com/apache/arrow/go/arrow">Go</a>
<a class="dropdown-item" href="/docs/java">Java</a>
<a class="dropdown-item" href="/docs/js">JavaScript</a>
<a class="dropdown-item" href="/julia/">Julia</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/matlab/README.md">MATLAB</a>
<a class="dropdown-item" href="/docs/python">Python</a>
<a class="dropdown-item" href="/docs/r">R</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/ruby/README.md">Ruby</a>
<a class="dropdown-item" href="https://docs.rs/arrow/latest">Rust</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownSubprojects" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Subprojects
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownSubprojects">
<a class="dropdown-item" href="/adbc">ADBC</a>
<a class="dropdown-item" href="/docs/format/Flight.html">Arrow Flight</a>
<a class="dropdown-item" href="/docs/format/FlightSql.html">Arrow Flight SQL</a>
<a class="dropdown-item" href="/datafusion">DataFusion</a>
<a class="dropdown-item" href="/nanoarrow">nanoarrow</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownCommunity" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Community
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownCommunity">
<a class="dropdown-item" href="/community/">Communication</a>
<a class="dropdown-item" href="/docs/developers/index.html">Contributing</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/issues">Issue Tracker</a>
<a class="dropdown-item" href="/committers/">Governance</a>
<a class="dropdown-item" href="/use_cases/">Use Cases</a>
<a class="dropdown-item" href="/powered_by/">Powered By</a>
<a class="dropdown-item" href="/visual_identity/">Visual Identity</a>
<a class="dropdown-item" href="/security/">Security</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/policies/conduct.html">Code of Conduct</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownASF" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
ASF Links
</a>
<div class="dropdown-menu dropdown-menu-right" aria-labelledby="navbarDropdownASF">
<a class="dropdown-item" href="https://www.apache.org/">ASF Website</a>
<a class="dropdown-item" href="https://www.apache.org/licenses/">License</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/sponsorship.html">Donate</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/thanks.html">Thanks</a>
<a class="dropdown-item" href="https://www.apache.org/security/">Security</a>
</div>
</li>
</ul>
</div><!-- /.navbar-collapse -->
</nav>
</header>
<div class="container p-4 pt-5">
<main role="main" class="pb-5">
<!--
-->
<h1 id="apache-arrow-0170-20-april-2020">Apache Arrow 0.17.0 (20 April 2020)</h1>
<p>This is a major release covering more than 2 months of development.</p>
<h2 id="download">Download</h2>
<ul>
<li><a href="https://www.apache.org/dyn/closer.lua/arrow/arrow-0.17.0/"><strong>Source Artifacts</strong></a></li>
<li><strong>Binary Artifacts</strong>
<ul>
<li><a href="https://apache.jfrog.io/artifactory/arrow/centos/">For CentOS</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/debian/">For Debian</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/python/0.17.0/">For Python</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/ubuntu/">For Ubuntu</a></li>
</ul>
</li>
<li><a href="https://github.com/apache/arrow/releases/tag/apache-arrow-0.17.0">Git tag</a></li>
</ul>
<h2 id="contributors">Contributors</h2>
<p>This release includes 569 commits from 79 distinct contributors.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> apache-arrow-0.16.0..apache-arrow-0.17.0
<span class="go"> 80 Antoine Pitrou
78 Krisztián Szűcs
58 Wes McKinney
55 Neal Richardson
39 Sutou Kouhei
30 Benjamin Kietzman
26 Joris Van den Bossche
20 Andy Grove
13 liyafan82
11 David Li
11 François Saint-Jacques
8 Micah Kornfield
8 Projjal Chanda
7 Max Burke
7 Prudhvi Porandla
6 Kenta Murata
6 Uwe L. Korn
6 Yibo Cai
6 tianchen
5 Frank Du
5 Martin Radev
4 Rok
3 Paddy Horan
3 Takashi Hashida
3 Yosuke Shiro
3 Zhuo Peng
2 Bryan Cutler
2 David Kegley
2 Jacek Pliszka
2 Ji Kunshang
2 Jorge C. Leitao
2 Jörn Horstmann
2 Kazuaki Ishizaki
2 Markus Westerlind
2 Morgan Cassels
2 Ryan Murray
2 Sam Albers
2 Siyuan
2 Stephanie Wang
2 Yuqi Gu
2 gawain.bolton
1 Adam Hooper
1 Andreas Zimmerer
1 Bryant Biggs
1 Chao Sun
1 Dan Nugent
1 Dominik Durner
1 Eyal Farago-Hagag
1 Grant Monroe
1 Hatem Helal
1 Ismaël Mejía
1 John Muehlhausen
1 Ken Suenobu
1 Kevin Conaway
1 Li, Jiajia
1 Mahmut Bulut
1 Marius van Niekerk
1 Michael Chirico
1 NAITOH Jun
1 Neville Dipale
1 Omega Gamage
1 Onur Satici
1 Palmer Lao
1 Paul Balanca
1 Prudhvi
1 Ravindra Wagh
1 Richard Artoul
1 Sebastien Binet
1 TP Boudreau
1 Tarek Allam
1 Tobias Mayer
1 Yuan Zhou
1 Zherui Cao
1 andy-thomason
1 caleboverman
1 mubai
1 psuman
1 ravindra-wagh
1 wenjun.swj
</span></code></pre></div></div>
<h2 id="patch-committers">Patch Committers</h2>
<p>The following Apache committers merged contributed patches to the repository.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-csn</span> apache-arrow-0.16.0..apache-arrow-0.17.0
<span class="go"> 128 Wes McKinney
109 Krisztián Szűcs
86 Antoine Pitrou
61 Neal Richardson
60 Sutou Kouhei
33 Benjamin Kietzman
23 Andy Grove
14 François Saint-Jacques
10 Micah Kornfield
7 Paddy Horan
7 Praveen
5 David Li
5 Sebastien Binet
3 GitHub
3 Joris Van den Bossche
3 Neville Dipale
3 Pindikura Ravindra
3 Yosuke Shiro
2 Bryan Cutler
2 Philipp Moritz
1 Brian Hulette
1 Chao Sun
</span></code></pre></div></div>
<h2 id="changelog">Changelog</h2>
<h3 id="new-features-and-improvements">New Features and Improvements</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1560">ARROW-1560</a> - [C++] Kernel implementations for “match” function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1571">ARROW-1571</a> - [C++] Implement argsort kernels (sort indices) for integers using O(n) counting sort</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1581">ARROW-1581</a> - [Packaging] Tooling to make nightly wheels available for install</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1582">ARROW-1582</a> - [Python] Set up + document nightly conda builds for macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1636">ARROW-1636</a> - [Format] Integration tests for null type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2447">ARROW-2447</a> - [C++] Create a device abstraction</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3054">ARROW-3054</a> - [Packaging] Tooling to enable nightly conda packages to be updated to some anaconda.org channel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3410">ARROW-3410</a> - [C++][Dataset] Streaming CSV reader interface for memory-constrainted environments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3750">ARROW-3750</a> - [R] Pass various wrapped Arrow objects created in Python into R with zero copy via reticulate</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4120">ARROW-4120</a> - [Python] Define process for testing procedures that check for no macro-level memory leaks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4226">ARROW-4226</a> - [Format][C++] Add CSF sparse tensor support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4304">ARROW-4304</a> - [Rust] Enhance documentation for arrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4428">ARROW-4428</a> - [R] Feature flags for R build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4482">ARROW-4482</a> - [Website] Add blog archive page</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4815">ARROW-4815</a> - [Rust] [DataFusion] Add support for * in SQL projection</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5357">ARROW-5357</a> - [Rust] Add capacity field in Buffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5405">ARROW-5405</a> - [Documentation] Move integration testing documentation to Sphinx docs, add instructions for JavaScript</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5497">ARROW-5497</a> - [Release] Build and publish R/Java/JS docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5501">ARROW-5501</a> - [R] Reorganize read/write file/stream functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5510">ARROW-5510</a> - [Format] Feather V2 based on Arrow IPC file format, with compression support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5563">ARROW-5563</a> - [Format] Update integration test JSON format documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5585">ARROW-5585</a> - [Go] rename arrow.TypeEquals into arrow.TypeEqual</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5742">ARROW-5742</a> - [CI] Add daily / weekly Valgrind build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5757">ARROW-5757</a> - [Python] Stop supporting Python 2.7</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-590">ARROW-590</a> - [Integration] Add integration tests for Union types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5949">ARROW-5949</a> - [Rust] Implement DictionaryArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6165">ARROW-6165</a> - [Integration] Use multiprocessing to run integration tests on multiple CPU cores</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6176">ARROW-6176</a> - [Python] Allow to subclass ExtensionArray to attach to custom extension type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6393">ARROW-6393</a> - [C++] Add EqualOptions support in SparseTensor::Equals</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6479">ARROW-6479</a> - [C++] inline errors from external projects’ build logs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6510">ARROW-6510</a> - [Python][Filesystem] Expose nanosecond resolution mtime</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6666">ARROW-6666</a> - [Rust] [DataFusion] Implement string literal expression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6724">ARROW-6724</a> - [C++] Add simpler static ctor for BufferOutputStream than the current Create function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6821">ARROW-6821</a> - [C++][Parquet] Do not require Thrift compiler when building (but still require library)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6823">ARROW-6823</a> - [C++][Python][R] Support metadata in the feather format?</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6837">ARROW-6837</a> - [C++/Python] access File Footer custom_metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6841">ARROW-6841</a> - [C++] Upgrade to LLVM 8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6875">ARROW-6875</a> - [FlightRPC] Implement Criteria for ListFlights RPC / list_flights method</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6915">ARROW-6915</a> - [Developer] Do not overwrite minor release version with merge script, even if not specified by committer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6947">ARROW-6947</a> - [Rust] [DataFusion] Add support for scalar UDFs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6996">ARROW-6996</a> - [Python] Expose boolean filter kernel on Table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7044">ARROW-7044</a> - [Release] Create a post release script for the home-brew formulas</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7048">ARROW-7048</a> - [Java] Support for combining multiple vectors under VectorSchemaRoot</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7063">ARROW-7063</a> - [C++] Schema print method prints too much metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7073">ARROW-7073</a> - [Java] Support concating vectors values in batch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7080">ARROW-7080</a> - [Python][Parquet][C++] Expose parquet field_id in Schema objects</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7091">ARROW-7091</a> - [C++] Move all factories to type_fwd.h</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7119">ARROW-7119</a> - [C++][CI] Use scripts/util_coredump.sh to show automatic backtraces</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7201">ARROW-7201</a> - [GLib][Gandiva] Add support for BooleanNode</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7202">ARROW-7202</a> - [R][CI] Improve rwinlib building on CI to stop re-downloading dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7222">ARROW-7222</a> - [Python][Release] Wipe any existing generated Python API documentation when updating website</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7233">ARROW-7233</a> - [C++] Add Result<T> APIs to IPC module</T></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7256">ARROW-7256</a> - [C++] Remove ARROW_MEMORY_POOL_DEFAULT macro</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7330">ARROW-7330</a> - [C++] Add Result<T> to APIs to arrow/gpu</T></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7332">ARROW-7332</a> - [C++][Parquet] Explicitly catch status exceptions in PARQUET_CATCH_NOT_OK</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7336">ARROW-7336</a> - [C++] Implement MinMax options to not skip nulls</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7338">ARROW-7338</a> - [C++] Improve InMemoryDataSource to support generator instead of static list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7365">ARROW-7365</a> - [Python] Support FixedSizeList type in conversion to numpy/pandas</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7400">ARROW-7400</a> - [Java] Avoids the worst case for quick sort</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7412">ARROW-7412</a> - [C++][Dataset] Ensure that dataset code is robust to schemas with duplicate field names</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7419">ARROW-7419</a> - [Python] Support SparseCSCMatrix</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7427">ARROW-7427</a> - [Python] Support SparseCSFTensor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7428">ARROW-7428</a> - [Format][C++] Add serialization for CSF sparse tensors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7444">ARROW-7444</a> - [GLib] Add LocalFileSystem support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7462">ARROW-7462</a> - [C++] Add CpuInfo detection for Arm64 Architecture</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7491">ARROW-7491</a> - [Java] Improve the performance of aligning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7515">ARROW-7515</a> - [C++] Rename nonexistent and non_existent to not_found</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7524">ARROW-7524</a> - [C++][CI] Build parquet support in the VS2019 GitHub Actions job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7530">ARROW-7530</a> - [Developer] Do not include list of commits from PR in squashed summary message</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7534">ARROW-7534</a> - [Java] Create a new java/contrib module</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7547">ARROW-7547</a> - [C++] [Python] [Dataset] Additional reader options in ParquetFileFormat</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7587">ARROW-7587</a> - [C++][Compute] Add Top-k kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7608">ARROW-7608</a> - [C++][Dataset] Expose more informational properties</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7615">ARROW-7615</a> - [CI][Gandiva] Ensure that the gandiva_jni library has only a whitelisted set of shared dependencies as part of Travis CI job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7616">ARROW-7616</a> - [Java] Support comparing value ranges for dense union vector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7625">ARROW-7625</a> - [GLib] Parquet GLib and Red Parquet (Ruby) do not allow specifying compression type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7641">ARROW-7641</a> - [R] Make dataset vignette have executable code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7662">ARROW-7662</a> - [R] Support creating ListArray from R list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7664">ARROW-7664</a> - [C++] Extract localfs default from FileSystemFromUri</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7675">ARROW-7675</a> - [R][CI] Move Windows CI from Appveyor to GHA</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7679">ARROW-7679</a> - [R] Cleaner interface for creating UnionDataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7684">ARROW-7684</a> - [Rust] Provide example of Flight server for DataFusion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7685">ARROW-7685</a> - [Developer] Add support for GitHub Actions to Crossbow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7691">ARROW-7691</a> - [C++] Verify missing fields when walking Flatbuffers data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7708">ARROW-7708</a> - [Release] Include PARQUET commits from git changelog in release changelogs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7712">ARROW-7712</a> - [CI][Crossbow] Fix or delete fuzzit jobs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7720">ARROW-7720</a> - [C++][Python] Add check_metadata argument to Table.equals</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7725">ARROW-7725</a> - [C++] Add infrastructure for unity builds and precompiled headers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7726">ARROW-7726</a> - [CI] [C++] Use boost binaries on Windows GHA build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7729">ARROW-7729</a> - [Python][CI] Pin pandas version to 0.25 in the dask integration test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7733">ARROW-7733</a> - [Developer] Install locally a new enough version of Go for release verification script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7735">ARROW-7735</a> - [Release] conda-forge channel is missing for verifying wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7736">ARROW-7736</a> - [Release] Binary verification sometimes fails with transient error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7739">ARROW-7739</a> - [GLib] Use placement new to initialize shared_ptr object in private structs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7741">ARROW-7741</a> - [C++][Parquet] Incorporate new level generation logic in parquet write path with a flag to revert back to old logic</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7742">ARROW-7742</a> - [GLib] Add support for MapArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7745">ARROW-7745</a> - [Doc] [C++] Update Parquet documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7749">ARROW-7749</a> - [C++] Link some more tests together</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7750">ARROW-7750</a> - [Release] Make the source release verification script restartable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7751">ARROW-7751</a> - [Release] macOS wheel verification also needs arrow-testing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7752">ARROW-7752</a> - [Release] Enable and test dataset in the verification script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7754">ARROW-7754</a> - [C++] Result<T> is slow</T></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7761">ARROW-7761</a> - [C++] Add S3 support to fs::FileSystemFromUri</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7764">ARROW-7764</a> - [C++] Builders allocate a null bitmap buffer even if there is no nulls</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7771">ARROW-7771</a> - [Developer] Use ARROW_TMPDIR environment variable in the verification scripts instead of TMPDIR</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7774">ARROW-7774</a> - [Packaging][Python] Update macos and windows wheel filenames</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7787">ARROW-7787</a> - [Rust] Add collect to Table API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7788">ARROW-7788</a> - [C++] Add schema conversion support for map type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7790">ARROW-7790</a> - [Website] Update how to install Linux packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7795">ARROW-7795</a> - [Rust - DataFusion] Support boolean negation (NOT)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7796">ARROW-7796</a> - [R] write_* functions should invisibly return their inputs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7799">ARROW-7799</a> - [R][CI] Remove flatbuffers from homebrew formulae</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7804">ARROW-7804</a> - [C++][R] Compile error on macOS 10.11</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7812">ARROW-7812</a> - [Packaging][Python] Upgrade LLVM in manylinux1 docker image</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7817">ARROW-7817</a> - [CI] macOS R autobrew nightly failed on installing dependency from source</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7819">ARROW-7819</a> - [C++][Gandiva] Add DumpIR to Filter/Projector classes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7824">ARROW-7824</a> - [C++][Dataset] Provide Dataset writing to IPC format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7828">ARROW-7828</a> - [Release] Remove SSH keys for internal use</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7829">ARROW-7829</a> - [R] Test R bindings on clang</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7833">ARROW-7833</a> - [R] Make install_arrow() actually install arrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7834">ARROW-7834</a> - [Release] Post release task for updating the documentations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7839">ARROW-7839</a> - [Python][Dataset] Add IPC format to python bindings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7846">ARROW-7846</a> - [Python][Dev] Remove last dependencies on six</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7849">ARROW-7849</a> - [Packaging][Python] Remove the remaining py27 crossbow wheel tasks from the nightlies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7858">ARROW-7858</a> - [C++][Python] Support casting an Extension type to its storage type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7859">ARROW-7859</a> - [R] Minor patches for CRAN submission 0.16.0.2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7862">ARROW-7862</a> - [R] Linux installation should run quieter by default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7863">ARROW-7863</a> - [C++][Python][CI] Ensure running HDFS related tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7864">ARROW-7864</a> - [R] Make sure bundled installation works even if there are system packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7865">ARROW-7865</a> - [R] Test builds on latest Linux versions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7868">ARROW-7868</a> - [Crossbow] Reduce GitHub API query parallelism</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7869">ARROW-7869</a> - [Python] Boost::system and boost::filesystem not necessary anymore in Python wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7872">ARROW-7872</a> - [Python] Support conversion of list-of-struct in Array/Table.to_pandas</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7874">ARROW-7874</a> - [Python][Archery] Validate docstrings with numpydoc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7876">ARROW-7876</a> - [R] Installation fails in the documentation generation image</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7877">ARROW-7877</a> - [Packaging] Fix crossbow deployment to github artifacts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7879">ARROW-7879</a> - [C++][Doc] Add doc for the Device API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7880">ARROW-7880</a> - [CI][R] R sanitizer job is not really working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7881">ARROW-7881</a> - [C++] Fix pedantic warnings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7882">ARROW-7882</a> - [C++][Gandiva] Optimise like function for substring pattern</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7886">ARROW-7886</a> - [C++][Dataset] Consolidate Source and Dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7888">ARROW-7888</a> - [Python] Allow using a more modern version of jpype in pyarrow.jvm</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7890">ARROW-7890</a> - [C++] Add Promise / Future implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7891">ARROW-7891</a> - [C++] RecordBatch-&gt;Equals should also have a check_metadata argument</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7892">ARROW-7892</a> - [Python] Expose FilesystemSource.format attribute</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7895">ARROW-7895</a> - [Python] Remove more python 2.7 cruft</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7896">ARROW-7896</a> - [C++] Refactor from #include guards to #pragma once</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7897">ARROW-7897</a> - [Packaging] Temporarily disable artifact uploading until we fix the deployment issues</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7898">ARROW-7898</a> - [Python] Reduce the number docstring violations using numpydoc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7904">ARROW-7904</a> - [C++] Decide about Field/Schema metadata printing parameters and how much to show by default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7907">ARROW-7907</a> - [Python] Conversion to pandas of empty table with timestamp type aborts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7912">ARROW-7912</a> - [Format] C data interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7913">ARROW-7913</a> - [C++][Python][R] C++ implementation of C data interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7915">ARROW-7915</a> - [CI] [Python] Run tests with Python development mode enabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7916">ARROW-7916</a> - [C++][Dataset] Project IPC record batches to materialized fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7917">ARROW-7917</a> - [CMake] FindPythonInterp should check for python3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7919">ARROW-7919</a> - [R] install_arrow() should conda install if appropriate</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7920">ARROW-7920</a> - [R] Fill in some missing input validation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7921">ARROW-7921</a> - [Go] Add Reset method to various components and clean up comments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7927">ARROW-7927</a> - [C++] Fix ‘cpu_info.cc’ compilation warning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7929">ARROW-7929</a> - [C++] CMake target names differ from upstream provided names</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7930">ARROW-7930</a> - [Python][CI] Test jpype integration in CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7932">ARROW-7932</a> - [Rust] [Parquet] Implement array reader for temporal types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7934">ARROW-7934</a> - [C++] Fix UriEscape for empty string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7935">ARROW-7935</a> - [Java] Remove Netty dependency for BufferAllocator and ReferenceManager</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7937">ARROW-7937</a> - [Python][Packaging] Remove boost from the macos wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7941">ARROW-7941</a> - [Rust] [DataFusion] Logical plan should support unresolved column references</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7943">ARROW-7943</a> - [C++][Parquet] Add a new level builder capable of handling nested data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7947">ARROW-7947</a> - [Rust] [Flight] [DataFusion] Implement example for get_schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7949">ARROW-7949</a> - [Developer] Update to ‘.gitignore’ to not track user specific ‘cpp/Brewfile.lock.json’ file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7951">ARROW-7951</a> - [Python][Parquet] Expose BYTE_STREAM_SPLIT to pyarrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7959">ARROW-7959</a> - [Ruby] Add support for Ruby 2.3 again</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7963">ARROW-7963</a> - [C++][Python][Dataset] Expose listing fragments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7965">ARROW-7965</a> - [Python] Refine higher level dataset API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7966">ARROW-7966</a> - [Integration][Flight][C++] Client should verify each batch independently</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7969">ARROW-7969</a> - [Packaging] Use cURL to upload artifacts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7970">ARROW-7970</a> - [Packaging][Python] Use system boost to build the macos wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7971">ARROW-7971</a> - [Rust] Create rowcount utility</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7977">ARROW-7977</a> - [C++] Rename fs::FileStats to fs::FileInfo</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7979">ARROW-7979</a> - [C++] Implement experimental buffer compression in IPC messages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7982">ARROW-7982</a> - [C++] Let ArrayDataVisitor accept void-returning functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7983">ARROW-7983</a> - [CI][R] Nightly builds should be more verbose when they fail</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7984">ARROW-7984</a> - [R] Check for valid inputs in more places</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7986">ARROW-7986</a> - [Python] pa.Array.from_pandas cannot convert pandas.Series containing pyspark.ml.linalg.SparseVector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7987">ARROW-7987</a> - [CI][R] Fix for verbose nightly builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7988">ARROW-7988</a> - [R] Fix on.exit calls in reticulate bindings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7991">ARROW-7991</a> - [C++][Plasma] Allow option for evicting if full when creating an object</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7993">ARROW-7993</a> - [Java] Support decimal type in ComplexCopier</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7994">ARROW-7994</a> - [CI][C++] Move AppVeyor MinGW builds to GitHub Actions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7995">ARROW-7995</a> - [C++] IO: coalescing and caching read ranges</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7998">ARROW-7998</a> - [C++][Plasma] Make Seal requests synchronous</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8005">ARROW-8005</a> - [Website] Review and adjust any usages of Apache dist system from website / tools</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8014">ARROW-8014</a> - [C++] Provide CMake targets to test only within a given label</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8016">ARROW-8016</a> - [Developer] Fix deprecation warning in PR merge tool</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8018">ARROW-8018</a> - [C++][Parquet]Parquet Modular Encryption</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8024">ARROW-8024</a> - [R] Bindings for BinaryType and FixedBinaryType</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8026">ARROW-8026</a> - [Python] Support memoryview in addition to string value types for constructing string and binary type arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8027">ARROW-8027</a> - [Developer][Integration] Add integration tests for duplicate field names</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8028">ARROW-8028</a> - [Go] Allow duplicate field names in schemas and nested types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8030">ARROW-8030</a> - [C++][Plasma] Fix inconsistent comment style</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8039">ARROW-8039</a> - [Python][Dataset] Support using dataset API in pyarrow.parquet with a minimal ParquetDataset shim</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8044">ARROW-8044</a> - [CI][NIGHTLY:gandiva-jar-osx] pygit2 needs libgit2 v1.0.x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8055">ARROW-8055</a> - [GLib][Ruby] Add some metadata bindings to GArrowSchema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8058">ARROW-8058</a> - [C++][Python][Dataset] Provide an option to toggle validation and schema inference in FileSystemDatasetFactoryOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8059">ARROW-8059</a> - [Python] Make FileSystem objects serializable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8060">ARROW-8060</a> - [Python] Make dataset Expression objects serializable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8061">ARROW-8061</a> - [C++][Dataset] Ability to specify granularity of ParquetFileFragment (support row groups)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8063">ARROW-8063</a> - [Python] Add user guide documentation for Datasets API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8064">ARROW-8064</a> - [Dev] Implement Comment bot via Github actions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8072">ARROW-8072</a> - [C++][Plasma] Add const constraint when parsing data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8077">ARROW-8077</a> - [Python] Add wheel build script and Crossbow configuration for Windows on Python 3.5</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8079">ARROW-8079</a> - [Python] Implement a wrapper for KeyValueMetadata, duck-typing dict where relevant</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8080">ARROW-8080</a> - [C++] Add AVX512 build option</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8082">ARROW-8082</a> - [Java][Plasma] Add JNI list() interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8083">ARROW-8083</a> - [GLib] Add support for Peek() to GIOInputStream</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8086">ARROW-8086</a> - [Java] Support writing decimal from big endian byte array in UnionListWriter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8087">ARROW-8087</a> - [C++][Dataset] Order of keys with HivePartitioning is lost in resulting schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8096">ARROW-8096</a> - [C++][Gandiva] Create null node of Interval type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8097">ARROW-8097</a> - [Dev] Comment bot’s crossbow command acts on the master branch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8103">ARROW-8103</a> - [R] Make default Linux build more minimal</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8104">ARROW-8104</a> - [C++] Don’t install bundled Thrift</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8107">ARROW-8107</a> - [Packaging][APT] Use HTTPS for LLVM APT repository for Debian GNU/Linux stretch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8109">ARROW-8109</a> - [Packaging][APT] Drop support for Ubuntu Disco</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8117">ARROW-8117</a> - [Rust] [Datafusion] Allow CAST from number to timestamp</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8118">ARROW-8118</a> - [R] dim method for FileSystemDataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8120">ARROW-8120</a> - [Packaging][APT] Add support for Ubuntu Focal</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8123">ARROW-8123</a> - [Rust] [DataFusion] Create LogicalPlanBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8124">ARROW-8124</a> - [Rust] Update library dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8126">ARROW-8126</a> - [C++][Compute] Add Top-K kernel benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8129">ARROW-8129</a> - [C++][Compute] Refine compare sorting kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8130">ARROW-8130</a> - [C++][Gandiva] Fix Dex visitor in llvm_generator to handle interval type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8140">ARROW-8140</a> - [Developer] Follow NullType -&gt; NullField change</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8141">ARROW-8141</a> - [C++] Optimize BM_PlainDecodingBoolean performance using AVX512 Intrinsics API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8145">ARROW-8145</a> - [C++] Rename GetTargetInfos</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8146">ARROW-8146</a> - [C++] Add per-filesystem facility to sanitize a path</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8150">ARROW-8150</a> - [Rust] Allow writing custom FileMetaData k/v pairs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8151">ARROW-8151</a> - [Benchmarking][Dataset] Benchmark Parquet read performance with S3File</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8153">ARROW-8153</a> - [Packaging] Update the conda feedstock files and upload artifacts to Anaconda</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8158">ARROW-8158</a> - [Java] Getting length of data buffer and base variable width vector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8164">ARROW-8164</a> - [C++][Dataset] Let datasets be viewable with non-identical schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8165">ARROW-8165</a> - [Packaging] Make nightly wheels available on a PyPI server</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8167">ARROW-8167</a> - [CI] Add support for skipping builds with skip pattern in pull request title</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8168">ARROW-8168</a> - [Java][Plasma] Improve Java Plasma client off-heap memory usage</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8177">ARROW-8177</a> - [Rust] Make schema_to_fb_offset public</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8178">ARROW-8178</a> - [C++] Upgrade to Flatbuffers 1.12</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8179">ARROW-8179</a> - [R] Windows build script tweaking for nightly packaging on GHA</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8181">ARROW-8181</a> - [Java][FlightRPC] Expose transport error metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8182">ARROW-8182</a> - [Packaging] Increment the version number detected from the latest git tag</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8183">ARROW-8183</a> - [c++][FlightRPC] Expose transport error metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8184">ARROW-8184</a> - [Packaging] Use arrow-nightlies organization name on Anaconda and Gemfury to host the nightlies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8185">ARROW-8185</a> - [Packaging] Document the available nightly wheels and conda packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8187">ARROW-8187</a> - [R] Make test assertions robust to i18n</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8191">ARROW-8191</a> - [Packaging][APT] Fix cmake removal in Debian GNU/Linux Stretch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8192">ARROW-8192</a> - [C++] script for unpack avx512 intrinsics code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8194">ARROW-8194</a> - [CI] Github Actions Windows job should run tests in parallel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8195">ARROW-8195</a> - [CI] Remove Boost download step in Github Actions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8198">ARROW-8198</a> - [C++] Diffing should handle null arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8200">ARROW-8200</a> - [GLib] Rename garrow_file_system_target_info{,s}() to …_file_info{,s}()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8203">ARROW-8203</a> - [C#] “dotnet pack” is failed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8204">ARROW-8204</a> - [Rust] [DataFusion] Add support for aliased expressions in SQL</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8207">ARROW-8207</a> - [Packaging][wheel] Use LLVM 8 in manylinux2010 and manylinux2014</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8215">ARROW-8215</a> - [CI][GLib] Meson install fails in the macOS build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8218">ARROW-8218</a> - [C++] Parallelize decompression at field level in experimental IPC compression code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8220">ARROW-8220</a> - [Python] Make dataset FileFormat objects serializable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8222">ARROW-8222</a> - [C++] Use bcp to make a slim boost for bundled build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8224">ARROW-8224</a> - [C++] Remove APIs deprecated prior to 0.16.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8225">ARROW-8225</a> - [Rust] IPC reader must respect continuation markers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8227">ARROW-8227</a> - [C++] Refine SIMD feature definitions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8231">ARROW-8231</a> - [Rust] Parse key_value_metadata from parquet FileMetaData into arrow schema metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8232">ARROW-8232</a> - [Python] Deprecate pa.open_file and pa.open_stream in favor of pa.ipc.open_file/open_stream</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8241">ARROW-8241</a> - [Rust] Add convenience methods to Schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8242">ARROW-8242</a> - [C++] Flight fails to compile on GCC 4.8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8243">ARROW-8243</a> - [Rust] [DataFusion] Fix inconsistent API in LogicalPlanBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8244">ARROW-8244</a> - [Python][Parquet] Add `write_to_dataset` option to populate the “file_path” metadata fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8246">ARROW-8246</a> - [C++] Add -Wa,-mbig-obj when compiling with MinGW to avoid linking errors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8247">ARROW-8247</a> - [Python] Expose Parquet writing “engine” setting in pyarrow.parquet.write_table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8249">ARROW-8249</a> - [Rust] [DataFusion] Make Table and LogicalPlanBuilder APIs more consistent</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8252">ARROW-8252</a> - [CI][Ruby] Add Ubuntu 20.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8256">ARROW-8256</a> - [Rust] [DataFusion] Update CLI documentation for 0.17.0 release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8264">ARROW-8264</a> - [Rust] [DataFusion] Create utility for printing record batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8266">ARROW-8266</a> - [C++] Add backup mirrors for external project source downloads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8267">ARROW-8267</a> - [CI][GLib] Failed to build on Ubuntu 16.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8271">ARROW-8271</a> - [Packaging] Allow wheel upload failures to gemfury</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8275">ARROW-8275</a> - [Python][Docs] Review Feather + IPC file documentation per “Feather V2” changes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8277">ARROW-8277</a> - [Python] RecordBatch interface improvements</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8279">ARROW-8279</a> - [C++] Do not export symbols from Codec implementations, remove need for PIMPL pattern</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8288">ARROW-8288</a> - [Python] Expose with_ modifiers on DataType</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8290">ARROW-8290</a> - [Python][Dataset] Improve ergonomy of the FileSystemDataset constructor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8291">ARROW-8291</a> - [Packaging] Conda nightly builds can’t locate Numpy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8292">ARROW-8292</a> - [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8294">ARROW-8294</a> - [Format][Flight] Add DoExchange RPC to Flight protocol</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8295">ARROW-8295</a> - [C++][Dataset] IpcFileFormat should expliclity push down column projection</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8299">ARROW-8299</a> - [C++] Reusable “optional ParallelFor” function for optional use of multithreading</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8300">ARROW-8300</a> - [R] Documentation and changelog updates for 0.17</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8307">ARROW-8307</a> - [Python] Expose use_memory_map option in pyarrow.feather APIs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8308">ARROW-8308</a> - [Rust] [Flight] Implement DoExchange on examples</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8309">ARROW-8309</a> - [CI] C++/Java/Rust workflows should trigger on changes to Flight.proto</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8311">ARROW-8311</a> - [C++] Add push style stream format reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8316">ARROW-8316</a> - [CI] Set docker-compose to use docker-cli instead of docker-py for building images</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8319">ARROW-8319</a> - [CI] Install thrift compiler in the debian build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8320">ARROW-8320</a> - [Documentation][Format] Clarify (lack of) alignment requirements in C data interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8321">ARROW-8321</a> - [CI] Use bundled thrift in Fedora 30 build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8322">ARROW-8322</a> - [CI] Fix C# workflow file syntax</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8325">ARROW-8325</a> - [R][CI] Stop including boost in R windows bundle</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8329">ARROW-8329</a> - [Documentation][C++] Undocumented FilterOptions argument in Filter kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8330">ARROW-8330</a> - [Documentation] The post release script generates the documentation with a development version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8332">ARROW-8332</a> - [C++] Require Thrift compiler to use system libthrift for Parquet build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8335">ARROW-8335</a> - [Release] Add crossbow jobs to run release verification</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8336">ARROW-8336</a> - [Packaging][deb] Use libthrift-dev on Debian 10 and Ubuntu 19.10 or later</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8341">ARROW-8341</a> - [Packaging][deb] Fail to build by no disk space</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8343">ARROW-8343</a> - [GLib] Add GArrowRecordBatchIterator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8347">ARROW-8347</a> - [C++] Add Result<T> APIs to Array methods</T></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8351">ARROW-8351</a> - [R][CI] Store the Rtools-built Arrow C++ library as a build artifact</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8352">ARROW-8352</a> - [R] Add install_pyarrow()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8356">ARROW-8356</a> - [Developer] Support * wildcards with “crossbow submit” via GitHub actions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8361">ARROW-8361</a> - [C++] Add Result<T> APIs to Buffer methods and functions</T></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8362">ARROW-8362</a> - [Crossbow] Ensure that the locally generated version is used in the docker tasks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8367">ARROW-8367</a> - [C++] Deprecate Buffer::FromString(…, pool)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8368">ARROW-8368</a> - [Format] In C interface, clarify resource management for consumers needing only a subset of child fields in ArrowArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8370">ARROW-8370</a> - [C++] Add Result<T> to type / schema APIs</T></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8371">ARROW-8371</a> - [Crossbow] Implement and exercise sanity checks for tasks.yml</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8372">ARROW-8372</a> - [C++] Add Result<T> to table / record batch APIs</T></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8375">ARROW-8375</a> - [CI][R] Make Windows tests more verbose in case of segfault</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8376">ARROW-8376</a> - [R] Add experimental interface to ScanTask/RecordBatch iterators</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8387">ARROW-8387</a> - [Rust] Make schema_to_fb public</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8389">ARROW-8389</a> - [Integration] Run tests in parallel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8390">ARROW-8390</a> - [R] Expose schema unification features</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8393">ARROW-8393</a> - [C++][Gandiva] Make gandiva function registry case-insensitive</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8396">ARROW-8396</a> - [Rust] Remove libc from dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8398">ARROW-8398</a> - [Python] Remove deprecation warnings originating from python tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8401">ARROW-8401</a> - [C++] Add AVX2/AVX512 version of ByteStreamSplitDecode/ByteStreamSplitEncode</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8403">ARROW-8403</a> - [C++] Add ToString() to ChunkedArray, Table and RecordBatch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8407">ARROW-8407</a> - [Rust] Add rustdoc for Dictionary type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8408">ARROW-8408</a> - [Python] Add memory_map= toggle to pyarrow.feather.read_feather</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8409">ARROW-8409</a> - [R] Add arrow::cpu_count, arrow::set_cpu_count wrapper functions a la Python</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8412">ARROW-8412</a> - [C++][Gandiva] Fix gandiva date_diff function definitions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8433">ARROW-8433</a> - [R] Add feather alias for ipc format in dataset API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8443">ARROW-8443</a> - [Gandiva][C++] Fix round/truncate to no-op for special cases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8444">ARROW-8444</a> - [Documentation] Fix spelling errors across the codebase</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8449">ARROW-8449</a> - [R] Use CMAKE_UNITY_BUILD everywhere</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8450">ARROW-8450</a> - [Integration][C++] Implement large list/binary/utf8 integration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8457">ARROW-8457</a> - [C++] bridge test does not take care of endianness</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8458">ARROW-8458</a> - [C++] Prefer the original mirrors for the bundled thirdparty dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8461">ARROW-8461</a> - [Packaging][deb] Use zstd package for Ubuntu Xenial</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8463">ARROW-8463</a> - [CI] Balance the nightly test builds between CircleCI, Azure and Github</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8467">ARROW-8467</a> - [C++] Test cases using ArrayFromJSON assume only a little-endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8477">ARROW-8477</a> - [C++] Enable reading and writing of long filenames for Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8496">ARROW-8496</a> - [C++] Refine ByteStreamSplitDecodeScalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8499">ARROW-8499</a> - [C++][Dataset] In ScannerBuilder, batch_size will not work if projecter is not empty</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8506">ARROW-8506</a> - [c++] Miss tests to verify expected_buffer with bit_width &gt; 8 in RLE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8515">ARROW-8515</a> - [C++] Bitmap ToString should have an option of grouping by bytes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8517">ARROW-8517</a> - [Developer][Release] Update Crossbow RC verification setup for changes since 0.16.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8523">ARROW-8523</a> - [C++] Optimize BitmapReader</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1663">PARQUET-1663</a> - [C++] Provide API to check the presence of complex data types</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1716">PARQUET-1716</a> - [C++] Add support for BYTE_STREAM_SPLIT encoding</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1770">PARQUET-1770</a> - [C++][CI] Add fuzz target for reading Parquet files</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1785">PARQUET-1785</a> - [C++] Improve code reusability in encoding-test.cc</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1786">PARQUET-1786</a> - [C++] Use simd to improve BYTE_STREAM_SPLIT decoding performance</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1806">PARQUET-1806</a> - [C++] [CI] Improve fuzzing seed corpus</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1825">PARQUET-1825</a> - [C++] Fix compilation error in column_io_benchmark.cc</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1828">PARQUET-1828</a> - [C++] Add a SSE2 path for the ByteStreamSplit encoder implementation</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1840">PARQUET-1840</a> - [C++] DecodeSpaced copies more values then necessary</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-458">PARQUET-458</a> - [C++] Implement support for DataPageV2</li>
</ul>
<h3 id="bug-fixes">Bug Fixes</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1907">ARROW-1907</a> - [C++/Python] Feather format cannot accommodate string columns containing more than a total of 2GB of data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2255">ARROW-2255</a> - [Developer][Integration] Serialize schema- and field-level custom metadata in integration test JSON format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2587">ARROW-2587</a> - [Python] Unable to write StructArrays with multiple children to parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3004">ARROW-3004</a> - [Documentation] Builds docs for master rather than a pinned commit</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3543">ARROW-3543</a> - [R] Better support for timestamp format and time zones in R</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5265">ARROW-5265</a> - [Python/CI] Add integration test with kartothek</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5473">ARROW-5473</a> - [C++] Build failure on googletest_ep on Windows when using Ninja</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5981">ARROW-5981</a> - [C++] DictionaryBuilder<T> initialization with Array can fail silently</T></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6528">ARROW-6528</a> - [C++] Spurious Flight test failures (port allocation failure)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6738">ARROW-6738</a> - [Java] Fix problems with current union comparison logic</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6757">ARROW-6757</a> - [Python] Creating csv.ParseOptions() causes “Windows fatal exception: access violation” with Visual Studio 2017</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6871">ARROW-6871</a> - [Java] Enhance TransferPair related parameters check and tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6872">ARROW-6872</a> - [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6895">ARROW-6895</a> - [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()`</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7008">ARROW-7008</a> - [Python] pyarrow.chunked_array([array]) fails on array with all-None buffers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7049">ARROW-7049</a> - [C++] warnings building on mingw-w64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7301">ARROW-7301</a> - [Java] Sql type DATE should correspond to DateDayVector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7335">ARROW-7335</a> - [C++][Gandiva] Add castBIGINT, extractDay interval_day functions in Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7390">ARROW-7390</a> - [C++][Dataset] Concurrency race in Projector::Project</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7405">ARROW-7405</a> - [Java] ListVector isEmpty API is incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7466">ARROW-7466</a> - [CI][Java] Fix gandiva-jar-osx nightly build failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7467">ARROW-7467</a> - [Java] ComplexCopier does incorrect copy for Map nullable info</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7520">ARROW-7520</a> - [R] Writing many batches causes a crash</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7546">ARROW-7546</a> - [Java] Use new implementation to concat vectors values in batch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7624">ARROW-7624</a> - [Rust] Soundness issues via `Buffer` methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7628">ARROW-7628</a> - [Python] Better document some read_csv corner cases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7631">ARROW-7631</a> - [C++][Gandiva] return zero if there is an overflow while converting a decimal to a lower precision/scale</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7680">ARROW-7680</a> - [C++][Dataset] Partition discovery is not working with windows path</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7701">ARROW-7701</a> - [C++] [CI] Flight test error on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7713">ARROW-7713</a> - [Java] TastLeak was put at the wrong location</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7722">ARROW-7722</a> - [Java][FlightRPC] Memory leak</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7734">ARROW-7734</a> - [C++] Segfault when comparing status with and without detail</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7740">ARROW-7740</a> - [C++] Array internals corruption in StructArray::Flatten</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7755">ARROW-7755</a> - [Python] Windows wheel cannot be installed on Python 3.8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7758">ARROW-7758</a> - [Python] Wrong conversion of timestamps that are out of bounds for pandas (eg 0000-01-01)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7760">ARROW-7760</a> - [Release] Fix verify-release-candidate.sh since pip3 seems to no longer be in miniconda</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7762">ARROW-7762</a> - [Python] Exceptions in ParquetWriter get ignored</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7766">ARROW-7766</a> - [Python][Packaging] Windows py38 wheels are built with wrong ABI tag</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7772">ARROW-7772</a> - [R][C++][Dataset] Unable to filter on date32 object with date64 scalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7775">ARROW-7775</a> - [Rust] Don’t let safe code arbitrarily transmute readers and writers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7777">ARROW-7777</a> - [Go] StructBuilder/ListBuilder index out of range panic</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7780">ARROW-7780</a> - [Release] Fix Windows wheel RC verification script given lack of “m” ABI tag in Python 3.8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7781">ARROW-7781</a> - [C++][Dataset] Filtering on a non-existent column gives a segfault</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7783">ARROW-7783</a> - [C++] ARROW_DATASET should enable ARROW_COMPUTE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7785">ARROW-7785</a> - [C++] sparse_tensor.cc is extremely slow to compile</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7786">ARROW-7786</a> - [R] Wire up check_metadata in Table.Equals method</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7789">ARROW-7789</a> - [R] Can’t initialize arrow objects when R.oo package is loaded</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7791">ARROW-7791</a> - [C++][Parquet] Fix building error “cannot bind lvalue”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7792">ARROW-7792</a> - [R] read_* functions should close connection to file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7793">ARROW-7793</a> - [Java] If there is a leak the base allocator should release the excess memory to parent before throwing exception</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7794">ARROW-7794</a> - [Rust] cargo publish fails for arrow-flight due to relative path to Flight.proto</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7797">ARROW-7797</a> - [Release][Rust] Fix arrow-flight’s version in datafusion crate</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7802">ARROW-7802</a> - [C++] Support for LargeBinary and LargeString in the hash kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7806">ARROW-7806</a> - [Python] Implement to_pandas for lists of LargeBinary/String</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7807">ARROW-7807</a> - [R] Installation on RHEL 7 Cannot call io___MemoryMappedFile__Open()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7809">ARROW-7809</a> - [R] vignette does not run on Win 10 nor ubuntu</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7813">ARROW-7813</a> - [Rust] Fix undefined behaviour and and remove unsafe</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7815">ARROW-7815</a> - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7827">ARROW-7827</a> - [Python] conda-forge pyarrow package does not have s3 enabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7832">ARROW-7832</a> - [R] Patches to 0.16.0 release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7836">ARROW-7836</a> - [Rust] “allocate_aligned”/”reallocate” need to initialize memory to avoid UB</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7837">ARROW-7837</a> - [Java] bug in BaseVariableWidthVector.copyFromSafe results with an index out of bounds exception</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7838">ARROW-7838</a> - [C++] Installed plasma-store-server fails finding Boost</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7841">ARROW-7841</a> - [C++] HADOOP_HOME doesn’t work to find libhdfs.so</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7844">ARROW-7844</a> - [R] array_to_vector is not thread safe</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7848">ARROW-7848</a> - Add doc for MapType</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7852">ARROW-7852</a> - [Python] 0.16.0 wheels not compatible with older numpy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7857">ARROW-7857</a> - [Python] Failing test with pandas master for extension type conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7861">ARROW-7861</a> - [C++][Parquet] Add fuzz regression corpus for parquet reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7884">ARROW-7884</a> - [C++][Python] Crash in pq.read_table()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7887">ARROW-7887</a> - [Rust] Filter kernel does not support temporal types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7889">ARROW-7889</a> - [Rust] Datafusion CLI does not support registering Parquet files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7899">ARROW-7899</a> - [Integration][Java] null type integration test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7908">ARROW-7908</a> - [R] Can’t install package without setting LIBARROW_DOWNLOAD=true</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7922">ARROW-7922</a> - [CI][Crossbow] Nightly macOS wheel builds fail (brew bundle edition)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7923">ARROW-7923</a> - [CI][Crossbow] macOS autobrew fails on homebrew-versions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7926">ARROW-7926</a> - [Developer] “archery lint” target is not ergonomic for running a single check like IWYU</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7928">ARROW-7928</a> - [Python] Example of flight server and client not working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7931">ARROW-7931</a> - [C++] Fix crash on corrupt Map array input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7936">ARROW-7936</a> - [Python] FileSystem.from_uri test fails on python 3.5</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7940">ARROW-7940</a> - [C++] Unable to generate cmake build with settings other than default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7958">ARROW-7958</a> - [Java] Update Avro to version 1.9.2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7962">ARROW-7962</a> - [R][Dataset] Followup to “Consolidate Source and Dataset classes”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7968">ARROW-7968</a> - [C++] orc_ep build fails on 64-bit Raspbian</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7973">ARROW-7973</a> - [Developer][C++] ResourceWarnings in run_cpplint.py</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7974">ARROW-7974</a> - [Developer][C++] ResourceWarning in “make check-format”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7975">ARROW-7975</a> - [C++] Do not include padding bytes in “Buffer” IPC metadata accounting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7978">ARROW-7978</a> - [Developer] GitHub Actions “lint” task is running include-what-you-use and failing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7980">ARROW-7980</a> - [Python] Deserialization with pyarrow fails for certain Timestamp-based data frame</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7981">ARROW-7981</a> - [C++][Dataset] Fails to compile on gcc 5.4</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7985">ARROW-7985</a> - [C++] ListBuilder.Finish fails if underlying value builder is empty and .Reserve’d</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7990">ARROW-7990</a> - [C++][Developer] Add “archery lint” option for running “iwyu.sh all”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7992">ARROW-7992</a> - [C++] MSVC warning causing Appveyor failure in sort_to_indices.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7996">ARROW-7996</a> - [Python] Error serializing empty pandas DataFrame with pyarrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7997">ARROW-7997</a> - [Python] Schema equals method with inconsistent docs in pyarrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7999">ARROW-7999</a> - [C++] Fix crash on corrupt Map array input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8000">ARROW-8000</a> - [C++] gcc 4.8 build failures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8003">ARROW-8003</a> - [C++] -DBZip2_SOURCE=BUNDLED fails when building with clang</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8006">ARROW-8006</a> - [C++] Unsafe arrow dictionary recovered from parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8007">ARROW-8007</a> - [Python] Remove unused and defunct assert_get_object_equal in plasma tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8008">ARROW-8008</a> - [C++/Python] Framework Python is preferred even though not the activated one</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8009">ARROW-8009</a> - [Java] Fix the hash code methods for BitVector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8011">ARROW-8011</a> - [C++] Some buffers not resized when reading from Parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8013">ARROW-8013</a> - [Python][Packaging] Fix manylinux wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8021">ARROW-8021</a> - [Python] Appveyor does not appear to be including pandas in test runs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8036">ARROW-8036</a> - [C++] Compilation failure with gtest 1.10.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8042">ARROW-8042</a> - [Python] pyarrow.ChunkedArray docstring is incorrect regarding zero-length ChunkedArray having no chunks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8057">ARROW-8057</a> - [Python] Don’t check Schema metadata in __eq__ and __ne__</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8070">ARROW-8070</a> - [C++] Cast segfaults on unsupported cast from list<binary> to utf8</binary></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8071">ARROW-8071</a> - [GLib] Build error with configure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8088">ARROW-8088</a> - [C++][Dataset] Partition columns with specified dictionary type result in all nulls</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8091">ARROW-8091</a> - [CI][Crossbow] Fix nightly homebrew and R failures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8092">ARROW-8092</a> - [CI][Crossbow] OSX wheels fail on bundled bzip2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8094">ARROW-8094</a> - [CI][Crossbow] Nightly valgrind test fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8095">ARROW-8095</a> - [CI][Crossbow] Nightly turbodbc job fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8098">ARROW-8098</a> - [go] Checkptr Failures on Go 1.14</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8101">ARROW-8101</a> - [FlightRPC][Java] Can’t read/write only an empty null array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8102">ARROW-8102</a> - [Dev] Crossbow’s version detection doesn’t work in the comment bot’s scenario</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8105">ARROW-8105</a> - [Python] pyarrow.array segfaults when passed masked array with shrunken mask</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8106">ARROW-8106</a> - [Python] Builds on master broken by pandas 1.0.2 release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8110">ARROW-8110</a> - [C#] BuildArrays fails if NestedType is included</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8112">ARROW-8112</a> - [FlightRPC][C++] Some status codes don’t round-trip through gRPC</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8119">ARROW-8119</a> - [Dev] Make Yaml optional dependency for archery</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8122">ARROW-8122</a> - [Python] Empty numpy arrays with shape cannot be deserialized</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8125">ARROW-8125</a> - [C++] “arrow-tests” target broken with ninja build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8127">ARROW-8127</a> - [C++] [Parquet] Incorrect column chunk metadata for multipage batch writes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8128">ARROW-8128</a> - [C#] NestedType children serialized on wrong length</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8132">ARROW-8132</a> - [C++] arrow-s3fs-test failing on master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8133">ARROW-8133</a> - [CI] Github Actions sometimes fail to checkout Arrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8136">ARROW-8136</a> - [C++][Python] Creating dataset from relative path no longer working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8139">ARROW-8139</a> - [C++] FileSystem enum causes attributes warning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8142">ARROW-8142</a> - [C++] Casting a chunked array with 0 chunks critical failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8144">ARROW-8144</a> - [CI] Cmake 3.2 nightly build fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8159">ARROW-8159</a> - [Python] pyarrow.Schema.from_pandas doesn’t support ExtensionDtype</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8166">ARROW-8166</a> - [C++] AVX512 intrinsics fail to compile with clang-8 on Ubuntu 18.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8176">ARROW-8176</a> - [FlightRPC][Integration] Have Flight services bind to port 0 in integration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8186">ARROW-8186</a> - [Python] Dataset expression != returns bool instead of expression for invalid value</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8188">ARROW-8188</a> - [R] Adapt to latest checks in R-devel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8193">ARROW-8193</a> - [C++] arrow-future-test fails to compile on gcc 4.8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8197">ARROW-8197</a> - [Rust] DataFusion “create_physical_plan” returns incorrect schema?</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8206">ARROW-8206</a> - [R] Minor fix for backwards compatibility on Linux installation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8209">ARROW-8209</a> - [Python] Accessing duplicate column of Table by name gives wrong error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8213">ARROW-8213</a> - [Python][Dataset] Opening a dataset with a local incorrect path gives confusing error message</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8216">ARROW-8216</a> - [R][C++][Dataset] Filtering returns all-missing rows where the filtering column is missing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8217">ARROW-8217</a> - [R][C++] Fix crashing test in test-dataset.R on 32-bit Windows from ARROW-7979</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8219">ARROW-8219</a> - [Rust] sqlparser crate needs to be bumped to version 0.2.5</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8233">ARROW-8233</a> - [CI] Build timeouts on “AMD64 Windows MinGW 64 GLib &amp; Ruby “</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8237">ARROW-8237</a> - [Python] Review Developer build instructions for conda and non-conda users</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8238">ARROW-8238</a> - [C++][Compute] Failed to build compute tests on windows with msvc2015</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8239">ARROW-8239</a> - [Java] fix param checks in splitAndTransfer method</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8245">ARROW-8245</a> - [Python][Parquet] Skip hidden directories when reading partitioned parquet files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8255">ARROW-8255</a> - [Rust] [DataFusion] COUNT(*) results in confusing error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8259">ARROW-8259</a> - [Rust] [DataFusion] ProjectionPushDownRule does not rewrite LIMIT</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8268">ARROW-8268</a> - [Ruby] Test failure due to lack of built ZSTD support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8269">ARROW-8269</a> - [Python] Failure in “nopandas” build in test_parquet_row_group_fragments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8270">ARROW-8270</a> - [Python][Flight] Example Flight server with TLS’s certificate and key is not working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8272">ARROW-8272</a> - [CI][Python] Test failure on Ubuntu 16.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8274">ARROW-8274</a> - [C++] Use LZ4 frame format for “LZ4” compression in IPC write</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8276">ARROW-8276</a> - [C++][Dataset] Scanning a Fragment does not take into account the partition columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8280">ARROW-8280</a> - [C++] MinGW builds failing due to CARES-related toolchain issue</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8286">ARROW-8286</a> - [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8298">ARROW-8298</a> - [C++][CI] MinGW builds fail building grpc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8303">ARROW-8303</a> - [Python] Fix test failure caused by non-deterministic dict key ordering on Python 3.5</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8304">ARROW-8304</a> - [Flight][Python] Flight client with TLS root certificate is reporting error on do_get()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8305">ARROW-8305</a> - [Java] ExtensionTypeVector should make sure underlyingVector not null</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8310">ARROW-8310</a> - [C++] Minio’s exceptions not recognized by IsConnectError()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8315">ARROW-8315</a> - [Python][Dataset] Don’t rely on ordered dict keys in test_dataset.py</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8323">ARROW-8323</a> - [C++] Pin gRPC at v1.27 to avoid compilation error in its headers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8326">ARROW-8326</a> - [C++] Don’t use deprecated TYPED_TEST_CASE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8327">ARROW-8327</a> - [FlightRPC][Java] gRPC trailers may be null</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8331">ARROW-8331</a> - [C++] arrow-compute-filter-benchmark fails to compile</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8333">ARROW-8333</a> - [C++][CI] Always compile benchmarks in some C++ CI entry</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8334">ARROW-8334</a> - [C++] [Gandiva] Missing DATE32 in LLVM Types / Simple D32 Compute Functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8342">ARROW-8342</a> - [Python] dask and kartothek integration tests are failing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8345">ARROW-8345</a> - [Python] feather.read_table should not require pandas</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8346">ARROW-8346</a> - [CI][Ruby] GLib/Ruby macOS build fails on zlib</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8349">ARROW-8349</a> - [CI][NIGHTLY:gandiva-jar-osx] Use latest pygit2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8353">ARROW-8353</a> - [C++] is_nullable maybe not initialized in parquet writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8354">ARROW-8354</a> - [R] Fix segfault in Table to Array conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8357">ARROW-8357</a> - [Rust] [DataFusion] Dockerfile for CLI is missing format dir</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8358">ARROW-8358</a> - [C++] Fix -Wrange-loop-construct warnings in clang-11</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8360">ARROW-8360</a> - [C++][Gandiva] Fixes date32 support for date/time functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8365">ARROW-8365</a> - [C++] Error when writing files to S3 larger than 5 GB</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8366">ARROW-8366</a> - [Rust] Need to revert recent arrow-flight build change</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8369">ARROW-8369</a> - [CI] Fix crossbow wildcard groups</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8373">ARROW-8373</a> - [GLib] Problems resolving gobject-introspection, arrow in Meson builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8380">ARROW-8380</a> - [RUST] StringDictionaryBuilder not publicly exported from arrow::array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8384">ARROW-8384</a> - [C++][Python] arrow/filesystem/hdfs.h and Python wrapper does not have an option for setting a path to a Kerberos ticket</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8386">ARROW-8386</a> - [Python] pyarrow.jvm raises error for empty Arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8388">ARROW-8388</a> - [C++] GCC 4.8 fails to move on return</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8397">ARROW-8397</a> - [C++] Fail to compile aggregate_test.cc on Ubuntu 16.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8406">ARROW-8406</a> - [Python] test_fs fails when run from a different drive on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8410">ARROW-8410</a> - [C++] CMake fails on aarch64 systems that do not support -march=armv8-a+crc+crypto</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8414">ARROW-8414</a> - [Python] Non-deterministic row order failure in test_parquet.py</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8415">ARROW-8415</a> - [C++][Packaging] fix gandiva linux job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8416">ARROW-8416</a> - [Python] Provide a “feather” alias in the dataset API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8420">ARROW-8420</a> - [C++] CMake fails to configure on armv7l platform (e.g. Raspberry Pi 3)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8427">ARROW-8427</a> - [C++][Dataset] Do not ignore file paths with underscore/dot when full path was specified</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8428">ARROW-8428</a> - [C++][NIGHTLY:gandiva-jar-trusty] GCC 4.8 failures in C++ unit tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8429">ARROW-8429</a> - [C++] Fix Buffer::CopySlice on 0-sized buffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8432">ARROW-8432</a> - [Python][CI] Failure to download Hadoop</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8437">ARROW-8437</a> - [C++] Remove std::move return value from MakeRandomNullBitmap test utility</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8438">ARROW-8438</a> - [C++] arrow-io-memory-benchmark crashes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8439">ARROW-8439</a> - [Python] Filesystem docs are outdated</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8441">ARROW-8441</a> - [C++] Fix crashes on invalid input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8442">ARROW-8442</a> - [Python] NullType.to_pandas_dtype inconsisent with dtype returned in to_pandas/to_numpy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8460">ARROW-8460</a> - [Packaging][deb] Ubuntu Focal build is failed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8465">ARROW-8465</a> - [Packaging][Python] Windows py35 wheel build fails because of boost</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8466">ARROW-8466</a> - [Packaging] The python unittests are not running in the windows wheel builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8468">ARROW-8468</a> - [Document] Fix the incorrect null bits description</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8469">ARROW-8469</a> - [Dev] Fix nightly docker tests on azure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8478">ARROW-8478</a> - [Java] Rollback contrib package changes.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8498">ARROW-8498</a> - [Python] Schema.from_pandas fails on extension type, while Table.from_pandas works</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8511">ARROW-8511</a> - [Developer][Release] Windows release verification script does not halt if C++ compilation fails</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1780">PARQUET-1780</a> - [C++] Set ColumnMetadata.encoding_stats field</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1788">PARQUET-1788</a> - [C++] ColumnWriter has undefined behavior when writing arrow chunks</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1797">PARQUET-1797</a> - [C++] Fix fuzzing errors</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1799">PARQUET-1799</a> - [C++] Stream API: Relax schema checking when reading</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1810">PARQUET-1810</a> - [C++] Fix undefined behaviour on invalid enum values (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1813">PARQUET-1813</a> - [C++] Remove logging statement in unit test</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1819">PARQUET-1819</a> - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1823">PARQUET-1823</a> - [C++] Invalid RowGroup returned when reading with parquet::arrow::FileReader-&gt;RowGroup(i)-&gt;Column(j)</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1829">PARQUET-1829</a> - [C++] Fix crashes on invalid input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1831">PARQUET-1831</a> - [C++] Fix crashes on invalid input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1835">PARQUET-1835</a> - [C++] Fix crashes on invalid input (OSS-Fuzz)</li>
</ul>
</main>
<hr/>
<footer class="footer">
<div class="row">
<div class="col-md-9">
<p>Apache Arrow, Arrow, Apache, the Apache feather logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.</p>
<p>&copy; 2016-2024 The Apache Software Foundation</p>
</div>
<div class="col-md-3">
<a class="d-sm-none d-md-inline pr-2" href="https://www.apache.org/events/current-event.html">
<img src="https://www.apache.org/events/current-event-234x60.png"/>
</a>
</div>
</div>
</footer>
</div>
</body>
</html>