blob: 6fd4e7cdebf8685cfd2513e1151aa173f71c0d2f [file] [log] [blame]
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- The above meta tags *must* come first in the head; any other head content must come *after* these tags -->
<title>Apache Arrow 1.0.0 Release | Apache Arrow</title>
<!-- Begin Jekyll SEO tag v2.8.0 -->
<meta name="generator" content="Jekyll v4.3.3" />
<meta property="og:title" content="Apache Arrow 1.0.0 Release" />
<meta property="og:locale" content="en_US" />
<meta name="description" content="Apache Arrow 1.0.0 (24 July 2020) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 771 commits from 100 distinct contributors. $ git shortlog -sn apache-arrow-0.17.0..apache-arrow-1.0.0 125 Wes McKinney 72 Neal Richardson 61 Antoine Pitrou 56 Sutou Kouhei 55 Krisztián Szűcs 30 Joris Van den Bossche 24 Benjamin Kietzman 23 David Li 21 Kazuaki Ishizaki 19 Qingping Hou 18 Andy Grove 18 François Saint-Jacques 17 Neville Dipale 16 Yibo Cai 16 liyafan82 15 Ryan Murray 14 Micah Kornfield 10 Uwe L. Korn 9 Paddy Horan 8 Jorge C. Leitao 8 Projjal Chanda 7 Frank Du 7 Romain Francois 6 Kenta Murata 6 Maarten A. Breddels 6 Max Burke 5 Mahmut Bulut 4 Mark Hildreth 4 Yuan Zhou 3 Adam Szmigin 3 Raphael Taylor-Davies 3 Zhuo Peng 3 rdettai 2 Bobby Wagner 2 Eric Erhardt 2 Prudhvi Porandla 2 R. Tyler Croy 2 Sagnik Chakraborty 2 Vamsi 2 Yordan Pavlov 2 c-jamie 2 dota17 2 mayuropensource 2 tianchen 2 tianchen92 1 Alex Baden 1 Ben Kimock 1 Bryan Cutler 1 Chad Dougherty 1 Christian Hudon 1 Cuong Nguyen 1 David Anthoff 1 Dmitry Kalinkin 1 Drazen Urch 1 Erin Ryan 1 Eyal Leshem 1 Frank Smith 1 German I Ramirez-Espinoza 1 Hongze Zhang 1 Ilija Puaca 1 Itamar Turner-Trauring 1 Jacek Pliszka 1 Jens Glaser 1 Ji Liu 1 Jörn Horstmann 1 Kyle Brandt 1 Kyle McCarthy 1 Laurent Goujon 1 Li, Jiajia 1 Maarten Breddels 1 Masaki Kozuki 1 Mohamed Zenadi 1 Nick Poorman 1 Pand9 1 Paul Dix 1 Paul Taylor 1 Praveen 1 Robert Borkowski 1 Rok 1 Sergey Fedoseev 1 Shuai Zhang 1 Simon Watts 1 Stéphane Campinas 1 Suvayu Ali 1 Sven Wagner-Boysen 1 TP Boudreau 1 Takuya Kato 1 Tobias Mayer 1 Wakahisa 1 Weston Steimel 1 Yuqi Gu 1 Zachary Gramana 1 Ze&#39;ev Maor 1 alamb 1 alimcmaster1 1 amol 1 chenguoping 1 ritchie46 1 rvernica 1 xuyiming Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-0.17.0..apache-arrow-1.0.0 231 Wes McKinney 85 Antoine Pitrou 74 Krisztián Szűcs 68 Neal Richardson 67 Sutou Kouhei 44 François Saint-Jacques 35 Andy Grove 34 Neville Dipale 21 Benjamin Kietzman 17 GitHub 17 Micah Kornfield 12 Chao Sun 12 Paddy Horan 8 David Li 8 Praveen 7 Joris Van den Bossche 5 Bryan Cutler 5 Eric Erhardt 5 Uwe L. Korn 4 Yosuke Shiro 3 Jacques Nadeau 3 Pindikura Ravindra 3 Sebastien Binet 2 liyafan82 1 Kenta Murata Changelog Apache Arrow 1.0.0 (2020-07-24) Bug Fixes ARROW-1692 - [Python, Java] UnionArray round trip not working ARROW-3329 - [Python] Error casting decimal(38, 4) to int64 ARROW-3861 - [Python] ParquetDataset().read columns argument always returns partition column ARROW-4018 - [C++] RLE decoder may not big-endian compatible ARROW-4309 - [Documentation] Add a docker-compose entry which builds the documentation with CUDA enabled ARROW-4600 - [Ruby] Arrow::DictionaryArray#[] should returns the item in the indices array ARROW-5158 - [Packaging][Wheel] Symlink libraries in wheels ARROW-5310 - [Python] better error message on creating ParquetDataset from empty directory ARROW-5359 - [Python] timestamp_as_object support for pa.Table.to_pandas in pyarrow ARROW-5572 - [Python] raise error message when passing invalid filter in parquet reading ARROW-5666 - [Python] Underscores in partition (string) values are dropped when reading dataset ARROW-5744 - [C++] Do not error in Table::CombineChunks for BinaryArray types that overflow 2GB limit ARROW-5875 - [FlightRPC] Test RPC features in integration tests ARROW-6235 - [R] Conversion from arrow::BinaryArray to R character vector not implemented ARROW-6523 - [C++][Dataset] arrow_dataset target does not depend on anything ARROW-6848 - [C++] Specify -std=c++11 instead of -std=gnu++11 when building ARROW-7018 - [R] Non-UTF-8 data in Arrow &lt;–&gt; R conversion ARROW-7028 - [R] Date roundtrip results in different R storage mode ARROW-7084 - [C++] ArrayRangeEquals should check for full type equality? ARROW-7173 - [Integration] Add test to verify Map field names can be arbitrary ARROW-7208 - [Python] Passing directory to ParquetFile class gives confusing error message ARROW-7273 - [Python] Non-nullable null field is allowed / crashes when writing to parquet ARROW-7480 - [Rust] [DataFusion] Query fails/incorrect when aggregated + grouped columns don’t match the selected columns ARROW-7610 - [Java] Finish support for 64 bit int allocations ARROW-7654 - [Python] Ability to set column_types to a Schema in csv.ConvertOptions is undocumented ARROW-7681 - [Rust] Explicitly seeking a BufReader will discard the internal buffer ARROW-7702 - [C++][Dataset] Provide (optional) deterministic order of batches ARROW-7782 - [Python] Losing index information when using write_to_dataset with partition_cols ARROW-7840 - [Java] [Integration] Java executables fail ARROW-7925 - [C++][Documentation] Instructions about running IWYU and other tasks in cpp/development.rst have gone stale ARROW-7939 - [Python] crashes when reading parquet file compressed with snappy ARROW-7967 - [CI][Crossbow] Pin macOS version in autobrew job to match CRAN ARROW-8050 - [Python][Packaging] Do not include generated Cython source files in wheel packages ARROW-8078 - [Python] Missing links in the docs regarding field and schema DataTypes ARROW-8115 - [Python] Conversion when mixing NaT and datetime objects not working ARROW-8251 - [Python] pandas.ExtensionDtype does not survive round trip with write_to_dataset ARROW-8344 - [C#] StringArray.Builder.Clear() corrupts subsequently-built array contents ARROW-8360 - [C++][Gandiva] Fixes date32 support for date/time functions ARROW-8374 - [R] Table to vector of DictonaryType will error when Arrays don’t have the same Dictionary per array ARROW-8392 - [Java] Fix overflow related corner cases for vector value comparison ARROW-8448 - [Package] Can’t build apt packages with ubuntu-focal ARROW-8455 - [Rust] [Parquet] Arrow column read on partially compatible files ARROW-8455 - [Rust] [Parquet] Arrow column read on partially compatible files ARROW-8471 - [C++][Integration] Regression to /u?int64/ as JSON::number ARROW-8472 - [Go][Integration] Represent 64 bit integers as JSON::string ARROW-8473 - [Rust] “Statistics support” in rust/parquet readme is incorrect ARROW-8480 - [Rust] There is no check for allocation failure ARROW-8503 - [Packaging][deb] Can’t build apache-arrow-archive-keyring for RC ARROW-8505 - [Release][C#] “sourcelink test” is failed by Apache.Arrow.AssemblyInfo.cs ARROW-8508 - [Rust] ListBuilder of FixedSizeListBuilder creates wrong offsets ARROW-8510 - [C++] arrow/dataset/file_base.cc fails to compile with internal compiler error with “Visual Studio 15 2017 Win64” generator ARROW-8511 - [Developer][Release] Windows release verification script does not halt if C++ compilation fails ARROW-8514 - [Developer] Windows wheel verification script does not check Python 3.5 ARROW-8529 - [C++] Fix usage of NextCounts() in GetBatchWithDict[Spaced] ARROW-8535 - [Rust] Arrow crate does not specify arrow-flight version ARROW-8536 - [Rust] Failed to locate format/Flight.proto in any parent directory ARROW-8537 - [C++] Performance regression from ARROW-8523 ARROW-8539 - [CI] “AMD64 MacOS 10.15 GLib &amp; Ruby” fails ARROW-8554 - [C++][Benchmark] Fix building error “cannot bind lvalue” ARROW-8556 - [R] zstd symbol not found if there are multiple installations of zstd ARROW-8566 - [R] error when writing POSIXct to spark ARROW-8568 - [C++][Python] Crash on decimal cast in debug mode ARROW-8577 - [Plasma] PlasmaClient::Connect() of CUDA enabled build is always failed on no CUDA device machine ARROW-8583 - [C++][Doc] Undocumented parameter in Dataset namespace ARROW-8584 - [Packaging][C++] Protobuf link error in deb builds ARROW-8585 - [Packaging][Python] Windows wheels fail to build because of link error ARROW-8586 - [R] installation failure on CentOS 7 ARROW-8587 - [C++] Compilation error when linking arrow-flight-perf-server ARROW-8592 - [C++] Docs still list LLVM 7 as compiler used ARROW-8593 - [C++] Parquet file_serialize_test.cc fails to build with musl libc ARROW-8598 - [Rust] simd_compare_op creates buffer of incorrect length when item count is not a multiple of T::lanes() ARROW-8602 - [CMake] Fix ws2_32 link issue when cross-compiling on Linux ARROW-8603 - [Documentation] Fix Sphinx doxygen comment ARROW-8604 - [R][CI] Update CI to use R 4.0 ARROW-8608 - [C++] Update vendored mpark/variant.h to latest to fix NVCC compilation issues ARROW-8609 - [C++] ORC JNI bridge crashed on null arrow buffer ARROW-8610 - [Rust] DivideByZero when running arrow crate when simd feature is disabled ARROW-8613 - [C++][Dataset] Raise error for unparsable partition value ARROW-8615 - [R] Error better and insist on RandomAccessFile in read_feather ARROW-8617 - [Rust] simd_load_set_invalid does not exist on aarch64 ARROW-8632 - [C++] Fix conversion error warning in array_union_test.cc ARROW-8641 - [Python] Regression in feather: no longer supports permutation in column selection ARROW-8643 - [Python] Tests with pandas master failing due to freq assertion ARROW-8644 - [Python] Dask integration tests failing due to change in not including partition columns ARROW-8646 - [Java] Allow UnionListWriter to write null values ARROW-8649 - [Java] [Website] Java documentation on website is hidden ARROW-8657 - [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version=’2.0’ ARROW-8663 - [Documentation] Small correction to building.rst ARROW-8680 - [Rust] ComplexObjectArrayReader incorrect null value shuffling ARROW-8684 - [Python] “SystemError: Bad call flags in _PyMethodDef_RawFastCallDict” in Python 3.7.7 on macOS when using pyarrow wheel ARROW-8689 - [C++] S3 benchmarks fail linking ARROW-8693 - [Python] Dataset.get_fragments is missing an implicit cast when filtering ARROW-8694 - [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame ARROW-8701 - [Rust] Unresolved import `crate::compute::util::simd_load_set_invalid` on Raspberry Pi ARROW-8704 - [C++] Fix Parquet crash on invalid input (OSS-Fuzz) ARROW-8705 - [Java] ComplexCopier is skipping null values ARROW-8706 - [C++][Parquet] Tracking JIRA for PARQUET-1857 (unencrypted INT16_MAX Parquet row group limit) ARROW-8710 - [Rust] Continuation marker not written correctly in IPC writer, and stream not flushed ARROW-8722 - [Dev] “archery docker run -e” doesn’t work ARROW-8726 - [C++][Dataset] Mis-specified DirectoryPartitioning incorrectly uses the file name as value ARROW-8728 - [C++] Bitmap operation may cause buffer overflow ARROW-8729 - [C++][Dataset] Only selecting a partition column results in empty table ARROW-8734 - [R] improve nightly build installation ARROW-8741 - [Python][Packaging] Keep VS2015 with for the windows wheels ARROW-8750 - [Python] pyarrow.feather.write_feather does not default to lz4 compression if it’s available ARROW-8768 - [R][CI] Fix nightly as-cran spurious failure ARROW-8775 - [C++][FlightRPC] Integration client doesn’t run integration tests ARROW-8776 - [FlightRPC][C++] Flight/C++ middleware don’t receive headers on failed calls to Java servers ARROW-8798 - [C++] Fix Parquet crashes on invalid input (OSS-Fuzz) ARROW-8799 - [C++][Dataset] Reading list column as nested dictionary segfaults ARROW-8801 - [Python] Memory leak on read from parquet file with UTC timestamps using pandas ARROW-8802 - [C++][Dataset] Schema metadata are lost when reading a subset of columns ARROW-8803 - [Java] Row count should be set before loading buffers in VectorLoader ARROW-8808 - [Rust] Divide by zero in arrays/builder.rs ARROW-8809 - [Rust] schema mismatch in integration test ARROW-8811 - [Java] Fix build on master ARROW-8820 - [C++][Gandiva] fix date_trunc functions to return date types ARROW-8821 - [Rust] nested binary expression with Like, NotLike and Not operator results in type cast error ARROW-8825 - [C++] Cannot compiled pass with Wunused-parameter flag ARROW-8826 - [Crossbow] remote URL should always have .git ARROW-8832 - [Python] AttributeError: module ‘pyarrow.fs’ has no attribute ‘S3FileSystem’ ARROW-8848 - [CI][C/Glib] MinGW build error ARROW-8848 - [CI][C/Glib] MinGW build error ARROW-8858 - [FlightRPC] Ensure headers are uniformly exposed ARROW-8860 - [C++] IPC/Feather decompression broken for nested arrays ARROW-8862 - [C++] NumericBuilder does not use MemoryPool passed to CTOR ARROW-8863 - [C++] Array subclass constructors must set ArrayData::null_count to 0 when there is no validity bitmap ARROW-8869 - [Rust] [DataFusion] Type Coercion optimizer rule does not support new scan nodes ARROW-8871 - [C++] Gandiva build failure ARROW-8872 - [CI] Travis-CI jobs fail (can’t open file ‘ci/detect-changes.py’) ARROW-8874 - [C++][Dataset] Scanner::ToTable race when ScanTask exit early with an error ARROW-8878 - [R] try_download is confused when download.file.method isn’t default ARROW-8882 - [C#] Add .editorconfig to C# code ARROW-8888 - [Python] Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions ARROW-8889 - [Python] Python 3.7 SIGSEGV when comparing RecordBatch to None ARROW-8892 - [C++][CI] CI builds for MSVC do not build benchmarks ARROW-8909 - [Java] Out of order writes using setSafe ARROW-8911 - [C++] Slicing a ChunkedArray with zero chunks segfaults ARROW-8924 - [C++][Gandiva] castDATE_date32() may cause overflow ARROW-8925 - [Rust] [DataFusion] CsvExec::schema() returns incorrect results ARROW-8930 - [C++] libz.so linking error with liborc.a ARROW-8932 - [C++] symbol resolution failures with liborc.a ARROW-8946 - [Python] Add tests for parquet.write_metadata metadata_collector ARROW-8948 - [Java][Integration] enable duplicate field names integration tests ARROW-8951 - [C++] Fix compiler warning in compute/kernels/scalar_cast_temporal.cc ARROW-8954 - [Website] ca-certificates should be listed in installation instructions ARROW-8957 - [FlightRPC][C++] Fail to build due to IpcOptions ARROW-8959 - [Rust] Broken build due to new benchmark crate using old API ARROW-8962 - [C++] Linking failure with clang-4.0 ARROW-8968 - [C++][Gandiva] Show link warning message on s390x ARROW-8975 - [FlightRPC][C++] Fix flaky MacOS tests ARROW-8977 - [R] Table$create with schema crashes with some dictionary index types ARROW-8978 - [C++][Compute] “Conditional jump or move depends on uninitialised value(s)” Valgrind warning ARROW-8980 - [Python] Metadata grows exponentially when using schema from disk ARROW-8982 - [CI] Remove allow_failures for s390x in TravisCI ARROW-8986 - [Archery][ursabot] Fix benchmark diff checkout of origin/master ARROW-9000 - [Java] build crashes with JDK14 ARROW-9009 - [C++][Dataset] ARROW:schema should be removed from schema’s metadata when reading Parquet files ARROW-9013 - [C++] Validate enum-style CMake options ARROW-9020 - [Python] read_json won’t respect explicit_schema in parse_options ARROW-9024 - [C++/Python] Install anaconda-client in conda-clean job ARROW-9026 - [C++/Python] Force package removal from arrow-nightlies conda repository ARROW-9037 - [C++][C] unable to import array with null count == -1 (which could be exported) ARROW-9057 - [Rust] Projection should work on InMemoryScan without error ARROW-9059 - [Rust] Documentation for slicing array data has the wrong sign ARROW-9066 - [Python] Raise correct error in isnull() ARROW-9071 - [C++] MakeArrayOfNull makes invalid ListArray ARROW-9077 - [C++] Fix aggregate/scalar-compare benchmark null_percent calculation ARROW-9080 - [C++] arrow::AllocateBuffer returns a Result&lt;unique_ptr&lt;Buffer&gt;&gt; ARROW-9082 - [Rust] - Stream reader fail when steam not ended with (optional) 0xFFFFFFFF 0x00000000” ARROW-9084 - [C++] CMake is unable to find zstd target when ZSTD_SOURCE=SYSTEM ARROW-9085 - [C++][CI] Appveyor CI test failures ARROW-9087 - [C++] Missing HDFS options parsing ARROW-9098 - RecordBatch::ToStructArray cannot handle record batches with 0 column ARROW-9105 - [C++] ParquetFileFragment scanning doesn’t handle filter on partition field ARROW-9120 - [C++] Lint and Format C++ files with “codegen” in file name ARROW-9121 - [C++] Do not wipe the filesystem when path is empty ARROW-9122 - [C++] Adapt ascii_lower/ascii_upper bulk transforms to work on sliced arrays ARROW-9126 - [C++] Trimmed Boost bundle fails to build on Windows ARROW-9127 - [Rust] Update thrift library dependencies ARROW-9134 - [Python] Parquet partitioning degrades Int32 to float64 ARROW-9141 - [R] Update cross-package documentation links ARROW-9142 - [C++] random::RandomArrayGenerator::Boolean “probability” misdocumented / incorrect ARROW-9143 - [C++] RecordBatch::Slice erroneously sets non-nullable field’s internal null_count to unknown ARROW-9146 - [C++][Dataset] Scanning a Fragment with a filter + mismatching schema shouldn’t abort ARROW-9151 - [R][CI] Fix Rtools 4.0 build: pacman sync ARROW-9160 - [C++] Implement string/binary contains for exact matches ARROW-9174 - [Go] Tests panic with 386 (x86) builds ARROW-9183 - [C++] Failed to build arrow-cpp with gcc 4.9.2 ARROW-9184 - [Rust][Datafusion] table scan without projection should return all columns ARROW-9194 - [C++] Array::GetScalar not implemented for decimal type ARROW-9195 - [Java] Wrong usage of Unsafe.get from bytearray in ByteFunctionsHelper class ARROW-9209 - [C++] Benchmarks fail to build ARROW_IPC=OFF and ARROW_BUILD_TESTS=OFF ARROW-9219 - [R] coerce_timestamps in Parquet write options does not work ARROW-9221 - ArrowBuf#setBytes(int, ByteBuffer) doesn’t check the byte buffer’s endianness ARROW-9223 - [Python] Fix to_pandas() export for timestamps within structs ARROW-9230 - [FlightRPC][Python] flight.connect() doesn’t pass through all arguments ARROW-9233 - [C++] is_null on NullArray should be true for all values ARROW-9236 - [Rust] CSV WriterBuilder never writes header ARROW-9237 - [R] 0.17 install on Arch Linux ARROW-9238 - [C++][CI] A few test coverages of round-robin in ipc and flight ARROW-9252 - [Integration] GitHub Actions integration test job does not test against “gold” 0.14.1 files in apache/arrow-testing ARROW-9260 - [CI] “ARM64v8 Ubuntu 20.04 C++” fails ARROW-9260 - [CI] “ARM64v8 Ubuntu 20.04 C++” fails ARROW-9261 - [Python][Packaging] S3FileSystem curl errors in manylinux wheels ARROW-9274 - [Rust] [Integration Testing] Read i64 from json files as strings ARROW-9282 - [R] Remove usage of _EXTPTR_PTR ARROW-9284 - [Java] getMinorTypeForArrowType returns sparse minor type for dense union types ARROW-9288 - [C++][Dataset] Discovery of partition field as dictionary type segfaulting with HivePartitioning ARROW-9297 - [C++][Dataset] Dataset scanner cannot handle large binary column (&gt; 2 GB) ARROW-9298 - [C++] Fix crashes on invalid input (OSS-Fuzz) ARROW-9303 - [R] Linux static build should always bundle dependencies ARROW-9305 - [Python] Dependency load failure in Windows wheel build ARROW-9315 - [Java] Fix the failure of testAllocationManagerType ARROW-9317 - [Java] A few testcases for arrow-memory ARROW-9326 - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds ARROW-9326 - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds ARROW-9326 - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds ARROW-9330 - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz) ARROW-9334 - [Dev][Archery] Push ancestor docker images ARROW-9336 - [Ruby] Creating RecordBatch with structs missing keys results in a malformed table ARROW-9343 - [C++][Gandiva] CastINT/Float functions from string should handle leading/trailing white spaces ARROW-9347 - [Python] Tests fail with latest fsspec ARROW-9350 - [C++][CI] Nightly valgrind job failures ARROW-9351 - [C++][CI] Nightly test-ubuntu-18.04-cpp-cmake32 fails ARROW-9353 - [Python][CI] Nightly dask integration jobs fail ARROW-9354 - [C++] Turbodbc latest fails to build in the integration tests ARROW-9355 - [R] Fix -Wimplicit-int-float-conversion ARROW-9360 - [CI][Crossbow] Nightly homebrew-cpp job times out ARROW-9363 - [C++][Dataset] ParquetDatasetFactory schema: pandas metadata is lost ARROW-9368 - [Python] Rename predicate argument to filter in split_by_row_group() ARROW-9373 - [C++] Fix Parquet crash on invalid input (OSS-Fuzz) ARROW-9380 - [C++] Segfaults in compute::CallFunction ARROW-9384 - [C++] Out-of-memory on invalid IPC input (OSS-Fuzz) ARROW-9385 - [Python] [CI] jpype integration failure ARROW-9389 - [C++] Can’t call isin/match through CallFunction ARROW-9397 - [R] Pass CC/CXX to cmake when building libarrow in Linux build ARROW-9408 - [Integration] Tests do not run in Windows due to numpy 64-bit errors ARROW-9409 - [CI][Crossbow] Nightly conda-r fails ARROW-9410 - [CI][Crossbow] Fix homebrew-cpp again ARROW-9413 - [Rust] Fix clippy lint on master ARROW-9415 - [C++] Arrow does not compile on Power9 ARROW-9416 - [Go] Add test cases for some datatypes ARROW-9417 - [C++][IPC] size in message written in native endian ARROW-9418 - [R] nyc-taxi Parquet files not downloaded in binary mode on Windows ARROW-9419 - [C++] Test that “fill_null” function works with sliced inputs, expand tests ARROW-9428 - [C++] Update documentation for buffer allocation functions ARROW-9436 - [C++][CI] Valgrind errors in fill_null kernel tests ARROW-9438 - [CI] Spark integration tests are failing ARROW-9439 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9440 - [Python] Expose Fill Null Compute Kernel in PyArrow ARROW-9443 - [C++] Bundled bz2 build should only build libbz2 ARROW-9448 - [Java] Circular initialization between ArrowBuf and BaseAllocator leads to null HistoricalLog for empty buffer ARROW-9449 - [R] Strip arrow.so ARROW-9450 - [Python] “pytest pyarrow” takes over 10 seconds to collect tests and start executing ARROW-9456 - [Python] Dataset segfault when not importing pyarrow.parquet ARROW-9458 - [Python] Dataset Scanner is single-threaded only ARROW-9460 - [C++] BinaryContainsExact doesn’t cope with double characters in the pattern ARROW-9461 - [Rust] Reading Date32 and Date64 errors - they are incorrectly converted to RecordBatch ARROW-9476 - [C++][Dataset] HivePartitioning discovery with dictionary types fails for multiple fields ARROW-9486 - [C++][Dataset] Support implicit casting InExpression::set_ to dict ARROW-9497 - [C++][Parquet] Fix failure caused by malformed repetition/definition levels ARROW-9499 - [C++] AdaptiveIntBuilder::AppendNull does not increment the null count ARROW-9500 - [C++] Fix segfault with std::to_string in -O3 builds on gcc 7.5.0 ARROW-9501 - [C++][Gandiva] Add logic in timestampdiff() when end date is last day of a month ARROW-9503 - [Rust] Comparison sliced arrays is wrong ARROW-9504 - [Python] Segmentation fault on ChunkedArray.take ARROW-9506 - [Packaging][Python] Fix macOS wheel build failures ARROW-9512 - [C++] Variadic template unpack inside lambda doesn’t compile with gcc ARROW-9524 - [CI][Gandiva] C++ unit test arrow-ipc-read-write failing in gandiva nightly build ARROW-9527 - [Rust] Remove un-needed dev-dependencies PARQUET-1839 - [C++] values_read not updated in ReadBatchSpaced PARQUET-1857 - [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups PARQUET-1865 - [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc PARQUET-1877 - [C++] Reconcile container size with string size for memory issues PARQUET-1882 - [C++] Writing an all-null column and then reading it with buffered_stream aborts the process New Features and Improvements ARROW-300 - [Format] Add body buffer compression option to IPC message protocol using LZ4 or ZSTD ARROW-842 - [Python] Handle more kinds of null sentinel objects from pandas 0.x ARROW-971 - [C++/Python] Implement Array.isvalid/notnull/isnull as scalar functions ARROW-974 - [Website] Add Use Cases section to the website ARROW-1277 - Completing integration tests for major implemented data types ARROW-1567 - [C++] Implement “fill null” kernels that replace null values with some scalar replacement value ARROW-1570 - [C++] Define API for creating a kernel instance from function of scalar input and output with a particular signature ARROW-1682 - [Python] Add documentation / example for reading a directory of Parquet files on S3 ARROW-1796 - [Python] RowGroup filtering on file level ARROW-2260 - [C++][Plasma] plasma_store should show usage ARROW-2444 - [Python][C++] Better handle reading empty parquet files ARROW-2702 - [Python] Examine usages of Invalid and TypeError errors in numpy_to_arrow.cc to see if we are using the right error type in each instance ARROW-2714 - [C++/Python] Variable step size slicing for arrays ARROW-2912 - [Website] Build more detailed Community landing page a la Apache Spark ARROW-3089 - [Rust] Add ArrayBuilder for different Arrow arrays ARROW-3134 - [C++] Implement n-ary iterator for a collection of chunked arrays with possibly different chunking layouts ARROW-3154 - [Python][C++] Document how to write _metadata, _common_metadata files with Parquet datasets ARROW-3244 - [Python] Multi-file parquet loading without scan ARROW-3275 - [Python] Add documentation about inspecting Parquet file metadata ARROW-3308 - [R] Convert R character vector with data exceeding 2GB to Large type ARROW-3317 - [R] Test/support conversions from data.frame with a single character column exceeding 2GB capacity of BinaryArray ARROW-3446 - [R] Document mapping of Arrow &lt;-&gt; R types ARROW-3509 - [C++] Inconsistent child accessor naming ARROW-3520 - [C++] Implement List Flatten kernel ARROW-3688 - [Rust] Implement PrimitiveArrayBuilder&lt;T&gt;.push_values ARROW-3827 - [Rust] Implement UnionArray ARROW-4022 - [C++] Promote Datum variant out of compute namespace ARROW-4221 - [Format] Add canonical flag in COO sparse index ARROW-4390 - [R] Serialize “labeled” metadata in Feather files, IPC messages ARROW-4412 - [DOCUMENTATION] Add explicit version numbers to the arrow specification documents. ARROW-4427 - [Doc] Move Confluence Wiki pages to the Sphinx docs ARROW-4429 - [Doc] Add git rebase tips to the ‘Contributing’ page in the developer docs ARROW-5035 - [C#] ArrowBuffer.Builder&lt;bool&gt; is broken ARROW-5082 - [Python][Packaging] Reduce size of macOS and manylinux1 wheels ARROW-5143 - [Flight] Enable integration testing of batches with dictionaries ARROW-5279 - [C++] Support reading delta dictionaries in IPC streams ARROW-5377 - [C++] Make IpcPayload public and add GetPayloadSize ARROW-5489 - [C++] Normalize kernels and ChunkedArray behavior ARROW-5548 - [Documentation] http://arrow.apache.org/docs/latest/ is not latest ARROW-5649 - [Integration][C++] Create round trip integration test for extension types ARROW-5708 - [C#] Null support for BooleanArray ARROW-5760 - [C++] Optimize Take implementation ARROW-5854 - [Python] Expose compare kernels on Array class ARROW-6052 - [C++] Divide up arrow/array.h,cc into files in arrow/array/ similar to builder files ARROW-6110 - [Java] Support LargeList Type and add integration test with C++ ARROW-6111 - [Java] Support LargeVarChar and LargeBinary types and add integration test with C++ ARROW-6439 - [R] Implement S3 file-system interface in R ARROW-6456 - [C++] Possible to reduce object code generated in compute/kernels/take.cc? ARROW-6501 - [C++] Remove non_zero_length field from SparseIndex ARROW-6521 - [C++] Add function to arrow:: namespace that returns the current ABI version ARROW-6543 - [R] Support LargeBinary and LargeString types ARROW-6602 - [Doc] Add feature / implementation matrix ARROW-6603 - [C#] ArrayBuilder API to support writing nulls ARROW-6645 - [Python] Faster boundschecking of dictionary indices when converting to Categorical ARROW-6689 - [Rust] [DataFusion] Query execution enhancements for 1.0.0 release ARROW-6691 - [Rust] [DataFusion] Use tokio and Futures instead of spawning threads ARROW-6775 - [C++] [Python] Proposal for several Array utility functions ARROW-6776 - [Python] Need a lite version of pyarrow ARROW-6800 - [C++] Add CMake option to build libraries targeting a C++14 or C++17 toolchain environment ARROW-6839 - [Java] Add APIs to read and write “custom_metadata” field of IPC file footer ARROW-6856 - [C++] Use ArrayData instead of Array for ArrayData::dictionary ARROW-6917 - ARROW-6917: [Archery][Release] Add support for JIRA curation, changelog generation and commit cherry-picking for maintenance releases ARROW-6945 - [Rust] Enable integration tests ARROW-6959 - [C++] Clarify what signatures are preferred for compute kernels ARROW-6978 - [R] Add bindings for sum and mean compute kernels ARROW-6979 - [R] Enable jemalloc in autobrew formula ARROW-6982 - [R] Add bindings for compare and boolean kernels ARROW-7009 - [C++] Refactor filter/take kernels to use Datum instead of overloads ARROW-7010 - [C++] Support lossy casts from decimal128 to float32 and float64/double ARROW-7011 - [C++] Implement casts from float/double to decimal128 ARROW-7012 - [C++] Clarify ChunkedArray chunking strategy and policy ARROW-7068 - [C++] Expose the offsets of a ListArray as a Int32Array ARROW-7075 - [C++] Boolean kernels should not allocate in Call() ARROW-7175 - [Website] Add a security page to track when vulnerabilities are patched ARROW-7229 - [C++] Unify ConcatenateTables APIs ARROW-7230 - [C++] Use vendored std::optional instead of boost::optional in Gandiva ARROW-7237 - [C++] Add Result&lt;T&gt; to APIs to arrow/json ARROW-7243 - [Docs] Add common “implementation status” table to the README of each native language implementation, as well as top level README ARROW-7285 - [C++] ensure C++ implementation meets clarified dictionary spec ARROW-7300 - [C++][Gandiva] Implement functions to cast from strings to integers/floats ARROW-7313 - [C++] Add function for retrieving a scalar from an array slot ARROW-7371 - [GLib] Add Datasets binding ARROW-7375 - [Python] Expose C++ MakeArrayOfNull ARROW-7391 - [Python] Remove unnecessary classes from the binding layer ARROW-7495 - [Java] Remove “empty” concept from ArrowBuf, replace with custom referencemanager ARROW-7605 - [C++] Create and install static library containing all dependencies built by Arrow ARROW-7607 - [C++] Add to cpp/examples minimal examples of using Arrow as a dependency of another CMake project ARROW-7673 - [C++][Dataset] Revisit File discovery failure mode ARROW-7676 - [Packaging][Python] Ensure that the static libraries are not built in the wheel scripts ARROW-7699 - [Java] Support concating dense union vectors in batch ARROW-7705 - [Rust] Initial sort implementation ARROW-7717 - [CI] Have nightly integration test for Spark’s latest release ARROW-7759 - [C++][Dataset] Add CsvFileFormat for CSV support ARROW-7778 - [C++] Support nested dictionaries in JSON integration format ARROW-7784 - [C++] diff.cc is extremely slow to compile ARROW-7801 - [Developer] Add issue_comment workflow to fix lint/style/codegen ARROW-7803 - [R][CI] Autobrew/homebrew tests should not always install from master ARROW-7831 - [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors ARROW-7831 - [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors ARROW-7902 - [Integration] Unskip nested dictionary integration tests ARROW-7910 - [C++] Provide function to query page size portably ARROW-7924 - [Rust] Add sort for float types ARROW-7950 - [Python] When initializing pandas API shim, inform user if their installed pandas version is too old ARROW-7955 - [Java] Support large buffer for file/stream IPC ARROW-8020 - [Java] Implement vector validate functionality ARROW-8023 - [Website] Write a blog post about the C data interface ARROW-8025 - [C++] Implement cast to Binary and FixedSizeBinary ARROW-8025 - [C++] Implement cast to Binary and FixedSizeBinary ARROW-8046 - [Developer][Integration] Makefile.docker’s target names are broken ARROW-8062 - [C++][Dataset] Parquet Dataset factory from a _metadata/_common_metadata file ARROW-8065 - [C++][Dataset] Untangle Dataset, Fragment and ScanOptions ARROW-8074 - [C++][Dataset] Support for file-like objects (buffers) in FileSystemDataset? ARROW-8108 - [Java] Extract a common interface for dictionary encoders ARROW-8111 - [C++][CSV] Support MM/DD/YYYY date format ARROW-8114 - [Java][Integration] Enable custom_metadata integration test ARROW-8121 - [Java] Enhance code style checking for Java code (add space after commas, semi-colons and type casts) ARROW-8149 - [C++/Python] Enable CUDA Support in conda recipes ARROW-8157 - [C++][Gandiva] Support building with LLVM 9 ARROW-8162 - [Format][Python] Add serialization for CSF sparse tensors ARROW-8169 - [Java] Improve the performance of JDBC adapter by allocating memory proactively ARROW-8171 - Consider pre-allocating memory for fix-width vector in Avro adapter iterator ARROW-8190 - [C++][Flight] Allow setting IpcWriteOptions and IpcReadOptions in Flight IPC message reader and writer classes ARROW-8229 - [Java] Move ArrowBuf into the Arrow package ARROW-8230 - [Java] Move Netty memory manager into a separate module ARROW-8261 - [Rust] [DataFusion] LogicalPlanBuilder.limit() should take a literal argument ARROW-8263 - [Rust] [DataFusion] Add documentation for supported SQL functions ARROW-8281 - [R] Name collision of arrow.dll on Windows conda ARROW-8283 - [Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset ARROW-8287 - [Rust] Arrow examples should use utility to print results ARROW-8293 - [Python] Run flake8 on python/examples also ARROW-8297 - [FlightRPC][C++] Implement Flight DoExchange for C++ ARROW-8301 - [R] Handle ChunkedArray and Table in C data interface ARROW-8312 - [Java][Gandiva] improve IN expression support ARROW-8314 - [Python] Provide a method to select a subset of columns of a Table ARROW-8318 - [C++][Dataset] Dataset should instantiate Fragment ARROW-8399 - [Rust] Extend memory alignments to include other architectures ARROW-8413 - [C++] Refactor DefLevelsToBitmap ARROW-8422 - [Rust] [Parquet] Implement function to convert Arrow schema to Parquet schema ARROW-8430 - [CI] Configure self-hosted runners for Github Actions ARROW-8434 - [C++] Ipc RecordBatchFileReader deserializes the Schema multiple times ARROW-8440 - [C++] Refine simd header files ARROW-8443 - [Gandiva][C++] Fix round/truncate to no-op for special cases ARROW-8447 - [C++][Dataset] Ensure Scanner::ToTable preserve ordering of ScanTasks ARROW-8467 - [C++] Test cases using ArrayFromJSON assume only a little-endian platform ARROW-8474 - [CI][Crossbow] Skip some nightlies we don’t need to run ARROW-8477 - [C++] Enable reading and writing of long filenames for Windows ARROW-8481 - [Java] Provide an allocation manager based on Unsafe API ARROW-8483 - [Ruby] Arrow::Table documentation improvement ARROW-8485 - [Integration][Java] Implement extension types integration ARROW-8486 - [C++] arrow-utility-test causes failures on a big-endian platform ARROW-8487 - [FlightRPC][C++] Make it possible to target a specific payload size ARROW-8488 - [R] Replace VALUE_OR_STOP with ValueOrStop ARROW-8496 - [C++] Refine ByteStreamSplitDecodeScalar ARROW-8497 - [Archery] Add missing component to builds ARROW-8499 - [C++][Dataset] In ScannerBuilder, batch_size will not work if projecter is not empty ARROW-8500 - [C++] Use selection vectors in Filter implementation for record batches, tables ARROW-8501 - [Packaging][RPM] Upgrade devtoolset to 8 on CentOS 6 ARROW-8502 - [Release][APT][Yum] Ignore all arm64 verifications ARROW-8504 - [C++] Add Run Length Reader ARROW-8506 - [c++] Miss tests to verify expected_buffer with bit_width &gt; 8 in RLE ARROW-8507 - [Release] Detect .git directory automatically in changelog.py ARROW-8509 - [GLib] Add low level record batch read/write functions ARROW-8512 - [C++] Delete unused compute expr prototype code ARROW-8513 - [Python] Expose Take with Table input in Python ARROW-8515 - [C++] Bitmap ToString should have an option of grouping by bytes ARROW-8516 - [Rust] Slow BufferBuilder&lt;BooleanType&gt; inserts within PrimitiveBuilder::append_slice ARROW-8517 - [Developer][Release] Update Crossbow RC verification setup for changes since 0.16.0 ARROW-8520 - [Developer] Use .asf.yaml to direct GitHub notifications to e-mail lists and JIRA ARROW-8521 - [Developer] Group Sub-task, Task, Test, and Wish issue types as “Improvement” in Changelog ARROW-8522 - [Developer] Add environment variable option to toggle whether ephemeral NodeJS is installed in release verification script ARROW-8524 - [CI] Free up space on github actions ARROW-8526 - [Python] Fix non-deterministic row order failure in dataset tests ARROW-8531 - [C++] Deprecate ARROW_USE_SIMD CMake option ARROW-8538 - [Packaging] Remove boost from homebrew formula ARROW-8540 - [C++] Create memory allocation benchmark ARROW-8541 - [Release] Don’t remove previous source releases automatically ARROW-8542 - [Release] Fix checksum url in the website post release script ARROW-8543 - [C++] IO: single pass coalescing algorithm ARROW-8544 - [CI][Crossbow] Add a status.json to the gh-pages summary of nightly builds to get around rate limiting ARROW-8548 - [Website] 0.17 release post ARROW-8549 - [R] Assorted post-0.17 release cleanups ARROW-8550 - [CI] Don’t run cron GHA jobs on forks ARROW-8551 - [CI][Gandiva] Use LLVM 8 to build gandiva linux jar ARROW-8552 - [Rust] support column iteration for parquet row ARROW-8553 - [C++] Optimize unaligned bitmap operations ARROW-8555 - [FlightRPC][Java] Implement Flight DoExchange for Java ARROW-8558 - [Rust] GitHub Actions missing rustfmt ARROW-8559 - [Rust] Consolidate Record Batch reader traits in main arrow crate ARROW-8560 - [Rust] Docs for MutableBuffer resize are incorrect ARROW-8561 - [C++][Gandiva] Stop using deprecated google::protobuf::MessageLite::ByteSize() ARROW-8562 - [C++] IO: Parameterize I/O coalescing using S3 storage metrics ARROW-8563 - [Go] Minor change to make newBuilder public ARROW-8564 - [Website] Add Ubuntu 20.04 LTS to supported package list ARROW-8569 - [CI] Upgrade xcode version for testing homebrew formulae ARROW-8571 - [C++] Switch AppVeyor image to VS 2017 ARROW-8572 - [Python] Expose UnionArray.array and other fields ARROW-8573 - [Rust] Upgrade to Rust 1.44 nightly ARROW-8574 - [Rust] Implement Debug for all plain types ARROW-8575 - [Developer] Add issue_comment workflow to rebase a PR ARROW-8590 - [Rust] Use Arrow pretty print utility in DataFusion ARROW-8591 - [Rust] Reverse lookup for a key in DictionaryArray ARROW-8597 - [Rust] arrow crate lint and readability improvements ARROW-8606 - [CI] Don’t trigger all builds on a change to any file in ci/ ARROW-8607 - [R][CI] Unbreak builds following R 4.0 release ARROW-8611 - [R] Can’t install arrow 0.17 on Ubuntu 18.04 R 3.6.3 ARROW-8612 - [GLib] Add GArrowReadOptions and GArrowWriteOptions ARROW-8616 - [Rust] Turn explicit SIMD off by default ARROW-8619 - [C++] Use distinct Type::type values for interval types ARROW-8622 - [Rust] Parquet crate does not compile on aarch64 ARROW-8623 - [C++][Gandiva] Reduce use of Boost, remove Boost headers from header files ARROW-8624 - [Website] Install page should mention arrow-dataset packages ARROW-8628 - [CI][Dev] Wrap docker-compose commands with archery ARROW-8629 - [Rust] Eliminate indirection of ZST allocations ARROW-8633 - [C++] Add ValidateAscii function ARROW-8634 - [Java] Create an example ARROW-8639 - [C++][Plasma] Require gflags ARROW-8645 - [C++] Missing gflags dependency for plasma ARROW-8647 - [C++][Dataset] Optionally encode partition field values as dictionary type ARROW-8648 - [Rust] Optimize Rust CI Build Times ARROW-8650 - [Rust] [Website] Add documentation to Arrow website ARROW-8651 - [Python][Dataset] Support pickling of Dataset objects ARROW-8655 - [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset ARROW-8656 - [Python] Switch to VS2017 in the windows wheel builds ARROW-8659 - [Rust] ListBuilder and FixedSizeListBuilder capacity ARROW-8660 - [C++][Gandiva] Reduce dependence on Boost ARROW-8662 - [CI] Consolidate appveyor scripts ARROW-8664 - [Java] Add skip null check to all Vector types ARROW-8668 - [Packaging][APT][Yum][ARM] Use Travis CI’s ARM machine to build packages ARROW-8669 - [C++] Add IpcWriteOptions argument to GetRecordBatchSize() ARROW-8671 - [C++] Use IPC body compression metadata approved in ARROW-300 ARROW-8671 - [C++] Use IPC body compression metadata approved in ARROW-300 ARROW-8682 - [Ruby][Parquet] Add support for column level compression ARROW-8687 - [Java] Finish move of io.netty.buffer.ArrowBuf ARROW-8690 - [Python] Clean-up dataset+parquet tests now order is determinstic ARROW-8692 - [C++] Avoid memory copies when downloading from S3 ARROW-8695 - [Java] remove references to PlatformDependent in memory module ARROW-8696 - [Java] Convert tests to integration tests ARROW-8699 - [R] Fix automatic r_to_py conversion ARROW-8702 - [Packaging][C#] Build NuGet packages in release process ARROW-8703 - [R] schema$metadata should be properly typed ARROW-8707 - [CI] Docker push fails because of wrong dockerhub credentials ARROW-8708 - [CI] Utilize github actions cache for docker-compose volumes ARROW-8711 - [Python] Expose strptime timestamp parsing in read_csv conversion options ARROW-8717 - [CI][Packaging] Add build dependency on boost to homebrew ARROW-8720 - [C++] Fix checked_pointer_cast ARROW-8721 - [CI] Fix R build matrix ARROW-8723 - [Rust] Remove SIMD specific benchmark code ARROW-8724 - [Packaging][deb][RPM] Use directory in host as build directory ARROW-8725 - [Rust] redundant directory walk in rust parquet datasource code ARROW-8727 - [C++] Do not require struct-initialization of StringConverter&lt;T&gt; to parse strings to other types ARROW-8730 - [Rust] Use slice instead of &amp;Vec for function arguments ARROW-8733 - [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata ARROW-8736 - [Rust] [DataFusion] Table API should provide a schema() method ARROW-8740 - [CI] Fix archery option in pandas master cron test ARROW-8742 - [C++][Python] Add flight client support for Mutual TLS ARROW-8743 - [C++][CI] Add a test job on s390x ARROW-8744 - [Rust] ParquetIterator’s next method should be safe to call even after reached end of iteration ARROW-8745 - [C++] Bitmap.ToString causes failures on a big-endian platform ARROW-8747 - [C++] Feather tests with compression cause failure on big-endian platforms ARROW-8751 - [Rust] ParquetFileArrowReader should be able to read empty parquet file without error ARROW-8752 - [Rust] Remove unused hashmap ARROW-8753 - [C++][CI] Add a test job on ARM ARROW-8754 - [C++][CI] enable tests for additional components on big-endian platforms ARROW-8756 - [C++] Bitmap word tests cause failures on a big-endian platform ARROW-8757 - [C++] Plasma header is written in native endian ARROW-8758 - [R] Updates for compatibility with dplyr 1.0 ARROW-8759 - [C++] TestPlasmaSerialization.DeleteReply tests failure on big-endian platforms ARROW-8762 - [C++][Gandiva] Replace Gandiva’s BitmapAnd with common implementation ARROW-8763 - [C++] Create RandomAccessFile::WillNeed-like API ARROW-8764 - [C++] Make ThreadPool configurable in ReadRangeCache ARROW-8766 - [Python] A FileSystem implementation based on Python callbacks ARROW-8769 - [C++] Add convenience methods to access fields by name in StructScalar ARROW-8770 - [C++][CI] enable arrow-csv-test on s390x ARROW-8772 - [C++] Expand SumKernel benchmark to more types ARROW-8777 - [Rust] Parquet.rs does not support reading fixed-size binary fields. ARROW-8778 - [C++][Gandiva] SelectionVector related test failed on big-endian platforms ARROW-8779 - [R] Implement conversion to List&lt;Struct&gt; ARROW-8781 - [CI][C++] Enable ccache on GHA MinGW jobs ARROW-8782 - [Rust] [DataFusion] Add benchmarks based on NYC Taxi data set ARROW-8783 - [Rust] [DataFusion] Logical plan should have ParquetScan and CsvScan entries ARROW-8784 - [Rust] [DataFusion] Remove use of Arc from LogicalPlan ARROW-8785 - [Python][Packaging] Build the windows wheels with MIMALLOC enabled ARROW-8786 - [Packaging][rpm] Use bundled zstd in the CentOS 8 build ARROW-8788 - [C#] Array builders to use bit-packed buffer builder rather than boolean array builder for validity map ARROW-8789 - [Rust] Add separate crate for integration test binaries ARROW-8790 - [C++][CI] Enable arrow-flight-test on s390x ARROW-8791 - [Rust] Creating StringDictionaryBuilder with existing dictionary values ARROW-8792 - [C++] Improved declarative compute function / kernel development framework, normalize calling conventions ARROW-8793 - [C++] BitUtil::SetBitsTo probably doesn’t need to be inline ARROW-8794 - [C++] Expand benchmark coverage for arrow from parquet reading ARROW-8795 - [C++] Limited iOS support ARROW-8800 - [C++] Split arrow::ChunkedArray into arrow/chunked_array.h ARROW-8804 - [R][CI] Followup to Rtools40 upgrade ARROW-8814 - [Dev][Release] Binary upload script keeps raising locale warnings ARROW-8815 - [Dev][Release] Binary upload script should retry on unexpected bintray request error ARROW-8818 - [Rust] Failing to build on master due to Flatbuffers/Union issues ARROW-8822 - [Rust] [DataFusion] Add MemoryScan variant to LogicalPlan ARROW-8827 - [Integration Testing] Initial skeleton for Rust integration tests ARROW-8830 - [GLib] Add support for Tell againt not seekable GIO output stream ARROW-8831 - [Rust] incomplete SIMD implementation in simd_compare_op ARROW-8833 - [Rust] Implement VALIDATE mode in integration test binary ARROW-8834 - [Rust] Implement arrow-file-to-stream for integration testing ARROW-8835 - [Rust] Implement arrow-stream-to-file for integration testing ARROW-8836 - [Website] Update copyright end year automatically ARROW-8837 - [Rust] Add Null type ARROW-8838 - [Rust] File reader fails to read header from valid files ARROW-8839 - [Rust] datafusion logical plan should support scaning csv without provided schema ARROW-8840 - [Rust] datafusion ExecutionError should implement std::error:Error trait ARROW-8841 - [C++] Add benchmark and unittest for PLAIN spaced ARROW-8843 - [C++] Optimize BitmapEquals unaligned case ARROW-8844 - [C++] Optimize TransferBitmap unaligned case ARROW-8846 - [Dev][Python] Autoformat Python sources with Archery ARROW-8847 - [C++] Pass task size / metrics in Executor API ARROW-8851 - [Python][Documentation] Fix FutureWarnings in Python Plasma docs ARROW-8852 - [R] Post-0.17.1 adjustments ARROW-8854 - [Rust] [Integration Testing] Show output from arrow-json-integration-test ARROW-8855 - [Rust] [Integration Testing] data type Date32(Day) not supported ARROW-8856 - [Rust] [Integration Testing] Return empty batch if MessageHeader is NONE ARROW-8864 - [R] Add methods to Table/RecordBatch for consistency with data.frame ARROW-8866 - [C++] Split Type::UNION into Type::SPARSE_UNION and Type::DENSE_UNION ARROW-8867 - [R] Support converting POSIXlt type ARROW-8875 - [C++] use AWS SDK SetResponseStreamFactory to avoid a copy of bytes ARROW-8877 - [Rust] add CSV read option struct to simplify datafusion interface ARROW-8880 - [R][Linux] Make R Binary Install Friendlier ARROW-8881 - [Rust] Add large list and binary support ARROW-8885 - [R] Don’t include everything everywhere ARROW-8886 - [C#] Decide and implement appropriate behaviour for Array builder resize to negative size ARROW-8887 - [Java] Buffer size for complex vectors increases rapidly in case of clear/write loop ARROW-8890 - [R] Fix C++ lint issue ARROW-8895 - [C++] Add C++ unit tests for filter and take functions on temporal type inputs, including timestamps ARROW-8896 - [C++] Reimplement dictionary unpacking in Cast kernels using Take ARROW-8899 - [R] Add R metadata like pandas metadata for round-trip fidelity ARROW-8901 - [C++] Reduce number of take kernels ARROW-8903 - [C++] Implement optimized “unsafe take” for use with selection vectors for kernel execution ARROW-8904 - [Python] Fix usages of deprecated C++ APIs related to child/field ARROW-8906 - [Rust] Support reading multiple CSV files for schema inference ARROW-8907 - [Rust] implement scalar comparison operations ARROW-8912 - [Ruby] Keep reference of Arrow::Buffer’s data for GC ARROW-8913 - [Ruby] Use “field” instead of “child” ARROW-8914 - [C++][Gandiva] Decimal128 related test failed on big-endian platforms ARROW-8915 - [Dev][Archery] Require Click 7 ARROW-8917 - [C++][Compute] Formalize “metafunction” concept ARROW-8918 - [C++] Add cast “metafunction” to FunctionRegistry that addresses dispatching to appropriate type-specific CastFunction ARROW-8922 - [C++] Implement example string scalar kernel function to assist with string kernels buildout per ARROW-555 ARROW-8923 - [C++] Improve usability of arrow::compute::CallFunction by moving ExecContext* argument to end and adding default ARROW-8926 - [C++] Improve docstrings in new public APIs in arrow/compute and fix miscellaneous typos ARROW-8927 - [C++] Support dictionary memos when reading/writing record batches using cuda IPC ARROW-8929 - [C++] Change compute::Arity:VarArgs min_args default to 0 ARROW-8931 - [Rust] Support lexical sort in arrow compute kernel ARROW-8933 - [C++] Reduce generated code in vector_hash.cc ARROW-8934 - [C++] Add timestamp subtract kernel aliased to int64 subtract implementation ARROW-8937 - [C++] Add “parse_strptime” function for string to timestamp conversions using the kernels framework ARROW-8938 - [R] Provide binding for arrow::compute::CallFunction ARROW-8940 - [Java] Fix the performance degradation of integration tests ARROW-8941 - [C++/Python] arrow-nightlies conda repository is full ARROW-8942 - [R] Detect compression in reading CSV/JSON ARROW-8943 - [C++][Dataset] Add support for Partitioning to ParquetDatasetFactory ARROW-8950 - [C++] Make head optional in s3fs ARROW-8958 - [FlightRPC][Python] Implement Flight DoExchange for Python ARROW-8960 - [MINOR] [FORMAT] Fix typos in comments ARROW-8961 - [C++] Add utf8proc library to toolchain ARROW-8963 - [C++][Parquet] Parquet cpp optimize allocate memory ARROW-8965 - [Python][Documentation] Pyarrow documentation for pip nightlies references 404’d location ARROW-8966 - [C++] Move arrow::ArrayData to a separate header file ARROW-8969 - [C++] Reduce generated code in compute/kernels/scalar_compare.cc ARROW-8970 - [C++] Reduce shared library / binary code size (umbrella issue) ARROW-8972 - [Java] Support range value comparison for large varchar/varbinary vectors ARROW-8973 - [Java] Support batch value appending for large varchar/varbinary vectors ARROW-8974 - [C++] Refine TransferBitmap template parameters ARROW-8976 - [C++] compute::CallFunction can’t Filter/Take with ChunkedArray ARROW-8979 - [C++] Implement bitmap word reader and writer ARROW-8984 - [R] Revise install guides now that Windows conda package exists ARROW-8985 - [Format] Add “byte width” field with default of 16 to Decimal Flatbuffers type for forward compatibility ARROW-8989 - [C++] Document available functions in compute::FunctionRegistry ARROW-8993 - [Rust] Support reading non-seekable sources in text readers ARROW-8994 - [C++] Disable include-what-you-use cpplint lint checks ARROW-8996 - [C++] Runtime SIMD path for Aggregate Sum/Mean kernel ARROW-8997 - [Archery] Benchmark formatter should have friendly units ARROW-9004 - [C++][Gandiva] Support building with LLVM 10 ARROW-9005 - [Rust] [DataFusion] Support sort expression ARROW-9007 - [Rust] Support appending arrays by merging array data ARROW-9014 - [Packaging] Bump the minor part of the automatically generated version in crossbow ARROW-9015 - [Java] Make BaseAllocator package private ARROW-9016 - [Java] Remove direct references to Netty/Unsafe Allocators ARROW-9017 - [Python] Refactor the Scalar classes ARROW-9018 - [C++] Remove APIs that were deprecated in 0.17.x and prior ARROW-9021 - [Python] The filesystem keyword in parquet.read_table is not documented ARROW-9022 - [C++] Add/Sub/Mul arithmetic kernels with overflow check ARROW-9029 - [C++] Implement BitBlockCounter interface for blockwise popcounts of validity bitmaps ARROW-9030 - [Python] Clean up some usages of pyarrow.compat, move some common functions/symbols to lib.pyx ARROW-9031 - [R] Implement conversion from Type::UINT64 to R vector ARROW-9032 - [C++] Split arrow/util/bit_util.h into multiple header files ARROW-9034 - [C++] Implement binary (two bitmap) version of BitBlockCounter ARROW-9042 - [C++] Add Subtract and Multiply arithmetic kernels with wrap-around behavior ARROW-9043 - [Go] Temporarily copy LICENSE.txt to go/ ARROW-9043 - [Go] Temporarily copy LICENSE.txt to go/ ARROW-9045 - [C++] Improve and expand Take/Filter benchmarks ARROW-9046 - [C++][R] Put more things in type_fwds ARROW-9047 - [Rust] Setting 0-bits of a 0-length bitset segfaults ARROW-9050 - [Release] Use 1.0.0 as the next version ARROW-9051 - [GLib] Refer Array related objects from Array ARROW-9052 - [CI][MinGW] Enable Gandiva ARROW-9055 - [C++] Add sum/mean kernels for Boolean type ARROW-9058 - [Packaging][wheel] Boost download is failed ARROW-9060 - [GLib] Add support for building Apache Arrow Datasets GLib with non-installed Apache Arrow Datasets ARROW-9061 - [Packaging][APT][Yum][GLib] Add Apache Arrow Datasets GLib ARROW-9062 - [Rust] Support to read JSON into dictionary type ARROW-9067 - [C++] Create reusable branchless / vectorized index boundschecking functions ARROW-9070 - [C++] StructScalar needs field accessor methods ARROW-9073 - [C++] RapidJSON include directory detection doesn’t work with RapidJSONConfig.cmake ARROW-9074 - [GLib] Add missing arrow-json check ARROW-9075 - [C++] Optimize Filter implementation ARROW-9079 - [C++] Write benchmark for arithmetic kernels ARROW-9083 - [R] collect int64, uint32, uint64 as R integer type if not out of bounds ARROW-9086 - [CI][Homebrew] Enable Gandiva ARROW-9088 - [Rust] Recent version of arrow crate does not compile into wasm target ARROW-9089 - [Python] A PyFileSystem handler for fsspec-based filesystems ARROW-9090 - [C++] Bump versions of bundled libraries ARROW-9091 - [C++] Utilize function’s default options when passing no options to CallFunction for a function that requires them ARROW-9093 - [FlightRPC][C++][Python] Allow setting gRPC client options ARROW-9094 - [Python] Bump versions of compiled dependencies in manylinux wheels ARROW-9095 - [Rust] Fix NullArray to comply with spec ARROW-9099 - [C++][Gandiva] Add TRIM function for string ARROW-9100 - [C++] Add ascii_lower kernel ARROW-9101 - [Doc][C++][Python] Document encoding expected by CSV and JSON readers ARROW-9102 - [Packaging] Upload built manylinux docker images ARROW-9106 - [C++] Add C++ foundation to ease file transcoding ARROW-9108 - [C++][Dataset] Add Parquet Statistics conversion for timestamp columns ARROW-9109 - [Python][Packaging] Enable S3 support in manylinux wheels ARROW-9110 - [C++] Fix CPU cache size detection on macOS ARROW-9112 - [R] Update autobrew script location ARROW-9115 - [C++] Process data buffers in batch in ascii_lower / ascii_upper kernels rather than using string_view value iteration ARROW-9116 - [C++] Add BinaryArray::total_values_length() ARROW-9116 - [C++] Add BinaryArray::total_values_length() ARROW-9118 - [C++] Add more general BoundsCheck function that also checks for arbitrary lower limits in integer arrays ARROW-9119 - [C++] Add support for building with system static gRPC ARROW-9123 - [Python][wheel] Use libzstd.a explicitly ARROW-9124 - [Rust][Datafusion] DFParser should consume sql query as &amp;str instead of String ARROW-9125 - [C++] Add missing include for arrow::internal::ZeroMemory() for Valgrind ARROW-9129 - [Python][JPype] Test is failed with JPype 0.7.5 ARROW-9130 - [Python] Add deprecated wrappers functions to a pyarrow/compat.py module for 1.0.0 that will be removed later ARROW-9131 - [C++] Faster ascii_lower and ascii_upper ARROW-9132 - [C++] Implement hash kernels for dictionary data with constant dictionaries ARROW-9133 - [C++] Add utf8_upper and utf8_lower ARROW-9137 - [GLib][Ruby] Allow to read Parquet files in chunks (by RowGroup) ARROW-9138 - [Docs][Format] Make sure format version is hard coded in the docs ARROW-9139 - [Python] parquet read_table should not use_legacy_dataset ARROW-9144 - [CI] OSS-Fuzz build fails because recent changes in the google repository ARROW-9145 - [C++] Add true_count / false_count methods to BooleanArray ARROW-9152 - [C++] Create specialized filter implementation for varbinary types ARROW-9153 - [Python] Add bindings for StructScalar ARROW-9154 - [Developer] Use GitHub issue templates better ARROW-9155 - [Archery] Less precise but faster default settings for “archery benchmark diff” ARROW-9156 - [C++] Reducing the code size of the tensor module ARROW-9157 - [Rust][Datafusion] execution context’s create_physical_plan should take self as immutable reference ARROW-9158 - [Rust][Datafusion] Projection physical plan compilation should preserve nullability ARROW-9159 - [Python] Expose the isnull/isvalid kernels ARROW-9162 - [Python] Expose Add/Subtract/Multiply arithmetic kernels ARROW-9163 - [C++] Add methods to StringArray, LargeStringArray, to validate whether its values are all UTF-8 ARROW-9166 - [Website] Add overview page ARROW-9167 - [Doc][Website] /docs/c_glib/index.html is overwritten ARROW-9168 - [C++][Flight] allow flight benchmark to use separated TCP connections ARROW-9173 - [C++] Document how to use Arrow from a third-party CMake project ARROW-9175 - [FlightRPC][C++][Python] Expose connected peer ARROW-9176 - [Rust] Fix for memory leaks in Arrow allocator ARROW-9178 - [R] Improve documentation about CSV reader ARROW-9179 - [R] Replace usage of iris dataset in tests ARROW-9180 - [Developer] Remove usage of whitelist, blacklist, slave, etc. ARROW-9181 - [C++] Instantiate fewer templates in Cast kernel implementation ARROW-9182 - [C++] Use “applicator” namespace for kernel operator-to-kernel functors, streamline argument unboxing ARROW-9185 - [C++] [Java][Gandiva] Make llvm build optimisation configurable from java ARROW-9188 - [C++] Do not always statically link Brotli libraries ARROW-9189 - [Website] Improve contributor guide ARROW-9190 - [Website][C++] Add blog post on efforts to make building lighter and easier ARROW-9191 - [Rust] Do not panic when int96 milliseconds are negative ARROW-9192 - [CI][Rust] Add support for running clippy ARROW-9193 - [C++] Add method to parse date from null-terminated string ARROW-9197 - [C++] Revamp numeric casts: faster performance and reduced binary size ARROW-9201 - [Archery] Render-human readable table when using “archery benchmark diff” ARROW-9202 - [GLib] Add GArrowDatum ARROW-9203 - [Packaging][deb] Add missing gir1.2-arrow-dataset-1.0.install ARROW-9204 - [C++][Flight] change records_per_stream to int64 in flight benchmark ARROW-9205 - [Documentation] Fix typos in Columnar.rst ARROW-9206 - [C++][Flight] measure latency in flight benchmark ARROW-9207 - [Python][Dataset] Clean-up internal FileSource class ARROW-9210 - [C++] Use OptionalBitBlockCounter in ArrayDataInlineVisitor ARROW-9214 - [C++] Avoid util::optional in favor of separate inlineable functions in arrow/visitor_inline.h ARROW-9216 - [C++][Parquet] Use BitBlockCounter for plain spaced encoding/decoding ARROW-9217 - [C++][Parquet] Cover 0.01% null for the plain spaced encoding/decoding benchmark ARROW-9220 - [C++] Disable relevant compute kernels if ARROW_WITH_UTF8PROC=OFF ARROW-9222 - [Format][Proposal] Remove validity bitmap from Union types ARROW-9224 - [Dev][Archery] Copy local repo on clone failure ARROW-9225 - [C++][Compute] Improve counting sort ARROW-9231 - [Format] Increment MetadataVersion from V4 to V5 ARROW-9234 - [GLib][CUDA] Add support for dictionary memo on reading record batch from buffer ARROW-9241 - [C++] Add forward compatibility checks for Decimal::bitWidth ARROW-9242 - [Java] Add forward compatibility checks for Decimal::bitWidth ARROW-9247 - [Python] Expose BinaryArray::total_values_length in bindings ARROW-9248 - [C++] Add “list_size” function that returns Int32Array/Int64Array giving list cell sizes ARROW-9249 - [C++] Implement “list_parent_indices” vector function ARROW-9250 - [C++] Compact generated code in compute/kernels/scalar_set_lookup.cc using same method as vector_hash.cc ARROW-9251 - [C++] Move JSON testing code for integration tests to libarrow_testing ARROW-9254 - [C++] Factor out some integer casting internals so it can be reused with temporal casts ARROW-9255 - [C++] Use CMake to build bundled Protobuf with CMake &gt;= 3.7 ARROW-9256 - [C++] Incorrect variable name ARROW_CXX_FLAGS ARROW-9258 - [Format] Add V5 MetadataVersion ARROW-9259 - [Format] Permit unsigned dictionary indices in Columnar.rst ARROW-9262 - [Packaging][Linux][CI] Use Ubuntu 18.04 to build ARM64 packages on Travis CI ARROW-9263 - [C++] Benchmark: promote RegressionSetArgs size to L2 ARROW-9264 - [C++] Cleanup Parquet Arrow Schema code ARROW-9265 - [C++] Add support for writing MetadataVersion::V4-compatible IPC messages for compatibility with library versions &lt;= 0.17.1 ARROW-9268 - [C++] Add is{alnum,alpha,…} kernels for strings ARROW-9272 - [C++][Python] Reduce complexity in python to arrow conversion ARROW-9276 - [Dev] Enable ARROW_CUDA when generating API documentations ARROW-9277 - [C++] Fix documentation of Reading CSV files ARROW-9278 - [C++] Implement Union validity bitmap changes from ARROW-9222 ARROW-9280 - [Rust] Write statistics to Parquet files ARROW-9281 - [R] Turn off utf8proc in R builds ARROW-9283 - [Python] Expose C++ build info ARROW-9287 - [C++] Implement support for unsigned dictionary indices ARROW-9289 - [R] Remove deprecated functions ARROW-9290 - [Rust] [Parquet] Add features to allow opting out of dependencies ARROW-9291 - [R] Support fixed size binary/list types ARROW-9292 - [Rust] Update feature matrix with passing tests ARROW-9294 - [GLib] Add GArrowFunction ARROW-9300 - [Java] Separate Netty Memory to its own module ARROW-9306 - [Ruby] Add support for Arrow::RecordBatch.new(raw_table) ARROW-9307 - [Ruby] Add Arrow::RecordBatchIterator#to_a ARROW-9308 - [Format] Add Feature enum to schema.fbs for forward compatibity ARROW-9316 - [C++] Use “Dataset” instead of “Datasets” ARROW-9321 - [C++][Dataset] Allow to “collect” statistics for ParquetFragment row groups if not constructed from _metadata ARROW-9322 - [R] Dataset documentation polishing ARROW-9323 - [Ruby] Add Red Arrow Dataset ARROW-9327 - Fix all clippy errors for arrow crate ARROW-9328 - [C++][Gandiva] Add LTRIM, RTRIM, BTRIM functions for string ARROW-9329 - [C++][Gandiva] Implement castTimestampToDate function ARROW-9331 - [C++] Improve the performance of Tensor-to-SparseTensor conversion ARROW-9333 - [Python] Expose more IPC write options in Python ARROW-9335 - [Website] Update website for 1.0 ARROW-9337 - [R] On C++ library build failure, give an unambiguous message ARROW-9339 - [Rust] Comments on SIMD in Arrow README are incorrect ARROW-9340 - [R] Use CRAN version of decor package ARROW-9341 - [GLib] Use arrow::Datum version Take() ARROW-9345 - [C++][Dataset] Expression with dictionary type should work with operand of value type ARROW-9346 - [C++][Python][Dataset] Add total_byte_size metadata to RowGroupInfo ARROW-9362 - [Java] Add support for writing MetadataVersion::V4-compatible IPC messages for compatibility with library versions &lt;= 0.17.1 ARROW-9365 - [Go] Implement the rest of the typed array builders in NewBuilder ARROW-9370 - [Java] Bump Netty version ARROW-9374 - [C++][Python] Expose MakeArrayFromScalar ARROW-9379 - [Rust] Support unsigned dictionary indices ARROW-9383 - [Python] Support fsspec filesystems in Dataset API through fs handler ARROW-9386 - [Rust] RecordBatch.schema() should not return &amp;Arc&lt;Schema&gt; ARROW-9390 - [C++] Review compute function names ARROW-9390 - [C++] Review compute function names ARROW-9390 - [C++] Review compute function names ARROW-9391 - [Rust] Float32 values interpreted as zero when record batch has one row ARROW-9393 - [Doc] update supported types documentation for Java ARROW-9395 - [Python] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed ARROW-9399 - [C++] Add forward compatibility checks for unrecognized future MetadataVersion ARROW-9403 - [Python] add .tolist as alias of .to_pylist ARROW-9407 - [Python] Accept pd.NA as missing value in array constructor ARROW-9411 - [Rust] Update dependencies ARROW-9424 - [C++][Parquet] Disable writing files with LZ4 codec ARROW-9425 - [Rust][DataFusion] Make ExecutionContext sharable between threads ARROW-9427 - [Rust][DataFusion] Add pub fn ExecutionContext.tables() ARROW-9437 - [Python][Packaging] Homebrew fails to install build dependencies in the macOS wheel builds ARROW-9442 - [Python] Do not force Validate() to be called in pyarrow_wrap_table ARROW-9445 - [Python] Revert Array.equals changes + expose comparison ops in compute ARROW-9446 - [C++] Export compiler information in BuildInfo ARROW-9447 - [Rust][DataFusion] Allow closures as ScalarUDFs ARROW-9452 - [Rust] [DateFusion] Improve performance of parquet scan ARROW-9470 - [CI][Java] Run Maven in parallel ARROW-9472 - [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed ARROW-9473 - [Doc] Polishing for 1.0 ARROW-9478 - [C++] Improve error message on unsupported cast types ARROW-9484 - [Docs] Update is* functions to be is_* in the compute docs ARROW-9485 - [R] Better shared library stripping ARROW-9493 - [Python][Dataset] Dictionary encode string partition columns by default ARROW-9508 - [Release][APT][Yum] Enable verification for arm64 binaries ARROW-9509 - [Release] Don’t test Gandiva in the windows wheel verification script ARROW-9511 - [Packaging][Release] Set conda packages’ build number to 0 ARROW-9519 - [Rust] Improve error message when getting a field by name from schema ARROW-9529 - [Dev][Release] Improvements to release verification scripts ARROW-9531 - [Packaging][Release] Update conda forge dependency pins ARROW-9534 - [Rust] [DataFusion] Implement functions for creating literal expressions for all types PARQUET-1820 - [C++] Use a column filter hint to inform read prefetching in Arrow reads PARQUET-1843 - [C++] Unnecessary assignment in DictDecoderImpl::Decode PARQUET-1855 - [C++] Improve documentation on MetaData ownership PARQUET-1861 - [Documentation][C++] Explain ReaderProperters.buffer_stream*" />
<meta property="og:description" content="Apache Arrow 1.0.0 (24 July 2020) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 771 commits from 100 distinct contributors. $ git shortlog -sn apache-arrow-0.17.0..apache-arrow-1.0.0 125 Wes McKinney 72 Neal Richardson 61 Antoine Pitrou 56 Sutou Kouhei 55 Krisztián Szűcs 30 Joris Van den Bossche 24 Benjamin Kietzman 23 David Li 21 Kazuaki Ishizaki 19 Qingping Hou 18 Andy Grove 18 François Saint-Jacques 17 Neville Dipale 16 Yibo Cai 16 liyafan82 15 Ryan Murray 14 Micah Kornfield 10 Uwe L. Korn 9 Paddy Horan 8 Jorge C. Leitao 8 Projjal Chanda 7 Frank Du 7 Romain Francois 6 Kenta Murata 6 Maarten A. Breddels 6 Max Burke 5 Mahmut Bulut 4 Mark Hildreth 4 Yuan Zhou 3 Adam Szmigin 3 Raphael Taylor-Davies 3 Zhuo Peng 3 rdettai 2 Bobby Wagner 2 Eric Erhardt 2 Prudhvi Porandla 2 R. Tyler Croy 2 Sagnik Chakraborty 2 Vamsi 2 Yordan Pavlov 2 c-jamie 2 dota17 2 mayuropensource 2 tianchen 2 tianchen92 1 Alex Baden 1 Ben Kimock 1 Bryan Cutler 1 Chad Dougherty 1 Christian Hudon 1 Cuong Nguyen 1 David Anthoff 1 Dmitry Kalinkin 1 Drazen Urch 1 Erin Ryan 1 Eyal Leshem 1 Frank Smith 1 German I Ramirez-Espinoza 1 Hongze Zhang 1 Ilija Puaca 1 Itamar Turner-Trauring 1 Jacek Pliszka 1 Jens Glaser 1 Ji Liu 1 Jörn Horstmann 1 Kyle Brandt 1 Kyle McCarthy 1 Laurent Goujon 1 Li, Jiajia 1 Maarten Breddels 1 Masaki Kozuki 1 Mohamed Zenadi 1 Nick Poorman 1 Pand9 1 Paul Dix 1 Paul Taylor 1 Praveen 1 Robert Borkowski 1 Rok 1 Sergey Fedoseev 1 Shuai Zhang 1 Simon Watts 1 Stéphane Campinas 1 Suvayu Ali 1 Sven Wagner-Boysen 1 TP Boudreau 1 Takuya Kato 1 Tobias Mayer 1 Wakahisa 1 Weston Steimel 1 Yuqi Gu 1 Zachary Gramana 1 Ze&#39;ev Maor 1 alamb 1 alimcmaster1 1 amol 1 chenguoping 1 ritchie46 1 rvernica 1 xuyiming Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-0.17.0..apache-arrow-1.0.0 231 Wes McKinney 85 Antoine Pitrou 74 Krisztián Szűcs 68 Neal Richardson 67 Sutou Kouhei 44 François Saint-Jacques 35 Andy Grove 34 Neville Dipale 21 Benjamin Kietzman 17 GitHub 17 Micah Kornfield 12 Chao Sun 12 Paddy Horan 8 David Li 8 Praveen 7 Joris Van den Bossche 5 Bryan Cutler 5 Eric Erhardt 5 Uwe L. Korn 4 Yosuke Shiro 3 Jacques Nadeau 3 Pindikura Ravindra 3 Sebastien Binet 2 liyafan82 1 Kenta Murata Changelog Apache Arrow 1.0.0 (2020-07-24) Bug Fixes ARROW-1692 - [Python, Java] UnionArray round trip not working ARROW-3329 - [Python] Error casting decimal(38, 4) to int64 ARROW-3861 - [Python] ParquetDataset().read columns argument always returns partition column ARROW-4018 - [C++] RLE decoder may not big-endian compatible ARROW-4309 - [Documentation] Add a docker-compose entry which builds the documentation with CUDA enabled ARROW-4600 - [Ruby] Arrow::DictionaryArray#[] should returns the item in the indices array ARROW-5158 - [Packaging][Wheel] Symlink libraries in wheels ARROW-5310 - [Python] better error message on creating ParquetDataset from empty directory ARROW-5359 - [Python] timestamp_as_object support for pa.Table.to_pandas in pyarrow ARROW-5572 - [Python] raise error message when passing invalid filter in parquet reading ARROW-5666 - [Python] Underscores in partition (string) values are dropped when reading dataset ARROW-5744 - [C++] Do not error in Table::CombineChunks for BinaryArray types that overflow 2GB limit ARROW-5875 - [FlightRPC] Test RPC features in integration tests ARROW-6235 - [R] Conversion from arrow::BinaryArray to R character vector not implemented ARROW-6523 - [C++][Dataset] arrow_dataset target does not depend on anything ARROW-6848 - [C++] Specify -std=c++11 instead of -std=gnu++11 when building ARROW-7018 - [R] Non-UTF-8 data in Arrow &lt;–&gt; R conversion ARROW-7028 - [R] Date roundtrip results in different R storage mode ARROW-7084 - [C++] ArrayRangeEquals should check for full type equality? ARROW-7173 - [Integration] Add test to verify Map field names can be arbitrary ARROW-7208 - [Python] Passing directory to ParquetFile class gives confusing error message ARROW-7273 - [Python] Non-nullable null field is allowed / crashes when writing to parquet ARROW-7480 - [Rust] [DataFusion] Query fails/incorrect when aggregated + grouped columns don’t match the selected columns ARROW-7610 - [Java] Finish support for 64 bit int allocations ARROW-7654 - [Python] Ability to set column_types to a Schema in csv.ConvertOptions is undocumented ARROW-7681 - [Rust] Explicitly seeking a BufReader will discard the internal buffer ARROW-7702 - [C++][Dataset] Provide (optional) deterministic order of batches ARROW-7782 - [Python] Losing index information when using write_to_dataset with partition_cols ARROW-7840 - [Java] [Integration] Java executables fail ARROW-7925 - [C++][Documentation] Instructions about running IWYU and other tasks in cpp/development.rst have gone stale ARROW-7939 - [Python] crashes when reading parquet file compressed with snappy ARROW-7967 - [CI][Crossbow] Pin macOS version in autobrew job to match CRAN ARROW-8050 - [Python][Packaging] Do not include generated Cython source files in wheel packages ARROW-8078 - [Python] Missing links in the docs regarding field and schema DataTypes ARROW-8115 - [Python] Conversion when mixing NaT and datetime objects not working ARROW-8251 - [Python] pandas.ExtensionDtype does not survive round trip with write_to_dataset ARROW-8344 - [C#] StringArray.Builder.Clear() corrupts subsequently-built array contents ARROW-8360 - [C++][Gandiva] Fixes date32 support for date/time functions ARROW-8374 - [R] Table to vector of DictonaryType will error when Arrays don’t have the same Dictionary per array ARROW-8392 - [Java] Fix overflow related corner cases for vector value comparison ARROW-8448 - [Package] Can’t build apt packages with ubuntu-focal ARROW-8455 - [Rust] [Parquet] Arrow column read on partially compatible files ARROW-8455 - [Rust] [Parquet] Arrow column read on partially compatible files ARROW-8471 - [C++][Integration] Regression to /u?int64/ as JSON::number ARROW-8472 - [Go][Integration] Represent 64 bit integers as JSON::string ARROW-8473 - [Rust] “Statistics support” in rust/parquet readme is incorrect ARROW-8480 - [Rust] There is no check for allocation failure ARROW-8503 - [Packaging][deb] Can’t build apache-arrow-archive-keyring for RC ARROW-8505 - [Release][C#] “sourcelink test” is failed by Apache.Arrow.AssemblyInfo.cs ARROW-8508 - [Rust] ListBuilder of FixedSizeListBuilder creates wrong offsets ARROW-8510 - [C++] arrow/dataset/file_base.cc fails to compile with internal compiler error with “Visual Studio 15 2017 Win64” generator ARROW-8511 - [Developer][Release] Windows release verification script does not halt if C++ compilation fails ARROW-8514 - [Developer] Windows wheel verification script does not check Python 3.5 ARROW-8529 - [C++] Fix usage of NextCounts() in GetBatchWithDict[Spaced] ARROW-8535 - [Rust] Arrow crate does not specify arrow-flight version ARROW-8536 - [Rust] Failed to locate format/Flight.proto in any parent directory ARROW-8537 - [C++] Performance regression from ARROW-8523 ARROW-8539 - [CI] “AMD64 MacOS 10.15 GLib &amp; Ruby” fails ARROW-8554 - [C++][Benchmark] Fix building error “cannot bind lvalue” ARROW-8556 - [R] zstd symbol not found if there are multiple installations of zstd ARROW-8566 - [R] error when writing POSIXct to spark ARROW-8568 - [C++][Python] Crash on decimal cast in debug mode ARROW-8577 - [Plasma] PlasmaClient::Connect() of CUDA enabled build is always failed on no CUDA device machine ARROW-8583 - [C++][Doc] Undocumented parameter in Dataset namespace ARROW-8584 - [Packaging][C++] Protobuf link error in deb builds ARROW-8585 - [Packaging][Python] Windows wheels fail to build because of link error ARROW-8586 - [R] installation failure on CentOS 7 ARROW-8587 - [C++] Compilation error when linking arrow-flight-perf-server ARROW-8592 - [C++] Docs still list LLVM 7 as compiler used ARROW-8593 - [C++] Parquet file_serialize_test.cc fails to build with musl libc ARROW-8598 - [Rust] simd_compare_op creates buffer of incorrect length when item count is not a multiple of T::lanes() ARROW-8602 - [CMake] Fix ws2_32 link issue when cross-compiling on Linux ARROW-8603 - [Documentation] Fix Sphinx doxygen comment ARROW-8604 - [R][CI] Update CI to use R 4.0 ARROW-8608 - [C++] Update vendored mpark/variant.h to latest to fix NVCC compilation issues ARROW-8609 - [C++] ORC JNI bridge crashed on null arrow buffer ARROW-8610 - [Rust] DivideByZero when running arrow crate when simd feature is disabled ARROW-8613 - [C++][Dataset] Raise error for unparsable partition value ARROW-8615 - [R] Error better and insist on RandomAccessFile in read_feather ARROW-8617 - [Rust] simd_load_set_invalid does not exist on aarch64 ARROW-8632 - [C++] Fix conversion error warning in array_union_test.cc ARROW-8641 - [Python] Regression in feather: no longer supports permutation in column selection ARROW-8643 - [Python] Tests with pandas master failing due to freq assertion ARROW-8644 - [Python] Dask integration tests failing due to change in not including partition columns ARROW-8646 - [Java] Allow UnionListWriter to write null values ARROW-8649 - [Java] [Website] Java documentation on website is hidden ARROW-8657 - [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version=’2.0’ ARROW-8663 - [Documentation] Small correction to building.rst ARROW-8680 - [Rust] ComplexObjectArrayReader incorrect null value shuffling ARROW-8684 - [Python] “SystemError: Bad call flags in _PyMethodDef_RawFastCallDict” in Python 3.7.7 on macOS when using pyarrow wheel ARROW-8689 - [C++] S3 benchmarks fail linking ARROW-8693 - [Python] Dataset.get_fragments is missing an implicit cast when filtering ARROW-8694 - [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame ARROW-8701 - [Rust] Unresolved import `crate::compute::util::simd_load_set_invalid` on Raspberry Pi ARROW-8704 - [C++] Fix Parquet crash on invalid input (OSS-Fuzz) ARROW-8705 - [Java] ComplexCopier is skipping null values ARROW-8706 - [C++][Parquet] Tracking JIRA for PARQUET-1857 (unencrypted INT16_MAX Parquet row group limit) ARROW-8710 - [Rust] Continuation marker not written correctly in IPC writer, and stream not flushed ARROW-8722 - [Dev] “archery docker run -e” doesn’t work ARROW-8726 - [C++][Dataset] Mis-specified DirectoryPartitioning incorrectly uses the file name as value ARROW-8728 - [C++] Bitmap operation may cause buffer overflow ARROW-8729 - [C++][Dataset] Only selecting a partition column results in empty table ARROW-8734 - [R] improve nightly build installation ARROW-8741 - [Python][Packaging] Keep VS2015 with for the windows wheels ARROW-8750 - [Python] pyarrow.feather.write_feather does not default to lz4 compression if it’s available ARROW-8768 - [R][CI] Fix nightly as-cran spurious failure ARROW-8775 - [C++][FlightRPC] Integration client doesn’t run integration tests ARROW-8776 - [FlightRPC][C++] Flight/C++ middleware don’t receive headers on failed calls to Java servers ARROW-8798 - [C++] Fix Parquet crashes on invalid input (OSS-Fuzz) ARROW-8799 - [C++][Dataset] Reading list column as nested dictionary segfaults ARROW-8801 - [Python] Memory leak on read from parquet file with UTC timestamps using pandas ARROW-8802 - [C++][Dataset] Schema metadata are lost when reading a subset of columns ARROW-8803 - [Java] Row count should be set before loading buffers in VectorLoader ARROW-8808 - [Rust] Divide by zero in arrays/builder.rs ARROW-8809 - [Rust] schema mismatch in integration test ARROW-8811 - [Java] Fix build on master ARROW-8820 - [C++][Gandiva] fix date_trunc functions to return date types ARROW-8821 - [Rust] nested binary expression with Like, NotLike and Not operator results in type cast error ARROW-8825 - [C++] Cannot compiled pass with Wunused-parameter flag ARROW-8826 - [Crossbow] remote URL should always have .git ARROW-8832 - [Python] AttributeError: module ‘pyarrow.fs’ has no attribute ‘S3FileSystem’ ARROW-8848 - [CI][C/Glib] MinGW build error ARROW-8848 - [CI][C/Glib] MinGW build error ARROW-8858 - [FlightRPC] Ensure headers are uniformly exposed ARROW-8860 - [C++] IPC/Feather decompression broken for nested arrays ARROW-8862 - [C++] NumericBuilder does not use MemoryPool passed to CTOR ARROW-8863 - [C++] Array subclass constructors must set ArrayData::null_count to 0 when there is no validity bitmap ARROW-8869 - [Rust] [DataFusion] Type Coercion optimizer rule does not support new scan nodes ARROW-8871 - [C++] Gandiva build failure ARROW-8872 - [CI] Travis-CI jobs fail (can’t open file ‘ci/detect-changes.py’) ARROW-8874 - [C++][Dataset] Scanner::ToTable race when ScanTask exit early with an error ARROW-8878 - [R] try_download is confused when download.file.method isn’t default ARROW-8882 - [C#] Add .editorconfig to C# code ARROW-8888 - [Python] Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions ARROW-8889 - [Python] Python 3.7 SIGSEGV when comparing RecordBatch to None ARROW-8892 - [C++][CI] CI builds for MSVC do not build benchmarks ARROW-8909 - [Java] Out of order writes using setSafe ARROW-8911 - [C++] Slicing a ChunkedArray with zero chunks segfaults ARROW-8924 - [C++][Gandiva] castDATE_date32() may cause overflow ARROW-8925 - [Rust] [DataFusion] CsvExec::schema() returns incorrect results ARROW-8930 - [C++] libz.so linking error with liborc.a ARROW-8932 - [C++] symbol resolution failures with liborc.a ARROW-8946 - [Python] Add tests for parquet.write_metadata metadata_collector ARROW-8948 - [Java][Integration] enable duplicate field names integration tests ARROW-8951 - [C++] Fix compiler warning in compute/kernels/scalar_cast_temporal.cc ARROW-8954 - [Website] ca-certificates should be listed in installation instructions ARROW-8957 - [FlightRPC][C++] Fail to build due to IpcOptions ARROW-8959 - [Rust] Broken build due to new benchmark crate using old API ARROW-8962 - [C++] Linking failure with clang-4.0 ARROW-8968 - [C++][Gandiva] Show link warning message on s390x ARROW-8975 - [FlightRPC][C++] Fix flaky MacOS tests ARROW-8977 - [R] Table$create with schema crashes with some dictionary index types ARROW-8978 - [C++][Compute] “Conditional jump or move depends on uninitialised value(s)” Valgrind warning ARROW-8980 - [Python] Metadata grows exponentially when using schema from disk ARROW-8982 - [CI] Remove allow_failures for s390x in TravisCI ARROW-8986 - [Archery][ursabot] Fix benchmark diff checkout of origin/master ARROW-9000 - [Java] build crashes with JDK14 ARROW-9009 - [C++][Dataset] ARROW:schema should be removed from schema’s metadata when reading Parquet files ARROW-9013 - [C++] Validate enum-style CMake options ARROW-9020 - [Python] read_json won’t respect explicit_schema in parse_options ARROW-9024 - [C++/Python] Install anaconda-client in conda-clean job ARROW-9026 - [C++/Python] Force package removal from arrow-nightlies conda repository ARROW-9037 - [C++][C] unable to import array with null count == -1 (which could be exported) ARROW-9057 - [Rust] Projection should work on InMemoryScan without error ARROW-9059 - [Rust] Documentation for slicing array data has the wrong sign ARROW-9066 - [Python] Raise correct error in isnull() ARROW-9071 - [C++] MakeArrayOfNull makes invalid ListArray ARROW-9077 - [C++] Fix aggregate/scalar-compare benchmark null_percent calculation ARROW-9080 - [C++] arrow::AllocateBuffer returns a Result&lt;unique_ptr&lt;Buffer&gt;&gt; ARROW-9082 - [Rust] - Stream reader fail when steam not ended with (optional) 0xFFFFFFFF 0x00000000” ARROW-9084 - [C++] CMake is unable to find zstd target when ZSTD_SOURCE=SYSTEM ARROW-9085 - [C++][CI] Appveyor CI test failures ARROW-9087 - [C++] Missing HDFS options parsing ARROW-9098 - RecordBatch::ToStructArray cannot handle record batches with 0 column ARROW-9105 - [C++] ParquetFileFragment scanning doesn’t handle filter on partition field ARROW-9120 - [C++] Lint and Format C++ files with “codegen” in file name ARROW-9121 - [C++] Do not wipe the filesystem when path is empty ARROW-9122 - [C++] Adapt ascii_lower/ascii_upper bulk transforms to work on sliced arrays ARROW-9126 - [C++] Trimmed Boost bundle fails to build on Windows ARROW-9127 - [Rust] Update thrift library dependencies ARROW-9134 - [Python] Parquet partitioning degrades Int32 to float64 ARROW-9141 - [R] Update cross-package documentation links ARROW-9142 - [C++] random::RandomArrayGenerator::Boolean “probability” misdocumented / incorrect ARROW-9143 - [C++] RecordBatch::Slice erroneously sets non-nullable field’s internal null_count to unknown ARROW-9146 - [C++][Dataset] Scanning a Fragment with a filter + mismatching schema shouldn’t abort ARROW-9151 - [R][CI] Fix Rtools 4.0 build: pacman sync ARROW-9160 - [C++] Implement string/binary contains for exact matches ARROW-9174 - [Go] Tests panic with 386 (x86) builds ARROW-9183 - [C++] Failed to build arrow-cpp with gcc 4.9.2 ARROW-9184 - [Rust][Datafusion] table scan without projection should return all columns ARROW-9194 - [C++] Array::GetScalar not implemented for decimal type ARROW-9195 - [Java] Wrong usage of Unsafe.get from bytearray in ByteFunctionsHelper class ARROW-9209 - [C++] Benchmarks fail to build ARROW_IPC=OFF and ARROW_BUILD_TESTS=OFF ARROW-9219 - [R] coerce_timestamps in Parquet write options does not work ARROW-9221 - ArrowBuf#setBytes(int, ByteBuffer) doesn’t check the byte buffer’s endianness ARROW-9223 - [Python] Fix to_pandas() export for timestamps within structs ARROW-9230 - [FlightRPC][Python] flight.connect() doesn’t pass through all arguments ARROW-9233 - [C++] is_null on NullArray should be true for all values ARROW-9236 - [Rust] CSV WriterBuilder never writes header ARROW-9237 - [R] 0.17 install on Arch Linux ARROW-9238 - [C++][CI] A few test coverages of round-robin in ipc and flight ARROW-9252 - [Integration] GitHub Actions integration test job does not test against “gold” 0.14.1 files in apache/arrow-testing ARROW-9260 - [CI] “ARM64v8 Ubuntu 20.04 C++” fails ARROW-9260 - [CI] “ARM64v8 Ubuntu 20.04 C++” fails ARROW-9261 - [Python][Packaging] S3FileSystem curl errors in manylinux wheels ARROW-9274 - [Rust] [Integration Testing] Read i64 from json files as strings ARROW-9282 - [R] Remove usage of _EXTPTR_PTR ARROW-9284 - [Java] getMinorTypeForArrowType returns sparse minor type for dense union types ARROW-9288 - [C++][Dataset] Discovery of partition field as dictionary type segfaulting with HivePartitioning ARROW-9297 - [C++][Dataset] Dataset scanner cannot handle large binary column (&gt; 2 GB) ARROW-9298 - [C++] Fix crashes on invalid input (OSS-Fuzz) ARROW-9303 - [R] Linux static build should always bundle dependencies ARROW-9305 - [Python] Dependency load failure in Windows wheel build ARROW-9315 - [Java] Fix the failure of testAllocationManagerType ARROW-9317 - [Java] A few testcases for arrow-memory ARROW-9326 - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds ARROW-9326 - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds ARROW-9326 - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds ARROW-9330 - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz) ARROW-9334 - [Dev][Archery] Push ancestor docker images ARROW-9336 - [Ruby] Creating RecordBatch with structs missing keys results in a malformed table ARROW-9343 - [C++][Gandiva] CastINT/Float functions from string should handle leading/trailing white spaces ARROW-9347 - [Python] Tests fail with latest fsspec ARROW-9350 - [C++][CI] Nightly valgrind job failures ARROW-9351 - [C++][CI] Nightly test-ubuntu-18.04-cpp-cmake32 fails ARROW-9353 - [Python][CI] Nightly dask integration jobs fail ARROW-9354 - [C++] Turbodbc latest fails to build in the integration tests ARROW-9355 - [R] Fix -Wimplicit-int-float-conversion ARROW-9360 - [CI][Crossbow] Nightly homebrew-cpp job times out ARROW-9363 - [C++][Dataset] ParquetDatasetFactory schema: pandas metadata is lost ARROW-9368 - [Python] Rename predicate argument to filter in split_by_row_group() ARROW-9373 - [C++] Fix Parquet crash on invalid input (OSS-Fuzz) ARROW-9380 - [C++] Segfaults in compute::CallFunction ARROW-9384 - [C++] Out-of-memory on invalid IPC input (OSS-Fuzz) ARROW-9385 - [Python] [CI] jpype integration failure ARROW-9389 - [C++] Can’t call isin/match through CallFunction ARROW-9397 - [R] Pass CC/CXX to cmake when building libarrow in Linux build ARROW-9408 - [Integration] Tests do not run in Windows due to numpy 64-bit errors ARROW-9409 - [CI][Crossbow] Nightly conda-r fails ARROW-9410 - [CI][Crossbow] Fix homebrew-cpp again ARROW-9413 - [Rust] Fix clippy lint on master ARROW-9415 - [C++] Arrow does not compile on Power9 ARROW-9416 - [Go] Add test cases for some datatypes ARROW-9417 - [C++][IPC] size in message written in native endian ARROW-9418 - [R] nyc-taxi Parquet files not downloaded in binary mode on Windows ARROW-9419 - [C++] Test that “fill_null” function works with sliced inputs, expand tests ARROW-9428 - [C++] Update documentation for buffer allocation functions ARROW-9436 - [C++][CI] Valgrind errors in fill_null kernel tests ARROW-9438 - [CI] Spark integration tests are failing ARROW-9439 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9440 - [Python] Expose Fill Null Compute Kernel in PyArrow ARROW-9443 - [C++] Bundled bz2 build should only build libbz2 ARROW-9448 - [Java] Circular initialization between ArrowBuf and BaseAllocator leads to null HistoricalLog for empty buffer ARROW-9449 - [R] Strip arrow.so ARROW-9450 - [Python] “pytest pyarrow” takes over 10 seconds to collect tests and start executing ARROW-9456 - [Python] Dataset segfault when not importing pyarrow.parquet ARROW-9458 - [Python] Dataset Scanner is single-threaded only ARROW-9460 - [C++] BinaryContainsExact doesn’t cope with double characters in the pattern ARROW-9461 - [Rust] Reading Date32 and Date64 errors - they are incorrectly converted to RecordBatch ARROW-9476 - [C++][Dataset] HivePartitioning discovery with dictionary types fails for multiple fields ARROW-9486 - [C++][Dataset] Support implicit casting InExpression::set_ to dict ARROW-9497 - [C++][Parquet] Fix failure caused by malformed repetition/definition levels ARROW-9499 - [C++] AdaptiveIntBuilder::AppendNull does not increment the null count ARROW-9500 - [C++] Fix segfault with std::to_string in -O3 builds on gcc 7.5.0 ARROW-9501 - [C++][Gandiva] Add logic in timestampdiff() when end date is last day of a month ARROW-9503 - [Rust] Comparison sliced arrays is wrong ARROW-9504 - [Python] Segmentation fault on ChunkedArray.take ARROW-9506 - [Packaging][Python] Fix macOS wheel build failures ARROW-9512 - [C++] Variadic template unpack inside lambda doesn’t compile with gcc ARROW-9524 - [CI][Gandiva] C++ unit test arrow-ipc-read-write failing in gandiva nightly build ARROW-9527 - [Rust] Remove un-needed dev-dependencies PARQUET-1839 - [C++] values_read not updated in ReadBatchSpaced PARQUET-1857 - [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups PARQUET-1865 - [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc PARQUET-1877 - [C++] Reconcile container size with string size for memory issues PARQUET-1882 - [C++] Writing an all-null column and then reading it with buffered_stream aborts the process New Features and Improvements ARROW-300 - [Format] Add body buffer compression option to IPC message protocol using LZ4 or ZSTD ARROW-842 - [Python] Handle more kinds of null sentinel objects from pandas 0.x ARROW-971 - [C++/Python] Implement Array.isvalid/notnull/isnull as scalar functions ARROW-974 - [Website] Add Use Cases section to the website ARROW-1277 - Completing integration tests for major implemented data types ARROW-1567 - [C++] Implement “fill null” kernels that replace null values with some scalar replacement value ARROW-1570 - [C++] Define API for creating a kernel instance from function of scalar input and output with a particular signature ARROW-1682 - [Python] Add documentation / example for reading a directory of Parquet files on S3 ARROW-1796 - [Python] RowGroup filtering on file level ARROW-2260 - [C++][Plasma] plasma_store should show usage ARROW-2444 - [Python][C++] Better handle reading empty parquet files ARROW-2702 - [Python] Examine usages of Invalid and TypeError errors in numpy_to_arrow.cc to see if we are using the right error type in each instance ARROW-2714 - [C++/Python] Variable step size slicing for arrays ARROW-2912 - [Website] Build more detailed Community landing page a la Apache Spark ARROW-3089 - [Rust] Add ArrayBuilder for different Arrow arrays ARROW-3134 - [C++] Implement n-ary iterator for a collection of chunked arrays with possibly different chunking layouts ARROW-3154 - [Python][C++] Document how to write _metadata, _common_metadata files with Parquet datasets ARROW-3244 - [Python] Multi-file parquet loading without scan ARROW-3275 - [Python] Add documentation about inspecting Parquet file metadata ARROW-3308 - [R] Convert R character vector with data exceeding 2GB to Large type ARROW-3317 - [R] Test/support conversions from data.frame with a single character column exceeding 2GB capacity of BinaryArray ARROW-3446 - [R] Document mapping of Arrow &lt;-&gt; R types ARROW-3509 - [C++] Inconsistent child accessor naming ARROW-3520 - [C++] Implement List Flatten kernel ARROW-3688 - [Rust] Implement PrimitiveArrayBuilder&lt;T&gt;.push_values ARROW-3827 - [Rust] Implement UnionArray ARROW-4022 - [C++] Promote Datum variant out of compute namespace ARROW-4221 - [Format] Add canonical flag in COO sparse index ARROW-4390 - [R] Serialize “labeled” metadata in Feather files, IPC messages ARROW-4412 - [DOCUMENTATION] Add explicit version numbers to the arrow specification documents. ARROW-4427 - [Doc] Move Confluence Wiki pages to the Sphinx docs ARROW-4429 - [Doc] Add git rebase tips to the ‘Contributing’ page in the developer docs ARROW-5035 - [C#] ArrowBuffer.Builder&lt;bool&gt; is broken ARROW-5082 - [Python][Packaging] Reduce size of macOS and manylinux1 wheels ARROW-5143 - [Flight] Enable integration testing of batches with dictionaries ARROW-5279 - [C++] Support reading delta dictionaries in IPC streams ARROW-5377 - [C++] Make IpcPayload public and add GetPayloadSize ARROW-5489 - [C++] Normalize kernels and ChunkedArray behavior ARROW-5548 - [Documentation] http://arrow.apache.org/docs/latest/ is not latest ARROW-5649 - [Integration][C++] Create round trip integration test for extension types ARROW-5708 - [C#] Null support for BooleanArray ARROW-5760 - [C++] Optimize Take implementation ARROW-5854 - [Python] Expose compare kernels on Array class ARROW-6052 - [C++] Divide up arrow/array.h,cc into files in arrow/array/ similar to builder files ARROW-6110 - [Java] Support LargeList Type and add integration test with C++ ARROW-6111 - [Java] Support LargeVarChar and LargeBinary types and add integration test with C++ ARROW-6439 - [R] Implement S3 file-system interface in R ARROW-6456 - [C++] Possible to reduce object code generated in compute/kernels/take.cc? ARROW-6501 - [C++] Remove non_zero_length field from SparseIndex ARROW-6521 - [C++] Add function to arrow:: namespace that returns the current ABI version ARROW-6543 - [R] Support LargeBinary and LargeString types ARROW-6602 - [Doc] Add feature / implementation matrix ARROW-6603 - [C#] ArrayBuilder API to support writing nulls ARROW-6645 - [Python] Faster boundschecking of dictionary indices when converting to Categorical ARROW-6689 - [Rust] [DataFusion] Query execution enhancements for 1.0.0 release ARROW-6691 - [Rust] [DataFusion] Use tokio and Futures instead of spawning threads ARROW-6775 - [C++] [Python] Proposal for several Array utility functions ARROW-6776 - [Python] Need a lite version of pyarrow ARROW-6800 - [C++] Add CMake option to build libraries targeting a C++14 or C++17 toolchain environment ARROW-6839 - [Java] Add APIs to read and write “custom_metadata” field of IPC file footer ARROW-6856 - [C++] Use ArrayData instead of Array for ArrayData::dictionary ARROW-6917 - ARROW-6917: [Archery][Release] Add support for JIRA curation, changelog generation and commit cherry-picking for maintenance releases ARROW-6945 - [Rust] Enable integration tests ARROW-6959 - [C++] Clarify what signatures are preferred for compute kernels ARROW-6978 - [R] Add bindings for sum and mean compute kernels ARROW-6979 - [R] Enable jemalloc in autobrew formula ARROW-6982 - [R] Add bindings for compare and boolean kernels ARROW-7009 - [C++] Refactor filter/take kernels to use Datum instead of overloads ARROW-7010 - [C++] Support lossy casts from decimal128 to float32 and float64/double ARROW-7011 - [C++] Implement casts from float/double to decimal128 ARROW-7012 - [C++] Clarify ChunkedArray chunking strategy and policy ARROW-7068 - [C++] Expose the offsets of a ListArray as a Int32Array ARROW-7075 - [C++] Boolean kernels should not allocate in Call() ARROW-7175 - [Website] Add a security page to track when vulnerabilities are patched ARROW-7229 - [C++] Unify ConcatenateTables APIs ARROW-7230 - [C++] Use vendored std::optional instead of boost::optional in Gandiva ARROW-7237 - [C++] Add Result&lt;T&gt; to APIs to arrow/json ARROW-7243 - [Docs] Add common “implementation status” table to the README of each native language implementation, as well as top level README ARROW-7285 - [C++] ensure C++ implementation meets clarified dictionary spec ARROW-7300 - [C++][Gandiva] Implement functions to cast from strings to integers/floats ARROW-7313 - [C++] Add function for retrieving a scalar from an array slot ARROW-7371 - [GLib] Add Datasets binding ARROW-7375 - [Python] Expose C++ MakeArrayOfNull ARROW-7391 - [Python] Remove unnecessary classes from the binding layer ARROW-7495 - [Java] Remove “empty” concept from ArrowBuf, replace with custom referencemanager ARROW-7605 - [C++] Create and install static library containing all dependencies built by Arrow ARROW-7607 - [C++] Add to cpp/examples minimal examples of using Arrow as a dependency of another CMake project ARROW-7673 - [C++][Dataset] Revisit File discovery failure mode ARROW-7676 - [Packaging][Python] Ensure that the static libraries are not built in the wheel scripts ARROW-7699 - [Java] Support concating dense union vectors in batch ARROW-7705 - [Rust] Initial sort implementation ARROW-7717 - [CI] Have nightly integration test for Spark’s latest release ARROW-7759 - [C++][Dataset] Add CsvFileFormat for CSV support ARROW-7778 - [C++] Support nested dictionaries in JSON integration format ARROW-7784 - [C++] diff.cc is extremely slow to compile ARROW-7801 - [Developer] Add issue_comment workflow to fix lint/style/codegen ARROW-7803 - [R][CI] Autobrew/homebrew tests should not always install from master ARROW-7831 - [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors ARROW-7831 - [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors ARROW-7902 - [Integration] Unskip nested dictionary integration tests ARROW-7910 - [C++] Provide function to query page size portably ARROW-7924 - [Rust] Add sort for float types ARROW-7950 - [Python] When initializing pandas API shim, inform user if their installed pandas version is too old ARROW-7955 - [Java] Support large buffer for file/stream IPC ARROW-8020 - [Java] Implement vector validate functionality ARROW-8023 - [Website] Write a blog post about the C data interface ARROW-8025 - [C++] Implement cast to Binary and FixedSizeBinary ARROW-8025 - [C++] Implement cast to Binary and FixedSizeBinary ARROW-8046 - [Developer][Integration] Makefile.docker’s target names are broken ARROW-8062 - [C++][Dataset] Parquet Dataset factory from a _metadata/_common_metadata file ARROW-8065 - [C++][Dataset] Untangle Dataset, Fragment and ScanOptions ARROW-8074 - [C++][Dataset] Support for file-like objects (buffers) in FileSystemDataset? ARROW-8108 - [Java] Extract a common interface for dictionary encoders ARROW-8111 - [C++][CSV] Support MM/DD/YYYY date format ARROW-8114 - [Java][Integration] Enable custom_metadata integration test ARROW-8121 - [Java] Enhance code style checking for Java code (add space after commas, semi-colons and type casts) ARROW-8149 - [C++/Python] Enable CUDA Support in conda recipes ARROW-8157 - [C++][Gandiva] Support building with LLVM 9 ARROW-8162 - [Format][Python] Add serialization for CSF sparse tensors ARROW-8169 - [Java] Improve the performance of JDBC adapter by allocating memory proactively ARROW-8171 - Consider pre-allocating memory for fix-width vector in Avro adapter iterator ARROW-8190 - [C++][Flight] Allow setting IpcWriteOptions and IpcReadOptions in Flight IPC message reader and writer classes ARROW-8229 - [Java] Move ArrowBuf into the Arrow package ARROW-8230 - [Java] Move Netty memory manager into a separate module ARROW-8261 - [Rust] [DataFusion] LogicalPlanBuilder.limit() should take a literal argument ARROW-8263 - [Rust] [DataFusion] Add documentation for supported SQL functions ARROW-8281 - [R] Name collision of arrow.dll on Windows conda ARROW-8283 - [Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset ARROW-8287 - [Rust] Arrow examples should use utility to print results ARROW-8293 - [Python] Run flake8 on python/examples also ARROW-8297 - [FlightRPC][C++] Implement Flight DoExchange for C++ ARROW-8301 - [R] Handle ChunkedArray and Table in C data interface ARROW-8312 - [Java][Gandiva] improve IN expression support ARROW-8314 - [Python] Provide a method to select a subset of columns of a Table ARROW-8318 - [C++][Dataset] Dataset should instantiate Fragment ARROW-8399 - [Rust] Extend memory alignments to include other architectures ARROW-8413 - [C++] Refactor DefLevelsToBitmap ARROW-8422 - [Rust] [Parquet] Implement function to convert Arrow schema to Parquet schema ARROW-8430 - [CI] Configure self-hosted runners for Github Actions ARROW-8434 - [C++] Ipc RecordBatchFileReader deserializes the Schema multiple times ARROW-8440 - [C++] Refine simd header files ARROW-8443 - [Gandiva][C++] Fix round/truncate to no-op for special cases ARROW-8447 - [C++][Dataset] Ensure Scanner::ToTable preserve ordering of ScanTasks ARROW-8467 - [C++] Test cases using ArrayFromJSON assume only a little-endian platform ARROW-8474 - [CI][Crossbow] Skip some nightlies we don’t need to run ARROW-8477 - [C++] Enable reading and writing of long filenames for Windows ARROW-8481 - [Java] Provide an allocation manager based on Unsafe API ARROW-8483 - [Ruby] Arrow::Table documentation improvement ARROW-8485 - [Integration][Java] Implement extension types integration ARROW-8486 - [C++] arrow-utility-test causes failures on a big-endian platform ARROW-8487 - [FlightRPC][C++] Make it possible to target a specific payload size ARROW-8488 - [R] Replace VALUE_OR_STOP with ValueOrStop ARROW-8496 - [C++] Refine ByteStreamSplitDecodeScalar ARROW-8497 - [Archery] Add missing component to builds ARROW-8499 - [C++][Dataset] In ScannerBuilder, batch_size will not work if projecter is not empty ARROW-8500 - [C++] Use selection vectors in Filter implementation for record batches, tables ARROW-8501 - [Packaging][RPM] Upgrade devtoolset to 8 on CentOS 6 ARROW-8502 - [Release][APT][Yum] Ignore all arm64 verifications ARROW-8504 - [C++] Add Run Length Reader ARROW-8506 - [c++] Miss tests to verify expected_buffer with bit_width &gt; 8 in RLE ARROW-8507 - [Release] Detect .git directory automatically in changelog.py ARROW-8509 - [GLib] Add low level record batch read/write functions ARROW-8512 - [C++] Delete unused compute expr prototype code ARROW-8513 - [Python] Expose Take with Table input in Python ARROW-8515 - [C++] Bitmap ToString should have an option of grouping by bytes ARROW-8516 - [Rust] Slow BufferBuilder&lt;BooleanType&gt; inserts within PrimitiveBuilder::append_slice ARROW-8517 - [Developer][Release] Update Crossbow RC verification setup for changes since 0.16.0 ARROW-8520 - [Developer] Use .asf.yaml to direct GitHub notifications to e-mail lists and JIRA ARROW-8521 - [Developer] Group Sub-task, Task, Test, and Wish issue types as “Improvement” in Changelog ARROW-8522 - [Developer] Add environment variable option to toggle whether ephemeral NodeJS is installed in release verification script ARROW-8524 - [CI] Free up space on github actions ARROW-8526 - [Python] Fix non-deterministic row order failure in dataset tests ARROW-8531 - [C++] Deprecate ARROW_USE_SIMD CMake option ARROW-8538 - [Packaging] Remove boost from homebrew formula ARROW-8540 - [C++] Create memory allocation benchmark ARROW-8541 - [Release] Don’t remove previous source releases automatically ARROW-8542 - [Release] Fix checksum url in the website post release script ARROW-8543 - [C++] IO: single pass coalescing algorithm ARROW-8544 - [CI][Crossbow] Add a status.json to the gh-pages summary of nightly builds to get around rate limiting ARROW-8548 - [Website] 0.17 release post ARROW-8549 - [R] Assorted post-0.17 release cleanups ARROW-8550 - [CI] Don’t run cron GHA jobs on forks ARROW-8551 - [CI][Gandiva] Use LLVM 8 to build gandiva linux jar ARROW-8552 - [Rust] support column iteration for parquet row ARROW-8553 - [C++] Optimize unaligned bitmap operations ARROW-8555 - [FlightRPC][Java] Implement Flight DoExchange for Java ARROW-8558 - [Rust] GitHub Actions missing rustfmt ARROW-8559 - [Rust] Consolidate Record Batch reader traits in main arrow crate ARROW-8560 - [Rust] Docs for MutableBuffer resize are incorrect ARROW-8561 - [C++][Gandiva] Stop using deprecated google::protobuf::MessageLite::ByteSize() ARROW-8562 - [C++] IO: Parameterize I/O coalescing using S3 storage metrics ARROW-8563 - [Go] Minor change to make newBuilder public ARROW-8564 - [Website] Add Ubuntu 20.04 LTS to supported package list ARROW-8569 - [CI] Upgrade xcode version for testing homebrew formulae ARROW-8571 - [C++] Switch AppVeyor image to VS 2017 ARROW-8572 - [Python] Expose UnionArray.array and other fields ARROW-8573 - [Rust] Upgrade to Rust 1.44 nightly ARROW-8574 - [Rust] Implement Debug for all plain types ARROW-8575 - [Developer] Add issue_comment workflow to rebase a PR ARROW-8590 - [Rust] Use Arrow pretty print utility in DataFusion ARROW-8591 - [Rust] Reverse lookup for a key in DictionaryArray ARROW-8597 - [Rust] arrow crate lint and readability improvements ARROW-8606 - [CI] Don’t trigger all builds on a change to any file in ci/ ARROW-8607 - [R][CI] Unbreak builds following R 4.0 release ARROW-8611 - [R] Can’t install arrow 0.17 on Ubuntu 18.04 R 3.6.3 ARROW-8612 - [GLib] Add GArrowReadOptions and GArrowWriteOptions ARROW-8616 - [Rust] Turn explicit SIMD off by default ARROW-8619 - [C++] Use distinct Type::type values for interval types ARROW-8622 - [Rust] Parquet crate does not compile on aarch64 ARROW-8623 - [C++][Gandiva] Reduce use of Boost, remove Boost headers from header files ARROW-8624 - [Website] Install page should mention arrow-dataset packages ARROW-8628 - [CI][Dev] Wrap docker-compose commands with archery ARROW-8629 - [Rust] Eliminate indirection of ZST allocations ARROW-8633 - [C++] Add ValidateAscii function ARROW-8634 - [Java] Create an example ARROW-8639 - [C++][Plasma] Require gflags ARROW-8645 - [C++] Missing gflags dependency for plasma ARROW-8647 - [C++][Dataset] Optionally encode partition field values as dictionary type ARROW-8648 - [Rust] Optimize Rust CI Build Times ARROW-8650 - [Rust] [Website] Add documentation to Arrow website ARROW-8651 - [Python][Dataset] Support pickling of Dataset objects ARROW-8655 - [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset ARROW-8656 - [Python] Switch to VS2017 in the windows wheel builds ARROW-8659 - [Rust] ListBuilder and FixedSizeListBuilder capacity ARROW-8660 - [C++][Gandiva] Reduce dependence on Boost ARROW-8662 - [CI] Consolidate appveyor scripts ARROW-8664 - [Java] Add skip null check to all Vector types ARROW-8668 - [Packaging][APT][Yum][ARM] Use Travis CI’s ARM machine to build packages ARROW-8669 - [C++] Add IpcWriteOptions argument to GetRecordBatchSize() ARROW-8671 - [C++] Use IPC body compression metadata approved in ARROW-300 ARROW-8671 - [C++] Use IPC body compression metadata approved in ARROW-300 ARROW-8682 - [Ruby][Parquet] Add support for column level compression ARROW-8687 - [Java] Finish move of io.netty.buffer.ArrowBuf ARROW-8690 - [Python] Clean-up dataset+parquet tests now order is determinstic ARROW-8692 - [C++] Avoid memory copies when downloading from S3 ARROW-8695 - [Java] remove references to PlatformDependent in memory module ARROW-8696 - [Java] Convert tests to integration tests ARROW-8699 - [R] Fix automatic r_to_py conversion ARROW-8702 - [Packaging][C#] Build NuGet packages in release process ARROW-8703 - [R] schema$metadata should be properly typed ARROW-8707 - [CI] Docker push fails because of wrong dockerhub credentials ARROW-8708 - [CI] Utilize github actions cache for docker-compose volumes ARROW-8711 - [Python] Expose strptime timestamp parsing in read_csv conversion options ARROW-8717 - [CI][Packaging] Add build dependency on boost to homebrew ARROW-8720 - [C++] Fix checked_pointer_cast ARROW-8721 - [CI] Fix R build matrix ARROW-8723 - [Rust] Remove SIMD specific benchmark code ARROW-8724 - [Packaging][deb][RPM] Use directory in host as build directory ARROW-8725 - [Rust] redundant directory walk in rust parquet datasource code ARROW-8727 - [C++] Do not require struct-initialization of StringConverter&lt;T&gt; to parse strings to other types ARROW-8730 - [Rust] Use slice instead of &amp;Vec for function arguments ARROW-8733 - [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata ARROW-8736 - [Rust] [DataFusion] Table API should provide a schema() method ARROW-8740 - [CI] Fix archery option in pandas master cron test ARROW-8742 - [C++][Python] Add flight client support for Mutual TLS ARROW-8743 - [C++][CI] Add a test job on s390x ARROW-8744 - [Rust] ParquetIterator’s next method should be safe to call even after reached end of iteration ARROW-8745 - [C++] Bitmap.ToString causes failures on a big-endian platform ARROW-8747 - [C++] Feather tests with compression cause failure on big-endian platforms ARROW-8751 - [Rust] ParquetFileArrowReader should be able to read empty parquet file without error ARROW-8752 - [Rust] Remove unused hashmap ARROW-8753 - [C++][CI] Add a test job on ARM ARROW-8754 - [C++][CI] enable tests for additional components on big-endian platforms ARROW-8756 - [C++] Bitmap word tests cause failures on a big-endian platform ARROW-8757 - [C++] Plasma header is written in native endian ARROW-8758 - [R] Updates for compatibility with dplyr 1.0 ARROW-8759 - [C++] TestPlasmaSerialization.DeleteReply tests failure on big-endian platforms ARROW-8762 - [C++][Gandiva] Replace Gandiva’s BitmapAnd with common implementation ARROW-8763 - [C++] Create RandomAccessFile::WillNeed-like API ARROW-8764 - [C++] Make ThreadPool configurable in ReadRangeCache ARROW-8766 - [Python] A FileSystem implementation based on Python callbacks ARROW-8769 - [C++] Add convenience methods to access fields by name in StructScalar ARROW-8770 - [C++][CI] enable arrow-csv-test on s390x ARROW-8772 - [C++] Expand SumKernel benchmark to more types ARROW-8777 - [Rust] Parquet.rs does not support reading fixed-size binary fields. ARROW-8778 - [C++][Gandiva] SelectionVector related test failed on big-endian platforms ARROW-8779 - [R] Implement conversion to List&lt;Struct&gt; ARROW-8781 - [CI][C++] Enable ccache on GHA MinGW jobs ARROW-8782 - [Rust] [DataFusion] Add benchmarks based on NYC Taxi data set ARROW-8783 - [Rust] [DataFusion] Logical plan should have ParquetScan and CsvScan entries ARROW-8784 - [Rust] [DataFusion] Remove use of Arc from LogicalPlan ARROW-8785 - [Python][Packaging] Build the windows wheels with MIMALLOC enabled ARROW-8786 - [Packaging][rpm] Use bundled zstd in the CentOS 8 build ARROW-8788 - [C#] Array builders to use bit-packed buffer builder rather than boolean array builder for validity map ARROW-8789 - [Rust] Add separate crate for integration test binaries ARROW-8790 - [C++][CI] Enable arrow-flight-test on s390x ARROW-8791 - [Rust] Creating StringDictionaryBuilder with existing dictionary values ARROW-8792 - [C++] Improved declarative compute function / kernel development framework, normalize calling conventions ARROW-8793 - [C++] BitUtil::SetBitsTo probably doesn’t need to be inline ARROW-8794 - [C++] Expand benchmark coverage for arrow from parquet reading ARROW-8795 - [C++] Limited iOS support ARROW-8800 - [C++] Split arrow::ChunkedArray into arrow/chunked_array.h ARROW-8804 - [R][CI] Followup to Rtools40 upgrade ARROW-8814 - [Dev][Release] Binary upload script keeps raising locale warnings ARROW-8815 - [Dev][Release] Binary upload script should retry on unexpected bintray request error ARROW-8818 - [Rust] Failing to build on master due to Flatbuffers/Union issues ARROW-8822 - [Rust] [DataFusion] Add MemoryScan variant to LogicalPlan ARROW-8827 - [Integration Testing] Initial skeleton for Rust integration tests ARROW-8830 - [GLib] Add support for Tell againt not seekable GIO output stream ARROW-8831 - [Rust] incomplete SIMD implementation in simd_compare_op ARROW-8833 - [Rust] Implement VALIDATE mode in integration test binary ARROW-8834 - [Rust] Implement arrow-file-to-stream for integration testing ARROW-8835 - [Rust] Implement arrow-stream-to-file for integration testing ARROW-8836 - [Website] Update copyright end year automatically ARROW-8837 - [Rust] Add Null type ARROW-8838 - [Rust] File reader fails to read header from valid files ARROW-8839 - [Rust] datafusion logical plan should support scaning csv without provided schema ARROW-8840 - [Rust] datafusion ExecutionError should implement std::error:Error trait ARROW-8841 - [C++] Add benchmark and unittest for PLAIN spaced ARROW-8843 - [C++] Optimize BitmapEquals unaligned case ARROW-8844 - [C++] Optimize TransferBitmap unaligned case ARROW-8846 - [Dev][Python] Autoformat Python sources with Archery ARROW-8847 - [C++] Pass task size / metrics in Executor API ARROW-8851 - [Python][Documentation] Fix FutureWarnings in Python Plasma docs ARROW-8852 - [R] Post-0.17.1 adjustments ARROW-8854 - [Rust] [Integration Testing] Show output from arrow-json-integration-test ARROW-8855 - [Rust] [Integration Testing] data type Date32(Day) not supported ARROW-8856 - [Rust] [Integration Testing] Return empty batch if MessageHeader is NONE ARROW-8864 - [R] Add methods to Table/RecordBatch for consistency with data.frame ARROW-8866 - [C++] Split Type::UNION into Type::SPARSE_UNION and Type::DENSE_UNION ARROW-8867 - [R] Support converting POSIXlt type ARROW-8875 - [C++] use AWS SDK SetResponseStreamFactory to avoid a copy of bytes ARROW-8877 - [Rust] add CSV read option struct to simplify datafusion interface ARROW-8880 - [R][Linux] Make R Binary Install Friendlier ARROW-8881 - [Rust] Add large list and binary support ARROW-8885 - [R] Don’t include everything everywhere ARROW-8886 - [C#] Decide and implement appropriate behaviour for Array builder resize to negative size ARROW-8887 - [Java] Buffer size for complex vectors increases rapidly in case of clear/write loop ARROW-8890 - [R] Fix C++ lint issue ARROW-8895 - [C++] Add C++ unit tests for filter and take functions on temporal type inputs, including timestamps ARROW-8896 - [C++] Reimplement dictionary unpacking in Cast kernels using Take ARROW-8899 - [R] Add R metadata like pandas metadata for round-trip fidelity ARROW-8901 - [C++] Reduce number of take kernels ARROW-8903 - [C++] Implement optimized “unsafe take” for use with selection vectors for kernel execution ARROW-8904 - [Python] Fix usages of deprecated C++ APIs related to child/field ARROW-8906 - [Rust] Support reading multiple CSV files for schema inference ARROW-8907 - [Rust] implement scalar comparison operations ARROW-8912 - [Ruby] Keep reference of Arrow::Buffer’s data for GC ARROW-8913 - [Ruby] Use “field” instead of “child” ARROW-8914 - [C++][Gandiva] Decimal128 related test failed on big-endian platforms ARROW-8915 - [Dev][Archery] Require Click 7 ARROW-8917 - [C++][Compute] Formalize “metafunction” concept ARROW-8918 - [C++] Add cast “metafunction” to FunctionRegistry that addresses dispatching to appropriate type-specific CastFunction ARROW-8922 - [C++] Implement example string scalar kernel function to assist with string kernels buildout per ARROW-555 ARROW-8923 - [C++] Improve usability of arrow::compute::CallFunction by moving ExecContext* argument to end and adding default ARROW-8926 - [C++] Improve docstrings in new public APIs in arrow/compute and fix miscellaneous typos ARROW-8927 - [C++] Support dictionary memos when reading/writing record batches using cuda IPC ARROW-8929 - [C++] Change compute::Arity:VarArgs min_args default to 0 ARROW-8931 - [Rust] Support lexical sort in arrow compute kernel ARROW-8933 - [C++] Reduce generated code in vector_hash.cc ARROW-8934 - [C++] Add timestamp subtract kernel aliased to int64 subtract implementation ARROW-8937 - [C++] Add “parse_strptime” function for string to timestamp conversions using the kernels framework ARROW-8938 - [R] Provide binding for arrow::compute::CallFunction ARROW-8940 - [Java] Fix the performance degradation of integration tests ARROW-8941 - [C++/Python] arrow-nightlies conda repository is full ARROW-8942 - [R] Detect compression in reading CSV/JSON ARROW-8943 - [C++][Dataset] Add support for Partitioning to ParquetDatasetFactory ARROW-8950 - [C++] Make head optional in s3fs ARROW-8958 - [FlightRPC][Python] Implement Flight DoExchange for Python ARROW-8960 - [MINOR] [FORMAT] Fix typos in comments ARROW-8961 - [C++] Add utf8proc library to toolchain ARROW-8963 - [C++][Parquet] Parquet cpp optimize allocate memory ARROW-8965 - [Python][Documentation] Pyarrow documentation for pip nightlies references 404’d location ARROW-8966 - [C++] Move arrow::ArrayData to a separate header file ARROW-8969 - [C++] Reduce generated code in compute/kernels/scalar_compare.cc ARROW-8970 - [C++] Reduce shared library / binary code size (umbrella issue) ARROW-8972 - [Java] Support range value comparison for large varchar/varbinary vectors ARROW-8973 - [Java] Support batch value appending for large varchar/varbinary vectors ARROW-8974 - [C++] Refine TransferBitmap template parameters ARROW-8976 - [C++] compute::CallFunction can’t Filter/Take with ChunkedArray ARROW-8979 - [C++] Implement bitmap word reader and writer ARROW-8984 - [R] Revise install guides now that Windows conda package exists ARROW-8985 - [Format] Add “byte width” field with default of 16 to Decimal Flatbuffers type for forward compatibility ARROW-8989 - [C++] Document available functions in compute::FunctionRegistry ARROW-8993 - [Rust] Support reading non-seekable sources in text readers ARROW-8994 - [C++] Disable include-what-you-use cpplint lint checks ARROW-8996 - [C++] Runtime SIMD path for Aggregate Sum/Mean kernel ARROW-8997 - [Archery] Benchmark formatter should have friendly units ARROW-9004 - [C++][Gandiva] Support building with LLVM 10 ARROW-9005 - [Rust] [DataFusion] Support sort expression ARROW-9007 - [Rust] Support appending arrays by merging array data ARROW-9014 - [Packaging] Bump the minor part of the automatically generated version in crossbow ARROW-9015 - [Java] Make BaseAllocator package private ARROW-9016 - [Java] Remove direct references to Netty/Unsafe Allocators ARROW-9017 - [Python] Refactor the Scalar classes ARROW-9018 - [C++] Remove APIs that were deprecated in 0.17.x and prior ARROW-9021 - [Python] The filesystem keyword in parquet.read_table is not documented ARROW-9022 - [C++] Add/Sub/Mul arithmetic kernels with overflow check ARROW-9029 - [C++] Implement BitBlockCounter interface for blockwise popcounts of validity bitmaps ARROW-9030 - [Python] Clean up some usages of pyarrow.compat, move some common functions/symbols to lib.pyx ARROW-9031 - [R] Implement conversion from Type::UINT64 to R vector ARROW-9032 - [C++] Split arrow/util/bit_util.h into multiple header files ARROW-9034 - [C++] Implement binary (two bitmap) version of BitBlockCounter ARROW-9042 - [C++] Add Subtract and Multiply arithmetic kernels with wrap-around behavior ARROW-9043 - [Go] Temporarily copy LICENSE.txt to go/ ARROW-9043 - [Go] Temporarily copy LICENSE.txt to go/ ARROW-9045 - [C++] Improve and expand Take/Filter benchmarks ARROW-9046 - [C++][R] Put more things in type_fwds ARROW-9047 - [Rust] Setting 0-bits of a 0-length bitset segfaults ARROW-9050 - [Release] Use 1.0.0 as the next version ARROW-9051 - [GLib] Refer Array related objects from Array ARROW-9052 - [CI][MinGW] Enable Gandiva ARROW-9055 - [C++] Add sum/mean kernels for Boolean type ARROW-9058 - [Packaging][wheel] Boost download is failed ARROW-9060 - [GLib] Add support for building Apache Arrow Datasets GLib with non-installed Apache Arrow Datasets ARROW-9061 - [Packaging][APT][Yum][GLib] Add Apache Arrow Datasets GLib ARROW-9062 - [Rust] Support to read JSON into dictionary type ARROW-9067 - [C++] Create reusable branchless / vectorized index boundschecking functions ARROW-9070 - [C++] StructScalar needs field accessor methods ARROW-9073 - [C++] RapidJSON include directory detection doesn’t work with RapidJSONConfig.cmake ARROW-9074 - [GLib] Add missing arrow-json check ARROW-9075 - [C++] Optimize Filter implementation ARROW-9079 - [C++] Write benchmark for arithmetic kernels ARROW-9083 - [R] collect int64, uint32, uint64 as R integer type if not out of bounds ARROW-9086 - [CI][Homebrew] Enable Gandiva ARROW-9088 - [Rust] Recent version of arrow crate does not compile into wasm target ARROW-9089 - [Python] A PyFileSystem handler for fsspec-based filesystems ARROW-9090 - [C++] Bump versions of bundled libraries ARROW-9091 - [C++] Utilize function’s default options when passing no options to CallFunction for a function that requires them ARROW-9093 - [FlightRPC][C++][Python] Allow setting gRPC client options ARROW-9094 - [Python] Bump versions of compiled dependencies in manylinux wheels ARROW-9095 - [Rust] Fix NullArray to comply with spec ARROW-9099 - [C++][Gandiva] Add TRIM function for string ARROW-9100 - [C++] Add ascii_lower kernel ARROW-9101 - [Doc][C++][Python] Document encoding expected by CSV and JSON readers ARROW-9102 - [Packaging] Upload built manylinux docker images ARROW-9106 - [C++] Add C++ foundation to ease file transcoding ARROW-9108 - [C++][Dataset] Add Parquet Statistics conversion for timestamp columns ARROW-9109 - [Python][Packaging] Enable S3 support in manylinux wheels ARROW-9110 - [C++] Fix CPU cache size detection on macOS ARROW-9112 - [R] Update autobrew script location ARROW-9115 - [C++] Process data buffers in batch in ascii_lower / ascii_upper kernels rather than using string_view value iteration ARROW-9116 - [C++] Add BinaryArray::total_values_length() ARROW-9116 - [C++] Add BinaryArray::total_values_length() ARROW-9118 - [C++] Add more general BoundsCheck function that also checks for arbitrary lower limits in integer arrays ARROW-9119 - [C++] Add support for building with system static gRPC ARROW-9123 - [Python][wheel] Use libzstd.a explicitly ARROW-9124 - [Rust][Datafusion] DFParser should consume sql query as &amp;str instead of String ARROW-9125 - [C++] Add missing include for arrow::internal::ZeroMemory() for Valgrind ARROW-9129 - [Python][JPype] Test is failed with JPype 0.7.5 ARROW-9130 - [Python] Add deprecated wrappers functions to a pyarrow/compat.py module for 1.0.0 that will be removed later ARROW-9131 - [C++] Faster ascii_lower and ascii_upper ARROW-9132 - [C++] Implement hash kernels for dictionary data with constant dictionaries ARROW-9133 - [C++] Add utf8_upper and utf8_lower ARROW-9137 - [GLib][Ruby] Allow to read Parquet files in chunks (by RowGroup) ARROW-9138 - [Docs][Format] Make sure format version is hard coded in the docs ARROW-9139 - [Python] parquet read_table should not use_legacy_dataset ARROW-9144 - [CI] OSS-Fuzz build fails because recent changes in the google repository ARROW-9145 - [C++] Add true_count / false_count methods to BooleanArray ARROW-9152 - [C++] Create specialized filter implementation for varbinary types ARROW-9153 - [Python] Add bindings for StructScalar ARROW-9154 - [Developer] Use GitHub issue templates better ARROW-9155 - [Archery] Less precise but faster default settings for “archery benchmark diff” ARROW-9156 - [C++] Reducing the code size of the tensor module ARROW-9157 - [Rust][Datafusion] execution context’s create_physical_plan should take self as immutable reference ARROW-9158 - [Rust][Datafusion] Projection physical plan compilation should preserve nullability ARROW-9159 - [Python] Expose the isnull/isvalid kernels ARROW-9162 - [Python] Expose Add/Subtract/Multiply arithmetic kernels ARROW-9163 - [C++] Add methods to StringArray, LargeStringArray, to validate whether its values are all UTF-8 ARROW-9166 - [Website] Add overview page ARROW-9167 - [Doc][Website] /docs/c_glib/index.html is overwritten ARROW-9168 - [C++][Flight] allow flight benchmark to use separated TCP connections ARROW-9173 - [C++] Document how to use Arrow from a third-party CMake project ARROW-9175 - [FlightRPC][C++][Python] Expose connected peer ARROW-9176 - [Rust] Fix for memory leaks in Arrow allocator ARROW-9178 - [R] Improve documentation about CSV reader ARROW-9179 - [R] Replace usage of iris dataset in tests ARROW-9180 - [Developer] Remove usage of whitelist, blacklist, slave, etc. ARROW-9181 - [C++] Instantiate fewer templates in Cast kernel implementation ARROW-9182 - [C++] Use “applicator” namespace for kernel operator-to-kernel functors, streamline argument unboxing ARROW-9185 - [C++] [Java][Gandiva] Make llvm build optimisation configurable from java ARROW-9188 - [C++] Do not always statically link Brotli libraries ARROW-9189 - [Website] Improve contributor guide ARROW-9190 - [Website][C++] Add blog post on efforts to make building lighter and easier ARROW-9191 - [Rust] Do not panic when int96 milliseconds are negative ARROW-9192 - [CI][Rust] Add support for running clippy ARROW-9193 - [C++] Add method to parse date from null-terminated string ARROW-9197 - [C++] Revamp numeric casts: faster performance and reduced binary size ARROW-9201 - [Archery] Render-human readable table when using “archery benchmark diff” ARROW-9202 - [GLib] Add GArrowDatum ARROW-9203 - [Packaging][deb] Add missing gir1.2-arrow-dataset-1.0.install ARROW-9204 - [C++][Flight] change records_per_stream to int64 in flight benchmark ARROW-9205 - [Documentation] Fix typos in Columnar.rst ARROW-9206 - [C++][Flight] measure latency in flight benchmark ARROW-9207 - [Python][Dataset] Clean-up internal FileSource class ARROW-9210 - [C++] Use OptionalBitBlockCounter in ArrayDataInlineVisitor ARROW-9214 - [C++] Avoid util::optional in favor of separate inlineable functions in arrow/visitor_inline.h ARROW-9216 - [C++][Parquet] Use BitBlockCounter for plain spaced encoding/decoding ARROW-9217 - [C++][Parquet] Cover 0.01% null for the plain spaced encoding/decoding benchmark ARROW-9220 - [C++] Disable relevant compute kernels if ARROW_WITH_UTF8PROC=OFF ARROW-9222 - [Format][Proposal] Remove validity bitmap from Union types ARROW-9224 - [Dev][Archery] Copy local repo on clone failure ARROW-9225 - [C++][Compute] Improve counting sort ARROW-9231 - [Format] Increment MetadataVersion from V4 to V5 ARROW-9234 - [GLib][CUDA] Add support for dictionary memo on reading record batch from buffer ARROW-9241 - [C++] Add forward compatibility checks for Decimal::bitWidth ARROW-9242 - [Java] Add forward compatibility checks for Decimal::bitWidth ARROW-9247 - [Python] Expose BinaryArray::total_values_length in bindings ARROW-9248 - [C++] Add “list_size” function that returns Int32Array/Int64Array giving list cell sizes ARROW-9249 - [C++] Implement “list_parent_indices” vector function ARROW-9250 - [C++] Compact generated code in compute/kernels/scalar_set_lookup.cc using same method as vector_hash.cc ARROW-9251 - [C++] Move JSON testing code for integration tests to libarrow_testing ARROW-9254 - [C++] Factor out some integer casting internals so it can be reused with temporal casts ARROW-9255 - [C++] Use CMake to build bundled Protobuf with CMake &gt;= 3.7 ARROW-9256 - [C++] Incorrect variable name ARROW_CXX_FLAGS ARROW-9258 - [Format] Add V5 MetadataVersion ARROW-9259 - [Format] Permit unsigned dictionary indices in Columnar.rst ARROW-9262 - [Packaging][Linux][CI] Use Ubuntu 18.04 to build ARM64 packages on Travis CI ARROW-9263 - [C++] Benchmark: promote RegressionSetArgs size to L2 ARROW-9264 - [C++] Cleanup Parquet Arrow Schema code ARROW-9265 - [C++] Add support for writing MetadataVersion::V4-compatible IPC messages for compatibility with library versions &lt;= 0.17.1 ARROW-9268 - [C++] Add is{alnum,alpha,…} kernels for strings ARROW-9272 - [C++][Python] Reduce complexity in python to arrow conversion ARROW-9276 - [Dev] Enable ARROW_CUDA when generating API documentations ARROW-9277 - [C++] Fix documentation of Reading CSV files ARROW-9278 - [C++] Implement Union validity bitmap changes from ARROW-9222 ARROW-9280 - [Rust] Write statistics to Parquet files ARROW-9281 - [R] Turn off utf8proc in R builds ARROW-9283 - [Python] Expose C++ build info ARROW-9287 - [C++] Implement support for unsigned dictionary indices ARROW-9289 - [R] Remove deprecated functions ARROW-9290 - [Rust] [Parquet] Add features to allow opting out of dependencies ARROW-9291 - [R] Support fixed size binary/list types ARROW-9292 - [Rust] Update feature matrix with passing tests ARROW-9294 - [GLib] Add GArrowFunction ARROW-9300 - [Java] Separate Netty Memory to its own module ARROW-9306 - [Ruby] Add support for Arrow::RecordBatch.new(raw_table) ARROW-9307 - [Ruby] Add Arrow::RecordBatchIterator#to_a ARROW-9308 - [Format] Add Feature enum to schema.fbs for forward compatibity ARROW-9316 - [C++] Use “Dataset” instead of “Datasets” ARROW-9321 - [C++][Dataset] Allow to “collect” statistics for ParquetFragment row groups if not constructed from _metadata ARROW-9322 - [R] Dataset documentation polishing ARROW-9323 - [Ruby] Add Red Arrow Dataset ARROW-9327 - Fix all clippy errors for arrow crate ARROW-9328 - [C++][Gandiva] Add LTRIM, RTRIM, BTRIM functions for string ARROW-9329 - [C++][Gandiva] Implement castTimestampToDate function ARROW-9331 - [C++] Improve the performance of Tensor-to-SparseTensor conversion ARROW-9333 - [Python] Expose more IPC write options in Python ARROW-9335 - [Website] Update website for 1.0 ARROW-9337 - [R] On C++ library build failure, give an unambiguous message ARROW-9339 - [Rust] Comments on SIMD in Arrow README are incorrect ARROW-9340 - [R] Use CRAN version of decor package ARROW-9341 - [GLib] Use arrow::Datum version Take() ARROW-9345 - [C++][Dataset] Expression with dictionary type should work with operand of value type ARROW-9346 - [C++][Python][Dataset] Add total_byte_size metadata to RowGroupInfo ARROW-9362 - [Java] Add support for writing MetadataVersion::V4-compatible IPC messages for compatibility with library versions &lt;= 0.17.1 ARROW-9365 - [Go] Implement the rest of the typed array builders in NewBuilder ARROW-9370 - [Java] Bump Netty version ARROW-9374 - [C++][Python] Expose MakeArrayFromScalar ARROW-9379 - [Rust] Support unsigned dictionary indices ARROW-9383 - [Python] Support fsspec filesystems in Dataset API through fs handler ARROW-9386 - [Rust] RecordBatch.schema() should not return &amp;Arc&lt;Schema&gt; ARROW-9390 - [C++] Review compute function names ARROW-9390 - [C++] Review compute function names ARROW-9390 - [C++] Review compute function names ARROW-9391 - [Rust] Float32 values interpreted as zero when record batch has one row ARROW-9393 - [Doc] update supported types documentation for Java ARROW-9395 - [Python] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed ARROW-9399 - [C++] Add forward compatibility checks for unrecognized future MetadataVersion ARROW-9403 - [Python] add .tolist as alias of .to_pylist ARROW-9407 - [Python] Accept pd.NA as missing value in array constructor ARROW-9411 - [Rust] Update dependencies ARROW-9424 - [C++][Parquet] Disable writing files with LZ4 codec ARROW-9425 - [Rust][DataFusion] Make ExecutionContext sharable between threads ARROW-9427 - [Rust][DataFusion] Add pub fn ExecutionContext.tables() ARROW-9437 - [Python][Packaging] Homebrew fails to install build dependencies in the macOS wheel builds ARROW-9442 - [Python] Do not force Validate() to be called in pyarrow_wrap_table ARROW-9445 - [Python] Revert Array.equals changes + expose comparison ops in compute ARROW-9446 - [C++] Export compiler information in BuildInfo ARROW-9447 - [Rust][DataFusion] Allow closures as ScalarUDFs ARROW-9452 - [Rust] [DateFusion] Improve performance of parquet scan ARROW-9470 - [CI][Java] Run Maven in parallel ARROW-9472 - [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed ARROW-9473 - [Doc] Polishing for 1.0 ARROW-9478 - [C++] Improve error message on unsupported cast types ARROW-9484 - [Docs] Update is* functions to be is_* in the compute docs ARROW-9485 - [R] Better shared library stripping ARROW-9493 - [Python][Dataset] Dictionary encode string partition columns by default ARROW-9508 - [Release][APT][Yum] Enable verification for arm64 binaries ARROW-9509 - [Release] Don’t test Gandiva in the windows wheel verification script ARROW-9511 - [Packaging][Release] Set conda packages’ build number to 0 ARROW-9519 - [Rust] Improve error message when getting a field by name from schema ARROW-9529 - [Dev][Release] Improvements to release verification scripts ARROW-9531 - [Packaging][Release] Update conda forge dependency pins ARROW-9534 - [Rust] [DataFusion] Implement functions for creating literal expressions for all types PARQUET-1820 - [C++] Use a column filter hint to inform read prefetching in Arrow reads PARQUET-1843 - [C++] Unnecessary assignment in DictDecoderImpl::Decode PARQUET-1855 - [C++] Improve documentation on MetaData ownership PARQUET-1861 - [Documentation][C++] Explain ReaderProperters.buffer_stream*" />
<link rel="canonical" href="https://arrow.apache.org/release/1.0.0.html" />
<meta property="og:url" content="https://arrow.apache.org/release/1.0.0.html" />
<meta property="og:site_name" content="Apache Arrow" />
<meta property="og:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="og:type" content="article" />
<meta property="article:published_time" content="2024-04-26T05:50:33-04:00" />
<meta name="twitter:card" content="summary_large_image" />
<meta property="twitter:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="twitter:title" content="Apache Arrow 1.0.0 Release" />
<meta name="twitter:site" content="@ApacheArrow" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"BlogPosting","dateModified":"2024-04-26T05:50:33-04:00","datePublished":"2024-04-26T05:50:33-04:00","description":"Apache Arrow 1.0.0 (24 July 2020) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 771 commits from 100 distinct contributors. $ git shortlog -sn apache-arrow-0.17.0..apache-arrow-1.0.0 125 Wes McKinney 72 Neal Richardson 61 Antoine Pitrou 56 Sutou Kouhei 55 Krisztián Szűcs 30 Joris Van den Bossche 24 Benjamin Kietzman 23 David Li 21 Kazuaki Ishizaki 19 Qingping Hou 18 Andy Grove 18 François Saint-Jacques 17 Neville Dipale 16 Yibo Cai 16 liyafan82 15 Ryan Murray 14 Micah Kornfield 10 Uwe L. Korn 9 Paddy Horan 8 Jorge C. Leitao 8 Projjal Chanda 7 Frank Du 7 Romain Francois 6 Kenta Murata 6 Maarten A. Breddels 6 Max Burke 5 Mahmut Bulut 4 Mark Hildreth 4 Yuan Zhou 3 Adam Szmigin 3 Raphael Taylor-Davies 3 Zhuo Peng 3 rdettai 2 Bobby Wagner 2 Eric Erhardt 2 Prudhvi Porandla 2 R. Tyler Croy 2 Sagnik Chakraborty 2 Vamsi 2 Yordan Pavlov 2 c-jamie 2 dota17 2 mayuropensource 2 tianchen 2 tianchen92 1 Alex Baden 1 Ben Kimock 1 Bryan Cutler 1 Chad Dougherty 1 Christian Hudon 1 Cuong Nguyen 1 David Anthoff 1 Dmitry Kalinkin 1 Drazen Urch 1 Erin Ryan 1 Eyal Leshem 1 Frank Smith 1 German I Ramirez-Espinoza 1 Hongze Zhang 1 Ilija Puaca 1 Itamar Turner-Trauring 1 Jacek Pliszka 1 Jens Glaser 1 Ji Liu 1 Jörn Horstmann 1 Kyle Brandt 1 Kyle McCarthy 1 Laurent Goujon 1 Li, Jiajia 1 Maarten Breddels 1 Masaki Kozuki 1 Mohamed Zenadi 1 Nick Poorman 1 Pand9 1 Paul Dix 1 Paul Taylor 1 Praveen 1 Robert Borkowski 1 Rok 1 Sergey Fedoseev 1 Shuai Zhang 1 Simon Watts 1 Stéphane Campinas 1 Suvayu Ali 1 Sven Wagner-Boysen 1 TP Boudreau 1 Takuya Kato 1 Tobias Mayer 1 Wakahisa 1 Weston Steimel 1 Yuqi Gu 1 Zachary Gramana 1 Ze&#39;ev Maor 1 alamb 1 alimcmaster1 1 amol 1 chenguoping 1 ritchie46 1 rvernica 1 xuyiming Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-0.17.0..apache-arrow-1.0.0 231 Wes McKinney 85 Antoine Pitrou 74 Krisztián Szűcs 68 Neal Richardson 67 Sutou Kouhei 44 François Saint-Jacques 35 Andy Grove 34 Neville Dipale 21 Benjamin Kietzman 17 GitHub 17 Micah Kornfield 12 Chao Sun 12 Paddy Horan 8 David Li 8 Praveen 7 Joris Van den Bossche 5 Bryan Cutler 5 Eric Erhardt 5 Uwe L. Korn 4 Yosuke Shiro 3 Jacques Nadeau 3 Pindikura Ravindra 3 Sebastien Binet 2 liyafan82 1 Kenta Murata Changelog Apache Arrow 1.0.0 (2020-07-24) Bug Fixes ARROW-1692 - [Python, Java] UnionArray round trip not working ARROW-3329 - [Python] Error casting decimal(38, 4) to int64 ARROW-3861 - [Python] ParquetDataset().read columns argument always returns partition column ARROW-4018 - [C++] RLE decoder may not big-endian compatible ARROW-4309 - [Documentation] Add a docker-compose entry which builds the documentation with CUDA enabled ARROW-4600 - [Ruby] Arrow::DictionaryArray#[] should returns the item in the indices array ARROW-5158 - [Packaging][Wheel] Symlink libraries in wheels ARROW-5310 - [Python] better error message on creating ParquetDataset from empty directory ARROW-5359 - [Python] timestamp_as_object support for pa.Table.to_pandas in pyarrow ARROW-5572 - [Python] raise error message when passing invalid filter in parquet reading ARROW-5666 - [Python] Underscores in partition (string) values are dropped when reading dataset ARROW-5744 - [C++] Do not error in Table::CombineChunks for BinaryArray types that overflow 2GB limit ARROW-5875 - [FlightRPC] Test RPC features in integration tests ARROW-6235 - [R] Conversion from arrow::BinaryArray to R character vector not implemented ARROW-6523 - [C++][Dataset] arrow_dataset target does not depend on anything ARROW-6848 - [C++] Specify -std=c++11 instead of -std=gnu++11 when building ARROW-7018 - [R] Non-UTF-8 data in Arrow &lt;–&gt; R conversion ARROW-7028 - [R] Date roundtrip results in different R storage mode ARROW-7084 - [C++] ArrayRangeEquals should check for full type equality? ARROW-7173 - [Integration] Add test to verify Map field names can be arbitrary ARROW-7208 - [Python] Passing directory to ParquetFile class gives confusing error message ARROW-7273 - [Python] Non-nullable null field is allowed / crashes when writing to parquet ARROW-7480 - [Rust] [DataFusion] Query fails/incorrect when aggregated + grouped columns don’t match the selected columns ARROW-7610 - [Java] Finish support for 64 bit int allocations ARROW-7654 - [Python] Ability to set column_types to a Schema in csv.ConvertOptions is undocumented ARROW-7681 - [Rust] Explicitly seeking a BufReader will discard the internal buffer ARROW-7702 - [C++][Dataset] Provide (optional) deterministic order of batches ARROW-7782 - [Python] Losing index information when using write_to_dataset with partition_cols ARROW-7840 - [Java] [Integration] Java executables fail ARROW-7925 - [C++][Documentation] Instructions about running IWYU and other tasks in cpp/development.rst have gone stale ARROW-7939 - [Python] crashes when reading parquet file compressed with snappy ARROW-7967 - [CI][Crossbow] Pin macOS version in autobrew job to match CRAN ARROW-8050 - [Python][Packaging] Do not include generated Cython source files in wheel packages ARROW-8078 - [Python] Missing links in the docs regarding field and schema DataTypes ARROW-8115 - [Python] Conversion when mixing NaT and datetime objects not working ARROW-8251 - [Python] pandas.ExtensionDtype does not survive round trip with write_to_dataset ARROW-8344 - [C#] StringArray.Builder.Clear() corrupts subsequently-built array contents ARROW-8360 - [C++][Gandiva] Fixes date32 support for date/time functions ARROW-8374 - [R] Table to vector of DictonaryType will error when Arrays don’t have the same Dictionary per array ARROW-8392 - [Java] Fix overflow related corner cases for vector value comparison ARROW-8448 - [Package] Can’t build apt packages with ubuntu-focal ARROW-8455 - [Rust] [Parquet] Arrow column read on partially compatible files ARROW-8455 - [Rust] [Parquet] Arrow column read on partially compatible files ARROW-8471 - [C++][Integration] Regression to /u?int64/ as JSON::number ARROW-8472 - [Go][Integration] Represent 64 bit integers as JSON::string ARROW-8473 - [Rust] “Statistics support” in rust/parquet readme is incorrect ARROW-8480 - [Rust] There is no check for allocation failure ARROW-8503 - [Packaging][deb] Can’t build apache-arrow-archive-keyring for RC ARROW-8505 - [Release][C#] “sourcelink test” is failed by Apache.Arrow.AssemblyInfo.cs ARROW-8508 - [Rust] ListBuilder of FixedSizeListBuilder creates wrong offsets ARROW-8510 - [C++] arrow/dataset/file_base.cc fails to compile with internal compiler error with “Visual Studio 15 2017 Win64” generator ARROW-8511 - [Developer][Release] Windows release verification script does not halt if C++ compilation fails ARROW-8514 - [Developer] Windows wheel verification script does not check Python 3.5 ARROW-8529 - [C++] Fix usage of NextCounts() in GetBatchWithDict[Spaced] ARROW-8535 - [Rust] Arrow crate does not specify arrow-flight version ARROW-8536 - [Rust] Failed to locate format/Flight.proto in any parent directory ARROW-8537 - [C++] Performance regression from ARROW-8523 ARROW-8539 - [CI] “AMD64 MacOS 10.15 GLib &amp; Ruby” fails ARROW-8554 - [C++][Benchmark] Fix building error “cannot bind lvalue” ARROW-8556 - [R] zstd symbol not found if there are multiple installations of zstd ARROW-8566 - [R] error when writing POSIXct to spark ARROW-8568 - [C++][Python] Crash on decimal cast in debug mode ARROW-8577 - [Plasma] PlasmaClient::Connect() of CUDA enabled build is always failed on no CUDA device machine ARROW-8583 - [C++][Doc] Undocumented parameter in Dataset namespace ARROW-8584 - [Packaging][C++] Protobuf link error in deb builds ARROW-8585 - [Packaging][Python] Windows wheels fail to build because of link error ARROW-8586 - [R] installation failure on CentOS 7 ARROW-8587 - [C++] Compilation error when linking arrow-flight-perf-server ARROW-8592 - [C++] Docs still list LLVM 7 as compiler used ARROW-8593 - [C++] Parquet file_serialize_test.cc fails to build with musl libc ARROW-8598 - [Rust] simd_compare_op creates buffer of incorrect length when item count is not a multiple of T::lanes() ARROW-8602 - [CMake] Fix ws2_32 link issue when cross-compiling on Linux ARROW-8603 - [Documentation] Fix Sphinx doxygen comment ARROW-8604 - [R][CI] Update CI to use R 4.0 ARROW-8608 - [C++] Update vendored mpark/variant.h to latest to fix NVCC compilation issues ARROW-8609 - [C++] ORC JNI bridge crashed on null arrow buffer ARROW-8610 - [Rust] DivideByZero when running arrow crate when simd feature is disabled ARROW-8613 - [C++][Dataset] Raise error for unparsable partition value ARROW-8615 - [R] Error better and insist on RandomAccessFile in read_feather ARROW-8617 - [Rust] simd_load_set_invalid does not exist on aarch64 ARROW-8632 - [C++] Fix conversion error warning in array_union_test.cc ARROW-8641 - [Python] Regression in feather: no longer supports permutation in column selection ARROW-8643 - [Python] Tests with pandas master failing due to freq assertion ARROW-8644 - [Python] Dask integration tests failing due to change in not including partition columns ARROW-8646 - [Java] Allow UnionListWriter to write null values ARROW-8649 - [Java] [Website] Java documentation on website is hidden ARROW-8657 - [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version=’2.0’ ARROW-8663 - [Documentation] Small correction to building.rst ARROW-8680 - [Rust] ComplexObjectArrayReader incorrect null value shuffling ARROW-8684 - [Python] “SystemError: Bad call flags in _PyMethodDef_RawFastCallDict” in Python 3.7.7 on macOS when using pyarrow wheel ARROW-8689 - [C++] S3 benchmarks fail linking ARROW-8693 - [Python] Dataset.get_fragments is missing an implicit cast when filtering ARROW-8694 - [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame ARROW-8701 - [Rust] Unresolved import `crate::compute::util::simd_load_set_invalid` on Raspberry Pi ARROW-8704 - [C++] Fix Parquet crash on invalid input (OSS-Fuzz) ARROW-8705 - [Java] ComplexCopier is skipping null values ARROW-8706 - [C++][Parquet] Tracking JIRA for PARQUET-1857 (unencrypted INT16_MAX Parquet row group limit) ARROW-8710 - [Rust] Continuation marker not written correctly in IPC writer, and stream not flushed ARROW-8722 - [Dev] “archery docker run -e” doesn’t work ARROW-8726 - [C++][Dataset] Mis-specified DirectoryPartitioning incorrectly uses the file name as value ARROW-8728 - [C++] Bitmap operation may cause buffer overflow ARROW-8729 - [C++][Dataset] Only selecting a partition column results in empty table ARROW-8734 - [R] improve nightly build installation ARROW-8741 - [Python][Packaging] Keep VS2015 with for the windows wheels ARROW-8750 - [Python] pyarrow.feather.write_feather does not default to lz4 compression if it’s available ARROW-8768 - [R][CI] Fix nightly as-cran spurious failure ARROW-8775 - [C++][FlightRPC] Integration client doesn’t run integration tests ARROW-8776 - [FlightRPC][C++] Flight/C++ middleware don’t receive headers on failed calls to Java servers ARROW-8798 - [C++] Fix Parquet crashes on invalid input (OSS-Fuzz) ARROW-8799 - [C++][Dataset] Reading list column as nested dictionary segfaults ARROW-8801 - [Python] Memory leak on read from parquet file with UTC timestamps using pandas ARROW-8802 - [C++][Dataset] Schema metadata are lost when reading a subset of columns ARROW-8803 - [Java] Row count should be set before loading buffers in VectorLoader ARROW-8808 - [Rust] Divide by zero in arrays/builder.rs ARROW-8809 - [Rust] schema mismatch in integration test ARROW-8811 - [Java] Fix build on master ARROW-8820 - [C++][Gandiva] fix date_trunc functions to return date types ARROW-8821 - [Rust] nested binary expression with Like, NotLike and Not operator results in type cast error ARROW-8825 - [C++] Cannot compiled pass with Wunused-parameter flag ARROW-8826 - [Crossbow] remote URL should always have .git ARROW-8832 - [Python] AttributeError: module ‘pyarrow.fs’ has no attribute ‘S3FileSystem’ ARROW-8848 - [CI][C/Glib] MinGW build error ARROW-8848 - [CI][C/Glib] MinGW build error ARROW-8858 - [FlightRPC] Ensure headers are uniformly exposed ARROW-8860 - [C++] IPC/Feather decompression broken for nested arrays ARROW-8862 - [C++] NumericBuilder does not use MemoryPool passed to CTOR ARROW-8863 - [C++] Array subclass constructors must set ArrayData::null_count to 0 when there is no validity bitmap ARROW-8869 - [Rust] [DataFusion] Type Coercion optimizer rule does not support new scan nodes ARROW-8871 - [C++] Gandiva build failure ARROW-8872 - [CI] Travis-CI jobs fail (can’t open file ‘ci/detect-changes.py’) ARROW-8874 - [C++][Dataset] Scanner::ToTable race when ScanTask exit early with an error ARROW-8878 - [R] try_download is confused when download.file.method isn’t default ARROW-8882 - [C#] Add .editorconfig to C# code ARROW-8888 - [Python] Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions ARROW-8889 - [Python] Python 3.7 SIGSEGV when comparing RecordBatch to None ARROW-8892 - [C++][CI] CI builds for MSVC do not build benchmarks ARROW-8909 - [Java] Out of order writes using setSafe ARROW-8911 - [C++] Slicing a ChunkedArray with zero chunks segfaults ARROW-8924 - [C++][Gandiva] castDATE_date32() may cause overflow ARROW-8925 - [Rust] [DataFusion] CsvExec::schema() returns incorrect results ARROW-8930 - [C++] libz.so linking error with liborc.a ARROW-8932 - [C++] symbol resolution failures with liborc.a ARROW-8946 - [Python] Add tests for parquet.write_metadata metadata_collector ARROW-8948 - [Java][Integration] enable duplicate field names integration tests ARROW-8951 - [C++] Fix compiler warning in compute/kernels/scalar_cast_temporal.cc ARROW-8954 - [Website] ca-certificates should be listed in installation instructions ARROW-8957 - [FlightRPC][C++] Fail to build due to IpcOptions ARROW-8959 - [Rust] Broken build due to new benchmark crate using old API ARROW-8962 - [C++] Linking failure with clang-4.0 ARROW-8968 - [C++][Gandiva] Show link warning message on s390x ARROW-8975 - [FlightRPC][C++] Fix flaky MacOS tests ARROW-8977 - [R] Table$create with schema crashes with some dictionary index types ARROW-8978 - [C++][Compute] “Conditional jump or move depends on uninitialised value(s)” Valgrind warning ARROW-8980 - [Python] Metadata grows exponentially when using schema from disk ARROW-8982 - [CI] Remove allow_failures for s390x in TravisCI ARROW-8986 - [Archery][ursabot] Fix benchmark diff checkout of origin/master ARROW-9000 - [Java] build crashes with JDK14 ARROW-9009 - [C++][Dataset] ARROW:schema should be removed from schema’s metadata when reading Parquet files ARROW-9013 - [C++] Validate enum-style CMake options ARROW-9020 - [Python] read_json won’t respect explicit_schema in parse_options ARROW-9024 - [C++/Python] Install anaconda-client in conda-clean job ARROW-9026 - [C++/Python] Force package removal from arrow-nightlies conda repository ARROW-9037 - [C++][C] unable to import array with null count == -1 (which could be exported) ARROW-9057 - [Rust] Projection should work on InMemoryScan without error ARROW-9059 - [Rust] Documentation for slicing array data has the wrong sign ARROW-9066 - [Python] Raise correct error in isnull() ARROW-9071 - [C++] MakeArrayOfNull makes invalid ListArray ARROW-9077 - [C++] Fix aggregate/scalar-compare benchmark null_percent calculation ARROW-9080 - [C++] arrow::AllocateBuffer returns a Result&lt;unique_ptr&lt;Buffer&gt;&gt; ARROW-9082 - [Rust] - Stream reader fail when steam not ended with (optional) 0xFFFFFFFF 0x00000000” ARROW-9084 - [C++] CMake is unable to find zstd target when ZSTD_SOURCE=SYSTEM ARROW-9085 - [C++][CI] Appveyor CI test failures ARROW-9087 - [C++] Missing HDFS options parsing ARROW-9098 - RecordBatch::ToStructArray cannot handle record batches with 0 column ARROW-9105 - [C++] ParquetFileFragment scanning doesn’t handle filter on partition field ARROW-9120 - [C++] Lint and Format C++ files with “codegen” in file name ARROW-9121 - [C++] Do not wipe the filesystem when path is empty ARROW-9122 - [C++] Adapt ascii_lower/ascii_upper bulk transforms to work on sliced arrays ARROW-9126 - [C++] Trimmed Boost bundle fails to build on Windows ARROW-9127 - [Rust] Update thrift library dependencies ARROW-9134 - [Python] Parquet partitioning degrades Int32 to float64 ARROW-9141 - [R] Update cross-package documentation links ARROW-9142 - [C++] random::RandomArrayGenerator::Boolean “probability” misdocumented / incorrect ARROW-9143 - [C++] RecordBatch::Slice erroneously sets non-nullable field’s internal null_count to unknown ARROW-9146 - [C++][Dataset] Scanning a Fragment with a filter + mismatching schema shouldn’t abort ARROW-9151 - [R][CI] Fix Rtools 4.0 build: pacman sync ARROW-9160 - [C++] Implement string/binary contains for exact matches ARROW-9174 - [Go] Tests panic with 386 (x86) builds ARROW-9183 - [C++] Failed to build arrow-cpp with gcc 4.9.2 ARROW-9184 - [Rust][Datafusion] table scan without projection should return all columns ARROW-9194 - [C++] Array::GetScalar not implemented for decimal type ARROW-9195 - [Java] Wrong usage of Unsafe.get from bytearray in ByteFunctionsHelper class ARROW-9209 - [C++] Benchmarks fail to build ARROW_IPC=OFF and ARROW_BUILD_TESTS=OFF ARROW-9219 - [R] coerce_timestamps in Parquet write options does not work ARROW-9221 - ArrowBuf#setBytes(int, ByteBuffer) doesn’t check the byte buffer’s endianness ARROW-9223 - [Python] Fix to_pandas() export for timestamps within structs ARROW-9230 - [FlightRPC][Python] flight.connect() doesn’t pass through all arguments ARROW-9233 - [C++] is_null on NullArray should be true for all values ARROW-9236 - [Rust] CSV WriterBuilder never writes header ARROW-9237 - [R] 0.17 install on Arch Linux ARROW-9238 - [C++][CI] A few test coverages of round-robin in ipc and flight ARROW-9252 - [Integration] GitHub Actions integration test job does not test against “gold” 0.14.1 files in apache/arrow-testing ARROW-9260 - [CI] “ARM64v8 Ubuntu 20.04 C++” fails ARROW-9260 - [CI] “ARM64v8 Ubuntu 20.04 C++” fails ARROW-9261 - [Python][Packaging] S3FileSystem curl errors in manylinux wheels ARROW-9274 - [Rust] [Integration Testing] Read i64 from json files as strings ARROW-9282 - [R] Remove usage of _EXTPTR_PTR ARROW-9284 - [Java] getMinorTypeForArrowType returns sparse minor type for dense union types ARROW-9288 - [C++][Dataset] Discovery of partition field as dictionary type segfaulting with HivePartitioning ARROW-9297 - [C++][Dataset] Dataset scanner cannot handle large binary column (&gt; 2 GB) ARROW-9298 - [C++] Fix crashes on invalid input (OSS-Fuzz) ARROW-9303 - [R] Linux static build should always bundle dependencies ARROW-9305 - [Python] Dependency load failure in Windows wheel build ARROW-9315 - [Java] Fix the failure of testAllocationManagerType ARROW-9317 - [Java] A few testcases for arrow-memory ARROW-9326 - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds ARROW-9326 - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds ARROW-9326 - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds ARROW-9330 - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz) ARROW-9334 - [Dev][Archery] Push ancestor docker images ARROW-9336 - [Ruby] Creating RecordBatch with structs missing keys results in a malformed table ARROW-9343 - [C++][Gandiva] CastINT/Float functions from string should handle leading/trailing white spaces ARROW-9347 - [Python] Tests fail with latest fsspec ARROW-9350 - [C++][CI] Nightly valgrind job failures ARROW-9351 - [C++][CI] Nightly test-ubuntu-18.04-cpp-cmake32 fails ARROW-9353 - [Python][CI] Nightly dask integration jobs fail ARROW-9354 - [C++] Turbodbc latest fails to build in the integration tests ARROW-9355 - [R] Fix -Wimplicit-int-float-conversion ARROW-9360 - [CI][Crossbow] Nightly homebrew-cpp job times out ARROW-9363 - [C++][Dataset] ParquetDatasetFactory schema: pandas metadata is lost ARROW-9368 - [Python] Rename predicate argument to filter in split_by_row_group() ARROW-9373 - [C++] Fix Parquet crash on invalid input (OSS-Fuzz) ARROW-9380 - [C++] Segfaults in compute::CallFunction ARROW-9384 - [C++] Out-of-memory on invalid IPC input (OSS-Fuzz) ARROW-9385 - [Python] [CI] jpype integration failure ARROW-9389 - [C++] Can’t call isin/match through CallFunction ARROW-9397 - [R] Pass CC/CXX to cmake when building libarrow in Linux build ARROW-9408 - [Integration] Tests do not run in Windows due to numpy 64-bit errors ARROW-9409 - [CI][Crossbow] Nightly conda-r fails ARROW-9410 - [CI][Crossbow] Fix homebrew-cpp again ARROW-9413 - [Rust] Fix clippy lint on master ARROW-9415 - [C++] Arrow does not compile on Power9 ARROW-9416 - [Go] Add test cases for some datatypes ARROW-9417 - [C++][IPC] size in message written in native endian ARROW-9418 - [R] nyc-taxi Parquet files not downloaded in binary mode on Windows ARROW-9419 - [C++] Test that “fill_null” function works with sliced inputs, expand tests ARROW-9428 - [C++] Update documentation for buffer allocation functions ARROW-9436 - [C++][CI] Valgrind errors in fill_null kernel tests ARROW-9438 - [CI] Spark integration tests are failing ARROW-9439 - [C++] Fix crash on invalid IPC input (OSS-Fuzz) ARROW-9440 - [Python] Expose Fill Null Compute Kernel in PyArrow ARROW-9443 - [C++] Bundled bz2 build should only build libbz2 ARROW-9448 - [Java] Circular initialization between ArrowBuf and BaseAllocator leads to null HistoricalLog for empty buffer ARROW-9449 - [R] Strip arrow.so ARROW-9450 - [Python] “pytest pyarrow” takes over 10 seconds to collect tests and start executing ARROW-9456 - [Python] Dataset segfault when not importing pyarrow.parquet ARROW-9458 - [Python] Dataset Scanner is single-threaded only ARROW-9460 - [C++] BinaryContainsExact doesn’t cope with double characters in the pattern ARROW-9461 - [Rust] Reading Date32 and Date64 errors - they are incorrectly converted to RecordBatch ARROW-9476 - [C++][Dataset] HivePartitioning discovery with dictionary types fails for multiple fields ARROW-9486 - [C++][Dataset] Support implicit casting InExpression::set_ to dict ARROW-9497 - [C++][Parquet] Fix failure caused by malformed repetition/definition levels ARROW-9499 - [C++] AdaptiveIntBuilder::AppendNull does not increment the null count ARROW-9500 - [C++] Fix segfault with std::to_string in -O3 builds on gcc 7.5.0 ARROW-9501 - [C++][Gandiva] Add logic in timestampdiff() when end date is last day of a month ARROW-9503 - [Rust] Comparison sliced arrays is wrong ARROW-9504 - [Python] Segmentation fault on ChunkedArray.take ARROW-9506 - [Packaging][Python] Fix macOS wheel build failures ARROW-9512 - [C++] Variadic template unpack inside lambda doesn’t compile with gcc ARROW-9524 - [CI][Gandiva] C++ unit test arrow-ipc-read-write failing in gandiva nightly build ARROW-9527 - [Rust] Remove un-needed dev-dependencies PARQUET-1839 - [C++] values_read not updated in ReadBatchSpaced PARQUET-1857 - [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups PARQUET-1865 - [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc PARQUET-1877 - [C++] Reconcile container size with string size for memory issues PARQUET-1882 - [C++] Writing an all-null column and then reading it with buffered_stream aborts the process New Features and Improvements ARROW-300 - [Format] Add body buffer compression option to IPC message protocol using LZ4 or ZSTD ARROW-842 - [Python] Handle more kinds of null sentinel objects from pandas 0.x ARROW-971 - [C++/Python] Implement Array.isvalid/notnull/isnull as scalar functions ARROW-974 - [Website] Add Use Cases section to the website ARROW-1277 - Completing integration tests for major implemented data types ARROW-1567 - [C++] Implement “fill null” kernels that replace null values with some scalar replacement value ARROW-1570 - [C++] Define API for creating a kernel instance from function of scalar input and output with a particular signature ARROW-1682 - [Python] Add documentation / example for reading a directory of Parquet files on S3 ARROW-1796 - [Python] RowGroup filtering on file level ARROW-2260 - [C++][Plasma] plasma_store should show usage ARROW-2444 - [Python][C++] Better handle reading empty parquet files ARROW-2702 - [Python] Examine usages of Invalid and TypeError errors in numpy_to_arrow.cc to see if we are using the right error type in each instance ARROW-2714 - [C++/Python] Variable step size slicing for arrays ARROW-2912 - [Website] Build more detailed Community landing page a la Apache Spark ARROW-3089 - [Rust] Add ArrayBuilder for different Arrow arrays ARROW-3134 - [C++] Implement n-ary iterator for a collection of chunked arrays with possibly different chunking layouts ARROW-3154 - [Python][C++] Document how to write _metadata, _common_metadata files with Parquet datasets ARROW-3244 - [Python] Multi-file parquet loading without scan ARROW-3275 - [Python] Add documentation about inspecting Parquet file metadata ARROW-3308 - [R] Convert R character vector with data exceeding 2GB to Large type ARROW-3317 - [R] Test/support conversions from data.frame with a single character column exceeding 2GB capacity of BinaryArray ARROW-3446 - [R] Document mapping of Arrow &lt;-&gt; R types ARROW-3509 - [C++] Inconsistent child accessor naming ARROW-3520 - [C++] Implement List Flatten kernel ARROW-3688 - [Rust] Implement PrimitiveArrayBuilder&lt;T&gt;.push_values ARROW-3827 - [Rust] Implement UnionArray ARROW-4022 - [C++] Promote Datum variant out of compute namespace ARROW-4221 - [Format] Add canonical flag in COO sparse index ARROW-4390 - [R] Serialize “labeled” metadata in Feather files, IPC messages ARROW-4412 - [DOCUMENTATION] Add explicit version numbers to the arrow specification documents. ARROW-4427 - [Doc] Move Confluence Wiki pages to the Sphinx docs ARROW-4429 - [Doc] Add git rebase tips to the ‘Contributing’ page in the developer docs ARROW-5035 - [C#] ArrowBuffer.Builder&lt;bool&gt; is broken ARROW-5082 - [Python][Packaging] Reduce size of macOS and manylinux1 wheels ARROW-5143 - [Flight] Enable integration testing of batches with dictionaries ARROW-5279 - [C++] Support reading delta dictionaries in IPC streams ARROW-5377 - [C++] Make IpcPayload public and add GetPayloadSize ARROW-5489 - [C++] Normalize kernels and ChunkedArray behavior ARROW-5548 - [Documentation] http://arrow.apache.org/docs/latest/ is not latest ARROW-5649 - [Integration][C++] Create round trip integration test for extension types ARROW-5708 - [C#] Null support for BooleanArray ARROW-5760 - [C++] Optimize Take implementation ARROW-5854 - [Python] Expose compare kernels on Array class ARROW-6052 - [C++] Divide up arrow/array.h,cc into files in arrow/array/ similar to builder files ARROW-6110 - [Java] Support LargeList Type and add integration test with C++ ARROW-6111 - [Java] Support LargeVarChar and LargeBinary types and add integration test with C++ ARROW-6439 - [R] Implement S3 file-system interface in R ARROW-6456 - [C++] Possible to reduce object code generated in compute/kernels/take.cc? ARROW-6501 - [C++] Remove non_zero_length field from SparseIndex ARROW-6521 - [C++] Add function to arrow:: namespace that returns the current ABI version ARROW-6543 - [R] Support LargeBinary and LargeString types ARROW-6602 - [Doc] Add feature / implementation matrix ARROW-6603 - [C#] ArrayBuilder API to support writing nulls ARROW-6645 - [Python] Faster boundschecking of dictionary indices when converting to Categorical ARROW-6689 - [Rust] [DataFusion] Query execution enhancements for 1.0.0 release ARROW-6691 - [Rust] [DataFusion] Use tokio and Futures instead of spawning threads ARROW-6775 - [C++] [Python] Proposal for several Array utility functions ARROW-6776 - [Python] Need a lite version of pyarrow ARROW-6800 - [C++] Add CMake option to build libraries targeting a C++14 or C++17 toolchain environment ARROW-6839 - [Java] Add APIs to read and write “custom_metadata” field of IPC file footer ARROW-6856 - [C++] Use ArrayData instead of Array for ArrayData::dictionary ARROW-6917 - ARROW-6917: [Archery][Release] Add support for JIRA curation, changelog generation and commit cherry-picking for maintenance releases ARROW-6945 - [Rust] Enable integration tests ARROW-6959 - [C++] Clarify what signatures are preferred for compute kernels ARROW-6978 - [R] Add bindings for sum and mean compute kernels ARROW-6979 - [R] Enable jemalloc in autobrew formula ARROW-6982 - [R] Add bindings for compare and boolean kernels ARROW-7009 - [C++] Refactor filter/take kernels to use Datum instead of overloads ARROW-7010 - [C++] Support lossy casts from decimal128 to float32 and float64/double ARROW-7011 - [C++] Implement casts from float/double to decimal128 ARROW-7012 - [C++] Clarify ChunkedArray chunking strategy and policy ARROW-7068 - [C++] Expose the offsets of a ListArray as a Int32Array ARROW-7075 - [C++] Boolean kernels should not allocate in Call() ARROW-7175 - [Website] Add a security page to track when vulnerabilities are patched ARROW-7229 - [C++] Unify ConcatenateTables APIs ARROW-7230 - [C++] Use vendored std::optional instead of boost::optional in Gandiva ARROW-7237 - [C++] Add Result&lt;T&gt; to APIs to arrow/json ARROW-7243 - [Docs] Add common “implementation status” table to the README of each native language implementation, as well as top level README ARROW-7285 - [C++] ensure C++ implementation meets clarified dictionary spec ARROW-7300 - [C++][Gandiva] Implement functions to cast from strings to integers/floats ARROW-7313 - [C++] Add function for retrieving a scalar from an array slot ARROW-7371 - [GLib] Add Datasets binding ARROW-7375 - [Python] Expose C++ MakeArrayOfNull ARROW-7391 - [Python] Remove unnecessary classes from the binding layer ARROW-7495 - [Java] Remove “empty” concept from ArrowBuf, replace with custom referencemanager ARROW-7605 - [C++] Create and install static library containing all dependencies built by Arrow ARROW-7607 - [C++] Add to cpp/examples minimal examples of using Arrow as a dependency of another CMake project ARROW-7673 - [C++][Dataset] Revisit File discovery failure mode ARROW-7676 - [Packaging][Python] Ensure that the static libraries are not built in the wheel scripts ARROW-7699 - [Java] Support concating dense union vectors in batch ARROW-7705 - [Rust] Initial sort implementation ARROW-7717 - [CI] Have nightly integration test for Spark’s latest release ARROW-7759 - [C++][Dataset] Add CsvFileFormat for CSV support ARROW-7778 - [C++] Support nested dictionaries in JSON integration format ARROW-7784 - [C++] diff.cc is extremely slow to compile ARROW-7801 - [Developer] Add issue_comment workflow to fix lint/style/codegen ARROW-7803 - [R][CI] Autobrew/homebrew tests should not always install from master ARROW-7831 - [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors ARROW-7831 - [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors ARROW-7902 - [Integration] Unskip nested dictionary integration tests ARROW-7910 - [C++] Provide function to query page size portably ARROW-7924 - [Rust] Add sort for float types ARROW-7950 - [Python] When initializing pandas API shim, inform user if their installed pandas version is too old ARROW-7955 - [Java] Support large buffer for file/stream IPC ARROW-8020 - [Java] Implement vector validate functionality ARROW-8023 - [Website] Write a blog post about the C data interface ARROW-8025 - [C++] Implement cast to Binary and FixedSizeBinary ARROW-8025 - [C++] Implement cast to Binary and FixedSizeBinary ARROW-8046 - [Developer][Integration] Makefile.docker’s target names are broken ARROW-8062 - [C++][Dataset] Parquet Dataset factory from a _metadata/_common_metadata file ARROW-8065 - [C++][Dataset] Untangle Dataset, Fragment and ScanOptions ARROW-8074 - [C++][Dataset] Support for file-like objects (buffers) in FileSystemDataset? ARROW-8108 - [Java] Extract a common interface for dictionary encoders ARROW-8111 - [C++][CSV] Support MM/DD/YYYY date format ARROW-8114 - [Java][Integration] Enable custom_metadata integration test ARROW-8121 - [Java] Enhance code style checking for Java code (add space after commas, semi-colons and type casts) ARROW-8149 - [C++/Python] Enable CUDA Support in conda recipes ARROW-8157 - [C++][Gandiva] Support building with LLVM 9 ARROW-8162 - [Format][Python] Add serialization for CSF sparse tensors ARROW-8169 - [Java] Improve the performance of JDBC adapter by allocating memory proactively ARROW-8171 - Consider pre-allocating memory for fix-width vector in Avro adapter iterator ARROW-8190 - [C++][Flight] Allow setting IpcWriteOptions and IpcReadOptions in Flight IPC message reader and writer classes ARROW-8229 - [Java] Move ArrowBuf into the Arrow package ARROW-8230 - [Java] Move Netty memory manager into a separate module ARROW-8261 - [Rust] [DataFusion] LogicalPlanBuilder.limit() should take a literal argument ARROW-8263 - [Rust] [DataFusion] Add documentation for supported SQL functions ARROW-8281 - [R] Name collision of arrow.dll on Windows conda ARROW-8283 - [Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset ARROW-8287 - [Rust] Arrow examples should use utility to print results ARROW-8293 - [Python] Run flake8 on python/examples also ARROW-8297 - [FlightRPC][C++] Implement Flight DoExchange for C++ ARROW-8301 - [R] Handle ChunkedArray and Table in C data interface ARROW-8312 - [Java][Gandiva] improve IN expression support ARROW-8314 - [Python] Provide a method to select a subset of columns of a Table ARROW-8318 - [C++][Dataset] Dataset should instantiate Fragment ARROW-8399 - [Rust] Extend memory alignments to include other architectures ARROW-8413 - [C++] Refactor DefLevelsToBitmap ARROW-8422 - [Rust] [Parquet] Implement function to convert Arrow schema to Parquet schema ARROW-8430 - [CI] Configure self-hosted runners for Github Actions ARROW-8434 - [C++] Ipc RecordBatchFileReader deserializes the Schema multiple times ARROW-8440 - [C++] Refine simd header files ARROW-8443 - [Gandiva][C++] Fix round/truncate to no-op for special cases ARROW-8447 - [C++][Dataset] Ensure Scanner::ToTable preserve ordering of ScanTasks ARROW-8467 - [C++] Test cases using ArrayFromJSON assume only a little-endian platform ARROW-8474 - [CI][Crossbow] Skip some nightlies we don’t need to run ARROW-8477 - [C++] Enable reading and writing of long filenames for Windows ARROW-8481 - [Java] Provide an allocation manager based on Unsafe API ARROW-8483 - [Ruby] Arrow::Table documentation improvement ARROW-8485 - [Integration][Java] Implement extension types integration ARROW-8486 - [C++] arrow-utility-test causes failures on a big-endian platform ARROW-8487 - [FlightRPC][C++] Make it possible to target a specific payload size ARROW-8488 - [R] Replace VALUE_OR_STOP with ValueOrStop ARROW-8496 - [C++] Refine ByteStreamSplitDecodeScalar ARROW-8497 - [Archery] Add missing component to builds ARROW-8499 - [C++][Dataset] In ScannerBuilder, batch_size will not work if projecter is not empty ARROW-8500 - [C++] Use selection vectors in Filter implementation for record batches, tables ARROW-8501 - [Packaging][RPM] Upgrade devtoolset to 8 on CentOS 6 ARROW-8502 - [Release][APT][Yum] Ignore all arm64 verifications ARROW-8504 - [C++] Add Run Length Reader ARROW-8506 - [c++] Miss tests to verify expected_buffer with bit_width &gt; 8 in RLE ARROW-8507 - [Release] Detect .git directory automatically in changelog.py ARROW-8509 - [GLib] Add low level record batch read/write functions ARROW-8512 - [C++] Delete unused compute expr prototype code ARROW-8513 - [Python] Expose Take with Table input in Python ARROW-8515 - [C++] Bitmap ToString should have an option of grouping by bytes ARROW-8516 - [Rust] Slow BufferBuilder&lt;BooleanType&gt; inserts within PrimitiveBuilder::append_slice ARROW-8517 - [Developer][Release] Update Crossbow RC verification setup for changes since 0.16.0 ARROW-8520 - [Developer] Use .asf.yaml to direct GitHub notifications to e-mail lists and JIRA ARROW-8521 - [Developer] Group Sub-task, Task, Test, and Wish issue types as “Improvement” in Changelog ARROW-8522 - [Developer] Add environment variable option to toggle whether ephemeral NodeJS is installed in release verification script ARROW-8524 - [CI] Free up space on github actions ARROW-8526 - [Python] Fix non-deterministic row order failure in dataset tests ARROW-8531 - [C++] Deprecate ARROW_USE_SIMD CMake option ARROW-8538 - [Packaging] Remove boost from homebrew formula ARROW-8540 - [C++] Create memory allocation benchmark ARROW-8541 - [Release] Don’t remove previous source releases automatically ARROW-8542 - [Release] Fix checksum url in the website post release script ARROW-8543 - [C++] IO: single pass coalescing algorithm ARROW-8544 - [CI][Crossbow] Add a status.json to the gh-pages summary of nightly builds to get around rate limiting ARROW-8548 - [Website] 0.17 release post ARROW-8549 - [R] Assorted post-0.17 release cleanups ARROW-8550 - [CI] Don’t run cron GHA jobs on forks ARROW-8551 - [CI][Gandiva] Use LLVM 8 to build gandiva linux jar ARROW-8552 - [Rust] support column iteration for parquet row ARROW-8553 - [C++] Optimize unaligned bitmap operations ARROW-8555 - [FlightRPC][Java] Implement Flight DoExchange for Java ARROW-8558 - [Rust] GitHub Actions missing rustfmt ARROW-8559 - [Rust] Consolidate Record Batch reader traits in main arrow crate ARROW-8560 - [Rust] Docs for MutableBuffer resize are incorrect ARROW-8561 - [C++][Gandiva] Stop using deprecated google::protobuf::MessageLite::ByteSize() ARROW-8562 - [C++] IO: Parameterize I/O coalescing using S3 storage metrics ARROW-8563 - [Go] Minor change to make newBuilder public ARROW-8564 - [Website] Add Ubuntu 20.04 LTS to supported package list ARROW-8569 - [CI] Upgrade xcode version for testing homebrew formulae ARROW-8571 - [C++] Switch AppVeyor image to VS 2017 ARROW-8572 - [Python] Expose UnionArray.array and other fields ARROW-8573 - [Rust] Upgrade to Rust 1.44 nightly ARROW-8574 - [Rust] Implement Debug for all plain types ARROW-8575 - [Developer] Add issue_comment workflow to rebase a PR ARROW-8590 - [Rust] Use Arrow pretty print utility in DataFusion ARROW-8591 - [Rust] Reverse lookup for a key in DictionaryArray ARROW-8597 - [Rust] arrow crate lint and readability improvements ARROW-8606 - [CI] Don’t trigger all builds on a change to any file in ci/ ARROW-8607 - [R][CI] Unbreak builds following R 4.0 release ARROW-8611 - [R] Can’t install arrow 0.17 on Ubuntu 18.04 R 3.6.3 ARROW-8612 - [GLib] Add GArrowReadOptions and GArrowWriteOptions ARROW-8616 - [Rust] Turn explicit SIMD off by default ARROW-8619 - [C++] Use distinct Type::type values for interval types ARROW-8622 - [Rust] Parquet crate does not compile on aarch64 ARROW-8623 - [C++][Gandiva] Reduce use of Boost, remove Boost headers from header files ARROW-8624 - [Website] Install page should mention arrow-dataset packages ARROW-8628 - [CI][Dev] Wrap docker-compose commands with archery ARROW-8629 - [Rust] Eliminate indirection of ZST allocations ARROW-8633 - [C++] Add ValidateAscii function ARROW-8634 - [Java] Create an example ARROW-8639 - [C++][Plasma] Require gflags ARROW-8645 - [C++] Missing gflags dependency for plasma ARROW-8647 - [C++][Dataset] Optionally encode partition field values as dictionary type ARROW-8648 - [Rust] Optimize Rust CI Build Times ARROW-8650 - [Rust] [Website] Add documentation to Arrow website ARROW-8651 - [Python][Dataset] Support pickling of Dataset objects ARROW-8655 - [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset ARROW-8656 - [Python] Switch to VS2017 in the windows wheel builds ARROW-8659 - [Rust] ListBuilder and FixedSizeListBuilder capacity ARROW-8660 - [C++][Gandiva] Reduce dependence on Boost ARROW-8662 - [CI] Consolidate appveyor scripts ARROW-8664 - [Java] Add skip null check to all Vector types ARROW-8668 - [Packaging][APT][Yum][ARM] Use Travis CI’s ARM machine to build packages ARROW-8669 - [C++] Add IpcWriteOptions argument to GetRecordBatchSize() ARROW-8671 - [C++] Use IPC body compression metadata approved in ARROW-300 ARROW-8671 - [C++] Use IPC body compression metadata approved in ARROW-300 ARROW-8682 - [Ruby][Parquet] Add support for column level compression ARROW-8687 - [Java] Finish move of io.netty.buffer.ArrowBuf ARROW-8690 - [Python] Clean-up dataset+parquet tests now order is determinstic ARROW-8692 - [C++] Avoid memory copies when downloading from S3 ARROW-8695 - [Java] remove references to PlatformDependent in memory module ARROW-8696 - [Java] Convert tests to integration tests ARROW-8699 - [R] Fix automatic r_to_py conversion ARROW-8702 - [Packaging][C#] Build NuGet packages in release process ARROW-8703 - [R] schema$metadata should be properly typed ARROW-8707 - [CI] Docker push fails because of wrong dockerhub credentials ARROW-8708 - [CI] Utilize github actions cache for docker-compose volumes ARROW-8711 - [Python] Expose strptime timestamp parsing in read_csv conversion options ARROW-8717 - [CI][Packaging] Add build dependency on boost to homebrew ARROW-8720 - [C++] Fix checked_pointer_cast ARROW-8721 - [CI] Fix R build matrix ARROW-8723 - [Rust] Remove SIMD specific benchmark code ARROW-8724 - [Packaging][deb][RPM] Use directory in host as build directory ARROW-8725 - [Rust] redundant directory walk in rust parquet datasource code ARROW-8727 - [C++] Do not require struct-initialization of StringConverter&lt;T&gt; to parse strings to other types ARROW-8730 - [Rust] Use slice instead of &amp;Vec for function arguments ARROW-8733 - [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata ARROW-8736 - [Rust] [DataFusion] Table API should provide a schema() method ARROW-8740 - [CI] Fix archery option in pandas master cron test ARROW-8742 - [C++][Python] Add flight client support for Mutual TLS ARROW-8743 - [C++][CI] Add a test job on s390x ARROW-8744 - [Rust] ParquetIterator’s next method should be safe to call even after reached end of iteration ARROW-8745 - [C++] Bitmap.ToString causes failures on a big-endian platform ARROW-8747 - [C++] Feather tests with compression cause failure on big-endian platforms ARROW-8751 - [Rust] ParquetFileArrowReader should be able to read empty parquet file without error ARROW-8752 - [Rust] Remove unused hashmap ARROW-8753 - [C++][CI] Add a test job on ARM ARROW-8754 - [C++][CI] enable tests for additional components on big-endian platforms ARROW-8756 - [C++] Bitmap word tests cause failures on a big-endian platform ARROW-8757 - [C++] Plasma header is written in native endian ARROW-8758 - [R] Updates for compatibility with dplyr 1.0 ARROW-8759 - [C++] TestPlasmaSerialization.DeleteReply tests failure on big-endian platforms ARROW-8762 - [C++][Gandiva] Replace Gandiva’s BitmapAnd with common implementation ARROW-8763 - [C++] Create RandomAccessFile::WillNeed-like API ARROW-8764 - [C++] Make ThreadPool configurable in ReadRangeCache ARROW-8766 - [Python] A FileSystem implementation based on Python callbacks ARROW-8769 - [C++] Add convenience methods to access fields by name in StructScalar ARROW-8770 - [C++][CI] enable arrow-csv-test on s390x ARROW-8772 - [C++] Expand SumKernel benchmark to more types ARROW-8777 - [Rust] Parquet.rs does not support reading fixed-size binary fields. ARROW-8778 - [C++][Gandiva] SelectionVector related test failed on big-endian platforms ARROW-8779 - [R] Implement conversion to List&lt;Struct&gt; ARROW-8781 - [CI][C++] Enable ccache on GHA MinGW jobs ARROW-8782 - [Rust] [DataFusion] Add benchmarks based on NYC Taxi data set ARROW-8783 - [Rust] [DataFusion] Logical plan should have ParquetScan and CsvScan entries ARROW-8784 - [Rust] [DataFusion] Remove use of Arc from LogicalPlan ARROW-8785 - [Python][Packaging] Build the windows wheels with MIMALLOC enabled ARROW-8786 - [Packaging][rpm] Use bundled zstd in the CentOS 8 build ARROW-8788 - [C#] Array builders to use bit-packed buffer builder rather than boolean array builder for validity map ARROW-8789 - [Rust] Add separate crate for integration test binaries ARROW-8790 - [C++][CI] Enable arrow-flight-test on s390x ARROW-8791 - [Rust] Creating StringDictionaryBuilder with existing dictionary values ARROW-8792 - [C++] Improved declarative compute function / kernel development framework, normalize calling conventions ARROW-8793 - [C++] BitUtil::SetBitsTo probably doesn’t need to be inline ARROW-8794 - [C++] Expand benchmark coverage for arrow from parquet reading ARROW-8795 - [C++] Limited iOS support ARROW-8800 - [C++] Split arrow::ChunkedArray into arrow/chunked_array.h ARROW-8804 - [R][CI] Followup to Rtools40 upgrade ARROW-8814 - [Dev][Release] Binary upload script keeps raising locale warnings ARROW-8815 - [Dev][Release] Binary upload script should retry on unexpected bintray request error ARROW-8818 - [Rust] Failing to build on master due to Flatbuffers/Union issues ARROW-8822 - [Rust] [DataFusion] Add MemoryScan variant to LogicalPlan ARROW-8827 - [Integration Testing] Initial skeleton for Rust integration tests ARROW-8830 - [GLib] Add support for Tell againt not seekable GIO output stream ARROW-8831 - [Rust] incomplete SIMD implementation in simd_compare_op ARROW-8833 - [Rust] Implement VALIDATE mode in integration test binary ARROW-8834 - [Rust] Implement arrow-file-to-stream for integration testing ARROW-8835 - [Rust] Implement arrow-stream-to-file for integration testing ARROW-8836 - [Website] Update copyright end year automatically ARROW-8837 - [Rust] Add Null type ARROW-8838 - [Rust] File reader fails to read header from valid files ARROW-8839 - [Rust] datafusion logical plan should support scaning csv without provided schema ARROW-8840 - [Rust] datafusion ExecutionError should implement std::error:Error trait ARROW-8841 - [C++] Add benchmark and unittest for PLAIN spaced ARROW-8843 - [C++] Optimize BitmapEquals unaligned case ARROW-8844 - [C++] Optimize TransferBitmap unaligned case ARROW-8846 - [Dev][Python] Autoformat Python sources with Archery ARROW-8847 - [C++] Pass task size / metrics in Executor API ARROW-8851 - [Python][Documentation] Fix FutureWarnings in Python Plasma docs ARROW-8852 - [R] Post-0.17.1 adjustments ARROW-8854 - [Rust] [Integration Testing] Show output from arrow-json-integration-test ARROW-8855 - [Rust] [Integration Testing] data type Date32(Day) not supported ARROW-8856 - [Rust] [Integration Testing] Return empty batch if MessageHeader is NONE ARROW-8864 - [R] Add methods to Table/RecordBatch for consistency with data.frame ARROW-8866 - [C++] Split Type::UNION into Type::SPARSE_UNION and Type::DENSE_UNION ARROW-8867 - [R] Support converting POSIXlt type ARROW-8875 - [C++] use AWS SDK SetResponseStreamFactory to avoid a copy of bytes ARROW-8877 - [Rust] add CSV read option struct to simplify datafusion interface ARROW-8880 - [R][Linux] Make R Binary Install Friendlier ARROW-8881 - [Rust] Add large list and binary support ARROW-8885 - [R] Don’t include everything everywhere ARROW-8886 - [C#] Decide and implement appropriate behaviour for Array builder resize to negative size ARROW-8887 - [Java] Buffer size for complex vectors increases rapidly in case of clear/write loop ARROW-8890 - [R] Fix C++ lint issue ARROW-8895 - [C++] Add C++ unit tests for filter and take functions on temporal type inputs, including timestamps ARROW-8896 - [C++] Reimplement dictionary unpacking in Cast kernels using Take ARROW-8899 - [R] Add R metadata like pandas metadata for round-trip fidelity ARROW-8901 - [C++] Reduce number of take kernels ARROW-8903 - [C++] Implement optimized “unsafe take” for use with selection vectors for kernel execution ARROW-8904 - [Python] Fix usages of deprecated C++ APIs related to child/field ARROW-8906 - [Rust] Support reading multiple CSV files for schema inference ARROW-8907 - [Rust] implement scalar comparison operations ARROW-8912 - [Ruby] Keep reference of Arrow::Buffer’s data for GC ARROW-8913 - [Ruby] Use “field” instead of “child” ARROW-8914 - [C++][Gandiva] Decimal128 related test failed on big-endian platforms ARROW-8915 - [Dev][Archery] Require Click 7 ARROW-8917 - [C++][Compute] Formalize “metafunction” concept ARROW-8918 - [C++] Add cast “metafunction” to FunctionRegistry that addresses dispatching to appropriate type-specific CastFunction ARROW-8922 - [C++] Implement example string scalar kernel function to assist with string kernels buildout per ARROW-555 ARROW-8923 - [C++] Improve usability of arrow::compute::CallFunction by moving ExecContext* argument to end and adding default ARROW-8926 - [C++] Improve docstrings in new public APIs in arrow/compute and fix miscellaneous typos ARROW-8927 - [C++] Support dictionary memos when reading/writing record batches using cuda IPC ARROW-8929 - [C++] Change compute::Arity:VarArgs min_args default to 0 ARROW-8931 - [Rust] Support lexical sort in arrow compute kernel ARROW-8933 - [C++] Reduce generated code in vector_hash.cc ARROW-8934 - [C++] Add timestamp subtract kernel aliased to int64 subtract implementation ARROW-8937 - [C++] Add “parse_strptime” function for string to timestamp conversions using the kernels framework ARROW-8938 - [R] Provide binding for arrow::compute::CallFunction ARROW-8940 - [Java] Fix the performance degradation of integration tests ARROW-8941 - [C++/Python] arrow-nightlies conda repository is full ARROW-8942 - [R] Detect compression in reading CSV/JSON ARROW-8943 - [C++][Dataset] Add support for Partitioning to ParquetDatasetFactory ARROW-8950 - [C++] Make head optional in s3fs ARROW-8958 - [FlightRPC][Python] Implement Flight DoExchange for Python ARROW-8960 - [MINOR] [FORMAT] Fix typos in comments ARROW-8961 - [C++] Add utf8proc library to toolchain ARROW-8963 - [C++][Parquet] Parquet cpp optimize allocate memory ARROW-8965 - [Python][Documentation] Pyarrow documentation for pip nightlies references 404’d location ARROW-8966 - [C++] Move arrow::ArrayData to a separate header file ARROW-8969 - [C++] Reduce generated code in compute/kernels/scalar_compare.cc ARROW-8970 - [C++] Reduce shared library / binary code size (umbrella issue) ARROW-8972 - [Java] Support range value comparison for large varchar/varbinary vectors ARROW-8973 - [Java] Support batch value appending for large varchar/varbinary vectors ARROW-8974 - [C++] Refine TransferBitmap template parameters ARROW-8976 - [C++] compute::CallFunction can’t Filter/Take with ChunkedArray ARROW-8979 - [C++] Implement bitmap word reader and writer ARROW-8984 - [R] Revise install guides now that Windows conda package exists ARROW-8985 - [Format] Add “byte width” field with default of 16 to Decimal Flatbuffers type for forward compatibility ARROW-8989 - [C++] Document available functions in compute::FunctionRegistry ARROW-8993 - [Rust] Support reading non-seekable sources in text readers ARROW-8994 - [C++] Disable include-what-you-use cpplint lint checks ARROW-8996 - [C++] Runtime SIMD path for Aggregate Sum/Mean kernel ARROW-8997 - [Archery] Benchmark formatter should have friendly units ARROW-9004 - [C++][Gandiva] Support building with LLVM 10 ARROW-9005 - [Rust] [DataFusion] Support sort expression ARROW-9007 - [Rust] Support appending arrays by merging array data ARROW-9014 - [Packaging] Bump the minor part of the automatically generated version in crossbow ARROW-9015 - [Java] Make BaseAllocator package private ARROW-9016 - [Java] Remove direct references to Netty/Unsafe Allocators ARROW-9017 - [Python] Refactor the Scalar classes ARROW-9018 - [C++] Remove APIs that were deprecated in 0.17.x and prior ARROW-9021 - [Python] The filesystem keyword in parquet.read_table is not documented ARROW-9022 - [C++] Add/Sub/Mul arithmetic kernels with overflow check ARROW-9029 - [C++] Implement BitBlockCounter interface for blockwise popcounts of validity bitmaps ARROW-9030 - [Python] Clean up some usages of pyarrow.compat, move some common functions/symbols to lib.pyx ARROW-9031 - [R] Implement conversion from Type::UINT64 to R vector ARROW-9032 - [C++] Split arrow/util/bit_util.h into multiple header files ARROW-9034 - [C++] Implement binary (two bitmap) version of BitBlockCounter ARROW-9042 - [C++] Add Subtract and Multiply arithmetic kernels with wrap-around behavior ARROW-9043 - [Go] Temporarily copy LICENSE.txt to go/ ARROW-9043 - [Go] Temporarily copy LICENSE.txt to go/ ARROW-9045 - [C++] Improve and expand Take/Filter benchmarks ARROW-9046 - [C++][R] Put more things in type_fwds ARROW-9047 - [Rust] Setting 0-bits of a 0-length bitset segfaults ARROW-9050 - [Release] Use 1.0.0 as the next version ARROW-9051 - [GLib] Refer Array related objects from Array ARROW-9052 - [CI][MinGW] Enable Gandiva ARROW-9055 - [C++] Add sum/mean kernels for Boolean type ARROW-9058 - [Packaging][wheel] Boost download is failed ARROW-9060 - [GLib] Add support for building Apache Arrow Datasets GLib with non-installed Apache Arrow Datasets ARROW-9061 - [Packaging][APT][Yum][GLib] Add Apache Arrow Datasets GLib ARROW-9062 - [Rust] Support to read JSON into dictionary type ARROW-9067 - [C++] Create reusable branchless / vectorized index boundschecking functions ARROW-9070 - [C++] StructScalar needs field accessor methods ARROW-9073 - [C++] RapidJSON include directory detection doesn’t work with RapidJSONConfig.cmake ARROW-9074 - [GLib] Add missing arrow-json check ARROW-9075 - [C++] Optimize Filter implementation ARROW-9079 - [C++] Write benchmark for arithmetic kernels ARROW-9083 - [R] collect int64, uint32, uint64 as R integer type if not out of bounds ARROW-9086 - [CI][Homebrew] Enable Gandiva ARROW-9088 - [Rust] Recent version of arrow crate does not compile into wasm target ARROW-9089 - [Python] A PyFileSystem handler for fsspec-based filesystems ARROW-9090 - [C++] Bump versions of bundled libraries ARROW-9091 - [C++] Utilize function’s default options when passing no options to CallFunction for a function that requires them ARROW-9093 - [FlightRPC][C++][Python] Allow setting gRPC client options ARROW-9094 - [Python] Bump versions of compiled dependencies in manylinux wheels ARROW-9095 - [Rust] Fix NullArray to comply with spec ARROW-9099 - [C++][Gandiva] Add TRIM function for string ARROW-9100 - [C++] Add ascii_lower kernel ARROW-9101 - [Doc][C++][Python] Document encoding expected by CSV and JSON readers ARROW-9102 - [Packaging] Upload built manylinux docker images ARROW-9106 - [C++] Add C++ foundation to ease file transcoding ARROW-9108 - [C++][Dataset] Add Parquet Statistics conversion for timestamp columns ARROW-9109 - [Python][Packaging] Enable S3 support in manylinux wheels ARROW-9110 - [C++] Fix CPU cache size detection on macOS ARROW-9112 - [R] Update autobrew script location ARROW-9115 - [C++] Process data buffers in batch in ascii_lower / ascii_upper kernels rather than using string_view value iteration ARROW-9116 - [C++] Add BinaryArray::total_values_length() ARROW-9116 - [C++] Add BinaryArray::total_values_length() ARROW-9118 - [C++] Add more general BoundsCheck function that also checks for arbitrary lower limits in integer arrays ARROW-9119 - [C++] Add support for building with system static gRPC ARROW-9123 - [Python][wheel] Use libzstd.a explicitly ARROW-9124 - [Rust][Datafusion] DFParser should consume sql query as &amp;str instead of String ARROW-9125 - [C++] Add missing include for arrow::internal::ZeroMemory() for Valgrind ARROW-9129 - [Python][JPype] Test is failed with JPype 0.7.5 ARROW-9130 - [Python] Add deprecated wrappers functions to a pyarrow/compat.py module for 1.0.0 that will be removed later ARROW-9131 - [C++] Faster ascii_lower and ascii_upper ARROW-9132 - [C++] Implement hash kernels for dictionary data with constant dictionaries ARROW-9133 - [C++] Add utf8_upper and utf8_lower ARROW-9137 - [GLib][Ruby] Allow to read Parquet files in chunks (by RowGroup) ARROW-9138 - [Docs][Format] Make sure format version is hard coded in the docs ARROW-9139 - [Python] parquet read_table should not use_legacy_dataset ARROW-9144 - [CI] OSS-Fuzz build fails because recent changes in the google repository ARROW-9145 - [C++] Add true_count / false_count methods to BooleanArray ARROW-9152 - [C++] Create specialized filter implementation for varbinary types ARROW-9153 - [Python] Add bindings for StructScalar ARROW-9154 - [Developer] Use GitHub issue templates better ARROW-9155 - [Archery] Less precise but faster default settings for “archery benchmark diff” ARROW-9156 - [C++] Reducing the code size of the tensor module ARROW-9157 - [Rust][Datafusion] execution context’s create_physical_plan should take self as immutable reference ARROW-9158 - [Rust][Datafusion] Projection physical plan compilation should preserve nullability ARROW-9159 - [Python] Expose the isnull/isvalid kernels ARROW-9162 - [Python] Expose Add/Subtract/Multiply arithmetic kernels ARROW-9163 - [C++] Add methods to StringArray, LargeStringArray, to validate whether its values are all UTF-8 ARROW-9166 - [Website] Add overview page ARROW-9167 - [Doc][Website] /docs/c_glib/index.html is overwritten ARROW-9168 - [C++][Flight] allow flight benchmark to use separated TCP connections ARROW-9173 - [C++] Document how to use Arrow from a third-party CMake project ARROW-9175 - [FlightRPC][C++][Python] Expose connected peer ARROW-9176 - [Rust] Fix for memory leaks in Arrow allocator ARROW-9178 - [R] Improve documentation about CSV reader ARROW-9179 - [R] Replace usage of iris dataset in tests ARROW-9180 - [Developer] Remove usage of whitelist, blacklist, slave, etc. ARROW-9181 - [C++] Instantiate fewer templates in Cast kernel implementation ARROW-9182 - [C++] Use “applicator” namespace for kernel operator-to-kernel functors, streamline argument unboxing ARROW-9185 - [C++] [Java][Gandiva] Make llvm build optimisation configurable from java ARROW-9188 - [C++] Do not always statically link Brotli libraries ARROW-9189 - [Website] Improve contributor guide ARROW-9190 - [Website][C++] Add blog post on efforts to make building lighter and easier ARROW-9191 - [Rust] Do not panic when int96 milliseconds are negative ARROW-9192 - [CI][Rust] Add support for running clippy ARROW-9193 - [C++] Add method to parse date from null-terminated string ARROW-9197 - [C++] Revamp numeric casts: faster performance and reduced binary size ARROW-9201 - [Archery] Render-human readable table when using “archery benchmark diff” ARROW-9202 - [GLib] Add GArrowDatum ARROW-9203 - [Packaging][deb] Add missing gir1.2-arrow-dataset-1.0.install ARROW-9204 - [C++][Flight] change records_per_stream to int64 in flight benchmark ARROW-9205 - [Documentation] Fix typos in Columnar.rst ARROW-9206 - [C++][Flight] measure latency in flight benchmark ARROW-9207 - [Python][Dataset] Clean-up internal FileSource class ARROW-9210 - [C++] Use OptionalBitBlockCounter in ArrayDataInlineVisitor ARROW-9214 - [C++] Avoid util::optional in favor of separate inlineable functions in arrow/visitor_inline.h ARROW-9216 - [C++][Parquet] Use BitBlockCounter for plain spaced encoding/decoding ARROW-9217 - [C++][Parquet] Cover 0.01% null for the plain spaced encoding/decoding benchmark ARROW-9220 - [C++] Disable relevant compute kernels if ARROW_WITH_UTF8PROC=OFF ARROW-9222 - [Format][Proposal] Remove validity bitmap from Union types ARROW-9224 - [Dev][Archery] Copy local repo on clone failure ARROW-9225 - [C++][Compute] Improve counting sort ARROW-9231 - [Format] Increment MetadataVersion from V4 to V5 ARROW-9234 - [GLib][CUDA] Add support for dictionary memo on reading record batch from buffer ARROW-9241 - [C++] Add forward compatibility checks for Decimal::bitWidth ARROW-9242 - [Java] Add forward compatibility checks for Decimal::bitWidth ARROW-9247 - [Python] Expose BinaryArray::total_values_length in bindings ARROW-9248 - [C++] Add “list_size” function that returns Int32Array/Int64Array giving list cell sizes ARROW-9249 - [C++] Implement “list_parent_indices” vector function ARROW-9250 - [C++] Compact generated code in compute/kernels/scalar_set_lookup.cc using same method as vector_hash.cc ARROW-9251 - [C++] Move JSON testing code for integration tests to libarrow_testing ARROW-9254 - [C++] Factor out some integer casting internals so it can be reused with temporal casts ARROW-9255 - [C++] Use CMake to build bundled Protobuf with CMake &gt;= 3.7 ARROW-9256 - [C++] Incorrect variable name ARROW_CXX_FLAGS ARROW-9258 - [Format] Add V5 MetadataVersion ARROW-9259 - [Format] Permit unsigned dictionary indices in Columnar.rst ARROW-9262 - [Packaging][Linux][CI] Use Ubuntu 18.04 to build ARM64 packages on Travis CI ARROW-9263 - [C++] Benchmark: promote RegressionSetArgs size to L2 ARROW-9264 - [C++] Cleanup Parquet Arrow Schema code ARROW-9265 - [C++] Add support for writing MetadataVersion::V4-compatible IPC messages for compatibility with library versions &lt;= 0.17.1 ARROW-9268 - [C++] Add is{alnum,alpha,…} kernels for strings ARROW-9272 - [C++][Python] Reduce complexity in python to arrow conversion ARROW-9276 - [Dev] Enable ARROW_CUDA when generating API documentations ARROW-9277 - [C++] Fix documentation of Reading CSV files ARROW-9278 - [C++] Implement Union validity bitmap changes from ARROW-9222 ARROW-9280 - [Rust] Write statistics to Parquet files ARROW-9281 - [R] Turn off utf8proc in R builds ARROW-9283 - [Python] Expose C++ build info ARROW-9287 - [C++] Implement support for unsigned dictionary indices ARROW-9289 - [R] Remove deprecated functions ARROW-9290 - [Rust] [Parquet] Add features to allow opting out of dependencies ARROW-9291 - [R] Support fixed size binary/list types ARROW-9292 - [Rust] Update feature matrix with passing tests ARROW-9294 - [GLib] Add GArrowFunction ARROW-9300 - [Java] Separate Netty Memory to its own module ARROW-9306 - [Ruby] Add support for Arrow::RecordBatch.new(raw_table) ARROW-9307 - [Ruby] Add Arrow::RecordBatchIterator#to_a ARROW-9308 - [Format] Add Feature enum to schema.fbs for forward compatibity ARROW-9316 - [C++] Use “Dataset” instead of “Datasets” ARROW-9321 - [C++][Dataset] Allow to “collect” statistics for ParquetFragment row groups if not constructed from _metadata ARROW-9322 - [R] Dataset documentation polishing ARROW-9323 - [Ruby] Add Red Arrow Dataset ARROW-9327 - Fix all clippy errors for arrow crate ARROW-9328 - [C++][Gandiva] Add LTRIM, RTRIM, BTRIM functions for string ARROW-9329 - [C++][Gandiva] Implement castTimestampToDate function ARROW-9331 - [C++] Improve the performance of Tensor-to-SparseTensor conversion ARROW-9333 - [Python] Expose more IPC write options in Python ARROW-9335 - [Website] Update website for 1.0 ARROW-9337 - [R] On C++ library build failure, give an unambiguous message ARROW-9339 - [Rust] Comments on SIMD in Arrow README are incorrect ARROW-9340 - [R] Use CRAN version of decor package ARROW-9341 - [GLib] Use arrow::Datum version Take() ARROW-9345 - [C++][Dataset] Expression with dictionary type should work with operand of value type ARROW-9346 - [C++][Python][Dataset] Add total_byte_size metadata to RowGroupInfo ARROW-9362 - [Java] Add support for writing MetadataVersion::V4-compatible IPC messages for compatibility with library versions &lt;= 0.17.1 ARROW-9365 - [Go] Implement the rest of the typed array builders in NewBuilder ARROW-9370 - [Java] Bump Netty version ARROW-9374 - [C++][Python] Expose MakeArrayFromScalar ARROW-9379 - [Rust] Support unsigned dictionary indices ARROW-9383 - [Python] Support fsspec filesystems in Dataset API through fs handler ARROW-9386 - [Rust] RecordBatch.schema() should not return &amp;Arc&lt;Schema&gt; ARROW-9390 - [C++] Review compute function names ARROW-9390 - [C++] Review compute function names ARROW-9390 - [C++] Review compute function names ARROW-9391 - [Rust] Float32 values interpreted as zero when record batch has one row ARROW-9393 - [Doc] update supported types documentation for Java ARROW-9395 - [Python] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed ARROW-9399 - [C++] Add forward compatibility checks for unrecognized future MetadataVersion ARROW-9403 - [Python] add .tolist as alias of .to_pylist ARROW-9407 - [Python] Accept pd.NA as missing value in array constructor ARROW-9411 - [Rust] Update dependencies ARROW-9424 - [C++][Parquet] Disable writing files with LZ4 codec ARROW-9425 - [Rust][DataFusion] Make ExecutionContext sharable between threads ARROW-9427 - [Rust][DataFusion] Add pub fn ExecutionContext.tables() ARROW-9437 - [Python][Packaging] Homebrew fails to install build dependencies in the macOS wheel builds ARROW-9442 - [Python] Do not force Validate() to be called in pyarrow_wrap_table ARROW-9445 - [Python] Revert Array.equals changes + expose comparison ops in compute ARROW-9446 - [C++] Export compiler information in BuildInfo ARROW-9447 - [Rust][DataFusion] Allow closures as ScalarUDFs ARROW-9452 - [Rust] [DateFusion] Improve performance of parquet scan ARROW-9470 - [CI][Java] Run Maven in parallel ARROW-9472 - [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed ARROW-9473 - [Doc] Polishing for 1.0 ARROW-9478 - [C++] Improve error message on unsupported cast types ARROW-9484 - [Docs] Update is* functions to be is_* in the compute docs ARROW-9485 - [R] Better shared library stripping ARROW-9493 - [Python][Dataset] Dictionary encode string partition columns by default ARROW-9508 - [Release][APT][Yum] Enable verification for arm64 binaries ARROW-9509 - [Release] Don’t test Gandiva in the windows wheel verification script ARROW-9511 - [Packaging][Release] Set conda packages’ build number to 0 ARROW-9519 - [Rust] Improve error message when getting a field by name from schema ARROW-9529 - [Dev][Release] Improvements to release verification scripts ARROW-9531 - [Packaging][Release] Update conda forge dependency pins ARROW-9534 - [Rust] [DataFusion] Implement functions for creating literal expressions for all types PARQUET-1820 - [C++] Use a column filter hint to inform read prefetching in Arrow reads PARQUET-1843 - [C++] Unnecessary assignment in DictDecoderImpl::Decode PARQUET-1855 - [C++] Improve documentation on MetaData ownership PARQUET-1861 - [Documentation][C++] Explain ReaderProperters.buffer_stream*","headline":"Apache Arrow 1.0.0 Release","image":"https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png","mainEntityOfPage":{"@type":"WebPage","@id":"https://arrow.apache.org/release/1.0.0.html"},"publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"https://arrow.apache.org/img/logo.png"}},"url":"https://arrow.apache.org/release/1.0.0.html"}</script>
<!-- End Jekyll SEO tag -->
<!-- favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16.png" id="light1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32.png" id="light2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon.png" id="light3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120.png" id="light4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76.png" id="light5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60.png" id="light6">
<!-- dark mode favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16-dark.png" id="dark1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32-dark.png" id="dark2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon-dark.png" id="dark3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120-dark.png" id="dark4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76-dark.png" id="dark5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60-dark.png" id="dark6">
<script>
// Switch to the dark-mode favicons if prefers-color-scheme: dark
function onUpdate() {
light1 = document.querySelector('link#light1');
light2 = document.querySelector('link#light2');
light3 = document.querySelector('link#light3');
light4 = document.querySelector('link#light4');
light5 = document.querySelector('link#light5');
light6 = document.querySelector('link#light6');
dark1 = document.querySelector('link#dark1');
dark2 = document.querySelector('link#dark2');
dark3 = document.querySelector('link#dark3');
dark4 = document.querySelector('link#dark4');
dark5 = document.querySelector('link#dark5');
dark6 = document.querySelector('link#dark6');
if (matcher.matches) {
light1.remove();
light2.remove();
light3.remove();
light4.remove();
light5.remove();
light6.remove();
document.head.append(dark1);
document.head.append(dark2);
document.head.append(dark3);
document.head.append(dark4);
document.head.append(dark5);
document.head.append(dark6);
} else {
dark1.remove();
dark2.remove();
dark3.remove();
dark4.remove();
dark5.remove();
dark6.remove();
document.head.append(light1);
document.head.append(light2);
document.head.append(light3);
document.head.append(light4);
document.head.append(light5);
document.head.append(light6);
}
}
matcher = window.matchMedia('(prefers-color-scheme: dark)');
matcher.addListener(onUpdate);
onUpdate();
</script>
<link rel="stylesheet" href="//fonts.googleapis.com/css?family=Lato:300,300italic,400,400italic,700,700italic,900">
<link href="/css/main.css" rel="stylesheet">
<link href="/css/syntax.css" rel="stylesheet">
<script src="/javascript/main.js"></script>
<!-- Matomo -->
<script>
var _paq = window._paq = window._paq || [];
/* tracker methods like "setCustomDimension" should be called before "trackPageView" */
/* We explicitly disable cookie tracking to avoid privacy issues */
_paq.push(['disableCookies']);
_paq.push(['trackPageView']);
_paq.push(['enableLinkTracking']);
(function() {
var u="https://analytics.apache.org/";
_paq.push(['setTrackerUrl', u+'matomo.php']);
_paq.push(['setSiteId', '20']);
var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g,s);
})();
</script>
<!-- End Matomo Code -->
</head>
<body class="wrap">
<header>
<nav class="navbar navbar-expand-md navbar-dark bg-dark">
<a class="navbar-brand no-padding" href="/"><img src="/img/arrow-inverse-300px.png" height="40px"/></a>
<button class="navbar-toggler ml-auto" type="button" data-toggle="collapse" data-target="#arrow-navbar" aria-controls="arrow-navbar" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span>
</button>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse justify-content-end" id="arrow-navbar">
<ul class="nav navbar-nav">
<li class="nav-item"><a class="nav-link" href="/overview/" role="button" aria-haspopup="true" aria-expanded="false">Overview</a></li>
<li class="nav-item"><a class="nav-link" href="/faq/" role="button" aria-haspopup="true" aria-expanded="false">FAQ</a></li>
<li class="nav-item"><a class="nav-link" href="/blog" role="button" aria-haspopup="true" aria-expanded="false">Blog</a></li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownGetArrow" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Get Arrow
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownGetArrow">
<a class="dropdown-item" href="/install/">Install</a>
<a class="dropdown-item" href="/release/">Releases</a>
<a class="dropdown-item" href="https://github.com/apache/arrow">Source Code</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownDocumentation" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Documentation
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownDocumentation">
<a class="dropdown-item" href="/docs">Project Docs</a>
<a class="dropdown-item" href="/docs/format/Columnar.html">Format</a>
<hr/>
<a class="dropdown-item" href="/docs/c_glib">C GLib</a>
<a class="dropdown-item" href="/docs/cpp">C++</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/csharp/README.md">C#</a>
<a class="dropdown-item" href="https://godoc.org/github.com/apache/arrow/go/arrow">Go</a>
<a class="dropdown-item" href="/docs/java">Java</a>
<a class="dropdown-item" href="/docs/js">JavaScript</a>
<a class="dropdown-item" href="/julia/">Julia</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/matlab/README.md">MATLAB</a>
<a class="dropdown-item" href="/docs/python">Python</a>
<a class="dropdown-item" href="/docs/r">R</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/ruby/README.md">Ruby</a>
<a class="dropdown-item" href="https://docs.rs/arrow/latest">Rust</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownSubprojects" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Subprojects
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownSubprojects">
<a class="dropdown-item" href="/adbc">ADBC</a>
<a class="dropdown-item" href="/docs/format/Flight.html">Arrow Flight</a>
<a class="dropdown-item" href="/docs/format/FlightSql.html">Arrow Flight SQL</a>
<a class="dropdown-item" href="https://datafusion.apache.org">DataFusion</a>
<a class="dropdown-item" href="/nanoarrow">nanoarrow</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownCommunity" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Community
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownCommunity">
<a class="dropdown-item" href="/community/">Communication</a>
<a class="dropdown-item" href="/docs/developers/index.html">Contributing</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/issues">Issue Tracker</a>
<a class="dropdown-item" href="/committers/">Governance</a>
<a class="dropdown-item" href="/use_cases/">Use Cases</a>
<a class="dropdown-item" href="/powered_by/">Powered By</a>
<a class="dropdown-item" href="/visual_identity/">Visual Identity</a>
<a class="dropdown-item" href="/security/">Security</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/policies/conduct.html">Code of Conduct</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownASF" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
ASF Links
</a>
<div class="dropdown-menu dropdown-menu-right" aria-labelledby="navbarDropdownASF">
<a class="dropdown-item" href="https://www.apache.org/">ASF Website</a>
<a class="dropdown-item" href="https://www.apache.org/licenses/">License</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/sponsorship.html">Donate</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/thanks.html">Thanks</a>
<a class="dropdown-item" href="https://www.apache.org/security/">Security</a>
</div>
</li>
</ul>
</div><!-- /.navbar-collapse -->
</nav>
</header>
<div class="container p-4 pt-5">
<main role="main" class="pb-5">
<!--
-->
<h1 id="apache-arrow-100-24-july-2020">Apache Arrow 1.0.0 (24 July 2020)</h1>
<p>This is a major release covering more than 3 months of development.</p>
<h2 id="download">Download</h2>
<ul>
<li><a href="https://www.apache.org/dyn/closer.lua/arrow/arrow-1.0.0/"><strong>Source Artifacts</strong></a></li>
<li><strong>Binary Artifacts</strong>
<ul>
<li><a href="https://apache.jfrog.io/artifactory/arrow/centos/">For CentOS</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/debian/">For Debian</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/python/1.0.0/">For Python</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/ubuntu/">For Ubuntu</a></li>
</ul>
</li>
<li><a href="https://github.com/apache/arrow/releases/tag/apache-arrow-1.0.0">Git tag</a></li>
</ul>
<h2 id="contributors">Contributors</h2>
<p>This release includes 771 commits from 100 distinct contributors.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> apache-arrow-0.17.0..apache-arrow-1.0.0
<span class="go"> 125 Wes McKinney
72 Neal Richardson
61 Antoine Pitrou
56 Sutou Kouhei
55 Krisztián Szűcs
30 Joris Van den Bossche
24 Benjamin Kietzman
23 David Li
21 Kazuaki Ishizaki
19 Qingping Hou
18 Andy Grove
18 François Saint-Jacques
17 Neville Dipale
16 Yibo Cai
16 liyafan82
15 Ryan Murray
14 Micah Kornfield
10 Uwe L. Korn
9 Paddy Horan
8 Jorge C. Leitao
8 Projjal Chanda
7 Frank Du
7 Romain Francois
6 Kenta Murata
6 Maarten A. Breddels
6 Max Burke
5 Mahmut Bulut
4 Mark Hildreth
4 Yuan Zhou
3 Adam Szmigin
3 Raphael Taylor-Davies
3 Zhuo Peng
3 rdettai
2 Bobby Wagner
2 Eric Erhardt
2 Prudhvi Porandla
2 R. Tyler Croy
2 Sagnik Chakraborty
2 Vamsi
2 Yordan Pavlov
2 c-jamie
2 dota17
2 mayuropensource
2 tianchen
2 tianchen92
1 Alex Baden
1 Ben Kimock
1 Bryan Cutler
1 Chad Dougherty
1 Christian Hudon
1 Cuong Nguyen
1 David Anthoff
1 Dmitry Kalinkin
1 Drazen Urch
1 Erin Ryan
1 Eyal Leshem
1 Frank Smith
1 German I Ramirez-Espinoza
1 Hongze Zhang
1 Ilija Puaca
1 Itamar Turner-Trauring
1 Jacek Pliszka
1 Jens Glaser
1 Ji Liu
1 Jörn Horstmann
1 Kyle Brandt
1 Kyle McCarthy
1 Laurent Goujon
1 Li, Jiajia
1 Maarten Breddels
1 Masaki Kozuki
1 Mohamed Zenadi
1 Nick Poorman
1 Pand9
1 Paul Dix
1 Paul Taylor
1 Praveen
1 Robert Borkowski
1 Rok
1 Sergey Fedoseev
1 Shuai Zhang
1 Simon Watts
1 Stéphane Campinas
1 Suvayu Ali
1 Sven Wagner-Boysen
1 TP Boudreau
1 Takuya Kato
1 Tobias Mayer
1 Wakahisa
1 Weston Steimel
1 Yuqi Gu
1 Zachary Gramana
1 Ze'ev Maor
1 alamb
1 alimcmaster1
1 amol
1 chenguoping
1 ritchie46
1 rvernica
1 xuyiming
</span></code></pre></div></div>
<h2 id="patch-committers">Patch Committers</h2>
<p>The following Apache committers merged contributed patches to the repository.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-csn</span> apache-arrow-0.17.0..apache-arrow-1.0.0
<span class="go"> 231 Wes McKinney
85 Antoine Pitrou
74 Krisztián Szűcs
68 Neal Richardson
67 Sutou Kouhei
44 François Saint-Jacques
35 Andy Grove
34 Neville Dipale
21 Benjamin Kietzman
17 GitHub
17 Micah Kornfield
12 Chao Sun
12 Paddy Horan
8 David Li
8 Praveen
7 Joris Van den Bossche
5 Bryan Cutler
5 Eric Erhardt
5 Uwe L. Korn
4 Yosuke Shiro
3 Jacques Nadeau
3 Pindikura Ravindra
3 Sebastien Binet
2 liyafan82
1 Kenta Murata
</span></code></pre></div></div>
<h2 id="changelog">Changelog</h2>
<h2 id="apache-arrow-100-2020-07-24">Apache Arrow 1.0.0 (2020-07-24)</h2>
<h3 id="bug-fixes">Bug Fixes</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1692">ARROW-1692</a> - [Python, Java] UnionArray round trip not working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3329">ARROW-3329</a> - [Python] Error casting decimal(38, 4) to int64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3861">ARROW-3861</a> - [Python] ParquetDataset().read columns argument always returns partition column</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4018">ARROW-4018</a> - [C++] RLE decoder may not big-endian compatible</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4309">ARROW-4309</a> - [Documentation] Add a docker-compose entry which builds the documentation with CUDA enabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4600">ARROW-4600</a> - [Ruby] Arrow::DictionaryArray#[] should returns the item in the indices array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5158">ARROW-5158</a> - [Packaging][Wheel] Symlink libraries in wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5310">ARROW-5310</a> - [Python] better error message on creating ParquetDataset from empty directory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5359">ARROW-5359</a> - [Python] timestamp_as_object support for pa.Table.to_pandas in pyarrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5572">ARROW-5572</a> - [Python] raise error message when passing invalid filter in parquet reading</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5666">ARROW-5666</a> - [Python] Underscores in partition (string) values are dropped when reading dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5744">ARROW-5744</a> - [C++] Do not error in Table::CombineChunks for BinaryArray types that overflow 2GB limit</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5875">ARROW-5875</a> - [FlightRPC] Test RPC features in integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6235">ARROW-6235</a> - [R] Conversion from arrow::BinaryArray to R character vector not implemented</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6523">ARROW-6523</a> - [C++][Dataset] arrow_dataset target does not depend on anything</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6848">ARROW-6848</a> - [C++] Specify -std=c++11 instead of -std=gnu++11 when building</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7018">ARROW-7018</a> - [R] Non-UTF-8 data in Arrow &lt;–&gt; R conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7028">ARROW-7028</a> - [R] Date roundtrip results in different R storage mode</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7084">ARROW-7084</a> - [C++] ArrayRangeEquals should check for full type equality?</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7173">ARROW-7173</a> - [Integration] Add test to verify Map field names can be arbitrary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7208">ARROW-7208</a> - [Python] Passing directory to ParquetFile class gives confusing error message</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7273">ARROW-7273</a> - [Python] Non-nullable null field is allowed / crashes when writing to parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7480">ARROW-7480</a> - [Rust] [DataFusion] Query fails/incorrect when aggregated + grouped columns don’t match the selected columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7610">ARROW-7610</a> - [Java] Finish support for 64 bit int allocations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7654">ARROW-7654</a> - [Python] Ability to set column_types to a Schema in csv.ConvertOptions is undocumented</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7681">ARROW-7681</a> - [Rust] Explicitly seeking a BufReader will discard the internal buffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7702">ARROW-7702</a> - [C++][Dataset] Provide (optional) deterministic order of batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7782">ARROW-7782</a> - [Python] Losing index information when using write_to_dataset with partition_cols</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7840">ARROW-7840</a> - [Java] [Integration] Java executables fail</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7925">ARROW-7925</a> - [C++][Documentation] Instructions about running IWYU and other tasks in cpp/development.rst have gone stale</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7939">ARROW-7939</a> - [Python] crashes when reading parquet file compressed with snappy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7967">ARROW-7967</a> - [CI][Crossbow] Pin macOS version in autobrew job to match CRAN</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8050">ARROW-8050</a> - [Python][Packaging] Do not include generated Cython source files in wheel packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8078">ARROW-8078</a> - [Python] Missing links in the docs regarding field and schema DataTypes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8115">ARROW-8115</a> - [Python] Conversion when mixing NaT and datetime objects not working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8251">ARROW-8251</a> - [Python] pandas.ExtensionDtype does not survive round trip with write_to_dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8344">ARROW-8344</a> - [C#] StringArray.Builder.Clear() corrupts subsequently-built array contents</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8360">ARROW-8360</a> - [C++][Gandiva] Fixes date32 support for date/time functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8374">ARROW-8374</a> - [R] Table to vector of DictonaryType will error when Arrays don’t have the same Dictionary per array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8392">ARROW-8392</a> - [Java] Fix overflow related corner cases for vector value comparison</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8448">ARROW-8448</a> - [Package] Can’t build apt packages with ubuntu-focal</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8455">ARROW-8455</a> - [Rust] [Parquet] Arrow column read on partially compatible files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8455">ARROW-8455</a> - [Rust] [Parquet] Arrow column read on partially compatible files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8471">ARROW-8471</a> - [C++][Integration] Regression to /u?int64/ as JSON::number</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8472">ARROW-8472</a> - [Go][Integration] Represent 64 bit integers as JSON::string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8473">ARROW-8473</a> - [Rust] “Statistics support” in rust/parquet readme is incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8480">ARROW-8480</a> - [Rust] There is no check for allocation failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8503">ARROW-8503</a> - [Packaging][deb] Can’t build apache-arrow-archive-keyring for RC</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8505">ARROW-8505</a> - [Release][C#] “sourcelink test” is failed by Apache.Arrow.AssemblyInfo.cs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8508">ARROW-8508</a> - [Rust] ListBuilder of FixedSizeListBuilder creates wrong offsets</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8510">ARROW-8510</a> - [C++] arrow/dataset/file_base.cc fails to compile with internal compiler error with “Visual Studio 15 2017 Win64” generator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8511">ARROW-8511</a> - [Developer][Release] Windows release verification script does not halt if C++ compilation fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8514">ARROW-8514</a> - [Developer] Windows wheel verification script does not check Python 3.5</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8529">ARROW-8529</a> - [C++] Fix usage of NextCounts() in GetBatchWithDict[Spaced]</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8535">ARROW-8535</a> - [Rust] Arrow crate does not specify arrow-flight version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8536">ARROW-8536</a> - [Rust] Failed to locate format/Flight.proto in any parent directory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8537">ARROW-8537</a> - [C++] Performance regression from ARROW-8523</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8539">ARROW-8539</a> - [CI] “AMD64 MacOS 10.15 GLib &amp; Ruby” fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8554">ARROW-8554</a> - [C++][Benchmark] Fix building error “cannot bind lvalue”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8556">ARROW-8556</a> - [R] zstd symbol not found if there are multiple installations of zstd</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8566">ARROW-8566</a> - [R] error when writing POSIXct to spark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8568">ARROW-8568</a> - [C++][Python] Crash on decimal cast in debug mode</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8577">ARROW-8577</a> - [Plasma] PlasmaClient::Connect() of CUDA enabled build is always failed on no CUDA device machine</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8583">ARROW-8583</a> - [C++][Doc] Undocumented parameter in Dataset namespace</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8584">ARROW-8584</a> - [Packaging][C++] Protobuf link error in deb builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8585">ARROW-8585</a> - [Packaging][Python] Windows wheels fail to build because of link error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8586">ARROW-8586</a> - [R] installation failure on CentOS 7</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8587">ARROW-8587</a> - [C++] Compilation error when linking arrow-flight-perf-server</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8592">ARROW-8592</a> - [C++] Docs still list LLVM 7 as compiler used</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8593">ARROW-8593</a> - [C++] Parquet file_serialize_test.cc fails to build with musl libc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8598">ARROW-8598</a> - [Rust] simd_compare_op creates buffer of incorrect length when item count is not a multiple of T::lanes()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8602">ARROW-8602</a> - [CMake] Fix ws2_32 link issue when cross-compiling on Linux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8603">ARROW-8603</a> - [Documentation] Fix Sphinx doxygen comment</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8604">ARROW-8604</a> - [R][CI] Update CI to use R 4.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8608">ARROW-8608</a> - [C++] Update vendored mpark/variant.h to latest to fix NVCC compilation issues</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8609">ARROW-8609</a> - [C++] ORC JNI bridge crashed on null arrow buffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8610">ARROW-8610</a> - [Rust] DivideByZero when running arrow crate when simd feature is disabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8613">ARROW-8613</a> - [C++][Dataset] Raise error for unparsable partition value</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8615">ARROW-8615</a> - [R] Error better and insist on RandomAccessFile in read_feather</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8617">ARROW-8617</a> - [Rust] simd_load_set_invalid does not exist on aarch64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8632">ARROW-8632</a> - [C++] Fix conversion error warning in array_union_test.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8641">ARROW-8641</a> - [Python] Regression in feather: no longer supports permutation in column selection</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8643">ARROW-8643</a> - [Python] Tests with pandas master failing due to freq assertion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8644">ARROW-8644</a> - [Python] Dask integration tests failing due to change in not including partition columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8646">ARROW-8646</a> - [Java] Allow UnionListWriter to write null values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8649">ARROW-8649</a> - [Java] [Website] Java documentation on website is hidden</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8657">ARROW-8657</a> - [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when using version=’2.0’</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8663">ARROW-8663</a> - [Documentation] Small correction to building.rst</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8680">ARROW-8680</a> - [Rust] ComplexObjectArrayReader incorrect null value shuffling</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8684">ARROW-8684</a> - [Python] “SystemError: Bad call flags in _PyMethodDef_RawFastCallDict” in Python 3.7.7 on macOS when using pyarrow wheel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8689">ARROW-8689</a> - [C++] S3 benchmarks fail linking</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8693">ARROW-8693</a> - [Python] Dataset.get_fragments is missing an implicit cast when filtering</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8694">ARROW-8694</a> - [Python][Parquet] parquet.read_schema() fails when loading wide table created from Pandas DataFrame</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8701">ARROW-8701</a> - [Rust] Unresolved import `crate::compute::util::simd_load_set_invalid` on Raspberry Pi</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8704">ARROW-8704</a> - [C++] Fix Parquet crash on invalid input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8705">ARROW-8705</a> - [Java] ComplexCopier is skipping null values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8706">ARROW-8706</a> - [C++][Parquet] Tracking JIRA for PARQUET-1857 (unencrypted INT16_MAX Parquet row group limit)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8710">ARROW-8710</a> - [Rust] Continuation marker not written correctly in IPC writer, and stream not flushed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8722">ARROW-8722</a> - [Dev] “archery docker run -e” doesn’t work</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8726">ARROW-8726</a> - [C++][Dataset] Mis-specified DirectoryPartitioning incorrectly uses the file name as value</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8728">ARROW-8728</a> - [C++] Bitmap operation may cause buffer overflow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8729">ARROW-8729</a> - [C++][Dataset] Only selecting a partition column results in empty table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8734">ARROW-8734</a> - [R] improve nightly build installation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8741">ARROW-8741</a> - [Python][Packaging] Keep VS2015 with for the windows wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8750">ARROW-8750</a> - [Python] pyarrow.feather.write_feather does not default to lz4 compression if it’s available</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8768">ARROW-8768</a> - [R][CI] Fix nightly as-cran spurious failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8775">ARROW-8775</a> - [C++][FlightRPC] Integration client doesn’t run integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8776">ARROW-8776</a> - [FlightRPC][C++] Flight/C++ middleware don’t receive headers on failed calls to Java servers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8798">ARROW-8798</a> - [C++] Fix Parquet crashes on invalid input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8799">ARROW-8799</a> - [C++][Dataset] Reading list column as nested dictionary segfaults</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8801">ARROW-8801</a> - [Python] Memory leak on read from parquet file with UTC timestamps using pandas</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8802">ARROW-8802</a> - [C++][Dataset] Schema metadata are lost when reading a subset of columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8803">ARROW-8803</a> - [Java] Row count should be set before loading buffers in VectorLoader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8808">ARROW-8808</a> - [Rust] Divide by zero in arrays/builder.rs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8809">ARROW-8809</a> - [Rust] schema mismatch in integration test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8811">ARROW-8811</a> - [Java] Fix build on master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8820">ARROW-8820</a> - [C++][Gandiva] fix date_trunc functions to return date types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8821">ARROW-8821</a> - [Rust] nested binary expression with Like, NotLike and Not operator results in type cast error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8825">ARROW-8825</a> - [C++] Cannot compiled pass with Wunused-parameter flag</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8826">ARROW-8826</a> - [Crossbow] remote URL should always have .git</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8832">ARROW-8832</a> - [Python] AttributeError: module ‘pyarrow.fs’ has no attribute ‘S3FileSystem’</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8848">ARROW-8848</a> - [CI][C/Glib] MinGW build error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8848">ARROW-8848</a> - [CI][C/Glib] MinGW build error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8858">ARROW-8858</a> - [FlightRPC] Ensure headers are uniformly exposed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8860">ARROW-8860</a> - [C++] IPC/Feather decompression broken for nested arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8862">ARROW-8862</a> - [C++] NumericBuilder does not use MemoryPool passed to CTOR</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8863">ARROW-8863</a> - [C++] Array subclass constructors must set ArrayData::null_count to 0 when there is no validity bitmap</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8869">ARROW-8869</a> - [Rust] [DataFusion] Type Coercion optimizer rule does not support new scan nodes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8871">ARROW-8871</a> - [C++] Gandiva build failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8872">ARROW-8872</a> - [CI] Travis-CI jobs fail (can’t open file ‘ci/detect-changes.py’)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8874">ARROW-8874</a> - [C++][Dataset] Scanner::ToTable race when ScanTask exit early with an error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8878">ARROW-8878</a> - [R] try_download is confused when download.file.method isn’t default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8882">ARROW-8882</a> - [C#] Add .editorconfig to C# code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8888">ARROW-8888</a> - [Python] Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8889">ARROW-8889</a> - [Python] Python 3.7 SIGSEGV when comparing RecordBatch to None</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8892">ARROW-8892</a> - [C++][CI] CI builds for MSVC do not build benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8909">ARROW-8909</a> - [Java] Out of order writes using setSafe</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8911">ARROW-8911</a> - [C++] Slicing a ChunkedArray with zero chunks segfaults</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8924">ARROW-8924</a> - [C++][Gandiva] castDATE_date32() may cause overflow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8925">ARROW-8925</a> - [Rust] [DataFusion] CsvExec::schema() returns incorrect results</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8930">ARROW-8930</a> - [C++] libz.so linking error with liborc.a</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8932">ARROW-8932</a> - [C++] symbol resolution failures with liborc.a</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8946">ARROW-8946</a> - [Python] Add tests for parquet.write_metadata metadata_collector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8948">ARROW-8948</a> - [Java][Integration] enable duplicate field names integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8951">ARROW-8951</a> - [C++] Fix compiler warning in compute/kernels/scalar_cast_temporal.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8954">ARROW-8954</a> - [Website] ca-certificates should be listed in installation instructions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8957">ARROW-8957</a> - [FlightRPC][C++] Fail to build due to IpcOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8959">ARROW-8959</a> - [Rust] Broken build due to new benchmark crate using old API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8962">ARROW-8962</a> - [C++] Linking failure with clang-4.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8968">ARROW-8968</a> - [C++][Gandiva] Show link warning message on s390x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8975">ARROW-8975</a> - [FlightRPC][C++] Fix flaky MacOS tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8977">ARROW-8977</a> - [R] Table$create with schema crashes with some dictionary index types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8978">ARROW-8978</a> - [C++][Compute] “Conditional jump or move depends on uninitialised value(s)” Valgrind warning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8980">ARROW-8980</a> - [Python] Metadata grows exponentially when using schema from disk</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8982">ARROW-8982</a> - [CI] Remove allow_failures for s390x in TravisCI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8986">ARROW-8986</a> - [Archery][ursabot] Fix benchmark diff checkout of origin/master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9000">ARROW-9000</a> - [Java] build crashes with JDK14</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9009">ARROW-9009</a> - [C++][Dataset] ARROW:schema should be removed from schema’s metadata when reading Parquet files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9013">ARROW-9013</a> - [C++] Validate enum-style CMake options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9020">ARROW-9020</a> - [Python] read_json won’t respect explicit_schema in parse_options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9024">ARROW-9024</a> - [C++/Python] Install anaconda-client in conda-clean job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9026">ARROW-9026</a> - [C++/Python] Force package removal from arrow-nightlies conda repository</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9037">ARROW-9037</a> - [C++][C] unable to import array with null count == -1 (which could be exported)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9057">ARROW-9057</a> - [Rust] Projection should work on InMemoryScan without error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9059">ARROW-9059</a> - [Rust] Documentation for slicing array data has the wrong sign</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9066">ARROW-9066</a> - [Python] Raise correct error in isnull()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9071">ARROW-9071</a> - [C++] MakeArrayOfNull makes invalid ListArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9077">ARROW-9077</a> - [C++] Fix aggregate/scalar-compare benchmark null_percent calculation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9080">ARROW-9080</a> - [C++] arrow::AllocateBuffer returns a Result&lt;unique_ptr&lt;Buffer&gt;&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9082">ARROW-9082</a> - [Rust] - Stream reader fail when steam not ended with (optional) 0xFFFFFFFF 0x00000000”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9084">ARROW-9084</a> - [C++] CMake is unable to find zstd target when ZSTD_SOURCE=SYSTEM</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9085">ARROW-9085</a> - [C++][CI] Appveyor CI test failures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9087">ARROW-9087</a> - [C++] Missing HDFS options parsing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9098">ARROW-9098</a> - RecordBatch::ToStructArray cannot handle record batches with 0 column</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9105">ARROW-9105</a> - [C++] ParquetFileFragment scanning doesn’t handle filter on partition field</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9120">ARROW-9120</a> - [C++] Lint and Format C++ files with “codegen” in file name</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9121">ARROW-9121</a> - [C++] Do not wipe the filesystem when path is empty</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9122">ARROW-9122</a> - [C++] Adapt ascii_lower/ascii_upper bulk transforms to work on sliced arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9126">ARROW-9126</a> - [C++] Trimmed Boost bundle fails to build on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9127">ARROW-9127</a> - [Rust] Update thrift library dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9134">ARROW-9134</a> - [Python] Parquet partitioning degrades Int32 to float64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9141">ARROW-9141</a> - [R] Update cross-package documentation links</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9142">ARROW-9142</a> - [C++] random::RandomArrayGenerator::Boolean “probability” misdocumented / incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9143">ARROW-9143</a> - [C++] RecordBatch::Slice erroneously sets non-nullable field’s internal null_count to unknown</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9146">ARROW-9146</a> - [C++][Dataset] Scanning a Fragment with a filter + mismatching schema shouldn’t abort</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9151">ARROW-9151</a> - [R][CI] Fix Rtools 4.0 build: pacman sync</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9160">ARROW-9160</a> - [C++] Implement string/binary contains for exact matches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9174">ARROW-9174</a> - [Go] Tests panic with 386 (x86) builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9183">ARROW-9183</a> - [C++] Failed to build arrow-cpp with gcc 4.9.2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9184">ARROW-9184</a> - [Rust][Datafusion] table scan without projection should return all columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9194">ARROW-9194</a> - [C++] Array::GetScalar not implemented for decimal type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9195">ARROW-9195</a> - [Java] Wrong usage of Unsafe.get from bytearray in ByteFunctionsHelper class</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9209">ARROW-9209</a> - [C++] Benchmarks fail to build ARROW_IPC=OFF and ARROW_BUILD_TESTS=OFF</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9219">ARROW-9219</a> - [R] coerce_timestamps in Parquet write options does not work</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9221">ARROW-9221</a> - ArrowBuf#setBytes(int, ByteBuffer) doesn’t check the byte buffer’s endianness</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9223">ARROW-9223</a> - [Python] Fix to_pandas() export for timestamps within structs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9230">ARROW-9230</a> - [FlightRPC][Python] flight.connect() doesn’t pass through all arguments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9233">ARROW-9233</a> - [C++] is_null on NullArray should be true for all values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9236">ARROW-9236</a> - [Rust] CSV WriterBuilder never writes header</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9237">ARROW-9237</a> - [R] 0.17 install on Arch Linux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9238">ARROW-9238</a> - [C++][CI] A few test coverages of round-robin in ipc and flight</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9252">ARROW-9252</a> - [Integration] GitHub Actions integration test job does not test against “gold” 0.14.1 files in apache/arrow-testing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9260">ARROW-9260</a> - [CI] “ARM64v8 Ubuntu 20.04 C++” fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9260">ARROW-9260</a> - [CI] “ARM64v8 Ubuntu 20.04 C++” fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9261">ARROW-9261</a> - [Python][Packaging] S3FileSystem curl errors in manylinux wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9274">ARROW-9274</a> - [Rust] [Integration Testing] Read i64 from json files as strings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9282">ARROW-9282</a> - [R] Remove usage of _EXTPTR_PTR</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9284">ARROW-9284</a> - [Java] getMinorTypeForArrowType returns sparse minor type for dense union types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9288">ARROW-9288</a> - [C++][Dataset] Discovery of partition field as dictionary type segfaulting with HivePartitioning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9297">ARROW-9297</a> - [C++][Dataset] Dataset scanner cannot handle large binary column (&gt; 2 GB)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9298">ARROW-9298</a> - [C++] Fix crashes on invalid input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9303">ARROW-9303</a> - [R] Linux static build should always bundle dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9305">ARROW-9305</a> - [Python] Dependency load failure in Windows wheel build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9315">ARROW-9315</a> - [Java] Fix the failure of testAllocationManagerType</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9317">ARROW-9317</a> - [Java] A few testcases for arrow-memory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9326">ARROW-9326</a> - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9326">ARROW-9326</a> - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9326">ARROW-9326</a> - [Python] Setuptools 49.1.0 appears to break our Python 3.6 builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9330">ARROW-9330</a> - [C++] Fix crashes on corrupt IPC input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9334">ARROW-9334</a> - [Dev][Archery] Push ancestor docker images</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9336">ARROW-9336</a> - [Ruby] Creating RecordBatch with structs missing keys results in a malformed table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9343">ARROW-9343</a> - [C++][Gandiva] CastINT/Float functions from string should handle leading/trailing white spaces</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9347">ARROW-9347</a> - [Python] Tests fail with latest fsspec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9350">ARROW-9350</a> - [C++][CI] Nightly valgrind job failures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9351">ARROW-9351</a> - [C++][CI] Nightly test-ubuntu-18.04-cpp-cmake32 fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9353">ARROW-9353</a> - [Python][CI] Nightly dask integration jobs fail</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9354">ARROW-9354</a> - [C++] Turbodbc latest fails to build in the integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9355">ARROW-9355</a> - [R] Fix -Wimplicit-int-float-conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9360">ARROW-9360</a> - [CI][Crossbow] Nightly homebrew-cpp job times out</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9363">ARROW-9363</a> - [C++][Dataset] ParquetDatasetFactory schema: pandas metadata is lost</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9368">ARROW-9368</a> - [Python] Rename predicate argument to filter in split_by_row_group()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9373">ARROW-9373</a> - [C++] Fix Parquet crash on invalid input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9380">ARROW-9380</a> - [C++] Segfaults in compute::CallFunction</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9384">ARROW-9384</a> - [C++] Out-of-memory on invalid IPC input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9385">ARROW-9385</a> - [Python] [CI] jpype integration failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9389">ARROW-9389</a> - [C++] Can’t call isin/match through CallFunction</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9397">ARROW-9397</a> - [R] Pass CC/CXX to cmake when building libarrow in Linux build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9408">ARROW-9408</a> - [Integration] Tests do not run in Windows due to numpy 64-bit errors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9409">ARROW-9409</a> - [CI][Crossbow] Nightly conda-r fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9410">ARROW-9410</a> - [CI][Crossbow] Fix homebrew-cpp again</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9413">ARROW-9413</a> - [Rust] Fix clippy lint on master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9415">ARROW-9415</a> - [C++] Arrow does not compile on Power9</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9416">ARROW-9416</a> - [Go] Add test cases for some datatypes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9417">ARROW-9417</a> - [C++][IPC] size in message written in native endian</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9418">ARROW-9418</a> - [R] nyc-taxi Parquet files not downloaded in binary mode on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9419">ARROW-9419</a> - [C++] Test that “fill_null” function works with sliced inputs, expand tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9428">ARROW-9428</a> - [C++] Update documentation for buffer allocation functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9436">ARROW-9436</a> - [C++][CI] Valgrind errors in fill_null kernel tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9438">ARROW-9438</a> - [CI] Spark integration tests are failing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9439">ARROW-9439</a> - [C++] Fix crash on invalid IPC input (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9440">ARROW-9440</a> - [Python] Expose Fill Null Compute Kernel in PyArrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9443">ARROW-9443</a> - [C++] Bundled bz2 build should only build libbz2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9448">ARROW-9448</a> - [Java] Circular initialization between ArrowBuf and BaseAllocator leads to null HistoricalLog for empty buffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9449">ARROW-9449</a> - [R] Strip arrow.so</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9450">ARROW-9450</a> - [Python] “pytest pyarrow” takes over 10 seconds to collect tests and start executing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9456">ARROW-9456</a> - [Python] Dataset segfault when not importing pyarrow.parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9458">ARROW-9458</a> - [Python] Dataset Scanner is single-threaded only</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9460">ARROW-9460</a> - [C++] BinaryContainsExact doesn’t cope with double characters in the pattern</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9461">ARROW-9461</a> - [Rust] Reading Date32 and Date64 errors - they are incorrectly converted to RecordBatch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9476">ARROW-9476</a> - [C++][Dataset] HivePartitioning discovery with dictionary types fails for multiple fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9486">ARROW-9486</a> - [C++][Dataset] Support implicit casting InExpression::set_ to dict</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9497">ARROW-9497</a> - [C++][Parquet] Fix failure caused by malformed repetition/definition levels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9499">ARROW-9499</a> - [C++] AdaptiveIntBuilder::AppendNull does not increment the null count</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9500">ARROW-9500</a> - [C++] Fix segfault with std::to_string in -O3 builds on gcc 7.5.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9501">ARROW-9501</a> - [C++][Gandiva] Add logic in timestampdiff() when end date is last day of a month</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9503">ARROW-9503</a> - [Rust] Comparison sliced arrays is wrong</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9504">ARROW-9504</a> - [Python] Segmentation fault on ChunkedArray.take</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9506">ARROW-9506</a> - [Packaging][Python] Fix macOS wheel build failures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9512">ARROW-9512</a> - [C++] Variadic template unpack inside lambda doesn’t compile with gcc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9524">ARROW-9524</a> - [CI][Gandiva] C++ unit test arrow-ipc-read-write failing in gandiva nightly build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9527">ARROW-9527</a> - [Rust] Remove un-needed dev-dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1839">PARQUET-1839</a> - [C++] values_read not updated in ReadBatchSpaced</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1857">PARQUET-1857</a> - [C++][Parquet] ParquetFileReader unable to read files with more than 32767 row groups</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1865">PARQUET-1865</a> - [C++] Failure from C++17 feature used in parquet/encoding_benchmark.cc</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1877">PARQUET-1877</a> - [C++] Reconcile container size with string size for memory issues</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1882">PARQUET-1882</a> - [C++] Writing an all-null column and then reading it with buffered_stream aborts the process</li>
</ul>
<h3 id="new-features-and-improvements">New Features and Improvements</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-300">ARROW-300</a> - [Format] Add body buffer compression option to IPC message protocol using LZ4 or ZSTD</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-842">ARROW-842</a> - [Python] Handle more kinds of null sentinel objects from pandas 0.x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-971">ARROW-971</a> - [C++/Python] Implement Array.isvalid/notnull/isnull as scalar functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-974">ARROW-974</a> - [Website] Add Use Cases section to the website</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1277">ARROW-1277</a> - Completing integration tests for major implemented data types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1567">ARROW-1567</a> - [C++] Implement “fill null” kernels that replace null values with some scalar replacement value</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1570">ARROW-1570</a> - [C++] Define API for creating a kernel instance from function of scalar input and output with a particular signature</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1682">ARROW-1682</a> - [Python] Add documentation / example for reading a directory of Parquet files on S3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1796">ARROW-1796</a> - [Python] RowGroup filtering on file level</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2260">ARROW-2260</a> - [C++][Plasma] plasma_store should show usage</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2444">ARROW-2444</a> - [Python][C++] Better handle reading empty parquet files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2702">ARROW-2702</a> - [Python] Examine usages of Invalid and TypeError errors in numpy_to_arrow.cc to see if we are using the right error type in each instance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2714">ARROW-2714</a> - [C++/Python] Variable step size slicing for arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2912">ARROW-2912</a> - [Website] Build more detailed Community landing page a la Apache Spark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3089">ARROW-3089</a> - [Rust] Add ArrayBuilder for different Arrow arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3134">ARROW-3134</a> - [C++] Implement n-ary iterator for a collection of chunked arrays with possibly different chunking layouts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3154">ARROW-3154</a> - [Python][C++] Document how to write _metadata, _common_metadata files with Parquet datasets</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3244">ARROW-3244</a> - [Python] Multi-file parquet loading without scan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3275">ARROW-3275</a> - [Python] Add documentation about inspecting Parquet file metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3308">ARROW-3308</a> - [R] Convert R character vector with data exceeding 2GB to Large type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3317">ARROW-3317</a> - [R] Test/support conversions from data.frame with a single character column exceeding 2GB capacity of BinaryArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3446">ARROW-3446</a> - [R] Document mapping of Arrow &lt;-&gt; R types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3509">ARROW-3509</a> - [C++] Inconsistent child accessor naming</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3520">ARROW-3520</a> - [C++] Implement List Flatten kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3688">ARROW-3688</a> - [Rust] Implement PrimitiveArrayBuilder&lt;T&gt;.push_values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3827">ARROW-3827</a> - [Rust] Implement UnionArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4022">ARROW-4022</a> - [C++] Promote Datum variant out of compute namespace</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4221">ARROW-4221</a> - [Format] Add canonical flag in COO sparse index</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4390">ARROW-4390</a> - [R] Serialize “labeled” metadata in Feather files, IPC messages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4412">ARROW-4412</a> - [DOCUMENTATION] Add explicit version numbers to the arrow specification documents.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4427">ARROW-4427</a> - [Doc] Move Confluence Wiki pages to the Sphinx docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4429">ARROW-4429</a> - [Doc] Add git rebase tips to the ‘Contributing’ page in the developer docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5035">ARROW-5035</a> - [C#] ArrowBuffer.Builder&lt;bool&gt; is broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5082">ARROW-5082</a> - [Python][Packaging] Reduce size of macOS and manylinux1 wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5143">ARROW-5143</a> - [Flight] Enable integration testing of batches with dictionaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5279">ARROW-5279</a> - [C++] Support reading delta dictionaries in IPC streams</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5377">ARROW-5377</a> - [C++] Make IpcPayload public and add GetPayloadSize</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5489">ARROW-5489</a> - [C++] Normalize kernels and ChunkedArray behavior</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5548">ARROW-5548</a> - [Documentation] http://arrow.apache.org/docs/latest/ is not latest</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5649">ARROW-5649</a> - [Integration][C++] Create round trip integration test for extension types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5708">ARROW-5708</a> - [C#] Null support for BooleanArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5760">ARROW-5760</a> - [C++] Optimize Take implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5854">ARROW-5854</a> - [Python] Expose compare kernels on Array class</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6052">ARROW-6052</a> - [C++] Divide up arrow/array.h,cc into files in arrow/array/ similar to builder files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6110">ARROW-6110</a> - [Java] Support LargeList Type and add integration test with C++</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6111">ARROW-6111</a> - [Java] Support LargeVarChar and LargeBinary types and add integration test with C++</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6439">ARROW-6439</a> - [R] Implement S3 file-system interface in R</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6456">ARROW-6456</a> - [C++] Possible to reduce object code generated in compute/kernels/take.cc?</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6501">ARROW-6501</a> - [C++] Remove non_zero_length field from SparseIndex</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6521">ARROW-6521</a> - [C++] Add function to arrow:: namespace that returns the current ABI version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6543">ARROW-6543</a> - [R] Support LargeBinary and LargeString types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6602">ARROW-6602</a> - [Doc] Add feature / implementation matrix</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6603">ARROW-6603</a> - [C#] ArrayBuilder API to support writing nulls</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6645">ARROW-6645</a> - [Python] Faster boundschecking of dictionary indices when converting to Categorical</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6689">ARROW-6689</a> - [Rust] [DataFusion] Query execution enhancements for 1.0.0 release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6691">ARROW-6691</a> - [Rust] [DataFusion] Use tokio and Futures instead of spawning threads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6775">ARROW-6775</a> - [C++] [Python] Proposal for several Array utility functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6776">ARROW-6776</a> - [Python] Need a lite version of pyarrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6800">ARROW-6800</a> - [C++] Add CMake option to build libraries targeting a C++14 or C++17 toolchain environment</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6839">ARROW-6839</a> - [Java] Add APIs to read and write “custom_metadata” field of IPC file footer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6856">ARROW-6856</a> - [C++] Use ArrayData instead of Array for ArrayData::dictionary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6917">ARROW-6917</a> - ARROW-6917: [Archery][Release] Add support for JIRA curation, changelog generation and commit cherry-picking for maintenance releases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6945">ARROW-6945</a> - [Rust] Enable integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6959">ARROW-6959</a> - [C++] Clarify what signatures are preferred for compute kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6978">ARROW-6978</a> - [R] Add bindings for sum and mean compute kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6979">ARROW-6979</a> - [R] Enable jemalloc in autobrew formula</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6982">ARROW-6982</a> - [R] Add bindings for compare and boolean kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7009">ARROW-7009</a> - [C++] Refactor filter/take kernels to use Datum instead of overloads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7010">ARROW-7010</a> - [C++] Support lossy casts from decimal128 to float32 and float64/double</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7011">ARROW-7011</a> - [C++] Implement casts from float/double to decimal128</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7012">ARROW-7012</a> - [C++] Clarify ChunkedArray chunking strategy and policy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7068">ARROW-7068</a> - [C++] Expose the offsets of a ListArray as a Int32Array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7075">ARROW-7075</a> - [C++] Boolean kernels should not allocate in Call()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7175">ARROW-7175</a> - [Website] Add a security page to track when vulnerabilities are patched</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7229">ARROW-7229</a> - [C++] Unify ConcatenateTables APIs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7230">ARROW-7230</a> - [C++] Use vendored std::optional instead of boost::optional in Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7237">ARROW-7237</a> - [C++] Add Result&lt;T&gt; to APIs to arrow/json</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7243">ARROW-7243</a> - [Docs] Add common “implementation status” table to the README of each native language implementation, as well as top level README</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7285">ARROW-7285</a> - [C++] ensure C++ implementation meets clarified dictionary spec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7300">ARROW-7300</a> - [C++][Gandiva] Implement functions to cast from strings to integers/floats</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7313">ARROW-7313</a> - [C++] Add function for retrieving a scalar from an array slot</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7371">ARROW-7371</a> - [GLib] Add Datasets binding</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7375">ARROW-7375</a> - [Python] Expose C++ MakeArrayOfNull</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7391">ARROW-7391</a> - [Python] Remove unnecessary classes from the binding layer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7495">ARROW-7495</a> - [Java] Remove “empty” concept from ArrowBuf, replace with custom referencemanager</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7605">ARROW-7605</a> - [C++] Create and install static library containing all dependencies built by Arrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7607">ARROW-7607</a> - [C++] Add to cpp/examples minimal examples of using Arrow as a dependency of another CMake project</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7673">ARROW-7673</a> - [C++][Dataset] Revisit File discovery failure mode</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7676">ARROW-7676</a> - [Packaging][Python] Ensure that the static libraries are not built in the wheel scripts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7699">ARROW-7699</a> - [Java] Support concating dense union vectors in batch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7705">ARROW-7705</a> - [Rust] Initial sort implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7717">ARROW-7717</a> - [CI] Have nightly integration test for Spark’s latest release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7759">ARROW-7759</a> - [C++][Dataset] Add CsvFileFormat for CSV support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7778">ARROW-7778</a> - [C++] Support nested dictionaries in JSON integration format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7784">ARROW-7784</a> - [C++] diff.cc is extremely slow to compile</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7801">ARROW-7801</a> - [Developer] Add issue_comment workflow to fix lint/style/codegen</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7803">ARROW-7803</a> - [R][CI] Autobrew/homebrew tests should not always install from master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7831">ARROW-7831</a> - [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7831">ARROW-7831</a> - [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7902">ARROW-7902</a> - [Integration] Unskip nested dictionary integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7910">ARROW-7910</a> - [C++] Provide function to query page size portably</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7924">ARROW-7924</a> - [Rust] Add sort for float types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7950">ARROW-7950</a> - [Python] When initializing pandas API shim, inform user if their installed pandas version is too old</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7955">ARROW-7955</a> - [Java] Support large buffer for file/stream IPC</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8020">ARROW-8020</a> - [Java] Implement vector validate functionality</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8023">ARROW-8023</a> - [Website] Write a blog post about the C data interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8025">ARROW-8025</a> - [C++] Implement cast to Binary and FixedSizeBinary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8025">ARROW-8025</a> - [C++] Implement cast to Binary and FixedSizeBinary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8046">ARROW-8046</a> - [Developer][Integration] Makefile.docker’s target names are broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8062">ARROW-8062</a> - [C++][Dataset] Parquet Dataset factory from a _metadata/_common_metadata file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8065">ARROW-8065</a> - [C++][Dataset] Untangle Dataset, Fragment and ScanOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8074">ARROW-8074</a> - [C++][Dataset] Support for file-like objects (buffers) in FileSystemDataset?</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8108">ARROW-8108</a> - [Java] Extract a common interface for dictionary encoders</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8111">ARROW-8111</a> - [C++][CSV] Support MM/DD/YYYY date format</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8114">ARROW-8114</a> - [Java][Integration] Enable custom_metadata integration test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8121">ARROW-8121</a> - [Java] Enhance code style checking for Java code (add space after commas, semi-colons and type casts)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8149">ARROW-8149</a> - [C++/Python] Enable CUDA Support in conda recipes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8157">ARROW-8157</a> - [C++][Gandiva] Support building with LLVM 9</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8162">ARROW-8162</a> - [Format][Python] Add serialization for CSF sparse tensors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8169">ARROW-8169</a> - [Java] Improve the performance of JDBC adapter by allocating memory proactively</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8171">ARROW-8171</a> - Consider pre-allocating memory for fix-width vector in Avro adapter iterator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8190">ARROW-8190</a> - [C++][Flight] Allow setting IpcWriteOptions and IpcReadOptions in Flight IPC message reader and writer classes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8229">ARROW-8229</a> - [Java] Move ArrowBuf into the Arrow package</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8230">ARROW-8230</a> - [Java] Move Netty memory manager into a separate module</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8261">ARROW-8261</a> - [Rust] [DataFusion] LogicalPlanBuilder.limit() should take a literal argument</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8263">ARROW-8263</a> - [Rust] [DataFusion] Add documentation for supported SQL functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8281">ARROW-8281</a> - [R] Name collision of arrow.dll on Windows conda</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8283">ARROW-8283</a> - [Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8287">ARROW-8287</a> - [Rust] Arrow examples should use utility to print results</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8293">ARROW-8293</a> - [Python] Run flake8 on python/examples also</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8297">ARROW-8297</a> - [FlightRPC][C++] Implement Flight DoExchange for C++</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8301">ARROW-8301</a> - [R] Handle ChunkedArray and Table in C data interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8312">ARROW-8312</a> - [Java][Gandiva] improve IN expression support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8314">ARROW-8314</a> - [Python] Provide a method to select a subset of columns of a Table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8318">ARROW-8318</a> - [C++][Dataset] Dataset should instantiate Fragment</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8399">ARROW-8399</a> - [Rust] Extend memory alignments to include other architectures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8413">ARROW-8413</a> - [C++] Refactor DefLevelsToBitmap</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8422">ARROW-8422</a> - [Rust] [Parquet] Implement function to convert Arrow schema to Parquet schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8430">ARROW-8430</a> - [CI] Configure self-hosted runners for Github Actions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8434">ARROW-8434</a> - [C++] Ipc RecordBatchFileReader deserializes the Schema multiple times</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8440">ARROW-8440</a> - [C++] Refine simd header files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8443">ARROW-8443</a> - [Gandiva][C++] Fix round/truncate to no-op for special cases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8447">ARROW-8447</a> - [C++][Dataset] Ensure Scanner::ToTable preserve ordering of ScanTasks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8467">ARROW-8467</a> - [C++] Test cases using ArrayFromJSON assume only a little-endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8474">ARROW-8474</a> - [CI][Crossbow] Skip some nightlies we don’t need to run</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8477">ARROW-8477</a> - [C++] Enable reading and writing of long filenames for Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8481">ARROW-8481</a> - [Java] Provide an allocation manager based on Unsafe API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8483">ARROW-8483</a> - [Ruby] Arrow::Table documentation improvement</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8485">ARROW-8485</a> - [Integration][Java] Implement extension types integration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8486">ARROW-8486</a> - [C++] arrow-utility-test causes failures on a big-endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8487">ARROW-8487</a> - [FlightRPC][C++] Make it possible to target a specific payload size</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8488">ARROW-8488</a> - [R] Replace VALUE_OR_STOP with ValueOrStop</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8496">ARROW-8496</a> - [C++] Refine ByteStreamSplitDecodeScalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8497">ARROW-8497</a> - [Archery] Add missing component to builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8499">ARROW-8499</a> - [C++][Dataset] In ScannerBuilder, batch_size will not work if projecter is not empty</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8500">ARROW-8500</a> - [C++] Use selection vectors in Filter implementation for record batches, tables</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8501">ARROW-8501</a> - [Packaging][RPM] Upgrade devtoolset to 8 on CentOS 6</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8502">ARROW-8502</a> - [Release][APT][Yum] Ignore all arm64 verifications</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8504">ARROW-8504</a> - [C++] Add Run Length Reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8506">ARROW-8506</a> - [c++] Miss tests to verify expected_buffer with bit_width &gt; 8 in RLE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8507">ARROW-8507</a> - [Release] Detect .git directory automatically in changelog.py</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8509">ARROW-8509</a> - [GLib] Add low level record batch read/write functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8512">ARROW-8512</a> - [C++] Delete unused compute expr prototype code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8513">ARROW-8513</a> - [Python] Expose Take with Table input in Python</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8515">ARROW-8515</a> - [C++] Bitmap ToString should have an option of grouping by bytes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8516">ARROW-8516</a> - [Rust] Slow BufferBuilder&lt;BooleanType&gt; inserts within PrimitiveBuilder::append_slice</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8517">ARROW-8517</a> - [Developer][Release] Update Crossbow RC verification setup for changes since 0.16.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8520">ARROW-8520</a> - [Developer] Use .asf.yaml to direct GitHub notifications to e-mail lists and JIRA</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8521">ARROW-8521</a> - [Developer] Group Sub-task, Task, Test, and Wish issue types as “Improvement” in Changelog</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8522">ARROW-8522</a> - [Developer] Add environment variable option to toggle whether ephemeral NodeJS is installed in release verification script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8524">ARROW-8524</a> - [CI] Free up space on github actions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8526">ARROW-8526</a> - [Python] Fix non-deterministic row order failure in dataset tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8531">ARROW-8531</a> - [C++] Deprecate ARROW_USE_SIMD CMake option</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8538">ARROW-8538</a> - [Packaging] Remove boost from homebrew formula</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8540">ARROW-8540</a> - [C++] Create memory allocation benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8541">ARROW-8541</a> - [Release] Don’t remove previous source releases automatically</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8542">ARROW-8542</a> - [Release] Fix checksum url in the website post release script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8543">ARROW-8543</a> - [C++] IO: single pass coalescing algorithm</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8544">ARROW-8544</a> - [CI][Crossbow] Add a status.json to the gh-pages summary of nightly builds to get around rate limiting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8548">ARROW-8548</a> - [Website] 0.17 release post</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8549">ARROW-8549</a> - [R] Assorted post-0.17 release cleanups</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8550">ARROW-8550</a> - [CI] Don’t run cron GHA jobs on forks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8551">ARROW-8551</a> - [CI][Gandiva] Use LLVM 8 to build gandiva linux jar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8552">ARROW-8552</a> - [Rust] support column iteration for parquet row</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8553">ARROW-8553</a> - [C++] Optimize unaligned bitmap operations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8555">ARROW-8555</a> - [FlightRPC][Java] Implement Flight DoExchange for Java</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8558">ARROW-8558</a> - [Rust] GitHub Actions missing rustfmt</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8559">ARROW-8559</a> - [Rust] Consolidate Record Batch reader traits in main arrow crate</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8560">ARROW-8560</a> - [Rust] Docs for MutableBuffer resize are incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8561">ARROW-8561</a> - [C++][Gandiva] Stop using deprecated google::protobuf::MessageLite::ByteSize()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8562">ARROW-8562</a> - [C++] IO: Parameterize I/O coalescing using S3 storage metrics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8563">ARROW-8563</a> - [Go] Minor change to make newBuilder public</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8564">ARROW-8564</a> - [Website] Add Ubuntu 20.04 LTS to supported package list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8569">ARROW-8569</a> - [CI] Upgrade xcode version for testing homebrew formulae</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8571">ARROW-8571</a> - [C++] Switch AppVeyor image to VS 2017</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8572">ARROW-8572</a> - [Python] Expose UnionArray.array and other fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8573">ARROW-8573</a> - [Rust] Upgrade to Rust 1.44 nightly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8574">ARROW-8574</a> - [Rust] Implement Debug for all plain types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8575">ARROW-8575</a> - [Developer] Add issue_comment workflow to rebase a PR</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8590">ARROW-8590</a> - [Rust] Use Arrow pretty print utility in DataFusion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8591">ARROW-8591</a> - [Rust] Reverse lookup for a key in DictionaryArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8597">ARROW-8597</a> - [Rust] arrow crate lint and readability improvements</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8606">ARROW-8606</a> - [CI] Don’t trigger all builds on a change to any file in ci/</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8607">ARROW-8607</a> - [R][CI] Unbreak builds following R 4.0 release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8611">ARROW-8611</a> - [R] Can’t install arrow 0.17 on Ubuntu 18.04 R 3.6.3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8612">ARROW-8612</a> - [GLib] Add GArrowReadOptions and GArrowWriteOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8616">ARROW-8616</a> - [Rust] Turn explicit SIMD off by default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8619">ARROW-8619</a> - [C++] Use distinct Type::type values for interval types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8622">ARROW-8622</a> - [Rust] Parquet crate does not compile on aarch64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8623">ARROW-8623</a> - [C++][Gandiva] Reduce use of Boost, remove Boost headers from header files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8624">ARROW-8624</a> - [Website] Install page should mention arrow-dataset packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8628">ARROW-8628</a> - [CI][Dev] Wrap docker-compose commands with archery</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8629">ARROW-8629</a> - [Rust] Eliminate indirection of ZST allocations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8633">ARROW-8633</a> - [C++] Add ValidateAscii function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8634">ARROW-8634</a> - [Java] Create an example</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8639">ARROW-8639</a> - [C++][Plasma] Require gflags</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8645">ARROW-8645</a> - [C++] Missing gflags dependency for plasma</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8647">ARROW-8647</a> - [C++][Dataset] Optionally encode partition field values as dictionary type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8648">ARROW-8648</a> - [Rust] Optimize Rust CI Build Times</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8650">ARROW-8650</a> - [Rust] [Website] Add documentation to Arrow website</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8651">ARROW-8651</a> - [Python][Dataset] Support pickling of Dataset objects</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8655">ARROW-8655</a> - [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8656">ARROW-8656</a> - [Python] Switch to VS2017 in the windows wheel builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8659">ARROW-8659</a> - [Rust] ListBuilder and FixedSizeListBuilder capacity</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8660">ARROW-8660</a> - [C++][Gandiva] Reduce dependence on Boost</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8662">ARROW-8662</a> - [CI] Consolidate appveyor scripts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8664">ARROW-8664</a> - [Java] Add skip null check to all Vector types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8668">ARROW-8668</a> - [Packaging][APT][Yum][ARM] Use Travis CI’s ARM machine to build packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8669">ARROW-8669</a> - [C++] Add IpcWriteOptions argument to GetRecordBatchSize()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8671">ARROW-8671</a> - [C++] Use IPC body compression metadata approved in ARROW-300</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8671">ARROW-8671</a> - [C++] Use IPC body compression metadata approved in ARROW-300</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8682">ARROW-8682</a> - [Ruby][Parquet] Add support for column level compression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8687">ARROW-8687</a> - [Java] Finish move of io.netty.buffer.ArrowBuf</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8690">ARROW-8690</a> - [Python] Clean-up dataset+parquet tests now order is determinstic</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8692">ARROW-8692</a> - [C++] Avoid memory copies when downloading from S3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8695">ARROW-8695</a> - [Java] remove references to PlatformDependent in memory module</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8696">ARROW-8696</a> - [Java] Convert tests to integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8699">ARROW-8699</a> - [R] Fix automatic r_to_py conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8702">ARROW-8702</a> - [Packaging][C#] Build NuGet packages in release process</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8703">ARROW-8703</a> - [R] schema$metadata should be properly typed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8707">ARROW-8707</a> - [CI] Docker push fails because of wrong dockerhub credentials</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8708">ARROW-8708</a> - [CI] Utilize github actions cache for docker-compose volumes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8711">ARROW-8711</a> - [Python] Expose strptime timestamp parsing in read_csv conversion options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8717">ARROW-8717</a> - [CI][Packaging] Add build dependency on boost to homebrew</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8720">ARROW-8720</a> - [C++] Fix checked_pointer_cast</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8721">ARROW-8721</a> - [CI] Fix R build matrix</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8723">ARROW-8723</a> - [Rust] Remove SIMD specific benchmark code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8724">ARROW-8724</a> - [Packaging][deb][RPM] Use directory in host as build directory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8725">ARROW-8725</a> - [Rust] redundant directory walk in rust parquet datasource code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8727">ARROW-8727</a> - [C++] Do not require struct-initialization of StringConverter&lt;T&gt; to parse strings to other types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8730">ARROW-8730</a> - [Rust] Use slice instead of &amp;Vec for function arguments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8733">ARROW-8733</a> - [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8736">ARROW-8736</a> - [Rust] [DataFusion] Table API should provide a schema() method</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8740">ARROW-8740</a> - [CI] Fix archery option in pandas master cron test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8742">ARROW-8742</a> - [C++][Python] Add flight client support for Mutual TLS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8743">ARROW-8743</a> - [C++][CI] Add a test job on s390x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8744">ARROW-8744</a> - [Rust] ParquetIterator’s next method should be safe to call even after reached end of iteration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8745">ARROW-8745</a> - [C++] Bitmap.ToString causes failures on a big-endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8747">ARROW-8747</a> - [C++] Feather tests with compression cause failure on big-endian platforms</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8751">ARROW-8751</a> - [Rust] ParquetFileArrowReader should be able to read empty parquet file without error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8752">ARROW-8752</a> - [Rust] Remove unused hashmap</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8753">ARROW-8753</a> - [C++][CI] Add a test job on ARM</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8754">ARROW-8754</a> - [C++][CI] enable tests for additional components on big-endian platforms</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8756">ARROW-8756</a> - [C++] Bitmap word tests cause failures on a big-endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8757">ARROW-8757</a> - [C++] Plasma header is written in native endian</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8758">ARROW-8758</a> - [R] Updates for compatibility with dplyr 1.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8759">ARROW-8759</a> - [C++] TestPlasmaSerialization.DeleteReply tests failure on big-endian platforms</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8762">ARROW-8762</a> - [C++][Gandiva] Replace Gandiva’s BitmapAnd with common implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8763">ARROW-8763</a> - [C++] Create RandomAccessFile::WillNeed-like API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8764">ARROW-8764</a> - [C++] Make ThreadPool configurable in ReadRangeCache</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8766">ARROW-8766</a> - [Python] A FileSystem implementation based on Python callbacks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8769">ARROW-8769</a> - [C++] Add convenience methods to access fields by name in StructScalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8770">ARROW-8770</a> - [C++][CI] enable arrow-csv-test on s390x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8772">ARROW-8772</a> - [C++] Expand SumKernel benchmark to more types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8777">ARROW-8777</a> - [Rust] Parquet.rs does not support reading fixed-size binary fields.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8778">ARROW-8778</a> - [C++][Gandiva] SelectionVector related test failed on big-endian platforms</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8779">ARROW-8779</a> - [R] Implement conversion to List&lt;Struct&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8781">ARROW-8781</a> - [CI][C++] Enable ccache on GHA MinGW jobs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8782">ARROW-8782</a> - [Rust] [DataFusion] Add benchmarks based on NYC Taxi data set</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8783">ARROW-8783</a> - [Rust] [DataFusion] Logical plan should have ParquetScan and CsvScan entries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8784">ARROW-8784</a> - [Rust] [DataFusion] Remove use of Arc from LogicalPlan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8785">ARROW-8785</a> - [Python][Packaging] Build the windows wheels with MIMALLOC enabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8786">ARROW-8786</a> - [Packaging][rpm] Use bundled zstd in the CentOS 8 build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8788">ARROW-8788</a> - [C#] Array builders to use bit-packed buffer builder rather than boolean array builder for validity map</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8789">ARROW-8789</a> - [Rust] Add separate crate for integration test binaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8790">ARROW-8790</a> - [C++][CI] Enable arrow-flight-test on s390x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8791">ARROW-8791</a> - [Rust] Creating StringDictionaryBuilder with existing dictionary values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8792">ARROW-8792</a> - [C++] Improved declarative compute function / kernel development framework, normalize calling conventions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8793">ARROW-8793</a> - [C++] BitUtil::SetBitsTo probably doesn’t need to be inline</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8794">ARROW-8794</a> - [C++] Expand benchmark coverage for arrow from parquet reading</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8795">ARROW-8795</a> - [C++] Limited iOS support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8800">ARROW-8800</a> - [C++] Split arrow::ChunkedArray into arrow/chunked_array.h</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8804">ARROW-8804</a> - [R][CI] Followup to Rtools40 upgrade</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8814">ARROW-8814</a> - [Dev][Release] Binary upload script keeps raising locale warnings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8815">ARROW-8815</a> - [Dev][Release] Binary upload script should retry on unexpected bintray request error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8818">ARROW-8818</a> - [Rust] Failing to build on master due to Flatbuffers/Union issues</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8822">ARROW-8822</a> - [Rust] [DataFusion] Add MemoryScan variant to LogicalPlan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8827">ARROW-8827</a> - [Integration Testing] Initial skeleton for Rust integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8830">ARROW-8830</a> - [GLib] Add support for Tell againt not seekable GIO output stream</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8831">ARROW-8831</a> - [Rust] incomplete SIMD implementation in simd_compare_op</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8833">ARROW-8833</a> - [Rust] Implement VALIDATE mode in integration test binary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8834">ARROW-8834</a> - [Rust] Implement arrow-file-to-stream for integration testing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8835">ARROW-8835</a> - [Rust] Implement arrow-stream-to-file for integration testing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8836">ARROW-8836</a> - [Website] Update copyright end year automatically</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8837">ARROW-8837</a> - [Rust] Add Null type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8838">ARROW-8838</a> - [Rust] File reader fails to read header from valid files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8839">ARROW-8839</a> - [Rust] datafusion logical plan should support scaning csv without provided schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8840">ARROW-8840</a> - [Rust] datafusion ExecutionError should implement std::error:Error trait</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8841">ARROW-8841</a> - [C++] Add benchmark and unittest for PLAIN spaced</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8843">ARROW-8843</a> - [C++] Optimize BitmapEquals unaligned case</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8844">ARROW-8844</a> - [C++] Optimize TransferBitmap unaligned case</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8846">ARROW-8846</a> - [Dev][Python] Autoformat Python sources with Archery</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8847">ARROW-8847</a> - [C++] Pass task size / metrics in Executor API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8851">ARROW-8851</a> - [Python][Documentation] Fix FutureWarnings in Python Plasma docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8852">ARROW-8852</a> - [R] Post-0.17.1 adjustments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8854">ARROW-8854</a> - [Rust] [Integration Testing] Show output from arrow-json-integration-test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8855">ARROW-8855</a> - [Rust] [Integration Testing] data type Date32(Day) not supported</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8856">ARROW-8856</a> - [Rust] [Integration Testing] Return empty batch if MessageHeader is NONE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8864">ARROW-8864</a> - [R] Add methods to Table/RecordBatch for consistency with data.frame</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8866">ARROW-8866</a> - [C++] Split Type::UNION into Type::SPARSE_UNION and Type::DENSE_UNION</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8867">ARROW-8867</a> - [R] Support converting POSIXlt type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8875">ARROW-8875</a> - [C++] use AWS SDK SetResponseStreamFactory to avoid a copy of bytes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8877">ARROW-8877</a> - [Rust] add CSV read option struct to simplify datafusion interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8880">ARROW-8880</a> - [R][Linux] Make R Binary Install Friendlier</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8881">ARROW-8881</a> - [Rust] Add large list and binary support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8885">ARROW-8885</a> - [R] Don’t include everything everywhere</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8886">ARROW-8886</a> - [C#] Decide and implement appropriate behaviour for Array builder resize to negative size</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8887">ARROW-8887</a> - [Java] Buffer size for complex vectors increases rapidly in case of clear/write loop</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8890">ARROW-8890</a> - [R] Fix C++ lint issue</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8895">ARROW-8895</a> - [C++] Add C++ unit tests for filter and take functions on temporal type inputs, including timestamps</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8896">ARROW-8896</a> - [C++] Reimplement dictionary unpacking in Cast kernels using Take</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8899">ARROW-8899</a> - [R] Add R metadata like pandas metadata for round-trip fidelity</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8901">ARROW-8901</a> - [C++] Reduce number of take kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8903">ARROW-8903</a> - [C++] Implement optimized “unsafe take” for use with selection vectors for kernel execution</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8904">ARROW-8904</a> - [Python] Fix usages of deprecated C++ APIs related to child/field</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8906">ARROW-8906</a> - [Rust] Support reading multiple CSV files for schema inference</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8907">ARROW-8907</a> - [Rust] implement scalar comparison operations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8912">ARROW-8912</a> - [Ruby] Keep reference of Arrow::Buffer’s data for GC</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8913">ARROW-8913</a> - [Ruby] Use “field” instead of “child”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8914">ARROW-8914</a> - [C++][Gandiva] Decimal128 related test failed on big-endian platforms</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8915">ARROW-8915</a> - [Dev][Archery] Require Click 7</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8917">ARROW-8917</a> - [C++][Compute] Formalize “metafunction” concept</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8918">ARROW-8918</a> - [C++] Add cast “metafunction” to FunctionRegistry that addresses dispatching to appropriate type-specific CastFunction</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8922">ARROW-8922</a> - [C++] Implement example string scalar kernel function to assist with string kernels buildout per ARROW-555</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8923">ARROW-8923</a> - [C++] Improve usability of arrow::compute::CallFunction by moving ExecContext* argument to end and adding default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8926">ARROW-8926</a> - [C++] Improve docstrings in new public APIs in arrow/compute and fix miscellaneous typos</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8927">ARROW-8927</a> - [C++] Support dictionary memos when reading/writing record batches using cuda IPC</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8929">ARROW-8929</a> - [C++] Change compute::Arity:VarArgs min_args default to 0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8931">ARROW-8931</a> - [Rust] Support lexical sort in arrow compute kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8933">ARROW-8933</a> - [C++] Reduce generated code in vector_hash.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8934">ARROW-8934</a> - [C++] Add timestamp subtract kernel aliased to int64 subtract implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8937">ARROW-8937</a> - [C++] Add “parse_strptime” function for string to timestamp conversions using the kernels framework</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8938">ARROW-8938</a> - [R] Provide binding for arrow::compute::CallFunction</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8940">ARROW-8940</a> - [Java] Fix the performance degradation of integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8941">ARROW-8941</a> - [C++/Python] arrow-nightlies conda repository is full</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8942">ARROW-8942</a> - [R] Detect compression in reading CSV/JSON</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8943">ARROW-8943</a> - [C++][Dataset] Add support for Partitioning to ParquetDatasetFactory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8950">ARROW-8950</a> - [C++] Make head optional in s3fs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8958">ARROW-8958</a> - [FlightRPC][Python] Implement Flight DoExchange for Python</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8960">ARROW-8960</a> - [MINOR] [FORMAT] Fix typos in comments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8961">ARROW-8961</a> - [C++] Add utf8proc library to toolchain</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8963">ARROW-8963</a> - [C++][Parquet] Parquet cpp optimize allocate memory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8965">ARROW-8965</a> - [Python][Documentation] Pyarrow documentation for pip nightlies references 404’d location</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8966">ARROW-8966</a> - [C++] Move arrow::ArrayData to a separate header file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8969">ARROW-8969</a> - [C++] Reduce generated code in compute/kernels/scalar_compare.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8970">ARROW-8970</a> - [C++] Reduce shared library / binary code size (umbrella issue)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8972">ARROW-8972</a> - [Java] Support range value comparison for large varchar/varbinary vectors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8973">ARROW-8973</a> - [Java] Support batch value appending for large varchar/varbinary vectors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8974">ARROW-8974</a> - [C++] Refine TransferBitmap template parameters</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8976">ARROW-8976</a> - [C++] compute::CallFunction can’t Filter/Take with ChunkedArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8979">ARROW-8979</a> - [C++] Implement bitmap word reader and writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8984">ARROW-8984</a> - [R] Revise install guides now that Windows conda package exists</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8985">ARROW-8985</a> - [Format] Add “byte width” field with default of 16 to Decimal Flatbuffers type for forward compatibility</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8989">ARROW-8989</a> - [C++] Document available functions in compute::FunctionRegistry</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8993">ARROW-8993</a> - [Rust] Support reading non-seekable sources in text readers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8994">ARROW-8994</a> - [C++] Disable include-what-you-use cpplint lint checks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8996">ARROW-8996</a> - [C++] Runtime SIMD path for Aggregate Sum/Mean kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8997">ARROW-8997</a> - [Archery] Benchmark formatter should have friendly units</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9004">ARROW-9004</a> - [C++][Gandiva] Support building with LLVM 10</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9005">ARROW-9005</a> - [Rust] [DataFusion] Support sort expression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9007">ARROW-9007</a> - [Rust] Support appending arrays by merging array data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9014">ARROW-9014</a> - [Packaging] Bump the minor part of the automatically generated version in crossbow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9015">ARROW-9015</a> - [Java] Make BaseAllocator package private</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9016">ARROW-9016</a> - [Java] Remove direct references to Netty/Unsafe Allocators</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9017">ARROW-9017</a> - [Python] Refactor the Scalar classes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9018">ARROW-9018</a> - [C++] Remove APIs that were deprecated in 0.17.x and prior</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9021">ARROW-9021</a> - [Python] The filesystem keyword in parquet.read_table is not documented</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9022">ARROW-9022</a> - [C++] Add/Sub/Mul arithmetic kernels with overflow check</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9029">ARROW-9029</a> - [C++] Implement BitBlockCounter interface for blockwise popcounts of validity bitmaps</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9030">ARROW-9030</a> - [Python] Clean up some usages of pyarrow.compat, move some common functions/symbols to lib.pyx</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9031">ARROW-9031</a> - [R] Implement conversion from Type::UINT64 to R vector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9032">ARROW-9032</a> - [C++] Split arrow/util/bit_util.h into multiple header files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9034">ARROW-9034</a> - [C++] Implement binary (two bitmap) version of BitBlockCounter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9042">ARROW-9042</a> - [C++] Add Subtract and Multiply arithmetic kernels with wrap-around behavior</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9043">ARROW-9043</a> - [Go] Temporarily copy LICENSE.txt to go/</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9043">ARROW-9043</a> - [Go] Temporarily copy LICENSE.txt to go/</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9045">ARROW-9045</a> - [C++] Improve and expand Take/Filter benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9046">ARROW-9046</a> - [C++][R] Put more things in type_fwds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9047">ARROW-9047</a> - [Rust] Setting 0-bits of a 0-length bitset segfaults</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9050">ARROW-9050</a> - [Release] Use 1.0.0 as the next version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9051">ARROW-9051</a> - [GLib] Refer Array related objects from Array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9052">ARROW-9052</a> - [CI][MinGW] Enable Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9055">ARROW-9055</a> - [C++] Add sum/mean kernels for Boolean type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9058">ARROW-9058</a> - [Packaging][wheel] Boost download is failed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9060">ARROW-9060</a> - [GLib] Add support for building Apache Arrow Datasets GLib with non-installed Apache Arrow Datasets</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9061">ARROW-9061</a> - [Packaging][APT][Yum][GLib] Add Apache Arrow Datasets GLib</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9062">ARROW-9062</a> - [Rust] Support to read JSON into dictionary type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9067">ARROW-9067</a> - [C++] Create reusable branchless / vectorized index boundschecking functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9070">ARROW-9070</a> - [C++] StructScalar needs field accessor methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9073">ARROW-9073</a> - [C++] RapidJSON include directory detection doesn’t work with RapidJSONConfig.cmake</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9074">ARROW-9074</a> - [GLib] Add missing arrow-json check</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9075">ARROW-9075</a> - [C++] Optimize Filter implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9079">ARROW-9079</a> - [C++] Write benchmark for arithmetic kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9083">ARROW-9083</a> - [R] collect int64, uint32, uint64 as R integer type if not out of bounds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9086">ARROW-9086</a> - [CI][Homebrew] Enable Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9088">ARROW-9088</a> - [Rust] Recent version of arrow crate does not compile into wasm target</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9089">ARROW-9089</a> - [Python] A PyFileSystem handler for fsspec-based filesystems</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9090">ARROW-9090</a> - [C++] Bump versions of bundled libraries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9091">ARROW-9091</a> - [C++] Utilize function’s default options when passing no options to CallFunction for a function that requires them</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9093">ARROW-9093</a> - [FlightRPC][C++][Python] Allow setting gRPC client options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9094">ARROW-9094</a> - [Python] Bump versions of compiled dependencies in manylinux wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9095">ARROW-9095</a> - [Rust] Fix NullArray to comply with spec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9099">ARROW-9099</a> - [C++][Gandiva] Add TRIM function for string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9100">ARROW-9100</a> - [C++] Add ascii_lower kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9101">ARROW-9101</a> - [Doc][C++][Python] Document encoding expected by CSV and JSON readers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9102">ARROW-9102</a> - [Packaging] Upload built manylinux docker images</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9106">ARROW-9106</a> - [C++] Add C++ foundation to ease file transcoding</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9108">ARROW-9108</a> - [C++][Dataset] Add Parquet Statistics conversion for timestamp columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9109">ARROW-9109</a> - [Python][Packaging] Enable S3 support in manylinux wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9110">ARROW-9110</a> - [C++] Fix CPU cache size detection on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9112">ARROW-9112</a> - [R] Update autobrew script location</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9115">ARROW-9115</a> - [C++] Process data buffers in batch in ascii_lower / ascii_upper kernels rather than using string_view value iteration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9116">ARROW-9116</a> - [C++] Add BinaryArray::total_values_length()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9116">ARROW-9116</a> - [C++] Add BinaryArray::total_values_length()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9118">ARROW-9118</a> - [C++] Add more general BoundsCheck function that also checks for arbitrary lower limits in integer arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9119">ARROW-9119</a> - [C++] Add support for building with system static gRPC</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9123">ARROW-9123</a> - [Python][wheel] Use libzstd.a explicitly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9124">ARROW-9124</a> - [Rust][Datafusion] DFParser should consume sql query as &amp;str instead of String</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9125">ARROW-9125</a> - [C++] Add missing include for arrow::internal::ZeroMemory() for Valgrind</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9129">ARROW-9129</a> - [Python][JPype] Test is failed with JPype 0.7.5</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9130">ARROW-9130</a> - [Python] Add deprecated wrappers functions to a pyarrow/compat.py module for 1.0.0 that will be removed later</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9131">ARROW-9131</a> - [C++] Faster ascii_lower and ascii_upper</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9132">ARROW-9132</a> - [C++] Implement hash kernels for dictionary data with constant dictionaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9133">ARROW-9133</a> - [C++] Add utf8_upper and utf8_lower</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9137">ARROW-9137</a> - [GLib][Ruby] Allow to read Parquet files in chunks (by RowGroup)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9138">ARROW-9138</a> - [Docs][Format] Make sure format version is hard coded in the docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9139">ARROW-9139</a> - [Python] parquet read_table should not use_legacy_dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9144">ARROW-9144</a> - [CI] OSS-Fuzz build fails because recent changes in the google repository</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9145">ARROW-9145</a> - [C++] Add true_count / false_count methods to BooleanArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9152">ARROW-9152</a> - [C++] Create specialized filter implementation for varbinary types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9153">ARROW-9153</a> - [Python] Add bindings for StructScalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9154">ARROW-9154</a> - [Developer] Use GitHub issue templates better</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9155">ARROW-9155</a> - [Archery] Less precise but faster default settings for “archery benchmark diff”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9156">ARROW-9156</a> - [C++] Reducing the code size of the tensor module</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9157">ARROW-9157</a> - [Rust][Datafusion] execution context’s create_physical_plan should take self as immutable reference</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9158">ARROW-9158</a> - [Rust][Datafusion] Projection physical plan compilation should preserve nullability</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9159">ARROW-9159</a> - [Python] Expose the isnull/isvalid kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9162">ARROW-9162</a> - [Python] Expose Add/Subtract/Multiply arithmetic kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9163">ARROW-9163</a> - [C++] Add methods to StringArray, LargeStringArray, to validate whether its values are all UTF-8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9166">ARROW-9166</a> - [Website] Add overview page</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9167">ARROW-9167</a> - [Doc][Website] /docs/c_glib/index.html is overwritten</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9168">ARROW-9168</a> - [C++][Flight] allow flight benchmark to use separated TCP connections</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9173">ARROW-9173</a> - [C++] Document how to use Arrow from a third-party CMake project</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9175">ARROW-9175</a> - [FlightRPC][C++][Python] Expose connected peer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9176">ARROW-9176</a> - [Rust] Fix for memory leaks in Arrow allocator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9178">ARROW-9178</a> - [R] Improve documentation about CSV reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9179">ARROW-9179</a> - [R] Replace usage of iris dataset in tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9180">ARROW-9180</a> - [Developer] Remove usage of whitelist, blacklist, slave, etc.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9181">ARROW-9181</a> - [C++] Instantiate fewer templates in Cast kernel implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9182">ARROW-9182</a> - [C++] Use “applicator” namespace for kernel operator-to-kernel functors, streamline argument unboxing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9185">ARROW-9185</a> - [C++] [Java][Gandiva] Make llvm build optimisation configurable from java</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9188">ARROW-9188</a> - [C++] Do not always statically link Brotli libraries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9189">ARROW-9189</a> - [Website] Improve contributor guide</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9190">ARROW-9190</a> - [Website][C++] Add blog post on efforts to make building lighter and easier</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9191">ARROW-9191</a> - [Rust] Do not panic when int96 milliseconds are negative</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9192">ARROW-9192</a> - [CI][Rust] Add support for running clippy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9193">ARROW-9193</a> - [C++] Add method to parse date from null-terminated string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9197">ARROW-9197</a> - [C++] Revamp numeric casts: faster performance and reduced binary size</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9201">ARROW-9201</a> - [Archery] Render-human readable table when using “archery benchmark diff”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9202">ARROW-9202</a> - [GLib] Add GArrowDatum</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9203">ARROW-9203</a> - [Packaging][deb] Add missing gir1.2-arrow-dataset-1.0.install</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9204">ARROW-9204</a> - [C++][Flight] change records_per_stream to int64 in flight benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9205">ARROW-9205</a> - [Documentation] Fix typos in Columnar.rst</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9206">ARROW-9206</a> - [C++][Flight] measure latency in flight benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9207">ARROW-9207</a> - [Python][Dataset] Clean-up internal FileSource class</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9210">ARROW-9210</a> - [C++] Use OptionalBitBlockCounter in ArrayDataInlineVisitor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9214">ARROW-9214</a> - [C++] Avoid util::optional in favor of separate inlineable functions in arrow/visitor_inline.h</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9216">ARROW-9216</a> - [C++][Parquet] Use BitBlockCounter for plain spaced encoding/decoding</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9217">ARROW-9217</a> - [C++][Parquet] Cover 0.01% null for the plain spaced encoding/decoding benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9220">ARROW-9220</a> - [C++] Disable relevant compute kernels if ARROW_WITH_UTF8PROC=OFF</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9222">ARROW-9222</a> - [Format][Proposal] Remove validity bitmap from Union types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9224">ARROW-9224</a> - [Dev][Archery] Copy local repo on clone failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9225">ARROW-9225</a> - [C++][Compute] Improve counting sort</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9231">ARROW-9231</a> - [Format] Increment MetadataVersion from V4 to V5</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9234">ARROW-9234</a> - [GLib][CUDA] Add support for dictionary memo on reading record batch from buffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9241">ARROW-9241</a> - [C++] Add forward compatibility checks for Decimal::bitWidth</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9242">ARROW-9242</a> - [Java] Add forward compatibility checks for Decimal::bitWidth</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9247">ARROW-9247</a> - [Python] Expose BinaryArray::total_values_length in bindings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9248">ARROW-9248</a> - [C++] Add “list_size” function that returns Int32Array/Int64Array giving list cell sizes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9249">ARROW-9249</a> - [C++] Implement “list_parent_indices” vector function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9250">ARROW-9250</a> - [C++] Compact generated code in compute/kernels/scalar_set_lookup.cc using same method as vector_hash.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9251">ARROW-9251</a> - [C++] Move JSON testing code for integration tests to libarrow_testing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9254">ARROW-9254</a> - [C++] Factor out some integer casting internals so it can be reused with temporal casts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9255">ARROW-9255</a> - [C++] Use CMake to build bundled Protobuf with CMake &gt;= 3.7</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9256">ARROW-9256</a> - [C++] Incorrect variable name ARROW_CXX_FLAGS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9258">ARROW-9258</a> - [Format] Add V5 MetadataVersion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9259">ARROW-9259</a> - [Format] Permit unsigned dictionary indices in Columnar.rst</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9262">ARROW-9262</a> - [Packaging][Linux][CI] Use Ubuntu 18.04 to build ARM64 packages on Travis CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9263">ARROW-9263</a> - [C++] Benchmark: promote RegressionSetArgs size to L2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9264">ARROW-9264</a> - [C++] Cleanup Parquet Arrow Schema code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9265">ARROW-9265</a> - [C++] Add support for writing MetadataVersion::V4-compatible IPC messages for compatibility with library versions &lt;= 0.17.1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9268">ARROW-9268</a> - [C++] Add is{alnum,alpha,…} kernels for strings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9272">ARROW-9272</a> - [C++][Python] Reduce complexity in python to arrow conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9276">ARROW-9276</a> - [Dev] Enable ARROW_CUDA when generating API documentations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9277">ARROW-9277</a> - [C++] Fix documentation of Reading CSV files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9278">ARROW-9278</a> - [C++] Implement Union validity bitmap changes from ARROW-9222</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9280">ARROW-9280</a> - [Rust] Write statistics to Parquet files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9281">ARROW-9281</a> - [R] Turn off utf8proc in R builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9283">ARROW-9283</a> - [Python] Expose C++ build info</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9287">ARROW-9287</a> - [C++] Implement support for unsigned dictionary indices</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9289">ARROW-9289</a> - [R] Remove deprecated functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9290">ARROW-9290</a> - [Rust] [Parquet] Add features to allow opting out of dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9291">ARROW-9291</a> - [R] Support fixed size binary/list types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9292">ARROW-9292</a> - [Rust] Update feature matrix with passing tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9294">ARROW-9294</a> - [GLib] Add GArrowFunction</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9300">ARROW-9300</a> - [Java] Separate Netty Memory to its own module</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9306">ARROW-9306</a> - [Ruby] Add support for Arrow::RecordBatch.new(raw_table)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9307">ARROW-9307</a> - [Ruby] Add Arrow::RecordBatchIterator#to_a</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9308">ARROW-9308</a> - [Format] Add Feature enum to schema.fbs for forward compatibity</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9316">ARROW-9316</a> - [C++] Use “Dataset” instead of “Datasets”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9321">ARROW-9321</a> - [C++][Dataset] Allow to “collect” statistics for ParquetFragment row groups if not constructed from _metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9322">ARROW-9322</a> - [R] Dataset documentation polishing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9323">ARROW-9323</a> - [Ruby] Add Red Arrow Dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9327">ARROW-9327</a> - Fix all clippy errors for arrow crate</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9328">ARROW-9328</a> - [C++][Gandiva] Add LTRIM, RTRIM, BTRIM functions for string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9329">ARROW-9329</a> - [C++][Gandiva] Implement castTimestampToDate function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9331">ARROW-9331</a> - [C++] Improve the performance of Tensor-to-SparseTensor conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9333">ARROW-9333</a> - [Python] Expose more IPC write options in Python</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9335">ARROW-9335</a> - [Website] Update website for 1.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9337">ARROW-9337</a> - [R] On C++ library build failure, give an unambiguous message</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9339">ARROW-9339</a> - [Rust] Comments on SIMD in Arrow README are incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9340">ARROW-9340</a> - [R] Use CRAN version of decor package</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9341">ARROW-9341</a> - [GLib] Use arrow::Datum version Take()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9345">ARROW-9345</a> - [C++][Dataset] Expression with dictionary type should work with operand of value type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9346">ARROW-9346</a> - [C++][Python][Dataset] Add total_byte_size metadata to RowGroupInfo</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9362">ARROW-9362</a> - [Java] Add support for writing MetadataVersion::V4-compatible IPC messages for compatibility with library versions &lt;= 0.17.1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9365">ARROW-9365</a> - [Go] Implement the rest of the typed array builders in NewBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9370">ARROW-9370</a> - [Java] Bump Netty version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9374">ARROW-9374</a> - [C++][Python] Expose MakeArrayFromScalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9379">ARROW-9379</a> - [Rust] Support unsigned dictionary indices</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9383">ARROW-9383</a> - [Python] Support fsspec filesystems in Dataset API through fs handler</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9386">ARROW-9386</a> - [Rust] RecordBatch.schema() should not return &amp;Arc&lt;Schema&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9390">ARROW-9390</a> - [C++] Review compute function names</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9390">ARROW-9390</a> - [C++] Review compute function names</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9390">ARROW-9390</a> - [C++] Review compute function names</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9391">ARROW-9391</a> - [Rust] Float32 values interpreted as zero when record batch has one row</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9393">ARROW-9393</a> - [Doc] update supported types documentation for Java</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9395">ARROW-9395</a> - [Python] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9399">ARROW-9399</a> - [C++] Add forward compatibility checks for unrecognized future MetadataVersion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9403">ARROW-9403</a> - [Python] add .tolist as alias of .to_pylist</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9407">ARROW-9407</a> - [Python] Accept pd.NA as missing value in array constructor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9411">ARROW-9411</a> - [Rust] Update dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9424">ARROW-9424</a> - [C++][Parquet] Disable writing files with LZ4 codec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9425">ARROW-9425</a> - [Rust][DataFusion] Make ExecutionContext sharable between threads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9427">ARROW-9427</a> - [Rust][DataFusion] Add pub fn ExecutionContext.tables()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9437">ARROW-9437</a> - [Python][Packaging] Homebrew fails to install build dependencies in the macOS wheel builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9442">ARROW-9442</a> - [Python] Do not force Validate() to be called in pyarrow_wrap_table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9445">ARROW-9445</a> - [Python] Revert Array.equals changes + expose comparison ops in compute</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9446">ARROW-9446</a> - [C++] Export compiler information in BuildInfo</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9447">ARROW-9447</a> - [Rust][DataFusion] Allow closures as ScalarUDFs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9452">ARROW-9452</a> - [Rust] [DateFusion] Improve performance of parquet scan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9470">ARROW-9470</a> - [CI][Java] Run Maven in parallel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9472">ARROW-9472</a> - [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9473">ARROW-9473</a> - [Doc] Polishing for 1.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9478">ARROW-9478</a> - [C++] Improve error message on unsupported cast types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9484">ARROW-9484</a> - [Docs] Update is* functions to be is_* in the compute docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9485">ARROW-9485</a> - [R] Better shared library stripping</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9493">ARROW-9493</a> - [Python][Dataset] Dictionary encode string partition columns by default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9508">ARROW-9508</a> - [Release][APT][Yum] Enable verification for arm64 binaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9509">ARROW-9509</a> - [Release] Don’t test Gandiva in the windows wheel verification script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9511">ARROW-9511</a> - [Packaging][Release] Set conda packages’ build number to 0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9519">ARROW-9519</a> - [Rust] Improve error message when getting a field by name from schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9529">ARROW-9529</a> - [Dev][Release] Improvements to release verification scripts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9531">ARROW-9531</a> - [Packaging][Release] Update conda forge dependency pins</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9534">ARROW-9534</a> - [Rust] [DataFusion] Implement functions for creating literal expressions for all types</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1820">PARQUET-1820</a> - [C++] Use a column filter hint to inform read prefetching in Arrow reads</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1843">PARQUET-1843</a> - [C++] Unnecessary assignment in DictDecoderImpl::Decode</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1855">PARQUET-1855</a> - [C++] Improve documentation on MetaData ownership</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1861">PARQUET-1861</a> - [Documentation][C++] Explain ReaderProperters.buffer_stream*</li>
</ul>
</main>
<hr/>
<footer class="footer">
<div class="row">
<div class="col-md-9">
<p>Apache Arrow, Arrow, Apache, the Apache feather logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.</p>
<p>&copy; 2016-2024 The Apache Software Foundation</p>
</div>
<div class="col-md-3">
<a class="d-sm-none d-md-inline pr-2" href="https://www.apache.org/events/current-event.html">
<img src="https://www.apache.org/events/current-event-234x60.png"/>
</a>
</div>
</div>
</footer>
</div>
</body>
</html>