blob: 439f167540efe31a60eb6b9d5aaf6c111b15ab51 [file] [log] [blame]
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- The above meta tags *must* come first in the head; any other head content must come *after* these tags -->
<title>Apache Arrow 3.0.0 Release | Apache Arrow</title>
<!-- Begin Jekyll SEO tag v2.8.0 -->
<meta name="generator" content="Jekyll v4.3.3" />
<meta property="og:title" content="Apache Arrow 3.0.0 Release" />
<meta property="og:locale" content="en_US" />
<meta name="description" content="Apache Arrow 3.0.0 (26 January 2021) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 648 commits from 106 distinct contributors. $ git shortlog -sn apache-arrow-2.0.0..apache-arrow-3.0.0 71 Jorge C. Leitao 64 Sutou Kouhei 48 Antoine Pitrou 48 Heres, Daniel 27 Andy Grove 27 Neville Dipale 24 Joris Van den Bossche 19 Neal Richardson 15 Benjamin Kietzman 14 Carol (Nichols || Goulding) 14 Uwe L. Korn 12 Jonathan Keane 12 Krisztián Szűcs 11 Andrew Lamb 11 Mike Seddon 11 mqy 10 Kazuaki Ishizaki 10 Qingping Hou 8 Jörn Horstmann 8 Yibo Cai 8 alamb 7 Andrew Wieteska 7 Ian Cook 7 Mahmut Bulut 6 Dmitry Patsura 6 Projjal Chanda 6 liyafan82 6 rdettai 5 Daniel Russo 5 David Li 5 Florian Müller 5 Jacob Quinn 5 James Duong 4 Micah Kornfield 4 Weston Pace 3 Jimmy Lu 3 Maarten A. Breddels 3 Terence D. Honles 3 Will Jones 2 Bei 2 Benjamin Wilhelm 2 Christian Lundgren 2 Daniël Heres 2 Evan Chan 2 Fokko Driesprong 2 François Garillot 2 Greg Bowyer 2 Kirill Lykov 2 Kyle Porter 2 Lyndon Bauto 2 Markus Westerlind 2 Sagnik Chakraborty 2 Sean Moran 2 Wes McKinney 2 Yordan Pavlov 2 jensenrichardson 2 wayne 2 Östman Alexander 1 Bruno LE HYARIC 1 Chiyang Wan 1 Christoph Schulze 1 Diana Clarke 1 Eric Erhardt 1 Erik Fonselius 1 Frank Du 1 Frank Smith 1 Gidon Gershinsky 1 H-Plus-Time 1 Hongze Zhang 1 Jake Goulding 1 Jeremy 1 Johan Peltenburg 1 Jorge Leitao 1 Josiah 1 Kopilov Aleksandr 1 Marc Prud&#39;hommeaux 1 Marius van Niekerk 1 Matthew Pollock 1 Matthew Topol 1 Michal 1 Mingyu Zhong 1 Nick Bruno 1 Paddy Horan 1 Pavel Tiunov 1 Prudhvi Porandla 1 Radu Teodorescu 1 Romain Francois 1 Ruan Pearce-Authers 1 Ruihang Xia 1 Sebastian Berg 1 Tao He 1 Tyrel Rink 1 Ulimo 1 Vivian Kong 1 Weiyang Zhao 1 Zhuo Peng 1 arw2019 1 david 1 jonathan.albrecht 1 naman1996 1 rjzamora 1 shawnding(丁晓坤) 1 tching 1 tianchen 1 tifflhl 1 xudingyu Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-2.0.0..apache-arrow-3.0.0 100 Andrew Lamb 88 Antoine Pitrou 76 Sutou Kouhei 70 Jorge C. Leitao 69 Neville Dipale 54 Krisztián Szűcs 44 Neal Richardson 40 Andy Grove 26 Benjamin Kietzman 19 Joris Van den Bossche 12 David Li 8 liyafan82 7 Uwe L. Korn 7 alamb 6 Micah Kornfield 5 Praveen 4 Bryan Cutler 4 Eric Erhardt 2 GitHub 2 Paddy Horan 2 Pindikura Ravindra 2 Wes McKinney 1 Sebastien Binet Changelog Apache Arrow 3.0.0 (2021-01-26) Bug Fixes ARROW-2616 - [Python] Cross-compiling Pyarrow ARROW-6582 - [R] Arrow to R fails with embedded nuls in strings ARROW-7363 - [Python] Add combine_chunks method to ChunkedArray ARROW-7909 - [Website] Add how to install on Red Hat Enterprise Linux ARROW-8258 - [Rust] [Parquet] ArrowReader fails on some timestamp types ARROW-9027 - [Python] Split in multiple files + clean-up pyarrow.parquet tests ARROW-9479 - [JS] Table.from fails for zero-item Lists, FixedSizeLists, Maps. ditto Table.empty ARROW-9636 - [Python] Update documentation about ‘LZO’ compression in parquet.write_table ARROW-9776 - [R] read_feather causes segfault in R if file doesn’t exist ARROW-9897 - [C++][Gandiva] Add to_date() function from pattern ARROW-9897 - [C++][Gandiva] Add to_date() function from pattern ARROW-9898 - [C++][Gandiva] Error handling in castINT fails in some enviroments ARROW-9903 - [R] open_dataset freezes opening feather files on Windows ARROW-9963 - [Python] Recognize datetime.timezone.utc as UTC on conversion python-&gt;pyarrow ARROW-10039 - [Rust] Do not require memory alignment of buffers ARROW-10042 - [Rust] Buffer equalities may be incorrect ARROW-10080 - [R] Arrow does not release unused memory ARROW-10122 - [Python] Selecting one column of multi-index results in a duplicated value column. ARROW-10145 - [C++][Dataset] Assert integer overflow in partitioning falls back to string ARROW-10146 - [Python] Parquet metadata to_dict raises attribute error ARROW-10174 - [Java] Reading of Dictionary encoded struct vector fails ARROW-10177 - [CI][Gandiva] Nightly gandiva-jar-xenial fails ARROW-10186 - [Rust] Tests fail when following instructions in README ARROW-10247 - [C++][Dataset] Cannot write dataset with dictionary column as partition field ARROW-10264 - [C++][Python] Parquet test failing with HadoopFileSystem URI ARROW-10270 - [R] Fix CSV timestamp_parsers test on R-devel ARROW-10283 - [Python] Python deprecation warning for “PY_SSIZE_T_CLEAN will be required for ‘#’ formats” ARROW-10293 - [Rust] [DataFusion] Fix benchmarks ARROW-10294 - [Java] Resolve problems of DecimalVector APIs on ArrowBufs ARROW-10321 - [C++] Building AVX512 code when we should not ARROW-10333 - [Java] Remove split packages in arrow-memory-core and arrow-vectors ARROW-10345 - [C++] NaN breaks sorting ARROW-10346 - [Python] Default S3 region is eu-central-1 even with LANG=C ARROW-10348 - [C++] Fix crash on invalid Parquet file (OSS-Fuzz) ARROW-10350 - [Rust] parquet_derive crate cannot be published to crates.io ARROW-10353 - [C++] Parquet decompresses DataPageV2 pages even if is_compressed==0 ARROW-10358 - [R] Followups to 2.0.0 release ARROW-10365 - [R] Remove duplicate setting of S3 flag on macOS ARROW-10369 - [Dev] Fix archery release utility test cases ARROW-10370 - [Python] Spurious s3fs-related test failures ARROW-10371 - [R] Linux system requirements check needs to support older cmake versions ARROW-10386 - [R] List column class attributes not preserved in roundtrip ARROW-10388 - [Java] Fix Spark integration build failure ARROW-10390 - [Rust] [Parquet] Regression Can not implement custom ParquetWriter because `TryClone` is not publically exported ARROW-10393 - [Rust]: Fix null value reading in jsonreader for both dictionary and stringbuilders ARROW-10394 - [Rust] [Large]BinaryArray can be created from non-binary datatypes ARROW-10397 - [C++] Outdated and confusing comment on dictionary indices ARROW-10399 - [R] Fix performance regression from cpp11::r_string ARROW-10411 - [C++] Fix incorrect child array lengths for Concatenate of FixedSizeList ARROW-10412 - [C++] CMake Build Fails with grpc 1.33.1, “GRPC_CPP_PLUGIN-NOTFOUND: program not found or is not executable” ARROW-10413 - [Rust] [Parquet] Unignore some roundtrip tests that are passing now ARROW-10414 - [R] open_dataset doesn’t work with absolute/expanded paths on Windows ARROW-10426 - [C++] Arrow type large_string cannot be written to Parquet type column descriptor ARROW-10433 - [Python] pyarrow doesn’t work with s3fs&gt;=0.5 ARROW-10434 - [Rust] Debug formatting arrays with lengths greater than 10 and less than 20 produces incorrect values ARROW-10441 - [FlightRPC][Java] FlightClients from FlightGrpcUtils#createFlightClient shutdown gRPC channel when closed ARROW-10446 - [C++][Python] Timezone aware pd.Timestamp’s are incorrectly converted to Timestamp arrys ARROW-10448 - [Rust] PrimitiveArray::new can create arrays not in spec ARROW-10453 - [Rust] [DataFusion] Performance degredation after removing specialization ARROW-10457 - [CI] Fix Spark branch-3.0 integration tests ARROW-10461 - [Rust] Offset related bug in BitChunks::remainder_bits ARROW-10462 - [Python] ParquetDatasetPiece’s path broken when using fsspec fs on Windows ARROW-10463 - [R] Better messaging for currently unsupported CSV options in open_dataset ARROW-10470 - [R] Fix missing file error causing NYC taxi example to fail ARROW-10471 - [CI][Python] Ensure we have tests with s3fs and run those on CI ARROW-10472 - [C++][Python] casting a scalar timestamp to date32 results in Aborted (core dump) ARROW-10475 - [С++][FlightRPC] Arrow Flight Server / Client cannot be initialized with Ipv6 host ARROW-10480 - [Python] Parquet write_table creates gzipped Parquet file, not Parquet with gzip compression ARROW-10482 - [Python] Specifying compression type on a column basis when writing Parquet not working ARROW-10489 - [C++] Unable to configure or make with intel compiler ARROW-10489 - [C++] Unable to configure or make with intel compiler ARROW-10491 - [FlightRPC][Java] Fix NPE when using FlightProducer without interceptors ARROW-10493 - [C++][Parquet] Writing nullable nested strings results in wrong data in file ARROW-10495 - [C++] find_package(Arrow) is broken on Ubuntu 18 ARROW-10496 - [R][CI] Fix conda-r job ARROW-10499 - [C++][Java] Fix ORC Java JNI Crash ARROW-10502 - [C++/Python] CUDA detection messes up nightly conda-win builds ARROW-10503 - [C++] Uriparser will not compile using Intel compiler ARROW-10508 - [Java] Allow FixedSizeListVector to have empty children ARROW-10509 - [C++] Define operator«(ostream, ParquetException) for clang+Windows ARROW-10511 - [Python] Table.to_pandas() failing when timezone-awareness mismatch in metadata ARROW-10518 - Fix cast function issues in gandiva ARROW-10519 - [Python] Deadlock when PyArrow imports Pandas from multiple threads ARROW-10525 - [C++] Fix crash on unsupported IPC stream (OSS-Fuzz) ARROW-10532 - [Python] Mangled pandas_metadata when specified schema has different order as DataFrame columns ARROW-10545 - [C++] Fix crash on invalid Parquet file (OSS-Fuzz) ARROW-10546 - [Python] Deprecate the S3FSWrapper class ARROW-10547 - [Rust][DataFusion] Filter pushdown loses filters if below a user defined node ARROW-10551 - [Rust]: Fix unreproducible benchmarks ARROW-10558 - [Python] Filesystem S3 tests not independent (native s3 influences s3fs) ARROW-10560 - [Python] Crash when creating array with string over 2GB ARROW-10563 - [Packaging][C++] CMake find_package(Arrow 2.0 CONFIG REQUIRED) broken ARROW-10565 - [Python] Table.from_batches and Table.from_pandas have argument Schema_schema in documentation instead of schema ARROW-10568 - [C++][Parquet] Parquet writer crashes process when Tell() does not succeed ARROW-10569 - [C++][Python] Poor Table filtering performance ARROW-10577 - [Rust][DataFusion] Hash Aggregator stream finishes unexpectedly after going to Pending state ARROW-10578 - [C++] Comparison kernels crashing for string array with null string scalar ARROW-10610 - [C++] arrow-utility-test and arrow-csv-test causes failures on a big-endian platform ARROW-10616 - [Developer] Expand PR labeler to all supported languages ARROW-10617 - [Python] RecordBatchStreamReader’s iterator doesn’t work with python 3.8 ARROW-10619 - [C++] Fix crash on unsupported IPC stream (OSS-Fuzz) ARROW-10620 - [Rust][Parquet] move column chunk range logic to metadata.rs ARROW-10621 - [Java] flight-cpre test causes a failure on s390x ARROW-10622 - [R] Nameof&lt;&gt;() is incorrect in r-arrow build environment ARROW-10623 - [R] Version 1.0.1 breaks data.frame attributes when reading file written by 2.0.0 ARROW-10624 - [R] Proactively remove “problems” attributes ARROW-10627 - [Rust] Github master does not compile for WASM target ARROW-10629 - [CI] MinGW builds broken on Github Actions ARROW-10631 - [Rust] Equality of fixed-sized binary is incorrect. ARROW-10642 - [R] Can’t get Table from RecordBatchReader with 0 batches ARROW-10656 - [Rust] New RecordBatch requires exact match of Data Types ARROW-10656 - [Rust] New RecordBatch requires exact match of Data Types ARROW-10661 - [C#] Fix benchmarking project ARROW-10662 - [Java] Avoid integer overflow for Json file reader ARROW-10663 - [C++/Doc] The IsIn kernel ignores the skip_nulls option of SetLookupOptions ARROW-10667 - [Rust] [Parquet] Add a convenience type for writing Parquet to memory ARROW-10668 - [R] Filtering does not work with .data pronoun ARROW-10681 - [Rust] [DataFusion] TPC-H Query 12 fails with scheduler error ARROW-10684 - [Rust] Logical equality should consider parent array nullability ARROW-10690 - [Java] ComplexCopier gives incorrect result for list vector if target vector is non-empty ARROW-10692 - [Rust] Segfault while array buffer append ARROW-10699 - [C++] BitmapUInt64Reader doesn’t work on big-endian ARROW-10701 - [Rust] [Datafusion] Benchmark sort_limit_query_sql fails because order by clause specifies column index instead of expression ARROW-10705 - [Rust] Lifetime annotations in the IPC writer are too strict, preventing code reuse ARROW-10710 - [Rust] Example flight server is broken after tokio upgrade (among other things) ARROW-10711 - [CI] Remove set-env from auto-tune to work with new GHA settings ARROW-10719 - [C#] ArrowStreamWriter doesn’t write schema metadata ARROW-10746 - [C++] Use GTEST_SKIP in parquet encoding tests ARROW-10748 - [Java] TimeStampMilliVector cannot be cast to TimeStampMilliTZVector ARROW-10749 - [C++] Incorrect string format for Datum with the collection type ARROW-10751 - [C++] Add RE2 to minimal build example ARROW-10753 - [Rust] [DataFusion] Negative numbers in SQL WHERE clause not parsed correctly ARROW-10757 - [Rust] [CI] Sporadic failures due to disk filling up ARROW-10760 - [Rust] [DataFusion] Predicate push down does not support joins correctly ARROW-10769 - [CI] Integration tests are failing in master ARROW-10774 - [R] Set minimum cpp11 version ARROW-10777 - [Packaging][Python] PyPI pyarrow source dist (sdist) contains architecture dependent binaries ARROW-10778 - [Python] RowGroupInfo.statistics errors for empty row group ARROW-10779 - [Java] writeNull method in UnionListWriter doesn’t work correctly if validity at that index is already set ARROW-10780 - [R] Update known R installation issues for CentOS 7 ARROW-10791 - [Rust] StreamReader, read_dictionary duplicating schema info ARROW-10801 - [Rust] [Flight] Support sending FlightData for Dictionaries with that of a RecordBatch ARROW-10803 - [R] Support R &gt;= 3.3 and add CI ARROW-10804 - [Rust] Remove UB on parquet crate ARROW-10807 - [Rust][DataFusion] Avoid double hashing ARROW-10810 - [Rust] Speed up comparison kernels ARROW-10811 - [R][CI] Remove nightly centos6 build ARROW-10823 - MutableArrayData with use_null false yields wrong results ARROW-10830 - [Rust] json reader should not hard crash on invalid json ARROW-10833 - [Python] Avoid usage of NumPy’s PyArray_DescrCheck macro ARROW-10834 - [R] Fix print method for SubTreeFileSystem ARROW-10837 - [Rust] Use `Vec&lt;u8&gt;` for hash key instead ARROW-10840 - [C++] Parquet FileMetaData does not have key_value_metadata when built from FileMetaDataBuilder ARROW-10842 - [Rust] decouple IO from json schema inference code ARROW-10844 - [Rust] [DataFusion] join of two DataFrames is not possible ARROW-10850 - [R] Unrecognized compression type: LZ4 ARROW-10852 - [C++] AssertTablesEqual(verbose=true) segfaults if the left array has more rows ARROW-10854 - [Rust] [DataFusion] Simplified logical scans ARROW-10855 - [Python][Numpy] ArrowTypeError after upgrading NumPy to 1.20.0rc1 ARROW-10856 - [R] CentOS 7 not correctly identifying compiler version ARROW-10859 - [Rust] [DataFusion] Make collect not require ExecutionContext ARROW-10860 - [Java] Avoid integer overflow for generated classes in Vector ARROW-10863 - [Python] ExtensionArray.to_pandas not working ARROW-10863 - [Python] ExtensionArray.to_pandas not working ARROW-10875 - simplify simd cfg check ARROW-10876 - [Rust] json reader should validate value type ARROW-10897 - [Rust] Replace Arc&lt;String&gt; by String in DataType::Timestamp ARROW-10907 - [Rust] Cast UTF8 to Date64 Incorrect ARROW-10913 - [Python][Doc] Code block typo in filesystems docs ARROW-10914 - [Rust]: SIMD implementation of arithmetic kernels reads out of bounds ARROW-10915 - [Rust] Make ARROW_TEST_DATA and PARQUET_TEST_DATA absolute dirs ARROW-10921 - `TypeError: ‘coroutine’ object is not iterable` when reading parquet partitions via s3fs &gt;= 0.5 with pyarrow ARROW-10930 - [Python] LargeListType doesn’t have a value_field ARROW-10932 - [C++] BinaryMemoTable::CopyOffsets access out-of-bound address when data is empty ARROW-10932 - [C++] BinaryMemoTable::CopyOffsets access out-of-bound address when data is empty ARROW-10942 - [C++] S3FileSystem::Impl::IsEmptyDirectory fails on Amazon S3 ARROW-10943 - [Rust] Intermittent build failure in parquet encoding ARROW-10954 - [C++][Doc] PlasmaClient is threadSafe now, doc not update ARROW-10955 - [C++] Reading empty json lists results in invalid non-nullable null type ARROW-10960 - [C++][FlightRPC] Missing protobuf data_body should result in default value of empty bytes, not null ARROW-10962 - [Java][FlightRPC] FlightData deserializer should accept missing fields ARROW-10967 - [Rust] Make env vars ARROW_TEST_DATA and PARQUET_TEST_DATA optional ARROW-10990 - [Rust]: SIMD implementation of compare kernels reads out of bounds ARROW-10994 - [Rust] Fix bugs in TPC-H file conversion ARROW-10996 - [Rust] Return error messages via Result for get_arrow_schema_from_metadata ARROW-10999 - [Rust] TPC-H parquet files cannot be read by Apache Spark ARROW-11014 - [Rust] [DataFusion] ParquetExec reports incorrect statistics ARROW-11023 - [C++][CMake] gRPC doesn’t respect CMAKE_CXX_COMPILER ARROW-11024 - [C++][Parquet] Writing List&lt;Struct&gt; to parquet sometimes writes wrong data ARROW-11025 - [Rust] Bench for boolean kernels measure array creation ARROW-11030 - [Rust] [DataFusion] HashJoinExec slow with many batches ARROW-11048 - [Rust] Add bench to MutableBuffer ARROW-11050 - [R] Handle RecordBatch in write_parquet ARROW-11067 - [C++] CSV reader returns nulls for some strings on macOS ARROW-11069 - [C++] Parquet writer incorrect data being written when data type is struct ARROW-11073 - [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs ARROW-11083 - [CI] Build “Source Release and Merge Script” is broken ARROW-11084 - [Rust] Clippy failing in master ARROW-11085 - [Rust] Rust CI no longer works b/c it uses action-rs: Migrate CI away from action-rs/* ARROW-11092 - [CI] (Temporarily) move offending workflows to separate files ARROW-11102 - [Rust][DataFusion] fmt::Debug for ScalarValue(Utf8) is always quoted ARROW-11113 - [Rust] support as_struct_array cast ARROW-11114 - [Java] Metadata serialization is broken for Field class ARROW-11132 - [CI] Use pip to install crossbow’s dependencies for the comment bot ARROW-11144 - [C++][Python][CI] Fix HDFS nightly build ARROW-11152 - [CI][C++] Fix Homebrew numpy installation on macOS builds ARROW-11162 - [C++] Fix crash on Decimal256 Parquet file (OSS-Fuzz) ARROW-11163 - [C++][Python] Compressed Feather file written with pyarrow 0.17 not readable in pyarrow 2.0.0+ ARROW-11166 - [Python][Compute] Add bindings for ProjectOptions ARROW-11171 - [Go] Build fails on s390x with noasm tag ARROW-11189 - [Developer] Achery benchmark diff cannot compare two jsons ARROW-11190 - [C++][Dataset] Clean up compiler warnings ARROW-11202 - [R][CI] Nightly builds not happening (or artifacts not exported) ARROW-11224 - [R] don’t test metadata serialization on old R versions ARROW-11226 - [Python][CI] Filesystem tests failing with s3fs 0.5.2 ARROW-11227 - [Python][CI] AMD64 Conda Python 3.7 Pandas 0.24 cron job failing in to_pandas extension dtype test ARROW-11229 - [C++][Dataset] Static build is failed ARROW-11230 - [R] Fix build failures on Windows when multiple libarrow binaries found ARROW-11232 - [C++] Table::CombineChunks() returns incorrect results if Table has no column ARROW-11233 - [C++][Flight] Fail to link with bundled gRPC and Abseil ARROW-11237 - [C++] Compiler error with GLog and unity build enabled ARROW-11251 - [CI] Make sure that devtoolset-8 is really installed + being used ARROW-11253 - [R] Make sure that large metadata tests are reproducible ARROW-11255 - [Packaging][Conda][macOS] Fix Python version ARROW-11257 - [C++][Parquet] PyArrow Table contains different data after writing and reloading from Parquet ARROW-11271 - [Rust] [Parquet] List schema to Arrow parser misinterpreting child nullability ARROW-11274 - [Packaging][wheel][Windows] Fix wheels path for Gemfury ARROW-11275 - [Packaging][wheel][Linux] Fix paths for Gemfury ARROW-11277 - [C++] Fix compilation error in dataset expressions on macOS 10.11 ARROW-11283 - [Julia] Fix install link ARROW-11286 - [Release][Yum] Fix minimal build example check ARROW-11287 - [Packaging][RPM] Add missing dependencies ARROW-11299 - [Python] build warning in python ARROW-11301 - [C++] Fix reading LZ4-compressed Parquet files produced by Java Parquet implementation ARROW-11302 - [Release][Python] Remove verification of python 3.5 wheel on macOS ARROW-11303 - [Release][C++] Enable mimalloc in the windows verification script ARROW-11305 - [Rust]: parquet-rowcount binary tries to open itself as a parquet file ARROW-11306 - [Packaging][Ubuntu][16.04] Add missing libprotobuf-dev dependency ARROW-11320 - [C++] Spurious test failure when creating temporary dir ARROW-11322 - [Rust] Arrow `memory` made private is a breaking API change PARQUET-1935 - [C++][Parquet] nullptr access violation when writing arrays of non-nullable values New Features and Improvements ARROW-1846 - [C++] Implement “any” reduction kernel for boolean data ARROW-3850 - [Python] Support MapType and StructType for enhanced PySpark integration ARROW-4193 - [Rust] Add support for decimal data type ARROW-4544 - [Rust] Read nested JSON structs into StructArrays ARROW-4804 - [Rust] Read temporal values from CSV - Parse Date32 and Date64 in CSV reader ARROW-4960 - [R] Add crossbow task for r-arrow-feedstock ARROW-4970 - [C++][Parquet] Implement parquet::FileMetaData::Equals ARROW-5336 - [C++] Implement arrow::Concatenate for dictionary-encoded arrays with unequal dictionaries ARROW-5350 - [Rust] Support filtering on primitive/string lists ARROW-5394 - [C++] Benchmarks for IsIn Kernel ARROW-5679 - [Python] Drop Python 3.5 from support matrix ARROW-5950 - [Rust] [DataFusion] Add ability to log via logger dependency ARROW-6071 - [C++] Implement casting Binary &lt;-&gt; LargeBinary ARROW-6697 - [Rust] [DataFusion] Validate that all parquet partitions have the same schema ARROW-6715 - [Website] Describe “non-free” component is needed for Plasma packages in install page ARROW-6883 - [C++] Support sending delta DictionaryBatch or replacement DictionaryBatch in IPC stream writer class ARROW-6995 - [Packaging][Crossbow] The windows conda artifacts are not uploaded to GitHub releases ARROW-7531 - [C++] Investigate header cost reduction ARROW-7633 - [C++][CI] Create fuzz targets for tensors and sparse tensors ARROW-7800 - [Python] Expose GetRecordBatchReader API in PyArrow ARROW-7842 - [Rust] [Parquet] Implement array reader for list type ARROW-8113 - [C++] Implement a lighter-weight variant ARROW-8199 - [C++] Add support for multi-column sort on Table ARROW-8289 - [Rust] [Parquet] Implement minimal Arrow Parquet writer as starting point for full writer ARROW-8423 - [Rust] [Parquet] Serialize arrow schema into metadata when writing parquet ARROW-8425 - [Rust] [Parquet] Add support for writing temporal types ARROW-8426 - [Rust] [Parquet] Add support for writing dictionary types ARROW-8426 - [Rust] [Parquet] Add support for writing dictionary types ARROW-8853 - [Rust] [Integration Testing] Enable Flight tests ARROW-8876 - [C++] Implement casts from date types to Timestamp ARROW-8883 - [Rust] [Integration Testing] Enable passing tests and update spec doc ARROW-9001 - [R] Box outputs as correct type in call_function ARROW-9128 - [C++] Implement string space trimming kernels: trim, ltrim, and rtrim ARROW-9164 - [C++] Provide APIs for adding “docstrings” to arrow::compute::Function classes that can be accessed by bindings ARROW-9187 - [R] Add bindings for arithmetic kernels ARROW-9296 - [CI][Rust] Enable more clippy lint checks ARROW-9304 - [C++] Add “AppendEmptyValue” builder APIs for use inside StructBuilder::AppendNull ARROW-9361 - [Rust] Move other array types into their own modules ARROW-9400 - [Python] Do not depend on conda-forge static libraries in Windows wheel builds ARROW-9475 - [Java] Clean up usages of BaseAllocator, use BufferAllocator instead ARROW-9489 - [C++] Add fill_null kernel implementation for (array[string], scalar[string]) ARROW-9555 - [Rust] [DataFusion] Add inner (hash) equijoin physical plan ARROW-9564 - [Packaging] Vendor r-arrow-feedstock conda-forge recipe ARROW-9674 - [Rust] Parquet reader should implement Send + Sync ARROW-9704 - [Java] TestEndianness.testLittleEndian fails on big endian platform ARROW-9707 - [Rust] [DataFusion] Re-implement threading model ARROW-9709 - [Java] Test cases in arrow-vector assume little-endian platform ARROW-9728 - [Rust] [Parquet] Compute nested definition and repetition for structs ARROW-9747 - [C++][Java][Format] Support Decimal256 Type ARROW-9771 - [Rust] [DataFusion] Predicate Pushdown Improvement: treat predicates separated by AND separately ARROW-9803 - [Go] Add initial support for s390x ARROW-9804 - [FlightRPC] Authentication Redesign ARROW-9828 - [Rust] [DataFusion] TableProvider trait should support predicate push-down ARROW-9861 - [Java] Failed Arrow Vector on big-endian platform ARROW-9862 - Throw an exception in UnsafeDirectLittleEndian on Big-Endian platform ARROW-9911 - [Rust][DataFusion] SELECT &lt;expression&gt; with no FROM clause should produce a single row of output ARROW-9945 - [C++][Dataset] Refactor Expression::Assume to return a Result ARROW-9991 - [C++] split kernels for strings/binary ARROW-10002 - [Rust] Trait-specialization requires nightly ARROW-10021 - [C++][Compute] Support finding nth frequently used value in mode kernel ARROW-10032 - [Documentation] C++ Windows docs are out of date ARROW-10079 - [Rust]: Benchmark and improve count_set_bits function ARROW-10095 - [Rust] [Parquet] Update for IPC changes ARROW-10097 - [C++] Persist SetLookupState in between usages of IsIn when filtering dataset batches ARROW-10106 - [FlightRPC][Java] Expose onIsReady() callback on OutboundStreamListener ARROW-10108 - [Rust] [Parquet] Fix compiler warning about unused return value ARROW-10109 - [Rust] Add support to produce a C Data interface ARROW-10110 - [Rust] Add support to consume C Data Interface ARROW-10131 - [C++][Dataset] Lazily parse parquet metadata / statistics in ParquetDatasetFactory and ParquetFileFragment ARROW-10135 - [Rust] [Parquet] Refactor file module to help adding sources ARROW-10143 - [C++] ArrayRangeEquals should accept EqualOptions ARROW-10144 - [Flight] Add support for using the TLS_SNI extension ARROW-10149 - [Rust] Add support to external release of un-owned buffers ARROW-10163 - [Rust] [DataFusion] Add DictionaryArray coercion support ARROW-10168 - [Rust] [Parquet] Extend arrow schema conversion to projected fields ARROW-10173 - [Rust][DataFusion] Improve performance of equality to a constant predicate support ARROW-10180 - [C++][Doc] Update dependency management docs following aws-sdk-cpp addition ARROW-10182 - [C++] Add basic continuation support to futures ARROW-10191 - [Rust] [Parquet] Add roundtrip tests for single column batches ARROW-10197 - [Gandiva][python] Execute expression on filtered data ARROW-10203 - [Doc] Capture guidance for endianness support in contributors guide. ARROW-10207 - [C++] Unary kernels that results in a list have no preallocated offset buffer ARROW-10208 - [C++] String split kernels do not propagate nulls correctly on sliced input ARROW-10216 - [Rust] Simd implementation of min/max aggregation kernels for primitive types ARROW-10224 - [Python] Add support for Python 3.9 except macOS wheel and Windows wheel ARROW-10225 - [Rust] [Parquet] Fix null bitmap comparisons in roundtrip tests ARROW-10228 - [Julia] Donate Julia Implementation ARROW-10236 - [Rust] [DataFusion] Make DataFusion casting rules consistent with cast kernel ARROW-10241 - [C++][Compute] Add variance kernel benchmark ARROW-10249 - [Rust]: Support Dictionary types for ListArrays in arrow json reader ARROW-10259 - [Rust] Support field metadata ARROW-10261 - [Rust] [BREAKING] Lists should take Field instead of DataType ARROW-10263 - [C++][Compute] Improve numerical stability of variances merging ARROW-10268 - [Rust] Support writing dictionaries to IPC file and stream ARROW-10269 - [Rust] Update nightly: Oct 2020 Edition ARROW-10277 - [C++] Support comparing scalars approximately ARROW-10289 - [Rust] Support reading dictionary streams ARROW-10292 - [Rust] [DataFusion] Simplify merge ARROW-10295 - [Rust] [DataFusion] Simplify accumulators ARROW-10299 - [Rust] Support reading and writing V5 of IPC metadata ARROW-10300 - [Rust] Improve benchmark documentation for generating/converting TPC-H data ARROW-10301 - [C++] Add “all” boolean reducing kernel ARROW-10302 - [Python] Don’t double-package plasma-store-server ARROW-10304 - [C++][Compute] Optimize variance kernel for integers ARROW-10310 - [C++][Gandiva] Add single argument round() in Gandiva ARROW-10311 - [Release] Update crossbow verification process ARROW-10313 - [C++] Improve UTF8 validation speed and CSV string conversion ARROW-10318 - [C++] Use pimpl idiom in CSV parser ARROW-10319 - [Flight][Go] Add Context to Client Auth Handler functions for Flight ARROW-10320 - [Rust] Convert RecordBatchIterator to a Stream ARROW-10322 - [C++][Dataset] Minimize Expression to a wrapper around compute::Function ARROW-10323 - [Release][wheel] Add missing verification setup step ARROW-10325 - [C++][Compute] Separate aggregate kernel registration ARROW-10328 - [C++] Consider using fast-double-parser ARROW-10330 - [Rust][Datafusion] Implement nullif() function for DataFusion ARROW-10331 - [Rust] [DataFusion] Re-organize errors ARROW-10332 - [Rust] Allow CSV reader to start from a line ARROW-10334 - [Rust] [Parquet] Support reading and writing Arrow NullArray ARROW-10336 - [Rust] Support fromIter and toIter for strings ARROW-10337 - [C++] More liberal parsing of ISO8601 timestamps with fractional seconds ARROW-10338 - [Rust]: Use const fn for applicable methods ARROW-10340 - [Packaging][deb][RPM] Use Python 3.8 for pygit2 ARROW-10356 - [Rust] [DataFusion] Add support for is_in ARROW-10363 - [Python] Remove workaround for CMake bug in manylinux ARROW-10366 - [Rust] [DataFusion] Remove collect from merge ARROW-10375 - [Rust] Remove PrimitiveArrayOps ARROW-10378 - [Rust] Update take() kernel with support for large lists ARROW-10381 - [Rust] Generalize Arrow to support MergeSort ARROW-10382 - [Rust] Fix typos and spelling ARROW-10383 - [Doc] Fix typos and spelling ARROW-10384 - [C++] Fix typos and spelling ARROW-10385 - [C++][Gandiva] Add support for LLVM 11 ARROW-10389 - [Rust][DataFusion] Make the custom source implementation API more explicit ARROW-10392 - [C++][Gandiva] Avoid string copy while evaluating IN expression ARROW-10396 - [Rust] [Parquet] Expose SliceableCursor and FileSource ARROW-10398 - [Rust] [Parquet] Re-export parquet::record::api::Field ARROW-10400 - Propagate TLS client peer_identity when using mutual TLS ARROW-10402 - [Rust] Improve array equality ARROW-10407 - [C++] Division Support in Decimal256 ARROW-10408 - [Java] Upgrade Avro dependency to 1.10 ARROW-10410 - [Rust] Some refactorings ARROW-10416 - [R] Support Tables in Flight ARROW-10422 - [Rust] Removed unused BinaryArrayBuilder ARROW-10424 - [Rust] Simplify code for impl PrimitiveArray ARROW-10428 - [FlightRPC][Java] Add support for HTTP cookies ARROW-10445 - [Rust] Add DoubleEnded to PrimitiveArrayIter ARROW-10449 - [Rust] Make dictionary keys be a PrimitiveArray ARROW-10454 - [Rust][Datafusion] support creating ParquetExec from externally resolved file list and schema ARROW-10455 - [Rust] Fix CI cache misses on windows ARROW-10458 - [Rust] [Datafusion] context.create_logical_plan should not take a mutable self reference ARROW-10464 - [Rust] Implement utility to convert TPC-H tbl files to CSV and Parquet ARROW-10466 - [Rust] [Website] Update implementation status page ARROW-10467 - [FlightRPC][Java] Ability to pass arbitrary client properties to server ARROW-10468 - [C++][Compute] Refactor FunctionExecutor -&gt; KernelExecutor ARROW-10476 - [Rust] Allow string array to be built from iterator of &amp;str ARROW-10477 - [Rust] Add support for iterators over binary arrays ARROW-10478 - [Dev][Release] Correct Java versions to 3.0.0-SNAPSHOT ARROW-10481 - [R] Bindings to add, remove, replace Table columns ARROW-10483 - [C++] Move Executor into a separate header ARROW-10484 - [C++] Future&lt;{void,Status}&gt; could be more generic ARROW-10487 - [FlightRPC][C++] Header-based auth in clients ARROW-10490 - [C++][GLib] Fail to build with Xcode 12.0.1 ARROW-10492 - [Java][JDBC] Allow users to config the mapping between SQL types and Arrow types ARROW-10504 - [C++] Suppress UBSAN pointer-overflow warning in RapidJSON ARROW-10510 - [Rust] [DataFusion] Add benchmarks for COUNT(DISTINCT) ARROW-10515 - [Julia][Doc] Update lists of supported languages to include Julia ARROW-10522 - [R] Allow rename Table and RecordBatch columns with names() ARROW-10526 - [FlightRPC][C++] HTTP cookie handling in clients ARROW-10530 - [R] Optionally use distro package in linuxlibs.R ARROW-10531 - [Rust] [DataFusion] Better display for logical plans: Graphviz and Schema information ARROW-10539 - [Packaging][Python] Use GitHub Actions to build wheels for Windows ARROW-10540 - [Rust] Allow unary kernels of arbitrary array types ARROW-10541 - [C++] Add re2 library to core arrow / ARROW_WITH_RE2 ARROW-10542 - [C#][Flight] Add beginning on flight code for net core ARROW-10543 - [Developer] Update dev instructions to note there may be a timelag ARROW-10552 - [Rust] Remove un-used Result from Buffer ARROW-10559 - [Rust] [DataFusion] Break up logical_plan/mod.rs into smaller modules ARROW-10561 - [Rust] Simplify `MutableBuffer::write` and `MutableBuffer::write_bytes` ARROW-10562 - [Rust] Potential UB on unsafe code ARROW-10566 - [C++] Array validation should work on ArrayData ARROW-10567 - [C++][FlightRPC] Add options to help increase precision of arrow-flight-benchmark ARROW-10572 - [Rust][DataFusion] Use aHash and std::collections hashmap for aggregates / distinct ARROW-10574 - [Python][Parquet] Allow collections for ‘in’ / ‘not in’ filter (in addition to sets) ARROW-10575 - [Rust] Rename union.rs to be cosistent with other arrays ARROW-10581 - [Doc] IPC dictionary reference to relevant section ARROW-10582 - [Rust] [DataFusion] Implement “repartition” operator ARROW-10584 - [Rust] [DataFusion] Implement SQL join support using explicit JOIN ON syntax ARROW-10585 - [Rust] [DataFusion] Add join support to DataFrame and LogicalPlan ARROW-10586 - [Rust] [DataFusion] Add join support to query planner ARROW-10589 - [Rust]: Implement AVX-512 bit and operation ARROW-10590 - [Rust] Remove Date32(Millisecond) from test ARROW-10591 - [Rust] Add support to structArrays for MutableArrayData ARROW-10595 - [Rust] Simplify inner loop of min/max kernels for non-null case ARROW-10596 - [Rust] Improve take benchmark ARROW-10598 - [C++] Improve performance of GenerateBitsUnrolled ARROW-10604 - [Ruby] Support Decimal256 type ARROW-10607 - [C++][Parquet] Support Reading/Writing Decimal256 type in Parquet ARROW-10609 - [Rust] Optimize min/max of non null strings ARROW-10628 - [Rust] Make clippy error on clippy warnings ARROW-10633 - [Rust][DataFusion] Dependency version upgrades ARROW-10634 - [C#][CI] Change the build version from 2.2 to 3.1 in CI ARROW-10636 - [Rust] Remove specialisation from Rust parquet ARROW-10637 - [Rust] Add examples to boolean kernels ARROW-10638 - [Rust] Improve tests of boolean kernels ARROW-10639 - [Rust] Simplify signature of is_null and add example ARROW-10644 - [Python] Consolidate path/filesystem handling in pyarrow.dataset and pyarrow.fs ARROW-10646 - [C++][FlightRPC] Disable flaky test ARROW-10648 - [Java] Prepare Java codebase for source release without requiring any git tags to be created or pushed ARROW-10651 - [C++] alloc-dealloc-mismatch in s3fs.cc ARROW-10652 - [C++][Gandiva] Make gandiva cache size configurable ARROW-10653 - [Rust]: Update toolchain version to bring new features ARROW-10654 - [Rust] Specialize parsing of floats / bools ARROW-10660 - [Rust] Implement AVX-512 bit or operation ARROW-10665 - [Rust] Add fast paths for common utf8 like patterns ARROW-10666 - [Rust] [DataFusion] Support nested SELECT statements ARROW-10669 - [C++][Compute] Support Scalar inputs to boolean kernels ARROW-10672 - [Rust] [DataFusion] Make limit be computed as a stream ARROW-10673 - [Rust] [DataFusion] Make sort be computed on the stream ARROW-10674 - [Rust] Add integration tests for Decimal type ARROW-10677 - [Rust] Fix Bug and Add tests as documentation showing supported csv parsing ARROW-10679 - [Rust] [DataFusion] Implement SQL CASE WHEN physical expression ARROW-10680 - [Rust] [DataFusion] Implement TPC-H Query 12 ARROW-10682 - [Rust] Sort kernel performance tuning ARROW-10685 - [Rust] [DataFusion] Add support for join on filter pushdown optimizer ARROW-10688 - [Rust] [DataFusion] Support CASE WHEN from DataFrame API ARROW-10689 - [Rust] [DataFusion] Support CASE WHEN from SQL ARROW-10693 - [Rust] [DataFusion] Add support for the left join ARROW-10696 - [C++] Investigate a bit run reader that would only return runs of set bits ARROW-10697 - [C++] Consolidate bitmap word readers ARROW-10703 - [Rust] [DataFusion] Make join not collect left on every part ARROW-10704 - [Rust][DataFusion] Remove Nested from expression enum ARROW-10708 - [Packaging][deb] Add support for Ubuntu 20.10 ARROW-10709 - [Python] Difficult to make an efficient zero-copy file reader in Python ARROW-10712 - [Rust] [DataFusion] Add tests to TPC-H benchmarks ARROW-10717 - [Rust] [DataFusion] Add support for right join ARROW-10720 - [C++] Add BasicDecimal256 Rescale Support ARROW-10721 - [C#][CI] Use .NET 3.1 by default ARROW-10722 - [Rust][DataFusion] Reduce overhead in data types in aggregations / joins, improve benchmarks ARROW-10723 - [Packaging][deb][RPM] Enable Parquet encription ARROW-10724 - [Developer Tools] Add labeler to when PRs need rebase ARROW-10725 - [Python][Compute] Exposing bindings for sort options ARROW-10728 - [Rust] [DataFusion] Add SQL support for JOIN with USING clause ARROW-10729 - [Rust] [DataFusion] Add SQL support for JOIN using implicit syntax ARROW-10732 - [Rust] [DataFusion] Add SQL support for table/relation aliases and compound identifiers ARROW-10733 - [R] Improvements to Linux installation troubleshooting ARROW-10740 - [Rust][DataFusion] Remove redundant clones found by clippy ARROW-10741 - Apply clippy lints to source code, remove them from ignore list ARROW-10742 - [Python] Mask not checked when creating array from numpy array ARROW-10745 - [Rust] Allocate padding bytes in filter context ARROW-10747 - [Rust] Optimizations for csv reader ARROW-10750 - [Rust] [DataFusion] Add SQL support for LEFT and RIGHT join ARROW-10752 - [GLib] Add garrow_schema_has_metadata() ARROW-10754 - [GLib] Add support for metadata to GArrowField ARROW-10755 - [Rust] [Parquet] Add support for writing boolean type ARROW-10756 - [Rust] Clippy - fix reduntant clone ARROW-10759 - [Rust][DataFusion] Implement support for casting string to date in sql expressions ARROW-10763 - [Rust] Speed up take kernels ARROW-10765 - [Rust] Optimize take strings for non-null arrays ARROW-10767 - [Rust] Speed up sum kernel with nulls ARROW-10770 - [Rust] Support reading nested JSON lists ARROW-10772 - [Rust] Improve take performance ARROW-10775 - [Rust][DataFusion] Use ahash in hash join ARROW-10776 - [C++] Provide iterator access to primitive elements inside an Array ARROW-10781 - [Rust] [DataFusion] TableProvider should provide row count statistics ARROW-10783 - [Rust] [DataFusion] Implement row count statistics for Parquet TableProvider ARROW-10785 - Further optimize take string ARROW-10786 - [Packaging][RPM] Drop support for CentOS 6 ARROW-10788 - [C++] Make S3 recursive walks parallel ARROW-10789 - [Rust][DataFusion] Make TableProvider dynamically typed ARROW-10790 - [C++][Compute] Investigate ChunkedArray sort performance ARROW-10792 - [Rust] [CI] Modulararize CI for faster and smaller builds ARROW-10795 - [Rust] Fix specialization for arrow datatypes ARROW-10796 - [C++] Investigate RecordBatch sort performance ARROW-10800 - [Rust] [Parquet] Provide access to the elements of parquet::record::{List, Map} ARROW-10802 - [C++] Remove Dictionary[NullType] special casing in parquet column writer ARROW-10808 - [Rust] [DataFusion] Support nested expressions in aggregations ARROW-10809 - [C++] Use Datum for SortIndices() input ARROW-10812 - [Rust] Make BooleanArray not a PrimitiveArray ARROW-10813 - [Rust] [DataFusion] Implement DFSchema ARROW-10814 - [Packaging][deb] Drop support for Debian GNU/Linux Stretch ARROW-10817 - [Rust] [DataFusion] Implement TypedString ARROW-10820 - [Rust] [DataFusion] Complete TPC-H Benchmark Queries ARROW-10821 - [Rust] [Datafusion] implement negative expression ARROW-10822 - [Rust] [Datafusion] support compiling datafusion with simd support ARROW-10824 - [Rust] Added PartialEq for NullArray ARROW-10825 - [Rust] Add support to NullArrays for MutableArrayData ARROW-10826 - [Rust] Add support for FixedSizeBinary to MutableArrayData ARROW-10827 - [Rust] Extend concatenate to all types ARROW-10828 - [Rust][DataFusion] Enable more clippy lints ARROW-10829 - [Rust] [DataFusion] Implement Into&lt;Schema&gt; for DFSchema ARROW-10832 - [Rust] Evaluate latest snapshot flatc ARROW-10836 - [Rust] Extend take kernel to FixedSizeListArray ARROW-10838 - [Rust] [CI] Add CI for wasm32 target ARROW-10839 - [Rust] [DataFusion] Implement BETWEEN Operator ARROW-10843 - [C++] Add support for temporal types in sort family kernels ARROW-10845 - [Python][CI] Add python CI build using numpy nightly ARROW-10849 - [Python] Handle numpy deprecation warnings for builtin type aliases ARROW-10851 - [C++] Reduce code size of vector_sort.cc ARROW-10857 - [Packaging] Follow PowerTools repository name change on CentOS 8 ARROW-10858 - [C++][MSVC] Add missing Boost dependency ARROW-10861 - [Python] Update minimal NumPy version to 1.16.6 ARROW-10864 - [Rust] Use standard ordering for floats ARROW-10865 - [Rust][DataFusion] More ergonomic conversion between Schema, SchemaRef, DFSchema, and DFSchemaRef ARROW-10867 - build failure on aarch64 with -DARROW_PYTHON=ON and gcc ARROW-10869 - [GLib] Add garrow_*_sort_indices() and related options ARROW-10870 - [Julia] Update website with Julia implementation ARROW-10871 - [Julia] Setup Julia CI via GitHub Actions ARROW-10873 - [C++] Apple Silicon is reported as arm64 in CMake ARROW-10874 - [Rust][DataFusion] Add table statistics for MemTable ARROW-10877 - [Rust] [DataFusion] Add benchmark based on kaggle movies ARROW-10878 - [Rust] Simplify extend_from_slice ARROW-10879 - [Packaging][deb] Restore Debian GNU/Linux Buster configuration ARROW-10881 - [C++] EXC_BAD_ACCESS in BaseSetBitRunReader&lt;false&gt;::NextRun ARROW-10885 - [Rust][DataFusion] Optimize join build vs probe based on statistics on row number ARROW-10887 - [C++][Doc] Document IPC API ARROW-10889 - [Rust] Document our approach to unsafe code in README ARROW-10890 - [Rust] [DataFusion] JOIN support ARROW-10891 - [Rust][DataFusion] More clippy lints ARROW-10893 - [Rust] [DataFusion] Easier clippy fixes ARROW-10896 - [C++][CMake] Rename internal RE2 package name to “re2” from “RE2” ARROW-10900 - [Rust][DataFusion] Resolve TableScan provider eagerly ARROW-10904 - [Python] Add support for Python 3.9 macOS wheels ARROW-10905 - [Python] Add support for Python 3.9 windows wheels ARROW-10908 - [Rust] [DataFusion] Update relevant tpch-queries with BETWEEN ARROW-10917 - [Rust][Doc] Update feature matrix ARROW-10918 - [C++][Doc] Document supported Parquet features ARROW-10927 - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary ARROW-10927 - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary ARROW-10927 - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary ARROW-10929 - [Rust] Migrate CI tests to stable rust ARROW-10933 - [Rust] Update docs in regard to stable rust ARROW-10934 - [Python] Tests are failed with fsspec-0.8.5 ARROW-10938 - [Rust] upgrade dependency “flatbuffers” to 0.8 ARROW-10940 - [Rust] Extend sort kernel to ListArray ARROW-10941 - [Doc][C++] Document supported Parquet encryption features ARROW-10944 - [Rust] Implement min/max kernels for BooleanArray ARROW-10946 - [Rust] Make ChunkIter not depend on a buffer ARROW-10947 - [Rust][DataFusion] Refactor UTF8 to Date32 for Performance ARROW-10948 - [C++] Always use GTestConfig.cmake ARROW-10949 - [Rust] Avoid clones in getting values of boolean arrays ARROW-10951 - [Python][CI] Nightly pandas builds failing because of pytest monkeypatch issue ARROW-10952 - [Rust] Add pre-commit hook ARROW-10966 - [C++] Use FnOnce for ThreadPool’s tasks instead of std::function ARROW-10968 - [Rust][DataFusion] Don’t build hash table for right side of the join ARROW-10969 - [Rust][DataFusion] Implement ANSI SQL Functions ARROW-10985 - [Rust] Update unsafe guidelines for adding JIRA references ARROW-10986 - [Rust][DataFusion] Add average statistic to TCP-H benchmark too ARROW-10988 - [C++] Require CMake 3.5 or later ARROW-10989 - [Rust] Use slices for iterating primitive arrays ARROW-10993 - [CI][macOS] Fix Python 3.9 installation by Homebrew ARROW-10995 - [Rust] [DataFusion] Improve parallelism when reading Parquet files ARROW-11004 - [FlightRPC][Python] Header-based auth in clients ARROW-11005 - [Rust] Remove indirection from take kernel and simplify interface ARROW-11008 - [Rust][DataFusion] Simplify count accumulator ARROW-11009 - [Python] Add environment variable to elect default usage of system memory allocator instead of jemalloc/mimalloc ARROW-11010 - [Python] `np.float` deprecation warning in `_pandas_logical_type_map` ARROW-11012 - [Rust] [DataFusion] Make write_csv and write_parquet concurrent ARROW-11015 - [CI][Gandiva] Move gandiva nightly build from travis to github action ARROW-11018 - [Rust][DataFusion] Add null count column statistics ARROW-11026 - [Rust]: Run tests without requiring environment variables ARROW-11028 - [Rust] Somewhat pedantic pattern-matches ARROW-11029 - [Rust] [DataFusion] Document why join order optimization does not work with filter pushdown ARROW-11032 - [C++][FlightRPC] Add benchmark for local RPC through unix socket ARROW-11033 - [Rust] CSV writer performance improvements ARROW-11034 - [Rust] rustfmt cleanup ARROW-11035 - [Rust] Improve performance of cast to utf8 via FromIter ARROW-11037 - [Rust] Improve performance of string fromIter ARROW-11038 - [Rust] Remove `BufferBuilderTrait` and associated Result requirement. ARROW-11039 - [Rust] Improve performance for utf8 to float cast ARROW-11040 - [Rust] Simplify builders with generics ARROW-11042 - [Rust][DataFusion] Increase default batch size ARROW-11043 - [C++] Add “is_nan” kernel ARROW-11046 - [Rust][DataFusion] Add count_distinct to dataframe API ARROW-11049 - [Python] Expose alternate memory pools ARROW-11052 - [Rust] [DataFusion] Implement metrics in join operator ARROW-11053 - [Rust] [DataFusion] Optimize joins with dynamic capacity for output batches ARROW-11054 - Update SQLParser to 0.70 ARROW-11055 - [Rust] [DataFusion] Support date_trunc function ARROW-11058 - [Rust] [DataFusion] Implement “coalesce batches” operator ARROW-11063 - [Rust] Validate null counts when building arrays ARROW-11064 - [Rust][DataFusion] Speed up hash join on smaller batches ARROW-11072 - [Rust] [Parquet] Support int32 and int64 physical types ARROW-11076 - [Rust][DataFusion] Refactor usage of right indices in hash join ARROW-11079 - [R] Catch up on changelog since 2.0 ARROW-11080 - [C++][Dataset] Improvements to implicit casting ARROW-11082 - [Rust] Add FFI for LargeUtf8 ARROW-11086 - [Rust] Extend take to support more index types ARROW-11091 - [Rust][DataFusion] Fix clippy warning in rust 1.49 ARROW-11095 - [Python] Access pyarrow.RecordBatch column by name ARROW-11096 - [Rust] Add FFI for [Large]Binary ARROW-11097 - [Rust] Simplify tests ARROW-11099 - [Rust]: Remove unsafe value_slice method from PrimitiveArray and BooleanArray ARROW-11100 - [Rust] Speed up numeric to string cast using lexical_core ARROW-11101 - [Rust] enable “cargo +nightly fmt” in git pre-commit hook ARROW-11104 - [GLib] Add append_null/append_nulls to GArrowArrayBuilder and use them ARROW-11105 - [Rust] Favor From/Into traits in MutableBuffer ARROW-11108 - [Rust] Improve performance of MutableBuffer ARROW-11109 - [GLib] Add garrow_array_builder_append_empty_value() and values() ARROW-11110 - [Rust] [Datafusion] context.table should not take a mutable self reference ARROW-11111 - [GLib] Add GArrowFixedSizeBinaryArrayBuilder ARROW-11121 - [Developer] Use pull_request_target for PR JIRA integration ARROW-11122 - [Rust] Add FFI for date and time ARROW-11124 - [Doc] Update status matrix for Decimal256 ARROW-11125 - [Rust] Implement logical equality for list arrays ARROW-11126 - [Rust] Document and test ARROW-10656 ARROW-11127 - [C++] Unused cpu_info on non-x86 architecture ARROW-11129 - [Rust][DataFusion] Use tokio thread pool for loading parquet ARROW-11130 - [Website][CentOS 8][RHEL 8] Enable all required repositories by default ARROW-11131 - [Rust] Improve performance of bool_equal ARROW-11136 - [R] Bindings for is.nan ARROW-11137 - [Rust][DataFusion] Fix Clippy needless_range_loop, needless_lifetimes ARROW-11138 - [Rust] [DataFusion] Support ltrim, rtrim ARROW-11139 - [GLib] Add support for extension type ARROW-11155 - [C++][Packaging] Move gandiva crossbow jobs off of Travis-CI ARROW-11158 - [Julia] Implement Decimal256 support ARROW-11159 - [Developer] Consolidate pull request related jobs ARROW-11165 - [Rust] [DataFusion] Document the desired SQL dialect for DataFusion ARROW-11168 - [Rust] Fix cargo doc warnings ARROW-11169 - [Rust] Add a comment explaining where float total_order algorithm came from ARROW-11175 - [R] Small docs fixes ARROW-11176 - [R] Expose memory pool name and document setting it ARROW-11187 - [Rust] [Parquet] Pin specific parquet-format-rs version ARROW-11188 - [Rust] Implement crypto functions from PostgreSQL dialect ARROW-11193 - [Documentation] Add docs for Java ListVector ARROW-11194 - [Rust] Enable SIMD for aarch64 ARROW-11195 - [Rust] [DataFusion] Built-in table providers should expose relevant fields ARROW-11196 - [GLib] Add support for mock, HDFS and S3 file systems with factory function ARROW-11198 - [Packaging][Python] Ensure setuptools version during build supports markdown ARROW-11200 - [Rust] [DateFusion] Physical operators and expressions should have public accessor methods ARROW-11201 - [Rust] create_batch_empty - support more types ARROW-11203 - [Developer][Website] Enable JIRA and pull request integration ARROW-11204 - [C++] Fix build failure with bundled gRPC and Protobuf ARROW-11205 - [GLib][Dataset] Add GADFileFormat and its family ARROW-11209 - [Rust] DF - Provide better error message on unsupported GROUP BY ARROW-11210 - [CI] Restore workflows that had been blocked by INFRA ARROW-11212 - [Packaging][Python] Use vcpkg as dependency source for manylinux and windows wheels ARROW-11213 - [Packaging][Python] Dockerize wheel building on windows ARROW-11215 - [CI] Use named volumes by default for caching in docker-compose ARROW-11216 - [Rust] Improve documentation for StringDictionaryBuilder ARROW-11218 - [R] Make SubTreeFileSystem print method more informative ARROW-11219 - [CI][Ruby][MinGW] Reduce CI time ARROW-11221 - [Rust] DF Implement GROUP BY support for Float32/Float64 ARROW-11231 - [Packaging] Add mimalloc to Linux builds ARROW-11234 - [CI][Ruby][macOS] Reduce CI time ARROW-11236 - [Java] Bump Jackson to 2.11.4 ARROW-11240 - [Packaging][R] Add mimalloc to R packaging ARROW-11242 - [CI] Remove CMake 3.2 job ARROW-11245 - [C++][Gandiva] Add support for LLVM 11.1 ARROW-11247 - [C++] Infer date32 columns in CSV ARROW-11256 - [Packaging][Linux] Don’t buffer packaging output ARROW-11268 - [Rust][DataFusion] Support specifying repartitions in MemTable ARROW-11272 - [Release][wheel] Remove unsupported Python 3.5 and manylinux1 ARROW-11273 - [Release][deb] Remove unsupported Debian GNU/Linux stretch ARROW-11278 - [Release][NodeJS] Don’t touch ~/.bash_profile ARROW-11280 - [Release][APT] Fix minimal build example check ARROW-11281 - [C++] Remove needless runtime RapidJSON dependency ARROW-11282 - [Packaging][deb] Add missing libgflags-dev dependency ARROW-11285 - [Release][APT] Add support for Ubuntu Groovy ARROW-11292 - [Release][JS] Use Node.JS LTS ARROW-11293 - [C++] Don’t require Boost and gflags with find_package(Arrow) ARROW-11307 - [Release][Ubuntu][20.10] Add workaround for dependency issue ARROW-11309 - [Release][C#] Use .NET 3.1 for verification ARROW-11329 - [Rust] Do not rebuild the library on every change ARROW-11372 - Support RC verification on macOS-ARM64 PARQUET-1566 - [C++] Indicate if null count, distinct count are present in column statistics" />
<meta property="og:description" content="Apache Arrow 3.0.0 (26 January 2021) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 648 commits from 106 distinct contributors. $ git shortlog -sn apache-arrow-2.0.0..apache-arrow-3.0.0 71 Jorge C. Leitao 64 Sutou Kouhei 48 Antoine Pitrou 48 Heres, Daniel 27 Andy Grove 27 Neville Dipale 24 Joris Van den Bossche 19 Neal Richardson 15 Benjamin Kietzman 14 Carol (Nichols || Goulding) 14 Uwe L. Korn 12 Jonathan Keane 12 Krisztián Szűcs 11 Andrew Lamb 11 Mike Seddon 11 mqy 10 Kazuaki Ishizaki 10 Qingping Hou 8 Jörn Horstmann 8 Yibo Cai 8 alamb 7 Andrew Wieteska 7 Ian Cook 7 Mahmut Bulut 6 Dmitry Patsura 6 Projjal Chanda 6 liyafan82 6 rdettai 5 Daniel Russo 5 David Li 5 Florian Müller 5 Jacob Quinn 5 James Duong 4 Micah Kornfield 4 Weston Pace 3 Jimmy Lu 3 Maarten A. Breddels 3 Terence D. Honles 3 Will Jones 2 Bei 2 Benjamin Wilhelm 2 Christian Lundgren 2 Daniël Heres 2 Evan Chan 2 Fokko Driesprong 2 François Garillot 2 Greg Bowyer 2 Kirill Lykov 2 Kyle Porter 2 Lyndon Bauto 2 Markus Westerlind 2 Sagnik Chakraborty 2 Sean Moran 2 Wes McKinney 2 Yordan Pavlov 2 jensenrichardson 2 wayne 2 Östman Alexander 1 Bruno LE HYARIC 1 Chiyang Wan 1 Christoph Schulze 1 Diana Clarke 1 Eric Erhardt 1 Erik Fonselius 1 Frank Du 1 Frank Smith 1 Gidon Gershinsky 1 H-Plus-Time 1 Hongze Zhang 1 Jake Goulding 1 Jeremy 1 Johan Peltenburg 1 Jorge Leitao 1 Josiah 1 Kopilov Aleksandr 1 Marc Prud&#39;hommeaux 1 Marius van Niekerk 1 Matthew Pollock 1 Matthew Topol 1 Michal 1 Mingyu Zhong 1 Nick Bruno 1 Paddy Horan 1 Pavel Tiunov 1 Prudhvi Porandla 1 Radu Teodorescu 1 Romain Francois 1 Ruan Pearce-Authers 1 Ruihang Xia 1 Sebastian Berg 1 Tao He 1 Tyrel Rink 1 Ulimo 1 Vivian Kong 1 Weiyang Zhao 1 Zhuo Peng 1 arw2019 1 david 1 jonathan.albrecht 1 naman1996 1 rjzamora 1 shawnding(丁晓坤) 1 tching 1 tianchen 1 tifflhl 1 xudingyu Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-2.0.0..apache-arrow-3.0.0 100 Andrew Lamb 88 Antoine Pitrou 76 Sutou Kouhei 70 Jorge C. Leitao 69 Neville Dipale 54 Krisztián Szűcs 44 Neal Richardson 40 Andy Grove 26 Benjamin Kietzman 19 Joris Van den Bossche 12 David Li 8 liyafan82 7 Uwe L. Korn 7 alamb 6 Micah Kornfield 5 Praveen 4 Bryan Cutler 4 Eric Erhardt 2 GitHub 2 Paddy Horan 2 Pindikura Ravindra 2 Wes McKinney 1 Sebastien Binet Changelog Apache Arrow 3.0.0 (2021-01-26) Bug Fixes ARROW-2616 - [Python] Cross-compiling Pyarrow ARROW-6582 - [R] Arrow to R fails with embedded nuls in strings ARROW-7363 - [Python] Add combine_chunks method to ChunkedArray ARROW-7909 - [Website] Add how to install on Red Hat Enterprise Linux ARROW-8258 - [Rust] [Parquet] ArrowReader fails on some timestamp types ARROW-9027 - [Python] Split in multiple files + clean-up pyarrow.parquet tests ARROW-9479 - [JS] Table.from fails for zero-item Lists, FixedSizeLists, Maps. ditto Table.empty ARROW-9636 - [Python] Update documentation about ‘LZO’ compression in parquet.write_table ARROW-9776 - [R] read_feather causes segfault in R if file doesn’t exist ARROW-9897 - [C++][Gandiva] Add to_date() function from pattern ARROW-9897 - [C++][Gandiva] Add to_date() function from pattern ARROW-9898 - [C++][Gandiva] Error handling in castINT fails in some enviroments ARROW-9903 - [R] open_dataset freezes opening feather files on Windows ARROW-9963 - [Python] Recognize datetime.timezone.utc as UTC on conversion python-&gt;pyarrow ARROW-10039 - [Rust] Do not require memory alignment of buffers ARROW-10042 - [Rust] Buffer equalities may be incorrect ARROW-10080 - [R] Arrow does not release unused memory ARROW-10122 - [Python] Selecting one column of multi-index results in a duplicated value column. ARROW-10145 - [C++][Dataset] Assert integer overflow in partitioning falls back to string ARROW-10146 - [Python] Parquet metadata to_dict raises attribute error ARROW-10174 - [Java] Reading of Dictionary encoded struct vector fails ARROW-10177 - [CI][Gandiva] Nightly gandiva-jar-xenial fails ARROW-10186 - [Rust] Tests fail when following instructions in README ARROW-10247 - [C++][Dataset] Cannot write dataset with dictionary column as partition field ARROW-10264 - [C++][Python] Parquet test failing with HadoopFileSystem URI ARROW-10270 - [R] Fix CSV timestamp_parsers test on R-devel ARROW-10283 - [Python] Python deprecation warning for “PY_SSIZE_T_CLEAN will be required for ‘#’ formats” ARROW-10293 - [Rust] [DataFusion] Fix benchmarks ARROW-10294 - [Java] Resolve problems of DecimalVector APIs on ArrowBufs ARROW-10321 - [C++] Building AVX512 code when we should not ARROW-10333 - [Java] Remove split packages in arrow-memory-core and arrow-vectors ARROW-10345 - [C++] NaN breaks sorting ARROW-10346 - [Python] Default S3 region is eu-central-1 even with LANG=C ARROW-10348 - [C++] Fix crash on invalid Parquet file (OSS-Fuzz) ARROW-10350 - [Rust] parquet_derive crate cannot be published to crates.io ARROW-10353 - [C++] Parquet decompresses DataPageV2 pages even if is_compressed==0 ARROW-10358 - [R] Followups to 2.0.0 release ARROW-10365 - [R] Remove duplicate setting of S3 flag on macOS ARROW-10369 - [Dev] Fix archery release utility test cases ARROW-10370 - [Python] Spurious s3fs-related test failures ARROW-10371 - [R] Linux system requirements check needs to support older cmake versions ARROW-10386 - [R] List column class attributes not preserved in roundtrip ARROW-10388 - [Java] Fix Spark integration build failure ARROW-10390 - [Rust] [Parquet] Regression Can not implement custom ParquetWriter because `TryClone` is not publically exported ARROW-10393 - [Rust]: Fix null value reading in jsonreader for both dictionary and stringbuilders ARROW-10394 - [Rust] [Large]BinaryArray can be created from non-binary datatypes ARROW-10397 - [C++] Outdated and confusing comment on dictionary indices ARROW-10399 - [R] Fix performance regression from cpp11::r_string ARROW-10411 - [C++] Fix incorrect child array lengths for Concatenate of FixedSizeList ARROW-10412 - [C++] CMake Build Fails with grpc 1.33.1, “GRPC_CPP_PLUGIN-NOTFOUND: program not found or is not executable” ARROW-10413 - [Rust] [Parquet] Unignore some roundtrip tests that are passing now ARROW-10414 - [R] open_dataset doesn’t work with absolute/expanded paths on Windows ARROW-10426 - [C++] Arrow type large_string cannot be written to Parquet type column descriptor ARROW-10433 - [Python] pyarrow doesn’t work with s3fs&gt;=0.5 ARROW-10434 - [Rust] Debug formatting arrays with lengths greater than 10 and less than 20 produces incorrect values ARROW-10441 - [FlightRPC][Java] FlightClients from FlightGrpcUtils#createFlightClient shutdown gRPC channel when closed ARROW-10446 - [C++][Python] Timezone aware pd.Timestamp’s are incorrectly converted to Timestamp arrys ARROW-10448 - [Rust] PrimitiveArray::new can create arrays not in spec ARROW-10453 - [Rust] [DataFusion] Performance degredation after removing specialization ARROW-10457 - [CI] Fix Spark branch-3.0 integration tests ARROW-10461 - [Rust] Offset related bug in BitChunks::remainder_bits ARROW-10462 - [Python] ParquetDatasetPiece’s path broken when using fsspec fs on Windows ARROW-10463 - [R] Better messaging for currently unsupported CSV options in open_dataset ARROW-10470 - [R] Fix missing file error causing NYC taxi example to fail ARROW-10471 - [CI][Python] Ensure we have tests with s3fs and run those on CI ARROW-10472 - [C++][Python] casting a scalar timestamp to date32 results in Aborted (core dump) ARROW-10475 - [С++][FlightRPC] Arrow Flight Server / Client cannot be initialized with Ipv6 host ARROW-10480 - [Python] Parquet write_table creates gzipped Parquet file, not Parquet with gzip compression ARROW-10482 - [Python] Specifying compression type on a column basis when writing Parquet not working ARROW-10489 - [C++] Unable to configure or make with intel compiler ARROW-10489 - [C++] Unable to configure or make with intel compiler ARROW-10491 - [FlightRPC][Java] Fix NPE when using FlightProducer without interceptors ARROW-10493 - [C++][Parquet] Writing nullable nested strings results in wrong data in file ARROW-10495 - [C++] find_package(Arrow) is broken on Ubuntu 18 ARROW-10496 - [R][CI] Fix conda-r job ARROW-10499 - [C++][Java] Fix ORC Java JNI Crash ARROW-10502 - [C++/Python] CUDA detection messes up nightly conda-win builds ARROW-10503 - [C++] Uriparser will not compile using Intel compiler ARROW-10508 - [Java] Allow FixedSizeListVector to have empty children ARROW-10509 - [C++] Define operator«(ostream, ParquetException) for clang+Windows ARROW-10511 - [Python] Table.to_pandas() failing when timezone-awareness mismatch in metadata ARROW-10518 - Fix cast function issues in gandiva ARROW-10519 - [Python] Deadlock when PyArrow imports Pandas from multiple threads ARROW-10525 - [C++] Fix crash on unsupported IPC stream (OSS-Fuzz) ARROW-10532 - [Python] Mangled pandas_metadata when specified schema has different order as DataFrame columns ARROW-10545 - [C++] Fix crash on invalid Parquet file (OSS-Fuzz) ARROW-10546 - [Python] Deprecate the S3FSWrapper class ARROW-10547 - [Rust][DataFusion] Filter pushdown loses filters if below a user defined node ARROW-10551 - [Rust]: Fix unreproducible benchmarks ARROW-10558 - [Python] Filesystem S3 tests not independent (native s3 influences s3fs) ARROW-10560 - [Python] Crash when creating array with string over 2GB ARROW-10563 - [Packaging][C++] CMake find_package(Arrow 2.0 CONFIG REQUIRED) broken ARROW-10565 - [Python] Table.from_batches and Table.from_pandas have argument Schema_schema in documentation instead of schema ARROW-10568 - [C++][Parquet] Parquet writer crashes process when Tell() does not succeed ARROW-10569 - [C++][Python] Poor Table filtering performance ARROW-10577 - [Rust][DataFusion] Hash Aggregator stream finishes unexpectedly after going to Pending state ARROW-10578 - [C++] Comparison kernels crashing for string array with null string scalar ARROW-10610 - [C++] arrow-utility-test and arrow-csv-test causes failures on a big-endian platform ARROW-10616 - [Developer] Expand PR labeler to all supported languages ARROW-10617 - [Python] RecordBatchStreamReader’s iterator doesn’t work with python 3.8 ARROW-10619 - [C++] Fix crash on unsupported IPC stream (OSS-Fuzz) ARROW-10620 - [Rust][Parquet] move column chunk range logic to metadata.rs ARROW-10621 - [Java] flight-cpre test causes a failure on s390x ARROW-10622 - [R] Nameof&lt;&gt;() is incorrect in r-arrow build environment ARROW-10623 - [R] Version 1.0.1 breaks data.frame attributes when reading file written by 2.0.0 ARROW-10624 - [R] Proactively remove “problems” attributes ARROW-10627 - [Rust] Github master does not compile for WASM target ARROW-10629 - [CI] MinGW builds broken on Github Actions ARROW-10631 - [Rust] Equality of fixed-sized binary is incorrect. ARROW-10642 - [R] Can’t get Table from RecordBatchReader with 0 batches ARROW-10656 - [Rust] New RecordBatch requires exact match of Data Types ARROW-10656 - [Rust] New RecordBatch requires exact match of Data Types ARROW-10661 - [C#] Fix benchmarking project ARROW-10662 - [Java] Avoid integer overflow for Json file reader ARROW-10663 - [C++/Doc] The IsIn kernel ignores the skip_nulls option of SetLookupOptions ARROW-10667 - [Rust] [Parquet] Add a convenience type for writing Parquet to memory ARROW-10668 - [R] Filtering does not work with .data pronoun ARROW-10681 - [Rust] [DataFusion] TPC-H Query 12 fails with scheduler error ARROW-10684 - [Rust] Logical equality should consider parent array nullability ARROW-10690 - [Java] ComplexCopier gives incorrect result for list vector if target vector is non-empty ARROW-10692 - [Rust] Segfault while array buffer append ARROW-10699 - [C++] BitmapUInt64Reader doesn’t work on big-endian ARROW-10701 - [Rust] [Datafusion] Benchmark sort_limit_query_sql fails because order by clause specifies column index instead of expression ARROW-10705 - [Rust] Lifetime annotations in the IPC writer are too strict, preventing code reuse ARROW-10710 - [Rust] Example flight server is broken after tokio upgrade (among other things) ARROW-10711 - [CI] Remove set-env from auto-tune to work with new GHA settings ARROW-10719 - [C#] ArrowStreamWriter doesn’t write schema metadata ARROW-10746 - [C++] Use GTEST_SKIP in parquet encoding tests ARROW-10748 - [Java] TimeStampMilliVector cannot be cast to TimeStampMilliTZVector ARROW-10749 - [C++] Incorrect string format for Datum with the collection type ARROW-10751 - [C++] Add RE2 to minimal build example ARROW-10753 - [Rust] [DataFusion] Negative numbers in SQL WHERE clause not parsed correctly ARROW-10757 - [Rust] [CI] Sporadic failures due to disk filling up ARROW-10760 - [Rust] [DataFusion] Predicate push down does not support joins correctly ARROW-10769 - [CI] Integration tests are failing in master ARROW-10774 - [R] Set minimum cpp11 version ARROW-10777 - [Packaging][Python] PyPI pyarrow source dist (sdist) contains architecture dependent binaries ARROW-10778 - [Python] RowGroupInfo.statistics errors for empty row group ARROW-10779 - [Java] writeNull method in UnionListWriter doesn’t work correctly if validity at that index is already set ARROW-10780 - [R] Update known R installation issues for CentOS 7 ARROW-10791 - [Rust] StreamReader, read_dictionary duplicating schema info ARROW-10801 - [Rust] [Flight] Support sending FlightData for Dictionaries with that of a RecordBatch ARROW-10803 - [R] Support R &gt;= 3.3 and add CI ARROW-10804 - [Rust] Remove UB on parquet crate ARROW-10807 - [Rust][DataFusion] Avoid double hashing ARROW-10810 - [Rust] Speed up comparison kernels ARROW-10811 - [R][CI] Remove nightly centos6 build ARROW-10823 - MutableArrayData with use_null false yields wrong results ARROW-10830 - [Rust] json reader should not hard crash on invalid json ARROW-10833 - [Python] Avoid usage of NumPy’s PyArray_DescrCheck macro ARROW-10834 - [R] Fix print method for SubTreeFileSystem ARROW-10837 - [Rust] Use `Vec&lt;u8&gt;` for hash key instead ARROW-10840 - [C++] Parquet FileMetaData does not have key_value_metadata when built from FileMetaDataBuilder ARROW-10842 - [Rust] decouple IO from json schema inference code ARROW-10844 - [Rust] [DataFusion] join of two DataFrames is not possible ARROW-10850 - [R] Unrecognized compression type: LZ4 ARROW-10852 - [C++] AssertTablesEqual(verbose=true) segfaults if the left array has more rows ARROW-10854 - [Rust] [DataFusion] Simplified logical scans ARROW-10855 - [Python][Numpy] ArrowTypeError after upgrading NumPy to 1.20.0rc1 ARROW-10856 - [R] CentOS 7 not correctly identifying compiler version ARROW-10859 - [Rust] [DataFusion] Make collect not require ExecutionContext ARROW-10860 - [Java] Avoid integer overflow for generated classes in Vector ARROW-10863 - [Python] ExtensionArray.to_pandas not working ARROW-10863 - [Python] ExtensionArray.to_pandas not working ARROW-10875 - simplify simd cfg check ARROW-10876 - [Rust] json reader should validate value type ARROW-10897 - [Rust] Replace Arc&lt;String&gt; by String in DataType::Timestamp ARROW-10907 - [Rust] Cast UTF8 to Date64 Incorrect ARROW-10913 - [Python][Doc] Code block typo in filesystems docs ARROW-10914 - [Rust]: SIMD implementation of arithmetic kernels reads out of bounds ARROW-10915 - [Rust] Make ARROW_TEST_DATA and PARQUET_TEST_DATA absolute dirs ARROW-10921 - `TypeError: ‘coroutine’ object is not iterable` when reading parquet partitions via s3fs &gt;= 0.5 with pyarrow ARROW-10930 - [Python] LargeListType doesn’t have a value_field ARROW-10932 - [C++] BinaryMemoTable::CopyOffsets access out-of-bound address when data is empty ARROW-10932 - [C++] BinaryMemoTable::CopyOffsets access out-of-bound address when data is empty ARROW-10942 - [C++] S3FileSystem::Impl::IsEmptyDirectory fails on Amazon S3 ARROW-10943 - [Rust] Intermittent build failure in parquet encoding ARROW-10954 - [C++][Doc] PlasmaClient is threadSafe now, doc not update ARROW-10955 - [C++] Reading empty json lists results in invalid non-nullable null type ARROW-10960 - [C++][FlightRPC] Missing protobuf data_body should result in default value of empty bytes, not null ARROW-10962 - [Java][FlightRPC] FlightData deserializer should accept missing fields ARROW-10967 - [Rust] Make env vars ARROW_TEST_DATA and PARQUET_TEST_DATA optional ARROW-10990 - [Rust]: SIMD implementation of compare kernels reads out of bounds ARROW-10994 - [Rust] Fix bugs in TPC-H file conversion ARROW-10996 - [Rust] Return error messages via Result for get_arrow_schema_from_metadata ARROW-10999 - [Rust] TPC-H parquet files cannot be read by Apache Spark ARROW-11014 - [Rust] [DataFusion] ParquetExec reports incorrect statistics ARROW-11023 - [C++][CMake] gRPC doesn’t respect CMAKE_CXX_COMPILER ARROW-11024 - [C++][Parquet] Writing List&lt;Struct&gt; to parquet sometimes writes wrong data ARROW-11025 - [Rust] Bench for boolean kernels measure array creation ARROW-11030 - [Rust] [DataFusion] HashJoinExec slow with many batches ARROW-11048 - [Rust] Add bench to MutableBuffer ARROW-11050 - [R] Handle RecordBatch in write_parquet ARROW-11067 - [C++] CSV reader returns nulls for some strings on macOS ARROW-11069 - [C++] Parquet writer incorrect data being written when data type is struct ARROW-11073 - [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs ARROW-11083 - [CI] Build “Source Release and Merge Script” is broken ARROW-11084 - [Rust] Clippy failing in master ARROW-11085 - [Rust] Rust CI no longer works b/c it uses action-rs: Migrate CI away from action-rs/* ARROW-11092 - [CI] (Temporarily) move offending workflows to separate files ARROW-11102 - [Rust][DataFusion] fmt::Debug for ScalarValue(Utf8) is always quoted ARROW-11113 - [Rust] support as_struct_array cast ARROW-11114 - [Java] Metadata serialization is broken for Field class ARROW-11132 - [CI] Use pip to install crossbow’s dependencies for the comment bot ARROW-11144 - [C++][Python][CI] Fix HDFS nightly build ARROW-11152 - [CI][C++] Fix Homebrew numpy installation on macOS builds ARROW-11162 - [C++] Fix crash on Decimal256 Parquet file (OSS-Fuzz) ARROW-11163 - [C++][Python] Compressed Feather file written with pyarrow 0.17 not readable in pyarrow 2.0.0+ ARROW-11166 - [Python][Compute] Add bindings for ProjectOptions ARROW-11171 - [Go] Build fails on s390x with noasm tag ARROW-11189 - [Developer] Achery benchmark diff cannot compare two jsons ARROW-11190 - [C++][Dataset] Clean up compiler warnings ARROW-11202 - [R][CI] Nightly builds not happening (or artifacts not exported) ARROW-11224 - [R] don’t test metadata serialization on old R versions ARROW-11226 - [Python][CI] Filesystem tests failing with s3fs 0.5.2 ARROW-11227 - [Python][CI] AMD64 Conda Python 3.7 Pandas 0.24 cron job failing in to_pandas extension dtype test ARROW-11229 - [C++][Dataset] Static build is failed ARROW-11230 - [R] Fix build failures on Windows when multiple libarrow binaries found ARROW-11232 - [C++] Table::CombineChunks() returns incorrect results if Table has no column ARROW-11233 - [C++][Flight] Fail to link with bundled gRPC and Abseil ARROW-11237 - [C++] Compiler error with GLog and unity build enabled ARROW-11251 - [CI] Make sure that devtoolset-8 is really installed + being used ARROW-11253 - [R] Make sure that large metadata tests are reproducible ARROW-11255 - [Packaging][Conda][macOS] Fix Python version ARROW-11257 - [C++][Parquet] PyArrow Table contains different data after writing and reloading from Parquet ARROW-11271 - [Rust] [Parquet] List schema to Arrow parser misinterpreting child nullability ARROW-11274 - [Packaging][wheel][Windows] Fix wheels path for Gemfury ARROW-11275 - [Packaging][wheel][Linux] Fix paths for Gemfury ARROW-11277 - [C++] Fix compilation error in dataset expressions on macOS 10.11 ARROW-11283 - [Julia] Fix install link ARROW-11286 - [Release][Yum] Fix minimal build example check ARROW-11287 - [Packaging][RPM] Add missing dependencies ARROW-11299 - [Python] build warning in python ARROW-11301 - [C++] Fix reading LZ4-compressed Parquet files produced by Java Parquet implementation ARROW-11302 - [Release][Python] Remove verification of python 3.5 wheel on macOS ARROW-11303 - [Release][C++] Enable mimalloc in the windows verification script ARROW-11305 - [Rust]: parquet-rowcount binary tries to open itself as a parquet file ARROW-11306 - [Packaging][Ubuntu][16.04] Add missing libprotobuf-dev dependency ARROW-11320 - [C++] Spurious test failure when creating temporary dir ARROW-11322 - [Rust] Arrow `memory` made private is a breaking API change PARQUET-1935 - [C++][Parquet] nullptr access violation when writing arrays of non-nullable values New Features and Improvements ARROW-1846 - [C++] Implement “any” reduction kernel for boolean data ARROW-3850 - [Python] Support MapType and StructType for enhanced PySpark integration ARROW-4193 - [Rust] Add support for decimal data type ARROW-4544 - [Rust] Read nested JSON structs into StructArrays ARROW-4804 - [Rust] Read temporal values from CSV - Parse Date32 and Date64 in CSV reader ARROW-4960 - [R] Add crossbow task for r-arrow-feedstock ARROW-4970 - [C++][Parquet] Implement parquet::FileMetaData::Equals ARROW-5336 - [C++] Implement arrow::Concatenate for dictionary-encoded arrays with unequal dictionaries ARROW-5350 - [Rust] Support filtering on primitive/string lists ARROW-5394 - [C++] Benchmarks for IsIn Kernel ARROW-5679 - [Python] Drop Python 3.5 from support matrix ARROW-5950 - [Rust] [DataFusion] Add ability to log via logger dependency ARROW-6071 - [C++] Implement casting Binary &lt;-&gt; LargeBinary ARROW-6697 - [Rust] [DataFusion] Validate that all parquet partitions have the same schema ARROW-6715 - [Website] Describe “non-free” component is needed for Plasma packages in install page ARROW-6883 - [C++] Support sending delta DictionaryBatch or replacement DictionaryBatch in IPC stream writer class ARROW-6995 - [Packaging][Crossbow] The windows conda artifacts are not uploaded to GitHub releases ARROW-7531 - [C++] Investigate header cost reduction ARROW-7633 - [C++][CI] Create fuzz targets for tensors and sparse tensors ARROW-7800 - [Python] Expose GetRecordBatchReader API in PyArrow ARROW-7842 - [Rust] [Parquet] Implement array reader for list type ARROW-8113 - [C++] Implement a lighter-weight variant ARROW-8199 - [C++] Add support for multi-column sort on Table ARROW-8289 - [Rust] [Parquet] Implement minimal Arrow Parquet writer as starting point for full writer ARROW-8423 - [Rust] [Parquet] Serialize arrow schema into metadata when writing parquet ARROW-8425 - [Rust] [Parquet] Add support for writing temporal types ARROW-8426 - [Rust] [Parquet] Add support for writing dictionary types ARROW-8426 - [Rust] [Parquet] Add support for writing dictionary types ARROW-8853 - [Rust] [Integration Testing] Enable Flight tests ARROW-8876 - [C++] Implement casts from date types to Timestamp ARROW-8883 - [Rust] [Integration Testing] Enable passing tests and update spec doc ARROW-9001 - [R] Box outputs as correct type in call_function ARROW-9128 - [C++] Implement string space trimming kernels: trim, ltrim, and rtrim ARROW-9164 - [C++] Provide APIs for adding “docstrings” to arrow::compute::Function classes that can be accessed by bindings ARROW-9187 - [R] Add bindings for arithmetic kernels ARROW-9296 - [CI][Rust] Enable more clippy lint checks ARROW-9304 - [C++] Add “AppendEmptyValue” builder APIs for use inside StructBuilder::AppendNull ARROW-9361 - [Rust] Move other array types into their own modules ARROW-9400 - [Python] Do not depend on conda-forge static libraries in Windows wheel builds ARROW-9475 - [Java] Clean up usages of BaseAllocator, use BufferAllocator instead ARROW-9489 - [C++] Add fill_null kernel implementation for (array[string], scalar[string]) ARROW-9555 - [Rust] [DataFusion] Add inner (hash) equijoin physical plan ARROW-9564 - [Packaging] Vendor r-arrow-feedstock conda-forge recipe ARROW-9674 - [Rust] Parquet reader should implement Send + Sync ARROW-9704 - [Java] TestEndianness.testLittleEndian fails on big endian platform ARROW-9707 - [Rust] [DataFusion] Re-implement threading model ARROW-9709 - [Java] Test cases in arrow-vector assume little-endian platform ARROW-9728 - [Rust] [Parquet] Compute nested definition and repetition for structs ARROW-9747 - [C++][Java][Format] Support Decimal256 Type ARROW-9771 - [Rust] [DataFusion] Predicate Pushdown Improvement: treat predicates separated by AND separately ARROW-9803 - [Go] Add initial support for s390x ARROW-9804 - [FlightRPC] Authentication Redesign ARROW-9828 - [Rust] [DataFusion] TableProvider trait should support predicate push-down ARROW-9861 - [Java] Failed Arrow Vector on big-endian platform ARROW-9862 - Throw an exception in UnsafeDirectLittleEndian on Big-Endian platform ARROW-9911 - [Rust][DataFusion] SELECT &lt;expression&gt; with no FROM clause should produce a single row of output ARROW-9945 - [C++][Dataset] Refactor Expression::Assume to return a Result ARROW-9991 - [C++] split kernels for strings/binary ARROW-10002 - [Rust] Trait-specialization requires nightly ARROW-10021 - [C++][Compute] Support finding nth frequently used value in mode kernel ARROW-10032 - [Documentation] C++ Windows docs are out of date ARROW-10079 - [Rust]: Benchmark and improve count_set_bits function ARROW-10095 - [Rust] [Parquet] Update for IPC changes ARROW-10097 - [C++] Persist SetLookupState in between usages of IsIn when filtering dataset batches ARROW-10106 - [FlightRPC][Java] Expose onIsReady() callback on OutboundStreamListener ARROW-10108 - [Rust] [Parquet] Fix compiler warning about unused return value ARROW-10109 - [Rust] Add support to produce a C Data interface ARROW-10110 - [Rust] Add support to consume C Data Interface ARROW-10131 - [C++][Dataset] Lazily parse parquet metadata / statistics in ParquetDatasetFactory and ParquetFileFragment ARROW-10135 - [Rust] [Parquet] Refactor file module to help adding sources ARROW-10143 - [C++] ArrayRangeEquals should accept EqualOptions ARROW-10144 - [Flight] Add support for using the TLS_SNI extension ARROW-10149 - [Rust] Add support to external release of un-owned buffers ARROW-10163 - [Rust] [DataFusion] Add DictionaryArray coercion support ARROW-10168 - [Rust] [Parquet] Extend arrow schema conversion to projected fields ARROW-10173 - [Rust][DataFusion] Improve performance of equality to a constant predicate support ARROW-10180 - [C++][Doc] Update dependency management docs following aws-sdk-cpp addition ARROW-10182 - [C++] Add basic continuation support to futures ARROW-10191 - [Rust] [Parquet] Add roundtrip tests for single column batches ARROW-10197 - [Gandiva][python] Execute expression on filtered data ARROW-10203 - [Doc] Capture guidance for endianness support in contributors guide. ARROW-10207 - [C++] Unary kernels that results in a list have no preallocated offset buffer ARROW-10208 - [C++] String split kernels do not propagate nulls correctly on sliced input ARROW-10216 - [Rust] Simd implementation of min/max aggregation kernels for primitive types ARROW-10224 - [Python] Add support for Python 3.9 except macOS wheel and Windows wheel ARROW-10225 - [Rust] [Parquet] Fix null bitmap comparisons in roundtrip tests ARROW-10228 - [Julia] Donate Julia Implementation ARROW-10236 - [Rust] [DataFusion] Make DataFusion casting rules consistent with cast kernel ARROW-10241 - [C++][Compute] Add variance kernel benchmark ARROW-10249 - [Rust]: Support Dictionary types for ListArrays in arrow json reader ARROW-10259 - [Rust] Support field metadata ARROW-10261 - [Rust] [BREAKING] Lists should take Field instead of DataType ARROW-10263 - [C++][Compute] Improve numerical stability of variances merging ARROW-10268 - [Rust] Support writing dictionaries to IPC file and stream ARROW-10269 - [Rust] Update nightly: Oct 2020 Edition ARROW-10277 - [C++] Support comparing scalars approximately ARROW-10289 - [Rust] Support reading dictionary streams ARROW-10292 - [Rust] [DataFusion] Simplify merge ARROW-10295 - [Rust] [DataFusion] Simplify accumulators ARROW-10299 - [Rust] Support reading and writing V5 of IPC metadata ARROW-10300 - [Rust] Improve benchmark documentation for generating/converting TPC-H data ARROW-10301 - [C++] Add “all” boolean reducing kernel ARROW-10302 - [Python] Don’t double-package plasma-store-server ARROW-10304 - [C++][Compute] Optimize variance kernel for integers ARROW-10310 - [C++][Gandiva] Add single argument round() in Gandiva ARROW-10311 - [Release] Update crossbow verification process ARROW-10313 - [C++] Improve UTF8 validation speed and CSV string conversion ARROW-10318 - [C++] Use pimpl idiom in CSV parser ARROW-10319 - [Flight][Go] Add Context to Client Auth Handler functions for Flight ARROW-10320 - [Rust] Convert RecordBatchIterator to a Stream ARROW-10322 - [C++][Dataset] Minimize Expression to a wrapper around compute::Function ARROW-10323 - [Release][wheel] Add missing verification setup step ARROW-10325 - [C++][Compute] Separate aggregate kernel registration ARROW-10328 - [C++] Consider using fast-double-parser ARROW-10330 - [Rust][Datafusion] Implement nullif() function for DataFusion ARROW-10331 - [Rust] [DataFusion] Re-organize errors ARROW-10332 - [Rust] Allow CSV reader to start from a line ARROW-10334 - [Rust] [Parquet] Support reading and writing Arrow NullArray ARROW-10336 - [Rust] Support fromIter and toIter for strings ARROW-10337 - [C++] More liberal parsing of ISO8601 timestamps with fractional seconds ARROW-10338 - [Rust]: Use const fn for applicable methods ARROW-10340 - [Packaging][deb][RPM] Use Python 3.8 for pygit2 ARROW-10356 - [Rust] [DataFusion] Add support for is_in ARROW-10363 - [Python] Remove workaround for CMake bug in manylinux ARROW-10366 - [Rust] [DataFusion] Remove collect from merge ARROW-10375 - [Rust] Remove PrimitiveArrayOps ARROW-10378 - [Rust] Update take() kernel with support for large lists ARROW-10381 - [Rust] Generalize Arrow to support MergeSort ARROW-10382 - [Rust] Fix typos and spelling ARROW-10383 - [Doc] Fix typos and spelling ARROW-10384 - [C++] Fix typos and spelling ARROW-10385 - [C++][Gandiva] Add support for LLVM 11 ARROW-10389 - [Rust][DataFusion] Make the custom source implementation API more explicit ARROW-10392 - [C++][Gandiva] Avoid string copy while evaluating IN expression ARROW-10396 - [Rust] [Parquet] Expose SliceableCursor and FileSource ARROW-10398 - [Rust] [Parquet] Re-export parquet::record::api::Field ARROW-10400 - Propagate TLS client peer_identity when using mutual TLS ARROW-10402 - [Rust] Improve array equality ARROW-10407 - [C++] Division Support in Decimal256 ARROW-10408 - [Java] Upgrade Avro dependency to 1.10 ARROW-10410 - [Rust] Some refactorings ARROW-10416 - [R] Support Tables in Flight ARROW-10422 - [Rust] Removed unused BinaryArrayBuilder ARROW-10424 - [Rust] Simplify code for impl PrimitiveArray ARROW-10428 - [FlightRPC][Java] Add support for HTTP cookies ARROW-10445 - [Rust] Add DoubleEnded to PrimitiveArrayIter ARROW-10449 - [Rust] Make dictionary keys be a PrimitiveArray ARROW-10454 - [Rust][Datafusion] support creating ParquetExec from externally resolved file list and schema ARROW-10455 - [Rust] Fix CI cache misses on windows ARROW-10458 - [Rust] [Datafusion] context.create_logical_plan should not take a mutable self reference ARROW-10464 - [Rust] Implement utility to convert TPC-H tbl files to CSV and Parquet ARROW-10466 - [Rust] [Website] Update implementation status page ARROW-10467 - [FlightRPC][Java] Ability to pass arbitrary client properties to server ARROW-10468 - [C++][Compute] Refactor FunctionExecutor -&gt; KernelExecutor ARROW-10476 - [Rust] Allow string array to be built from iterator of &amp;str ARROW-10477 - [Rust] Add support for iterators over binary arrays ARROW-10478 - [Dev][Release] Correct Java versions to 3.0.0-SNAPSHOT ARROW-10481 - [R] Bindings to add, remove, replace Table columns ARROW-10483 - [C++] Move Executor into a separate header ARROW-10484 - [C++] Future&lt;{void,Status}&gt; could be more generic ARROW-10487 - [FlightRPC][C++] Header-based auth in clients ARROW-10490 - [C++][GLib] Fail to build with Xcode 12.0.1 ARROW-10492 - [Java][JDBC] Allow users to config the mapping between SQL types and Arrow types ARROW-10504 - [C++] Suppress UBSAN pointer-overflow warning in RapidJSON ARROW-10510 - [Rust] [DataFusion] Add benchmarks for COUNT(DISTINCT) ARROW-10515 - [Julia][Doc] Update lists of supported languages to include Julia ARROW-10522 - [R] Allow rename Table and RecordBatch columns with names() ARROW-10526 - [FlightRPC][C++] HTTP cookie handling in clients ARROW-10530 - [R] Optionally use distro package in linuxlibs.R ARROW-10531 - [Rust] [DataFusion] Better display for logical plans: Graphviz and Schema information ARROW-10539 - [Packaging][Python] Use GitHub Actions to build wheels for Windows ARROW-10540 - [Rust] Allow unary kernels of arbitrary array types ARROW-10541 - [C++] Add re2 library to core arrow / ARROW_WITH_RE2 ARROW-10542 - [C#][Flight] Add beginning on flight code for net core ARROW-10543 - [Developer] Update dev instructions to note there may be a timelag ARROW-10552 - [Rust] Remove un-used Result from Buffer ARROW-10559 - [Rust] [DataFusion] Break up logical_plan/mod.rs into smaller modules ARROW-10561 - [Rust] Simplify `MutableBuffer::write` and `MutableBuffer::write_bytes` ARROW-10562 - [Rust] Potential UB on unsafe code ARROW-10566 - [C++] Array validation should work on ArrayData ARROW-10567 - [C++][FlightRPC] Add options to help increase precision of arrow-flight-benchmark ARROW-10572 - [Rust][DataFusion] Use aHash and std::collections hashmap for aggregates / distinct ARROW-10574 - [Python][Parquet] Allow collections for ‘in’ / ‘not in’ filter (in addition to sets) ARROW-10575 - [Rust] Rename union.rs to be cosistent with other arrays ARROW-10581 - [Doc] IPC dictionary reference to relevant section ARROW-10582 - [Rust] [DataFusion] Implement “repartition” operator ARROW-10584 - [Rust] [DataFusion] Implement SQL join support using explicit JOIN ON syntax ARROW-10585 - [Rust] [DataFusion] Add join support to DataFrame and LogicalPlan ARROW-10586 - [Rust] [DataFusion] Add join support to query planner ARROW-10589 - [Rust]: Implement AVX-512 bit and operation ARROW-10590 - [Rust] Remove Date32(Millisecond) from test ARROW-10591 - [Rust] Add support to structArrays for MutableArrayData ARROW-10595 - [Rust] Simplify inner loop of min/max kernels for non-null case ARROW-10596 - [Rust] Improve take benchmark ARROW-10598 - [C++] Improve performance of GenerateBitsUnrolled ARROW-10604 - [Ruby] Support Decimal256 type ARROW-10607 - [C++][Parquet] Support Reading/Writing Decimal256 type in Parquet ARROW-10609 - [Rust] Optimize min/max of non null strings ARROW-10628 - [Rust] Make clippy error on clippy warnings ARROW-10633 - [Rust][DataFusion] Dependency version upgrades ARROW-10634 - [C#][CI] Change the build version from 2.2 to 3.1 in CI ARROW-10636 - [Rust] Remove specialisation from Rust parquet ARROW-10637 - [Rust] Add examples to boolean kernels ARROW-10638 - [Rust] Improve tests of boolean kernels ARROW-10639 - [Rust] Simplify signature of is_null and add example ARROW-10644 - [Python] Consolidate path/filesystem handling in pyarrow.dataset and pyarrow.fs ARROW-10646 - [C++][FlightRPC] Disable flaky test ARROW-10648 - [Java] Prepare Java codebase for source release without requiring any git tags to be created or pushed ARROW-10651 - [C++] alloc-dealloc-mismatch in s3fs.cc ARROW-10652 - [C++][Gandiva] Make gandiva cache size configurable ARROW-10653 - [Rust]: Update toolchain version to bring new features ARROW-10654 - [Rust] Specialize parsing of floats / bools ARROW-10660 - [Rust] Implement AVX-512 bit or operation ARROW-10665 - [Rust] Add fast paths for common utf8 like patterns ARROW-10666 - [Rust] [DataFusion] Support nested SELECT statements ARROW-10669 - [C++][Compute] Support Scalar inputs to boolean kernels ARROW-10672 - [Rust] [DataFusion] Make limit be computed as a stream ARROW-10673 - [Rust] [DataFusion] Make sort be computed on the stream ARROW-10674 - [Rust] Add integration tests for Decimal type ARROW-10677 - [Rust] Fix Bug and Add tests as documentation showing supported csv parsing ARROW-10679 - [Rust] [DataFusion] Implement SQL CASE WHEN physical expression ARROW-10680 - [Rust] [DataFusion] Implement TPC-H Query 12 ARROW-10682 - [Rust] Sort kernel performance tuning ARROW-10685 - [Rust] [DataFusion] Add support for join on filter pushdown optimizer ARROW-10688 - [Rust] [DataFusion] Support CASE WHEN from DataFrame API ARROW-10689 - [Rust] [DataFusion] Support CASE WHEN from SQL ARROW-10693 - [Rust] [DataFusion] Add support for the left join ARROW-10696 - [C++] Investigate a bit run reader that would only return runs of set bits ARROW-10697 - [C++] Consolidate bitmap word readers ARROW-10703 - [Rust] [DataFusion] Make join not collect left on every part ARROW-10704 - [Rust][DataFusion] Remove Nested from expression enum ARROW-10708 - [Packaging][deb] Add support for Ubuntu 20.10 ARROW-10709 - [Python] Difficult to make an efficient zero-copy file reader in Python ARROW-10712 - [Rust] [DataFusion] Add tests to TPC-H benchmarks ARROW-10717 - [Rust] [DataFusion] Add support for right join ARROW-10720 - [C++] Add BasicDecimal256 Rescale Support ARROW-10721 - [C#][CI] Use .NET 3.1 by default ARROW-10722 - [Rust][DataFusion] Reduce overhead in data types in aggregations / joins, improve benchmarks ARROW-10723 - [Packaging][deb][RPM] Enable Parquet encription ARROW-10724 - [Developer Tools] Add labeler to when PRs need rebase ARROW-10725 - [Python][Compute] Exposing bindings for sort options ARROW-10728 - [Rust] [DataFusion] Add SQL support for JOIN with USING clause ARROW-10729 - [Rust] [DataFusion] Add SQL support for JOIN using implicit syntax ARROW-10732 - [Rust] [DataFusion] Add SQL support for table/relation aliases and compound identifiers ARROW-10733 - [R] Improvements to Linux installation troubleshooting ARROW-10740 - [Rust][DataFusion] Remove redundant clones found by clippy ARROW-10741 - Apply clippy lints to source code, remove them from ignore list ARROW-10742 - [Python] Mask not checked when creating array from numpy array ARROW-10745 - [Rust] Allocate padding bytes in filter context ARROW-10747 - [Rust] Optimizations for csv reader ARROW-10750 - [Rust] [DataFusion] Add SQL support for LEFT and RIGHT join ARROW-10752 - [GLib] Add garrow_schema_has_metadata() ARROW-10754 - [GLib] Add support for metadata to GArrowField ARROW-10755 - [Rust] [Parquet] Add support for writing boolean type ARROW-10756 - [Rust] Clippy - fix reduntant clone ARROW-10759 - [Rust][DataFusion] Implement support for casting string to date in sql expressions ARROW-10763 - [Rust] Speed up take kernels ARROW-10765 - [Rust] Optimize take strings for non-null arrays ARROW-10767 - [Rust] Speed up sum kernel with nulls ARROW-10770 - [Rust] Support reading nested JSON lists ARROW-10772 - [Rust] Improve take performance ARROW-10775 - [Rust][DataFusion] Use ahash in hash join ARROW-10776 - [C++] Provide iterator access to primitive elements inside an Array ARROW-10781 - [Rust] [DataFusion] TableProvider should provide row count statistics ARROW-10783 - [Rust] [DataFusion] Implement row count statistics for Parquet TableProvider ARROW-10785 - Further optimize take string ARROW-10786 - [Packaging][RPM] Drop support for CentOS 6 ARROW-10788 - [C++] Make S3 recursive walks parallel ARROW-10789 - [Rust][DataFusion] Make TableProvider dynamically typed ARROW-10790 - [C++][Compute] Investigate ChunkedArray sort performance ARROW-10792 - [Rust] [CI] Modulararize CI for faster and smaller builds ARROW-10795 - [Rust] Fix specialization for arrow datatypes ARROW-10796 - [C++] Investigate RecordBatch sort performance ARROW-10800 - [Rust] [Parquet] Provide access to the elements of parquet::record::{List, Map} ARROW-10802 - [C++] Remove Dictionary[NullType] special casing in parquet column writer ARROW-10808 - [Rust] [DataFusion] Support nested expressions in aggregations ARROW-10809 - [C++] Use Datum for SortIndices() input ARROW-10812 - [Rust] Make BooleanArray not a PrimitiveArray ARROW-10813 - [Rust] [DataFusion] Implement DFSchema ARROW-10814 - [Packaging][deb] Drop support for Debian GNU/Linux Stretch ARROW-10817 - [Rust] [DataFusion] Implement TypedString ARROW-10820 - [Rust] [DataFusion] Complete TPC-H Benchmark Queries ARROW-10821 - [Rust] [Datafusion] implement negative expression ARROW-10822 - [Rust] [Datafusion] support compiling datafusion with simd support ARROW-10824 - [Rust] Added PartialEq for NullArray ARROW-10825 - [Rust] Add support to NullArrays for MutableArrayData ARROW-10826 - [Rust] Add support for FixedSizeBinary to MutableArrayData ARROW-10827 - [Rust] Extend concatenate to all types ARROW-10828 - [Rust][DataFusion] Enable more clippy lints ARROW-10829 - [Rust] [DataFusion] Implement Into&lt;Schema&gt; for DFSchema ARROW-10832 - [Rust] Evaluate latest snapshot flatc ARROW-10836 - [Rust] Extend take kernel to FixedSizeListArray ARROW-10838 - [Rust] [CI] Add CI for wasm32 target ARROW-10839 - [Rust] [DataFusion] Implement BETWEEN Operator ARROW-10843 - [C++] Add support for temporal types in sort family kernels ARROW-10845 - [Python][CI] Add python CI build using numpy nightly ARROW-10849 - [Python] Handle numpy deprecation warnings for builtin type aliases ARROW-10851 - [C++] Reduce code size of vector_sort.cc ARROW-10857 - [Packaging] Follow PowerTools repository name change on CentOS 8 ARROW-10858 - [C++][MSVC] Add missing Boost dependency ARROW-10861 - [Python] Update minimal NumPy version to 1.16.6 ARROW-10864 - [Rust] Use standard ordering for floats ARROW-10865 - [Rust][DataFusion] More ergonomic conversion between Schema, SchemaRef, DFSchema, and DFSchemaRef ARROW-10867 - build failure on aarch64 with -DARROW_PYTHON=ON and gcc ARROW-10869 - [GLib] Add garrow_*_sort_indices() and related options ARROW-10870 - [Julia] Update website with Julia implementation ARROW-10871 - [Julia] Setup Julia CI via GitHub Actions ARROW-10873 - [C++] Apple Silicon is reported as arm64 in CMake ARROW-10874 - [Rust][DataFusion] Add table statistics for MemTable ARROW-10877 - [Rust] [DataFusion] Add benchmark based on kaggle movies ARROW-10878 - [Rust] Simplify extend_from_slice ARROW-10879 - [Packaging][deb] Restore Debian GNU/Linux Buster configuration ARROW-10881 - [C++] EXC_BAD_ACCESS in BaseSetBitRunReader&lt;false&gt;::NextRun ARROW-10885 - [Rust][DataFusion] Optimize join build vs probe based on statistics on row number ARROW-10887 - [C++][Doc] Document IPC API ARROW-10889 - [Rust] Document our approach to unsafe code in README ARROW-10890 - [Rust] [DataFusion] JOIN support ARROW-10891 - [Rust][DataFusion] More clippy lints ARROW-10893 - [Rust] [DataFusion] Easier clippy fixes ARROW-10896 - [C++][CMake] Rename internal RE2 package name to “re2” from “RE2” ARROW-10900 - [Rust][DataFusion] Resolve TableScan provider eagerly ARROW-10904 - [Python] Add support for Python 3.9 macOS wheels ARROW-10905 - [Python] Add support for Python 3.9 windows wheels ARROW-10908 - [Rust] [DataFusion] Update relevant tpch-queries with BETWEEN ARROW-10917 - [Rust][Doc] Update feature matrix ARROW-10918 - [C++][Doc] Document supported Parquet features ARROW-10927 - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary ARROW-10927 - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary ARROW-10927 - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary ARROW-10929 - [Rust] Migrate CI tests to stable rust ARROW-10933 - [Rust] Update docs in regard to stable rust ARROW-10934 - [Python] Tests are failed with fsspec-0.8.5 ARROW-10938 - [Rust] upgrade dependency “flatbuffers” to 0.8 ARROW-10940 - [Rust] Extend sort kernel to ListArray ARROW-10941 - [Doc][C++] Document supported Parquet encryption features ARROW-10944 - [Rust] Implement min/max kernels for BooleanArray ARROW-10946 - [Rust] Make ChunkIter not depend on a buffer ARROW-10947 - [Rust][DataFusion] Refactor UTF8 to Date32 for Performance ARROW-10948 - [C++] Always use GTestConfig.cmake ARROW-10949 - [Rust] Avoid clones in getting values of boolean arrays ARROW-10951 - [Python][CI] Nightly pandas builds failing because of pytest monkeypatch issue ARROW-10952 - [Rust] Add pre-commit hook ARROW-10966 - [C++] Use FnOnce for ThreadPool’s tasks instead of std::function ARROW-10968 - [Rust][DataFusion] Don’t build hash table for right side of the join ARROW-10969 - [Rust][DataFusion] Implement ANSI SQL Functions ARROW-10985 - [Rust] Update unsafe guidelines for adding JIRA references ARROW-10986 - [Rust][DataFusion] Add average statistic to TCP-H benchmark too ARROW-10988 - [C++] Require CMake 3.5 or later ARROW-10989 - [Rust] Use slices for iterating primitive arrays ARROW-10993 - [CI][macOS] Fix Python 3.9 installation by Homebrew ARROW-10995 - [Rust] [DataFusion] Improve parallelism when reading Parquet files ARROW-11004 - [FlightRPC][Python] Header-based auth in clients ARROW-11005 - [Rust] Remove indirection from take kernel and simplify interface ARROW-11008 - [Rust][DataFusion] Simplify count accumulator ARROW-11009 - [Python] Add environment variable to elect default usage of system memory allocator instead of jemalloc/mimalloc ARROW-11010 - [Python] `np.float` deprecation warning in `_pandas_logical_type_map` ARROW-11012 - [Rust] [DataFusion] Make write_csv and write_parquet concurrent ARROW-11015 - [CI][Gandiva] Move gandiva nightly build from travis to github action ARROW-11018 - [Rust][DataFusion] Add null count column statistics ARROW-11026 - [Rust]: Run tests without requiring environment variables ARROW-11028 - [Rust] Somewhat pedantic pattern-matches ARROW-11029 - [Rust] [DataFusion] Document why join order optimization does not work with filter pushdown ARROW-11032 - [C++][FlightRPC] Add benchmark for local RPC through unix socket ARROW-11033 - [Rust] CSV writer performance improvements ARROW-11034 - [Rust] rustfmt cleanup ARROW-11035 - [Rust] Improve performance of cast to utf8 via FromIter ARROW-11037 - [Rust] Improve performance of string fromIter ARROW-11038 - [Rust] Remove `BufferBuilderTrait` and associated Result requirement. ARROW-11039 - [Rust] Improve performance for utf8 to float cast ARROW-11040 - [Rust] Simplify builders with generics ARROW-11042 - [Rust][DataFusion] Increase default batch size ARROW-11043 - [C++] Add “is_nan” kernel ARROW-11046 - [Rust][DataFusion] Add count_distinct to dataframe API ARROW-11049 - [Python] Expose alternate memory pools ARROW-11052 - [Rust] [DataFusion] Implement metrics in join operator ARROW-11053 - [Rust] [DataFusion] Optimize joins with dynamic capacity for output batches ARROW-11054 - Update SQLParser to 0.70 ARROW-11055 - [Rust] [DataFusion] Support date_trunc function ARROW-11058 - [Rust] [DataFusion] Implement “coalesce batches” operator ARROW-11063 - [Rust] Validate null counts when building arrays ARROW-11064 - [Rust][DataFusion] Speed up hash join on smaller batches ARROW-11072 - [Rust] [Parquet] Support int32 and int64 physical types ARROW-11076 - [Rust][DataFusion] Refactor usage of right indices in hash join ARROW-11079 - [R] Catch up on changelog since 2.0 ARROW-11080 - [C++][Dataset] Improvements to implicit casting ARROW-11082 - [Rust] Add FFI for LargeUtf8 ARROW-11086 - [Rust] Extend take to support more index types ARROW-11091 - [Rust][DataFusion] Fix clippy warning in rust 1.49 ARROW-11095 - [Python] Access pyarrow.RecordBatch column by name ARROW-11096 - [Rust] Add FFI for [Large]Binary ARROW-11097 - [Rust] Simplify tests ARROW-11099 - [Rust]: Remove unsafe value_slice method from PrimitiveArray and BooleanArray ARROW-11100 - [Rust] Speed up numeric to string cast using lexical_core ARROW-11101 - [Rust] enable “cargo +nightly fmt” in git pre-commit hook ARROW-11104 - [GLib] Add append_null/append_nulls to GArrowArrayBuilder and use them ARROW-11105 - [Rust] Favor From/Into traits in MutableBuffer ARROW-11108 - [Rust] Improve performance of MutableBuffer ARROW-11109 - [GLib] Add garrow_array_builder_append_empty_value() and values() ARROW-11110 - [Rust] [Datafusion] context.table should not take a mutable self reference ARROW-11111 - [GLib] Add GArrowFixedSizeBinaryArrayBuilder ARROW-11121 - [Developer] Use pull_request_target for PR JIRA integration ARROW-11122 - [Rust] Add FFI for date and time ARROW-11124 - [Doc] Update status matrix for Decimal256 ARROW-11125 - [Rust] Implement logical equality for list arrays ARROW-11126 - [Rust] Document and test ARROW-10656 ARROW-11127 - [C++] Unused cpu_info on non-x86 architecture ARROW-11129 - [Rust][DataFusion] Use tokio thread pool for loading parquet ARROW-11130 - [Website][CentOS 8][RHEL 8] Enable all required repositories by default ARROW-11131 - [Rust] Improve performance of bool_equal ARROW-11136 - [R] Bindings for is.nan ARROW-11137 - [Rust][DataFusion] Fix Clippy needless_range_loop, needless_lifetimes ARROW-11138 - [Rust] [DataFusion] Support ltrim, rtrim ARROW-11139 - [GLib] Add support for extension type ARROW-11155 - [C++][Packaging] Move gandiva crossbow jobs off of Travis-CI ARROW-11158 - [Julia] Implement Decimal256 support ARROW-11159 - [Developer] Consolidate pull request related jobs ARROW-11165 - [Rust] [DataFusion] Document the desired SQL dialect for DataFusion ARROW-11168 - [Rust] Fix cargo doc warnings ARROW-11169 - [Rust] Add a comment explaining where float total_order algorithm came from ARROW-11175 - [R] Small docs fixes ARROW-11176 - [R] Expose memory pool name and document setting it ARROW-11187 - [Rust] [Parquet] Pin specific parquet-format-rs version ARROW-11188 - [Rust] Implement crypto functions from PostgreSQL dialect ARROW-11193 - [Documentation] Add docs for Java ListVector ARROW-11194 - [Rust] Enable SIMD for aarch64 ARROW-11195 - [Rust] [DataFusion] Built-in table providers should expose relevant fields ARROW-11196 - [GLib] Add support for mock, HDFS and S3 file systems with factory function ARROW-11198 - [Packaging][Python] Ensure setuptools version during build supports markdown ARROW-11200 - [Rust] [DateFusion] Physical operators and expressions should have public accessor methods ARROW-11201 - [Rust] create_batch_empty - support more types ARROW-11203 - [Developer][Website] Enable JIRA and pull request integration ARROW-11204 - [C++] Fix build failure with bundled gRPC and Protobuf ARROW-11205 - [GLib][Dataset] Add GADFileFormat and its family ARROW-11209 - [Rust] DF - Provide better error message on unsupported GROUP BY ARROW-11210 - [CI] Restore workflows that had been blocked by INFRA ARROW-11212 - [Packaging][Python] Use vcpkg as dependency source for manylinux and windows wheels ARROW-11213 - [Packaging][Python] Dockerize wheel building on windows ARROW-11215 - [CI] Use named volumes by default for caching in docker-compose ARROW-11216 - [Rust] Improve documentation for StringDictionaryBuilder ARROW-11218 - [R] Make SubTreeFileSystem print method more informative ARROW-11219 - [CI][Ruby][MinGW] Reduce CI time ARROW-11221 - [Rust] DF Implement GROUP BY support for Float32/Float64 ARROW-11231 - [Packaging] Add mimalloc to Linux builds ARROW-11234 - [CI][Ruby][macOS] Reduce CI time ARROW-11236 - [Java] Bump Jackson to 2.11.4 ARROW-11240 - [Packaging][R] Add mimalloc to R packaging ARROW-11242 - [CI] Remove CMake 3.2 job ARROW-11245 - [C++][Gandiva] Add support for LLVM 11.1 ARROW-11247 - [C++] Infer date32 columns in CSV ARROW-11256 - [Packaging][Linux] Don’t buffer packaging output ARROW-11268 - [Rust][DataFusion] Support specifying repartitions in MemTable ARROW-11272 - [Release][wheel] Remove unsupported Python 3.5 and manylinux1 ARROW-11273 - [Release][deb] Remove unsupported Debian GNU/Linux stretch ARROW-11278 - [Release][NodeJS] Don’t touch ~/.bash_profile ARROW-11280 - [Release][APT] Fix minimal build example check ARROW-11281 - [C++] Remove needless runtime RapidJSON dependency ARROW-11282 - [Packaging][deb] Add missing libgflags-dev dependency ARROW-11285 - [Release][APT] Add support for Ubuntu Groovy ARROW-11292 - [Release][JS] Use Node.JS LTS ARROW-11293 - [C++] Don’t require Boost and gflags with find_package(Arrow) ARROW-11307 - [Release][Ubuntu][20.10] Add workaround for dependency issue ARROW-11309 - [Release][C#] Use .NET 3.1 for verification ARROW-11329 - [Rust] Do not rebuild the library on every change ARROW-11372 - Support RC verification on macOS-ARM64 PARQUET-1566 - [C++] Indicate if null count, distinct count are present in column statistics" />
<link rel="canonical" href="https://arrow.apache.org/release/3.0.0.html" />
<meta property="og:url" content="https://arrow.apache.org/release/3.0.0.html" />
<meta property="og:site_name" content="Apache Arrow" />
<meta property="og:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="og:type" content="article" />
<meta property="article:published_time" content="2024-04-24T21:04:19-04:00" />
<meta name="twitter:card" content="summary_large_image" />
<meta property="twitter:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="twitter:title" content="Apache Arrow 3.0.0 Release" />
<meta name="twitter:site" content="@ApacheArrow" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"BlogPosting","dateModified":"2024-04-24T21:04:19-04:00","datePublished":"2024-04-24T21:04:19-04:00","description":"Apache Arrow 3.0.0 (26 January 2021) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For CentOS For Debian For Python For Ubuntu Git tag Contributors This release includes 648 commits from 106 distinct contributors. $ git shortlog -sn apache-arrow-2.0.0..apache-arrow-3.0.0 71 Jorge C. Leitao 64 Sutou Kouhei 48 Antoine Pitrou 48 Heres, Daniel 27 Andy Grove 27 Neville Dipale 24 Joris Van den Bossche 19 Neal Richardson 15 Benjamin Kietzman 14 Carol (Nichols || Goulding) 14 Uwe L. Korn 12 Jonathan Keane 12 Krisztián Szűcs 11 Andrew Lamb 11 Mike Seddon 11 mqy 10 Kazuaki Ishizaki 10 Qingping Hou 8 Jörn Horstmann 8 Yibo Cai 8 alamb 7 Andrew Wieteska 7 Ian Cook 7 Mahmut Bulut 6 Dmitry Patsura 6 Projjal Chanda 6 liyafan82 6 rdettai 5 Daniel Russo 5 David Li 5 Florian Müller 5 Jacob Quinn 5 James Duong 4 Micah Kornfield 4 Weston Pace 3 Jimmy Lu 3 Maarten A. Breddels 3 Terence D. Honles 3 Will Jones 2 Bei 2 Benjamin Wilhelm 2 Christian Lundgren 2 Daniël Heres 2 Evan Chan 2 Fokko Driesprong 2 François Garillot 2 Greg Bowyer 2 Kirill Lykov 2 Kyle Porter 2 Lyndon Bauto 2 Markus Westerlind 2 Sagnik Chakraborty 2 Sean Moran 2 Wes McKinney 2 Yordan Pavlov 2 jensenrichardson 2 wayne 2 Östman Alexander 1 Bruno LE HYARIC 1 Chiyang Wan 1 Christoph Schulze 1 Diana Clarke 1 Eric Erhardt 1 Erik Fonselius 1 Frank Du 1 Frank Smith 1 Gidon Gershinsky 1 H-Plus-Time 1 Hongze Zhang 1 Jake Goulding 1 Jeremy 1 Johan Peltenburg 1 Jorge Leitao 1 Josiah 1 Kopilov Aleksandr 1 Marc Prud&#39;hommeaux 1 Marius van Niekerk 1 Matthew Pollock 1 Matthew Topol 1 Michal 1 Mingyu Zhong 1 Nick Bruno 1 Paddy Horan 1 Pavel Tiunov 1 Prudhvi Porandla 1 Radu Teodorescu 1 Romain Francois 1 Ruan Pearce-Authers 1 Ruihang Xia 1 Sebastian Berg 1 Tao He 1 Tyrel Rink 1 Ulimo 1 Vivian Kong 1 Weiyang Zhao 1 Zhuo Peng 1 arw2019 1 david 1 jonathan.albrecht 1 naman1996 1 rjzamora 1 shawnding(丁晓坤) 1 tching 1 tianchen 1 tifflhl 1 xudingyu Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-2.0.0..apache-arrow-3.0.0 100 Andrew Lamb 88 Antoine Pitrou 76 Sutou Kouhei 70 Jorge C. Leitao 69 Neville Dipale 54 Krisztián Szűcs 44 Neal Richardson 40 Andy Grove 26 Benjamin Kietzman 19 Joris Van den Bossche 12 David Li 8 liyafan82 7 Uwe L. Korn 7 alamb 6 Micah Kornfield 5 Praveen 4 Bryan Cutler 4 Eric Erhardt 2 GitHub 2 Paddy Horan 2 Pindikura Ravindra 2 Wes McKinney 1 Sebastien Binet Changelog Apache Arrow 3.0.0 (2021-01-26) Bug Fixes ARROW-2616 - [Python] Cross-compiling Pyarrow ARROW-6582 - [R] Arrow to R fails with embedded nuls in strings ARROW-7363 - [Python] Add combine_chunks method to ChunkedArray ARROW-7909 - [Website] Add how to install on Red Hat Enterprise Linux ARROW-8258 - [Rust] [Parquet] ArrowReader fails on some timestamp types ARROW-9027 - [Python] Split in multiple files + clean-up pyarrow.parquet tests ARROW-9479 - [JS] Table.from fails for zero-item Lists, FixedSizeLists, Maps. ditto Table.empty ARROW-9636 - [Python] Update documentation about ‘LZO’ compression in parquet.write_table ARROW-9776 - [R] read_feather causes segfault in R if file doesn’t exist ARROW-9897 - [C++][Gandiva] Add to_date() function from pattern ARROW-9897 - [C++][Gandiva] Add to_date() function from pattern ARROW-9898 - [C++][Gandiva] Error handling in castINT fails in some enviroments ARROW-9903 - [R] open_dataset freezes opening feather files on Windows ARROW-9963 - [Python] Recognize datetime.timezone.utc as UTC on conversion python-&gt;pyarrow ARROW-10039 - [Rust] Do not require memory alignment of buffers ARROW-10042 - [Rust] Buffer equalities may be incorrect ARROW-10080 - [R] Arrow does not release unused memory ARROW-10122 - [Python] Selecting one column of multi-index results in a duplicated value column. ARROW-10145 - [C++][Dataset] Assert integer overflow in partitioning falls back to string ARROW-10146 - [Python] Parquet metadata to_dict raises attribute error ARROW-10174 - [Java] Reading of Dictionary encoded struct vector fails ARROW-10177 - [CI][Gandiva] Nightly gandiva-jar-xenial fails ARROW-10186 - [Rust] Tests fail when following instructions in README ARROW-10247 - [C++][Dataset] Cannot write dataset with dictionary column as partition field ARROW-10264 - [C++][Python] Parquet test failing with HadoopFileSystem URI ARROW-10270 - [R] Fix CSV timestamp_parsers test on R-devel ARROW-10283 - [Python] Python deprecation warning for “PY_SSIZE_T_CLEAN will be required for ‘#’ formats” ARROW-10293 - [Rust] [DataFusion] Fix benchmarks ARROW-10294 - [Java] Resolve problems of DecimalVector APIs on ArrowBufs ARROW-10321 - [C++] Building AVX512 code when we should not ARROW-10333 - [Java] Remove split packages in arrow-memory-core and arrow-vectors ARROW-10345 - [C++] NaN breaks sorting ARROW-10346 - [Python] Default S3 region is eu-central-1 even with LANG=C ARROW-10348 - [C++] Fix crash on invalid Parquet file (OSS-Fuzz) ARROW-10350 - [Rust] parquet_derive crate cannot be published to crates.io ARROW-10353 - [C++] Parquet decompresses DataPageV2 pages even if is_compressed==0 ARROW-10358 - [R] Followups to 2.0.0 release ARROW-10365 - [R] Remove duplicate setting of S3 flag on macOS ARROW-10369 - [Dev] Fix archery release utility test cases ARROW-10370 - [Python] Spurious s3fs-related test failures ARROW-10371 - [R] Linux system requirements check needs to support older cmake versions ARROW-10386 - [R] List column class attributes not preserved in roundtrip ARROW-10388 - [Java] Fix Spark integration build failure ARROW-10390 - [Rust] [Parquet] Regression Can not implement custom ParquetWriter because `TryClone` is not publically exported ARROW-10393 - [Rust]: Fix null value reading in jsonreader for both dictionary and stringbuilders ARROW-10394 - [Rust] [Large]BinaryArray can be created from non-binary datatypes ARROW-10397 - [C++] Outdated and confusing comment on dictionary indices ARROW-10399 - [R] Fix performance regression from cpp11::r_string ARROW-10411 - [C++] Fix incorrect child array lengths for Concatenate of FixedSizeList ARROW-10412 - [C++] CMake Build Fails with grpc 1.33.1, “GRPC_CPP_PLUGIN-NOTFOUND: program not found or is not executable” ARROW-10413 - [Rust] [Parquet] Unignore some roundtrip tests that are passing now ARROW-10414 - [R] open_dataset doesn’t work with absolute/expanded paths on Windows ARROW-10426 - [C++] Arrow type large_string cannot be written to Parquet type column descriptor ARROW-10433 - [Python] pyarrow doesn’t work with s3fs&gt;=0.5 ARROW-10434 - [Rust] Debug formatting arrays with lengths greater than 10 and less than 20 produces incorrect values ARROW-10441 - [FlightRPC][Java] FlightClients from FlightGrpcUtils#createFlightClient shutdown gRPC channel when closed ARROW-10446 - [C++][Python] Timezone aware pd.Timestamp’s are incorrectly converted to Timestamp arrys ARROW-10448 - [Rust] PrimitiveArray::new can create arrays not in spec ARROW-10453 - [Rust] [DataFusion] Performance degredation after removing specialization ARROW-10457 - [CI] Fix Spark branch-3.0 integration tests ARROW-10461 - [Rust] Offset related bug in BitChunks::remainder_bits ARROW-10462 - [Python] ParquetDatasetPiece’s path broken when using fsspec fs on Windows ARROW-10463 - [R] Better messaging for currently unsupported CSV options in open_dataset ARROW-10470 - [R] Fix missing file error causing NYC taxi example to fail ARROW-10471 - [CI][Python] Ensure we have tests with s3fs and run those on CI ARROW-10472 - [C++][Python] casting a scalar timestamp to date32 results in Aborted (core dump) ARROW-10475 - [С++][FlightRPC] Arrow Flight Server / Client cannot be initialized with Ipv6 host ARROW-10480 - [Python] Parquet write_table creates gzipped Parquet file, not Parquet with gzip compression ARROW-10482 - [Python] Specifying compression type on a column basis when writing Parquet not working ARROW-10489 - [C++] Unable to configure or make with intel compiler ARROW-10489 - [C++] Unable to configure or make with intel compiler ARROW-10491 - [FlightRPC][Java] Fix NPE when using FlightProducer without interceptors ARROW-10493 - [C++][Parquet] Writing nullable nested strings results in wrong data in file ARROW-10495 - [C++] find_package(Arrow) is broken on Ubuntu 18 ARROW-10496 - [R][CI] Fix conda-r job ARROW-10499 - [C++][Java] Fix ORC Java JNI Crash ARROW-10502 - [C++/Python] CUDA detection messes up nightly conda-win builds ARROW-10503 - [C++] Uriparser will not compile using Intel compiler ARROW-10508 - [Java] Allow FixedSizeListVector to have empty children ARROW-10509 - [C++] Define operator«(ostream, ParquetException) for clang+Windows ARROW-10511 - [Python] Table.to_pandas() failing when timezone-awareness mismatch in metadata ARROW-10518 - Fix cast function issues in gandiva ARROW-10519 - [Python] Deadlock when PyArrow imports Pandas from multiple threads ARROW-10525 - [C++] Fix crash on unsupported IPC stream (OSS-Fuzz) ARROW-10532 - [Python] Mangled pandas_metadata when specified schema has different order as DataFrame columns ARROW-10545 - [C++] Fix crash on invalid Parquet file (OSS-Fuzz) ARROW-10546 - [Python] Deprecate the S3FSWrapper class ARROW-10547 - [Rust][DataFusion] Filter pushdown loses filters if below a user defined node ARROW-10551 - [Rust]: Fix unreproducible benchmarks ARROW-10558 - [Python] Filesystem S3 tests not independent (native s3 influences s3fs) ARROW-10560 - [Python] Crash when creating array with string over 2GB ARROW-10563 - [Packaging][C++] CMake find_package(Arrow 2.0 CONFIG REQUIRED) broken ARROW-10565 - [Python] Table.from_batches and Table.from_pandas have argument Schema_schema in documentation instead of schema ARROW-10568 - [C++][Parquet] Parquet writer crashes process when Tell() does not succeed ARROW-10569 - [C++][Python] Poor Table filtering performance ARROW-10577 - [Rust][DataFusion] Hash Aggregator stream finishes unexpectedly after going to Pending state ARROW-10578 - [C++] Comparison kernels crashing for string array with null string scalar ARROW-10610 - [C++] arrow-utility-test and arrow-csv-test causes failures on a big-endian platform ARROW-10616 - [Developer] Expand PR labeler to all supported languages ARROW-10617 - [Python] RecordBatchStreamReader’s iterator doesn’t work with python 3.8 ARROW-10619 - [C++] Fix crash on unsupported IPC stream (OSS-Fuzz) ARROW-10620 - [Rust][Parquet] move column chunk range logic to metadata.rs ARROW-10621 - [Java] flight-cpre test causes a failure on s390x ARROW-10622 - [R] Nameof&lt;&gt;() is incorrect in r-arrow build environment ARROW-10623 - [R] Version 1.0.1 breaks data.frame attributes when reading file written by 2.0.0 ARROW-10624 - [R] Proactively remove “problems” attributes ARROW-10627 - [Rust] Github master does not compile for WASM target ARROW-10629 - [CI] MinGW builds broken on Github Actions ARROW-10631 - [Rust] Equality of fixed-sized binary is incorrect. ARROW-10642 - [R] Can’t get Table from RecordBatchReader with 0 batches ARROW-10656 - [Rust] New RecordBatch requires exact match of Data Types ARROW-10656 - [Rust] New RecordBatch requires exact match of Data Types ARROW-10661 - [C#] Fix benchmarking project ARROW-10662 - [Java] Avoid integer overflow for Json file reader ARROW-10663 - [C++/Doc] The IsIn kernel ignores the skip_nulls option of SetLookupOptions ARROW-10667 - [Rust] [Parquet] Add a convenience type for writing Parquet to memory ARROW-10668 - [R] Filtering does not work with .data pronoun ARROW-10681 - [Rust] [DataFusion] TPC-H Query 12 fails with scheduler error ARROW-10684 - [Rust] Logical equality should consider parent array nullability ARROW-10690 - [Java] ComplexCopier gives incorrect result for list vector if target vector is non-empty ARROW-10692 - [Rust] Segfault while array buffer append ARROW-10699 - [C++] BitmapUInt64Reader doesn’t work on big-endian ARROW-10701 - [Rust] [Datafusion] Benchmark sort_limit_query_sql fails because order by clause specifies column index instead of expression ARROW-10705 - [Rust] Lifetime annotations in the IPC writer are too strict, preventing code reuse ARROW-10710 - [Rust] Example flight server is broken after tokio upgrade (among other things) ARROW-10711 - [CI] Remove set-env from auto-tune to work with new GHA settings ARROW-10719 - [C#] ArrowStreamWriter doesn’t write schema metadata ARROW-10746 - [C++] Use GTEST_SKIP in parquet encoding tests ARROW-10748 - [Java] TimeStampMilliVector cannot be cast to TimeStampMilliTZVector ARROW-10749 - [C++] Incorrect string format for Datum with the collection type ARROW-10751 - [C++] Add RE2 to minimal build example ARROW-10753 - [Rust] [DataFusion] Negative numbers in SQL WHERE clause not parsed correctly ARROW-10757 - [Rust] [CI] Sporadic failures due to disk filling up ARROW-10760 - [Rust] [DataFusion] Predicate push down does not support joins correctly ARROW-10769 - [CI] Integration tests are failing in master ARROW-10774 - [R] Set minimum cpp11 version ARROW-10777 - [Packaging][Python] PyPI pyarrow source dist (sdist) contains architecture dependent binaries ARROW-10778 - [Python] RowGroupInfo.statistics errors for empty row group ARROW-10779 - [Java] writeNull method in UnionListWriter doesn’t work correctly if validity at that index is already set ARROW-10780 - [R] Update known R installation issues for CentOS 7 ARROW-10791 - [Rust] StreamReader, read_dictionary duplicating schema info ARROW-10801 - [Rust] [Flight] Support sending FlightData for Dictionaries with that of a RecordBatch ARROW-10803 - [R] Support R &gt;= 3.3 and add CI ARROW-10804 - [Rust] Remove UB on parquet crate ARROW-10807 - [Rust][DataFusion] Avoid double hashing ARROW-10810 - [Rust] Speed up comparison kernels ARROW-10811 - [R][CI] Remove nightly centos6 build ARROW-10823 - MutableArrayData with use_null false yields wrong results ARROW-10830 - [Rust] json reader should not hard crash on invalid json ARROW-10833 - [Python] Avoid usage of NumPy’s PyArray_DescrCheck macro ARROW-10834 - [R] Fix print method for SubTreeFileSystem ARROW-10837 - [Rust] Use `Vec&lt;u8&gt;` for hash key instead ARROW-10840 - [C++] Parquet FileMetaData does not have key_value_metadata when built from FileMetaDataBuilder ARROW-10842 - [Rust] decouple IO from json schema inference code ARROW-10844 - [Rust] [DataFusion] join of two DataFrames is not possible ARROW-10850 - [R] Unrecognized compression type: LZ4 ARROW-10852 - [C++] AssertTablesEqual(verbose=true) segfaults if the left array has more rows ARROW-10854 - [Rust] [DataFusion] Simplified logical scans ARROW-10855 - [Python][Numpy] ArrowTypeError after upgrading NumPy to 1.20.0rc1 ARROW-10856 - [R] CentOS 7 not correctly identifying compiler version ARROW-10859 - [Rust] [DataFusion] Make collect not require ExecutionContext ARROW-10860 - [Java] Avoid integer overflow for generated classes in Vector ARROW-10863 - [Python] ExtensionArray.to_pandas not working ARROW-10863 - [Python] ExtensionArray.to_pandas not working ARROW-10875 - simplify simd cfg check ARROW-10876 - [Rust] json reader should validate value type ARROW-10897 - [Rust] Replace Arc&lt;String&gt; by String in DataType::Timestamp ARROW-10907 - [Rust] Cast UTF8 to Date64 Incorrect ARROW-10913 - [Python][Doc] Code block typo in filesystems docs ARROW-10914 - [Rust]: SIMD implementation of arithmetic kernels reads out of bounds ARROW-10915 - [Rust] Make ARROW_TEST_DATA and PARQUET_TEST_DATA absolute dirs ARROW-10921 - `TypeError: ‘coroutine’ object is not iterable` when reading parquet partitions via s3fs &gt;= 0.5 with pyarrow ARROW-10930 - [Python] LargeListType doesn’t have a value_field ARROW-10932 - [C++] BinaryMemoTable::CopyOffsets access out-of-bound address when data is empty ARROW-10932 - [C++] BinaryMemoTable::CopyOffsets access out-of-bound address when data is empty ARROW-10942 - [C++] S3FileSystem::Impl::IsEmptyDirectory fails on Amazon S3 ARROW-10943 - [Rust] Intermittent build failure in parquet encoding ARROW-10954 - [C++][Doc] PlasmaClient is threadSafe now, doc not update ARROW-10955 - [C++] Reading empty json lists results in invalid non-nullable null type ARROW-10960 - [C++][FlightRPC] Missing protobuf data_body should result in default value of empty bytes, not null ARROW-10962 - [Java][FlightRPC] FlightData deserializer should accept missing fields ARROW-10967 - [Rust] Make env vars ARROW_TEST_DATA and PARQUET_TEST_DATA optional ARROW-10990 - [Rust]: SIMD implementation of compare kernels reads out of bounds ARROW-10994 - [Rust] Fix bugs in TPC-H file conversion ARROW-10996 - [Rust] Return error messages via Result for get_arrow_schema_from_metadata ARROW-10999 - [Rust] TPC-H parquet files cannot be read by Apache Spark ARROW-11014 - [Rust] [DataFusion] ParquetExec reports incorrect statistics ARROW-11023 - [C++][CMake] gRPC doesn’t respect CMAKE_CXX_COMPILER ARROW-11024 - [C++][Parquet] Writing List&lt;Struct&gt; to parquet sometimes writes wrong data ARROW-11025 - [Rust] Bench for boolean kernels measure array creation ARROW-11030 - [Rust] [DataFusion] HashJoinExec slow with many batches ARROW-11048 - [Rust] Add bench to MutableBuffer ARROW-11050 - [R] Handle RecordBatch in write_parquet ARROW-11067 - [C++] CSV reader returns nulls for some strings on macOS ARROW-11069 - [C++] Parquet writer incorrect data being written when data type is struct ARROW-11073 - [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs ARROW-11083 - [CI] Build “Source Release and Merge Script” is broken ARROW-11084 - [Rust] Clippy failing in master ARROW-11085 - [Rust] Rust CI no longer works b/c it uses action-rs: Migrate CI away from action-rs/* ARROW-11092 - [CI] (Temporarily) move offending workflows to separate files ARROW-11102 - [Rust][DataFusion] fmt::Debug for ScalarValue(Utf8) is always quoted ARROW-11113 - [Rust] support as_struct_array cast ARROW-11114 - [Java] Metadata serialization is broken for Field class ARROW-11132 - [CI] Use pip to install crossbow’s dependencies for the comment bot ARROW-11144 - [C++][Python][CI] Fix HDFS nightly build ARROW-11152 - [CI][C++] Fix Homebrew numpy installation on macOS builds ARROW-11162 - [C++] Fix crash on Decimal256 Parquet file (OSS-Fuzz) ARROW-11163 - [C++][Python] Compressed Feather file written with pyarrow 0.17 not readable in pyarrow 2.0.0+ ARROW-11166 - [Python][Compute] Add bindings for ProjectOptions ARROW-11171 - [Go] Build fails on s390x with noasm tag ARROW-11189 - [Developer] Achery benchmark diff cannot compare two jsons ARROW-11190 - [C++][Dataset] Clean up compiler warnings ARROW-11202 - [R][CI] Nightly builds not happening (or artifacts not exported) ARROW-11224 - [R] don’t test metadata serialization on old R versions ARROW-11226 - [Python][CI] Filesystem tests failing with s3fs 0.5.2 ARROW-11227 - [Python][CI] AMD64 Conda Python 3.7 Pandas 0.24 cron job failing in to_pandas extension dtype test ARROW-11229 - [C++][Dataset] Static build is failed ARROW-11230 - [R] Fix build failures on Windows when multiple libarrow binaries found ARROW-11232 - [C++] Table::CombineChunks() returns incorrect results if Table has no column ARROW-11233 - [C++][Flight] Fail to link with bundled gRPC and Abseil ARROW-11237 - [C++] Compiler error with GLog and unity build enabled ARROW-11251 - [CI] Make sure that devtoolset-8 is really installed + being used ARROW-11253 - [R] Make sure that large metadata tests are reproducible ARROW-11255 - [Packaging][Conda][macOS] Fix Python version ARROW-11257 - [C++][Parquet] PyArrow Table contains different data after writing and reloading from Parquet ARROW-11271 - [Rust] [Parquet] List schema to Arrow parser misinterpreting child nullability ARROW-11274 - [Packaging][wheel][Windows] Fix wheels path for Gemfury ARROW-11275 - [Packaging][wheel][Linux] Fix paths for Gemfury ARROW-11277 - [C++] Fix compilation error in dataset expressions on macOS 10.11 ARROW-11283 - [Julia] Fix install link ARROW-11286 - [Release][Yum] Fix minimal build example check ARROW-11287 - [Packaging][RPM] Add missing dependencies ARROW-11299 - [Python] build warning in python ARROW-11301 - [C++] Fix reading LZ4-compressed Parquet files produced by Java Parquet implementation ARROW-11302 - [Release][Python] Remove verification of python 3.5 wheel on macOS ARROW-11303 - [Release][C++] Enable mimalloc in the windows verification script ARROW-11305 - [Rust]: parquet-rowcount binary tries to open itself as a parquet file ARROW-11306 - [Packaging][Ubuntu][16.04] Add missing libprotobuf-dev dependency ARROW-11320 - [C++] Spurious test failure when creating temporary dir ARROW-11322 - [Rust] Arrow `memory` made private is a breaking API change PARQUET-1935 - [C++][Parquet] nullptr access violation when writing arrays of non-nullable values New Features and Improvements ARROW-1846 - [C++] Implement “any” reduction kernel for boolean data ARROW-3850 - [Python] Support MapType and StructType for enhanced PySpark integration ARROW-4193 - [Rust] Add support for decimal data type ARROW-4544 - [Rust] Read nested JSON structs into StructArrays ARROW-4804 - [Rust] Read temporal values from CSV - Parse Date32 and Date64 in CSV reader ARROW-4960 - [R] Add crossbow task for r-arrow-feedstock ARROW-4970 - [C++][Parquet] Implement parquet::FileMetaData::Equals ARROW-5336 - [C++] Implement arrow::Concatenate for dictionary-encoded arrays with unequal dictionaries ARROW-5350 - [Rust] Support filtering on primitive/string lists ARROW-5394 - [C++] Benchmarks for IsIn Kernel ARROW-5679 - [Python] Drop Python 3.5 from support matrix ARROW-5950 - [Rust] [DataFusion] Add ability to log via logger dependency ARROW-6071 - [C++] Implement casting Binary &lt;-&gt; LargeBinary ARROW-6697 - [Rust] [DataFusion] Validate that all parquet partitions have the same schema ARROW-6715 - [Website] Describe “non-free” component is needed for Plasma packages in install page ARROW-6883 - [C++] Support sending delta DictionaryBatch or replacement DictionaryBatch in IPC stream writer class ARROW-6995 - [Packaging][Crossbow] The windows conda artifacts are not uploaded to GitHub releases ARROW-7531 - [C++] Investigate header cost reduction ARROW-7633 - [C++][CI] Create fuzz targets for tensors and sparse tensors ARROW-7800 - [Python] Expose GetRecordBatchReader API in PyArrow ARROW-7842 - [Rust] [Parquet] Implement array reader for list type ARROW-8113 - [C++] Implement a lighter-weight variant ARROW-8199 - [C++] Add support for multi-column sort on Table ARROW-8289 - [Rust] [Parquet] Implement minimal Arrow Parquet writer as starting point for full writer ARROW-8423 - [Rust] [Parquet] Serialize arrow schema into metadata when writing parquet ARROW-8425 - [Rust] [Parquet] Add support for writing temporal types ARROW-8426 - [Rust] [Parquet] Add support for writing dictionary types ARROW-8426 - [Rust] [Parquet] Add support for writing dictionary types ARROW-8853 - [Rust] [Integration Testing] Enable Flight tests ARROW-8876 - [C++] Implement casts from date types to Timestamp ARROW-8883 - [Rust] [Integration Testing] Enable passing tests and update spec doc ARROW-9001 - [R] Box outputs as correct type in call_function ARROW-9128 - [C++] Implement string space trimming kernels: trim, ltrim, and rtrim ARROW-9164 - [C++] Provide APIs for adding “docstrings” to arrow::compute::Function classes that can be accessed by bindings ARROW-9187 - [R] Add bindings for arithmetic kernels ARROW-9296 - [CI][Rust] Enable more clippy lint checks ARROW-9304 - [C++] Add “AppendEmptyValue” builder APIs for use inside StructBuilder::AppendNull ARROW-9361 - [Rust] Move other array types into their own modules ARROW-9400 - [Python] Do not depend on conda-forge static libraries in Windows wheel builds ARROW-9475 - [Java] Clean up usages of BaseAllocator, use BufferAllocator instead ARROW-9489 - [C++] Add fill_null kernel implementation for (array[string], scalar[string]) ARROW-9555 - [Rust] [DataFusion] Add inner (hash) equijoin physical plan ARROW-9564 - [Packaging] Vendor r-arrow-feedstock conda-forge recipe ARROW-9674 - [Rust] Parquet reader should implement Send + Sync ARROW-9704 - [Java] TestEndianness.testLittleEndian fails on big endian platform ARROW-9707 - [Rust] [DataFusion] Re-implement threading model ARROW-9709 - [Java] Test cases in arrow-vector assume little-endian platform ARROW-9728 - [Rust] [Parquet] Compute nested definition and repetition for structs ARROW-9747 - [C++][Java][Format] Support Decimal256 Type ARROW-9771 - [Rust] [DataFusion] Predicate Pushdown Improvement: treat predicates separated by AND separately ARROW-9803 - [Go] Add initial support for s390x ARROW-9804 - [FlightRPC] Authentication Redesign ARROW-9828 - [Rust] [DataFusion] TableProvider trait should support predicate push-down ARROW-9861 - [Java] Failed Arrow Vector on big-endian platform ARROW-9862 - Throw an exception in UnsafeDirectLittleEndian on Big-Endian platform ARROW-9911 - [Rust][DataFusion] SELECT &lt;expression&gt; with no FROM clause should produce a single row of output ARROW-9945 - [C++][Dataset] Refactor Expression::Assume to return a Result ARROW-9991 - [C++] split kernels for strings/binary ARROW-10002 - [Rust] Trait-specialization requires nightly ARROW-10021 - [C++][Compute] Support finding nth frequently used value in mode kernel ARROW-10032 - [Documentation] C++ Windows docs are out of date ARROW-10079 - [Rust]: Benchmark and improve count_set_bits function ARROW-10095 - [Rust] [Parquet] Update for IPC changes ARROW-10097 - [C++] Persist SetLookupState in between usages of IsIn when filtering dataset batches ARROW-10106 - [FlightRPC][Java] Expose onIsReady() callback on OutboundStreamListener ARROW-10108 - [Rust] [Parquet] Fix compiler warning about unused return value ARROW-10109 - [Rust] Add support to produce a C Data interface ARROW-10110 - [Rust] Add support to consume C Data Interface ARROW-10131 - [C++][Dataset] Lazily parse parquet metadata / statistics in ParquetDatasetFactory and ParquetFileFragment ARROW-10135 - [Rust] [Parquet] Refactor file module to help adding sources ARROW-10143 - [C++] ArrayRangeEquals should accept EqualOptions ARROW-10144 - [Flight] Add support for using the TLS_SNI extension ARROW-10149 - [Rust] Add support to external release of un-owned buffers ARROW-10163 - [Rust] [DataFusion] Add DictionaryArray coercion support ARROW-10168 - [Rust] [Parquet] Extend arrow schema conversion to projected fields ARROW-10173 - [Rust][DataFusion] Improve performance of equality to a constant predicate support ARROW-10180 - [C++][Doc] Update dependency management docs following aws-sdk-cpp addition ARROW-10182 - [C++] Add basic continuation support to futures ARROW-10191 - [Rust] [Parquet] Add roundtrip tests for single column batches ARROW-10197 - [Gandiva][python] Execute expression on filtered data ARROW-10203 - [Doc] Capture guidance for endianness support in contributors guide. ARROW-10207 - [C++] Unary kernels that results in a list have no preallocated offset buffer ARROW-10208 - [C++] String split kernels do not propagate nulls correctly on sliced input ARROW-10216 - [Rust] Simd implementation of min/max aggregation kernels for primitive types ARROW-10224 - [Python] Add support for Python 3.9 except macOS wheel and Windows wheel ARROW-10225 - [Rust] [Parquet] Fix null bitmap comparisons in roundtrip tests ARROW-10228 - [Julia] Donate Julia Implementation ARROW-10236 - [Rust] [DataFusion] Make DataFusion casting rules consistent with cast kernel ARROW-10241 - [C++][Compute] Add variance kernel benchmark ARROW-10249 - [Rust]: Support Dictionary types for ListArrays in arrow json reader ARROW-10259 - [Rust] Support field metadata ARROW-10261 - [Rust] [BREAKING] Lists should take Field instead of DataType ARROW-10263 - [C++][Compute] Improve numerical stability of variances merging ARROW-10268 - [Rust] Support writing dictionaries to IPC file and stream ARROW-10269 - [Rust] Update nightly: Oct 2020 Edition ARROW-10277 - [C++] Support comparing scalars approximately ARROW-10289 - [Rust] Support reading dictionary streams ARROW-10292 - [Rust] [DataFusion] Simplify merge ARROW-10295 - [Rust] [DataFusion] Simplify accumulators ARROW-10299 - [Rust] Support reading and writing V5 of IPC metadata ARROW-10300 - [Rust] Improve benchmark documentation for generating/converting TPC-H data ARROW-10301 - [C++] Add “all” boolean reducing kernel ARROW-10302 - [Python] Don’t double-package plasma-store-server ARROW-10304 - [C++][Compute] Optimize variance kernel for integers ARROW-10310 - [C++][Gandiva] Add single argument round() in Gandiva ARROW-10311 - [Release] Update crossbow verification process ARROW-10313 - [C++] Improve UTF8 validation speed and CSV string conversion ARROW-10318 - [C++] Use pimpl idiom in CSV parser ARROW-10319 - [Flight][Go] Add Context to Client Auth Handler functions for Flight ARROW-10320 - [Rust] Convert RecordBatchIterator to a Stream ARROW-10322 - [C++][Dataset] Minimize Expression to a wrapper around compute::Function ARROW-10323 - [Release][wheel] Add missing verification setup step ARROW-10325 - [C++][Compute] Separate aggregate kernel registration ARROW-10328 - [C++] Consider using fast-double-parser ARROW-10330 - [Rust][Datafusion] Implement nullif() function for DataFusion ARROW-10331 - [Rust] [DataFusion] Re-organize errors ARROW-10332 - [Rust] Allow CSV reader to start from a line ARROW-10334 - [Rust] [Parquet] Support reading and writing Arrow NullArray ARROW-10336 - [Rust] Support fromIter and toIter for strings ARROW-10337 - [C++] More liberal parsing of ISO8601 timestamps with fractional seconds ARROW-10338 - [Rust]: Use const fn for applicable methods ARROW-10340 - [Packaging][deb][RPM] Use Python 3.8 for pygit2 ARROW-10356 - [Rust] [DataFusion] Add support for is_in ARROW-10363 - [Python] Remove workaround for CMake bug in manylinux ARROW-10366 - [Rust] [DataFusion] Remove collect from merge ARROW-10375 - [Rust] Remove PrimitiveArrayOps ARROW-10378 - [Rust] Update take() kernel with support for large lists ARROW-10381 - [Rust] Generalize Arrow to support MergeSort ARROW-10382 - [Rust] Fix typos and spelling ARROW-10383 - [Doc] Fix typos and spelling ARROW-10384 - [C++] Fix typos and spelling ARROW-10385 - [C++][Gandiva] Add support for LLVM 11 ARROW-10389 - [Rust][DataFusion] Make the custom source implementation API more explicit ARROW-10392 - [C++][Gandiva] Avoid string copy while evaluating IN expression ARROW-10396 - [Rust] [Parquet] Expose SliceableCursor and FileSource ARROW-10398 - [Rust] [Parquet] Re-export parquet::record::api::Field ARROW-10400 - Propagate TLS client peer_identity when using mutual TLS ARROW-10402 - [Rust] Improve array equality ARROW-10407 - [C++] Division Support in Decimal256 ARROW-10408 - [Java] Upgrade Avro dependency to 1.10 ARROW-10410 - [Rust] Some refactorings ARROW-10416 - [R] Support Tables in Flight ARROW-10422 - [Rust] Removed unused BinaryArrayBuilder ARROW-10424 - [Rust] Simplify code for impl PrimitiveArray ARROW-10428 - [FlightRPC][Java] Add support for HTTP cookies ARROW-10445 - [Rust] Add DoubleEnded to PrimitiveArrayIter ARROW-10449 - [Rust] Make dictionary keys be a PrimitiveArray ARROW-10454 - [Rust][Datafusion] support creating ParquetExec from externally resolved file list and schema ARROW-10455 - [Rust] Fix CI cache misses on windows ARROW-10458 - [Rust] [Datafusion] context.create_logical_plan should not take a mutable self reference ARROW-10464 - [Rust] Implement utility to convert TPC-H tbl files to CSV and Parquet ARROW-10466 - [Rust] [Website] Update implementation status page ARROW-10467 - [FlightRPC][Java] Ability to pass arbitrary client properties to server ARROW-10468 - [C++][Compute] Refactor FunctionExecutor -&gt; KernelExecutor ARROW-10476 - [Rust] Allow string array to be built from iterator of &amp;str ARROW-10477 - [Rust] Add support for iterators over binary arrays ARROW-10478 - [Dev][Release] Correct Java versions to 3.0.0-SNAPSHOT ARROW-10481 - [R] Bindings to add, remove, replace Table columns ARROW-10483 - [C++] Move Executor into a separate header ARROW-10484 - [C++] Future&lt;{void,Status}&gt; could be more generic ARROW-10487 - [FlightRPC][C++] Header-based auth in clients ARROW-10490 - [C++][GLib] Fail to build with Xcode 12.0.1 ARROW-10492 - [Java][JDBC] Allow users to config the mapping between SQL types and Arrow types ARROW-10504 - [C++] Suppress UBSAN pointer-overflow warning in RapidJSON ARROW-10510 - [Rust] [DataFusion] Add benchmarks for COUNT(DISTINCT) ARROW-10515 - [Julia][Doc] Update lists of supported languages to include Julia ARROW-10522 - [R] Allow rename Table and RecordBatch columns with names() ARROW-10526 - [FlightRPC][C++] HTTP cookie handling in clients ARROW-10530 - [R] Optionally use distro package in linuxlibs.R ARROW-10531 - [Rust] [DataFusion] Better display for logical plans: Graphviz and Schema information ARROW-10539 - [Packaging][Python] Use GitHub Actions to build wheels for Windows ARROW-10540 - [Rust] Allow unary kernels of arbitrary array types ARROW-10541 - [C++] Add re2 library to core arrow / ARROW_WITH_RE2 ARROW-10542 - [C#][Flight] Add beginning on flight code for net core ARROW-10543 - [Developer] Update dev instructions to note there may be a timelag ARROW-10552 - [Rust] Remove un-used Result from Buffer ARROW-10559 - [Rust] [DataFusion] Break up logical_plan/mod.rs into smaller modules ARROW-10561 - [Rust] Simplify `MutableBuffer::write` and `MutableBuffer::write_bytes` ARROW-10562 - [Rust] Potential UB on unsafe code ARROW-10566 - [C++] Array validation should work on ArrayData ARROW-10567 - [C++][FlightRPC] Add options to help increase precision of arrow-flight-benchmark ARROW-10572 - [Rust][DataFusion] Use aHash and std::collections hashmap for aggregates / distinct ARROW-10574 - [Python][Parquet] Allow collections for ‘in’ / ‘not in’ filter (in addition to sets) ARROW-10575 - [Rust] Rename union.rs to be cosistent with other arrays ARROW-10581 - [Doc] IPC dictionary reference to relevant section ARROW-10582 - [Rust] [DataFusion] Implement “repartition” operator ARROW-10584 - [Rust] [DataFusion] Implement SQL join support using explicit JOIN ON syntax ARROW-10585 - [Rust] [DataFusion] Add join support to DataFrame and LogicalPlan ARROW-10586 - [Rust] [DataFusion] Add join support to query planner ARROW-10589 - [Rust]: Implement AVX-512 bit and operation ARROW-10590 - [Rust] Remove Date32(Millisecond) from test ARROW-10591 - [Rust] Add support to structArrays for MutableArrayData ARROW-10595 - [Rust] Simplify inner loop of min/max kernels for non-null case ARROW-10596 - [Rust] Improve take benchmark ARROW-10598 - [C++] Improve performance of GenerateBitsUnrolled ARROW-10604 - [Ruby] Support Decimal256 type ARROW-10607 - [C++][Parquet] Support Reading/Writing Decimal256 type in Parquet ARROW-10609 - [Rust] Optimize min/max of non null strings ARROW-10628 - [Rust] Make clippy error on clippy warnings ARROW-10633 - [Rust][DataFusion] Dependency version upgrades ARROW-10634 - [C#][CI] Change the build version from 2.2 to 3.1 in CI ARROW-10636 - [Rust] Remove specialisation from Rust parquet ARROW-10637 - [Rust] Add examples to boolean kernels ARROW-10638 - [Rust] Improve tests of boolean kernels ARROW-10639 - [Rust] Simplify signature of is_null and add example ARROW-10644 - [Python] Consolidate path/filesystem handling in pyarrow.dataset and pyarrow.fs ARROW-10646 - [C++][FlightRPC] Disable flaky test ARROW-10648 - [Java] Prepare Java codebase for source release without requiring any git tags to be created or pushed ARROW-10651 - [C++] alloc-dealloc-mismatch in s3fs.cc ARROW-10652 - [C++][Gandiva] Make gandiva cache size configurable ARROW-10653 - [Rust]: Update toolchain version to bring new features ARROW-10654 - [Rust] Specialize parsing of floats / bools ARROW-10660 - [Rust] Implement AVX-512 bit or operation ARROW-10665 - [Rust] Add fast paths for common utf8 like patterns ARROW-10666 - [Rust] [DataFusion] Support nested SELECT statements ARROW-10669 - [C++][Compute] Support Scalar inputs to boolean kernels ARROW-10672 - [Rust] [DataFusion] Make limit be computed as a stream ARROW-10673 - [Rust] [DataFusion] Make sort be computed on the stream ARROW-10674 - [Rust] Add integration tests for Decimal type ARROW-10677 - [Rust] Fix Bug and Add tests as documentation showing supported csv parsing ARROW-10679 - [Rust] [DataFusion] Implement SQL CASE WHEN physical expression ARROW-10680 - [Rust] [DataFusion] Implement TPC-H Query 12 ARROW-10682 - [Rust] Sort kernel performance tuning ARROW-10685 - [Rust] [DataFusion] Add support for join on filter pushdown optimizer ARROW-10688 - [Rust] [DataFusion] Support CASE WHEN from DataFrame API ARROW-10689 - [Rust] [DataFusion] Support CASE WHEN from SQL ARROW-10693 - [Rust] [DataFusion] Add support for the left join ARROW-10696 - [C++] Investigate a bit run reader that would only return runs of set bits ARROW-10697 - [C++] Consolidate bitmap word readers ARROW-10703 - [Rust] [DataFusion] Make join not collect left on every part ARROW-10704 - [Rust][DataFusion] Remove Nested from expression enum ARROW-10708 - [Packaging][deb] Add support for Ubuntu 20.10 ARROW-10709 - [Python] Difficult to make an efficient zero-copy file reader in Python ARROW-10712 - [Rust] [DataFusion] Add tests to TPC-H benchmarks ARROW-10717 - [Rust] [DataFusion] Add support for right join ARROW-10720 - [C++] Add BasicDecimal256 Rescale Support ARROW-10721 - [C#][CI] Use .NET 3.1 by default ARROW-10722 - [Rust][DataFusion] Reduce overhead in data types in aggregations / joins, improve benchmarks ARROW-10723 - [Packaging][deb][RPM] Enable Parquet encription ARROW-10724 - [Developer Tools] Add labeler to when PRs need rebase ARROW-10725 - [Python][Compute] Exposing bindings for sort options ARROW-10728 - [Rust] [DataFusion] Add SQL support for JOIN with USING clause ARROW-10729 - [Rust] [DataFusion] Add SQL support for JOIN using implicit syntax ARROW-10732 - [Rust] [DataFusion] Add SQL support for table/relation aliases and compound identifiers ARROW-10733 - [R] Improvements to Linux installation troubleshooting ARROW-10740 - [Rust][DataFusion] Remove redundant clones found by clippy ARROW-10741 - Apply clippy lints to source code, remove them from ignore list ARROW-10742 - [Python] Mask not checked when creating array from numpy array ARROW-10745 - [Rust] Allocate padding bytes in filter context ARROW-10747 - [Rust] Optimizations for csv reader ARROW-10750 - [Rust] [DataFusion] Add SQL support for LEFT and RIGHT join ARROW-10752 - [GLib] Add garrow_schema_has_metadata() ARROW-10754 - [GLib] Add support for metadata to GArrowField ARROW-10755 - [Rust] [Parquet] Add support for writing boolean type ARROW-10756 - [Rust] Clippy - fix reduntant clone ARROW-10759 - [Rust][DataFusion] Implement support for casting string to date in sql expressions ARROW-10763 - [Rust] Speed up take kernels ARROW-10765 - [Rust] Optimize take strings for non-null arrays ARROW-10767 - [Rust] Speed up sum kernel with nulls ARROW-10770 - [Rust] Support reading nested JSON lists ARROW-10772 - [Rust] Improve take performance ARROW-10775 - [Rust][DataFusion] Use ahash in hash join ARROW-10776 - [C++] Provide iterator access to primitive elements inside an Array ARROW-10781 - [Rust] [DataFusion] TableProvider should provide row count statistics ARROW-10783 - [Rust] [DataFusion] Implement row count statistics for Parquet TableProvider ARROW-10785 - Further optimize take string ARROW-10786 - [Packaging][RPM] Drop support for CentOS 6 ARROW-10788 - [C++] Make S3 recursive walks parallel ARROW-10789 - [Rust][DataFusion] Make TableProvider dynamically typed ARROW-10790 - [C++][Compute] Investigate ChunkedArray sort performance ARROW-10792 - [Rust] [CI] Modulararize CI for faster and smaller builds ARROW-10795 - [Rust] Fix specialization for arrow datatypes ARROW-10796 - [C++] Investigate RecordBatch sort performance ARROW-10800 - [Rust] [Parquet] Provide access to the elements of parquet::record::{List, Map} ARROW-10802 - [C++] Remove Dictionary[NullType] special casing in parquet column writer ARROW-10808 - [Rust] [DataFusion] Support nested expressions in aggregations ARROW-10809 - [C++] Use Datum for SortIndices() input ARROW-10812 - [Rust] Make BooleanArray not a PrimitiveArray ARROW-10813 - [Rust] [DataFusion] Implement DFSchema ARROW-10814 - [Packaging][deb] Drop support for Debian GNU/Linux Stretch ARROW-10817 - [Rust] [DataFusion] Implement TypedString ARROW-10820 - [Rust] [DataFusion] Complete TPC-H Benchmark Queries ARROW-10821 - [Rust] [Datafusion] implement negative expression ARROW-10822 - [Rust] [Datafusion] support compiling datafusion with simd support ARROW-10824 - [Rust] Added PartialEq for NullArray ARROW-10825 - [Rust] Add support to NullArrays for MutableArrayData ARROW-10826 - [Rust] Add support for FixedSizeBinary to MutableArrayData ARROW-10827 - [Rust] Extend concatenate to all types ARROW-10828 - [Rust][DataFusion] Enable more clippy lints ARROW-10829 - [Rust] [DataFusion] Implement Into&lt;Schema&gt; for DFSchema ARROW-10832 - [Rust] Evaluate latest snapshot flatc ARROW-10836 - [Rust] Extend take kernel to FixedSizeListArray ARROW-10838 - [Rust] [CI] Add CI for wasm32 target ARROW-10839 - [Rust] [DataFusion] Implement BETWEEN Operator ARROW-10843 - [C++] Add support for temporal types in sort family kernels ARROW-10845 - [Python][CI] Add python CI build using numpy nightly ARROW-10849 - [Python] Handle numpy deprecation warnings for builtin type aliases ARROW-10851 - [C++] Reduce code size of vector_sort.cc ARROW-10857 - [Packaging] Follow PowerTools repository name change on CentOS 8 ARROW-10858 - [C++][MSVC] Add missing Boost dependency ARROW-10861 - [Python] Update minimal NumPy version to 1.16.6 ARROW-10864 - [Rust] Use standard ordering for floats ARROW-10865 - [Rust][DataFusion] More ergonomic conversion between Schema, SchemaRef, DFSchema, and DFSchemaRef ARROW-10867 - build failure on aarch64 with -DARROW_PYTHON=ON and gcc ARROW-10869 - [GLib] Add garrow_*_sort_indices() and related options ARROW-10870 - [Julia] Update website with Julia implementation ARROW-10871 - [Julia] Setup Julia CI via GitHub Actions ARROW-10873 - [C++] Apple Silicon is reported as arm64 in CMake ARROW-10874 - [Rust][DataFusion] Add table statistics for MemTable ARROW-10877 - [Rust] [DataFusion] Add benchmark based on kaggle movies ARROW-10878 - [Rust] Simplify extend_from_slice ARROW-10879 - [Packaging][deb] Restore Debian GNU/Linux Buster configuration ARROW-10881 - [C++] EXC_BAD_ACCESS in BaseSetBitRunReader&lt;false&gt;::NextRun ARROW-10885 - [Rust][DataFusion] Optimize join build vs probe based on statistics on row number ARROW-10887 - [C++][Doc] Document IPC API ARROW-10889 - [Rust] Document our approach to unsafe code in README ARROW-10890 - [Rust] [DataFusion] JOIN support ARROW-10891 - [Rust][DataFusion] More clippy lints ARROW-10893 - [Rust] [DataFusion] Easier clippy fixes ARROW-10896 - [C++][CMake] Rename internal RE2 package name to “re2” from “RE2” ARROW-10900 - [Rust][DataFusion] Resolve TableScan provider eagerly ARROW-10904 - [Python] Add support for Python 3.9 macOS wheels ARROW-10905 - [Python] Add support for Python 3.9 windows wheels ARROW-10908 - [Rust] [DataFusion] Update relevant tpch-queries with BETWEEN ARROW-10917 - [Rust][Doc] Update feature matrix ARROW-10918 - [C++][Doc] Document supported Parquet features ARROW-10927 - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary ARROW-10927 - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary ARROW-10927 - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary ARROW-10929 - [Rust] Migrate CI tests to stable rust ARROW-10933 - [Rust] Update docs in regard to stable rust ARROW-10934 - [Python] Tests are failed with fsspec-0.8.5 ARROW-10938 - [Rust] upgrade dependency “flatbuffers” to 0.8 ARROW-10940 - [Rust] Extend sort kernel to ListArray ARROW-10941 - [Doc][C++] Document supported Parquet encryption features ARROW-10944 - [Rust] Implement min/max kernels for BooleanArray ARROW-10946 - [Rust] Make ChunkIter not depend on a buffer ARROW-10947 - [Rust][DataFusion] Refactor UTF8 to Date32 for Performance ARROW-10948 - [C++] Always use GTestConfig.cmake ARROW-10949 - [Rust] Avoid clones in getting values of boolean arrays ARROW-10951 - [Python][CI] Nightly pandas builds failing because of pytest monkeypatch issue ARROW-10952 - [Rust] Add pre-commit hook ARROW-10966 - [C++] Use FnOnce for ThreadPool’s tasks instead of std::function ARROW-10968 - [Rust][DataFusion] Don’t build hash table for right side of the join ARROW-10969 - [Rust][DataFusion] Implement ANSI SQL Functions ARROW-10985 - [Rust] Update unsafe guidelines for adding JIRA references ARROW-10986 - [Rust][DataFusion] Add average statistic to TCP-H benchmark too ARROW-10988 - [C++] Require CMake 3.5 or later ARROW-10989 - [Rust] Use slices for iterating primitive arrays ARROW-10993 - [CI][macOS] Fix Python 3.9 installation by Homebrew ARROW-10995 - [Rust] [DataFusion] Improve parallelism when reading Parquet files ARROW-11004 - [FlightRPC][Python] Header-based auth in clients ARROW-11005 - [Rust] Remove indirection from take kernel and simplify interface ARROW-11008 - [Rust][DataFusion] Simplify count accumulator ARROW-11009 - [Python] Add environment variable to elect default usage of system memory allocator instead of jemalloc/mimalloc ARROW-11010 - [Python] `np.float` deprecation warning in `_pandas_logical_type_map` ARROW-11012 - [Rust] [DataFusion] Make write_csv and write_parquet concurrent ARROW-11015 - [CI][Gandiva] Move gandiva nightly build from travis to github action ARROW-11018 - [Rust][DataFusion] Add null count column statistics ARROW-11026 - [Rust]: Run tests without requiring environment variables ARROW-11028 - [Rust] Somewhat pedantic pattern-matches ARROW-11029 - [Rust] [DataFusion] Document why join order optimization does not work with filter pushdown ARROW-11032 - [C++][FlightRPC] Add benchmark for local RPC through unix socket ARROW-11033 - [Rust] CSV writer performance improvements ARROW-11034 - [Rust] rustfmt cleanup ARROW-11035 - [Rust] Improve performance of cast to utf8 via FromIter ARROW-11037 - [Rust] Improve performance of string fromIter ARROW-11038 - [Rust] Remove `BufferBuilderTrait` and associated Result requirement. ARROW-11039 - [Rust] Improve performance for utf8 to float cast ARROW-11040 - [Rust] Simplify builders with generics ARROW-11042 - [Rust][DataFusion] Increase default batch size ARROW-11043 - [C++] Add “is_nan” kernel ARROW-11046 - [Rust][DataFusion] Add count_distinct to dataframe API ARROW-11049 - [Python] Expose alternate memory pools ARROW-11052 - [Rust] [DataFusion] Implement metrics in join operator ARROW-11053 - [Rust] [DataFusion] Optimize joins with dynamic capacity for output batches ARROW-11054 - Update SQLParser to 0.70 ARROW-11055 - [Rust] [DataFusion] Support date_trunc function ARROW-11058 - [Rust] [DataFusion] Implement “coalesce batches” operator ARROW-11063 - [Rust] Validate null counts when building arrays ARROW-11064 - [Rust][DataFusion] Speed up hash join on smaller batches ARROW-11072 - [Rust] [Parquet] Support int32 and int64 physical types ARROW-11076 - [Rust][DataFusion] Refactor usage of right indices in hash join ARROW-11079 - [R] Catch up on changelog since 2.0 ARROW-11080 - [C++][Dataset] Improvements to implicit casting ARROW-11082 - [Rust] Add FFI for LargeUtf8 ARROW-11086 - [Rust] Extend take to support more index types ARROW-11091 - [Rust][DataFusion] Fix clippy warning in rust 1.49 ARROW-11095 - [Python] Access pyarrow.RecordBatch column by name ARROW-11096 - [Rust] Add FFI for [Large]Binary ARROW-11097 - [Rust] Simplify tests ARROW-11099 - [Rust]: Remove unsafe value_slice method from PrimitiveArray and BooleanArray ARROW-11100 - [Rust] Speed up numeric to string cast using lexical_core ARROW-11101 - [Rust] enable “cargo +nightly fmt” in git pre-commit hook ARROW-11104 - [GLib] Add append_null/append_nulls to GArrowArrayBuilder and use them ARROW-11105 - [Rust] Favor From/Into traits in MutableBuffer ARROW-11108 - [Rust] Improve performance of MutableBuffer ARROW-11109 - [GLib] Add garrow_array_builder_append_empty_value() and values() ARROW-11110 - [Rust] [Datafusion] context.table should not take a mutable self reference ARROW-11111 - [GLib] Add GArrowFixedSizeBinaryArrayBuilder ARROW-11121 - [Developer] Use pull_request_target for PR JIRA integration ARROW-11122 - [Rust] Add FFI for date and time ARROW-11124 - [Doc] Update status matrix for Decimal256 ARROW-11125 - [Rust] Implement logical equality for list arrays ARROW-11126 - [Rust] Document and test ARROW-10656 ARROW-11127 - [C++] Unused cpu_info on non-x86 architecture ARROW-11129 - [Rust][DataFusion] Use tokio thread pool for loading parquet ARROW-11130 - [Website][CentOS 8][RHEL 8] Enable all required repositories by default ARROW-11131 - [Rust] Improve performance of bool_equal ARROW-11136 - [R] Bindings for is.nan ARROW-11137 - [Rust][DataFusion] Fix Clippy needless_range_loop, needless_lifetimes ARROW-11138 - [Rust] [DataFusion] Support ltrim, rtrim ARROW-11139 - [GLib] Add support for extension type ARROW-11155 - [C++][Packaging] Move gandiva crossbow jobs off of Travis-CI ARROW-11158 - [Julia] Implement Decimal256 support ARROW-11159 - [Developer] Consolidate pull request related jobs ARROW-11165 - [Rust] [DataFusion] Document the desired SQL dialect for DataFusion ARROW-11168 - [Rust] Fix cargo doc warnings ARROW-11169 - [Rust] Add a comment explaining where float total_order algorithm came from ARROW-11175 - [R] Small docs fixes ARROW-11176 - [R] Expose memory pool name and document setting it ARROW-11187 - [Rust] [Parquet] Pin specific parquet-format-rs version ARROW-11188 - [Rust] Implement crypto functions from PostgreSQL dialect ARROW-11193 - [Documentation] Add docs for Java ListVector ARROW-11194 - [Rust] Enable SIMD for aarch64 ARROW-11195 - [Rust] [DataFusion] Built-in table providers should expose relevant fields ARROW-11196 - [GLib] Add support for mock, HDFS and S3 file systems with factory function ARROW-11198 - [Packaging][Python] Ensure setuptools version during build supports markdown ARROW-11200 - [Rust] [DateFusion] Physical operators and expressions should have public accessor methods ARROW-11201 - [Rust] create_batch_empty - support more types ARROW-11203 - [Developer][Website] Enable JIRA and pull request integration ARROW-11204 - [C++] Fix build failure with bundled gRPC and Protobuf ARROW-11205 - [GLib][Dataset] Add GADFileFormat and its family ARROW-11209 - [Rust] DF - Provide better error message on unsupported GROUP BY ARROW-11210 - [CI] Restore workflows that had been blocked by INFRA ARROW-11212 - [Packaging][Python] Use vcpkg as dependency source for manylinux and windows wheels ARROW-11213 - [Packaging][Python] Dockerize wheel building on windows ARROW-11215 - [CI] Use named volumes by default for caching in docker-compose ARROW-11216 - [Rust] Improve documentation for StringDictionaryBuilder ARROW-11218 - [R] Make SubTreeFileSystem print method more informative ARROW-11219 - [CI][Ruby][MinGW] Reduce CI time ARROW-11221 - [Rust] DF Implement GROUP BY support for Float32/Float64 ARROW-11231 - [Packaging] Add mimalloc to Linux builds ARROW-11234 - [CI][Ruby][macOS] Reduce CI time ARROW-11236 - [Java] Bump Jackson to 2.11.4 ARROW-11240 - [Packaging][R] Add mimalloc to R packaging ARROW-11242 - [CI] Remove CMake 3.2 job ARROW-11245 - [C++][Gandiva] Add support for LLVM 11.1 ARROW-11247 - [C++] Infer date32 columns in CSV ARROW-11256 - [Packaging][Linux] Don’t buffer packaging output ARROW-11268 - [Rust][DataFusion] Support specifying repartitions in MemTable ARROW-11272 - [Release][wheel] Remove unsupported Python 3.5 and manylinux1 ARROW-11273 - [Release][deb] Remove unsupported Debian GNU/Linux stretch ARROW-11278 - [Release][NodeJS] Don’t touch ~/.bash_profile ARROW-11280 - [Release][APT] Fix minimal build example check ARROW-11281 - [C++] Remove needless runtime RapidJSON dependency ARROW-11282 - [Packaging][deb] Add missing libgflags-dev dependency ARROW-11285 - [Release][APT] Add support for Ubuntu Groovy ARROW-11292 - [Release][JS] Use Node.JS LTS ARROW-11293 - [C++] Don’t require Boost and gflags with find_package(Arrow) ARROW-11307 - [Release][Ubuntu][20.10] Add workaround for dependency issue ARROW-11309 - [Release][C#] Use .NET 3.1 for verification ARROW-11329 - [Rust] Do not rebuild the library on every change ARROW-11372 - Support RC verification on macOS-ARM64 PARQUET-1566 - [C++] Indicate if null count, distinct count are present in column statistics","headline":"Apache Arrow 3.0.0 Release","image":"https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png","mainEntityOfPage":{"@type":"WebPage","@id":"https://arrow.apache.org/release/3.0.0.html"},"publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"https://arrow.apache.org/img/logo.png"}},"url":"https://arrow.apache.org/release/3.0.0.html"}</script>
<!-- End Jekyll SEO tag -->
<!-- favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16.png" id="light1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32.png" id="light2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon.png" id="light3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120.png" id="light4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76.png" id="light5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60.png" id="light6">
<!-- dark mode favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16-dark.png" id="dark1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32-dark.png" id="dark2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon-dark.png" id="dark3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120-dark.png" id="dark4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76-dark.png" id="dark5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60-dark.png" id="dark6">
<script>
// Switch to the dark-mode favicons if prefers-color-scheme: dark
function onUpdate() {
light1 = document.querySelector('link#light1');
light2 = document.querySelector('link#light2');
light3 = document.querySelector('link#light3');
light4 = document.querySelector('link#light4');
light5 = document.querySelector('link#light5');
light6 = document.querySelector('link#light6');
dark1 = document.querySelector('link#dark1');
dark2 = document.querySelector('link#dark2');
dark3 = document.querySelector('link#dark3');
dark4 = document.querySelector('link#dark4');
dark5 = document.querySelector('link#dark5');
dark6 = document.querySelector('link#dark6');
if (matcher.matches) {
light1.remove();
light2.remove();
light3.remove();
light4.remove();
light5.remove();
light6.remove();
document.head.append(dark1);
document.head.append(dark2);
document.head.append(dark3);
document.head.append(dark4);
document.head.append(dark5);
document.head.append(dark6);
} else {
dark1.remove();
dark2.remove();
dark3.remove();
dark4.remove();
dark5.remove();
dark6.remove();
document.head.append(light1);
document.head.append(light2);
document.head.append(light3);
document.head.append(light4);
document.head.append(light5);
document.head.append(light6);
}
}
matcher = window.matchMedia('(prefers-color-scheme: dark)');
matcher.addListener(onUpdate);
onUpdate();
</script>
<link rel="stylesheet" href="//fonts.googleapis.com/css?family=Lato:300,300italic,400,400italic,700,700italic,900">
<link href="/css/main.css" rel="stylesheet">
<link href="/css/syntax.css" rel="stylesheet">
<script src="/javascript/main.js"></script>
<!-- Matomo -->
<script>
var _paq = window._paq = window._paq || [];
/* tracker methods like "setCustomDimension" should be called before "trackPageView" */
/* We explicitly disable cookie tracking to avoid privacy issues */
_paq.push(['disableCookies']);
_paq.push(['trackPageView']);
_paq.push(['enableLinkTracking']);
(function() {
var u="https://analytics.apache.org/";
_paq.push(['setTrackerUrl', u+'matomo.php']);
_paq.push(['setSiteId', '20']);
var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g,s);
})();
</script>
<!-- End Matomo Code -->
</head>
<body class="wrap">
<header>
<nav class="navbar navbar-expand-md navbar-dark bg-dark">
<a class="navbar-brand no-padding" href="/"><img src="/img/arrow-inverse-300px.png" height="40px"/></a>
<button class="navbar-toggler ml-auto" type="button" data-toggle="collapse" data-target="#arrow-navbar" aria-controls="arrow-navbar" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span>
</button>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse justify-content-end" id="arrow-navbar">
<ul class="nav navbar-nav">
<li class="nav-item"><a class="nav-link" href="/overview/" role="button" aria-haspopup="true" aria-expanded="false">Overview</a></li>
<li class="nav-item"><a class="nav-link" href="/faq/" role="button" aria-haspopup="true" aria-expanded="false">FAQ</a></li>
<li class="nav-item"><a class="nav-link" href="/blog" role="button" aria-haspopup="true" aria-expanded="false">Blog</a></li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownGetArrow" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Get Arrow
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownGetArrow">
<a class="dropdown-item" href="/install/">Install</a>
<a class="dropdown-item" href="/release/">Releases</a>
<a class="dropdown-item" href="https://github.com/apache/arrow">Source Code</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownDocumentation" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Documentation
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownDocumentation">
<a class="dropdown-item" href="/docs">Project Docs</a>
<a class="dropdown-item" href="/docs/format/Columnar.html">Format</a>
<hr/>
<a class="dropdown-item" href="/docs/c_glib">C GLib</a>
<a class="dropdown-item" href="/docs/cpp">C++</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/csharp/README.md">C#</a>
<a class="dropdown-item" href="https://godoc.org/github.com/apache/arrow/go/arrow">Go</a>
<a class="dropdown-item" href="/docs/java">Java</a>
<a class="dropdown-item" href="/docs/js">JavaScript</a>
<a class="dropdown-item" href="/julia/">Julia</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/matlab/README.md">MATLAB</a>
<a class="dropdown-item" href="/docs/python">Python</a>
<a class="dropdown-item" href="/docs/r">R</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/ruby/README.md">Ruby</a>
<a class="dropdown-item" href="https://docs.rs/arrow/latest">Rust</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownSubprojects" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Subprojects
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownSubprojects">
<a class="dropdown-item" href="/adbc">ADBC</a>
<a class="dropdown-item" href="/docs/format/Flight.html">Arrow Flight</a>
<a class="dropdown-item" href="/docs/format/FlightSql.html">Arrow Flight SQL</a>
<a class="dropdown-item" href="https://datafusion.apache.org">DataFusion</a>
<a class="dropdown-item" href="/nanoarrow">nanoarrow</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownCommunity" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Community
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownCommunity">
<a class="dropdown-item" href="/community/">Communication</a>
<a class="dropdown-item" href="/docs/developers/index.html">Contributing</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/issues">Issue Tracker</a>
<a class="dropdown-item" href="/committers/">Governance</a>
<a class="dropdown-item" href="/use_cases/">Use Cases</a>
<a class="dropdown-item" href="/powered_by/">Powered By</a>
<a class="dropdown-item" href="/visual_identity/">Visual Identity</a>
<a class="dropdown-item" href="/security/">Security</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/policies/conduct.html">Code of Conduct</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownASF" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
ASF Links
</a>
<div class="dropdown-menu dropdown-menu-right" aria-labelledby="navbarDropdownASF">
<a class="dropdown-item" href="https://www.apache.org/">ASF Website</a>
<a class="dropdown-item" href="https://www.apache.org/licenses/">License</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/sponsorship.html">Donate</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/thanks.html">Thanks</a>
<a class="dropdown-item" href="https://www.apache.org/security/">Security</a>
</div>
</li>
</ul>
</div><!-- /.navbar-collapse -->
</nav>
</header>
<div class="container p-4 pt-5">
<main role="main" class="pb-5">
<!--
-->
<h1 id="apache-arrow-300-26-january-2021">Apache Arrow 3.0.0 (26 January 2021)</h1>
<p>This is a major release covering more than 3 months of development.</p>
<h2 id="download">Download</h2>
<ul>
<li><a href="https://www.apache.org/dyn/closer.lua/arrow/arrow-3.0.0/"><strong>Source Artifacts</strong></a></li>
<li><strong>Binary Artifacts</strong>
<ul>
<li><a href="https://apache.jfrog.io/artifactory/arrow/centos/">For CentOS</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/debian/">For Debian</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/python/3.0.0/">For Python</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/ubuntu/">For Ubuntu</a></li>
</ul>
</li>
<li><a href="https://github.com/apache/arrow/releases/tag/apache-arrow-3.0.0">Git tag</a></li>
</ul>
<h2 id="contributors">Contributors</h2>
<p>This release includes 648 commits from 106 distinct contributors.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> apache-arrow-2.0.0..apache-arrow-3.0.0
<span class="go"> 71 Jorge C. Leitao
64 Sutou Kouhei
48 Antoine Pitrou
48 Heres, Daniel
27 Andy Grove
27 Neville Dipale
24 Joris Van den Bossche
19 Neal Richardson
15 Benjamin Kietzman
14 Carol (Nichols || Goulding)
14 Uwe L. Korn
12 Jonathan Keane
12 Krisztián Szűcs
11 Andrew Lamb
11 Mike Seddon
11 mqy
10 Kazuaki Ishizaki
10 Qingping Hou
8 Jörn Horstmann
8 Yibo Cai
8 alamb
7 Andrew Wieteska
7 Ian Cook
7 Mahmut Bulut
6 Dmitry Patsura
6 Projjal Chanda
6 liyafan82
6 rdettai
5 Daniel Russo
5 David Li
5 Florian Müller
5 Jacob Quinn
5 James Duong
4 Micah Kornfield
4 Weston Pace
3 Jimmy Lu
3 Maarten A. Breddels
3 Terence D. Honles
3 Will Jones
2 Bei
2 Benjamin Wilhelm
2 Christian Lundgren
2 Daniël Heres
2 Evan Chan
2 Fokko Driesprong
2 François Garillot
2 Greg Bowyer
2 Kirill Lykov
2 Kyle Porter
2 Lyndon Bauto
2 Markus Westerlind
2 Sagnik Chakraborty
2 Sean Moran
2 Wes McKinney
2 Yordan Pavlov
2 jensenrichardson
2 wayne
2 Östman Alexander
1 Bruno LE HYARIC
1 Chiyang Wan
1 Christoph Schulze
1 Diana Clarke
1 Eric Erhardt
1 Erik Fonselius
1 Frank Du
1 Frank Smith
1 Gidon Gershinsky
1 H-Plus-Time
1 Hongze Zhang
1 Jake Goulding
1 Jeremy
1 Johan Peltenburg
1 Jorge Leitao
1 Josiah
1 Kopilov Aleksandr
1 Marc Prud'hommeaux
1 Marius van Niekerk
1 Matthew Pollock
1 Matthew Topol
1 Michal
1 Mingyu Zhong
1 Nick Bruno
1 Paddy Horan
1 Pavel Tiunov
1 Prudhvi Porandla
1 Radu Teodorescu
1 Romain Francois
1 Ruan Pearce-Authers
1 Ruihang Xia
1 Sebastian Berg
1 Tao He
1 Tyrel Rink
1 Ulimo
1 Vivian Kong
1 Weiyang Zhao
1 Zhuo Peng
1 arw2019
1 david
1 jonathan.albrecht
1 naman1996
1 rjzamora
1 shawnding(丁晓坤)
1 tching
1 tianchen
1 tifflhl
1 xudingyu
</span></code></pre></div></div>
<h2 id="patch-committers">Patch Committers</h2>
<p>The following Apache committers merged contributed patches to the repository.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-csn</span> apache-arrow-2.0.0..apache-arrow-3.0.0
<span class="go"> 100 Andrew Lamb
88 Antoine Pitrou
76 Sutou Kouhei
70 Jorge C. Leitao
69 Neville Dipale
54 Krisztián Szűcs
44 Neal Richardson
40 Andy Grove
26 Benjamin Kietzman
19 Joris Van den Bossche
12 David Li
8 liyafan82
7 Uwe L. Korn
7 alamb
6 Micah Kornfield
5 Praveen
4 Bryan Cutler
4 Eric Erhardt
2 GitHub
2 Paddy Horan
2 Pindikura Ravindra
2 Wes McKinney
1 Sebastien Binet
</span></code></pre></div></div>
<h2 id="changelog">Changelog</h2>
<h2 id="apache-arrow-300-2021-01-26">Apache Arrow 3.0.0 (2021-01-26)</h2>
<h3 id="bug-fixes">Bug Fixes</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-2616">ARROW-2616</a> - [Python] Cross-compiling Pyarrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6582">ARROW-6582</a> - [R] Arrow to R fails with embedded nuls in strings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7363">ARROW-7363</a> - [Python] Add combine_chunks method to ChunkedArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7909">ARROW-7909</a> - [Website] Add how to install on Red Hat Enterprise Linux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8258">ARROW-8258</a> - [Rust] [Parquet] ArrowReader fails on some timestamp types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9027">ARROW-9027</a> - [Python] Split in multiple files + clean-up pyarrow.parquet tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9479">ARROW-9479</a> - [JS] Table.from fails for zero-item Lists, FixedSizeLists, Maps. ditto Table.empty</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9636">ARROW-9636</a> - [Python] Update documentation about ‘LZO’ compression in parquet.write_table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9776">ARROW-9776</a> - [R] read_feather causes segfault in R if file doesn’t exist</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9897">ARROW-9897</a> - [C++][Gandiva] Add to_date() function from pattern</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9897">ARROW-9897</a> - [C++][Gandiva] Add to_date() function from pattern</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9898">ARROW-9898</a> - [C++][Gandiva] Error handling in castINT fails in some enviroments</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9903">ARROW-9903</a> - [R] open_dataset freezes opening feather files on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9963">ARROW-9963</a> - [Python] Recognize datetime.timezone.utc as UTC on conversion python-&gt;pyarrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10039">ARROW-10039</a> - [Rust] Do not require memory alignment of buffers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10042">ARROW-10042</a> - [Rust] Buffer equalities may be incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10080">ARROW-10080</a> - [R] Arrow does not release unused memory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10122">ARROW-10122</a> - [Python] Selecting one column of multi-index results in a duplicated value column.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10145">ARROW-10145</a> - [C++][Dataset] Assert integer overflow in partitioning falls back to string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10146">ARROW-10146</a> - [Python] Parquet metadata to_dict raises attribute error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10174">ARROW-10174</a> - [Java] Reading of Dictionary encoded struct vector fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10177">ARROW-10177</a> - [CI][Gandiva] Nightly gandiva-jar-xenial fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10186">ARROW-10186</a> - [Rust] Tests fail when following instructions in README</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10247">ARROW-10247</a> - [C++][Dataset] Cannot write dataset with dictionary column as partition field</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10264">ARROW-10264</a> - [C++][Python] Parquet test failing with HadoopFileSystem URI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10270">ARROW-10270</a> - [R] Fix CSV timestamp_parsers test on R-devel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10283">ARROW-10283</a> - [Python] Python deprecation warning for “PY_SSIZE_T_CLEAN will be required for ‘#’ formats”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10293">ARROW-10293</a> - [Rust] [DataFusion] Fix benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10294">ARROW-10294</a> - [Java] Resolve problems of DecimalVector APIs on ArrowBufs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10321">ARROW-10321</a> - [C++] Building AVX512 code when we should not</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10333">ARROW-10333</a> - [Java] Remove split packages in arrow-memory-core and arrow-vectors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10345">ARROW-10345</a> - [C++] NaN breaks sorting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10346">ARROW-10346</a> - [Python] Default S3 region is eu-central-1 even with LANG=C</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10348">ARROW-10348</a> - [C++] Fix crash on invalid Parquet file (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10350">ARROW-10350</a> - [Rust] parquet_derive crate cannot be published to crates.io</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10353">ARROW-10353</a> - [C++] Parquet decompresses DataPageV2 pages even if is_compressed==0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10358">ARROW-10358</a> - [R] Followups to 2.0.0 release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10365">ARROW-10365</a> - [R] Remove duplicate setting of S3 flag on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10369">ARROW-10369</a> - [Dev] Fix archery release utility test cases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10370">ARROW-10370</a> - [Python] Spurious s3fs-related test failures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10371">ARROW-10371</a> - [R] Linux system requirements check needs to support older cmake versions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10386">ARROW-10386</a> - [R] List column class attributes not preserved in roundtrip</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10388">ARROW-10388</a> - [Java] Fix Spark integration build failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10390">ARROW-10390</a> - [Rust] [Parquet] Regression Can not implement custom ParquetWriter because `TryClone` is not publically exported</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10393">ARROW-10393</a> - [Rust]: Fix null value reading in jsonreader for both dictionary and stringbuilders</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10394">ARROW-10394</a> - [Rust] [Large]BinaryArray can be created from non-binary datatypes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10397">ARROW-10397</a> - [C++] Outdated and confusing comment on dictionary indices</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10399">ARROW-10399</a> - [R] Fix performance regression from cpp11::r_string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10411">ARROW-10411</a> - [C++] Fix incorrect child array lengths for Concatenate of FixedSizeList</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10412">ARROW-10412</a> - [C++] CMake Build Fails with grpc 1.33.1, “GRPC_CPP_PLUGIN-NOTFOUND: program not found or is not executable”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10413">ARROW-10413</a> - [Rust] [Parquet] Unignore some roundtrip tests that are passing now</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10414">ARROW-10414</a> - [R] open_dataset doesn’t work with absolute/expanded paths on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10426">ARROW-10426</a> - [C++] Arrow type large_string cannot be written to Parquet type column descriptor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10433">ARROW-10433</a> - [Python] pyarrow doesn’t work with s3fs&gt;=0.5</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10434">ARROW-10434</a> - [Rust] Debug formatting arrays with lengths greater than 10 and less than 20 produces incorrect values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10441">ARROW-10441</a> - [FlightRPC][Java] FlightClients from FlightGrpcUtils#createFlightClient shutdown gRPC channel when closed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10446">ARROW-10446</a> - [C++][Python] Timezone aware pd.Timestamp’s are incorrectly converted to Timestamp arrys</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10448">ARROW-10448</a> - [Rust] PrimitiveArray::new can create arrays not in spec</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10453">ARROW-10453</a> - [Rust] [DataFusion] Performance degredation after removing specialization</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10457">ARROW-10457</a> - [CI] Fix Spark branch-3.0 integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10461">ARROW-10461</a> - [Rust] Offset related bug in BitChunks::remainder_bits</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10462">ARROW-10462</a> - [Python] ParquetDatasetPiece’s path broken when using fsspec fs on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10463">ARROW-10463</a> - [R] Better messaging for currently unsupported CSV options in open_dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10470">ARROW-10470</a> - [R] Fix missing file error causing NYC taxi example to fail</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10471">ARROW-10471</a> - [CI][Python] Ensure we have tests with s3fs and run those on CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10472">ARROW-10472</a> - [C++][Python] casting a scalar timestamp to date32 results in Aborted (core dump)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10475">ARROW-10475</a> - [С++][FlightRPC] Arrow Flight Server / Client cannot be initialized with Ipv6 host</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10480">ARROW-10480</a> - [Python] Parquet write_table creates gzipped Parquet file, not Parquet with gzip compression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10482">ARROW-10482</a> - [Python] Specifying compression type on a column basis when writing Parquet not working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10489">ARROW-10489</a> - [C++] Unable to configure or make with intel compiler</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10489">ARROW-10489</a> - [C++] Unable to configure or make with intel compiler</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10491">ARROW-10491</a> - [FlightRPC][Java] Fix NPE when using FlightProducer without interceptors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10493">ARROW-10493</a> - [C++][Parquet] Writing nullable nested strings results in wrong data in file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10495">ARROW-10495</a> - [C++] find_package(Arrow) is broken on Ubuntu 18</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10496">ARROW-10496</a> - [R][CI] Fix conda-r job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10499">ARROW-10499</a> - [C++][Java] Fix ORC Java JNI Crash</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10502">ARROW-10502</a> - [C++/Python] CUDA detection messes up nightly conda-win builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10503">ARROW-10503</a> - [C++] Uriparser will not compile using Intel compiler</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10508">ARROW-10508</a> - [Java] Allow FixedSizeListVector to have empty children</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10509">ARROW-10509</a> - [C++] Define operator«(ostream, ParquetException) for clang+Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10511">ARROW-10511</a> - [Python] Table.to_pandas() failing when timezone-awareness mismatch in metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10518">ARROW-10518</a> - Fix cast function issues in gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10519">ARROW-10519</a> - [Python] Deadlock when PyArrow imports Pandas from multiple threads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10525">ARROW-10525</a> - [C++] Fix crash on unsupported IPC stream (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10532">ARROW-10532</a> - [Python] Mangled pandas_metadata when specified schema has different order as DataFrame columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10545">ARROW-10545</a> - [C++] Fix crash on invalid Parquet file (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10546">ARROW-10546</a> - [Python] Deprecate the S3FSWrapper class</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10547">ARROW-10547</a> - [Rust][DataFusion] Filter pushdown loses filters if below a user defined node</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10551">ARROW-10551</a> - [Rust]: Fix unreproducible benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10558">ARROW-10558</a> - [Python] Filesystem S3 tests not independent (native s3 influences s3fs)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10560">ARROW-10560</a> - [Python] Crash when creating array with string over 2GB</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10563">ARROW-10563</a> - [Packaging][C++] CMake find_package(Arrow 2.0 CONFIG REQUIRED) broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10565">ARROW-10565</a> - [Python] Table.from_batches and Table.from_pandas have argument Schema_schema in documentation instead of schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10568">ARROW-10568</a> - [C++][Parquet] Parquet writer crashes process when Tell() does not succeed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10569">ARROW-10569</a> - [C++][Python] Poor Table filtering performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10577">ARROW-10577</a> - [Rust][DataFusion] Hash Aggregator stream finishes unexpectedly after going to Pending state</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10578">ARROW-10578</a> - [C++] Comparison kernels crashing for string array with null string scalar</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10610">ARROW-10610</a> - [C++] arrow-utility-test and arrow-csv-test causes failures on a big-endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10616">ARROW-10616</a> - [Developer] Expand PR labeler to all supported languages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10617">ARROW-10617</a> - [Python] RecordBatchStreamReader’s iterator doesn’t work with python 3.8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10619">ARROW-10619</a> - [C++] Fix crash on unsupported IPC stream (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10620">ARROW-10620</a> - [Rust][Parquet] move column chunk range logic to metadata.rs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10621">ARROW-10621</a> - [Java] flight-cpre test causes a failure on s390x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10622">ARROW-10622</a> - [R] Nameof&lt;&gt;() is incorrect in r-arrow build environment</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10623">ARROW-10623</a> - [R] Version 1.0.1 breaks data.frame attributes when reading file written by 2.0.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10624">ARROW-10624</a> - [R] Proactively remove “problems” attributes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10627">ARROW-10627</a> - [Rust] Github master does not compile for WASM target</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10629">ARROW-10629</a> - [CI] MinGW builds broken on Github Actions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10631">ARROW-10631</a> - [Rust] Equality of fixed-sized binary is incorrect.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10642">ARROW-10642</a> - [R] Can’t get Table from RecordBatchReader with 0 batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10656">ARROW-10656</a> - [Rust] New RecordBatch requires exact match of Data Types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10656">ARROW-10656</a> - [Rust] New RecordBatch requires exact match of Data Types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10661">ARROW-10661</a> - [C#] Fix benchmarking project</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10662">ARROW-10662</a> - [Java] Avoid integer overflow for Json file reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10663">ARROW-10663</a> - [C++/Doc] The IsIn kernel ignores the skip_nulls option of SetLookupOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10667">ARROW-10667</a> - [Rust] [Parquet] Add a convenience type for writing Parquet to memory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10668">ARROW-10668</a> - [R] Filtering does not work with .data pronoun</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10681">ARROW-10681</a> - [Rust] [DataFusion] TPC-H Query 12 fails with scheduler error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10684">ARROW-10684</a> - [Rust] Logical equality should consider parent array nullability</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10690">ARROW-10690</a> - [Java] ComplexCopier gives incorrect result for list vector if target vector is non-empty</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10692">ARROW-10692</a> - [Rust] Segfault while array buffer append</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10699">ARROW-10699</a> - [C++] BitmapUInt64Reader doesn’t work on big-endian</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10701">ARROW-10701</a> - [Rust] [Datafusion] Benchmark sort_limit_query_sql fails because order by clause specifies column index instead of expression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10705">ARROW-10705</a> - [Rust] Lifetime annotations in the IPC writer are too strict, preventing code reuse</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10710">ARROW-10710</a> - [Rust] Example flight server is broken after tokio upgrade (among other things)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10711">ARROW-10711</a> - [CI] Remove set-env from auto-tune to work with new GHA settings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10719">ARROW-10719</a> - [C#] ArrowStreamWriter doesn’t write schema metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10746">ARROW-10746</a> - [C++] Use GTEST_SKIP in parquet encoding tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10748">ARROW-10748</a> - [Java] TimeStampMilliVector cannot be cast to TimeStampMilliTZVector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10749">ARROW-10749</a> - [C++] Incorrect string format for Datum with the collection type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10751">ARROW-10751</a> - [C++] Add RE2 to minimal build example</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10753">ARROW-10753</a> - [Rust] [DataFusion] Negative numbers in SQL WHERE clause not parsed correctly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10757">ARROW-10757</a> - [Rust] [CI] Sporadic failures due to disk filling up</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10760">ARROW-10760</a> - [Rust] [DataFusion] Predicate push down does not support joins correctly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10769">ARROW-10769</a> - [CI] Integration tests are failing in master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10774">ARROW-10774</a> - [R] Set minimum cpp11 version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10777">ARROW-10777</a> - [Packaging][Python] PyPI pyarrow source dist (sdist) contains architecture dependent binaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10778">ARROW-10778</a> - [Python] RowGroupInfo.statistics errors for empty row group</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10779">ARROW-10779</a> - [Java] writeNull method in UnionListWriter doesn’t work correctly if validity at that index is already set</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10780">ARROW-10780</a> - [R] Update known R installation issues for CentOS 7</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10791">ARROW-10791</a> - [Rust] StreamReader, read_dictionary duplicating schema info</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10801">ARROW-10801</a> - [Rust] [Flight] Support sending FlightData for Dictionaries with that of a RecordBatch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10803">ARROW-10803</a> - [R] Support R &gt;= 3.3 and add CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10804">ARROW-10804</a> - [Rust] Remove UB on parquet crate</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10807">ARROW-10807</a> - [Rust][DataFusion] Avoid double hashing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10810">ARROW-10810</a> - [Rust] Speed up comparison kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10811">ARROW-10811</a> - [R][CI] Remove nightly centos6 build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10823">ARROW-10823</a> - MutableArrayData with use_null false yields wrong results</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10830">ARROW-10830</a> - [Rust] json reader should not hard crash on invalid json</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10833">ARROW-10833</a> - [Python] Avoid usage of NumPy’s PyArray_DescrCheck macro</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10834">ARROW-10834</a> - [R] Fix print method for SubTreeFileSystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10837">ARROW-10837</a> - [Rust] Use `Vec&lt;u8&gt;` for hash key instead</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10840">ARROW-10840</a> - [C++] Parquet FileMetaData does not have key_value_metadata when built from FileMetaDataBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10842">ARROW-10842</a> - [Rust] decouple IO from json schema inference code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10844">ARROW-10844</a> - [Rust] [DataFusion] join of two DataFrames is not possible</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10850">ARROW-10850</a> - [R] Unrecognized compression type: LZ4</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10852">ARROW-10852</a> - [C++] AssertTablesEqual(verbose=true) segfaults if the left array has more rows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10854">ARROW-10854</a> - [Rust] [DataFusion] Simplified logical scans</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10855">ARROW-10855</a> - [Python][Numpy] ArrowTypeError after upgrading NumPy to 1.20.0rc1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10856">ARROW-10856</a> - [R] CentOS 7 not correctly identifying compiler version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10859">ARROW-10859</a> - [Rust] [DataFusion] Make collect not require ExecutionContext</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10860">ARROW-10860</a> - [Java] Avoid integer overflow for generated classes in Vector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10863">ARROW-10863</a> - [Python] ExtensionArray.to_pandas not working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10863">ARROW-10863</a> - [Python] ExtensionArray.to_pandas not working</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10875">ARROW-10875</a> - simplify simd cfg check</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10876">ARROW-10876</a> - [Rust] json reader should validate value type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10897">ARROW-10897</a> - [Rust] Replace Arc&lt;String&gt; by String in DataType::Timestamp</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10907">ARROW-10907</a> - [Rust] Cast UTF8 to Date64 Incorrect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10913">ARROW-10913</a> - [Python][Doc] Code block typo in filesystems docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10914">ARROW-10914</a> - [Rust]: SIMD implementation of arithmetic kernels reads out of bounds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10915">ARROW-10915</a> - [Rust] Make ARROW_TEST_DATA and PARQUET_TEST_DATA absolute dirs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10921">ARROW-10921</a> - `TypeError: ‘coroutine’ object is not iterable` when reading parquet partitions via s3fs &gt;= 0.5 with pyarrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10930">ARROW-10930</a> - [Python] LargeListType doesn’t have a value_field</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10932">ARROW-10932</a> - [C++] BinaryMemoTable::CopyOffsets access out-of-bound address when data is empty</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10932">ARROW-10932</a> - [C++] BinaryMemoTable::CopyOffsets access out-of-bound address when data is empty</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10942">ARROW-10942</a> - [C++] S3FileSystem::Impl::IsEmptyDirectory fails on Amazon S3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10943">ARROW-10943</a> - [Rust] Intermittent build failure in parquet encoding</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10954">ARROW-10954</a> - [C++][Doc] PlasmaClient is threadSafe now, doc not update</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10955">ARROW-10955</a> - [C++] Reading empty json lists results in invalid non-nullable null type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10960">ARROW-10960</a> - [C++][FlightRPC] Missing protobuf data_body should result in default value of empty bytes, not null</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10962">ARROW-10962</a> - [Java][FlightRPC] FlightData deserializer should accept missing fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10967">ARROW-10967</a> - [Rust] Make env vars ARROW_TEST_DATA and PARQUET_TEST_DATA optional</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10990">ARROW-10990</a> - [Rust]: SIMD implementation of compare kernels reads out of bounds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10994">ARROW-10994</a> - [Rust] Fix bugs in TPC-H file conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10996">ARROW-10996</a> - [Rust] Return error messages via Result for get_arrow_schema_from_metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10999">ARROW-10999</a> - [Rust] TPC-H parquet files cannot be read by Apache Spark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11014">ARROW-11014</a> - [Rust] [DataFusion] ParquetExec reports incorrect statistics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11023">ARROW-11023</a> - [C++][CMake] gRPC doesn’t respect CMAKE_CXX_COMPILER</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11024">ARROW-11024</a> - [C++][Parquet] Writing List&lt;Struct&gt; to parquet sometimes writes wrong data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11025">ARROW-11025</a> - [Rust] Bench for boolean kernels measure array creation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11030">ARROW-11030</a> - [Rust] [DataFusion] HashJoinExec slow with many batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11048">ARROW-11048</a> - [Rust] Add bench to MutableBuffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11050">ARROW-11050</a> - [R] Handle RecordBatch in write_parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11067">ARROW-11067</a> - [C++] CSV reader returns nulls for some strings on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11069">ARROW-11069</a> - [C++] Parquet writer incorrect data being written when data type is struct</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11073">ARROW-11073</a> - [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11083">ARROW-11083</a> - [CI] Build “Source Release and Merge Script” is broken</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11084">ARROW-11084</a> - [Rust] Clippy failing in master</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11085">ARROW-11085</a> - [Rust] Rust CI no longer works b/c it uses action-rs: Migrate CI away from action-rs/*</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11092">ARROW-11092</a> - [CI] (Temporarily) move offending workflows to separate files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11102">ARROW-11102</a> - [Rust][DataFusion] fmt::Debug for ScalarValue(Utf8) is always quoted</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11113">ARROW-11113</a> - [Rust] support as_struct_array cast</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11114">ARROW-11114</a> - [Java] Metadata serialization is broken for Field class</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11132">ARROW-11132</a> - [CI] Use pip to install crossbow’s dependencies for the comment bot</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11144">ARROW-11144</a> - [C++][Python][CI] Fix HDFS nightly build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11152">ARROW-11152</a> - [CI][C++] Fix Homebrew numpy installation on macOS builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11162">ARROW-11162</a> - [C++] Fix crash on Decimal256 Parquet file (OSS-Fuzz)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11163">ARROW-11163</a> - [C++][Python] Compressed Feather file written with pyarrow 0.17 not readable in pyarrow 2.0.0+</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11166">ARROW-11166</a> - [Python][Compute] Add bindings for ProjectOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11171">ARROW-11171</a> - [Go] Build fails on s390x with noasm tag</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11189">ARROW-11189</a> - [Developer] Achery benchmark diff cannot compare two jsons</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11190">ARROW-11190</a> - [C++][Dataset] Clean up compiler warnings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11202">ARROW-11202</a> - [R][CI] Nightly builds not happening (or artifacts not exported)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11224">ARROW-11224</a> - [R] don’t test metadata serialization on old R versions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11226">ARROW-11226</a> - [Python][CI] Filesystem tests failing with s3fs 0.5.2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11227">ARROW-11227</a> - [Python][CI] AMD64 Conda Python 3.7 Pandas 0.24 cron job failing in to_pandas extension dtype test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11229">ARROW-11229</a> - [C++][Dataset] Static build is failed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11230">ARROW-11230</a> - [R] Fix build failures on Windows when multiple libarrow binaries found</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11232">ARROW-11232</a> - [C++] Table::CombineChunks() returns incorrect results if Table has no column</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11233">ARROW-11233</a> - [C++][Flight] Fail to link with bundled gRPC and Abseil</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11237">ARROW-11237</a> - [C++] Compiler error with GLog and unity build enabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11251">ARROW-11251</a> - [CI] Make sure that devtoolset-8 is really installed + being used</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11253">ARROW-11253</a> - [R] Make sure that large metadata tests are reproducible</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11255">ARROW-11255</a> - [Packaging][Conda][macOS] Fix Python version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11257">ARROW-11257</a> - [C++][Parquet] PyArrow Table contains different data after writing and reloading from Parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11271">ARROW-11271</a> - [Rust] [Parquet] List schema to Arrow parser misinterpreting child nullability</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11274">ARROW-11274</a> - [Packaging][wheel][Windows] Fix wheels path for Gemfury</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11275">ARROW-11275</a> - [Packaging][wheel][Linux] Fix paths for Gemfury</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11277">ARROW-11277</a> - [C++] Fix compilation error in dataset expressions on macOS 10.11</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11283">ARROW-11283</a> - [Julia] Fix install link</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11286">ARROW-11286</a> - [Release][Yum] Fix minimal build example check</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11287">ARROW-11287</a> - [Packaging][RPM] Add missing dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11299">ARROW-11299</a> - [Python] build warning in python</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11301">ARROW-11301</a> - [C++] Fix reading LZ4-compressed Parquet files produced by Java Parquet implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11302">ARROW-11302</a> - [Release][Python] Remove verification of python 3.5 wheel on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11303">ARROW-11303</a> - [Release][C++] Enable mimalloc in the windows verification script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11305">ARROW-11305</a> - [Rust]: parquet-rowcount binary tries to open itself as a parquet file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11306">ARROW-11306</a> - [Packaging][Ubuntu][16.04] Add missing libprotobuf-dev dependency</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11320">ARROW-11320</a> - [C++] Spurious test failure when creating temporary dir</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11322">ARROW-11322</a> - [Rust] Arrow `memory` made private is a breaking API change</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1935">PARQUET-1935</a> - [C++][Parquet] nullptr access violation when writing arrays of non-nullable values</li>
</ul>
<h3 id="new-features-and-improvements">New Features and Improvements</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1846">ARROW-1846</a> - [C++] Implement “any” reduction kernel for boolean data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3850">ARROW-3850</a> - [Python] Support MapType and StructType for enhanced PySpark integration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4193">ARROW-4193</a> - [Rust] Add support for decimal data type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4544">ARROW-4544</a> - [Rust] Read nested JSON structs into StructArrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4804">ARROW-4804</a> - [Rust] Read temporal values from CSV - Parse Date32 and Date64 in CSV reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4960">ARROW-4960</a> - [R] Add crossbow task for r-arrow-feedstock</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-4970">ARROW-4970</a> - [C++][Parquet] Implement parquet::FileMetaData::Equals</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5336">ARROW-5336</a> - [C++] Implement arrow::Concatenate for dictionary-encoded arrays with unequal dictionaries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5350">ARROW-5350</a> - [Rust] Support filtering on primitive/string lists</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5394">ARROW-5394</a> - [C++] Benchmarks for IsIn Kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5679">ARROW-5679</a> - [Python] Drop Python 3.5 from support matrix</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5950">ARROW-5950</a> - [Rust] [DataFusion] Add ability to log via logger dependency</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6071">ARROW-6071</a> - [C++] Implement casting Binary &lt;-&gt; LargeBinary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6697">ARROW-6697</a> - [Rust] [DataFusion] Validate that all parquet partitions have the same schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6715">ARROW-6715</a> - [Website] Describe “non-free” component is needed for Plasma packages in install page</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6883">ARROW-6883</a> - [C++] Support sending delta DictionaryBatch or replacement DictionaryBatch in IPC stream writer class</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6995">ARROW-6995</a> - [Packaging][Crossbow] The windows conda artifacts are not uploaded to GitHub releases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7531">ARROW-7531</a> - [C++] Investigate header cost reduction</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7633">ARROW-7633</a> - [C++][CI] Create fuzz targets for tensors and sparse tensors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7800">ARROW-7800</a> - [Python] Expose GetRecordBatchReader API in PyArrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7842">ARROW-7842</a> - [Rust] [Parquet] Implement array reader for list type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8113">ARROW-8113</a> - [C++] Implement a lighter-weight variant</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8199">ARROW-8199</a> - [C++] Add support for multi-column sort on Table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8289">ARROW-8289</a> - [Rust] [Parquet] Implement minimal Arrow Parquet writer as starting point for full writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8423">ARROW-8423</a> - [Rust] [Parquet] Serialize arrow schema into metadata when writing parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8425">ARROW-8425</a> - [Rust] [Parquet] Add support for writing temporal types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8426">ARROW-8426</a> - [Rust] [Parquet] Add support for writing dictionary types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8426">ARROW-8426</a> - [Rust] [Parquet] Add support for writing dictionary types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8853">ARROW-8853</a> - [Rust] [Integration Testing] Enable Flight tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8876">ARROW-8876</a> - [C++] Implement casts from date types to Timestamp</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-8883">ARROW-8883</a> - [Rust] [Integration Testing] Enable passing tests and update spec doc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9001">ARROW-9001</a> - [R] Box outputs as correct type in call_function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9128">ARROW-9128</a> - [C++] Implement string space trimming kernels: trim, ltrim, and rtrim</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9164">ARROW-9164</a> - [C++] Provide APIs for adding “docstrings” to arrow::compute::Function classes that can be accessed by bindings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9187">ARROW-9187</a> - [R] Add bindings for arithmetic kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9296">ARROW-9296</a> - [CI][Rust] Enable more clippy lint checks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9304">ARROW-9304</a> - [C++] Add “AppendEmptyValue” builder APIs for use inside StructBuilder::AppendNull</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9361">ARROW-9361</a> - [Rust] Move other array types into their own modules</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9400">ARROW-9400</a> - [Python] Do not depend on conda-forge static libraries in Windows wheel builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9475">ARROW-9475</a> - [Java] Clean up usages of BaseAllocator, use BufferAllocator instead</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9489">ARROW-9489</a> - [C++] Add fill_null kernel implementation for (array[string], scalar[string])</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9555">ARROW-9555</a> - [Rust] [DataFusion] Add inner (hash) equijoin physical plan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9564">ARROW-9564</a> - [Packaging] Vendor r-arrow-feedstock conda-forge recipe</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9674">ARROW-9674</a> - [Rust] Parquet reader should implement Send + Sync</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9704">ARROW-9704</a> - [Java] TestEndianness.testLittleEndian fails on big endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9707">ARROW-9707</a> - [Rust] [DataFusion] Re-implement threading model</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9709">ARROW-9709</a> - [Java] Test cases in arrow-vector assume little-endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9728">ARROW-9728</a> - [Rust] [Parquet] Compute nested definition and repetition for structs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9747">ARROW-9747</a> - [C++][Java][Format] Support Decimal256 Type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9771">ARROW-9771</a> - [Rust] [DataFusion] Predicate Pushdown Improvement: treat predicates separated by AND separately</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9803">ARROW-9803</a> - [Go] Add initial support for s390x</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9804">ARROW-9804</a> - [FlightRPC] Authentication Redesign</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9828">ARROW-9828</a> - [Rust] [DataFusion] TableProvider trait should support predicate push-down</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9861">ARROW-9861</a> - [Java] Failed Arrow Vector on big-endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9862">ARROW-9862</a> - Throw an exception in UnsafeDirectLittleEndian on Big-Endian platform</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9911">ARROW-9911</a> - [Rust][DataFusion] SELECT &lt;expression&gt; with no FROM clause should produce a single row of output</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9945">ARROW-9945</a> - [C++][Dataset] Refactor Expression::Assume to return a Result</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9991">ARROW-9991</a> - [C++] split kernels for strings/binary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10002">ARROW-10002</a> - [Rust] Trait-specialization requires nightly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10021">ARROW-10021</a> - [C++][Compute] Support finding nth frequently used value in mode kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10032">ARROW-10032</a> - [Documentation] C++ Windows docs are out of date</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10079">ARROW-10079</a> - [Rust]: Benchmark and improve count_set_bits function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10095">ARROW-10095</a> - [Rust] [Parquet] Update for IPC changes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10097">ARROW-10097</a> - [C++] Persist SetLookupState in between usages of IsIn when filtering dataset batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10106">ARROW-10106</a> - [FlightRPC][Java] Expose onIsReady() callback on OutboundStreamListener</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10108">ARROW-10108</a> - [Rust] [Parquet] Fix compiler warning about unused return value</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10109">ARROW-10109</a> - [Rust] Add support to produce a C Data interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10110">ARROW-10110</a> - [Rust] Add support to consume C Data Interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10131">ARROW-10131</a> - [C++][Dataset] Lazily parse parquet metadata / statistics in ParquetDatasetFactory and ParquetFileFragment</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10135">ARROW-10135</a> - [Rust] [Parquet] Refactor file module to help adding sources</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10143">ARROW-10143</a> - [C++] ArrayRangeEquals should accept EqualOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10144">ARROW-10144</a> - [Flight] Add support for using the TLS_SNI extension</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10149">ARROW-10149</a> - [Rust] Add support to external release of un-owned buffers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10163">ARROW-10163</a> - [Rust] [DataFusion] Add DictionaryArray coercion support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10168">ARROW-10168</a> - [Rust] [Parquet] Extend arrow schema conversion to projected fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10173">ARROW-10173</a> - [Rust][DataFusion] Improve performance of equality to a constant predicate support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10180">ARROW-10180</a> - [C++][Doc] Update dependency management docs following aws-sdk-cpp addition</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10182">ARROW-10182</a> - [C++] Add basic continuation support to futures</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10191">ARROW-10191</a> - [Rust] [Parquet] Add roundtrip tests for single column batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10197">ARROW-10197</a> - [Gandiva][python] Execute expression on filtered data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10203">ARROW-10203</a> - [Doc] Capture guidance for endianness support in contributors guide.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10207">ARROW-10207</a> - [C++] Unary kernels that results in a list have no preallocated offset buffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10208">ARROW-10208</a> - [C++] String split kernels do not propagate nulls correctly on sliced input</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10216">ARROW-10216</a> - [Rust] Simd implementation of min/max aggregation kernels for primitive types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10224">ARROW-10224</a> - [Python] Add support for Python 3.9 except macOS wheel and Windows wheel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10225">ARROW-10225</a> - [Rust] [Parquet] Fix null bitmap comparisons in roundtrip tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10228">ARROW-10228</a> - [Julia] Donate Julia Implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10236">ARROW-10236</a> - [Rust] [DataFusion] Make DataFusion casting rules consistent with cast kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10241">ARROW-10241</a> - [C++][Compute] Add variance kernel benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10249">ARROW-10249</a> - [Rust]: Support Dictionary types for ListArrays in arrow json reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10259">ARROW-10259</a> - [Rust] Support field metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10261">ARROW-10261</a> - [Rust] [BREAKING] Lists should take Field instead of DataType</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10263">ARROW-10263</a> - [C++][Compute] Improve numerical stability of variances merging</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10268">ARROW-10268</a> - [Rust] Support writing dictionaries to IPC file and stream</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10269">ARROW-10269</a> - [Rust] Update nightly: Oct 2020 Edition</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10277">ARROW-10277</a> - [C++] Support comparing scalars approximately</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10289">ARROW-10289</a> - [Rust] Support reading dictionary streams</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10292">ARROW-10292</a> - [Rust] [DataFusion] Simplify merge</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10295">ARROW-10295</a> - [Rust] [DataFusion] Simplify accumulators</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10299">ARROW-10299</a> - [Rust] Support reading and writing V5 of IPC metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10300">ARROW-10300</a> - [Rust] Improve benchmark documentation for generating/converting TPC-H data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10301">ARROW-10301</a> - [C++] Add “all” boolean reducing kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10302">ARROW-10302</a> - [Python] Don’t double-package plasma-store-server</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10304">ARROW-10304</a> - [C++][Compute] Optimize variance kernel for integers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10310">ARROW-10310</a> - [C++][Gandiva] Add single argument round() in Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10311">ARROW-10311</a> - [Release] Update crossbow verification process</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10313">ARROW-10313</a> - [C++] Improve UTF8 validation speed and CSV string conversion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10318">ARROW-10318</a> - [C++] Use pimpl idiom in CSV parser</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10319">ARROW-10319</a> - [Flight][Go] Add Context to Client Auth Handler functions for Flight</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10320">ARROW-10320</a> - [Rust] Convert RecordBatchIterator to a Stream</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10322">ARROW-10322</a> - [C++][Dataset] Minimize Expression to a wrapper around compute::Function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10323">ARROW-10323</a> - [Release][wheel] Add missing verification setup step</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10325">ARROW-10325</a> - [C++][Compute] Separate aggregate kernel registration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10328">ARROW-10328</a> - [C++] Consider using fast-double-parser</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10330">ARROW-10330</a> - [Rust][Datafusion] Implement nullif() function for DataFusion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10331">ARROW-10331</a> - [Rust] [DataFusion] Re-organize errors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10332">ARROW-10332</a> - [Rust] Allow CSV reader to start from a line</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10334">ARROW-10334</a> - [Rust] [Parquet] Support reading and writing Arrow NullArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10336">ARROW-10336</a> - [Rust] Support fromIter and toIter for strings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10337">ARROW-10337</a> - [C++] More liberal parsing of ISO8601 timestamps with fractional seconds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10338">ARROW-10338</a> - [Rust]: Use const fn for applicable methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10340">ARROW-10340</a> - [Packaging][deb][RPM] Use Python 3.8 for pygit2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10356">ARROW-10356</a> - [Rust] [DataFusion] Add support for is_in</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10363">ARROW-10363</a> - [Python] Remove workaround for CMake bug in manylinux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10366">ARROW-10366</a> - [Rust] [DataFusion] Remove collect from merge</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10375">ARROW-10375</a> - [Rust] Remove PrimitiveArrayOps</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10378">ARROW-10378</a> - [Rust] Update take() kernel with support for large lists</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10381">ARROW-10381</a> - [Rust] Generalize Arrow to support MergeSort</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10382">ARROW-10382</a> - [Rust] Fix typos and spelling</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10383">ARROW-10383</a> - [Doc] Fix typos and spelling</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10384">ARROW-10384</a> - [C++] Fix typos and spelling</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10385">ARROW-10385</a> - [C++][Gandiva] Add support for LLVM 11</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10389">ARROW-10389</a> - [Rust][DataFusion] Make the custom source implementation API more explicit</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10392">ARROW-10392</a> - [C++][Gandiva] Avoid string copy while evaluating IN expression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10396">ARROW-10396</a> - [Rust] [Parquet] Expose SliceableCursor and FileSource</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10398">ARROW-10398</a> - [Rust] [Parquet] Re-export parquet::record::api::Field</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10400">ARROW-10400</a> - Propagate TLS client peer_identity when using mutual TLS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10402">ARROW-10402</a> - [Rust] Improve array equality</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10407">ARROW-10407</a> - [C++] Division Support in Decimal256</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10408">ARROW-10408</a> - [Java] Upgrade Avro dependency to 1.10</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10410">ARROW-10410</a> - [Rust] Some refactorings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10416">ARROW-10416</a> - [R] Support Tables in Flight</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10422">ARROW-10422</a> - [Rust] Removed unused BinaryArrayBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10424">ARROW-10424</a> - [Rust] Simplify code for impl PrimitiveArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10428">ARROW-10428</a> - [FlightRPC][Java] Add support for HTTP cookies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10445">ARROW-10445</a> - [Rust] Add DoubleEnded to PrimitiveArrayIter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10449">ARROW-10449</a> - [Rust] Make dictionary keys be a PrimitiveArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10454">ARROW-10454</a> - [Rust][Datafusion] support creating ParquetExec from externally resolved file list and schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10455">ARROW-10455</a> - [Rust] Fix CI cache misses on windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10458">ARROW-10458</a> - [Rust] [Datafusion] context.create_logical_plan should not take a mutable self reference</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10464">ARROW-10464</a> - [Rust] Implement utility to convert TPC-H tbl files to CSV and Parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10466">ARROW-10466</a> - [Rust] [Website] Update implementation status page</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10467">ARROW-10467</a> - [FlightRPC][Java] Ability to pass arbitrary client properties to server</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10468">ARROW-10468</a> - [C++][Compute] Refactor FunctionExecutor -&gt; KernelExecutor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10476">ARROW-10476</a> - [Rust] Allow string array to be built from iterator of &amp;str</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10477">ARROW-10477</a> - [Rust] Add support for iterators over binary arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10478">ARROW-10478</a> - [Dev][Release] Correct Java versions to 3.0.0-SNAPSHOT</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10481">ARROW-10481</a> - [R] Bindings to add, remove, replace Table columns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10483">ARROW-10483</a> - [C++] Move Executor into a separate header</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10484">ARROW-10484</a> - [C++] Future&lt;{void,Status}&gt; could be more generic</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10487">ARROW-10487</a> - [FlightRPC][C++] Header-based auth in clients</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10490">ARROW-10490</a> - [C++][GLib] Fail to build with Xcode 12.0.1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10492">ARROW-10492</a> - [Java][JDBC] Allow users to config the mapping between SQL types and Arrow types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10504">ARROW-10504</a> - [C++] Suppress UBSAN pointer-overflow warning in RapidJSON</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10510">ARROW-10510</a> - [Rust] [DataFusion] Add benchmarks for COUNT(DISTINCT)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10515">ARROW-10515</a> - [Julia][Doc] Update lists of supported languages to include Julia</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10522">ARROW-10522</a> - [R] Allow rename Table and RecordBatch columns with names()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10526">ARROW-10526</a> - [FlightRPC][C++] HTTP cookie handling in clients</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10530">ARROW-10530</a> - [R] Optionally use distro package in linuxlibs.R</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10531">ARROW-10531</a> - [Rust] [DataFusion] Better display for logical plans: Graphviz and Schema information</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10539">ARROW-10539</a> - [Packaging][Python] Use GitHub Actions to build wheels for Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10540">ARROW-10540</a> - [Rust] Allow unary kernels of arbitrary array types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10541">ARROW-10541</a> - [C++] Add re2 library to core arrow / ARROW_WITH_RE2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10542">ARROW-10542</a> - [C#][Flight] Add beginning on flight code for net core</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10543">ARROW-10543</a> - [Developer] Update dev instructions to note there may be a timelag</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10552">ARROW-10552</a> - [Rust] Remove un-used Result from Buffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10559">ARROW-10559</a> - [Rust] [DataFusion] Break up logical_plan/mod.rs into smaller modules</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10561">ARROW-10561</a> - [Rust] Simplify `MutableBuffer::write` and `MutableBuffer::write_bytes`</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10562">ARROW-10562</a> - [Rust] Potential UB on unsafe code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10566">ARROW-10566</a> - [C++] Array validation should work on ArrayData</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10567">ARROW-10567</a> - [C++][FlightRPC] Add options to help increase precision of arrow-flight-benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10572">ARROW-10572</a> - [Rust][DataFusion] Use aHash and std::collections hashmap for aggregates / distinct</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10574">ARROW-10574</a> - [Python][Parquet] Allow collections for ‘in’ / ‘not in’ filter (in addition to sets)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10575">ARROW-10575</a> - [Rust] Rename union.rs to be cosistent with other arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10581">ARROW-10581</a> - [Doc] IPC dictionary reference to relevant section</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10582">ARROW-10582</a> - [Rust] [DataFusion] Implement “repartition” operator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10584">ARROW-10584</a> - [Rust] [DataFusion] Implement SQL join support using explicit JOIN ON syntax</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10585">ARROW-10585</a> - [Rust] [DataFusion] Add join support to DataFrame and LogicalPlan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10586">ARROW-10586</a> - [Rust] [DataFusion] Add join support to query planner</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10589">ARROW-10589</a> - [Rust]: Implement AVX-512 bit and operation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10590">ARROW-10590</a> - [Rust] Remove Date32(Millisecond) from test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10591">ARROW-10591</a> - [Rust] Add support to structArrays for MutableArrayData</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10595">ARROW-10595</a> - [Rust] Simplify inner loop of min/max kernels for non-null case</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10596">ARROW-10596</a> - [Rust] Improve take benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10598">ARROW-10598</a> - [C++] Improve performance of GenerateBitsUnrolled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10604">ARROW-10604</a> - [Ruby] Support Decimal256 type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10607">ARROW-10607</a> - [C++][Parquet] Support Reading/Writing Decimal256 type in Parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10609">ARROW-10609</a> - [Rust] Optimize min/max of non null strings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10628">ARROW-10628</a> - [Rust] Make clippy error on clippy warnings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10633">ARROW-10633</a> - [Rust][DataFusion] Dependency version upgrades</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10634">ARROW-10634</a> - [C#][CI] Change the build version from 2.2 to 3.1 in CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10636">ARROW-10636</a> - [Rust] Remove specialisation from Rust parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10637">ARROW-10637</a> - [Rust] Add examples to boolean kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10638">ARROW-10638</a> - [Rust] Improve tests of boolean kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10639">ARROW-10639</a> - [Rust] Simplify signature of is_null and add example</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10644">ARROW-10644</a> - [Python] Consolidate path/filesystem handling in pyarrow.dataset and pyarrow.fs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10646">ARROW-10646</a> - [C++][FlightRPC] Disable flaky test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10648">ARROW-10648</a> - [Java] Prepare Java codebase for source release without requiring any git tags to be created or pushed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10651">ARROW-10651</a> - [C++] alloc-dealloc-mismatch in s3fs.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10652">ARROW-10652</a> - [C++][Gandiva] Make gandiva cache size configurable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10653">ARROW-10653</a> - [Rust]: Update toolchain version to bring new features</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10654">ARROW-10654</a> - [Rust] Specialize parsing of floats / bools</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10660">ARROW-10660</a> - [Rust] Implement AVX-512 bit or operation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10665">ARROW-10665</a> - [Rust] Add fast paths for common utf8 like patterns</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10666">ARROW-10666</a> - [Rust] [DataFusion] Support nested SELECT statements</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10669">ARROW-10669</a> - [C++][Compute] Support Scalar inputs to boolean kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10672">ARROW-10672</a> - [Rust] [DataFusion] Make limit be computed as a stream</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10673">ARROW-10673</a> - [Rust] [DataFusion] Make sort be computed on the stream</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10674">ARROW-10674</a> - [Rust] Add integration tests for Decimal type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10677">ARROW-10677</a> - [Rust] Fix Bug and Add tests as documentation showing supported csv parsing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10679">ARROW-10679</a> - [Rust] [DataFusion] Implement SQL CASE WHEN physical expression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10680">ARROW-10680</a> - [Rust] [DataFusion] Implement TPC-H Query 12</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10682">ARROW-10682</a> - [Rust] Sort kernel performance tuning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10685">ARROW-10685</a> - [Rust] [DataFusion] Add support for join on filter pushdown optimizer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10688">ARROW-10688</a> - [Rust] [DataFusion] Support CASE WHEN from DataFrame API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10689">ARROW-10689</a> - [Rust] [DataFusion] Support CASE WHEN from SQL</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10693">ARROW-10693</a> - [Rust] [DataFusion] Add support for the left join</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10696">ARROW-10696</a> - [C++] Investigate a bit run reader that would only return runs of set bits</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10697">ARROW-10697</a> - [C++] Consolidate bitmap word readers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10703">ARROW-10703</a> - [Rust] [DataFusion] Make join not collect left on every part</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10704">ARROW-10704</a> - [Rust][DataFusion] Remove Nested from expression enum</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10708">ARROW-10708</a> - [Packaging][deb] Add support for Ubuntu 20.10</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10709">ARROW-10709</a> - [Python] Difficult to make an efficient zero-copy file reader in Python</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10712">ARROW-10712</a> - [Rust] [DataFusion] Add tests to TPC-H benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10717">ARROW-10717</a> - [Rust] [DataFusion] Add support for right join</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10720">ARROW-10720</a> - [C++] Add BasicDecimal256 Rescale Support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10721">ARROW-10721</a> - [C#][CI] Use .NET 3.1 by default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10722">ARROW-10722</a> - [Rust][DataFusion] Reduce overhead in data types in aggregations / joins, improve benchmarks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10723">ARROW-10723</a> - [Packaging][deb][RPM] Enable Parquet encription</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10724">ARROW-10724</a> - [Developer Tools] Add labeler to when PRs need rebase</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10725">ARROW-10725</a> - [Python][Compute] Exposing bindings for sort options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10728">ARROW-10728</a> - [Rust] [DataFusion] Add SQL support for JOIN with USING clause</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10729">ARROW-10729</a> - [Rust] [DataFusion] Add SQL support for JOIN using implicit syntax</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10732">ARROW-10732</a> - [Rust] [DataFusion] Add SQL support for table/relation aliases and compound identifiers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10733">ARROW-10733</a> - [R] Improvements to Linux installation troubleshooting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10740">ARROW-10740</a> - [Rust][DataFusion] Remove redundant clones found by clippy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10741">ARROW-10741</a> - Apply clippy lints to source code, remove them from ignore list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10742">ARROW-10742</a> - [Python] Mask not checked when creating array from numpy array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10745">ARROW-10745</a> - [Rust] Allocate padding bytes in filter context</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10747">ARROW-10747</a> - [Rust] Optimizations for csv reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10750">ARROW-10750</a> - [Rust] [DataFusion] Add SQL support for LEFT and RIGHT join</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10752">ARROW-10752</a> - [GLib] Add garrow_schema_has_metadata()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10754">ARROW-10754</a> - [GLib] Add support for metadata to GArrowField</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10755">ARROW-10755</a> - [Rust] [Parquet] Add support for writing boolean type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10756">ARROW-10756</a> - [Rust] Clippy - fix reduntant clone</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10759">ARROW-10759</a> - [Rust][DataFusion] Implement support for casting string to date in sql expressions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10763">ARROW-10763</a> - [Rust] Speed up take kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10765">ARROW-10765</a> - [Rust] Optimize take strings for non-null arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10767">ARROW-10767</a> - [Rust] Speed up sum kernel with nulls</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10770">ARROW-10770</a> - [Rust] Support reading nested JSON lists</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10772">ARROW-10772</a> - [Rust] Improve take performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10775">ARROW-10775</a> - [Rust][DataFusion] Use ahash in hash join</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10776">ARROW-10776</a> - [C++] Provide iterator access to primitive elements inside an Array</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10781">ARROW-10781</a> - [Rust] [DataFusion] TableProvider should provide row count statistics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10783">ARROW-10783</a> - [Rust] [DataFusion] Implement row count statistics for Parquet TableProvider</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10785">ARROW-10785</a> - Further optimize take string</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10786">ARROW-10786</a> - [Packaging][RPM] Drop support for CentOS 6</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10788">ARROW-10788</a> - [C++] Make S3 recursive walks parallel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10789">ARROW-10789</a> - [Rust][DataFusion] Make TableProvider dynamically typed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10790">ARROW-10790</a> - [C++][Compute] Investigate ChunkedArray sort performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10792">ARROW-10792</a> - [Rust] [CI] Modulararize CI for faster and smaller builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10795">ARROW-10795</a> - [Rust] Fix specialization for arrow datatypes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10796">ARROW-10796</a> - [C++] Investigate RecordBatch sort performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10800">ARROW-10800</a> - [Rust] [Parquet] Provide access to the elements of parquet::record::{List, Map}</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10802">ARROW-10802</a> - [C++] Remove Dictionary[NullType] special casing in parquet column writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10808">ARROW-10808</a> - [Rust] [DataFusion] Support nested expressions in aggregations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10809">ARROW-10809</a> - [C++] Use Datum for SortIndices() input</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10812">ARROW-10812</a> - [Rust] Make BooleanArray not a PrimitiveArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10813">ARROW-10813</a> - [Rust] [DataFusion] Implement DFSchema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10814">ARROW-10814</a> - [Packaging][deb] Drop support for Debian GNU/Linux Stretch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10817">ARROW-10817</a> - [Rust] [DataFusion] Implement TypedString</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10820">ARROW-10820</a> - [Rust] [DataFusion] Complete TPC-H Benchmark Queries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10821">ARROW-10821</a> - [Rust] [Datafusion] implement negative expression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10822">ARROW-10822</a> - [Rust] [Datafusion] support compiling datafusion with simd support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10824">ARROW-10824</a> - [Rust] Added PartialEq for NullArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10825">ARROW-10825</a> - [Rust] Add support to NullArrays for MutableArrayData</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10826">ARROW-10826</a> - [Rust] Add support for FixedSizeBinary to MutableArrayData</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10827">ARROW-10827</a> - [Rust] Extend concatenate to all types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10828">ARROW-10828</a> - [Rust][DataFusion] Enable more clippy lints</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10829">ARROW-10829</a> - [Rust] [DataFusion] Implement Into&lt;Schema&gt; for DFSchema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10832">ARROW-10832</a> - [Rust] Evaluate latest snapshot flatc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10836">ARROW-10836</a> - [Rust] Extend take kernel to FixedSizeListArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10838">ARROW-10838</a> - [Rust] [CI] Add CI for wasm32 target</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10839">ARROW-10839</a> - [Rust] [DataFusion] Implement BETWEEN Operator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10843">ARROW-10843</a> - [C++] Add support for temporal types in sort family kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10845">ARROW-10845</a> - [Python][CI] Add python CI build using numpy nightly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10849">ARROW-10849</a> - [Python] Handle numpy deprecation warnings for builtin type aliases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10851">ARROW-10851</a> - [C++] Reduce code size of vector_sort.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10857">ARROW-10857</a> - [Packaging] Follow PowerTools repository name change on CentOS 8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10858">ARROW-10858</a> - [C++][MSVC] Add missing Boost dependency</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10861">ARROW-10861</a> - [Python] Update minimal NumPy version to 1.16.6</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10864">ARROW-10864</a> - [Rust] Use standard ordering for floats</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10865">ARROW-10865</a> - [Rust][DataFusion] More ergonomic conversion between Schema, SchemaRef, DFSchema, and DFSchemaRef</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10867">ARROW-10867</a> - build failure on aarch64 with -DARROW_PYTHON=ON and gcc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10869">ARROW-10869</a> - [GLib] Add garrow_*_sort_indices() and related options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10870">ARROW-10870</a> - [Julia] Update website with Julia implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10871">ARROW-10871</a> - [Julia] Setup Julia CI via GitHub Actions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10873">ARROW-10873</a> - [C++] Apple Silicon is reported as arm64 in CMake</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10874">ARROW-10874</a> - [Rust][DataFusion] Add table statistics for MemTable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10877">ARROW-10877</a> - [Rust] [DataFusion] Add benchmark based on kaggle movies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10878">ARROW-10878</a> - [Rust] Simplify extend_from_slice</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10879">ARROW-10879</a> - [Packaging][deb] Restore Debian GNU/Linux Buster configuration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10881">ARROW-10881</a> - [C++] EXC_BAD_ACCESS in BaseSetBitRunReader&lt;false&gt;::NextRun</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10885">ARROW-10885</a> - [Rust][DataFusion] Optimize join build vs probe based on statistics on row number</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10887">ARROW-10887</a> - [C++][Doc] Document IPC API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10889">ARROW-10889</a> - [Rust] Document our approach to unsafe code in README</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10890">ARROW-10890</a> - [Rust] [DataFusion] JOIN support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10891">ARROW-10891</a> - [Rust][DataFusion] More clippy lints</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10893">ARROW-10893</a> - [Rust] [DataFusion] Easier clippy fixes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10896">ARROW-10896</a> - [C++][CMake] Rename internal RE2 package name to “re2” from “RE2”</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10900">ARROW-10900</a> - [Rust][DataFusion] Resolve TableScan provider eagerly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10904">ARROW-10904</a> - [Python] Add support for Python 3.9 macOS wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10905">ARROW-10905</a> - [Python] Add support for Python 3.9 windows wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10908">ARROW-10908</a> - [Rust] [DataFusion] Update relevant tpch-queries with BETWEEN</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10917">ARROW-10917</a> - [Rust][Doc] Update feature matrix</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10918">ARROW-10918</a> - [C++][Doc] Document supported Parquet features</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10927">ARROW-10927</a> - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10927">ARROW-10927</a> - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10927">ARROW-10927</a> - [Rust] [Parquet] Add Decimal to ArrayBuilderReader for physical type fixed size binary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10929">ARROW-10929</a> - [Rust] Migrate CI tests to stable rust</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10933">ARROW-10933</a> - [Rust] Update docs in regard to stable rust</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10934">ARROW-10934</a> - [Python] Tests are failed with fsspec-0.8.5</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10938">ARROW-10938</a> - [Rust] upgrade dependency “flatbuffers” to 0.8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10940">ARROW-10940</a> - [Rust] Extend sort kernel to ListArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10941">ARROW-10941</a> - [Doc][C++] Document supported Parquet encryption features</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10944">ARROW-10944</a> - [Rust] Implement min/max kernels for BooleanArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10946">ARROW-10946</a> - [Rust] Make ChunkIter not depend on a buffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10947">ARROW-10947</a> - [Rust][DataFusion] Refactor UTF8 to Date32 for Performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10948">ARROW-10948</a> - [C++] Always use GTestConfig.cmake</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10949">ARROW-10949</a> - [Rust] Avoid clones in getting values of boolean arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10951">ARROW-10951</a> - [Python][CI] Nightly pandas builds failing because of pytest monkeypatch issue</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10952">ARROW-10952</a> - [Rust] Add pre-commit hook</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10966">ARROW-10966</a> - [C++] Use FnOnce for ThreadPool’s tasks instead of std::function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10968">ARROW-10968</a> - [Rust][DataFusion] Don’t build hash table for right side of the join</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10969">ARROW-10969</a> - [Rust][DataFusion] Implement ANSI SQL Functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10985">ARROW-10985</a> - [Rust] Update unsafe guidelines for adding JIRA references</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10986">ARROW-10986</a> - [Rust][DataFusion] Add average statistic to TCP-H benchmark too</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10988">ARROW-10988</a> - [C++] Require CMake 3.5 or later</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10989">ARROW-10989</a> - [Rust] Use slices for iterating primitive arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10993">ARROW-10993</a> - [CI][macOS] Fix Python 3.9 installation by Homebrew</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10995">ARROW-10995</a> - [Rust] [DataFusion] Improve parallelism when reading Parquet files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11004">ARROW-11004</a> - [FlightRPC][Python] Header-based auth in clients</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11005">ARROW-11005</a> - [Rust] Remove indirection from take kernel and simplify interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11008">ARROW-11008</a> - [Rust][DataFusion] Simplify count accumulator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11009">ARROW-11009</a> - [Python] Add environment variable to elect default usage of system memory allocator instead of jemalloc/mimalloc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11010">ARROW-11010</a> - [Python] `np.float` deprecation warning in `_pandas_logical_type_map`</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11012">ARROW-11012</a> - [Rust] [DataFusion] Make write_csv and write_parquet concurrent</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11015">ARROW-11015</a> - [CI][Gandiva] Move gandiva nightly build from travis to github action</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11018">ARROW-11018</a> - [Rust][DataFusion] Add null count column statistics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11026">ARROW-11026</a> - [Rust]: Run tests without requiring environment variables</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11028">ARROW-11028</a> - [Rust] Somewhat pedantic pattern-matches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11029">ARROW-11029</a> - [Rust] [DataFusion] Document why join order optimization does not work with filter pushdown</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11032">ARROW-11032</a> - [C++][FlightRPC] Add benchmark for local RPC through unix socket</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11033">ARROW-11033</a> - [Rust] CSV writer performance improvements</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11034">ARROW-11034</a> - [Rust] rustfmt cleanup</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11035">ARROW-11035</a> - [Rust] Improve performance of cast to utf8 via FromIter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11037">ARROW-11037</a> - [Rust] Improve performance of string fromIter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11038">ARROW-11038</a> - [Rust] Remove `BufferBuilderTrait` and associated Result requirement.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11039">ARROW-11039</a> - [Rust] Improve performance for utf8 to float cast</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11040">ARROW-11040</a> - [Rust] Simplify builders with generics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11042">ARROW-11042</a> - [Rust][DataFusion] Increase default batch size</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11043">ARROW-11043</a> - [C++] Add “is_nan” kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11046">ARROW-11046</a> - [Rust][DataFusion] Add count_distinct to dataframe API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11049">ARROW-11049</a> - [Python] Expose alternate memory pools</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11052">ARROW-11052</a> - [Rust] [DataFusion] Implement metrics in join operator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11053">ARROW-11053</a> - [Rust] [DataFusion] Optimize joins with dynamic capacity for output batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11054">ARROW-11054</a> - Update SQLParser to 0.70</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11055">ARROW-11055</a> - [Rust] [DataFusion] Support date_trunc function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11058">ARROW-11058</a> - [Rust] [DataFusion] Implement “coalesce batches” operator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11063">ARROW-11063</a> - [Rust] Validate null counts when building arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11064">ARROW-11064</a> - [Rust][DataFusion] Speed up hash join on smaller batches</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11072">ARROW-11072</a> - [Rust] [Parquet] Support int32 and int64 physical types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11076">ARROW-11076</a> - [Rust][DataFusion] Refactor usage of right indices in hash join</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11079">ARROW-11079</a> - [R] Catch up on changelog since 2.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11080">ARROW-11080</a> - [C++][Dataset] Improvements to implicit casting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11082">ARROW-11082</a> - [Rust] Add FFI for LargeUtf8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11086">ARROW-11086</a> - [Rust] Extend take to support more index types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11091">ARROW-11091</a> - [Rust][DataFusion] Fix clippy warning in rust 1.49</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11095">ARROW-11095</a> - [Python] Access pyarrow.RecordBatch column by name</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11096">ARROW-11096</a> - [Rust] Add FFI for [Large]Binary</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11097">ARROW-11097</a> - [Rust] Simplify tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11099">ARROW-11099</a> - [Rust]: Remove unsafe value_slice method from PrimitiveArray and BooleanArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11100">ARROW-11100</a> - [Rust] Speed up numeric to string cast using lexical_core</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11101">ARROW-11101</a> - [Rust] enable “cargo +nightly fmt” in git pre-commit hook</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11104">ARROW-11104</a> - [GLib] Add append_null/append_nulls to GArrowArrayBuilder and use them</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11105">ARROW-11105</a> - [Rust] Favor From/Into traits in MutableBuffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11108">ARROW-11108</a> - [Rust] Improve performance of MutableBuffer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11109">ARROW-11109</a> - [GLib] Add garrow_array_builder_append_empty_value() and values()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11110">ARROW-11110</a> - [Rust] [Datafusion] context.table should not take a mutable self reference</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11111">ARROW-11111</a> - [GLib] Add GArrowFixedSizeBinaryArrayBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11121">ARROW-11121</a> - [Developer] Use pull_request_target for PR JIRA integration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11122">ARROW-11122</a> - [Rust] Add FFI for date and time</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11124">ARROW-11124</a> - [Doc] Update status matrix for Decimal256</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11125">ARROW-11125</a> - [Rust] Implement logical equality for list arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11126">ARROW-11126</a> - [Rust] Document and test ARROW-10656</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11127">ARROW-11127</a> - [C++] Unused cpu_info on non-x86 architecture</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11129">ARROW-11129</a> - [Rust][DataFusion] Use tokio thread pool for loading parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11130">ARROW-11130</a> - [Website][CentOS 8][RHEL 8] Enable all required repositories by default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11131">ARROW-11131</a> - [Rust] Improve performance of bool_equal</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11136">ARROW-11136</a> - [R] Bindings for is.nan</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11137">ARROW-11137</a> - [Rust][DataFusion] Fix Clippy needless_range_loop, needless_lifetimes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11138">ARROW-11138</a> - [Rust] [DataFusion] Support ltrim, rtrim</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11139">ARROW-11139</a> - [GLib] Add support for extension type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11155">ARROW-11155</a> - [C++][Packaging] Move gandiva crossbow jobs off of Travis-CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11158">ARROW-11158</a> - [Julia] Implement Decimal256 support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11159">ARROW-11159</a> - [Developer] Consolidate pull request related jobs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11165">ARROW-11165</a> - [Rust] [DataFusion] Document the desired SQL dialect for DataFusion</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11168">ARROW-11168</a> - [Rust] Fix cargo doc warnings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11169">ARROW-11169</a> - [Rust] Add a comment explaining where float total_order algorithm came from</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11175">ARROW-11175</a> - [R] Small docs fixes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11176">ARROW-11176</a> - [R] Expose memory pool name and document setting it</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11187">ARROW-11187</a> - [Rust] [Parquet] Pin specific parquet-format-rs version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11188">ARROW-11188</a> - [Rust] Implement crypto functions from PostgreSQL dialect</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11193">ARROW-11193</a> - [Documentation] Add docs for Java ListVector</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11194">ARROW-11194</a> - [Rust] Enable SIMD for aarch64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11195">ARROW-11195</a> - [Rust] [DataFusion] Built-in table providers should expose relevant fields</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11196">ARROW-11196</a> - [GLib] Add support for mock, HDFS and S3 file systems with factory function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11198">ARROW-11198</a> - [Packaging][Python] Ensure setuptools version during build supports markdown</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11200">ARROW-11200</a> - [Rust] [DateFusion] Physical operators and expressions should have public accessor methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11201">ARROW-11201</a> - [Rust] create_batch_empty - support more types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11203">ARROW-11203</a> - [Developer][Website] Enable JIRA and pull request integration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11204">ARROW-11204</a> - [C++] Fix build failure with bundled gRPC and Protobuf</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11205">ARROW-11205</a> - [GLib][Dataset] Add GADFileFormat and its family</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11209">ARROW-11209</a> - [Rust] DF - Provide better error message on unsupported GROUP BY</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11210">ARROW-11210</a> - [CI] Restore workflows that had been blocked by INFRA</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11212">ARROW-11212</a> - [Packaging][Python] Use vcpkg as dependency source for manylinux and windows wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11213">ARROW-11213</a> - [Packaging][Python] Dockerize wheel building on windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11215">ARROW-11215</a> - [CI] Use named volumes by default for caching in docker-compose</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11216">ARROW-11216</a> - [Rust] Improve documentation for StringDictionaryBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11218">ARROW-11218</a> - [R] Make SubTreeFileSystem print method more informative</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11219">ARROW-11219</a> - [CI][Ruby][MinGW] Reduce CI time</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11221">ARROW-11221</a> - [Rust] DF Implement GROUP BY support for Float32/Float64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11231">ARROW-11231</a> - [Packaging] Add mimalloc to Linux builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11234">ARROW-11234</a> - [CI][Ruby][macOS] Reduce CI time</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11236">ARROW-11236</a> - [Java] Bump Jackson to 2.11.4</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11240">ARROW-11240</a> - [Packaging][R] Add mimalloc to R packaging</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11242">ARROW-11242</a> - [CI] Remove CMake 3.2 job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11245">ARROW-11245</a> - [C++][Gandiva] Add support for LLVM 11.1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11247">ARROW-11247</a> - [C++] Infer date32 columns in CSV</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11256">ARROW-11256</a> - [Packaging][Linux] Don’t buffer packaging output</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11268">ARROW-11268</a> - [Rust][DataFusion] Support specifying repartitions in MemTable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11272">ARROW-11272</a> - [Release][wheel] Remove unsupported Python 3.5 and manylinux1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11273">ARROW-11273</a> - [Release][deb] Remove unsupported Debian GNU/Linux stretch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11278">ARROW-11278</a> - [Release][NodeJS] Don’t touch ~/.bash_profile</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11280">ARROW-11280</a> - [Release][APT] Fix minimal build example check</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11281">ARROW-11281</a> - [C++] Remove needless runtime RapidJSON dependency</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11282">ARROW-11282</a> - [Packaging][deb] Add missing libgflags-dev dependency</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11285">ARROW-11285</a> - [Release][APT] Add support for Ubuntu Groovy</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11292">ARROW-11292</a> - [Release][JS] Use Node.JS LTS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11293">ARROW-11293</a> - [C++] Don’t require Boost and gflags with find_package(Arrow)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11307">ARROW-11307</a> - [Release][Ubuntu][20.10] Add workaround for dependency issue</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11309">ARROW-11309</a> - [Release][C#] Use .NET 3.1 for verification</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11329">ARROW-11329</a> - [Rust] Do not rebuild the library on every change</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11372">ARROW-11372</a> - Support RC verification on macOS-ARM64</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-1566">PARQUET-1566</a> - [C++] Indicate if null count, distinct count are present in column statistics</li>
</ul>
</main>
<hr/>
<footer class="footer">
<div class="row">
<div class="col-md-9">
<p>Apache Arrow, Arrow, Apache, the Apache feather logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.</p>
<p>&copy; 2016-2024 The Apache Software Foundation</p>
</div>
<div class="col-md-3">
<a class="d-sm-none d-md-inline pr-2" href="https://www.apache.org/events/current-event.html">
<img src="https://www.apache.org/events/current-event-234x60.png"/>
</a>
</div>
</div>
</footer>
</div>
</body>
</html>