blob: d22dbf3e58f054bbeb202a1f91baf37ed86b01d6 [file] [log] [blame]
<!DOCTYPE html>
<html lang="en-US">
<head>
<meta charset="UTF-8">
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">
<!-- The above meta tags *must* come first in the head; any other head content must come *after* these tags -->
<title>Apache Arrow 8.0.0 Release | Apache Arrow</title>
<!-- Begin Jekyll SEO tag v2.8.0 -->
<meta name="generator" content="Jekyll v4.3.3" />
<meta property="og:title" content="Apache Arrow 8.0.0 Release" />
<meta property="og:locale" content="en_US" />
<meta name="description" content="Apache Arrow 8.0.0 (6 May 2022) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For AlmaLinux For Amazon Linux For CentOS For C# For Debian For Python For Ubuntu Git tag Contributors This release includes 636 commits from 127 distinct contributors. $ git shortlog -sn apache-arrow-7.0.0..apache-arrow-8.0.0 43 Antoine Pitrou 40 David Li 39 Sutou Kouhei 36 Alenka Frim 29 Weston Pace 25 Rok 22 Dragoș Moldovan-Grünfeld 22 Joris Van den Bossche 20 Jonathan Keane 20 Krisztián Szűcs 19 Raúl Cumplido 17 Will Jones 16 Dewey Dunnington 13 david dali susanibar arce 12 Matthew Topol 10 Nic Crane 10 Yibo Cai 10 Vibhatha Abeykoon 9 Dominik Moritz 9 SHIMA Tatsuya 8 Neal Richardson 8 Sanjiban Sengupta 8 okadakk 8 Jacob Wujciak-Jens 7 Tobias Zagorni 7 Alessandro Molina 7 William Butler 6 Dhruv Vats 5 Yuqi Gu 5 Johnnathan 4 Anthony Louis 4 Projjal Chanda 4 Fiona La 4 Micah Kornfield 4 Bryan Cutler 3 Yaron Gvili 3 Ian Alexander Joiner 3 Vinicius Roque 3 ViniciusSouzaRoque 3 emkornfield 3 Chenxi Li 2 Jorge C. Leitao 2 Jose Almeida 2 assignUser 2 Uwe L. Korn 2 Todd Farmer 2 Tim Van Wassenhove 2 Rob Saccoccio 2 Alvin Chunga 2 Stephanie Hazlitt 2 Jeroen van Straten 2 Kyle Barron 2 Sasha Krassovsky 2 Eduardo Ponce 2 Salonijain27 2 liyafan82 2 JabariBooker 2 michalursa 2 Matthijs Brobbel 2 Danielle Navarro 2 Maya Anderson 2 Even Rouault 2 Joost Hoozemans 1 zhixingheyi-tian 1 SAm Albers 1 Alex McRae 1 Alfred Mountfield 1 Ali Amiri 1 Andrew Higgins 1 Andrew Strelsky 1 Andrii Oriekhov 1 Andy Teucher 1 Benjamin Kietzman 1 Benson Muite 1 Bob Matcuk 1 Chris Casola 1 Colin Jermain 1 Corentin 1 Dongjoon Hyun 1 Dooyoung-Hwang 1 Edward Visel 1 Elena Henderson 1 Frederic Branczyk 1 Haowei Yu 1 Hongze Zhang 1 Isaac Brodsky 1 James Duong 1 Johan Mattsson 1 João Pedro 1 Kazuaki Ishizaki 1 Kevin Gurney 1 Kian-Meng Ang 1 Laurent Goujon 1 Leo Di Donato 1 Li Jin 1 Min-Young Wu 1 MrMallIronmaker 1 Nate Clark 1 Nicolas Moreau 1 Nivia 1 Pradeep Garigipati 1 Rafael Telles 1 Romain Francois 1 Ryan Blue 1 Aakash Kumar 1 Sam Albers 1 Siddhant Rao 1 Stephan T. Lavavej 1 Thomas J. Fan 1 Tom Drabas 1 William Ayd 1 Xinyu Zeng 1 Yue Ni 1 Zac Farrell 1 bkmgit 1 burntcustard 1 fabiencelier 1 hankvyang 1 hzuo 1 iurysalino 1 jvictorhuguenin 1 jwijffels 1 karldw 1 marcbernot 1 okadak 1 ptaylor 1 shegden Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-7.0.0..apache-arrow-8.0.0 105 David Li 102 Antoine Pitrou 84 Sutou Kouhei 72 Jonathan Keane 47 Krisztián Szűcs 46 Joris Van den Bossche 30 Weston Pace 25 Nic Crane 23 Pindikura Ravindra 22 Matthew Topol 16 Alessandro Molina 15 Yibo Cai 15 Neal Richardson 14 Dominik Moritz 5 Bryan Cutler 5 GitHub 3 Ian Cook 2 liyafan82 2 Micah Kornfield 1 Matt Topol 1 Li Jin 1 Chao Sun Changelog Apache Arrow 8.0.0 (2022-05-06) Bug Fixes ARROW-5248 - [Python] support zoneinfo / dateutil timezones ARROW-7350 - [Python] Decode parquet statistics as scalars ARROW-9664 - [Python] Array/ChunkedArray.to_pandas do not support types_mapper keyword ARROW-11415 - [R] map_batches wouldn’t accept a dataset as an argument ARROW-13168 - [C++][R] Enable runtime timezone database for Windows ARROW-13594 - [CI] Enable nightly turbodbc builds again ARROW-13922 - [Python] Fix ParquetDataset throw error when len(path_or_paths) == 1 ARROW-14047 - [C++] [Parquet] FileReader returns inconsistent results on repeat reads ARROW-14215 - [R][CI] Conda Windows builds failing due to space in library name ARROW-14256 - [CI][Package] Re-enable disabled conda packaging builds ARROW-14389 - [C++][Gandiva] Fix performance bug with LIKE expressions ARROW-14638 - [C++][R] Unknown C compiler / ccache on Arch Linux ARROW-14647 - [JS] fix bignumToNumber for negative numbers ARROW-14665 - [JAVA] fix JdbcToArrow ResultSet iteration bug ARROW-14708 - [C++] Adding missing abseil dependencies to enable static flight build ARROW-14908 - [C++][R] Dataset hash join segfaults on Windows ARROW-14911 - [C++] arrow-compute-hash-join-node-test failed ARROW-14960 - [C++] Add exception to Arrow style guide based on changes in Google style guide that we are not adopting ARROW-15018 - [Python] DataFrame Index modified during Feather serialization round trip ARROW-15092 - [R] Support create_package_with_all_dependencies() on non-linux systems ARROW-15253 - [Python] Error in to_pandas for empty dataframe with index with extension type ARROW-15272 - [Java] Add cleanup failures as suppressed in ArrowVectorIterator#create ARROW-15291 - [C++][Python] Segfault in StructArray.to_numpy and to_pandas if it contains an ExtensionArray ARROW-15312 - [R][C++] filtering a Parquet dataset with is.na() misses some rows ARROW-15401 - [Python] Gdb tests are failing on windows and apple M1 ARROW-15426 - [C++][Gandiva] Update InExpressionNode validation ARROW-15444 - [C++] Compilation with GCC 7.5 fails in aggregate_basic.cc ARROW-15465 - [Python] Add some missing parquet marks in dataset tests ARROW-15502 - [Java] Detect exceptional footer size in Arrow file reader ARROW-15504 - [Python][CI] Ensure that optional components are tested ARROW-15509 - [Go][Parquet] Parquet cmds crash ARROW-15511 - [Python][C++] Remove reference management in numpy indexer ARROW-15514 - [C++][Gandiva] Add flag to enable Gandiva Object Code ARROW-15520 - [C++] Qualify arrow_vendored::date::format() for C++20 compatibility ARROW-15533 - [C++] Check ARROW_WITH_OPENTELEMETRY in CI ARROW-15539 - [Archery] Add ARROW_JEMALLOC to build options ARROW-15541 - [Python] Bump the minimum Cython version ARROW-15544 - [Go][Parquet] Fix origin schema base64 decoding ARROW-15546 - [FlightRPC][C++] Remove quotes from cookie header ARROW-15555 - [Release] Don’t push the release tag since it already exists ARROW-15580 - [Python] Make pytz an actual optional dependency of PyArrow ARROW-15593 - [C++] Make after-fork ThreadPool reinitialization thread-safe ARROW-15598 - [C++][Gandiva] Avoid using hardcoded raw pointer addresses in generated code ARROW-15599 - [R] Convert a column as a sub-second timestamp from CSV file with the T col type option ARROW-15603 - [C++] Remove unused variables ARROW-15604 - [C++][CI] Sporadic ThreadSanitizer failure with OpenTracing ARROW-15604 - [C++][CI] Sporadic ThreadSanitizer failure with OpenTracing ARROW-15607 - [C++] Fix incorrect CPUID flag for AVX detection ARROW-15626 - [GLib] Fix a bug that GArrowGIOInputStream may not read enough data ARROW-15627 - [R] Fix union dataset unify schema ARROW-15648 - [C++][Gandiva] Fix the size of the Gandiva cache ARROW-15651 - [JavaScript] Structs incorrectly initialise null values ARROW-15652 - [C++] Fix GDB pretty-printing from inside parquet namespace ARROW-15659 - [R] strptime should return NA (not error) with format mismatch ARROW-15664 - [C++] parquet reader Segfaults with illegal SIMD instruction ARROW-15667 - [R] Test development build with ARROW_BUILD_STATIC=OFF ARROW-15674 - [C++][Gandiva] Like function doesn’t properly handle patterns with special characters in certain cases ARROW-15677 - [R] calling invalidate() method on ArrowObjects causes subsequent segfault ARROW-15679 - [R] count should return an ungrouped dataframe ARROW-15688 - [C++] add_checked doesn’t error out on duration overflow ARROW-15699 - [C++][Gandiva] Fix implementation of left and right func… ARROW-15700 - [C++] Compilation error on Ubuntu 18.04 ARROW-15705 - [JavaScript] Allowing appending null on children in a StructBuilder ARROW-15710 - [C++] Intermittent deadlock on arrow-threading-utility-test ARROW-15715 - [Go] ipc trim value offsets on arrays ARROW-15718 - [C++] Increase thread limit to work around thread issues ARROW-15720 - [CI] Fix nightly dask build (skip failing test due to wrong usage of Array.to_pandas) ARROW-15723 - [Python] Segfault orcWriter write table ARROW-15727 - [Python] Allow converting lists of MonthDayNano intervals to Pandas ARROW-15728 - [Python] Reduce entropy for zstd test_ipc ARROW-15743 - [R] skip not connected up to skip_rows on open_dataset despite error messages indicating otherwise ARROW-15746 - [Release][Java] Add missing artifacts to tasks.yml ARROW-15748 - [Python] Round temporal options default unit is day but documented as second. Follow-up ARROW-15748 - [Python] Round temporal options default unit is day but documented as second ARROW-15757 - [Python] Missing bindings for existing_data_behavior makes it impossible to maintain old behavior ARROW-15760 - [C++] Avoid hard dependency on git in cmake (download tarballs from github instead) ARROW-15770 - [CI] Not all python tests are running on CI jobs ARROW-15772 - [Go][Flight] Server Basic Auth Middleware/Interceptor wrongly base64 decode ARROW-15778 - [Java] set native endian to schema ARROW-15783 - [Python] Initialize static pandas data on write ARROW-15784 - [C++][Python] Removing flag enable_parallel_column_conversion which is no longer used ARROW-15791 - [Go] ipc FileWriter negative WaitGroup counter ARROW-15794 - [CI][Crossbow] Nightly builds failing due to error in types_mapper ARROW-15815 - [C++][Parquet] Fix undefined behaviour on invalid input ARROW-15819 - [R] R docs version switcher doesn’t work on Safari on MacOS ARROW-15830 - [C++] Ensure target directory exists before running Substrait generation ARROW-15837 - [C++][Python] Clarify documentation for ListArray::offsets() ARROW-15845 - [Python][Packaging] Fix macOS wheel builds ARROW-15847 - [Python][CI] Ensure we have a nightly Python build with parquet encryption disabled ARROW-15847 - [Python] Building with Parquet but without Parquet encryption fails ARROW-15848 - [Gandiva][C++] Fix function istrue and is not true ARROW-15851 - [C++] Enable RE2 when building with gRPC ARROW-15852 - [JS] Fix error thrown by Table.getByteLength() ARROW-15857 - [R] rhub/fedora-clang-devel fails to install ‘sass’ (rmarkdown dependency) ARROW-15863 - [Packaging][C++][Python] Fix conda package builds ARROW-15869 - [C++] Fix Valgrind failure (uninitialized value) ARROW-15888 - [Doc][Python] Modernize development instructions ARROW-15892 - [C++] Dataset APIs require s3:ListBucket Permissions ARROW-15895 - [R] R docs version switcher disappears &amp; reappears with back button on Chrome ARROW-15898 - [CI] Clean old conda nightlies more thoroughly ARROW-15905 - [Python][C++] Fix CMake warning when building PyArrow ARROW-15928 - [C++] Fix crashes and implement chunked array support for replace_with_mask function ARROW-15929 - [R] io_thread_count is actually the CPU thread count ARROW-15946 - [Go] Fix memory leak in pqarrow.NewColumnWriter when writing nested data ARROW-15949 - [Python] Do not require Parquet encryption when Parquet is disabled ARROW-15951 - [CI][Python] “Test wheel” step successful despite test error ARROW-15954 - [Java] Remove mac native netty kqueue dependency after upgrade ARROW-15960 - [C++] Fix crash on adaptive int builder edge cases ARROW-15962 - [C++][GANDIVA] Fix unhex errors return ARROW-15965 - [C++][Python] Add Scalar constructor of RoundToMultipleOptions to Python ARROW-15970 - [R][CI] Re-enable DuckDB dev tests ARROW-15973 - [CI] Split nightly reports into three: Tests, Packaging, Release ARROW-15982 - [Python] parquet.read_table fails to parse home directory path ARROW-15985 - [CI] Fix conda-clean failure when there are no files to delete ARROW-15987 - [C++][FlightRPC] Work around arrow-flight-test crash on AppVeyor ARROW-15993 - [CI] Add sphinx-tabs to ci/conda_env_sphinx.txt ARROW-16012 - [C++] Retry S3 request in tests when Minio not fully initialized ARROW-16013 - [C++][Python] Signed overflow when using negative stride in NumPyStridedConverter ARROW-16016 - [C++] Fix recursive ccache invocation error ARROW-16019 - [C++] Minimize chances of Minio connect errors ARROW-16021 - [C++] arrow-compute-hash-join-node-test timeout on MinGW ARROW-16025 - [Python][C++] Fix segmentation fault when closing ORCFileWritter ARROW-16031 - [C++][Gandiva] Fix Soundex errors generate ARROW-16035 - [Java] Handling empty JDBC ResultSet ARROW-16043 - [C++][Filesystem][S3] Add missing empty content for creating directory ARROW-16048 - [Python] Avoid exposing null buffer address to the Python buffer protocol ARROW-16051 - [Gandiva][C++] Fix datediff regression build ARROW-16052 - [R] undefined global function %&gt;% ARROW-16060 - [C++] subtract_checked support for timestamp(“s”) and date32 ARROW-16071 - [R] More undefined global functions ARROW-16078 - Upgrade bundled zlib to 1.2.12 ARROW-16099 - [JS] RecordBatches that are compressed should throw an error ARROW-16107 - [Dev][Archery] Fix archery crossbow latest-prefix query ARROW-16110 - [C++] GcsFileSystem::Make ignores IOContext ARROW-16113 - [Python] Partitioning.dictionaries in case of a subset of fields are dictionary encoded ARROW-16131 - [C++] support saving and retrieving custom metadata in batches for IPC file ARROW-16134 - [C++][GANDIVA] Fix Concat_WS errors return ARROW-16136 - [Gandiva][C++] Fix problem of the huge size of AddMappings function ARROW-16139 - [Python] Crash in tests/test_dataset.py::test_write_dataset_s3 ARROW-16143 - [Java] Upgrade jackson dependencies CVE-2020-36518 ARROW-16143 - [Java] Upgrade jackson dependencies CVE-2020-36518 ARROW-16146 - [C++] arrow-gcsfs-test is timing out ARROW-16148 - [C++] TPC-H generator cleanup ARROW-16152 - [C++] Fix segfault with unknown functions in Substrait ARROW-16159 - [C++][Python] Allow FileSystem::DeleteDirContents to succeed if the directory is missing ARROW-16162 - [C++][FlightRPC] Fix Flight build on Ubuntu 18.04 ARROW-16163 - [Go] IPC FileReader leaks memory when used with ZSTD compression ARROW-16165 - [CI][Archery] Fix nightly query to crossbow to send reports ARROW-16169 - [C++][Gandiva] Fix empty string case in convert_fromUTF8_binary() ARROW-16181 - [CI][C++] Valgrind failure in TPCH node tests ARROW-16182 - [C++][CI] TPCH node tests timeout under ThreadSanitizer ARROW-16185 - [C++] Fix uninitialized output data in strptime kernel ARROW-16197 - [Docs] Fix broken link ARROW-16205 - [C++][FlightRPC] Don’t use constexpr std::initializer_list ARROW-16209 - [JS] Support setting arbitrary symbols on Tables ARROW-16215 - [C++][FlightRPC] Fix segfault in Flight test on Windows ARROW-16216 - [Python][FlightRPC] Fix test_flight.py when Flight is not available ARROW-16219 - [CI] Fix git config to prevent SCM tools failure ARROW-16223 - [C++] Fix decimal reduce scale rounding ARROW-16225 - [C++][Parquet] Fix length of encryption AAD random byte generation ARROW-16233 - [Python][Packaging] test_zoneinfo_tzinfo_to_string fails with zoneinfo._common.ZoneInfoNotFoundError on packaging wheels on Windows ARROW-16235 - [C++] Fix build failure, compiler warnings from MinGW ARROW-16236 - [Python] [Packaging] test_s3fs_limited_permissions_create_bucket fails with Permission denied on MAC OS wheel builds ARROW-16237 - [Docs] Apache Impala is no longer incubating ARROW-16238 - [C++] Fix nullptr dereference when pre-buffering IPC reads ARROW-16261 - [C++] Fix DeleteDirContents on HDFS with missing_dir_ok=True ARROW-16262 - [CI][Integration] Skip failing tests from kartothek integration ARROW-16278 - [CI] Fix git installation failure on brew ARROW-16278 - [CI] Fix git installation failure on brew ARROW-16278 - [CI] Fix git installation failure on brew ARROW-16293 - [CI][GLib] Make tests stable ARROW-16295 - [CI][Release] Use windows-2019 for verify-rc-source-windows ARROW-16300 - pc.sort_indices with nonexistent column throws malloc error ARROW-16301 - [C#][CI] Fix docker configuration for .NET 6 ARROW-16305 - [C++] Missed reference to ARROW_ENGINE during the rename ARROW-16306 - [CI] Fix Nightly verify rc on ubuntu ARROW-16307 - [Java][FlightRPC] Skip flaky test TestDoExchange.testClientCancel ARROW-16311 - [Java] Do not return table_schema column when it’s not requested ARROW-16312 - [C++][CI] Install tzdata in the windows verification builds ARROW-16313 - [R] Ensure assume_timezone options are always initialized ARROW-16332 - [Release][Java] Add artifacts uploaded verification ARROW-16336 - [Python] ParquetDataset - Hide internal (common_)metadata related warnings from the user ARROW-16374 - [R][C++] skip another snappy test during sanitizer runs ARROW-16375 - [R][CI] Pin test-r-devdocs on Windows to R 4.1 ARROW-16393 - [JAVA] Update option spec to accept value for query, catalog, schema and table ARROW-16413 - [Python] Certain dataset APIs hang with a python filesystem ARROW-16417 - [C++][Python] Segfault in test_exec_plan.py / test_joins ARROW-16419 - [Python] Properly wait for ExecPlan to finish ARROW-16442 - [Python][Dataset] Fix fragments of ORC Dataset to use FileFragment class PARQUET-2115 - [C++] Parquet dictionary bit widths are limited to 32 bits PARQUET-2118 - [C++] Don’t assume standard pointers PARQUET-2119 - [C++] Fix DeltaBitPackDecoder fuzzer found issue PARQUET-2123 - [C++] Fix invalid memory access in ScanFileContents PARQUET-2124 - [C++] Remove Parquet Dictionary DCHECK PARQUET-2130 - Fix crash in debug with non-standard key names. PARQUET-2131 - Number values decoded DCHECKs should be exceptions New Features and Improvements ARROW-1888 - [C++] Implement Struct Casts ARROW-3016 - [Docs][C++] Memory profiling with perf ARROW-3039 - [Go] Add support for DictionaryArray ARROW-3998 - [C++] Add TPC-H Generator ARROW-5107 - [Release] Validate non-RC source and binary artifacts ARROW-5598 - [Go] Rename array.Array{,Approx}Equal to array.{,Approx}Equal ARROW-6780 - [C++][Parquet] Support DurationType in writing/reading parquet (written as int64) ARROW-7174 - [Python] Expose parquet dictionary_pagesize_limit write parameter ARROW-7272 - [C++][Java][Dataset] JNI bridge between RecordBatch and VectorSchemaRoot ARROW-7914 - [Python] Allow pandas datetime as index for feather ARROW-9235 - [R] Support for connection class when reading and writing files ARROW-9378 - [Go] Support unsigned dictionary indices ARROW-9947 - [Python] High-level Python API for Parquet encryption of files. ARROW-10643 - [Python] Pandas&lt;-&gt;pyarrow roundtrip failing to recreate index for empty dataframe ARROW-10924 - [C++] Validate temporal data in ValidateArrayFull ARROW-11071 - [R][CI] Use processx to set up minio and flight servers in tests ARROW-11259 - [Python] Allow to create field reference to nested field ARROW-11989 - [C++][Python] Improve ChunkedArray’s complexity for the access of elements ARROW-12515 - [Dev][Wiki][Release] Fix and update Windows RC verify script ARROW-12516 - [C++][Gandiva] Implements castINTERVALDAY(varchar) and castINTERVALYEAR(varchar) functions ARROW-12659 - [C++] Support is_valid as a guarantee ARROW-12743 - [R] Add DESCRIPTION fields for dev dependencies ARROW-13185 - [MATLAB] Create a single MEX gateway function which delegates to specific C++ functions ARROW-13204 - [MATLAB] Update documentation for the MATLAB Interface to reflect latest CMake build system changes ARROW-13231 - [Doc] Add ORC documentation ARROW-13260 - [Doc] Host different released versions of the documentation + version switcher ARROW-13337 - [R] Define Math group generics ARROW-13375 - [C++][Gandiva] Implement POSITIVE and NEGATIVE Hive functions on Gandiva ARROW-13409 - [C++][FlightRPC] Expose server shutdown with deadline ARROW-13564 - [Dev] Check individual commit messages for “Co-authored-by:” tags when integrating a pull request ARROW-13616 - [R] Cheat Sheet Structure ARROW-13683 - [R] Test Windows UCRT R ARROW-13703 - [Python][R] Add bindings for new dataset writing options ARROW-13993 - [C++][Compute] Add hash_one aggregate function ARROW-14075 - [C++][CI] Add an appveyor CI job for VisualStudio 2019, non-conda ARROW-14091 - [C++] add(date, duration) -&gt; timestamp kernel ARROW-14093 - [C++] subtract(date, date) -&gt; duration kernel ARROW-14094 - [C++] add(timestamp, duration) -&gt; timestamp kernel ARROW-14095 - [C++] subtract(timestamp, duration) -&gt; timestamp kernel ARROW-14096 - [C++] add(time, duration) -&gt; time kernel ARROW-14097 - [C++] subtract(time, duration) -&gt; time kernel ARROW-14098 - [C++] subtract(time, time) -&gt; duration kernel ARROW-14099 - [C++] add(duration, duration) -&gt; duration kernel ARROW-14100 - [C++] subtract(duration, duration) -&gt; duration kernel ARROW-14101 - [C++] multiply(duration, integer) -&gt; duration kernel ARROW-14102 - [C++] divide(duration, integer) -&gt; duration kernel ARROW-14153 - [C++][Dataset] Add support for batch_size in the ORC Scanner ARROW-14168 - [R] Warn only once about arrow function differences ARROW-14169 - [R] altrep for factors ARROW-14199 - [R] bindings for format (where possible) ARROW-14266 - [R] Use WriteNode to write queries ARROW-14279 - [Docs] Initial attempt at describing structure of PyArrow library ARROW-14292 - [C++][Python] Join foundation for Tables ARROW-14293 - [Python] Basic Join functionality in PyArrow ARROW-14322 - [Doc] Add Python doc on how to connect Python to other languages ARROW-14333 - [C++][Compute] Add binary and LargeStringType tests to comparison kernels ARROW-14339 - [Docs] Add canonical url to the pkgdown (R) docs ARROW-14442 - [R] fix behaviour when converting timestamps with “” as tzone ARROW-14444 - [C++] Implement task-based model into the executable-pipelines. ARROW-14498 - [Docs] Make it possible to regenerate older docs with additional patch(es) ARROW-14502 - [C++][Gandiva] Add test DayOfMonth ARROW-14506 - [C++] Conda support for google-cloud-cpp ARROW-14553 - [Doc] Java Cookbook Release 1 ARROW-14579 - [Documentation] Document the CI ARROW-14591 - [R] Implement bindings for lubridate duration types ARROW-14612 - [C++] Support for filename-based partitioning ARROW-14631 - [C++][Gandiva] Implement Nextday Function ARROW-14651 - [Release][Archery] Add support for retrying download ARROW-14672 - [Docs] Document how to exchange data between Python and Java ARROW-14679 - [R][C++] Handle suffix argument in joins ARROW-14698 - [Docs][FlightRPC] Add API docs for Flight SQL ARROW-14702 - [Doc][C++] Document threading model ARROW-14745 - [R] Enable true duckdb streaming ARROW-14776 - [Website] Don’t include squashed commits in merge commit message ARROW-14798 - [C++][Python][R] Add container window to PrettyPrintOptions ARROW-14808 - [R] Implement bindings for lubridate::date() ARROW-14810 - [R] Implement bindings for lubridate’s date_decimal() and decimal_date() ARROW-14815 - [R] bindings for lubridate::semester() ARROW-14817 - [R] Implement bindings for lubridate::tz() ARROW-14823 - [R] Implement bindings for lubridate::leap_year ARROW-14824 - [R] Implement bindings for lubridate::epiyear() ARROW-14825 - [C++] Temporal component extraction function for extracting epiyear ARROW-14826 - [R] Implement bindings for lubridate::dst() ARROW-14827 - [C++] Temporal component extraction function for extracting dst indicator ARROW-14893 - [C++] Allow creating GCS filesystem from URI ARROW-14927 - [CI] Upgrade Fedora 33 to Fedora 35 ARROW-14942 - [R] Bindings for lubridate’s dpicoseconds, dnanoseconds, desconds, dmilliseconds, dmicroseconds ARROW-14943 - [R] Bindings for lubridate’s ddays, dhours, dminutes, dmonths, dweeks, dyears ARROW-14944 - [R] Implement lubridate::make_difftime() ARROW-14963 - [Doc] Add copy button extension to code-blocks ARROW-14993 - [C++] Benchmark CSV writer ARROW-14997 - [Python][Doc] Add thread_count functions to API docs ARROW-15013 - [R] Expose concatenate at the R level ARROW-15015 - [R] Test / CI flag for ensuring all tests are run? ARROW-15020 - [R] Add bindings for new dataset writing options ARROW-15040 - [R] Enable write_csv_arrow to take a Dataset or arrow_dplyr_query as input ARROW-15061 - [C++] Add logging for kernel functions and exec plan nodes ARROW-15062 - [C++] Add memory information to current spans ARROW-15064 - [C++] Vectorize CheckStringHasNoStructuralChars in CSV writer ARROW-15066 - [C++] Enable use of non-bundled OpenTelemetry ARROW-15067 - [C++] Add tracing spans to the scanner ARROW-15080 - [Python][C++] Enable tuples conversion to interval ARROW-15089 - [C++][Compute] Implement kernel to lookup a MapArray item for a given key ARROW-15098 - [R] Add binding for lubridate::duration() and/or as.difftime() ARROW-15118 - [C++] Avoid bitmap buffer if all inputs are all valid for Scalar Kernels ARROW-15152 - [C++][Compute] Implement hash_list aggregate function ARROW-15156 - [Doc] Implement Tutorials for the Java Documentation ARROW-15157 - [Doc] New Contributors Guide v2 ARROW-15163 - [R] lubridate functions for 8.0.0 ARROW-15167 - [R] Improve efficiency of decimal casting ARROW-15168 - [R] Add S3 generics to create main Arrow objects ARROW-15178 - [Java][Docs] Java Tutorial: Developer Docs for Java ARROW-15180 - Document how to add JNI bindings for C++ features ARROW-15183 - [Python][Docs] Add Missing Dataset Write Options ARROW-15192 - [Java] Allow use of Jackson 2.12 and higher ARROW-15195 - [MATLAB] Enable GitHub Actions CI for MATLAB Interface on macOS ARROW-15197 - [C++] UTF-8 string repeat kernel ARROW-15212 - [C++] Handle suffix argument in joins ARROW-15215 - [C++] Consolidate kernel data-copy utilities between replace_with_mask, case_when, coalesce, choose, fill_null_forward, fill_null_backward ARROW-15223 - [C++] Implement Not Between ternary kernel ARROW-15238 - [C++] ARROW_ENGINE module with substrait consumer ARROW-15239 - [C++][Compute] Adding Bloom filter implementation ARROW-15258 - [C++] Easy options to create a source node from a table ARROW-15262 - [C++] Create a ToTable sink node ARROW-15281 - [C++] Implement ability to retrieve fragment filename ARROW-15282 - [C++][FlightRPC] Split data methods from the underlying transport ARROW-15294 - [R] Remove arrow-without-arrow and other Solaris hacks ARROW-15296 - [CI][GO] Add Go staticcheck linting to CI lint job ARROW-15299 - [R] investigate {remotes} dependencies “soft” vs TRUE ARROW-15313 - [C++][Java][FlightRPC] Implement type info method to flight-sql ARROW-15314 - [C++][Java][FlightRPC] Add missing metadata on Arrow schemas returned by Flight SQL ARROW-15321 - [Dev][Python] Also numpydoc-validate Cython-generated methods ARROW-15346 - [Doc][Guide] Arrow codebase - minor corrections ARROW-15347 - [Doc][Guide] Update testing section in new contributors guide ARROW-15348 - [Doc][Guide] Lifecycle of a PR - minor corrections ARROW-15349 - [Doc][Guide] Existing Contributors page - update ARROW-15350 - [Doc][Guide] Add styling and linters info section ARROW-15351 - [Doc][Guide] Additional tutorial for R bindings ARROW-15352 - [Doc][Guide] R package and make clean ARROW-15353 - [Doc][Guide] Intro into CI topic and link to the existing docs ARROW-15364 - [Python] Update filesystem entry in read docstrings to reflect current behaviour ARROW-15366 - [Docs] Automate incrementing of package version for R and non-R version switchers ARROW-15367 - [Python] Improve Classes and Methods Docstrings for 8.0.0 ARROW-15369 - [Doc] Tweak example to use the new support for str pointers ARROW-15374 - [C++][FlightRPC] Add support for MemoryManager in data methods ARROW-15389 - [C++][Dev] Improve Array preview in GDB plugin ARROW-15400 - [Go][CI] Exercise builds on arm machines ARROW-15410 - [C++][Datasets] Improve memory usage of datasets API when scanning parquet ARROW-15418 - [Go][Flight] Update gRPC version, hide impl details ARROW-15425 - [C++] Add delta dictionaries in file format to integration tests ARROW-15428 - [Python] Address docstrings in Parquet classes and functions ARROW-15429 - [Python] Address docstrings for ChunkedArray class, methods, attributes and constructor ARROW-15431 - [Python] Address docstrings in Schema ARROW-15432 - [Python] Address CSV docstrings ARROW-15440 - [Go] Implement ‘unpack_bool’ with Arm64 GoLang Assembly ARROW-15450 - [Python][Wheel] Flight test receives SIGKILL during in macOS tests ARROW-15462 - [GLib] Add GArrow{Month,DayTime,MonthDayNano}Interval{Scalar,Array,ArrayBuilder} ARROW-15468 - [R][CI] A crossbow job that tests against DuckDB’s dev branch ARROW-15471 - [R] ExtensionType support in R ARROW-15472 - [Website] Add Flight SQL blog post ARROW-15477 - [C++][Python] Allow to create (FixedSize/Large)ListArray from arrays and type ARROW-15480 - [R] Expand on schema/colnames mismatch error messages ARROW-15483 - [Release] Revamp the verification scripts ARROW-15487 - [FlightRPC][C++][GLib][Python][R] Implement FlightClient::Close ARROW-15489 - [R] Expand RecordBatchReader usability ARROW-15491 - [Website] Rotate PMC chair for 2022 ARROW-15497 - [C++][Homebrew] Use Clang Tools 12 ARROW-15501 - [Java] Support validating decimal vectors ARROW-15503 - [GLib][Release] Avoid deprecation warning ARROW-15505 - [C++][Compute] Support null type in product aggregation ARROW-15506 - [C++][Compute] Support Null type in hash_sum/hash_product/hash_mean ARROW-15510 - [C++][FlightRPC] Add CUDA memory manager support to benchmark ARROW-15515 - [C++] Update ExecPlan example code and documentation with new options ARROW-15517 - [R] Use WriteNode in write_dataset() ARROW-15523 - [Python] Support for Datasets as inputs of Joins ARROW-15524 - [Python] Make joins able to receive Tables as inputs ARROW-15525 - [Python] Make joins able to output a Table as result. ARROW-15526 - [Python] Support for Dataset.join ARROW-15527 - [Python] Make Joins able to execute the join operation ARROW-15532 - [C++] Fix unused warning for StringClassifyDoc ARROW-15542 - [GLib][Parquet] Add GParquet*Metadata ARROW-15550 - [C++] Add optional debug memory checks ARROW-15551 - [C++][FlightRPC] Update gRPC TLS options detection for 1.43 ARROW-15552 - [Doc][Format] Remove erroneous mention of base64 ARROW-15556 - [Release] Add a script to update Homebrew packages ARROW-15569 - [Packaging][deb] Use gem instead of apt to install gobject-introspection gem ARROW-15570 - [CI][Nightly] Drop centos-8 R nightly job ARROW-15572 - [Java][Docs] Add Installation section to Java documentation ARROW-15573 - [Java][Doc] Document Apache Arrow memory management ARROW-15574 - [Java][Doc] Review existing documentation ARROW-15575 - [Java][Doc] Datasets Tutorial ARROW-15576 - [Java][Doc] Document VectorSchemaRoots for 2D data ARROW-15577 - [Java][Doc] Add Arrow Flight documentation ARROW-15578 - [Java][Doc] Document C Data Interface and how to interface with other languages ARROW-15579 - [C++] Add MemoryManager::CopyBuffer(const Buffer&amp;) ARROW-15594 - [C++][FlightRPC] Add Deserialize(const Buffer&amp;) to various Flight types ARROW-15595 - [Release][Ruby] Add support for MFA ARROW-15600 - [C++][FlightRPC] Add minimal Flight SQL query example ARROW-15601 - [Docs][Release] Update post release script to move stable docs + keep dev docs ARROW-15605 - [CI][R] Keep using old macos runners on our autobrew CI job ARROW-15606 - [CI][R] Add brew build that exercises the R package ARROW-15609 - [C++][Compute] Support hash_aggregate with only keys ARROW-15611 - [C++] Migrate arrow::ipc::internal::json::ArrayFromJSON to Result&lt;&gt; ARROW-15614 - [C++] Add sqrt binary scalar kernel ARROW-15617 - [Doc][C++] Document environment variables ARROW-15619 - [C++] Temporal component extraction function for extracting is_leap_year indicator ARROW-15623 - [C++][Python] Update developers/python.rst (console blocks + “” in archery install) ARROW-15625 - [C++] Convert underscore to hyphen in example executable names ARROW-15629 - [GLib] Add garrow_{,large_}string_array_builder_append_string_len() ARROW-15630 - [Release][MSYS2] Update reverse dependencies too ARROW-15631 - [Packaging][RPM] Add major version to libs packages ARROW-15632 - [R] Prune the bundled libarrow source ARROW-15633 - [R] Skip s3_bucket example that requires network connection ARROW-15634 - [C++][Packaging] Improve compilation speed for java-jars nighlty build for MacOS ARROW-15643 - [C++] Allow selecting subset of fields of a StructArray via cast ARROW-15650 - [MATLAB] Rename the MEX gateway function ARROW-15653 - [R][CI] Fix tests of bundled cpp source ARROW-15656 - [C++][R] Make valgrind builds slightly quicker ARROW-15657 - [C++][Java] Upgrade Apache ORC to 1.7.3 ARROW-15665 - [C++] Fix error_is_null in strptime with invalid inputs ARROW-15665 - [C++] Add error handling option to StrptimeOptions ARROW-15670 - [C++/Python/Packaging] Update conda pinnings and enable GCS on Windows ARROW-15672 - [C++] Enable CSV writer to control the field delimiter ARROW-15673 - [R] Error gracefully if DuckDB isn’t installed ARROW-15680 - [C++] Temporal floor/ceil/round should accept week_starts_monday when rounding to multiple of week ARROW-15682 - [CI] Github starting to migrate “windows-latest” tag from windows 2019 to windows 2022 ARROW-15683 - [Website][Rust][DataFusion] Make a 7.0.0 release announcement blog ARROW-15690 - [Dev] Update GitHub Actions workflows that hardcode master as default ARROW-15692 - [Dev] Update release scripts to use default branch ARROW-15694 - [Dev] Update apache/arrow-site GitHub Actions deploy.yml website deployment workflow to support being triggered when pushing to main ARROW-15697 - [R] Add logo and meta tags to pkgdown site ARROW-15698 - [Integration] Privatized some code in tests ARROW-15701 - [R] month() should allow integer inputs ARROW-15706 - [C++][FlightRPC] Implement a UCX transport ARROW-15707 - [C++][FlightRPC] Make Flight tests more resuable across transports ARROW-15708 - [R][CI] skip snappy encoded parquets on clang sanitizer ARROW-15709 - [C++] Compilation of ARROW_ENGINE fails if doing an “inline” build ARROW-15709 - [C++] Revert change ARROW-15709 - [C++] Compilation of ARROW_ENGINE fails if doing an “inline” build ARROW-15712 - [R] Add a type method for Expression objects ARROW-15714 - [C++][Gandiva] Increase the protobuf recursion limit in gandiva protobuf parser ARROW-15717 - [Docs] Add hash_one to the documentation ARROW-15721 - [Docs][FlightRPC] Add Flight/Flight SQL to subprojects ARROW-15722 - [Java] Improve error message for nested types with incorrect children ARROW-15726 - [C++] If a projected_schema is not supplied but a bound projection expression is then we should use that to infer the projected_schema ARROW-15739 - [C++] Bump xsimd to latest version ARROW-15740 - [C++][Compute] Benchmark element wise min/max ARROW-15741 - [Doc][Format] Clarify thread-safety of C stream interface ARROW-15742 - [Go] Implement ‘bitmap_neon’ with Arm64 GoLang Assembly ARROW-15744 - [Gandiva][C++] Add NEGATIVE function for interval types ARROW-15749 - [Ruby] Add support for #values of Month Interval Type ARROW-15750 - [Ruby] Add support for #raw_records of Month Interval Type ARROW-15755 - [Java] Support Java 17 ARROW-15763 - [C++] Improve CSV writer performance ARROW-15766 - [R] Implement bindings for lubridate::duration() ARROW-15769 - [C++] Generate less arithmetic kernels ARROW-15775 - [R] Clean up as.* methods to use build_expr() ARROW-15776 - [Python] Expose IpcReadOptions ARROW-15777 - [Python][Flight] Allow passing IpcReadOptions to FlightCallOptions ARROW-15781 - [Python] Release GIL in ensure_complete_metadata ARROW-15782 - [C++] Fix Findre2Alt.cmake to check RE2_ROOT variable first ARROW-15788 - [C++][FlightRPC] Prepare benchmark for alternative transports ARROW-15789 - [C++] Update OpenTelemetry to v1.2.0 ARROW-15795 - [Java] Add a getter for the timeZone in timestamp with timezone vectors ARROW-15796 - [Python] Pickling ParquetFileFragment shouldn’t fetch metadata ARROW-15799 - [R] Update as.Date() to support an origin different from epoch ARROW-15800 - [R] Implement bindings for lubridate::as_date() and lubridate::as_datetime() ARROW-15801 - [R] Implement bindings for lubridate date-time helpers ARROW-15802 - [R] bindings for lubridate::make_datetime() and lubridate::make_date() ARROW-15810 - [CI][Nightly] Check R related image strictly ARROW-15814 - [R][DOCS] Improve documentation for cast() ARROW-15817 - [R] Use TableSourceNode instead of InMemoryDataset ARROW-15818 - [R] Implement initial Substrait consumer in the R bindings ARROW-15820 - [C++][Doc] Add table_source to streaming_execution.rst &amp; clarify parameter name ARROW-15821 - [JS] Fix paths to sourcemaps in directories ARROW-15823 - [C++][Python] Add a method to convert a Table to a RecordBatchReader ARROW-15824 - [Python] Make pyarrow.parquet a package ARROW-15827 - [R] Improve UX of write_dataset(…, max_rows_per_group) ARROW-15831 - [Java] Upgrade Flight dependencies ARROW-15841 - [R] Implement SafeCallIntoR to safely call the R API from another thread ARROW-15844 - [Release][Packaging] Use ASCII format for detached sign ARROW-15846 - [Format] Clarify presence of struct validity bitmap ARROW-15850 - [C++] Engine substrait headers missing from install ARROW-15854 - [C++] Refine CSV writer code ARROW-15860 - [Python] Document RecordBatchReader ARROW-15864 - [Java][Docs] Update Arrow nightly Maven releases documentation ARROW-15866 - [Packaging][Ubuntu] Drop support for Ubuntu 21.04 ARROW-15870 - [Python] Start to raise deprecation warnings for use_legacy_dataset=True in parquet.read_table ARROW-15871 - [Python] Start raising deprecation warnings for ParquetDataset keywords that won’t be supported with the new API ARROW-15873 - [CI] Migrate from Ubuntu 21.04 to 22.04 ARROW-15875 - [R] Expose ReadMetadata for input streams ARROW-15882 - [Python][CI] Ensure we are running hypothesis tests in the nightly hypothesis build ARROW-15885 - [Ruby] Add support for #values of DayTime Interval Type ARROW-15886 - [Ruby] Add support for #raw_records of DayTimeInterval type ARROW-15890 - [CI][Python] Use venv instead of virtualenv ARROW-15896 - [Python][C++] Add errno detail for filesystem “file not found” errors ARROW-15900 - [C++] Support Substrait reading of a Feather-format local file ARROW-15902 - [Website] Add new committers: Raphael Taylor-Davies, Wang Xudong, Yijie Shen, Kun Liu ARROW-15916 - [Packaging][RPM] Add support for CentOS Stream 8 ARROW-15917 - [Java][Docs] Document how to use Flight artifacts ARROW-15918 - [Ruby][{day:, millisecond:}, …] ) ARROW-15919 - [C++] Add function not commutative with timestamps &amp; duration maths ARROW-15921 - [Format][FlightRPC][C++][Java] Clarify interpretation of FlightEndpoint.locations ARROW-15923 - [Packaging][Linux] Enable GCS support ARROW-15924 - [Ruby] Add support for #values of MonthDayNanoInterval type ARROW-15925 - [Ruby] Add support for #raw_records of MonthDayNanoInterval type ARROW-15931 - [Website] Add explicit Apache LICENSE.txt and NOTICE.txt files to apache/arrow-site repository ARROW-15932 - [C++][FlightRPC] Add more tests to the common Flight suite ARROW-15934 - [Python] Expose write_batch_size in python ARROW-15935 - [Ruby] Add test for Arrow::DictionaryArray#values ARROW-15939 - [Python] Add pickle support for JSON options classes ARROW-15940 - [Gandiva][C++] Add NEGATIVE function for decimal data type ARROW-15941 - [C++] Allow overriding the number of IO threads with an environment variable ARROW-15944 - [Docs][C++] Document dependencies for building on Arch Linux ARROW-15947 - [R] rename_with s3 method for arrow_dplyr_query ARROW-15950 - [Go] Lift BitSetRunReader to internal/bitutils package ARROW-15952 - [C++] Document Visitors and finish Scalar::Accept ARROW-15955 - [Packaging][RPM] Add missing json-devel to CentOS Stream 8 build image ARROW-15956 - [Java] Consolidate Flight integration testing code ARROW-15963 - [Go][Parquet] simplify ReaderAtSeeker interface ARROW-15968 - [C++] Update AsyncGenerator semantics to emit a terminal item only after all outstanding futures have completed ARROW-15972 - [Java][Doc] Add Getting Started section ARROW-15974 - [C++] Migrate flight/types.h header definitions to use Result&lt;&gt; ARROW-15975 - [C++] Document type traits and inline visitors ARROW-15976 - [C++] Clean up commenting on execution plan example ARROW-15979 - [C++][Doc] Expose more functions of parquet::WriterProperties in doc ARROW-15984 - [C++] Change RecordBatchReader API to use Result&lt;&gt; ARROW-15989 - [R] rbind &amp; cbind for Table &amp; RecordBatch ARROW-15994 - [C++] Back out taskify changes ARROW-15995 - [GO] Improve ‘sum_float64_neon’ performance ARROW-15998 - [Docs][CI] Use sphinx-design tabs instead of sphinx-tabs ARROW-15999 - [Python] Turn deadlines off for the test using hypothesis ARROW-16007 - [R] grepl bindings return FALSE for NA inputs ARROW-16011 - [R] CI jobs should fail if lintr picked up issues ARROW-16014 - [C++] Create more benchmarks for measuring expression evaluation overhead ARROW-16026 - [C++] Add support for the serial executor to expose an async generator as an iterable ARROW-16032 - [C++] Migrate FlightClient API to Result&lt;&gt; ARROW-16033 - [C++] Pass schema to consuming sink node ARROW-16038 - [R] different behavior from dplyr when mutate’s .keep option is set ARROW-16042 - [GO] Fix header file preprocessor issues ARROW-16044 - [Julia] Remove from apache/arrow ARROW-16046 - [Docs][FlightRPC][Python] Ensure Flight Python API is documented ARROW-16049 - [C++][FlightRPC] Fix Flight SQL’s ColumnMetadata constructor visibility ARROW-16053 - [C++][FlightRPC] Fix flaky test TestAuthHandler.FailUnauthenticatedCalls ARROW-16055 - [C++][Gandiva] Skip unnecessary work during cache hit when using object code cache ARROW-16057 - [Python] Address docstrings for RecordBatch class, methods, attributes and constructor ARROW-16058 - [Python] Address docstrings for Table class, methods, attributes and constructor ARROW-16059 - [Python] Address docstrings for Tensor class ARROW-16061 - [R][CI] Speed up windows 3.6 builds ARROW-16062 - [Python] Move libarrow_python include definitions to its own file ARROW-16064 - [Java][C++][FlightRPC] Add missing column metadata for type name on FlightSQL ARROW-16065 - [FlightRPC][Docs] Improve Flight documentation ARROW-16068 - [C++][FlightRPC] Migrate remaining flight API to use Result&lt;&gt; ARROW-16069 - [C++][FlightRPC] Refactor out gRPC error code handling ARROW-16073 - [R] clean-up date time unit testing once tzdb is available on Windows ARROW-16074 - [Docs] Document joins ARROW-16079 - [Python] Address docstrings in Parquet schema and metadata ARROW-16082 - [Flight][Go] Allow specifying a net.Listener ARROW-16098 - [JS] Don’t return null in table and recordbatch iterators ARROW-16102 - [C++] Add support for building with system gRPC and bundled GCS ARROW-16104 - [Packaging] Add support for Ubuntu 22.04 ARROW-16105 - [C++][Gandiva] Add support for LLVM 14 ARROW-16109 - [Python] Add dataset mark to test in order to avoid failure ARROW-16114 - [Docs][Python] Document Parquet FileMetaData ARROW-16117 - [JS] Improve decode UTF8 performance ARROW-16120 - [Python] ParquetDataset deprecation: change Deprecation to FutureWarnings ARROW-16121 - [Python] Deprecate the (common_)metadata(_path) attributes of ParquetDataset ARROW-16122 - [Python] Change use_legacy_dataset default and deprecate no-longer supported keywords in parquet.write_to_dataset ARROW-16128 - [C++][FlightRPC] Fix Flight SQL static build on Windows ARROW-16132 - [Packaging][deb][CUDA] Relax libcuda1 dependency ARROW-16154 - [R] Errors which pass through handle_csv_read_error() and handle_parquet_io_error() need better error tracing ARROW-16156 - [R] Clarify warning message for features not turned on in .onAttach() ARROW-16158 - [C++][R] Rename ARROW_ENGINE to ARROW_SUBSTRAIT ARROW-16166 - [C++][Compute] Utilities for assembling join output ARROW-16167 - [JS] refactor get and set visitors ARROW-16173 - [C++] Add benchmarks for temporal functions/kernels ARROW-16176 - [Release][C#] Use .NET 6.0 on Ubuntu 22.04 ARROW-16186 - [C++][GANDIVA] Add alias and tests for decimal, quarter, xor, etc… ARROW-16187 - [Go][Parquet] Properly utilize BufferedStream and buffer size when reading ARROW-16192 - [Go] Remove deprecated aliases for v8 ARROW-16193 - [Go] Replace CPU discovery package with golang.org/x/sys/cpu module ARROW-16198 - [CI][Packaging][Python] Update VCPKG version ARROW-16201 - [R] SafeCallIntoR on 3.4 ARROW-16203 - [Release] Remove all old artifacts on release ARROW-16204 - [C++][Dataset] Default error existing_data_behaviour for writing dataset ignores a single file ARROW-16208 - [JS] Upgrade deps ARROW-16210 - [JS] Implement tableFromJSON and support struct vector in vectorFromArray ARROW-16214 - [GLib][Parquet] Add GParquetFileMetadata ARROW-16229 - [CI] Temporary remove turbodbc tests from nightly tests ARROW-16232 - [C++] Include OpenTelemetry in LICENSE.txt ARROW-16240 - [Python] Support row_group_size/chunk_size keyword in pq.write_to_dataset with use_legacy_dataset=False ARROW-16242 - [Go] xerrors.Errorf and xerrors.Is are deprecated, fix linting ARROW-16245 - [GLib][Parquet] Add GParquetRowGroupMetadata ARROW-16247 - [GLib] Add GArrowGCSFileSystem ARROW-16250 - [GLib][Parquet] Add GParquetColumnChunkMetadata ARROW-16251 - [GLib][Parquet] Add GParquetStatistics and its family ARROW-16252 - [CI][Archery] Highlight number of failed builds on nightly reports ARROW-16256 - [Docs] Document which format version is supported ARROW-16257 - [R] Break-up as_date and as_datetime into individual functions ARROW-16264 - [C++][CI] Valgrind timeout in arrow-compute-hash-join-node-test ARROW-16276 - [R] Release News ARROW-16277 - [Python] No builds for macOS arm64. ARROW-16280 - [C++] Avoid copying shared_ptr in Expression::type() ARROW-16282 - [CI] [C#] Verifiy release on c-sharp has been failing since upgrading ubuntu to 22.04 ARROW-16283 - [Go] Cleanup panics in new Buffered Reader ARROW-16284 - [Python][Packaging] Use delocate-fuse to create universal2 wheels ARROW-16291 - [Java]: Support JSE17 for Java Cookbooks ARROW-16292 - [Java][Doc] Upgrade java documentation for JSE17/JSE18 ARROW-16294 - [C++] Improve performance of parquet readahead ARROW-16296 - [GLib] Add missing casts for GArrowRoundMode ARROW-16303 - [C++] Check EINTR in file IO ARROW-16308 - [CI] Upgrade windows runner version as windows-2016 is deprecated. ARROW-16314 - [Python][CI] Skip running cython tests in windows verification builds ARROW-16325 - [R] Add task for R package with gcc12 ARROW-16327 - [Java][CI]: Add support for Java 17 CI process ARROW-16334 - [Archery][CI] Use build links on nightly report emails instead of branch link ARROW-16338 - [CI] Update azure windows image as vs2017-win2016 is retired ARROW-16347 - [Release] Escape backtick in verification script ARROW-16349 - [Release][Packaging][RPM] Remove ed25519 keys from KEYS ARROW-16350 - [Dev][Archery] Add missing newline in error message comment ARROW-16352 - [GLib] Fix wrong enums.h install location ARROW-16354 - [Packaging][RPM] Update artifacts pattern list ARROW-16355 - [Dev] Update verify-release-candidate.sh to compile cpp in parallel ARROW-16373 - [Docs][CI] Small improvements to CI documentation ARROW-16387 - [C++] Add -Wshorten-64-to-32 to list of CHECKIN warnings tested by clang ARROW-16390 - [C++] Dataset initialization could segfault if called simultaneously ARROW-16408 - [C++] Add support for DATE type in SQLite FlightSQL example ARROW-16411 - [Website] Migrate to Matomo from Google Analitics ARROW-16412 - [Java] Updated README to reference compilation docs ARROW-16416 - [C++] Support cast-function in Substrait ARROW-16428 - [Release] Add prefix to ENV variables ARROW-16445 - [R] [Doc] Add a short summary for the Installing the Arrow package on Linux article ARROW-16455 - [CI] [Packaging] Anaconda storage size exceeded for linux-ppc64le" />
<meta property="og:description" content="Apache Arrow 8.0.0 (6 May 2022) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For AlmaLinux For Amazon Linux For CentOS For C# For Debian For Python For Ubuntu Git tag Contributors This release includes 636 commits from 127 distinct contributors. $ git shortlog -sn apache-arrow-7.0.0..apache-arrow-8.0.0 43 Antoine Pitrou 40 David Li 39 Sutou Kouhei 36 Alenka Frim 29 Weston Pace 25 Rok 22 Dragoș Moldovan-Grünfeld 22 Joris Van den Bossche 20 Jonathan Keane 20 Krisztián Szűcs 19 Raúl Cumplido 17 Will Jones 16 Dewey Dunnington 13 david dali susanibar arce 12 Matthew Topol 10 Nic Crane 10 Yibo Cai 10 Vibhatha Abeykoon 9 Dominik Moritz 9 SHIMA Tatsuya 8 Neal Richardson 8 Sanjiban Sengupta 8 okadakk 8 Jacob Wujciak-Jens 7 Tobias Zagorni 7 Alessandro Molina 7 William Butler 6 Dhruv Vats 5 Yuqi Gu 5 Johnnathan 4 Anthony Louis 4 Projjal Chanda 4 Fiona La 4 Micah Kornfield 4 Bryan Cutler 3 Yaron Gvili 3 Ian Alexander Joiner 3 Vinicius Roque 3 ViniciusSouzaRoque 3 emkornfield 3 Chenxi Li 2 Jorge C. Leitao 2 Jose Almeida 2 assignUser 2 Uwe L. Korn 2 Todd Farmer 2 Tim Van Wassenhove 2 Rob Saccoccio 2 Alvin Chunga 2 Stephanie Hazlitt 2 Jeroen van Straten 2 Kyle Barron 2 Sasha Krassovsky 2 Eduardo Ponce 2 Salonijain27 2 liyafan82 2 JabariBooker 2 michalursa 2 Matthijs Brobbel 2 Danielle Navarro 2 Maya Anderson 2 Even Rouault 2 Joost Hoozemans 1 zhixingheyi-tian 1 SAm Albers 1 Alex McRae 1 Alfred Mountfield 1 Ali Amiri 1 Andrew Higgins 1 Andrew Strelsky 1 Andrii Oriekhov 1 Andy Teucher 1 Benjamin Kietzman 1 Benson Muite 1 Bob Matcuk 1 Chris Casola 1 Colin Jermain 1 Corentin 1 Dongjoon Hyun 1 Dooyoung-Hwang 1 Edward Visel 1 Elena Henderson 1 Frederic Branczyk 1 Haowei Yu 1 Hongze Zhang 1 Isaac Brodsky 1 James Duong 1 Johan Mattsson 1 João Pedro 1 Kazuaki Ishizaki 1 Kevin Gurney 1 Kian-Meng Ang 1 Laurent Goujon 1 Leo Di Donato 1 Li Jin 1 Min-Young Wu 1 MrMallIronmaker 1 Nate Clark 1 Nicolas Moreau 1 Nivia 1 Pradeep Garigipati 1 Rafael Telles 1 Romain Francois 1 Ryan Blue 1 Aakash Kumar 1 Sam Albers 1 Siddhant Rao 1 Stephan T. Lavavej 1 Thomas J. Fan 1 Tom Drabas 1 William Ayd 1 Xinyu Zeng 1 Yue Ni 1 Zac Farrell 1 bkmgit 1 burntcustard 1 fabiencelier 1 hankvyang 1 hzuo 1 iurysalino 1 jvictorhuguenin 1 jwijffels 1 karldw 1 marcbernot 1 okadak 1 ptaylor 1 shegden Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-7.0.0..apache-arrow-8.0.0 105 David Li 102 Antoine Pitrou 84 Sutou Kouhei 72 Jonathan Keane 47 Krisztián Szűcs 46 Joris Van den Bossche 30 Weston Pace 25 Nic Crane 23 Pindikura Ravindra 22 Matthew Topol 16 Alessandro Molina 15 Yibo Cai 15 Neal Richardson 14 Dominik Moritz 5 Bryan Cutler 5 GitHub 3 Ian Cook 2 liyafan82 2 Micah Kornfield 1 Matt Topol 1 Li Jin 1 Chao Sun Changelog Apache Arrow 8.0.0 (2022-05-06) Bug Fixes ARROW-5248 - [Python] support zoneinfo / dateutil timezones ARROW-7350 - [Python] Decode parquet statistics as scalars ARROW-9664 - [Python] Array/ChunkedArray.to_pandas do not support types_mapper keyword ARROW-11415 - [R] map_batches wouldn’t accept a dataset as an argument ARROW-13168 - [C++][R] Enable runtime timezone database for Windows ARROW-13594 - [CI] Enable nightly turbodbc builds again ARROW-13922 - [Python] Fix ParquetDataset throw error when len(path_or_paths) == 1 ARROW-14047 - [C++] [Parquet] FileReader returns inconsistent results on repeat reads ARROW-14215 - [R][CI] Conda Windows builds failing due to space in library name ARROW-14256 - [CI][Package] Re-enable disabled conda packaging builds ARROW-14389 - [C++][Gandiva] Fix performance bug with LIKE expressions ARROW-14638 - [C++][R] Unknown C compiler / ccache on Arch Linux ARROW-14647 - [JS] fix bignumToNumber for negative numbers ARROW-14665 - [JAVA] fix JdbcToArrow ResultSet iteration bug ARROW-14708 - [C++] Adding missing abseil dependencies to enable static flight build ARROW-14908 - [C++][R] Dataset hash join segfaults on Windows ARROW-14911 - [C++] arrow-compute-hash-join-node-test failed ARROW-14960 - [C++] Add exception to Arrow style guide based on changes in Google style guide that we are not adopting ARROW-15018 - [Python] DataFrame Index modified during Feather serialization round trip ARROW-15092 - [R] Support create_package_with_all_dependencies() on non-linux systems ARROW-15253 - [Python] Error in to_pandas for empty dataframe with index with extension type ARROW-15272 - [Java] Add cleanup failures as suppressed in ArrowVectorIterator#create ARROW-15291 - [C++][Python] Segfault in StructArray.to_numpy and to_pandas if it contains an ExtensionArray ARROW-15312 - [R][C++] filtering a Parquet dataset with is.na() misses some rows ARROW-15401 - [Python] Gdb tests are failing on windows and apple M1 ARROW-15426 - [C++][Gandiva] Update InExpressionNode validation ARROW-15444 - [C++] Compilation with GCC 7.5 fails in aggregate_basic.cc ARROW-15465 - [Python] Add some missing parquet marks in dataset tests ARROW-15502 - [Java] Detect exceptional footer size in Arrow file reader ARROW-15504 - [Python][CI] Ensure that optional components are tested ARROW-15509 - [Go][Parquet] Parquet cmds crash ARROW-15511 - [Python][C++] Remove reference management in numpy indexer ARROW-15514 - [C++][Gandiva] Add flag to enable Gandiva Object Code ARROW-15520 - [C++] Qualify arrow_vendored::date::format() for C++20 compatibility ARROW-15533 - [C++] Check ARROW_WITH_OPENTELEMETRY in CI ARROW-15539 - [Archery] Add ARROW_JEMALLOC to build options ARROW-15541 - [Python] Bump the minimum Cython version ARROW-15544 - [Go][Parquet] Fix origin schema base64 decoding ARROW-15546 - [FlightRPC][C++] Remove quotes from cookie header ARROW-15555 - [Release] Don’t push the release tag since it already exists ARROW-15580 - [Python] Make pytz an actual optional dependency of PyArrow ARROW-15593 - [C++] Make after-fork ThreadPool reinitialization thread-safe ARROW-15598 - [C++][Gandiva] Avoid using hardcoded raw pointer addresses in generated code ARROW-15599 - [R] Convert a column as a sub-second timestamp from CSV file with the T col type option ARROW-15603 - [C++] Remove unused variables ARROW-15604 - [C++][CI] Sporadic ThreadSanitizer failure with OpenTracing ARROW-15604 - [C++][CI] Sporadic ThreadSanitizer failure with OpenTracing ARROW-15607 - [C++] Fix incorrect CPUID flag for AVX detection ARROW-15626 - [GLib] Fix a bug that GArrowGIOInputStream may not read enough data ARROW-15627 - [R] Fix union dataset unify schema ARROW-15648 - [C++][Gandiva] Fix the size of the Gandiva cache ARROW-15651 - [JavaScript] Structs incorrectly initialise null values ARROW-15652 - [C++] Fix GDB pretty-printing from inside parquet namespace ARROW-15659 - [R] strptime should return NA (not error) with format mismatch ARROW-15664 - [C++] parquet reader Segfaults with illegal SIMD instruction ARROW-15667 - [R] Test development build with ARROW_BUILD_STATIC=OFF ARROW-15674 - [C++][Gandiva] Like function doesn’t properly handle patterns with special characters in certain cases ARROW-15677 - [R] calling invalidate() method on ArrowObjects causes subsequent segfault ARROW-15679 - [R] count should return an ungrouped dataframe ARROW-15688 - [C++] add_checked doesn’t error out on duration overflow ARROW-15699 - [C++][Gandiva] Fix implementation of left and right func… ARROW-15700 - [C++] Compilation error on Ubuntu 18.04 ARROW-15705 - [JavaScript] Allowing appending null on children in a StructBuilder ARROW-15710 - [C++] Intermittent deadlock on arrow-threading-utility-test ARROW-15715 - [Go] ipc trim value offsets on arrays ARROW-15718 - [C++] Increase thread limit to work around thread issues ARROW-15720 - [CI] Fix nightly dask build (skip failing test due to wrong usage of Array.to_pandas) ARROW-15723 - [Python] Segfault orcWriter write table ARROW-15727 - [Python] Allow converting lists of MonthDayNano intervals to Pandas ARROW-15728 - [Python] Reduce entropy for zstd test_ipc ARROW-15743 - [R] skip not connected up to skip_rows on open_dataset despite error messages indicating otherwise ARROW-15746 - [Release][Java] Add missing artifacts to tasks.yml ARROW-15748 - [Python] Round temporal options default unit is day but documented as second. Follow-up ARROW-15748 - [Python] Round temporal options default unit is day but documented as second ARROW-15757 - [Python] Missing bindings for existing_data_behavior makes it impossible to maintain old behavior ARROW-15760 - [C++] Avoid hard dependency on git in cmake (download tarballs from github instead) ARROW-15770 - [CI] Not all python tests are running on CI jobs ARROW-15772 - [Go][Flight] Server Basic Auth Middleware/Interceptor wrongly base64 decode ARROW-15778 - [Java] set native endian to schema ARROW-15783 - [Python] Initialize static pandas data on write ARROW-15784 - [C++][Python] Removing flag enable_parallel_column_conversion which is no longer used ARROW-15791 - [Go] ipc FileWriter negative WaitGroup counter ARROW-15794 - [CI][Crossbow] Nightly builds failing due to error in types_mapper ARROW-15815 - [C++][Parquet] Fix undefined behaviour on invalid input ARROW-15819 - [R] R docs version switcher doesn’t work on Safari on MacOS ARROW-15830 - [C++] Ensure target directory exists before running Substrait generation ARROW-15837 - [C++][Python] Clarify documentation for ListArray::offsets() ARROW-15845 - [Python][Packaging] Fix macOS wheel builds ARROW-15847 - [Python][CI] Ensure we have a nightly Python build with parquet encryption disabled ARROW-15847 - [Python] Building with Parquet but without Parquet encryption fails ARROW-15848 - [Gandiva][C++] Fix function istrue and is not true ARROW-15851 - [C++] Enable RE2 when building with gRPC ARROW-15852 - [JS] Fix error thrown by Table.getByteLength() ARROW-15857 - [R] rhub/fedora-clang-devel fails to install ‘sass’ (rmarkdown dependency) ARROW-15863 - [Packaging][C++][Python] Fix conda package builds ARROW-15869 - [C++] Fix Valgrind failure (uninitialized value) ARROW-15888 - [Doc][Python] Modernize development instructions ARROW-15892 - [C++] Dataset APIs require s3:ListBucket Permissions ARROW-15895 - [R] R docs version switcher disappears &amp; reappears with back button on Chrome ARROW-15898 - [CI] Clean old conda nightlies more thoroughly ARROW-15905 - [Python][C++] Fix CMake warning when building PyArrow ARROW-15928 - [C++] Fix crashes and implement chunked array support for replace_with_mask function ARROW-15929 - [R] io_thread_count is actually the CPU thread count ARROW-15946 - [Go] Fix memory leak in pqarrow.NewColumnWriter when writing nested data ARROW-15949 - [Python] Do not require Parquet encryption when Parquet is disabled ARROW-15951 - [CI][Python] “Test wheel” step successful despite test error ARROW-15954 - [Java] Remove mac native netty kqueue dependency after upgrade ARROW-15960 - [C++] Fix crash on adaptive int builder edge cases ARROW-15962 - [C++][GANDIVA] Fix unhex errors return ARROW-15965 - [C++][Python] Add Scalar constructor of RoundToMultipleOptions to Python ARROW-15970 - [R][CI] Re-enable DuckDB dev tests ARROW-15973 - [CI] Split nightly reports into three: Tests, Packaging, Release ARROW-15982 - [Python] parquet.read_table fails to parse home directory path ARROW-15985 - [CI] Fix conda-clean failure when there are no files to delete ARROW-15987 - [C++][FlightRPC] Work around arrow-flight-test crash on AppVeyor ARROW-15993 - [CI] Add sphinx-tabs to ci/conda_env_sphinx.txt ARROW-16012 - [C++] Retry S3 request in tests when Minio not fully initialized ARROW-16013 - [C++][Python] Signed overflow when using negative stride in NumPyStridedConverter ARROW-16016 - [C++] Fix recursive ccache invocation error ARROW-16019 - [C++] Minimize chances of Minio connect errors ARROW-16021 - [C++] arrow-compute-hash-join-node-test timeout on MinGW ARROW-16025 - [Python][C++] Fix segmentation fault when closing ORCFileWritter ARROW-16031 - [C++][Gandiva] Fix Soundex errors generate ARROW-16035 - [Java] Handling empty JDBC ResultSet ARROW-16043 - [C++][Filesystem][S3] Add missing empty content for creating directory ARROW-16048 - [Python] Avoid exposing null buffer address to the Python buffer protocol ARROW-16051 - [Gandiva][C++] Fix datediff regression build ARROW-16052 - [R] undefined global function %&gt;% ARROW-16060 - [C++] subtract_checked support for timestamp(“s”) and date32 ARROW-16071 - [R] More undefined global functions ARROW-16078 - Upgrade bundled zlib to 1.2.12 ARROW-16099 - [JS] RecordBatches that are compressed should throw an error ARROW-16107 - [Dev][Archery] Fix archery crossbow latest-prefix query ARROW-16110 - [C++] GcsFileSystem::Make ignores IOContext ARROW-16113 - [Python] Partitioning.dictionaries in case of a subset of fields are dictionary encoded ARROW-16131 - [C++] support saving and retrieving custom metadata in batches for IPC file ARROW-16134 - [C++][GANDIVA] Fix Concat_WS errors return ARROW-16136 - [Gandiva][C++] Fix problem of the huge size of AddMappings function ARROW-16139 - [Python] Crash in tests/test_dataset.py::test_write_dataset_s3 ARROW-16143 - [Java] Upgrade jackson dependencies CVE-2020-36518 ARROW-16143 - [Java] Upgrade jackson dependencies CVE-2020-36518 ARROW-16146 - [C++] arrow-gcsfs-test is timing out ARROW-16148 - [C++] TPC-H generator cleanup ARROW-16152 - [C++] Fix segfault with unknown functions in Substrait ARROW-16159 - [C++][Python] Allow FileSystem::DeleteDirContents to succeed if the directory is missing ARROW-16162 - [C++][FlightRPC] Fix Flight build on Ubuntu 18.04 ARROW-16163 - [Go] IPC FileReader leaks memory when used with ZSTD compression ARROW-16165 - [CI][Archery] Fix nightly query to crossbow to send reports ARROW-16169 - [C++][Gandiva] Fix empty string case in convert_fromUTF8_binary() ARROW-16181 - [CI][C++] Valgrind failure in TPCH node tests ARROW-16182 - [C++][CI] TPCH node tests timeout under ThreadSanitizer ARROW-16185 - [C++] Fix uninitialized output data in strptime kernel ARROW-16197 - [Docs] Fix broken link ARROW-16205 - [C++][FlightRPC] Don’t use constexpr std::initializer_list ARROW-16209 - [JS] Support setting arbitrary symbols on Tables ARROW-16215 - [C++][FlightRPC] Fix segfault in Flight test on Windows ARROW-16216 - [Python][FlightRPC] Fix test_flight.py when Flight is not available ARROW-16219 - [CI] Fix git config to prevent SCM tools failure ARROW-16223 - [C++] Fix decimal reduce scale rounding ARROW-16225 - [C++][Parquet] Fix length of encryption AAD random byte generation ARROW-16233 - [Python][Packaging] test_zoneinfo_tzinfo_to_string fails with zoneinfo._common.ZoneInfoNotFoundError on packaging wheels on Windows ARROW-16235 - [C++] Fix build failure, compiler warnings from MinGW ARROW-16236 - [Python] [Packaging] test_s3fs_limited_permissions_create_bucket fails with Permission denied on MAC OS wheel builds ARROW-16237 - [Docs] Apache Impala is no longer incubating ARROW-16238 - [C++] Fix nullptr dereference when pre-buffering IPC reads ARROW-16261 - [C++] Fix DeleteDirContents on HDFS with missing_dir_ok=True ARROW-16262 - [CI][Integration] Skip failing tests from kartothek integration ARROW-16278 - [CI] Fix git installation failure on brew ARROW-16278 - [CI] Fix git installation failure on brew ARROW-16278 - [CI] Fix git installation failure on brew ARROW-16293 - [CI][GLib] Make tests stable ARROW-16295 - [CI][Release] Use windows-2019 for verify-rc-source-windows ARROW-16300 - pc.sort_indices with nonexistent column throws malloc error ARROW-16301 - [C#][CI] Fix docker configuration for .NET 6 ARROW-16305 - [C++] Missed reference to ARROW_ENGINE during the rename ARROW-16306 - [CI] Fix Nightly verify rc on ubuntu ARROW-16307 - [Java][FlightRPC] Skip flaky test TestDoExchange.testClientCancel ARROW-16311 - [Java] Do not return table_schema column when it’s not requested ARROW-16312 - [C++][CI] Install tzdata in the windows verification builds ARROW-16313 - [R] Ensure assume_timezone options are always initialized ARROW-16332 - [Release][Java] Add artifacts uploaded verification ARROW-16336 - [Python] ParquetDataset - Hide internal (common_)metadata related warnings from the user ARROW-16374 - [R][C++] skip another snappy test during sanitizer runs ARROW-16375 - [R][CI] Pin test-r-devdocs on Windows to R 4.1 ARROW-16393 - [JAVA] Update option spec to accept value for query, catalog, schema and table ARROW-16413 - [Python] Certain dataset APIs hang with a python filesystem ARROW-16417 - [C++][Python] Segfault in test_exec_plan.py / test_joins ARROW-16419 - [Python] Properly wait for ExecPlan to finish ARROW-16442 - [Python][Dataset] Fix fragments of ORC Dataset to use FileFragment class PARQUET-2115 - [C++] Parquet dictionary bit widths are limited to 32 bits PARQUET-2118 - [C++] Don’t assume standard pointers PARQUET-2119 - [C++] Fix DeltaBitPackDecoder fuzzer found issue PARQUET-2123 - [C++] Fix invalid memory access in ScanFileContents PARQUET-2124 - [C++] Remove Parquet Dictionary DCHECK PARQUET-2130 - Fix crash in debug with non-standard key names. PARQUET-2131 - Number values decoded DCHECKs should be exceptions New Features and Improvements ARROW-1888 - [C++] Implement Struct Casts ARROW-3016 - [Docs][C++] Memory profiling with perf ARROW-3039 - [Go] Add support for DictionaryArray ARROW-3998 - [C++] Add TPC-H Generator ARROW-5107 - [Release] Validate non-RC source and binary artifacts ARROW-5598 - [Go] Rename array.Array{,Approx}Equal to array.{,Approx}Equal ARROW-6780 - [C++][Parquet] Support DurationType in writing/reading parquet (written as int64) ARROW-7174 - [Python] Expose parquet dictionary_pagesize_limit write parameter ARROW-7272 - [C++][Java][Dataset] JNI bridge between RecordBatch and VectorSchemaRoot ARROW-7914 - [Python] Allow pandas datetime as index for feather ARROW-9235 - [R] Support for connection class when reading and writing files ARROW-9378 - [Go] Support unsigned dictionary indices ARROW-9947 - [Python] High-level Python API for Parquet encryption of files. ARROW-10643 - [Python] Pandas&lt;-&gt;pyarrow roundtrip failing to recreate index for empty dataframe ARROW-10924 - [C++] Validate temporal data in ValidateArrayFull ARROW-11071 - [R][CI] Use processx to set up minio and flight servers in tests ARROW-11259 - [Python] Allow to create field reference to nested field ARROW-11989 - [C++][Python] Improve ChunkedArray’s complexity for the access of elements ARROW-12515 - [Dev][Wiki][Release] Fix and update Windows RC verify script ARROW-12516 - [C++][Gandiva] Implements castINTERVALDAY(varchar) and castINTERVALYEAR(varchar) functions ARROW-12659 - [C++] Support is_valid as a guarantee ARROW-12743 - [R] Add DESCRIPTION fields for dev dependencies ARROW-13185 - [MATLAB] Create a single MEX gateway function which delegates to specific C++ functions ARROW-13204 - [MATLAB] Update documentation for the MATLAB Interface to reflect latest CMake build system changes ARROW-13231 - [Doc] Add ORC documentation ARROW-13260 - [Doc] Host different released versions of the documentation + version switcher ARROW-13337 - [R] Define Math group generics ARROW-13375 - [C++][Gandiva] Implement POSITIVE and NEGATIVE Hive functions on Gandiva ARROW-13409 - [C++][FlightRPC] Expose server shutdown with deadline ARROW-13564 - [Dev] Check individual commit messages for “Co-authored-by:” tags when integrating a pull request ARROW-13616 - [R] Cheat Sheet Structure ARROW-13683 - [R] Test Windows UCRT R ARROW-13703 - [Python][R] Add bindings for new dataset writing options ARROW-13993 - [C++][Compute] Add hash_one aggregate function ARROW-14075 - [C++][CI] Add an appveyor CI job for VisualStudio 2019, non-conda ARROW-14091 - [C++] add(date, duration) -&gt; timestamp kernel ARROW-14093 - [C++] subtract(date, date) -&gt; duration kernel ARROW-14094 - [C++] add(timestamp, duration) -&gt; timestamp kernel ARROW-14095 - [C++] subtract(timestamp, duration) -&gt; timestamp kernel ARROW-14096 - [C++] add(time, duration) -&gt; time kernel ARROW-14097 - [C++] subtract(time, duration) -&gt; time kernel ARROW-14098 - [C++] subtract(time, time) -&gt; duration kernel ARROW-14099 - [C++] add(duration, duration) -&gt; duration kernel ARROW-14100 - [C++] subtract(duration, duration) -&gt; duration kernel ARROW-14101 - [C++] multiply(duration, integer) -&gt; duration kernel ARROW-14102 - [C++] divide(duration, integer) -&gt; duration kernel ARROW-14153 - [C++][Dataset] Add support for batch_size in the ORC Scanner ARROW-14168 - [R] Warn only once about arrow function differences ARROW-14169 - [R] altrep for factors ARROW-14199 - [R] bindings for format (where possible) ARROW-14266 - [R] Use WriteNode to write queries ARROW-14279 - [Docs] Initial attempt at describing structure of PyArrow library ARROW-14292 - [C++][Python] Join foundation for Tables ARROW-14293 - [Python] Basic Join functionality in PyArrow ARROW-14322 - [Doc] Add Python doc on how to connect Python to other languages ARROW-14333 - [C++][Compute] Add binary and LargeStringType tests to comparison kernels ARROW-14339 - [Docs] Add canonical url to the pkgdown (R) docs ARROW-14442 - [R] fix behaviour when converting timestamps with “” as tzone ARROW-14444 - [C++] Implement task-based model into the executable-pipelines. ARROW-14498 - [Docs] Make it possible to regenerate older docs with additional patch(es) ARROW-14502 - [C++][Gandiva] Add test DayOfMonth ARROW-14506 - [C++] Conda support for google-cloud-cpp ARROW-14553 - [Doc] Java Cookbook Release 1 ARROW-14579 - [Documentation] Document the CI ARROW-14591 - [R] Implement bindings for lubridate duration types ARROW-14612 - [C++] Support for filename-based partitioning ARROW-14631 - [C++][Gandiva] Implement Nextday Function ARROW-14651 - [Release][Archery] Add support for retrying download ARROW-14672 - [Docs] Document how to exchange data between Python and Java ARROW-14679 - [R][C++] Handle suffix argument in joins ARROW-14698 - [Docs][FlightRPC] Add API docs for Flight SQL ARROW-14702 - [Doc][C++] Document threading model ARROW-14745 - [R] Enable true duckdb streaming ARROW-14776 - [Website] Don’t include squashed commits in merge commit message ARROW-14798 - [C++][Python][R] Add container window to PrettyPrintOptions ARROW-14808 - [R] Implement bindings for lubridate::date() ARROW-14810 - [R] Implement bindings for lubridate’s date_decimal() and decimal_date() ARROW-14815 - [R] bindings for lubridate::semester() ARROW-14817 - [R] Implement bindings for lubridate::tz() ARROW-14823 - [R] Implement bindings for lubridate::leap_year ARROW-14824 - [R] Implement bindings for lubridate::epiyear() ARROW-14825 - [C++] Temporal component extraction function for extracting epiyear ARROW-14826 - [R] Implement bindings for lubridate::dst() ARROW-14827 - [C++] Temporal component extraction function for extracting dst indicator ARROW-14893 - [C++] Allow creating GCS filesystem from URI ARROW-14927 - [CI] Upgrade Fedora 33 to Fedora 35 ARROW-14942 - [R] Bindings for lubridate’s dpicoseconds, dnanoseconds, desconds, dmilliseconds, dmicroseconds ARROW-14943 - [R] Bindings for lubridate’s ddays, dhours, dminutes, dmonths, dweeks, dyears ARROW-14944 - [R] Implement lubridate::make_difftime() ARROW-14963 - [Doc] Add copy button extension to code-blocks ARROW-14993 - [C++] Benchmark CSV writer ARROW-14997 - [Python][Doc] Add thread_count functions to API docs ARROW-15013 - [R] Expose concatenate at the R level ARROW-15015 - [R] Test / CI flag for ensuring all tests are run? ARROW-15020 - [R] Add bindings for new dataset writing options ARROW-15040 - [R] Enable write_csv_arrow to take a Dataset or arrow_dplyr_query as input ARROW-15061 - [C++] Add logging for kernel functions and exec plan nodes ARROW-15062 - [C++] Add memory information to current spans ARROW-15064 - [C++] Vectorize CheckStringHasNoStructuralChars in CSV writer ARROW-15066 - [C++] Enable use of non-bundled OpenTelemetry ARROW-15067 - [C++] Add tracing spans to the scanner ARROW-15080 - [Python][C++] Enable tuples conversion to interval ARROW-15089 - [C++][Compute] Implement kernel to lookup a MapArray item for a given key ARROW-15098 - [R] Add binding for lubridate::duration() and/or as.difftime() ARROW-15118 - [C++] Avoid bitmap buffer if all inputs are all valid for Scalar Kernels ARROW-15152 - [C++][Compute] Implement hash_list aggregate function ARROW-15156 - [Doc] Implement Tutorials for the Java Documentation ARROW-15157 - [Doc] New Contributors Guide v2 ARROW-15163 - [R] lubridate functions for 8.0.0 ARROW-15167 - [R] Improve efficiency of decimal casting ARROW-15168 - [R] Add S3 generics to create main Arrow objects ARROW-15178 - [Java][Docs] Java Tutorial: Developer Docs for Java ARROW-15180 - Document how to add JNI bindings for C++ features ARROW-15183 - [Python][Docs] Add Missing Dataset Write Options ARROW-15192 - [Java] Allow use of Jackson 2.12 and higher ARROW-15195 - [MATLAB] Enable GitHub Actions CI for MATLAB Interface on macOS ARROW-15197 - [C++] UTF-8 string repeat kernel ARROW-15212 - [C++] Handle suffix argument in joins ARROW-15215 - [C++] Consolidate kernel data-copy utilities between replace_with_mask, case_when, coalesce, choose, fill_null_forward, fill_null_backward ARROW-15223 - [C++] Implement Not Between ternary kernel ARROW-15238 - [C++] ARROW_ENGINE module with substrait consumer ARROW-15239 - [C++][Compute] Adding Bloom filter implementation ARROW-15258 - [C++] Easy options to create a source node from a table ARROW-15262 - [C++] Create a ToTable sink node ARROW-15281 - [C++] Implement ability to retrieve fragment filename ARROW-15282 - [C++][FlightRPC] Split data methods from the underlying transport ARROW-15294 - [R] Remove arrow-without-arrow and other Solaris hacks ARROW-15296 - [CI][GO] Add Go staticcheck linting to CI lint job ARROW-15299 - [R] investigate {remotes} dependencies “soft” vs TRUE ARROW-15313 - [C++][Java][FlightRPC] Implement type info method to flight-sql ARROW-15314 - [C++][Java][FlightRPC] Add missing metadata on Arrow schemas returned by Flight SQL ARROW-15321 - [Dev][Python] Also numpydoc-validate Cython-generated methods ARROW-15346 - [Doc][Guide] Arrow codebase - minor corrections ARROW-15347 - [Doc][Guide] Update testing section in new contributors guide ARROW-15348 - [Doc][Guide] Lifecycle of a PR - minor corrections ARROW-15349 - [Doc][Guide] Existing Contributors page - update ARROW-15350 - [Doc][Guide] Add styling and linters info section ARROW-15351 - [Doc][Guide] Additional tutorial for R bindings ARROW-15352 - [Doc][Guide] R package and make clean ARROW-15353 - [Doc][Guide] Intro into CI topic and link to the existing docs ARROW-15364 - [Python] Update filesystem entry in read docstrings to reflect current behaviour ARROW-15366 - [Docs] Automate incrementing of package version for R and non-R version switchers ARROW-15367 - [Python] Improve Classes and Methods Docstrings for 8.0.0 ARROW-15369 - [Doc] Tweak example to use the new support for str pointers ARROW-15374 - [C++][FlightRPC] Add support for MemoryManager in data methods ARROW-15389 - [C++][Dev] Improve Array preview in GDB plugin ARROW-15400 - [Go][CI] Exercise builds on arm machines ARROW-15410 - [C++][Datasets] Improve memory usage of datasets API when scanning parquet ARROW-15418 - [Go][Flight] Update gRPC version, hide impl details ARROW-15425 - [C++] Add delta dictionaries in file format to integration tests ARROW-15428 - [Python] Address docstrings in Parquet classes and functions ARROW-15429 - [Python] Address docstrings for ChunkedArray class, methods, attributes and constructor ARROW-15431 - [Python] Address docstrings in Schema ARROW-15432 - [Python] Address CSV docstrings ARROW-15440 - [Go] Implement ‘unpack_bool’ with Arm64 GoLang Assembly ARROW-15450 - [Python][Wheel] Flight test receives SIGKILL during in macOS tests ARROW-15462 - [GLib] Add GArrow{Month,DayTime,MonthDayNano}Interval{Scalar,Array,ArrayBuilder} ARROW-15468 - [R][CI] A crossbow job that tests against DuckDB’s dev branch ARROW-15471 - [R] ExtensionType support in R ARROW-15472 - [Website] Add Flight SQL blog post ARROW-15477 - [C++][Python] Allow to create (FixedSize/Large)ListArray from arrays and type ARROW-15480 - [R] Expand on schema/colnames mismatch error messages ARROW-15483 - [Release] Revamp the verification scripts ARROW-15487 - [FlightRPC][C++][GLib][Python][R] Implement FlightClient::Close ARROW-15489 - [R] Expand RecordBatchReader usability ARROW-15491 - [Website] Rotate PMC chair for 2022 ARROW-15497 - [C++][Homebrew] Use Clang Tools 12 ARROW-15501 - [Java] Support validating decimal vectors ARROW-15503 - [GLib][Release] Avoid deprecation warning ARROW-15505 - [C++][Compute] Support null type in product aggregation ARROW-15506 - [C++][Compute] Support Null type in hash_sum/hash_product/hash_mean ARROW-15510 - [C++][FlightRPC] Add CUDA memory manager support to benchmark ARROW-15515 - [C++] Update ExecPlan example code and documentation with new options ARROW-15517 - [R] Use WriteNode in write_dataset() ARROW-15523 - [Python] Support for Datasets as inputs of Joins ARROW-15524 - [Python] Make joins able to receive Tables as inputs ARROW-15525 - [Python] Make joins able to output a Table as result. ARROW-15526 - [Python] Support for Dataset.join ARROW-15527 - [Python] Make Joins able to execute the join operation ARROW-15532 - [C++] Fix unused warning for StringClassifyDoc ARROW-15542 - [GLib][Parquet] Add GParquet*Metadata ARROW-15550 - [C++] Add optional debug memory checks ARROW-15551 - [C++][FlightRPC] Update gRPC TLS options detection for 1.43 ARROW-15552 - [Doc][Format] Remove erroneous mention of base64 ARROW-15556 - [Release] Add a script to update Homebrew packages ARROW-15569 - [Packaging][deb] Use gem instead of apt to install gobject-introspection gem ARROW-15570 - [CI][Nightly] Drop centos-8 R nightly job ARROW-15572 - [Java][Docs] Add Installation section to Java documentation ARROW-15573 - [Java][Doc] Document Apache Arrow memory management ARROW-15574 - [Java][Doc] Review existing documentation ARROW-15575 - [Java][Doc] Datasets Tutorial ARROW-15576 - [Java][Doc] Document VectorSchemaRoots for 2D data ARROW-15577 - [Java][Doc] Add Arrow Flight documentation ARROW-15578 - [Java][Doc] Document C Data Interface and how to interface with other languages ARROW-15579 - [C++] Add MemoryManager::CopyBuffer(const Buffer&amp;) ARROW-15594 - [C++][FlightRPC] Add Deserialize(const Buffer&amp;) to various Flight types ARROW-15595 - [Release][Ruby] Add support for MFA ARROW-15600 - [C++][FlightRPC] Add minimal Flight SQL query example ARROW-15601 - [Docs][Release] Update post release script to move stable docs + keep dev docs ARROW-15605 - [CI][R] Keep using old macos runners on our autobrew CI job ARROW-15606 - [CI][R] Add brew build that exercises the R package ARROW-15609 - [C++][Compute] Support hash_aggregate with only keys ARROW-15611 - [C++] Migrate arrow::ipc::internal::json::ArrayFromJSON to Result&lt;&gt; ARROW-15614 - [C++] Add sqrt binary scalar kernel ARROW-15617 - [Doc][C++] Document environment variables ARROW-15619 - [C++] Temporal component extraction function for extracting is_leap_year indicator ARROW-15623 - [C++][Python] Update developers/python.rst (console blocks + “” in archery install) ARROW-15625 - [C++] Convert underscore to hyphen in example executable names ARROW-15629 - [GLib] Add garrow_{,large_}string_array_builder_append_string_len() ARROW-15630 - [Release][MSYS2] Update reverse dependencies too ARROW-15631 - [Packaging][RPM] Add major version to libs packages ARROW-15632 - [R] Prune the bundled libarrow source ARROW-15633 - [R] Skip s3_bucket example that requires network connection ARROW-15634 - [C++][Packaging] Improve compilation speed for java-jars nighlty build for MacOS ARROW-15643 - [C++] Allow selecting subset of fields of a StructArray via cast ARROW-15650 - [MATLAB] Rename the MEX gateway function ARROW-15653 - [R][CI] Fix tests of bundled cpp source ARROW-15656 - [C++][R] Make valgrind builds slightly quicker ARROW-15657 - [C++][Java] Upgrade Apache ORC to 1.7.3 ARROW-15665 - [C++] Fix error_is_null in strptime with invalid inputs ARROW-15665 - [C++] Add error handling option to StrptimeOptions ARROW-15670 - [C++/Python/Packaging] Update conda pinnings and enable GCS on Windows ARROW-15672 - [C++] Enable CSV writer to control the field delimiter ARROW-15673 - [R] Error gracefully if DuckDB isn’t installed ARROW-15680 - [C++] Temporal floor/ceil/round should accept week_starts_monday when rounding to multiple of week ARROW-15682 - [CI] Github starting to migrate “windows-latest” tag from windows 2019 to windows 2022 ARROW-15683 - [Website][Rust][DataFusion] Make a 7.0.0 release announcement blog ARROW-15690 - [Dev] Update GitHub Actions workflows that hardcode master as default ARROW-15692 - [Dev] Update release scripts to use default branch ARROW-15694 - [Dev] Update apache/arrow-site GitHub Actions deploy.yml website deployment workflow to support being triggered when pushing to main ARROW-15697 - [R] Add logo and meta tags to pkgdown site ARROW-15698 - [Integration] Privatized some code in tests ARROW-15701 - [R] month() should allow integer inputs ARROW-15706 - [C++][FlightRPC] Implement a UCX transport ARROW-15707 - [C++][FlightRPC] Make Flight tests more resuable across transports ARROW-15708 - [R][CI] skip snappy encoded parquets on clang sanitizer ARROW-15709 - [C++] Compilation of ARROW_ENGINE fails if doing an “inline” build ARROW-15709 - [C++] Revert change ARROW-15709 - [C++] Compilation of ARROW_ENGINE fails if doing an “inline” build ARROW-15712 - [R] Add a type method for Expression objects ARROW-15714 - [C++][Gandiva] Increase the protobuf recursion limit in gandiva protobuf parser ARROW-15717 - [Docs] Add hash_one to the documentation ARROW-15721 - [Docs][FlightRPC] Add Flight/Flight SQL to subprojects ARROW-15722 - [Java] Improve error message for nested types with incorrect children ARROW-15726 - [C++] If a projected_schema is not supplied but a bound projection expression is then we should use that to infer the projected_schema ARROW-15739 - [C++] Bump xsimd to latest version ARROW-15740 - [C++][Compute] Benchmark element wise min/max ARROW-15741 - [Doc][Format] Clarify thread-safety of C stream interface ARROW-15742 - [Go] Implement ‘bitmap_neon’ with Arm64 GoLang Assembly ARROW-15744 - [Gandiva][C++] Add NEGATIVE function for interval types ARROW-15749 - [Ruby] Add support for #values of Month Interval Type ARROW-15750 - [Ruby] Add support for #raw_records of Month Interval Type ARROW-15755 - [Java] Support Java 17 ARROW-15763 - [C++] Improve CSV writer performance ARROW-15766 - [R] Implement bindings for lubridate::duration() ARROW-15769 - [C++] Generate less arithmetic kernels ARROW-15775 - [R] Clean up as.* methods to use build_expr() ARROW-15776 - [Python] Expose IpcReadOptions ARROW-15777 - [Python][Flight] Allow passing IpcReadOptions to FlightCallOptions ARROW-15781 - [Python] Release GIL in ensure_complete_metadata ARROW-15782 - [C++] Fix Findre2Alt.cmake to check RE2_ROOT variable first ARROW-15788 - [C++][FlightRPC] Prepare benchmark for alternative transports ARROW-15789 - [C++] Update OpenTelemetry to v1.2.0 ARROW-15795 - [Java] Add a getter for the timeZone in timestamp with timezone vectors ARROW-15796 - [Python] Pickling ParquetFileFragment shouldn’t fetch metadata ARROW-15799 - [R] Update as.Date() to support an origin different from epoch ARROW-15800 - [R] Implement bindings for lubridate::as_date() and lubridate::as_datetime() ARROW-15801 - [R] Implement bindings for lubridate date-time helpers ARROW-15802 - [R] bindings for lubridate::make_datetime() and lubridate::make_date() ARROW-15810 - [CI][Nightly] Check R related image strictly ARROW-15814 - [R][DOCS] Improve documentation for cast() ARROW-15817 - [R] Use TableSourceNode instead of InMemoryDataset ARROW-15818 - [R] Implement initial Substrait consumer in the R bindings ARROW-15820 - [C++][Doc] Add table_source to streaming_execution.rst &amp; clarify parameter name ARROW-15821 - [JS] Fix paths to sourcemaps in directories ARROW-15823 - [C++][Python] Add a method to convert a Table to a RecordBatchReader ARROW-15824 - [Python] Make pyarrow.parquet a package ARROW-15827 - [R] Improve UX of write_dataset(…, max_rows_per_group) ARROW-15831 - [Java] Upgrade Flight dependencies ARROW-15841 - [R] Implement SafeCallIntoR to safely call the R API from another thread ARROW-15844 - [Release][Packaging] Use ASCII format for detached sign ARROW-15846 - [Format] Clarify presence of struct validity bitmap ARROW-15850 - [C++] Engine substrait headers missing from install ARROW-15854 - [C++] Refine CSV writer code ARROW-15860 - [Python] Document RecordBatchReader ARROW-15864 - [Java][Docs] Update Arrow nightly Maven releases documentation ARROW-15866 - [Packaging][Ubuntu] Drop support for Ubuntu 21.04 ARROW-15870 - [Python] Start to raise deprecation warnings for use_legacy_dataset=True in parquet.read_table ARROW-15871 - [Python] Start raising deprecation warnings for ParquetDataset keywords that won’t be supported with the new API ARROW-15873 - [CI] Migrate from Ubuntu 21.04 to 22.04 ARROW-15875 - [R] Expose ReadMetadata for input streams ARROW-15882 - [Python][CI] Ensure we are running hypothesis tests in the nightly hypothesis build ARROW-15885 - [Ruby] Add support for #values of DayTime Interval Type ARROW-15886 - [Ruby] Add support for #raw_records of DayTimeInterval type ARROW-15890 - [CI][Python] Use venv instead of virtualenv ARROW-15896 - [Python][C++] Add errno detail for filesystem “file not found” errors ARROW-15900 - [C++] Support Substrait reading of a Feather-format local file ARROW-15902 - [Website] Add new committers: Raphael Taylor-Davies, Wang Xudong, Yijie Shen, Kun Liu ARROW-15916 - [Packaging][RPM] Add support for CentOS Stream 8 ARROW-15917 - [Java][Docs] Document how to use Flight artifacts ARROW-15918 - [Ruby][{day:, millisecond:}, …] ) ARROW-15919 - [C++] Add function not commutative with timestamps &amp; duration maths ARROW-15921 - [Format][FlightRPC][C++][Java] Clarify interpretation of FlightEndpoint.locations ARROW-15923 - [Packaging][Linux] Enable GCS support ARROW-15924 - [Ruby] Add support for #values of MonthDayNanoInterval type ARROW-15925 - [Ruby] Add support for #raw_records of MonthDayNanoInterval type ARROW-15931 - [Website] Add explicit Apache LICENSE.txt and NOTICE.txt files to apache/arrow-site repository ARROW-15932 - [C++][FlightRPC] Add more tests to the common Flight suite ARROW-15934 - [Python] Expose write_batch_size in python ARROW-15935 - [Ruby] Add test for Arrow::DictionaryArray#values ARROW-15939 - [Python] Add pickle support for JSON options classes ARROW-15940 - [Gandiva][C++] Add NEGATIVE function for decimal data type ARROW-15941 - [C++] Allow overriding the number of IO threads with an environment variable ARROW-15944 - [Docs][C++] Document dependencies for building on Arch Linux ARROW-15947 - [R] rename_with s3 method for arrow_dplyr_query ARROW-15950 - [Go] Lift BitSetRunReader to internal/bitutils package ARROW-15952 - [C++] Document Visitors and finish Scalar::Accept ARROW-15955 - [Packaging][RPM] Add missing json-devel to CentOS Stream 8 build image ARROW-15956 - [Java] Consolidate Flight integration testing code ARROW-15963 - [Go][Parquet] simplify ReaderAtSeeker interface ARROW-15968 - [C++] Update AsyncGenerator semantics to emit a terminal item only after all outstanding futures have completed ARROW-15972 - [Java][Doc] Add Getting Started section ARROW-15974 - [C++] Migrate flight/types.h header definitions to use Result&lt;&gt; ARROW-15975 - [C++] Document type traits and inline visitors ARROW-15976 - [C++] Clean up commenting on execution plan example ARROW-15979 - [C++][Doc] Expose more functions of parquet::WriterProperties in doc ARROW-15984 - [C++] Change RecordBatchReader API to use Result&lt;&gt; ARROW-15989 - [R] rbind &amp; cbind for Table &amp; RecordBatch ARROW-15994 - [C++] Back out taskify changes ARROW-15995 - [GO] Improve ‘sum_float64_neon’ performance ARROW-15998 - [Docs][CI] Use sphinx-design tabs instead of sphinx-tabs ARROW-15999 - [Python] Turn deadlines off for the test using hypothesis ARROW-16007 - [R] grepl bindings return FALSE for NA inputs ARROW-16011 - [R] CI jobs should fail if lintr picked up issues ARROW-16014 - [C++] Create more benchmarks for measuring expression evaluation overhead ARROW-16026 - [C++] Add support for the serial executor to expose an async generator as an iterable ARROW-16032 - [C++] Migrate FlightClient API to Result&lt;&gt; ARROW-16033 - [C++] Pass schema to consuming sink node ARROW-16038 - [R] different behavior from dplyr when mutate’s .keep option is set ARROW-16042 - [GO] Fix header file preprocessor issues ARROW-16044 - [Julia] Remove from apache/arrow ARROW-16046 - [Docs][FlightRPC][Python] Ensure Flight Python API is documented ARROW-16049 - [C++][FlightRPC] Fix Flight SQL’s ColumnMetadata constructor visibility ARROW-16053 - [C++][FlightRPC] Fix flaky test TestAuthHandler.FailUnauthenticatedCalls ARROW-16055 - [C++][Gandiva] Skip unnecessary work during cache hit when using object code cache ARROW-16057 - [Python] Address docstrings for RecordBatch class, methods, attributes and constructor ARROW-16058 - [Python] Address docstrings for Table class, methods, attributes and constructor ARROW-16059 - [Python] Address docstrings for Tensor class ARROW-16061 - [R][CI] Speed up windows 3.6 builds ARROW-16062 - [Python] Move libarrow_python include definitions to its own file ARROW-16064 - [Java][C++][FlightRPC] Add missing column metadata for type name on FlightSQL ARROW-16065 - [FlightRPC][Docs] Improve Flight documentation ARROW-16068 - [C++][FlightRPC] Migrate remaining flight API to use Result&lt;&gt; ARROW-16069 - [C++][FlightRPC] Refactor out gRPC error code handling ARROW-16073 - [R] clean-up date time unit testing once tzdb is available on Windows ARROW-16074 - [Docs] Document joins ARROW-16079 - [Python] Address docstrings in Parquet schema and metadata ARROW-16082 - [Flight][Go] Allow specifying a net.Listener ARROW-16098 - [JS] Don’t return null in table and recordbatch iterators ARROW-16102 - [C++] Add support for building with system gRPC and bundled GCS ARROW-16104 - [Packaging] Add support for Ubuntu 22.04 ARROW-16105 - [C++][Gandiva] Add support for LLVM 14 ARROW-16109 - [Python] Add dataset mark to test in order to avoid failure ARROW-16114 - [Docs][Python] Document Parquet FileMetaData ARROW-16117 - [JS] Improve decode UTF8 performance ARROW-16120 - [Python] ParquetDataset deprecation: change Deprecation to FutureWarnings ARROW-16121 - [Python] Deprecate the (common_)metadata(_path) attributes of ParquetDataset ARROW-16122 - [Python] Change use_legacy_dataset default and deprecate no-longer supported keywords in parquet.write_to_dataset ARROW-16128 - [C++][FlightRPC] Fix Flight SQL static build on Windows ARROW-16132 - [Packaging][deb][CUDA] Relax libcuda1 dependency ARROW-16154 - [R] Errors which pass through handle_csv_read_error() and handle_parquet_io_error() need better error tracing ARROW-16156 - [R] Clarify warning message for features not turned on in .onAttach() ARROW-16158 - [C++][R] Rename ARROW_ENGINE to ARROW_SUBSTRAIT ARROW-16166 - [C++][Compute] Utilities for assembling join output ARROW-16167 - [JS] refactor get and set visitors ARROW-16173 - [C++] Add benchmarks for temporal functions/kernels ARROW-16176 - [Release][C#] Use .NET 6.0 on Ubuntu 22.04 ARROW-16186 - [C++][GANDIVA] Add alias and tests for decimal, quarter, xor, etc… ARROW-16187 - [Go][Parquet] Properly utilize BufferedStream and buffer size when reading ARROW-16192 - [Go] Remove deprecated aliases for v8 ARROW-16193 - [Go] Replace CPU discovery package with golang.org/x/sys/cpu module ARROW-16198 - [CI][Packaging][Python] Update VCPKG version ARROW-16201 - [R] SafeCallIntoR on 3.4 ARROW-16203 - [Release] Remove all old artifacts on release ARROW-16204 - [C++][Dataset] Default error existing_data_behaviour for writing dataset ignores a single file ARROW-16208 - [JS] Upgrade deps ARROW-16210 - [JS] Implement tableFromJSON and support struct vector in vectorFromArray ARROW-16214 - [GLib][Parquet] Add GParquetFileMetadata ARROW-16229 - [CI] Temporary remove turbodbc tests from nightly tests ARROW-16232 - [C++] Include OpenTelemetry in LICENSE.txt ARROW-16240 - [Python] Support row_group_size/chunk_size keyword in pq.write_to_dataset with use_legacy_dataset=False ARROW-16242 - [Go] xerrors.Errorf and xerrors.Is are deprecated, fix linting ARROW-16245 - [GLib][Parquet] Add GParquetRowGroupMetadata ARROW-16247 - [GLib] Add GArrowGCSFileSystem ARROW-16250 - [GLib][Parquet] Add GParquetColumnChunkMetadata ARROW-16251 - [GLib][Parquet] Add GParquetStatistics and its family ARROW-16252 - [CI][Archery] Highlight number of failed builds on nightly reports ARROW-16256 - [Docs] Document which format version is supported ARROW-16257 - [R] Break-up as_date and as_datetime into individual functions ARROW-16264 - [C++][CI] Valgrind timeout in arrow-compute-hash-join-node-test ARROW-16276 - [R] Release News ARROW-16277 - [Python] No builds for macOS arm64. ARROW-16280 - [C++] Avoid copying shared_ptr in Expression::type() ARROW-16282 - [CI] [C#] Verifiy release on c-sharp has been failing since upgrading ubuntu to 22.04 ARROW-16283 - [Go] Cleanup panics in new Buffered Reader ARROW-16284 - [Python][Packaging] Use delocate-fuse to create universal2 wheels ARROW-16291 - [Java]: Support JSE17 for Java Cookbooks ARROW-16292 - [Java][Doc] Upgrade java documentation for JSE17/JSE18 ARROW-16294 - [C++] Improve performance of parquet readahead ARROW-16296 - [GLib] Add missing casts for GArrowRoundMode ARROW-16303 - [C++] Check EINTR in file IO ARROW-16308 - [CI] Upgrade windows runner version as windows-2016 is deprecated. ARROW-16314 - [Python][CI] Skip running cython tests in windows verification builds ARROW-16325 - [R] Add task for R package with gcc12 ARROW-16327 - [Java][CI]: Add support for Java 17 CI process ARROW-16334 - [Archery][CI] Use build links on nightly report emails instead of branch link ARROW-16338 - [CI] Update azure windows image as vs2017-win2016 is retired ARROW-16347 - [Release] Escape backtick in verification script ARROW-16349 - [Release][Packaging][RPM] Remove ed25519 keys from KEYS ARROW-16350 - [Dev][Archery] Add missing newline in error message comment ARROW-16352 - [GLib] Fix wrong enums.h install location ARROW-16354 - [Packaging][RPM] Update artifacts pattern list ARROW-16355 - [Dev] Update verify-release-candidate.sh to compile cpp in parallel ARROW-16373 - [Docs][CI] Small improvements to CI documentation ARROW-16387 - [C++] Add -Wshorten-64-to-32 to list of CHECKIN warnings tested by clang ARROW-16390 - [C++] Dataset initialization could segfault if called simultaneously ARROW-16408 - [C++] Add support for DATE type in SQLite FlightSQL example ARROW-16411 - [Website] Migrate to Matomo from Google Analitics ARROW-16412 - [Java] Updated README to reference compilation docs ARROW-16416 - [C++] Support cast-function in Substrait ARROW-16428 - [Release] Add prefix to ENV variables ARROW-16445 - [R] [Doc] Add a short summary for the Installing the Arrow package on Linux article ARROW-16455 - [CI] [Packaging] Anaconda storage size exceeded for linux-ppc64le" />
<link rel="canonical" href="https://arrow.apache.org/release/8.0.0.html" />
<meta property="og:url" content="https://arrow.apache.org/release/8.0.0.html" />
<meta property="og:site_name" content="Apache Arrow" />
<meta property="og:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="og:type" content="article" />
<meta property="article:published_time" content="2024-04-29T17:30:49-04:00" />
<meta name="twitter:card" content="summary_large_image" />
<meta property="twitter:image" content="https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png" />
<meta property="twitter:title" content="Apache Arrow 8.0.0 Release" />
<meta name="twitter:site" content="@ApacheArrow" />
<script type="application/ld+json">
{"@context":"https://schema.org","@type":"BlogPosting","dateModified":"2024-04-29T17:30:49-04:00","datePublished":"2024-04-29T17:30:49-04:00","description":"Apache Arrow 8.0.0 (6 May 2022) This is a major release covering more than 3 months of development. Download Source Artifacts Binary Artifacts For AlmaLinux For Amazon Linux For CentOS For C# For Debian For Python For Ubuntu Git tag Contributors This release includes 636 commits from 127 distinct contributors. $ git shortlog -sn apache-arrow-7.0.0..apache-arrow-8.0.0 43 Antoine Pitrou 40 David Li 39 Sutou Kouhei 36 Alenka Frim 29 Weston Pace 25 Rok 22 Dragoș Moldovan-Grünfeld 22 Joris Van den Bossche 20 Jonathan Keane 20 Krisztián Szűcs 19 Raúl Cumplido 17 Will Jones 16 Dewey Dunnington 13 david dali susanibar arce 12 Matthew Topol 10 Nic Crane 10 Yibo Cai 10 Vibhatha Abeykoon 9 Dominik Moritz 9 SHIMA Tatsuya 8 Neal Richardson 8 Sanjiban Sengupta 8 okadakk 8 Jacob Wujciak-Jens 7 Tobias Zagorni 7 Alessandro Molina 7 William Butler 6 Dhruv Vats 5 Yuqi Gu 5 Johnnathan 4 Anthony Louis 4 Projjal Chanda 4 Fiona La 4 Micah Kornfield 4 Bryan Cutler 3 Yaron Gvili 3 Ian Alexander Joiner 3 Vinicius Roque 3 ViniciusSouzaRoque 3 emkornfield 3 Chenxi Li 2 Jorge C. Leitao 2 Jose Almeida 2 assignUser 2 Uwe L. Korn 2 Todd Farmer 2 Tim Van Wassenhove 2 Rob Saccoccio 2 Alvin Chunga 2 Stephanie Hazlitt 2 Jeroen van Straten 2 Kyle Barron 2 Sasha Krassovsky 2 Eduardo Ponce 2 Salonijain27 2 liyafan82 2 JabariBooker 2 michalursa 2 Matthijs Brobbel 2 Danielle Navarro 2 Maya Anderson 2 Even Rouault 2 Joost Hoozemans 1 zhixingheyi-tian 1 SAm Albers 1 Alex McRae 1 Alfred Mountfield 1 Ali Amiri 1 Andrew Higgins 1 Andrew Strelsky 1 Andrii Oriekhov 1 Andy Teucher 1 Benjamin Kietzman 1 Benson Muite 1 Bob Matcuk 1 Chris Casola 1 Colin Jermain 1 Corentin 1 Dongjoon Hyun 1 Dooyoung-Hwang 1 Edward Visel 1 Elena Henderson 1 Frederic Branczyk 1 Haowei Yu 1 Hongze Zhang 1 Isaac Brodsky 1 James Duong 1 Johan Mattsson 1 João Pedro 1 Kazuaki Ishizaki 1 Kevin Gurney 1 Kian-Meng Ang 1 Laurent Goujon 1 Leo Di Donato 1 Li Jin 1 Min-Young Wu 1 MrMallIronmaker 1 Nate Clark 1 Nicolas Moreau 1 Nivia 1 Pradeep Garigipati 1 Rafael Telles 1 Romain Francois 1 Ryan Blue 1 Aakash Kumar 1 Sam Albers 1 Siddhant Rao 1 Stephan T. Lavavej 1 Thomas J. Fan 1 Tom Drabas 1 William Ayd 1 Xinyu Zeng 1 Yue Ni 1 Zac Farrell 1 bkmgit 1 burntcustard 1 fabiencelier 1 hankvyang 1 hzuo 1 iurysalino 1 jvictorhuguenin 1 jwijffels 1 karldw 1 marcbernot 1 okadak 1 ptaylor 1 shegden Patch Committers The following Apache committers merged contributed patches to the repository. $ git shortlog -csn apache-arrow-7.0.0..apache-arrow-8.0.0 105 David Li 102 Antoine Pitrou 84 Sutou Kouhei 72 Jonathan Keane 47 Krisztián Szűcs 46 Joris Van den Bossche 30 Weston Pace 25 Nic Crane 23 Pindikura Ravindra 22 Matthew Topol 16 Alessandro Molina 15 Yibo Cai 15 Neal Richardson 14 Dominik Moritz 5 Bryan Cutler 5 GitHub 3 Ian Cook 2 liyafan82 2 Micah Kornfield 1 Matt Topol 1 Li Jin 1 Chao Sun Changelog Apache Arrow 8.0.0 (2022-05-06) Bug Fixes ARROW-5248 - [Python] support zoneinfo / dateutil timezones ARROW-7350 - [Python] Decode parquet statistics as scalars ARROW-9664 - [Python] Array/ChunkedArray.to_pandas do not support types_mapper keyword ARROW-11415 - [R] map_batches wouldn’t accept a dataset as an argument ARROW-13168 - [C++][R] Enable runtime timezone database for Windows ARROW-13594 - [CI] Enable nightly turbodbc builds again ARROW-13922 - [Python] Fix ParquetDataset throw error when len(path_or_paths) == 1 ARROW-14047 - [C++] [Parquet] FileReader returns inconsistent results on repeat reads ARROW-14215 - [R][CI] Conda Windows builds failing due to space in library name ARROW-14256 - [CI][Package] Re-enable disabled conda packaging builds ARROW-14389 - [C++][Gandiva] Fix performance bug with LIKE expressions ARROW-14638 - [C++][R] Unknown C compiler / ccache on Arch Linux ARROW-14647 - [JS] fix bignumToNumber for negative numbers ARROW-14665 - [JAVA] fix JdbcToArrow ResultSet iteration bug ARROW-14708 - [C++] Adding missing abseil dependencies to enable static flight build ARROW-14908 - [C++][R] Dataset hash join segfaults on Windows ARROW-14911 - [C++] arrow-compute-hash-join-node-test failed ARROW-14960 - [C++] Add exception to Arrow style guide based on changes in Google style guide that we are not adopting ARROW-15018 - [Python] DataFrame Index modified during Feather serialization round trip ARROW-15092 - [R] Support create_package_with_all_dependencies() on non-linux systems ARROW-15253 - [Python] Error in to_pandas for empty dataframe with index with extension type ARROW-15272 - [Java] Add cleanup failures as suppressed in ArrowVectorIterator#create ARROW-15291 - [C++][Python] Segfault in StructArray.to_numpy and to_pandas if it contains an ExtensionArray ARROW-15312 - [R][C++] filtering a Parquet dataset with is.na() misses some rows ARROW-15401 - [Python] Gdb tests are failing on windows and apple M1 ARROW-15426 - [C++][Gandiva] Update InExpressionNode validation ARROW-15444 - [C++] Compilation with GCC 7.5 fails in aggregate_basic.cc ARROW-15465 - [Python] Add some missing parquet marks in dataset tests ARROW-15502 - [Java] Detect exceptional footer size in Arrow file reader ARROW-15504 - [Python][CI] Ensure that optional components are tested ARROW-15509 - [Go][Parquet] Parquet cmds crash ARROW-15511 - [Python][C++] Remove reference management in numpy indexer ARROW-15514 - [C++][Gandiva] Add flag to enable Gandiva Object Code ARROW-15520 - [C++] Qualify arrow_vendored::date::format() for C++20 compatibility ARROW-15533 - [C++] Check ARROW_WITH_OPENTELEMETRY in CI ARROW-15539 - [Archery] Add ARROW_JEMALLOC to build options ARROW-15541 - [Python] Bump the minimum Cython version ARROW-15544 - [Go][Parquet] Fix origin schema base64 decoding ARROW-15546 - [FlightRPC][C++] Remove quotes from cookie header ARROW-15555 - [Release] Don’t push the release tag since it already exists ARROW-15580 - [Python] Make pytz an actual optional dependency of PyArrow ARROW-15593 - [C++] Make after-fork ThreadPool reinitialization thread-safe ARROW-15598 - [C++][Gandiva] Avoid using hardcoded raw pointer addresses in generated code ARROW-15599 - [R] Convert a column as a sub-second timestamp from CSV file with the T col type option ARROW-15603 - [C++] Remove unused variables ARROW-15604 - [C++][CI] Sporadic ThreadSanitizer failure with OpenTracing ARROW-15604 - [C++][CI] Sporadic ThreadSanitizer failure with OpenTracing ARROW-15607 - [C++] Fix incorrect CPUID flag for AVX detection ARROW-15626 - [GLib] Fix a bug that GArrowGIOInputStream may not read enough data ARROW-15627 - [R] Fix union dataset unify schema ARROW-15648 - [C++][Gandiva] Fix the size of the Gandiva cache ARROW-15651 - [JavaScript] Structs incorrectly initialise null values ARROW-15652 - [C++] Fix GDB pretty-printing from inside parquet namespace ARROW-15659 - [R] strptime should return NA (not error) with format mismatch ARROW-15664 - [C++] parquet reader Segfaults with illegal SIMD instruction ARROW-15667 - [R] Test development build with ARROW_BUILD_STATIC=OFF ARROW-15674 - [C++][Gandiva] Like function doesn’t properly handle patterns with special characters in certain cases ARROW-15677 - [R] calling invalidate() method on ArrowObjects causes subsequent segfault ARROW-15679 - [R] count should return an ungrouped dataframe ARROW-15688 - [C++] add_checked doesn’t error out on duration overflow ARROW-15699 - [C++][Gandiva] Fix implementation of left and right func… ARROW-15700 - [C++] Compilation error on Ubuntu 18.04 ARROW-15705 - [JavaScript] Allowing appending null on children in a StructBuilder ARROW-15710 - [C++] Intermittent deadlock on arrow-threading-utility-test ARROW-15715 - [Go] ipc trim value offsets on arrays ARROW-15718 - [C++] Increase thread limit to work around thread issues ARROW-15720 - [CI] Fix nightly dask build (skip failing test due to wrong usage of Array.to_pandas) ARROW-15723 - [Python] Segfault orcWriter write table ARROW-15727 - [Python] Allow converting lists of MonthDayNano intervals to Pandas ARROW-15728 - [Python] Reduce entropy for zstd test_ipc ARROW-15743 - [R] skip not connected up to skip_rows on open_dataset despite error messages indicating otherwise ARROW-15746 - [Release][Java] Add missing artifacts to tasks.yml ARROW-15748 - [Python] Round temporal options default unit is day but documented as second. Follow-up ARROW-15748 - [Python] Round temporal options default unit is day but documented as second ARROW-15757 - [Python] Missing bindings for existing_data_behavior makes it impossible to maintain old behavior ARROW-15760 - [C++] Avoid hard dependency on git in cmake (download tarballs from github instead) ARROW-15770 - [CI] Not all python tests are running on CI jobs ARROW-15772 - [Go][Flight] Server Basic Auth Middleware/Interceptor wrongly base64 decode ARROW-15778 - [Java] set native endian to schema ARROW-15783 - [Python] Initialize static pandas data on write ARROW-15784 - [C++][Python] Removing flag enable_parallel_column_conversion which is no longer used ARROW-15791 - [Go] ipc FileWriter negative WaitGroup counter ARROW-15794 - [CI][Crossbow] Nightly builds failing due to error in types_mapper ARROW-15815 - [C++][Parquet] Fix undefined behaviour on invalid input ARROW-15819 - [R] R docs version switcher doesn’t work on Safari on MacOS ARROW-15830 - [C++] Ensure target directory exists before running Substrait generation ARROW-15837 - [C++][Python] Clarify documentation for ListArray::offsets() ARROW-15845 - [Python][Packaging] Fix macOS wheel builds ARROW-15847 - [Python][CI] Ensure we have a nightly Python build with parquet encryption disabled ARROW-15847 - [Python] Building with Parquet but without Parquet encryption fails ARROW-15848 - [Gandiva][C++] Fix function istrue and is not true ARROW-15851 - [C++] Enable RE2 when building with gRPC ARROW-15852 - [JS] Fix error thrown by Table.getByteLength() ARROW-15857 - [R] rhub/fedora-clang-devel fails to install ‘sass’ (rmarkdown dependency) ARROW-15863 - [Packaging][C++][Python] Fix conda package builds ARROW-15869 - [C++] Fix Valgrind failure (uninitialized value) ARROW-15888 - [Doc][Python] Modernize development instructions ARROW-15892 - [C++] Dataset APIs require s3:ListBucket Permissions ARROW-15895 - [R] R docs version switcher disappears &amp; reappears with back button on Chrome ARROW-15898 - [CI] Clean old conda nightlies more thoroughly ARROW-15905 - [Python][C++] Fix CMake warning when building PyArrow ARROW-15928 - [C++] Fix crashes and implement chunked array support for replace_with_mask function ARROW-15929 - [R] io_thread_count is actually the CPU thread count ARROW-15946 - [Go] Fix memory leak in pqarrow.NewColumnWriter when writing nested data ARROW-15949 - [Python] Do not require Parquet encryption when Parquet is disabled ARROW-15951 - [CI][Python] “Test wheel” step successful despite test error ARROW-15954 - [Java] Remove mac native netty kqueue dependency after upgrade ARROW-15960 - [C++] Fix crash on adaptive int builder edge cases ARROW-15962 - [C++][GANDIVA] Fix unhex errors return ARROW-15965 - [C++][Python] Add Scalar constructor of RoundToMultipleOptions to Python ARROW-15970 - [R][CI] Re-enable DuckDB dev tests ARROW-15973 - [CI] Split nightly reports into three: Tests, Packaging, Release ARROW-15982 - [Python] parquet.read_table fails to parse home directory path ARROW-15985 - [CI] Fix conda-clean failure when there are no files to delete ARROW-15987 - [C++][FlightRPC] Work around arrow-flight-test crash on AppVeyor ARROW-15993 - [CI] Add sphinx-tabs to ci/conda_env_sphinx.txt ARROW-16012 - [C++] Retry S3 request in tests when Minio not fully initialized ARROW-16013 - [C++][Python] Signed overflow when using negative stride in NumPyStridedConverter ARROW-16016 - [C++] Fix recursive ccache invocation error ARROW-16019 - [C++] Minimize chances of Minio connect errors ARROW-16021 - [C++] arrow-compute-hash-join-node-test timeout on MinGW ARROW-16025 - [Python][C++] Fix segmentation fault when closing ORCFileWritter ARROW-16031 - [C++][Gandiva] Fix Soundex errors generate ARROW-16035 - [Java] Handling empty JDBC ResultSet ARROW-16043 - [C++][Filesystem][S3] Add missing empty content for creating directory ARROW-16048 - [Python] Avoid exposing null buffer address to the Python buffer protocol ARROW-16051 - [Gandiva][C++] Fix datediff regression build ARROW-16052 - [R] undefined global function %&gt;% ARROW-16060 - [C++] subtract_checked support for timestamp(“s”) and date32 ARROW-16071 - [R] More undefined global functions ARROW-16078 - Upgrade bundled zlib to 1.2.12 ARROW-16099 - [JS] RecordBatches that are compressed should throw an error ARROW-16107 - [Dev][Archery] Fix archery crossbow latest-prefix query ARROW-16110 - [C++] GcsFileSystem::Make ignores IOContext ARROW-16113 - [Python] Partitioning.dictionaries in case of a subset of fields are dictionary encoded ARROW-16131 - [C++] support saving and retrieving custom metadata in batches for IPC file ARROW-16134 - [C++][GANDIVA] Fix Concat_WS errors return ARROW-16136 - [Gandiva][C++] Fix problem of the huge size of AddMappings function ARROW-16139 - [Python] Crash in tests/test_dataset.py::test_write_dataset_s3 ARROW-16143 - [Java] Upgrade jackson dependencies CVE-2020-36518 ARROW-16143 - [Java] Upgrade jackson dependencies CVE-2020-36518 ARROW-16146 - [C++] arrow-gcsfs-test is timing out ARROW-16148 - [C++] TPC-H generator cleanup ARROW-16152 - [C++] Fix segfault with unknown functions in Substrait ARROW-16159 - [C++][Python] Allow FileSystem::DeleteDirContents to succeed if the directory is missing ARROW-16162 - [C++][FlightRPC] Fix Flight build on Ubuntu 18.04 ARROW-16163 - [Go] IPC FileReader leaks memory when used with ZSTD compression ARROW-16165 - [CI][Archery] Fix nightly query to crossbow to send reports ARROW-16169 - [C++][Gandiva] Fix empty string case in convert_fromUTF8_binary() ARROW-16181 - [CI][C++] Valgrind failure in TPCH node tests ARROW-16182 - [C++][CI] TPCH node tests timeout under ThreadSanitizer ARROW-16185 - [C++] Fix uninitialized output data in strptime kernel ARROW-16197 - [Docs] Fix broken link ARROW-16205 - [C++][FlightRPC] Don’t use constexpr std::initializer_list ARROW-16209 - [JS] Support setting arbitrary symbols on Tables ARROW-16215 - [C++][FlightRPC] Fix segfault in Flight test on Windows ARROW-16216 - [Python][FlightRPC] Fix test_flight.py when Flight is not available ARROW-16219 - [CI] Fix git config to prevent SCM tools failure ARROW-16223 - [C++] Fix decimal reduce scale rounding ARROW-16225 - [C++][Parquet] Fix length of encryption AAD random byte generation ARROW-16233 - [Python][Packaging] test_zoneinfo_tzinfo_to_string fails with zoneinfo._common.ZoneInfoNotFoundError on packaging wheels on Windows ARROW-16235 - [C++] Fix build failure, compiler warnings from MinGW ARROW-16236 - [Python] [Packaging] test_s3fs_limited_permissions_create_bucket fails with Permission denied on MAC OS wheel builds ARROW-16237 - [Docs] Apache Impala is no longer incubating ARROW-16238 - [C++] Fix nullptr dereference when pre-buffering IPC reads ARROW-16261 - [C++] Fix DeleteDirContents on HDFS with missing_dir_ok=True ARROW-16262 - [CI][Integration] Skip failing tests from kartothek integration ARROW-16278 - [CI] Fix git installation failure on brew ARROW-16278 - [CI] Fix git installation failure on brew ARROW-16278 - [CI] Fix git installation failure on brew ARROW-16293 - [CI][GLib] Make tests stable ARROW-16295 - [CI][Release] Use windows-2019 for verify-rc-source-windows ARROW-16300 - pc.sort_indices with nonexistent column throws malloc error ARROW-16301 - [C#][CI] Fix docker configuration for .NET 6 ARROW-16305 - [C++] Missed reference to ARROW_ENGINE during the rename ARROW-16306 - [CI] Fix Nightly verify rc on ubuntu ARROW-16307 - [Java][FlightRPC] Skip flaky test TestDoExchange.testClientCancel ARROW-16311 - [Java] Do not return table_schema column when it’s not requested ARROW-16312 - [C++][CI] Install tzdata in the windows verification builds ARROW-16313 - [R] Ensure assume_timezone options are always initialized ARROW-16332 - [Release][Java] Add artifacts uploaded verification ARROW-16336 - [Python] ParquetDataset - Hide internal (common_)metadata related warnings from the user ARROW-16374 - [R][C++] skip another snappy test during sanitizer runs ARROW-16375 - [R][CI] Pin test-r-devdocs on Windows to R 4.1 ARROW-16393 - [JAVA] Update option spec to accept value for query, catalog, schema and table ARROW-16413 - [Python] Certain dataset APIs hang with a python filesystem ARROW-16417 - [C++][Python] Segfault in test_exec_plan.py / test_joins ARROW-16419 - [Python] Properly wait for ExecPlan to finish ARROW-16442 - [Python][Dataset] Fix fragments of ORC Dataset to use FileFragment class PARQUET-2115 - [C++] Parquet dictionary bit widths are limited to 32 bits PARQUET-2118 - [C++] Don’t assume standard pointers PARQUET-2119 - [C++] Fix DeltaBitPackDecoder fuzzer found issue PARQUET-2123 - [C++] Fix invalid memory access in ScanFileContents PARQUET-2124 - [C++] Remove Parquet Dictionary DCHECK PARQUET-2130 - Fix crash in debug with non-standard key names. PARQUET-2131 - Number values decoded DCHECKs should be exceptions New Features and Improvements ARROW-1888 - [C++] Implement Struct Casts ARROW-3016 - [Docs][C++] Memory profiling with perf ARROW-3039 - [Go] Add support for DictionaryArray ARROW-3998 - [C++] Add TPC-H Generator ARROW-5107 - [Release] Validate non-RC source and binary artifacts ARROW-5598 - [Go] Rename array.Array{,Approx}Equal to array.{,Approx}Equal ARROW-6780 - [C++][Parquet] Support DurationType in writing/reading parquet (written as int64) ARROW-7174 - [Python] Expose parquet dictionary_pagesize_limit write parameter ARROW-7272 - [C++][Java][Dataset] JNI bridge between RecordBatch and VectorSchemaRoot ARROW-7914 - [Python] Allow pandas datetime as index for feather ARROW-9235 - [R] Support for connection class when reading and writing files ARROW-9378 - [Go] Support unsigned dictionary indices ARROW-9947 - [Python] High-level Python API for Parquet encryption of files. ARROW-10643 - [Python] Pandas&lt;-&gt;pyarrow roundtrip failing to recreate index for empty dataframe ARROW-10924 - [C++] Validate temporal data in ValidateArrayFull ARROW-11071 - [R][CI] Use processx to set up minio and flight servers in tests ARROW-11259 - [Python] Allow to create field reference to nested field ARROW-11989 - [C++][Python] Improve ChunkedArray’s complexity for the access of elements ARROW-12515 - [Dev][Wiki][Release] Fix and update Windows RC verify script ARROW-12516 - [C++][Gandiva] Implements castINTERVALDAY(varchar) and castINTERVALYEAR(varchar) functions ARROW-12659 - [C++] Support is_valid as a guarantee ARROW-12743 - [R] Add DESCRIPTION fields for dev dependencies ARROW-13185 - [MATLAB] Create a single MEX gateway function which delegates to specific C++ functions ARROW-13204 - [MATLAB] Update documentation for the MATLAB Interface to reflect latest CMake build system changes ARROW-13231 - [Doc] Add ORC documentation ARROW-13260 - [Doc] Host different released versions of the documentation + version switcher ARROW-13337 - [R] Define Math group generics ARROW-13375 - [C++][Gandiva] Implement POSITIVE and NEGATIVE Hive functions on Gandiva ARROW-13409 - [C++][FlightRPC] Expose server shutdown with deadline ARROW-13564 - [Dev] Check individual commit messages for “Co-authored-by:” tags when integrating a pull request ARROW-13616 - [R] Cheat Sheet Structure ARROW-13683 - [R] Test Windows UCRT R ARROW-13703 - [Python][R] Add bindings for new dataset writing options ARROW-13993 - [C++][Compute] Add hash_one aggregate function ARROW-14075 - [C++][CI] Add an appveyor CI job for VisualStudio 2019, non-conda ARROW-14091 - [C++] add(date, duration) -&gt; timestamp kernel ARROW-14093 - [C++] subtract(date, date) -&gt; duration kernel ARROW-14094 - [C++] add(timestamp, duration) -&gt; timestamp kernel ARROW-14095 - [C++] subtract(timestamp, duration) -&gt; timestamp kernel ARROW-14096 - [C++] add(time, duration) -&gt; time kernel ARROW-14097 - [C++] subtract(time, duration) -&gt; time kernel ARROW-14098 - [C++] subtract(time, time) -&gt; duration kernel ARROW-14099 - [C++] add(duration, duration) -&gt; duration kernel ARROW-14100 - [C++] subtract(duration, duration) -&gt; duration kernel ARROW-14101 - [C++] multiply(duration, integer) -&gt; duration kernel ARROW-14102 - [C++] divide(duration, integer) -&gt; duration kernel ARROW-14153 - [C++][Dataset] Add support for batch_size in the ORC Scanner ARROW-14168 - [R] Warn only once about arrow function differences ARROW-14169 - [R] altrep for factors ARROW-14199 - [R] bindings for format (where possible) ARROW-14266 - [R] Use WriteNode to write queries ARROW-14279 - [Docs] Initial attempt at describing structure of PyArrow library ARROW-14292 - [C++][Python] Join foundation for Tables ARROW-14293 - [Python] Basic Join functionality in PyArrow ARROW-14322 - [Doc] Add Python doc on how to connect Python to other languages ARROW-14333 - [C++][Compute] Add binary and LargeStringType tests to comparison kernels ARROW-14339 - [Docs] Add canonical url to the pkgdown (R) docs ARROW-14442 - [R] fix behaviour when converting timestamps with “” as tzone ARROW-14444 - [C++] Implement task-based model into the executable-pipelines. ARROW-14498 - [Docs] Make it possible to regenerate older docs with additional patch(es) ARROW-14502 - [C++][Gandiva] Add test DayOfMonth ARROW-14506 - [C++] Conda support for google-cloud-cpp ARROW-14553 - [Doc] Java Cookbook Release 1 ARROW-14579 - [Documentation] Document the CI ARROW-14591 - [R] Implement bindings for lubridate duration types ARROW-14612 - [C++] Support for filename-based partitioning ARROW-14631 - [C++][Gandiva] Implement Nextday Function ARROW-14651 - [Release][Archery] Add support for retrying download ARROW-14672 - [Docs] Document how to exchange data between Python and Java ARROW-14679 - [R][C++] Handle suffix argument in joins ARROW-14698 - [Docs][FlightRPC] Add API docs for Flight SQL ARROW-14702 - [Doc][C++] Document threading model ARROW-14745 - [R] Enable true duckdb streaming ARROW-14776 - [Website] Don’t include squashed commits in merge commit message ARROW-14798 - [C++][Python][R] Add container window to PrettyPrintOptions ARROW-14808 - [R] Implement bindings for lubridate::date() ARROW-14810 - [R] Implement bindings for lubridate’s date_decimal() and decimal_date() ARROW-14815 - [R] bindings for lubridate::semester() ARROW-14817 - [R] Implement bindings for lubridate::tz() ARROW-14823 - [R] Implement bindings for lubridate::leap_year ARROW-14824 - [R] Implement bindings for lubridate::epiyear() ARROW-14825 - [C++] Temporal component extraction function for extracting epiyear ARROW-14826 - [R] Implement bindings for lubridate::dst() ARROW-14827 - [C++] Temporal component extraction function for extracting dst indicator ARROW-14893 - [C++] Allow creating GCS filesystem from URI ARROW-14927 - [CI] Upgrade Fedora 33 to Fedora 35 ARROW-14942 - [R] Bindings for lubridate’s dpicoseconds, dnanoseconds, desconds, dmilliseconds, dmicroseconds ARROW-14943 - [R] Bindings for lubridate’s ddays, dhours, dminutes, dmonths, dweeks, dyears ARROW-14944 - [R] Implement lubridate::make_difftime() ARROW-14963 - [Doc] Add copy button extension to code-blocks ARROW-14993 - [C++] Benchmark CSV writer ARROW-14997 - [Python][Doc] Add thread_count functions to API docs ARROW-15013 - [R] Expose concatenate at the R level ARROW-15015 - [R] Test / CI flag for ensuring all tests are run? ARROW-15020 - [R] Add bindings for new dataset writing options ARROW-15040 - [R] Enable write_csv_arrow to take a Dataset or arrow_dplyr_query as input ARROW-15061 - [C++] Add logging for kernel functions and exec plan nodes ARROW-15062 - [C++] Add memory information to current spans ARROW-15064 - [C++] Vectorize CheckStringHasNoStructuralChars in CSV writer ARROW-15066 - [C++] Enable use of non-bundled OpenTelemetry ARROW-15067 - [C++] Add tracing spans to the scanner ARROW-15080 - [Python][C++] Enable tuples conversion to interval ARROW-15089 - [C++][Compute] Implement kernel to lookup a MapArray item for a given key ARROW-15098 - [R] Add binding for lubridate::duration() and/or as.difftime() ARROW-15118 - [C++] Avoid bitmap buffer if all inputs are all valid for Scalar Kernels ARROW-15152 - [C++][Compute] Implement hash_list aggregate function ARROW-15156 - [Doc] Implement Tutorials for the Java Documentation ARROW-15157 - [Doc] New Contributors Guide v2 ARROW-15163 - [R] lubridate functions for 8.0.0 ARROW-15167 - [R] Improve efficiency of decimal casting ARROW-15168 - [R] Add S3 generics to create main Arrow objects ARROW-15178 - [Java][Docs] Java Tutorial: Developer Docs for Java ARROW-15180 - Document how to add JNI bindings for C++ features ARROW-15183 - [Python][Docs] Add Missing Dataset Write Options ARROW-15192 - [Java] Allow use of Jackson 2.12 and higher ARROW-15195 - [MATLAB] Enable GitHub Actions CI for MATLAB Interface on macOS ARROW-15197 - [C++] UTF-8 string repeat kernel ARROW-15212 - [C++] Handle suffix argument in joins ARROW-15215 - [C++] Consolidate kernel data-copy utilities between replace_with_mask, case_when, coalesce, choose, fill_null_forward, fill_null_backward ARROW-15223 - [C++] Implement Not Between ternary kernel ARROW-15238 - [C++] ARROW_ENGINE module with substrait consumer ARROW-15239 - [C++][Compute] Adding Bloom filter implementation ARROW-15258 - [C++] Easy options to create a source node from a table ARROW-15262 - [C++] Create a ToTable sink node ARROW-15281 - [C++] Implement ability to retrieve fragment filename ARROW-15282 - [C++][FlightRPC] Split data methods from the underlying transport ARROW-15294 - [R] Remove arrow-without-arrow and other Solaris hacks ARROW-15296 - [CI][GO] Add Go staticcheck linting to CI lint job ARROW-15299 - [R] investigate {remotes} dependencies “soft” vs TRUE ARROW-15313 - [C++][Java][FlightRPC] Implement type info method to flight-sql ARROW-15314 - [C++][Java][FlightRPC] Add missing metadata on Arrow schemas returned by Flight SQL ARROW-15321 - [Dev][Python] Also numpydoc-validate Cython-generated methods ARROW-15346 - [Doc][Guide] Arrow codebase - minor corrections ARROW-15347 - [Doc][Guide] Update testing section in new contributors guide ARROW-15348 - [Doc][Guide] Lifecycle of a PR - minor corrections ARROW-15349 - [Doc][Guide] Existing Contributors page - update ARROW-15350 - [Doc][Guide] Add styling and linters info section ARROW-15351 - [Doc][Guide] Additional tutorial for R bindings ARROW-15352 - [Doc][Guide] R package and make clean ARROW-15353 - [Doc][Guide] Intro into CI topic and link to the existing docs ARROW-15364 - [Python] Update filesystem entry in read docstrings to reflect current behaviour ARROW-15366 - [Docs] Automate incrementing of package version for R and non-R version switchers ARROW-15367 - [Python] Improve Classes and Methods Docstrings for 8.0.0 ARROW-15369 - [Doc] Tweak example to use the new support for str pointers ARROW-15374 - [C++][FlightRPC] Add support for MemoryManager in data methods ARROW-15389 - [C++][Dev] Improve Array preview in GDB plugin ARROW-15400 - [Go][CI] Exercise builds on arm machines ARROW-15410 - [C++][Datasets] Improve memory usage of datasets API when scanning parquet ARROW-15418 - [Go][Flight] Update gRPC version, hide impl details ARROW-15425 - [C++] Add delta dictionaries in file format to integration tests ARROW-15428 - [Python] Address docstrings in Parquet classes and functions ARROW-15429 - [Python] Address docstrings for ChunkedArray class, methods, attributes and constructor ARROW-15431 - [Python] Address docstrings in Schema ARROW-15432 - [Python] Address CSV docstrings ARROW-15440 - [Go] Implement ‘unpack_bool’ with Arm64 GoLang Assembly ARROW-15450 - [Python][Wheel] Flight test receives SIGKILL during in macOS tests ARROW-15462 - [GLib] Add GArrow{Month,DayTime,MonthDayNano}Interval{Scalar,Array,ArrayBuilder} ARROW-15468 - [R][CI] A crossbow job that tests against DuckDB’s dev branch ARROW-15471 - [R] ExtensionType support in R ARROW-15472 - [Website] Add Flight SQL blog post ARROW-15477 - [C++][Python] Allow to create (FixedSize/Large)ListArray from arrays and type ARROW-15480 - [R] Expand on schema/colnames mismatch error messages ARROW-15483 - [Release] Revamp the verification scripts ARROW-15487 - [FlightRPC][C++][GLib][Python][R] Implement FlightClient::Close ARROW-15489 - [R] Expand RecordBatchReader usability ARROW-15491 - [Website] Rotate PMC chair for 2022 ARROW-15497 - [C++][Homebrew] Use Clang Tools 12 ARROW-15501 - [Java] Support validating decimal vectors ARROW-15503 - [GLib][Release] Avoid deprecation warning ARROW-15505 - [C++][Compute] Support null type in product aggregation ARROW-15506 - [C++][Compute] Support Null type in hash_sum/hash_product/hash_mean ARROW-15510 - [C++][FlightRPC] Add CUDA memory manager support to benchmark ARROW-15515 - [C++] Update ExecPlan example code and documentation with new options ARROW-15517 - [R] Use WriteNode in write_dataset() ARROW-15523 - [Python] Support for Datasets as inputs of Joins ARROW-15524 - [Python] Make joins able to receive Tables as inputs ARROW-15525 - [Python] Make joins able to output a Table as result. ARROW-15526 - [Python] Support for Dataset.join ARROW-15527 - [Python] Make Joins able to execute the join operation ARROW-15532 - [C++] Fix unused warning for StringClassifyDoc ARROW-15542 - [GLib][Parquet] Add GParquet*Metadata ARROW-15550 - [C++] Add optional debug memory checks ARROW-15551 - [C++][FlightRPC] Update gRPC TLS options detection for 1.43 ARROW-15552 - [Doc][Format] Remove erroneous mention of base64 ARROW-15556 - [Release] Add a script to update Homebrew packages ARROW-15569 - [Packaging][deb] Use gem instead of apt to install gobject-introspection gem ARROW-15570 - [CI][Nightly] Drop centos-8 R nightly job ARROW-15572 - [Java][Docs] Add Installation section to Java documentation ARROW-15573 - [Java][Doc] Document Apache Arrow memory management ARROW-15574 - [Java][Doc] Review existing documentation ARROW-15575 - [Java][Doc] Datasets Tutorial ARROW-15576 - [Java][Doc] Document VectorSchemaRoots for 2D data ARROW-15577 - [Java][Doc] Add Arrow Flight documentation ARROW-15578 - [Java][Doc] Document C Data Interface and how to interface with other languages ARROW-15579 - [C++] Add MemoryManager::CopyBuffer(const Buffer&amp;) ARROW-15594 - [C++][FlightRPC] Add Deserialize(const Buffer&amp;) to various Flight types ARROW-15595 - [Release][Ruby] Add support for MFA ARROW-15600 - [C++][FlightRPC] Add minimal Flight SQL query example ARROW-15601 - [Docs][Release] Update post release script to move stable docs + keep dev docs ARROW-15605 - [CI][R] Keep using old macos runners on our autobrew CI job ARROW-15606 - [CI][R] Add brew build that exercises the R package ARROW-15609 - [C++][Compute] Support hash_aggregate with only keys ARROW-15611 - [C++] Migrate arrow::ipc::internal::json::ArrayFromJSON to Result&lt;&gt; ARROW-15614 - [C++] Add sqrt binary scalar kernel ARROW-15617 - [Doc][C++] Document environment variables ARROW-15619 - [C++] Temporal component extraction function for extracting is_leap_year indicator ARROW-15623 - [C++][Python] Update developers/python.rst (console blocks + “” in archery install) ARROW-15625 - [C++] Convert underscore to hyphen in example executable names ARROW-15629 - [GLib] Add garrow_{,large_}string_array_builder_append_string_len() ARROW-15630 - [Release][MSYS2] Update reverse dependencies too ARROW-15631 - [Packaging][RPM] Add major version to libs packages ARROW-15632 - [R] Prune the bundled libarrow source ARROW-15633 - [R] Skip s3_bucket example that requires network connection ARROW-15634 - [C++][Packaging] Improve compilation speed for java-jars nighlty build for MacOS ARROW-15643 - [C++] Allow selecting subset of fields of a StructArray via cast ARROW-15650 - [MATLAB] Rename the MEX gateway function ARROW-15653 - [R][CI] Fix tests of bundled cpp source ARROW-15656 - [C++][R] Make valgrind builds slightly quicker ARROW-15657 - [C++][Java] Upgrade Apache ORC to 1.7.3 ARROW-15665 - [C++] Fix error_is_null in strptime with invalid inputs ARROW-15665 - [C++] Add error handling option to StrptimeOptions ARROW-15670 - [C++/Python/Packaging] Update conda pinnings and enable GCS on Windows ARROW-15672 - [C++] Enable CSV writer to control the field delimiter ARROW-15673 - [R] Error gracefully if DuckDB isn’t installed ARROW-15680 - [C++] Temporal floor/ceil/round should accept week_starts_monday when rounding to multiple of week ARROW-15682 - [CI] Github starting to migrate “windows-latest” tag from windows 2019 to windows 2022 ARROW-15683 - [Website][Rust][DataFusion] Make a 7.0.0 release announcement blog ARROW-15690 - [Dev] Update GitHub Actions workflows that hardcode master as default ARROW-15692 - [Dev] Update release scripts to use default branch ARROW-15694 - [Dev] Update apache/arrow-site GitHub Actions deploy.yml website deployment workflow to support being triggered when pushing to main ARROW-15697 - [R] Add logo and meta tags to pkgdown site ARROW-15698 - [Integration] Privatized some code in tests ARROW-15701 - [R] month() should allow integer inputs ARROW-15706 - [C++][FlightRPC] Implement a UCX transport ARROW-15707 - [C++][FlightRPC] Make Flight tests more resuable across transports ARROW-15708 - [R][CI] skip snappy encoded parquets on clang sanitizer ARROW-15709 - [C++] Compilation of ARROW_ENGINE fails if doing an “inline” build ARROW-15709 - [C++] Revert change ARROW-15709 - [C++] Compilation of ARROW_ENGINE fails if doing an “inline” build ARROW-15712 - [R] Add a type method for Expression objects ARROW-15714 - [C++][Gandiva] Increase the protobuf recursion limit in gandiva protobuf parser ARROW-15717 - [Docs] Add hash_one to the documentation ARROW-15721 - [Docs][FlightRPC] Add Flight/Flight SQL to subprojects ARROW-15722 - [Java] Improve error message for nested types with incorrect children ARROW-15726 - [C++] If a projected_schema is not supplied but a bound projection expression is then we should use that to infer the projected_schema ARROW-15739 - [C++] Bump xsimd to latest version ARROW-15740 - [C++][Compute] Benchmark element wise min/max ARROW-15741 - [Doc][Format] Clarify thread-safety of C stream interface ARROW-15742 - [Go] Implement ‘bitmap_neon’ with Arm64 GoLang Assembly ARROW-15744 - [Gandiva][C++] Add NEGATIVE function for interval types ARROW-15749 - [Ruby] Add support for #values of Month Interval Type ARROW-15750 - [Ruby] Add support for #raw_records of Month Interval Type ARROW-15755 - [Java] Support Java 17 ARROW-15763 - [C++] Improve CSV writer performance ARROW-15766 - [R] Implement bindings for lubridate::duration() ARROW-15769 - [C++] Generate less arithmetic kernels ARROW-15775 - [R] Clean up as.* methods to use build_expr() ARROW-15776 - [Python] Expose IpcReadOptions ARROW-15777 - [Python][Flight] Allow passing IpcReadOptions to FlightCallOptions ARROW-15781 - [Python] Release GIL in ensure_complete_metadata ARROW-15782 - [C++] Fix Findre2Alt.cmake to check RE2_ROOT variable first ARROW-15788 - [C++][FlightRPC] Prepare benchmark for alternative transports ARROW-15789 - [C++] Update OpenTelemetry to v1.2.0 ARROW-15795 - [Java] Add a getter for the timeZone in timestamp with timezone vectors ARROW-15796 - [Python] Pickling ParquetFileFragment shouldn’t fetch metadata ARROW-15799 - [R] Update as.Date() to support an origin different from epoch ARROW-15800 - [R] Implement bindings for lubridate::as_date() and lubridate::as_datetime() ARROW-15801 - [R] Implement bindings for lubridate date-time helpers ARROW-15802 - [R] bindings for lubridate::make_datetime() and lubridate::make_date() ARROW-15810 - [CI][Nightly] Check R related image strictly ARROW-15814 - [R][DOCS] Improve documentation for cast() ARROW-15817 - [R] Use TableSourceNode instead of InMemoryDataset ARROW-15818 - [R] Implement initial Substrait consumer in the R bindings ARROW-15820 - [C++][Doc] Add table_source to streaming_execution.rst &amp; clarify parameter name ARROW-15821 - [JS] Fix paths to sourcemaps in directories ARROW-15823 - [C++][Python] Add a method to convert a Table to a RecordBatchReader ARROW-15824 - [Python] Make pyarrow.parquet a package ARROW-15827 - [R] Improve UX of write_dataset(…, max_rows_per_group) ARROW-15831 - [Java] Upgrade Flight dependencies ARROW-15841 - [R] Implement SafeCallIntoR to safely call the R API from another thread ARROW-15844 - [Release][Packaging] Use ASCII format for detached sign ARROW-15846 - [Format] Clarify presence of struct validity bitmap ARROW-15850 - [C++] Engine substrait headers missing from install ARROW-15854 - [C++] Refine CSV writer code ARROW-15860 - [Python] Document RecordBatchReader ARROW-15864 - [Java][Docs] Update Arrow nightly Maven releases documentation ARROW-15866 - [Packaging][Ubuntu] Drop support for Ubuntu 21.04 ARROW-15870 - [Python] Start to raise deprecation warnings for use_legacy_dataset=True in parquet.read_table ARROW-15871 - [Python] Start raising deprecation warnings for ParquetDataset keywords that won’t be supported with the new API ARROW-15873 - [CI] Migrate from Ubuntu 21.04 to 22.04 ARROW-15875 - [R] Expose ReadMetadata for input streams ARROW-15882 - [Python][CI] Ensure we are running hypothesis tests in the nightly hypothesis build ARROW-15885 - [Ruby] Add support for #values of DayTime Interval Type ARROW-15886 - [Ruby] Add support for #raw_records of DayTimeInterval type ARROW-15890 - [CI][Python] Use venv instead of virtualenv ARROW-15896 - [Python][C++] Add errno detail for filesystem “file not found” errors ARROW-15900 - [C++] Support Substrait reading of a Feather-format local file ARROW-15902 - [Website] Add new committers: Raphael Taylor-Davies, Wang Xudong, Yijie Shen, Kun Liu ARROW-15916 - [Packaging][RPM] Add support for CentOS Stream 8 ARROW-15917 - [Java][Docs] Document how to use Flight artifacts ARROW-15918 - [Ruby][{day:, millisecond:}, …] ) ARROW-15919 - [C++] Add function not commutative with timestamps &amp; duration maths ARROW-15921 - [Format][FlightRPC][C++][Java] Clarify interpretation of FlightEndpoint.locations ARROW-15923 - [Packaging][Linux] Enable GCS support ARROW-15924 - [Ruby] Add support for #values of MonthDayNanoInterval type ARROW-15925 - [Ruby] Add support for #raw_records of MonthDayNanoInterval type ARROW-15931 - [Website] Add explicit Apache LICENSE.txt and NOTICE.txt files to apache/arrow-site repository ARROW-15932 - [C++][FlightRPC] Add more tests to the common Flight suite ARROW-15934 - [Python] Expose write_batch_size in python ARROW-15935 - [Ruby] Add test for Arrow::DictionaryArray#values ARROW-15939 - [Python] Add pickle support for JSON options classes ARROW-15940 - [Gandiva][C++] Add NEGATIVE function for decimal data type ARROW-15941 - [C++] Allow overriding the number of IO threads with an environment variable ARROW-15944 - [Docs][C++] Document dependencies for building on Arch Linux ARROW-15947 - [R] rename_with s3 method for arrow_dplyr_query ARROW-15950 - [Go] Lift BitSetRunReader to internal/bitutils package ARROW-15952 - [C++] Document Visitors and finish Scalar::Accept ARROW-15955 - [Packaging][RPM] Add missing json-devel to CentOS Stream 8 build image ARROW-15956 - [Java] Consolidate Flight integration testing code ARROW-15963 - [Go][Parquet] simplify ReaderAtSeeker interface ARROW-15968 - [C++] Update AsyncGenerator semantics to emit a terminal item only after all outstanding futures have completed ARROW-15972 - [Java][Doc] Add Getting Started section ARROW-15974 - [C++] Migrate flight/types.h header definitions to use Result&lt;&gt; ARROW-15975 - [C++] Document type traits and inline visitors ARROW-15976 - [C++] Clean up commenting on execution plan example ARROW-15979 - [C++][Doc] Expose more functions of parquet::WriterProperties in doc ARROW-15984 - [C++] Change RecordBatchReader API to use Result&lt;&gt; ARROW-15989 - [R] rbind &amp; cbind for Table &amp; RecordBatch ARROW-15994 - [C++] Back out taskify changes ARROW-15995 - [GO] Improve ‘sum_float64_neon’ performance ARROW-15998 - [Docs][CI] Use sphinx-design tabs instead of sphinx-tabs ARROW-15999 - [Python] Turn deadlines off for the test using hypothesis ARROW-16007 - [R] grepl bindings return FALSE for NA inputs ARROW-16011 - [R] CI jobs should fail if lintr picked up issues ARROW-16014 - [C++] Create more benchmarks for measuring expression evaluation overhead ARROW-16026 - [C++] Add support for the serial executor to expose an async generator as an iterable ARROW-16032 - [C++] Migrate FlightClient API to Result&lt;&gt; ARROW-16033 - [C++] Pass schema to consuming sink node ARROW-16038 - [R] different behavior from dplyr when mutate’s .keep option is set ARROW-16042 - [GO] Fix header file preprocessor issues ARROW-16044 - [Julia] Remove from apache/arrow ARROW-16046 - [Docs][FlightRPC][Python] Ensure Flight Python API is documented ARROW-16049 - [C++][FlightRPC] Fix Flight SQL’s ColumnMetadata constructor visibility ARROW-16053 - [C++][FlightRPC] Fix flaky test TestAuthHandler.FailUnauthenticatedCalls ARROW-16055 - [C++][Gandiva] Skip unnecessary work during cache hit when using object code cache ARROW-16057 - [Python] Address docstrings for RecordBatch class, methods, attributes and constructor ARROW-16058 - [Python] Address docstrings for Table class, methods, attributes and constructor ARROW-16059 - [Python] Address docstrings for Tensor class ARROW-16061 - [R][CI] Speed up windows 3.6 builds ARROW-16062 - [Python] Move libarrow_python include definitions to its own file ARROW-16064 - [Java][C++][FlightRPC] Add missing column metadata for type name on FlightSQL ARROW-16065 - [FlightRPC][Docs] Improve Flight documentation ARROW-16068 - [C++][FlightRPC] Migrate remaining flight API to use Result&lt;&gt; ARROW-16069 - [C++][FlightRPC] Refactor out gRPC error code handling ARROW-16073 - [R] clean-up date time unit testing once tzdb is available on Windows ARROW-16074 - [Docs] Document joins ARROW-16079 - [Python] Address docstrings in Parquet schema and metadata ARROW-16082 - [Flight][Go] Allow specifying a net.Listener ARROW-16098 - [JS] Don’t return null in table and recordbatch iterators ARROW-16102 - [C++] Add support for building with system gRPC and bundled GCS ARROW-16104 - [Packaging] Add support for Ubuntu 22.04 ARROW-16105 - [C++][Gandiva] Add support for LLVM 14 ARROW-16109 - [Python] Add dataset mark to test in order to avoid failure ARROW-16114 - [Docs][Python] Document Parquet FileMetaData ARROW-16117 - [JS] Improve decode UTF8 performance ARROW-16120 - [Python] ParquetDataset deprecation: change Deprecation to FutureWarnings ARROW-16121 - [Python] Deprecate the (common_)metadata(_path) attributes of ParquetDataset ARROW-16122 - [Python] Change use_legacy_dataset default and deprecate no-longer supported keywords in parquet.write_to_dataset ARROW-16128 - [C++][FlightRPC] Fix Flight SQL static build on Windows ARROW-16132 - [Packaging][deb][CUDA] Relax libcuda1 dependency ARROW-16154 - [R] Errors which pass through handle_csv_read_error() and handle_parquet_io_error() need better error tracing ARROW-16156 - [R] Clarify warning message for features not turned on in .onAttach() ARROW-16158 - [C++][R] Rename ARROW_ENGINE to ARROW_SUBSTRAIT ARROW-16166 - [C++][Compute] Utilities for assembling join output ARROW-16167 - [JS] refactor get and set visitors ARROW-16173 - [C++] Add benchmarks for temporal functions/kernels ARROW-16176 - [Release][C#] Use .NET 6.0 on Ubuntu 22.04 ARROW-16186 - [C++][GANDIVA] Add alias and tests for decimal, quarter, xor, etc… ARROW-16187 - [Go][Parquet] Properly utilize BufferedStream and buffer size when reading ARROW-16192 - [Go] Remove deprecated aliases for v8 ARROW-16193 - [Go] Replace CPU discovery package with golang.org/x/sys/cpu module ARROW-16198 - [CI][Packaging][Python] Update VCPKG version ARROW-16201 - [R] SafeCallIntoR on 3.4 ARROW-16203 - [Release] Remove all old artifacts on release ARROW-16204 - [C++][Dataset] Default error existing_data_behaviour for writing dataset ignores a single file ARROW-16208 - [JS] Upgrade deps ARROW-16210 - [JS] Implement tableFromJSON and support struct vector in vectorFromArray ARROW-16214 - [GLib][Parquet] Add GParquetFileMetadata ARROW-16229 - [CI] Temporary remove turbodbc tests from nightly tests ARROW-16232 - [C++] Include OpenTelemetry in LICENSE.txt ARROW-16240 - [Python] Support row_group_size/chunk_size keyword in pq.write_to_dataset with use_legacy_dataset=False ARROW-16242 - [Go] xerrors.Errorf and xerrors.Is are deprecated, fix linting ARROW-16245 - [GLib][Parquet] Add GParquetRowGroupMetadata ARROW-16247 - [GLib] Add GArrowGCSFileSystem ARROW-16250 - [GLib][Parquet] Add GParquetColumnChunkMetadata ARROW-16251 - [GLib][Parquet] Add GParquetStatistics and its family ARROW-16252 - [CI][Archery] Highlight number of failed builds on nightly reports ARROW-16256 - [Docs] Document which format version is supported ARROW-16257 - [R] Break-up as_date and as_datetime into individual functions ARROW-16264 - [C++][CI] Valgrind timeout in arrow-compute-hash-join-node-test ARROW-16276 - [R] Release News ARROW-16277 - [Python] No builds for macOS arm64. ARROW-16280 - [C++] Avoid copying shared_ptr in Expression::type() ARROW-16282 - [CI] [C#] Verifiy release on c-sharp has been failing since upgrading ubuntu to 22.04 ARROW-16283 - [Go] Cleanup panics in new Buffered Reader ARROW-16284 - [Python][Packaging] Use delocate-fuse to create universal2 wheels ARROW-16291 - [Java]: Support JSE17 for Java Cookbooks ARROW-16292 - [Java][Doc] Upgrade java documentation for JSE17/JSE18 ARROW-16294 - [C++] Improve performance of parquet readahead ARROW-16296 - [GLib] Add missing casts for GArrowRoundMode ARROW-16303 - [C++] Check EINTR in file IO ARROW-16308 - [CI] Upgrade windows runner version as windows-2016 is deprecated. ARROW-16314 - [Python][CI] Skip running cython tests in windows verification builds ARROW-16325 - [R] Add task for R package with gcc12 ARROW-16327 - [Java][CI]: Add support for Java 17 CI process ARROW-16334 - [Archery][CI] Use build links on nightly report emails instead of branch link ARROW-16338 - [CI] Update azure windows image as vs2017-win2016 is retired ARROW-16347 - [Release] Escape backtick in verification script ARROW-16349 - [Release][Packaging][RPM] Remove ed25519 keys from KEYS ARROW-16350 - [Dev][Archery] Add missing newline in error message comment ARROW-16352 - [GLib] Fix wrong enums.h install location ARROW-16354 - [Packaging][RPM] Update artifacts pattern list ARROW-16355 - [Dev] Update verify-release-candidate.sh to compile cpp in parallel ARROW-16373 - [Docs][CI] Small improvements to CI documentation ARROW-16387 - [C++] Add -Wshorten-64-to-32 to list of CHECKIN warnings tested by clang ARROW-16390 - [C++] Dataset initialization could segfault if called simultaneously ARROW-16408 - [C++] Add support for DATE type in SQLite FlightSQL example ARROW-16411 - [Website] Migrate to Matomo from Google Analitics ARROW-16412 - [Java] Updated README to reference compilation docs ARROW-16416 - [C++] Support cast-function in Substrait ARROW-16428 - [Release] Add prefix to ENV variables ARROW-16445 - [R] [Doc] Add a short summary for the Installing the Arrow package on Linux article ARROW-16455 - [CI] [Packaging] Anaconda storage size exceeded for linux-ppc64le","headline":"Apache Arrow 8.0.0 Release","image":"https://arrow.apache.org/img/arrow-logo_horizontal_black-txt_white-bg.png","mainEntityOfPage":{"@type":"WebPage","@id":"https://arrow.apache.org/release/8.0.0.html"},"publisher":{"@type":"Organization","logo":{"@type":"ImageObject","url":"https://arrow.apache.org/img/logo.png"}},"url":"https://arrow.apache.org/release/8.0.0.html"}</script>
<!-- End Jekyll SEO tag -->
<!-- favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16.png" id="light1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32.png" id="light2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon.png" id="light3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120.png" id="light4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76.png" id="light5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60.png" id="light6">
<!-- dark mode favicons -->
<link rel="icon" type="image/png" sizes="16x16" href="/img/favicon-16x16-dark.png" id="dark1">
<link rel="icon" type="image/png" sizes="32x32" href="/img/favicon-32x32-dark.png" id="dark2">
<link rel="apple-touch-icon" type="image/png" sizes="180x180" href="/img/apple-touch-icon-dark.png" id="dark3">
<link rel="apple-touch-icon" type="image/png" sizes="120x120" href="/img/apple-touch-icon-120x120-dark.png" id="dark4">
<link rel="apple-touch-icon" type="image/png" sizes="76x76" href="/img/apple-touch-icon-76x76-dark.png" id="dark5">
<link rel="apple-touch-icon" type="image/png" sizes="60x60" href="/img/apple-touch-icon-60x60-dark.png" id="dark6">
<script>
// Switch to the dark-mode favicons if prefers-color-scheme: dark
function onUpdate() {
light1 = document.querySelector('link#light1');
light2 = document.querySelector('link#light2');
light3 = document.querySelector('link#light3');
light4 = document.querySelector('link#light4');
light5 = document.querySelector('link#light5');
light6 = document.querySelector('link#light6');
dark1 = document.querySelector('link#dark1');
dark2 = document.querySelector('link#dark2');
dark3 = document.querySelector('link#dark3');
dark4 = document.querySelector('link#dark4');
dark5 = document.querySelector('link#dark5');
dark6 = document.querySelector('link#dark6');
if (matcher.matches) {
light1.remove();
light2.remove();
light3.remove();
light4.remove();
light5.remove();
light6.remove();
document.head.append(dark1);
document.head.append(dark2);
document.head.append(dark3);
document.head.append(dark4);
document.head.append(dark5);
document.head.append(dark6);
} else {
dark1.remove();
dark2.remove();
dark3.remove();
dark4.remove();
dark5.remove();
dark6.remove();
document.head.append(light1);
document.head.append(light2);
document.head.append(light3);
document.head.append(light4);
document.head.append(light5);
document.head.append(light6);
}
}
matcher = window.matchMedia('(prefers-color-scheme: dark)');
matcher.addListener(onUpdate);
onUpdate();
</script>
<link rel="stylesheet" href="//fonts.googleapis.com/css?family=Lato:300,300italic,400,400italic,700,700italic,900">
<link href="/css/main.css" rel="stylesheet">
<link href="/css/syntax.css" rel="stylesheet">
<script src="/javascript/main.js"></script>
<!-- Matomo -->
<script>
var _paq = window._paq = window._paq || [];
/* tracker methods like "setCustomDimension" should be called before "trackPageView" */
/* We explicitly disable cookie tracking to avoid privacy issues */
_paq.push(['disableCookies']);
_paq.push(['trackPageView']);
_paq.push(['enableLinkTracking']);
(function() {
var u="https://analytics.apache.org/";
_paq.push(['setTrackerUrl', u+'matomo.php']);
_paq.push(['setSiteId', '20']);
var d=document, g=d.createElement('script'), s=d.getElementsByTagName('script')[0];
g.async=true; g.src=u+'matomo.js'; s.parentNode.insertBefore(g,s);
})();
</script>
<!-- End Matomo Code -->
</head>
<body class="wrap">
<header>
<nav class="navbar navbar-expand-md navbar-dark bg-dark">
<a class="navbar-brand no-padding" href="/"><img src="/img/arrow-inverse-300px.png" height="40px"/></a>
<button class="navbar-toggler ml-auto" type="button" data-toggle="collapse" data-target="#arrow-navbar" aria-controls="arrow-navbar" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span>
</button>
<!-- Collect the nav links, forms, and other content for toggling -->
<div class="collapse navbar-collapse justify-content-end" id="arrow-navbar">
<ul class="nav navbar-nav">
<li class="nav-item"><a class="nav-link" href="/overview/" role="button" aria-haspopup="true" aria-expanded="false">Overview</a></li>
<li class="nav-item"><a class="nav-link" href="/faq/" role="button" aria-haspopup="true" aria-expanded="false">FAQ</a></li>
<li class="nav-item"><a class="nav-link" href="/blog" role="button" aria-haspopup="true" aria-expanded="false">Blog</a></li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownGetArrow" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Get Arrow
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownGetArrow">
<a class="dropdown-item" href="/install/">Install</a>
<a class="dropdown-item" href="/release/">Releases</a>
<a class="dropdown-item" href="https://github.com/apache/arrow">Source Code</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownDocumentation" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Documentation
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownDocumentation">
<a class="dropdown-item" href="/docs">Project Docs</a>
<a class="dropdown-item" href="/docs/format/Columnar.html">Format</a>
<hr/>
<a class="dropdown-item" href="/docs/c_glib">C GLib</a>
<a class="dropdown-item" href="/docs/cpp">C++</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/csharp/README.md">C#</a>
<a class="dropdown-item" href="https://godoc.org/github.com/apache/arrow/go/arrow">Go</a>
<a class="dropdown-item" href="/docs/java">Java</a>
<a class="dropdown-item" href="/docs/js">JavaScript</a>
<a class="dropdown-item" href="/julia/">Julia</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/matlab/README.md">MATLAB</a>
<a class="dropdown-item" href="/docs/python">Python</a>
<a class="dropdown-item" href="/docs/r">R</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/blob/main/ruby/README.md">Ruby</a>
<a class="dropdown-item" href="https://docs.rs/arrow/latest">Rust</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownSubprojects" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Subprojects
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownSubprojects">
<a class="dropdown-item" href="/adbc">ADBC</a>
<a class="dropdown-item" href="/docs/format/Flight.html">Arrow Flight</a>
<a class="dropdown-item" href="/docs/format/FlightSql.html">Arrow Flight SQL</a>
<a class="dropdown-item" href="https://datafusion.apache.org">DataFusion</a>
<a class="dropdown-item" href="/nanoarrow">nanoarrow</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownCommunity" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
Community
</a>
<div class="dropdown-menu" aria-labelledby="navbarDropdownCommunity">
<a class="dropdown-item" href="/community/">Communication</a>
<a class="dropdown-item" href="/docs/developers/index.html">Contributing</a>
<a class="dropdown-item" href="https://github.com/apache/arrow/issues">Issue Tracker</a>
<a class="dropdown-item" href="/committers/">Governance</a>
<a class="dropdown-item" href="/use_cases/">Use Cases</a>
<a class="dropdown-item" href="/powered_by/">Powered By</a>
<a class="dropdown-item" href="/visual_identity/">Visual Identity</a>
<a class="dropdown-item" href="/security/">Security</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/policies/conduct.html">Code of Conduct</a>
</div>
</li>
<li class="nav-item dropdown">
<a class="nav-link dropdown-toggle" href="#"
id="navbarDropdownASF" role="button" data-toggle="dropdown"
aria-haspopup="true" aria-expanded="false">
ASF Links
</a>
<div class="dropdown-menu dropdown-menu-right" aria-labelledby="navbarDropdownASF">
<a class="dropdown-item" href="https://www.apache.org/">ASF Website</a>
<a class="dropdown-item" href="https://www.apache.org/licenses/">License</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/sponsorship.html">Donate</a>
<a class="dropdown-item" href="https://www.apache.org/foundation/thanks.html">Thanks</a>
<a class="dropdown-item" href="https://www.apache.org/security/">Security</a>
</div>
</li>
</ul>
</div><!-- /.navbar-collapse -->
</nav>
</header>
<div class="container p-4 pt-5">
<main role="main" class="pb-5">
<!--
-->
<h1 id="apache-arrow-800-6-may-2022">Apache Arrow 8.0.0 (6 May 2022)</h1>
<p>This is a major release covering more than 3 months of development.</p>
<h2 id="download">Download</h2>
<ul>
<li><a href="https://www.apache.org/dyn/closer.lua/arrow/arrow-8.0.0/"><strong>Source Artifacts</strong></a></li>
<li><strong>Binary Artifacts</strong>
<ul>
<li><a href="https://apache.jfrog.io/artifactory/arrow/almalinux/">For AlmaLinux</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/amazon-linux/">For Amazon Linux</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/centos/">For CentOS</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/nuget/">For C#</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/debian/">For Debian</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/python/8.0.0/">For Python</a></li>
<li><a href="https://apache.jfrog.io/artifactory/arrow/ubuntu/">For Ubuntu</a></li>
</ul>
</li>
<li><a href="https://github.com/apache/arrow/releases/tag/apache-arrow-8.0.0">Git tag</a></li>
</ul>
<h2 id="contributors">Contributors</h2>
<p>This release includes 636 commits from 127 distinct contributors.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-sn</span> apache-arrow-7.0.0..apache-arrow-8.0.0
<span class="go"> 43 Antoine Pitrou
40 David Li
39 Sutou Kouhei
36 Alenka Frim
29 Weston Pace
25 Rok
22 Dragoș Moldovan-Grünfeld
22 Joris Van den Bossche
20 Jonathan Keane
20 Krisztián Szűcs
19 Raúl Cumplido
17 Will Jones
16 Dewey Dunnington
13 david dali susanibar arce
12 Matthew Topol
10 Nic Crane
10 Yibo Cai
10 Vibhatha Abeykoon
9 Dominik Moritz
9 SHIMA Tatsuya
8 Neal Richardson
8 Sanjiban Sengupta
8 okadakk
8 Jacob Wujciak-Jens
7 Tobias Zagorni
7 Alessandro Molina
7 William Butler
6 Dhruv Vats
5 Yuqi Gu
5 Johnnathan
4 Anthony Louis
4 Projjal Chanda
4 Fiona La
4 Micah Kornfield
4 Bryan Cutler
3 Yaron Gvili
3 Ian Alexander Joiner
3 Vinicius Roque
3 ViniciusSouzaRoque
3 emkornfield
3 Chenxi Li
2 Jorge C. Leitao
2 Jose Almeida
2 assignUser
2 Uwe L. Korn
2 Todd Farmer
2 Tim Van Wassenhove
2 Rob Saccoccio
2 Alvin Chunga
2 Stephanie Hazlitt
2 Jeroen van Straten
2 Kyle Barron
2 Sasha Krassovsky
2 Eduardo Ponce
2 Salonijain27
2 liyafan82
2 JabariBooker
2 michalursa
2 Matthijs Brobbel
2 Danielle Navarro
2 Maya Anderson
2 Even Rouault
2 Joost Hoozemans
1 zhixingheyi-tian
1 SAm Albers
1 Alex McRae
1 Alfred Mountfield
1 Ali Amiri
1 Andrew Higgins
1 Andrew Strelsky
1 Andrii Oriekhov
1 Andy Teucher
1 Benjamin Kietzman
1 Benson Muite
1 Bob Matcuk
1 Chris Casola
1 Colin Jermain
1 Corentin
1 Dongjoon Hyun
1 Dooyoung-Hwang
1 Edward Visel
1 Elena Henderson
1 Frederic Branczyk
1 Haowei Yu
1 Hongze Zhang
1 Isaac Brodsky
1 James Duong
1 Johan Mattsson
1 João Pedro
1 Kazuaki Ishizaki
1 Kevin Gurney
1 Kian-Meng Ang
1 Laurent Goujon
1 Leo Di Donato
1 Li Jin
1 Min-Young Wu
1 MrMallIronmaker
1 Nate Clark
1 Nicolas Moreau
1 Nivia
1 Pradeep Garigipati
1 Rafael Telles
1 Romain Francois
1 Ryan Blue
1 Aakash Kumar
1 Sam Albers
1 Siddhant Rao
1 Stephan T. Lavavej
1 Thomas J. Fan
1 Tom Drabas
1 William Ayd
1 Xinyu Zeng
1 Yue Ni
1 Zac Farrell
1 bkmgit
1 burntcustard
1 fabiencelier
1 hankvyang
1 hzuo
1 iurysalino
1 jvictorhuguenin
1 jwijffels
1 karldw
1 marcbernot
1 okadak
1 ptaylor
1 shegden
</span></code></pre></div></div>
<h2 id="patch-committers">Patch Committers</h2>
<p>The following Apache committers merged contributed patches to the repository.</p>
<div class="language-console highlighter-rouge"><div class="highlight"><pre class="highlight"><code><span class="gp">$</span><span class="w"> </span>git shortlog <span class="nt">-csn</span> apache-arrow-7.0.0..apache-arrow-8.0.0
<span class="go"> 105 David Li
102 Antoine Pitrou
84 Sutou Kouhei
72 Jonathan Keane
47 Krisztián Szűcs
46 Joris Van den Bossche
30 Weston Pace
25 Nic Crane
23 Pindikura Ravindra
22 Matthew Topol
16 Alessandro Molina
15 Yibo Cai
15 Neal Richardson
14 Dominik Moritz
5 Bryan Cutler
5 GitHub
3 Ian Cook
2 liyafan82
2 Micah Kornfield
1 Matt Topol
1 Li Jin
1 Chao Sun
</span></code></pre></div></div>
<h2 id="changelog">Changelog</h2>
<h2 id="apache-arrow-800-2022-05-06">Apache Arrow 8.0.0 (2022-05-06)</h2>
<h3 id="bug-fixes">Bug Fixes</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5248">ARROW-5248</a> - [Python] support zoneinfo / dateutil timezones</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7350">ARROW-7350</a> - [Python] Decode parquet statistics as scalars</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9664">ARROW-9664</a> - [Python] Array/ChunkedArray.to_pandas do not support types_mapper keyword</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11415">ARROW-11415</a> - [R] map_batches wouldn’t accept a dataset as an argument</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13168">ARROW-13168</a> - [C++][R] Enable runtime timezone database for Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13594">ARROW-13594</a> - [CI] Enable nightly turbodbc builds again</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13922">ARROW-13922</a> - [Python] Fix ParquetDataset throw error when len(path_or_paths) == 1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14047">ARROW-14047</a> - [C++] [Parquet] FileReader returns inconsistent results on repeat reads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14215">ARROW-14215</a> - [R][CI] Conda Windows builds failing due to space in library name</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14256">ARROW-14256</a> - [CI][Package] Re-enable disabled conda packaging builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14389">ARROW-14389</a> - [C++][Gandiva] Fix performance bug with LIKE expressions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14638">ARROW-14638</a> - [C++][R] Unknown C compiler / ccache on Arch Linux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14647">ARROW-14647</a> - [JS] fix bignumToNumber for negative numbers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14665">ARROW-14665</a> - [JAVA] fix JdbcToArrow ResultSet iteration bug</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14708">ARROW-14708</a> - [C++] Adding missing abseil dependencies to enable static flight build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14908">ARROW-14908</a> - [C++][R] Dataset hash join segfaults on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14911">ARROW-14911</a> - [C++] arrow-compute-hash-join-node-test failed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14960">ARROW-14960</a> - [C++] Add exception to Arrow style guide based on changes in Google style guide that we are not adopting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15018">ARROW-15018</a> - [Python] DataFrame Index modified during Feather serialization round trip</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15092">ARROW-15092</a> - [R] Support create_package_with_all_dependencies() on non-linux systems</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15253">ARROW-15253</a> - [Python] Error in to_pandas for empty dataframe with index with extension type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15272">ARROW-15272</a> - [Java] Add cleanup failures as suppressed in ArrowVectorIterator#create</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15291">ARROW-15291</a> - [C++][Python] Segfault in StructArray.to_numpy and to_pandas if it contains an ExtensionArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15312">ARROW-15312</a> - [R][C++] filtering a Parquet dataset with is.na() misses some rows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15401">ARROW-15401</a> - [Python] Gdb tests are failing on windows and apple M1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15426">ARROW-15426</a> - [C++][Gandiva] Update InExpressionNode validation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15444">ARROW-15444</a> - [C++] Compilation with GCC 7.5 fails in aggregate_basic.cc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15465">ARROW-15465</a> - [Python] Add some missing parquet marks in dataset tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15502">ARROW-15502</a> - [Java] Detect exceptional footer size in Arrow file reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15504">ARROW-15504</a> - [Python][CI] Ensure that optional components are tested</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15509">ARROW-15509</a> - [Go][Parquet] Parquet cmds crash</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15511">ARROW-15511</a> - [Python][C++] Remove reference management in numpy indexer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15514">ARROW-15514</a> - [C++][Gandiva] Add flag to enable Gandiva Object Code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15520">ARROW-15520</a> - [C++] Qualify <code class="language-plaintext highlighter-rouge">arrow_vendored::date::format()</code> for C++20 compatibility</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15533">ARROW-15533</a> - [C++] Check ARROW_WITH_OPENTELEMETRY in CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15539">ARROW-15539</a> - [Archery] Add ARROW_JEMALLOC to build options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15541">ARROW-15541</a> - [Python] Bump the minimum Cython version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15544">ARROW-15544</a> - [Go][Parquet] Fix origin schema base64 decoding</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15546">ARROW-15546</a> - [FlightRPC][C++] Remove quotes from cookie header</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15555">ARROW-15555</a> - [Release] Don’t push the release tag since it already exists</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15580">ARROW-15580</a> - [Python] Make pytz an actual optional dependency of PyArrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15593">ARROW-15593</a> - [C++] Make after-fork ThreadPool reinitialization thread-safe</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15598">ARROW-15598</a> - [C++][Gandiva] Avoid using hardcoded raw pointer addresses in generated code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15599">ARROW-15599</a> - [R] Convert a column as a sub-second timestamp from CSV file with the <code class="language-plaintext highlighter-rouge">T</code> col type option</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15603">ARROW-15603</a> - [C++] Remove unused variables</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15604">ARROW-15604</a> - [C++][CI] Sporadic ThreadSanitizer failure with OpenTracing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15604">ARROW-15604</a> - [C++][CI] Sporadic ThreadSanitizer failure with OpenTracing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15607">ARROW-15607</a> - [C++] Fix incorrect CPUID flag for AVX detection</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15626">ARROW-15626</a> - [GLib] Fix a bug that GArrowGIOInputStream may not read enough data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15627">ARROW-15627</a> - [R] Fix union dataset unify schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15648">ARROW-15648</a> - [C++][Gandiva] Fix the size of the Gandiva cache</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15651">ARROW-15651</a> - [JavaScript] Structs incorrectly initialise null values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15652">ARROW-15652</a> - [C++] Fix GDB pretty-printing from inside parquet namespace</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15659">ARROW-15659</a> - [R] strptime should return NA (not error) with format mismatch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15664">ARROW-15664</a> - [C++] parquet reader Segfaults with illegal SIMD instruction</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15667">ARROW-15667</a> - [R] Test development build with ARROW_BUILD_STATIC=OFF</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15674">ARROW-15674</a> - [C++][Gandiva] Like function doesn’t properly handle patterns with special characters in certain cases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15677">ARROW-15677</a> - [R] calling invalidate() method on ArrowObjects causes subsequent segfault</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15679">ARROW-15679</a> - [R] count should return an ungrouped dataframe</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15688">ARROW-15688</a> - [C++] add_checked doesn’t error out on duration overflow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15699">ARROW-15699</a> - [C++][Gandiva] Fix implementation of left and right func…</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15700">ARROW-15700</a> - [C++] Compilation error on Ubuntu 18.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15705">ARROW-15705</a> - [JavaScript] Allowing appending null on children in a StructBuilder</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15710">ARROW-15710</a> - [C++] Intermittent deadlock on arrow-threading-utility-test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15715">ARROW-15715</a> - [Go] ipc trim value offsets on arrays</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15718">ARROW-15718</a> - [C++] Increase thread limit to work around thread issues</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15720">ARROW-15720</a> - [CI] Fix nightly dask build (skip failing test due to wrong usage of Array.to_pandas)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15723">ARROW-15723</a> - [Python] Segfault orcWriter write table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15727">ARROW-15727</a> - [Python] Allow converting lists of MonthDayNano intervals to Pandas</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15728">ARROW-15728</a> - [Python] Reduce entropy for zstd test_ipc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15743">ARROW-15743</a> - [R] <code class="language-plaintext highlighter-rouge">skip</code> not connected up to <code class="language-plaintext highlighter-rouge">skip_rows</code> on open_dataset despite error messages indicating otherwise</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15746">ARROW-15746</a> - [Release][Java] Add missing artifacts to tasks.yml</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15748">ARROW-15748</a> - [Python] Round temporal options default unit is <code class="language-plaintext highlighter-rouge">day</code> but documented as <code class="language-plaintext highlighter-rouge">second</code>. Follow-up</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15748">ARROW-15748</a> - [Python] Round temporal options default unit is <code class="language-plaintext highlighter-rouge">day</code> but documented as <code class="language-plaintext highlighter-rouge">second</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15757">ARROW-15757</a> - [Python] Missing bindings for existing_data_behavior makes it impossible to maintain old behavior</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15760">ARROW-15760</a> - [C++] Avoid hard dependency on git in cmake (download tarballs from github instead)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15770">ARROW-15770</a> - [CI] Not all python tests are running on CI jobs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15772">ARROW-15772</a> - [Go][Flight] Server Basic Auth Middleware/Interceptor wrongly base64 decode</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15778">ARROW-15778</a> - [Java] set native endian to schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15783">ARROW-15783</a> - [Python] Initialize static pandas data on write</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15784">ARROW-15784</a> - [C++][Python] Removing flag enable_parallel_column_conversion which is no longer used</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15791">ARROW-15791</a> - [Go] ipc FileWriter negative WaitGroup counter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15794">ARROW-15794</a> - [CI][Crossbow] Nightly builds failing due to error in types_mapper</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15815">ARROW-15815</a> - [C++][Parquet] Fix undefined behaviour on invalid input</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15819">ARROW-15819</a> - [R] R docs version switcher doesn’t work on Safari on MacOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15830">ARROW-15830</a> - [C++] Ensure target directory exists before running Substrait generation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15837">ARROW-15837</a> - [C++][Python] Clarify documentation for ListArray::offsets()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15845">ARROW-15845</a> - [Python][Packaging] Fix macOS wheel builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15847">ARROW-15847</a> - [Python][CI] Ensure we have a nightly Python build with parquet encryption disabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15847">ARROW-15847</a> - [Python] Building with Parquet but without Parquet encryption fails</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15848">ARROW-15848</a> - [Gandiva][C++] Fix function istrue and is not true</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15851">ARROW-15851</a> - [C++] Enable RE2 when building with gRPC</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15852">ARROW-15852</a> - [JS] Fix error thrown by <code class="language-plaintext highlighter-rouge">Table.getByteLength()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15857">ARROW-15857</a> - [R] rhub/fedora-clang-devel fails to install ‘sass’ (rmarkdown dependency)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15863">ARROW-15863</a> - [Packaging][C++][Python] Fix conda package builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15869">ARROW-15869</a> - [C++] Fix Valgrind failure (uninitialized value)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15888">ARROW-15888</a> - [Doc][Python] Modernize development instructions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15892">ARROW-15892</a> - [C++] Dataset APIs require s3:ListBucket Permissions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15895">ARROW-15895</a> - [R] R docs version switcher disappears &amp; reappears with back button on Chrome</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15898">ARROW-15898</a> - [CI] Clean old conda nightlies more thoroughly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15905">ARROW-15905</a> - [Python][C++] Fix CMake warning when building PyArrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15928">ARROW-15928</a> - [C++] Fix crashes and implement chunked array support for replace_with_mask function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15929">ARROW-15929</a> - [R] io_thread_count is actually the CPU thread count</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15946">ARROW-15946</a> - [Go] Fix memory leak in pqarrow.NewColumnWriter when writing nested data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15949">ARROW-15949</a> - [Python] Do not require Parquet encryption when Parquet is disabled</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15951">ARROW-15951</a> - [CI][Python] “Test wheel” step successful despite test error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15954">ARROW-15954</a> - [Java] Remove mac native netty kqueue dependency after upgrade</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15960">ARROW-15960</a> - [C++] Fix crash on adaptive int builder edge cases</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15962">ARROW-15962</a> - [C++][GANDIVA] Fix unhex errors return</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15965">ARROW-15965</a> - [C++][Python] Add Scalar constructor of RoundToMultipleOptions to Python</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15970">ARROW-15970</a> - [R][CI] Re-enable DuckDB dev tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15973">ARROW-15973</a> - [CI] Split nightly reports into three: Tests, Packaging, Release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15982">ARROW-15982</a> - [Python] parquet.read_table fails to parse home directory path</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15985">ARROW-15985</a> - [CI] Fix conda-clean failure when there are no files to delete</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15987">ARROW-15987</a> - [C++][FlightRPC] Work around arrow-flight-test crash on AppVeyor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15993">ARROW-15993</a> - [CI] Add sphinx-tabs to ci/conda_env_sphinx.txt</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16012">ARROW-16012</a> - [C++] Retry S3 request in tests when Minio not fully initialized</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16013">ARROW-16013</a> - [C++][Python] Signed overflow when using negative stride in NumPyStridedConverter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16016">ARROW-16016</a> - [C++] Fix recursive ccache invocation error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16019">ARROW-16019</a> - [C++] Minimize chances of Minio connect errors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16021">ARROW-16021</a> - [C++] arrow-compute-hash-join-node-test timeout on MinGW</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16025">ARROW-16025</a> - [Python][C++] Fix segmentation fault when closing ORCFileWritter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16031">ARROW-16031</a> - [C++][Gandiva] Fix Soundex errors generate</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16035">ARROW-16035</a> - [Java] Handling empty JDBC ResultSet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16043">ARROW-16043</a> - [C++][Filesystem][S3] Add missing empty content for creating directory</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16048">ARROW-16048</a> - [Python] Avoid exposing null buffer address to the Python buffer protocol</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16051">ARROW-16051</a> - [Gandiva][C++] Fix datediff regression build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16052">ARROW-16052</a> - [R] undefined global function %&gt;%</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16060">ARROW-16060</a> - [C++] subtract_checked support for timestamp(“s”) and date32</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16071">ARROW-16071</a> - [R] More undefined global functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16078">ARROW-16078</a> - Upgrade bundled zlib to 1.2.12</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16099">ARROW-16099</a> - [JS] RecordBatches that are compressed should throw an error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16107">ARROW-16107</a> - [Dev][Archery] Fix archery crossbow latest-prefix query</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16110">ARROW-16110</a> - [C++] GcsFileSystem::Make ignores IOContext</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16113">ARROW-16113</a> - [Python] Partitioning.dictionaries in case of a subset of fields are dictionary encoded</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16131">ARROW-16131</a> - [C++] support saving and retrieving custom metadata in batches for IPC file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16134">ARROW-16134</a> - [C++][GANDIVA] Fix Concat_WS errors return</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16136">ARROW-16136</a> - [Gandiva][C++] Fix problem of the huge size of AddMappings function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16139">ARROW-16139</a> - [Python] Crash in tests/test_dataset.py::test_write_dataset_s3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16143">ARROW-16143</a> - [Java] Upgrade jackson dependencies CVE-2020-36518</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16143">ARROW-16143</a> - [Java] Upgrade jackson dependencies CVE-2020-36518</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16146">ARROW-16146</a> - [C++] arrow-gcsfs-test is timing out</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16148">ARROW-16148</a> - [C++] TPC-H generator cleanup</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16152">ARROW-16152</a> - [C++] Fix segfault with unknown functions in Substrait</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16159">ARROW-16159</a> - [C++][Python] Allow FileSystem::DeleteDirContents to succeed if the directory is missing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16162">ARROW-16162</a> - [C++][FlightRPC] Fix Flight build on Ubuntu 18.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16163">ARROW-16163</a> - [Go] IPC FileReader leaks memory when used with ZSTD compression</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16165">ARROW-16165</a> - [CI][Archery] Fix nightly query to crossbow to send reports</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16169">ARROW-16169</a> - [C++][Gandiva] Fix empty string case in convert_fromUTF8_binary()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16181">ARROW-16181</a> - [CI][C++] Valgrind failure in TPCH node tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16182">ARROW-16182</a> - [C++][CI] TPCH node tests timeout under ThreadSanitizer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16185">ARROW-16185</a> - [C++] Fix uninitialized output data in strptime kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16197">ARROW-16197</a> - [Docs] Fix broken link</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16205">ARROW-16205</a> - [C++][FlightRPC] Don’t use constexpr std::initializer_list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16209">ARROW-16209</a> - [JS] Support setting arbitrary symbols on Tables</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16215">ARROW-16215</a> - [C++][FlightRPC] Fix segfault in Flight test on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16216">ARROW-16216</a> - [Python][FlightRPC] Fix test_flight.py when Flight is not available</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16219">ARROW-16219</a> - [CI] Fix git config to prevent SCM tools failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16223">ARROW-16223</a> - [C++] Fix decimal reduce scale rounding</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16225">ARROW-16225</a> - [C++][Parquet] Fix length of encryption AAD random byte generation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16233">ARROW-16233</a> - [Python][Packaging] test_zoneinfo_tzinfo_to_string fails with zoneinfo._common.ZoneInfoNotFoundError on packaging wheels on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16235">ARROW-16235</a> - [C++] Fix build failure, compiler warnings from MinGW</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16236">ARROW-16236</a> - [Python] [Packaging] test_s3fs_limited_permissions_create_bucket fails with Permission denied on MAC OS wheel builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16237">ARROW-16237</a> - [Docs] Apache Impala is no longer incubating</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16238">ARROW-16238</a> - [C++] Fix nullptr dereference when pre-buffering IPC reads</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16261">ARROW-16261</a> - [C++] Fix DeleteDirContents on HDFS with missing_dir_ok=True</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16262">ARROW-16262</a> - [CI][Integration] Skip failing tests from kartothek integration</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16278">ARROW-16278</a> - [CI] Fix git installation failure on brew</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16278">ARROW-16278</a> - [CI] Fix git installation failure on brew</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16278">ARROW-16278</a> - [CI] Fix git installation failure on brew</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16293">ARROW-16293</a> - [CI][GLib] Make tests stable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16295">ARROW-16295</a> - [CI][Release] Use windows-2019 for verify-rc-source-windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16300">ARROW-16300</a> - pc.sort_indices with nonexistent column throws malloc error</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16301">ARROW-16301</a> - [C#][CI] Fix docker configuration for .NET 6</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16305">ARROW-16305</a> - [C++] Missed reference to ARROW_ENGINE during the rename</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16306">ARROW-16306</a> - [CI] Fix Nightly verify rc on ubuntu</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16307">ARROW-16307</a> - [Java][FlightRPC] Skip flaky test TestDoExchange.testClientCancel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16311">ARROW-16311</a> - [Java] Do not return table_schema column when it’s not requested</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16312">ARROW-16312</a> - [C++][CI] Install tzdata in the windows verification builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16313">ARROW-16313</a> - [R] Ensure assume_timezone options are always initialized</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16332">ARROW-16332</a> - [Release][Java] Add artifacts uploaded verification</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16336">ARROW-16336</a> - [Python] ParquetDataset - Hide internal (common_)metadata related warnings from the user</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16374">ARROW-16374</a> - [R][C++] skip another snappy test during sanitizer runs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16375">ARROW-16375</a> - [R][CI] Pin test-r-devdocs on Windows to R 4.1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16393">ARROW-16393</a> - [JAVA] Update option spec to accept value for query, catalog, schema and table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16413">ARROW-16413</a> - [Python] Certain dataset APIs hang with a python filesystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16417">ARROW-16417</a> - [C++][Python] Segfault in test_exec_plan.py / test_joins</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16419">ARROW-16419</a> - [Python] Properly wait for ExecPlan to finish</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16442">ARROW-16442</a> - [Python][Dataset] Fix fragments of ORC Dataset to use FileFragment class</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-2115">PARQUET-2115</a> - [C++] Parquet dictionary bit widths are limited to 32 bits</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-2118">PARQUET-2118</a> - [C++] Don’t assume standard pointers</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-2119">PARQUET-2119</a> - [C++] Fix DeltaBitPackDecoder fuzzer found issue</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-2123">PARQUET-2123</a> - [C++] Fix invalid memory access in ScanFileContents</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-2124">PARQUET-2124</a> - [C++] Remove Parquet Dictionary DCHECK</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-2130">PARQUET-2130</a> - Fix crash in debug with non-standard key names.</li>
<li><a href="https://issues.apache.org/jira/browse/PARQUET-2131">PARQUET-2131</a> - Number values decoded DCHECKs should be exceptions</li>
</ul>
<h3 id="new-features-and-improvements">New Features and Improvements</h3>
<ul>
<li><a href="https://issues.apache.org/jira/browse/ARROW-1888">ARROW-1888</a> - [C++] Implement Struct Casts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3016">ARROW-3016</a> - [Docs][C++] Memory profiling with perf</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3039">ARROW-3039</a> - [Go] Add support for DictionaryArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-3998">ARROW-3998</a> - [C++] Add TPC-H Generator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5107">ARROW-5107</a> - [Release] Validate non-RC source and binary artifacts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-5598">ARROW-5598</a> - [Go] Rename array.Array{,Approx}Equal to array.{,Approx}Equal</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-6780">ARROW-6780</a> - [C++][Parquet] Support DurationType in writing/reading parquet (written as int64)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7174">ARROW-7174</a> - [Python] Expose parquet dictionary_pagesize_limit write parameter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7272">ARROW-7272</a> - [C++][Java][Dataset] JNI bridge between RecordBatch and VectorSchemaRoot</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-7914">ARROW-7914</a> - [Python] Allow pandas datetime as index for feather</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9235">ARROW-9235</a> - [R] Support for <code class="language-plaintext highlighter-rouge">connection</code> class when reading and writing files</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9378">ARROW-9378</a> - [Go] Support unsigned dictionary indices</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-9947">ARROW-9947</a> - [Python] High-level Python API for Parquet encryption of files.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10643">ARROW-10643</a> - [Python] Pandas&lt;-&gt;pyarrow roundtrip failing to recreate index for empty dataframe</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-10924">ARROW-10924</a> - [C++] Validate temporal data in ValidateArrayFull</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11071">ARROW-11071</a> - [R][CI] Use processx to set up minio and flight servers in tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11259">ARROW-11259</a> - [Python] Allow to create field reference to nested field</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-11989">ARROW-11989</a> - [C++][Python] Improve ChunkedArray’s complexity for the access of elements</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12515">ARROW-12515</a> - [Dev][Wiki][Release] Fix and update Windows RC verify script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12516">ARROW-12516</a> - [C++][Gandiva] Implements castINTERVALDAY(varchar) and castINTERVALYEAR(varchar) functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12659">ARROW-12659</a> - [C++] Support is_valid as a guarantee</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-12743">ARROW-12743</a> - [R] Add DESCRIPTION fields for dev dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13185">ARROW-13185</a> - [MATLAB] Create a single MEX gateway function which delegates to specific C++ functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13204">ARROW-13204</a> - [MATLAB] Update documentation for the MATLAB Interface to reflect latest CMake build system changes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13231">ARROW-13231</a> - [Doc] Add ORC documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13260">ARROW-13260</a> - [Doc] Host different released versions of the documentation + version switcher</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13337">ARROW-13337</a> - [R] Define Math group generics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13375">ARROW-13375</a> - [C++][Gandiva] Implement POSITIVE and NEGATIVE Hive functions on Gandiva</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13409">ARROW-13409</a> - [C++][FlightRPC] Expose server shutdown with deadline</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13564">ARROW-13564</a> - [Dev] Check individual commit messages for “Co-authored-by:” tags when integrating a pull request</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13616">ARROW-13616</a> - [R] Cheat Sheet Structure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13683">ARROW-13683</a> - [R] Test Windows UCRT R</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13703">ARROW-13703</a> - [Python][R] Add bindings for new dataset writing options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-13993">ARROW-13993</a> - [C++][Compute] Add hash_one aggregate function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14075">ARROW-14075</a> - [C++][CI] Add an appveyor CI job for VisualStudio 2019, non-conda</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14091">ARROW-14091</a> - [C++] add(date, duration) -&gt; timestamp kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14093">ARROW-14093</a> - [C++] subtract(date, date) -&gt; duration kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14094">ARROW-14094</a> - [C++] add(timestamp, duration) -&gt; timestamp kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14095">ARROW-14095</a> - [C++] subtract(timestamp, duration) -&gt; timestamp kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14096">ARROW-14096</a> - [C++] add(time, duration) -&gt; time kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14097">ARROW-14097</a> - [C++] subtract(time, duration) -&gt; time kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14098">ARROW-14098</a> - [C++] subtract(time, time) -&gt; duration kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14099">ARROW-14099</a> - [C++] add(duration, duration) -&gt; duration kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14100">ARROW-14100</a> - [C++] subtract(duration, duration) -&gt; duration kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14101">ARROW-14101</a> - [C++] multiply(duration, integer) -&gt; duration kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14102">ARROW-14102</a> - [C++] divide(duration, integer) -&gt; duration kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14153">ARROW-14153</a> - [C++][Dataset] Add support for batch_size in the ORC Scanner</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14168">ARROW-14168</a> - [R] Warn only once about arrow function differences</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14169">ARROW-14169</a> - [R] altrep for factors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14199">ARROW-14199</a> - [R] bindings for format (where possible)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14266">ARROW-14266</a> - [R] Use WriteNode to write queries</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14279">ARROW-14279</a> - [Docs] Initial attempt at describing structure of PyArrow library</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14292">ARROW-14292</a> - [C++][Python] Join foundation for Tables</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14293">ARROW-14293</a> - [Python] Basic Join functionality in PyArrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14322">ARROW-14322</a> - [Doc] Add Python doc on how to connect Python to other languages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14333">ARROW-14333</a> - [C++][Compute] Add binary and LargeStringType tests to comparison kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14339">ARROW-14339</a> - [Docs] Add canonical url to the pkgdown (R) docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14442">ARROW-14442</a> - [R] fix behaviour when converting timestamps with “” as tzone</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14444">ARROW-14444</a> - [C++] Implement task-based model into the executable-pipelines.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14498">ARROW-14498</a> - [Docs] Make it possible to regenerate older docs with additional patch(es)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14502">ARROW-14502</a> - [C++][Gandiva] Add test DayOfMonth</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14506">ARROW-14506</a> - [C++] Conda support for google-cloud-cpp</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14553">ARROW-14553</a> - [Doc] Java Cookbook Release 1</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14579">ARROW-14579</a> - [Documentation] Document the CI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14591">ARROW-14591</a> - [R] Implement bindings for lubridate duration types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14612">ARROW-14612</a> - [C++] Support for filename-based partitioning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14631">ARROW-14631</a> - [C++][Gandiva] Implement Nextday Function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14651">ARROW-14651</a> - [Release][Archery] Add support for retrying download</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14672">ARROW-14672</a> - [Docs] Document how to exchange data between Python and Java</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14679">ARROW-14679</a> - [R][C++] Handle suffix argument in joins</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14698">ARROW-14698</a> - [Docs][FlightRPC] Add API docs for Flight SQL</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14702">ARROW-14702</a> - [Doc][C++] Document threading model</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14745">ARROW-14745</a> - [R] Enable true duckdb streaming</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14776">ARROW-14776</a> - [Website] Don’t include squashed commits in merge commit message</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14798">ARROW-14798</a> - [C++][Python][R] Add container window to PrettyPrintOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14808">ARROW-14808</a> - [R] Implement bindings for <code class="language-plaintext highlighter-rouge">lubridate::date()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14810">ARROW-14810</a> - [R] Implement bindings for lubridate’s <code class="language-plaintext highlighter-rouge">date_decimal()</code> and <code class="language-plaintext highlighter-rouge">decimal_date()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14815">ARROW-14815</a> - [R] bindings for <code class="language-plaintext highlighter-rouge">lubridate::semester()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14817">ARROW-14817</a> - [R] Implement bindings for <code class="language-plaintext highlighter-rouge">lubridate::tz()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14823">ARROW-14823</a> - [R] Implement bindings for lubridate::leap_year</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14824">ARROW-14824</a> - [R] Implement bindings for lubridate::epiyear()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14825">ARROW-14825</a> - [C++] Temporal component extraction function for extracting epiyear</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14826">ARROW-14826</a> - [R] Implement bindings for <code class="language-plaintext highlighter-rouge">lubridate::dst()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14827">ARROW-14827</a> - [C++] Temporal component extraction function for extracting dst indicator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14893">ARROW-14893</a> - [C++] Allow creating GCS filesystem from URI</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14927">ARROW-14927</a> - [CI] Upgrade Fedora 33 to Fedora 35</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14942">ARROW-14942</a> - [R] Bindings for lubridate’s dpicoseconds, dnanoseconds, desconds, dmilliseconds, dmicroseconds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14943">ARROW-14943</a> - [R] Bindings for lubridate’s ddays, dhours, dminutes, dmonths, dweeks, dyears</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14944">ARROW-14944</a> - [R] Implement <code class="language-plaintext highlighter-rouge">lubridate::make_difftime()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14963">ARROW-14963</a> - [Doc] Add copy button extension to code-blocks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14993">ARROW-14993</a> - [C++] Benchmark CSV writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-14997">ARROW-14997</a> - [Python][Doc] Add thread_count functions to API docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15013">ARROW-15013</a> - [R] Expose concatenate at the R level</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15015">ARROW-15015</a> - [R] Test / CI flag for ensuring all tests are run?</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15020">ARROW-15020</a> - [R] Add bindings for new dataset writing options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15040">ARROW-15040</a> - [R] Enable write_csv_arrow to take a Dataset or arrow_dplyr_query as input</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15061">ARROW-15061</a> - [C++] Add logging for kernel functions and exec plan nodes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15062">ARROW-15062</a> - [C++] Add memory information to current spans</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15064">ARROW-15064</a> - [C++] Vectorize CheckStringHasNoStructuralChars in CSV writer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15066">ARROW-15066</a> - [C++] Enable use of non-bundled OpenTelemetry</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15067">ARROW-15067</a> - [C++] Add tracing spans to the scanner</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15080">ARROW-15080</a> - [Python][C++] Enable tuples conversion to interval</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15089">ARROW-15089</a> - [C++][Compute] Implement kernel to lookup a MapArray item for a given key</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15098">ARROW-15098</a> - [R] Add binding for <code class="language-plaintext highlighter-rouge">lubridate::duration()</code> and/or <code class="language-plaintext highlighter-rouge">as.difftime()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15118">ARROW-15118</a> - [C++] Avoid bitmap buffer if all inputs are all valid for Scalar Kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15152">ARROW-15152</a> - [C++][Compute] Implement hash_list aggregate function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15156">ARROW-15156</a> - [Doc] Implement Tutorials for the Java Documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15157">ARROW-15157</a> - [Doc] New Contributors Guide v2</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15163">ARROW-15163</a> - [R] lubridate functions for 8.0.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15167">ARROW-15167</a> - [R] Improve efficiency of decimal casting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15168">ARROW-15168</a> - [R] Add S3 generics to create main Arrow objects</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15178">ARROW-15178</a> - [Java][Docs] Java Tutorial: Developer Docs for Java</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15180">ARROW-15180</a> - Document how to add JNI bindings for C++ features</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15183">ARROW-15183</a> - [Python][Docs] Add Missing Dataset Write Options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15192">ARROW-15192</a> - [Java] Allow use of Jackson 2.12 and higher</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15195">ARROW-15195</a> - [MATLAB] Enable GitHub Actions CI for MATLAB Interface on macOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15197">ARROW-15197</a> - [C++] UTF-8 string repeat kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15212">ARROW-15212</a> - [C++] Handle suffix argument in joins</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15215">ARROW-15215</a> - [C++] Consolidate kernel data-copy utilities between replace_with_mask, case_when, coalesce, choose, fill_null_forward, fill_null_backward</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15223">ARROW-15223</a> - [C++] Implement Not Between ternary kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15238">ARROW-15238</a> - [C++] ARROW_ENGINE module with substrait consumer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15239">ARROW-15239</a> - [C++][Compute] Adding Bloom filter implementation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15258">ARROW-15258</a> - [C++] Easy options to create a source node from a table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15262">ARROW-15262</a> - [C++] Create a ToTable sink node</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15281">ARROW-15281</a> - [C++] Implement ability to retrieve fragment filename</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15282">ARROW-15282</a> - [C++][FlightRPC] Split data methods from the underlying transport</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15294">ARROW-15294</a> - [R] Remove arrow-without-arrow and other Solaris hacks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15296">ARROW-15296</a> - [CI][GO] Add Go staticcheck linting to CI lint job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15299">ARROW-15299</a> - [R] investigate {remotes} dependencies “soft” vs TRUE</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15313">ARROW-15313</a> - [C++][Java][FlightRPC] Implement type info method to flight-sql</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15314">ARROW-15314</a> - [C++][Java][FlightRPC] Add missing metadata on Arrow schemas returned by Flight SQL</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15321">ARROW-15321</a> - [Dev][Python] Also numpydoc-validate Cython-generated methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15346">ARROW-15346</a> - [Doc][Guide] Arrow codebase - minor corrections</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15347">ARROW-15347</a> - [Doc][Guide] Update testing section in new contributors guide</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15348">ARROW-15348</a> - [Doc][Guide] Lifecycle of a PR - minor corrections</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15349">ARROW-15349</a> - [Doc][Guide] Existing Contributors page - update</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15350">ARROW-15350</a> - [Doc][Guide] Add styling and linters info section</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15351">ARROW-15351</a> - [Doc][Guide] Additional tutorial for R bindings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15352">ARROW-15352</a> - [Doc][Guide] R package and make clean</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15353">ARROW-15353</a> - [Doc][Guide] Intro into CI topic and link to the existing docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15364">ARROW-15364</a> - [Python] Update filesystem entry in read docstrings to reflect current behaviour</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15366">ARROW-15366</a> - [Docs] Automate incrementing of package version for R and non-R version switchers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15367">ARROW-15367</a> - [Python] Improve Classes and Methods Docstrings for 8.0.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15369">ARROW-15369</a> - [Doc] Tweak example to use the new support for str pointers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15374">ARROW-15374</a> - [C++][FlightRPC] Add support for MemoryManager in data methods</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15389">ARROW-15389</a> - [C++][Dev] Improve Array preview in GDB plugin</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15400">ARROW-15400</a> - [Go][CI] Exercise builds on arm machines</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15410">ARROW-15410</a> - [C++][Datasets] Improve memory usage of datasets API when scanning parquet</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15418">ARROW-15418</a> - [Go][Flight] Update gRPC version, hide impl details</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15425">ARROW-15425</a> - [C++] Add delta dictionaries in file format to integration tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15428">ARROW-15428</a> - [Python] Address docstrings in Parquet classes and functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15429">ARROW-15429</a> - [Python] Address docstrings for ChunkedArray class, methods, attributes and constructor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15431">ARROW-15431</a> - [Python] Address docstrings in Schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15432">ARROW-15432</a> - [Python] Address CSV docstrings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15440">ARROW-15440</a> - [Go] Implement ‘unpack_bool’ with Arm64 GoLang Assembly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15450">ARROW-15450</a> - [Python][Wheel] Flight test receives SIGKILL during in macOS tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15462">ARROW-15462</a> - [GLib] Add GArrow{Month,DayTime,MonthDayNano}Interval{Scalar,Array,ArrayBuilder}</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15468">ARROW-15468</a> - [R][CI] A crossbow job that tests against DuckDB’s dev branch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15471">ARROW-15471</a> - [R] ExtensionType support in R</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15472">ARROW-15472</a> - [Website] Add Flight SQL blog post</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15477">ARROW-15477</a> - [C++][Python] Allow to create (FixedSize/Large)ListArray from arrays and type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15480">ARROW-15480</a> - [R] Expand on schema/colnames mismatch error messages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15483">ARROW-15483</a> - [Release] Revamp the verification scripts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15487">ARROW-15487</a> - [FlightRPC][C++][GLib][Python][R] Implement FlightClient::Close</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15489">ARROW-15489</a> - [R] Expand RecordBatchReader usability</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15491">ARROW-15491</a> - [Website] Rotate PMC chair for 2022</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15497">ARROW-15497</a> - [C++][Homebrew] Use Clang Tools 12</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15501">ARROW-15501</a> - [Java] Support validating decimal vectors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15503">ARROW-15503</a> - [GLib][Release] Avoid deprecation warning</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15505">ARROW-15505</a> - [C++][Compute] Support null type in product aggregation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15506">ARROW-15506</a> - [C++][Compute] Support Null type in hash_sum/hash_product/hash_mean</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15510">ARROW-15510</a> - [C++][FlightRPC] Add CUDA memory manager support to benchmark</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15515">ARROW-15515</a> - [C++] Update ExecPlan example code and documentation with new options</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15517">ARROW-15517</a> - [R] Use WriteNode in write_dataset()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15523">ARROW-15523</a> - [Python] Support for Datasets as inputs of Joins</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15524">ARROW-15524</a> - [Python] Make joins able to receive Tables as inputs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15525">ARROW-15525</a> - [Python] Make joins able to output a Table as result.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15526">ARROW-15526</a> - [Python] Support for Dataset.join</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15527">ARROW-15527</a> - [Python] Make Joins able to execute the join operation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15532">ARROW-15532</a> - [C++] Fix unused warning for StringClassifyDoc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15542">ARROW-15542</a> - [GLib][Parquet] Add GParquet*Metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15550">ARROW-15550</a> - [C++] Add optional debug memory checks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15551">ARROW-15551</a> - [C++][FlightRPC] Update gRPC TLS options detection for 1.43</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15552">ARROW-15552</a> - [Doc][Format] Remove erroneous mention of base64</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15556">ARROW-15556</a> - [Release] Add a script to update Homebrew packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15569">ARROW-15569</a> - [Packaging][deb] Use gem instead of apt to install gobject-introspection gem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15570">ARROW-15570</a> - [CI][Nightly] Drop centos-8 R nightly job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15572">ARROW-15572</a> - [Java][Docs] Add Installation section to Java documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15573">ARROW-15573</a> - [Java][Doc] Document Apache Arrow memory management</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15574">ARROW-15574</a> - [Java][Doc] Review existing documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15575">ARROW-15575</a> - [Java][Doc] Datasets Tutorial</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15576">ARROW-15576</a> - [Java][Doc] Document VectorSchemaRoots for 2D data</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15577">ARROW-15577</a> - [Java][Doc] Add Arrow Flight documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15578">ARROW-15578</a> - [Java][Doc] Document C Data Interface and how to interface with other languages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15579">ARROW-15579</a> - [C++] Add MemoryManager::CopyBuffer(const Buffer&amp;)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15594">ARROW-15594</a> - [C++][FlightRPC] Add Deserialize(const Buffer&amp;) to various Flight types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15595">ARROW-15595</a> - [Release][Ruby] Add support for MFA</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15600">ARROW-15600</a> - [C++][FlightRPC] Add minimal Flight SQL query example</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15601">ARROW-15601</a> - [Docs][Release] Update post release script to move stable docs + keep dev docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15605">ARROW-15605</a> - [CI][R] Keep using old macos runners on our autobrew CI job</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15606">ARROW-15606</a> - [CI][R] Add brew build that exercises the R package</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15609">ARROW-15609</a> - [C++][Compute] Support hash_aggregate with only keys</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15611">ARROW-15611</a> - [C++] Migrate arrow::ipc::internal::json::ArrayFromJSON to Result&lt;&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15614">ARROW-15614</a> - [C++] Add sqrt binary scalar kernel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15617">ARROW-15617</a> - [Doc][C++] Document environment variables</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15619">ARROW-15619</a> - [C++] Temporal component extraction function for extracting is_leap_year indicator</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15623">ARROW-15623</a> - [C++][Python] Update developers/python.rst (console blocks + “” in archery install)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15625">ARROW-15625</a> - [C++] Convert underscore to hyphen in example executable names</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15629">ARROW-15629</a> - [GLib] Add garrow_{,large_}string_array_builder_append_string_len()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15630">ARROW-15630</a> - [Release][MSYS2] Update reverse dependencies too</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15631">ARROW-15631</a> - [Packaging][RPM] Add major version to libs packages</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15632">ARROW-15632</a> - [R] Prune the bundled libarrow source</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15633">ARROW-15633</a> - [R] Skip s3_bucket example that requires network connection</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15634">ARROW-15634</a> - [C++][Packaging] Improve compilation speed for java-jars nighlty build for MacOS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15643">ARROW-15643</a> - [C++] Allow selecting subset of fields of a StructArray via cast</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15650">ARROW-15650</a> - [MATLAB] Rename the MEX gateway function</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15653">ARROW-15653</a> - [R][CI] Fix tests of bundled cpp source</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15656">ARROW-15656</a> - [C++][R] Make valgrind builds slightly quicker</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15657">ARROW-15657</a> - [C++][Java] Upgrade Apache ORC to 1.7.3</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15665">ARROW-15665</a> - [C++] Fix error_is_null in strptime with invalid inputs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15665">ARROW-15665</a> - [C++] Add error handling option to StrptimeOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15670">ARROW-15670</a> - [C++/Python/Packaging] Update conda pinnings and enable GCS on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15672">ARROW-15672</a> - [C++] Enable CSV writer to control the field delimiter</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15673">ARROW-15673</a> - [R] Error gracefully if DuckDB isn’t installed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15680">ARROW-15680</a> - [C++] Temporal floor/ceil/round should accept week_starts_monday when rounding to multiple of week</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15682">ARROW-15682</a> - [CI] Github starting to migrate “windows-latest” tag from windows 2019 to windows 2022</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15683">ARROW-15683</a> - [Website][Rust][DataFusion] Make a 7.0.0 release announcement blog</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15690">ARROW-15690</a> - [Dev] Update GitHub Actions workflows that hardcode master as default</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15692">ARROW-15692</a> - [Dev] Update release scripts to use default branch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15694">ARROW-15694</a> - [Dev] Update apache/arrow-site GitHub Actions deploy.yml website deployment workflow to support being triggered when pushing to main</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15697">ARROW-15697</a> - [R] Add logo and meta tags to pkgdown site</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15698">ARROW-15698</a> - [Integration] Privatized some code in tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15701">ARROW-15701</a> - [R] month() should allow integer inputs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15706">ARROW-15706</a> - [C++][FlightRPC] Implement a UCX transport</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15707">ARROW-15707</a> - [C++][FlightRPC] Make Flight tests more resuable across transports</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15708">ARROW-15708</a> - [R][CI] skip snappy encoded parquets on clang sanitizer</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15709">ARROW-15709</a> - [C++] Compilation of ARROW_ENGINE fails if doing an “inline” build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15709">ARROW-15709</a> - [C++] Revert change</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15709">ARROW-15709</a> - [C++] Compilation of ARROW_ENGINE fails if doing an “inline” build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15712">ARROW-15712</a> - [R] Add a <code class="language-plaintext highlighter-rouge">type</code> method for <code class="language-plaintext highlighter-rouge">Expression</code> objects</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15714">ARROW-15714</a> - [C++][Gandiva] Increase the protobuf recursion limit in gandiva protobuf parser</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15717">ARROW-15717</a> - [Docs] Add hash_one to the documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15721">ARROW-15721</a> - [Docs][FlightRPC] Add Flight/Flight SQL to subprojects</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15722">ARROW-15722</a> - [Java] Improve error message for nested types with incorrect children</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15726">ARROW-15726</a> - [C++] If a projected_schema is not supplied but a bound projection expression is then we should use that to infer the projected_schema</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15739">ARROW-15739</a> - [C++] Bump xsimd to latest version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15740">ARROW-15740</a> - [C++][Compute] Benchmark element wise min/max</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15741">ARROW-15741</a> - [Doc][Format] Clarify thread-safety of C stream interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15742">ARROW-15742</a> - [Go] Implement ‘bitmap_neon’ with Arm64 GoLang Assembly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15744">ARROW-15744</a> - [Gandiva][C++] Add NEGATIVE function for interval types</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15749">ARROW-15749</a> - [Ruby] Add support for #values of Month Interval Type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15750">ARROW-15750</a> - [Ruby] Add support for #raw_records of Month Interval Type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15755">ARROW-15755</a> - [Java] Support Java 17</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15763">ARROW-15763</a> - [C++] Improve CSV writer performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15766">ARROW-15766</a> - [R] Implement bindings for lubridate::duration()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15769">ARROW-15769</a> - [C++] Generate less arithmetic kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15775">ARROW-15775</a> - [R] Clean up as.* methods to use build_expr()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15776">ARROW-15776</a> - [Python] Expose IpcReadOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15777">ARROW-15777</a> - [Python][Flight] Allow passing IpcReadOptions to FlightCallOptions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15781">ARROW-15781</a> - [Python] Release GIL in ensure_complete_metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15782">ARROW-15782</a> - [C++] Fix Findre2Alt.cmake to check RE2_ROOT variable first</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15788">ARROW-15788</a> - [C++][FlightRPC] Prepare benchmark for alternative transports</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15789">ARROW-15789</a> - [C++] Update OpenTelemetry to v1.2.0</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15795">ARROW-15795</a> - [Java] Add a getter for the timeZone in timestamp with timezone vectors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15796">ARROW-15796</a> - [Python] Pickling ParquetFileFragment shouldn’t fetch metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15799">ARROW-15799</a> - [R] Update as.Date() to support an origin different from epoch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15800">ARROW-15800</a> - [R] Implement bindings for <code class="language-plaintext highlighter-rouge">lubridate::as_date()</code> and <code class="language-plaintext highlighter-rouge">lubridate::as_datetime()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15801">ARROW-15801</a> - [R] Implement bindings for lubridate date-time helpers</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15802">ARROW-15802</a> - [R] bindings for <code class="language-plaintext highlighter-rouge">lubridate::make_datetime()</code> and <code class="language-plaintext highlighter-rouge">lubridate::make_date()</code></li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15810">ARROW-15810</a> - [CI][Nightly] Check R related image strictly</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15814">ARROW-15814</a> - [R][DOCS] Improve documentation for cast()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15817">ARROW-15817</a> - [R] Use TableSourceNode instead of InMemoryDataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15818">ARROW-15818</a> - [R] Implement initial Substrait consumer in the R bindings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15820">ARROW-15820</a> - [C++][Doc] Add table_source to streaming_execution.rst &amp; clarify parameter name</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15821">ARROW-15821</a> - [JS] Fix paths to sourcemaps in directories</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15823">ARROW-15823</a> - [C++][Python] Add a method to convert a Table to a RecordBatchReader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15824">ARROW-15824</a> - [Python] Make pyarrow.parquet a package</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15827">ARROW-15827</a> - [R] Improve UX of write_dataset(…, max_rows_per_group)</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15831">ARROW-15831</a> - [Java] Upgrade Flight dependencies</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15841">ARROW-15841</a> - [R] Implement SafeCallIntoR to safely call the R API from another thread</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15844">ARROW-15844</a> - [Release][Packaging] Use ASCII format for detached sign</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15846">ARROW-15846</a> - [Format] Clarify presence of struct validity bitmap</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15850">ARROW-15850</a> - [C++] Engine substrait headers missing from install</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15854">ARROW-15854</a> - [C++] Refine CSV writer code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15860">ARROW-15860</a> - [Python] Document RecordBatchReader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15864">ARROW-15864</a> - [Java][Docs] Update Arrow nightly Maven releases documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15866">ARROW-15866</a> - [Packaging][Ubuntu] Drop support for Ubuntu 21.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15870">ARROW-15870</a> - [Python] Start to raise deprecation warnings for use_legacy_dataset=True in parquet.read_table</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15871">ARROW-15871</a> - [Python] Start raising deprecation warnings for ParquetDataset keywords that won’t be supported with the new API</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15873">ARROW-15873</a> - [CI] Migrate from Ubuntu 21.04 to 22.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15875">ARROW-15875</a> - [R] Expose ReadMetadata for input streams</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15882">ARROW-15882</a> - [Python][CI] Ensure we are running hypothesis tests in the nightly hypothesis build</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15885">ARROW-15885</a> - [Ruby] Add support for #values of DayTime Interval Type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15886">ARROW-15886</a> - [Ruby] Add support for #raw_records of DayTimeInterval type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15890">ARROW-15890</a> - [CI][Python] Use venv instead of virtualenv</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15896">ARROW-15896</a> - [Python][C++] Add errno detail for filesystem “file not found” errors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15900">ARROW-15900</a> - [C++] Support Substrait reading of a Feather-format local file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15902">ARROW-15902</a> - [Website] Add new committers: Raphael Taylor-Davies, Wang Xudong, Yijie Shen, Kun Liu</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15916">ARROW-15916</a> - [Packaging][RPM] Add support for CentOS Stream 8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15917">ARROW-15917</a> - [Java][Docs] Document how to use Flight artifacts</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15918">ARROW-15918</a> - [Ruby][{day:, millisecond:}, …] )</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15919">ARROW-15919</a> - [C++] Add function not commutative with timestamps &amp; duration maths</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15921">ARROW-15921</a> - [Format][FlightRPC][C++][Java] Clarify interpretation of FlightEndpoint.locations</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15923">ARROW-15923</a> - [Packaging][Linux] Enable GCS support</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15924">ARROW-15924</a> - [Ruby] Add support for #values of MonthDayNanoInterval type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15925">ARROW-15925</a> - [Ruby] Add support for #raw_records of MonthDayNanoInterval type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15931">ARROW-15931</a> - [Website] Add explicit Apache LICENSE.txt and NOTICE.txt files to apache/arrow-site repository</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15932">ARROW-15932</a> - [C++][FlightRPC] Add more tests to the common Flight suite</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15934">ARROW-15934</a> - [Python] Expose write_batch_size in python</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15935">ARROW-15935</a> - [Ruby] Add test for Arrow::DictionaryArray#values</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15939">ARROW-15939</a> - [Python] Add pickle support for JSON options classes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15940">ARROW-15940</a> - [Gandiva][C++] Add NEGATIVE function for decimal data type</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15941">ARROW-15941</a> - [C++] Allow overriding the number of IO threads with an environment variable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15944">ARROW-15944</a> - [Docs][C++] Document dependencies for building on Arch Linux</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15947">ARROW-15947</a> - [R] rename_with s3 method for arrow_dplyr_query</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15950">ARROW-15950</a> - [Go] Lift BitSetRunReader to internal/bitutils package</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15952">ARROW-15952</a> - [C++] Document Visitors and finish Scalar::Accept</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15955">ARROW-15955</a> - [Packaging][RPM] Add missing json-devel to CentOS Stream 8 build image</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15956">ARROW-15956</a> - [Java] Consolidate Flight integration testing code</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15963">ARROW-15963</a> - [Go][Parquet] simplify ReaderAtSeeker interface</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15968">ARROW-15968</a> - [C++] Update AsyncGenerator semantics to emit a terminal item only after all outstanding futures have completed</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15972">ARROW-15972</a> - [Java][Doc] Add Getting Started section</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15974">ARROW-15974</a> - [C++] Migrate flight/types.h header definitions to use Result&lt;&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15975">ARROW-15975</a> - [C++] Document type traits and inline visitors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15976">ARROW-15976</a> - [C++] Clean up commenting on execution plan example</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15979">ARROW-15979</a> - [C++][Doc] Expose more functions of parquet::WriterProperties in doc</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15984">ARROW-15984</a> - [C++] Change RecordBatchReader API to use Result&lt;&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15989">ARROW-15989</a> - [R] rbind &amp; cbind for Table &amp; RecordBatch</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15994">ARROW-15994</a> - [C++] Back out taskify changes</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15995">ARROW-15995</a> - [GO] Improve ‘sum_float64_neon’ performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15998">ARROW-15998</a> - [Docs][CI] Use sphinx-design tabs instead of sphinx-tabs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-15999">ARROW-15999</a> - [Python] Turn deadlines off for the test using hypothesis</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16007">ARROW-16007</a> - [R] grepl bindings return FALSE for NA inputs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16011">ARROW-16011</a> - [R] CI jobs should fail if lintr picked up issues</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16014">ARROW-16014</a> - [C++] Create more benchmarks for measuring expression evaluation overhead</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16026">ARROW-16026</a> - [C++] Add support for the serial executor to expose an async generator as an iterable</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16032">ARROW-16032</a> - [C++] Migrate FlightClient API to Result&lt;&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16033">ARROW-16033</a> - [C++] Pass schema to consuming sink node</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16038">ARROW-16038</a> - [R] different behavior from dplyr when mutate’s <code class="language-plaintext highlighter-rouge">.keep</code> option is set</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16042">ARROW-16042</a> - [GO] Fix header file preprocessor issues</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16044">ARROW-16044</a> - [Julia] Remove from apache/arrow</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16046">ARROW-16046</a> - [Docs][FlightRPC][Python] Ensure Flight Python API is documented</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16049">ARROW-16049</a> - [C++][FlightRPC] Fix Flight SQL’s ColumnMetadata constructor visibility</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16053">ARROW-16053</a> - [C++][FlightRPC] Fix flaky test TestAuthHandler.FailUnauthenticatedCalls</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16055">ARROW-16055</a> - [C++][Gandiva] Skip unnecessary work during cache hit when using object code cache</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16057">ARROW-16057</a> - [Python] Address docstrings for RecordBatch class, methods, attributes and constructor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16058">ARROW-16058</a> - [Python] Address docstrings for Table class, methods, attributes and constructor</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16059">ARROW-16059</a> - [Python] Address docstrings for Tensor class</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16061">ARROW-16061</a> - [R][CI] Speed up windows 3.6 builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16062">ARROW-16062</a> - [Python] Move libarrow_python include definitions to its own file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16064">ARROW-16064</a> - [Java][C++][FlightRPC] Add missing column metadata for type name on FlightSQL</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16065">ARROW-16065</a> - [FlightRPC][Docs] Improve Flight documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16068">ARROW-16068</a> - [C++][FlightRPC] Migrate remaining flight API to use Result&lt;&gt;</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16069">ARROW-16069</a> - [C++][FlightRPC] Refactor out gRPC error code handling</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16073">ARROW-16073</a> - [R] clean-up date time unit testing once tzdb is available on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16074">ARROW-16074</a> - [Docs] Document joins</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16079">ARROW-16079</a> - [Python] Address docstrings in Parquet schema and metadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16082">ARROW-16082</a> - [Flight][Go] Allow specifying a net.Listener</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16098">ARROW-16098</a> - [JS] Don’t return null in table and recordbatch iterators</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16102">ARROW-16102</a> - [C++] Add support for building with system gRPC and bundled GCS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16104">ARROW-16104</a> - [Packaging] Add support for Ubuntu 22.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16105">ARROW-16105</a> - [C++][Gandiva] Add support for LLVM 14</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16109">ARROW-16109</a> - [Python] Add dataset mark to test in order to avoid failure</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16114">ARROW-16114</a> - [Docs][Python] Document Parquet FileMetaData</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16117">ARROW-16117</a> - [JS] Improve decode UTF8 performance</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16120">ARROW-16120</a> - [Python] ParquetDataset deprecation: change Deprecation to FutureWarnings</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16121">ARROW-16121</a> - [Python] Deprecate the (common_)metadata(_path) attributes of ParquetDataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16122">ARROW-16122</a> - [Python] Change use_legacy_dataset default and deprecate no-longer supported keywords in parquet.write_to_dataset</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16128">ARROW-16128</a> - [C++][FlightRPC] Fix Flight SQL static build on Windows</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16132">ARROW-16132</a> - [Packaging][deb][CUDA] Relax libcuda1 dependency</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16154">ARROW-16154</a> - [R] Errors which pass through <code class="language-plaintext highlighter-rouge">handle_csv_read_error()</code> and <code class="language-plaintext highlighter-rouge">handle_parquet_io_error()</code> need better error tracing</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16156">ARROW-16156</a> - [R] Clarify warning message for features not turned on in .onAttach()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16158">ARROW-16158</a> - [C++][R] Rename ARROW_ENGINE to ARROW_SUBSTRAIT</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16166">ARROW-16166</a> - [C++][Compute] Utilities for assembling join output</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16167">ARROW-16167</a> - [JS] refactor get and set visitors</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16173">ARROW-16173</a> - [C++] Add benchmarks for temporal functions/kernels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16176">ARROW-16176</a> - [Release][C#] Use .NET 6.0 on Ubuntu 22.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16186">ARROW-16186</a> - [C++][GANDIVA] Add alias and tests for decimal, quarter, xor, etc…</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16187">ARROW-16187</a> - [Go][Parquet] Properly utilize BufferedStream and buffer size when reading</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16192">ARROW-16192</a> - [Go] Remove deprecated aliases for v8</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16193">ARROW-16193</a> - [Go] Replace CPU discovery package with golang.org/x/sys/cpu module</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16198">ARROW-16198</a> - [CI][Packaging][Python] Update VCPKG version</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16201">ARROW-16201</a> - [R] SafeCallIntoR on 3.4</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16203">ARROW-16203</a> - [Release] Remove all old artifacts on release</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16204">ARROW-16204</a> - [C++][Dataset] Default error existing_data_behaviour for writing dataset ignores a single file</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16208">ARROW-16208</a> - [JS] Upgrade deps</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16210">ARROW-16210</a> - [JS] Implement tableFromJSON and support struct vector in vectorFromArray</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16214">ARROW-16214</a> - [GLib][Parquet] Add GParquetFileMetadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16229">ARROW-16229</a> - [CI] Temporary remove turbodbc tests from nightly tests</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16232">ARROW-16232</a> - [C++] Include OpenTelemetry in LICENSE.txt</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16240">ARROW-16240</a> - [Python] Support row_group_size/chunk_size keyword in pq.write_to_dataset with use_legacy_dataset=False</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16242">ARROW-16242</a> - [Go] xerrors.Errorf and xerrors.Is are deprecated, fix linting</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16245">ARROW-16245</a> - [GLib][Parquet] Add GParquetRowGroupMetadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16247">ARROW-16247</a> - [GLib] Add GArrowGCSFileSystem</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16250">ARROW-16250</a> - [GLib][Parquet] Add GParquetColumnChunkMetadata</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16251">ARROW-16251</a> - [GLib][Parquet] Add GParquetStatistics and its family</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16252">ARROW-16252</a> - [CI][Archery] Highlight number of failed builds on nightly reports</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16256">ARROW-16256</a> - [Docs] Document which format version is supported</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16257">ARROW-16257</a> - [R] Break-up as_date and as_datetime into individual functions</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16264">ARROW-16264</a> - [C++][CI] Valgrind timeout in arrow-compute-hash-join-node-test</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16276">ARROW-16276</a> - [R] Release News</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16277">ARROW-16277</a> - [Python] No builds for macOS arm64.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16280">ARROW-16280</a> - [C++] Avoid copying shared_ptr in Expression::type()</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16282">ARROW-16282</a> - [CI] [C#] Verifiy release on c-sharp has been failing since upgrading ubuntu to 22.04</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16283">ARROW-16283</a> - [Go] Cleanup panics in new Buffered Reader</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16284">ARROW-16284</a> - [Python][Packaging] Use delocate-fuse to create universal2 wheels</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16291">ARROW-16291</a> - [Java]: Support JSE17 for Java Cookbooks</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16292">ARROW-16292</a> - [Java][Doc] Upgrade java documentation for JSE17/JSE18</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16294">ARROW-16294</a> - [C++] Improve performance of parquet readahead</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16296">ARROW-16296</a> - [GLib] Add missing casts for GArrowRoundMode</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16303">ARROW-16303</a> - [C++] Check EINTR in file IO</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16308">ARROW-16308</a> - [CI] Upgrade windows runner version as windows-2016 is deprecated.</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16314">ARROW-16314</a> - [Python][CI] Skip running cython tests in windows verification builds</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16325">ARROW-16325</a> - [R] Add task for R package with gcc12</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16327">ARROW-16327</a> - [Java][CI]: Add support for Java 17 CI process</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16334">ARROW-16334</a> - [Archery][CI] Use build links on nightly report emails instead of branch link</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16338">ARROW-16338</a> - [CI] Update azure windows image as vs2017-win2016 is retired</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16347">ARROW-16347</a> - [Release] Escape backtick in verification script</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16349">ARROW-16349</a> - [Release][Packaging][RPM] Remove ed25519 keys from KEYS</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16350">ARROW-16350</a> - [Dev][Archery] Add missing newline in error message comment</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16352">ARROW-16352</a> - [GLib] Fix wrong enums.h install location</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16354">ARROW-16354</a> - [Packaging][RPM] Update artifacts pattern list</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16355">ARROW-16355</a> - [Dev] Update verify-release-candidate.sh to compile cpp in parallel</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16373">ARROW-16373</a> - [Docs][CI] Small improvements to CI documentation</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16387">ARROW-16387</a> - [C++] Add -Wshorten-64-to-32 to list of CHECKIN warnings tested by clang</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16390">ARROW-16390</a> - [C++] Dataset initialization could segfault if called simultaneously</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16408">ARROW-16408</a> - [C++] Add support for DATE type in SQLite FlightSQL example</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16411">ARROW-16411</a> - [Website] Migrate to Matomo from Google Analitics</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16412">ARROW-16412</a> - [Java] Updated README to reference compilation docs</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16416">ARROW-16416</a> - [C++] Support cast-function in Substrait</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16428">ARROW-16428</a> - [Release] Add prefix to ENV variables</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16445">ARROW-16445</a> - [R] [Doc] Add a short summary for the Installing the Arrow package on Linux article</li>
<li><a href="https://issues.apache.org/jira/browse/ARROW-16455">ARROW-16455</a> - [CI] [Packaging] Anaconda storage size exceeded for linux-ppc64le</li>
</ul>
</main>
<hr/>
<footer class="footer">
<div class="row">
<div class="col-md-9">
<p>Apache Arrow, Arrow, Apache, the Apache feather logo, and the Apache Arrow project logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.</p>
<p>&copy; 2016-2024 The Apache Software Foundation</p>
</div>
<div class="col-md-3">
<a class="d-sm-none d-md-inline pr-2" href="https://www.apache.org/events/current-event.html">
<img src="https://www.apache.org/events/current-event-234x60.png"/>
</a>
</div>
</div>
</footer>
</div>
</body>
</html>