Log - HEAD - spark - Git at Google

b558422 [SPARK-48245][SQL] Fix typo in BadRecordException class doc by Vladimir Golubev · 6 hours ago master
5b965f7 [SPARK-48239][INFRA] Update the release docker image to follow what we use in Github Action jobs by Wenchen Fan · 6 hours ago
57b2077 [SPARK-48240][DOCS] Replace `Local[..]` with `"Local[...]"` in the docs by panbingkun · 2 days ago
f699f55 [SPARK-48237][BUILD] Clean up `dev/pr-deps` at the end of `test-dependencies.sh` script by panbingkun · 2 days ago
d82458f [SPARK-48205][SQL][FOLLOWUP] Add missing tags for the dataSource API by allisonwang-db · 2 days ago
5b3b8a9 [SPARK-48236][BUILD] Add `commons-lang:commons-lang:2.6` back to support legacy Hive UDF jars by Dongjoon Hyun · 2 days ago
726ef8a Revert "[SPARK-48230][BUILD] Remove unused `jodd-core`" by Dongjoon Hyun · 2 days ago
2225aa1 [SPARK-48144][SQL] Fix `canPlanAsBroadcastHashJoin` to respect shuffle join hints by fred-db · 2 days ago
a6632ff [SPARK-48143][SQL] Use lightweight exceptions for control-flow between UnivocityParser and FailureSafeParser by Vladimir Golubev · 3 days ago
5beaf85 [SPARK-47793][TEST][FOLLOWUP] Fix flaky test for Python data source exactly once by Chaoqin Li · 3 days ago
c5b6ec7 [SPARK-47441][YARN] Do not add log link for unmanaged AM in Spark UI by Yuming Wang · 3 days ago
73bb619 [SPARK-48235][SQL] Directly pass join instead of all arguments to getBroadcastBuildSide and getShuffleHashJoinBuildSide by fred-db · 3 days ago
7ef0440 [SPARK-48146][SQL] Fix aggregate function in With expression child assertion by Kelvin Jiang · 3 days ago
259760a [SPARK-48228][PYTHON][CONNECT][FOLLOWUP] Also apply `_validate_pandas_udf` in MapInXXX by Ruifeng Zheng · 3 days ago
256a238 [SPARK-48232][PYTHON][TESTS] Fix 'pyspark.sql.tests.connect.test_connect_session' in Python 3.12 build by Hyukjin Kwon · 3 days ago
d815118 [SPARK-48230][BUILD] Remove unused `jodd-core` by Cheng Pan · 3 days ago
2df494f [SPARK-48158][SQL] Add collation support for XML expressions by Uros Bojanic · 3 days ago
33cac44 [SPARK-47847][CORE] Deprecate `spark.network.remoteReadNioBufferConversion` by Cheng Pan · 3 days ago
8ccc8b9 [SPARK-48201][DOCS][PYTHON] Make some corrections in the docstring of pyspark DataStreamReader methods by Chloe He · 3 days ago
9bb15db [SPARK-48228][PYTHON][CONNECT] Implement the missing function validation in ApplyInXXX by Ruifeng Zheng · 3 days ago
b371e7d [SPARK-48224][SQL] Disallow map keys from being of variant type by Harsh Motwani · 3 days ago
2d609bf [SPARK-47018][BUILD][SQL] Bump built-in Hive to 2.3.10 by Cheng Pan · 3 days ago
1138b2a [MINOR][BUILD] Remove duplicate configuration of maven-compiler-plugin by zml1206 · 3 days ago
32b2827 [SPARK-47834][SQL][CONNECT] Mark deprecated functions with `@deprecated` in `SQLImplicits` by YangJie · 3 days ago
a41d0ae [SPARK-48176][SQL] Adjust name of FIELD_ALREADY_EXISTS error condition by Nicholas Chammas · 3 days ago
9a28188 [SPARK-48222][INFRA][DOCS] Sync Ruby Bundler to 2.4.22 and refresh Gem lock file by Nicholas Chammas · 3 days ago
012d19d [SPARK-48227][PYTHON][DOC] Document the requirement of seed in protos by Ruifeng Zheng · 3 days ago
71f0eda [SPARK-48180][SQL] Improve error when UDTF call with TABLE arg forgets parentheses around multiple PARTITION/ORDER BY exprs by Daniel Tenedorio · 3 days ago
e704b9e [SPARK-48226][BUILD] Add `spark-ganglia-lgpl` to `lint-java` & `spark-ganglia-lgpl` and `jvm-profiler` to `sbt-checkstyle` by panbingkun · 3 days ago
b47d785 [SPARK-48148][CORE] JSON objects should not be modified when read as STRING by Eric Maynard · 3 days ago
e1fb1d7 [SPARK-48216][TESTS] Remove overrides DockerJDBCIntegrationSuite.connectionTimeout to make related tests configurable by Kent Yao · 4 days ago
21333f8 [SPARK-47409][SQL] Add support for collation for StringTrim type of functions/expressions (for UTF8_BINARY & LCASE) by David Milicevic · 4 days ago
3fd38d4 [SPARK-47803][FOLLOWUP] Check nulls when casting nested type to variant by Chenhao Li · 4 days ago
207d675 [SPARK-48211][SQL] DB2: Read SMALLINT as ShortType by Kent Yao · 4 days ago
9e62dba [SPARK-48212][PYTHON][CONNECT][TESTS] Fully enable `PandasUDFParityTests.test_udf_wrong_arg` by Ruifeng Zheng · 4 days ago
ecca1bf [SPARK-47365][PYTHON] Add toArrow() DataFrame method to PySpark by Ian Cook · 4 days ago
027327d [SPARK-47986][CONNECT][PYTHON] Unable to create a new session when the default session is closed by the server by Niranjan Jayakar · 4 days ago
34ee0d8 [SPARK-47421][SQL] Add collation support for URL expressions by Uros Bojanic · 4 days ago
045ec6a [SPARK-48208][SS] Skip providing memory usage metrics from RocksDB if bounded memory usage is enabled by Anish Shrigondekar · 4 days ago
91da4ac [SPARK-47354][SQL] Add collation support for variant expressions by Uros Bojanic · 4 days ago
a4ab82b [SPARK-48186][SQL] Add support for AbstractMapType by Uros Bojanic · 4 days ago
6cc3dc2 [SPARK-48169][SPARK-48143][SQL] Revert BadRecordException optimizations by Vladimir Golubev · 4 days ago
85a6e35 [SPARK-48182][SQL] SQL (java side): Migrate `error/warn/info` with variables to structured logging framework by panbingkun · 4 days ago
5891b20 [SPARK-47186][TESTS][FOLLOWUP] Correct the name of spark.test.docker.connectionTimeout by Kent Yao · 4 days ago
7e79e91 [SPARK-48197][SQL] Avoid assert error for invalid lambda function by Wenchen Fan · 4 days ago
337f980 [SPARK-48204][INFRA] Fix release script for Spark 4.0+ by Wenchen Fan · 4 days ago
4fb6624 [SPARK-48205][PYTHON] Remove the private[sql] modifier for Python data sources by allisonwang-db · 4 days ago
21548a8 [SPARK-47545][CONNECT] Dataset `observe` support for the Scala client by Paddy Xu · 4 days ago
fbfcd40 [SPARK-48203][INFRA] Spin off `pyspark` tests from `build_branch34.yml` Daily CI by Dongjoon Hyun · 4 days ago
70e5d2a [SPARK-48202][INFRA] Spin off `pyspark` tests from `build_branch35.yml` Daily CI by Dongjoon Hyun · 5 days ago
9d79ab4 [SPARK-48200][INFRA] Split `build_python.yml` into per-version cron jobs by Dongjoon Hyun · 5 days ago
e0c406e [SPARK-48198][BUILD] Upgrade jackson to 2.17.1 by panbingkun · 5 days ago
47afe77 [SPARK-48184][PYTHON][CONNECT] Always set the seed of `Dataframe.sample` in Client side by Ruifeng Zheng · 5 days ago
8d70816 [SPARK-48161][SQL] Add collation support for JSON expressions by Uros Bojanic · 5 days ago
8950add [SPARK-48188][SQL] Consistently use normalized plan for cache by Wenchen Fan · 5 days ago
003823b [SPARK-48191][SQL] Support UTF-32 for string encode and decode by Vladimir Golubev · 5 days ago
d7f69e7 [SPARK-48190][PYTHON][PS][TESTS] Introduce a helper function to drop metadata by Ruifeng Zheng · 5 days ago
bd896ca Revert "[SPARK-48163][CONNECT][TESTS] Disable `SparkConnectServiceSuite.SPARK-43923: commands send events - get_resources_command`" by Hyukjin Kwon · 5 days ago
1b966d2 [SPARK-47965][SQL][FOLLOW-UP] Uses `null` as its default value for `OptionalConfigEntry` by Hyukjin Kwon · 5 days ago
fe3ef20 [SPARK-48193][INFRA] Make `maven-deploy-plugin` retry 3 times by panbingkun · 5 days ago
fe8b18b [SPARK-48185][SQL] Fix 'symbolic reference class is not accessible: class sun.util.calendar.ZoneInfo' by Kent Yao · 5 days ago
f5401ba [MINOR][INFRA] Rename builds to have consistent names by Hyukjin Kwon · 5 days ago
f693abc [SPARK-48192][INFRA] Enable TPC-DS tests in forked repository by Hyukjin Kwon · 5 days ago
f3d9b81 [SPARK-48187][INFRA] Run `docs` only in PR builders and `build_non_ansi` Daily CI by Dongjoon Hyun · 5 days ago
52a7f63 [SPARK-48183][PYTHON][DOCS] Update error contribution guide to respect new error class file by Haejoon Lee · 5 days ago
5f88311 [SPARK-47914][SQL] Do not display the splits parameter in Range by guihuawen · 5 days ago
3b1ea0f [MINOR][PYTHON][TESTS] Remove the doc in error message tests to allow other PyArrow versions in tests by Hyukjin Kwon · 5 days ago
6588554 [SPARK-48149][INFRA][FOLLOWUP] Use single quotation mark by Dongjoon Hyun · 5 days ago
84c5b91 [SPARK-48131][CORE][FOLLOWUP] Add a new configuration for the MDC key of Task Name by Gengliang Wang · 5 days ago
6bbf6b1 [SPARK-48126][CORE] Make `spark.log.structuredLogging.enabled` effective by Gengliang Wang · 5 days ago
67ae239 [SPARK-48045][PYTHON] Pandas API groupby with multi-agg-relabel ignores as_index=False by sai · 5 days ago
a15adeb [SPARK-47240][CORE][PART1] Migrate logInfo with variables to structured logging framework by Tuan Pham · 5 days ago
553e1b8 [SPARK-48152][BUILD] Make `spark-profiler` as a part of release and publish to maven central repo by panbingkun · 5 days ago
5e49665 [SPARK-47960][SS] Allow chaining other stateful operators after transformWithState operator by Bhuwan Sahni · 5 days ago
3d9d1f3 [SPARK-48134][CORE] Spark core (java side): Migrate `error/warn/info` with variables to structured logging framework by panbingkun · 5 days ago
e24f896 [SPARK-48037][CORE] Fix SortShuffleWriter lacks shuffle write related metrics resulting in potentially inaccurate data by sychen · 6 days ago
26c5036 [SPARK-48174][INFRA] Merge `connect` back to the original test pipeline by Dongjoon Hyun · 6 days ago
8081868 [SPARK-48035][SQL][FOLLOWUP] Fix try_add/try_multiply being semantic equal to add/multiply by Supun Nakandala · 6 days ago
8f719ad [SPARK-41547][CONNECT][TESTS] Re-eneable Spark Connect function tests with ANSI mode by Martin Grund · 6 days ago
a3eebcf [SPARK-48170][PYTHON][CONNECT][TESTS] Enable `ArrowPythonUDFParityTests.test_err_return_type` by Ruifeng Zheng · 6 days ago
925457ca [SPARK-48169][SQL] Use lazy BadRecordException cause in all parsers and remove the old constructor, which was meant for the migration by Vladimir Golubev · 6 days ago
9e0a87e [SPARK-48165][BUILD] Update `ap-loader` to 3.0-9 by panbingkun · 6 days ago
493493d [SPARK-48173][SQL] CheckAnalysis should see the entire query plan by Wenchen Fan · 6 days ago
148f533 [SPARK-47297][SQL] Add collation support for format expressions by Uros Bojanic · 6 days ago
c326f3c [SPARK-48171][CORE] Clean up the use of deprecated constructors of `o.rocksdb.Logger` by yangjie01 · 6 days ago
326dbb4 [SPARK-48143][SQL] Use lightweight exceptions for control-flow between UnivocityParser and FailureSafeParser by Vladimir Golubev · 6 days ago
08c6bb9 [SPARK-47267][SQL] Add collation support for hash expressions by Uros Bojanic · 6 days ago
c4df12c [SPARK-48113][CONNECT] Allow Plugins to integrate with Spark Connect by Tom van Bussel · 6 days ago
7f8ef96 [SPARK-48166][SQL] Avoid using BadRecordException as user-facing error in VariantExpressionEvalUtils by Vladimir Golubev · 6 days ago
98d4ab7 [SPARK-48058][SPARK-43727][PYTHON][CONNECT][TESTS][FOLLOWUP] Code clean up by Ruifeng Zheng · 6 days ago
eee1791 [SPARK-48090][SS][PYTHON][TESTS] Shorten the traceback in the test checking error message in UDF by Hyukjin Kwon · 6 days ago
b5e39be [SPARK-48027][SQL] InjectRuntimeFilter for multi-level join should check child join type by Angerszhuuuu · 6 days ago
56fe185 [SPARK-48163][CONNECT][TESTS] Disable `SparkConnectServiceSuite.SPARK-43923: commands send events - get_resources_command` by Dongjoon Hyun · 6 days ago
0f58b8e [MINOR][CONNECT][TESTS] Improve test failure error message in StreamingParityTests by Hyukjin Kwon · 6 days ago
3b9f52f [SPARK-48154][PYTHON][CONNECT][TESTS] Enable `PandasUDFGroupedAggParityTests.test_manual` by Ruifeng Zheng · 6 days ago
05b22eb [SPARK-48141][TEST] Update the Oracle docker image version used for test and integration to use Oracle Database 23ai Free by Luca Canali · 6 days ago
8cf602a [SPARK-48150][SQL] try_parse_json output should be declared as nullable by Josh Rosen · 6 days ago
0862f69 [SPARK-48147][SS][CONNECT] Remove client side listeners when local Spark session is deleted by Wei Liu · 6 days ago
0907a15 [SPARK-48153][INFRA] Run `build` job of `build_and_test.yml` only if needed by Dongjoon Hyun · 6 days ago
f918d11 [SPARK-48151][INFRA] `build_and_test.yml` should use `Volcano` 1.7.0 for `branch-3.4/3.5` by Dongjoon Hyun · 6 days ago