Log - HEAD - spark - Git at Google

9e8c4aa [SPARK-46122][SQL] Set `spark.sql.legacy.createHiveTableByDefault` to `false` by default by Dongjoon Hyun · 9 weeks ago
ed5aa56 [SPARK-48055][PYTHON][CONNECT][TESTS] Enable `PandasUDFScalarParityTests.{test_vectorized_udf_empty_partition, test_vectorized_udf_struct_with_empty_partition}` by Ruifeng Zheng · 9 weeks ago
e0af824 [SPARK-48053][PYTHON][CONNECT] SparkSession.createDataFrame should warn for unsupported options by Hyukjin Kwon · 9 weeks ago
87b20b1 [SPARK-47585][SQL] SQL core: Migrate logInfo with variables to structured logging framework by panbingkun · 9 weeks ago
e6217c1 [SPARK-48030][SQL] SPJ: cache rowOrdering and structType for InternalRowComparableWrapper by Xianjin · 9 weeks ago
da92293 [SPARK-48033][SQL] Fix `RuntimeReplaceable` expressions being used in default columns by Richard Chen · 9 weeks ago
fe05eb8 [SPARK-47741] Added stack overflow handling in parser by milastdbx · 9 weeks ago
3fbcb26 [SPARK-48016][SQL] Fix a bug in try_divide function when with decimals by Gengliang Wang · 9 weeks ago
c9ed9df [SPARK-48042][SQL] Use a timestamp formatter with timezone at class level instead of making copies at method level by Kent Yao · 9 weeks ago
c35a21e [SPARK-48044][PYTHON][CONNECT] Cache `DataFrame.isStreaming` by Ruifeng Zheng · 9 weeks ago
f781d15 [SPARK-48046][K8S] Remove `clock` parameter from `DriverServiceFeatureStep` by Dongjoon Hyun · 9 weeks ago
d42c10d [SPARK-47693][TESTS][FOLLOWUP] Reduce CollationBenchmarks time by Kent Yao · 9 weeks ago
ccb0eb6 [SPARK-48038][K8S] Promote driverServiceName to KubernetesDriverConf by Cheng Pan · 9 weeks ago
d913d1b [SPARK-47148][SQL] Avoid to materialize AQE ExchangeQueryStageExec on the cancellation by erenavsarogullari · 9 weeks ago
3f15ad4 [SPARK-47994][SQL] Fix bug with CASE WHEN column filter push down in SQLServer by Stefan Bukorovic · 9 weeks ago
c74f584 [SPARK-48039][PYTHON][CONNECT] Update the error class for `group.apply` by Ruifeng Zheng · 9 weeks ago
7b1147a [SPARK-47567][SQL] Support LOCATE function to work with collated strings by Milan Dankovic · 9 weeks ago
0e52b59 [SPARK-47939][SQL] Implement a new Analyzer rule to move ParameterizedQuery inside ExplainCommand and DescribeQueryCommand by Vladimir Golubev · 9 weeks ago
eaed585 [SPARK-48002][PYTHON][SS][TESTS] Adds sleep before event testing after query termination by Hyukjin Kwon · 9 weeks ago
ff0751a [MINOR][DOCS] Remove space in the middle of configuration name in Arrow-optimized Python UDF page by Hyukjin Kwon · 9 weeks ago
e1445e3 [SPARK-48036][DOCS] Update `sql-ref-ansi-compliance.md` and `sql-ref-identifier.md` by Dongjoon Hyun · 9 weeks ago
9a42610 [SPARK-48029][INFRA] Update the packages name removed in building the spark docker image by panbingkun · 9 weeks ago
59d5946 [SPARK-48034][TESTS] NullPointerException in MapStatusesSerDeserBenchmark by Kent Yao · 9 weeks ago
8f1634e [SPARK-48032][BUILD] Upgrade `commons-codec` to 1.17.0 by panbingkun · 9 weeks ago
3d62dd7 [SPARK-47730][K8S] Support `APP_ID` and `EXECUTOR_ID` placeholders in labels by Xi Chen · 9 weeks ago
8c446f3 [SPARK-47292][SS] safeMapToJValue should consider null typed values by Wei Liu · 9 weeks ago
506b2d5 [SPARK-48014][SQL] Change the makeFromJava error in EvaluatePython to a user-facing error by allisonwang-db · 9 weeks ago
023f07d [SPARK-47933][CONNECT][PYTHON][FOLLOW-UP] Remove `pyspark.sql.classic` reference in `pyspark.ml.stat` by Hyukjin Kwon · 9 weeks ago
0bf3945 [SPARK-48025][SQL][TESTS] Fix org.apache.spark.sql.execution.benchmark.DateTimeBenchmark by Kent Yao · 9 weeks ago
657d9d0 [SPARK-48024][PYTHON][CONNECT][TESTS] Enable `UDFParityTests.test_udf_timestamp_ntz` by Ruifeng Zheng · 9 weeks ago
a80579b [SPARK-48002][PYTHON][SS] Add test for observed metrics in PySpark StreamingQueryListener by Wei Liu · 9 weeks ago
1f9e09c [SPARK-48021][ML][BUILD][FOLLOWUP] add `--add-modules=jdk.incubator.vector` to maven compile args by panbingkun · 9 weeks ago
8b8ea60 [SPARK-47927][SQL] Fix nullability attribute in UDF decoder by Emil Ejbyfeldt · 9 weeks ago
76ce6b0 [SPARK-48019] Fix incorrect behavior in ColumnVector/ColumnarArray with dictionary and nulls by Gene Pang · 9 weeks ago
64d3219 [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` by panbingkun · 9 weeks ago
356830ad [SPARK-48020][INFRA][PYTHON] Pin 'pandas==2.2.2' by Ruifeng Zheng · 9 weeks ago
b623601 [SPARK-47408][SQL] Fix mathExpressions that use StringType by Mihailo Milosevic · 9 weeks ago
d5712ce [SPARK-48018][SS] Fix null groupId causing missing param error when throwing KafkaException.couldNotReadOffsetRange by micheal-o · 9 weeks ago
2b2a33c [SPARK-48011][CORE] Store LogKey name as a value to avoid generating new string instances by Gengliang Wang · 9 weeks ago
6098bd9 [SPARK-48010][SQL] Avoid repeated calls to conf.resolver in resolveExpression by Nikhil Sheoran · 9 weeks ago
9cf6dc8 [SPARK-47963][CORE] Make the external Spark ecosystem can use structured logging mechanisms by panbingkun · 9 weeks ago
675f5f0b [SPARK-47818][CONNECT][FOLLOW-UP] Introduce plan cache in SparkConnectPlanner to improve performance of Analyze requests by Xi Lyu · 9 weeks ago
78b19d5 [SPARK-48005][PS][CONNECT][TESTS] Enable `DefaultIndexParityTests.test_index_distributed_sequence_cleanup` by Ruifeng Zheng · 9 weeks ago
beda1a4 [SPARK-47440][SQL][FOLLOWUP] Reenable predicate pushdown for syntax with boolean comparison in MsSqlServer by Kent Yao · 9 weeks ago
733e53a [SPARK-47968][SQL] MsSQLServer: Map datatimeoffset to TimestampType by Kent Yao · 9 weeks ago
07b84dd [SPARK-47476][SQL] Support REPLACE function to work with collated strings by Milan Dankovic · 9 weeks ago
4ee528f [SPARK-48007][BUILD][TESTS] Upgrade `mssql.jdbc` to `12.6.1.jre11` by Kent Yao · 9 weeks ago
afe310d [SPARK-47351][SQL] Add collation support for StringToMap & Mask string expressions by Uros Bojanic · 9 weeks ago
6c40214 [SPARK-47350][SQL] Add collation support for SplitPart string expression by Uros Bojanic · 9 weeks ago
b8b6d17 [SPARK-48004][SQL] Add WriteFilesExecBase trait for v1 write by ulysses-you · 9 weeks ago
95d6c61 [SPARK-47355][SQL] Use wildcard imports in CollationTypeCasts by Uros Bojanic · 9 weeks ago
e04ac56 [SPARK-45225][SQL][FOLLOW-UP] XML: Fix nested XSD file path resolution by Sandip Agarwala · 9 weeks ago
3451e66 [SPARK-47993][PYTHON] Drop Python 3.8 by Hyukjin Kwon · 9 weeks ago
2e5825f [SPARK-47858][PYTHON][FOLLOWUP] Excluding Python magic methods from error context target by Haejoon Lee · 9 weeks ago
a715007 [SPARK-48001][CORE] Remove unused `private implicit def arrayToArrayWritable` from `SparkContext` by yangjie01 · 9 weeks ago
7d04d0f [SPARK-47986][CONNECT][PYTHON] Unable to create a new session when the default session is closed by the server by Niranjan Jayakar · 9 weeks ago
b0e03a1 [SPARK-47999][SS] Improve logging around snapshot creation and adding/removing entries from state cache map in HDFS backed state store provider by Anish Shrigondekar · 9 weeks ago
033ca3e [SPARK-47922][SQL] Implement the try_parse_json expression by Harsh Motwani · 9 weeks ago
ea4b7a2 [SPARK-47991][SQL][TEST] Arrange the test cases for window frames and window functions by beliefer · 9 weeks ago
79357c8 [SPARK-47933][CONNECT][PYTHON][FOLLOW-UP] Avoid referencing _to_seq in `pyspark-connect` by Hyukjin Kwon · 9 weeks ago
d540786 [SPARK-47597][STREAMING] Streaming: Migrate logInfo with variables to structured logging framework by Daniel Tenedorio · 9 weeks ago
994775a [SPARK-47955][SQL] Improve `DeduplicateRelations` performance by Peter Toth · 9 weeks ago
e1d0212 [SPARK-45425][DOCS][FOLLOWUP] Add a migration guide for TINYINT type mapping change by Kent Yao · 9 weeks ago
287d020 [SPARK-47989][SQL] MsSQLServer: Fix the scope of spark.sql.legacy.mssqlserver.numericMapping.enabled by Kent Yao · 9 weeks ago
de5c512 [SPARK-47987][PYTHON][CONNECT][TESTS] Enable `ArrowParityTests.test_createDataFrame_empty_partition` by Ruifeng Zheng · 9 weeks ago
5810554 [SPARK-47990][BUILD] Upgrade `zstd-jni` to 1.5.6-3 by panbingkun · 9 weeks ago
24b0c75 [SPARK-47985][PYTHON] Simplify functions with `lit` by Ruifeng Zheng · 9 weeks ago
a84cffd [SPARK-47982][BUILD] Update some code style's plugins to latest version by panbingkun · 9 weeks ago
5f730c8 [SPARK-47984][ML][SQL] Change `MetricsAggregate/V2Aggregator#serialize/deserialize` to call `SparkSerDeUtils#serialize/deserialize` by yangjie01 · 9 weeks ago
7090bc1 [SPARK-47981][BUILD] Upgrade `Arrow` to 16.0.0 by sychen · 9 weeks ago
a066d0c [SPARK-47983][SQL] Demote spark.sql.pyspark.legacy.inferArrayTypeFromFirstElement.enabled to internal by Kent Yao · 9 weeks ago
775bc54 [SPARK-47580][SQL] SQL catalyst: eliminate unnamed variables in error logs by Gengliang Wang · 9 weeks ago
08caa56 [SPARK-47980][SQL][TESTS] Reactivate test 'Empty float/double array columns raise EOFException' by Kent Yao · 9 weeks ago
dab4a04 [SPARK-47973][CORE] Log call site in SparkContext.stop() and later in SparkContext.assertNotStopped() by Bo Zhang · 9 weeks ago
b4624bf [SPARK-47414][SQL] Lowercase collation support for regexp expressions by Uros Bojanic · 9 weeks ago
c6aaa18 Revert "[SPARK-45302][PYTHON] Remove PID communication between Pythonworkers when no demon is used" by Hyukjin Kwon · 9 weeks ago
ea37c86 [SPARK-47962][PYTHON][DOCS] PySpark Dataframe doc test improvement by Wei Liu · 9 weeks ago
d233892 [SPARK-47967][SQL] Make `JdbcUtils.makeGetter` handle reading time type as NTZ correctly by Kent Yao · 9 weeks ago
0fcced6 [SPARK-47979][SQL][TESTS] Use Hive tables explicitly for Hive table capability tests by Dongjoon Hyun · 9 weeks ago
5a1559a [SPARK-47921][CONNECT] Fix ExecuteJobTag creation in ExecuteHolder by allisonwang-db · 9 weeks ago
62dd64a [SPARK-47583][CORE] SQL core: Migrate logError with variables to structured logging framework by Daniel Tenedorio · 9 weeks ago
03d4ea6 [SPARK-47974][BUILD] Remove `install_scala` from `build/mvn` by Cheng Pan · 9 weeks ago
09ed09c [SPARK-47958][TESTS] Change LocalSchedulerBackend to notify scheduler of executor on start by Davin Tjong · 10 weeks ago
0042b67 [SPARK-47965][CORE] Avoid orNull in TypedConfigBuilder and OptionalConfigEntry by Hyukjin Kwon · 10 weeks ago
0f1b644 [SPARK-47971][PYTHON][CONNECT][TESTS] Reenable `PandasUDFGroupedAggParityTests.test_grouped_with_empty_partition` by Ruifeng Zheng · 10 weeks ago
c44493d [SPARK-47764][CORE][SQL] Cleanup shuffle dependencies based on ShuffleCleanupMode by Bo Zhang · 10 weeks ago
461ffa1 [SPARK-47692][SQL] Fix default StringType meaning in implicit casting by Mihailo Milosevic · 10 weeks ago
890f78d [SPARK-47418][SQL] Add hand-crafted implementations for lowercase unicode-aware contains, startsWith and endsWith and optimize UTF8_BINARY_LCASE by Vladimir Golubev · 10 weeks ago
6f01982 [SPARK-47964][PYTHON][CONNECT] Hide SQLContext and HiveContext in pyspark-connect by Hyukjin Kwon · 10 weeks ago
fd695be [SPARK-47903][PYTHON][FOLLOW-UP] Removed changes relating to try_parse_json by Harsh Motwani · 10 weeks ago
cb1e1f5 [SPARK-47969][PYTHON][TESTS] Make `test_creation_index` deterministic by Ruifeng Zheng · 10 weeks ago
3cf0c83 [SPARK-47771][PYTHON][DOCS][TESTS][FOLLOWUP] Make `max_by, min_by` doctests deterministic by Ruifeng Zheng · 10 weeks ago
aa4a84b [SPARK-47933][PYTHON][CONNECT][FOLLOW-UP] Add a check of `__name__` at `_with_origin` by Hyukjin Kwon · 10 weeks ago
b3c11ef Revert "Revert "[SPARK-45302][PYTHON] Remove PID communication between Python workers when no demon is used"" by Hyukjin Kwon · 10 weeks ago
e8f529b Revert "[SPARK-45302][PYTHON] Remove PID communication between Python workers when no demon is used" by Hyukjin Kwon · 10 weeks ago
390fb74 [SPARK-47941][SS][CONNECT] Propagate ForeachBatch worker initialization errors to users for PySpark by Eric Marnadi · 10 weeks ago
c88fabf [SPARK-47604][CORE] Resource managers: Migrate logInfo with variables to structured logging framework by panbingkun · 10 weeks ago
e50737b [SPARK-47864][FOLLOWUP][PYTHON][DOCS] Fix minor typo: "MLLib" -> "MLlib" by Haejoon Lee · 10 weeks ago
66613ba [SPARK-47956][SQL] Sanity check for unresolved LCA reference by Wenchen Fan · 10 weeks ago
2b01755 [SPARK-47948][PYTHON] Upgrade the minimum `Pandas` version to 2.0.0 by Haejoon Lee · 10 weeks ago