1. 9e8c4aa [SPARK-46122][SQL] Set `spark.sql.legacy.createHiveTableByDefault` to `false` by default by Dongjoon Hyun · 7 weeks ago
  2. ed5aa56 [SPARK-48055][PYTHON][CONNECT][TESTS] Enable `PandasUDFScalarParityTests.{test_vectorized_udf_empty_partition, test_vectorized_udf_struct_with_empty_partition}` by Ruifeng Zheng · 7 weeks ago
  3. e0af824 [SPARK-48053][PYTHON][CONNECT] SparkSession.createDataFrame should warn for unsupported options by Hyukjin Kwon · 7 weeks ago
  4. 87b20b1 [SPARK-47585][SQL] SQL core: Migrate logInfo with variables to structured logging framework by panbingkun · 7 weeks ago
  5. e6217c1 [SPARK-48030][SQL] SPJ: cache rowOrdering and structType for InternalRowComparableWrapper by Xianjin · 7 weeks ago
  6. da92293 [SPARK-48033][SQL] Fix `RuntimeReplaceable` expressions being used in default columns by Richard Chen · 7 weeks ago
  7. fe05eb8 [SPARK-47741] Added stack overflow handling in parser by milastdbx · 7 weeks ago
  8. 3fbcb26 [SPARK-48016][SQL] Fix a bug in try_divide function when with decimals by Gengliang Wang · 7 weeks ago
  9. c9ed9df [SPARK-48042][SQL] Use a timestamp formatter with timezone at class level instead of making copies at method level by Kent Yao · 7 weeks ago
  10. c35a21e [SPARK-48044][PYTHON][CONNECT] Cache `DataFrame.isStreaming` by Ruifeng Zheng · 7 weeks ago
  11. f781d15 [SPARK-48046][K8S] Remove `clock` parameter from `DriverServiceFeatureStep` by Dongjoon Hyun · 7 weeks ago
  12. d42c10d [SPARK-47693][TESTS][FOLLOWUP] Reduce CollationBenchmarks time by Kent Yao · 7 weeks ago
  13. ccb0eb6 [SPARK-48038][K8S] Promote driverServiceName to KubernetesDriverConf by Cheng Pan · 7 weeks ago
  14. d913d1b [SPARK-47148][SQL] Avoid to materialize AQE ExchangeQueryStageExec on the cancellation by erenavsarogullari · 7 weeks ago
  15. 3f15ad4 [SPARK-47994][SQL] Fix bug with CASE WHEN column filter push down in SQLServer by Stefan Bukorovic · 7 weeks ago
  16. c74f584 [SPARK-48039][PYTHON][CONNECT] Update the error class for `group.apply` by Ruifeng Zheng · 7 weeks ago
  17. 7b1147a [SPARK-47567][SQL] Support LOCATE function to work with collated strings by Milan Dankovic · 7 weeks ago
  18. 0e52b59 [SPARK-47939][SQL] Implement a new Analyzer rule to move ParameterizedQuery inside ExplainCommand and DescribeQueryCommand by Vladimir Golubev · 7 weeks ago
  19. eaed585 [SPARK-48002][PYTHON][SS][TESTS] Adds sleep before event testing after query termination by Hyukjin Kwon · 8 weeks ago
  20. ff0751a [MINOR][DOCS] Remove space in the middle of configuration name in Arrow-optimized Python UDF page by Hyukjin Kwon · 8 weeks ago
  21. e1445e3 [SPARK-48036][DOCS] Update `sql-ref-ansi-compliance.md` and `sql-ref-identifier.md` by Dongjoon Hyun · 8 weeks ago
  22. 9a42610 [SPARK-48029][INFRA] Update the packages name removed in building the spark docker image by panbingkun · 8 weeks ago
  23. 59d5946 [SPARK-48034][TESTS] NullPointerException in MapStatusesSerDeserBenchmark by Kent Yao · 8 weeks ago
  24. 8f1634e [SPARK-48032][BUILD] Upgrade `commons-codec` to 1.17.0 by panbingkun · 8 weeks ago
  25. 3d62dd7 [SPARK-47730][K8S] Support `APP_ID` and `EXECUTOR_ID` placeholders in labels by Xi Chen · 8 weeks ago
  26. 8c446f3 [SPARK-47292][SS] safeMapToJValue should consider null typed values by Wei Liu · 8 weeks ago
  27. 506b2d5 [SPARK-48014][SQL] Change the makeFromJava error in EvaluatePython to a user-facing error by allisonwang-db · 8 weeks ago
  28. 023f07d [SPARK-47933][CONNECT][PYTHON][FOLLOW-UP] Remove `pyspark.sql.classic` reference in `pyspark.ml.stat` by Hyukjin Kwon · 8 weeks ago
  29. 0bf3945 [SPARK-48025][SQL][TESTS] Fix org.apache.spark.sql.execution.benchmark.DateTimeBenchmark by Kent Yao · 8 weeks ago
  30. 657d9d0 [SPARK-48024][PYTHON][CONNECT][TESTS] Enable `UDFParityTests.test_udf_timestamp_ntz` by Ruifeng Zheng · 8 weeks ago
  31. a80579b [SPARK-48002][PYTHON][SS] Add test for observed metrics in PySpark StreamingQueryListener by Wei Liu · 8 weeks ago
  32. 1f9e09c [SPARK-48021][ML][BUILD][FOLLOWUP] add `--add-modules=jdk.incubator.vector` to maven compile args by panbingkun · 8 weeks ago
  33. 8b8ea60 [SPARK-47927][SQL] Fix nullability attribute in UDF decoder by Emil Ejbyfeldt · 8 weeks ago
  34. 76ce6b0 [SPARK-48019] Fix incorrect behavior in ColumnVector/ColumnarArray with dictionary and nulls by Gene Pang · 8 weeks ago
  35. 64d3219 [SPARK-48021][ML][BUILD] Add `--add-modules=jdk.incubator.vector` to `JavaModuleOptions` by panbingkun · 8 weeks ago
  36. 356830ad [SPARK-48020][INFRA][PYTHON] Pin 'pandas==2.2.2' by Ruifeng Zheng · 8 weeks ago
  37. b623601 [SPARK-47408][SQL] Fix mathExpressions that use StringType by Mihailo Milosevic · 8 weeks ago
  38. d5712ce [SPARK-48018][SS] Fix null groupId causing missing param error when throwing KafkaException.couldNotReadOffsetRange by micheal-o · 8 weeks ago
  39. 2b2a33c [SPARK-48011][CORE] Store LogKey name as a value to avoid generating new string instances by Gengliang Wang · 8 weeks ago
  40. 6098bd9 [SPARK-48010][SQL] Avoid repeated calls to conf.resolver in resolveExpression by Nikhil Sheoran · 8 weeks ago
  41. 9cf6dc8 [SPARK-47963][CORE] Make the external Spark ecosystem can use structured logging mechanisms by panbingkun · 8 weeks ago
  42. 675f5f0b [SPARK-47818][CONNECT][FOLLOW-UP] Introduce plan cache in SparkConnectPlanner to improve performance of Analyze requests by Xi Lyu · 8 weeks ago
  43. 78b19d5 [SPARK-48005][PS][CONNECT][TESTS] Enable `DefaultIndexParityTests.test_index_distributed_sequence_cleanup` by Ruifeng Zheng · 8 weeks ago
  44. beda1a4 [SPARK-47440][SQL][FOLLOWUP] Reenable predicate pushdown for syntax with boolean comparison in MsSqlServer by Kent Yao · 8 weeks ago
  45. 733e53a [SPARK-47968][SQL] MsSQLServer: Map datatimeoffset to TimestampType by Kent Yao · 8 weeks ago
  46. 07b84dd [SPARK-47476][SQL] Support REPLACE function to work with collated strings by Milan Dankovic · 8 weeks ago
  47. 4ee528f [SPARK-48007][BUILD][TESTS] Upgrade `mssql.jdbc` to `12.6.1.jre11` by Kent Yao · 8 weeks ago
  48. afe310d [SPARK-47351][SQL] Add collation support for StringToMap & Mask string expressions by Uros Bojanic · 8 weeks ago
  49. 6c40214 [SPARK-47350][SQL] Add collation support for SplitPart string expression by Uros Bojanic · 8 weeks ago
  50. b8b6d17 [SPARK-48004][SQL] Add WriteFilesExecBase trait for v1 write by ulysses-you · 8 weeks ago
  51. 95d6c61 [SPARK-47355][SQL] Use wildcard imports in CollationTypeCasts by Uros Bojanic · 8 weeks ago
  52. e04ac56 [SPARK-45225][SQL][FOLLOW-UP] XML: Fix nested XSD file path resolution by Sandip Agarwala · 8 weeks ago
  53. 3451e66 [SPARK-47993][PYTHON] Drop Python 3.8 by Hyukjin Kwon · 8 weeks ago
  54. 2e5825f [SPARK-47858][PYTHON][FOLLOWUP] Excluding Python magic methods from error context target by Haejoon Lee · 8 weeks ago
  55. a715007 [SPARK-48001][CORE] Remove unused `private implicit def arrayToArrayWritable` from `SparkContext` by yangjie01 · 8 weeks ago
  56. 7d04d0f [SPARK-47986][CONNECT][PYTHON] Unable to create a new session when the default session is closed by the server by Niranjan Jayakar · 8 weeks ago
  57. b0e03a1 [SPARK-47999][SS] Improve logging around snapshot creation and adding/removing entries from state cache map in HDFS backed state store provider by Anish Shrigondekar · 8 weeks ago
  58. 033ca3e [SPARK-47922][SQL] Implement the try_parse_json expression by Harsh Motwani · 8 weeks ago
  59. ea4b7a2 [SPARK-47991][SQL][TEST] Arrange the test cases for window frames and window functions by beliefer · 8 weeks ago
  60. 79357c8 [SPARK-47933][CONNECT][PYTHON][FOLLOW-UP] Avoid referencing _to_seq in `pyspark-connect` by Hyukjin Kwon · 8 weeks ago
  61. d540786 [SPARK-47597][STREAMING] Streaming: Migrate logInfo with variables to structured logging framework by Daniel Tenedorio · 8 weeks ago
  62. 994775a [SPARK-47955][SQL] Improve `DeduplicateRelations` performance by Peter Toth · 8 weeks ago
  63. e1d0212 [SPARK-45425][DOCS][FOLLOWUP] Add a migration guide for TINYINT type mapping change by Kent Yao · 8 weeks ago
  64. 287d020 [SPARK-47989][SQL] MsSQLServer: Fix the scope of spark.sql.legacy.mssqlserver.numericMapping.enabled by Kent Yao · 8 weeks ago
  65. de5c512 [SPARK-47987][PYTHON][CONNECT][TESTS] Enable `ArrowParityTests.test_createDataFrame_empty_partition` by Ruifeng Zheng · 8 weeks ago
  66. 5810554 [SPARK-47990][BUILD] Upgrade `zstd-jni` to 1.5.6-3 by panbingkun · 8 weeks ago
  67. 24b0c75 [SPARK-47985][PYTHON] Simplify functions with `lit` by Ruifeng Zheng · 8 weeks ago
  68. a84cffd [SPARK-47982][BUILD] Update some code style's plugins to latest version by panbingkun · 8 weeks ago
  69. 5f730c8 [SPARK-47984][ML][SQL] Change `MetricsAggregate/V2Aggregator#serialize/deserialize` to call `SparkSerDeUtils#serialize/deserialize` by yangjie01 · 8 weeks ago
  70. 7090bc1 [SPARK-47981][BUILD] Upgrade `Arrow` to 16.0.0 by sychen · 8 weeks ago
  71. a066d0c [SPARK-47983][SQL] Demote spark.sql.pyspark.legacy.inferArrayTypeFromFirstElement.enabled to internal by Kent Yao · 8 weeks ago
  72. 775bc54 [SPARK-47580][SQL] SQL catalyst: eliminate unnamed variables in error logs by Gengliang Wang · 8 weeks ago
  73. 08caa56 [SPARK-47980][SQL][TESTS] Reactivate test 'Empty float/double array columns raise EOFException' by Kent Yao · 8 weeks ago
  74. dab4a04 [SPARK-47973][CORE] Log call site in SparkContext.stop() and later in SparkContext.assertNotStopped() by Bo Zhang · 8 weeks ago
  75. b4624bf [SPARK-47414][SQL] Lowercase collation support for regexp expressions by Uros Bojanic · 8 weeks ago
  76. c6aaa18 Revert "[SPARK-45302][PYTHON] Remove PID communication between Pythonworkers when no demon is used" by Hyukjin Kwon · 8 weeks ago
  77. ea37c86 [SPARK-47962][PYTHON][DOCS] PySpark Dataframe doc test improvement by Wei Liu · 8 weeks ago
  78. d233892 [SPARK-47967][SQL] Make `JdbcUtils.makeGetter` handle reading time type as NTZ correctly by Kent Yao · 8 weeks ago
  79. 0fcced6 [SPARK-47979][SQL][TESTS] Use Hive tables explicitly for Hive table capability tests by Dongjoon Hyun · 8 weeks ago
  80. 5a1559a [SPARK-47921][CONNECT] Fix ExecuteJobTag creation in ExecuteHolder by allisonwang-db · 8 weeks ago
  81. 62dd64a [SPARK-47583][CORE] SQL core: Migrate logError with variables to structured logging framework by Daniel Tenedorio · 8 weeks ago
  82. 03d4ea6 [SPARK-47974][BUILD] Remove `install_scala` from `build/mvn` by Cheng Pan · 8 weeks ago
  83. 09ed09c [SPARK-47958][TESTS] Change LocalSchedulerBackend to notify scheduler of executor on start by Davin Tjong · 8 weeks ago
  84. 0042b67 [SPARK-47965][CORE] Avoid orNull in TypedConfigBuilder and OptionalConfigEntry by Hyukjin Kwon · 8 weeks ago
  85. 0f1b644 [SPARK-47971][PYTHON][CONNECT][TESTS] Reenable `PandasUDFGroupedAggParityTests.test_grouped_with_empty_partition` by Ruifeng Zheng · 8 weeks ago
  86. c44493d [SPARK-47764][CORE][SQL] Cleanup shuffle dependencies based on ShuffleCleanupMode by Bo Zhang · 8 weeks ago
  87. 461ffa1 [SPARK-47692][SQL] Fix default StringType meaning in implicit casting by Mihailo Milosevic · 8 weeks ago
  88. 890f78d [SPARK-47418][SQL] Add hand-crafted implementations for lowercase unicode-aware contains, startsWith and endsWith and optimize UTF8_BINARY_LCASE by Vladimir Golubev · 8 weeks ago
  89. 6f01982 [SPARK-47964][PYTHON][CONNECT] Hide SQLContext and HiveContext in pyspark-connect by Hyukjin Kwon · 8 weeks ago
  90. fd695be [SPARK-47903][PYTHON][FOLLOW-UP] Removed changes relating to try_parse_json by Harsh Motwani · 8 weeks ago
  91. cb1e1f5 [SPARK-47969][PYTHON][TESTS] Make `test_creation_index` deterministic by Ruifeng Zheng · 8 weeks ago
  92. 3cf0c83 [SPARK-47771][PYTHON][DOCS][TESTS][FOLLOWUP] Make `max_by, min_by` doctests deterministic by Ruifeng Zheng · 8 weeks ago
  93. aa4a84b [SPARK-47933][PYTHON][CONNECT][FOLLOW-UP] Add a check of `__name__` at `_with_origin` by Hyukjin Kwon · 8 weeks ago
  94. b3c11ef Revert "Revert "[SPARK-45302][PYTHON] Remove PID communication between Python workers when no demon is used"" by Hyukjin Kwon · 8 weeks ago
  95. e8f529b Revert "[SPARK-45302][PYTHON] Remove PID communication between Python workers when no demon is used" by Hyukjin Kwon · 8 weeks ago
  96. 390fb74 [SPARK-47941][SS][CONNECT] Propagate ForeachBatch worker initialization errors to users for PySpark by Eric Marnadi · 8 weeks ago
  97. c88fabf [SPARK-47604][CORE] Resource managers: Migrate logInfo with variables to structured logging framework by panbingkun · 8 weeks ago
  98. e50737b [SPARK-47864][FOLLOWUP][PYTHON][DOCS] Fix minor typo: "MLLib" -> "MLlib" by Haejoon Lee · 8 weeks ago
  99. 66613ba [SPARK-47956][SQL] Sanity check for unresolved LCA reference by Wenchen Fan · 8 weeks ago
  100. 2b01755 [SPARK-47948][PYTHON] Upgrade the minimum `Pandas` version to 2.0.0 by Haejoon Lee · 8 weeks ago