1. 0e1101a [SPARK-57388][INFRA] Pin downstream actions/checkout to a single resolved SHA in maven_test.yml and python_hosted_runner_test.yml by Ruifeng Zheng · 7 hours ago master
  2. 5d3aa9a [SPARK-34679][SQL][DOC] Add inferTimestamp option to JSON data source options table by BRIJ RAJ KISHORE · 10 hours ago
  3. 3d4c8b0 [SPARK-56903][SQL][FOLLOWUP] Fix null join key shuffle config version by Chao Sun · 13 hours ago
  4. 913f105 [SPARK-57387][YARN] Make executor JVM options `-XX:OnOutOfMemoryError` configurable on YARN by Cheng Pan · 13 hours ago
  5. f64739d [SPARK-57381][PYTHON] Refactor SQL_WINDOW_AGG_PANDAS_UDF by Yicong Huang · 16 hours ago
  6. 3349526 [SPARK-57360][SQL] Block temporary variables in generated column expressions by Szehon Ho · 16 hours ago
  7. b88952c [SPARK-57393] Build: PySpark and SparkR source distributions are missing LICENSE and NOTICE files by Huaxin Gao · 17 hours ago
  8. bcdcd47 [SPARK-57133][SQL] Add BIN BY relation operator parsing and resolution by Nikolina Vraneš · 17 hours ago
  9. 3fce4cf [SPARK-57073][SS][PYTHON][TEST] Catch AnalysisException for test_parity_listener by Tian Gao · 17 hours ago
  10. a423d06 [SPARK-57327][INFRA] Move scheduled CIs for 4.1 to branch-4.1 by Tian Gao · 18 hours ago
  11. a1922b5 [SPARK-57369][SQL] Move main EXECUTE IMMEDIATE resolution logic to common code by Mihailo Aleksic · 19 hours ago
  12. 9357bc9 [SPARK-57295][SQL] Make database location validation consistent for w… by Anurag Kumar Dwivedi · 20 hours ago
  13. 8cced6f [SPARK-57020][PYTHON][TEST] Add ASV microbenchmark for SQL_TRANSFORM_WITH_STATE_PANDAS_UDF by Yicong Huang · 22 hours ago
  14. be299a1 [SPARK-57361][PYTHON] Refactor SQL_ARROW_UDTF by Yicong Huang · 22 hours ago
  15. 89dff6b [SPARK-57321][SQL] Infer CSV schema from tar archives by akshatshenoi-db · 22 hours ago
  16. 302ba67 [SPARK-56877][SQL][FOLLOWUP] Make PartitioningCollection invariant check O(1) per nesting level by Wenchen Fan · 22 hours ago
  17. e33017a [SPARK-57383][SQL][PYTHON] Honor configured Arrow zstd compression level when writing Arrow batches by Liang-Chi Hsieh · 24 hours ago
  18. 6693d43 [SPARK-57377][INFRA] Add CI check to prevent new entries in the config binding policy exceptions file by Wenchen Fan · 26 hours ago
  19. 90f6bab [SPARK-57368][PYTHON][ML][TEST] Fix assertTrue misuse in PySpark tests by Ruifeng Zheng · 27 hours ago
  20. 60acc8f [SPARK-57332][SQL][FOLLOWUP] Fix line length exceeding 100 characters in JDBCSuite and V2ExpressionSQLBuilder by Kousuke Saruta · 32 hours ago
  21. 9018d84 [SPARK-57263][SQL][FOLLOWUP] Fix Hive 4.2 getTablesByName compatibility by YangJie · 2 days ago
  22. f2d11a6 [SPARK-57313][SQL] Fix SampleExec numOutputRows metric when whole-stage codegen is disabled by Eric Yang · 2 days ago
  23. 79dcac9 [SPARK-53840][SQL] Add AS JSON output support for SHOW TABLES and SHOW TABLE EXTENDED by Ayush · 2 days ago
  24. 8e22e99 [SPARK-57359][DOC] Document the MERGE INTO statement in the SQL reference by Szehon Ho · 2 days ago
  25. f5eabcb [SPARK-57332][SQL] Fix MySQL backslash escaping in LIKE predicate pushdown via a dialect string-literal escaping hook by Wenchen Fan · 2 days ago
  26. da67157 [SPARK-56887][SQL] Add dedicated sort-merge physical operator for AS-OF join by Kousuke Saruta · 2 days ago
  27. 1175d40 [SPARK-57285][SQL] Route nanosecond timestamp cast-to-string through the Types Framework by Maxim Gekk · 2 days ago
  28. d2cbc7f [SPARK-57374][BUILD] Upgrade `netty-tcnative` to 2.0.78.Final by Dongjoon Hyun · 2 days ago
  29. 44984a6 [SPARK-56995][SQL][DML][TESTS][FOLLOWUP] Fix AutoCdcScd1FullRefreshSuite by returning live table from SharedTablesInMemoryRowLevelOperationTableCatalog by Kousuke Saruta · 2 days ago
  30. 9eb44b3 [SPARK-57355][PYTHON] Fix __module__ check in udf profiler by Tian Gao · 2 days ago
  31. 0e8a75f [SPARK-57207][SQL][FOLLOWUP] Fix StackOverflowError when setting timestampNanosTypes.enabled via SparkConf by Stevo Mitric · 2 days ago
  32. a852aa3 [SPARK-57325][CONNECT] Stop streaming queries registered while the Connect session is closing by DB Tsai · 2 days ago
  33. bd04636 [SPARK-57318][SQL] Refactor WorkerSession into a state-machine interface by Haiyang Sun · 2 days ago
  34. 0b05352 [MINOR][PYTHON][TEST] Use assertEqual instead of assertTrue in PySpark tests by Ruifeng Zheng · 2 days ago
  35. 19aec7a [SPARK-57315][SQL] Support HOUR, MINUTE and SECOND functions over nanosecond-precision timestamps by Maxim Gekk · 2 days ago
  36. cc88e6c [SPARK-57367][PYTHON][DOC] Improve See Also cross-references in pyspark.sql.functions by Ruifeng Zheng · 2 days ago
  37. 3ad5b64 [SPARK-57194][SQL] Add preOperatorOptimizationRules extension point to Optimizer by Shrirang Mhalgi · 2 days ago
  38. bbd7c46 [SPARK-57261][SQL] Allow to disable HashAggregateExec by config by Cheng Pan · 2 days ago
  39. 6d4b71e [SPARK-57344][INFRA] Ensure tests for `pipelines` module triggered when sql-related modules are modified by YangJie · 2 days ago
  40. e598b0c [SPARK-57348][PYTHON][TESTS] Replace sql_keywords doctest show() with columns check by Kousuke Saruta · 2 days ago
  41. 62ae4db [SPARK-57152][SDP] Implement SCD2 Batch Processor; Find Affected Aux/Target Table Rows by AnishMahto · 2 days ago
  42. 844f6f0 [SPARK-57338][SQL] Render external values in Row JSON via formatExternal by Maxim Gekk · 2 days ago
  43. 5001ba0 [MINOR][PYTHON][DOC] Fix broken See Also links in pyspark.sql.functions by Ruifeng Zheng · 3 days ago
  44. fc527bc [SPARK-57316][DOC] Document WITH SCHEMA EVOLUTION and BY NAME for SQL INSERT by Thang Long Vu · 3 days ago
  45. af39d95 Revert "[SPARK-57133][SQL] Add BIN BY relation operator parsing and resolution" by Dongjoon Hyun · 3 days ago
  46. 2bb8b20 [SPARK-57351][K8S][CORE] Enable `spark.kubernetes.executor.useDriverPodIP` by default by Dongjoon Hyun · 3 days ago
  47. 6c2325a [SPARK-56758][PYTHON] Refactor SQL_MAP_PANDAS_ITER_UDF by Yicong Huang · 3 days ago
  48. 5227368 [SPARK-57212][SQL][FOLLOWUP] Record AQE rule timing into the shared tracker via a lock instead of per-node trackers by Wenchen Fan · 3 days ago
  49. 05b4635 [SPARK-57148][SQL] Rename splitSemiColonWithIndex to splitSemiColon by Anupam Yadav · 3 days ago
  50. 029731c [SPARK-57349][CONNECT] Split udf protocol into message and grpc service. by Haiyang Sun · 3 days ago
  51. e94782c [SPARK-56538][CONNECT] Add per-RPC deadlines to Spark Connect client by pranavdev022 · 3 days ago
  52. 098057d [SPARK-57212][SQL] Track preparation and AQE rule timing in `QueryPlanningTracker` by Peter Toth · 3 days ago
  53. ac4457e [SPARK-57259][SQL][TEST] Add nanosecond timestamp types to DataTypeTestUtils type sets by Maxim Gekk · 3 days ago
  54. bf79473 [SPARK-57274][CONNECT] Support fetch/type accessors and getMoreResults for SparkConnectStatement by Jiwon Park · 3 days ago
  55. 391d65a [SPARK-57234][SS][DOCS] Add Real-time Mode documentation page to the Structured Streaming guide by Boyang Jerry Peng · 3 days ago
  56. 13ea0f5 [SPARK-57281][SQL][SS] Remove @Experimental annotation from Real-time mode by Boyang Jerry Peng · 3 days ago
  57. 3e7cae7 [SPARK-50520][PYTHON] Respect timeout in df.rdd.countApprox() by Rishav Sinha · 4 days ago
  58. b67073f [SPARK-57253][SQL] Add `jaro_winkler_similarity` built-in function by Kousuke Saruta · 4 days ago
  59. cc64f0a [SPARK-52719][SQL] Support using scalar UDFs in TVF arguments by Anupam Yadav · 4 days ago
  60. 3744250 [SPARK-54876][SQL] Fix splitSemiColon dropping statement ending with block comment by Anupam Yadav · 4 days ago
  61. ac0c117 [SPARK-57330][INFRA] Switch shared CI compile artifacts to zstd compression by Ruifeng Zheng · 4 days ago
  62. 0993d43 [SPARK-57314][PS][TEST] Add tests for Index.equals in pandas-on-Spark by tonghuaroot (童话) · 4 days ago
  63. 3c31d68 [SPARK-57287][SQL] Escape backslash in LIKE pattern for STARTS_WITH/ENDS_WITH/CONTAINS pushdown by Shrirang Mhalgi · 4 days ago
  64. 49908a2 [SPARK-57298][SQL] collect_set fails to dedupe float/double NaN/-0.0 by their semantics by Eric Yang · 4 days ago
  65. 0849776 [SPARK-57326][SQL][TEST] Honor DEFAULT_ARTIFACT_REPOSITORY in IsolatedClientLoaderIvySettingsSuite by Wenchen Fan · 4 days ago
  66. bee16dc [SPARK-57135][SQL] Support reading CSV files inside tar archives by akshatshenoi-db · 4 days ago
  67. 3ebf8d6 [SPARK-57320][BUILD] Upgrade Netty to 4.2.15.Final by Dongjoon Hyun · 4 days ago
  68. d9c50b2 [SPARK-57224][INFRA] Add input check for merge script by Tian Gao · 4 days ago
  69. 761afcb [SPARK-57133][SQL] Add BIN BY relation operator parsing and resolution by Nikolina Vraneš · 4 days ago
  70. 952a283 [SPARK-57254][INFRA] Put CI-unrelated files in a module so CI won't be triggered by Tian Gao · 4 days ago
  71. 99db069 [SPARK-57317][SQL] Fix Literal.create for external nanosecond timestamp values by Maxim Gekk · 4 days ago
  72. 7129ce0 [SPARK-37019][SQL] Add codegen support to array higher-order functions by Adam Binford · 4 days ago
  73. b55c2cc [SPARK-57282][SQL] Spread NULL left anti join keys across shuffle partitions by Chao Sun · 4 days ago
  74. 542ea3b [SPARK-56995][SQL][DML] Allow dataframe caching in the DSv2 Transaction API by Andreas Chatzistergiou · 4 days ago
  75. 1592ec2 [SPARK-56661] Addressing review comments from PR #55768 by Sven Weber · 4 days ago
  76. b098a58 [SPARK-57258][SQL] Reduce regexp_extract/regexp_extract_all generated code size via shared extract helpers by YangJie · 4 days ago
  77. 3e02257 [SPARK-57294][PS] Support DataFrame.combine in fallback mode by tonghuaroot (童话) · 4 days ago
  78. e8ca287 [SPARK-56830][INFRA] Share SBT compile artifact with python hosted runner CI jobs by Ruifeng Zheng · 4 days ago
  79. 4d8b715 [SPARK-57277][INFRA] Make CI cache keys OS-specific by Ruifeng Zheng · 4 days ago
  80. 9c1adaf [SPARK-57278][INFRA] Install zstd in CI container images to fix GitHub Actions cache by Ruifeng Zheng · 4 days ago
  81. 96b255f [SPARK-57262][SQL][WEBUI] Job description derived from a query should respect `spark.sql.redaction.string.regex` by Kousuke Saruta · 4 days ago
  82. 2660f4d [SPARK-57293][SQL] Cast between nanosecond-precision and microsecond-precision timestamp types by Maxim Gekk · 4 days ago
  83. 5077f7f [SPARK-57255][SQL] Simplify RegExpReplace codegen by extracting the match/replace loop into a shared helper by YangJie · 5 days ago
  84. a6ac0b8 [SPARK-57141][SS][RTM][STREAMINGSHUFFLE][PART3] Add StreamingShuffleManager and MultiShuffleManager by Boyang Jerry Peng · 5 days ago
  85. 4bc7196 [SPARK-57297][SQL][TESTS] Add a test that SQL execution description respects `spark.sql.redaction.string.regex` by Dongjoon Hyun · 5 days ago
  86. c9e7421 Revert "[SPARK-57262][SQL][WEBUI] Job description derived from a query should respect `spark.sql.redaction.string.regex`" by Dongjoon Hyun · 6 days ago
  87. 583e5bb [SPARK-57262][SQL][WEBUI] Job description derived from a query should respect `spark.sql.redaction.string.regex` by Kousuke Saruta · 6 days ago
  88. b66d392 [SPARK-57284][PYTHON][SQL] Add Scala/Python bindings for vector functions by Kousuke Saruta · 6 days ago
  89. c082f82 [SPARK-56645][CORE] Fix History Server serving stale UI after app completes by cxzl25 · 6 days ago
  90. ccffd01 [SPARK-57256][SQL] Cast nanosecond-precision timestamps to string by Maxim Gekk · 7 days ago
  91. f3f5677 [SPARK-57247][SQL][CONNECT] Support DataFrame.zip in Spark Connect by Ruifeng Zheng · 7 days ago
  92. e113afc [SPARK-57286][BUILD] Add `wildfly-openssl-macosx-aarch64` dependency to support Apple Silicon by Dongjoon Hyun · 7 days ago
  93. 060a617 [SPARK-57283][BUILD] Upgrade `wildfly-openssl` to 2.3.0.Final by Dongjoon Hyun · 7 days ago
  94. 637803e [SPARK-57257][SQL] Support nanosecond-precision timestamps in Hive results by Maxim Gekk · 7 days ago
  95. 042ad7d [SPARK-57176][SQL] Extend nested column pruning through array-returning functions by Chao Sun · 7 days ago
  96. 0536814 [SPARK-57273][BUILD] Upgrade jackson to 2.21.4 by Dongjoon Hyun · 7 days ago
  97. a32cda3 [SPARK-57260][SQL] Fix variable resolution in REPLACE WHERE clause of INSERT INTO by Joel Robin P · 7 days ago
  98. b2580fc [SPARK-57263][SQL] Support Hive 4.2 metastore by YangJie · 7 days ago
  99. 9e32a26 [SPARK-57250][SQL] Construct sub-microsecond timestamp typed literals with precision derived from fractional digits by Maxim Gekk · 7 days ago
  100. 4915340 [SPARK-57207][SQL] Support nanosecond timestamp types in the Types Framework by Maxim Gekk · 8 days ago