1. 7d8df99 [SPARK-56019][SQL] Close JDBC connection on task kill to unblock native socket reads by Ivan Sadikov · 6 hours ago master
  2. 6423cb0 [SPARK-56407][BUILD][TESTS] Remove pre-built class files and JARs used in artifact transfer tests by Kousuke Saruta · 8 hours ago
  3. 3851cb5 [SPARK-56302][CORE] Free task result memory eagerly during serialization on executor by Tengfei Huang · 12 hours ago
  4. 4018cc7 [SPARK-56338][INFRA][FOLLOWUP] Support MAVEN_MIRROR_URL in SBT launcher bootstrap by Wenchen Fan · 17 hours ago
  5. cf26c42 [SPARK-56397][BUILD] Upgrade `ICU4J` to 78.3 by Dongjoon Hyun · 18 hours ago
  6. 05f6f69b [SPARK-56332][SQL][TESTS] Use `sql.SparkSession` in `trait SQLTestData` by Ruifeng Zheng · 18 hours ago
  7. 528386c [SPARK-56377][PYTHON] Add type hint for shuffle.py by Tian Gao · 18 hours ago
  8. 21ada68 [SPARK-56253][PYTHON][CONNECT] Make spark.read.json accept DataFrame input by Yicong-Huang · 18 hours ago
  9. a7fb5bbe6 [SPARK-56330][CORE] Add TaskInterruptListener to TaskContext for interrupt notifications by Ivan Sadikov · 19 hours ago
  10. efa725e [SPARK-56343][SQL][TESTS] Add MERGE INTO test for type mismatch without schema evolution trigger condition by Szehon Ho · 20 hours ago
  11. 33c18ee [SPARK-56393][K8S][DOCS] Drop K8s v1.33 Support by Dongjoon Hyun · 25 hours ago
  12. 0eac893 [SPARK-56392][SQL] Make Sample.seed Optional to distinguish user-specified vs random seeds by Rahul Sharma · 26 hours ago
  13. a3930d3 [SPARK-56221][SQL][PYTHON][FOLLOWUP] Rename TablePartition and remove SHOW CACHED TABLES / listCachedTables by Wenchen Fan · 26 hours ago
  14. 361b9d6 [SPARK-55278][FOLLOWUP] Add shading rule for udf/worker protobuf as for core and connect by Haiyang Sun · 29 hours ago
  15. 426accf [SPARK-55109][SQL] Enhance RaiseError to generate valid SQL by Felipe Fujiy Pessoto · 30 hours ago
  16. a66bf7d [SPARK-56277][SQL] Add missing toString() to NamespaceChange and TableChange property classes by anshul_baliga7 · 30 hours ago
  17. e96b48b [SPARK-56328][SQL][FOLLOWUP] Handle SubqueryAlias-wrapped inline tables in INSERT VALUES collation fix by ilicmarkodb · 31 hours ago
  18. 590b0d5 [SPARK-56346][SQL] Use PartitionPredicate in DSV2 Metadata Only Delete by Szehon Ho · 2 days ago
  19. 163abe5 [SPARK-56092][SS][CONNECT] Fix NPE in StreamingQueryException.toString() when cause is null by Xiaoxuan Li · 2 days ago
  20. aa6154b [SPARK-55306] Add ability to run Kafka tests in Python by Jerry Peng · 2 days ago
  21. c869549 [SPARK-56372][INFRA] Add cmake to CI Docker images for R fs package compilation by Ruifeng Zheng · 2 days ago
  22. c7c78e0 [MINOR][CORE][TESTS] Fix flaky DAGSchedulerSuite test for push-based shuffle by DB Tsai · 2 days ago
  23. e8b5cb8 [SPARK-56216][SS] Integrate checkpoint V2 with auto-repair snapshot by ericm-db · 2 days ago
  24. b490770 [SPARK-56328][SQL] Fix inline table collation handling for INSERT VALUES and DEFAULT COLLATION by ilicmarkodb · 2 days ago
  25. 7737a94 [SPARK-55278] Introduce module and core abstraction for language-agnostic UDF worker by Haiyang Sun · 2 days ago
  26. 98cdaee [SPARK-55450][SS][PYTHON][DOCS] Document admission control in PySpark streaming data sources by Jitesh Soni · 3 days ago
  27. 9dbe381 [SPARK-56340][PYTHON] Move input_type schema to eval conf by Tian Gao · 3 days ago
  28. 491add8 [SPARK-56364][BUILD][TESTS] Generate Scala-based test JARs dynamically instead of storing pre-built binaries by Kousuke Saruta · 3 days ago
  29. af9c8b3 [SPARK-56273][SQL] Simplify extracting fields from DataSourceV2ScanRelation by Yan Yan · 3 days ago
  30. 975b299 [SPARK-56342][PYTHON] Tighten type hints for refactored eval type functions in worker.py by Yicong-Huang · 3 days ago
  31. 5bb6271 [SPARK-56207][SQL] Replace legacy error codes with named errors in DSv2 connector API by DB Tsai · 3 days ago
  32. f184c25 [SPARK-56361][SS] Provide better error with logging on NPE in stream-stream join by Jungtaek Lim · 3 days ago
  33. ae7f6e3 [SPARK-56248][PYTHON][SS] Optimize python stateful processor serialization to skip unnecessary list/dict/row construction by Jia Teoh · 3 days ago
  34. e42a561 [SPARK-56359][PYTHON] Remove unused ArrowStreamArrowUDFSerializer by Yicong-Huang · 3 days ago
  35. 5beaa5b [SPARK-56353][BUILD][TESTS] Generate Java-based test JARs dynamically instead of storing pre-built binaries by Kousuke Saruta · 3 days ago
  36. 732f30b [SPARK-56357][BUILD] Upgrade sbt to 1.12.8 by yangjie01 · 3 days ago
  37. 08e5436 [SPARK-55657][BUILD] Bump Hadoop 3.5.0 by Cheng Pan · 3 days ago
  38. 3fb32fd [SPARK-56363][INFRA] Add remotes in Spark release image by Hyukjin Kwon · 3 days ago
  39. 7167dad [SPARK-55330][INFRA] Add cmake to release Docker base image by Hyukjin Kwon · 4 days ago
  40. 734a86c [SPARK-56360][INFRA] Wait for base and RM Docker logs in release workflow; avoid hang on early failure by Hyukjin Kwon · 4 days ago
  41. 003855c [SPARK-55115][INFRA][FOLLOW-UP] Fix release workflow log tailing for base Docker image build by Hyukjin Kwon · 4 days ago
  42. b168254 [SPARK-56358][BUILD] Add gson version override to SBT build to align with Maven by yangjie01 · 4 days ago
  43. 6767941 [SPARK-55667][PYTHON][CONNECT][FOLLOW-UP] Remove arguments in check_dependencies by Tian Gao · 4 days ago
  44. 08b1390 [SPARK-56349][PYTHON] Remove unused ArrowStreamAggArrowUDFSerializer by Yicong-Huang · 4 days ago
  45. 561b7b9 [SPARK-56348][PYTHON] Remove unused ArrowBatchUDFSerializer by Yicong-Huang · 4 days ago
  46. c6a198e [MINOR][PYTHON] Fix PySparkException failing when messageParameters is omitted by Fangchen Li · 4 days ago
  47. c1dd15c2 [SPARK-49793][PYTHON][TESTS][FOLLOW-UP] Fix test_caching in connect mode by Tian Gao · 4 days ago
  48. 49e06b3 [SPARK-56317][SQL] GetJsonObjectEvaluator should reuse output buffer by Xianming Lei · 4 days ago
  49. ed01691 [MINOR][DOCS] Fix a typo of "KLL" initialism by donaldchai · 4 days ago
  50. 263c976 [SPARK-56209][BUILD][FOLLOWUP] Exclude Netty transitive dependencies from Vert.x in Kubernetes modules by yangjie01 · 4 days ago
  51. 1ea48e4 [SPARK-56007][CONNECT] Fix ArrowDeserializer to use positional binding for rows by Herman van Hövell · 5 days ago
  52. a20bfc7 [SPARK-56356][BUILD] Fix an issue in release build caused by error on fetching artifacts by Kousuke Saruta · 5 days ago
  53. 842eb7b [SPARK-54938][PYTHON][TEST][FOLLOW-UP] Fix inferred time unit for pandas >= 3 by Ruifeng Zheng · 5 days ago
  54. d9c8eda [SPARK-56322][CONNECT][PYTHON] Fix TypeError when self-joining observed DataFrames by Marcin Wojtyczka · 6 days ago
  55. 504060f [SPARK-56347][TEST] Fix TOCTOU race in DockerJDBCIntegrationSuite port allocation by Vladan Vasić · 6 days ago
  56. 5f5fc89 [SPARK-56329][PYTHON] Fix all E721 type comparison violations by Ruifeng Zheng · 6 days ago
  57. cba5705 [SPARK-56341][PYTHON][DOCS] Fix outdated PyArrow minimum version in arrow_pandas.rst by Yicong-Huang · 6 days ago
  58. 50f7179 [SPARK-56313][PYTHON][FOLLOWUP] Remove rddsampler from mypy exception list by Tian Gao · 6 days ago
  59. e130525 [SPARK-55902][PYTHON] Refactor SQL_ARROW_BATCHED_UDF by Yicong-Huang · 6 days ago
  60. 7cb54df [SPARK-56280][SS] normalize NaN and +/-0.0 in streaming dedupe node by Richard Chen · 7 days ago
  61. e2f15f6 [SPARK-56243][SS] Throw detailed error on malformed Kafka record timestamps by Kavpreet Grewal · 7 days ago
  62. ccd4206 [SPARK-56345][PYTHON][TESTS] Use `pd.Series.__name__` in Arrow UDF type-hint test by Takuya Ueshin · 7 days ago
  63. 650b0a6 [SPARK-56338][INFRA] Support Maven mirrors for build by Tian Gao · 7 days ago
  64. e9a348e [SPARK-56327][PYTHON][TESTS] Fix grouped map pandas tests for pandas 3 by Takuya Ueshin · 7 days ago
  65. 922de74 [SPARK-56323][SQL] Propagate ROW FORMAT / STORED AS to v2 catalog in CREATE TABLE LIKE by Liang-Chi Hsieh · 7 days ago
  66. 671c65f [SPARK-56321][SQL] Fix `AnalysisException` when scan reports transform-based ordering via `SupportsReportOrdering` by Peter Toth · 7 days ago
  67. 73a272d [SPARK-56189][PYTHON] Refactor SQL_WINDOW_AGG_ARROW_UDF by Yicong-Huang · 7 days ago
  68. b580b4f [SPARK-56251][SQL] Add default fetchSize for postgres to avoid loading all data in memory by Tengfei Huang · 7 days ago
  69. 3a22620 [SPARK-56313][PYTHON][FOLLOWUP] Use old way to label generic for a class by Tian Gao · 8 days ago
  70. 0d8e031 Revert "[SPARK-56189][PYTHON] Refactor SQL_WINDOW_AGG_ARROW_UDF" by Ruifeng Zheng · 8 days ago
  71. 0af9722 [SPARK-56260][INFRA] Pin third-party GitHub Actions to commit SHA by Yicong Huang · 8 days ago
  72. 25e85b2 [SPARK-56222][PYTHON] Create ArrowStreamGroupSerializer and ArrowStreamCoGroupSerializer by Yicong Huang · 8 days ago
  73. 3433c38 [SPARK-56189][PYTHON] Refactor SQL_WINDOW_AGG_ARROW_UDF by Yicong Huang · 8 days ago
  74. d498b06 [SPARK-56314][SQL][TESTS] Avoid uncessary RDD->DataFrame conversion in `SQLTestData` by Ruifeng Zheng · 8 days ago
  75. 989df5e [SPARK-56306][SQL] Fix collation-aware PIVOT by Akash Nayar · 8 days ago
  76. 00b9451 [SPARK-55981][SQL] Allow Geo Types with SRID's from the pre-built registry by Szehon Ho · 8 days ago
  77. 463a188 [SPARK-56190][SQL] Support nested partition columns for DSV2 PartitionPredicate by Szehon Ho · 8 days ago
  78. e0bc7aa [SPARK-56313][PYTHON][FOLLOWUP] Remove the generic for rddsampler methods by Tian Gao · 8 days ago
  79. e75d6fb [SPARK-56219][PS][FOLLOW-UP] Keep legacy groupby idxmax and idxmin skipna=False behavior for pandas 2 by Takuya Ueshin · 8 days ago
  80. fec2804 [SPARK-56310][PYTHON] Handle pandas 3 dtype in DataFrame.toPandas by Takuya Ueshin · 8 days ago
  81. 0f0c0e2 [SPARK-56296][SQL] Pivot createTableLike to pass full TableInfo including schema, partitioning, constraints, and owner by Liang-Chi Hsieh · 8 days ago
  82. 3e7e126 [SPARK-56001][SQL] Add INSERT INTO ... REPLACE ON/USING syntax by Thang Long VU · 8 days ago
  83. dd492dd [SPARK-56054][SQL] Fix aliased nested fields ignored for schema evolution in MERGE by Johan Lasperas · 8 days ago
  84. 9518075 [SPARK-54938][PYTHON][TEST][FOLLOW-UP] Fix `test_pyarrow_array_type_inference` for pandas >= 3 by Ruifeng Zheng · 8 days ago
  85. d1916e3 [SPARK-56313][PYTHON] Add type hint for rddsampler.py by Tian Gao · 9 days ago
  86. 1dd26f9 [SPARK-56271][PYTHON] Fix type hint and remove unused method for _globals.py by Tian Gao · 9 days ago
  87. 384f543 [SPARK-56311][PYTHON] Add type hints for daemon.py by Tian Gao · 9 days ago
  88. 89cb692 [SPARK-56123][PYTHON][FOLLOWUP] Avoid using concat_batches for old version of pyarrow by Tian Gao · 9 days ago
  89. d580b65 [SPARK-49543][SQL] Add SHOW COLLATIONS command by Liang-Chi Hsieh · 9 days ago
  90. a4719590 [SPARK-56307][BUILD] Upgrade `log4j` to 2.25.4 by Dongjoon Hyun · 9 days ago
  91. e6feb27 [SPARK-51988][SS] Do file checksum verification on read for RocksDB zip file by Gurpreet Nanda · 9 days ago
  92. e7eceb7 [SPARK-56235][CORE] Add reverse index in TaskSetManager to avoid O(N) scans in executorLost by DenineLu · 9 days ago
  93. 224f30f [SPARK-56241][SQL] Derive `outputOrdering` from `KeyedPartitioning` key expressions by Peter Toth · 9 days ago
  94. 28e9e69 [SPARK-56074][INFRA] Improve AGENTS.md with inline build/test commands, PR workflow, and dev notes by Wenchen Fan · 9 days ago
  95. a67f1eca [SPARK-56137][UI][TESTS] Add regression tests for SQL tab DataTables migration by Kent Yao · 9 days ago
  96. 640476d [SPARK-56303][K8S] Add Java-friendly factory methods to `JavaMainAppResource` by Dongjoon Hyun · 9 days ago
  97. a1de465 [SPARK-56300][K8S] Add Java-friendly factory method to `KubernetesDriverSpec` by Dongjoon Hyun · 9 days ago
  98. 292e1d5 [SPARK-56301][PYTHON] Fix typos in `error-conditions.json` by Ruifeng Zheng · 9 days ago
  99. 0419e68 [SPARK-56155][SQL] Collect_list/collect_set sql() function includes "RESPECT NULLS" by Helios He · 10 days ago
  100. 6e8c690 [SPARK-53209][YARN] Add ActiveProcessorCount JVM option to YARN executor and AM by John Zhuge · 10 days ago