1. 3298e68 [SPARK-55373][CONNECT] Improve noHandlerFoundForExtension error message by Alex Khakhlyuk · 5 hours ago master
  2. 1a93d55 [SPARK-55341][SQL] Add storage level flag for cached local relations by pranavdev022 · 6 hours ago
  3. ec90eb0 [SPARK-55356][SQL] Support alias for PIVOT clause by Filip Davidovic · 8 hours ago
  4. fd37671 [SPARK-55365][PYTHON] Generalize the utils for arrow array conversion by Ruifeng Zheng · 11 hours ago
  5. 43d332d [SPARK-55228][SPARK-55230][SQL][CONNECT] Implement Dataset.zipWithIndex in Scala API by Fangchen Li · 12 hours ago
  6. c28d7ad Revert "[SPARK-55175][PYTHON] Extract `to_pandas` transformer from serializers" by Ruifeng Zheng · 12 hours ago
  7. 7d3b322 [SPARK-55175][PYTHON] Extract `to_pandas` transformer from serializers by Yicong-Huang · 13 hours ago
  8. b58fdcd [SPARK-55364][PYTHON] Make SupportsIAdd and SupportsOrdering protocol more reasonable by Tian Gao · 13 hours ago
  9. 52b327f [SPARK-55360][BUILD] Upgrade sbt to `1.12.2` by Kousuke Saruta · 14 hours ago
  10. 7a4bb46 [SPARK-55359][CORE] Promote `TaskResourceRequest` to `Stable` by Dongjoon Hyun · 14 hours ago
  11. efe9b53 Revert "[SPARK-55313][PYTHON][FOLLOW-UP] Only add condabin to PATH for pip tests" by Tian Gao · 15 hours ago
  12. 214bf95 [SPARK-55153][SS][PYTHON][DOC] Add documentation for TwsTester by Dmytro Fedoriaka · 16 hours ago
  13. 612ade4 [SPARK-54805][SS][PYTHON] Implement TwsTester in PySpark by Dmytro Fedoriaka · 16 hours ago
  14. 481f986 [SPARK-55303][PYTHON][TESTS] Extract GoldenFileTestMixin for type coercion golden file tests by Yicong-Huang · 17 hours ago
  15. d9dc3c1 [SPARK-55335][PYTHON][TESTS] Use eventually instead of hard-coded wait for datasource test by Tian Gao · 17 hours ago
  16. 45879b7 [SPARK-55313][PYTHON][FOLLOW-UP] Only add condabin to PATH for pip tests by Tian Gao · 17 hours ago
  17. ebc24e0 [SPARK-55363][PS][TESTS] Make ops tests with "decimal_nan" columns ignore NaN vs. None by Takuya Ueshin · 17 hours ago
  18. 73c3513 [SPARK-55350][PYTHON][CONNECT] Fix row count loss when creating DataFrame from pandas with 0 columns by Yicong-Huang · 18 hours ago
  19. bb98d4d [SPARK-46165][PS] Add support for DataFrame.all axis=None by Devin Petersohn · 19 hours ago
  20. 508130f [SPARK-55086][PYTHON] Add DataSourceReader.pushFilters to Python Data Source API docs by yamayuki-hub · 19 hours ago
  21. 0437a93 [SPARK-55340][SQL] Add helper for name to data type by Leon Windheuser · 19 hours ago
  22. 620b2f6 [SPARK-55362][PYTHON][CONNECT] Don't wait for threadpool shutdown by Tian Gao · 20 hours ago
  23. f9b712a [MINOR] Remove python version requirements for scipy-stubs by Tian Gao · 20 hours ago
  24. fac11c4 [SPARK-55291][CONNECT] Pre-process metadata headers at client interceptor construction time by Yihong He · 28 hours ago
  25. 16f639f [SPARK-55354][CORE][DOCS] Fix `ExecutorAllocationClient` comment to include `Kubernetes` by Dongjoon Hyun · 32 hours ago
  26. 4c19158 Revert "[SPARK-55351][PYTHON][SQL] PythonArrowInput encapsulate resource allocation inside `newWriter`" by Ruifeng Zheng · 32 hours ago
  27. f8526b6 [MINOR][PYTHON][TESTS] Fix `test_time_zone_against_map_in_arrow` for tzdata on ubuntu 24 by Ruifeng Zheng · 34 hours ago
  28. b03c69c [SPARK-55346][INFRA][PYTHON] Upgrade pystack version to 1.6.0 and install it on all major images by Tian Gao · 2 days ago
  29. 663a6c4 [SPARK-55351][PYTHON][SQL] PythonArrowInput encapsulate resource allocation inside `newWriter` by Ruifeng Zheng · 2 days ago
  30. ca1e3e7 [MINOR][PYTHON][TESTS] Skip the doctest of toJSON by Ruifeng Zheng · 2 days ago
  31. 78d9eae [SPARK-54599][PYTHON] Refactor PythonException so it can take errorClass with sqlstate by Tian Gao · 2 days ago
  32. 6aafdc1 [SPARK-55309][BUILD][FOLLOW-UP] Bump container protobuf version by Tian Gao · 2 days ago
  33. 5802a78 [SPARK-55313][PYTHON][FOLLOW-UP] Do not auto-activate conda for CI by Tian Gao · 2 days ago
  34. b91d407 [SPARK-55336][PYTHON] Let createDF use create_batch logic for decoupling by Yicong-Huang · 2 days ago
  35. 455ea6c [SPARK-55040][PYTHON][FOLLOW-UP] Always pass secret for taskcontext by Tian Gao · 2 days ago
  36. 5648458 [SPARK-55342][K8S] Fix `ExecutorPodsLifecycleEventHandler` to `ExecutorPodsLifecycleManager` by Dongjoon Hyun · 2 days ago
  37. a263a5e [SPARK-55280][CONNECT] Add GetStatus proto to support execution status monitoring by Anastasiia Terenteva · 2 days ago
  38. f9cc3dd [SPARK-55106][SS] Add Repartition Integration test for TransformWithState Operators by zifeif2 · 2 days ago
  39. 9788c52 [SPARK-55258][DOCS] Document CLI parameters in declarative pipelines programming guide by Sandy Ryza · 2 days ago
  40. a7bc395 [SPARK-55295][GEO][CONNECT][PYTHON][SQL] Extend the ST_GeomFromWKB function to take an optional SRID value by Uros Bojanic · 2 days ago
  41. 7b673d6 [SPARK-55308][BUILD] Upgrade icu4j to 78.2 by yangjie01 · 2 days ago
  42. cc2ecef [SPARK-55320][SQL][CONNECT] Use raise_error instead of divide by zero in Observation tests by Yihong He · 2 days ago
  43. 3f5fd65 [SPARK-54969][PYTHON] Implement new arrow->pandas conversion by Ruifeng Zheng · 3 days ago
  44. 917baea [SPARK-55328][SQL][PYTHON] Reuse PythonArrowInput.codec in GroupedPythonArrowInput by Ruifeng Zheng · 3 days ago
  45. 2b9c1fb [SPARK-55327][K8S] Reduce Spark docker image sizes by Dongjoon Hyun · 3 days ago
  46. 14bc852 [SPARK-55313][PYTHON] Do not activate conda environment when installing conda by Tian Gao · 3 days ago
  47. 11d3fec [SPARK-55315][PYTHON][TESTS] Allow eventually to take custom exceptions by Tian Gao · 3 days ago
  48. 4235048 [SPARK-55323][PYTHON] Move UDF metadata to EvalConf to simplify worker protocol by Tian Gao · 3 days ago
  49. 7a8bad7 [SPARK-55319][PYTHON][INFRA] Add libjpeg-dev to pypy dockerfile by Tian Gao · 3 days ago
  50. 8ffd150 [SPARK-55293][PS][TESTS][FOLLOW-UP] Avoid more old offset aliases by Takuya Ueshin · 3 days ago
  51. 30ace9f [SPARK-55309][BUILD] Upgrade protobuf to 33.5 by yangjie01 · 3 days ago
  52. 7b242f2 [SPARK-55318] Performance Optimizations for vector_avg/vector_sum by zhidongqu-db · 3 days ago
  53. 60c8c3f [SPARK-55176][PYTHON][FOLLOW-UP] Fix `_input_type` and `_arrow_cast` not defined in `ArrowStreamPandasSerializer` by Yicong-Huang · 3 days ago
  54. 0c041c2 [SPARK-55224][PYTHON] Use Spark DataType as ground truth in Pandas-Arrow serialization by Yicong-Huang · 3 days ago
  55. 22094fe [SPARK-55161][PYTHON] Support profilers on python data source by Tian Gao · 3 days ago
  56. c94ce2c [SPARK-55302][SQL] Fix custom metrics in case of `KeyGroupedPartitioning` by Peter Toth · 3 days ago
  57. 15c6849 [SPARK-55285][SQL][PYTHON][FOLLOW-UP] Code clean up by Ruifeng Zheng · 3 days ago
  58. 7792122 [SPARK-55260][GEO][SQL] Implement Parquet write support for Geo types by Uros Bojanic · 3 days ago
  59. 3026926 [SPARK-55289][SQL] Fix flaky test in-set-operations.sql by disabling broadcast join by Kent Yao · 3 days ago
  60. 3fa07bb8 [SPARK-55305][SQL][TESTS] Use `ParquetFooterReader.readFooter` uniformly in test code to read the footer by yangjie01 · 3 days ago
  61. d3dc602 [SPARK-55307][K8S][INFRA] Update `setup-minikube` to v0.0.21 by Dongjoon Hyun · 4 days ago
  62. 36fbc71 [SPARK-55297][PYTHON][PS] Restore timedelta dtype based on the original dtype by Tian Gao · 4 days ago
  63. f74d7be [SPARK-55093][CORE] Handle TaskRunner construction failures in launchTask by Zequn Lin · 4 days ago
  64. b7c00ea [SPARK-55293][PS][TESTS] Avoid using old offset aliases by Takuya Ueshin · 4 days ago
  65. 0aeb6f9 [SPARK-55286][INFRA] Add test summary to GitHub Actions for better failure visibility by Kent Yao · 4 days ago
  66. 2d41e46 [SPARK-55283][PYTHON][PS][TESTS] Add a new argument ignore_null to assert_eq by Tian Gao · 4 days ago
  67. fbc3471 [SPARK-55285][SQL][PYTHON] Fix the initialization of `PythonArrowInput` by Ruifeng Zheng · 4 days ago
  68. 9b6393d [SPARK-55284][PYTHON][TEST] Move mypy-data related configs to the script by Tian Gao · 4 days ago
  69. a3e3da9 [SPARK-55176][PYTHON] Extract `arrow_to_pandas` converter into ArrowArrayToPandasConversion by Yicong-Huang · 4 days ago
  70. 545d9e7 [SPARK-55287][INFRA] Consolidate steps in `lint` by Ruifeng Zheng · 4 days ago
  71. 1ce1102 [SPARK-55105][SS] Add Integration Test for Join Operator by zifeif2 · 4 days ago
  72. c5cb243 [SPARK-55123][SS] Add SequentialUnionOffset for tracking sequential source processing by ericm-db · 4 days ago
  73. 2d94091 [SPARK-55193][CORE][BUILD] Use `CompressionHandler` as a replacement for the deprecated `GzipHandler` in `JettyUtils` by yangjie01 · 5 days ago
  74. 866a6e8 [SPARK-55290][NETWORK][TESTS] Fix testReloadMissingTrustStore cross-device link error with JDK 21 by Emilie Faracci · 6 days ago
  75. 65a6a55 [SPARK-55246][SS] Add Test for Pyspark TWS and TWSInPandas and Fix StatePartitionAllColumnFamiliesWriter Bug by zifeif2 · 6 days ago
  76. fbb4019 [SPARK-55279][SQL] Add `sketch_funcs` group for DataSketches SQL functions by Kent Yao · 6 days ago
  77. 76f6c78 [SPARK-55239][CONNECT][YARN] Allow to launch SparkConnectServer in YARN cluster mode by Kousuke Saruta · 6 days ago
  78. 04b821c [SPARK-55256][SQL] Support IGNORE NULLS / RESPECT NULLS for array_agg and collect_list by Kent Yao · 6 days ago
  79. 1da0e53 [SPARK-55133][CONNECT] Fix race condition in IsolatedSessionState lifecycle management by Wenchen Fan · 6 days ago
  80. 44db44c [SPARK-49110][SQL] Simplify SubqueryAlias.metadataOutput to always propagate metadata columns by Wenchen Fan · 6 days ago
  81. 4e05372 [SPARK-55277][SQL] Add `protobuf_funcs` group for Protobuf SQL functions by Kent Yao · 6 days ago
  82. 4a58b84 Revert "[SPARK-55277][SQL] Add `protobuf_funcs` group for Protobuf SQL functions" by Kent Yao · 7 days ago
  83. efdb492 [SPARK-55277][SQL] Add `protobuf_funcs` group for Protobuf SQL functions by Kent Yao · 7 days ago
  84. 8e65769 [SPARK-55236][CORE] Address unexpected exception in some CoarseGrainedExecutorBackendSuite test cases by ChuckLin2025 · 7 days ago
  85. 4729b99 [SPARK-55031][SQL] Add vector avg/sum aggregation function expressions by zhidongqu-db · 7 days ago
  86. 75cd9be [SPARK-55237][SQL] Suppress annoying messages when looking up nonexistent DBs by Cheng Pan · 7 days ago
  87. 4344f3fc [SPARK-55273][SQL] Replace `ParquetFileReader.open().getFooter()` with `readFooter()` to avoid unnecessary operations in `ParquetFooterReader` by yangjie01 · 7 days ago
  88. b3cbff3 [SPARK-55272][BUILD] Upgrade SBT to 1.12.1 by yangjie01 · 7 days ago
  89. 86f8b3f [SPARK-55276][BUILD] Upgrade `scala-maven-plugin` to 4.9.9 by Dongjoon Hyun · 7 days ago
  90. ea26cac [SPARK-55266][INFRA] Add pre-commit hooks for format/lint by Tian Gao · 7 days ago
  91. 6625591 [SPARK-55281][PYTHON] Add ipykernel and IPython to mypy optional package list by Tian Gao · 7 days ago
  92. 9254e89 [SPARK-55263][PYTHON][INFRA] Upgrade Python linter from 3.11 to 3.12 in CI by Yicong-Huang · 7 days ago
  93. 23afba2 [SPARK-55282][PYTHON][CONNECT] Avoid using worker_util in the Driver-side by Takuya Ueshin · 7 days ago
  94. a1577253 [SPARK-55011][DOCS] CURSORs docs by Serge Rielau · 7 days ago
  95. 1aadbc4 [SPARK-54887] Add previously removed legacy error class back in by Garland Zhang · 7 days ago
  96. fe9e5c0 [SPARK-55262][GEO][SQL] Block Geo types in all file based data sources except Parquet by Uros Bojanic · 7 days ago
  97. c32aee1 [SPARK-55243][CONNECT] Allow setting binary headers via the -bin suffix in the Scala Connect client by Robert Dillitz · 7 days ago
  98. 5c320f4 [SPARK-55259][GEO][SQL] Implement Parquet schema conversion for Geo types by Uros Bojanic · 7 days ago
  99. 6ffc45a [SPARK-55114][PYTHON][TESTS][FOLLOW-UP] Update the result format to be more friendly to markdown by Ruifeng Zheng · 7 days ago
  100. 7e4a040 [SPARK-55064][SQL][CORE] Support query level indeterminate shuffle retry by Tengfei Huang · 8 days ago