Sign in
apache
/
spark
/
HEAD
3298e68
[SPARK-55373][CONNECT] Improve noHandlerFoundForExtension error message
by Alex Khakhlyuk
· 5 hours ago
master
1a93d55
[SPARK-55341][SQL] Add storage level flag for cached local relations
by pranavdev022
· 6 hours ago
ec90eb0
[SPARK-55356][SQL] Support alias for PIVOT clause
by Filip Davidovic
· 8 hours ago
fd37671
[SPARK-55365][PYTHON] Generalize the utils for arrow array conversion
by Ruifeng Zheng
· 11 hours ago
43d332d
[SPARK-55228][SPARK-55230][SQL][CONNECT] Implement Dataset.zipWithIndex in Scala API
by Fangchen Li
· 12 hours ago
c28d7ad
Revert "[SPARK-55175][PYTHON] Extract `to_pandas` transformer from serializers"
by Ruifeng Zheng
· 12 hours ago
7d3b322
[SPARK-55175][PYTHON] Extract `to_pandas` transformer from serializers
by Yicong-Huang
· 13 hours ago
b58fdcd
[SPARK-55364][PYTHON] Make SupportsIAdd and SupportsOrdering protocol more reasonable
by Tian Gao
· 13 hours ago
52b327f
[SPARK-55360][BUILD] Upgrade sbt to `1.12.2`
by Kousuke Saruta
· 14 hours ago
7a4bb46
[SPARK-55359][CORE] Promote `TaskResourceRequest` to `Stable`
by Dongjoon Hyun
· 14 hours ago
efe9b53
Revert "[SPARK-55313][PYTHON][FOLLOW-UP] Only add condabin to PATH for pip tests"
by Tian Gao
· 15 hours ago
214bf95
[SPARK-55153][SS][PYTHON][DOC] Add documentation for TwsTester
by Dmytro Fedoriaka
· 16 hours ago
612ade4
[SPARK-54805][SS][PYTHON] Implement TwsTester in PySpark
by Dmytro Fedoriaka
· 16 hours ago
481f986
[SPARK-55303][PYTHON][TESTS] Extract GoldenFileTestMixin for type coercion golden file tests
by Yicong-Huang
· 17 hours ago
d9dc3c1
[SPARK-55335][PYTHON][TESTS] Use eventually instead of hard-coded wait for datasource test
by Tian Gao
· 17 hours ago
45879b7
[SPARK-55313][PYTHON][FOLLOW-UP] Only add condabin to PATH for pip tests
by Tian Gao
· 17 hours ago
ebc24e0
[SPARK-55363][PS][TESTS] Make ops tests with "decimal_nan" columns ignore NaN vs. None
by Takuya Ueshin
· 17 hours ago
73c3513
[SPARK-55350][PYTHON][CONNECT] Fix row count loss when creating DataFrame from pandas with 0 columns
by Yicong-Huang
· 18 hours ago
bb98d4d
[SPARK-46165][PS] Add support for DataFrame.all axis=None
by Devin Petersohn
· 19 hours ago
508130f
[SPARK-55086][PYTHON] Add DataSourceReader.pushFilters to Python Data Source API docs
by yamayuki-hub
· 19 hours ago
0437a93
[SPARK-55340][SQL] Add helper for name to data type
by Leon Windheuser
· 19 hours ago
620b2f6
[SPARK-55362][PYTHON][CONNECT] Don't wait for threadpool shutdown
by Tian Gao
· 20 hours ago
f9b712a
[MINOR] Remove python version requirements for scipy-stubs
by Tian Gao
· 20 hours ago
fac11c4
[SPARK-55291][CONNECT] Pre-process metadata headers at client interceptor construction time
by Yihong He
· 28 hours ago
16f639f
[SPARK-55354][CORE][DOCS] Fix `ExecutorAllocationClient` comment to include `Kubernetes`
by Dongjoon Hyun
· 32 hours ago
4c19158
Revert "[SPARK-55351][PYTHON][SQL] PythonArrowInput encapsulate resource allocation inside `newWriter`"
by Ruifeng Zheng
· 32 hours ago
f8526b6
[MINOR][PYTHON][TESTS] Fix `test_time_zone_against_map_in_arrow` for tzdata on ubuntu 24
by Ruifeng Zheng
· 34 hours ago
b03c69c
[SPARK-55346][INFRA][PYTHON] Upgrade pystack version to 1.6.0 and install it on all major images
by Tian Gao
· 2 days ago
663a6c4
[SPARK-55351][PYTHON][SQL] PythonArrowInput encapsulate resource allocation inside `newWriter`
by Ruifeng Zheng
· 2 days ago
ca1e3e7
[MINOR][PYTHON][TESTS] Skip the doctest of toJSON
by Ruifeng Zheng
· 2 days ago
78d9eae
[SPARK-54599][PYTHON] Refactor PythonException so it can take errorClass with sqlstate
by Tian Gao
· 2 days ago
6aafdc1
[SPARK-55309][BUILD][FOLLOW-UP] Bump container protobuf version
by Tian Gao
· 2 days ago
5802a78
[SPARK-55313][PYTHON][FOLLOW-UP] Do not auto-activate conda for CI
by Tian Gao
· 2 days ago
b91d407
[SPARK-55336][PYTHON] Let createDF use create_batch logic for decoupling
by Yicong-Huang
· 2 days ago
455ea6c
[SPARK-55040][PYTHON][FOLLOW-UP] Always pass secret for taskcontext
by Tian Gao
· 2 days ago
5648458
[SPARK-55342][K8S] Fix `ExecutorPodsLifecycleEventHandler` to `ExecutorPodsLifecycleManager`
by Dongjoon Hyun
· 2 days ago
a263a5e
[SPARK-55280][CONNECT] Add GetStatus proto to support execution status monitoring
by Anastasiia Terenteva
· 2 days ago
f9cc3dd
[SPARK-55106][SS] Add Repartition Integration test for TransformWithState Operators
by zifeif2
· 2 days ago
9788c52
[SPARK-55258][DOCS] Document CLI parameters in declarative pipelines programming guide
by Sandy Ryza
· 2 days ago
a7bc395
[SPARK-55295][GEO][CONNECT][PYTHON][SQL] Extend the ST_GeomFromWKB function to take an optional SRID value
by Uros Bojanic
· 2 days ago
7b673d6
[SPARK-55308][BUILD] Upgrade icu4j to 78.2
by yangjie01
· 2 days ago
cc2ecef
[SPARK-55320][SQL][CONNECT] Use raise_error instead of divide by zero in Observation tests
by Yihong He
· 2 days ago
3f5fd65
[SPARK-54969][PYTHON] Implement new arrow->pandas conversion
by Ruifeng Zheng
· 3 days ago
917baea
[SPARK-55328][SQL][PYTHON] Reuse PythonArrowInput.codec in GroupedPythonArrowInput
by Ruifeng Zheng
· 3 days ago
2b9c1fb
[SPARK-55327][K8S] Reduce Spark docker image sizes
by Dongjoon Hyun
· 3 days ago
14bc852
[SPARK-55313][PYTHON] Do not activate conda environment when installing conda
by Tian Gao
· 3 days ago
11d3fec
[SPARK-55315][PYTHON][TESTS] Allow eventually to take custom exceptions
by Tian Gao
· 3 days ago
4235048
[SPARK-55323][PYTHON] Move UDF metadata to EvalConf to simplify worker protocol
by Tian Gao
· 3 days ago
7a8bad7
[SPARK-55319][PYTHON][INFRA] Add libjpeg-dev to pypy dockerfile
by Tian Gao
· 3 days ago
8ffd150
[SPARK-55293][PS][TESTS][FOLLOW-UP] Avoid more old offset aliases
by Takuya Ueshin
· 3 days ago
30ace9f
[SPARK-55309][BUILD] Upgrade protobuf to 33.5
by yangjie01
· 3 days ago
7b242f2
[SPARK-55318] Performance Optimizations for vector_avg/vector_sum
by zhidongqu-db
· 3 days ago
60c8c3f
[SPARK-55176][PYTHON][FOLLOW-UP] Fix `_input_type` and `_arrow_cast` not defined in `ArrowStreamPandasSerializer`
by Yicong-Huang
· 3 days ago
0c041c2
[SPARK-55224][PYTHON] Use Spark DataType as ground truth in Pandas-Arrow serialization
by Yicong-Huang
· 3 days ago
22094fe
[SPARK-55161][PYTHON] Support profilers on python data source
by Tian Gao
· 3 days ago
c94ce2c
[SPARK-55302][SQL] Fix custom metrics in case of `KeyGroupedPartitioning`
by Peter Toth
· 3 days ago
15c6849
[SPARK-55285][SQL][PYTHON][FOLLOW-UP] Code clean up
by Ruifeng Zheng
· 3 days ago
7792122
[SPARK-55260][GEO][SQL] Implement Parquet write support for Geo types
by Uros Bojanic
· 3 days ago
3026926
[SPARK-55289][SQL] Fix flaky test in-set-operations.sql by disabling broadcast join
by Kent Yao
· 3 days ago
3fa07bb8
[SPARK-55305][SQL][TESTS] Use `ParquetFooterReader.readFooter` uniformly in test code to read the footer
by yangjie01
· 3 days ago
d3dc602
[SPARK-55307][K8S][INFRA] Update `setup-minikube` to v0.0.21
by Dongjoon Hyun
· 4 days ago
36fbc71
[SPARK-55297][PYTHON][PS] Restore timedelta dtype based on the original dtype
by Tian Gao
· 4 days ago
f74d7be
[SPARK-55093][CORE] Handle TaskRunner construction failures in launchTask
by Zequn Lin
· 4 days ago
b7c00ea
[SPARK-55293][PS][TESTS] Avoid using old offset aliases
by Takuya Ueshin
· 4 days ago
0aeb6f9
[SPARK-55286][INFRA] Add test summary to GitHub Actions for better failure visibility
by Kent Yao
· 4 days ago
2d41e46
[SPARK-55283][PYTHON][PS][TESTS] Add a new argument ignore_null to assert_eq
by Tian Gao
· 4 days ago
fbc3471
[SPARK-55285][SQL][PYTHON] Fix the initialization of `PythonArrowInput`
by Ruifeng Zheng
· 4 days ago
9b6393d
[SPARK-55284][PYTHON][TEST] Move mypy-data related configs to the script
by Tian Gao
· 4 days ago
a3e3da9
[SPARK-55176][PYTHON] Extract `arrow_to_pandas` converter into ArrowArrayToPandasConversion
by Yicong-Huang
· 4 days ago
545d9e7
[SPARK-55287][INFRA] Consolidate steps in `lint`
by Ruifeng Zheng
· 4 days ago
1ce1102
[SPARK-55105][SS] Add Integration Test for Join Operator
by zifeif2
· 4 days ago
c5cb243
[SPARK-55123][SS] Add SequentialUnionOffset for tracking sequential source processing
by ericm-db
· 4 days ago
2d94091
[SPARK-55193][CORE][BUILD] Use `CompressionHandler` as a replacement for the deprecated `GzipHandler` in `JettyUtils`
by yangjie01
· 5 days ago
866a6e8
[SPARK-55290][NETWORK][TESTS] Fix testReloadMissingTrustStore cross-device link error with JDK 21
by Emilie Faracci
· 6 days ago
65a6a55
[SPARK-55246][SS] Add Test for Pyspark TWS and TWSInPandas and Fix StatePartitionAllColumnFamiliesWriter Bug
by zifeif2
· 6 days ago
fbb4019
[SPARK-55279][SQL] Add `sketch_funcs` group for DataSketches SQL functions
by Kent Yao
· 6 days ago
76f6c78
[SPARK-55239][CONNECT][YARN] Allow to launch SparkConnectServer in YARN cluster mode
by Kousuke Saruta
· 6 days ago
04b821c
[SPARK-55256][SQL] Support IGNORE NULLS / RESPECT NULLS for array_agg and collect_list
by Kent Yao
· 6 days ago
1da0e53
[SPARK-55133][CONNECT] Fix race condition in IsolatedSessionState lifecycle management
by Wenchen Fan
· 6 days ago
44db44c
[SPARK-49110][SQL] Simplify SubqueryAlias.metadataOutput to always propagate metadata columns
by Wenchen Fan
· 6 days ago
4e05372
[SPARK-55277][SQL] Add `protobuf_funcs` group for Protobuf SQL functions
by Kent Yao
· 6 days ago
4a58b84
Revert "[SPARK-55277][SQL] Add `protobuf_funcs` group for Protobuf SQL functions"
by Kent Yao
· 7 days ago
efdb492
[SPARK-55277][SQL] Add `protobuf_funcs` group for Protobuf SQL functions
by Kent Yao
· 7 days ago
8e65769
[SPARK-55236][CORE] Address unexpected exception in some CoarseGrainedExecutorBackendSuite test cases
by ChuckLin2025
· 7 days ago
4729b99
[SPARK-55031][SQL] Add vector avg/sum aggregation function expressions
by zhidongqu-db
· 7 days ago
75cd9be
[SPARK-55237][SQL] Suppress annoying messages when looking up nonexistent DBs
by Cheng Pan
· 7 days ago
4344f3fc
[SPARK-55273][SQL] Replace `ParquetFileReader.open().getFooter()` with `readFooter()` to avoid unnecessary operations in `ParquetFooterReader`
by yangjie01
· 7 days ago
b3cbff3
[SPARK-55272][BUILD] Upgrade SBT to 1.12.1
by yangjie01
· 7 days ago
86f8b3f
[SPARK-55276][BUILD] Upgrade `scala-maven-plugin` to 4.9.9
by Dongjoon Hyun
· 7 days ago
ea26cac
[SPARK-55266][INFRA] Add pre-commit hooks for format/lint
by Tian Gao
· 7 days ago
6625591
[SPARK-55281][PYTHON] Add ipykernel and IPython to mypy optional package list
by Tian Gao
· 7 days ago
9254e89
[SPARK-55263][PYTHON][INFRA] Upgrade Python linter from 3.11 to 3.12 in CI
by Yicong-Huang
· 7 days ago
23afba2
[SPARK-55282][PYTHON][CONNECT] Avoid using worker_util in the Driver-side
by Takuya Ueshin
· 7 days ago
a1577253
[SPARK-55011][DOCS] CURSORs docs
by Serge Rielau
· 7 days ago
1aadbc4
[SPARK-54887] Add previously removed legacy error class back in
by Garland Zhang
· 7 days ago
fe9e5c0
[SPARK-55262][GEO][SQL] Block Geo types in all file based data sources except Parquet
by Uros Bojanic
· 7 days ago
c32aee1
[SPARK-55243][CONNECT] Allow setting binary headers via the -bin suffix in the Scala Connect client
by Robert Dillitz
· 7 days ago
5c320f4
[SPARK-55259][GEO][SQL] Implement Parquet schema conversion for Geo types
by Uros Bojanic
· 7 days ago
6ffc45a
[SPARK-55114][PYTHON][TESTS][FOLLOW-UP] Update the result format to be more friendly to markdown
by Ruifeng Zheng
· 7 days ago
7e4a040
[SPARK-55064][SQL][CORE] Support query level indeterminate shuffle retry
by Tengfei Huang
· 8 days ago
Next »