Sign in
apache
/
spark
/
HEAD
7d8df99
[SPARK-56019][SQL] Close JDBC connection on task kill to unblock native socket reads
by Ivan Sadikov
· 6 hours ago
master
6423cb0
[SPARK-56407][BUILD][TESTS] Remove pre-built class files and JARs used in artifact transfer tests
by Kousuke Saruta
· 8 hours ago
3851cb5
[SPARK-56302][CORE] Free task result memory eagerly during serialization on executor
by Tengfei Huang
· 12 hours ago
4018cc7
[SPARK-56338][INFRA][FOLLOWUP] Support MAVEN_MIRROR_URL in SBT launcher bootstrap
by Wenchen Fan
· 17 hours ago
cf26c42
[SPARK-56397][BUILD] Upgrade `ICU4J` to 78.3
by Dongjoon Hyun
· 18 hours ago
05f6f69b
[SPARK-56332][SQL][TESTS] Use `sql.SparkSession` in `trait SQLTestData`
by Ruifeng Zheng
· 18 hours ago
528386c
[SPARK-56377][PYTHON] Add type hint for shuffle.py
by Tian Gao
· 18 hours ago
21ada68
[SPARK-56253][PYTHON][CONNECT] Make spark.read.json accept DataFrame input
by Yicong-Huang
· 18 hours ago
a7fb5bbe6
[SPARK-56330][CORE] Add TaskInterruptListener to TaskContext for interrupt notifications
by Ivan Sadikov
· 19 hours ago
efa725e
[SPARK-56343][SQL][TESTS] Add MERGE INTO test for type mismatch without schema evolution trigger condition
by Szehon Ho
· 20 hours ago
33c18ee
[SPARK-56393][K8S][DOCS] Drop K8s v1.33 Support
by Dongjoon Hyun
· 25 hours ago
0eac893
[SPARK-56392][SQL] Make Sample.seed Optional to distinguish user-specified vs random seeds
by Rahul Sharma
· 26 hours ago
a3930d3
[SPARK-56221][SQL][PYTHON][FOLLOWUP] Rename TablePartition and remove SHOW CACHED TABLES / listCachedTables
by Wenchen Fan
· 26 hours ago
361b9d6
[SPARK-55278][FOLLOWUP] Add shading rule for udf/worker protobuf as for core and connect
by Haiyang Sun
· 29 hours ago
426accf
[SPARK-55109][SQL] Enhance RaiseError to generate valid SQL
by Felipe Fujiy Pessoto
· 30 hours ago
a66bf7d
[SPARK-56277][SQL] Add missing toString() to NamespaceChange and TableChange property classes
by anshul_baliga7
· 30 hours ago
e96b48b
[SPARK-56328][SQL][FOLLOWUP] Handle SubqueryAlias-wrapped inline tables in INSERT VALUES collation fix
by ilicmarkodb
· 31 hours ago
590b0d5
[SPARK-56346][SQL] Use PartitionPredicate in DSV2 Metadata Only Delete
by Szehon Ho
· 2 days ago
163abe5
[SPARK-56092][SS][CONNECT] Fix NPE in StreamingQueryException.toString() when cause is null
by Xiaoxuan Li
· 2 days ago
aa6154b
[SPARK-55306] Add ability to run Kafka tests in Python
by Jerry Peng
· 2 days ago
c869549
[SPARK-56372][INFRA] Add cmake to CI Docker images for R fs package compilation
by Ruifeng Zheng
· 2 days ago
c7c78e0
[MINOR][CORE][TESTS] Fix flaky DAGSchedulerSuite test for push-based shuffle
by DB Tsai
· 2 days ago
e8b5cb8
[SPARK-56216][SS] Integrate checkpoint V2 with auto-repair snapshot
by ericm-db
· 2 days ago
b490770
[SPARK-56328][SQL] Fix inline table collation handling for INSERT VALUES and DEFAULT COLLATION
by ilicmarkodb
· 2 days ago
7737a94
[SPARK-55278] Introduce module and core abstraction for language-agnostic UDF worker
by Haiyang Sun
· 2 days ago
98cdaee
[SPARK-55450][SS][PYTHON][DOCS] Document admission control in PySpark streaming data sources
by Jitesh Soni
· 3 days ago
9dbe381
[SPARK-56340][PYTHON] Move input_type schema to eval conf
by Tian Gao
· 3 days ago
491add8
[SPARK-56364][BUILD][TESTS] Generate Scala-based test JARs dynamically instead of storing pre-built binaries
by Kousuke Saruta
· 3 days ago
af9c8b3
[SPARK-56273][SQL] Simplify extracting fields from DataSourceV2ScanRelation
by Yan Yan
· 3 days ago
975b299
[SPARK-56342][PYTHON] Tighten type hints for refactored eval type functions in worker.py
by Yicong-Huang
· 3 days ago
5bb6271
[SPARK-56207][SQL] Replace legacy error codes with named errors in DSv2 connector API
by DB Tsai
· 3 days ago
f184c25
[SPARK-56361][SS] Provide better error with logging on NPE in stream-stream join
by Jungtaek Lim
· 3 days ago
ae7f6e3
[SPARK-56248][PYTHON][SS] Optimize python stateful processor serialization to skip unnecessary list/dict/row construction
by Jia Teoh
· 3 days ago
e42a561
[SPARK-56359][PYTHON] Remove unused ArrowStreamArrowUDFSerializer
by Yicong-Huang
· 3 days ago
5beaa5b
[SPARK-56353][BUILD][TESTS] Generate Java-based test JARs dynamically instead of storing pre-built binaries
by Kousuke Saruta
· 3 days ago
732f30b
[SPARK-56357][BUILD] Upgrade sbt to 1.12.8
by yangjie01
· 3 days ago
08e5436
[SPARK-55657][BUILD] Bump Hadoop 3.5.0
by Cheng Pan
· 3 days ago
3fb32fd
[SPARK-56363][INFRA] Add remotes in Spark release image
by Hyukjin Kwon
· 3 days ago
7167dad
[SPARK-55330][INFRA] Add cmake to release Docker base image
by Hyukjin Kwon
· 4 days ago
734a86c
[SPARK-56360][INFRA] Wait for base and RM Docker logs in release workflow; avoid hang on early failure
by Hyukjin Kwon
· 4 days ago
003855c
[SPARK-55115][INFRA][FOLLOW-UP] Fix release workflow log tailing for base Docker image build
by Hyukjin Kwon
· 4 days ago
b168254
[SPARK-56358][BUILD] Add gson version override to SBT build to align with Maven
by yangjie01
· 4 days ago
6767941
[SPARK-55667][PYTHON][CONNECT][FOLLOW-UP] Remove arguments in check_dependencies
by Tian Gao
· 4 days ago
08b1390
[SPARK-56349][PYTHON] Remove unused ArrowStreamAggArrowUDFSerializer
by Yicong-Huang
· 4 days ago
561b7b9
[SPARK-56348][PYTHON] Remove unused ArrowBatchUDFSerializer
by Yicong-Huang
· 4 days ago
c6a198e
[MINOR][PYTHON] Fix PySparkException failing when messageParameters is omitted
by Fangchen Li
· 4 days ago
c1dd15c2
[SPARK-49793][PYTHON][TESTS][FOLLOW-UP] Fix test_caching in connect mode
by Tian Gao
· 4 days ago
49e06b3
[SPARK-56317][SQL] GetJsonObjectEvaluator should reuse output buffer
by Xianming Lei
· 4 days ago
ed01691
[MINOR][DOCS] Fix a typo of "KLL" initialism
by donaldchai
· 4 days ago
263c976
[SPARK-56209][BUILD][FOLLOWUP] Exclude Netty transitive dependencies from Vert.x in Kubernetes modules
by yangjie01
· 4 days ago
1ea48e4
[SPARK-56007][CONNECT] Fix ArrowDeserializer to use positional binding for rows
by Herman van Hövell
· 5 days ago
a20bfc7
[SPARK-56356][BUILD] Fix an issue in release build caused by error on fetching artifacts
by Kousuke Saruta
· 5 days ago
842eb7b
[SPARK-54938][PYTHON][TEST][FOLLOW-UP] Fix inferred time unit for pandas >= 3
by Ruifeng Zheng
· 5 days ago
d9c8eda
[SPARK-56322][CONNECT][PYTHON] Fix TypeError when self-joining observed DataFrames
by Marcin Wojtyczka
· 6 days ago
504060f
[SPARK-56347][TEST] Fix TOCTOU race in DockerJDBCIntegrationSuite port allocation
by Vladan Vasić
· 6 days ago
5f5fc89
[SPARK-56329][PYTHON] Fix all E721 type comparison violations
by Ruifeng Zheng
· 6 days ago
cba5705
[SPARK-56341][PYTHON][DOCS] Fix outdated PyArrow minimum version in arrow_pandas.rst
by Yicong-Huang
· 6 days ago
50f7179
[SPARK-56313][PYTHON][FOLLOWUP] Remove rddsampler from mypy exception list
by Tian Gao
· 6 days ago
e130525
[SPARK-55902][PYTHON] Refactor SQL_ARROW_BATCHED_UDF
by Yicong-Huang
· 6 days ago
7cb54df
[SPARK-56280][SS] normalize NaN and +/-0.0 in streaming dedupe node
by Richard Chen
· 7 days ago
e2f15f6
[SPARK-56243][SS] Throw detailed error on malformed Kafka record timestamps
by Kavpreet Grewal
· 7 days ago
ccd4206
[SPARK-56345][PYTHON][TESTS] Use `pd.Series.__name__` in Arrow UDF type-hint test
by Takuya Ueshin
· 7 days ago
650b0a6
[SPARK-56338][INFRA] Support Maven mirrors for build
by Tian Gao
· 7 days ago
e9a348e
[SPARK-56327][PYTHON][TESTS] Fix grouped map pandas tests for pandas 3
by Takuya Ueshin
· 7 days ago
922de74
[SPARK-56323][SQL] Propagate ROW FORMAT / STORED AS to v2 catalog in CREATE TABLE LIKE
by Liang-Chi Hsieh
· 7 days ago
671c65f
[SPARK-56321][SQL] Fix `AnalysisException` when scan reports transform-based ordering via `SupportsReportOrdering`
by Peter Toth
· 7 days ago
73a272d
[SPARK-56189][PYTHON] Refactor SQL_WINDOW_AGG_ARROW_UDF
by Yicong-Huang
· 7 days ago
b580b4f
[SPARK-56251][SQL] Add default fetchSize for postgres to avoid loading all data in memory
by Tengfei Huang
· 7 days ago
3a22620
[SPARK-56313][PYTHON][FOLLOWUP] Use old way to label generic for a class
by Tian Gao
· 8 days ago
0d8e031
Revert "[SPARK-56189][PYTHON] Refactor SQL_WINDOW_AGG_ARROW_UDF"
by Ruifeng Zheng
· 8 days ago
0af9722
[SPARK-56260][INFRA] Pin third-party GitHub Actions to commit SHA
by Yicong Huang
· 8 days ago
25e85b2
[SPARK-56222][PYTHON] Create ArrowStreamGroupSerializer and ArrowStreamCoGroupSerializer
by Yicong Huang
· 8 days ago
3433c38
[SPARK-56189][PYTHON] Refactor SQL_WINDOW_AGG_ARROW_UDF
by Yicong Huang
· 8 days ago
d498b06
[SPARK-56314][SQL][TESTS] Avoid uncessary RDD->DataFrame conversion in `SQLTestData`
by Ruifeng Zheng
· 8 days ago
989df5e
[SPARK-56306][SQL] Fix collation-aware PIVOT
by Akash Nayar
· 8 days ago
00b9451
[SPARK-55981][SQL] Allow Geo Types with SRID's from the pre-built registry
by Szehon Ho
· 8 days ago
463a188
[SPARK-56190][SQL] Support nested partition columns for DSV2 PartitionPredicate
by Szehon Ho
· 8 days ago
e0bc7aa
[SPARK-56313][PYTHON][FOLLOWUP] Remove the generic for rddsampler methods
by Tian Gao
· 8 days ago
e75d6fb
[SPARK-56219][PS][FOLLOW-UP] Keep legacy groupby idxmax and idxmin skipna=False behavior for pandas 2
by Takuya Ueshin
· 8 days ago
fec2804
[SPARK-56310][PYTHON] Handle pandas 3 dtype in DataFrame.toPandas
by Takuya Ueshin
· 8 days ago
0f0c0e2
[SPARK-56296][SQL] Pivot createTableLike to pass full TableInfo including schema, partitioning, constraints, and owner
by Liang-Chi Hsieh
· 8 days ago
3e7e126
[SPARK-56001][SQL] Add INSERT INTO ... REPLACE ON/USING syntax
by Thang Long VU
· 8 days ago
dd492dd
[SPARK-56054][SQL] Fix aliased nested fields ignored for schema evolution in MERGE
by Johan Lasperas
· 8 days ago
9518075
[SPARK-54938][PYTHON][TEST][FOLLOW-UP] Fix `test_pyarrow_array_type_inference` for pandas >= 3
by Ruifeng Zheng
· 8 days ago
d1916e3
[SPARK-56313][PYTHON] Add type hint for rddsampler.py
by Tian Gao
· 9 days ago
1dd26f9
[SPARK-56271][PYTHON] Fix type hint and remove unused method for _globals.py
by Tian Gao
· 9 days ago
384f543
[SPARK-56311][PYTHON] Add type hints for daemon.py
by Tian Gao
· 9 days ago
89cb692
[SPARK-56123][PYTHON][FOLLOWUP] Avoid using concat_batches for old version of pyarrow
by Tian Gao
· 9 days ago
d580b65
[SPARK-49543][SQL] Add SHOW COLLATIONS command
by Liang-Chi Hsieh
· 9 days ago
a4719590
[SPARK-56307][BUILD] Upgrade `log4j` to 2.25.4
by Dongjoon Hyun
· 9 days ago
e6feb27
[SPARK-51988][SS] Do file checksum verification on read for RocksDB zip file
by Gurpreet Nanda
· 9 days ago
e7eceb7
[SPARK-56235][CORE] Add reverse index in TaskSetManager to avoid O(N) scans in executorLost
by DenineLu
· 9 days ago
224f30f
[SPARK-56241][SQL] Derive `outputOrdering` from `KeyedPartitioning` key expressions
by Peter Toth
· 9 days ago
28e9e69
[SPARK-56074][INFRA] Improve AGENTS.md with inline build/test commands, PR workflow, and dev notes
by Wenchen Fan
· 9 days ago
a67f1eca
[SPARK-56137][UI][TESTS] Add regression tests for SQL tab DataTables migration
by Kent Yao
· 9 days ago
640476d
[SPARK-56303][K8S] Add Java-friendly factory methods to `JavaMainAppResource`
by Dongjoon Hyun
· 9 days ago
a1de465
[SPARK-56300][K8S] Add Java-friendly factory method to `KubernetesDriverSpec`
by Dongjoon Hyun
· 9 days ago
292e1d5
[SPARK-56301][PYTHON] Fix typos in `error-conditions.json`
by Ruifeng Zheng
· 9 days ago
0419e68
[SPARK-56155][SQL] Collect_list/collect_set sql() function includes "RESPECT NULLS"
by Helios He
· 10 days ago
6e8c690
[SPARK-53209][YARN] Add ActiveProcessorCount JVM option to YARN executor and AM
by John Zhuge
· 10 days ago
Next »