Sign in
apache
/
spark
/
HEAD
83a0267
[SPARK-56390][SQL] Avoid throwing away resolved child when constructing V1 command variants of ShowColumns and DescribeRelation
by DhruvArya
· 6 hours ago
master
f9bb48c
[SPARK-54774][CORE] Submit failed should keep same exit code with app exit code in K8s mode
by Angerszhuuuu
· 6 hours ago
f00ea04
[SPARK-56381][PYTHON][TEST] Add ASV microbenchmark for SQL_COGROUPED_MAP_ARROW_UDF
by Yicong-Huang
· 6 hours ago
d97ac82
[SPARK-56428][PYTHON] Move eval type specific data to eval_conf for UDTF
by Tian Gao
· 6 hours ago
6243211
[SPARK-56403] Refactor kafka test so it's skipped when dependency is not available
by Tian Gao
· 6 hours ago
22522a6
[SPARK-56385][SQL] Track pushed filter expressions on DataSourceV2ScanRelation
by Yan Yan
· 6 hours ago
975f7b8
[SPARK-56427][PYTHON][INFRA] Remove six and py from requirements.txt
by Tian Gao
· 9 hours ago
2aff1f6
[SPARK-55689] Skip unsupported column changes during schema evolution
by Johan Lasperas
· 16 hours ago
7d8df99
[SPARK-56019][SQL] Close JDBC connection on task kill to unblock native socket reads
by Ivan Sadikov
· 20 hours ago
6423cb0
[SPARK-56407][BUILD][TESTS] Remove pre-built class files and JARs used in artifact transfer tests
by Kousuke Saruta
· 22 hours ago
3851cb5
[SPARK-56302][CORE] Free task result memory eagerly during serialization on executor
by Tengfei Huang
· 25 hours ago
4018cc7
[SPARK-56338][INFRA][FOLLOWUP] Support MAVEN_MIRROR_URL in SBT launcher bootstrap
by Wenchen Fan
· 31 hours ago
cf26c42
[SPARK-56397][BUILD] Upgrade `ICU4J` to 78.3
by Dongjoon Hyun
· 32 hours ago
05f6f69b
[SPARK-56332][SQL][TESTS] Use `sql.SparkSession` in `trait SQLTestData`
by Ruifeng Zheng
· 32 hours ago
528386c
[SPARK-56377][PYTHON] Add type hint for shuffle.py
by Tian Gao
· 32 hours ago
21ada68
[SPARK-56253][PYTHON][CONNECT] Make spark.read.json accept DataFrame input
by Yicong-Huang
· 32 hours ago
a7fb5bbe6
[SPARK-56330][CORE] Add TaskInterruptListener to TaskContext for interrupt notifications
by Ivan Sadikov
· 33 hours ago
efa725e
[SPARK-56343][SQL][TESTS] Add MERGE INTO test for type mismatch without schema evolution trigger condition
by Szehon Ho
· 34 hours ago
33c18ee
[SPARK-56393][K8S][DOCS] Drop K8s v1.33 Support
by Dongjoon Hyun
· 2 days ago
0eac893
[SPARK-56392][SQL] Make Sample.seed Optional to distinguish user-specified vs random seeds
by Rahul Sharma
· 2 days ago
a3930d3
[SPARK-56221][SQL][PYTHON][FOLLOWUP] Rename TablePartition and remove SHOW CACHED TABLES / listCachedTables
by Wenchen Fan
· 2 days ago
361b9d6
[SPARK-55278][FOLLOWUP] Add shading rule for udf/worker protobuf as for core and connect
by Haiyang Sun
· 2 days ago
426accf
[SPARK-55109][SQL] Enhance RaiseError to generate valid SQL
by Felipe Fujiy Pessoto
· 2 days ago
a66bf7d
[SPARK-56277][SQL] Add missing toString() to NamespaceChange and TableChange property classes
by anshul_baliga7
· 2 days ago
e96b48b
[SPARK-56328][SQL][FOLLOWUP] Handle SubqueryAlias-wrapped inline tables in INSERT VALUES collation fix
by ilicmarkodb
· 2 days ago
590b0d5
[SPARK-56346][SQL] Use PartitionPredicate in DSV2 Metadata Only Delete
by Szehon Ho
· 2 days ago
163abe5
[SPARK-56092][SS][CONNECT] Fix NPE in StreamingQueryException.toString() when cause is null
by Xiaoxuan Li
· 2 days ago
aa6154b
[SPARK-55306] Add ability to run Kafka tests in Python
by Jerry Peng
· 2 days ago
c869549
[SPARK-56372][INFRA] Add cmake to CI Docker images for R fs package compilation
by Ruifeng Zheng
· 2 days ago
c7c78e0
[MINOR][CORE][TESTS] Fix flaky DAGSchedulerSuite test for push-based shuffle
by DB Tsai
· 2 days ago
e8b5cb8
[SPARK-56216][SS] Integrate checkpoint V2 with auto-repair snapshot
by ericm-db
· 3 days ago
b490770
[SPARK-56328][SQL] Fix inline table collation handling for INSERT VALUES and DEFAULT COLLATION
by ilicmarkodb
· 3 days ago
7737a94
[SPARK-55278] Introduce module and core abstraction for language-agnostic UDF worker
by Haiyang Sun
· 3 days ago
98cdaee
[SPARK-55450][SS][PYTHON][DOCS] Document admission control in PySpark streaming data sources
by Jitesh Soni
· 3 days ago
9dbe381
[SPARK-56340][PYTHON] Move input_type schema to eval conf
by Tian Gao
· 3 days ago
491add8
[SPARK-56364][BUILD][TESTS] Generate Scala-based test JARs dynamically instead of storing pre-built binaries
by Kousuke Saruta
· 3 days ago
af9c8b3
[SPARK-56273][SQL] Simplify extracting fields from DataSourceV2ScanRelation
by Yan Yan
· 3 days ago
975b299
[SPARK-56342][PYTHON] Tighten type hints for refactored eval type functions in worker.py
by Yicong-Huang
· 3 days ago
5bb6271
[SPARK-56207][SQL] Replace legacy error codes with named errors in DSv2 connector API
by DB Tsai
· 3 days ago
f184c25
[SPARK-56361][SS] Provide better error with logging on NPE in stream-stream join
by Jungtaek Lim
· 3 days ago
ae7f6e3
[SPARK-56248][PYTHON][SS] Optimize python stateful processor serialization to skip unnecessary list/dict/row construction
by Jia Teoh
· 4 days ago
e42a561
[SPARK-56359][PYTHON] Remove unused ArrowStreamArrowUDFSerializer
by Yicong-Huang
· 4 days ago
5beaa5b
[SPARK-56353][BUILD][TESTS] Generate Java-based test JARs dynamically instead of storing pre-built binaries
by Kousuke Saruta
· 4 days ago
732f30b
[SPARK-56357][BUILD] Upgrade sbt to 1.12.8
by yangjie01
· 4 days ago
08e5436
[SPARK-55657][BUILD] Bump Hadoop 3.5.0
by Cheng Pan
· 4 days ago
3fb32fd
[SPARK-56363][INFRA] Add remotes in Spark release image
by Hyukjin Kwon
· 4 days ago
7167dad
[SPARK-55330][INFRA] Add cmake to release Docker base image
by Hyukjin Kwon
· 4 days ago
734a86c
[SPARK-56360][INFRA] Wait for base and RM Docker logs in release workflow; avoid hang on early failure
by Hyukjin Kwon
· 4 days ago
003855c
[SPARK-55115][INFRA][FOLLOW-UP] Fix release workflow log tailing for base Docker image build
by Hyukjin Kwon
· 4 days ago
b168254
[SPARK-56358][BUILD] Add gson version override to SBT build to align with Maven
by yangjie01
· 4 days ago
6767941
[SPARK-55667][PYTHON][CONNECT][FOLLOW-UP] Remove arguments in check_dependencies
by Tian Gao
· 4 days ago
08b1390
[SPARK-56349][PYTHON] Remove unused ArrowStreamAggArrowUDFSerializer
by Yicong-Huang
· 4 days ago
561b7b9
[SPARK-56348][PYTHON] Remove unused ArrowBatchUDFSerializer
by Yicong-Huang
· 4 days ago
c6a198e
[MINOR][PYTHON] Fix PySparkException failing when messageParameters is omitted
by Fangchen Li
· 4 days ago
c1dd15c2
[SPARK-49793][PYTHON][TESTS][FOLLOW-UP] Fix test_caching in connect mode
by Tian Gao
· 4 days ago
49e06b3
[SPARK-56317][SQL] GetJsonObjectEvaluator should reuse output buffer
by Xianming Lei
· 4 days ago
ed01691
[MINOR][DOCS] Fix a typo of "KLL" initialism
by donaldchai
· 4 days ago
263c976
[SPARK-56209][BUILD][FOLLOWUP] Exclude Netty transitive dependencies from Vert.x in Kubernetes modules
by yangjie01
· 5 days ago
1ea48e4
[SPARK-56007][CONNECT] Fix ArrowDeserializer to use positional binding for rows
by Herman van Hövell
· 5 days ago
a20bfc7
[SPARK-56356][BUILD] Fix an issue in release build caused by error on fetching artifacts
by Kousuke Saruta
· 5 days ago
842eb7b
[SPARK-54938][PYTHON][TEST][FOLLOW-UP] Fix inferred time unit for pandas >= 3
by Ruifeng Zheng
· 5 days ago
d9c8eda
[SPARK-56322][CONNECT][PYTHON] Fix TypeError when self-joining observed DataFrames
by Marcin Wojtyczka
· 7 days ago
504060f
[SPARK-56347][TEST] Fix TOCTOU race in DockerJDBCIntegrationSuite port allocation
by Vladan Vasić
· 7 days ago
5f5fc89
[SPARK-56329][PYTHON] Fix all E721 type comparison violations
by Ruifeng Zheng
· 7 days ago
cba5705
[SPARK-56341][PYTHON][DOCS] Fix outdated PyArrow minimum version in arrow_pandas.rst
by Yicong-Huang
· 7 days ago
50f7179
[SPARK-56313][PYTHON][FOLLOWUP] Remove rddsampler from mypy exception list
by Tian Gao
· 7 days ago
e130525
[SPARK-55902][PYTHON] Refactor SQL_ARROW_BATCHED_UDF
by Yicong-Huang
· 7 days ago
7cb54df
[SPARK-56280][SS] normalize NaN and +/-0.0 in streaming dedupe node
by Richard Chen
· 7 days ago
e2f15f6
[SPARK-56243][SS] Throw detailed error on malformed Kafka record timestamps
by Kavpreet Grewal
· 7 days ago
ccd4206
[SPARK-56345][PYTHON][TESTS] Use `pd.Series.__name__` in Arrow UDF type-hint test
by Takuya Ueshin
· 7 days ago
650b0a6
[SPARK-56338][INFRA] Support Maven mirrors for build
by Tian Gao
· 7 days ago
e9a348e
[SPARK-56327][PYTHON][TESTS] Fix grouped map pandas tests for pandas 3
by Takuya Ueshin
· 8 days ago
922de74
[SPARK-56323][SQL] Propagate ROW FORMAT / STORED AS to v2 catalog in CREATE TABLE LIKE
by Liang-Chi Hsieh
· 8 days ago
671c65f
[SPARK-56321][SQL] Fix `AnalysisException` when scan reports transform-based ordering via `SupportsReportOrdering`
by Peter Toth
· 8 days ago
73a272d
[SPARK-56189][PYTHON] Refactor SQL_WINDOW_AGG_ARROW_UDF
by Yicong-Huang
· 8 days ago
b580b4f
[SPARK-56251][SQL] Add default fetchSize for postgres to avoid loading all data in memory
by Tengfei Huang
· 8 days ago
3a22620
[SPARK-56313][PYTHON][FOLLOWUP] Use old way to label generic for a class
by Tian Gao
· 8 days ago
0d8e031
Revert "[SPARK-56189][PYTHON] Refactor SQL_WINDOW_AGG_ARROW_UDF"
by Ruifeng Zheng
· 8 days ago
0af9722
[SPARK-56260][INFRA] Pin third-party GitHub Actions to commit SHA
by Yicong Huang
· 8 days ago
25e85b2
[SPARK-56222][PYTHON] Create ArrowStreamGroupSerializer and ArrowStreamCoGroupSerializer
by Yicong Huang
· 8 days ago
3433c38
[SPARK-56189][PYTHON] Refactor SQL_WINDOW_AGG_ARROW_UDF
by Yicong Huang
· 8 days ago
d498b06
[SPARK-56314][SQL][TESTS] Avoid uncessary RDD->DataFrame conversion in `SQLTestData`
by Ruifeng Zheng
· 8 days ago
989df5e
[SPARK-56306][SQL] Fix collation-aware PIVOT
by Akash Nayar
· 8 days ago
00b9451
[SPARK-55981][SQL] Allow Geo Types with SRID's from the pre-built registry
by Szehon Ho
· 8 days ago
463a188
[SPARK-56190][SQL] Support nested partition columns for DSV2 PartitionPredicate
by Szehon Ho
· 8 days ago
e0bc7aa
[SPARK-56313][PYTHON][FOLLOWUP] Remove the generic for rddsampler methods
by Tian Gao
· 8 days ago
e75d6fb
[SPARK-56219][PS][FOLLOW-UP] Keep legacy groupby idxmax and idxmin skipna=False behavior for pandas 2
by Takuya Ueshin
· 9 days ago
fec2804
[SPARK-56310][PYTHON] Handle pandas 3 dtype in DataFrame.toPandas
by Takuya Ueshin
· 9 days ago
0f0c0e2
[SPARK-56296][SQL] Pivot createTableLike to pass full TableInfo including schema, partitioning, constraints, and owner
by Liang-Chi Hsieh
· 9 days ago
3e7e126
[SPARK-56001][SQL] Add INSERT INTO ... REPLACE ON/USING syntax
by Thang Long VU
· 9 days ago
dd492dd
[SPARK-56054][SQL] Fix aliased nested fields ignored for schema evolution in MERGE
by Johan Lasperas
· 9 days ago
9518075
[SPARK-54938][PYTHON][TEST][FOLLOW-UP] Fix `test_pyarrow_array_type_inference` for pandas >= 3
by Ruifeng Zheng
· 9 days ago
d1916e3
[SPARK-56313][PYTHON] Add type hint for rddsampler.py
by Tian Gao
· 9 days ago
1dd26f9
[SPARK-56271][PYTHON] Fix type hint and remove unused method for _globals.py
by Tian Gao
· 9 days ago
384f543
[SPARK-56311][PYTHON] Add type hints for daemon.py
by Tian Gao
· 9 days ago
89cb692
[SPARK-56123][PYTHON][FOLLOWUP] Avoid using concat_batches for old version of pyarrow
by Tian Gao
· 9 days ago
d580b65
[SPARK-49543][SQL] Add SHOW COLLATIONS command
by Liang-Chi Hsieh
· 9 days ago
a4719590
[SPARK-56307][BUILD] Upgrade `log4j` to 2.25.4
by Dongjoon Hyun
· 10 days ago
e6feb27
[SPARK-51988][SS] Do file checksum verification on read for RocksDB zip file
by Gurpreet Nanda
· 10 days ago
e7eceb7
[SPARK-56235][CORE] Add reverse index in TaskSetManager to avoid O(N) scans in executorLost
by DenineLu
· 10 days ago
Next »