Sign in
apache
/
spark
/
HEAD
0e1101a
[SPARK-57388][INFRA] Pin downstream actions/checkout to a single resolved SHA in maven_test.yml and python_hosted_runner_test.yml
by Ruifeng Zheng
· 7 hours ago
master
5d3aa9a
[SPARK-34679][SQL][DOC] Add inferTimestamp option to JSON data source options table
by BRIJ RAJ KISHORE
· 10 hours ago
3d4c8b0
[SPARK-56903][SQL][FOLLOWUP] Fix null join key shuffle config version
by Chao Sun
· 13 hours ago
913f105
[SPARK-57387][YARN] Make executor JVM options `-XX:OnOutOfMemoryError` configurable on YARN
by Cheng Pan
· 13 hours ago
f64739d
[SPARK-57381][PYTHON] Refactor SQL_WINDOW_AGG_PANDAS_UDF
by Yicong Huang
· 16 hours ago
3349526
[SPARK-57360][SQL] Block temporary variables in generated column expressions
by Szehon Ho
· 16 hours ago
b88952c
[SPARK-57393] Build: PySpark and SparkR source distributions are missing LICENSE and NOTICE files
by Huaxin Gao
· 17 hours ago
bcdcd47
[SPARK-57133][SQL] Add BIN BY relation operator parsing and resolution
by Nikolina Vraneš
· 17 hours ago
3fce4cf
[SPARK-57073][SS][PYTHON][TEST] Catch AnalysisException for test_parity_listener
by Tian Gao
· 17 hours ago
a423d06
[SPARK-57327][INFRA] Move scheduled CIs for 4.1 to branch-4.1
by Tian Gao
· 18 hours ago
a1922b5
[SPARK-57369][SQL] Move main EXECUTE IMMEDIATE resolution logic to common code
by Mihailo Aleksic
· 19 hours ago
9357bc9
[SPARK-57295][SQL] Make database location validation consistent for w…
by Anurag Kumar Dwivedi
· 20 hours ago
8cced6f
[SPARK-57020][PYTHON][TEST] Add ASV microbenchmark for SQL_TRANSFORM_WITH_STATE_PANDAS_UDF
by Yicong Huang
· 22 hours ago
be299a1
[SPARK-57361][PYTHON] Refactor SQL_ARROW_UDTF
by Yicong Huang
· 22 hours ago
89dff6b
[SPARK-57321][SQL] Infer CSV schema from tar archives
by akshatshenoi-db
· 22 hours ago
302ba67
[SPARK-56877][SQL][FOLLOWUP] Make PartitioningCollection invariant check O(1) per nesting level
by Wenchen Fan
· 22 hours ago
e33017a
[SPARK-57383][SQL][PYTHON] Honor configured Arrow zstd compression level when writing Arrow batches
by Liang-Chi Hsieh
· 24 hours ago
6693d43
[SPARK-57377][INFRA] Add CI check to prevent new entries in the config binding policy exceptions file
by Wenchen Fan
· 26 hours ago
90f6bab
[SPARK-57368][PYTHON][ML][TEST] Fix assertTrue misuse in PySpark tests
by Ruifeng Zheng
· 27 hours ago
60acc8f
[SPARK-57332][SQL][FOLLOWUP] Fix line length exceeding 100 characters in JDBCSuite and V2ExpressionSQLBuilder
by Kousuke Saruta
· 32 hours ago
9018d84
[SPARK-57263][SQL][FOLLOWUP] Fix Hive 4.2 getTablesByName compatibility
by YangJie
· 2 days ago
f2d11a6
[SPARK-57313][SQL] Fix SampleExec numOutputRows metric when whole-stage codegen is disabled
by Eric Yang
· 2 days ago
79dcac9
[SPARK-53840][SQL] Add AS JSON output support for SHOW TABLES and SHOW TABLE EXTENDED
by Ayush
· 2 days ago
8e22e99
[SPARK-57359][DOC] Document the MERGE INTO statement in the SQL reference
by Szehon Ho
· 2 days ago
f5eabcb
[SPARK-57332][SQL] Fix MySQL backslash escaping in LIKE predicate pushdown via a dialect string-literal escaping hook
by Wenchen Fan
· 2 days ago
da67157
[SPARK-56887][SQL] Add dedicated sort-merge physical operator for AS-OF join
by Kousuke Saruta
· 2 days ago
1175d40
[SPARK-57285][SQL] Route nanosecond timestamp cast-to-string through the Types Framework
by Maxim Gekk
· 2 days ago
d2cbc7f
[SPARK-57374][BUILD] Upgrade `netty-tcnative` to 2.0.78.Final
by Dongjoon Hyun
· 2 days ago
44984a6
[SPARK-56995][SQL][DML][TESTS][FOLLOWUP] Fix AutoCdcScd1FullRefreshSuite by returning live table from SharedTablesInMemoryRowLevelOperationTableCatalog
by Kousuke Saruta
· 2 days ago
9eb44b3
[SPARK-57355][PYTHON] Fix __module__ check in udf profiler
by Tian Gao
· 2 days ago
0e8a75f
[SPARK-57207][SQL][FOLLOWUP] Fix StackOverflowError when setting timestampNanosTypes.enabled via SparkConf
by Stevo Mitric
· 2 days ago
a852aa3
[SPARK-57325][CONNECT] Stop streaming queries registered while the Connect session is closing
by DB Tsai
· 2 days ago
bd04636
[SPARK-57318][SQL] Refactor WorkerSession into a state-machine interface
by Haiyang Sun
· 2 days ago
0b05352
[MINOR][PYTHON][TEST] Use assertEqual instead of assertTrue in PySpark tests
by Ruifeng Zheng
· 2 days ago
19aec7a
[SPARK-57315][SQL] Support HOUR, MINUTE and SECOND functions over nanosecond-precision timestamps
by Maxim Gekk
· 2 days ago
cc88e6c
[SPARK-57367][PYTHON][DOC] Improve See Also cross-references in pyspark.sql.functions
by Ruifeng Zheng
· 2 days ago
3ad5b64
[SPARK-57194][SQL] Add preOperatorOptimizationRules extension point to Optimizer
by Shrirang Mhalgi
· 2 days ago
bbd7c46
[SPARK-57261][SQL] Allow to disable HashAggregateExec by config
by Cheng Pan
· 2 days ago
6d4b71e
[SPARK-57344][INFRA] Ensure tests for `pipelines` module triggered when sql-related modules are modified
by YangJie
· 2 days ago
e598b0c
[SPARK-57348][PYTHON][TESTS] Replace sql_keywords doctest show() with columns check
by Kousuke Saruta
· 2 days ago
62ae4db
[SPARK-57152][SDP] Implement SCD2 Batch Processor; Find Affected Aux/Target Table Rows
by AnishMahto
· 2 days ago
844f6f0
[SPARK-57338][SQL] Render external values in Row JSON via formatExternal
by Maxim Gekk
· 2 days ago
5001ba0
[MINOR][PYTHON][DOC] Fix broken See Also links in pyspark.sql.functions
by Ruifeng Zheng
· 3 days ago
fc527bc
[SPARK-57316][DOC] Document WITH SCHEMA EVOLUTION and BY NAME for SQL INSERT
by Thang Long Vu
· 3 days ago
af39d95
Revert "[SPARK-57133][SQL] Add BIN BY relation operator parsing and resolution"
by Dongjoon Hyun
· 3 days ago
2bb8b20
[SPARK-57351][K8S][CORE] Enable `spark.kubernetes.executor.useDriverPodIP` by default
by Dongjoon Hyun
· 3 days ago
6c2325a
[SPARK-56758][PYTHON] Refactor SQL_MAP_PANDAS_ITER_UDF
by Yicong Huang
· 3 days ago
5227368
[SPARK-57212][SQL][FOLLOWUP] Record AQE rule timing into the shared tracker via a lock instead of per-node trackers
by Wenchen Fan
· 3 days ago
05b4635
[SPARK-57148][SQL] Rename splitSemiColonWithIndex to splitSemiColon
by Anupam Yadav
· 3 days ago
029731c
[SPARK-57349][CONNECT] Split udf protocol into message and grpc service.
by Haiyang Sun
· 3 days ago
e94782c
[SPARK-56538][CONNECT] Add per-RPC deadlines to Spark Connect client
by pranavdev022
· 3 days ago
098057d
[SPARK-57212][SQL] Track preparation and AQE rule timing in `QueryPlanningTracker`
by Peter Toth
· 3 days ago
ac4457e
[SPARK-57259][SQL][TEST] Add nanosecond timestamp types to DataTypeTestUtils type sets
by Maxim Gekk
· 3 days ago
bf79473
[SPARK-57274][CONNECT] Support fetch/type accessors and getMoreResults for SparkConnectStatement
by Jiwon Park
· 3 days ago
391d65a
[SPARK-57234][SS][DOCS] Add Real-time Mode documentation page to the Structured Streaming guide
by Boyang Jerry Peng
· 3 days ago
13ea0f5
[SPARK-57281][SQL][SS] Remove @Experimental annotation from Real-time mode
by Boyang Jerry Peng
· 3 days ago
3e7cae7
[SPARK-50520][PYTHON] Respect timeout in df.rdd.countApprox()
by Rishav Sinha
· 4 days ago
b67073f
[SPARK-57253][SQL] Add `jaro_winkler_similarity` built-in function
by Kousuke Saruta
· 4 days ago
cc64f0a
[SPARK-52719][SQL] Support using scalar UDFs in TVF arguments
by Anupam Yadav
· 4 days ago
3744250
[SPARK-54876][SQL] Fix splitSemiColon dropping statement ending with block comment
by Anupam Yadav
· 4 days ago
ac0c117
[SPARK-57330][INFRA] Switch shared CI compile artifacts to zstd compression
by Ruifeng Zheng
· 4 days ago
0993d43
[SPARK-57314][PS][TEST] Add tests for Index.equals in pandas-on-Spark
by tonghuaroot (童话)
· 4 days ago
3c31d68
[SPARK-57287][SQL] Escape backslash in LIKE pattern for STARTS_WITH/ENDS_WITH/CONTAINS pushdown
by Shrirang Mhalgi
· 4 days ago
49908a2
[SPARK-57298][SQL] collect_set fails to dedupe float/double NaN/-0.0 by their semantics
by Eric Yang
· 4 days ago
0849776
[SPARK-57326][SQL][TEST] Honor DEFAULT_ARTIFACT_REPOSITORY in IsolatedClientLoaderIvySettingsSuite
by Wenchen Fan
· 4 days ago
bee16dc
[SPARK-57135][SQL] Support reading CSV files inside tar archives
by akshatshenoi-db
· 4 days ago
3ebf8d6
[SPARK-57320][BUILD] Upgrade Netty to 4.2.15.Final
by Dongjoon Hyun
· 4 days ago
d9c50b2
[SPARK-57224][INFRA] Add input check for merge script
by Tian Gao
· 4 days ago
761afcb
[SPARK-57133][SQL] Add BIN BY relation operator parsing and resolution
by Nikolina Vraneš
· 4 days ago
952a283
[SPARK-57254][INFRA] Put CI-unrelated files in a module so CI won't be triggered
by Tian Gao
· 4 days ago
99db069
[SPARK-57317][SQL] Fix Literal.create for external nanosecond timestamp values
by Maxim Gekk
· 4 days ago
7129ce0
[SPARK-37019][SQL] Add codegen support to array higher-order functions
by Adam Binford
· 4 days ago
b55c2cc
[SPARK-57282][SQL] Spread NULL left anti join keys across shuffle partitions
by Chao Sun
· 4 days ago
542ea3b
[SPARK-56995][SQL][DML] Allow dataframe caching in the DSv2 Transaction API
by Andreas Chatzistergiou
· 4 days ago
1592ec2
[SPARK-56661] Addressing review comments from PR #55768
by Sven Weber
· 4 days ago
b098a58
[SPARK-57258][SQL] Reduce regexp_extract/regexp_extract_all generated code size via shared extract helpers
by YangJie
· 4 days ago
3e02257
[SPARK-57294][PS] Support DataFrame.combine in fallback mode
by tonghuaroot (童话)
· 4 days ago
e8ca287
[SPARK-56830][INFRA] Share SBT compile artifact with python hosted runner CI jobs
by Ruifeng Zheng
· 4 days ago
4d8b715
[SPARK-57277][INFRA] Make CI cache keys OS-specific
by Ruifeng Zheng
· 4 days ago
9c1adaf
[SPARK-57278][INFRA] Install zstd in CI container images to fix GitHub Actions cache
by Ruifeng Zheng
· 4 days ago
96b255f
[SPARK-57262][SQL][WEBUI] Job description derived from a query should respect `spark.sql.redaction.string.regex`
by Kousuke Saruta
· 4 days ago
2660f4d
[SPARK-57293][SQL] Cast between nanosecond-precision and microsecond-precision timestamp types
by Maxim Gekk
· 4 days ago
5077f7f
[SPARK-57255][SQL] Simplify RegExpReplace codegen by extracting the match/replace loop into a shared helper
by YangJie
· 5 days ago
a6ac0b8
[SPARK-57141][SS][RTM][STREAMINGSHUFFLE][PART3] Add StreamingShuffleManager and MultiShuffleManager
by Boyang Jerry Peng
· 5 days ago
4bc7196
[SPARK-57297][SQL][TESTS] Add a test that SQL execution description respects `spark.sql.redaction.string.regex`
by Dongjoon Hyun
· 5 days ago
c9e7421
Revert "[SPARK-57262][SQL][WEBUI] Job description derived from a query should respect `spark.sql.redaction.string.regex`"
by Dongjoon Hyun
· 6 days ago
583e5bb
[SPARK-57262][SQL][WEBUI] Job description derived from a query should respect `spark.sql.redaction.string.regex`
by Kousuke Saruta
· 6 days ago
b66d392
[SPARK-57284][PYTHON][SQL] Add Scala/Python bindings for vector functions
by Kousuke Saruta
· 6 days ago
c082f82
[SPARK-56645][CORE] Fix History Server serving stale UI after app completes
by cxzl25
· 6 days ago
ccffd01
[SPARK-57256][SQL] Cast nanosecond-precision timestamps to string
by Maxim Gekk
· 7 days ago
f3f5677
[SPARK-57247][SQL][CONNECT] Support DataFrame.zip in Spark Connect
by Ruifeng Zheng
· 7 days ago
e113afc
[SPARK-57286][BUILD] Add `wildfly-openssl-macosx-aarch64` dependency to support Apple Silicon
by Dongjoon Hyun
· 7 days ago
060a617
[SPARK-57283][BUILD] Upgrade `wildfly-openssl` to 2.3.0.Final
by Dongjoon Hyun
· 7 days ago
637803e
[SPARK-57257][SQL] Support nanosecond-precision timestamps in Hive results
by Maxim Gekk
· 7 days ago
042ad7d
[SPARK-57176][SQL] Extend nested column pruning through array-returning functions
by Chao Sun
· 7 days ago
0536814
[SPARK-57273][BUILD] Upgrade jackson to 2.21.4
by Dongjoon Hyun
· 7 days ago
a32cda3
[SPARK-57260][SQL] Fix variable resolution in REPLACE WHERE clause of INSERT INTO
by Joel Robin P
· 7 days ago
b2580fc
[SPARK-57263][SQL] Support Hive 4.2 metastore
by YangJie
· 7 days ago
9e32a26
[SPARK-57250][SQL] Construct sub-microsecond timestamp typed literals with precision derived from fractional digits
by Maxim Gekk
· 7 days ago
4915340
[SPARK-57207][SQL] Support nanosecond timestamp types in the Types Framework
by Maxim Gekk
· 8 days ago
Next »