Sign in
apache
/
spark
/
HEAD
fcf0d4c
[SPARK-56011][PYTHON][INFRA] Add python/asv wrapper script for running benchmarks from repo root
by Yicong-Huang
· 4 hours ago
master
b634429
[SPARK-50284][PYTHON] Change docs for parseJson function
by judy
· 5 hours ago
016875b
[SPARK-53675][PYTHON] Add str support in withColumn and withColumns in PySpark
by Zoey Han
· 5 hours ago
1a8ffff
[SPARK-50111][PYTHON] Add subplots support for pie charts in Plotly backend
by RishbhaJain
· 5 hours ago
055cfb3
[SPARK-47997][PS] Add errors parameter to DataFrame.drop and Series.drop
by Devin Petersohn
· 5 hours ago
9683519
[SPARK-55948][SQL] Add DSv2 CDC connector API, analyzer resolution, and SQL CHANGES clause
by Gengliang Wang
· 5 hours ago
68cbd1c
[SPARK-56043][SQL] Wrap NullPointerException from Avro 1.12.x ParseContext.resolve() in SchemaParseException
by Jerry Peng
· 6 hours ago
81b6b05
[SPARK-55056][SQL][PYTHON][TEST] Add tests using Arrow to deserialize nested array with empty outer array
by Yicong Huang
· 6 hours ago
2e7d0c9
[SPARK-56031][SQL] Make Natural Join column matching respect case sensitivity conf
by Stefan Kandic
· 8 hours ago
ca60558
[SPARK-55645][SQL][FOLLOWUP] Move serdeName to last parameter and filter empty strings
by Wenchen Fan
· 9 hours ago
cd2b43d
[SPARK-55728][SS] Introduce conf for file checksum threadpool size and support disabling the threadpool
by Gurpreet Nanda
· 11 hours ago
50514c5
[SPARK-56045][SQL] Add flag for ignoring Parquet UNKNOWN type annotation and revert to old behavior
by Ziya Mukhtarov
· 15 hours ago
b06bcfc
[SPARK-55964] system catalog wins over user catalog for BUILTIN, and SESSION schemas
by Serge Rielau
· 15 hours ago
99044a8
[SPARK-44065][SQL] Optimize BroadcastHashJoin skew in OptimizeSkewedJoin
by wforget
· 15 hours ago
f322271
[SPARK-56042][SS] Fix swapped external/internal col family count metrics in RocksDBStateStoreProvider
by Jerry Peng
· 22 hours ago
a4dcd6d
[SPARK-55596][SQL] DSV2 Enhanced Partition Stats Filtering
by Szehon Ho
· 24 hours ago
d95f1da
[SPARK-55973][SS] LeftSemi optimization for stream-stream join
by Jungtaek Lim
· 27 hours ago
5fa8dc7
[SPARK-56021][SS] Increase AutoSnapshotRepair default maxChangeFileReplay threshold from 50 to 500
by micheal-o
· 28 hours ago
523921a
[SPARK-55963][K8S] Optimize snapshot traversal in ExecutorPodsAllocator
by DenineLu
· 28 hours ago
5a07d8d
[SPARK-56041][PS][TESTS] Normalize ndarray values in apply_batch typed result comparison for pandas 3
by Takuya Ueshin
· 29 hours ago
b80148a
[SPARK-53970][PYTHON] Remove incorrect 'optional' tag for messageName…
by holyvolcano
· 29 hours ago
d8bc686
[SPARK-52785][PYTHON] Simplifying super() syntax in PySpark
by Simola Nayak
· 30 hours ago
0390e4b
[MINOR][CORE][TESTS] Call System.gc() before running test java.lang.ArrayIndexOutOfBoundsException in TimSort
by Cheng Pan
· 30 hours ago
ed21be7
[SPARK-55851][PYTHON][FOLLOW-UP] Deal with NotImplementedError for partitions()
by Tian Gao
· 30 hours ago
6c5d707
[SPARK-56039][INFRA] Install `remotes` R package for `dev/infra/Dockerfile`
by Dongjoon Hyun
· 32 hours ago
d06d086
[SPARK-55983][SQL] New single-pass analyzer functionality and bugfixes
by Mihailo Timotic
· 34 hours ago
5acd8e6
[SPARK-56027][INFRA] Fix `NoEmptyContinuation` warning of `spark-test-image/lint/Dockerfile`
by Dongjoon Hyun
· 2 days ago
b64383c
[SPARK-48139][CONNECT][TESTS] Re-enable `SparkSessionE2ESuite.interrupt tag`
by Kousuke Saruta
· 2 days ago
2bc6f75
[SPARK-55998][SHS] Synchronize more places on accessing SHS listing.db
by Cheng Pan
· 2 days ago
f64a7df
[SPARK-55996][CORE] Remove default `jdk.reflect.useDirectMethodHandle=false`
by Cheng Pan
· 2 days ago
e3c947d
[SPARK-56000][BUILD] Upgrade `arrow-java` to 19.0.0
by yangjie01
· 2 days ago
54164e9
[SPARK-56025][INFRA] Install `remotes` R package for `spark-test-image/(lint|docs|sparkr)/Dockerfile`
by Dongjoon Hyun
· 2 days ago
6e05916
[SPARK-55964] Cache coherence: clear function registry on DROP DATABASE
by Serge Rielau
· 2 days ago
9adf791
[SPARK-55857][SQL] Support ignoreMissingFiles during schema inference…
by Yash Botadra
· 2 days ago
cca4a12
[SPARK-56002][UI] Make SQL plan visualization metrics table sortable
by Kent Yao
· 2 days ago
c6b4a86
[SPARK-55714][SQL][FOLLOWUP] Narrow overflow message canonicalization to only match JDK patterns
by Wenchen Fan
· 2 days ago
cee7f80
[SPARK-55995][SQL] Support TIMESTAMP WITH LOCAL TIME ZONE in SQL syntax
by Cheng Pan
· 2 days ago
40417a6
Revert "[SPARK-56013][BUILD] Upgrade `JAXB` to 4.0.6"
by Dongjoon Hyun
· 2 days ago
e51e707
[SPARK-56008][BUILD] Upgrade `tink` to 1.20.0
by Dongjoon Hyun
· 2 days ago
ed3cbe9
[SPARK-56013][BUILD] Upgrade `JAXB` to 4.0.6
by Dongjoon Hyun
· 2 days ago
abaa2f0
[SPARK-56012][BUILD] Upgrade `xz` to 1.12
by Dongjoon Hyun
· 2 days ago
51c3e37
[SPARK-56009][BUILD] Upgrade `netty-tcnative` to 2.0.75.Final
by Dongjoon Hyun
· 2 days ago
71f1edc
[SPARK-56016][PS] Preserve named Series columns in concat with ignore_index on pandas 3
by Takuya Ueshin
· 2 days ago
6f3ece6
[SPARK-56014][PS][TESTS] Fix to_numeric ignore test for pandas 3.0
by Takuya Ueshin
· 2 days ago
8f9a3de
[SPARK-56010][BUILD] Upgrade `snowflake-jdbc` to 4.0.2
by Dongjoon Hyun
· 2 days ago
951c624
[SPARK-56006][BUILD] Upgrade Maven to 3.9.14
by Dongjoon Hyun
· 2 days ago
124d0a9
[SPARK-55885][SQL] Optimize vectorized Parquet boolean reading with lookup-table expansion and batch buffer reads
by yangjie01
· 2 days ago
48940be
[SPARK-55877][UI] Side-by-side Initial vs Final plan comparison for AQE queries
by Kent Yao
· 2 days ago
7eef6f7
[SPARK-55997][SS] Set upper bound to prefixScan in RocksDB state store provider
by Jungtaek Lim
· 2 days ago
a936ccf
[SPARK-55887][CONNECT] Special handling for `CollectLimitExec/CollectTailExec` to avoid full table scans
by yangjie01
· 2 days ago
9b70fca
[SPARK-55992][SQL] Fix GroupPartitions textual representation
by Peter Toth
· 2 days ago
cbcee8c
[SPARK-55986][PYTHON] Upgrade black to 26.3.1
by yangjie01
· 2 days ago
73dd6ed
[SPARK-55690] Schema evolution in DSv2 AppendData, OverwriteByExpression, OverwritePartitionsDynamic
by Johan Lasperas
· 3 days ago
cbbbd41
[SPARK-55790][GEO][SQL] Build a complete SRS registry using PROJ 9.7.1 data
by Uros Bojanic
· 3 days ago
09979af
[SPARK-53339][CONNECT] Fix interrupt on pending operations by moving `postStarted()` and allowing Pending to Canceled/Failed transition
by Kousuke Saruta
· 3 days ago
ceba3da
[SPARK-55984][SQL][TESTS] Add metadata_column_resolution.sql golden file
by mihailoale-db
· 3 days ago
6730ccd
[SPARK-55453][SQL] Fix LIKE pattern matching for supplementary Unicode characters
by Xiaoxuan Li
· 3 days ago
6fda5fb
[SPARK-55357][PYTHON] Fix docstring for timestamp_add
by judy
· 3 days ago
fce6cce
[SPARK-55993][SS][TEST] Fix flaky RocksDBStateStoreIntegrationSuite bounded memory test
by Kent Yao
· 3 days ago
13e44f7
[SPARK-55988][PS][TESTS] Compare categorical index codes by values in tests
by Takuya Ueshin
· 4 days ago
47424a3
[SPARK-55989][PS] Preserve non-int64 index dtypes in `restore_index`
by Takuya Ueshin
· 4 days ago
322165b
[SPARK-55977][PS] Fix isin() to use strict type matching like pandas
by Devin Petersohn
· 4 days ago
4d79768
[SPARK-55991] Fix unicode related SQL text corruption with parameters
by Serge Rielau
· 4 days ago
6ab9428
[SPARK-55880][UI] Link SQL plan metric stage IDs to stage detail page
by Kent Yao
· 5 days ago
81f2172
[SPARK-55985][WEBUI] Remove `jquery.blockUI.min.js`
by Kousuke Saruta
· 5 days ago
a5ad1a7
[SPARK-55557][SQL] Hyperbolic functions should not overflow with large inputs
by Marco Gaido
· 5 days ago
ae20bb9
[SPARK-55987][SS] Fix time window join in stream-stream join state format V4
by Nicholas Chew
· 5 days ago
bac7ce1
[SPARK-55493][SS] Do not mkdirs in streaming checkpoint offset/commit log directory in StateDataSource
by Livia Zhu
· 5 days ago
bff9dcf
[SPARK-55945][SDP] Support structured identifiers for flows in SDP eager analysis protos
by Yuheng Chang
· 5 days ago
e8d8e6a
[SPARK-55971][UI] Add Jobs table to SQL execution detail page
by Kent Yao
· 6 days ago
e7bbd32
[SPARK-55975][SQL][TESTS] NaN comparison can cause false UT failures due to different NaNs
by Marco Gaido
· 6 days ago
9a9c714
[SPARK-55628][SS] Integrate stream-stream join state format V4
by Nicholas Chew
· 6 days ago
8efc4c6
[SPARK-55967][PYTHON] Unify column conversion for connect dataframe
by Tian Gao
· 6 days ago
cf2aadd
[SPARK-55980][PS] Always apply _cast_back_float in numeric arithmetic
by Devin Petersohn
· 6 days ago
0e8d39e
[MINOR][DOCS] Remove redundant backtick in docstrings
by Joon Ro
· 6 days ago
85b351b
[SPARK-55976][SQL] Use Set instead of Seq for write privileges
by Anton Okolnychyi
· 6 days ago
06a7b2d
[SPARK-55870][SQL] Add docs for Geo types
by Szehon Ho
· 6 days ago
c1f4d11e
[SPARK-55275] Add InvalidPlanInput sql states for sql/connect
by Garland Zhang
· 7 days ago
14d659e
[SPARK-55535][SQL][FOLLOW-UP] Fix `OrderedDistribution` handling and minor improvements to `EnsureRequirements`
by Peter Toth
· 7 days ago
2a5c0df
[SPARK-55726][PYTHON][TEST][FOLLOW-UP] Make SQL_GROUPED_MAP_PANDAS_UDF benchmark to two bench classes
by Yicong Huang
· 7 days ago
12aa167
[SPARK-55960][INFRA][DOCS][FOLLOW-UP] Document how to re-generate the protobuf files for python client
by Ruifeng Zheng
· 7 days ago
587dfa4
[SPARK-55947][PYTHON][TEST] Add ASV micro-benchmarks for SQL_GROUPED_MAP_ARROW_UDF and SQL_GROUPED_MAP_ARROW_ITER_UDF
by Yicong Huang
· 7 days ago
320ece1
[SPARK-55928][SQL] New linter for config effectiveness in views and UDFs
by Mihailo Timotic
· 7 days ago
58cb617
[SPARK-55961][UI] Make SQL plan viz side panel collapsible
by Kent Yao
· 7 days ago
4cdaae7
[SPARK-55962][SQL] Use `getShort` instead of `getInt` casting in `putShortsFromIntsLittleEndian` on Little Endian platforms
by yangjie01
· 7 days ago
8d64502
[SPARK-55667][PYTHON][CONNECT] Move check_dependencies to __init__
by Tian Gao
· 7 days ago
1b63af8
[SPARK-55851][PYTHON] Clarify types of datasource partition and read
by Tian Gao
· 7 days ago
24b9d2c
[SPARK-55965][PYTHON] Add warning when pandas >= 3.0.0 is used with PySpark
by Yicong Huang
· 7 days ago
fa87249
[SPARK-55891][SQL] Preserve the SQL scripting context inside EXECUTE IMMEDIATE
by ilicmarkodb
· 8 days ago
104e43b
[SPARK-55903][SQL] Simplify MERGE Schema Evolution and Check Write Privileges
by Szehon Ho
· 8 days ago
5d207b2
[SPARK-55960][INFRA][CONNECT][PYTHON][FOLLOW-UP] Fix build on linux
by Ruifeng Zheng
· 8 days ago
ccf44c0
[SPARK-55960][INFRA][CONNECT][PYTHON] Add a docker image for spark connect codegen
by Ruifeng Zheng
· 8 days ago
12dc89e
[SPARK-55909][SQL][TESTS] Introduce trait `SparkSessionProvider`
by Ruifeng Zheng
· 8 days ago
305cdc3
[SPARK-55957][SQL] Add 'DATA_SOURCE_NOT_FOUND' in Catalog.ERROR_HANDLING_RULES
by Hyukjin Kwon
· 8 days ago
a47e2d1
[SPARK-55907][SQL] Fix incorrect error positions for invalid data types in CREATE FUNCTION
by Gengliang Wang
· 8 days ago
d6aa376
[SPARK-55884][SQL] Add v1StatsToV2Stats to DataSourceV2Relation
by Xin Huang
· 8 days ago
c433283
[SPARK-55954][PYTHON] Remove the incorrect overload type hint for fillna
by Tian Gao
· 8 days ago
e5a900a
[SPARK-55955][PYTHON] Remove overload type hint for drop
by Tian Gao
· 8 days ago
69f4d00c
[SPARK-55958][BUILD][CONNECT] Remove unused `add-scala-test-sources` setting from `pom.xml` in `connect-common`
by Kousuke Saruta
· 8 days ago
f1acf94
[SPARK-55889][DOCS][FOLLOWUP] Update `building-spark.md` with Maven 3.9.13
by Dongjoon Hyun
· 8 days ago
Next »