1. fcf0d4c [SPARK-56011][PYTHON][INFRA] Add python/asv wrapper script for running benchmarks from repo root by Yicong-Huang · 4 hours ago master
  2. b634429 [SPARK-50284][PYTHON] Change docs for parseJson function by judy · 5 hours ago
  3. 016875b [SPARK-53675][PYTHON] Add str support in withColumn and withColumns in PySpark by Zoey Han · 5 hours ago
  4. 1a8ffff [SPARK-50111][PYTHON] Add subplots support for pie charts in Plotly backend by RishbhaJain · 5 hours ago
  5. 055cfb3 [SPARK-47997][PS] Add errors parameter to DataFrame.drop and Series.drop by Devin Petersohn · 5 hours ago
  6. 9683519 [SPARK-55948][SQL] Add DSv2 CDC connector API, analyzer resolution, and SQL CHANGES clause by Gengliang Wang · 5 hours ago
  7. 68cbd1c [SPARK-56043][SQL] Wrap NullPointerException from Avro 1.12.x ParseContext.resolve() in SchemaParseException by Jerry Peng · 6 hours ago
  8. 81b6b05 [SPARK-55056][SQL][PYTHON][TEST] Add tests using Arrow to deserialize nested array with empty outer array by Yicong Huang · 6 hours ago
  9. 2e7d0c9 [SPARK-56031][SQL] Make Natural Join column matching respect case sensitivity conf by Stefan Kandic · 8 hours ago
  10. ca60558 [SPARK-55645][SQL][FOLLOWUP] Move serdeName to last parameter and filter empty strings by Wenchen Fan · 9 hours ago
  11. cd2b43d [SPARK-55728][SS] Introduce conf for file checksum threadpool size and support disabling the threadpool by Gurpreet Nanda · 11 hours ago
  12. 50514c5 [SPARK-56045][SQL] Add flag for ignoring Parquet UNKNOWN type annotation and revert to old behavior by Ziya Mukhtarov · 15 hours ago
  13. b06bcfc [SPARK-55964] system catalog wins over user catalog for BUILTIN, and SESSION schemas by Serge Rielau · 15 hours ago
  14. 99044a8 [SPARK-44065][SQL] Optimize BroadcastHashJoin skew in OptimizeSkewedJoin by wforget · 15 hours ago
  15. f322271 [SPARK-56042][SS] Fix swapped external/internal col family count metrics in RocksDBStateStoreProvider by Jerry Peng · 22 hours ago
  16. a4dcd6d [SPARK-55596][SQL] DSV2 Enhanced Partition Stats Filtering by Szehon Ho · 24 hours ago
  17. d95f1da [SPARK-55973][SS] LeftSemi optimization for stream-stream join by Jungtaek Lim · 27 hours ago
  18. 5fa8dc7 [SPARK-56021][SS] Increase AutoSnapshotRepair default maxChangeFileReplay threshold from 50 to 500 by micheal-o · 28 hours ago
  19. 523921a [SPARK-55963][K8S] Optimize snapshot traversal in ExecutorPodsAllocator by DenineLu · 28 hours ago
  20. 5a07d8d [SPARK-56041][PS][TESTS] Normalize ndarray values in apply_batch typed result comparison for pandas 3 by Takuya Ueshin · 29 hours ago
  21. b80148a [SPARK-53970][PYTHON] Remove incorrect 'optional' tag for messageName… by holyvolcano · 29 hours ago
  22. d8bc686 [SPARK-52785][PYTHON] Simplifying super() syntax in PySpark by Simola Nayak · 30 hours ago
  23. 0390e4b [MINOR][CORE][TESTS] Call System.gc() before running test java.lang.ArrayIndexOutOfBoundsException in TimSort by Cheng Pan · 30 hours ago
  24. ed21be7 [SPARK-55851][PYTHON][FOLLOW-UP] Deal with NotImplementedError for partitions() by Tian Gao · 30 hours ago
  25. 6c5d707 [SPARK-56039][INFRA] Install `remotes` R package for `dev/infra/Dockerfile` by Dongjoon Hyun · 32 hours ago
  26. d06d086 [SPARK-55983][SQL] New single-pass analyzer functionality and bugfixes by Mihailo Timotic · 34 hours ago
  27. 5acd8e6 [SPARK-56027][INFRA] Fix `NoEmptyContinuation` warning of `spark-test-image/lint/Dockerfile` by Dongjoon Hyun · 2 days ago
  28. b64383c [SPARK-48139][CONNECT][TESTS] Re-enable `SparkSessionE2ESuite.interrupt tag` by Kousuke Saruta · 2 days ago
  29. 2bc6f75 [SPARK-55998][SHS] Synchronize more places on accessing SHS listing.db by Cheng Pan · 2 days ago
  30. f64a7df [SPARK-55996][CORE] Remove default `jdk.reflect.useDirectMethodHandle=false` by Cheng Pan · 2 days ago
  31. e3c947d [SPARK-56000][BUILD] Upgrade `arrow-java` to 19.0.0 by yangjie01 · 2 days ago
  32. 54164e9 [SPARK-56025][INFRA] Install `remotes` R package for `spark-test-image/(lint|docs|sparkr)/Dockerfile` by Dongjoon Hyun · 2 days ago
  33. 6e05916 [SPARK-55964] Cache coherence: clear function registry on DROP DATABASE by Serge Rielau · 2 days ago
  34. 9adf791 [SPARK-55857][SQL] Support ignoreMissingFiles during schema inference… by Yash Botadra · 2 days ago
  35. cca4a12 [SPARK-56002][UI] Make SQL plan visualization metrics table sortable by Kent Yao · 2 days ago
  36. c6b4a86 [SPARK-55714][SQL][FOLLOWUP] Narrow overflow message canonicalization to only match JDK patterns by Wenchen Fan · 2 days ago
  37. cee7f80 [SPARK-55995][SQL] Support TIMESTAMP WITH LOCAL TIME ZONE in SQL syntax by Cheng Pan · 2 days ago
  38. 40417a6 Revert "[SPARK-56013][BUILD] Upgrade `JAXB` to 4.0.6" by Dongjoon Hyun · 2 days ago
  39. e51e707 [SPARK-56008][BUILD] Upgrade `tink` to 1.20.0 by Dongjoon Hyun · 2 days ago
  40. ed3cbe9 [SPARK-56013][BUILD] Upgrade `JAXB` to 4.0.6 by Dongjoon Hyun · 2 days ago
  41. abaa2f0 [SPARK-56012][BUILD] Upgrade `xz` to 1.12 by Dongjoon Hyun · 2 days ago
  42. 51c3e37 [SPARK-56009][BUILD] Upgrade `netty-tcnative` to 2.0.75.Final by Dongjoon Hyun · 2 days ago
  43. 71f1edc [SPARK-56016][PS] Preserve named Series columns in concat with ignore_index on pandas 3 by Takuya Ueshin · 2 days ago
  44. 6f3ece6 [SPARK-56014][PS][TESTS] Fix to_numeric ignore test for pandas 3.0 by Takuya Ueshin · 2 days ago
  45. 8f9a3de [SPARK-56010][BUILD] Upgrade `snowflake-jdbc` to 4.0.2 by Dongjoon Hyun · 2 days ago
  46. 951c624 [SPARK-56006][BUILD] Upgrade Maven to 3.9.14 by Dongjoon Hyun · 2 days ago
  47. 124d0a9 [SPARK-55885][SQL] Optimize vectorized Parquet boolean reading with lookup-table expansion and batch buffer reads by yangjie01 · 2 days ago
  48. 48940be [SPARK-55877][UI] Side-by-side Initial vs Final plan comparison for AQE queries by Kent Yao · 2 days ago
  49. 7eef6f7 [SPARK-55997][SS] Set upper bound to prefixScan in RocksDB state store provider by Jungtaek Lim · 2 days ago
  50. a936ccf [SPARK-55887][CONNECT] Special handling for `CollectLimitExec/CollectTailExec` to avoid full table scans by yangjie01 · 2 days ago
  51. 9b70fca [SPARK-55992][SQL] Fix GroupPartitions textual representation by Peter Toth · 2 days ago
  52. cbcee8c [SPARK-55986][PYTHON] Upgrade black to 26.3.1 by yangjie01 · 2 days ago
  53. 73dd6ed [SPARK-55690] Schema evolution in DSv2 AppendData, OverwriteByExpression, OverwritePartitionsDynamic by Johan Lasperas · 3 days ago
  54. cbbbd41 [SPARK-55790][GEO][SQL] Build a complete SRS registry using PROJ 9.7.1 data by Uros Bojanic · 3 days ago
  55. 09979af [SPARK-53339][CONNECT] Fix interrupt on pending operations by moving `postStarted()` and allowing Pending to Canceled/Failed transition by Kousuke Saruta · 3 days ago
  56. ceba3da [SPARK-55984][SQL][TESTS] Add metadata_column_resolution.sql golden file by mihailoale-db · 3 days ago
  57. 6730ccd [SPARK-55453][SQL] Fix LIKE pattern matching for supplementary Unicode characters by Xiaoxuan Li · 3 days ago
  58. 6fda5fb [SPARK-55357][PYTHON] Fix docstring for timestamp_add by judy · 3 days ago
  59. fce6cce [SPARK-55993][SS][TEST] Fix flaky RocksDBStateStoreIntegrationSuite bounded memory test by Kent Yao · 3 days ago
  60. 13e44f7 [SPARK-55988][PS][TESTS] Compare categorical index codes by values in tests by Takuya Ueshin · 4 days ago
  61. 47424a3 [SPARK-55989][PS] Preserve non-int64 index dtypes in `restore_index` by Takuya Ueshin · 4 days ago
  62. 322165b [SPARK-55977][PS] Fix isin() to use strict type matching like pandas by Devin Petersohn · 4 days ago
  63. 4d79768 [SPARK-55991] Fix unicode related SQL text corruption with parameters by Serge Rielau · 4 days ago
  64. 6ab9428 [SPARK-55880][UI] Link SQL plan metric stage IDs to stage detail page by Kent Yao · 5 days ago
  65. 81f2172 [SPARK-55985][WEBUI] Remove `jquery.blockUI.min.js` by Kousuke Saruta · 5 days ago
  66. a5ad1a7 [SPARK-55557][SQL] Hyperbolic functions should not overflow with large inputs by Marco Gaido · 5 days ago
  67. ae20bb9 [SPARK-55987][SS] Fix time window join in stream-stream join state format V4 by Nicholas Chew · 5 days ago
  68. bac7ce1 [SPARK-55493][SS] Do not mkdirs in streaming checkpoint offset/commit log directory in StateDataSource by Livia Zhu · 5 days ago
  69. bff9dcf [SPARK-55945][SDP] Support structured identifiers for flows in SDP eager analysis protos by Yuheng Chang · 5 days ago
  70. e8d8e6a [SPARK-55971][UI] Add Jobs table to SQL execution detail page by Kent Yao · 6 days ago
  71. e7bbd32 [SPARK-55975][SQL][TESTS] NaN comparison can cause false UT failures due to different NaNs by Marco Gaido · 6 days ago
  72. 9a9c714 [SPARK-55628][SS] Integrate stream-stream join state format V4 by Nicholas Chew · 6 days ago
  73. 8efc4c6 [SPARK-55967][PYTHON] Unify column conversion for connect dataframe by Tian Gao · 6 days ago
  74. cf2aadd [SPARK-55980][PS] Always apply _cast_back_float in numeric arithmetic by Devin Petersohn · 6 days ago
  75. 0e8d39e [MINOR][DOCS] Remove redundant backtick in docstrings by Joon Ro · 6 days ago
  76. 85b351b [SPARK-55976][SQL] Use Set instead of Seq for write privileges by Anton Okolnychyi · 6 days ago
  77. 06a7b2d [SPARK-55870][SQL] Add docs for Geo types by Szehon Ho · 6 days ago
  78. c1f4d11e [SPARK-55275] Add InvalidPlanInput sql states for sql/connect by Garland Zhang · 7 days ago
  79. 14d659e [SPARK-55535][SQL][FOLLOW-UP] Fix `OrderedDistribution` handling and minor improvements to `EnsureRequirements` by Peter Toth · 7 days ago
  80. 2a5c0df [SPARK-55726][PYTHON][TEST][FOLLOW-UP] Make SQL_GROUPED_MAP_PANDAS_UDF benchmark to two bench classes by Yicong Huang · 7 days ago
  81. 12aa167 [SPARK-55960][INFRA][DOCS][FOLLOW-UP] Document how to re-generate the protobuf files for python client by Ruifeng Zheng · 7 days ago
  82. 587dfa4 [SPARK-55947][PYTHON][TEST] Add ASV micro-benchmarks for SQL_GROUPED_MAP_ARROW_UDF and SQL_GROUPED_MAP_ARROW_ITER_UDF by Yicong Huang · 7 days ago
  83. 320ece1 [SPARK-55928][SQL] New linter for config effectiveness in views and UDFs by Mihailo Timotic · 7 days ago
  84. 58cb617 [SPARK-55961][UI] Make SQL plan viz side panel collapsible by Kent Yao · 7 days ago
  85. 4cdaae7 [SPARK-55962][SQL] Use `getShort` instead of `getInt` casting in `putShortsFromIntsLittleEndian` on Little Endian platforms by yangjie01 · 7 days ago
  86. 8d64502 [SPARK-55667][PYTHON][CONNECT] Move check_dependencies to __init__ by Tian Gao · 7 days ago
  87. 1b63af8 [SPARK-55851][PYTHON] Clarify types of datasource partition and read by Tian Gao · 7 days ago
  88. 24b9d2c [SPARK-55965][PYTHON] Add warning when pandas >= 3.0.0 is used with PySpark by Yicong Huang · 7 days ago
  89. fa87249 [SPARK-55891][SQL] Preserve the SQL scripting context inside EXECUTE IMMEDIATE by ilicmarkodb · 8 days ago
  90. 104e43b [SPARK-55903][SQL] Simplify MERGE Schema Evolution and Check Write Privileges by Szehon Ho · 8 days ago
  91. 5d207b2 [SPARK-55960][INFRA][CONNECT][PYTHON][FOLLOW-UP] Fix build on linux by Ruifeng Zheng · 8 days ago
  92. ccf44c0 [SPARK-55960][INFRA][CONNECT][PYTHON] Add a docker image for spark connect codegen by Ruifeng Zheng · 8 days ago
  93. 12dc89e [SPARK-55909][SQL][TESTS] Introduce trait `SparkSessionProvider` by Ruifeng Zheng · 8 days ago
  94. 305cdc3 [SPARK-55957][SQL] Add 'DATA_SOURCE_NOT_FOUND' in Catalog.ERROR_HANDLING_RULES by Hyukjin Kwon · 8 days ago
  95. a47e2d1 [SPARK-55907][SQL] Fix incorrect error positions for invalid data types in CREATE FUNCTION by Gengliang Wang · 8 days ago
  96. d6aa376 [SPARK-55884][SQL] Add v1StatsToV2Stats to DataSourceV2Relation by Xin Huang · 8 days ago
  97. c433283 [SPARK-55954][PYTHON] Remove the incorrect overload type hint for fillna by Tian Gao · 8 days ago
  98. e5a900a [SPARK-55955][PYTHON] Remove overload type hint for drop by Tian Gao · 8 days ago
  99. 69f4d00c [SPARK-55958][BUILD][CONNECT] Remove unused `add-scala-test-sources` setting from `pom.xml` in `connect-common` by Kousuke Saruta · 8 days ago
  100. f1acf94 [SPARK-55889][DOCS][FOLLOWUP] Update `building-spark.md` with Maven 3.9.13 by Dongjoon Hyun · 8 days ago