- cf3c05d [CELEBORN-2068] TransportClientFactory should close channel explicitly to avoid resource leak for timeout or failure by SteNicholas · 3 days ago main
- 6a0e19c [CELEBORN-2067] Clean up deprecated Guava API usage by SteNicholas · 3 days ago
- 765265a [CELEBORN-2031] Interruption Aware Slot Selection by Aravind Patnam · 6 days ago
- cfb4438 [CELEBORN-2057] Bump ap-loader version from 3.0-9 to 4.0-10 by SteNicholas · 11 days ago
- 532cedb [CELEBORN-1844][FOLLOWUP] alway try to use memory storage if available by mingji · 11 days ago
- 0fa600a [CELEBORN-2055] Fix some typos by codenohup · 11 days ago
- cd5d9cd [CELEBORN-2052] Fix unexpected warning logs in Flink caused by duplicate BufferStreamEnd messages by codenohup · 14 days ago
- a649823 [INFRA] More contributors name mapping by Wang, Fei · 14 days ago
- 41b5154 [CELEBORN-2051] Support write MapPartition to DFS by daowu.hzy · 3 weeks ago
- d2474e0 [CELEBORN-894][FOLLOWUP] update commitMeta before update subPartition… by lijianfu03 · 3 weeks ago
- cde33d9 [CELEBORN-894] End to End Integrity Checks by Gaurav Mittal · 3 weeks ago
- 7a0eee3 [CELEBORN-2045] Add logger sinks to allow persist metrics data and avoid possible worker OOM by mingji · 3 weeks ago
- 0fc7827 [CELEBORN-2036] Fix NPE when TransportMessage has null payload by Jray · 4 weeks ago
- 3ee3a26 [CELEBORN-2046] Specify extractionDir of AsyncProfilerLoader with celeborn.worker.jvmProfiler.localDir by SteNicholas · 4 weeks ago
- 8ae9737 [CELEBORN-2044] Proactively cleanup stream state from ChunkStreamManager when the stream ends by Mridul Muralidharan · 4 weeks ago
- 582726f [CELEBORN-1721][FOLLOWUP] Return softsplit if there is no hardsplit for pushMergeData by Shuang · 4 weeks ago
- 676beca [CELEBORN-2043] Fix IndexOutOfBoundsException exception in getEvictedFileWriter by mingji · 4 weeks ago
- 4d4012e [CELEBORN-2040] Avoid throw FetchFailedException when GetReducerFileGroupResponse failed via broadcast by caohaotian · 4 weeks ago
- dac0f56 [CELEBORN-1056][FOLLOWUP] Support testing of dynamic configuration management cli by SteNicholas · 4 weeks ago
- 6a09794 [CELEBORN-2042] Fix FetchFailure handling when TaskSetManager is not found by gaoyajun02 · 5 weeks ago
- d44242e [MINOR] Batch few celeborn client logs by Sanskar Modi · 5 weeks ago
- 46c9980 [CELEBORN-1056][FOLLOWUP] Support upsert and delete of dynamic configuration management by SteNicholas · 5 weeks ago
- 3d614f8 [CELEBORN-1931][FOLLOWUP] Update config version for worker local flusher gather api by Wang, Fei · 5 weeks ago
- 2a2c6e4 [CELEBORN-2024] Publish commit files fail count metrics by Sanskar Modi · 5 weeks ago
- a0a4260 [CELEBORN-1817][FOLLOWUP] Correct the problematic metrics by Shuang · 5 weeks ago
- 6f1c105 [CELEBORN-1413][FOLLOWUP] Check JAVA_HOME variables for release by Wang, Fei · 5 weeks ago
- cc13c1e [CELEBORN-2011][FOLLOWUP][INFRA] Write sorted authors for release contributors by Wang, Fei · 5 weeks ago
- cfc3f1b [CELEBORN-1319][FOLLOWUP] Support celeborn optimize skew partitions patch for Spark v3.5.6 and v4.0.0 by SteNicholas · 5 weeks ago
- 03f97e6 [CELEBORN-1577][FOLLOWUP] Improve check quota message by Xianming Lei · 6 weeks ago
- 80bdb46 [CELEBORN-1892] Adding register with master fail count metric for worker by Sanskar Modi · 6 weeks ago
- bbd3bb4 [CELEBORN-2033] updateProduceBytes should be called even if updateProduceBytes throws exception by Xianming Lei · 6 weeks ago
- edeeb4b [CELEBORN-1719][FOLLOWUP] Rename throwsFetchFailure to stageRerunEnabled by Xianming Lei · 6 weeks ago
- 68f3230 [CELEBORN-1572][FOLLOWUP] Support to show Celeborn CLI version for sub command by Wang, Fei · 6 weeks ago
- 9a689b7 [CELEBORN-2028] Setup GA for grafana dashboard by Wang, Fei · 6 weeks ago
- 919ece8 [CELEBORN-2015][FOLLOWUP] Retry IOException failures for RPC requests by Sanskar Modi · 6 weeks ago
- 8be7d92 [CELEBORN-2030] Bump Spark from 3.5.5 to 3.5.6 by SteNicholas · 6 weeks ago
- feba7ba [CELEBORN-2029][FLINK] Some minor optimizations in the Flink integration by codenohup · 6 weeks ago
- da84bae [CELEBORN-2027] Allow CelebornShuffleReader to decompress data on demand by Shuang · 6 weeks ago
- 60fa6d0 [CELEBORN-1720][FOLLOWUP] Fix flakyTest - check if fetch failure task another attempt is running or successful by Wang, Fei · 6 weeks ago
- 5e305c3 [CELEBORN-1673][FOLLOWUP] Shouldn't ignore InterruptedException when client retry by lvshuang.xjs · 6 weeks ago
- ebfa1d8 [CELEBORN-2014] updateInterruptionNotice REST API by Aravind Patnam · 6 weeks ago
- 061cdc3 [CELEBORN-2003] Add retry mechanism when completing S3 multipart upload by nicolas.fraison@datadoghq.com · 6 weeks ago
- 2e41877 [CELEBORN-2017][HELM] Add namespace to the metadata by Yi Chen · 7 weeks ago
- 211046d [CELEBORN-2025] RpcFailure Scala 2.13 serialization is incompatible by sychen · 7 weeks ago
- a24164c [CELEBORN-2026] Skip build tez client by mingji · 7 weeks ago
- 73bf154 [CELEBORN-2020][FOLLOWUP] Add --auth-header option to usage of CLI commands by SteNicholas · 7 weeks ago
- c0512c6 [CELEBORN-2022] Spark4 Client should package commons-io by sychen · 7 weeks ago
- 7bde738 [CELEBORN-2021] Fix issues on regression HDFS and OSS before release 0.6 by mingji · 7 weeks ago
- 5a50686 [CELEBORN-2020][FOLLOWUP] Fix CLI master commands authentication testing by Wang, Fei · 7 weeks ago
- 7eee202 [CELEBORN-2023] Spark4 Client incompatible with isLocalMaster method by sychen · 7 weeks ago
- aceee64 [CELEBORN-2018] Support min number of workers selected for shuffle by Sanskar Modi · 7 weeks ago
- 5f58fb1 [CELEBORN-2020] Support http authentication for Celeborn CLI by Wang, Fei · 7 weeks ago
- 68a1db1 [CELEBORN-2005][FOLLOWUP] Introduce ShuffleMetricGroup for numBytesIn, numBytesOut, numRecordsOut, numBytesInPerSecond, numBytesOutPerSecond, numRecordsOutPerSecond metrics by SteNicholas · 7 weeks ago
- 0227a1a [CELEBORN-1627][FOLLOWUP] Fix the issue where the case of name affects the metrics dashboard by Shuang · 7 weeks ago
- 3fb6d5b [CELEBORN-1413][FOLLOWUP] Support dependencies of spark-4.0 profile by SteNicholas · 8 weeks ago
- b447307 [CELEBORN-1413][FOLLOWUP] Bump spark 4.0 version to 4.0.0 by Fei Wang · 8 weeks ago
- aeac31f [CELEBORN-2009] Commit files request failure should exclude worker in LifecycleManager by Sanskar Modi · 8 weeks ago
- c83d498 [CELEBORN-1528][HELM] Use volume claim template to support various storage backend by Yi Chen · 8 weeks ago
- 612464c [CELEBORN-2015] Retry IOException failures for RPC requests by Sanskar Modi · 8 weeks ago
- 14d7212 [MINOR][DOC] Correct configuration values in slotsallocation by sychen · 8 weeks ago
- 0dffcf6 [CELEBORN-2013] Upgrade scala binary version of spark-3.3, spark-3.4, spark-3.5 profile to 2.13.8 by SteNicholas · 8 weeks ago
- d65ff56 [CELEBORN-2012] Add license for http5 by Wang, Fei · 8 weeks ago
- 48fb71e [CELEBORN-2011][INFRA] Add a script to simplify the process of creating release notes by Fei Wang · 8 weeks ago
- 637c423 [CELEBORN-2010][FOLLOWUP] Fix svn staging dir by Fei Wang · 8 weeks ago
- 81c3d91 [CELEBORN-2010][INFRA] Add release guide by Fei Wang · 8 weeks ago
- 11ca1a7 [CELEBORN-2005] Introduce numBytesIn, numBytesOut, numBytesInPerSecond, numBytesOutPerSecond metrics for RemoteShuffleServiceFactory by SteNicholas · 8 weeks ago
- a554261 [CELEBORN-2006] LifecycleManager should avoid parsing shufflePartitionType every time by sychen · 8 weeks ago
- 634343e [CELEBORN-2007] Reduce PartitionLocation memory usage by sychen · 8 weeks ago
- d2befe0 [CELEBORN-2008] SlotsAllocator should select disks randomly in RoundRobin mode by Xianming Lei · 8 weeks ago
- cbf4a14 Bump 0.7.0-SNAPSHOT by Wang, Fei · 9 weeks ago
- f7be341 [CELEBORN-1902] Read client throws PartitionConnectionException by Jinqian Fan · 9 weeks ago
- 2a847ba [MINOR] Change some config version by Wang, Fei · 9 weeks ago
- 082f0dd [CELEBORN-1775][FOLLOWUP] Improve logging around commit files by Sanskar Modi · 9 weeks ago
- 45b94bf [CELEBORN-1996][HELM] Rename volumes.{master,worker} to {master,worker}.volumes and {master.worker}.volumeMounts by Yi Chen · 9 weeks ago
- 46d9d63 [CELEBORN-1916][FOLLOWUP] Improve Aliyun OSS support by SteNicholas · 9 weeks ago
- 0b5a09a [CELEBORN-1896] delete data from failed to fetch shuffles by CodingCat · 9 weeks ago
- a7e6387 [CELEBORN-2004] Filter empty partition before createIntputStream by Fei Wang · 9 weeks ago
- 90ece96 [CELEBORN-2002][MASTER] Audit shuffle lifecycle in separate log file by Wang, Fei · 9 weeks ago
- fd715b4 [CELEBORN-1993] CelebornConf introduces celeborn.<module>.io.threads to specify number of threads used in the client thread pool by SteNicholas · 9 weeks ago
- ec62d92 [CELEBORN-2000] Ignore the getReducerFileGroup timeout before shuffle stage end by Wang, Fei · 9 weeks ago
- d9984c9 [CELEBORN-1800] Introduce ApplicationTotalCount and ApplicationFallbackCount metric to record the total and fallback count of application by SteNicholas · 9 weeks ago
- 4205f83 [CELEBORN-1995] Optimize memory usage for push failed batches by mingji · 9 weeks ago
- 062db5b [CELEBORN-1921][FOLLOWUP] Log the GetReducerFileGroupResponse size to provide insights by Wang, Fei · 9 weeks ago
- e8ae23b [CELEBORN-1960] Fix PauseSpentTime only append the interval check time by zhengtao · 10 weeks ago
- 88124d7 [CELEBORN-1691][FOLLOWUP] Fix the issue that upstream tasks don't rerun and the current task still retry when failed to deserialize in flink by SteNicholas · 10 weeks ago
- a9ce411 [CELEBORN-1998] RemoteShuffleEnvironment should not register InputChannelMetrics repeatedly by SteNicholas · 10 weeks ago
- d03efcb [CELEBORN-1999] OpenStreamTime should use requestId to record cost time by Xianming Lei · 10 weeks ago
- 8e66ac8 [CELEBORN-1994] Introduce disruptor dependency to support asynchronous logging of log4j2 by SteNicholas · 10 weeks ago
- eb2449c [CELEBORN-1989][HELM] Split securityContext into master.podSecurityContext and worker.podSecurityContext by Yi Chen · 10 weeks ago
- 045411a [CELEBORN-1855] LifecycleManager return appshuffleId for non barrier stage when fetch fail has been reported by lijianfu03 · 10 weeks ago
- a547cda [CELEBORN-1974] ApplicationId as metrics label should be behind a config flag by Sanskar Modi · 10 weeks ago
- 9ba54b3 [CELEBORN-1968] Publish metric for unreleased partition location count when worker was gracefully shutdown by Sanskar Modi · 10 weeks ago
- 3896249 [CELEBORN-1978][CIP-14] Add code style checking for cppClient by HolyLow · 2 months ago
- 6ceadd3 [CELEBORN-1487][FOLLOWUP] Fix updateProduceBytes by caohaotian · 2 months ago
- c9ca90c [CELEBORN-1965] Rely on all default hadoop providers for S3 auth by Nicolas Fraison · 2 months ago
- 06bcc20 [CELEBORN-1988][HELM] Split hostNetwork into master.hostNetwork and worker.hostNetwork by Yi Chen · 2 months ago
- 7542adf [CELEBORN-1948] Fix the issue where replica may lose data when HARD_SPLIT occurs during handlePushMergeData by Xianming Lei · 2 months ago
- b2c62d4 [CELEBORN-1987][HELM] Split dnsPolicy into master.dnsPolicy and worker.dnsPolicy by Yi Chen · 2 months ago
- 74b41bb [CELEBORN-1319][CELEBORN-474][FOLLOWUP] PushState uses JavaUtils#newConcurrentHashMap to speed up ConcurrentHashMap#computeIfAbsent by SteNicholas · 2 months ago
- fff9725 [CELEBORN-1760][FOLLOWUP] Remove redundant release on data added in flushBuffer by xinyuwang1 · 2 months ago