1. cf3c05d [CELEBORN-2068] TransportClientFactory should close channel explicitly to avoid resource leak for timeout or failure by SteNicholas · 3 days ago main
  2. 6a0e19c [CELEBORN-2067] Clean up deprecated Guava API usage by SteNicholas · 3 days ago
  3. 765265a [CELEBORN-2031] Interruption Aware Slot Selection by Aravind Patnam · 6 days ago
  4. cfb4438 [CELEBORN-2057] Bump ap-loader version from 3.0-9 to 4.0-10 by SteNicholas · 11 days ago
  5. 532cedb [CELEBORN-1844][FOLLOWUP] alway try to use memory storage if available by mingji · 11 days ago
  6. 0fa600a [CELEBORN-2055] Fix some typos by codenohup · 11 days ago
  7. cd5d9cd [CELEBORN-2052] Fix unexpected warning logs in Flink caused by duplicate BufferStreamEnd messages by codenohup · 14 days ago
  8. a649823 [INFRA] More contributors name mapping by Wang, Fei · 14 days ago
  9. 41b5154 [CELEBORN-2051] Support write MapPartition to DFS by daowu.hzy · 3 weeks ago
  10. d2474e0 [CELEBORN-894][FOLLOWUP] update commitMeta before update subPartition… by lijianfu03 · 3 weeks ago
  11. cde33d9 [CELEBORN-894] End to End Integrity Checks by Gaurav Mittal · 3 weeks ago
  12. 7a0eee3 [CELEBORN-2045] Add logger sinks to allow persist metrics data and avoid possible worker OOM by mingji · 3 weeks ago
  13. 0fc7827 [CELEBORN-2036] Fix NPE when TransportMessage has null payload by Jray · 4 weeks ago
  14. 3ee3a26 [CELEBORN-2046] Specify extractionDir of AsyncProfilerLoader with celeborn.worker.jvmProfiler.localDir by SteNicholas · 4 weeks ago
  15. 8ae9737 [CELEBORN-2044] Proactively cleanup stream state from ChunkStreamManager when the stream ends by Mridul Muralidharan · 4 weeks ago
  16. 582726f [CELEBORN-1721][FOLLOWUP] Return softsplit if there is no hardsplit for pushMergeData by Shuang · 4 weeks ago
  17. 676beca [CELEBORN-2043] Fix IndexOutOfBoundsException exception in getEvictedFileWriter by mingji · 4 weeks ago
  18. 4d4012e [CELEBORN-2040] Avoid throw FetchFailedException when GetReducerFileGroupResponse failed via broadcast by caohaotian · 4 weeks ago
  19. dac0f56 [CELEBORN-1056][FOLLOWUP] Support testing of dynamic configuration management cli by SteNicholas · 4 weeks ago
  20. 6a09794 [CELEBORN-2042] Fix FetchFailure handling when TaskSetManager is not found by gaoyajun02 · 5 weeks ago
  21. d44242e [MINOR] Batch few celeborn client logs by Sanskar Modi · 5 weeks ago
  22. 46c9980 [CELEBORN-1056][FOLLOWUP] Support upsert and delete of dynamic configuration management by SteNicholas · 5 weeks ago
  23. 3d614f8 [CELEBORN-1931][FOLLOWUP] Update config version for worker local flusher gather api by Wang, Fei · 5 weeks ago
  24. 2a2c6e4 [CELEBORN-2024] Publish commit files fail count metrics by Sanskar Modi · 5 weeks ago
  25. a0a4260 [CELEBORN-1817][FOLLOWUP] Correct the problematic metrics by Shuang · 5 weeks ago
  26. 6f1c105 [CELEBORN-1413][FOLLOWUP] Check JAVA_HOME variables for release by Wang, Fei · 5 weeks ago
  27. cc13c1e [CELEBORN-2011][FOLLOWUP][INFRA] Write sorted authors for release contributors by Wang, Fei · 5 weeks ago
  28. cfc3f1b [CELEBORN-1319][FOLLOWUP] Support celeborn optimize skew partitions patch for Spark v3.5.6 and v4.0.0 by SteNicholas · 5 weeks ago
  29. 03f97e6 [CELEBORN-1577][FOLLOWUP] Improve check quota message by Xianming Lei · 6 weeks ago
  30. 80bdb46 [CELEBORN-1892] Adding register with master fail count metric for worker by Sanskar Modi · 6 weeks ago
  31. bbd3bb4 [CELEBORN-2033] updateProduceBytes should be called even if updateProduceBytes throws exception by Xianming Lei · 6 weeks ago
  32. edeeb4b [CELEBORN-1719][FOLLOWUP] Rename throwsFetchFailure to stageRerunEnabled by Xianming Lei · 6 weeks ago
  33. 68f3230 [CELEBORN-1572][FOLLOWUP] Support to show Celeborn CLI version for sub command by Wang, Fei · 6 weeks ago
  34. 9a689b7 [CELEBORN-2028] Setup GA for grafana dashboard by Wang, Fei · 6 weeks ago
  35. 919ece8 [CELEBORN-2015][FOLLOWUP] Retry IOException failures for RPC requests by Sanskar Modi · 6 weeks ago
  36. 8be7d92 [CELEBORN-2030] Bump Spark from 3.5.5 to 3.5.6 by SteNicholas · 6 weeks ago
  37. feba7ba [CELEBORN-2029][FLINK] Some minor optimizations in the Flink integration by codenohup · 6 weeks ago
  38. da84bae [CELEBORN-2027] Allow CelebornShuffleReader to decompress data on demand by Shuang · 6 weeks ago
  39. 60fa6d0 [CELEBORN-1720][FOLLOWUP] Fix flakyTest - check if fetch failure task another attempt is running or successful by Wang, Fei · 6 weeks ago
  40. 5e305c3 [CELEBORN-1673][FOLLOWUP] Shouldn't ignore InterruptedException when client retry by lvshuang.xjs · 6 weeks ago
  41. ebfa1d8 [CELEBORN-2014] updateInterruptionNotice REST API by Aravind Patnam · 6 weeks ago
  42. 061cdc3 [CELEBORN-2003] Add retry mechanism when completing S3 multipart upload by nicolas.fraison@datadoghq.com · 6 weeks ago
  43. 2e41877 [CELEBORN-2017][HELM] Add namespace to the metadata by Yi Chen · 7 weeks ago
  44. 211046d [CELEBORN-2025] RpcFailure Scala 2.13 serialization is incompatible by sychen · 7 weeks ago
  45. a24164c [CELEBORN-2026] Skip build tez client by mingji · 7 weeks ago
  46. 73bf154 [CELEBORN-2020][FOLLOWUP] Add --auth-header option to usage of CLI commands by SteNicholas · 7 weeks ago
  47. c0512c6 [CELEBORN-2022] Spark4 Client should package commons-io by sychen · 7 weeks ago
  48. 7bde738 [CELEBORN-2021] Fix issues on regression HDFS and OSS before release 0.6 by mingji · 7 weeks ago
  49. 5a50686 [CELEBORN-2020][FOLLOWUP] Fix CLI master commands authentication testing by Wang, Fei · 7 weeks ago
  50. 7eee202 [CELEBORN-2023] Spark4 Client incompatible with isLocalMaster method by sychen · 7 weeks ago
  51. aceee64 [CELEBORN-2018] Support min number of workers selected for shuffle by Sanskar Modi · 7 weeks ago
  52. 5f58fb1 [CELEBORN-2020] Support http authentication for Celeborn CLI by Wang, Fei · 7 weeks ago
  53. 68a1db1 [CELEBORN-2005][FOLLOWUP] Introduce ShuffleMetricGroup for numBytesIn, numBytesOut, numRecordsOut, numBytesInPerSecond, numBytesOutPerSecond, numRecordsOutPerSecond metrics by SteNicholas · 7 weeks ago
  54. 0227a1a [CELEBORN-1627][FOLLOWUP] Fix the issue where the case of name affects the metrics dashboard by Shuang · 7 weeks ago
  55. 3fb6d5b [CELEBORN-1413][FOLLOWUP] Support dependencies of spark-4.0 profile by SteNicholas · 8 weeks ago
  56. b447307 [CELEBORN-1413][FOLLOWUP] Bump spark 4.0 version to 4.0.0 by Fei Wang · 8 weeks ago
  57. aeac31f [CELEBORN-2009] Commit files request failure should exclude worker in LifecycleManager by Sanskar Modi · 8 weeks ago
  58. c83d498 [CELEBORN-1528][HELM] Use volume claim template to support various storage backend by Yi Chen · 8 weeks ago
  59. 612464c [CELEBORN-2015] Retry IOException failures for RPC requests by Sanskar Modi · 8 weeks ago
  60. 14d7212 [MINOR][DOC] Correct configuration values ​​in slotsallocation by sychen · 8 weeks ago
  61. 0dffcf6 [CELEBORN-2013] Upgrade scala binary version of spark-3.3, spark-3.4, spark-3.5 profile to 2.13.8 by SteNicholas · 8 weeks ago
  62. d65ff56 [CELEBORN-2012] Add license for http5 by Wang, Fei · 8 weeks ago
  63. 48fb71e [CELEBORN-2011][INFRA] Add a script to simplify the process of creating release notes by Fei Wang · 8 weeks ago
  64. 637c423 [CELEBORN-2010][FOLLOWUP] Fix svn staging dir by Fei Wang · 8 weeks ago
  65. 81c3d91 [CELEBORN-2010][INFRA] Add release guide by Fei Wang · 8 weeks ago
  66. 11ca1a7 [CELEBORN-2005] Introduce numBytesIn, numBytesOut, numBytesInPerSecond, numBytesOutPerSecond metrics for RemoteShuffleServiceFactory by SteNicholas · 8 weeks ago
  67. a554261 [CELEBORN-2006] LifecycleManager should avoid parsing shufflePartitionType every time by sychen · 8 weeks ago
  68. 634343e [CELEBORN-2007] Reduce PartitionLocation memory usage by sychen · 8 weeks ago
  69. d2befe0 [CELEBORN-2008] SlotsAllocator should select disks randomly in RoundRobin mode by Xianming Lei · 8 weeks ago
  70. cbf4a14 Bump 0.7.0-SNAPSHOT by Wang, Fei · 9 weeks ago
  71. f7be341 [CELEBORN-1902] Read client throws PartitionConnectionException by Jinqian Fan · 9 weeks ago
  72. 2a847ba [MINOR] Change some config version by Wang, Fei · 9 weeks ago
  73. 082f0dd [CELEBORN-1775][FOLLOWUP] Improve logging around commit files by Sanskar Modi · 9 weeks ago
  74. 45b94bf [CELEBORN-1996][HELM] Rename volumes.{master,worker} to {master,worker}.volumes and {master.worker}.volumeMounts by Yi Chen · 9 weeks ago
  75. 46d9d63 [CELEBORN-1916][FOLLOWUP] Improve Aliyun OSS support by SteNicholas · 9 weeks ago
  76. 0b5a09a [CELEBORN-1896] delete data from failed to fetch shuffles by CodingCat · 9 weeks ago
  77. a7e6387 [CELEBORN-2004] Filter empty partition before createIntputStream by Fei Wang · 9 weeks ago
  78. 90ece96 [CELEBORN-2002][MASTER] Audit shuffle lifecycle in separate log file by Wang, Fei · 9 weeks ago
  79. fd715b4 [CELEBORN-1993] CelebornConf introduces celeborn.<module>.io.threads to specify number of threads used in the client thread pool by SteNicholas · 9 weeks ago
  80. ec62d92 [CELEBORN-2000] Ignore the getReducerFileGroup timeout before shuffle stage end by Wang, Fei · 9 weeks ago
  81. d9984c9 [CELEBORN-1800] Introduce ApplicationTotalCount and ApplicationFallbackCount metric to record the total and fallback count of application by SteNicholas · 9 weeks ago
  82. 4205f83 [CELEBORN-1995] Optimize memory usage for push failed batches by mingji · 9 weeks ago
  83. 062db5b [CELEBORN-1921][FOLLOWUP] Log the GetReducerFileGroupResponse size to provide insights by Wang, Fei · 9 weeks ago
  84. e8ae23b [CELEBORN-1960] Fix PauseSpentTime only append the interval check time by zhengtao · 10 weeks ago
  85. 88124d7 [CELEBORN-1691][FOLLOWUP] Fix the issue that upstream tasks don't rerun and the current task still retry when failed to deserialize in flink by SteNicholas · 10 weeks ago
  86. a9ce411 [CELEBORN-1998] RemoteShuffleEnvironment should not register InputChannelMetrics repeatedly by SteNicholas · 10 weeks ago
  87. d03efcb [CELEBORN-1999] OpenStreamTime should use requestId to record cost time by Xianming Lei · 10 weeks ago
  88. 8e66ac8 [CELEBORN-1994] Introduce disruptor dependency to support asynchronous logging of log4j2 by SteNicholas · 10 weeks ago
  89. eb2449c [CELEBORN-1989][HELM] Split securityContext into master.podSecurityContext and worker.podSecurityContext by Yi Chen · 10 weeks ago
  90. 045411a [CELEBORN-1855] LifecycleManager return appshuffleId for non barrier stage when fetch fail has been reported by lijianfu03 · 10 weeks ago
  91. a547cda [CELEBORN-1974] ApplicationId as metrics label should be behind a config flag by Sanskar Modi · 10 weeks ago
  92. 9ba54b3 [CELEBORN-1968] Publish metric for unreleased partition location count when worker was gracefully shutdown by Sanskar Modi · 10 weeks ago
  93. 3896249 [CELEBORN-1978][CIP-14] Add code style checking for cppClient by HolyLow · 2 months ago
  94. 6ceadd3 [CELEBORN-1487][FOLLOWUP] Fix updateProduceBytes by caohaotian · 2 months ago
  95. c9ca90c [CELEBORN-1965] Rely on all default hadoop providers for S3 auth by Nicolas Fraison · 2 months ago
  96. 06bcc20 [CELEBORN-1988][HELM] Split hostNetwork into master.hostNetwork and worker.hostNetwork by Yi Chen · 2 months ago
  97. 7542adf [CELEBORN-1948] Fix the issue where replica may lose data when HARD_SPLIT occurs during handlePushMergeData by Xianming Lei · 2 months ago
  98. b2c62d4 [CELEBORN-1987][HELM] Split dnsPolicy into master.dnsPolicy and worker.dnsPolicy by Yi Chen · 2 months ago
  99. 74b41bb [CELEBORN-1319][CELEBORN-474][FOLLOWUP] PushState uses JavaUtils#newConcurrentHashMap to speed up ConcurrentHashMap#computeIfAbsent by SteNicholas · 2 months ago
  100. fff9725 [CELEBORN-1760][FOLLOWUP] Remove redundant release on data added in flushBuffer by xinyuwang1 · 2 months ago