blob: 0fe10dae4c1df2b37b23f52c3fd200ed577e3f4c [file] [log] [blame]
================================================================================================
Dataset Benchmark
================================================================================================
OpenJDK 64-Bit Server VM 17.0.1+12-LTS on Linux 5.11.0-1022-azure
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
back-to-back map long: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD 11596 11730 189 8.6 116.0 1.0X
DataFrame 1808 1920 159 55.3 18.1 6.4X
Dataset 2412 2746 473 41.5 24.1 4.8X
OpenJDK 64-Bit Server VM 17.0.1+12-LTS on Linux 5.11.0-1022-azure
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
back-to-back map: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD 13438 13640 286 7.4 134.4 1.0X
DataFrame 4277 4335 83 23.4 42.8 3.1X
Dataset 14248 14324 107 7.0 142.5 0.9X
OpenJDK 64-Bit Server VM 17.0.1+12-LTS on Linux 5.11.0-1022-azure
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
back-to-back filter Long: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD 2906 3036 183 34.4 29.1 1.0X
DataFrame 1074 1089 21 93.1 10.7 2.7X
Dataset 3045 3060 21 32.8 30.5 1.0X
OpenJDK 64-Bit Server VM 17.0.1+12-LTS on Linux 5.11.0-1022-azure
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
back-to-back filter: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD 3884 3996 158 25.7 38.8 1.0X
DataFrame 179 202 21 557.2 1.8 21.6X
Dataset 4582 4655 103 21.8 45.8 0.8X
OpenJDK 64-Bit Server VM 17.0.1+12-LTS on Linux 5.11.0-1022-azure
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
aggregate: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum 3385 3545 226 29.5 33.9 1.0X
DataFrame sum 66 95 15 1505.9 0.7 51.0X
Dataset sum using Aggregator 3162 3222 86 31.6 31.6 1.1X
Dataset complex Aggregator 9086 9116 43 11.0 90.9 0.4X