blob: f976b21bf01a18df4961586cea101bd3062e50a2 [file] [log] [blame]
================================================================================================
Dataset Benchmark
================================================================================================
OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back map long: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD 5898 5930 46 17.0 59.0 1.0X
DataFrame 1234 1271 53 81.1 12.3 4.8X
Dataset 1338 1351 19 74.8 13.4 4.4X
OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back map: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD 7320 7452 188 13.7 73.2 1.0X
DataFrame 2788 2803 21 35.9 27.9 2.6X
Dataset 7187 7220 46 13.9 71.9 1.0X
OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter Long: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD 4085 4191 150 24.5 40.8 1.0X
DataFrame 719 732 18 139.0 7.2 5.7X
Dataset 1592 1597 6 62.8 15.9 2.6X
OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
back-to-back filter: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD 2012 2020 10 49.7 20.1 1.0X
DataFrame 119 133 12 837.5 1.2 16.9X
Dataset 2449 2452 4 40.8 24.5 0.8X
OpenJDK 64-Bit Server VM 17.0.16+8-LTS on Linux 6.11.0-1018-azure
AMD EPYC 7763 64-Core Processor
aggregate: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum 1404 1418 20 71.2 14.0 1.0X
DataFrame sum 70 84 12 1437.3 0.7 20.2X
Dataset sum using Aggregator 2046 2057 15 48.9 20.5 0.7X
Dataset complex Aggregator 5197 5229 45 19.2 52.0 0.3X