blob: c22118f91b3fcde6cc8a7af9d43965e38c799381 [file] [log] [blame]
================================================================================================
Benchmark for performance of JSON parsing
================================================================================================
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
JSON schema inferring: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
No encoding 63839 64000 263 1.6 638.4 1.0X
UTF-8 is set 124633 124945 429 0.8 1246.3 0.5X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
count a short column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
No encoding 51720 51901 157 1.9 517.2 1.0X
UTF-8 is set 91161 91190 25 1.1 911.6 0.6X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
count a wide column: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
No encoding 58486 59038 714 0.2 5848.6 1.0X
UTF-8 is set 103045 103350 358 0.1 10304.5 0.6X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
select wide row: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
No encoding 134909 135024 105 0.0 269818.6 1.0X
UTF-8 is set 154418 154593 155 0.0 308836.7 0.9X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Select a subset of 10 columns: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Select 10 columns 19538 19620 70 0.5 1953.8 1.0X
Select 1 column 26142 26159 15 0.4 2614.2 0.7X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
creation of JSON parser per line: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Short column without encoding 8103 8162 53 1.2 810.3 1.0X
Short column with UTF-8 13104 13150 58 0.8 1310.4 0.6X
Wide column without encoding 135280 135593 375 0.1 13528.0 0.1X
Wide column with UTF-8 175189 175483 278 0.1 17518.9 0.0X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
JSON functions: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Text read 1225 1234 8 8.2 122.5 1.0X
from_json 22482 22552 95 0.4 2248.2 0.1X
json_tuple 30203 30338 146 0.3 3020.3 0.0X
get_json_object 22219 22245 26 0.5 2221.9 0.1X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Dataset of json strings: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Text read 5897 5904 10 8.5 117.9 1.0X
schema inferring 30282 30340 50 1.7 605.6 0.2X
parsing 33304 33577 289 1.5 666.1 0.2X
Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Json files in the per-line mode: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Text read 9710 9757 80 5.1 194.2 1.0X
Schema inferring 35929 35939 9 1.4 718.6 0.3X
Parsing without charset 39175 39227 87 1.3 783.5 0.2X
Parsing with UTF-8 59188 59294 109 0.8 1183.8 0.2X
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Write dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Create a dataset of timestamps 1967 1977 9 5.1 196.7 1.0X
to_json(timestamp) 17086 17304 371 0.6 1708.6 0.1X
write timestamps to files 12691 12716 28 0.8 1269.1 0.2X
Create a dataset of dates 2192 2217 39 4.6 219.2 0.9X
to_json(date) 10541 10656 137 0.9 1054.1 0.2X
write dates to files 7259 7311 46 1.4 725.9 0.3X
OpenJDK 64-Bit Server VM 1.8.0_252-8u252-b09-1~18.04-b09 on Linux 4.15.0-1063-aws
Intel(R) Xeon(R) CPU E5-2670 v2 @ 2.50GHz
Read dates and timestamps: Best Time(ms) Avg Time(ms) Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
read timestamp text from files 2318 2326 13 4.3 231.8 1.0X
read timestamps from files 43345 43627 258 0.2 4334.5 0.1X
infer timestamps from files 89570 89621 59 0.1 8957.0 0.0X
read date text from files 2099 2107 9 4.8 209.9 1.1X
read date from files 18000 18065 98 0.6 1800.0 0.1X
timestamp strings 3937 3956 32 2.5 393.7 0.6X
parse timestamps from Dataset[String] 56001 56429 539 0.2 5600.1 0.0X
infer timestamps from Dataset[String] 109410 109963 559 0.1 10941.0 0.0X
date strings 4530 4540 9 2.2 453.0 0.5X
parse dates from Dataset[String] 29723 29767 72 0.3 2972.3 0.1X
from_json(timestamp) 74106 74619 728 0.1 7410.6 0.0X
from_json(date) 46599 46632 32 0.2 4659.9 0.0X