================================================================================================
Dataset Benchmark
================================================================================================

OpenJDK 64-Bit Server VM 11.0.18+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
back-to-back map long:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                9473           9479           9         10.6          94.7       1.0X
DataFrame                                          1694           1695           2         59.0          16.9       5.6X
Dataset                                            2315           2328          19         43.2          23.1       4.1X

OpenJDK 64-Bit Server VM 11.0.18+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
back-to-back map:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                               10942          10963          30          9.1         109.4       1.0X
DataFrame                                          4275           4299          34         23.4          42.8       2.6X
Dataset                                           12671          12831         226          7.9         126.7       0.9X

OpenJDK 64-Bit Server VM 11.0.18+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
back-to-back filter Long:                 Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                2638           2707          97         37.9          26.4       1.0X
DataFrame                                           961            975          16        104.1           9.6       2.7X
Dataset                                            2581           2583           2         38.7          25.8       1.0X

OpenJDK 64-Bit Server VM 11.0.18+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
back-to-back filter:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD                                                3298           3318          28         30.3          33.0       1.0X
DataFrame                                           153            194          32        653.6           1.5      21.6X
Dataset                                            4359           4418          83         22.9          43.6       0.8X

OpenJDK 64-Bit Server VM 11.0.18+10 on Linux 5.15.0-1031-azure
Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
aggregate:                                Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
RDD sum                                            3067           3079          18         32.6          30.7       1.0X
DataFrame sum                                        71            110          30       1416.7           0.7      43.4X
Dataset sum using Aggregator                       4577           4729         215         21.8          45.8       0.7X
Dataset complex Aggregator                         8639           8694          77         11.6          86.4       0.4X


