Continuing the comparison between Hive-Tez and Hive-MR3, here is the result of concurrent tests.
We submit the first 50 queries of TPC-DS (query 1 to query 50) at once. We measure the execution time of the longest running query, which is always query 23 (consisting of query 23-1 and query 23-2) for both Hive-Tez and Hive-MR3.
Hive-Tez:
2906.999 seconds (query 23-1: 2254.654 seconds + query 23-2: 652.345 seconds)
A total of 50 Tez DAGAppMasters, each with 4GB, are created.
Hive-MR3:
first batch:
792.962 seconds (608.746 + 184.216)
second batch:
828.303 seconds (639.835 + 188.468)
third batch:
809.656 seconds (620.914 + 188.742)
We allocate 64GB memory to MR3 DAGAppMaster.
We see that Hive-MR3 yields about 3.5 times higher throughput than Hive-Tez.
--- Sungwoo