error when run q20,25,q26,q28

223 views
Skip to first unread message

auge pang

unread,
Mar 25, 2017, 12:02:48 AM3/25/17
to Big Data Benchmark for BigBench
Hi,
I'm currently running BigBench and the queries seem to work just fine, except for the query number 20,25,26. 
The error message of query 20 I've got is the following: 

===============================================
Validating query : q20
-----------------------------------------------
benchmark phase: engine_validation_power_test
stream number  : 0
user parameter file: 
user settings file : 
log: /home/server501/tpcx-bb/logs/q20_hive_engine_validation_power_test_0.log
===============================================
Golden result set file /home/server501/tpcx-bb/engines/hive/queries/q20/results/q20-result not found
Auge Pang's information:Validation of /home/server501/tpcx-bb/engines/hive/queries/q20/results/q20-result and hadoop fs -cat /user/server501/benchmarks/bigbench/queryResults/q20_hive_engine_validation_power_test_0_result/*
17/03/24 20:45:28 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
Files /home/server501/tpcx-bb/engines/hive/queries/q20/results/q20-result and /dev/fd/62 differ
Validation of /home/server501/tpcx-bb/engines/hive/queries/q20/results/q20-result failed: Query returned incorrect results
Validation failed: Query results are not OK
An error occured while running command:
==========
query_run_validate_method
==========
Please check the log files for details
cat: Unable to write to output stream.
Command: [bash, /home/server501/tpcx-bb/bin/bigBench, validateQuery, -q, 20, -p, engine_validation_power_test, -t, 0, -f, 1, -e, hive, -d, bigbench, -m, 8, -U]
got non-zero return value: 1
BigBench engine validation: Check all query results     stream          0, query 20 ( 1/ 1) failed. Time:          0h:00m:02s:012ms
BigBench engine validation: Check all query results     failed. Time:          0h:00m:02s:016ms
==============
Benchmark run terminated
Reason: An error occured while running a command in phase ENGINE_VALIDATION_RESULT_VALIDATION
==============
java.io.IOException: Error while running query 20. More information in logfile: /home/server501/tpcx-bb/logs/q20_hive_engine_validation_power_test_0.log
at io.bigdatabenchmark.v1.driver.BigBench$QueryThread.call(BigBench.java:1492)
at io.bigdatabenchmark.v1.driver.BigBench$QueryThread.call(BigBench.java:1)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)

************
TPCx-BB
Result
v1.1
************
NOK: Some required BigBench phases were skipped. No final result available.
17/03/24 20:45:30 INFO client.RMProxy: Connecting to ResourceManager at n1/192.168.237.136:8032
17/03/24 20:45:30 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
[1] 20:45:31 [FAILURE] n2 Exited with error code 1
[2] 20:45:31 [FAILURE] n3 Exited with error code 1
[1] 20:45:31 [FAILURE] n2 Exited with error code 127
[2] 20:45:31 [FAILURE] n3 Exited with error code 127
[1] 20:45:32 [FAILURE] n2 Exited with error code 1
[2] 20:45:32 [FAILURE] n3 Exited with error code 1
q20_hive_engine_validation_power_test_0.log

Michael Frank

unread,
Mar 25, 2017, 8:31:22 AM3/25/17
to Big Data Benchmark for BigBench
Hi Auge Pang,

General descrption:
Your error occured during benchmark phase "engine_validation". The engine_validation phase is a precautious step, introduced to verifiy your cluster and hive/spark are at least capeable to run a Scale factor 1 benchmark (1 GB of data) without errors and produce the correct results, before proceeding to larger scale factors. Failing in this stage usually means, something is wrong with your cluster setup.
The results from this phase are compared against a "golden result set". Each query containes an corresponding "golden result set" stored in:
tpcx-bb/engines/hive/queries/q??/results/q??-result  where ?? in: [01,30]


From your q20_hive_engine_validation_power_test_0.log file is see that following file is missing or not accessible:
/home/server501/tpcx-bb/engines/hive/queries/q20/results/q20-result
 
see 11. line from the bottom in your log file:
Golden result set file /home/server501/tpcx-bb/engines/hive/queries/q20/results/q20-result
 
Missing file is that one: https://github.com/intel-hadoop/Big-Data-Benchmark-for-Big-Bench/blob/master/engines/hive/queries/q20/results/q20-result 

I also see some comments you apparently added yourself. Please make sure you did not manipulate or delete any important files.

Once that is resolved and validation still gives you trouble:
for example we know that due to implementation changes in the machine learning algorithms themselfs, different spark version may produce different sesults
you could skip the engine validation phases by changeing the following line in https://github.com/intel-hadoop/Big-Data-Benchmark-for-Big-Bench/blob/master/conf/bigBench.properties
workload=CLEAN_ALL,ENGINE_VALIDATION_DATA_GENERATION,ENGINE_VALIDATION_LOAD_TEST,ENGINE_VALIDATION_POWER_TEST,ENGINE_VALIDATION_RESULT_VALIDATION,CLEAN_DATA,DATA_GENERATION,BENCHMARK_START,LOAD_TEST,POWER_TEST,THROUGHPUT_TEST_1,BENCHMARK_STOP,VALIDATE_POWER_TEST,VALIDATE_THROUGHPUT_TEST_1
 To:
workload=CLEAN_ALL,CLEAN_DATA,DATA_GENERATION,BENCHMARK_START,LOAD_TEST,POWER_TEST,THROUGHPUT_TEST_1,BENCHMARK_STOP,VALIDATE_POWER_TEST,VALIDATE_THROUGHPUT_TEST_1
Or without manipulating the conf file using the -i option:
 ./bigBench runBenchmark -i "CLEAN_ALL,CLEAN_DATA,DATA_GENERATION,BENCHMARK_START,LOAD_TEST,POWER_TEST,BENCHMARK_STOP,VALIDATE_POWER_TEST" <..other options>

Cheers ,
Michael

auge pang

unread,
Mar 28, 2017, 10:31:50 PM3/28/17
to Big Data Benchmark for BigBench
Hi Michael,
      Thanks you for your help.  But the file /home/server501/tpcx-bb/engines/hive/queries/q20/results/q20-result already exists in the system and has access rights.
      I changed the conf file like following:
         workload=CLEAN_ALL,CLEAN_DATA,DATA_GENERATION,BENCHMARK_START,LOAD_TEST,,THROUGHPUT_TEST_1,BENCHMARK_STOP,VALIDATE_POWER_TEST,VALIDATE_THROUGHPUT_TEST_1
     query 20 also get the following error:

INFO: BigBench validation: Power test results                 stream          0, query 20 (20/30) log file: /home/server501/tpcx-bb/logs/q20_hive_power_test_0.log
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench runCmd
INFO: Command: [bash, /home/server501/tpcx-bb/bin/bigBench, validateQuery, -q, 20, -p, power_test, -t, 0, -f, 1, -m, 8, -s, 2, -U]
got non-zero return value: 1
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench$QueryThread call
INFO: BigBench validation: Power test results                 stream          0, query 20 (20/30) failed. Time:          0h:00m:04s:543ms
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench runQueries
INFO: BigBench validation: Power test results                 failed. Time:          0h:01m:59s:566ms
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench run
INFO: ==============
Benchmark run terminated
Reason: An error occured while running a command in phase VALIDATE_POWER_TEST
==============
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: 
************
TPCx-BB
Result
v1.1
************
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: T_LOAD = 718.722
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: T_LD = 0.1 * T_LOAD: 71.8722
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: T_PT = 7274.436825570289
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: T_T_PUT = 12474.375
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: T_TT = 6237.1875
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: === Checking validity of the final result ===
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: OK: All required BigBench phases were performed.
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: OK: All 30 queries were running in the power test.
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: OK: All 30 queries were running in the first throughput test.
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: OK: Pretend mode was inactive. All commands were executed.
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: === Final result ===
Mar 28, 2017 6:20:29 AM io.bigdatabenchmark.v1.driver.BigBench computeResult
INFO: VALID BBQpm@1 = 0.2644046061137735

    

在 2017年3月25日星期六 UTC+8下午8:31:22,Michael Frank写道:
Hi Auge Pang,
         

General descrption:
Your error occured during benchmark phase "engine_validation". The engine_validation phase is a precautious step, introduced to verifiy your cluster and hive/spark are at least capeable to run a Scale factor 1 benchmark (1 GB of data) without errors and produce the correct results, before proceeding to larger scale factors. Failing in this stage usually means, something is wrong with your cluster setup.
The results from this phase are compared against a "golden result set". Each query containes an corresponding "golden result set" stored in:
tpcx-bb/engines/hive/queries/q??/results/q??-result  where ?? in: [01,30]


From your q20_hive_engine_validation_power_test_0.log file is see that following file is missing or not accessible:
/home/server501/tpcx-bb/engines/hive/queries/q20/results/q20-result
 
see 11. line from the bottom in your log file:
Golden result set file /home/server501/tpcx-bb/engines/hive/queries/q20/results/q20-result
 
Missing file is that one: https://github.com/intel-hadoop/Big-Data-Benchmark-for-Big-Bench/blob/master/engines/hive/queries/q20/results/q20-result 

I also see some comments you apparently added yourself. Please make sure you did not manipulate or delete any important files.

Once that is resolved and validation still gives you trouble:
for example we know that due to implementation changes in the machine learning algorithms themselfs, different spark version may produce different sesults
you could skip the engine validation phases by changeing the following line in https://github.com/intel-hadoop/Big-Data-Benchmark-for-Big-Bench/blob/master/conf/bigBench.properties
workload=CLEAN_ALL,ENGINE_VALIDATION_DATA_GENERATION,ENGINE_VALIDATION_LOAD_TEST,ENGINE_VALIDATION_POWER_TEST,ENGINE_VALIDATION_RESULT_VALIDATION,CLEAN_DATA,DATA_GENERATION,BENCHMARK_START,LOAD_TEST,POWER_TEST,THROUGHPUT_TEST_1,BENCHMARK_STOP,VALIDATE_POWER_TEST,VALIDATE_THROUGHPUT_TEST_1
 To:
workload=CLEAN_ALL,CLEAN_DATA,DATA_GENERATION,BENCHMARK_START,LOAD_TEST,,THROUGHPUT_TEST_1,BENCHMARK_STOP,VALIDATE_POWER_TEST,VALIDATE_THROUGHPUT_TEST_1
Reply all
Reply to author
Forward
0 new messages