Failing on the first Map Task

107 views
Skip to first unread message

dwayne lessner

unread,
Jan 20, 2016, 4:52:13 PM1/20/16
to Big Data Benchmark for BigBench
I was trying to get past one error and have ran into another. Anytime my cluster starts to reduce it fails. Any thoughts?




Hive Settings I have changed


-- set mapreduce.input.fileinputformat.split.minsize=1048576;
-- set mapreduce.input.fileinputformat.split.maxsize=134217728;

-- ###########################
-- reducer settings
-- ###########################
-- Number of reducers used by HIVE
-- hives metric for estimating reducers is mostly controlled by the following settings. Node: Some Query functions like count(*) or Distinct will lead to hive always using only 1 reducer
-- 1GB default
-- set hive.exec.reducers.bytes.per.reducer=1000000



error below

Number of reduce tasks not specified. Estimated from input data size: 1
In order to change the average load for a reducer (in bytes):
  set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
  set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
  set mapreduce.job.reduces=<number>
Starting Job = job_1453316369292_0129, Tracking URL = http://cdh-mnode-3.tenanta.com:8088/proxy/application_1453316369292_0129/
Kill Command = /opt/cloudera/parcels/CDH-5.5.1-1.cdh5.5.1.p0.11/lib/hadoop/bin/hadoop job  -kill job_1453316369292_0129
Hadoop job information for Stage-2: number of mappers: 5; number of reducers: 1
2016-01-20 16:18:44,055 Stage-2 map = 0%,  reduce = 0%
Ended Job = job_1453316369292_0129 with errors
Error during job, obtaining debugging information...
FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask
MapReduce Jobs Launched:
Stage-Stage-2: Map: 5  Reduce: 1   FAIL
Total MapReduce CPU Time Spent: -1 msec
WARN: The method class org.apache.commons.logging.impl.SLF4JLogFactory#release() was invoked.
WARN: Please see http://www.slf4j.org/codes.html#release for an explanation.
An error occured while running command:
==========
runEngineCmd -f /root/Big-Data-Benchmark-for-Big-Bench/engines/hive/queries/q01/q01.sql
==========
Please check the log files for details
======= q01_hive_power_test_0 time =======
Start timestamp: 2016/01/20:16:18:23 1453324703
Stop  timestamp: 2016/01/20:16:18:44 1453324724
Duration:  0h 0m 21s
q01_hive_power_test_0 FAILED exit code: 2
----- result -----
EMPTY  bytes: 0
to display: hadoop fs -cat /user/root/benchmarks/bigbench/queryResults/q01_hive_power_test_0_result/*
----- logs -----


Michael Frank

unread,
Jan 20, 2016, 5:08:52 PM1/20/16
to Big Data Benchmark for BigBench
Hi dwayne,

 
Execution Error, return code 2 from org.apache.hadoop.hive.ql.
exec.mr.MapRedTask

is a very unspecific hive error. You need to investigate the task specific logs of your job for further detail:

Starting Job = job_1453316369292_0129, Tracking URL = http://cdh-mnode-3.tenanta.com:8088/proxy/application_1453316369292_0129/
to search for the real error. Additionally you can check the hive log files of your nodes: /tmp/<user.name>/hive.log
You will probably find some sort of exception in the logs, which will tell you what went wrong.

Cheers,
Michael


 

dwayne lessner

unread,
Jan 21, 2016, 1:27:40 PM1/21/16
to Big Data Benchmark for BigBench
thanks for the advice. Much appreciated

-DL

Michael Frank

unread,
Jan 22, 2016, 9:07:15 AM1/22/16
to Big Data Benchmark for BigBench
Hi Dwanye,
can you share what exception was causing your 'FAILED: Execution Error, return code 2'  and which distribution and hive version you are running with?
This would be helpful for the other folks reading along this group and may running into the same issue.

Cheers,
Michael

dwayne lessner

unread,
Jan 22, 2016, 11:15:14 AM1/22/16
to Big Data Benchmark for BigBench
I was using hive-0.12.0+cdh5.1.5+413
Reply all
Reply to author
Forward
0 new messages