Spark jobs not being able read from the Spark history server

90 views
Skip to first unread message

nikhil bejugama

unread,
Feb 14, 2017, 1:51:56 PM2/14/17
to dr-elephant-users


Hi,

I have set the SPARK_HOME and SPARK_CONF_DIR but I couldn't Spark applications analysed by the Dr E.

02-14-2017 11:49:36 INFO  [Thread-6] com.linkedin.drelephant.ElephantRunner : Dr.elephant has started
02-14-2017 11:49:36 INFO  [Thread-6] com.linkedin.drelephant.security.HadoopSecurity : This cluster is Kerberos enabled.
02-14-2017 11:49:36 INFO  [Thread-6] com.linkedin.drelephant.security.HadoopSecurity : No login user. Creating login user
02-14-2017 11:49:36 INFO  [Thread-6] com.linkedin.drelephant.security.HadoopSecurity : Logging with xxxx/xxxx@xxxx and xxxx
02-14-2017 11:49:36 INFO  [Thread-6] com.linkedin.drelephant.security.HadoopSecurity : Logged in with user xxxx/xxxx@xxxx (auth:KERBEROS)
02-14-2017 11:49:36 INFO  [Thread-6] com.linkedin.drelephant.security.HadoopSecurity : Login is keytab based
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.analysis.HDFSContext : HDFS BLock size: 134217728
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.util.Utils : Loading configuration file AggregatorConf.xml
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.util.Utils : Configuation file loaded. File: AggregatorConf.xml
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.ElephantContext : Load Aggregator : com.linkedin.drelephant.mapreduce.MapReduceMetricsAggregator
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.ElephantContext : Load Aggregator : com.linkedin.drelephant.spark.SparkMetricsAggregator
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.util.Utils : Loading configuration file FetcherConf.xml
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.util.Utils : Configuation file loaded. File: FetcherConf.xml
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.mapreduce.fetchers.MapReduceFetcherHadoop2 : Connecting to the job history server at xxxx:19888...
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.mapreduce.fetchers.MapReduceFetcherHadoop2 : Connection success.
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.ElephantContext : Load Fetcher : com.linkedin.drelephant.mapreduce.fetchers.MapReduceFetcherHadoop2
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.ElephantContext : Load Fetcher : com.linkedin.drelephant.spark.fetchers.SparkFetcher
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.util.Utils : Loading configuration file HeuristicConf.xml
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.util.Utils : Configuation file loaded. File: HeuristicConf.xml
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.mapreduce.heuristics.GenericDataSkewHeuristic : Mapper Data Skew will use num_tasks_severity with the following threshold settings: [10.0, 50.0, 100.0, 200.0]
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.mapreduce.heuristics.GenericDataSkewHeuristic : Mapper Data Skew will use deviation_severity with the following threshold settings: [2.0, 4.0, 8.0, 16.0]
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.mapreduce.heuristics.GenericDataSkewHeuristic : Mapper Data Skew will use files_severity with the following threshold settings: [0.125, 0.25, 0.5, 1.0]
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.ElephantContext : Load Heuristic : com.linkedin.drelephant.mapreduce.heuristics.MapperDataSkewHeuristic
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.ElephantContext : Load View : views.html.help.mapreduce.helpMapperDataSkew
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.mapreduce.heuristics.GenericGCHeuristic : Mapper GC will use gc_ratio_severity with the following threshold settings: [0.01, 0.02, 0.03, 0.04]
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.mapreduce.heuristics.GenericGCHeuristic : Mapper GC will use runtime_severity_in_min with the following threshold settings: [5.0, 10.0, 12.0, 15.0]
02-14-2017 11:49:37 INFO  [Thread-6] com.linkedin.drelephant.ElephantContext : Load Heuristic : com.linkedin.drelephant.mapreduce.heuristics.MapperGCHeuristic


INFO  org.apache.spark.deploy.history.SparkFSFetcher$ : Looking for spark logs at logDir:xxxx
org.apache.spark.deploy.history.SparkFSFetcher$ : The event log limit of Spark application is set to 100.0 MB

I couldn't find the lines such as above in the dr_elephant.log. Could you please let me know if I'm missing anything.

Thanks


Fawze Abujaber

unread,
Mar 20, 2018, 5:09:45 PM3/20/18
to dr-elephant-users
Hi,

Were you able to get this work?
Reply all
Reply to author
Forward
0 new messages