2016-01-07 14:22:50,641 INFO com.amazon.ws.emr.hadoop.fs.EmrFileSystem (main): Consistency disabled, using com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem as filesystem implementation
2016-01-07 14:22:51,140 INFO amazon.emr.metrics.MetricsSaver (main): MetricsConfigRecord disabledInCluster: false instanceEngineCycleSec: 60 clusterEngineCycleSec: 60 disableClusterEngine: false
2016-01-07 14:22:51,142 INFO amazon.emr.metrics.MetricsSaver (main): Created MetricsSaver j-3CMF50FKOQQBH:i-85185508:RunJar:03217 period:60 /mnt/var/em/raw/i-85185508_20160107_null_00000_raw.bin
2016-01-07 14:22:52,636 INFO cascading.flow.hadoop.util.HadoopUtil (main): resolving application jar from found main method on: com.snowplowanalytics.snowplow.storage.hadoop.JobRunner$
2016-01-07 14:22:52,639 INFO cascading.flow.hadoop.planner.HadoopPlanner (main): using application jar: /mnt/var/lib/hadoop/steps/s-RY37C99HWM8L/hadoop-elasticsearch-sink-0.1.0.jar
2016-01-07 14:22:52,655 INFO cascading.property.AppProps (main): using
app.id: 2C9A9D9B92BC48F8B4A6522BB4E301F4
2016-01-07 14:22:53,259 INFO org.elasticsearch.hadoop.cascading.EsTap (main): Elasticsearch Hadoop v2.2.0.BUILD-SNAPSHOT [a51c2f7f94] initialized
2016-01-07 14:22:53,363 INFO org.apache.hadoop.conf.Configuration.deprecation (main): mapred.used.genericoptionsparser is deprecated. Instead, use mapreduce.client.genericoptionsparser.used
2016-01-07 14:22:53,465 INFO org.apache.hadoop.conf.Configuration.deprecation (main): mapred.output.dir is deprecated. Instead, use mapreduce.output.fileoutputformat.outputdir
2016-01-07 14:22:53,654 INFO cascading.util.Version (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): Concurrent, Inc - Cascading 2.6.0
2016-01-07 14:22:53,666 INFO cascading.flow.Flow (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): [com.snowplowanalytics....] starting
2016-01-07 14:22:53,667 INFO cascading.flow.Flow (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): [com.snowplowanalytics....] source: Hfs["TextLine[['offset', 'line']->[ALL]]"]["s3://ig-etl/out/enriched/bad/run=2016-01-06-22-04-03"]
2016-01-07 14:22:53,667 INFO cascading.flow.Flow (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): [com.snowplowanalytics....] sink: EsHadoopTap["EsHadoopScheme[[UNKNOWN]->['output']]"]["snowplow/bad_rows"]
2016-01-07 14:22:53,668 INFO cascading.flow.Flow (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): [com.snowplowanalytics....] parallel execution is enabled: true
2016-01-07 14:22:53,668 INFO cascading.flow.Flow (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): [com.snowplowanalytics....] starting jobs: 1
2016-01-07 14:22:53,668 INFO cascading.flow.Flow (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): [com.snowplowanalytics....] allocating threads: 1
2016-01-07 14:22:53,677 INFO cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] starting step: (1/1) snowplow/bad_rows
2016-01-07 14:22:53,765 INFO org.apache.hadoop.yarn.client.RMProxy (pool-4-thread-1): Connecting to ResourceManager at /
172.31.13.22:90222016-01-07 14:22:54,021 INFO org.apache.hadoop.yarn.client.RMProxy (pool-4-thread-1): Connecting to ResourceManager at /
172.31.13.22:90222016-01-07 14:22:54,036 WARN org.elasticsearch.hadoop.mr.EsOutputFormat (pool-4-thread-1): Speculative execution enabled for reducer - consider disabling it to prevent data corruption
2016-01-07 14:22:55,013 INFO amazon.emr.metrics.MetricsSaver (DataStreamer for file /tmp/hadoop-yarn/staging/hadoop/.staging/job_1452176446003_0001/job.jar block BP-959782088-172.31.13.22-1452176412707:blk_1073741828_1004): 1 aggregated HDFSWriteDelay 418 raw values into 1 aggregated values, total 1
2016-01-07 14:22:55,715 INFO com.hadoop.compression.lzo.GPLNativeCodeLoader (pool-4-thread-1): Loaded native gpl library from the embedded binaries
2016-01-07 14:22:55,719 INFO com.hadoop.compression.lzo.LzoCodec (pool-4-thread-1): Successfully loaded & initialized native-lzo library [hadoop-lzo rev 77cfa96225d62546008ca339b7c2076a3da91578]
2016-01-07 14:22:55,780 INFO com.amazon.ws.emr.hadoop.fs.s3n.S3NativeFileSystem (pool-4-thread-1): listStatus s3://ig-etl/out/enriched/bad/run=2016-01-06-22-04-03 with recursive false
2016-01-07 14:22:56,267 INFO org.apache.hadoop.mapred.FileInputFormat (pool-4-thread-1): Total input paths to process : 30
2016-01-07 14:22:56,462 INFO org.apache.hadoop.mapreduce.JobSubmitter (pool-4-thread-1): number of splits:38
2016-01-07 14:22:57,024 INFO org.apache.hadoop.mapreduce.JobSubmitter (pool-4-thread-1): Submitting tokens for job: job_1452176446003_0001
2016-01-07 14:22:57,592 INFO org.apache.hadoop.yarn.client.api.impl.YarnClientImpl (pool-4-thread-1): Submitted application application_1452176446003_0001
2016-01-07 14:22:57,664 INFO cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] submitted hadoop job: job_1452176446003_0001
2016-01-07 14:23:21,028 INFO amazon.emr.metrics.MetricsSaver (Thread-3): MetricsSaver j-3CMF50FKOQQBH:i-85185508:RunJar:03217 metricFile /mnt/var/em/raw/i-85185508_20160107_null_00000_raw.bin
2016-01-07 14:23:21,031 INFO amazon.emr.metrics.MetricsSaver (Thread-3): Saved 8:95 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin
2016-01-07 14:23:23,861 INFO cascading.util.Update (UpdateRequestTimer): newer Cascading release available: 2.6.3
2016-01-07 14:23:51,028 INFO amazon.emr.metrics.MetricsSaver (Thread-3): Saved 8:15 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin
2016-01-07 14:24:21,028 INFO amazon.emr.metrics.MetricsSaver (Thread-3): Saved 8:22 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin
2016-01-07 14:24:51,029 INFO amazon.emr.metrics.MetricsSaver (Thread-3): Saved 8:22 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin
2016-01-07 14:25:21,028 INFO amazon.emr.metrics.MetricsSaver (Thread-3): Saved 8:22 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin
2016-01-07 14:25:51,028 INFO amazon.emr.metrics.MetricsSaver (Thread-3): Saved 8:22 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin
2016-01-07 14:26:21,028 INFO amazon.emr.metrics.MetricsSaver (Thread-3): Saved 8:22 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin
2016-01-07 14:26:51,032 INFO amazon.emr.metrics.MetricsSaver (Thread-3): Saved 8:22 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin
2016-01-07 14:27:21,028 INFO amazon.emr.metrics.MetricsSaver (Thread-3): Saved 8:22 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin
2016-01-07 14:27:43,416 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] hadoop job job_1452176446003_0001 state at FAILED
2016-01-07 14:27:43,418 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] failure info: Task failed task_1452176446003_0001_m_000027
Job failed as tasks failed. failedMaps:1 failedReduces:0
2016-01-07 14:27:43,468 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] task completion events identify failed tasks
2016-01-07 14:27:43,468 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] task completion events count: 10
2016-01-07 14:27:43,470 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000001_0, Status : SUCCEEDED
2016-01-07 14:27:43,470 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000003_0, Status : SUCCEEDED
2016-01-07 14:27:43,470 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000005_0, Status : SUCCEEDED
2016-01-07 14:27:43,470 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000007_0, Status : SUCCEEDED
2016-01-07 14:27:43,470 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000002_0, Status : SUCCEEDED
2016-01-07 14:27:43,471 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000008_0, Status : SUCCEEDED
2016-01-07 14:27:43,471 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000004_0, Status : SUCCEEDED
2016-01-07 14:27:43,471 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000009_0, Status : SUCCEEDED
2016-01-07 14:27:43,471 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000006_0, Status : SUCCEEDED
2016-01-07 14:27:43,471 WARN cascading.flow.FlowStep (pool-4-thread-1): [com.snowplowanalytics....] event = Task Id : attempt_1452176446003_0001_m_000000_0, Status : SUCCEEDED
2016-01-07 14:27:43,485 INFO cascading.flow.Flow (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): [com.snowplowanalytics....] stopping all jobs
2016-01-07 14:27:43,486 INFO cascading.flow.FlowStep (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): [com.snowplowanalytics....] stopping: (1/1) snowplow/bad_rows
2016-01-07 14:27:43,489 INFO cascading.flow.Flow (flow com.snowplowanalytics.snowplow.storage.hadoop.ElasticsearchJob): [com.snowplowanalytics....] stopped all jobs
2016-01-07 14:27:43,508 INFO amazon.emr.metrics.MetricsSaver (Thread-4): Saved 8:15 records to /mnt/var/em/raw/i-85185508_20160107_RunJar_03217_raw.bin