How to set YARN specific option (yarn.nodemanager.vmem-check-enabled) in EMR

980 views

Skip to first unread message

regata

unread,

Jan 3, 2015, 1:28:26 PM1/3/15

to mr...@googlegroups.com

Hey guys,

I'm seeing the following problem when trying to run a job on EMR using mrjob:

Container [pid=11274,containerID=container_1420305915489_0001_01_000003] is running beyond virtual memory limits. Current usage: 364.7 MB of 1 GB physical memory used; 38.2 GB of 5 GB virtual memory used. Killing container.

AMI version: 3.3.1

Hadoop distribution: Amazon 2.4.0

mrjob: 0.4.2

After googling it was suggested to adjust 2 configuration options:

mapreduce.map.memory.mb: 1024
yarn.nodemanager.vmem-check-enabled: false

I tried using mrjob.conf yaml file:

  emr:
    jobconf:
      yarn.nodemanager.vmem-check-enabled: false
      mapreduce.map.memory.mb: 1024

but only mapreduce.map.memory.mb is being updated.

How do I set yarn.nodemanager.vmem-check-enabled option?

Thanks!

regata

unread,

Jan 8, 2015, 2:08:54 PM1/8/15

to mr...@googlegroups.com

figured it out

  emr:
    bootstrap_actions:
     - >
      s3://elasticmapreduce/bootstrap-actions/configure-hadoop 
      -y yarn.nodemanager.vmem-check-enabled=false 
      -y yarn.nodemanager.vmem-pmem-ratio=50
      -y yarn.nodemanager.resource.memory-mb=12288
      -y yarn.scheduler.minimum-allocation-mb=3057
      -y yarn.app.mapreduce.am.resource.mb=6114

Reply all

Reply to author

Forward

0 new messages