Hi All,
I am running Airflow DAGS on a Ubuntu 14.04.2 server for the past few days. Once in a while tasks would fail with the error "'getpwuid(): uid not found: 47035'". I tried looking it up on the net but couldn't figure out the issue. Did anyone face similar issue. Copied the entire error stack trace below. Thanks in advance.
[2015-09-24 11:13:20,455] {models.py:959} ERROR - 'getpwuid(): uid not found: 47035'
Traceback (most recent call last):
File "/usr/local/lib/python2.7/dist-packages/airflow/models.py", line 928, in run
result = task_copy.execute(context=context)
File "/usr/local/lib/python2.7/dist-packages/airflow/operators/sensors.py", line 54, in execute
while not self.poke(context):
File "/usr/local/lib/python2.7/dist-packages/airflow/operators/sensors.py", line 224, in poke
sb = Client("XXXX", 8020, use_trash=False)
===========
[2015-09-24 11:13:09,496] {models.py:902} INFO - Executing <Task(PythonOperator): prepare-for-mr> on 2015-09-24 07:00:00
[2015-09-24 11:13:09,507] {models.py:959} ERROR - 'getpwuid(): uid not found: 47035'
Traceback (most recent call last):
File "/usr/local/lib/python2.7/dist-packages/airflow/models.py", line 928, in run
result = task_copy.execute(context=context)
File "/usr/local/lib/python2.7/dist-packages/airflow/operators/python_operator.py", line 51, in execute
return_value = self.python_callable(*self.op_args, **self.op_kwargs)
File "/x/home/vkavuluri/airflow/dags/inbound_pipeline.py", line 69, in prepare_for_mr
sb = hooks.HDFSHook("stampy").get_conn()
File "/usr/local/lib/python2.7/dist-packages/airflow/hooks/hdfs_hook.py", line 35, in get_conn
client = Client(connections[0].host, connections[0].port, use_trash=False)
File "/usr/local/lib/python2.7/dist-packages/snakebite-2.6.1-py2.7.egg/snakebite/client.py", line 109, in __init__
self.service = RpcService(self.service_stub_class, self.port, self.host, hadoop_version, effective_user, self.use_sasl)
File "/usr/local/lib/python2.7/dist-packages/snakebite-2.6.1-py2.7.egg/snakebite/service.py", line 28, in __init__
self.channel = SocketRpcChannel(host=self.host, port=self.port, version=hadoop_version, effective_user=effective_user,use_sasl=use_sasl)
File "/usr/local/lib/python2.7/dist-packages/snakebite-2.6.1-py2.7.egg/snakebite/channel.py", line 184, in __init__
self.effective_user = effective_user or get_current_username()
File "/usr/local/lib/python2.7/dist-packages/snakebite-2.6.1-py2.7.egg/snakebite/platformutils.py", line 11, in get_current_username
return pwd.getpwuid(os.getuid())[0]
KeyError: 'getpwuid(): uid not found: 47035'
[2015-09-24 11:13:09,549] {models.py:995} ERROR - 'getpwuid(): uid not found: 47035'