unix:///edx/var/supervisor/supervisor.sock no such file !!

792 views
Skip to first unread message

shubham rawat

unread,
Jul 17, 2015, 2:11:27 AM7/17/15
to opene...@googlegroups.com
I am running open edx on an ubuntu 12.04 machine. Recently,I tried to update the my fork of the edx-platform repo with the upstream.However i faced some errors,while running ansible update command.Being on some kind of a deadline, i reverted my fork to its older version and ran ansible again. I faced an error with lxml versions, apparently updated version had installed a newer version of lxml and it was not the one required by the older version of the repo. I installed the correct version and the error was gone.

Now something strange has happened. On running 
sudo /edx/bin/supervisorctl -c /edx/etc/supervisord.conf status
I noticed that the edxapp_workers were STARTING, ran the command again to see they were RUNNING, and again to see that they were STARTING again.

The strangest thing is, when i ran the command again, i get this
unix:///edx/var/supervisor/supervisor.sock no such file

The instance seems to be working fine on the browser.
I have no idea why this happened and appreciate any help.

shubham rawat

unread,
Jul 17, 2015, 2:16:22 AM7/17/15
to opene...@googlegroups.com
update:
I ran: sudo status supervisor found the supervisor was not running,strange!
sudo start supervisor started it again.
But the the process are still fluctuating between STARTING ang RUNNING.I need help regarding that please.

Fred Smith

unread,
Jul 17, 2015, 3:47:44 PM7/17/15
to opene...@googlegroups.com
Hi,

Tail the output files in /edx/var/log/supervisor/:

sudo tail -f /edx/var/log/supervisor/*.log

If there's no interesting output there, look at /edx/var/log/lms/edx.log

Something in that output should explain why the workers are regularly crashing.  It might be an inability to connect to a database or RabbitMQ, or some other misconfiguration.  My bet is that the workers can't reach RabbitMQ.

-Fred


--
You received this message because you are subscribed to the Google Groups "Open edX operations" group.
To unsubscribe from this group and stop receiving emails from it, send an email to openedx-ops...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/openedx-ops/29a4e89a-e7b3-478a-a220-03c9ed79d61c%40googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

shubham rawat

unread,
Jul 19, 2015, 5:12:31 AM7/19/15
to opene...@googlegroups.com
Hi Fred, 
Thanks for replying, I could really use some help with this.
So, the I am pasting the part in /edx/var/log/lms/edx.log that is repeating continously.

Jul 19 14:37:55 baadalservervm [service_variant=lms][dd.dogapi][env:sandbox] INFO [baadalservervm  4708] [dog_stats_api.py:66] - Initializing dog api to use statsd: loca$Jul 19 14:37:56 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] INFO [baadalservervm  4708] [glogging.py:213] - Starting gunicorn 0.17.4
Jul 19 14:37:56 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Connection in use: ('127.0.0.1', 8000) Jul 19 14:37:56 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Retrying in 1 second.
Jul 19 14:37:57 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Connection in use: ('127.0.0.1', 8000) Jul 19 14:37:57 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Retrying in 1 second.
Jul 19 14:37:58 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Connection in use: ('127.0.0.1', 8000) Jul 19 14:37:58 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Retrying in 1 second.
Jul 19 14:37:59 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Connection in use: ('127.0.0.1', 8000) Jul 19 14:37:59 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Retrying in 1 second.
Jul 19 14:38:00 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Connection in use: ('127.0.0.1', 8000) Jul 19 14:38:00 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Retrying in 1 second.
Jul 19 14:38:01 baadalservervm [service_variant=lms][gunicorn.error][env:sandbox] ERROR [baadalservervm  4708] [glogging.py:207] - Can't connect to ('127.0.0.1', 8000)
Jul 19 14:38:03 baadalservervm [service_variant=lms][dd.dogapi][env:sandbox] INFO [baadalservervm  4782] [dog_stats_api.py:66] - Initializing dog api to use statsd: localhost, 8125

Any idea what should be done about this.

Naresh Makwana

unread,
May 16, 2016, 10:45:20 AM5/16/16
to Open edX operations
sudo rabbitmqctl list_users

Follow these steps if above command doesn't list the user named "celery"

sudo rabbitmqctl add_user celery celery
sudo rabbitmqctl set_permissions celery ".*" ".*" ".*"
sudo service rabbitmq-server restart

manikar...@gmail.com

unread,
May 26, 2017, 4:15:02 PM5/26/17
to Open edX operations

I got the same error. following these steps worked for me.

sudo touch /edx/var/supervisor/supervisor.sock
sudo chmod 777 /edx/var/supervisor/supervisor.sock
sudo service supervisor restart
Reply all
Reply to author
Forward
0 new messages