Capacity shows 0, 100% Utilized, Jobs not running.

185 views
Skip to first unread message

Dane DeValcourt

unread,
Dec 3, 2021, 1:27:19 PM12/3/21
to AWX Project
Seems like something is not working quite right for me with my installation.

Doesn't appear to be able to read or get cluster data to generate capacity numbers.

Screen Shot 2021-12-03 at 10.35.33 AM.png

Not sure if related to kubernetes service account or something not having access. Any advice on troubleshooting this would be appreciated.  It keeps anything from being able to run it would seem.

Dane DeValcourt

unread,
Dec 3, 2021, 1:39:15 PM12/3/21
to AWX Project
bash-4.4$ awx-manage list_instances
[tower capacity=0 policy=100%]
        awxstg-69b7d5c47-zqtsz capacity=0 node_type=hybrid version=19.5.0

[controlplane capacity=0 policy=100%]
        awxstg-69b7d5c47-zqtsz capacity=0 node_type=hybrid version=19.5.0

[default capacity=0 policy=100%]


Something is preventing it from being able to determine capacity. 

Just can't figure out what that is yet.

Alan Rominger

unread,
Dec 8, 2021, 3:49:44 PM12/8/21
to Dane DeValcourt, AWX Project
In recent releases, the instance model (which you can access directly at endpoint /api/v2/instances/) has a new "errors" field. If you can identify your instance in that list and see that field, it might have a descriptive reason why it failed its periodic health check. In your screenshot I see the last time the health check was run was 12/3 10:27am. A question you want to ask is if that is sufficiently recent. For the main cluster, health checks should repeat once every 20 seconds or so. If it doesn't, then I'd hope to see some more basic errors in the logs.

It's also surprising to see the node_type is hybrid. For the supported install method with the operator it should be "control". I'm curious if there's a problem with the awx-operator, or migrating from a prior release.

Alan
github: AlanCoding


--
You received this message because you are subscribed to the Google Groups "AWX Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to awx-project...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/awx-project/cdea57e8-ab27-4d72-9a9f-4c7be41c1b50n%40googlegroups.com.

august....@gmail.com

unread,
Jul 1, 2022, 8:50:44 AM7/1/22
to AWX Project
I am also into the similar issue. Can someone help, what will be the fix.
bash-4.4# awx-manage list_instances
[tower capacity=0 policy=100%]
        awx capacity=0 version=17.0.1


May be because of this, my redis container is also not able to start.
redis                "docker-entrypoint.s…"   5 hours ago         Restarting (1) 18 seconds ago

Capture.JPG

AWX Project

unread,
Jul 6, 2022, 4:08:42 PM7/6/22
to AWX Project
Hi,

What is your setup? are you using awx-operator or the development docker based environment? also which AWX version?

In the UI you can run a health check which should re-calculate the capacity for that instance. After running it, can you navigate to the /api/v2/instances/ endpoint and find the "errors" field for that instance? does it report anything there?

AWX Team
Reply all
Reply to author
Forward
0 new messages