td-agent-bit extreme CPU usage

144 views
Skip to first unread message

Guy Knights

unread,
Feb 22, 2021, 4:15:22 PM2/22/21
to Fluent Bit
I've recently been trying out Fluent Bit to send logs to a loki server, and for the most part it's working well. However, we do currently have an issue where the td-agent-bit process seems to intermittently get into a state where it consumes up to 99% CPU contintuously until it's shut down and started again.

We've had a few instances with Loki where it has crashed for an extended period so I thought the 2 issues might be related, but even then when Loki has been stopped for a while not all servers running Fluent Bit have this issue. It seems sort of random.

Are there any known problems with Fluent Bit that could cause this? I've unfortunately had little luck finding information about any such issues. For the record, we are running td-agent-bit v1.7.0.

Thanks,
Guy

Eduardo Silva

unread,
Feb 22, 2021, 4:27:56 PM2/22/21
to Guy Knights, Fluent Bit
hi Guy, 

would you mind sharing your full configuration and if possible, steps to reproduce the issue ?

best

--
You received this message because you are subscribed to the Google Groups "Fluent Bit" group.
To unsubscribe from this group and stop receiving emails from it, send an email to fluent-bit+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/fluent-bit/1ff291de-b298-4e25-b76c-fd4ea45e3893n%40googlegroups.com.


--

Guy Knights

unread,
Feb 22, 2021, 5:28:41 PM2/22/21
to Eduardo Silva, Fluent Bit
Hi Eduardo,

Here is a pastebin link to the config:


Unfortunately I really have no idea what is causing this and therefore I don't know how to trigger it. I did mention that, previously, it seems like Loki crashing for periods of time might have something to do with it but Loki has been up and running all day and I've already had to restart td-agent-bit on 2 separate server instances that were both consuming upwards of 90% CPU at different times.

I'll try upping the td-agent log level and see what I can get out of that.

Thanks,
Guy
--
Guy Knights • Senior Systems Engineer
c: 778-996-2687p: 778-379-5120
   

Eduardo Silva

unread,
Mar 4, 2021, 4:43:15 PM3/4/21
to Guy Knights, Fluent Bit
We have just shipped v1.7.2 but I am not confident any of the fixes are associated with this.

If you are able to reproduce the problem or reduce the number of inputs to detect a minimal case would be useful for troubleshooting
Reply all
Reply to author
Forward
0 new messages