Network connectivity issue with AWS Elasticsearch Plugin

253 views
Skip to first unread message

starship

unread,
Feb 23, 2017, 8:51:25 PM2/23/17
to Fluentd Google Group
Hi,

We are using fluentd + AWS Elasticsearch plugin for our cloud hosted software.

After few days AWS Elasticsearch plugin loses network connectivity to AWS Elasticsearch service.

I tried fluent-plugin-aws-elasticsearch-service-hotfix but I still face the same issue.



2017-02-23 00:30:25 -0800 [warn]: temporarily failed to flush the buffer. next_retry=2017-02-23 00:30:26 -0800 error_class="Elasticsearch::Transport::Transport::Error" error="Cannot get new connection from pool." plugin_id="object:3f86c9870854"
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/elasticsearch-transport-1.0.18/lib/elasticsearch/transport/transport/base.rb:249:in `perform_request'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/elasticsearch-transport-1.0.18/lib/elasticsearch/transport/transport/http/faraday.rb:20:in `perform_request'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/elasticsearch-transport-1.0.18/lib/elasticsearch/transport/client.rb:128:in `perform_request'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/elasticsearch-api-1.0.18/lib/elasticsearch/api/actions/bulk.rb:90:in `bulk'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/fluent-plugin-elasticsearch-1.9.2/lib/fluent/plugin/out_elasticsearch.rb:353:in `send_bulk'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/fluent-plugin-elasticsearch-1.9.2/lib/fluent/plugin/out_elasticsearch.rb:339:in `write_objects'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/fluentd-0.12.31/lib/fluent/output.rb:490:in `write'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/fluentd-0.12.31/lib/fluent/buffer.rb:354:in `write_chunk'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/fluentd-0.12.31/lib/fluent/buffer.rb:333:in `pop'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/fluentd-0.12.31/lib/fluent/output.rb:342:in `try_flush'
  2017-02-23 00:30:25 -0800 [warn]: /opt/td-agent/embedded/lib/ruby/gems/2.1.0/gems/fluentd-0.12.31/lib/fluent/output.rb:149:in `run'

After this I see many instances of 

2017-02-23 00:39:17 -0800 [warn]: temporarily failed to flush the buffer. next_retry=2017-02-23 00:48:27 -0800 error_class="Elasticsearch::Transport::Transport::Error" error="Cannot get new connection from pool." plugin_id="object:3f86c9870854"
2017-02-23 00:39:17 -0800 [warn]: suppressed same stacktrace

Could anyone please provide a solution/workaround for this problem? We are in a large deployment and we are affected by this issue.
Even though restarting td-agent will solve this problem but it will come back after few days. I can write a cron job to restart td-agent everyday, 
but does docker's fluentd driver buffer the logs for the time td-agent is unavailable? I don't know the answer.

Another question on supportability

        If the owner is not responding or no longer interested in an important plugin like AWS elasticsearch, could treasuredata take over the plugin 
        and support it ?

regards,
Starship


David Wood

unread,
Feb 24, 2017, 12:46:57 AM2/24/17
to Fluentd Google Group

Mr. Fiber

unread,
Feb 24, 2017, 4:43:30 AM2/24/17
to Fluentd Google Group
Could anyone please provide a solution/workaround for this problem?

I want to know you set David refered two options.

If the owner is not responding or no longer interested in an important plugin like AWS elasticsearch,
> could treasuredata take over the plugin and support it ?

No. Fluentd is community based so I hope interesting person / organization take over such plugins.
There are some main maintainer changed plugins: bigquery, azure plugins, redis, etc...
I know fluent-plugin-aws-elasticsearch-service's author is not active.
If someone is interesting in the maintainer, should contact author about new maintainer.

BTW, If you need supportability, Fluentd enterprise is one option.


Masahiro

--
You received this message because you are subscribed to the Google Groups "Fluentd Google Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to fluentd+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Mr. Fiber

unread,
Feb 24, 2017, 4:46:03 AM2/24/17
to Fluentd Google Group
but does docker's fluentd driver buffer the logs for the time td-agent is unavailable? I don't know the answer.


On Fri, Feb 24, 2017 at 10:51 AM, starship <gow...@gmail.com> wrote:

starship

unread,
Feb 25, 2017, 11:33:27 AM2/25/17
to Fluentd Google Group
Thanks a lot Repeatedly/David.. 

I've added those two parameters to the config file now and will observe it for few days. 

regards,
Starship
To unsubscribe from this group and stop receiving emails from it, send an email to fluentd+u...@googlegroups.com.

starship

unread,
Mar 17, 2017, 11:46:42 PM3/17/17
to Fluentd Google Group
Thanks repeatedly and others. This issue seems to be fixed now. 

regards,
Startship
Reply all
Reply to author
Forward
0 new messages