Thinking about more modules -- what monitoring systems do you use?

800 views
Skip to first unread message

Michael DeHaan

unread,
May 25, 2013, 1:21:08 AM5/25/13
to ansible...@googlegroups.com
The push for notification modules yielded IRC, hipchat, Flowdock, and Campfire notification modules, plus mqtt --

On the monitoring side, we already have modules in place for nagios, airbrake, and newrelic.

I have it on very good authority (cough) that modules for AppDynamics, Pingdom, and Pagerduty are coming soon :)

Thinking ahead, do you have a monitoring system that has either outage window or deployment notification support that we should include?  (For those that aren't compatible with the nagios module, anyway?).  

I'd definitely like to expand this list.

Does it make sense to have modules for any of the newer systems?   Graphite/etc tend to be more about trending, so a playbook to set the up may be sufficient, I'm not sure if modules would be needed?


--
Michael DeHaan <mic...@ansibleworks.com>
CTO, AnsibleWorks, Inc.
http://www.ansibleworks.com/

Mark Mandel

unread,
May 25, 2013, 1:29:41 AM5/25/13
to ansible...@googlegroups.com

On Sat, May 25, 2013 at 3:21 PM, Michael DeHaan <mic...@ansibleworks.com> wrote:
Pingdom, and Pagerduty

Very excited for these, these are exactly what we use :)

Serge van Ginderachter

unread,
May 25, 2013, 5:16:35 AM5/25/13
to ansible...@googlegroups.com

On 25 May 2013 07:21, Michael DeHaan <mic...@ansibleworks.com> wrote:
Thinking ahead, do you have a monitoring system that has either outage window or deployment notification support that we should include?  (For those that aren't compatible with the nagios module, anyway?).  

I'm not working on monitoring directly myself at $work, but we ​have a Zabbix (http://www.zabbix.org) setup.
I think this could definitely use a module. Zabbix has an API, and there are a couple of python libraries (https://www.zabbix.org/wiki/Docs/api/libraries#Python).


  Serge



Lester Wade

unread,
May 25, 2013, 5:25:24 AM5/25/13
to ansible...@googlegroups.com


--
You received this message because you are subscribed to the Google Groups "Ansible Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ansible-proje...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
 
 

Michael DeHaan

unread,
May 25, 2013, 8:47:59 AM5/25/13
to ansible...@googlegroups.com
Sounds good should someone want to add this, we'll take it!

Looks like Zabbix outage windows like Nagios would be workable.

If that happens though, let's make sure it uses a library that is available in pip, and try to approximately parallel the nagios module parameter conventions, if possible.




Edgars

unread,
May 25, 2013, 10:12:51 AM5/25/13
to ansible...@googlegroups.com
+1 Zabbix

TextEditor

unread,
May 25, 2013, 11:40:34 AM5/25/13
to ansible...@googlegroups.com
I also give a vote to Zabbix. Currently using that, Cacti and Icinga, but Icinga works via the Nagios module, because it's just another clone. But Zabbix gets my vote. 

Op zaterdag 25 mei 2013 07:21:08 UTC+2 schreef Michael DeHaan het volgende:

Romeo Theriault

unread,
May 25, 2013, 4:00:45 PM5/25/13
to ansible...@googlegroups.com
We also run Zabbix at $work. I've been using ansible's 'uri' module to interact with zabbix via it's json-rpc api, though a dedicated module would certainly be cleaner. Since using the uri module to do the work involves a multi-step process of storing login authid in register, etc...

While there are several un-official python zabbix api's around, if I was writing it I'd probably just use a generic json-rpc module and go from there since last I looked the un-official api's weren't maintained very well... (this may have changed).

Romeo


--
You received this message because you are subscribed to the Google Groups "Ansible Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ansible-proje...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
 
 



--
Romeo

Romeo Theriault

unread,
May 25, 2013, 4:04:09 PM5/25/13
to ansible...@googlegroups.com
On Sat, May 25, 2013 at 10:00 AM, Romeo Theriault <romeo.t...@maine.edu> wrote:

While there are several un-official python zabbix api's around, if I was writing it I'd probably just use a generic json-rpc module

Even just doing it with the standard json and http libs would be possible. 

Michael DeHaan

unread,
May 25, 2013, 8:37:48 PM5/25/13
to ansible...@googlegroups.com
Yeah, let's do that...


--
You received this message because you are subscribed to the Google Groups "Ansible Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ansible-proje...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
 
 



--

Nicolas G.

unread,
May 26, 2013, 2:46:20 PM5/26/13
to ansible...@googlegroups.com

+1 for Zabbix

aam...@vecora.com

unread,
May 27, 2013, 8:24:43 AM5/27/13
to ansible...@googlegroups.com
+1 for Zabbix here as well.

Brian Coca

unread,
May 28, 2013, 4:08:26 PM5/28/13
to ansible...@googlegroups.com
zenoss, argus, HP and Dell management suites are the only ones i've seen missing, most other ones not mentioned are nagios derivatives.

--
Brian Coca
Stultorum infinitus est numerus
0110000101110010011001010110111000100111011101000010000001111001011011110111010100100000011100110110110101100001011100100111010000100001
Pedo mellon a minno

Stoflet Darryl

unread,
May 28, 2013, 10:38:32 PM5/28/13
to ansible...@googlegroups.com
Anybody using monit? It has an unmonitor command that would be applicable for service specific outages. In general a monit module may be useful...

Ali Asad Lotia

unread,
May 29, 2013, 5:59:24 AM5/29/13
to ansible...@googlegroups.com
May be useful to have a sensu module.
--
Ali


On Wed, May 29, 2013 at 3:38 AM, Stoflet Darryl <sto...@gmail.com> wrote:
Anybody using monit? It has an unmonitor command that would be applicable for service specific outages. In general a monit module may be useful...

Michael DeHaan

unread,
May 29, 2013, 7:40:25 AM5/29/13
to ansible...@googlegroups.com
All great ideas!

We have about a week before code freeze on 1.2 if folks want to add some.

I already have pingdom and pagerduty in queue to merge!   If not, 1.3 will be exciting (it will be anyway) and we can always pull in more later, and it's likely those who want to sit on 1.2 a while can just copy those modules over.  (and of course, many people run from source!)


Steve Irvine

unread,
Jun 10, 2013, 3:47:34 PM6/10/13
to ansible...@googlegroups.com
Can I still vote for OMD/Check_MK? I know it's a Nagios clone but it's config files seem to be different to stock Nagios, it compiles Nagios config files dynamically from it's own groups and tags. 

I love the idea of having my ansible role match my check_mk roles.

Haven't had chance to look at it yet.

Michael DeHaan

unread,
Jun 10, 2013, 3:58:40 PM6/10/13
to ansible...@googlegroups.com
It wasn't so much a voting thread as a brainstorming thread :)

So far this thread has produced modules for pingdom, pagerduty, airbrake, newrelic, and monit in addition to the already existing Nagios!

More for 1.3 are welcome if anyone would like to add some for major apps!


Brice Burgess

unread,
Jun 11, 2013, 10:37:42 AM6/11/13
to ansible...@googlegroups.com

We use *Librato* metrics to aggregate and visualize all our metrics. It supports thresholds and notifies via *OpsGenie* and *PagerDuty* (which may expose an API that ansible can automate).

For metrics collection we use collectd and diamond. collectd supports thresholding; but I can't see how to automate these any better with a special module than with the current ones inside a playbook. Sensu and StatsD are also popular nagios alternatives.

Librato features integration with *StatusCake*,  a status check service similar to pingdom and monit. We use them because of the librato integration -- and it's actually a very nice service!

An ElasticSearch and Datomic module would also be nice. ElasticSearch search provides a REST API that makes it relatively easy to poll status and create/remove indexes (databases). I basically use the uri module with ignore_errors set to true to ensure an index exists.

Thanks,

~ Brice

Michael DeHaan

unread,
Jun 12, 2013, 6:46:17 AM6/12/13
to ansible...@googlegroups.com
Wanted to point out there's already a pagerduty module in 1.2 core, BTW :)




--
You received this message because you are subscribed to the Google Groups "Ansible Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ansible-proje...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
 
 

Bas Meijer

unread,
May 20, 2014, 4:59:41 AM5/20/14
to ansible...@googlegroups.com
Yeah, please do the zabbix module.

Serge van Ginderachter

unread,
May 20, 2014, 7:34:01 AM5/20/14
to ansible...@googlegroups.com

On 20 May 2014 10:59, Bas Meijer <brme...@gmail.com> wrote:
Yeah, please do the zabbix module.

​There are several zabbix modules in the PR queue, which need additional testing.​

Michael DeHaan

unread,
May 21, 2014, 5:58:07 PM5/21/14
to ansible...@googlegroups.com
Yes, please +1 the things you like!



--
You received this message because you are subscribed to the Google Groups "Ansible Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ansible-proje...@googlegroups.com.

Benjamin Nugent

unread,
May 22, 2014, 2:21:00 PM5/22/14
to ansible...@googlegroups.com
A Zenoss module has been on my todo list for ages. It has a pretty decent API. Not sure I have the bandwidth to get to it, but would be happy to see it, regardless.


Dmitry Makovey

unread,
May 22, 2014, 7:54:28 PM5/22/14
to ansible...@googlegroups.com
xymon is missing. you can feed it cli commands to add/remove hosts etc.

Michael DeHaan

unread,
May 22, 2014, 8:24:27 PM5/22/14
to ansible...@googlegroups.com
FYI, somehow an ancient zombie thread got resurrected.

Ideas are fine, but it's seldom frequent that someone will scratch an itch they don't personally have - take a look at existing ones and send in pull requests if you like.






On Thu, May 22, 2014 at 7:54 PM, Dmitry Makovey <droop...@gmail.com> wrote:
xymon is missing. you can feed it cli commands to add/remove hosts etc.

--
You received this message because you are subscribed to the Google Groups "Ansible Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ansible-proje...@googlegroups.com.
To post to this group, send email to ansible...@googlegroups.com.

Serge van Ginderachter

unread,
May 23, 2014, 3:33:26 AM5/23/14
to ansible...@googlegroups.com

On 23 May 2014 02:24, Michael DeHaan <mic...@ansible.com> wrote:
FYI, somehow an ancient zombie thread got resurrected.

TIL zombies could still get resurrected.​

Michael DeHaan

unread,
May 23, 2014, 8:34:32 AM5/23/14
to ansible...@googlegroups.com
Only after you kill them.

(Always double tap)



--
You received this message because you are subscribed to the Google Groups "Ansible Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ansible-proje...@googlegroups.com.
To post to this group, send email to ansible...@googlegroups.com.

Brian Coca

unread,
May 23, 2014, 8:38:04 AM5/23/14
to ansible...@googlegroups.com
head shot! It doesn't matter if we call em biters or walkers, its always a head shot.
Reply all
Reply to author
Forward
0 new messages