--
You received this message because you are subscribed to the Google Groups "Agile System Administration" group.
To post to this group, send email to agile-system-...@googlegroups.com.
To unsubscribe from this group, send email to agile-system-admini...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/agile-system-administration?hl=en.
I've been willing to answer this mail for a long time now...
Paul Nasrat wrote:
> 2009/11/7 Gildas Le Nadan <3ntr...@gmail.com>:
>
>> There was a discussion about splunk! during the "tools" session of
>> DevOpsDAys'09.
>>
>> I think Patrick is right to mention the price as the biggest
>> showstopper, as there was a consensus on the fact an OSS replacement for
>> Splunk! was one of the dearly missed tool.
>>
>> Lindsay Holmwood made mention of a prototype/QnD tool based on mysql he
>> wrote, but I don't know if he has posted/published it yet (and
>> functionaly it is probably quite far from splunk! anyway).
>
> So what is the core value of splunk that we'd want in an OSS tool (or
> set of tools)?
>
> Thinking about the problem I think it falls into several components:
>
> *) Host based Log Collection (live tailing, etc)
> *) Log Aggregation (getting it efficiently across the network)
I think the actual status quo on this is remote logging via syslog-ng
and writing logs on nfs volumes. Would you say this is ok?
> *) Log analysis
> *) Visualisation
> *) Search/Query
I would add trending. I find it really valuable to know what is the
"normal rate of errors" in application logs for instance. It allows you
to avoid red herrings when you have a real problem, hence a smaller Time
To Diagnose.
It must be said that the system probably need to allow access to non
technical/non admin people to the "live" dataset. And you want to avoid
DoS (think of stupid queries on live systems).
> I really like what Data Wrangling have done with wikipedia's squid
> logs plus hadoop:
>
> http://www.trendingtopics.org/
>
> It is open source and uses a combination of Hive/Hadoop Streaming
> (python) to do the analysis with a rails app and google visualisations
> and charts for the front end.
>
> Paul
Indeed the Data Wrangling solution is neat. It uses "off line" datas though.
It seems there is a new challenger:
http://www.roadtofailure.com/2010/01/25/logging-unsexy-important-and-now-usable/
(thanks @fs111 for the link).
The plus sides: it's shared nothing so it should scale linearly.
Gildas