Gmail Calendar Documents Reader Web more »
Recently Visited Groups | Help | Sign in
Google Groups Home
cascading loganalysis example
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  4 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Robert Henry  
View profile  
 More options Nov 9, 7:49 pm
From: Robert Henry <rrh.he...@gmail.com>
Date: Mon, 9 Nov 2009 16:49:04 -0800
Local: Mon, Nov 9 2009 7:49 pm
Subject: cascading loganalysis example
The cascading loganalysis example seems to produce data sorted on the
timestamp, at least for small input files and a single shard of
output.  There seems to be some implicit typing going on under the
covers so that the timestamps are treated as longs and sorted as such;
where dos this happen coercion happen?

Is there a way to disable the sorting by timestamp, so that the
loganalysis benchmark is unconstrained as to the output order, and
thus presumably make it run faster?  This is to say, treat the log
data as a set of events, rather than a sequence of events.

Robert Henry


    Reply    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chris K Wensel  
View profile  
 More options Nov 10, 12:07 pm
From: Chris K Wensel <ch...@wensel.net>
Date: Tue, 10 Nov 2009 09:07:34 -0800
Local: Tues, Nov 10 2009 12:07 pm
Subject: Re: cascading loganalysis example

In MapReduce sorting happens to support grouping on key values.

So the results are sorted on the fields that are grouped upon.

In this example, we are grouping on timestamps (minute and second  
intervals) in order to get the metrics for each.

cheers,
chris

On Nov 9, 2009, at 4:49 PM, Robert Henry wrote:

--
Chris K Wensel
ch...@concurrentinc.com
http://www.concurrentinc.com

    Reply    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Robert Henry  
View profile  
 More options Nov 10, 12:48 pm
From: Robert Henry <rrh.he...@gmail.com>
Date: Tue, 10 Nov 2009 09:48:07 -0800
Local: Tues, Nov 10 2009 12:48 pm
Subject: Re: cascading loganalysis example
How is the type of the key values determined?  The key values must be
Longs or Dates, somehow, to be sorted correctly.   Which of the
pipeline builders knows that it will be dealing with Longs or Dates?
Is there some magic involved with the use of the DateParser object, or
the field named "ts"?

Thanks.

On Tue, Nov 10, 2009 at 9:07 AM, Chris K Wensel <ch...@wensel.net> wrote:


    Reply    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Chris K Wensel  
View profile  
 More options Nov 10, 12:58 pm
From: Chris K Wensel <ch...@wensel.net>
Date: Tue, 10 Nov 2009 09:58:14 -0800
Local: Tues, Nov 10 2009 12:58 pm
Subject: Re: cascading loganalysis example
The operations that create ts and tm are responsible.

new DateParser( new Fields( "ts" ), "dd/MMM/yyyy:HH:mm:ss Z" );

new ExpressionFunction( new Fields( "tm" ), "ts - (ts % (60 * 1000))",  
long.class )

On Nov 10, 2009, at 9:48 AM, Robert Henry wrote:

--
Chris K Wensel
ch...@concurrentinc.com
http://www.concurrentinc.com

    Reply    Reply to author    Forward  
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »

Create a group - Google Groups - Google Home - Terms of Service - Privacy Policy
©2009 Google