meeting tonight: hadoop

Dan Croak

unread,

Sep 9, 2010, 1:16:02 PM9/9/10

to nosql-sum...@googlegroups.com

Hey folks,

Late again but these meetings have been so good I think we should
still get together.

There isn't a paper listed on nosqlsummer.org specifically for Hadoop
but there's been some interest so let's focus on that tonight.

http://hadoop.apache.org/

The problem with Hadoop is it's not one technology but a set of
disparate tools. Since Zookeeper has come up a few times in past
discussions, I'd love to hear a little more about it tonight.

The two papers stated as inspiring Hadoop are the Google Filesystem
and Google MapReduce papers. We've already read the MapReduce paper so
that one should just be refresher!

http://nosqlsummer.org/paper/google-mapreduce
http://labs.google.com/papers/gfs-sosp2003.pdf

And, to whet your appetite, here's an awesome story of Hadoop in action:

"As an example The New York Times used 100 Amazon EC2 instances and a
Hadoop application to process 4TB of raw image TIFF data (stored in
S3) into 11 million finished PDFs in the space of 24 hours at a
computation cost of about $240 (not including bandwidth)"

Dan

Russell Hanson

unread,

Sep 9, 2010, 1:56:20 PM9/9/10

to NoSQL Summer Boston, dcr...@thoughtbot.com

I'm assuming this would be at the NERD Center tonight 9-9-10 from
7pm-8:30pm??

Russell

On Sep 9, 1:16 pm, Dan Croak <dcr...@thoughtbot.com> wrote:
> Hey folks,
>
> Late again but these meetings have been so good I think we should
> still get together.
>
> There isn't a paper listed on nosqlsummer.org specifically for Hadoop
> but there's been some interest so let's focus on that tonight.
>
> http://hadoop.apache.org/
>
> The problem with Hadoop is it's not one technology but a set of
> disparate tools. Since Zookeeper has come up a few times in past
> discussions, I'd love to hear a little more about it tonight.
>
> The two papers stated as inspiring Hadoop are the Google Filesystem
> and Google MapReduce papers. We've already read the MapReduce paper so
> that one should just be refresher!
>

> http://nosqlsummer.org/paper/google-mapreducehttp://labs.google.com/papers/gfs-sosp2003.pdf

Dan Croak

unread,

Sep 9, 2010, 2:03:37 PM9/9/10

to Russell Hanson, NoSQL Summer Boston

Yessir.

Nick

unread,

Sep 10, 2010, 12:06:14 PM9/10/10

to NoSQL Summer Boston

Gah! My google groups digest didn't come in until 8AM this morning.

On Sep 9, 2:03 pm, Dan Croak <dcr...@thoughtbot.com> wrote:
> Yessir.
>

> On Thu, Sep 9, 2010 at 1:56 PM, Russell Hanson <russellhan...@gmail.com> wrote:
> > I'm assuming this would be at the NERD Center tonight 9-9-10 from
> > 7pm-8:30pm??
>
> > Russell
>
> > On Sep 9, 1:16 pm, Dan Croak <dcr...@thoughtbot.com> wrote:
> >> Hey folks,
>
> >> Late again but these meetings have been so good I think we should
> >> still get together.
>
> >> There isn't a paper listed on nosqlsummer.org specifically for Hadoop
> >> but there's been some interest so let's focus on that tonight.
>
> >>http://hadoop.apache.org/
>
> >> The problem with Hadoop is it's not one technology but a set of
> >> disparate tools. Since Zookeeper has come up a few times in past
> >> discussions, I'd love to hear a little more about it tonight.
>
> >> The two papers stated as inspiring Hadoop are the Google Filesystem
> >> and Google MapReduce papers. We've already read the MapReduce paper so
> >> that one should just be refresher!
>

> >>http://nosqlsummer.org/paper/google-mapreducehttp://labs.google.com/p...

tedpe...@gmail.com

unread,

Sep 10, 2010, 12:23:59 PM9/10/10

to nosql-sum...@googlegroups.com, NoSQL Summer Boston

eventual consistency!

(couldn't resist)

Ted

Sent from my iPhone

Reply all

Reply to author

Forward