The problem with Hadoop is it's not one technology but a set of disparate tools. Since Zookeeper has come up a few times in past discussions, I'd love to hear a little more about it tonight.
The two papers stated as inspiring Hadoop are the Google Filesystem and Google MapReduce papers. We've already read the MapReduce paper so that one should just be refresher!
And, to whet your appetite, here's an awesome story of Hadoop in action:
"As an example The New York Times used 100 Amazon EC2 instances and a Hadoop application to process 4TB of raw image TIFF data (stored in S3) into 11 million finished PDFs in the space of 24 hours at a computation cost of about $240 (not including bandwidth)"
> The problem with Hadoop is it's not one technology but a set of
> disparate tools. Since Zookeeper has come up a few times in past
> discussions, I'd love to hear a little more about it tonight.
> The two papers stated as inspiring Hadoop are the Google Filesystem
> and Google MapReduce papers. We've already read the MapReduce paper so
> that one should just be refresher!
> And, to whet your appetite, here's an awesome story of Hadoop in action:
> "As an example The New York Times used 100 Amazon EC2 instances and a
> Hadoop application to process 4TB of raw image TIFF data (stored in
> S3) into 11 million finished PDFs in the space of 24 hours at a
> computation cost of about $240 (not including bandwidth)"
On Thu, Sep 9, 2010 at 1:56 PM, Russell Hanson <russellhan...@gmail.com> wrote: > I'm assuming this would be at the NERD Center tonight 9-9-10 from > 7pm-8:30pm??
> Russell
> On Sep 9, 1:16 pm, Dan Croak <dcr...@thoughtbot.com> wrote: >> Hey folks,
>> Late again but these meetings have been so good I think we should >> still get together.
>> There isn't a paper listed on nosqlsummer.org specifically for Hadoop >> but there's been some interest so let's focus on that tonight.
>> The problem with Hadoop is it's not one technology but a set of >> disparate tools. Since Zookeeper has come up a few times in past >> discussions, I'd love to hear a little more about it tonight.
>> The two papers stated as inspiring Hadoop are the Google Filesystem >> and Google MapReduce papers. We've already read the MapReduce paper so >> that one should just be refresher!
>> And, to whet your appetite, here's an awesome story of Hadoop in action:
>> "As an example The New York Times used 100 Amazon EC2 instances and a >> Hadoop application to process 4TB of raw image TIFF data (stored in >> S3) into 11 million finished PDFs in the space of 24 hours at a >> computation cost of about $240 (not including bandwidth)"
> On Thu, Sep 9, 2010 at 1:56 PM, Russell Hanson <russellhan...@gmail.com> wrote:
> > I'm assuming this would be at the NERD Center tonight 9-9-10 from
> > 7pm-8:30pm??
> > Russell
> > On Sep 9, 1:16 pm, Dan Croak <dcr...@thoughtbot.com> wrote:
> >> Hey folks,
> >> Late again but these meetings have been so good I think we should
> >> still get together.
> >> There isn't a paper listed on nosqlsummer.org specifically for Hadoop
> >> but there's been some interest so let's focus on that tonight.
> >> The problem with Hadoop is it's not one technology but a set of
> >> disparate tools. Since Zookeeper has come up a few times in past
> >> discussions, I'd love to hear a little more about it tonight.
> >> The two papers stated as inspiring Hadoop are the Google Filesystem
> >> and Google MapReduce papers. We've already read the MapReduce paper so
> >> that one should just be refresher!
> >> And, to whet your appetite, here's an awesome story of Hadoop in action:
> >> "As an example The New York Times used 100 Amazon EC2 instances and a
> >> Hadoop application to process 4TB of raw image TIFF data (stored in
> >> S3) into 11 million finished PDFs in the space of 24 hours at a
> >> computation cost of about $240 (not including bandwidth)"
> Gah! My google groups digest didn't come in until 8AM this morning.
> On Sep 9, 2:03 pm, Dan Croak <dcr...@thoughtbot.com> wrote: >> Yessir.
>> On Thu, Sep 9, 2010 at 1:56 PM, Russell Hanson <russellhan...@gmail.com> wrote: >>> I'm assuming this would be at the NERD Center tonight 9-9-10 from >>> 7pm-8:30pm??
>>> Russell
>>> On Sep 9, 1:16 pm, Dan Croak <dcr...@thoughtbot.com> wrote: >>>> Hey folks,
>>>> Late again but these meetings have been so good I think we should >>>> still get together.
>>>> There isn't a paper listed on nosqlsummer.org specifically for Hadoop >>>> but there's been some interest so let's focus on that tonight.
>>>> The problem with Hadoop is it's not one technology but a set of >>>> disparate tools. Since Zookeeper has come up a few times in past >>>> discussions, I'd love to hear a little more about it tonight.
>>>> The two papers stated as inspiring Hadoop are the Google Filesystem >>>> and Google MapReduce papers. We've already read the MapReduce paper so >>>> that one should just be refresher!
>>>> And, to whet your appetite, here's an awesome story of Hadoop in action:
>>>> "As an example The New York Times used 100 Amazon EC2 instances and a >>>> Hadoop application to process 4TB of raw image TIFF data (stored in >>>> S3) into 11 million finished PDFs in the space of 24 hours at a >>>> computation cost of about $240 (not including bandwidth)"