krati with hadoop hdfs backend

qmx

unread,

Dec 28, 2011, 2:00:03 PM12/28/11

to kr...@googlegroups.com

Hi!

What do you all think about it? Does it sound feasible?

Cheers!

-- qmx

Sam Tingleff

unread,

Dec 28, 2011, 9:06:22 PM12/28/11

to kr...@googlegroups.com

What are you trying to do?

Douglas Campos

unread,

Dec 29, 2011, 12:23:58 AM12/29/11

to kr...@googlegroups.com

Well, my boss asked me about it, then I started looking if it was possible to have the same cool api, having hdfs as the backend (we're on 3TB/day data volume here, and they want to mine the data later)

The main reason for not going straight to hbase / cassandra is that all the clustering / columns stuff adds significant overhead, and krati gets close to our desired throughput rate.

Am I crazy? :)

Sam Tingleff

unread,

Dec 29, 2011, 1:35:09 AM12/29/11

to kr...@googlegroups.com

It sounds crazy, yes :) HDFS has very different behavior than local disk. It is designed for reliability and throughput more than random seek performance.

There is ElephantDB [1] and I have been working on a thrift-based service to serve read-only exports from HDFS using Krati as local storage. In general these kind of systems (also see Voldemort) typically export out of HDFS to expose as a real time service.

[1] https://github.com/nathanmarz/elephantdb

[2] https://github.com/samtingleff/valkyriedb

Reply all

Reply to author

Forward