krati with hadoop hdfs backend

30 views
Skip to first unread message

qmx

unread,
Dec 28, 2011, 2:00:03 PM12/28/11
to kr...@googlegroups.com
Hi!

What do you all think about it? Does it sound feasible?


Cheers!
-- qmx

Sam Tingleff

unread,
Dec 28, 2011, 9:06:22 PM12/28/11
to kr...@googlegroups.com
What are you trying to do?

Douglas Campos

unread,
Dec 29, 2011, 12:23:58 AM12/29/11
to kr...@googlegroups.com
Well, my boss asked me about it, then I started looking if it was possible to have the same cool api, having hdfs as the backend (we're on 3TB/day data volume here, and they want to mine the data later)

The main reason for not going straight to hbase / cassandra is that all the clustering / columns stuff adds significant overhead, and krati gets close to our desired throughput rate.

Am I crazy? :)

Sam Tingleff

unread,
Dec 29, 2011, 1:35:09 AM12/29/11
to kr...@googlegroups.com
It sounds crazy, yes :) HDFS has very different behavior than local disk. It is designed for reliability and throughput more than random seek performance.

There is ElephantDB [1] and I have been working on a thrift-based service to serve read-only exports from HDFS using Krati as local storage. In general these kind of systems (also see Voldemort) typically export out of HDFS to expose as a real time service.

[1] https://github.com/nathanmarz/elephantdb

[2] https://github.com/samtingleff/valkyriedb

Reply all
Reply to author
Forward
0 new messages