Reading files from Hadoop HDFS to Accumulo

64 views
Skip to first unread message

Stuart Paton

unread,
Jan 12, 2016, 8:40:08 AM1/12/16
to Lumify
Hi,

I am trying to set up a data flow to Lumify from existing data which I have in a Hadoop HDFS file system. 

Does anyone know how I can get this data to be received in Accumulo, and indexed in ElasticSearch so that Lumify can see the information?

Thanks,

Stuart

Ryan Gimmy

unread,
Jan 14, 2016, 4:22:57 PM1/14/16
to Lumify
Stuart, 

What format is the data in?  I work on the Visallo(https://github.com/v5analytics/visallo) project, which is based on Lumify(http://visallo.org/blog/2015-05/visallo-open-sourced/), and we have a couple of tools that are available to help ingest your data so that project might be worth taking a look at.    What have you tried so far?  There are some MR jobs that might help that might be similar between the two projects: https://github.com/v5analytics/visallo/tree/master/datasets/wikipedia/mr/src/main/java/org/visallo/wikipedia/mapreduce.  Look at that might be a good place to get started on how one might ingest data in order to see it on the front end.

--R
Reply all
Reply to author
Forward
0 new messages