[Stormcrawler] URL content to HdfsBolt

11 views
Skip to first unread message

aiguz...@gmail.com

unread,
Jun 1, 2018, 10:52:10 AM6/1/18
to DigitalPebble
Hi Julien,

In the ES topology I would like to index urls in ElasticSearch and forward a tuple of (url, [title, content]) to an Hdfs storage. I found that Apache-storm has a proper Hdfs bolt which looks like a straight forward implementation. I would like to know where to look for this tuple in the ES crawling topology. Could you point which bolt has this data?

Regards, 
Artur

DigitalPebble

unread,
Jun 1, 2018, 11:46:33 AM6/1/18
to DigitalPebble
 Hi Artur,

Please use stack overflow so that more people get the answer. Thanks

Julien



--
You received this message because you are subscribed to the Google Groups "DigitalPebble" group.
To unsubscribe from this group and stop receiving emails from it, send an email to digitalpebbl...@googlegroups.com.
To post to this group, send email to digita...@googlegroups.com.
Visit this group at https://groups.google.com/group/digitalpebble.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages