Groups keyboard shortcuts have been updated
Dismiss
See shortcuts

SenseiDB batch indexing performance

28 views
Skip to first unread message

Zhou Lizhi

unread,
May 14, 2013, 2:27:41 AM5/14/13
to sensei...@googlegroups.com
I tried batch indexing on a 7G, LZO-compressed  dataset. It took more than 15h to run the job on a 10-machine Hadoop cluster. The job consumed huge amount of IO and CPU cycles. Is this expected?
Reply all
Reply to author
Forward
0 new messages