Splitting files for copy into DDFS

10 views
Skip to first unread message

Harihara Vinayakaram

unread,
Aug 19, 2010, 1:05:18 AM8/19/10
to disc...@googlegroups.com
Hi
  I have been able to use ddfs.push and copy files into the DDFS file system. I can see that the file is replicated in the other nodes. But the file is not split into smaller nodes. The file is copied as is (I copied a 500 MB file) . Is there an utility to split the file (an equivalent of the copyFromLocal in Hadoop)

Regards
Hari

Ville Tuulos

unread,
Aug 20, 2010, 5:35:11 AM8/20/10
to disc...@googlegroups.com
Hi Hari,

If you have simple line-based text files, you can use the standard
split command as in

http://discoproject.org/doc/start/tutorial.html#prepare-input-data

Next major release of Disco will include built-in chunking.

Ville

Reply all
Reply to author
Forward
0 new messages