On Mon, 30 Nov 2009 22:26:30 +0100
Klaas Bosteels <
klaas.b...@gmail.com> wrote:
> dumbo cat /path/on/dfs/to/parts/dir -hadoop /path/to/hadoop should
> work
Doesn't work for me (Dumbo 0.21.21):
bin/dumbo cat /user/forsberg/test0 -hadoop /usr/lib/hadoop
java.io.IOException: Cannot open filename /user/forsberg/test0/_logs
at
org.apache.hadoop.dfs.DFSClient$DFSInputStream.openInfo(DFSClient.java:1181)
at
org.apache.hadoop.dfs.DFSClient$DFSInputStream.<init>(DFSClient.java:1172)
at org.apache.hadoop.dfs.DFSClient.open(DFSClient.java:355) at
org.apache.hadoop.dfs.DistributedFileSystem.open(DistributedFileSystem.java:163)
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:364) at
org.apache.hadoop.streaming.AutoInputFormat.getRecordReader(AutoInputFormat.java:56)
at
org.apache.hadoop.streaming.DumpTypedBytes.dumpTypedBytes(DumpTypedBytes.java:101)
at
org.apache.hadoop.streaming.DumpTypedBytes.run(DumpTypedBytes.java:82)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at
org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) at
org.apache.hadoop.streaming.HadoopStreaming.main(HadoopStreaming.java:43)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597) at
org.apache.hadoop.util.RunJar.main(RunJar.java:155) at
org.apache.hadoop.mapred.JobShell.run(JobShell.java:54) at
org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at
org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) at
org.apache.hadoop.mapred.JobShell.main(JobShell.java:68)
Adding a / to the path doesn't help:
bin/dumbo cat /user/forsberg/test0/ -hadoop /usr/lib/hadoop
java.io.IOException: Cannot open
filename /user/forsberg/test0/_logs at
org.apache.hadoop.dfs.DFSClient$DFSInputStream.openInfo(DFSClient.java:1181)
at
org.apache.hadoop.dfs.DFSClient$DFSInputStream.<init>(DFSClient.java:1172)
at org.apache.hadoop.dfs.DFSClient.open(DFSClient.java:355) at
org.apache.hadoop.dfs.DistributedFileSystem.open(DistributedFileSystem.java:163)
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:364) at
org.apache.hadoop.streaming.AutoInputFormat.getRecordReader(AutoInputFormat.java:56)
at
org.apache.hadoop.streaming.DumpTypedBytes.dumpTypedBytes(DumpTypedBytes.java:101)
at
org.apache.hadoop.streaming.DumpTypedBytes.run(DumpTypedBytes.java:82)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at
org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) at
org.apache.hadoop.streaming.HadoopStreaming.main(HadoopStreaming.java:43)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597) at
org.apache.hadoop.util.RunJar.main(RunJar.java:155) at
org.apache.hadoop.mapred.JobShell.run(JobShell.java:54) at
org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at
org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) at
org.apache.hadoop.mapred.JobShell.main(JobShell.java:68)
For reference, the contents of the DFS directory:
hadoop dfs -ls /user/forsberg/test0
Found 3 items
drwxr-xr-x - forsberg supergroup 0 2009-11-30
10:34 /user/forsberg/test0/_logs -rw-r--r-- 1 forsberg supergroup
21164658 2009-11-30 10:35 /user/forsberg/test0/part-00000 -rw-r--r--
1 forsberg supergroup 21243935 2009-11-30
10:35 /user/forsberg/test0/part-00001
If I first do 'dfs -rmr /user/forsberg/test0/_logs, 'dumbo cat' works
as intended. Hmm.. googling on _logs gives me
http://dumbotics.com/2009/05/31/dumbo-on-clouderas-distribution/, so it
seems this is a known problem.
I guess I can disable logs creation, or move the logs somewhere else,
but still, I think this is a bug. Which reminds me that when I try to
join the dumbo space at assembla it just renders me a blank page :-(.
Thanks for your reply, it got me thinking! :-)
\EF
> email to
dumbo-user+...@googlegroups.com. For more options,