Change of hadoop namenode

31 views
Skip to first unread message

Joseph Lim

unread,
Mar 11, 2013, 10:54:24 AM3/11/13
to crawlzi...@googlegroups.com
Hi,

i did some changes to crawzilla's file to my own hadoop namenode and jobtracker (example 155.69.140.200)with the below 

core-site.xml
<configuration>
  <property>
    <name>fs.default.name</name>
    <value>hdfs://155.69.140.200:9000</value>
  </property>
  <property>
    <name>hadoop.tmp.dir</name>
    <value>/home/crawler/crawlzilla/workspace/nutch-crawler</value>
  </property>
</configuration>

mapred-site.xml
<property>
    <name>mapred.job.tracker</name>
    <value>155.69.140.200:9001</value>
  </property>

and when i crawl, i got this

error: hadoop dfs -mkdir /user/crawler/admin/testnode broken

able to guide me through?


reason: i already have my hadoop running with some other application and hope to integrate crawzilla in it too.

Thanks so much =)

Jazz Yao-Tsung Wang

unread,
Mar 19, 2013, 11:37:02 AM3/19/13
to crawlzi...@googlegroups.com
Hi Joseph,

I have to say that Crawlzilla is very sensitive to Hadoop version.
May I ask which version of your existing hadoop environment ?
And which linux distribution you use ? Fedora 12+ ?
PS. Crawlzilla 1.1.2 use hadoop 0.20.2.

- Jazz
> --
> You received this message because you are subscribed to the Google Groups
> "crawlzilla-user" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to crawlzilla-us...@googlegroups.com.
> For more options, visit https://groups.google.com/groups/opt_out.
>
>

Joseph Lim

unread,
Apr 5, 2013, 3:54:13 AM4/5/13
to crawlzi...@googlegroups.com, ja...@nchc.narl.org.tw
HI,

I can follow the Hadoop version but i am using Ubuntu LTS12.04
Reply all
Reply to author
Forward
0 new messages