NullPointerException in worker when the leader of master was switching.

24 views
Skip to first unread message

2748...@qq.com

unread,
Aug 27, 2015, 1:46:05 AM8/27/15
to Tachyon Users

Hi all,
   My environmen: centos6.6,spark 1.4.0,Tachyon 0.6.4,Gluster 3.4.2
   Master:node62->node52
   The logs in worker:
 2015-08-26 08:29:03,669 INFO WORKER_LOGGER (LeaderInquireClient.java:getMasterAddress) - Master addresses:[node62:19998, node52:19998]
2015-08-26 08:29:08,679 INFO WORKER_LOGGER (MasterClient.java:connect) - Tachyon client (version 0.6.4) is trying to connect master @ node52:19998
2015-08-26 08:29:08,683 ERROR WORKER_LOGGER (TachyonWorker.java:main) - Uncaught exception terminating worker
java.lang.NullPointerException
at tachyon.util.NetworkUtils.getFqdnHost(NetworkUtils.java:143)
at tachyon.master.MasterClient.connect(MasterClient.java:183)
at tachyon.master.MasterClient.worker_register(MasterClient.java:817)
at tachyon.worker.WorkerStorage.register(WorkerStorage.java:780)
at tachyon.worker.WorkerStorage.initialize(WorkerStorage.java:336)
at tachyon.worker.TachyonWorker.<init>(TachyonWorker.java:206)
at tachyon.worker.TachyonWorker.createWorker(TachyonWorker.java:90)
at tachyon.worker.TachyonWorker.main(TachyonWorker.java:126)

Haoyuan Li

unread,
Sep 1, 2015, 7:33:34 AM9/1/15
to 2748...@qq.com, Tachyon Users
Is the problem solved?

Best,

Haoyuan

--
You received this message because you are subscribed to the Google Groups "Tachyon Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tachyon-user...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



2748...@qq.com

unread,
Sep 1, 2015, 9:11:01 PM9/1/15
to Tachyon Users, 2748...@qq.com
I don't find the reason yet,but the worker will recover itself when the master changes again.

Thanks,
Mingkai

在 2015年9月1日星期二 UTC+8下午7:33:34,Haoyuan Li写道:

Haoyuan Li

unread,
Sep 2, 2015, 11:13:57 AM9/2/15
to 2748...@qq.com, Tachyon Users
Why would the master change? Multiple master and one failed?

Thanks.

Haoyuan
--
Haoyuan Li

野火燎原

unread,
Sep 6, 2015, 4:49:54 AM9/6/15
to Haoyuan Li, Tachyon Users
Our fault tolerant tachyon cluster have two master nodes and five slave nodes.Our spark cluster connect to the cluster using a public IP for the multiple master.The problem occurred when I testing the tachyon cluster.

Thanks.

Mingkai


------------------ 原始邮件 ------------------
发件人: "Haoyuan Li";<haoyu...@gmail.com>;
发送时间: 2015年9月2日(星期三) 晚上11:13
收件人: "野火燎原"<2748...@qq.com>;
抄送: "Tachyon Users"<tachyo...@googlegroups.com>;
主题: Re: NullPointerException in worker when the leader of master was switching.

Haoyuan Li

unread,
Sep 6, 2015, 11:34:50 PM9/6/15
to 野火燎原, Tachyon Users
Are you running Zookeeper?

Haoyuan

野火燎原

unread,
Sep 7, 2015, 12:48:15 AM9/7/15
to Haoyuan Li, Tachyon Users
Yes,I have three Zookeeper nodes.
My environmen: 
1.tachyon master: node52,node62
2.tachyon slave: node62,node60,node63,node104,node106
3.zookeeper:node62,node104,node106

Thanks,
Mingkai


------------------ 原始邮件 ------------------
发件人: "Haoyuan Li";<haoyu...@gmail.com>;
发送时间: 2015年9月7日(星期一) 中午11:34

Haoyuan Li

unread,
Nov 15, 2015, 1:52:35 PM11/15/15
to 野火燎原, Tachyon Users
Mingkai,

Is this resolved? It will be great to give Tachyon 0.8.2 a try. 0.6.4 is very old. From Github, it's more than 6000 commits different from 0.8.2. :)

Looking forward to hearing from you.

Best,

Haoyuan

野火燎原

unread,
Nov 19, 2015, 2:11:14 AM11/19/15
to Haoyuan Li, Tachyon Users
Haoyuan,
    
    I will have a try sometimes later, because my testing environment were occupied by my colleagues. I want to upgrade Tachyon 0.6.4 to 0.8.2 in my production environment because of some bugs in 0.6.4 , do you think it is a good 
idea?

Thanks,
Mingkai

------------------ 原始邮件 ------------------
发件人: "Haoyuan Li";<haoyu...@gmail.com>;
发送时间: 2015年11月16日(星期一) 凌晨2:52

Haoyuan Li

unread,
Nov 19, 2015, 2:17:34 AM11/19/15
to 野火燎原, Tachyon Users
Definitely. 0.8.2 has many new features and is also much stable than 0.6.4.

Cheers,

Haoyuan

2748...@qq.com

unread,
Dec 9, 2015, 2:33:51 AM12/9/15
to Tachyon Users, 2748...@qq.com
Hi Haoyuan,
   
    I have test 0.8.2,the problem have solved.Thank you very much.
Thanks,
Mingkai

在 2015年11月19日星期四 UTC+8下午3:17:34,Haoyuan Li写道:

Gene Pang

unread,
Dec 9, 2015, 10:09:56 AM12/9/15
to Tachyon Users, 2748...@qq.com
Thanks for the confirmation!

-Gene

Haoyuan Li

unread,
Dec 9, 2015, 12:44:54 PM12/9/15
to Gene Pang, Tachyon Users, 野火燎原
Great!

On Wed, Dec 9, 2015 at 7:09 AM, Gene Pang <gene...@gmail.com> wrote:
Thanks for the confirmation!

-Gene

--
You received this message because you are subscribed to the Google Groups "Tachyon Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tachyon-user...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Reply all
Reply to author
Forward
0 new messages