This morning, I rebuilt BeeGFS and took a look at the storage configuration.
[centos@ip-10-0-0-10 ~]$ ssh 10.0.0.100 beegfs-ctl --listnodes
--nodetype=storage --details
ip-10-0-0-30 [ID: 1]
Ports: UDP: 8003; TCP: 8003
Interfaces: eth0(TCP)
Number of nodes: 1
Connection to 10.0.0.100 closed.
It's only showing 1 storage server (1 node) and I have 2 storage nodes
(the second storage server is 10.0.0.31). The beegfs logs on the second
storage server are:
(3) Jan20 14:12:40 Main [RegDGramLis] >> Listening for UDP datagrams:
Port 8003
(1) Jan20 14:12:40 Main [App] >> Waiting for beegfs...@10.0.0.10:8008...
(2) Jan20 14:12:40 RegDGramLis [Heartbeat incoming] >> New node:
beegfs-mgmtd ip-10-0-0-10 [ID: 1];
(3) Jan20 14:12:40 Main [NodeConn (acquire stream)] >> Connected:
beegfs...@10.0.0.10:8008 (protocol: TCP)
(1) Jan20 14:12:40 Main [App] >> Version: 6.3
(2) Jan20 14:12:40 Main [App] >> LocalNode: beegfs-storage ip-10-0-0-31
[ID: 1]
(2) Jan20 14:12:40 Main [App] >> Usable NICs: eth0(TCP)
(2) Jan20 14:12:40 Main [App] >> Storage targets: 1
(3) Jan20 14:12:40 Main [RegDGramLis] >> Listening for UDP datagrams:
Port 8003
(1) Jan20 14:12:42 Main [Register node] >> Node registration not
successful. Management node offline? Will keep on trying...
I can ssh to the management node (10.0.0.10) and vice-versa. All the
ports between 10.0.0.31 and 10.0.0.10 are open (particularly 8003)/
I built BeeGFS with the following commands:
sudo /etc/init.d/beegfs-mgmtd start
sleep 2
#meta# - metadata servers
ssh 10.0.0.20 "sudo /opt/beegfs/sbin/beegfs-setup-meta -p /beegfs-meta0
-s 1 -m 10.0.0.10"
sleep 2
#data# - data servers
ssh 10.0.0.31 "sudo /opt/beegfs/sbin/beegfs-setup-storage -p
/beegfs-data1 -s 1 -i 101 -m 10.0.0.10"
ssh 10.0.0.30 "sudo /opt/beegfs/sbin/beegfs-setup-storage -p
/beegfs-data0 -s 0 -i 001 -m 10.0.0.10"
#= Start services
sleep 2
#=meta= metadata servers
ssh 10.0.0.20 "sudo /etc/init.d/beegfs-meta start"
sleep 2
#=data= data servers
ssh 10.0.0.31 "sudo /etc/init.d/beegfs-storage start"
ssh 10.0.0.30 "sudo /etc/init.d/beegfs-storage start"
ssh 10.0.0.100 "sudo /etc/init.d/beegfs-helperd start"
sleep 2
sudo /etc/init.d/beegfs-client start
#=client= clients
ssh 10.0.0.100 "sudo /etc/init.d/beegfs-client start"
Any suggestions on where to look for issues?
Thanks!
Jeff