Hello,
we setup a system with 3 data nodes + master + 2 rangeserver and current version is 0.9.7.5 (upgraded from 0.9.7.1).
we have problem about rs1. when system running rs1 RangeServer.log file grew with like this logs;
RangeServer Logs ->
1367998993 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/TableInfo.cc:259) Adding range 2/8[be358b80-b9e6-4d63-bc0b-ccd2b0bb0b2a..be77787b-ca3f-4978-af48-62908e545ced] to TableInfo
1367998993 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:4467) Successfully replay loaded range 2/8[be358b80-b9e6-4d63-bc0b-ccd2b0bb0b2a..be77787b-ca3f-4978-af48-62908e545ced]
1367999004 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:3113) Entering get_statistics()
1367999004 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:3363) Exiting get_statistics()
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/IOHandler.h:149) Event: type=CONNECTION_ESTABLISHED from=
172.16.200.59:628101367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/Range.cc:256) Loading cellstores for '0/0[..0/0:��]'
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/Range.cc:344) Finished loading cellstores for '0/0[..0/0:��]'
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/Range.cc:256) Loading cellstores for '0/0[0/0:��..��]'
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/IOHandler.h:149) Event: type=CONNECTION_ESTABLISHED from=
172.16.200.58:616431367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/Range.cc:315) Loading CellStore 0/0/default/qyoNKN5rd__dbHKv/cs1
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/Range.cc:344) Finished loading cellstores for '0/0[0/0:��..��]'
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999030 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999031 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999032 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999032 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999033 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999033 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999033 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
1367999033 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:1407) Successfully created scanner (id=0) on table '0/0', returning 2 k/v pairs, more=0
i don't now why. Because system was running. after than rs1 down. we started again rs1 that it started. but rs1 RangeServer.log file grew again and rs1 down again.
when i cap stop/start hypertable said "rangeserver is waiting..." (rs1)
i can see rs2's datas in monitoring. But rs1's datas not there.
how can i resolve this issue with no data loss? this is logs;
Hypertable.Master.log
1368024541 INFO Hypertable.Master : (/root/src/hypertable/src/cc/Hypertable/Master/OperationRecoverRanges.cc:225) Leaving RecoverServerRanges rs1 plan_generation=1 type=3 state=PHANTOM_LOAD
1368024541 INFO Hypertable.Master : (/root/src/hypertable/src/cc/Hypertable/Master/OperationRecoveryBlocker.cc:44) Entering RecoveryBlocker-104089
1368024541 INFO Hypertable.Master : (/root/src/hypertable/src/cc/Hypertable/Master/OperationRecoveryBlocker.cc:53) total_servers=2 connected_servers=2 quorum=2
1368024541 INFO Hypertable.Master : (/root/src/hypertable/src/cc/Hypertable/Master/OperationRecoveryBlocker.cc:84) Leaving RecoveryBlocker-104089 state=COMPLETE
1368024541 INFO Hypertable.Master : (/root/src/hypertable/src/cc/Hypertable/Master/OperationRecoverRanges.cc:64) Entering RecoverServerRanges (0x1756000) rs1 type=3 plan_generation=1 state=PHANTOM_LOAD
1368024541 INFO Hypertable.Master : (/root/src/hypertable/src/cc/Hypertable/Master/OperationRecoverRanges.cc:328) Calling phantom_load(plan_generation=1, location=rs2) for 412 user ranges
1368024541 ERROR Hypertable.Master : phantom_load_ranges (/root/src/hypertable/src/cc/Hypertable/Master/OperationRecoverRanges.cc:335): Hypertable::Exception: RangeServer phantom_load() failure : Unable to verify schema for 3/4Problem getting attribute 'schema' of hyperspace file '/hypertable/tables/3/4' - HYPERSPACE file not found
at void Hypertable::RangeServerClient::phantom_load(const Hypertable::CommAddress&, const Hypertable::String&, int, const std::vector<unsigned int, std::allocator<unsigned int> >&, const std::vector<Hypertable::QualifiedRangeSpec, std::allocator<Hypertable::QualifiedRangeSpec> >&, const std::vector<Hypertable::RangeState, std::allocator<Hypertable::RangeState> >&) (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:678)
1368024541 ERROR Hypertable.Master : phantom_load_ranges (/root/src/hypertable/src/cc/Hypertable/Master/OperationRecoverRanges.cc:340): Failed to issue phantom_load calls
1368024541 INFO Hypertable.Master : (/root/src/hypertable/src/cc/Hypertable/Master/OperationRecoverRanges.cc:225) Leaving RecoverServerRanges rs1 plan_generation=1 type=3 state=PHANTOM_LOAD
Hypertable.Rangeserver (rs1) logs;
1368001942 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:3113) Entering get_statistics()
1368001942 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:3363) Exiting get_statistics()
1368001954 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RSStats.h:83) Maintenance stats scans=(0 0 0 0.000000) updates=(0 0 0 0.000000 0)
1368001954 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:4572) Memory Usage: 0 bytes
1368001974 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RSStats.h:83) Maintenance stats scans=(0 0 0 0.000000) updates=(0 0 0 0.000000 0)
1368001974 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RangeServer.cc:4572) Memory Usage: 0 bytes
1368001977 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/IOHandler.h:149) Event: type=CONNECTION_ESTABLISHED from=
172.16.200.44:535121368001977 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/ConnectionHandler.cc:227) Event: type=DISCONNECT from=
172.16.200.44:535121368001979 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:384) Received event Event: type=DISCONNECT from=
172.16.200.57:380501368001979 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:424) Event: type=DISCONNECT from=
172.16.200.57:38050; Problem connecting to Master, will retry in 5000 milliseconds...
1368001979 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/ConnectionHandler.cc:227) Event: type=DISCONNECT from=
172.16.200.57:380501368001985 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/IOHandlerData.cc:635) connect() completion error - Connection refused
1368001985 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:384) Received event Event: type=DISCONNECT from=
172.16.200.57:380501368001985 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:424) Event: type=DISCONNECT from=
172.16.200.57:38050; Problem connecting to Master, will retry in 5000 milliseconds...
1368001985 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/ConnectionHandler.cc:227) Event: type=DISCONNECT from=
172.16.200.57:380501368001986 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/IOHandler.h:149) Event: type=CONNECTION_ESTABLISHED from=
172.16.200.58:520911368001986 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/ConnectionHandler.cc:156) Received shutdown command
1368001986 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/TimerHandler.cc:152) TimerHandler shutting down.
1368001986 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/GroupCommitTimerHandler.cc:61) CommitIntervalGroupCommitTimerHandler shutting down.
1368001986 ERROR Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:309) No connection for
172.16.200.58:52091 - COMM not connected
1368001986 ERROR Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:309) No connection for
172.16.200.58:52091 - COMM not connected
1368001986 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/AsyncComm/IOHandler.h:149) Event: type=CONNECTION_ESTABLISHED from=
172.16.200.58:607571368001988 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Hypertable/RangeServer/RequestHandlerShutdown.cc:43) Exiting RangeServer.
1368002202 INFO Hypertable.RangeServer : (/root/src/hypertable/src/cc/Common/Config.cc:577) Initializing Hypertable.RangeServer (Hypertable 0.9.7.5 (v0.9.7.5-0-g22129c7-dirty))...
ThrisBroker.log
1368001935 WARN ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:268) No connection for rs1 - COMM invalid proxy
1368001935 WARN ThriftBroker : (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:728) Comm::send_request to rs1 failed - COMM invalid proxy
1368001938 INFO ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:237) Connection attempt to RangeServer at rs1 failed - COMM invalid proxy. Will retry again in 3000 milliseconds...
1368001939 WARN ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:268) No connection for rs1 - COMM invalid proxy
1368001939 WARN ThriftBroker : (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:728) Comm::send_request to rs1 failed - COMM invalid proxy
1368001941 INFO ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:237) Connection attempt to RangeServer at rs1 failed - COMM invalid proxy. Will retry again in 3000 milliseconds...
1368001941 WARN ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:268) No connection for rs1 - COMM invalid proxy
1368001941 WARN ThriftBroker : (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:728) Comm::send_request to rs1 failed - COMM invalid proxy
1368001945 INFO ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:237) Connection attempt to RangeServer at rs1 failed - COMM invalid proxy. Will retry again in 3000 milliseconds...
1368001945 WARN ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:268) No connection for rs1 - COMM invalid proxy
1368001945 WARN ThriftBroker : (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:728) Comm::send_request to rs1 failed - COMM invalid proxy
1368001947 WARN ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:268) No connection for rs1 - COMM invalid proxy
1368001947 WARN ThriftBroker : (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:728) Comm::send_request to rs1 failed - COMM invalid proxy
1368001949 INFO ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:237) Connection attempt to RangeServer at rs1 failed - COMM invalid proxy. Will retry again in 3000 milliseconds...
1368001951 WARN ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:268) No connection for rs1 - COMM invalid proxy
1368001951 WARN ThriftBroker : (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:728) Comm::send_request to rs1 failed - COMM invalid proxy
1368001953 WARN ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:268) No connection for rs1 - COMM invalid proxy
1368001953 WARN ThriftBroker : (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:728) Comm::send_request to rs1 failed - COMM invalid proxy
1368001953 INFO ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:237) Connection attempt to RangeServer at rs1 failed - COMM invalid proxy. Will retry again in 3000 milliseconds...
1368001956 INFO ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:237) Connection attempt to RangeServer at rs1 failed - COMM invalid proxy. Will retry again in 3000 milliseconds...
1368001957 WARN ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:268) No connection for rs1 - COMM invalid proxy
1368001957 WARN ThriftBroker : (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:728) Comm::send_request to rs1 failed - COMM invalid proxy
1368001959 WARN ThriftBroker : (/root/src/hypertable/src/cc/AsyncComm/Comm.cc:268) No connection for rs1 - COMM invalid proxy
1368001959 WARN ThriftBroker : (/root/src/hypertable/src/cc/Hypertable/Lib/RangeServerClient.cc:728) Comm::send_request to rs1 failed - COMM invalid proxy
1368007317 INFO ThriftBroker : (/root/src/hypertable/src/cc/Common/Config.cc:577) Initializing ThriftBroker (Hypertable 0.9.7.5 (v0.9.7.5-0-g22129c7-dirty))...
CPU cores count=24
DfsBroker.Local.Root=fs/local
DfsBroker.Port=38030
HdfsBroker.Hadoop.ConfDir=/etc/hadoop/conf
Hyperspace.GracePeriod=200000
Hyperspace.KeepAlive.Interval=30000
Hyperspace.Lease.Interval=1000000
Hyperspace.Replica.Dir=hyperspace
Hyperspace.Replica.Host=[hadoop57]
Hyperspace.Replica.Port=38040
Hypertable.Failover.Quorum.Percentage=100
Hypertable.Master.Port=38050
Hypertable.RangeServer.Port=38060
Hypertable.Verbose=true
ThriftBroker.Port=38080
pidfile=/opt/hypertable/current/run/ThriftBroker.pid
port=38080
reactors=24
verbose=true
1368007317 INFO ThriftBroker : (/root/src/hypertable/src/cc/Hyperspace/Session.cc:63) Hyperspace session setup to reconnect
1368007317 INFO ThriftBroker : (/root/src/hypertable/src/cc/ThriftBroker/ThriftBroker.cc:2497) Starting the server...
tcmalloc: large alloc 1668055040 bytes == 0xe4a000 @
tcmalloc: large alloc 1668055040 bytes == 0xe4a000 @
tcmalloc: large alloc 1668055040 bytes == 0xe4a000 @
tcmalloc: large alloc 1668055040 bytes == 0xe4a000 @
1368022577 INFO ThriftBroker : (/root/src/hypertable/src/cc/Common/Config.cc:577) Initializing ThriftBroker (Hypertable 0.9.7.5 (v0.9.7.5-0-g22129c7-dirty))...
Dfsbroker.log
May 8, 2013 5:16:01 PM org.hypertable.DfsBroker.hadoop.HadoopBroker Create
INFO: Created file '/hypertable/servers/master/log/mml/21' handle = 2
May 8, 2013 5:16:13 PM org.hypertable.AsyncComm.IOHandlerData handle_message_body
WARNING: Received response for non-pending event (id=0,version=1,total_len=72)
May 8, 2013 5:16:15 PM org.hypertable.AsyncComm.IOHandlerData handle_message_body
WARNING: Received response for non-pending event (id=0,version=1,total_len=72)
May 8, 2013 5:21:01 PM org.hypertable.DfsBroker.hadoop.HadoopBroker Remove
INFO: Removing file '/hypertable/tables/0/0/default/qyoNKN5rd__dbHKv/cs8
May 8, 2013 5:21:01 PM org.hypertable.DfsBroker.hadoop.HadoopBroker Remove
INFO: Removing file '/hypertable/tables/0/0/default/qyoNKN5rd__dbHKv/cs9
May 8, 2013 5:21:01 PM org.hypertable.DfsBroker.hadoop.HadoopBroker Remove
INFO: Removing file '/hypertable/tables/0/1/range/qyoNKN5rd__dbHKv/cs8
May 8, 2013 5:21:01 PM org.hypertable.DfsBroker.hadoop.HadoopBroker Remove
INFO: Removing file '/hypertable/tables/0/1/range/qyoNKN5rd__dbHKv/cs9
May 8, 2013 5:49:01 PM org.hypertable.DfsBroker.hadoop.HadoopBroker Close
INFO: Closing output file /hypertable/servers/master/log/mml/21 handle 2
May 8, 2013 5:49:01 PM org.hypertable.DfsBroker.hadoop.ConnectionHandler handle
May 8, 2013 5:49:17 PM org.hypertable.AsyncComm.IOHandler DeliverEvent
Closed 0 input streams and 0 output streams
May 8, 2013 5:49:17 PM org.hypertable.DfsBroker.hadoop.ConnectionHandler handle
INFO: Removing file '/hypertable/tables/0/1/range/qyoNKN5rd__dbHKv/cs9
May 8, 2013 5:49:01 PM org.hypertable.DfsBroker.hadoop.HadoopBroker Close
INFO: Closing output file /hypertable/servers/master/log/mml/21 handle 2
May 8, 2013 5:49:01 PM org.hypertable.DfsBroker.hadoop.ConnectionHandler handle
May 8, 2013 5:49:17 PM org.hypertable.AsyncComm.IOHandler DeliverEvent
Closed 0 input streams and 0 output streams
May 8, 2013 5:49:17 PM org.hypertable.DfsBroker.hadoop.ConnectionHandler handle
Hyperspace.log
1368024548 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/Master.cc:487) Destroying handle 14
1368024550 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/RequestHandlerDestroySession.cc:42) Destroying session 4
1368024550 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/Master.cc:317) destroyed session 4(Hypertable.RangeServer)
1368024550 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/Master.cc:468) Expiring session 4 name=Hypertable.RangeServer
1368024550 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/Master.cc:487) Destroying handle 10
1368024550 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/Master.cc:487) Destroying handle 12
1368024550 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/Master.cc:1533) Persisting lock released notifications
1368024550 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/Master.cc:1544) Finished persisting lock released notifications
1368024550 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/RequestHandlerDestroySession.cc:42) Destroying session 5
1368024550 INFO Hyperspace.Master : (/root/src/hypertable/src/cc/Hyperspace/Master.cc:317) destroyed session 5(Hypertable.RangeServer)
best regards,
mali