1382498906 ERROR cdrimport_test : (/root/src/hypertable/src/cc/Hyperspace/ClientKeepaliveHandler.cc:173) Master session (97) error - HYPERSPACE expired session
1382498908 ERROR cdrimport_test : set_cells (/root/src/hypertable/src/cc/Hypertable/Lib/TableMutator.cc:116): Hypertable::Exception: Problem getting attribute 'Location' of hyperspace file 'UNKNOWN' - COMM broken connection
at void Hyperspace::Session::attr_get(uint64_t, const std::string&, Hypertable::DynamicBuffer&, Hypertable::Timer*) (/root/src/hypertable/src/cc/Hyperspace/Session.cc:546)
1382498908 ERROR cdrimport_test : Commit (/home/jack/mnt2/MCCloud_V2.0/foundation/../common/incl/DLHypertableClient.h:328): Hypertable::Exception: Problem getting attribute 'Location' of hyperspace file 'UNKNOWN' - COMM broken connection
at void Hyperspace::Session::attr_get(uint64_t, const std::string&, Hypertable::DynamicBuffer&, Hypertable::Timer*) (/root/src/hypertable/src/cc/Hyperspace/Session.cc:546)
2013-10-23 11:28:29 commitData success
2013-10-23 11:28:29 file size:67108860 use time:32
2013-10-23 11:28:29 total_filesize:[63M] size/s:[1M/s]
2013-10-23 11:28:29 HandleFile: //data02/cloudilbak/121/1352072004-1352329313-bssap-1-zh_cloud121.cdr
1382498909 INFO cdrimport_test : (/root/src/hypertable/src/cc/AsyncComm/IOHandlerData.cc:242) socket read(57, len=38) failure : Connection timed out
1382498909 INFO cdrimport_test : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:389) Received event Event: type=DISCONNECT from=
172.16.23.164:380601382498909 INFO cdrimport_test : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:429) Event: type=DISCONNECT from=
172.16.23.164:38060; Problem connecting to Root RangeServer, will retry in 3000 milliseconds...
2013-10-23 11:28:40 commitData
1382498933 INFO cdrimport_test : (/root/src/hypertable/src/cc/AsyncComm/IOHandlerData.cc:242) socket read(56, len=38) failure : Connection timed out
1382498933 INFO cdrimport_test : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:389) Received event Event: type=DISCONNECT from=
172.16.23.164:380601382498933 INFO cdrimport_test : (/root/src/hypertable/src/cc/AsyncComm/ConnectionManager.cc:429) Event: type=DISCONNECT from=
172.16.23.164:38060; Problem connecting to Root RangeServer, will retry in 3000 milliseconds...
1382498936 ERROR cdrimport_test : set_cells (/root/src/hypertable/src/cc/Hypertable/Lib/TableMutator.cc:116): Hypertable::Exception: Problem getting attribute 'Location' of hyperspace file 'UNKNOWN' - HYPERSPACE invalid handle
at void Hyperspace::Session::attr_get(uint64_t, const std::string&, Hypertable::DynamicBuffer&, Hypertable::Timer*) (/root/src/hypertable/src/cc/Hyperspace/Session.cc:546)
1382498936 ERROR cdrimport_test : Commit (/home/jack/mnt2/MCCloud_V2.0/foundation/../common/incl/DLHypertableClient.h:328): Hypertable::Exception: Problem getting attribute 'Location' of hyperspace file 'UNKNOWN' - HYPERSPACE invalid handle
at void Hyperspace::Session::attr_get(uint64_t, const std::string&, Hypertable::DynamicBuffer&, Hypertable::Timer*) (/root/src/hypertable/src/cc/Hyperspace/Session.cc:546)
I think some block maybe corrupted, but executing 'hadoop fsck /hypertable' denotes all blocks are healthy. At last, no others way, i restart the cluster. All errors disappear, it looks okay.
Although it's okay now, but i want to know the real reason. The cluster is running on CDH4.3.0+Hypertable0.9.7.12. The attachment is the log of these two days.