I'm having a real problem keeping a mount active.
I have 3 data centers each with 1 mrc, 1 dir, and 2 osds. I'm running WqRq and 3 replicas.
mount command:
# mount.xtreemfs -o allow_other --vivaldi-enable --vivaldi-enable-dir-updates <fqdn>/vol /path/to/mount
I'm running an rsync -avP /source/path /path/to/mount/ and after a while it spews a bunch of messages like the below and exits with failure:
rsync: recv_generator: mkdir "/path/to/mount/subdir/anotherdir/athirddir/whatever.file" failed: Transport endpoint is not connected (107)
xtfsutil on the client mount shows in part:
Owner root
Group root
Type volume
Available/Used Space 47.2383919 TB / 331 MB
Num. Files/Dirs 1542 / 1153
Access Control p. POSIX (permissions & ACLs)
OSD Selection p. 1000,3002
Replica Selection p. default
Default Striping p. STRIPING_POLICY_RAID0 / 1 / 128kB
Default Repl. p. WqRq with 3 replicas
Snapshots enabled no
The source data is about 14G across 311188 files.
No errors with the default level of logging are being returned on any of the osds, mrcs, or dirs. Nothing interesting is being returned in the logs on the client at all.
The mount just dies and I'm forced to umount /path/to/mount and perform the above mount command again to get it moving once more.
I've checked the performance of all the systems and none have high cpu load, io wait, or anything else.