On Wed, 2017-04-05 at 14:57:30 +0200, Steffen Grunewald wrote:
> Hi,
>
> I have run a beegfs-fsck on one of our filesystems, and ended up with the
> following:
>
> ...
> Step 3: Check for errors...
>
> * Duplicated inode IDs ...
> * Duplicated chunks ...
> >>> Found 17 errors. Detailed information can also be found in check.out.
>
> Found errors beegfs-fsck cannot fix. Please consult the log for more information.
>
>
> In the output file, I find entries like
>
> >>> Found duplicated Chunks for ID 5C-58DD2BB9-2
> * Found on target 6 in path uA98/58D3/F/4FA9-58D3F5C1-2
> * Found on target 6 in path uA98/58D3/F/4FA9-58D3F5C1-2
>
> (Yes, the "Found on target" lines are pairwise identical.)
I have found that all these chunks (hint: chunk/$path/$ID below mountpoint of storage
target) have been created within 3 minutes while there should have been no issues with
the cluster - but they also all reside on the same storage target, same server:
storage01: -rw-rw-rw- 1 user1 group2 66484 Apr 3 22:53 /mnt/storage3/chunks/uA98/58D3/F/4FA9-58D3F5C1-2/5C-58DD2BB9-2
storage01: -rw-rw-rw- 1 user1 group2 124215 Apr 3 22:53 /mnt/storage3/chunks/uA98/58D3/F/17C7-58D3F5C1-2/1E7-58DD2B7B-2
storage01: -rw-rw-rw- 1 user1 group2 39219 Apr 3 22:52 /mnt/storage3/chunks/uA98/58D3/F/16C6-58D3F84D-1/277-58DD2BB9-2
storage01: -rw-rw-rw- 1 user1 group2 28527 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/4F56-58D3F5C1-2/306-58DD2B7B-2
storage01: -rw-rw-rw- 1 user1 group2 23345 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/D7C3-58D3F598-1/376-58DD2B7B-2
storage01: -rw-rw-rw- 1 user1 group2 30704 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/39CB-58D3F84D-1/445-58DD2BB9-2
storage01: -rw-rw-rw- 1 user1 group2 17908 Apr 3 22:52 /mnt/storage3/chunks/uA98/58D3/F/32F-58D3F876-2/46B-58DD2BB0-1
storage01: -rw-rw-rw- 1 user1 group2 22833 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/3B0A-58D3F84D-1/480-58DD2BB0-1
storage01: -rw-rw-rw- 1 user1 group2 23407 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/3BAA-58D3F84D-1/499-58DD2BB9-2
storage01: -rw-rw-rw- 1 user1 group2 21043 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/8792-58D3F582-2/4A1-58DD2B71-1
storage01: -rw-rw-rw- 1 user1 group2 23407 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/3BAA-58D3F84D-1/4DF-58DD2BB9-2
storage01: -rw-rw-rw- 1 user1 group2 124214 Apr 3 22:53 /mnt/storage3/chunks/uA98/58D3/F/17C4-58D3F5C1-2/4FC-58DD2B71-1
storage01: -rw-rw-rw- 1 user1 group2 56686 Apr 3 22:52 /mnt/storage3/chunks/uA98/58D3/F/2FAD-58D3F837-2/525-58DD2BB9-2
storage01: -rw-rw-rw- 1 user1 group2 17203 Apr 3 22:53 /mnt/storage3/chunks/uA98/58D3/F/19A6-58D3F5C1-2/542-58DD2B71-1
storage01: -rw-rw-rw- 1 user1 group2 27058 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/3A91-58D3F84D-1/667-58DD2BB9-2
storage01: -rw-rw-rw- 1 user1 group2 13683 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/3D4C-58D3F84D-1/70B-58DD2BB0-1
storage01: -rw-rw-rw- 1 user1 group2 13683 Apr 3 22:51 /mnt/storage3/chunks/uA98/58D3/F/3D4C-58D3F84D-1/727-58DD2BB0-1
I'm afraid there's an issue with the underlying XFS :( - migrate everything off it,
and clean it up?
> How can I find out which files have been affected, and how can I resolve this
> situation manually?
Also, "Duplicated chunks" usually isn't the last check to be performed. How do
I get beegfs-fsck to continue?
Would I be safe just to ignore the issues for now? Time for maintenance is running
out :(
- S