write error detected, all vm`s down

93 views
Skip to first unread message

Richard Lawrie

unread,
May 3, 2017, 6:55:22 AM5/3/17
to dedupfilesystem-sdfs-user-discuss
twice now

and then i get this for the garbage collection - using to export a datastore and all VM`s down again - not sure of the version - install is the .ovf from about 3 weeks ago


Garbage Collection
running
SSD500GB_DEDUPE
dedupe.choshinkai.com
SDFS Volume Cleanup Initiated for SSD500GB_DEDUPE
33%
8BB32123-3F64-60CD-DF07-A8373C0EF8E9
05/03/2017 11:29:14
12/31/1969 23:59:59

is not under heavy load both times this occurred

load average is going up to 7


top - 03:45:56 up 1 day, 17:57,  2 users,  load average: 7.00, 6.81, 4.98
Tasks: 167 total,   1 running, 166 sleeping,   0 stopped,   0 zombie
%Cpu(s):  0.2 us,  0.0 sy,  0.0 ni, 99.8 id,  0.0 wa,  0.0 hi,  0.0 si,  0.0 st
KiB Mem :  8175232 total,    75652 free,  2271496 used,  5828084 buff/cache
KiB Swap:  5145596 total,  5143168 free,     2428 used.  5816660 avail Mem

  PID USER      PR  NI    VIRT    RES    SHR S  %CPU %MEM     TIME+ COMMAND
 1225 root      20   0 7179388 3.203g 1.798g S   1.0 41.1 468:32.66 jsvc
 1091 root      20   0 5694316 659376   6556 S   0.3  8.1   9:39.30 java
    1 root      20   0   37700   5192   3428 S   0.0  0.1   0:01.79 systemd


root@dedupe:/var/log/sdfs# tail SSD500GB_DEDUPE-volume-cfg.xml.log
2017-05-03 03:29:36,300 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,350 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,400 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,427 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,458 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,468 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -5058997057981721749=1 delob=1 bs                                                                                                                                                                                                                                                                                                                                  z=12058
2017-05-03 03:40:35,961 [sdfs] [org.opendedup.sdfs.io.SparseDedupFile] [383] [Thread-19]  - WriteCache has take over [120] seconds. There are still 3 in fl                                                                                                                                                                                                                                                                                                                                  ush
2017-05-03 03:42:43,769 [sdfs] [org.opendedup.sdfs.io.SparseDedupFile] [383] [Thread-19]  - WriteCache has take over [240] seconds. There are still 3 in fl                                                                                                                                                                                                                                                                                                                                  ush
2017-05-03 03:44:51,552 [sdfs] [org.opendedup.sdfs.io.SparseDedupFile] [383] [Thread-19]  - WriteCache has take over [360] seconds. There are still 3 in fl                                                                                                                                                                                                                                                                                                                                  ush
2017-05-03 03:46:59,367 [sdfs] [org.opendedup.sdfs.io.SparseDedupFile] [383] [Thread-19]  - WriteCache has take over [480] seconds. There are still 3 in fl                                                                                          



                                                                                                                                                                                                                                    
root@dedupe:/var/log/sdfs# tail -n 100 SSD500GB_DEDUPE-volume-cfg.xml.log
        at org.opendedup.sdfs.io.DedupFileChannel.writeFile(DedupFileChannel.java:334)
        at fuse.SDFS.SDFSFileSystem.write(SDFSFileSystem.java:790)
        at fuse.Filesystem3ToFuseFSAdapter.write(Filesystem3ToFuseFSAdapter.java:361)
Caused by: java.lang.NullPointerException
2017-05-03 03:29:32,352 [sdfs] [org.opendedup.sdfs.io.DedupFileChannel] [387] [Thread-118]  - error while writing to /opt/sdfs/metadata/SSD500GB_DEDUPE/files/nfs/WINDOWS 7/WINDOWS 7-flat.vmdk java.io.IOException: error while getting blocks 1 errors found
java.io.IOException: error while getting blocks 1 errors found
        at org.opendedup.sdfs.io.WritableCacheBuffer.initBuffer(WritableCacheBuffer.java:315)
        at org.opendedup.sdfs.io.WritableCacheBuffer.writeBlock(WritableCacheBuffer.java:388)
        at org.opendedup.sdfs.io.WritableCacheBuffer.write(WritableCacheBuffer.java:493)
        at org.opendedup.sdfs.io.DedupFileChannel.writeFile(DedupFileChannel.java:334)
        at fuse.SDFS.SDFSFileSystem.write(SDFSFileSystem.java:790)
        at fuse.Filesystem3ToFuseFSAdapter.write(Filesystem3ToFuseFSAdapter.java:361)
2017-05-03 03:29:32,352 [sdfs] [fuse.SDFS.SDFSFileSystem] [792] [Thread-118]  - unable to write to file/nfs/WINDOWS 7/WINDOWS 7-flat.vmdk
java.io.IOException: error while writing to /opt/sdfs/metadata/SSD500GB_DEDUPE/files/nfs/WINDOWS 7/WINDOWS 7-flat.vmdk java.io.IOException: error while getting blocks 1 errors found
        at org.opendedup.sdfs.io.DedupFileChannel.writeFile(DedupFileChannel.java:391)
        at fuse.SDFS.SDFSFileSystem.write(SDFSFileSystem.java:790)
        at fuse.Filesystem3ToFuseFSAdapter.write(Filesystem3ToFuseFSAdapter.java:361)
2017-05-03 03:29:32,369 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:32,466 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:32,487 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -3846274934533402200=1 delob=1 bsz=12288
2017-05-03 03:29:32,615 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for 1645231692638055281=4 delob=4 bsz=108351
2017-05-03 03:29:33,214 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -7351589483335662043=5 delob=5 bsz=201752
2017-05-03 03:29:33,351 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -4896386094842461228=8 delob=8 bsz=61440
2017-05-03 03:29:33,433 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:33,495 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap] [945] [Thread-18]  - done initialize /opt/sdfs/hashdb/SSD500GB_DEDUPE/hashstore-sdfs-E2875ECE-7423-910A-C334-0303C5C4DBB9
2017-05-03 03:29:33,495 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap] [492] [Thread-18]  - opened hashtable /opt/sdfs/hashdb/SSD500GB_DEDUPE/hashstore-sdfs-E2875ECE-7423-910A-C334-0303C5C4DBB9 size = 0
2017-05-03 03:29:33,533 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:33,538 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -6032152788059198278=2 delob=2 bsz=32798
2017-05-03 03:29:33,569 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for 6959058372475878492=4 delob=4 bsz=100407
2017-05-03 03:29:33,595 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -9101138303672514963=1 delob=1 bsz=8192
2017-05-03 03:29:33,620 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for 7798466022790873607=3 delob=3 bsz=16384
2017-05-03 03:29:33,648 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -4197761817507375240=3 delob=3 bsz=32768
2017-05-03 03:29:33,676 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -984525360502919409=6 delob=6 bsz=49151
2017-05-03 03:29:33,701 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -1336699566796732691=3 delob=3 bsz=74429
2017-05-03 03:29:33,726 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -6035783326431830767=25 delob=25 bsz=285854
2017-05-03 03:29:33,780 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:33,821 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:33,833 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for 3131228515134663452=23 delob=23 bsz=359100
2017-05-03 03:29:33,866 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:33,912 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:33,966 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:33,983 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for 6537696491717846078=2 delob=2 bsz=46481
2017-05-03 03:29:34,011 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for 4085397519959990699=7 delob=7 bsz=143355
2017-05-03 03:29:34,054 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,113 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,125 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -3521824669515422378=1 delob=1 bsz=20477
2017-05-03 03:29:34,150 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -5085888832602931469=2 delob=2 bsz=73723
2017-05-03 03:29:34,178 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -8554583122796552301=4 delob=4 bsz=16384
2017-05-03 03:29:34,207 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for 5860980072634053361=1 delob=1 bsz=58444
2017-05-03 03:29:34,232 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for 6841355554821600268=23 delob=23 bsz=305280
2017-05-03 03:29:34,301 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,312 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for 4477913883552004648=20 delob=20 bsz=6492139
2017-05-03 03:29:34,365 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,416 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,480 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,529 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,578 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,637 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,706 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,737 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,797 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,842 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,903 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,951 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:34,984 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,047 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,116 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,182 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,279 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,318 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,371 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,435 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,484 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,543 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,582 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,620 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,663 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,704 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,753 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,799 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,847 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,876 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,923 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:35,972 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,009 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,053 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,144 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,176 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,226 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,255 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,300 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,350 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,400 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,427 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,458 [sdfs] [org.opendedup.collections.ShardedFileByteArrayLongMap$Shard] [1259] [Thread-3]  - looped through everything
2017-05-03 03:29:36,468 [sdfs] [org.opendedup.sdfs.filestore.BatchFileChunkStore] [459] [Thread-3]  - remove requests for -5058997057981721749=1 delob=1 bsz=12058
2017-05-03 03:40:35,961 [sdfs] [org.opendedup.sdfs.io.SparseDedupFile] [383] [Thread-19]  - WriteCache has take over [120] seconds. There are still 3 in flush
2017-05-03 03:42:43,769 [sdfs] [org.opendedup.sdfs.io.SparseDedupFile] [383] [Thread-19]  - WriteCache has take over [240] seconds. There are still 3 in flush
2017-05-03 03:44:51,552 [sdfs] [org.opendedup.sdfs.io.SparseDedupFile] [383] [Thread-19]  - WriteCache has take over [360] seconds. There are still 3 in flush
2017-05-03 03:46:59,367 [sdfs] [org.opendedup.sdfs.io.SparseDedupFile] [383] [Thread-19]  - WriteCache has take over [480] seconds. There are still 3 in flush



Sam Silverberg

unread,
May 4, 2017, 8:03:52 AM5/4/17
to dedupfilesystem-sdfs-user-discuss
What version of opendedupe are you running on the appliance?

Richard Lawrie

unread,
May 4, 2017, 8:55:40 AM5/4/17
to dedupfilesystem-...@googlegroups.com
what ever is in the ovf
how do i find out?


On Thu, May 4, 2017 at 1:03 PM, Sam Silverberg <sam.sil...@gmail.com> wrote:
What version of opendedupe are you running on the appliance?

--
You received this message because you are subscribed to a topic in the Google Groups "dedupfilesystem-sdfs-user-discuss" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/dedupfilesystem-sdfs-user-discuss/DvrIDf-H8KM/unsubscribe.
To unsubscribe from this group and all its topics, send an email to dedupfilesystem-sdfs-user-discuss+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Sam Silverberg

unread,
May 5, 2017, 10:41:25 AM5/5/17
to dedupfilesystem-...@googlegroups.com
Can you update to sdfs version 3.4 


--
You received this message because you are subscribed to the Google Groups "dedupfilesystem-sdfs-user-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dedupfilesystem-sdfs-user-discuss+unsubscribe@googlegroups.com.

Sam Silverberg

unread,
May 9, 2017, 12:49:26 AM5/9/17
to dedupfilesystem-...@googlegroups.com
Richard - I have a fix for this. It will be released in 3.4.1 within the next two days. Sorry for the delay.

On Fri, May 5, 2017 at 7:41 AM, Sam Silverberg <sam.sil...@gmail.com> wrote:
Can you update to sdfs version 3.4 

Reply all
Reply to author
Forward
0 new messages