Nexus 3 checksum-based storage not efficient - blob dublication

268 views
Skip to first unread message

Мартынов Илья

unread,
Jan 12, 2017, 1:13:17 AM1/12/17
to Nexus Users
Hello,

on Nexus 3, I've deployed 2 equal binaries as mvn artifacts, with different versions. in UI artifact looks like this:


then I've browsed the blob storage:

[root@srv-081ea775d57b mvn]# pwd
/opt/sonatype-work/nexus3/blobs/mvn

[root@srv-081ea775d57b mvn]# ll ./content/vol-07/chap-41/de94332f-e2d3-4451-9093-27d3b611408c.bytes ./content/vol-08/chap-10/98cc9528-4cc8-414a-8336-b872e3566e06.bytes
-rw-r--r-- 1 nexus nexus 3699719 Dec 15 19:20 ./content/vol-07/chap-41/de94332f-e2d3-4451-9093-27d3b611408c.bytes
-rw-r--r-- 1 nexus nexus 3699719 Dec 15 19:21 ./content/vol-08/chap-10/98cc9528-4cc8-414a-8336-b872e3566e06.bytes

[root@srv-081ea775d57b mvn]# sha1sum ./content/vol-07/chap-41/de94332f-e2d3-4451-9093-27d3b611408c.bytes ./content/vol-08/chap-10/98cc9528-4cc8-414a-8336-b872e3566e06.bytes
11027d4399c172a665a199ae2ff6f8347ce59adf  ./content/vol-07/chap-41/de94332f-e2d3-4451-9093-27d3b611408c.bytes
11027d4399c172a665a199ae2ff6f8347ce59adf  ./content/vol-08/chap-10/98cc9528-4cc8-414a-8336-b872e3566e06.bytes

[root@srv-081ea775d57b mvn]# stat ./content/vol-07/chap-41/de94332f-e2d3-4451-9093-27d3b611408c.bytes ./content/vol-08/chap-10/98cc9528-4cc8-414a-8336-b872e3566e06.bytes
  File: './content/vol-07/chap-41/de94332f-e2d3-4451-9093-27d3b611408c.bytes'
  Size: 3699719         Blocks: 7232       IO Block: 4096   regular file
Device: 8040b651h/2151724625d   Inode: 10092713    Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1000/   nexus)   Gid: ( 1000/   nexus)
Access: 2017-01-12 08:49:17.270295633 +0300
Modify: 2016-12-15 19:20:07.648035838 +0300
Change: 2016-12-15 19:20:07.658035726 +0300
 Birth: -
  File: './content/vol-08/chap-10/98cc9528-4cc8-414a-8336-b872e3566e06.bytes'
  Size: 3699719         Blocks: 7232       IO Block: 4096   regular file
Device: 8040b651h/2151724625d   Inode: 10092693    Links: 1
Access: (0644/-rw-r--r--)  Uid: ( 1000/   nexus)   Gid: ( 1000/   nexus)
Access: 2017-01-12 08:49:17.304295721 +0300
Modify: 2016-12-15 19:21:09.927347828 +0300
Change: 2016-12-15 19:21:09.934347751 +0300
 Birth: -


So, 2 equal blobs was not collapsed by storage to a single file with 2 references. Therefore we cannot benefit more effective disk utilization that checksum-based storage is possible to provide
E g on our production Nexus, for 1 unique artifact there is ~ 4 dublicates, so we would be able to store x5 more if this feature existed

Is it a known issue? Nexus version 3.1.0-04





Peter Lynch

unread,
Jan 16, 2017, 10:43:39 PM1/16/17
to Мартынов Илья, Nexus Users
This issue I think covers what you are asking about.



--
You received this message because you are subscribed to the Google Groups "Nexus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to nexus-users+unsubscribe@glists.sonatype.com.
To post to this group, send email to nexus...@glists.sonatype.com.
To view this discussion on the web visit https://groups.google.com/a/glists.sonatype.com/d/msgid/nexus-users/62da2d2b-3eef-4f9e-a59a-61276af0be80%40glists.sonatype.com.
For more options, visit https://groups.google.com/a/glists.sonatype.com/d/optout.

Мартынов Илья

unread,
Jan 18, 2017, 10:15:18 AM1/18/17
to Peter Lynch, Nexus Users
Reply all
Reply to author
Forward
0 new messages