RelStorage blobs: move from non-shared to shared directory

Maurits van Rees

unread,

Mar 31, 2015, 5:57:53 AM3/31/15

to zo...@googlegroups.com

Hi,

I have a Plone 4.2 site with RelStorage 1.5.0, moving to RelStorage
1.6.0b3. Using Postgres.

It currently has the blobs stored in the relational database. We want
to move that out to a shared blob directory on NFS. Reasons:

- Postgres is running on a different server, so NFS may give a
performance improvement.
- Database size will be smaller, which seems a good idea.
- We have not dived into memcached yet, but I guess it helps for memory
usage if no blobs need to be loaded in both memcached and the zope instance.

Has anyone done that before? Any tips?

I can think of a few ways, and have experimented with two of them.

1. Use zodbconvert to convert from relstorage without shared blob
storage to a new relstorage with shared blob storage. Config file would
be this:

<relstorage source>
blob-dir /some/dir/var/blobcache
shared-blob-dir false
<postgresql>
dsn dbname='client_plone4' user='client' host='localhost'
port='5432' password='secret'
</postgresql>
</relstorage>
<relstorage destination>
blob-dir /some/dir/var/blobstorage
shared-blob-dir true
<postgresql>
dsn dbname='client_plone4_blob_migration' user='client'
host='localhost' port='5432' password='secret'
</postgresql>
</relstorage>

This works. You can now use the new database with shared blob storage.
You can even use the old database with the new shared blob storage. The
database will just have a filled blob_chunk table and large objects that
are not used anymore. Maybe useful as backup in case anything goes
wrong after all.
Using the old database has the advantage of not having to change any
connection strings, or database user permissions, or maybe backup scripts.

2. Use a script to download the blobs to the new blobstorage.

I have created such a script, based on the zodbconvert script and
RelStorage.copyTransactionsFrom.
The script is here:
https://gist.github.com/mauritsvanrees/effc677c97f4d6fb633c
Config would be the same as for the zodbconvert above, except that we
only need the source, not the destination.

After you have run this, you switch the old database to use the new
shared blob directory.
Again, this leaves unused blob_chunks and large objects in the database.

3. Iterate over the blob_chunk table, for each row download the blob as
above in idea number 2, then remove the row.

This should leave you with a clean database, without unused blob_chunks
and large objects lying about.

A question to check my understanding of the RelStorage code and tables.
- With shared blob storage the blobs are stored in a directory.
- With non-shared the blobs are stored in the blob_chunk table, which
uses large objects.
- When the non-shared blobs are downloaded to a shared blob storage as
above, the blob_chunk table can be safely emptied.
Right? Removing a blob_chunk row does not cascade to a removal of an
object_state row or something like that? No information is lost?

For ideas one and two, it could help to have a script to afterwards
remove the blobs from the database, as done in idea three.
Maybe that is just a one-liner in SQL, maybe some Python. I have not
tried this yet.
But I think I have read that a TRUNCATE on the blob_chunk table would
not remove the large objects, at least in Postgres. Am I correct?

Thoughts? Any existing scripts that I have missed that already do this?

--
Maurits van Rees: http://maurits.vanrees.org/
Zest Software: http://zestsoftware.nl

Maurits van Rees

unread,

Mar 31, 2015, 6:15:13 PM3/31/15

to zo...@googlegroups.com

Maurits van Rees schreef op 31-03-15 om 11:57:

> 3. Iterate over the blob_chunk table, for each row download the blob as
> above in idea number 2, then remove the row.
>
> This should leave you with a clean database, without unused blob_chunks
> and large objects lying about.

Script is here:
https://gist.github.com/mauritsvanrees/7c754bfc8457ea329d0f

I am not used to writing code at this level, so there may very well be
stupid mistakes, especially with locking and committing, which I kind of
figured out best as I could by looking at the current RelStorage code,
mostly the packing code. Only tried with Postgres. No tests... Use at
your own risk.

Comments are welcome.

It does seem to work though.

Before:

# select count(*) from blob_chunk;
count
-------
5724
(1 row)

# select count(*) from pg_largeobject;
count
---------
1515266
(1 row)

After:

# select count(*) from blob_chunk;
count
-------
0
(1 row)

# select count(*) from pg_largeobject;
count
-------
0
(1 row)

$ du -sh var/blobstorage
3,0G var/blobstorage

And no POSKeyErrors when using this as shared blob directory so far.

Jens W. Klein

unread,

Apr 1, 2015, 8:08:45 AM4/1/15

to zo...@googlegroups.com

Hi Maurits!

On 2015-03-31 11:57, Maurits van Rees wrote:> Hi,

>
> I have a Plone 4.2 site with RelStorage 1.5.0, moving to RelStorage
> 1.6.0b3. Using Postgres.
>
> It currently has the blobs stored in the relational database. We want
> to move that out to a shared blob directory on NFS. Reasons:
>
> - Postgres is running on a different server, so NFS may give a
> performance improvement.

This is also our experience. And x-sendfile/x-http-accel can reduce some
network load (if there is lots of blob-traffic).

> - Database size will be smaller, which seems a good idea.

Postgres is backup is more fun w/o blobs, yes.
Synchronization between postgres/blobstaorage at backups are more
difficult.

> - We have not dived into memcached yet, but I guess it helps for memory
> usage if no blobs need to be loaded in both memcached and the zope
> instance.

From what I understand from RelStorage code, only the pickle load is
cached in memcached
https://github.com/zodb/relstorage/blob/master/relstorage/cache.py#L282
which call using some dispatching magic
https://github.com/zodb/relstorage/blob/master/relstorage/adapters/mover.py#L87
and blobs are not handled at all by cache.

If I did not oversee something this means a blob w/o fs-blobstorage is
loaded at each access from RelStorage. So anyway using NFS should be
much more efficient here.

This is the way we converted it. Its stable but takes a while.

> 2. Use a script to download the blobs to the new blobstorage.
>
> I have created such a script, based on the zodbconvert script and
> RelStorage.copyTransactionsFrom.
> The script is here:
> https://gist.github.com/mauritsvanrees/effc677c97f4d6fb633c
> Config would be the same as for the zodbconvert above, except that we
> only need the source, not the destination.
>
> After you have run this, you switch the old database to use the new
> shared blob directory.
> Again, this leaves unused blob_chunks and large objects in the database.

How much faster is this way of conversion?

>
> 3. Iterate over the blob_chunk table, for each row download the blob as
> above in idea number 2, then remove the row.
>
> This should leave you with a clean database, without unused blob_chunks
> and large objects lying about.

Your script (in the other posting) for (3) should be the fastest way to
convert, right? I cant tell you if locking/committing and so on is ok,
but since it uses the RelStorage API it should be fine. But I think its
the most elegant way to get the blobs out of postgres.

>
> A question to check my understanding of the RelStorage code and tables.
> - With shared blob storage the blobs are stored in a directory.
> - With non-shared the blobs are stored in the blob_chunk table, which
> uses large objects.
> - When the non-shared blobs are downloaded to a shared blob storage as
> above, the blob_chunk table can be safely emptied.
> Right? Removing a blob_chunk row does not cascade to a removal of an

I would say it can be removed completly.

> object_state row or something like that? No information is lost?

As far as I understand it, the blob handling is decoupled in a way, that
the object state refering to a blob does not need to know anything about
the blob-storage implementation. So the storage implementation is
responsible to deliver blobs by its id.

> For ideas one and two, it could help to have a script to afterwards
> remove the blobs from the database, as done in idea three.
> Maybe that is just a one-liner in SQL, maybe some Python. I have not
> tried this yet.
> But I think I have read that a TRUNCATE on the blob_chunk table would
> not remove the large objects, at least in Postgres. Am I correct?
>
> Thoughts? Any existing scripts that I have missed that already do this?

PostgreSQL provides a script to remove orphaned blobs:
http://www.postgresql.org/docs/9.4/static/vacuumlo.html
No idea if it helps here, but looks promising.

Jens--
Klein & Partner KG, member of BlueDynamics Alliance

Maurits van Rees

unread,

Apr 2, 2015, 10:12:38 AM4/2/15

to zo...@googlegroups.com

Hi Jens!

Thanks for sharing your thoughts.

Jens W. Klein schreef op 01-04-15 om 14:08:

> From what I understand from RelStorage code, only the pickle load is
> cached in memcached
> https://github.com/zodb/relstorage/blob/master/relstorage/cache.py#L282
> which call using some dispatching magic
> https://github.com/zodb/relstorage/blob/master/relstorage/adapters/mover.py#L87
>
> and blobs are not handled at all by cache.
>
> If I did not oversee something this means a blob w/o fs-blobstorage is
> loaded at each access from RelStorage. So anyway using NFS should be
> much more efficient here.

Getting a PDF file of a few MB that is now stored on NFS seems very
snappy at least.

> > 3. Iterate over the blob_chunk table, for each row download the blob as
> > above in idea number 2, then remove the row.
> >
> > This should leave you with a clean database, without unused blob_chunks
> > and large objects lying about.
>
> Your script (in the other posting) for (3) should be the fastest way to
> convert, right? I cant tell you if locking/committing and so on is ok,
> but since it uses the RelStorage API it should be fine. But I think its
> the most elegant way to get the blobs out of postgres.

Let me try it again and get some figures.
This is locally on my laptop, with an SSD disk, and the Postgres server
on that same laptop.

0. Fresh import of postgres dump, with blobs still in the database,
recently packed, about 26,000 transactions: 16 minutes.

1. Use zodbconvert to convert from non-shared blobcache to shared
blobstorage:
All 25961 transactions copied successfully in 27.8 minutes.

2. Use blobdownload script to download the blobs of all transactions,
and not change anything in the database:
4577 blobs successfully downloaded from all 25961 transactions in
3.8 minutes.
BTW, subsequently it should be fine to truncate the blob_chunk and
pg_largeobject tables, if you have nothing else in your database that
may be using large objects. In my case this would have been fine.

3. Use blobchunkdownload to download using only the blob_chunk table,
and cleanup this table and the large objects afterwards:
- Download phase: 1 minute, 8 seconds.
- Optional cleanup phase: an extra 5 minutes, 45 seconds.
- Total: 6.9 minutes

I compared the resulting blobstorage dirs of the three methods (diff -r)
and they were exactly the same.

Conclusions:

- zodbconvert is by far the slowest method.

- Downloading using the blob_chunk table (method 3) is by far the fastest.

- For the final cleanup, at least on Postgres, you may simply want to
truncate the blob_chunk and pg_largeobject tables, which should take
less than a second. Should be safe if you use this database (better:
this entire database server) only for this single RelStorage site.

> > A question to check my understanding of the RelStorage code and tables.
> > - With shared blob storage the blobs are stored in a directory.
> > - With non-shared the blobs are stored in the blob_chunk table, which
> > uses large objects.
> > - When the non-shared blobs are downloaded to a shared blob storage as
> > above, the blob_chunk table can be safely emptied.
> > Right? Removing a blob_chunk row does not cascade to a removal of an
>
> I would say it can be removed completly.
>
> > object_state row or something like that? No information is lost?
>
> As far as I understand it, the blob handling is decoupled in a way, that
> the object state refering to a blob does not need to know anything about
> the blob-storage implementation. So the storage implementation is
> responsible to deliver blobs by its id.

That is my understanding too.

> > For ideas one and two, it could help to have a script to afterwards
> > remove the blobs from the database, as done in idea three.
> > Maybe that is just a one-liner in SQL, maybe some Python. I have not
> > tried this yet.
> > But I think I have read that a TRUNCATE on the blob_chunk table would
> > not remove the large objects, at least in Postgres. Am I correct?
> >
> > Thoughts? Any existing scripts that I have missed that already do this?
>
> PostgreSQL provides a script to remove orphaned blobs:
> http://www.postgresql.org/docs/9.4/static/vacuumlo.html
> No idea if it helps here, but looks promising.

I have not tried it, but it sound like a TRUNCATE on the blob_chunk
table in combination with running this script would do the trick too.

... Well, I tried it after all. :-) Truncate blob_chunk and then run
vacuumlo on your specific database, seems good to me. Does still take
almost 3 minutes in my case.

Note, for peace of mind (al least my mind), that vacuumlo only removes
large objects from one database:

sh-3.2$ time /opt/local/lib/postgresql93/bin/vacuumlo -n other_client_db
Connected to database "other_client_db"
Test run: no large objects will be removed!
Checking chunk in public.blob_chunk
Would remove 0 large objects from database "other_client_db".

real 0m0.040s
user 0m0.002s
sys 0m0.005s

Dry run with real db:

sh-3.2$ time /opt/local/lib/postgresql93/bin/vacuumlo -n
client_plone4_blob_migration
Connected to database "client_plone4_blob_migration"
Test run: no large objects will be removed!
Checking chunk in public.blob_chunk
Would remove 4577 large objects from database
"client_plone4_blob_migration".

real 0m0.040s
user 0m0.006s
sys 0m0.008s

Real run with real db:

sh-3.2$ time /opt/local/lib/postgresql93/bin/vacuumlo
client_plone4_blob_migration

real 2m50.050s
user 0m0.040s
sys 0m0.101s

Best,

Maurits van Rees

unread,

Apr 3, 2015, 3:55:04 PM4/3/15

to zo...@googlegroups.com

Maurits van Rees schreef op 02-04-15 om 16:12:

>
> 3. Use blobchunkdownload to download using only the blob_chunk table,
> and cleanup this table and the large objects afterwards:
> - Download phase: 1 minute, 8 seconds.
> - Optional cleanup phase: an extra 5 minutes, 45 seconds.
> - Total: 6.9 minutes

Strangely, I tried it again today, and now the download phase takes
about 7 minutes. Cleanup phase is the same. Only difference should be
that I added a few other databases to my local cluster.
So: your mileage may vary.

Reply all

Reply to author

Forward