Archivematica and Docker

311 views
Skip to first unread message

Gessy Junior

unread,
Feb 6, 2018, 1:58:58 PM2/6/18
to archivematica
Hi everyone!!!

Does anyone have any experience in Archivematica and Docker?

Thanks a lot!
Gessy

Geoffrey Brimhall

unread,
Feb 7, 2018, 11:46:25 AM2/7/18
to archivematica
Hi -

We've been working on one at 


It's at beta stage because we're still slowly working through different deployment use cases. The one that is good is the 1.6.1-beta1 tag, where everything is self-contained within a single docker image. The other betas are working on pulling services out of the container such as mysql, nginx.

Justin Simpson

unread,
Feb 7, 2018, 3:17:20 PM2/7/18
to archiv...@googlegroups.com
There are a couple of other repos where work on deploying Archivematica using Docker is being done:

https://github.com/artefactual-labs/am.git
This is beinig used as the default development environment by most Archivematica developers at the moment. It uses docker-compose and puts the different components of Archivematica into separate containers.  There are details in this readme: https://github.com/artefactual-labs/am/blob/master/compose/README.md

This repo works with the stable/1.7.x and qa/1.x branches of Archivematica.  An official 1.7.0 release has not been made, but is close, there is a v1.7.0-rc.2 tag available that is a good place to target.

https://github.com/JiscRDSS/rdss-archivematica
Jisc are a UK based organization that are running Archivematica in their Research Data Management Shared Service. This repo has a docker-compose based deploy method, and also has a couple of aws specific deployment methods, one based on docker-compose + docker-machine, the other using terraform. Both of the aws specific methods in that repo are untested, iirc. This repo adds other components to the default Archivematica stack, notably support for Shibboleth based integration and a NextCloud instance, and support for Arkivum.


This repo works with a fork of Archivematica that is being kept in sync with the stable/1.7.x branch.


It would be interesting to compare the U of Arizona example and these, to see how much in common they have.  Also interesting to hear if there are any other examples out there of Archivematica in containers.

Geoffrey Brimhall

unread,
Feb 22, 2018, 3:20:37 PM2/22/18
to archivematica
Does the https://github.com/artefactual-labs/am.git stable/1.7.x code base have support for shibboleth, 

Justin Simpson

unread,
Feb 23, 2018, 1:53:48 PM2/23/18
to archiv...@googlegroups.com
Hi Geoff,

tl;dr
The rdss-archivematica deployment scripts have explicit easy to use options to enable or disable shibboleth.  It could be done with the am.git version, but it would require a bit more work.

longer version
There are two components in Archivematica that have users - the dashboard and the storage service.  These are two separate Django applications.  The code required to make each of these Django apps work with shibboleth is included in the main code repositories (archivematica.git and archivematica-storage-service.git).  You can configure the Dashboard and the Storage Service to use shibboleth by setting the correct configuration parameters (described here and here ).

am.git and rdss-archivematica are both distribution repos ( a term we kind of made up recently).  They the code from archivematica.git and archivematica-storage-service.git as git submodules (in the case of am.git) or by running git clone commands from a script (in the case of rdss-archivematica.git).  You can choose which version of each repo is used - in this way these repos bring together a number of components, at specific versions, and deploy them together, with a particular set of configuration parameters.

In the case of rdss-archivematica (used in the Jisc Researd Data Shared Service project ) a few extra components are deployed, including NextCloud, an RDSS specific channel adapter, a different nginx container than the one used in am.git and a Shibboleth IdP container.  There is a diagram in this readme that shows all the possible containers available with rdss-archivematica.

rdss-archivematica has a concept of Service Sets - there are different docker-compose configuration files for different situations.  The 'qa' Service Set in rdss-archivematica does not include shibboleth, and is probably closest in functionality to what am.git does by default now.

The 'am-shib' Service Set uses a modified nginx container, that includes the Shibboleth FastCGI SP Application .

In theory, it would not be too difficult to bring the am-shib Service Set stuff over from rdss-archivematica into am.git, but that has not happened yet.

I'd be curious to hear any thoughts about this, does this approach make sense in other (i.e. non Jisc RDSS) environments?  


Justin Simpson
Director of Archivematica Technical Services
www.artefactual.com
604-527-2056

--
You received this message because you are subscribed to the Google Groups "archivematica" group.
To unsubscribe from this group and stop receiving emails from it, send an email to archivematica+unsubscribe@googlegroups.com.
To post to this group, send email to archiv...@googlegroups.com.
Visit this group at https://groups.google.com/group/archivematica.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages