2019-10-22 Dataverse Community Call
Agenda
* Community Questions
Attendees
* Julian Gautier (IQSS)
* Jim Myers (QDR, TDL)
* Jamie Jamison (UCLA)
* Phil Durbin (IQSS)
* Courtney Mumma (TDL)
* Laura Waugh (chair, TDR Steering Committee)
Notes
* (Phil) Dataverse 4.18 is next planned release. Update includes a new Preview tab for files on the file landing page and APIs for token management. See draft of release notes at
https://github.com/IQSS/dataverse/blob/eb21955f990aeedc1d4fc2d91b6108dbb59fb8b0/doc/release-notes/4.18-release-notes.md * (Jim) Will all recent Make Data Count pull requests be included in 4.18? Such as
https://github.com/IQSS/dataverse/pull/6265 * (Phil) I'm not sure but there's a very good chance. That pull request is QA on our board:
https://github.com/orgs/IQSS/projects/2#card-27565065* Community Questions
* (Jim) Anyone else seeing occasional errors connecting to AWS S3? (Jim, QDR). Can't access the bucket, can't get a file. Should we add a retry feature in Dataverse?
* (Phil) This is probably low priority for Harvard Dataverse because it hasn't come up, to my knowledge. As always, pull requests are welcome.
* (Courtney) FYI: Re: direct S3 upload/ remote storage work at TDL. We've identified members who would like to store larger data on site at their premises. Texas Advanced Computing Center (TACC) for UT researchers. We've hired Jim to set up the “TRSA”-ish thing.
* (Jim) Let's be careful about the acronym TRSA. TDL uses an S3 bucket that's local to their servers. They'd like to point to TACC for large files. We use the redirect to avoid streaming the file through Glassfish. AWS has a concept of pre-signed URLs for upload. It may not make sense to put big files through ingest. I'll write something up.
* (Phil) Eskimos have seven words for snow. We could probably use some more terms for things that look and act a bit like Odum's TRSA implementation.
* (Phil) I have a dream of
demo.dataverse.org being backed by dataverse-kubernetes so that we can play with the integrations that are available in the Dataverse ecosystem. Imagine spinning up your own demo environment that has the Data Capture Module (DCM) for rsync, Geoconnect/WorldMap, OSF, OJS, Whole Tale, Archivematica, etc. There's already an issue for DCM:
https://github.com/IQSS/dataverse-kubernetes/issues/68 * (Courtney) I'm also curious about Amazon Linux 1 being unsupported after June 2020 and whether there are plans for that
* (Phil) If you’re running on CentOS 7 or RedHat you’re in good shape. Dataverse has no active support for Amazon Linux 1 now. I've never heard of it.
* (Phil) With regard to the TRSA-ish thing, another thought is that reingest APIs are available if you decide to skip ingest when using the pre-signed AWS URLs for file upload:
http://guides.dataverse.org/en/4.17/api/native-api.html#reingest-a-file * (Jim) (After the call: for QDR, we added uningest/reingest options to the file edit menus - I think for admins/curators only. Haven’t gotten to create an issue/see if a PR would be useful for the community…)
* (Phil) Great idea! Please go ahead and create an issue. Hopefully that API endpoint is backed by a command so you can piggyback off whatever permissions it requires.
* (Phil) Phil’s Crazy Ideas, continued
* DataverseTV - a way to aggregate video of talks (keynotes, presentations, etc) from the community about Dataverse
* Phil will create a spreadsheet and Courtney will help populate it. She is planning another video series. The new spreadsheet is linked from
https://groups.google.com/d/msg/dataverse-community/kJEVDqK9Yf0/mpPbVFxKCgAJ * New map:
https://github.com/IQSS/dataverse-installations * If you'd like to help:
https://twitter.com/philipdurbin/status/1186659282726469633 * Project boards from installations of Dataverse:
https://github.com/orgs/IQSS/projects * (Jamie) I'll make one for UCLA.
* (Phil) Here's how ADA made theirs:
https://github.com/IQSS/dataverse/issues/6221