Doodle: Special Topic: SDB vs TDB

40 views
Skip to first unread message

Andrew Woods

unread,
Oct 22, 2019, 10:38:57 AM10/22/19
to VIVO Tech, VIVO Committers
Hello All,

This message is an invitation to join an upcoming discussion on the embedded triplestore technologies that (can) underlie VIVO, specifically Jena SDB and TDB.
https://jena.apache.org/documentation/sdb/
https://jena.apache.org/documentation/tdb/

Please complete the following Doodle poll before Saturday, Oct 26th:
https://doodle.com/poll/hktt3w7r4pgtghh3

In preparation for the discussion, you are encouraged to collect your experiences with both SDB and TDB, as well as facts that will help inform VIVO application defaults and community recommendations.

Regards,
Andrew

Andrew Woods

unread,
Oct 22, 2019, 11:04:49 AM10/22/19
to VIVO Tech, VIVO Committers
Update: The Doodle poll was set for U.S. Eastern Time (9am and 10am). Please convert to your local:
https://www.timeanddate.com/worldclock/meetingtime.html?iso=20191107&p1=179&p2=136&p3=37

Regards,
Andrew

________________________________________
From: Andrew Woods
Sent: Tuesday, October 22, 2019 10:38 AM
To: VIVO Tech
Cc: VIVO Committers
Subject: Doodle: Special Topic: SDB vs TDB

Andrew Woods

unread,
Oct 26, 2019, 10:34:22 AM10/26/19
to VIVO Tech, VIVO Committers, leadersh...@vivoweb.org
Hello All,
Please join the upcoming Special Topics call to discuss triplestores in the VIVO context. Specifically, we will be assessing SDB [1], TDB [2], and externalized triplestores with an eye towards community recommendations and VIVO's default configuration.

The call will be Friday, Dec 6th @9am Eastern Time.
https://www.timeanddate.com/worldclock/meetingtime.html?month=12&day=6&year=2019&p1=179&p2=136&p3=37&iv=0

Initial agenda and call-in details:
https://wiki.duraspace.org/display/VIVO/2019-12-06+-+Special+Topic+-+TDB+vs+SDB

Regards,
Andrew
[1] https://jena.apache.org/documentation/sdb/
[2] https://jena.apache.org/documentation/tdb/

Andrew Woods

unread,
Nov 27, 2019, 9:39:12 AM11/27/19
to VIVO Tech, VIVO Committers
Hello All,

In advance of the Dec 6th SDB/TDB special topics call, it would be helpful if everyone came to the meeting with facts that will help inform VIVO's default behavior and general recommendations to the community regarding the use of SDB and/or TDB.

The questions that we will be exploring include:
- Performance under specific usage scenarios (read, write: bulk load, many small updates, etc)
- Sustainability
- Durability (is corruption more of a risk with one over the other?)
- Scale
- Feature set

Any supporting evidence and facts that you can collect before the call would be very helpful.

To facilitate discussion, Mike has created the following table. Please feel free to update directly:
https://docs.google.com/document/d/1xUsxvcYo4N9U2nzN0OUmR44wZPU7A7DJESRA7H_Hfe0/edit?usp=sharing

Regards,
Andrew

________________________________________
From: Andrew Woods <andrew...@lyrasis.org>
Sent: Saturday, October 26, 2019 10:34 AM
To: VIVO Tech
Cc: VIVO Committers; leadersh...@vivoweb.org
Subject: Special Topic: SDB vs TDB

Andrew Woods

unread,
Dec 6, 2019, 2:34:04 PM12/6/19
to VIVO Tech, VIVO Committers
Hello All,

Thank you to all who participated in today's Special Topic call.

Although we have not yet resolved the top-level objective of determining both a community recommendation between the Jena triplestore options (SDB, TDB, and TDB2) and the related choice of VIVO's default triplestore, several pertinent points were surfaced:
- Jena community development is going in the direction of TDB2
- TDB has a slightly higher practical limit on number of triples/quads it has demonstrated supporting
- TDB is likely more stable / bullet-proof, as it sees the most usage in the broader Jena community
- Performance varies across triplestores based on many factors: usage patterns, client-side implementation, system configuration, etc
- The best way to "future-proof" VIVO is avoid coupling too closely with any component... best to implement against standards

There was also constructive conversation about the risk of corruption, and approaches/benefits related to SDB and TDB(2).

The notes are recording are available:
- https://wiki.lyrasis.org/display/VIVO/2019-12-06+-+Special+Topic+-+TDB+vs+SDB
- http://bit.ly/2019-12-vivo-sdb-tdb

Next steps will include:
- Reflecting on the points raised in this call, and collecting outstanding questions (from you all!)
- As appropriate, scheduling a follow-on meeting in early 2020.

Best regards,
Andrew

Andrew Woods

unread,
Dec 10, 2020, 10:49:42 AM12/10/20
to VIVO Tech, VIVO Committers
Hello All,

As a holiday treat, please join a VIVO Special Topic meeting next Wed, Dec 16th at 10am ET / 4pm CET to discuss the findings from a recent TIB/UQAM sprint focused on producing and consuming VIVO messages via Kafka. One of the objectives of the meeting is to determine if the opportunities surfaced with these initial investigates resonate with community needs and interests.

https://wiki.lyrasis.org/display/VIVO/2020-12-16+-+Special+Topic+-+VIVO+Kafka+Ingest

Regards,
Andrew

Andrew Woods

unread,
Dec 18, 2020, 1:08:27 PM12/18/20
to VIVO Tech, VIVO Committers
Hello All,

For those of you who may have missed the meeting or who simply want to relive the excitement, the VIVO / Kafka / Ingest / ORCID session recording is available:
- http://bit.ly/2020-12-vivo-kafka

It is also linked off of the meeting page:
- https://wiki.lyrasis.org/display/VIVO/2020-12-16+-+Special+Topic+-+VIVO+Kafka+Ingest

Regards,
Andrew

________________________________________
From: Andrew Woods <andrew...@lyrasis.org>
Sent: Thursday, December 10, 2020 10:49 AM
To: VIVO Tech
Cc: VIVO Committers
Subject: Special Topic: VIVO / Kafka / Ingest / ORCID
Reply all
Reply to author
Forward
0 new messages