Screen reader accessibility for scanned text with Mirador or Universal Viewer

David Moles

unread,

May 30, 2021, 12:23:07 PM5/30/21

to IIIF Discuss

IIIF newbie here -- we're looking at using IIIF to present scanned book and journal pages. We have the text (either OCR'd or from some other source) and we're wondering what the best way is to make these views accessible to screen readers.

Poking around the IIIF docs I've found things like the PDF example in "Providing Alternative Representation" that hint at ways to do this, but nothing that addresses it directly. Still, it seems like it ought to be a solved problem. Has anyone in the community been doing this sort of thing already?

Tom Crane

unread,

May 31, 2021, 7:29:06 AM5/31/21

to IIIF Discuss

Hi David,

Not sure if you are a member of the IIIF Slack (http://bit.ly/iiif-slack) but there was a discussion on this topic recently in the #cookbook and #accessibility channels:

https://iiif.slack.com/archives/CPLTX1ZD0/p1621446729009800

To try to summarise, the question was whether an HTML `alt` attribute on an image was the appropriate place to put the text in the image - where that image is OCR of a printed book page, or a transcription of a manuscript, rather than (for example) a photograph of a street scene that might contain some text in signage. IIIF's most common use case appears to fall into a bit of a gap when trying to follow guidance to describe the image for accessibility, because the significant text is not a description of the image but (usually) the text in the image, and any description of the image itself ("text of page 137") is not, in this case, the main attraction.

I think there are two recipes here. The first is one of IIIF modelling - how are transcriptions conveyed? How are captions, for AV, conveyed? These are works in progress; the first is well-established.

The second recipe is the HTML representation of these IIIF models - how should that OCRed text be represented in HTML to make it most accessible to screen readers and other accessibility mechanisms, bearing in mind that those mechanisms cannot be expected to know anything about IIIF?

This second question does not have a clear answer, and it needs one! All the pieces are in place to make accessible billions of words of digitised text out there, already modelled as IIIF resources; what should happen at the last step, where the IIIF model is transformed to HTML? Both for simple HTML representations, and representations created by UV, Mirador and other viewers; what's the best thing they can offer up to assistive technologies for the text of all those books and manuscripts?

Tom

David Moles

unread,

May 31, 2021, 11:32:45 AM5/31/21

to iiif-d...@googlegroups.com

Thanks, Tom — I've joined the Slack now.

--

David Moles

Head, Application Development

UC Berkeley Library IT

--
-- You received this message because you are subscribed to the IIIF-Discuss Google group. To post to this group, send email to iiif-d...@googlegroups.com. To unsubscribe from this group, send email to iiif-discuss...@googlegroups.com. For more options, visit this group at https://groups.google.com/d/forum/iiif-discuss?hl=en
---
You received this message because you are subscribed to a topic in the Google Groups "IIIF Discuss" group.
To unsubscribe from this topic, visit https://groups.google.com/d/topic/iiif-discuss/XiQHaKpcSME/unsubscribe.
To unsubscribe from this group and all its topics, send an email to iiif-discuss...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/iiif-discuss/4af8afed-f171-4d43-8df7-d2d1898a25b3n%40googlegroups.com.

Reply all

Reply to author

Forward