Good Morning folks,
So happens that Document uploading (also in our current reality) using other than just the famous and (in)famous PDF (and PDF/X, etc) is an actual use case/ need! In old times (my times) people used to upload documents in PDF/A to ensure archival preservation but there is right now a trend on letting all media conversion happen on the browser (thank you google docs for making not-available your code but making what you do the standard!) so, i was thinking
A) I'm gonna ask people what they want?
B) I'm gonna propose some options
C) I'm gonna code
So for A) what is that you want people? What are your use cases? Do you want to upload DOC/DOCX/PPTX/PPT etc and
Option 1: Allow users to see them as they are created (in their original form, or close to that.. remember many formats can be commercial/propieraty)
Option 2: Obscure the fact/format and unify into a single format (like HTML e.g, a favorite of the self-publishing-movement or PDF? Hated my many/loved by the other 50%)
Option 3: Other? Both? None?
for B) I propose what can be done without becoming ourselves google inc.
- 1. A Viewer/Formatter that can use Google.com and Office (MS) online viewers to display/render any MS Office propietary format inline (iframe..). tested and it works if the document in archipelago can publicly be accessible.
- 2. A more generic local viewer that requires you to upload the same MS documents in their respective Open Standard formats (Can be done in MS Word/Open office, etc) and can render them online: benefit: unified viewer experience, of course there could be edge cases and things we can not control (like password protected documents, wordstar or wordperfect documents from 1991, etc)
- 3. A post processor that unifies file formats/standards. This, given the fact it is a binary (and a super cool one named... wait for it... no, i won't share it here!..oh well, all right, its named pandoc) can read almost any format and output anyformat. From epub to emacs. Question would be unifies to which one? I like the idea of rendering directly HTML. What is even more portable than that? But i can be convicend otherwhise specially if people are very into the formatting / layouts than just the content in a readable way
C) i will do 1 and 2, and will bring 3 into Strawberry Runners as an option. Want to help? Please!!
All this said. I have an additional question
How would anyone would like Archipelago could/decide/which of this option applies to you when someone hits a Digital object page? Basically, when/how to use 1 or 2? Some automatic display? based on the files that are present? based on the rdf type ? Like if its of
schema.org type Document?
If any of you have time for a comment here, or some ideas (please reply to all, i get super nice replies that go directly into my inbox but sadly nobody else sees, we are all here to learn and share, there are no wrong answers or reasons to be shy, really) i would really appreciate that.
Thanks a lot
Diego Pino
Metro.org