I’m working with two pretty key interest groups who are bumping up against our current limitation of linking externally (online, out of the container) only to audio and video. We set those limitations because (1) we were conscious of the need for streaming audio and video (it being impractical to embed many audio and video files in the EPUB itself) and (2) we were (and are) concerned about exposing what should be a safe and self-contained EPUB to potentially unsecure external resources. (We also have limits on scripts for the latter reason especially.)
But here’s what I’m encountering:
MAGAZINE PUBLISHERS have “media resources” that they need or want to provide externally to the EPUB, and which include but are not limited to audio and video. There are two fundamental examples: (1) slideshows or image galleries that are “too big” for inclusion but which are image files, not audio and video; and (2) media resources that need to be dynamic, that change from time to time.
TEXTBOOK PUBLISHERS have the many of the same concerns as magazine publishers in this regard. But in addition, it is VERY common for a textbook to link to other ancillary resources, often through an LMS (Learning Management System) or CMS (Course Management System, not Content Management System). These can be many types of files, even things like PDFs, spreadsheets, etc. Of course in a print book effort is expected to get to those other resources, but in an electronic book a user expects to be able to link to them (understanding that you’d need to be online, not offline, to do so, just as with the streaming media).
And of course there are many other kinds of publishers who share these same issues as well (trade, scholarly, scientific—I could go on and on). They’re just coming to the fore for me with these two.
The fact that we currently don’t enable linking to these kinds of external resources is going to be a serious problem for these folks. I know it’s late in the game, but I really think we should reconsider that limitation to streaming audio and video only.
--Bill Kasdorf
On Wed, Jul 27, 2011 at 2:40 PM, Bill Kasdorf
<bkas...@apexcovantage.com> wrote:
> But in addition, it is VERY common for a textbook
> to link to other ancillary resources, often through an LMS (Learning
> Management System) or CMS (Course Management System, not Content Management
> System).
Would you clarify which requirements these publications have when
referencing these resources beyond what's
provided by a (lowly) link in the content itself?
<p>For more on ant colonies, see <a
href="http://learning-management-system/ant-colonies.m4v">this video
(external link)</a>.</p>
<p>To get a better sense of how boring these can be, see <a
href="http://course-management-system/q4results.ppt">this spreadsheet
(external link)</a>.</p>
Thanks,
Keith
The “third-party resources” aspect of the publication resource definition may be to blame here (I say, being the one who added that para). The idea was that the content in the container is first party as it’s required for direct rendering/referencing and you know it belongs to the current EPUB, and everything linked to externally can be lumped under the label of a third party resource since there’s no reliable way to make statements about ownership once you leave the container (and no need to as those resources aren’t required for rendering).
In hindsight, it’s probably needlessly confusing and I can see how it could get read as all my things I link to have to be in the container.
But an <a> tag that references a resource in the container using a relative IRI requires that resource to be listed in the manifest and included in the container, which also makes “outbound hyperlinks” a bit vague. Maybe it would be better to rewrite the paragraph in question along the lines of:
Examples of resources that are not Publication Resources include those
identified by the Package Document link element and those identified in outbound
hyperlinks that resolve outside the container (e.g., identified in [HTML5]
a
element href
attributes).
Thoughts welcome, though.
Matt
I also heard similar concerns, and the specific subject of using remote fonts as another use case for external links has been brought up by Makoto a while ago. The issue here (at least, one of them, as I see it) is that access to some of the resources is protected and requires a mechanism to identify where the request is coming from. Sometimes it is accomplished by referrer checking on a server side, other times it is done by a browser (e.g. same-origin restriction).
In general, for the external resources to be available and reliable, a request has to be provided with a proper HTTP Referrer header. I have no idea what this header would be if externally linked content is requested while viewing EPUB content (is it “file”, “localhost” or something else?), but if we want to allow this to happen, and I assume we do and should allow it – we also need to specify what the referrer header should be. I’ve spent some time thinking about it and the “EPUB Unique ID” seem to be a good candidate for it – it will uniquely identify the origin of a request, and would allow server-side access control to a protected external content. There may be other things we need to do to address all possible use cases, and I think the question raised by Bill is both timely (better late than never) and very appropriate.
Thank you,
Vladimir
Leonard,
While many of the reasons you mentioned have historically hampered the adoption of remote (or even embedded) fonts in various document formats, the most recent advancements in font technologies, publication of the OpenType font format as the ISO standard, the adoption of WOFF as a single interoperable webfont format, and wide-spread support for CSS3 @font-face in all major browsers have dramatically changed the overall landscape.
I believe that addressing the remote fonts today is both doable and appropriate – this is a problem that effectively has already been solved on the web, and will be IMO much easier to address in EPUB. The same font file that can be encapsulated in local EPUB content can be delivered online from a remote location. It’s 100% under publisher’s control, and the fonts can be uniquely identified by file name and location so there is no risk of getting a “wrong” font (although there is a risk of not getting the font when you need it). Virtually all existing web font services offer benefits that include CDN and subsetting, so that the fonts can be delivered fast, and from the source that is highly reliable. And, if a font is available as a webfont via a web font service, content authors need not be concerned with legal/licensing issues since those have been addressed by a web service provider.
Regards,
I just want to point out that my original issue had nothing to do with remote fonts. I agree with Garth that that was thoroughly discussed in the WG and we should leave it alone for now.
As for my original issue—that publishers such as magazine and textbook publishers need to be able to provide external links to things other than streaming audio and video—based on the following from the spec at http://idpf.org/epub/30/spec/epub30-publications.html#sec-resource-locations:
Publication Resource Locations
All Publication Resources must be located in the EPUB Container, with the following exceptions:
· › Audio resources may be located in the Container or remotely.
· › Video resources may be located in the Container or remotely.
Authors should prefer locating audio and video resources in the Container to allow the user access to the entire presentation regardless of connectivity status.
Perhaps we are just dealing with a semantic issue that some editing can clarify, prompted by the first two replies to my original e-mail, Keith’s and Matt’s. The problem may come down to clarifying what is meant by a “publication resource.” (We may be clear on that elsewhere and simply need to be clearer in the spot I just cited.) But I can say pretty confidently that the magazine folks I’m referring to will consider the slideshow or image gallery a “publication resource.” It is not just a related thing of interest; it is often created expressly and only for THAT publication, and the digital publication is not complete without it. That’s less clear for the textbook cases (per Keith’s examples)—those things he’s linking to can probably reasonably be considered related and supplementary but not really “part” of the publication.
FYI this is a huge debate in scholarly and scientific publishing right now. Those folks are dealing with what has in the past been considered “supplementary content”—spreadsheets, powerpoints, datasets, all kinds of stuff—that is increasingly (some of it, not all of it) becoming considered “part of the publication” (in those cases mostly journal articles).
--Bill Kasdorf
I just want to point out that my original issue had nothing to do with remote fonts. I agree with Garth that that was thoroughly discussed in the WG and we should leave it alone for now.
As for my original issue—that publishers such as magazine and textbook publishers need to be able to provide external links to things other than streaming audio and video—based on the following from the spec at http://idpf.org/epub/30/spec/epub30-publications.html#sec-resource-locations:
Publication Resource Locations
All Publication Resource <http://idpf.org/epub/30/spec/epub30-publications.html#gloss-publication-resource-cmt-or-foreign> s must be located in the EPUB Container <http://idpf.org/epub/30/spec/epub30-publications.html#gloss-container> , with the following exceptions:
· › <http://idpf.org/epub/30/spec/epub30-publications.html#sec-resource-locations-audio> Audio resources <http://idpf.org/epub/30/spec/epub30-publications.html#cmt-grp-audio> may be located in the Container or remotely.
· › <http://idpf.org/epub/30/spec/epub30-publications.html#sec-resource-locations-video> Video resources may be located in the Container or remotely.
As chair of the nextPub/PRISM-to-EPUB mapping committee and the liaison between our EPUB 3 WG and that IDEAlliance WG, I can say that this could drive magazine publishers away from EPUB as a standard deliverable. . . . (Probably not textbook publishers; what we have can probably work for them.) Photo galleries and slideshows are probably every bit as common and important to magazines as video and audio (often moreso). I will of course try to make the case with them that they _can_ put those resources into the EPUB, but that doesn’t address the fact that those kinds of things are often dynamic, changing over time. So maybe what this comes down to is that EPUB will work for them only if they can make those things static and of a size they consider practical to incorporate in an EPUB. The danger is that they will just say “well EPUB turns out not to work so well for us after all.”
I will defer to Dianne Kennedy, who is really central to those efforts, to provide examples of what the magazine folks really need. (She may have already corresponded with Markus on that; he’s on vacation right now.) It could be that a single specific addition to the list in the spec that I cited in my previous e-mail would do the trick for those folks. It’s really those non-video image collections that are mainly at issue here.
--Bill Kasdorf
Excellent suggestion. A good case can be made for this, at least as an interim solution until EPUB addresses the issue through a modular supplemental spec. It won’t be entirely appealing to magazine publishers—they want their users to just “see” these kinds of things with a minimum of indirection—but this is certainly a reasonable alternative to the even simpler “just put it in the container” answer.
--Bill
I agree that we already talked about remote fonts before and decided to postpone it. My intention was _not_ to bring it up again now - if you re-read my first post on this subject (http://groups.google.com/group/epub-working-group/msg/ae4e445e4d59bccc) you will see that I mentioned remote fonts only as an example of yet another use case for external links.
Disregard the very first sentence of my original email – the rest of the message was purely about external resources in general, and the need to have a well-formed request to make various types of content, including protected resources, available if needed. I believe this is something we need to resolve if you we want to see EPUB used by magazine publishers with external content that e.g. is only available on subscription basis.
I agree that more discussions on remote fonts are needed assuming that we will talk (and not shout) at each other.
As to why things different now comparing to what they used to be …
“I don't see how the standardization of OTF or WOFF has any bearing on the issue. Those are just formats – they change nothing.”
- OTF and WOFF standardization created the basis for interoperable solutions, clarified the IP positions and allowed all devices and applications to implement support for downloadable fonts;
“What is on the other side of the URL is NOT versioned, controlled, etc. If the person hosting the font data replaces v1 with v2, then the result on the documents that refer to it is UNKNOWN”
- CSS @font-face-src points to a particular file at a particular location. If I am a content owner who authored and hosted the content – I have *full* control of my resources, what was in the font file I hosted, and what goes into it if I decide to update it.
“You can't have a document rely on a service for rendering – too many possibilities for failure…”
- If this is true than Google Docs and other cloud-based solutions are just a myth(?)
Don’t get me wrong – I am not arguing against having fonts embedded in EPUB, but I do believe that using remote fonts should be _allowed_.
Thank you,
Vlad
Sent via BlackBerry from T-Mobile
As chair of the nextPub/PRISM-to-EPUB mapping committee and the liaison between our EPUB 3 WG and that IDEAlliance WG, I can say that this could drive magazine publishers away from EPUB as a standard deliverable. . . . (Probably not textbook publishers; what we have can probably work for them.) Photo galleries and slideshows are probably every bit as common and important to magazines as video and audio (often moreso). I will of course try to make the case with them that they_can_ put those resources into the EPUB, but that doesn’t address the fact that those kinds of things are often dynamic, changing over time. So maybe what this comes down to is that EPUB will work for them only if they can make those things static and of a size they consider practical to incorporate in an EPUB. The danger is that they will just say “well EPUB turns out not to work so well for us after all.”
Peter,Why do you see allowing image resources to be external to be a big burden on implementers and end users? Why is it any bigger burden than it is on the Internet now?BK
Sent via BlackBerry from T-Mobile
Thanks, Dianne, this is just what I was hoping for from you.
Thanks for the “we’re not going to abandon EPUB” message! Obviously, we’re going to forge ahead with our mapping, use it to the extent possible, and supplement it as necessary. That’s always been the plan.
I think what I should have been clearer about is that _a lot of magazine publishers_ may be less likely to use EPUB (certainly not EPUB 3 that exactly conforms to the 3.0 spec) if they can’t reference image collections remotely. And we all know how it undermines use of a standard if it is perceived as working only some of the time, for only some content.
For that reason, I still want to pursue the discussion of a possible solution in the context of EPUB 3, and I am getting the sense that there are others who would like to see this happen as well.
One important perspective was Peter Sorotokin’s, who clarified the problems he envisioned this might create for reading systems, layout, etc. That leads me to a really fundamental question for you:
--If we could require, as part of the spec, that any such image gallery or slideshow would be implemented to occupy a certain well defined area (just as a video would play in a certain area)—whether that definition were in absolute or relative terms—would that work for most magazine publishers? My hope, obviously, is that we can define this in such a way that it is more workable from the reading system point of view, per Peter’s issues. And this might even be viewed as a plus by magazine publishers, since controlling the layout is actually an important concern of theirs. They wouldn’t _want_ to do this in a way that would create unpredictable results. So maybe we can figure out how to make sure it won’t.
Thanks again for your thoughtful response!
--Bill Kasdorf
From: Dianne Kennedy [mailto:dken...@idealliance.org]
Sent: Friday, July 29, 2011 10:38 AM
To: Bill Kasdorf; epub-work...@googlegroups.com
Cc: 'peter...@timeinc.com'; David Steinhardt
Subject: RE: Links to external resources in EPUB 3
Sorry for the lag in my response time. I’m in the midst of releasing 3 specs in other IDEAlliance domains and just could not get back to you sooner.
Ever since the release of the first magazines on the iPad, magazine publishers have made extensive use of externally referenced content. Two issues seem to be driving this
1) Including all images, audio and video in the package means an unacceptable increase in download time and in the footprint of the publication on a device. This means not just audio and video but images as well. As Bill indicated, photo galleries and slide shows are as common, perhaps more common, than audio and video and so are often linked to externally in today’s digital magazine implementations.
2) Currency of the content is also an issue for us. Publishers want to be able to dynamically change/update content externally to the publication. And I believe this requirement will only increase over time.
I appreciate Peter’s point that “people do have an expectation of repeatable experience with the document [book],” but this really does not apply in the magazine world.
If you cannot include the ability to add images to the objects that can be linked to externally without holding up the spec, then that is what you have to do. But this will fall short of what magazine publishers really want and need. We need images, in addition to audio and video as external links.
I don’t agree with Bill that identifying areas where EPUB3 will not meet the needs of magazine publishers (such as not allowing for external images) will “drive magazine publishers away from EPUB.” . Our goal is, and will remain, recommending EPUB3 if possible. Where we find differences based on the differences between magazine publishing and book publishing, nextPub will develop a set of restrictions and extensions in much the same way IDPF has done with their recommendation for the use of HTML5 in EPBUB3.
Currently the IDEAlliance nextPub Working Groups are working to both create a mapping from nextPub to EPUB3 and to model the media functionality our business requires. When this work is complete in September we will be able to identify how EPUB-compliant magazines will work/function. My guess is that for some magazines, EPUB will work well. But, likely we will discover a number of areas where EPUB3 does not exactly meet the business requirements of other magazine publishers. In these cases, we will define a number of EPUB3 extensions and restrictions that will enable us to deliver magazine content to the broader, tablet publishing channel where browser-based display technologies are employed. Our goal is, and will remain, to stay as close to EPUB3 as possible. It is our hope that our work to define magazine specific extensions can be used by IDPF as input for future versions of EPUB so that the two specs stay as synchronized as is possible.
We will be having a working meeting in NYC at Time, Inc. on September 14 to review the work of each of our working groups and formulate Version 1.0 of our nextPub specifications. You are certainly invited to participate and contribute. Please contact me directly if you wish to attend.
Dianne Kennedy
VP of Emerging Technologies
IDEAlliance
Powering the TOTAL Media Chain
office: 630-941-8197
cell: 630-908-0770
fax: 630-941-8196
dken...@idealliance.org
www.idealliance.org
Bill and Leonard—
Thanks to both of you for these good clarifications.
I would point out that although this feature may not have been used in PDFs, that doesn’t mean it wouldn’t be _needed_ and used in EPUBs. And the fact that it may be deprecated in PDF 2.0 just underscores the difference between PDF and EPUB. PDF is ultimately all about page fidelity above all else; that is its raison d’être. That is not ultimately what EPUB is about; and in the environment we find ourselves now, what was unused in PDF may become essential in EPUB.
I can’t resist mentioning how much this discussion reminds me of a white paper I wrote many years ago, called “PDF and XML: Why We Need Both.” It led to a string of very enjoyable Seybold presentations over the years with none other than the aforementioned Jim King, one of my heroes! (I still use what I think of as one of his mantras all the time, and always credit him: it’s meaningless to talk in the abstract about XML, it is always necessary to say “XML _for what_?”) Please say hi to Jim for me.
--Bill Kasdorf
From: epub-work...@googlegroups.com [mailto:epub-work...@googlegroups.com] On Behalf Of Leonard Rosenthol
Sent: Thursday, July 28, 2011 8:41 PM
To: epub-work...@googlegroups.com
Cc: Dianne Kennedy
You just did—thanks! And again, just what I hoped you’d say.
--Bill
From: Dianne Kennedy [mailto:dken...@idealliance.org]
Sent: Friday, July 29, 2011 11:05 AM
To: Bill Kasdorf; epub-work...@googlegroups.com
Cc: 'peter...@timeinc.com'; David Steinhardt
Subject: RE: Links to external resources in EPUB 3
I think it is ok to say something to the effect of
External image references used in media objects such as gallery or slideshow are expected to display within the area defined by accompanying style sheet.
If this would make a difference to the ePUB3 folks, please communicate this to them.
Perhaps we should be investigating a specific "profile" of EPUB for magazines – much as there also seems to be a desire for a "lighter-weight" EPUB that would suffice for manga/comics, etc.
You just did—thanks! And again, just what I hoped you’d say.
--Bill
From: Dianne Kennedy [mailto:dken...@idealliance.org]
Sent: Friday, July 29, 2011 11:05 AM
To: Bill Kasdorf; epub-work...@googlegroups.com
Cc: 'peter...@timeinc.com'; David Steinhardt
Subject: RE: Links to external resources in EPUB 3
I think it is ok to say something to the effect of
External image references used in media objects such as gallery or slideshow are expected to display within the area defined by accompanying style sheet.
If this would make a difference to the ePUB3 folks, please communicate this to them.
Dianne Kennedy
VP of Emerging Technologies
IDEAlliance
Powering the TOTAL Media Chain
office: 630-941-8197
cell: 630-908-0770
fax: 630-941-8196
dken...@idealliance.org <mailto:dken...@idealliance.org>
www.idealliance.org <http://www.idealliance.org>
dken...@idealliance.org <mailto:dken...@idealliance.org>
www.idealliance.org <http://www.idealliance.org>
This looks promising, Peter—thanks. I will defer to Dianne Kennedy to answer more definitively.
--Bill Kasdorf
> End users: people do have an expectation of repeatable experience with the
> document. If images that are part of the book layout start to change on you,
> it�s not really reliable anymore � and how you, as a user, would know which
> documents are �stable� and which are not? Video and audio were not there in
> the paper world, so people intuitively put them in the different category of
> things. Slideshows were not on paper either, of course, but allowing any image
> to be remote is too broad for expressing those.
Discussion of end user expectations seems a bit rare in these discussions, and
I am very glad to see Mr. Sorotokin bring it up here. So, for what it's worth,
here are some of my expectations for external resources.
I want complete access to all significant portions of a publication when I'm
not "in the cloud" (which I try to stay out of as much as possible). When I
buy an e-book in Newark, I want to be able to enjoy it all the way to London.
When I'm on the beach in Costa Rica I want to be able to select from the
library of beach reads I loaded up at home.
Links to resources in "the cloud", which augment the publication I have
purchased are welcomed and desired. A "for more information see:" link in a
footnote is a great idea. When I connect to "the cloud" I frequently like to
explore this additional information.
Fancy ads with graphics and video are mostly just annoying, but I can tolerate
them so long as downloading them doesn't count against my cell phone
provider's data limits. The only thing worse than annoying ads is having to
pay for the "privilege" of being subjected to them.
So, the principles I derive from these expectations are that external links
should /always/ be permitted, and /never/ be required. Any resource that is
required to make full use of the publication /must/ be included in the ePub
package. Downloading supplemental resources, whatever their nature (whether
ads or videos of scientific experiments) /must/ be under the control of the
end user; the failure to access these external resources should /never/
substantially affect the reading experience.
In that extremely rare situation where internet connectivity and access to
external resources are absolutely required an ePublication /must/ be clearly
identified as requiring active internet connectivity for use.
These constraints are obviously not enforceable by an automated process; the
best the IDPF could do would be to clearly document them as best practices,
and rely on industry to comply. However, if the industry cannot be relied upon
to allow me to control the content on my own hardware I would advocate for
removing support for external links altogether.