Hi Scott, thanks for looking in to that.
The word joiner character is the semantically correct character we want
to use. See
<
https://www.unicode.org/L2/L2019/19114-line-break-design.pdf> and in
particular this quote:
> but because it is more commonly used as byte order mark, the use of
U+2060 WORD JOINER to indicate word joining is strongly preferred for
any new text.
If you add a ZWJ *after* the em dash, you risk ereaders treating the
first word, plus the em dash, plus the second word all as a single
non-breaking word, which would look really bad too. iBooks appears not
to do that, but all bets are off for the many other ereaders out there;
and it might even actually do that given a different example, too,
depending on what else is before and after the line.
So, we don't want to be enshrining incorrect semantics in our ebook
sources just because some ereaders do things differently. Unfortunately
the state of ereaders is that each one has its own quirks and there's no
single file that can appease them all perfectly. We occasionally have to
make due with weird errors like this one.
On 1/25/22 1:33 AM, Scott Ridley wrote:
> Hi all,
>
> I'm fairly new to Standard Ebooks- just a reader at the moment, although
> I've been collecting typos to attempt to put in a pull request for the
> first time.
>
> My question/comment is about when there's an em-dash that occurs near
> the end of a line. The word-joiner character keeps the em-dash attached
> to the proceeding word, however my reader (Apple Books) will then
> sometimes auto hyphenate the word before the em-dash to make it fit,
> which can be awkward, especially when it'd be better to break after the
> em-dash.
>
> e.g. in "The Mysterious Affaire at Styles" by Agatha Christie, a line is
> rendered:
> *My poor Emily. They're a lot of shark-*
> *s—all of them.*
>
> I'm not sure if this occurs in other reader software, but I've
> experimented with adding a zero-width space after the em-dash, and that
> makes it break much more naturally:
> *My poor Emily. They're a lot of sharks—*
> *all of them.*
>
> Can I suggest the style manual (and toolset) be altered to add a
> zero-width space where an em-dash sits between words (i.e. not at the
> end of a sentence)? Or does anyone else have a better solution to this
> issue?
>
> Thanks and regards!
> Scott.
>
> --
> You received this message because you are subscribed to the Google
> Groups "Standard Ebooks" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to
standardebook...@googlegroups.com
> <mailto:
standardebook...@googlegroups.com>.
> To view this discussion on the web visit
>
https://groups.google.com/d/msgid/standardebooks/5c954862-e495-4c25-9c5e-61569a49b33bn%40googlegroups.com
> <
https://groups.google.com/d/msgid/standardebooks/5c954862-e495-4c25-9c5e-61569a49b33bn%40googlegroups.com?utm_medium=email&utm_source=footer>.