A perennial problem in today's ereading systems is hyphenation of words between two lines.
Some ereaders do it fairly well; Nook, despite being a generally terrible ereader, has had automatic hyphenation for quite some time. Ereading apps like FBReader also do hyphenation fairly well.
Kindle, the 800 pound gorilla of ereaders, didn't do it at all until very recently with their "enhanced typography" update. But unfortunately for us ebook producers, Kindle's enhanced typography isn't automatic; that is, Kindle doesn't use a built-in hyphenation dictionary to hyphenate ebooks on-the-fly. While this would be the obvious approach to things, resulting in all of Amazon's back catalog being instantly hyphen-compatible and making life easy for ebook developers, as we all know Amazon's more interested in milking the ebook cow than it is putting any kind of effort into the reading or development experiences.
The good news is that it's possible to take advantage of Kindle's new hyphenation support as an independent ebook developer. While Amazon, intent on keeping developers miserable, declined to explain how their new hyphenation engine works, it actually appears to just be support for the Unicode soft hyphen character (U+00AD). If Kindle encounters a soft hyphen near a line break, it considers it a hyphenation opportunity.
That means all ebook developers have to do is manually insert soft hyphens into all of their ebooks, by hand, at every possible word break point. How convenient. Thanks Amazon!
Lucky for us there's a Python library for hyphenation, Hyphenator, that we can take advantage of. I've put together a new script for the Standard Ebooks production tools,
hyphenate, that uses the Hyphenator library to place a soft hyphen at every syllable break for every word in an XHTML file. Do this for every XHTML file in an epub, and convert it to azw3 with Calibre's ebook-convert, and all of a sudden you have a hyphenated Kindle ebook. And it looks pretty good, too!
We've integrated hyphenate in to our build process and left soft hyphens in our compatible epub2 files, so now every Standard Ebook, including Kindle files, should support hyphenation whenever the reading system does. As part of this update, we've moved on from generating .mobi files to generating Amazon's newest .azw3 format. Our entire catalog has been updated with these features.
It's just another little detail that we think true book lovers deserve in their ebooks.