Tobie,
> A correct character encoding should be all you really need to handle
> such entities.
That's rather flip, don't you think? How does character encoding
choice solve " or ?
These entities are valid HTML, regardless of character encoding. There
are sometimes very good reasons for using them. It's fine to say
String#unescapeHTML won't handle these because it's too big a problem
and document that, but let's not just dismiss it like the person
asking the question is being dumb, which is how the above comes
across.
-- T.J.