specify "errors" argument for output encoding + best cleanup tool?

1 view
Skip to first unread message

Ksenia

unread,
Feb 7, 2009, 6:34:48 PM2/7/09
to beautifulsoup
Greetings,

1) Would it be a good idea to add possibility to specify "errors"
argument for encoding?
e.g.
soup.renderContents(encoding='latin-1', errors='xmlcharrefreplace')

It can be handy for documents with mixed up encoding...

2) Since upgrade to the latest BS version, I've experiences quite a
few problems with markup because of the new parser. What is the most
advisable way for markup clean up before giving it to BS? (btw, I AM
happy with the new parser, it solved several problems with rendering
that old parser did not handle properly)

Thanks
Ksenia
Reply all
Reply to author
Forward
0 new messages