What do you think is needed for csvfix to support UTF-8?
UTF-8 is carefully designed so that naïve programs that are modestly careful do not run into problems with it.
The only time I can think of where UTF-8 support would matter is if the field delimiter (normally comma), record delimiter (normally newline) or field quote (normally double quote) characters needed to be a multi-byte UTF-8 character rather than a single-byte character as all the default values are.
And, for that to be a practical problem, I think you would need to provide convincing evidence that there's a major software system that is routinely configured in some locale to use non-default values for the format. It might be that somewhere in Asia, that is the case - you should document this carefully, and explain what the default values are in that locale.
Absent such compelling evidence, I don't see that there is a compelling reason for csvfix to do anything different from what it does now.
--
Jonathan Leffler <
jonathan...@gmail.com> #include <disclaimer.h>
Guardian of DBD::Informix - v2008.0513 -
http://dbi.perl.org"Blessed are we who can laugh at ourselves, for we shall never cease to be amused."