Right now the NumPy Unicode type (dtype "U") isn't supported, as there
isn't a wide-character type in HDF5. I am open to adding support for
this if a reasonable way can be found to represent UTF-32 strings
using the basic HDF5 types. A separate patch was contributed for
variable-length Unicode strings (e.g. the Python 2.X "unicode" type)
but it's not yet applied.
At the moment you will have to manually encode your data. Be careful
if you want to use utf-8, as the size of the final binary string you
get will likely be larger (more elements) than the size of your input
Unicode string.
Andrew