> Likewise, the concept Function_Denotational in
> umbel_abstract_concepts.n3 has a whole load of ill-formed characters
> in the skos:definition.
>
I made sure that the procedures that create these files convert non-utf8
characters in utf8 (the default charset used for umbel is utf8).
Sorry about these encoding issues; but since much information come from
different places, charsets quickly becomes mixed. The goal now, and in
the future, is to make sure everything is converted in utf8 first. It is
possible that such errors could be found elsewhere, so just report
future issues so that I quickly fix them (the files & the procedures
that create them).
Thanks!
Take care,
Fred
[ontologies] $ iconv umbel_abstract_concepts.n3 -o /dev/null
iconv: illegal input sequence at position 630884
To find the line number of that location, I used:
[ontologies] $ head -c 630884 | wc -l
Not sure if there's a version of iconv available for other platforms
(I would hope so, but a quick google for a cygwin version didn't
reveal an obvious answer).
Regards,
Ian
Thanks for the quick response Fred! I think utf8 is also the standard
charset for n3/turtle files.
> Sorry about these encoding issues; but since much information come from
> different places, charsets quickly becomes mixed.
No problem, I understand the problem. And, as you've already done, now
that you know about it it's not so hard to fix.
> Thanks!
You're welcome.
Ian
I'm sure Fred will add this to his bag of tricks. If we find any
others, we will post a notice here, too.
Thanks, Mike