Handling of special characters

12 views
Skip to first unread message

Pasquale Di Donato

unread,
Sep 28, 2017, 6:49:37 AM9/28/17
to ta...@googlegroups.com
Dear all,

this is really a nice tool and easy to use.
Just one issue on my side: how to get special characters rightly encoded?

E.g. in my source.csv I have German and French strings such as Altbüron or Câbles. This are encoded as:

C槆les
Altb├╝ron

Any hint?

Thanks

Richard Cyganiak

unread,
Sep 28, 2017, 7:57:11 AM9/28/17
to Pasquale Di Donato, ta...@googlegroups.com
Hi Pasquale,

Tarql attempts to guess the character encoding. This doesn’t always work.

You can specify the correct character encoding yourself with the --encoding argument. The most common values are utf-8, iso-8859-1, and the various windows-12XX encodings (see https://docs.oracle.com/javase/8/docs/technotes/guides/intl/encoding.doc.html for a list).

If none of the common encodings seem to work, then also consider the possibility that the characters are not correctly encoded in the original file. Does it open correctly in other tools such as Excel?

Richard




--
You received this message because you are subscribed to the Google Groups "Tarql" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tarql+un...@googlegroups.com.
To post to this group, send email to ta...@googlegroups.com.
To view this discussion on the web, visit https://groups.google.com/d/msgid/tarql/CAJA5qQJwLBu%3DMXXd0c9eE1coand%2BcKCM93ZQSkKi75_hLOYopQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Pasquale Di Donato

unread,
Sep 28, 2017, 8:01:41 AM9/28/17
to Richard Cyganiak, ta...@googlegroups.com
Hi Richard,

many thanks for the quick answer. I'll give it a try. File opens correctly in excel or in a text editor.
Now fighting against Virtuoso. It doesn't like something. Trying to understand what is wrong (according to Virtuoso) with the triples.

Thanks again
Pasquale 

On Thu, Sep 28, 2017 at 1:57 PM, Richard Cyganiak <ric...@cyganiak.de> wrote:
Hi Pasquale,

Tarql attempts to guess the character encoding. This doesn’t always work.

You can specify the correct character encoding yourself with the --encoding argument. The most common values are utf-8, iso-8859-1, and the various windows-12XX encodings (see https://docs.oracle.com/javase/8/docs/technotes/guides/intl/encoding.doc.html for a list).

If none of the common encodings seem to work, then also consider the possibility that the characters are not correctly encoded in the original file. Does it open correctly in other tools such as Excel?

Richard



On 28 Sep 2017, at 11:49, Pasquale Di Donato <pasquale...@gmail.com> wrote:

Dear all,

this is really a nice tool and easy to use.
Just one issue on my side: how to get special characters rightly encoded?

E.g. in my source.csv I have German and French strings such as Altbüron or Câbles. This are encoded as:

C槆les
Altb├╝ron

Any hint?

Thanks

--
You received this message because you are subscribed to the Google Groups "Tarql" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tarql+unsubscribe@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages