Displaying Japanese characters in search results

22 views
Skip to first unread message

jdc

unread,
May 31, 2010, 8:46:41 PM5/31/10
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
I maintain a site that uses a Content Management System (Documentum)
that uses a GSA for search. Some of the sites that are contained in
the common collections are in english, but other are in foreign
languages (e.g., Japanese and Chinese). In looking at the Content
Management System, the foreign language is encoded as hexideciminal
(at least I think it is ....) so the source code looks something like
this:

らのお知らせ.

This displays fine in the browser, but when a search is run, the
results appear as gibberish.The characterset we are using in our
search template is utf-8.

We do not have the option of using separate collections for English
and non-English sites, so is there a setting in the GSA that will
permit the foreign characters to appear properly?

brianb

unread,
Jun 1, 2010, 2:01:36 AM6/1/10
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
As long as your ie and oe parameters are set properly to UTF8 in your
queries, it should work. What do you see in the cache page when you
one of these results? Are the characters displayed properly? When
exactly do you see the problem?

You may also want to contact support if cannot figure it out. The GSA
is sold quite a bit in Asia including Japan so this should not be an
issue.

Brian
Reply all
Reply to author
Forward
0 new messages