Including double-quotes in GSA feed meta data

3 views
Skip to first unread message

Mosh

unread,
Nov 17, 2009, 5:51:24 PM11/17/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
I have run into a problem when using the GSA that I was hoping someone
here might be able to help with. The GSA is setup to serve a search
engine for the contents of an online publication library. I've
populated the index by incrementally submitting feeds to the GSA that
include an individual publication's direct URL, meta-data including
the publication's title), and Base64 encoded contents. It's been
working as expected for a while.

Today I was informed of a problem in which the search result screen
shows either no or a partial title if the title includes a double-
quote in it. The displayed title shows only up to the first double-
quote (so if the title begins with a double-quote, no title is
shown). Checking in the result's HTML code, the rest of the title
after the double-quote is simply not there.

I've checked to make sure that the feed used for such publications is
encoding double-quotes and, so far as I can tell, it is working
properly. Here is a sample from one such feed:

<meta name="Title" content="The Social Worker as a &quot;Social
Warner&quot;"/>

I haven't been able to find any mention of what might cause such a
situation in the GSA documentation. Any help would be very much
appreciated.

Thanks,
Mosh

Steve King

unread,
Nov 17, 2009, 7:19:12 PM11/17/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Hi mosh,

One solution could be to upgrade your GSA to 6.2. As I understand it,
you can base64 encode your Meta Strings!

Another idea might be to check the XSLT code on the MT output tags. It
could be breaking somewhere along there with the &quot (not sure thou
- since I've not seen this problem before).

Have you done any customisation to the XSLT? If so, try it on the
default frontend with Meta Tags turned on.

Regards,
Steve.

Pablo Solera

unread,
Nov 18, 2009, 3:12:36 AM11/18/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Hi Mosh,

Try using &amp;#022, or &#022; I can´t remember which.
Have a look at this stylesheet:
http://code.google.com/p/advanced-internal-onebox/downloads/list
It has a custom template to encode special characters on meta-tags.

Hope this helps,

Pablo Solera

Mosh

unread,
Nov 19, 2009, 10:54:32 AM11/19/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Thanks for the reply Steve.

I haven't yet upgraded the GSA to 6.2 but I would hope that versions
prior to 6.2 would be able to handle double quotes. I'll give your
suggestion a try and see how it goes.

Thanks.

Mosh

unread,
Nov 19, 2009, 10:56:20 AM11/19/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Thanks Pablo.

I'll take a look at the style sheet and see what I can find.

Mosh

Mosh

unread,
Nov 19, 2009, 11:59:59 AM11/19/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
All:

Just as a follow-up, I wanted to let you know what the problem turned
out to be. After turning Meta Tags on as per Steve's suggestion, the
value of the Title meta tag was still coming up blank. So I figured
that it wasn't a problem with how the GSA was displaying the value but
how it was processing it from the feed.

Apparently, the GSA doesn't seem to like HTML entities other than
&amp;. In order to properly represent the double-quote in the feed
XML, I had to replace "&quot;" with "&amp;quot;" . A quick update of
the index and the titles are now being displayed properly.

Thank you again to everyone for their help.

Thanks,
Mosh
Reply all
Reply to author
Forward
0 new messages