Invalid Characters / Symbols in Feed

2 views
Skip to first unread message

Chris

unread,
Mar 27, 2009, 5:38:47 AM3/27/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini, pop....@gmail.com
Hi Group!

I'm using the 'Feeds' feature of our Google Search Appliance (feeding
XML to build our search index).

My feed is built from a CSV file that has a list of URLs, and
associated metadata.


There's a chance our CSV file contains wierd characters or symbols.
What will the GSA do with these characters? Error? Skip them?

Also - is there a list anywhere of what characters / symbols the GSA
will not process?


Many thanks,
Please just ask if this requires any clarification.

Chris.

Thiru

unread,
Mar 27, 2009, 2:53:35 PM3/27/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Hi Chris,

The GSA will stop processing and discard the feeds file if it
encounters a control characters, etc. I highly recommend that you run
a xml parser like xmllint on the feeds xml files to make sure that
there are no unwanted control characters. You can get the latest
version of the xmllint from here : http://xmlsoft.org/ or may be
other sources as well.

Cheers,
Thiru
Reply all
Reply to author
Forward
0 new messages