Display all items in collection with empty search?

0 views
Skip to first unread message

Chris

unread,
Nov 26, 2009, 12:52:51 AM11/26/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Is it possible to show all the items in a collection with some sort of
empty search on each collection?


Thanks

mrliamhennessy

unread,
Nov 26, 2009, 5:44:11 AM11/26/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
I have collections that are each defined using a single URL pattern,
so I can use inurl to match an entire collection.
If my collection is defined as matching http://www.example.com/products/
then I can match all items in a collection by searching for:
inurl:http://www.example.com/products/

If your collection is more complex, try searching for something that
will match all records, perhaps one of these:
inmeta:description
inurl:com
inurl:example.com

Chris

unread,
Nov 26, 2009, 7:28:30 AM11/26/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
My collections will have multiple sites with content so (example.ca,
example.com, example.net, example.org) is there a way to search for
multiple inurl terms. When I try combining them such as
inurl:.ca|.com|.net I am left with fewer results than just inurl:.com.
Any other suggestions would be much appreciated.

Thank you for your help

On Nov 26, 5:44 am, mrliamhennessy <liam.henne...@gmail.com> wrote:
> I have collections that are each defined using a single URL pattern,
> so I can use inurl to match an entire collection.
> If my collection is defined as matchinghttp://www.example.com/products/

mrliamhennessy

unread,
Nov 26, 2009, 8:51:48 AM11/26/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
If you are feeding all documents to the GSA, a solution would be to
add a metadata field to every record:
<meta name="allindex" content"ABCXYZ" />
Then this query would match all records:
inmeta:allindex

If you are not feeding but can maintain an up to date list of URLs
indexed, you can add metadata to every record after they are indexed:
http://code.google.com/apis/searchappliance/documentation/60/metadata.html

I don't know if this will help if you are crawling a typical fast-
changing website, but these two scenarios might give someone some
ideas.

brianb

unread,
Nov 26, 2009, 8:50:55 PM11/26/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Why would you want to do this? Just displaying all results would
pretty much defeat the purpose of searching anyway, no?

But yes, the GSA requires at least something in the q parameter
(search box) so what mrliam says, if you have something common between
all those URLs, then you could just use a dummy search term or metatag
filter or whatever. But just displaying all the URLs that you defined
in a collection randomly would probably not be useful to the user.

Brian

Chris McCreery

unread,
Nov 26, 2009, 9:02:45 PM11/26/09
to google-search-...@googlegroups.com
I agree with you as well, that's the the whole point of the search box :). However I have been requested to make this possible. I thought I could accomplish this using inurl: ca | com | net | org however the inurl and allinurl don't take OR commands and unfortunately there is no common data between everything item in each collection.  Any other suggestions??

Thanks again.

--

You received this message because you are subscribed to the Google Groups "Google Search Appliance/Google Mini" group.
To post to this group, send email to Google-Search-...@googlegroups.com.
To unsubscribe from this group, send email to Google-Search-Applia...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/Google-Search-Appliance-Help?hl=en.





--
Chris McCreery

brianb

unread,
Nov 26, 2009, 11:24:28 PM11/26/09
to Google Search Appliance/Google Mini - Google Search Appliance/Google Mini
Hmmm... Yeah, it will kind of be like putting the square block into
the round hole but you could try using the site: special query term.
For example:

site:www.google.com/folder1/ OR site:www.google.com/folder2/

and just replace the above with your collection definitions. Not very
clean but it should achieve what you are trying to do.

Brian
> > Google-Search-Applia...@googlegroups.com<Google-Search-Appliance-Help%2Bunsu...@googlegroups.com>
> > .

Chris McCreery

unread,
Nov 26, 2009, 11:40:58 PM11/26/09
to google-search-...@googlegroups.com
Thanks Brian definitely a possibility but not moving forward if we need to keep adding different urls to our collections/crawls I'll need to manually update each. Great suggestion though. Thank you very much.

To unsubscribe from this group, send email to Google-Search-Applia...@googlegroups.com.



--
Chris McCreery

Joe D'Andrea

unread,
Nov 27, 2009, 9:41:51 AM11/27/09
to google-search-...@googlegroups.com
On Thu, Nov 26, 2009 at 9:02 PM, Chris McCreery
<chris.m...@gmail.com> wrote:

> I agree with you as well, that's the the whole point of the search box :).
> However I have been requested to make this possible.

I hear you. I've had this exact same discussion with _many_ an
enterprise client. Bottom line, it's important to keep in mind that
the GSA is _not_ a database retrieval system.

So … I guess my next question would be: What is the business case for
getting all of that info as a full set of _search_ results?

Perhaps they merely want assurance that the appropriate docs made it
onto the GSA.

For _that_, I would recommend exporting your collection's crawl
diagnostics instead (select list format first, then export to a file).
You can then load this into your spreadsheet app of choice and filter
on crawl status to your heart's content.

--
Joe D'Andrea | Liquid Joe LLC
Google Enterprise Partner | iPhone Application Developer
www.liquidjoe.biz | skype:joedandrea | +1 (908) 781-0323

Chris McCreery

unread,
Nov 27, 2009, 9:48:08 AM11/27/09
to google-search-...@googlegroups.com
Thanks Joe, they are looking to give more of a suggested browse function for those users new to the system to see what types of resources are available. I'm also noticing that the OR operator seems to be working strangely. When I search for site:www.example.com I get say 5000 records, if I then add the OR site:www.example2.ca I get less results. Should it return more items?

Thanks

--

You received this message because you are subscribed to the Google Groups "Google Search Appliance/Google Mini" group.
To post to this group, send email to Google-Search-...@googlegroups.com.
To unsubscribe from this group, send email to Google-Search-Applia...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/Google-Search-Appliance-Help?hl=en.





--
Chris McCreery

Joe D'Andrea

unread,
Nov 27, 2009, 10:30:04 AM11/27/09
to google-search-...@googlegroups.com
On Fri, Nov 27, 2009 at 9:48 AM, Chris McCreery
<chris.m...@gmail.com> wrote:

> Thanks Joe, they are looking to give more of a suggested browse function for
> those users new to the system to see what types of resources are available.

Aye. Browse … there's the rub. :)

That presumes some kind of categorization, which presumes some kind of
taxonomy, and so on.

Out-of-the-box, you could use Dynamic Result Clusters:
http://is.gd/54Ikk (code.google.com)

If that doesn't fly, parametric or faceted search might be worth a try
(with appropriate metadata):

http://code.google.com/p/parametric/
http://code.google.com/p/gsa-faceted-search/

Apart from all _that_, there are add-ons for the GSA that may also
help, for instance SmartLogic's Semaphore:

http://www.smartlogic.com/index.php/search-navigation-experience/gsaedition
http://code.google.com/enterprise/detail_Semaphore.html

> I'm also noticing that the OR operator seems to be working strangely. When I
> search for site:www.example.com I get say 5000 records, if I then add the OR
> site:www.example2.ca I get less results. Should it return more items?

Only one "site:" term can be submitted per search request. That's
probably the culprit.

Context-switching for a sec … back to my previous email about crawl
diagnostics, I just thought of something else, though it requires a
bit of heavy lifting.

With a bit of behind-the-scenes prep work (and perhaps a cron job or
two), you might be able to make nice use of one of these v6.0 admin
API calls:

http://code.google.com/apis/searchappliance/documentation/60/gdata/acapi_protocol.html#StatusAndReports
Reply all
Reply to author
Forward
0 new messages