solr chunking

135 views
Skip to first unread message

Richard Shrake

unread,
Jun 29, 2012, 2:30:07 PM6/29/12
to isla...@googlegroups.com
Can someone point me to a good resource on how to configure the way solr recognizes terms for facets? I've got facets like "and" showing up, which looks kind of silly.

Thanks!

David Wilcox

unread,
Jul 3, 2012, 8:47:50 AM7/3/12
to isla...@googlegroups.com
Hi Richard,

There's a great, freely available guide to Solr here:
http://www.lucidimagination.com/devzone/references/solr-guide. The
latest version of the guide has a section on facets here:
http://lucidworks.lucidimagination.com/display/solr/Faceting.

Does this give you the information you need? Or do you need more
Islandora-specific information?

Regards,

David
> --
> You received this message because you are subscribed to the Google Groups
> "islandora" group.
> To view this discussion on the web visit
> https://groups.google.com/d/msg/islandora/-/0iKfEPyqZmcJ.
> To post to this group, send email to isla...@googlegroups.com.
> To unsubscribe from this group, send email to
> islandora+...@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/islandora?hl=en.



--
David Wilcox, BA, MLIS
Islandora Training/Support Coordinator
Robertson Library
University of Prince Edward Island
dwi...@upei.ca
Skype Name: david.wilcox82
902.620.5167

Richard Shrake

unread,
Jul 3, 2012, 9:50:30 AM7/3/12
to isla...@googlegroups.com

Thanks, David. I'll take a look.

Sent from my mobile

Levy, Michael

unread,
Jul 3, 2012, 1:15:58 PM7/3/12
to isla...@googlegroups.com
If you see 'and' then it's an indication your faceting is being done on tokenized types. You can start your investigation by using 'string' type rather than 'text' type field values.

Richard Shrake

unread,
Jul 27, 2012, 1:23:21 PM7/27/12
to isla...@googlegroups.com
Thanks. In the Islandora Solr Client config settings, I changed the Facet Fields from dc.subject to dc.subject_s, but when I do that, I don't get any facet results. Is there another step to include the string values?

On Tue, Jul 3, 2012 at 1:15 PM, Levy, Michael <ml...@ushmm.org> wrote:
If you see 'and' then it's an indication your faceting is being done on tokenized types. You can start your investigation by using 'string' type rather than 'text' type field values.

--
You received this message because you are subscribed to the Google Groups "islandora" group.
To post to this group, send email to isla...@googlegroups.com.
To unsubscribe from this group, send email to islandora+...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/islandora?hl=en.



--
Richard Shrake
Associate Librarian for Technical Services
American Philosophical Society
105 South Fifth Street
Philadelphia, PA  19106-3386
Tel. (215) 440-3409
rsh...@amphilsoc.org
Main Library number: (215)440-3400
APS website:  www.amphilsoc.org

aaronC

unread,
Feb 19, 2013, 10:10:58 AM2/19/13
to isla...@googlegroups.com
Gravedigging this, I'm not clear myself on what I need to change to make sure I'm using strings and not text. I also get nothing when I add _s

I was under the impression the solution pack shipped with DC/MODS fields configured?

Richard Shrake

unread,
Feb 19, 2013, 10:39:49 AM2/19/13
to isla...@googlegroups.com
If you go to http://[yourserver]:8080/solr/admin/schema.jsp and examine some of the fields, are you showing results?

To unsubscribe from this group and stop receiving emails from it, send an email to islandora+...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
 
 

Aaron Collie

unread,
Feb 19, 2013, 11:13:40 AM2/19/13
to isla...@googlegroups.com
Yes, I see:

Field: dc.subject

Dynamically Created From Pattern: *

Field Type: TEXT_FGS

Schema: Indexed, Tokenized, Stored, Multivalued

Index: Indexed, Tokenized, Stored

Index Analyzer: org.apache.solr.analysis.TokenizerChain DETAILS

Query Analyzer: org.apache.solr.analysis.TokenizerChain DETAILS

Docs: 27

Distinct: 164


But under Field Types I see this bit:

Field Type: string

Fields: PID MANU_EXACT *_S

Tokenized: false

Class Name: org.apache.solr.schema.StrField

Index Analyzer: org.apache.solr.schema.FieldType$DefaultAnalyzer

Query Analyzer: org.apache.solr.schema.FieldType$DefaultAnalyzer

Aaron Collie

unread,
Feb 19, 2013, 11:29:40 AM2/19/13
to isla...@googlegroups.com
I'm not seeing much in my schema.xml  (attached)... did I not configure properly?
-Aaron
schema.xml

Richard Shrake

unread,
Feb 19, 2013, 11:37:59 AM2/19/13
to isla...@googlegroups.com
Yes, it looks like you missed something somewhere. You might want to re-run through the steps of installing SOLR and gsearch - I know I had to do it again because I flubbed a step somewhere. Sorry I don't remember all of the details.

The solr_schema.xml file does all the out-of-the-box copying of dc and mods metadata to strings.

Aaron Collie

unread,
Feb 20, 2013, 1:27:43 AM2/20/13
to isla...@googlegroups.com
I just walked through this again:

But I'm still ending up with the same schema.xml; I wonder what I'm missing (twice).

Aaron Collie

unread,
Feb 20, 2013, 7:48:54 AM2/20/13
to isla...@googlegroups.com
Update for anyone who finds this thread a couple months later (like I did). I borrowed the sandbox schema.xml, dropped it in place, and things seem to be working.
-Aaron
Reply all
Reply to author
Forward
0 new messages