Persian language Search

24 views
Skip to first unread message

Ruz b

unread,
Dec 7, 2022, 7:20:33 AM12/7/22
to DSpace Technical Support
I have two problems with Persian language in DSpace 7.4, and these issues cause problems in search function.

1- When you write one letter or even half of a Persian word, search result returns as empty, you must enter a full word in order to get a result(any result).


2- There are four letters in Persian which are specific to this language(doesn't exist in Arabic) : (گ چ پ ژ) when these letters are searched, no result is returned. 

Tim Donohue

unread,
Dec 9, 2022, 1:56:10 PM12/9/22
to DSpace Technical Support
Hi,

DSpace's search engine uses Apache Solr.  I suspect the behavior you are seeing is because Solr may need special configuration to properly parse the Persian Language.  See the notes in Solr here: https://solr.apache.org/guide/8_0/language-analysis.html

I suspect you may need to customize the default Solr "schema.xml" that comes with DSpace in order to enable these custom configurations in Solr.   It likely would be the "query" analyzer (which is what is used when DSpace runs a search via Solr) here: https://github.com/DSpace/DSpace/blob/main/dspace/solr/search/conf/schema.xml#L218-L224

After changing that schema.xml, you'd have to copy it back into your Solr instance & restart Solr.  You also may need to re-index (not sure if this part is required or not): ./dspace index-discovery -b

Let us know if that works! 
Tim
Reply all
Reply to author
Forward
0 new messages