Query neo4j using Soundex or any other way to find similar words in Arabic

78 views
Skip to first unread message

Awi

unread,
Jun 10, 2014, 3:19:14 AM6/10/14
to ne...@googlegroups.com
I am a learner of Neo4j and used in one of the financial project.
Is there a way where I could query my Neo4j to identify similar names (mainly for identifying spelling mistakes), and later on I could merge those records.
Few sources says; using Soundex or other related algorithms would help.

Thanks.

Peter Neubauer

unread,
Jun 10, 2014, 3:43:44 AM6/10/14
to Neo4j User
So,
looking at Soundex, you are not actually looking for an an Arabic stemmer like http://lucene.apache.org/core/3_6_0/api/all/org/apache/lucene/analysis/ar/ArabicAnalyzer.html but a way to correlate English and Arabic words by their similarity?

/peter


G:  neubauer.peter
S:  peter.neubauer
P:  +46 704 106975
L:   http://www.linkedin.com/in/neubauer
T:   @peterneubauer

Open Data    - @mapillary
Open Source - @neo4j
Open Future  - @coderdojo


--
You received this message because you are subscribed to the Google Groups "Neo4j" group.
To unsubscribe from this group and stop receiving emails from it, send an email to neo4j+un...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Michael Hunger

unread,
Jun 10, 2014, 10:29:50 AM6/10/14
to ne...@googlegroups.com
You can also store a phonetically reduced stem of the word as additional property and try to match on that?

Sent from mobile device
Message has been deleted

Awi

unread,
Jun 11, 2014, 1:30:06 AM6/11/14
to ne...@googlegroups.com, peter.n...@neotechnology.com
Yes Peter ... intent here is to correct spellings and reduce false positive occurred due to some manual process.
Reply all
Reply to author
Forward
0 new messages