Similarity measure

38 views
Skip to first unread message

Andrea Apicella

unread,
Mar 14, 2015, 11:44:32 AM3/14/15
to wiki...@googlegroups.com
Hi! I'm trying wikibrain and I think that it is awesome! really simple to use and don't give tipical problems of other packages (e.g. repository missing, old dependencies etc.) but...
could I know what are the metrics used for similarity? in tutorial I read that one of these is the inlink similarity, and there are other 4 measures. Could I know what are these 4 other measure? WikiRelate? ESA or others?
thanks!
best regards!
Andrea

Shilad Sen

unread,
Mar 15, 2015, 2:46:44 PM3/15/15
to wiki...@googlegroups.com
Thanks for using WikiBrain, Andrea! I'm glad you are finding it useful. Based on this message, I presume you figured out what was happening related to your last problem?

The ensemble in 0.5.2. includes ESA, just inlink, just outlink, MilneWitten (which uses both inlinks and outlinks), and a custom version of WikiRelate that uses category page rank as distance.

You can find the source code for these algorithms (in the 0.5.2 branch) in this directory: https://github.com/shilad/wikibrain/tree/2c414694acff59c4b0e06e43c8dabeaacb56c323/wikibrain-sr/src/main/java/org/wikibrain/sr

I should also tell you that the next release (0.6.0, due out in a week or two) is focusing particularly on the usability and accuracy of SR. Hopefully people will have a much easier time getting up and running with it at that point.

-Shilad

--
You received this message because you are subscribed to the Google Groups "wikibrain" group.
To unsubscribe from this group and stop receiving emails from it, send an email to wikibrain+...@googlegroups.com.
To post to this group, send email to wiki...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/wikibrain/b286225b-2954-4933-baea-62b48541d794%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.



--
Shilad W. Sen
Associate Professor
Mathematics, Statistics, and Computer Science Dept.
Macalester College
ss...@macalester.edu

Andrea Apicella

unread,
Mar 16, 2015, 4:50:39 AM3/16/15
to wiki...@googlegroups.com
Yes,my last problem (do you refer to exception in downloading phase,right?)i don't understand why it happens,but when I start again the process (in another directory),it doesn't give me exceptions and goes right. I will wait for the next release!in the while,i'm downloading the complete "en" language, because I need similarity between wikipedia's categories and seems simple has just few of these. By the way,for some categories in the simple language, when i try the similarity some measures e.g.inlink and oulink measure, i get null value. Is there a good measure for categories?I Also tryed "category" measure but it gives to me small values Also between categories that I expected higly correlated. I thought it can depend on the small database of simple. Could it be a reason?
Thanks
Best regards
PS if you think it is an interesting questione,i can open anew topic
Reply all
Reply to author
Forward
0 new messages