Hi Andrea! Good questions
1) There are two modes for SR: similarity(X, Y) and mostSimilar(X). The latter returns the most similar Ys. By default we only train similarity because mostSimilar requires much more computational effort. The normalizer that is missing is only the mostSimilar one, so that should be harmless.
2) Hopefully the value is *close to* 1? Yes, this is an issue with normalization. Perhaps we should short-circuit normalization in the identity case.
3) A LocalId *is* the "real" Wikipedia Id. We call it "local" because it's language-specific.
-Shilad