Manually overriding term frequency / document frequency

10 views
Skip to first unread message

Achim Domma

unread,
Oct 29, 2014, 6:49:06 PM10/29/14
to helio...@googlegroups.com
Hi,

is it possible to manually override the term frequencies / doc frequencies values used in tf-idf correction when matching documents? The use case is, that we know better values, than those calculated based on our document set.

kind regards,
Achim

Yonik Seeley

unread,
Oct 29, 2014, 7:56:44 PM10/29/14
to helio...@googlegroups.com
There is nothing out of the box that doesn't require Java coding.
One way is to develop a custom Similarity.
Or, if you're talking about passing in custom scoring factors with a
query, we might be able to develop something that would be reusable by
everyone.

-Yonik
Reply all
Reply to author
Forward
0 new messages