2013's baseline runs

49 views
Skip to first unread message

Avi Arampatzis

unread,
Jun 2, 2014, 7:00:43 AM6/2/14
to trec-...@googlegroups.com

Is there a document that describes in more detail than 2013's overview paper
the organizer's baseline runs? I am particularly interested in the size estimation methods used.

Avi

Dong Nguyen

unread,
Jun 4, 2014, 8:10:45 AM6/4/14
to trec-...@googlegroups.com
Hi Avi,

They are shortly described in the overview paper (page 6).
More details on the RS_Clueweb method:

For each search engine we do the following:
For each query obtained using query based sampling, we divide the number of results by its ClueWeb09 document frequency. 
We then scale it by the total number of documents in ClueWeb.
The final estimate is the average over all queries.

Hope that helps. If you have more questions, let us know.

Best,
Dong
Reply all
Reply to author
Forward
0 new messages