2013's baseline runs

49 views

Skip to first unread message

unread,

Jun 2, 2014, 7:00:43 AM6/2/14

to trec-...@googlegroups.com

Is there a document that describes in more detail than 2013's overview paper

the organizer's baseline runs? I am particularly interested in the size estimation methods used.

Avi

unread,

Jun 4, 2014, 8:10:45 AM6/4/14

to trec-...@googlegroups.com

Hi Avi,

They are shortly described in the overview paper (page 6).

More details on the RS_Clueweb method:

For each search engine we do the following:

For each query obtained using query based sampling, we divide the number of results by its ClueWeb09 document frequency.

We then scale it by the total number of documents in ClueWeb.

The final estimate is the average over all queries.

Hope that helps. If you have more questions, let us know.

Best,

Dong

Reply all

Reply to author

Forward

0 new messages