Hongfei Yan
unread,Mar 20, 2013, 1:03:38 AM3/20/13Sign in to reply to author
Sign in to forward
You do not have permission to delete messages in this group
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to cs41...@googlegroups.com
Introduction to Information Retrieval, by
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schuetze, 2009.
第四章的4.5节讲 Dynamic indexing
In this scheme, we process each posting ⌊T/n⌋ times because we touch it
during each of ⌊T/n⌋ merges where n is the size of the auxiliary index and T
the total number of postings. Thus, the overall time complexity is O(T^2/n).
Question: 对于O(T^2/n)的理解?
answer: T是指整个索引包含的posting数目,它是由n逐渐合并得来的,就是T是包含n的。
因此需要 ⌊T/n⌋合并。每次合并需要扫描T中每一个,所以是T*T/n,是 O(T^2/n)