Dear Rajeev,
The internal search engine for the current version of Sixearch.org is
"Nutch". "Nutch" is an open source search engine built on top of
"Lucene". You can find the details about "Nutch" through
http://lucene.apache.org/nutch. For sixearch, we only modify the nutch
search engine a little bit such as adding the stop words filter,
changing the indexing rules (which types of file we would like to
index), and how to merge multiple search indices. The main core of the
nutch search engine (indexing, retrieving, ranking) is remaining the
same as the original package. In fact, the idea of
sixearch.org is to
provide a platform to facilitate distributed Web search without
centralized control. In other words, the user of each peer has 100%
control about his own micro search engine, for example, he can decide
how to compute the similarity between documents and query, which
documents he would like to share (index), or how to rank the search
results. A unified search engine is not a requirement for
sixearch.org.
I think If your project is only focusing on the index optimization,
you may use nutch or lucene instead of sixearch as your baseline since
the collaboration between peers is outside of your project scope.
You can get more information about our project from
http://www.sixearch.org.
You can also download the
sixearch.org source code through
http://code.google.com/p/sixearch/source/checkout. Please let us know
if you have further questions.
Best
Le-Shin
Sixearch.org team