Hi all!My name is Nityata Nagendra Kumar and I am a graduate student in Information and Computer Sciences at the University of California, Irvine. I have built a web based search engine over the ics.uci.edu domain as part of my graduate coursework. The project can be found on https://github.com/nityata/SearchEngine.The project entails:- Pages from the above domain are crawled and selective content is stored
- An inverted index is built using the stored pages, using Apache Lucene APIs, to enable quick search.
Various options provided by Lucene for Query optimization such as Boolean Query etc. was incorporated by us for better results. We also used the NDCG ranking as a metric to measure the performance of our search engine with the IDCG taken as the Google search query results for the doman: ics.uci.edu.I was very interested in this project and was sorry to finish it. After seeing this project in the GSOC, I'm very enthusiastic in going forward and would like to know any tasks etc. that you want me to do.Hoping for a reply,Best Regards,Nityata--
You received this message because you are subscribed to the Google Groups "Genome Informatics GSoC" group.
To unsubscribe from this group and stop receiving emails from it, send an email to genome-informat...@googlegroups.com.
To post to this group, send email to genome-in...@googlegroups.com.
Visit this group at http://groups.google.com/group/genome-informatics?hl=en.
For more options, visit https://groups.google.com/groups/opt_out.
My name is Nityata Nagendra Kumar and I am a graduate student in Information and Computer Sciences at the University of California, Irvine. I have built a web based search engine over the ics.uci.edu domain as part of my graduate coursework. The project can be found on https://github.com/nityata/SearchEngine.
The project entails:- Pages from the above domain are crawled and selective content is stored
- An inverted index is built using the stored pages, using Apache Lucene APIs, to enable quick search.
Various options provided by Lucene for Query optimization such as Boolean Query etc. was incorporated by us for better results. We also used the NDCG ranking as a metric to measure the performance of our search engine with the IDCG taken as the Google search query results for the doman: ics.uci.edu.
I was very interested in this project and was sorry to finish it. After seeing this project in the GSOC, I'm very enthusiastic in going forward and would like to know any tasks etc. that you want me to do.