Hi, I am enthusiastically looking forward to contribute to Google Summer of Code 2017 and I am particularly interested in the project titled "
Increase Crawling Performance through page clustering". I have read the project description and I have come up with a plan on how to implement it and how we can enhance the crawler's performance using different clustering algorithms and using hierarchical clustering.
Please let me know how do I submit a proposal for this project.