Hi guys,
I'm Szymon Matejczyk (
@szymonmatejczyk) and I'll be working on Cassovary as GSoC student this summer.
The goal of my project is to use Cassovary to analyze Wikipedia pages graph and use it for entity resolution of short texts.
Last week I've been working on few minor changes needed to load Wikipedia dump to Cassovary:
- Test my triangle counting approximated algorithm: https://github.com/twitter/cassovary/pull/62
- Allow generic node types in Cassovary (for now only Ints are possible) and benchmark performance decrease
- Enhance traversals
If you have any questions, suggestions, feel free to ask,
Szymon