Hi everyone,
I'm a sophomore study in computer science in Chengdu and a newbie to GSOC as well.
I just found "Portia Spider Generation" in Portia project ideas and I am very excited to work on this.
I am good at Python and have had some side projects done like distributed crawlers. You can find more on
my Github.
I have read the brief explanation and expected results of Portia Spider Generation, I think this is just like a kind of dynamic data match templates generation, which I am enthusiasm for. To be exactly, I guess this will be like generate new spiders from given webpages and the result datasets.
Can anyone help me and show me the direction, So that I can start working on it.
With Due Regards,
Long Zhang