--
You received this message because you are subscribed to the Google Groups "Common Crawl" group.
To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl+unsubscribe@googlegroups.com.
To post to this group, send email to common...@googlegroups.com.
Visit this group at https://groups.google.com/group/common-crawl.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "Common Crawl" group.
To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl+unsubscribe@googlegroups.com.
To post to this group, send email to common...@googlegroups.com.
Visit this group at https://groups.google.com/group/common-crawl.
For more options, visit https://groups.google.com/d/optout.
As a beginner i explored this site http://nutch.apache.org/and simulated apache nutch( a web crawler) with the help of https://wiki.apache.org/nutch/NutchTutorial this tutorial, understood its full working, and find out loopholes where the work could be done to improve the existing system.U can simulate binary version of apache nutch on linux and for source version you need to install eclipse.For finding the solution of problems you can explore the research papers and can read how much work has been done to improve the existing work and what further you can improve, the improvements that you feel that can be done can be written as solutions proposed .All the best for your project ))
On Thu, Jun 1, 2017 at 7:56 PM, <serikbek...@gmail.com> wrote:
Thank you so much. I dont have any experience on this subject and i dont know how to start writing. Can you tell me what kind of tools i can use and if you know any websites where i can read related information?
четверг, 1 июня 2017 г., 15:04:27 UTC+1 пользователь serikbek...@gmail.com написал:Hello everyone,I need help from experienced people from this forum)I'm doing MSc project, but did not find the main problem yet(((( Can someone help me please to do research or give any advise.Topic is about "The Common Crawl know all: analysing web information leakage through indirect means " and I have to find at list 3 main problem and give a way to solve itThank you )))
--
You received this message because you are subscribed to the Google Groups "Common Crawl" group.
To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl...@googlegroups.com.
To post to this group, send email to common...@googlegroups.com.
Visit this group at https://groups.google.com/group/common-crawl.
For more options, visit https://groups.google.com/d/optout.
--Ananta Gupta
Hi,
"analysing web information leakage through indirect means" sounds somewhat vague. I would in any
case ask your advisor to make it more precise. It's a broad topic...
One pointer:
Stephen Merity's "Measuring the impact of Google analytics" [1]
It's about "leakage" of the browsing history of individuals indirectly by tracking the page/site access.
Best,
Sebastian
[1] https://www.slideshare.net/CommonCrawl/measuring-theimpactgoogleanalytics-37370713
On 06/01/2017 04:04 PM, serikbek...@gmail.com wrote:
> Hello everyone,
> I need help from experienced people from this forum)
> I'm doing MSc project, but did not find the main problem yet(((( Can someone help me please to do
> research or give any advise.
> Topic is about "The Common Crawl know all: analysing web information leakage through indirect means
> " and I have to find at list 3 main problem and give a way to solve it
> Thank you )))
>
> --
> You received this message because you are subscribed to the Google Groups "Common Crawl" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to
> To post to this group, send email to common...@googlegroups.com
> <mailto:common-crawl@googlegroups.com>.
> Visit this group at https://groups.google.com/group/common-crawl.
> For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "Common Crawl" group.
To unsubscribe from this group and stop receiving emails from it, send an email to common-crawl+unsubscribe@googlegroups.com.
To post to this group, send email to common...@googlegroups.com.