search engine

29 views
Skip to first unread message

Hanna Meressa

unread,
Mar 27, 2014, 3:25:15 PM3/27/14
to web...@googlegroups.com

how to collect data? how to create ad work indexing and stemming? 

Dave S

unread,
Mar 28, 2014, 2:31:23 PM3/28/14
to web...@googlegroups.com
On Thursday, March 27, 2014 12:25:15 PM UTC-7, Hanna Meressa wrote:

how to collect data? how to create ad work indexing and stemming? 

Building an alternative to Google?  That means cURLing a lot of web pages, parsing them for the information, building a database, and indexing that.  For the first part, you want to study "web crawlers" and "spiders", which are forms of bots (robotic web processes).  The latter is just database stuff, but doing it on any scale gets into Big Data ... there's a reason that the history of Hadoop includes papers from Google Research.

I think some apps (especially mobile applications) subscribe to an ad service, rather than trying to sign up a bunch of advertisers directly.

/dps
 
Reply all
Reply to author
Forward
0 new messages