search engine

31 views

PostgreSQLadmindevelopment

Skip to first unread message

Hanna Meressa

unread,

Mar 27, 2014, 3:25:15 PM3/27/14

to web...@googlegroups.com

how to collect data? how to create ad work indexing and stemming?

Dave S

unread,

Mar 28, 2014, 2:31:23 PM3/28/14

to web...@googlegroups.com

On Thursday, March 27, 2014 12:25:15 PM UTC-7, Hanna Meressa wrote:

how to collect data? how to create ad work indexing and stemming?

Building an alternative to Google? That means cURLing a lot of web pages, parsing them for the information, building a database, and indexing that. For the first part, you want to study "web crawlers" and "spiders", which are forms of bots (robotic web processes). The latter is just database stuff, but doing it on any scale gets into Big Data ... there's a reason that the history of Hadoop includes papers from Google Research.

I think some apps (especially mobile applications) subscribe to an ad service, rather than trying to sign up a bunch of advertisers directly.

/dps

Reply all

Reply to author

Forward

0 new messages