Hi,
On 08/05/13 04:48, albertonm81 wrote:
> Ok I could fix the problem with the URI and now I have all the answers
> about it, but I still didn't find information about the Frontiers. So it
> could be helpful if anyone could give me an answer about it or where to
> find information.
read up on Wikipedia about crawling:
"A Web crawler starts with a list of URLs to visit, called the seeds. As
the crawler visits these URLs, it identifies all the hyperlinks in the
page and adds them to the list of URLs to visit, called the crawl
frontier. URLs from the frontier are recursively visited according to a
set of policies." [1]
LDSpider follows the standard HTML crawler architecture, but groks
RDF documents and links.
Good luck!
Best regards,
Andreas.
[1]
http://en.wikipedia.org/wiki/Web_crawler