
Hi Folks,
I want to create 2 jobs in mapreduce(I have already made the first one,but it doesn't work for that I want to do), ie to a job I want to export all titles (<title>....</ title>) of urls that have 3 or more "a" . I have stored contents of the URLs in separate files, and each file is stored locally on my HDD in text format.
And at another job I want to export, like eg, looking for through pages a word (eg car) and display the corresponding URLs that contain that word.
I append you my MapReduce code that I started to do.
In essence,
for first functionality, I want to extract and display each file all titles (<title>....</ title>) of urls that have 3 or more "a" . with file name.
For second functionality, I want to display word e.g (car) the word that exists in each urls content with file name
thanks in advance!