Groups
Groups
Sign in
Groups
Groups
hounder
Conversations
About
Send feedback
Help
hounder
1–30 of 76
Mark all as read
Report group
0 selected
nisawahab
4/15/11
Crawler Modules Configuration example
Hi, I am trying to classify page and could not understand how to classify page using crawler modules
unread,
Crawler Modules Configuration example
Hi, I am trying to classify page and could not understand how to classify page using crawler modules
4/15/11
Евгений
,
Jorge Handl
5
11/10/10
Multimachine installation.
What happens if you try to connect to it from the machine the crawler is running on? Try the command:
unread,
Multimachine installation.
What happens if you try to connect to it from the machine the crawler is running on? Try the command:
11/10/10
germain
, …
Jorge Handl
8
9/9/10
Removing expired contents
Germain, your setup is mostly correct, except that you don't need to use regex. Each line in the
unread,
Removing expired contents
Germain, your setup is mostly correct, except that you don't need to use regex. Each line in the
9/9/10
Pablo
,
Alejandro Jorge Pérez
3
6/11/10
Timeout exceeded while waiting for an indexSearcher
In addition to the other tests, it's also interesting to know if the problems is solved momentary
unread,
Timeout exceeded while waiting for an indexSearcher
In addition to the other tests, it's also interesting to know if the problems is solved momentary
6/11/10
martin ariel
4/26/10
[hounder] Error timeout exceeded
Hola, Alguna vez tuvieron un error como el siguiente? Saben como se podria corregir? Informacion
unread,
[hounder] Error timeout exceeded
Hola, Alguna vez tuvieron un error como el siguiente? Saben como se podria corregir? Informacion
4/26/10
Blanca de la Fare
, …
Jorge Handl
4
4/13/10
Index files too big
Alejandro, you're right, it's not a bug, and in fact that indexer is not producing 1.1GB each
unread,
Index files too big
Alejandro, you're right, it's not a bug, and in fact that indexer is not producing 1.1GB each
4/13/10
Pete
,
Jorge Handl
6
11/3/09
No IndexId active
Do you have a pair of keys installed in your ~/.ssh directory? If not, follow the link suggested in
unread,
No IndexId active
Do you have a pair of keys installed in your ~/.ssh directory? If not, follow the link suggested in
11/3/09
Gustavo Arjones
,
Jorge Handl
2
9/22/09
Changing default document size
Gustavo, that is correct. - Jorge On Mon, Sep 21, 2009 at 10:42 PM, Gustavo Arjones <gustavo.
unread,
Changing default document size
Gustavo, that is correct. - Jorge On Mon, Sep 21, 2009 at 10:42 PM, Gustavo Arjones <gustavo.
9/22/09
Marino
2
9/16/09
searcher UNREACHABLE
I got the hounder up and running after 2 fixes, here they come. I'm using Debian with 2
unread,
searcher UNREACHABLE
I got the hounder up and running after 2 fixes, here they come. I'm using Debian with 2
9/16/09
kt
9/12/09
Problems crawling and searching
Hey there, I'm just downloaded hounder (Sep 12). 1. Install ran fine. 2. Configured seed urls and
unread,
Problems crawling and searching
Hey there, I'm just downloaded hounder (Sep 12). 1. Install ran fine. 2. Configured seed urls and
9/12/09
B R
,
Jorge Handl
8
9/5/09
Customized HTML Parsing
I'm glad it worked. Is your module generic enough to contribute it to the project? - Jorge On Sat
unread,
Customized HTML Parsing
I'm glad it worked. Is your module generic enough to contribute it to the project? - Jorge On Sat
9/5/09
Amit Kumar Verma
,
Jorge Handl
2
9/4/09
Hounder SVN read only access
I'm working on that, hold on! On Fri, Sep 4, 2009 at 7:10 AM, Amit Kumar Verma <cdac.amit@
unread,
Hounder SVN read only access
I'm working on that, hold on! On Fri, Sep 4, 2009 at 7:10 AM, Amit Kumar Verma <cdac.amit@
9/4/09
Amit Kumar Verma
,
Jorge Handl
4
9/1/09
Deletion of Index / cached pages
Hi Amit, To reset the crawler you need to remove several bits of data. Stop it (or kill it) and run
unread,
Deletion of Index / cached pages
Hi Amit, To reset the crawler you need to remove several bits of data. Stop it (or kill it) and run
9/1/09
B R
,
Jorge Handl
9
8/31/09
API for accessing crawled content
Thanks a lot. On Aug 28, 9:34 pm, Jorge Handl <jha...@gmail.com> wrote: > The get command
unread,
API for accessing crawled content
Thanks a lot. On Aug 28, 9:34 pm, Jorge Handl <jha...@gmail.com> wrote: > The get command
8/31/09
vlab@work
, …
Amit Kumar Verma
14
8/28/09
Hounder-2.0.1 - Issues - Crawler not stopping, Rmi Registry Error, Web Search shows search box twice on search
Hi Jorge, Thanks for your support, problem is resolved as you given the solution. Thanks, -amit On
unread,
Hounder-2.0.1 - Issues - Crawler not stopping, Rmi Registry Error, Web Search shows search box twice on search
Hi Jorge, Thanks for your support, problem is resolved as you given the solution. Thanks, -amit On
8/28/09
amar
, …
Jorge Handl
19
8/24/09
I need access to SVN to fix install problem
Amit, from the error mesage you posted, it appears the install script can't find the java command
unread,
I need access to SVN to fix install problem
Amit, from the error mesage you posted, it appears the install script can't find the java command
8/24/09
Bilford
,
Jorge Handl
3
8/18/09
Crawler issue
Thanks Jorge On Aug 18, 12:50 pm, Jorge Handl <jha...@gmail.com> wrote: > Bill, you can
unread,
Crawler issue
Thanks Jorge On Aug 18, 12:50 pm, Jorge Handl <jha...@gmail.com> wrote: > Bill, you can
8/18/09
Bilford
,
Jorge Handl
27
8/13/09
Problem with searcher component
No problem! On Thu, Aug 13, 2009 at 5:50 PM, Bill Mathews <bill...@gmail.com> wrote: Got it. I
unread,
Problem with searcher component
No problem! On Thu, Aug 13, 2009 at 5:50 PM, Bill Mathews <bill...@gmail.com> wrote: Got it. I
8/13/09
Gustavo Arjones
,
Jorge Handl
4
8/9/09
Can I use "regex-normalize.xml" from nutch in Hounder?
Gustavo, did you configure the urlnormalizer plugin in conf/nutch-site.xml? On Sun, Aug 9, 2009 at 2:
unread,
Can I use "regex-normalize.xml" from nutch in Hounder?
Gustavo, did you configure the urlnormalizer plugin in conf/nutch-site.xml? On Sun, Aug 9, 2009 at 2:
8/9/09
Gustavo Arjones
,
Jorge Handl
2
7/20/09
Control Bandwidth Usage
Gustavo, I'm glad to hear that you are successfully using Hounder. There is no direct way to cap
unread,
Control Bandwidth Usage
Gustavo, I'm glad to hear that you are successfully using Hounder. There is no direct way to cap
7/20/09
jagdis...@gmail.com
, …
Jorge Handl
10
7/19/09
pre requisites for installing hounder
We do offer consulting services and did so in many projects, but there is nothing stopping you from
unread,
pre requisites for installing hounder
We do offer consulting services and did so in many projects, but there is nothing stopping you from
7/19/09
prem
, …
Jorge Handl
6
6/22/09
Installation of Hounder in Windows XP
Jagdish, does the /var/local path exist in your setup? If not, you should create it. - Jorge On Mon,
unread,
Installation of Hounder in Windows XP
Jagdish, does the /var/local path exist in your setup? If not, you should create it. - Jorge On Mon,
6/22/09
Marino
,
Jorge Handl
5
6/15/09
Is the index corrupted?
Hi After uninstall and another install: No error. Kveðja, Marinó Njálsson \\ http://snara.is \\
unread,
Is the index corrupted?
Hi After uninstall and another install: No error. Kveðja, Marinó Njálsson \\ http://snara.is \\
6/15/09
marian...@gmail.com
,
Jorge Handl
3
5/22/09
more questions
Thank you once again, everythinhg is perfectly clear! M On May 20, 12:45 pm, Jorge Handl <jha...@
unread,
more questions
Thank you once again, everythinhg is perfectly clear! M On May 20, 12:45 pm, Jorge Handl <jha...@
5/22/09
marian...@gmail.com
,
Jorge Handl
3
5/22/09
list of questions
Excelent and quick responses, thank you very much! so far everything is goig very very good. On May
unread,
list of questions
Excelent and quick responses, thank you very much! so far everything is goig very very good. On May
5/22/09
marian...@gmail.com
,
Jorge Handl
3
5/5/09
Using hounder
Thank you very much for the reminder, I think I will do that in the future, since this is going
unread,
Using hounder
Thank you very much for the reminder, I think I will do that in the future, since this is going
5/5/09
gustavo
,
Jorge Handl
3
5/5/09
Error on Index Service
You're right! Sorry, I'm deploying a multi-machine hounder and start daemon for crawler/
unread,
Error on Index Service
You're right! Sorry, I'm deploying a multi-machine hounder and start daemon for crawler/
5/5/09
gustavo
,
Jorge Handl
3
4/30/09
Broken Link
The site is back to normal. - Jorge On Thu, Apr 30, 2009 at 9:28 AM, Jorge Handl <jha...@gmail.com
unread,
Broken Link
The site is back to normal. - Jorge On Thu, Apr 30, 2009 at 9:28 AM, Jorge Handl <jha...@gmail.com
4/30/09
marian...@gmail.com
,
Jorge Handl
2
4/27/09
words.txt
Mariana, that is because you are using the WordFilterModule, which filters any page that doesn't
unread,
words.txt
Mariana, that is because you are using the WordFilterModule, which filters any page that doesn't
4/27/09
marian...@gmail.com
,
Jorge Handl
3
4/23/09
Reset previous configuration
Mil gracias jorge en un rato lo hago y te cuento como me fue. Carinios On Apr 22, 10:35 pm, Jorge
unread,
Reset previous configuration
Mil gracias jorge en un rato lo hago y te cuento como me fue. Carinios On Apr 22, 10:35 pm, Jorge
4/23/09