Bonjour Jorge,
Thank you for your explanation and for pointing me in the right
direction. I've been trying to configure the WordFilterModule and
want to double check my settings.
1. Add the WordFilterModule to the list of enabled modules in the
crawler.properties file.
2. Create a file called words.txt that includes the regex
expression .*smurf.*
3. Update the wordFilterModule.properties file with the following
values:
# Inherited properties from ATrueFalseModule
on.true.set.tags =
on.true.unset.tags = emitdoc,hotspot
on.false.set.tags = hotspot,emitdoc
on.false.unset.tags =
I am trying to skip over / remove html files that contain the word
"smurf" on my internal test server.
Cheers,
Germain
> > <
hounder%2Bunsu...@googlegroups.com<hounder%252Bunsubscribe@googlegroup
s.com>
> > >
> > > > .
> > > > For more options, visit this group at
> > > >
http://groups.google.com/group/hounder?hl=en.-Hide quoted text -