Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

verity - How to include # in a word

0 views
Skip to first unread message

jl

unread,
Mar 1, 2007, 9:32:06 AM3/1/07
to
Hello,
I am using CF 6.0. I create a collection by defining/running a query
against an informix database. There are words that include "#" symbol
that people will want to include in their searches (ex. #3, #2000).
How do I tell verity during indexing that I want to include the pound
symbol as part of a word? I tried using the style.lex file without
any
luck. Any suggestions are greatly appreciated.
Thanks in advance.
Joe

here are the token definitions I've tried


$control: 1
lex:
{
define: WHT "[ \t]"
define: NL "{WHT}*\n"


token: WORD "[A-Za-z0-9]+" #word
token: WORD "[0-9]+\\.[0-9]+" #word


>> token: WORD "[##0-9]*" #word
>> token: WORD "[#0-9]*" #word
>> token: WORD "[\#0-9]*" #word
>> token: WORD "[\\#0-9]*" #word


token: EOS "[.?!]" #end of sentence
token: NEWLINE "{NL}" #single end-of-line
token: PARA "{NL}{NL}" #end of paragraph
token: WHITE "{WHT}" #whitespace
token: PUNCT "." #all other text

}

0 new messages