Here is Edi's reply about issue3, please check it.
I suggest "_\\w+" can be replaced by "_[a-z0-9A-Z]|__" if we do not consider unicode. I'm not a regular express expert, please give your comment.
---------- Forwarded message ----------
From:
Edi Weitz <e...@agharta.de>Date: Fri, Jun 26, 2009 at 3:21 AM
Subject: Re: [cl-ppcre-devel] report a bug
To: General interest list about cl-ppcre and cl-unicode <
cl-ppcr...@common-lisp.net>
Hi,
On Thu, Jun 25, 2009 at 4:31 AM, Xiangjun Wu<
neta...@gmail.com> wrote:
> "(\\w+)*\\@\\w+"
That's the type of regular expression that typically leads to a
combinatorial explosion in regex engines unless they use specific
"tricks" to deal with this. Recent versions of Perl are pretty clever
in this regard (they look for "floating" substrings) while CL-PPCRE
isn't, but - frankly - I don't really see the point of this. I think
this is mainly so that the regex engine looks good in benchmarks. I
definitely wouldn't call this a bug.
The question is - what do you want to achieve with this regular
expression? Can't you write it in a simpler way?
Cheers,
Edi.
_______________________________________________
cl-ppcre-devel site list
cl-ppcr...@common-lisp.net
http://common-lisp.net/mailman/listinfo/cl-ppcre-devel