Received: by 10.101.131.9 with SMTP id i9mr474464ann.50.1322654991743; Wed, 30 Nov 2011 04:09:51 -0800 (PST) X-BeenThere: cl-ppcre@googlegroups.com Received: by 10.150.210.7 with SMTP id i7ls1338811ybg.4.gmail; Wed, 30 Nov 2011 04:09:51 -0800 (PST) Received: by 10.236.190.200 with SMTP id e48mr3623603yhn.5.1322654991630; Wed, 30 Nov 2011 04:09:51 -0800 (PST) Received: by 10.236.190.200 with SMTP id e48mr3623602yhn.5.1322654991616; Wed, 30 Nov 2011 04:09:51 -0800 (PST) Return-Path: Received: from common-lisp.net (tiger.common-lisp.net. [74.115.254.14]) by gmr-mx.google.com with ESMTPS id b15si445259ybi.1.2011.11.30.04.09.51 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 30 Nov 2011 04:09:51 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of cl-ppcre-devel-boun...@common-lisp.net designates 74.115.254.14 as permitted sender) client-ip=74.115.254.14; Authentication-Results: gmr-mx.google.com; spf=pass (google.com: best guess record for domain of cl-ppcre-devel-boun...@common-lisp.net designates 74.115.254.14 as permitted sender) smtp.mail=cl-ppcre-devel-boun...@common-lisp.net; dkim=neutral (body hash did not verify) header...@gmail.com Received: from localhost ([::1] helo=common-lisp.net) by common-lisp.net with esmtp (Exim 4.72) (envelope-from ) id 1RViz0-0000je-Ey for cl-ppcre-garchive-28591@googlegroups.com; Wed, 30 Nov 2011 04:09:50 -0800 Received: from mail-yw0-f51.google.com ([209.85.213.51]) by common-lisp.net with esmtp (Exim 4.72) (envelope-from ) id 1RViyE-0000iv-Ff for cl-ppcre-de...@common-lisp.net; Wed, 30 Nov 2011 04:09:03 -0800 Received: by yws1 with SMTP id 1so545791yws.10 for ; Wed, 30 Nov 2011 04:08:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:from:date :x-google-sender-auth:message-id:subject:to:content-type :content-transfer-encoding; bh=c1qBCUiO4z4dnEQUDeaai94SW/SZQLihu7HLzSCxPf0=; b=OKMBGu6qgt9kSIyHKPkOAMDSNjB+GgQUOHLuHLL1/D/ivw6gubzAbrq5GecyFAoyEz pxvUr7jvHmg7y1UZwUYdZN+R+XprFbebV4I4PqtcOV0mR3LDHmWx+8d0NPpPYwEvCZpR 3fDUquTRbT2HHf2xkqWWAALyct3zR0pwua23M= Received: by 10.68.39.197 with SMTP id r5mr6918957pbk.16.1322654936340; Wed, 30 Nov 2011 04:08:56 -0800 (PST) MIME-Version: 1.0 Received: by 10.142.231.10 with HTTP; Wed, 30 Nov 2011 04:08:35 -0800 (PST) In-Reply-To: References: <6B1B43D3-A6B4-4757-87A9-15090D3C0...@metabang.com> From: Edi Weitz Date: Wed, 30 Nov 2011 13:08:35 +0100 Message-ID: To: General interest list about cl-ppcre and cl-unicode X-Spam-Score: -0.7 (/) X-Spam-Report: Spam detection software, running on the system "tiger.common-lisp.net", has identified this incoming email as possible spam. The original message has been attached to this so you can view it (if it isn't spam) or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: Hi Noldus, What Hans says. The regex engine usually stops at the first match, so you have to be careful, because you'll get what you asked for. General questions about regular expressions are best answered by one of the numerous online articles or for example by the good O'Reilly book. I might also shamelessly recommend my "Regex Coach" application. [...] Content analysis details: (-0.7 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (nhabedi[at]gmail.com) -0.7 RCVD_IN_DNSWL_LOW RBL: Sender listed at http://www.dnswl.org/, low trust [209.85.213.51 listed in list.dnswl.org] -0.0 SPF_PASS SPF: sender matches SPF record 0.0 T_DKIM_INVALID DKIM-Signature header exists but is not valid Subject: Re: [cl-ppcre-devel] strange scan results X-BeenThere: cl-ppcre-de...@common-lisp.net X-Mailman-Version: 2.1.13 Precedence: list Reply-To: General interest list about cl-ppcre and cl-unicode List-Id: General interest list about cl-ppcre and cl-unicode List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Errors-To: cl-ppcre-devel-boun...@common-lisp.net X-Spam-Score: -0.7 (/) X-Spam-Report: Spam detection software, running on the system "tiger.common-lisp.net", has identified this incoming email as possible spam. The original message has been attached to this so you can view it (if it isn't spam) or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: Hi Noldus, What Hans says. The regex engine usually stops at the first match, so you have to be careful, because you'll get what you asked for. General questions about regular expressions are best answered by one of the numerous online articles or for example by the good O'Reilly book. I might also shamelessly recommend my "Regex Coach" application. [...] Content analysis details: (-0.7 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -0.7 RCVD_IN_DNSWL_LOW RBL: Sender listed at http://www.dnswl.org/, low trust [209.85.213.51 listed in list.dnswl.org] 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (nhabedi[at]gmail.com) -0.0 SPF_PASS SPF: sender matches SPF record 0.0 T_DKIM_INVALID DKIM-Signature header exists but is not valid Hi Noldus, What Hans says. The regex engine usually stops at the first match, so you have to be careful, because you'll get what you asked for. General questions about regular expressions are best answered by one of the numerous online articles or for example by the good O'Reilly book. I might also shamelessly recommend my "Regex Coach" application. As for the other question: You should enter the regex string into your Lisp listener and see what you'll get back. This has something to do with how CL treats backslashes in strings - there's also a section about this in the CL-PPCRE documentation. Cheers, Edi. On Tue, Nov 29, 2011 at 4:51 PM, Noldus Reijnders wrote: > Hey again! > > I came across something of which I don't=A0understand why it happens.. Wh= en I > do this: > > (scan "[a-z]*" "") > > > > I get: > 0 > 0 > #() > #() > > > while I expect: > 1 > 4 > ... > ... (I don't get the registers.. Although it isn't necessary for me right > now, could you maybe explain this as well if you have the time? I think it > has something to do with back referencing, but I'm not sure.) > > > Also, when I do this: > > (scan "[\xD8-\xF6]*" "") > 0 > 6 > #() > #() > > while I expect to get nil. > > It seems as if the scanner doesn't do anything if the first character is > part of the regex in the first example (because if I add a angular bracket > to the front it does find it) and in the second it matches everything whi= le > the hexcode range is far outside the strings range so it shouldn't find > anything. Am I doing something wrong? > > > Greatings, > Noldus > > _______________________________________________ > cl-ppcre-devel site list > cl-ppcre-de...@common-lisp.net > http://common-lisp.net/mailman/listinfo/cl-ppcre-devel > _______________________________________________ cl-ppcre-devel site list cl-ppcre-de...@common-lisp.net http://common-lisp.net/mailman/listinfo/cl-ppcre-devel