From debbugs-submit-bounces@debbugs.gnu.org Thu Oct 28 05:10:14 2021 Received: (at submit) by debbugs.gnu.org; 28 Oct 2021 09:10:14 +0000 Received: from localhost ([127.0.0.1]:51340 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mg1QK-0005aH-WF for submit@debbugs.gnu.org; Thu, 28 Oct 2021 05:10:14 -0400 Received: from lists.gnu.org ([209.51.188.17]:60260) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mg0ll-0004IA-Vl for submit@debbugs.gnu.org; Thu, 28 Oct 2021 04:28:20 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:33312) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mg0ll-0002sW-KN for bug-grep@gnu.org; Thu, 28 Oct 2021 04:28:17 -0400 Received: from mail-he1eur02on0727.outbound.protection.outlook.com ([2a01:111:f400:fe05::727]:18400 helo=EUR02-HE1-obe.outbound.protection.outlook.com) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1mg0li-00084P-JA for bug-grep@gnu.org; Thu, 28 Oct 2021 04:28:17 -0400 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Y2nocM5w0laC6U3JnMO4iqFPL+HokaOzgBdtztVJakrzEuaVHdAE0X0l/QPP7z8e0mh5zf1uWRFcmBhB3L9cv7BEYlQCHeEaxYNQVMFMh62z6kUCN+viv9UzAZUGJ8+qq/1vQ2/93sOzkDb68PMQSt7+eofywLhbhw5ZQgtwb09Wbi08WTvM/41KxQhjV7YpuVX5lUlxP5sR50v8bS/6v6fI4VZxah8bewxvk40gS2WdokM4ywKBo8Gz7eghuNw0rKZ1Ww14VbX8+QBBUC/AduLhwKlxubWBpTCR8OP/DbxH+F1E8SsojfdhiGLv5lmt036IdPE/pyVo/IPBigD6rQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=H/LTUeRsWIPyAQmPqi0cxCQhZh0SWBpDWDyAqEVwCX0=; b=mCXUJydcklQ1baonrVBxg4nTCDvfvWfxcqxvMu/phubN5p81NY1lPVa4TR9f/J3Qav6BxTxqK9tZHxiQf1Y/6zx9air1Jn89aLEI5/DroJSpXUaBBSDK0hn3UAWMR4IKGdTKuEZFsFu6RM8ryZD9T1EOJ+dZntdroIi52I7kmF5H2RLugdSgnGUOvmJertDCgguhNWJohP4x2Tp0ka1Rn8hlS1yOfsMZkNpzM+b96OidkYJ47N0Qdd7P4M8/zlViuVQiPznyyJTGLYV6Nnauf5Kz2PgalMlvDi6Dx4Q7epsb0f0S9rt+eABFWmfvx/jtYqkl28SlmXHXfXqTU29hQQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia.com; dmarc=pass action=none header.from=nokia.com; dkim=pass header.d=nokia.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=H/LTUeRsWIPyAQmPqi0cxCQhZh0SWBpDWDyAqEVwCX0=; b=WpjNBLIekK1PHlaPLqLMT9XPJqbm1PtxxLZPcSJ0nlLdBcDJ2CXl+VrQsDNmAZ43eth/MSSd9B9qGbxKYpJQAsZCMt48x8fHoobVCJGjR7HV8dvc1bLpRFZH/yefPx+OtzIcMf13cPbxG5YewYrNaMUFR8prbg5FBQTNExZcYJk= Received: from AM4PR07MB3475.eurprd07.prod.outlook.com (2603:10a6:205:7::13) by AM0PR07MB5538.eurprd07.prod.outlook.com (2603:10a6:208:105::22) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4669.4; Thu, 28 Oct 2021 08:23:09 +0000 Received: from AM4PR07MB3475.eurprd07.prod.outlook.com ([fe80::91db:e34f:b4dc:1526]) by AM4PR07MB3475.eurprd07.prod.outlook.com ([fe80::91db:e34f:b4dc:1526%7]) with mapi id 15.20.4649.014; Thu, 28 Oct 2021 08:23:09 +0000 From: "Skrzyniarz, Slawomir (Nokia - PL/Krakow)" To: "bug-grep@gnu.org" Subject: grep PCRE - mean Thread-Topic: grep PCRE - mean Thread-Index: AdfL0FrRhZuX8340QwawJqk11yv5tQ== Date: Thu, 28 Oct 2021 08:23:08 +0000 Message-ID: Accept-Language: en-GB, pl-PL, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=nokia.com; x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: cd7c4679-7909-4da7-aa89-08d999ec2ca5 x-ms-traffictypediagnostic: AM0PR07MB5538: x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:619; x-ms-exchange-senderadcheck: 1 x-ms-exchange-antispam-relay: 0 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: o6d4JcnHJd6KfJErIXvp4rYAtgSRnENquK3L9ZSREgq6TQ1PoU2d07NPFWIzrQcagea6rWPnaqYVDi3IBTwSGRSWfMSDjAhhixBJXtRmmUS21PY1Dhexg2GiNe4Fq5z2lL3pikBfBlpChQT/f2QUr0i9E0yatTEcX0NqCf1HQinP7bFu+wSSEd4OVkJrKE5kyRQc/hrMrBds4SCldQw1JIJTR7eR5Re5mjC1sI0i8h3n5rxbjPYtnJO+xRTkUy3DtAFkwvTqS7cddKuykckFTjaYvrofeIDY/XcE1ttbn1gd8HIceuLV84XqCn/nUJpEfiRNsAJNvoxqx4hCqu9LUltxL+dZJSZ9EMayT6n3hFdLSSsz6Qm6/tzPaNcJOSURxt+76umEHF0GAfkltAMUcEFX2zM3q/lofAWB01IjU2hhjbf/norwi0VGFfveiAstEVy9Zu+X+vUt8R3iqAzoc80q3mZ2ZiJHm9aN/xiLjuS4qTfmx9He2R3+okTa3w6PCXwK1glyUylJer/H0tmCl6oB6NL6qUkilBvIOn/2Tqn3QBCs94t6qNAdkn3gr98PA1ZAXCLD6eO0x0HRvGNfwJZ/pM0Jb3khRuTVjzwqX1xI34xWqvo5eDjXLuhIFVE4hRGhHLANjLCAdtroKGclUCFw9AXObzdILGfZHbBBEwWd+Whu/Ky25b1cfhgblrKmQ6SRsnVNDy1AXFNGDubFG4bUDswSxbHOKIljQDr5KhygLIf7EjaZaOdGm+dTQYEhsY4SKEci9b2S6f7at2PnHlGLUxMdjfwyh7zJTtkkev8= x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:AM4PR07MB3475.eurprd07.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(316002)(66574015)(186003)(83380400001)(38070700005)(122000001)(71200400001)(38100700002)(82960400001)(166002)(8676002)(8936002)(7696005)(6916009)(52536014)(86362001)(66476007)(66556008)(66446008)(66946007)(76116006)(64756008)(966005)(6506007)(508600001)(33656002)(55016002)(9686003)(2906002)(5660300002); DIR:OUT; SFP:1102; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?iso-8859-2?Q?138SBzuYnEbIuAMoa6fhfZnWdiaUF/VVBPmFvslCV3Wz5eXllbrLYNOhVo?= =?iso-8859-2?Q?yUu/G6HRWGq7k4gyoNT9azVycGmgAai3bXkbyEn8RCuv/h/g6PDEjjP4pg?= =?iso-8859-2?Q?hg3mawn0pk1cs36TsFH2GX9rRkfgHs5ZZJA/lCxuONGlxDDLijs30opO8K?= =?iso-8859-2?Q?lKVGisg3x8LdPjayymWhmHrkYe8yJj64k93nDkauR8pYIsnTETFOB11JEA?= =?iso-8859-2?Q?fT0cm4ie9PSGpj7xsbHJZ5dvbckV1xa1JKNmtOHoiyM15QvNlKLdqKqO25?= =?iso-8859-2?Q?ZzoOtwDbDVXn/AlW1VNVdTMt45F6+JtQngT2qnI3ebXrFUoVAQT2VA2B/G?= =?iso-8859-2?Q?fpsausRAVIOhkPLeRl0jhZRhfWMoryNv0VkwgYdQw02izOhKY5mamn2y1Y?= =?iso-8859-2?Q?P839c6/aDxWKbOOgJXvza8IKGx2h/6Nzer2hU9Cip6CfKcfd4ITjWyZeu+?= =?iso-8859-2?Q?CWle6bsy2JmIbijD4r9O+/SbpjLRJbEhzT/mfGyBmBjYkIKlRAS9JoxDcj?= =?iso-8859-2?Q?crIOnfw/34O2Ms74/HazXZcCOBPCeznsSocrhWlsUA32POe32vzQehh+ec?= =?iso-8859-2?Q?CWUYET1hYy5CrfyGO4RI9QC1ONSKZZiGSy3ezlMZ0adGGGkWV6fAhYVYdq?= =?iso-8859-2?Q?yR3lBOIQwmL+UKUtlMNeS+rYEbqED0fEOCnU6jzbeOi7YInD0ednQ3aAob?= =?iso-8859-2?Q?Y1megS72C0l++3SzLflmyGECpWM2wNAPVk8CLLi50H+Oj6g85ybau2/w2y?= =?iso-8859-2?Q?4OF3wn7vTmEBNNIOQUWJLKzsbnmEtID5H2k5dNS1193Dir8UaYHb5kxyUp?= =?iso-8859-2?Q?VmI2GWfjyQ61ULNKwoj60+D5SxycB5jJneKGct5r72Bzpy0yOVRHu5xzVB?= =?iso-8859-2?Q?EhlmN6ZVjTRZOnR3JThFqtxZZQehI8w51j3LvERTb+rCSERoENgCNyxPLi?= =?iso-8859-2?Q?heZGhuVEaDYay0Yzy3zuwI0oUIsMP5o+kwE4+YTF2X/9a92t1zgajqlceZ?= =?iso-8859-2?Q?9hVRdGYJicYIL/Va8Yib7NNylQ3+4o8nz6aWuGNxVkhSThAIw5Ug2Q9paW?= =?iso-8859-2?Q?zhbTQ2WpajVtU11eUBfjGU92XMg8eB3s5NJJqxxpfubqfYC7W3YVl79f11?= =?iso-8859-2?Q?z/yBwMvWOhEoz4fYECKiPtffprbVH7eL5gfXulgddajlwb68FTOOcIoP0N?= =?iso-8859-2?Q?IJ8tEhvGGJa/O4N/WQsY+6UO9ZjBp75aCgyiB5ivRpwiBmEGIGOf6XDtB1?= =?iso-8859-2?Q?suHAF8CQIpmuobKBIBlC8scbQczhrGgGKZBeJWmUSJyMy/Q/B/XjO28D1f?= =?iso-8859-2?Q?iLrkeL7CxSX7bz7EIZedDnQepfCavebIi/52fPZq5QeN82wP+zNNfI9Ty3?= =?iso-8859-2?Q?k+3Nj10lNZgFaV4R7QmjLMlr3JFfR4cocNUzZq/Fq6i17M1FEj6XUzhCGt?= =?iso-8859-2?Q?9opn+K+bjpfBkXnybpOYwTpwyem9jK1bpGj91yezP3e+PhQfcksAUywSG7?= =?iso-8859-2?Q?OPn+Xfa6ELwqdk7Bd0vVeKy2OcU/MjGjCKEMJKzquSZxudq0DdmdYB/REy?= =?iso-8859-2?Q?Pkn/57xXpZJ1w5m43IIaqmYXP4GajxZ8alpXdByk+QnP1FC83JGdMJhCXJ?= =?iso-8859-2?Q?Qx5krWNcy6eQ4YRRccE3NpyZpy8kKXj6e9gA0zNN2z+fvqnliM4W9LPxE+?= =?iso-8859-2?Q?f8dsxjC61zx6AST/c84kD/z2HF4+Hqh9QBDbvNFZKpbZTxr6XH9y809cR2?= =?iso-8859-2?Q?kI8BZ4sbwXt3zmoo9gppAxTQiujCv4Eg6uLTlo+3PwzjS5IQnTHXy1F5rl?= =?iso-8859-2?Q?LJTuoacSaA=3D=3D?= Content-Type: multipart/alternative; boundary="_000_AM4PR07MB3475361550569AA0971D78E1EE869AM4PR07MB3475eurp_" MIME-Version: 1.0 X-OriginatorOrg: nokia.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: AM4PR07MB3475.eurprd07.prod.outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: cd7c4679-7909-4da7-aa89-08d999ec2ca5 X-MS-Exchange-CrossTenant-originalarrivaltime: 28 Oct 2021 08:23:08.9552 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0 X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: +vlRMQJWAKCrJrKXqfvwNN9AOAkjyQQJovKbWEj0REDVys0SRwbRn2Yc1G/7sgT5GLH0Q0TmcZ9OuI9w/NdFcOpL8oUNQvdlDxYL2sfzmoA= X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR07MB5538 Received-SPF: pass client-ip=2a01:111:f400:fe05::727; envelope-from=slawomir.skrzyniarz@nokia.com; helo=EUR02-HE1-obe.outbound.protection.outlook.com X-Spam_score_int: 0 X-Spam_score: -0.0 X-Spam_bar: / X-Spam_report: (-0.0 / 5.0 requ) BAYES_20=-0.001, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-Spam-Score: -1.3 (-) X-Debbugs-Envelope-To: submit X-Mailman-Approved-At: Thu, 28 Oct 2021 05:10:12 -0400 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) --_000_AM4PR07MB3475361550569AA0971D78E1EE869AM4PR07MB3475eurp_ Content-Type: text/plain; charset="iso-8859-2" Content-Transfer-Encoding: quoted-printable Hello Grep Team, I would update grep from version 2.20 to 3.1 and noticed that grep with -P = option stops recognize below regular expression: cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\/])*[*]\/|\/.*)::g' -e = 's:\"[^"]*\"::g' | grep -ozPLq '\A(?:\s*^(?:#\w+.*\s*|extern\s+.+)$)*+(?\s*namespac= e(?:\s+ utTestNamespace \s*(?>(?{(?:[^{}]*(?&block)*)*}))|(\s*[\w:]*= \s*{)(?&namespace)\s*}))\s*\z'; echo "retcode $?" Content of file SomeTestFile.cpp: #include #include #include namespace utTestNamespace { using ::testing::NiceMock; # some code here } //end of file I checked regular expression on regex101.com webpage and noticed that menti= oned regex is working for PCRE and PCRE2 on webpage but stop working in gre= p 3.1 and later versions (versions between 2.20 and 3.1 were not checked). See link: https://regex101.com/r/9NwluI/1/ Investigation shows that grep in 3.1 version and later 3.6 and 3.7 differen= t handle "^" and "$" for "-P" option. It looks that "^" does not detect all begin of lines but "$" does not recog= nize all end of lines. It seems that "^" is treated as beginning of whole test string - not new li= nes. "$" is suspected to recognize only end of whole test string - not end of li= nes. I would ask you if is intended behavior or it looks like an issue in grep. useful command in test: cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\/])*[*]\/|\/.*)::g' -e = 's:\"[^"]*\"::g' | grep -zP '(?:\s*^(?:\#\w+.*\s*|extern\s+.+)$)*+' cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\/])*[*]\/|\/.*)::g' -e = 's:\"[^"]*\"::g' | grep -zP '(?:\s*^(?:\#\w+.*\s*|extern\s+.+)\s*)*+' Best Regards, S=B3awek --_000_AM4PR07MB3475361550569AA0971D78E1EE869AM4PR07MB3475eurp_ Content-Type: text/html; charset="iso-8859-2" Content-Transfer-Encoding: quoted-printable

Hello Grep Team,

I would update grep from version 2.20 to 3.1 and not= iced that grep with -P option

stops recognize below regular expression:=

 

cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\= /])*[*]\/|\/.*)::g' -e 's:\"[^"]*\"::g' |

grep -ozPLq '\A(?:\s*^(?:#\w+.*\s*|extern\s+.+)$)*+(= ?<namespace>\s*namespace(?:\s+ utTestNamespace \s*(?>(?<block&g= t;{(?:[^{}]*(?&block)*)*}))|(\s*[\w:]*\s*{)(?&namespace)\s*}))\s*\z= '; echo "retcode $?"

 

Content of file SomeTestFile.cpp:

#include <memory>

#include <vector>

#include <gtest/gtest.h>

 

namespace utTestNamespace

{

using ::testing::NiceMock;

# some code here

}

//end of file

 

 

I checked regular expression on regex101.com webpage= and noticed that mentioned regex is working for PCRE and PCRE2 on webpage = but stop working in grep 3.1 and later versions (versions between 2.20 and = 3.1 were not checked).

See link:

https:/= /regex101.com/r/9NwluI/1/

 

Investigation shows that grep in 3.1 version and lat= er 3.6 and 3.7 different handle “^” and “$” for = 220;-P” option.

It looks that “^” does not detect all be= gin of lines but “$” does not recognize all end of lines.<= /o:p>

 

It seems that “^” is treated as beginnin= g of whole test string - not new lines.

“$” is suspected to recognize only end o= f whole test string – not end of lines.

 

I would ask you if is intended behavior or it looks = like an issue in grep.

 

useful command in test:

cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\= /])*[*]\/|\/.*)::g' -e 's:\"[^"]*\"::g' | grep -zP '(?:\s*^(= ?:\#\w+.*\s*|extern\s+.+)$)*+'

cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\= /])*[*]\/|\/.*)::g' -e 's:\"[^"]*\"::g' | grep -zP '(?:\s*^(= ?:\#\w+.*\s*|extern\s+.+)\s*)*+'

 

 

Best Regards,

S=B3awek

 

--_000_AM4PR07MB3475361550569AA0971D78E1EE869AM4PR07MB3475eurp_-- From debbugs-submit-bounces@debbugs.gnu.org Mon Nov 08 03:53:49 2021 Received: (at 51458) by debbugs.gnu.org; 8 Nov 2021 08:53:49 +0000 Received: from localhost ([127.0.0.1]:56217 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mk0PU-0003be-So for submit@debbugs.gnu.org; Mon, 08 Nov 2021 03:53:49 -0500 Received: from mail-eopbgr20136.outbound.protection.outlook.com ([40.107.2.136]:49027 helo=EUR02-VE1-obe.outbound.protection.outlook.com) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mjzEn-0001Rr-N1 for 51458@debbugs.gnu.org; Mon, 08 Nov 2021 02:38:46 -0500 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=dVSfGWXpX2ngfE1mghDSk43TI4OHj4ejT4+HNvvGEoAxaurUmu24YU5w8mHzVtNqXgetg2DcH1PdCw5DihyU1oeeQU/S/gT5Dt5qBiW0YgKSvdQ4qK26TqrUCledHiizu53wNndSuZhi8TgGnWQB1OerM0jYBs7L9FdaY4/y+mxZaSQ9/YOl7oxpCHhRtAacnGgALaPLwmKcOsI08RGp592BZ4AMgWbajL0QHzPip9v9sEOkp8AaabtBlm0/dHqPaIT9YpsAKZHN9yRAF1VTgLYFY5HkKpBiXo1CHnMS8/4X1UqCbF4WU0V/z1vUKpnzBevsqsk5TdFgIwzzNonznQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=q8PnbUCUEoXa2had4ZWwboxdtPGbWblBo8jSDjCRBaU=; b=kbXP1elBuIzpEERTzjB7virUHAnlkscFgLQn9E2u75BNjJz+5l6Joh2B8NrwPrAB1c21fIWmgXB6iE6Yw0akQEfahktkaAufgnPnuV6Qj4724fc03b07EM8cFdaLFG1U8opXIgmiY0DpnN9+PGYjIgAgBiX1dfK8NtGLyPRFmSyiweSnd9J98IijndBCht/3ZgqtJXJH8fR3iyGg1iAEYKZNMe2QrtuF5BcIKo4xLWOD+fFfpuQTnS7ANOAWQ3WOYO26wnZ8AG03euasOK2ylaVuIQJbpDWOcSP1hceEjuxVKE1gNROSa33AHAUzeRElZbCm4Z/nHlxlmjv/R7mQAA== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia.com; dmarc=pass action=none header.from=nokia.com; dkim=pass header.d=nokia.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=q8PnbUCUEoXa2had4ZWwboxdtPGbWblBo8jSDjCRBaU=; b=ZfnRgWBfjCaEzU6KrrTKP8R6cCbW94oFr9FHbJcXdji8GsZxCsnbPkPtXeahRGSkEdz2347DYU0mFkyofUxe2k9wpyj0TLGg0UkwhlLCxb/4H+XTXYsnsmKcyBYKQ0sFxRhL999Mg4Joa6BxZTo7HODDG4aiiOXuvro2wKS2jMc= Received: from AM4PR07MB3475.eurprd07.prod.outlook.com (2603:10a6:205:7::13) by AM0PR07MB5908.eurprd07.prod.outlook.com (2603:10a6:208:fd::30) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4690.11; Mon, 8 Nov 2021 07:38:33 +0000 Received: from AM4PR07MB3475.eurprd07.prod.outlook.com ([fe80::91db:e34f:b4dc:1526]) by AM4PR07MB3475.eurprd07.prod.outlook.com ([fe80::91db:e34f:b4dc:1526%7]) with mapi id 15.20.4690.015; Mon, 8 Nov 2021 07:38:33 +0000 From: "Skrzyniarz, Slawomir (Nokia - PL/Krakow)" To: "51458@debbugs.gnu.org" <51458@debbugs.gnu.org> Subject: RE: grep PCRE - "^" and "$" are not recognized as begin and end of line for multiline strings Thread-Topic: grep PCRE - "^" and "$" are not recognized as begin and end of line for multiline strings Thread-Index: AdfUc4h2thkmE1fwTIK6yVR9SLWBpQ== Date: Mon, 8 Nov 2021 07:38:33 +0000 Message-ID: Accept-Language: en-GB, pl-PL, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=nokia.com; x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: c22edeea-b4b8-4a49-33da-08d9a28ac499 x-ms-traffictypediagnostic: AM0PR07MB5908: x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:209; x-ms-exchange-senderadcheck: 1 x-ms-exchange-antispam-relay: 0 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: YxLfG+u58kNLy5jQmgLPVmZ45X+BrVOaZMxZVcyppXru3aONI8luJZRz9G8KRmpTCsFBgaMBccXuGjQKapDu81ohMzkavK6X7PIrEW6MObDGAyVsFmUDDR4Q7xIDT1LH2nNLWmyDwdHfBlqUerY66eylImzlPxr1yk2m0uxu9vjsSndIWSgrqPAMRmBAqiLMA7CKTpy7BYcnpQBepN83HCfpvbo73/RHkAZt3RDEAhrXn/Jc231RqHba53ESSrs+PX+bVlMNUMqJY38eLoUrG5YGz6YVrn9GH1fBldXAyqTC9Xx8CcgDDJqOe2AKIZAGijVc6J7NfQl89MHlY0fo5s/lsu5t41U+9QZyv99/Mem81j/a2ViybQHoIs1ScWEA0m3z0HccSDfd0IxxpjLYfMo5DsRg3bT6/HApZKPoRKg7NRXL2sTv/P4Rf4WeFSNJw8yFPQ94x7JaAwQRUtcrsHr7ii+KAjQjIeRrMI+9pcdTIKvJC1dvYmixLnC8C5bfkMLLdcHeq7hJAoOJSapd5AdHKov5l01rgXcSbyV3UEWs22s9IP0L18mVSNgQqYPwK5u3OT+kGKRRYKX2nrO/GK1pEajlcP/Z4+pRK3WPFgeIntHhck+BePMQM++ZfdhbUdeYdiK+pEsLyBCtnO/ltPt1jJ2B0+MbzODAwkxVsD1vhkHPUmDDaIEwzkUwDpZLleIyiEMho7q9kJXE1XWShIdwtGNmnlT+na90vF9+3xAB1KHSol6bymZkHm8Drehww4HR0Zx8Y9fEyLlsjwvQOLe7i7VbBuTCb4p98pnPb8Y= x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:AM4PR07MB3475.eurprd07.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(82960400001)(53546011)(122000001)(64756008)(7696005)(2906002)(186003)(8676002)(83380400001)(9686003)(166002)(6506007)(8936002)(71200400001)(55016002)(66946007)(6916009)(66574015)(966005)(38100700002)(52536014)(508600001)(76116006)(38070700005)(66556008)(33656002)(5660300002)(86362001)(66476007)(66446008)(316002); DIR:OUT; SFP:1102; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?iso-8859-2?Q?DTQf9SL+G+OczyiD+iNvyYP39xWo9VWxpiH1pg7JHfA4o11DIrApcw7Aag?= =?iso-8859-2?Q?7SPBDWsvxfjbULKGTjdgvD6Ovui/T5MhSS83mK+Su9VkDFHHdHH7jg0SWG?= =?iso-8859-2?Q?n9WIAbGO7NeQfZtUW1jDg8t7vbwfYyzgvzppFUWhtNV1t//gofA7Iog8C7?= =?iso-8859-2?Q?ZQuBh42fdi2fCSkLckdJlS5p4cZdb72kwHtPOtvMVNMdpv5xDgosZE+iRJ?= =?iso-8859-2?Q?hxUnpGZNc4X/gNMxlwya5a95vbZ983HIJtiVzcPdJORflnNm/HBg0Paag7?= =?iso-8859-2?Q?askjMB8BjAiQ/tWOdC/izphZ8649rOnPkwcDoMMSgbHmiOEdD6ymlOIWPQ?= =?iso-8859-2?Q?9u34f124zUUrmhu9YB0VZLfqNI6LhRRR+5hHjkzST1kpqGFGZs0TSt98oN?= =?iso-8859-2?Q?XMgmqIcav6q4WoXEJwluW5sjorDGEjj4Evf7ofqUAcJgBo8ik76G1Px2KQ?= =?iso-8859-2?Q?hikFFw/Ncfp+F2ZuPA3qY+MtM1R5qrUY8tZT7BeXf0spAQDimZQYIYA1B5?= =?iso-8859-2?Q?kWuYVl2Yo+pGZ3DnjZ6fMeOJTZfbh5NaioVo4Cwrg9VKyvIAdzmsYVhAGo?= =?iso-8859-2?Q?qtHDk/HB7qke+XP9QUED3Fdlx9MsCgZ4kRgkJ4QNAKuw8OFAguU6nXDOSI?= =?iso-8859-2?Q?NOf7XXJu2KawnCB5t0U+Y8VdZx3nxQWmykzbMqQE8MHVNzYD5c/cjvcUDH?= =?iso-8859-2?Q?irHmXwVhIs+lk7hSCraz/wrXemWT9bfmTIxrJZY59M9RreXuCA1y1/SDVD?= =?iso-8859-2?Q?dB1ir5UCbJPdDrbyvhkFDMY4j6kL0HxwoJXROJ6poNtPRsmMgh+2TIVX7n?= =?iso-8859-2?Q?RoJ75f1yaHuy5CR2dEvd+p+AsgLOiWlX4b13hGPbHus0EVjJZ/CQXYPtzB?= =?iso-8859-2?Q?73hsq8wiIyytkHzu/XmW1G3U+Q7+3z2pkUFWAG+gj5/ilN5U6hftL1+QYq?= =?iso-8859-2?Q?wW/t9wCSH6D+ez/qI6FbjGWXihrxsxPg4wPkVLcPVnGuxJVQ4zmRnvF4aD?= =?iso-8859-2?Q?3uXF+WGivux2d6ptohJ2+HI2VVgjLhb71bfSoEExVm5lTlsbeaS9QTeK4Y?= =?iso-8859-2?Q?cug6EHDRxRzD+ujo4QycJGlzogYyupQZ3yz5NpPU+SNxUI+27FbgF90L+s?= =?iso-8859-2?Q?V4267i+UABHefIOyoWnh2vg742i+jkjEGTfEJldqMnH0oLmLDGRpjvjfYM?= =?iso-8859-2?Q?EILN1rXuTHziIgYhjBKou9wgqRdfXiL9mqNR3ikhMqESbITaQwwzmv6qio?= =?iso-8859-2?Q?g83i7KFxb3EHYp3gTOZZhDyodgBeCoqyLyifc64N3xh0sywi8B95divGif?= =?iso-8859-2?Q?yAX/0wYNegnBSltZQAQUgHBKiiT/Ja0Dkhvdzp4MT32r08zwq0CBmkdHff?= =?iso-8859-2?Q?KP9gS6V6htvfOctLcXw5WrmJuaLCc81YqA2/efvZEcoNturAXCJuBKqpOm?= =?iso-8859-2?Q?kwEkdRBoj8/RDhCsEpk9WDLPnMyLcBIxTfJ3s69KhUT0ErPQVO2Zo5OTQk?= =?iso-8859-2?Q?gXKDuMb7GLl4mlSPqmvxnBbB2UaOsY8iKpF1EWvKMUpX+sjse+w26ZsZMI?= =?iso-8859-2?Q?njW5amCHslLLJnj16Uj5TDcECdjMMYy33A95OemwRDX4zuhv7XH0XNvNIf?= =?iso-8859-2?Q?4o7KpPEnMV3X3mbfjEy/lxCkXLkS+wT02TlJg/HRGDFgkATp/55UubK5vP?= =?iso-8859-2?Q?e1cEuQlfirDNOTcYS0sSywM4Hn6tkhrYa0DmcEX0La8AquFoBRZ6SaX3i0?= =?iso-8859-2?Q?LAPMItFLYmOz4Sgoszb3Kh53pJdiehPO5gPuS2o6oVcI2rxVRw02SLg5WS?= =?iso-8859-2?Q?Jjzff1lSaw=3D=3D?= Content-Type: multipart/alternative; boundary="_000_AM4PR07MB34752198D9CBE434E605A0FDEE919AM4PR07MB3475eurp_" MIME-Version: 1.0 X-OriginatorOrg: nokia.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: AM4PR07MB3475.eurprd07.prod.outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: c22edeea-b4b8-4a49-33da-08d9a28ac499 X-MS-Exchange-CrossTenant-originalarrivaltime: 08 Nov 2021 07:38:33.6537 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0 X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: N+j4m0NzL/Lu37SX6iPWJZtKZc8SxZYkhV/XIZRzZbJz8/rb9PZL5G92/BhoVYdyjx+ymOvgMxfPbi/6F9cds2TZu1Eu1XsXtiHzIKGpuTI= X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR07MB5908 X-Spam-Score: -0.0 (/) X-Debbugs-Envelope-To: 51458 X-Mailman-Approved-At: Mon, 08 Nov 2021 03:53:48 -0500 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) --_000_AM4PR07MB34752198D9CBE434E605A0FDEE919AM4PR07MB3475eurp_ Content-Type: text/plain; charset="iso-8859-2" Content-Transfer-Encoding: quoted-printable Hello team, I've fixed subject to better describe of issue: "grep PCRE - "^" and "$" ar= e not recognized as begin and end of line for multiline strings". Best Regards, S=B3awek From: Skrzyniarz, Slawomir (Nokia - PL/Krakow) Sent: Thursday, October 28, 2021 10:23 AM To: bug-grep@gnu.org Subject: grep PCRE - mean Hello Grep Team, I would update grep from version 2.20 to 3.1 and noticed that grep with -P = option stops recognize below regular expression: cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\/])*[*]\/|\/.*)::g' -e = 's:\"[^"]*\"::g' | grep -ozPLq '\A(?:\s*^(?:#\w+.*\s*|extern\s+.+)$)*+(?\s*namespac= e(?:\s+ utTestNamespace \s*(?>(?{(?:[^{}]*(?&block)*)*}))|(\s*[\w:]*= \s*{)(?&namespace)\s*}))\s*\z'; echo "retcode $?" Content of file SomeTestFile.cpp: #include #include #include namespace utTestNamespace { using ::testing::NiceMock; # some code here } //end of file I checked regular expression on regex101.com webpage and noticed that menti= oned regex is working for PCRE and PCRE2 on webpage but stop working in gre= p 3.1 and later versions (versions between 2.20 and 3.1 were not checked). See link: https://regex101.com/r/9NwluI/1/ Investigation shows that grep in 3.1 version and later 3.6 and 3.7 differen= t handle "^" and "$" for "-P" option. It looks that "^" does not detect all begin of lines but "$" does not recog= nize all end of lines. It seems that "^" is treated as beginning of whole test string - not new li= nes. "$" is suspected to recognize only end of whole test string - not end of li= nes. I would ask you if is intended behavior or it looks like an issue in grep. useful command in test: cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\/])*[*]\/|\/.*)::g' -e = 's:\"[^"]*\"::g' | grep -zP '(?:\s*^(?:\#\w+.*\s*|extern\s+.+)$)*+' cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\/])*[*]\/|\/.*)::g' -e = 's:\"[^"]*\"::g' | grep -zP '(?:\s*^(?:\#\w+.*\s*|extern\s+.+)\s*)*+' Best Regards, S=B3awek --_000_AM4PR07MB34752198D9CBE434E605A0FDEE919AM4PR07MB3475eurp_ Content-Type: text/html; charset="iso-8859-2" Content-Transfer-Encoding: quoted-printable

Hello team,

I’ve fixed subject to better describe of issue= : “grep PCRE - "^" and "$" are not recognized as = begin and end of line for multiline strings”.

 

Best Regards,

S=B3awek

 

From: Skrzyniarz, Slawomir (Nokia - PL/Krakow= )
Sent: Thursday, October 28, 2021 10:23 AM
To: bug-grep@gnu.org
Subject: grep PCRE - mean

 

Hello Grep Team,

I would update grep from version 2.20 to 3.1 and not= iced that grep with -P option

stops recognize below regular expression:=

 

cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\= /])*[*]\/|\/.*)::g' -e 's:\"[^"]*\"::g' |

grep -ozPLq '\A(?:\s*^(?:#\w+.*\s*|extern\s+.+)$)*+(= ?<namespace>\s*namespace(?:\s+ utTestNamespace \s*(?>(?<block&g= t;{(?:[^{}]*(?&block)*)*}))|(\s*[\w:]*\s*{)(?&namespace)\s*}))\s*\z= '; echo "retcode $?"

 

Content of file SomeTestFile.cpp:

#include <memory>

#include <vector>

#include <gtest/gtest.h>

 

namespace utTestNamespace

{

using ::testing::NiceMock;

# some code here

}

//end of file

 

 

I checked regular expression on regex101.com webpage= and noticed that mentioned regex is working for PCRE and PCRE2 on webpage = but stop working in grep 3.1 and later versions (versions between 2.20 and = 3.1 were not checked).

See link:

https:/= /regex101.com/r/9NwluI/1/

 

Investigation shows that grep in 3.1 version and lat= er 3.6 and 3.7 different handle “^” and “$” for = 220;-P” option.

It looks that “^” does not detect all be= gin of lines but “$” does not recognize all end of lines.<= /o:p>

 

It seems that “^” is treated as beginnin= g of whole test string - not new lines.

“$” is suspected to recognize only end o= f whole test string – not end of lines.

 

I would ask you if is intended behavior or it looks = like an issue in grep.

 

useful command in test:

cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\= /])*[*]\/|\/.*)::g' -e 's:\"[^"]*\"::g' | grep -zP '(?:\s*^(= ?:\#\w+.*\s*|extern\s+.+)$)*+'

cat SomeTestFile.cpp | sed -r -e 's:\/(\*([^*]|\*[^\= /])*[*]\/|\/.*)::g' -e 's:\"[^"]*\"::g' | grep -zP '(?:\s*^(= ?:\#\w+.*\s*|extern\s+.+)\s*)*+'

 

 

Best Regards,

S=B3awek

 

--_000_AM4PR07MB34752198D9CBE434E605A0FDEE919AM4PR07MB3475eurp_-- From debbugs-submit-bounces@debbugs.gnu.org Mon Nov 08 15:29:33 2021 Received: (at 51458) by debbugs.gnu.org; 8 Nov 2021 20:29:33 +0000 Received: from localhost ([127.0.0.1]:59681 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mkBGn-0007IM-C6 for submit@debbugs.gnu.org; Mon, 08 Nov 2021 15:29:33 -0500 Received: from mail-qt1-f176.google.com ([209.85.160.176]:46779) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mkBGi-0007I0-DZ for 51458@debbugs.gnu.org; Mon, 08 Nov 2021 15:29:32 -0500 Received: by mail-qt1-f176.google.com with SMTP id m25so5555782qtq.13 for <51458@debbugs.gnu.org>; Mon, 08 Nov 2021 12:29:28 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=date:from:to:cc:subject:message-id:mime-version:content-disposition; bh=HyJmCvCnY10twS7BRT41uxn06nM8Dak3r5B0kwKGYIM=; b=f/b/n7ODaMV4Nzz539pIBaxsk4r4148ibDnZSpNB0tP+sSgK+yol21ZwoOpFpccb4S MQmSmPFmtdFzESlGXaiZlGo9GeqrHwiL9xGZeFKCc43N7y2JRAOElGi9IA2n0Ap3V2PI C3Pep6G1IFGdkzql55rCEZrqXdATJlF9MIX9RyioHAD/ZU2hNtOTnimWiFFUlNDbPE1Z ZudVY9fUHxSJ7rTkh6RJ8OPVfCEM2OTHXp/erA40Maej4VRmQSMME8wFjCJ9ehkyXMTf vOls6Y7WqB+dLs35XpDjZEIAW1V8Sui/yCXfL666+xHIkOdysLyEyNoeISYC6BYrNsf9 b7Dw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:mime-version :content-disposition; bh=HyJmCvCnY10twS7BRT41uxn06nM8Dak3r5B0kwKGYIM=; b=M4FmNcU9FGregCluA+sA16Pq5Ne/bTNx08cQp3uI0ig4tsfAOK4SwDzTgGnox4GNvM tv7hd/reiVL6t0fzzq1w9lKivHipD6ah6B5GAvZgNC/hSZDRU9RI787JrIwlhSqkXbQg dQvuqhTeuetdMFWx2pIHfkion8ztXLWR5PwUlo6ZM6WteIqzvecIJnE7xJ4jyK6qKLCQ 9zsRDz8u2cKeKmP9g/b0avQXcLmvI2pbElNwhrSxCPAcWZRrEdo+oiSFmDtWoF7deV5V qChX4wmmsX1zSIyvcNaAN/F9hCOIZWeHsMukelNk/QApLB/gdQv9It6Pq6cZue0S/Tqm I+LA== X-Gm-Message-State: AOAM531jNGtp3GlgyX0XDFhAWS6sBKdkAPb4h/XPmgOqk2chZ8oaTTgE RF1Eo7fMExpenMcBzFCqSgk= X-Google-Smtp-Source: ABdhPJxVoDf+dowctSVTp0Ei3myWcMBSfKDvM+1FJGJie1Jr9kek6TszES7IiSfih1+BHRATkAlzBQ== X-Received: by 2002:ac8:5846:: with SMTP id h6mr2476389qth.273.1636403362704; Mon, 08 Nov 2021 12:29:22 -0800 (PST) Received: from carlos-mbp.lan (104-1-92-200.lightspeed.sntcca.sbcglobal.net. [104.1.92.200]) by smtp.gmail.com with ESMTPSA id k23sm7302826qtm.49.2021.11.08.12.29.21 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 08 Nov 2021 12:29:22 -0800 (PST) Date: Mon, 8 Nov 2021 12:29:19 -0800 From: Carlo Marcelo Arenas =?utf-8?B?QmVsw7Nu?= To: slawomir.skrzyniarz@nokia.com Subject: bug#51458: grep PCRE - '^' and '$' are not recognized as begin and end of line for multiline strings Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 51458 Cc: 51458@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) older versions of PCRE support in grep used multiline mode by default, which seems to be required by your expression to work and is also on by default in the regex site. you can add it back using an internal option[1] from PCRE, as shown in the following modified expression from your original example: /\A(?m:\s*^(?:#\w+.*\s*|extern\s+.+)$)*+(?\s*namespace(?:\s+utTestNamespace\s*(?>(?{(?:[^{}]*(?&block)*)*}))|(\s*[\w:]*\s*{)(?&namespace)\s*}))\s*\z/ Carlo [1] https://www.pcre.org/current/doc/html/pcre2pattern.html#internaloptions From debbugs-submit-bounces@debbugs.gnu.org Tue Nov 09 07:06:08 2021 Received: (at 51458) by debbugs.gnu.org; 9 Nov 2021 12:06:08 +0000 Received: from localhost ([127.0.0.1]:32792 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mkPt6-0003jZ-V7 for submit@debbugs.gnu.org; Tue, 09 Nov 2021 07:06:08 -0500 Received: from mail-eopbgr130093.outbound.protection.outlook.com ([40.107.13.93]:63228 helo=EUR01-HE1-obe.outbound.protection.outlook.com) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mkKwK-00037R-MX for 51458@debbugs.gnu.org; Tue, 09 Nov 2021 01:49:10 -0500 ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=QVimvM/2taYq8nJ4GcehJ3lUcd2unduLn4HqVQ+FyoWLL75aChULz42twlrlXfeT3wM8GvM1+JSAVGpVd79l3FsonWAyI5etAgV5yeJfOeIVggZhfiYuULTyA0+0pdD+lfwF97CZi22yEyRetE6b3uFWYpD6wiCwlRz/Evm7QowgvhT0pu1FzIENIH7yfOGwgB58WtmKnYMs1HQNdz68eCc3vBUx2iqXcA2xoQOm2WGgkpoOXdN67rh8Dw5ZTkedfKIo0FsvrY3VsMbB/YleiNQnwSz3pRJitGozNZFkZGOSX7tlHVILt3jgn+pNL0/mbJXYyhZ7F1cKZbw91ishsQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=RKdVwYVN7WRdCDmpkV5IDfix3aSNba7pEs4VOi3zPkA=; b=gqzr39DoWMGtyS9KpSkObkvFEgB0MAoPN8GIHQxWKNNAaGA7tgfe4bPMDrnbGHHEHVtd+hKkNObuZRf1DC9R3/ygpxUK3Xi/A4P7JsHoaXz4CkMhmZHxBxtKNFVDXeWmnRlazLZMhKH+y+7GhBzxBilQiBgMCf2os/MHGOgEzfmOY4O0QgkZQzBR3s4DB+U0rj/TlclUFs6BhTcYC9+L/Cyoo2OhNLVNSrpsvLB3fT6LV3ljU0MTN2w0zSkbRTF33f2ltk5QtlftahH7dhxMqg74/2xq5CrQcr4CJypNwhBPrrriWgxJa+kZYGEtYE9O2/1o5kgxK9G2ysNxyZAZyQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nokia.com; dmarc=pass action=none header.from=nokia.com; dkim=pass header.d=nokia.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=nokia.onmicrosoft.com; s=selector1-nokia-onmicrosoft-com; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=RKdVwYVN7WRdCDmpkV5IDfix3aSNba7pEs4VOi3zPkA=; b=LuW6JJaDE9UsUAQvfFtfZ2a7wX4vCuDghPxMv/vhi4cGTvTaUUQRFL4F1jLtxnOUywSoVBYboWtEdqyxVwXX/mnEDqrT5k8eJlZ4v3V+2D0BPYpuh08GM54kRBzZbi6dLP9eI4/uezP3aRfnn7gK0qn0WLQITpnBeidyo8OVAkM= Received: from AM4PR07MB3475.eurprd07.prod.outlook.com (2603:10a6:205:7::13) by AM0PR07MB6322.eurprd07.prod.outlook.com (2603:10a6:20b:159::23) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.4690.12; Tue, 9 Nov 2021 06:48:56 +0000 Received: from AM4PR07MB3475.eurprd07.prod.outlook.com ([fe80::91db:e34f:b4dc:1526]) by AM4PR07MB3475.eurprd07.prod.outlook.com ([fe80::91db:e34f:b4dc:1526%7]) with mapi id 15.20.4690.015; Tue, 9 Nov 2021 06:48:56 +0000 From: "Skrzyniarz, Slawomir (Nokia - PL/Krakow)" To: "51458@debbugs.gnu.org" <51458@debbugs.gnu.org> Subject: RE: bug#51458: grep PCRE - '^' and '$' are not recognized as begin and end of line for multiline strings Thread-Topic: bug#51458: grep PCRE - '^' and '$' are not recognized as begin and end of line for multiline strings Thread-Index: AQHX1N9SmqRUHeTNSkuSuAnOCgoQ4Kv6wX9A Date: Tue, 9 Nov 2021 06:48:56 +0000 Message-ID: References: In-Reply-To: Accept-Language: en-GB, pl-PL, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=nokia.com; x-ms-publictraffictype: Email x-ms-office365-filtering-correlation-id: a23fdd36-a4ff-4db5-e178-08d9a34d0066 x-ms-traffictypediagnostic: AM0PR07MB6322: x-microsoft-antispam-prvs: x-ms-oob-tlc-oobclassifiers: OLM:2803; x-ms-exchange-senderadcheck: 1 x-ms-exchange-antispam-relay: 0 x-microsoft-antispam: BCL:0; x-microsoft-antispam-message-info: Qtk6LoNvJR5D+t2nBRCtp3PhY1g6TC+utj1H7elPfuJAphlja1mBEV4TvfiodWhghXPuyZINPwZF1QDnxMVOuyVBkMTkLGcBmEHWWPSpMglNJ7I8byE+fQVMv/Bzz5vyV7YafELExKopDphtksGscKbY9Zl/F41UYuMoGdmQGK4ZprCEHbREM1KEhIxfQRUJ3PMCTOaNay0A2NHQu2aAL01/zX8RWUOwnRB3077pJfXgS+EEcyzWnYFNRXQEw6ui7sFUJ+TyORWUPCNpDy2snlfyRhbCOY+0Q/2EX452haeed1+Mijilv2QOg5Vlt8mP9DixInluWXJ/gH1PbPWQfoUK3Gmp45RPEd/99Ts83baM3BiRyDmNdllZvT4RnS6ECJC5ZaqzH2zW028v1pNAv00u5HSaMTyHmj6rYN16fUJ3sCBEgQzRlC0sByPwWCIn0MQk8NfXDyf4HhYpjfG9df1CKc/t0r0avMxwi9dkCyVVDT3GC52AN8nIwaSzlOvkogv/Vgf9hVkeasRL3IgetBvE4feQlMl8J7O3cOC4wjhfY8a0MYAUFUeIcYoAw9lRU6E0tEXsHgpSUDSd4KGgWEF4/vkwC6c9BxQXyk7PgNerNwKs8fAfVdNz+CdpO+kSeC11OvISQJ7zrUdIxv9HGfqnSQvSUzMB3Dy2Y5mImzNToYsEn4kKEzhIeKIjEe2uGeB1JPbqHtd2OnvRn+Da2ddbtET4RuaRdz/sz/PziLOK6pZ3f+fHTgXx8j0hJiUbM9XiUkv1zY2kORUnXMw2GRRNWYjRpkid/uCHQ+VbWPo= x-forefront-antispam-report: CIP:255.255.255.255; CTRY:; LANG:en; SCL:1; SRV:; IPV:NLI; SFV:NSPM; H:AM4PR07MB3475.eurprd07.prod.outlook.com; PTR:; CAT:NONE; SFS:(4636009)(366004)(38100700002)(4744005)(86362001)(55016002)(38070700005)(66476007)(66446008)(2906002)(66556008)(66946007)(64756008)(5660300002)(82960400001)(6506007)(966005)(76116006)(122000001)(33656002)(8676002)(7696005)(71200400001)(53546011)(186003)(316002)(9686003)(6916009)(83380400001)(508600001)(8936002)(52536014); DIR:OUT; SFP:1102; x-ms-exchange-antispam-messagedata-chunkcount: 1 x-ms-exchange-antispam-messagedata-0: =?iso-8859-2?Q?N6oNKtMukz64VNm0E1RY3sgYLUYeglfgrZZnKcye6+DIB0GcwUPA11I6IZ?= =?iso-8859-2?Q?aAGy1iBLad3IVf3qh2i0SuFNyP2AhQehOOuBNWHfvSVhI63WL/W1aboMTm?= =?iso-8859-2?Q?m/3xPymLZzC2Rbqf5/3Hhb10np7smQimqWznzSxNfkPWG2YDdrDnLf4LOp?= =?iso-8859-2?Q?yzY437X7JKVKmXyBvNc99z7Pcedu9rzH86lxf6ArKdqw9u1UIQMm5VJAhB?= =?iso-8859-2?Q?1ehzqugxwsfbZhDPLQ00h8p73IKhzl2actTXpHGArvmivq51my24Pi++Ir?= =?iso-8859-2?Q?FpGb1hu4LDKwnG+M9YJ6j+uPQpeugr/nTfLIpZbVvNS7x+byuPm+DK7Gel?= =?iso-8859-2?Q?eUOAcSLHFPyd9yjW4LErFtVjCyBH+BgiPO4v29oP7E6umNMg4ysD2ji+fj?= =?iso-8859-2?Q?uM9NYLKPjsu2xy+fkp/uY/A1+iSLVqXg4Fl6yDFrCXftHe8BVmPtMXHY3S?= =?iso-8859-2?Q?R3/SFb8q8MLxLGonq6Nr4k3hD63u1sAkp/4e/bmWFn+bWjoUVWmvnmxZ/z?= =?iso-8859-2?Q?YIeJWe6qnyuwxr4hGikjx7cJxOM4IUMSUEJEiLlKjbp75Y9mpuiYEroiyt?= =?iso-8859-2?Q?38W+4mPhHs3FDjHFIuGGa3EPPx3fQGWOJQJ4fOwyHKBzQ0T5jYHVnPcdci?= =?iso-8859-2?Q?Rz80Fl20PwjNYhSfnfr+eoGmWhmX2Fyvrjq5pV6Ph614v6wd/PhYe0ZDdI?= =?iso-8859-2?Q?9NuY3Fa/Pi1VoahrcesSQohO0M0txrQr9gjjsK9dHr5Uyi7T71fvKKKYe3?= =?iso-8859-2?Q?rWgOVjebZlgdrJ4DmXCYFVBsGqAlBKLZlytzdQm5lkUgPiQ2SspKDA7QMl?= =?iso-8859-2?Q?gvNcTquPJxc5K2Sj70eWtttmwl+c0mvRuvtxCnH1MI1CLRj5oMv3PoQNSf?= =?iso-8859-2?Q?lZ1MEUBjdmb4a3s4nYdQQ8/0PlD/FFERIkgz0W4/9MrLfaV27tGal7iwZA?= =?iso-8859-2?Q?5VR3pMpTnIYRwUj7E9loL2UBc0dNB6luXrBnYVdRVymczk4ZE+PGhOPsVA?= =?iso-8859-2?Q?ntL5N0tJ6rkoz/Wrk45zDm5HlNT6i0ZC82xh2OQyP9XIMtNbPqhzK57u7u?= =?iso-8859-2?Q?QpGLTqy15B7j91QssoTZ0/uAtf6M6PFZXx2jMqOMTize+qkBQbwwNwjVso?= =?iso-8859-2?Q?9Drc+jUzK8GeFKITCL0lTxi1yNsKcXZIR5/yq6kpUC/stByR1mHAydFbVF?= =?iso-8859-2?Q?sL2KNNMjCPQxfKCWIlC2jFfstohdNtr9unAl39TypD7rHScMMS3hCwlwgV?= =?iso-8859-2?Q?aKihDNnHzcyzH+9EHDK3GE9Kg6hCtPgFW+I8cCPXpcoJJzfQxxvzfU5LQT?= =?iso-8859-2?Q?BTKOi+be+VZDlAoyuPCCNxgDqFw5urzyHepG7pTLhqkra1VpIoZ7hx7sjY?= =?iso-8859-2?Q?tr0QN4krZeh4cCmhIrDGk9x1V/hLqpc/o/BW1XwoVQjh2mgQZWq4sV+Af6?= =?iso-8859-2?Q?Z3A5TGu3C5ePoIWM1PpKX4DsYlI1Na6XBDmNydfZTaWvV7M9FkFbSzjiDQ?= =?iso-8859-2?Q?XJbI2MW+sfMwhexiYBJ5BZnQhSoY2iILSzmFJZKKt97yov6E/hFkD4yVaP?= =?iso-8859-2?Q?9j5IlOaOgll5XebhfIU4b6A5TrPyIarPn2gu0fc+6nxdkU/O5el3fD5wZy?= =?iso-8859-2?Q?AeC2puWjnNmK89iBrRM1KtmabsVxU0dXIBPJA9MjccVn1qeX2tdSkH60CT?= =?iso-8859-2?Q?jzFhht5NVtvgOe56Ef3qeTObqweTFNOog38b2fF3YEoasC5y2yk5BM4flz?= =?iso-8859-2?Q?/GX8lyPOdrENb1C4mnLAam2X2NE3bWZFNc5T/wSn5I0z7qU7J073FP5Xtr?= =?iso-8859-2?Q?hfm3TWW4Xg=3D=3D?= Content-Type: text/plain; charset="iso-8859-2" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-OriginatorOrg: nokia.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-AuthSource: AM4PR07MB3475.eurprd07.prod.outlook.com X-MS-Exchange-CrossTenant-Network-Message-Id: a23fdd36-a4ff-4db5-e178-08d9a34d0066 X-MS-Exchange-CrossTenant-originalarrivaltime: 09 Nov 2021 06:48:56.2738 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 5d471751-9675-428d-917b-70f44f9630b0 X-MS-Exchange-CrossTenant-mailboxtype: HOSTED X-MS-Exchange-CrossTenant-userprincipalname: 4S/FFabtB/77pIRgE7ELgzEwfdeib6Z+OGXhQW0VOpE6NV1ELASRpkNzjScn9YeN0GsnacSPlQp/mWCwbnnuOyIDkbQuE4swQG6mjOyDItA= X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM0PR07MB6322 X-Spam-Score: -0.0 (/) X-Debbugs-Envelope-To: 51458 X-Mailman-Approved-At: Tue, 09 Nov 2021 07:06:03 -0500 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Thank you Carlo. Replacing (?: -> (?m: Solve my issue. Thank you, S=B3awek -----Original Message----- From: Carlo Marcelo Arenas Bel=F3n =20 Sent: Monday, November 8, 2021 9:29 PM To: Skrzyniarz, Slawomir (Nokia - PL/Krakow) Cc: 51458@debbugs.gnu.org Subject: bug#51458: grep PCRE - '^' and '$' are not recognized as begin and= end of line for multiline strings older versions of PCRE support in grep used multiline mode by default, whic= h seems to be required by your expression to work and is also on by default i= n the regex site. you can add it back using an internal option[1] from PCRE, as shown in the following modified expression from your original example: /\A(?m:\s*^(?:#\w+.*\s*|extern\s+.+)$)*+(?\s*namespace(?:\s+utTe= stNamespace\s*(?>(?{(?:[^{}]*(?&block)*)*}))|(\s*[\w:]*\s*{)(?&names= pace)\s*}))\s*\z/ Carlo [1] https://www.pcre.org/current/doc/html/pcre2pattern.html#internaloptions From debbugs-submit-bounces@debbugs.gnu.org Tue Nov 09 13:05:41 2021 Received: (at 51458-done) by debbugs.gnu.org; 9 Nov 2021 18:05:41 +0000 Received: from localhost ([127.0.0.1]:35719 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mkVV6-00084w-Ub for submit@debbugs.gnu.org; Tue, 09 Nov 2021 13:05:41 -0500 Received: from zimbra.cs.ucla.edu ([131.179.128.68]:52616) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mkVV5-00084f-2x for 51458-done@debbugs.gnu.org; Tue, 09 Nov 2021 13:05:40 -0500 Received: from localhost (localhost [127.0.0.1]) by zimbra.cs.ucla.edu (Postfix) with ESMTP id A4A491600CE; Tue, 9 Nov 2021 10:05:32 -0800 (PST) Received: from zimbra.cs.ucla.edu ([127.0.0.1]) by localhost (zimbra.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10032) with ESMTP id dMduZ0_WX0ff; Tue, 9 Nov 2021 10:05:32 -0800 (PST) Received: from localhost (localhost [127.0.0.1]) by zimbra.cs.ucla.edu (Postfix) with ESMTP id 0CED1160106; Tue, 9 Nov 2021 10:05:32 -0800 (PST) X-Virus-Scanned: amavisd-new at zimbra.cs.ucla.edu Received: from zimbra.cs.ucla.edu ([127.0.0.1]) by localhost (zimbra.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10026) with ESMTP id LqYUX0Rpn6-p; Tue, 9 Nov 2021 10:05:31 -0800 (PST) Received: from [192.168.1.9] (cpe-172-91-119-151.socal.res.rr.com [172.91.119.151]) by zimbra.cs.ucla.edu (Postfix) with ESMTPSA id DB8F01600CE; Tue, 9 Nov 2021 10:05:31 -0800 (PST) Message-ID: <4791ff36-8afe-5aad-f4b3-c02b5948acc1@cs.ucla.edu> Date: Tue, 9 Nov 2021 10:05:31 -0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.2.1 Subject: Re: bug#51458: grep PCRE - '^' and '$' are not recognized as begin and end of line for multiline strings Content-Language: en-US To: "Skrzyniarz, Slawomir (Nokia - PL/Krakow)" , "51458@debbugs.gnu.org" <51458-done@debbugs.gnu.org> References: From: Paul Eggert Organization: UCLA Computer Science Department In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Score: -2.4 (--) X-Debbugs-Envelope-To: 51458-done X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.4 (---) On 11/8/21 22:48, Skrzyniarz, Slawomir (Nokia - PL/Krakow) wrote: > Solve my issue. Thanks for letting us know; closing the bug report. From unknown Wed Aug 20 06:04:22 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Wed, 08 Dec 2021 12:24:05 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator