From unknown Mon Aug 18 00:05:27 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#18404 <18404@debbugs.gnu.org> To: bug#18404 <18404@debbugs.gnu.org> Subject: Status: AW: bug#18398: Probably found a bug in grep Reply-To: bug#18404 <18404@debbugs.gnu.org> Date: Mon, 18 Aug 2025 07:05:27 +0000 retitle 18404 AW: bug#18398: Probably found a bug in grep reassign 18404 grep submitter 18404 "Bergen, Andreas" severity 18404 normal thanks From debbugs-submit-bounces@debbugs.gnu.org Thu Sep 04 11:39:33 2014 Received: (at submit) by debbugs.gnu.org; 4 Sep 2014 15:39:33 +0000 Received: from localhost ([127.0.0.1]:58925 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPZ8G-00035l-86 for submit@debbugs.gnu.org; Thu, 04 Sep 2014 11:39:32 -0400 Received: from eggs.gnu.org ([208.118.235.92]:41759) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPZ8D-00035V-Gw for submit@debbugs.gnu.org; Thu, 04 Sep 2014 11:39:30 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XPZ83-0002em-Ta for submit@debbugs.gnu.org; Thu, 04 Sep 2014 11:39:24 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:54385) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPZ83-0002ee-P1 for submit@debbugs.gnu.org; Thu, 04 Sep 2014 11:39:19 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:43443) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPZ7z-0004yX-20 for bug-grep@gnu.org; Thu, 04 Sep 2014 11:39:19 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XPZ7t-0002d7-59 for bug-grep@gnu.org; Thu, 04 Sep 2014 11:39:14 -0400 Received: from mailout03.rmx.de ([217.111.120.11]:53720) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPZ7s-0002cj-Sf for bug-grep@gnu.org; Thu, 04 Sep 2014 11:39:09 -0400 Received: from kdin02.retarus.de (kdin02.retarus.de [172.19.17.49] (may be forged)) by mailout03.rmx.de (8.14.4/8.14.5/retarus.custom) with ESMTP id s84Fd4Ke024452 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK); Thu, 4 Sep 2014 17:39:04 +0200 Received: from hub.all-for-one.net ([10.79.13.13]) by kdin02.retarus.de (8.14.5/8.14.5/retarus.custom) with ESMTP id s84Fd3dt024064 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=FAIL); Thu, 4 Sep 2014 17:39:03 +0200 Received: from A41EXC004.all-for-one.net ([169.254.2.136]) by A41EXC002.all-for-one.net ([10.2.206.52]) with mapi id 14.03.0181.006; Thu, 4 Sep 2014 17:39:03 +0200 From: "Bergen, Andreas" To: Johannes Meixner Subject: AW: bug#18398: Probably found a bug in grep Thread-Topic: bug#18398: Probably found a bug in grep Thread-Index: Ac/HqdPzy2ZmYBYTSOKlv7EkRpGOkwAX69UAABMqqoA= Date: Thu, 4 Sep 2014 15:39:02 +0000 Message-ID: References: In-Reply-To: Accept-Language: de-DE, en-US Content-Language: de-DE X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.2.206.73] Content-Type: text/plain; charset="Windows-1252" Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-RMX-ID: 20140904-173903-s84Fd3dt024064-0@kdin02 X-RMX-TRACE: 2014-09-04 17:39:04 RmxMSO@kdin02/mailcc06 [0.0s] 20140904-173903-s84Fd3dt024064-0@kdin02 0:00:01 X-RMX-TRACE: 2014-09-04 17:39:03 KdIn2013@kdin02/mailcc05 [0.3s] 20140904-173903-s84Fd3dt024064-0@kdin02 0:00:00 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit Cc: "bug-grep@gnu.org" X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) Hello, thanks for the information. I've tested since on SLES11SP2 and SP3 and foun= d that the "bug" isn=92t there anymore, that is: grep works as expected. Best regards Andreas Bergen --- Andreas Bergen Solution Architect All for One Steeb AG Gottlieb-Manz-Stra=DFe 1 70794 Filderstadt T +49 711 78807-689 F +49 711 78807-92689 M +49 151 53824-689 Andreas.Bergen@all-for-one.com www.all-for-one.com -----Urspr=FCngliche Nachricht----- Von: Johannes Meixner [mailto:jsmeix@suse.de] Gesendet: Donnerstag, 4. September 2014 10:29 An: Bergen, Andreas Cc: bug-grep@gnu.org Betreff: Re: bug#18398: Probably found a bug in grep Hello, On Sep 3 19:11 Bergen, Andreas wrote (excerpt): > I've probably found a bug in "grep". ... > testfile: UTF-8 Unicode text > testfile2: ASCII text ... > Name : grep > Version : 2.5.1a > Vendor: SUSE LINUX Products GmbH, Nuernberg, Germany > Build Date: Tue Apr 22 03:47:13 2008 > Install Date: Mon Jul 6 16:21:37 2009 > Source RPM: grep-2.5.1a-20.17.src.rpm This grep version is very old. I found grep version 2.5.1a only in SUSE Linux Enterprise Server 10. openSUSE distributions with such an old grep are no longer available. I do not know if that old grep version was really meant to support UTF-8 character encoding (multibyte characters) actually well because I find almost nothing about "UTF" (ignore case) in the grep-2.5.1a sources. There is some multibyte character support in grep-2.5.1a but I wonder to what extent it actually works. In contrast in the grep-2.7 sources that we provide since SUSE Linux Enterprise Server 11 Service Pack 2 (SLES11-SP2) there is a lot more about "UTF" (ignore case). In the RPM changelog of our grep RPM package for SLES11-SP2 there is in particular: ------------------------------------------------------------------ Version upgrade to grep-2.7 and reset to full compliance with upstream ... version upgrade to grep-2.6.3, which brings among various compile fixes vast improvements for UTF-8 / multibyte handling. ------------------------------------------------------------------ In general: Any issues with various "traditional" Unix/Linux tools that depend on the locale are very often no real bugs. For users it is crucial to understand that any kind of behaviour can depend on the locale (from keyboard input via program behaviour to what is shown on the screen). For basic information see http://en.opensuse.org/SDB:Plain_Text_versus_Locale When programs process "plain text files", the user who runs the program must set up the locale environment to match the encoding of the "plain text file" before he runs the program. When you like to process your "plain text files" as you did "since ever" with various "traditional" Unix/Linux tools, you must use the POSIX locale, otherwise you will get weird results and unexpected side-effects. See also http://www.gnu.org/software/gawk/manual/html_node/Ranges-and-Locales.html Kind Regards Johannes Meixner -- SUSE LINUX Products GmbH -- Maxfeldstrasse 5 -- 90409 Nuernberg -- Germany HRB 16746 (AG Nuernberg) GF: Jeff Hawn, Jennifer Guild, Felix Imendoerffer ________________________________ All for One Steeb AG, Sitz der Gesellschaft: Filderstadt. Amtsgericht Stutt= gart: HRB 19 539, Vorstand: Lars Landwehrkamp (Sprecher), Stefan Land Vorsitzender des Aufsichtsrats: Peter Brogle Diese E-Mail (einschlie=DFlich aller Anh=E4nge) kann Betriebs- oder Gesch= =E4ftsgeheimnisse bzw. sonstige vertrauliche und/oder rechtlich gesch=FCtzt= e Informationen enthalten. Sollten Sie diese E-Mail irrt=FCmlich erhalten h= aben, ist Ihnen jede Kenntnisnahme des Inhalts, Nutzung, Vervielf=E4ltigung= , oder Weitergabe der E-Mail (einschlie=DFlich aller Anh=E4nge) ausdr=FCckl= ich untersagt. Bitte benachrichtigen Sie uns umgehend und vernichten Sie di= e empfangene E-Mail. Vielen Dank. This e-mail (including any attachments) may contain business or trade secre= ts or other confidential and / or legally protected information. If you hav= e received this e-mail in error, you are hereby notified that any review, u= se, copying, or distribution of it is strictly prohibited. Please inform us= immediately and destroy this e-mail. Thank you. From debbugs-submit-bounces@debbugs.gnu.org Wed Sep 10 17:15:36 2014 Received: (at control) by debbugs.gnu.org; 10 Sep 2014 21:15:36 +0000 Received: from localhost ([127.0.0.1]:37056 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XRpEl-0000gE-O3 for submit@debbugs.gnu.org; Wed, 10 Sep 2014 17:15:36 -0400 Received: from smtp.cs.ucla.edu ([131.179.128.62]:60103) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XRpEj-0000cs-4h for control@debbugs.gnu.org; Wed, 10 Sep 2014 17:15:34 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id BA6C639E8011 for ; Wed, 10 Sep 2014 14:15:32 -0700 (PDT) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id f7r7xE1pBZC3 for ; Wed, 10 Sep 2014 14:15:30 -0700 (PDT) Received: from penguin.cs.ucla.edu (Penguin.CS.UCLA.EDU [131.179.64.200]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id 7812A39E801B for ; Wed, 10 Sep 2014 14:15:30 -0700 (PDT) Message-ID: <5410BF72.8090409@cs.ucla.edu> Date: Wed, 10 Sep 2014 14:15:30 -0700 From: Paul Eggert Organization: UCLA Computer Science Department User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.1.0 MIME-Version: 1.0 To: control@debbugs.gnu.org Subject: 18404 was already fixed Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Score: -4.8 (----) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.8 (----) close 18404 From unknown Mon Aug 18 00:05:27 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Thu, 09 Oct 2014 11:24:03 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator