From unknown Fri Sep 05 18:51:30 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#18398 <18398@debbugs.gnu.org> To: bug#18398 <18398@debbugs.gnu.org> Subject: Status: Probably found a bug in grep Reply-To: bug#18398 <18398@debbugs.gnu.org> Date: Sat, 06 Sep 2025 01:51:30 +0000 retitle 18398 Probably found a bug in grep reassign 18398 grep submitter 18398 "Bergen, Andreas" severity 18398 normal thanks From debbugs-submit-bounces@debbugs.gnu.org Wed Sep 03 15:14:37 2014 Received: (at submit) by debbugs.gnu.org; 3 Sep 2014 19:14:38 +0000 Received: from localhost ([127.0.0.1]:58110 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPG0p-0003Ci-P7 for submit@debbugs.gnu.org; Wed, 03 Sep 2014 15:14:37 -0400 Received: from eggs.gnu.org ([208.118.235.92]:50097) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPFyV-00038u-7T for submit@debbugs.gnu.org; Wed, 03 Sep 2014 15:12:12 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XPFyL-0004gI-Qq for submit@debbugs.gnu.org; Wed, 03 Sep 2014 15:12:06 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50,HTML_MESSAGE autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:39132) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPFyL-0004gB-OB for submit@debbugs.gnu.org; Wed, 03 Sep 2014 15:12:01 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:51808) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPFyH-0002Sy-H8 for bug-grep@gnu.org; Wed, 03 Sep 2014 15:12:01 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XPFyD-0004fd-EE for bug-grep@gnu.org; Wed, 03 Sep 2014 15:11:57 -0400 Received: from mailout03.rmx.de ([217.111.120.11]:40644) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPFyC-0004fQ-Um for bug-grep@gnu.org; Wed, 03 Sep 2014 15:11:53 -0400 Received: from kdin04.retarus.de (malon.retarus.de [172.19.16.59] (may be forged)) by mailout03.rmx.de (8.14.4/8.14.5/retarus.custom) with ESMTP id s83JBlR5004447 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK) for ; Wed, 3 Sep 2014 21:11:47 +0200 Received: from hub.all-for-one.net ([10.79.13.14]) by kdin04.retarus.de (8.14.5/8.14.5/retarus.custom) with ESMTP id s83JBjpi002123 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=FAIL) for ; Wed, 3 Sep 2014 21:11:46 +0200 Received: from A41EXC004.all-for-one.net ([169.254.2.136]) by A41EXC001.all-for-one.net ([10.2.206.51]) with mapi id 14.03.0181.006; Wed, 3 Sep 2014 21:11:45 +0200 From: "Bergen, Andreas" To: "'bug-grep@gnu.org'" Subject: Probably found a bug in grep Thread-Topic: Probably found a bug in grep Thread-Index: Ac/HqdPzy2ZmYBYTSOKlv7EkRpGOkw== Date: Wed, 3 Sep 2014 19:11:45 +0000 Message-ID: Accept-Language: de-DE, en-US Content-Language: de-DE X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.2.206.73] Content-Type: multipart/alternative; boundary="_000_EDDED61450B3C3459CC4C2BE441AF03E014BD67CE7A41EXC004allf_" MIME-Version: 1.0 X-RMX-ID: 20140903-211145-s83JBjpi002123-0@kdin04 X-RMX-TRACE: 2014-09-03 21:11:47 RmxMSO@kdin04/mailcc06 [0.1s] 20140903-211145-s83JBjpi002123-0@kdin04 0:00:02 X-RMX-TRACE: 2014-09-03 21:11:46 KdIn2013@kdin04/mailcc10 [0.9s] 20140903-211145-s83JBjpi002123-0@kdin04 0:00:00 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit X-Mailman-Approved-At: Wed, 03 Sep 2014 15:14:34 -0400 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) --_000_EDDED61450B3C3459CC4C2BE441AF03E014BD67CE7A41EXC004allf_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hi all, I've probably found a bug in "grep". Here's a way how to reproduce it: s53mgt:/test2 # cat testfile A =C4 s53mgt:/test2 # grep -F -e=C4 -eA testfile A =C4 s53mgt:/test2 # grep -i -e=C4 -eA testfile A =C4 s53mgt:/test2 # grep -iF -e=C4 -eA testfile A As you can see the last one does not give A and =C4 but only A. When I do the same with another testfile without an "=C4" (A-Umlaut) in it,= it works like expected: s53mgt:/test2 # cat testfile2 A B s53mgt:/test2 # grep -F -eB -eA testfile2 A B s53mgt:/test2 # grep -i -eB -eA testfile2 A B s53mgt:/test2 # grep -iF -eB -eA testfile2 A B s53mgt:/test2 # file testfile testfile2 testfile: UTF-8 Unicode text testfile2: ASCII text Here's some information on my version of "grep". s53mgt:/test2 # rpm -qif /bin/grep Name : grep Relocations: (not relocatable) Version : 2.5.1a Vendor: SUSE LINUX Products= GmbH, Nuernberg, Germany Release : 20.17 Build Date: Tue Apr 22 03:47:13= 2008 Install Date: Mon Jul 6 16:21:37 2009 Build Host: blacher.suse.de Group : Productivity/Text/Utilities Source RPM: grep-2.5.1a-20.17.s= rc.rpm Size : 461697 License: GPL v2 or later Signature : DSA/SHA1, Tue Apr 22 03:49:23 2008, Key ID a84edae89c800aca Packager : http://bugs.opensuse.org URL : http://www.gnu.org/software/grep/ Summary : Print lines matching a pattern Description : GNU grep, the "fastest grep in the west" (hopefully). `grep' searches for lines matching a pattern. Can you confirm this? What can I do about it? Regards Andreas --- Andreas Bergen Solution Architect All for One Steeb AG Gottlieb-Manz-Stra=DFe 1 70794 Filderstadt T +49 711 78807-689 F +49 711 78807-92689 M +49 151 53824-689 Andreas.Bergen@all-for-one.com www.all-for-one.com ________________________________ All for One Steeb AG, Sitz der Gesellschaft: Filderstadt. Amtsgericht Stutt= gart: HRB 19 539, Vorstand: Lars Landwehrkamp (Sprecher), Stefan Land Vorsitzender des Aufsichtsrats: Peter Brogle Diese E-Mail (einschlie=DFlich aller Anh=E4nge) kann Betriebs- oder Gesch= =E4ftsgeheimnisse bzw. sonstige vertrauliche und/oder rechtlich gesch=FCtzt= e Informationen enthalten. Sollten Sie diese E-Mail irrt=FCmlich erhalten h= aben, ist Ihnen jede Kenntnisnahme des Inhalts, Nutzung, Vervielf=E4ltigung= , oder Weitergabe der E-Mail (einschlie=DFlich aller Anh=E4nge) ausdr=FCckl= ich untersagt. Bitte benachrichtigen Sie uns umgehend und vernichten Sie di= e empfangene E-Mail. Vielen Dank. This e-mail (including any attachments) may contain business or trade secre= ts or other confidential and / or legally protected information. If you hav= e received this e-mail in error, you are hereby notified that any review, u= se, copying, or distribution of it is strictly prohibited. Please inform us= immediately and destroy this e-mail. Thank you. --_000_EDDED61450B3C3459CC4C2BE441AF03E014BD67CE7A41EXC004allf_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable

Hi all,

 

I’ve probably found a bug= in „grep“.

Here’s a way how to repro= duce it:

 

s53mgt:/test2 # cat testfile

A

=C4

s53mgt:/test2 # grep -F -e=C4 -eA testfile=

A

=C4

s53mgt:/test2 # grep -i -e=C4 -eA testfile=

A

=C4

s53mgt:/test2 # grep -iF -e=C4 -eA testfile

A

 

As you can see the last one doe= s not give A and =C4 but only A.

 

When I do the same with another= testfile without an “=C4” (A-Umlaut) in it, it works like expe= cted:

s53mgt:/test2 # cat testfile2

A

B

s53mgt:/test2 # grep -F -eB -eA testfile2<= /p>

A

B

s53mgt:/test2 # grep -i -eB -eA testfile2<= /p>

A

B

s53mgt:/test2 # grep -iF -eB -eA testfile2

A

B

 

s53mgt:/test2 # file testfile testfile2

testfile:  UTF-8 Unicode text

testfile2: ASCII text

 

Here’s some information o= n my version of “grep”.

 

s53mgt:/test2 # rpm -qif /bin/grep

Name        : grep &= nbsp;           &nbs= p;           Relocations:= (not relocatable)

Version     : 2.5.1a   &n= bsp;            = ;            Vendor:= SUSE LINUX Products GmbH, Nuernberg, Germany

Release     : 20.17   &nb= sp;             = ;        Build Date: Tue Apr 22 03:= 47:13 2008

Install Date: Mon Jul  6 16:21:37 2009   = ;   Build Host: blacher.suse.de

Group       : Productivity/Tex= t/Utilities   Source RPM: grep-2.5.1a-20.17.src.rpm

Size        : 461697 = ;            &n= bsp;            = ; License: GPL v2 or later

Signature   : DSA/SHA1, Tue Apr 22 03:49:23 2008, = Key ID a84edae89c800aca

Packager    : http://bugs.opensuse.org

URL         : http:/= /www.gnu.org/software/grep/

Summary     : Print lines matching a pat= tern

Description :

GNU grep, the "fastest grep in the west" (hopefull= y).

 

`grep' searches for lines matching a pattern.

 

 

 

Can you confirm this?

 

What can I do about it?

 

Regards

  Andreas

 

---<= o:p>

Andr= eas Bergen

Solu= tion Architect

 

All f= or One Steeb AG

Gottlieb-Manz-Stra= =DFe 1

70794 Filderstadt

T  +49 711 = 78807-689

F&nbs= p; +49 711 78807-92689

M = 3;49 151 53824-689

And= reas.Bergen@all-for-one.com

www.all-for-one.com

 




All for One Steeb AG, Sitz der Gesellschaft: Filderstadt. Amtsgericht Stutt= gart: HRB 19 539,
Vorstand: Lars Landwehrkamp (Sprecher), Stefan Land
Vorsitzender des Aufsichtsrats: Peter Brogle

Diese E-Mail (einschlie=DFlich aller Anh=E4nge) kann Betriebs- oder Gesch= =E4ftsgeheimnisse bzw. sonstige vertrauliche und/oder rechtlich gesch=FCtzt= e Informationen enthalten. Sollten Sie diese E-Mail irrt=FCmlich erhalten h= aben, ist Ihnen jede Kenntnisnahme des Inhalts, Nutzung, Vervielf=E4ltigung, oder Weitergabe der E-Mail (einschlie=DFlich = aller Anh=E4nge) ausdr=FCcklich untersagt. Bitte benachrichtigen Sie uns um= gehend und vernichten Sie die empfangene E-Mail. Vielen Dank.

This e-mail (including any attachments) may contain business or trade secre= ts or other confidential and / or legally protected information. If you hav= e received this e-mail in error, you are hereby notified that any review, u= se, copying, or distribution of it is strictly prohibited. Please inform us immediately and destroy this e= -mail. Thank you.
--_000_EDDED61450B3C3459CC4C2BE441AF03E014BD67CE7A41EXC004allf_-- From debbugs-submit-bounces@debbugs.gnu.org Wed Sep 03 16:15:39 2014 Received: (at 18398-done) by debbugs.gnu.org; 3 Sep 2014 20:15:39 +0000 Received: from localhost ([127.0.0.1]:58134 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPGxt-0004iF-1X for submit@debbugs.gnu.org; Wed, 03 Sep 2014 16:15:37 -0400 Received: from smtp.cs.ucla.edu ([131.179.128.62]:38246) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPGxq-0004i1-7p for 18398-done@debbugs.gnu.org; Wed, 03 Sep 2014 16:15:35 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id 45AD2A60002; Wed, 3 Sep 2014 13:15:28 -0700 (PDT) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id O3mSLxJi0SiT; Wed, 3 Sep 2014 13:15:19 -0700 (PDT) Received: from [192.168.1.9] (pool-71-177-17-123.lsanca.dsl-w.verizon.net [71.177.17.123]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id 9AF7539E801C; Wed, 3 Sep 2014 13:15:19 -0700 (PDT) Message-ID: <540776D7.1070205@cs.ucla.edu> Date: Wed, 03 Sep 2014 13:15:19 -0700 From: Paul Eggert Organization: UCLA Computer Science Department User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.0 MIME-Version: 1.0 To: "Bergen, Andreas" , 18398-done@debbugs.gnu.org Subject: Re: bug#18398: Probably found a bug in grep References: In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Score: -4.0 (----) X-Debbugs-Envelope-To: 18398-done X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.0 (----) Bergen, Andreas wrote: > Version : 2.5.1a Thanks for the report. As shown below, I can't reproduce the bug with grep 2.20 (the current version) in either the en_US.utf8 or the de_DE.utf8 locales. grep 2.5.1a is pretty old (dated 2004) and several bugs have been fixed in this area in the last ten years, so I suggest upgrading and I'm taking the liberty of marking this as done. $ cat testfile A Ä $ grep -F -eÄ -eA testfile A Ä $ grep -i -eÄ -eA testfile A Ä $ grep -iF -eÄ -eA testfile A Ä From debbugs-submit-bounces@debbugs.gnu.org Thu Sep 04 04:28:31 2014 Received: (at 18398-done) by debbugs.gnu.org; 4 Sep 2014 08:28:31 +0000 Received: from localhost ([127.0.0.1]:58382 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPSP8-00075b-H3 for submit@debbugs.gnu.org; Thu, 04 Sep 2014 04:28:31 -0400 Received: from mailout03.rmx.de ([217.111.120.11]:45788) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPSP5-00075K-6n for 18398-done@debbugs.gnu.org; Thu, 04 Sep 2014 04:28:28 -0400 Received: from kdin01.retarus.de (kdin01.retarus.de [172.19.17.48] (may be forged)) by mailout03.rmx.de (8.14.4/8.14.5/retarus.custom) with ESMTP id s848S9w8032767 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=OK); Thu, 4 Sep 2014 10:28:09 +0200 Received: from hub.all-for-one.net ([10.79.13.14]) by kdin01.retarus.de (8.14.5/8.14.5/retarus.custom) with ESMTP id s848S82U016380 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=FAIL); Thu, 4 Sep 2014 10:28:08 +0200 Received: from A41EXC004.all-for-one.net ([169.254.2.136]) by A41EXC001.all-for-one.net ([10.2.206.51]) with mapi id 14.03.0181.006; Thu, 4 Sep 2014 10:28:07 +0200 From: "Bergen, Andreas" To: "'Paul Eggert'" , "18398-done@debbugs.gnu.org" <18398-done@debbugs.gnu.org> Subject: AW: bug#18398: Probably found a bug in grep Thread-Topic: bug#18398: Probably found a bug in grep Thread-Index: Ac/HqdPzy2ZmYBYTSOKlv7EkRpGOk///8mGA//8WufA= Date: Thu, 4 Sep 2014 08:28:07 +0000 Message-ID: References: <540776D7.1070205@cs.ucla.edu> In-Reply-To: <540776D7.1070205@cs.ucla.edu> Accept-Language: de-DE, en-US Content-Language: de-DE X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [10.2.206.73] Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 MIME-Version: 1.0 X-RMX-ID: 20140904-102808-s848S82U016380-0@kdin01 X-RMX-TRACE: 2014-09-04 10:28:08 RmxMSO@kdin01/mailcc01 [0.1s] 20140904-102808-s848S82U016380-0@kdin01 0:00:00 X-RMX-TRACE: 2014-09-04 10:28:08 KdIn2013@kdin01/mailcc09 [0.3s] 20140904-102808-s848S82U016380-0@kdin01 0:00:00 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: 18398-done X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) VGhhbmtzIGEgbG90Lg0KSSd2ZSB0cmllZCB3aXRoIHRoZSBuZXdlc3QgdmVyc2lvbiBhdmFpbGFi bGUgZm9yIFN1c2UgTGludXggKGdyZXAgMi43LTUuNy4xIGluIFNMRVMgMTEgU1AzKSBhbmQgdGhl IGJ1ZyBzZWVtcyB0byBiZSBmaXhlZCB0aGVyZSBhcyB3ZWxsLg0KDQpCZXN0IHJlZ2FyZHMNCiAg QW5kcmVhcw0KDQotLS0NCkFuZHJlYXMgQmVyZ2VuDQpTb2x1dGlvbiBBcmNoaXRlY3QNCg0KQWxs IGZvciBPbmUgU3RlZWIgQUcNCkdvdHRsaWViLU1hbnotU3RyYcOfZSAxDQo3MDc5NCBGaWxkZXJz dGFkdA0KVCAgKzQ5IDcxMSA3ODgwNy02ODkNCkYgICs0OSA3MTEgNzg4MDctOTI2ODkNCk0gKzQ5 IDE1MSA1MzgyNC02ODkNCkFuZHJlYXMuQmVyZ2VuQGFsbC1mb3Itb25lLmNvbQ0Kd3d3LmFsbC1m b3Itb25lLmNvbQ0KDQoNCi0tLS0tVXJzcHLDvG5nbGljaGUgTmFjaHJpY2h0LS0tLS0NClZvbjog UGF1bCBFZ2dlcnQgW21haWx0bzplZ2dlcnRAY3MudWNsYS5lZHVdDQpHZXNlbmRldDogTWl0dHdv Y2gsIDMuIFNlcHRlbWJlciAyMDE0IDIyOjE1DQpBbjogQmVyZ2VuLCBBbmRyZWFzOyAxODM5OC1k b25lQGRlYmJ1Z3MuZ251Lm9yZw0KQmV0cmVmZjogUmU6IGJ1ZyMxODM5ODogUHJvYmFibHkgZm91 bmQgYSBidWcgaW4gZ3JlcA0KDQpCZXJnZW4sIEFuZHJlYXMgd3JvdGU6DQo+IFZlcnNpb24gICAg IDogMi41LjFhDQoNClRoYW5rcyBmb3IgdGhlIHJlcG9ydC4gIEFzIHNob3duIGJlbG93LCBJIGNh bid0IHJlcHJvZHVjZSB0aGUgYnVnIHdpdGgNCmdyZXAgMi4yMCAodGhlIGN1cnJlbnQgdmVyc2lv bikgaW4gZWl0aGVyIHRoZSBlbl9VUy51dGY4IG9yIHRoZQ0KZGVfREUudXRmOCBsb2NhbGVzLiAg Z3JlcCAyLjUuMWEgaXMgcHJldHR5IG9sZCAoZGF0ZWQgMjAwNCkgYW5kIHNldmVyYWwNCmJ1Z3Mg aGF2ZSBiZWVuIGZpeGVkIGluIHRoaXMgYXJlYSBpbiB0aGUgbGFzdCB0ZW4geWVhcnMsIHNvIEkg c3VnZ2VzdA0KdXBncmFkaW5nIGFuZCBJJ20gdGFraW5nIHRoZSBsaWJlcnR5IG9mIG1hcmtpbmcg dGhpcyBhcyBkb25lLg0KDQokIGNhdCB0ZXN0ZmlsZQ0KQQ0Kw4QNCiQgZ3JlcCAtRiAtZcOEIC1l QSB0ZXN0ZmlsZQ0KQQ0Kw4QNCiQgZ3JlcCAtaSAtZcOEIC1lQSB0ZXN0ZmlsZQ0KQQ0Kw4QNCiQg Z3JlcCAtaUYgLWXDhCAtZUEgdGVzdGZpbGUNCkENCsOEDQoNCl9fX19fX19fX19fX19fX19fX19f X19fX19fX19fX19fDQoNCkFsbCBmb3IgT25lIFN0ZWViIEFHLCBTaXR6IGRlciBHZXNlbGxzY2hh ZnQ6IEZpbGRlcnN0YWR0LiBBbXRzZ2VyaWNodCBTdHV0dGdhcnQ6IEhSQiAxOSA1MzksDQpWb3Jz dGFuZDogTGFycyBMYW5kd2VocmthbXAgKFNwcmVjaGVyKSwgU3RlZmFuIExhbmQNClZvcnNpdHpl bmRlciBkZXMgQXVmc2ljaHRzcmF0czogUGV0ZXIgQnJvZ2xlDQoNCkRpZXNlIEUtTWFpbCAoZWlu c2NobGllw59saWNoIGFsbGVyIEFuaMOkbmdlKSBrYW5uIEJldHJpZWJzLSBvZGVyIEdlc2Now6Rm dHNnZWhlaW1uaXNzZSBiencuIHNvbnN0aWdlIHZlcnRyYXVsaWNoZSB1bmQvb2RlciByZWNodGxp Y2ggZ2VzY2jDvHR6dGUgSW5mb3JtYXRpb25lbiBlbnRoYWx0ZW4uIFNvbGx0ZW4gU2llIGRpZXNl IEUtTWFpbCBpcnJ0w7xtbGljaCBlcmhhbHRlbiBoYWJlbiwgaXN0IElobmVuIGplZGUgS2VubnRu aXNuYWhtZSBkZXMgSW5oYWx0cywgTnV0enVuZywgVmVydmllbGbDpGx0aWd1bmcsIG9kZXIgV2Vp dGVyZ2FiZSBkZXIgRS1NYWlsIChlaW5zY2hsaWXDn2xpY2ggYWxsZXIgQW5ow6RuZ2UpIGF1c2Ry w7xja2xpY2ggdW50ZXJzYWd0LiBCaXR0ZSBiZW5hY2hyaWNodGlnZW4gU2llIHVucyB1bWdlaGVu ZCB1bmQgdmVybmljaHRlbiBTaWUgZGllIGVtcGZhbmdlbmUgRS1NYWlsLiBWaWVsZW4gRGFuay4N Cg0KVGhpcyBlLW1haWwgKGluY2x1ZGluZyBhbnkgYXR0YWNobWVudHMpIG1heSBjb250YWluIGJ1 c2luZXNzIG9yIHRyYWRlIHNlY3JldHMgb3Igb3RoZXIgY29uZmlkZW50aWFsIGFuZCAvIG9yIGxl Z2FsbHkgcHJvdGVjdGVkIGluZm9ybWF0aW9uLiBJZiB5b3UgaGF2ZSByZWNlaXZlZCB0aGlzIGUt bWFpbCBpbiBlcnJvciwgeW91IGFyZSBoZXJlYnkgbm90aWZpZWQgdGhhdCBhbnkgcmV2aWV3LCB1 c2UsIGNvcHlpbmcsIG9yIGRpc3RyaWJ1dGlvbiBvZiBpdCBpcyBzdHJpY3RseSBwcm9oaWJpdGVk LiBQbGVhc2UgaW5mb3JtIHVzIGltbWVkaWF0ZWx5IGFuZCBkZXN0cm95IHRoaXMgZS1tYWlsLiBU aGFuayB5b3UuDQo= From debbugs-submit-bounces@debbugs.gnu.org Thu Sep 04 04:29:27 2014 Received: (at submit) by debbugs.gnu.org; 4 Sep 2014 08:29:27 +0000 Received: from localhost ([127.0.0.1]:58386 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPSQ2-000777-Cg for submit@debbugs.gnu.org; Thu, 04 Sep 2014 04:29:26 -0400 Received: from eggs.gnu.org ([208.118.235.92]:38338) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPSQ0-00076u-1v for submit@debbugs.gnu.org; Thu, 04 Sep 2014 04:29:24 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XPSPp-0000w4-5Q for submit@debbugs.gnu.org; Thu, 04 Sep 2014 04:29:18 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:40599) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPSPp-0000w0-2R for submit@debbugs.gnu.org; Thu, 04 Sep 2014 04:29:13 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:40016) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPSPj-0005S1-PH for bug-grep@gnu.org; Thu, 04 Sep 2014 04:29:13 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XPSPe-0000uS-Gc for bug-grep@gnu.org; Thu, 04 Sep 2014 04:29:07 -0400 Received: from cantor2.suse.de ([195.135.220.15]:60869 helo=mx2.suse.de) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPSPe-0000uG-9s for bug-grep@gnu.org; Thu, 04 Sep 2014 04:29:02 -0400 Received: from relay2.suse.de (charybdis-ext.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 6D375AC9D; Thu, 4 Sep 2014 08:29:00 +0000 (UTC) Date: Thu, 4 Sep 2014 10:29:00 +0200 (CEST) From: Johannes Meixner To: "Bergen, Andreas" Subject: Re: bug#18398: Probably found a bug in grep In-Reply-To: Message-ID: References: User-Agent: Alpine 2.00 (LNX 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x (no timestamps) [generic] X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit Cc: bug-grep@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) Hello, On Sep 3 19:11 Bergen, Andreas wrote (excerpt): > I've probably found a bug in "grep". ... > testfile: UTF-8 Unicode text > testfile2: ASCII text ... > Name : grep > Version : 2.5.1a > Vendor: SUSE LINUX Products GmbH, Nuernberg, Germany > Build Date: Tue Apr 22 03:47:13 2008 > Install Date: Mon Jul 6 16:21:37 2009 > Source RPM: grep-2.5.1a-20.17.src.rpm This grep version is very old. I found grep version 2.5.1a only in SUSE Linux Enterprise Server 10. openSUSE distributions with such an old grep are no longer available. I do not know if that old grep version was really meant to support UTF-8 character encoding (multibyte characters) actually well because I find almost nothing about "UTF" (ignore case) in the grep-2.5.1a sources. There is some multibyte character support in grep-2.5.1a but I wonder to what extent it actually works. In contrast in the grep-2.7 sources that we provide since SUSE Linux Enterprise Server 11 Service Pack 2 (SLES11-SP2) there is a lot more about "UTF" (ignore case). In the RPM changelog of our grep RPM package for SLES11-SP2 there is in particular: ------------------------------------------------------------------ Version upgrade to grep-2.7 and reset to full compliance with upstream ... version upgrade to grep-2.6.3, which brings among various compile fixes vast improvements for UTF-8 / multibyte handling. ------------------------------------------------------------------ In general: Any issues with various "traditional" Unix/Linux tools that depend on the locale are very often no real bugs. For users it is crucial to understand that any kind of behaviour can depend on the locale (from keyboard input via program behaviour to what is shown on the screen). For basic information see http://en.opensuse.org/SDB:Plain_Text_versus_Locale When programs process "plain text files", the user who runs the program must set up the locale environment to match the encoding of the "plain text file" before he runs the program. When you like to process your "plain text files" as you did "since ever" with various "traditional" Unix/Linux tools, you must use the POSIX locale, otherwise you will get weird results and unexpected side-effects. See also http://www.gnu.org/software/gawk/manual/html_node/Ranges-and-Locales.html Kind Regards Johannes Meixner -- SUSE LINUX Products GmbH -- Maxfeldstrasse 5 -- 90409 Nuernberg -- Germany HRB 16746 (AG Nuernberg) GF: Jeff Hawn, Jennifer Guild, Felix Imendoerffer From debbugs-submit-bounces@debbugs.gnu.org Thu Sep 04 09:01:38 2014 Received: (at submit) by debbugs.gnu.org; 4 Sep 2014 13:01:38 +0000 Received: from localhost ([127.0.0.1]:58480 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPWfR-0007ZV-Ll for submit@debbugs.gnu.org; Thu, 04 Sep 2014 09:01:38 -0400 Received: from eggs.gnu.org ([208.118.235.92]:58631) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XPWfP-0007ZH-58 for submit@debbugs.gnu.org; Thu, 04 Sep 2014 09:01:36 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XPWfC-0005up-Ay for submit@debbugs.gnu.org; Thu, 04 Sep 2014 09:01:29 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.0 required=5.0 tests=BAYES_40 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:51934) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPWfC-0005ul-8a for submit@debbugs.gnu.org; Thu, 04 Sep 2014 09:01:22 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:60295) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPWf4-0006oS-Of for bug-grep@gnu.org; Thu, 04 Sep 2014 09:01:22 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XPWex-0005s3-8z for bug-grep@gnu.org; Thu, 04 Sep 2014 09:01:14 -0400 Received: from cantor2.suse.de ([195.135.220.15]:38237 helo=mx2.suse.de) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XPWex-0005ry-2y for bug-grep@gnu.org; Thu, 04 Sep 2014 09:01:07 -0400 Received: from relay2.suse.de (charybdis-ext.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 63031AB07 for ; Thu, 4 Sep 2014 13:01:06 +0000 (UTC) Date: Thu, 4 Sep 2014 15:01:06 +0200 (CEST) From: Johannes Meixner To: bug-grep@gnu.org Subject: Re: bug#18398: AW: bug#18398: Probably found a bug in grep In-Reply-To: Message-ID: References: <540776D7.1070205@cs.ucla.edu> User-Agent: Alpine 2.00 (LNX 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x (no timestamps) [generic] X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) Hello, On Sep 4 08:28 Bergen, Andreas wrote (excerpt): > ... the newest version available > for Suse Linux (grep 2.7-5.7.1 in SLES 11 SP3) FWIW: The newest available grep versions for openSUSE are grep-2.14 for openSUSE:13.1 and grep-2.20 for openSUSE:Factory Kind Regards Johannes Meixner -- SUSE LINUX Products GmbH -- Maxfeldstrasse 5 -- 90409 Nuernberg -- Germany HRB 16746 (AG Nuernberg) GF: Jeff Hawn, Jennifer Guild, Felix Imendoerffer From unknown Fri Sep 05 18:51:30 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Fri, 03 Oct 2014 11:24:05 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator