From unknown Fri Aug 15 14:16:57 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#20678 <20678@debbugs.gnu.org> To: bug#20678 <20678@debbugs.gnu.org> Subject: Status: new bug that Paul "asked" for... grep -P aborts on non-utf8 input. Reply-To: bug#20678 <20678@debbugs.gnu.org> Date: Fri, 15 Aug 2025 21:16:57 +0000 retitle 20678 new bug that Paul "asked" for... grep -P aborts on non-utf8 i= nput. reassign 20678 coreutils submitter 20678 "L. A. Walsh" severity 20678 normal thanks From debbugs-submit-bounces@debbugs.gnu.org Wed May 27 17:41:41 2015 Received: (at submit) by debbugs.gnu.org; 27 May 2015 21:41:41 +0000 Received: from localhost ([127.0.0.1]:58687 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxj52-0003Fs-Sd for submit@debbugs.gnu.org; Wed, 27 May 2015 17:41:41 -0400 Received: from eggs.gnu.org ([208.118.235.92]:35602) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxj51-0003Fg-N7 for submit@debbugs.gnu.org; Wed, 27 May 2015 17:41:40 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Yxj4v-0005D8-I3 for submit@debbugs.gnu.org; Wed, 27 May 2015 17:41:34 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:50433) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Yxj4v-0005D3-FC for submit@debbugs.gnu.org; Wed, 27 May 2015 17:41:33 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:49106) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Yxj4u-0004mF-0q for bug-coreutils@gnu.org; Wed, 27 May 2015 17:41:33 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Yxj4p-0005CY-Rj for bug-coreutils@gnu.org; Wed, 27 May 2015 17:41:31 -0400 Received: from ishtar.tlinx.org ([173.164.175.65]:52877 helo=Ishtar.hs.tlinx.org) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Yxj4p-0005AD-Gs for bug-coreutils@gnu.org; Wed, 27 May 2015 17:41:27 -0400 Received: from [192.168.4.12] (Athenae [192.168.4.12]) by Ishtar.hs.tlinx.org (8.14.7/8.14.4/SuSE Linux 0.8) with ESMTP id t4RLfCVQ031379; Wed, 27 May 2015 14:41:15 -0700 Message-ID: <556639F8.4080406@tlinx.org> Date: Wed, 27 May 2015 14:41:12 -0700 From: "L. A. Walsh" User-Agent: Thunderbird MIME-Version: 1.0 To: Paul Eggert Subject: new bug that Paul "asked" for... grep -P aborts on non-utf8 input. References: <556115A2.2020404@tlinx.org> <5561CB0B.9090409@redhat.com> <5562C5A3.7010301@tlinx.org> <55633D60.10907@cs.ucla.edu> In-Reply-To: <55633D60.10907@cs.ucla.edu> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: quoted-printable X-MIME-Autoconverted: from 8bit to quoted-printable by Ishtar.hs.tlinx.org id t4RLfCVQ031379 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x (no timestamps) [generic] X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit Cc: bug-coreutils@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) (skip to end if you don't care to read how I found this mess)... Paul Eggert wrote: > Linda Walsh wrote: > >> I had one file that it bailed on >> saying it has an invalid UTF-8 encoding -- but the line was >> recursive starting from '.' -- and it didn't name the file > > That's pretty vague. Can you reproduce that problem? I don't observe=20 > it: ---- I'm not quite *sure* how to tell someone else to reproduce this, but I can pretty reliably now some output from a checker....: *** file =3D libvtkUtilitiesPythonInitializer-pv4.2.so.1 grep: invalid UTF-8 byte sequence in input ----- *** file =3D libvtkPVClientServerCoreCore-pv4.2.so.1 grep: invalid UTF-8 byte sequence in input ----- *** file =3D libsystemd.so.0 grep: invalid UTF-8 byte sequence in input ----- *** file =3D libvtkParallelCore-pv4.2.so.1 grep: invalid UTF-8 byte sequence in input ----- Now before you think I'm too daft, the code that produces those messages is in perl and is: for my $k (@sorted_missing) { P "*** file =3D %s", $k; open(my $gh, "grep -rP '/$k' /home/rpms/13.2|"); while (<$gh>) { print } P "-----"; } Those files are files that came up "missing" as pre-reqs. in /home/rpms/...., I have the *file listings* of each of the rpms, created in the same structure as in the distro, so a file under that dir /home/rpms/13.2.. This is why I had a problem finding it: Ishtar:rpms/13.2/repo/oss/suse> file -bi x86_64/*>/tmp/x86files.txt Ishtar:rpms/13.2/repo/oss/suse> sort time file -i x86_64/*>/tmp/fullx86files.txt =20 27.71sec 27.07usr 0.63sys (99.99% cpu) Then grep helps! Ishtar:rpms/13.2/repo/oss/suse> grep iso-88 /tmp/fullx86files.txt x86_64/aspell-is-0.51.10-46.1.2.x86_64.rpm:text/plain; charset=3Diso-8859= -1 x86_64/aspell-nb-0.50.10-46.1.2.x86_64.rpm:text/plain; charset=3Diso-8859= -1 --- Ishtar:rpms/13.2/repo/oss/suse> more=20 x86_64/aspell-is-0.51.10-46.1.2.x86_64.rpm =20 /usr/lib64/aspell-0.60/icelandic.alias /usr/lib64/aspell-0.60/is.dat /usr/lib64/aspell-0.60/is.multi /usr/lib64/aspell-0.60/is.rws /usr/lib64/aspell-0.60/is_phonet.dat /usr/lib64/aspell-0.60/355slenska.alias <<-- the 355 was in inverse color /usr/share/doc/packages/aspell-is /usr/share/doc/packages/aspell-is/COPYING /usr/share/doc/packages/aspell-is/Copyright /usr/share/doc/packages/aspell-is/README ---- Same w/the other file (had this 1 'violation': /usr/lib64/aspell-0.60/bokmal.alias /usr/lib64/aspell-0.60/bokm345l.alias <-3 So those are 'octal' code points (using a little calc prog): > pcalc pcalc V0.1.8: Type 'constants' to see constants (1)> 0355 =3D 237 (0x00ed) "=C3=AD"=20 (2)> 0345 =3D 229 (0x00e5) "=C3=A5" -------------------------------------------------------------------------= ------ So the 1st part of the bug is the message w/no filename. the 2nd part of the bug is this: (looking for '^nobody' in "/etc/passwd" works, as shown in 1st example: > grep -P '^nobody' /etc/passwd nobody:x:65534:65533:(group Nobody):/var/lib/nobody:/bin/nologin but the 'error' message aborts any further file searches: --- > grep -P '^nobody' x86_64/aspell-is-0.51.10-46.1.2.x86_64.rpm /etc/pass= wd=20 grep: invalid UTF-8 byte sequence in input ---------------------------------------------------------- This is why I objected to '\000' being treated as a binary file (and why I think it's bad grep can't look for that): If one works with windows, it's far more likely just to be in UTF-16 encoding. -l From debbugs-submit-bounces@debbugs.gnu.org Wed May 27 18:04:39 2015 Received: (at submit) by debbugs.gnu.org; 27 May 2015 22:04:39 +0000 Received: from localhost ([127.0.0.1]:58698 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YxjRG-0003n2-Kq for submit@debbugs.gnu.org; Wed, 27 May 2015 18:04:38 -0400 Received: from eggs.gnu.org ([208.118.235.92]:39044) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YxjRE-0003mp-CC for submit@debbugs.gnu.org; Wed, 27 May 2015 18:04:37 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YxjR8-0004uw-7n for submit@debbugs.gnu.org; Wed, 27 May 2015 18:04:31 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:36515) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YxjR8-0004us-4H for submit@debbugs.gnu.org; Wed, 27 May 2015 18:04:30 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:52546) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YxjR7-0003Ej-5i for bug-coreutils@gnu.org; Wed, 27 May 2015 18:04:30 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YxjR4-0004tk-Gd for bug-coreutils@gnu.org; Wed, 27 May 2015 18:04:29 -0400 Received: from smtp.cs.ucla.edu ([131.179.128.62]:50698) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YxjR4-0004tK-B4 for bug-coreutils@gnu.org; Wed, 27 May 2015 18:04:26 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id 8AC01A60010; Wed, 27 May 2015 15:04:24 -0700 (PDT) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id FhypwES+KhL2; Wed, 27 May 2015 15:04:24 -0700 (PDT) Received: from Penguin.CS.UCLA.EDU (Penguin.CS.UCLA.EDU [131.179.64.200]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id B7156A60002; Wed, 27 May 2015 15:04:23 -0700 (PDT) Message-ID: <55663F62.3050005@cs.ucla.edu> Date: Wed, 27 May 2015 15:04:18 -0700 From: Paul Eggert Organization: UCLA Computer Science Department User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 MIME-Version: 1.0 To: "L. A. Walsh" Subject: Re: new bug that Paul "asked" for... grep -P aborts on non-utf8 input. References: <556115A2.2020404@tlinx.org> <5561CB0B.9090409@redhat.com> <5562C5A3.7010301@tlinx.org> <55633D60.10907@cs.ucla.edu> <556639F8.4080406@tlinx.org> In-Reply-To: <556639F8.4080406@tlinx.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -4.0 (----) X-Debbugs-Envelope-To: submit Cc: bug-coreutils@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.0 (----) On 05/27/2015 02:41 PM, L. A. Walsh wrote: > *** file = libvtkUtilitiesPythonInitializer-pv4.2.so.1 > grep: invalid UTF-8 byte sequence in input This looks like you're using an old version of libpcre, or of grep. I can't reproduce the problem with the latest stable versions of both (libpcre 8.37, grep-2.21). I can find similar problems if I use old libpcre. From debbugs-submit-bounces@debbugs.gnu.org Wed May 27 18:24:23 2015 Received: (at submit) by debbugs.gnu.org; 27 May 2015 22:24:24 +0000 Received: from localhost ([127.0.0.1]:58703 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YxjkN-0004EJ-35 for submit@debbugs.gnu.org; Wed, 27 May 2015 18:24:23 -0400 Received: from eggs.gnu.org ([208.118.235.92]:42854) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YxjkK-0004E6-Um for submit@debbugs.gnu.org; Wed, 27 May 2015 18:24:21 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YxjkE-0005dE-Oh for submit@debbugs.gnu.org; Wed, 27 May 2015 18:24:15 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:33210) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YxjkE-0005dA-MC for submit@debbugs.gnu.org; Wed, 27 May 2015 18:24:14 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:56358) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YxjkD-0006z3-MF for bug-coreutils@gnu.org; Wed, 27 May 2015 18:24:14 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Yxjk9-0005cA-30 for bug-coreutils@gnu.org; Wed, 27 May 2015 18:24:13 -0400 Received: from ishtar.tlinx.org ([173.164.175.65]:54270 helo=Ishtar.hs.tlinx.org) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Yxjk8-0005by-O7 for bug-coreutils@gnu.org; Wed, 27 May 2015 18:24:09 -0400 Received: from [192.168.4.12] (Athenae [192.168.4.12]) by Ishtar.hs.tlinx.org (8.14.7/8.14.4/SuSE Linux 0.8) with ESMTP id t4RMO301040895; Wed, 27 May 2015 15:24:06 -0700 Message-ID: <55664403.2070707@tlinx.org> Date: Wed, 27 May 2015 15:24:03 -0700 From: Linda Walsh User-Agent: Thunderbird MIME-Version: 1.0 To: Paul Eggert Subject: Re: bug#20678 new bug that Paul "asked" for... grep -P aborts on non-utf8 input. References: <556115A2.2020404@tlinx.org> <5561CB0B.9090409@redhat.com> <5562C5A3.7010301@tlinx.org> <55633D60.10907@cs.ucla.edu> <556639F8.4080406@tlinx.org> <55663F62.3050005@cs.ucla.edu> In-Reply-To: <55663F62.3050005@cs.ucla.edu> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x (no timestamps) [generic] X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit Cc: bug-coreutils@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) Paul Eggert wrote: > On 05/27/2015 02:41 PM, L. A. Walsh wrote: >> *** file = libvtkUtilitiesPythonInitializer-pv4.2.so.1 >> grep: invalid UTF-8 byte sequence in input > > This looks like you're using an old version of libpcre, or of grep. I > can't reproduce the problem with the latest stable versions of both > (libpcre 8.37, grep-2.21). I can find similar problems if I use old > libpcre. --- ok... ARG -- I just installed the new version of grep from my distro (suse13.2) -- grep-2.20-2.4.1.x86_64 I think they'll be out with a new distro release in about a year...(yes, I can probably build my own...like I have to with a growing body of Software) -- something that has gotten me in trouble with my distro at times when I've caught them locking different pieces of software to specific libraries (not >== xxx but "==")... grrr...I could acknowledge their point that most people wouldn't bother rebuilding all the perl modules if they upgraded perl... but that's not *everyone*!...sigh. coreutils isn't as stable as it used to be (not entirely the CU-devel team either: I've caught suse's hand in 1-2)... Just ran into problems in their new gvim & sudo -- I think the sudo prob is the sudo-dev team...but the gvim I filed a bug on in previous version... guess it didn't get fixed. Filing bugs more often than not is a big waste of time. *grump* *grump*... ;-) From debbugs-submit-bounces@debbugs.gnu.org Wed May 27 18:44:37 2015 Received: (at control) by debbugs.gnu.org; 27 May 2015 22:44:37 +0000 Received: from localhost ([127.0.0.1]:58725 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxk3w-0004iI-Ne for submit@debbugs.gnu.org; Wed, 27 May 2015 18:44:36 -0400 Received: from smtp.cs.ucla.edu ([131.179.128.62]:33555) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxk3v-0004i5-2b for control@debbugs.gnu.org; Wed, 27 May 2015 18:44:35 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id 1072AA60010 for ; Wed, 27 May 2015 15:44:29 -0700 (PDT) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id iFQAjKkLL0di for ; Wed, 27 May 2015 15:44:28 -0700 (PDT) Received: from Penguin.CS.UCLA.EDU (Penguin.CS.UCLA.EDU [131.179.64.200]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id A3295A60002 for ; Wed, 27 May 2015 15:44:28 -0700 (PDT) Message-ID: <556648CC.5020504@cs.ucla.edu> Date: Wed, 27 May 2015 15:44:28 -0700 From: Paul Eggert Organization: UCLA Computer Science Department User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 MIME-Version: 1.0 To: control@debbugs.gnu.org Subject: closing 20678 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) close 20678 From debbugs-submit-bounces@debbugs.gnu.org Thu May 28 03:17:56 2015 Received: (at 20678) by debbugs.gnu.org; 28 May 2015 07:17:56 +0000 Received: from localhost ([127.0.0.1]:58929 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxs4h-0002yr-7X for submit@debbugs.gnu.org; Thu, 28 May 2015 03:17:55 -0400 Received: from mout.kundenserver.de ([212.227.17.10]:58398) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxs4d-0002yY-W8 for 20678@debbugs.gnu.org; Thu, 28 May 2015 03:17:52 -0400 Received: from [10.0.4.15] ([62.153.148.194]) by mrelayeu.kundenserver.de (mreue104) with ESMTPSA (Nemesis) id 0MNtut-1Z4DrI0sgN-007Rwe; Thu, 28 May 2015 09:17:43 +0200 Message-ID: <5566C115.8030107@bernhard-voelker.de> Date: Thu, 28 May 2015 09:17:41 +0200 From: Bernhard Voelker User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 MIME-Version: 1.0 To: Linda Walsh , Paul Eggert Subject: Re: bug#20678: new bug that Paul "asked" for... grep -P aborts on non-utf8 input. References: <556115A2.2020404@tlinx.org> <5561CB0B.9090409@redhat.com> <5562C5A3.7010301@tlinx.org> <55633D60.10907@cs.ucla.edu> <556639F8.4080406@tlinx.org> <55663F62.3050005@cs.ucla.edu> <55664403.2070707@tlinx.org> In-Reply-To: <55664403.2070707@tlinx.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-Provags-ID: V03:K0:XTI1vnLuquMhfAgZVTwgBZmPC2fQmTV+EK2O9Co7ZAkCvZISBsN 1JvkXSL5lmHImLFrdZ9YJHK9e4JtfgVYg2fstYzv0m5QkqWojSzEIPVuUCmjVuwIqg90g5M NiZe1DOZEaB8h3RulA8DXJ6wUL+Wo9TJw6/C4/FLwW725nlcYWjVlS0xHWW1ondst0cDQi7 S9ubbNJMmwWJnvfJko0hw== X-UI-Out-Filterresults: notjunk:1;V01:K0:otplySjuTic=:EaosJrY9hhnbyG0vE4rbd8 mcDK2YgyvNAIiGKQKChrSK46ca5qyhnhmgveIBIvLylsI8Tp25cqi7nQ7FGbJMJ97T4Gzl/Kv ABbYmp5Wo1uCA6vy5x47qMiP+nuZW5dMoq9y8L+qjHYhyZzZ8xnG0w4X9W109q2wqy/to5OGz LId4WkzO4nPTh3+fe5KbwP455WiYZy+OGXp/CW4zymILwgHdURXjfcNCdHLDo9AX/LWd0CLKw acIjP7E1N7viBi3zp26jxB/su4o9sKKfNM2FdOz6yW3rnWXpS+sJten6DyRn+NKktybLe1FgQ A7bLjOrNW2TNlX/PJrpEGlZm2/lq+Z6mdrvGS8RaSTyPqYfm6foyPMKJtx9eZgBUMFDGuDVcG wEPJB7GIpjAneOYoq00VMPiVaZ4r38Yk5frZjCmq0udjwsN6Bsz0u6rBJYdVzFiqTmxOQAm/e KVWkPcJBOcYlT9Adl1VBlvldP8ZxRkNHKl96TVzI9YkThmw4Nhse75aUAm7Y6KoYoaUNYx8hQ OKQPXrdVU5c+TFfJ2A+J7GmyMNZIVWwzcGL12Ai0bAsNBzKKbutfbS/Ig3v7cTQDgmKJCDG1B a1en0+RwK1RXLvBc9q5XxX+Cu1nNOZ6spoedYbKdq7uUqOcrpFc5bs0qTJKIA/lcw841VC+Sq c8IGg+ZT/a9+AbkvxlHS1VuXqXWqv6PdXM+Q6re8l+CV/UQ== X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 20678 Cc: 20678@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) On 05/28/2015 12:24 AM, Linda Walsh wrote: > ok... ARG -- I just installed the new version of grep from > my distro (suse13.2) -- grep-2.20-2.4.1.x86_64 > > I think they'll be out with a new distro release in about > a year...(yes, I can probably build my own...like I have > to with a growing body of Software) This is openSUSE specific. When you've built your own version with a patch for a problem, nothing prevents you from simply creating a submit request for that patch on OBS to "Base:System/grep", and maybe even creating a maintenance request for "openSUSE:13.2/grep". Get involved. Have a nice day, Berny From debbugs-submit-bounces@debbugs.gnu.org Thu May 28 09:17:53 2015 Received: (at 20678) by debbugs.gnu.org; 28 May 2015 13:17:53 +0000 Received: from localhost ([127.0.0.1]:59099 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxxgz-0004ej-H7 for submit@debbugs.gnu.org; Thu, 28 May 2015 09:17:52 -0400 Received: from ishtar.tlinx.org ([173.164.175.65]:49258 helo=Ishtar.hs.tlinx.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxxgs-0004eW-SN for 20678@debbugs.gnu.org; Thu, 28 May 2015 09:17:48 -0400 Received: from [192.168.4.12] (Athenae [192.168.4.12]) by Ishtar.hs.tlinx.org (8.14.7/8.14.4/SuSE Linux 0.8) with ESMTP id t4SDHWYF061101; Thu, 28 May 2015 06:17:34 -0700 Message-ID: <5567156C.3050207@tlinx.org> Date: Thu, 28 May 2015 06:17:32 -0700 From: Linda Walsh User-Agent: Thunderbird MIME-Version: 1.0 To: Bernhard Voelker Subject: Re: bug#20678: new bug that Paul "asked" for... grep -P aborts on non-utf8 input. References: <556115A2.2020404@tlinx.org> <5561CB0B.9090409@redhat.com> <5562C5A3.7010301@tlinx.org> <55633D60.10907@cs.ucla.edu> <556639F8.4080406@tlinx.org> <55663F62.3050005@cs.ucla.edu> <55664403.2070707@tlinx.org> <5566C115.8030107@bernhard-voelker.de> In-Reply-To: <5566C115.8030107@bernhard-voelker.de> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Score: -0.0 (/) X-Debbugs-Envelope-To: 20678 Cc: Paul Eggert , 20678@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.0 (/) Bernhard Voelker wrote: > On 05/28/2015 12:24 AM, Linda Walsh wrote: >> ok... ARG -- I just installed the new version of grep from >> my distro (suse13.2) -- grep-2.20-2.4.1.x86_64 >> >> I think they'll be out with a new distro release in about >> a year...(yes, I can probably build my own...like I have >> to with a growing body of Software) > > This is openSUSE specific. > When you've built your own version with a patch for a problem, > nothing prevents you from simply creating a submit request for > that patch on OBS to "Base:System/grep", and maybe even creating > a maintenance request for "openSUSE:13.2/grep". Get involved. ---- Main thing my patch is restoring functionality of 'rm' to allow "rm -fr .", I'm not daft enough to try to sneak that in as a default. Maybe in a different command, maybe as a non-default, but I'm anything but duplicitous (unfortunately). I _have_ always thought that a shorthand combination of rd and rm, might be nice -- maybe 'r'... Of course it would only work like rmdir on empty dirs unless they specify the "-r" flag so it could remove contents first. And of course it would pay attention to the posix rule about not trying to delete '.' after it finished its' depth first traversal... But no one else seems to really care that much, so I'm not sure how much effort I want to put into it to package something like that up. But it has entered my mind... Cheers, Lina From debbugs-submit-bounces@debbugs.gnu.org Thu May 28 09:29:18 2015 Received: (at 20678) by debbugs.gnu.org; 28 May 2015 13:29:18 +0000 Received: from localhost ([127.0.0.1]:59103 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxxs1-0004uB-5S for submit@debbugs.gnu.org; Thu, 28 May 2015 09:29:18 -0400 Received: from mout.kundenserver.de ([212.227.126.187]:57683) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Yxxrr-0004tX-4f for 20678@debbugs.gnu.org; Thu, 28 May 2015 09:29:08 -0400 Received: from [10.0.4.15] ([62.153.148.194]) by mrelayeu.kundenserver.de (mreue003) with ESMTPSA (Nemesis) id 0MY20C-1Yc3If2cam-00UuEM; Thu, 28 May 2015 15:28:49 +0200 Message-ID: <5567180F.70406@bernhard-voelker.de> Date: Thu, 28 May 2015 15:28:47 +0200 From: Bernhard Voelker User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.6.0 MIME-Version: 1.0 To: Linda Walsh Subject: Re: bug#20678: new bug that Paul "asked" for... grep -P aborts on non-utf8 input. References: <556115A2.2020404@tlinx.org> <5561CB0B.9090409@redhat.com> <5562C5A3.7010301@tlinx.org> <55633D60.10907@cs.ucla.edu> <556639F8.4080406@tlinx.org> <55663F62.3050005@cs.ucla.edu> <55664403.2070707@tlinx.org> <5566C115.8030107@bernhard-voelker.de> <5567156C.3050207@tlinx.org> In-Reply-To: <5567156C.3050207@tlinx.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-Provags-ID: V03:K0:MzXIeQCBs/nRev0j9Z4ySeMi1y02N3iYGVxmZWSKtXLuglxCWa7 wQ1l40E2z9Lfy23Eht5oWn8x+cXNabdpReTw1MJ/Gq8ZQydH+cP18EoolvpqY8Im/Ah94sM vFX/35nWju+mVM2SF7xEZo7y4XF60gWSXAgNdkPGLAvnYB/YcyfOz6YLR5nsZPQlqkgAvrp wbyr4V9DDFKRcY9PMqcyA== X-UI-Out-Filterresults: notjunk:1;V01:K0:zBV2r018PYM=:MrR8CkeaAXQ4TMaCpBsWDd htAs1B0e4+eDwjkZYkQAXLSXqqMyVot+2L9DRMKopGhdvMU0XONY/lLzj4dxkejS06JgJyzqf IP4NdG6mxraXNcKUIZ39PmxTf2Xidr7KDdp6yyK3AWqhzC0tHMuxh2HmwjduMn47y8miC/QPW dGfKbFDqzLu6OVcpffEYYkDjko2oxyzQA3rM6/JHqtCQu3suExfoH0iie4TjvlyCau0dEjw1L mvEYQKOYbTlKXCEl/FffFFCkOPDpVslonc4XpSAupN4GCGSY1WrwLDtl+IbwUueLF42AVshOr DIDC2gV4Wf2+Wtf8zfF0/6sEpOEjy5SyTHm+d8E6PIJ/IuykU8aE3xhWYu/2naEgutzY8PCnj TswPJiMiAceAUriflUnl/PHqOpEPqIZuN5kifJKw0yXuB9RiJoGGP4iVSDN9mrimjLoOZeATT rkdl99KUNMycU9l7khez+whOJ75M2EsSxdt3bo/YZNHyW1hYR18NJWA3SmgMBXzEs3wmT0/gK wsOSuVT9SpfSlNwxitJAwtPzb5vq4NNDwCb5hJALsTVbH6RjdGcAVA8IdNq5xTe0qqUqkopTV Ux6Uett4Ngwu7WSLj4BDhyVCVdH8rwWiWUlQN6gH13e/ozmIru1nVtAi6IzXyKQJYO/12BsOj Ad0bJUOlBqhZTNQVKO7Ji/cdA+gCbHcinlpQjuqh96gqdhA== X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 20678 Cc: Paul Eggert , 20678@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) On 05/28/2015 03:17 PM, Linda Walsh wrote: > Bernhard Voelker wrote: >> On 05/28/2015 12:24 AM, Linda Walsh wrote: >>> ok... ARG -- I just installed the new version of grep from >>> my distro (suse13.2) -- grep-2.20-2.4.1.x86_64 >>> >>> I think they'll be out with a new distro release in about >>> a year...(yes, I can probably build my own...like I have >>> to with a growing body of Software) >> >> This is openSUSE specific. >> When you've built your own version with a patch for a problem, >> nothing prevents you from simply creating a submit request for >> that patch on OBS to "Base:System/grep", and maybe even creating >> a maintenance request for "openSUSE:13.2/grep". Get involved. > ---- > Main thing my patch is restoring functionality of 'rm' > to allow "rm -fr ." [...] stop, 'rm -rf .' is a completely different story - your bug report was about 'grep -P' (for which the bug report is OT on the coreutils mailing list btw.). Having your own (probably non-generally wanted) patches in your own OBS project is perfect. I was just talking about submitting the patch for the non-utf8 issue (for which I personally didn't check whether it is already included downstreams). Have a nice day, Berny From unknown Fri Aug 15 14:16:57 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Fri, 26 Jun 2015 11:24:05 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator