GNU bug report logs - #16499
GNU grep-2.16-1.mga4 , grep-2.16 from sources and grep from git master HEAD get stuck during an LC_ALL=en_US.UTF-8 search inside a short binary file

Previous Next

Package: grep;

Reported by: Shlomi Fish <shlomif <at> shlomifish.org>

Date: Sun, 19 Jan 2014 18:47:03 UTC

Severity: normal

Done: Jim Meyering <jim <at> meyering.net>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Jim Meyering <jim <at> meyering.net>
Cc: tracker <at> debbugs.gnu.org
Subject: bug#16499: closed (GNU grep-2.16-1.mga4 , grep-2.16 from sources
 and grep from git master HEAD get stuck during an LC_ALL=en_US.UTF-8
 search inside a short binary file)
Date: Tue, 21 Jan 2014 23:57:02 +0000
[Message part 1 (text/plain, inline)]
Your message dated Tue, 21 Jan 2014 15:56:21 -0800
with message-id <CA+8g5KEW=jkzWj4BpwL-P_UEtzwXvDeOi4ywz-0d2mqvPiSKwA <at> mail.gmail.com>
and subject line Re: GNU grep-2.16-1.mga4 , grep-2.16 from sources and grep from git master HEAD get stuck during an LC_ALL=en_US.UTF-8 search inside a short binary file
has caused the debbugs.gnu.org bug report #16499,
regarding GNU grep-2.16-1.mga4 , grep-2.16 from sources and grep from git master HEAD get stuck during an LC_ALL=en_US.UTF-8 search inside a short binary file
to be marked as done.

(If you believe you have received this mail in error, please contact
help-debbugs <at> gnu.org.)


-- 
16499: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=16499
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Shlomi Fish <shlomif <at> shlomifish.org>
To: bug-grep <at> gnu.org
Subject: GNU grep-2.16-1.mga4 , grep-2.16 from sources and grep from git
 master HEAD get stuck during an LC_ALL=en_US.UTF-8 search inside a short
 binary file
Date: Sun, 19 Jan 2014 17:10:11 +0200
[Message part 3 (text/plain, inline)]
Hi all,

after I save the attached file as 1.dat , I see that grep -iP on '^Subject:'
or on '^S' gets stuck in the en_US.UTF-8 locale. It is fine in pcregrep and in
ack.

[SHELL]
shlomif <at> telaviv1:~$ time LC_ALL=en_US.UTF-8
~/apps/TEST-grep-from-git-TO-DEL/bin/grep -iP '^Subject:' < 1.dat ^C

real    0m4.199s
user    0m4.195s
sys     0m0.003s
shlomif <at> telaviv1:~$ time LC_ALL=en_US.UTF-8
~/apps/TEST-grep-from-git-TO-DEL/bin/grep -iP '^S' < 1.dat ^C

real    0m3.486s
user    0m3.485s
sys     0m0.001s
shlomif <at> telaviv1:~$ time LC_ALL=en_US.UTF-8
~/apps/TEST-grep-from-git-TO-DEL/bin/grep -iE '^S' < 1.dat

real    0m0.002s
user    0m0.002s
sys     0m0.000s
shlomif <at> telaviv1:~$ time LC_ALL=en_US.UTF-8
~/apps/TEST-grep-from-git-TO-DEL/bin/grep -P '^S' < 1.dat ^C

real    0m1.887s
user    0m1.885s
sys     0m0.000s
shlomif <at> telaviv1:~$ time LC_ALL=en_US.UTF-8
~/apps/TEST-grep-from-git-TO-DEL/bin/grep -P '^Subject:' < 1.dat

real    0m0.003s
user    0m0.000s
sys     0m0.002s
shlomif <at> telaviv1:~$ time LC_ALL=en_US.UTF-8
~/apps/TEST-grep-from-git-TO-DEL/bin/grep -P '^Subject:' < 1.dat time LC_ALL=C
~/apps/TEST-grep-from-git-TO-DEL/bin/grep -iP '^Subject:' < 1.dat

real    0m0.003s
user    0m0.001s
sys     0m0.001s
shlomif <at> telaviv1:~$ time LC_ALL=C pcregrep -i '^Subject:' < 1.dat

real    0m0.002s
user    0m0.001s
sys     0m0.000s
shlomif <at> telaviv1:~$ time LC_ALL=C ack -i '^Subject:' 1.dat

real    0m0.066s
user    0m0.059s
sys     0m0.007s
shlomif <at> telaviv1:~$ time LC_ALL=en_US.UTF-8 ack -i '^Subject:' 1.dat

real    0m0.070s
user    0m0.063s
sys     0m0.006s
[/SHELL]

The same thing happens with grep-2.16 built from the sources. I'm on Mageia
Linux x86-64 Cauldron (what will be Mageia 4). 

shlomif <at> telaviv1:~$ ldd ~/apps/TEST-grep-from-git-TO-DEL/bin/grep 
        linux-vdso.so.1 (0x00007fff2a7fe000)
        libpcre.so.1 => /lib64/libpcre.so.1 (0x00007f19ed302000)
        libc.so.6 => /lib64/libc.so.6 (0x00007f19ecf4d000)
        libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f19ecd30000)
        /lib64/ld-linux-x86-64.so.2 (0x00007f19ed568000)
shlomif <at> telaviv1:~$ rpm -qf /lib64/libpcre.so.1
lib64pcre1-8.33-2.mga4

Regards,

	Shlomi Fish

-- 
-----------------------------------------------------------------
Shlomi Fish       http://www.shlomifish.org/
Humanity - Parody of Modern Life - http://shlom.in/humanity

Linux — Because Software Problems Should not Cost Money.

Please reply to list if it's a mailing list post - http://shlom.in/reply .
[1.dat (application/octet-stream, attachment)]
[Message part 5 (message/rfc822, inline)]
From: Jim Meyering <jim <at> meyering.net>
To: Shlomi Fish <shlomif <at> shlomifish.org>
Cc: Paolo Bonzini <bonzini <at> gnu.org>, shlomif <shlomif <at> gmail.com>,
 16499-done <at> debbugs.gnu.org
Subject: Re: GNU grep-2.16-1.mga4 , grep-2.16 from sources and grep from git
 master HEAD get stuck during an LC_ALL=en_US.UTF-8 search inside a short
 binary file
Date: Tue, 21 Jan 2014 15:56:21 -0800
tags 16499 notabug
close 16499
thanks

Thank you for investigating.  I agree that it sure looks like the bug
is in libpcre, and not in grep itself.  If you haven't already
reported it to libpcre developers, would you please do that?


This bug report was last modified 11 years and 118 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.