GNU bug report logs - #24973
[regression] [d-f] no longer includes e with acute accent in single-byte locales

Previous Next

Package: grep;

Reported by: Stephane Chazelas <stephane.chazelas <at> gmail.com>

Date: Sun, 20 Nov 2016 21:15:01 UTC

Severity: normal

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Paul Eggert <eggert <at> cs.ucla.edu>
Cc: tracker <at> debbugs.gnu.org
Subject: bug#24973: closed ([regression] [d-f] no longer includes e with
 acute accent in single-byte locales)
Date: Mon, 21 Nov 2016 04:35:02 +0000
[Message part 1 (text/plain, inline)]
Your message dated Sun, 20 Nov 2016 20:34:35 -0800
with message-id <48683d86-36dc-7a02-4024-56870014b294 <at> cs.ucla.edu>
and subject line Re: bug#24973: [regression] [d-f] no longer includes e with acute accent in single-byte locales
has caused the debbugs.gnu.org bug report #24973,
regarding [regression] [d-f] no longer includes e with acute accent in single-byte locales
to be marked as done.

(If you believe you have received this mail in error, please contact
help-debbugs <at> gnu.org.)


-- 
24973: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=24973
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Stephane Chazelas <stephane.chazelas <at> gmail.com>
To: bug-grep <at> gnu.org
Subject: [regression] [d-f] no longer includes e with acute accent in
 single-byte locales
Date: Sun, 20 Nov 2016 21:14:31 +0000
Hello,

In grep 2.26,

echo é | grep '[d-f]'

no longer matches in locales like fr_FR.iso885915 <at> euro or
en_GB.iso88591 where the character set is single-byte like
ISO-8859-1. It still works OK with UTF-8.

2.25 was OK. git bisect points to commit
2769d5331a38d623b67b1860ac46b39ff7e54aca

Reproduce with:

printf '\351\n' | LC_ALL=en_US.iso88591 ./src/grep '[d-f]' || echo fail

(assuming that locale is available on the system).

Tested on Ubuntu 16.04 amd64.

-- 
Stephane


[Message part 3 (message/rfc822, inline)]
From: Paul Eggert <eggert <at> cs.ucla.edu>
To: Stephane Chazelas <stephane.chazelas <at> gmail.com>, 24973-done <at> debbugs.gnu.org
Subject: Re: bug#24973: [regression] [d-f] no longer includes e with acute
 accent in single-byte locales
Date: Sun, 20 Nov 2016 20:34:35 -0800
[Message part 4 (text/plain, inline)]
Stephane Chazelas wrote:
> 2.25 was OK. git bisect points to commit
> 2769d5331a38d623b67b1860ac46b39ff7e54aca

Thanks for pinpointing the bug. It was my logic error in that commit. Fixed by 
altering Gnulib as follows:

http://lists.gnu.org/archive/html/bug-gnulib/2016-11/msg00086.html

and by installing the attached patches into grep.
[0001-build-update-gnulib-submodule-to-latest.patch (text/x-diff, attachment)]
[0002-tests-check-for-unibyte-French-range-bug.patch (text/x-diff, attachment)]

This bug report was last modified 8 years and 184 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.