GNU bug report logs - #24973
[regression] [d-f] no longer includes e with acute accent in single-byte locales

Previous Next

Package: grep;

Reported by: Stephane Chazelas <stephane.chazelas <at> gmail.com>

Date: Sun, 20 Nov 2016 21:15:01 UTC

Severity: normal

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Stephane Chazelas <stephane.chazelas <at> gmail.com>
Subject: bug#24973: closed (Re: bug#24973: [regression] [d-f] no longer
 includes e with acute accent in single-byte locales)
Date: Mon, 21 Nov 2016 04:35:02 +0000
[Message part 1 (text/plain, inline)]
Your bug report

#24973: [regression] [d-f] no longer includes e with acute accent in single-byte locales

which was filed against the grep package, has been closed.

The explanation is attached below, along with your original report.
If you require more details, please reply to 24973 <at> debbugs.gnu.org.

-- 
24973: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=24973
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Paul Eggert <eggert <at> cs.ucla.edu>
To: Stephane Chazelas <stephane.chazelas <at> gmail.com>, 24973-done <at> debbugs.gnu.org
Subject: Re: bug#24973: [regression] [d-f] no longer includes e with acute
 accent in single-byte locales
Date: Sun, 20 Nov 2016 20:34:35 -0800
[Message part 3 (text/plain, inline)]
Stephane Chazelas wrote:
> 2.25 was OK. git bisect points to commit
> 2769d5331a38d623b67b1860ac46b39ff7e54aca

Thanks for pinpointing the bug. It was my logic error in that commit. Fixed by 
altering Gnulib as follows:

http://lists.gnu.org/archive/html/bug-gnulib/2016-11/msg00086.html

and by installing the attached patches into grep.
[0001-build-update-gnulib-submodule-to-latest.patch (text/x-diff, attachment)]
[0002-tests-check-for-unibyte-French-range-bug.patch (text/x-diff, attachment)]
[Message part 6 (message/rfc822, inline)]
From: Stephane Chazelas <stephane.chazelas <at> gmail.com>
To: bug-grep <at> gnu.org
Subject: [regression] [d-f] no longer includes e with acute accent in
 single-byte locales
Date: Sun, 20 Nov 2016 21:14:31 +0000
Hello,

In grep 2.26,

echo é | grep '[d-f]'

no longer matches in locales like fr_FR.iso885915 <at> euro or
en_GB.iso88591 where the character set is single-byte like
ISO-8859-1. It still works OK with UTF-8.

2.25 was OK. git bisect points to commit
2769d5331a38d623b67b1860ac46b39ff7e54aca

Reproduce with:

printf '\351\n' | LC_ALL=en_US.iso88591 ./src/grep '[d-f]' || echo fail

(assuming that locale is available on the system).

Tested on Ubuntu 16.04 amd64.

-- 
Stephane



This bug report was last modified 8 years and 184 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.