GNU bug report logs - #24973
[regression] [d-f] no longer includes e with acute accent in single-byte locales

Previous Next

Package: grep;

Reported by: Stephane Chazelas <stephane.chazelas <at> gmail.com>

Date: Sun, 20 Nov 2016 21:15:01 UTC

Severity: normal

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: Stephane Chazelas <stephane.chazelas <at> gmail.com>
To: 24973 <at> debbugs.gnu.org
Subject: bug#24973: [regression] [d-f] no longer includes e with acute accent in single-byte locales
Date: Sun, 20 Nov 2016 21:23:16 +0000
2016-11-20 21:14:31 +0000, Stephane Chazelas:
[...]
> echo é | grep '[d-f]'
> 
> no longer matches in locales like fr_FR.iso885915 <at> euro or
> en_GB.iso88591 where the character set is single-byte like
> ISO-8859-1. It still works OK with UTF-8.
[...]

I also seems to still be OK with other multi-byte locales like
zh_HK.big5hkscs:

$ locale charmap
BIG5-HKSCS
$ printf '\ue9' | ./src/grep '[d-f]' | hd
00000000  88 6d 0a                                          |.m.|
00000003

Though:

$ printf '\ue9' | ./src/grep '.*m' | hd
00000000  88 6d 0a                                          |.m.|

However, that seems to be a separate issue as it also failed in
earlier versions. I'll raise that separately.

-- 
Stephane




This bug report was last modified 8 years and 184 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.