GNU bug report logs - #19738
How did [a-z] match é?

Previous Next

Package: grep;

Reported by: James Calfee <james <at> jcalfee.info>

Date: Sat, 31 Jan 2015 02:48:02 UTC

Severity: normal

Tags: notabug

Done: Eric Blake <eblake <at> redhat.com>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Eric Blake <eblake <at> redhat.com>
Cc: tracker <at> debbugs.gnu.org
Subject: bug#19738: closed (How did [a-z] match é?)
Date: Sat, 31 Jan 2015 22:26:03 +0000
[Message part 1 (text/plain, inline)]
Your message dated Sat, 31 Jan 2015 15:25:29 -0700
with message-id <54CD5659.4020504 <at> redhat.com>
and subject line Re: bug#19738: How did [a-z] match é?
has caused the debbugs.gnu.org bug report #19738,
regarding How did [a-z] match é?
to be marked as done.

(If you believe you have received this mail in error, please contact
help-debbugs <at> gnu.org.)


-- 
19738: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=19738
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: James Calfee <james <at> jcalfee.info>
To: bug-grep <at> gnu.org
Subject: How did [a-z] match é?
Date: Fri, 30 Jan 2015 20:44:13 -0500
[Message part 3 (text/plain, inline)]
http://stackoverflow.com/questions/28247204/how-did-a-z-match-%C3%A9
[Message part 4 (text/html, inline)]
[Message part 5 (message/rfc822, inline)]
From: Eric Blake <eblake <at> redhat.com>
To: James Calfee <james <at> jcalfee.info>, 19738-done <at> debbugs.gnu.org
Subject: Re: bug#19738: How did [a-z] match é?
Date: Sat, 31 Jan 2015 15:25:29 -0700
[Message part 6 (text/plain, inline)]
tag 19738 notabug
thanks

On 01/30/2015 06:44 PM, James Calfee wrote:
> http://stackoverflow.com/questions/28247204/how-did-a-z-match-%C3%A9

Rather than making us chase a link, how about pasting the content of
your question directly in your mail.

At any rate, the answer to the question in your subject line is that
your current locale is set up to include letters outside of the 26 ascii
bytes.  Run with LC_ALL=C in the environment to get the well-defined
behavior you seem to want, and note that POSIX states that range
expressions in regular expressions are undefined in non-C locales,
precisely because some tools use all collation elements that 'sort'
would place in the range, while other tools use 'rational range
expression' and treat the range as if it were ASCII instead of the
current locale.

This is not a bug in coreutils, so I'm closing the tracker bug, but do
feel free to make further comments or questions.


-- 
Eric Blake   eblake redhat com    +1-919-301-3266
Libvirt virtualization library http://libvirt.org

[signature.asc (application/pgp-signature, attachment)]

This bug report was last modified 10 years and 117 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.