GNU bug report logs - #34492
rx: ASCII-raw byte ranges comprise all of Unicode

Previous Next

Package: emacs;

Reported by: Mattias Engdegård <mattiase <at> acm.org>

Date: Fri, 15 Feb 2019 18:25:02 UTC

Severity: normal

Done: Mattias Engdegård <mattiase <at> acm.org>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Mattias Engdegård <mattiase <at> acm.org>
Cc: tracker <at> debbugs.gnu.org
Subject: bug#34492: closed (rx: ASCII-raw byte ranges comprise all of Unicode)
Date: Sat, 16 Feb 2019 11:47:02 +0000
[Message part 1 (text/plain, inline)]
Your message dated Sat, 16 Feb 2019 12:46:16 +0100
with message-id <3FF91096-B760-4012-B207-512C173E254E <at> acm.org>
and subject line Re: bug#34492: Acknowledgement (rx: ASCII-raw byte ranges comprise all of Unicode)
has caused the debbugs.gnu.org bug report #34492,
regarding rx: ASCII-raw byte ranges comprise all of Unicode
to be marked as done.

(If you believe you have received this mail in error, please contact
help-debbugs <at> gnu.org.)


-- 
34492: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=34492
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Mattias Engdegård <mattiase <at> acm.org>
To: bug-gnu-emacs <at> gnu.org
Subject: rx: ASCII-raw byte ranges comprise all of Unicode
Date: Fri, 15 Feb 2019 19:23:56 +0100
`rx' incorrectly considers character ranges between ASCII and raw bytes to cover all codes in-between, which includes all non-ASCII Unicode chars.
This causes (any "\000-\377" ?Å) to be simplified to (any "\000-\377"), which is not at all the same thing: [\000-\377] really means [\000-\177\200-\377] -- the transformation is normally made by the Emacs regexp engine. The two ranges are not contiguous on the character code level.

It's a sleeper bug that was awakened by my fixing bug#33205, so I'm to blame for not checking this.



[Message part 3 (message/rfc822, inline)]
From: Mattias Engdegård <mattiase <at> acm.org>
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: 34492-done <at> debbugs.gnu.org
Subject: Re: bug#34492: Acknowledgement (rx: ASCII-raw byte ranges comprise
 all of Unicode)
Date: Sat, 16 Feb 2019 12:46:16 +0100
16 feb. 2019 kl. 12.40 skrev Eli Zaretskii <eliz <at> gnu.org>:
> 
> This is OK, but we use quoting 'like this' in NEWS.

Thank you, pushed with that modification.



This bug report was last modified 6 years and 95 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.