GNU bug report logs - #34492
rx: ASCII-raw byte ranges comprise all of Unicode

Previous Next

Package: emacs;

Reported by: Mattias Engdegård <mattiase <at> acm.org>

Date: Fri, 15 Feb 2019 18:25:02 UTC

Severity: normal

Done: Mattias Engdegård <mattiase <at> acm.org>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Mattias Engdegård <mattiase <at> acm.org>
Subject: bug#34492: closed (Re: bug#34492: Acknowledgement (rx: ASCII-raw
 byte ranges comprise all of Unicode))
Date: Sat, 16 Feb 2019 11:47:02 +0000
[Message part 1 (text/plain, inline)]
Your bug report

#34492: rx: ASCII-raw byte ranges comprise all of Unicode

which was filed against the emacs package, has been closed.

The explanation is attached below, along with your original report.
If you require more details, please reply to 34492 <at> debbugs.gnu.org.

-- 
34492: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=34492
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Mattias Engdegård <mattiase <at> acm.org>
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: 34492-done <at> debbugs.gnu.org
Subject: Re: bug#34492: Acknowledgement (rx: ASCII-raw byte ranges comprise
 all of Unicode)
Date: Sat, 16 Feb 2019 12:46:16 +0100
16 feb. 2019 kl. 12.40 skrev Eli Zaretskii <eliz <at> gnu.org>:
> 
> This is OK, but we use quoting 'like this' in NEWS.

Thank you, pushed with that modification.


[Message part 3 (message/rfc822, inline)]
From: Mattias Engdegård <mattiase <at> acm.org>
To: bug-gnu-emacs <at> gnu.org
Subject: rx: ASCII-raw byte ranges comprise all of Unicode
Date: Fri, 15 Feb 2019 19:23:56 +0100
`rx' incorrectly considers character ranges between ASCII and raw bytes to cover all codes in-between, which includes all non-ASCII Unicode chars.
This causes (any "\000-\377" ?Å) to be simplified to (any "\000-\377"), which is not at all the same thing: [\000-\377] really means [\000-\177\200-\377] -- the transformation is normally made by the Emacs regexp engine. The two ranges are not contiguous on the character code level.

It's a sleeper bug that was awakened by my fixing bug#33205, so I'm to blame for not checking this.




This bug report was last modified 6 years and 124 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.