GNU bug report logs -
#19878
24.4; Syntax class [:alpha:] wrongly matches the Indian digits ۱۲۳۴۵۶۷۸۹۰ as letter
Previous Next
Reported by: mohammad.mahmoudi <at> gmail.com
Date: Sun, 15 Feb 2015 19:25:02 UTC
Severity: normal
Found in version 24.4
Done: Eli Zaretskii <eliz <at> gnu.org>
Bug is archived. No further changes may be made.
Full log
Message #14 received at 19878 <at> debbugs.gnu.org (full text, mbox):
>>>>> Eli Zaretskii <eliz <at> gnu.org> writes:
[…]
> Also, does someone see any potential problem to make [:digit:] be a
> superset of the current ASCII-only set, to match UTS #18 as well?
> The comment in regex.c says it is "only used for single-byte
> characters", but it isn't clear to me whether this is a requirement,
> i. e. there's some code in Emacs that relies on that, or just a
> statement of facts.
Just for a random data point, my own preference was to always
use [0-9] when the intent is to discern a number for a later use
of number-to-string, etc. Frankly, I can’t even readily suggest
any reasonable examples where one’d want to use [:digit:] in the
first place.
[…]
--
FSF associate member #7257 http://boycottsystemd.org/ … 3013 B6A0 230E 334A
This bug report was last modified 10 years and 137 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.