GNU bug report logs -
#47858
Document that FULLWIDTH DIGITs boggle coreutils brain currently
Previous Next
Full log
Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):
(info "(coreutils) Correct/Incorrect ordering and Expected/Unexpected results")
needs to "admit" or "at least mention" that "coreutils just hasn't thought about FULLWIDTH
DIGITs yet." I.e., Europeans are in luck. East Asians are out of luck.
$ seq 9 11|sort
10
11
9
$ seq 9 11|sort -n
9
10
11
$ seq 9 11|sort -V
9
10
11
$ echo -e 9\\n10\\n11|sort
10
11
9
$ echo -e 9\\n10\\n11|sort -V
10
11
9
$ echo -e 9\\n10\\n11|sort -n
10
11
9
$ unicode 9
U+FF19 FULLWIDTH DIGIT NINE
UTF-8: ef bc 99 UTF-16BE: ff19 Decimal: 9 Octal: \0177431
9
Category: Nd (Number, Decimal Digit); East Asian width: F (fullwidth)
Unicode block: FF00..FFEF; Halfwidth and Fullwidth Forms
Numeric value: 9
Digit value: 9
Bidi: EN (European Number)
Decomposition: <wide> 0039
Nope, LC_ALL=zh_TW.UTF-8 didn't help.
sort (GNU coreutils) 8.32
This bug report was last modified 4 years and 58 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.