From unknown Sat Jun 14 18:43:56 2025 X-Loop: help-debbugs@gnu.org Subject: bug#47858: Document that FULLWIDTH DIGITs boggle coreutils brain currently Resent-From: =?UTF-8?Q?=E7=A9=8D=E4=B8=B9=E5=B0=BC?= Dan Jacobson Original-Sender: "Debbugs-submit" Resent-CC: bug-coreutils@gnu.org Resent-Date: Sun, 18 Apr 2021 01:13:04 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 47858 X-GNU-PR-Package: coreutils X-GNU-PR-Keywords: To: 47858@debbugs.gnu.org X-Debbugs-Original-To: bug-coreutils@gnu.org Received: via spool by submit@debbugs.gnu.org id=B.161870836418610 (code B ref -1); Sun, 18 Apr 2021 01:13:04 +0000 Received: (at submit) by debbugs.gnu.org; 18 Apr 2021 01:12:44 +0000 Received: from localhost ([127.0.0.1]:44964 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lXvzQ-0004q1-7C for submit@debbugs.gnu.org; Sat, 17 Apr 2021 21:12:44 -0400 Received: from lists.gnu.org ([209.51.188.17]:57400) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lXvzM-0004pc-FE for submit@debbugs.gnu.org; Sat, 17 Apr 2021 21:12:40 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:47792) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lXvzM-0003To-Aq for bug-coreutils@gnu.org; Sat, 17 Apr 2021 21:12:40 -0400 Received: from olivedrab.birch.relay.mailchannels.net ([23.83.209.135]:56801) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lXvzK-0002hN-CX for bug-coreutils@gnu.org; Sat, 17 Apr 2021 21:12:40 -0400 X-Sender-Id: dreamhost|x-authsender|jidanni@jidanni.org Received: from relay.mailchannels.net (localhost [127.0.0.1]) by relay.mailchannels.net (Postfix) with ESMTP id F067B3611BF for ; Sun, 18 Apr 2021 01:12:30 +0000 (UTC) Received: from pdx1-sub0-mail-a9.g.dreamhost.com (100-96-11-98.trex.outbound.svc.cluster.local [100.96.11.98]) (Authenticated sender: dreamhost) by relay.mailchannels.net (Postfix) with ESMTPA id 8FE40360FA9 for ; Sun, 18 Apr 2021 01:12:30 +0000 (UTC) X-Sender-Id: dreamhost|x-authsender|jidanni@jidanni.org Received: from pdx1-sub0-mail-a9.g.dreamhost.com (pop.dreamhost.com [64.90.62.162]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384) by 100.96.11.98 (trex/6.1.1); Sun, 18 Apr 2021 01:12:30 +0000 X-MC-Relay: Neutral X-MailChannels-SenderId: dreamhost|x-authsender|jidanni@jidanni.org X-MailChannels-Auth-Id: dreamhost X-Decisive-Share: 39f4f8e70986da50_1618708350800_1772418879 X-MC-Loop-Signature: 1618708350800:1229770116 X-MC-Ingress-Time: 1618708350800 Received: from pdx1-sub0-mail-a9.g.dreamhost.com (localhost [127.0.0.1]) by pdx1-sub0-mail-a9.g.dreamhost.com (Postfix) with ESMTP id 57F6586747 for ; Sat, 17 Apr 2021 18:12:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha1; c=relaxed; d=jidanni.org; h=from:to :subject:date:message-id:mime-version:content-type :content-transfer-encoding; s=jidanni.org; bh=TARtNiNy7AbSZkxmtG vQ2DZF2EQ=; b=AtjO2BV5I0aKaLz2X9L2r4b+1SYkJPkvAKJ1TCN4lqJZcGsjIl 3gPrsajHcY0koIgZNfQK+ubYTnr9SMSqH1RuVQtoWFBnDw4wzNv5kCDSl3dds3xB 0K/ZWHkSJ83FrKm47tpbyOdBMk89N3WGBg0ZV8SYh3l15k1y6iPVO1Pdc= Received: from jidanni.org (114-41-19-154.dynamic-ip.hinet.net [114.41.19.154]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) (Authenticated sender: jidanni@jidanni.org) by pdx1-sub0-mail-a9.g.dreamhost.com (Postfix) with ESMTPSA id 1EB47861BE for ; Sat, 17 Apr 2021 18:12:30 -0700 (PDT) X-DH-BACKEND: pdx1-sub0-mail-a9 From: =?UTF-8?Q?=E7=A9=8D=E4=B8=B9=E5=B0=BC?= Dan Jacobson Date: Sun, 18 Apr 2021 08:09:24 +0800 Message-ID: <87eef84fnv.5.fsf@jidanni.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=23.83.209.135; envelope-from=jidanni@jidanni.org; helo=olivedrab.birch.relay.mailchannels.net X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-Spam-Score: -1.4 (-) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.4 (--) (info "(coreutils) Correct/Incorrect ordering and Expected/Unexpected res= ults") needs to "admit" or "at least mention" that "coreutils just hasn't though= t about FULLWIDTH DIGITs yet." I.e., Europeans are in luck. East Asians are out of luck. $ seq 9 11|sort 10 11 9 $ seq 9 11|sort -n 9 10 11 $ seq 9 11|sort -V 9 10 11 $ echo -e =EF=BC=99\\n=EF=BC=91=EF=BC=90\\n=EF=BC=91=EF=BC=91|sort =EF=BC=91=EF=BC=90 =EF=BC=91=EF=BC=91 =EF=BC=99 $ echo -e =EF=BC=99\\n=EF=BC=91=EF=BC=90\\n=EF=BC=91=EF=BC=91|sort -V =EF=BC=91=EF=BC=90 =EF=BC=91=EF=BC=91 =EF=BC=99 $ echo -e =EF=BC=99\\n=EF=BC=91=EF=BC=90\\n=EF=BC=91=EF=BC=91|sort -n =EF=BC=91=EF=BC=90 =EF=BC=91=EF=BC=91 =EF=BC=99 $ unicode =EF=BC=99 U+FF19 FULLWIDTH DIGIT NINE UTF-8: ef bc 99 UTF-16BE: ff19 Decimal: 9 Octal: \0177431 =EF=BC=99 Category: Nd (Number, Decimal Digit); East Asian width: F (fullwidth) Unicode block: FF00..FFEF; Halfwidth and Fullwidth Forms Numeric value: 9 Digit value: 9 Bidi: EN (European Number) Decomposition: 0039 Nope, LC_ALL=3Dzh_TW.UTF-8 didn't help. sort (GNU coreutils) 8.32