From unknown Fri Aug 15 16:58:06 2025 X-Loop: help-debbugs@gnu.org Subject: bug#25455: uniq considers all the full-width punctuation and Japanese kana as the same under zh_CN.UTF-8 locale Resent-From: Icenowy Zheng Original-Sender: "Debbugs-submit" Resent-CC: bug-coreutils@gnu.org Resent-Date: Sun, 15 Jan 2017 23:10:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 25455 X-GNU-PR-Package: coreutils X-GNU-PR-Keywords: To: 25455@debbugs.gnu.org Cc: arthur2e5@aosc.xyz X-Debbugs-Original-To: bug-coreutils@gnu.org Received: via spool by submit@debbugs.gnu.org id=B.148452176722592 (code B ref -1); Sun, 15 Jan 2017 23:10:01 +0000 Received: (at submit) by debbugs.gnu.org; 15 Jan 2017 23:09:27 +0000 Received: from localhost ([127.0.0.1]:55103 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cStvS-0005sJ-2K for submit@debbugs.gnu.org; Sun, 15 Jan 2017 18:09:27 -0500 Received: from eggs.gnu.org ([208.118.235.92]:33638) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cSqzT-0001bg-J3 for submit@debbugs.gnu.org; Sun, 15 Jan 2017 15:01:25 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cSqzN-0002qD-PY for submit@debbugs.gnu.org; Sun, 15 Jan 2017 15:01:18 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50,T_DKIM_INVALID autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:59342) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1cSqzN-0002q8-NJ for submit@debbugs.gnu.org; Sun, 15 Jan 2017 15:01:17 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:52817) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cSqzM-0001fs-KQ for bug-coreutils@gnu.org; Sun, 15 Jan 2017 15:01:17 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cSqzJ-0002nO-Jz for bug-coreutils@gnu.org; Sun, 15 Jan 2017 15:01:16 -0500 Received: from forward13p.cmail.yandex.net ([87.250.241.140]:56039) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1cSqzI-0002mR-NS for bug-coreutils@gnu.org; Sun, 15 Jan 2017 15:01:13 -0500 Received: from mxback9m.mail.yandex.net (mxback9m.mail.yandex.net [IPv6:2a02:6b8:0:2519::112]) by forward13p.cmail.yandex.net (Yandex) with ESMTP id 79714218CD for ; Sun, 15 Jan 2017 23:01:06 +0300 (MSK) Received: from web19m.yandex.ru (web19m.yandex.ru [37.140.138.110]) by mxback9m.mail.yandex.net (nwsmtp/Yandex) with ESMTP id KryUjqYHhb-15KW5Yje; Sun, 15 Jan 2017 23:01:05 +0300 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=aosc.xyz; s=mail; t=1484510465; bh=7RBkGuvJAsheY3MVLfbIHrD8lVNtJ5Pj9jcAHACHyt0=; h=From:To:Cc:Subject:Message-Id:Date; b=Fkm3PQHmwHNJTjIaOk7ONKQNts/S04TmHLX2BdSiTwmyj4boDWwBByD2vn8bfk3kg E4HSxXkx35PM6RnZORIIPx1VVH6NfuVbbj6V7W8Fzcw34sQ9LOL1EpLQ8/45TKMYV5 OkOql0jaKXqlD3T5Gwky4Hp+Lxj17QuAk/c5YG5I= Authentication-Results: mxback9m.mail.yandex.net; dkim=pass header.i=@aosc.xyz Received: by web19m.yandex.ru with HTTP; Sun, 15 Jan 2017 23:01:05 +0300 From: Icenowy Zheng MIME-Version: 1.0 Message-Id: <5170141484510465@web19m.yandex.ru> X-Mailer: Yamail [ http://yandex.ru ] 5.0 Date: Mon, 16 Jan 2017 04:01:05 +0800 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] [fuzzy] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -4.0 (----) X-Mailman-Approved-At: Sun, 15 Jan 2017 18:09:25 -0500 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.0 (----) Problem: When dealing lines with only a Chinese full-width punctuation or Japanese= kana and locale is zh_CN.UTF-8, uniq command will consider all the lines are t= he same, and wrongly removed different punctuations. Reproduce steps: Run the following command: ``` printf "%s\n" =EF=BC=8C =E3=80=82 =EF=BC=9A =EF=BF=A5 =E3=81=82 =E3=81=8B= =E3=82=A2 =E3=82=AB a b c , . : $ | LC_ALL=3Dzh_CN.UTF-8 uniq ``` Comments: The printf command prints out ``` =EF=BC=8C =E3=80=82 =EF=BC=9A =EF=BF=A5 =E3=81=82 =E3=81=8B =E3=82=A2 =E3=82=AB a b c , . : $ ``` Every line is different. However, after uniq command, it gives out ``` =EF=BC=8C a b c , . : $ ``` Under zh_TW.UTF-8 locale, the problems also happens; but under ja_JP.UTF-= 8 or C it do not happen. Version info: ``` $ uniq --version uniq (GNU coreutils) 8.26 ... ... $ /lib/libc.so.6=20 GNU C Library (2.24-2_AOSC_OS) stable release version 2.24, by Roland McG= rath et al. ... ... ``` Architecture: on x86_64 and armv7l architectures the test fails. From unknown Fri Aug 15 16:58:06 2025 X-Loop: help-debbugs@gnu.org Subject: bug#25455: uniq considers all the full-width punctuation and Japanese kana as the same under zh_CN.UTF-8 locale Resent-From: "Mingye Wang (Arthur2e5)" Original-Sender: "Debbugs-submit" Resent-CC: bug-coreutils@gnu.org Resent-Date: Tue, 17 Jan 2017 19:27:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 25455 X-GNU-PR-Package: coreutils X-GNU-PR-Keywords: To: icenowy@aosc.xyz, 25455@debbugs.gnu.org X-Debbugs-Original-To: Icenowy Zheng , "bug-coreutils@gnu.org" Received: via spool by submit@debbugs.gnu.org id=B.148468122027466 (code B ref -1); Tue, 17 Jan 2017 19:27:02 +0000 Received: (at submit) by debbugs.gnu.org; 17 Jan 2017 19:27:00 +0000 Received: from localhost ([127.0.0.1]:56695 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cTZPH-00078v-FB for submit@debbugs.gnu.org; Tue, 17 Jan 2017 14:27:00 -0500 Received: from eggs.gnu.org ([208.118.235.92]:42702) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cTYPV-0005cq-Ak for submit@debbugs.gnu.org; Tue, 17 Jan 2017 13:23:10 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cTYPM-0005kd-IC for submit@debbugs.gnu.org; Tue, 17 Jan 2017 13:23:04 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50,T_DKIM_INVALID autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:45984) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1cTYPM-0005kT-FU for submit@debbugs.gnu.org; Tue, 17 Jan 2017 13:23:00 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:33623) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1cTYPL-0003i5-Il for bug-coreutils@gnu.org; Tue, 17 Jan 2017 13:23:00 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1cTYPG-0005iA-MH for bug-coreutils@gnu.org; Tue, 17 Jan 2017 13:22:59 -0500 Received: from forward16m.cmail.yandex.net ([5.255.216.147]:40946) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1cTYPG-0005h0-8Q for bug-coreutils@gnu.org; Tue, 17 Jan 2017 13:22:54 -0500 Received: from mxback10g.mail.yandex.net (mxback10g.mail.yandex.net [IPv6:2a02:6b8:0:1472:2741:0:8b7:171]) by forward16m.cmail.yandex.net (Yandex) with ESMTP id CA51A21D70 for ; Tue, 17 Jan 2017 21:22:48 +0300 (MSK) Received: from web7g.yandex.ru (web7g.yandex.ru [95.108.252.107]) by mxback10g.mail.yandex.net (nwsmtp/Yandex) with ESMTP id HKKG5Sqz1S-Mm24SslW; Tue, 17 Jan 2017 21:22:48 +0300 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=aosc.xyz; s=mail; t=1484677368; bh=ZNQq220cDu4kRAZm3NulcU5AVtE+nF4AOwQgUnKZR9c=; h=From:To:In-Reply-To:References:Subject:Message-Id:Date; b=Gv79oYZzE81VIeWMJXfADxYK8SL4+lwz9ozDtED7ZDWIdNO/LFGHybewrRATft1+A aW0yQV1zNKBxjDrP57uG++s1gGUSVYu5a341GhkeSNiMkVQ+M2/FjSK91/17VdlSep sLl510D+1QX13C233irb6pOA1QX6Y33+vWSmwlf4= Authentication-Results: mxback10g.mail.yandex.net; dkim=pass header.i=@aosc.xyz Received: by web7g.yandex.ru with HTTP; Tue, 17 Jan 2017 21:22:48 +0300 From: "Mingye Wang (Arthur2e5)" In-Reply-To: <5170141484510465@web19m.yandex.ru> References: <5170141484510465@web19m.yandex.ru> MIME-Version: 1.0 Message-Id: <6413941484677368@web7g.yandex.ru> X-Mailer: Yamail [ http://yandex.ru ] 5.0 Date: Tue, 17 Jan 2017 18:22:48 +0000 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] [fuzzy] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -4.0 (----) X-Mailman-Approved-At: Tue, 17 Jan 2017 14:26:58 -0500 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.0 (----) 15.01.2017, 20:01, "Icenowy Zheng" : > Problem: > When dealing lines with only a Chinese full-width punctuation or Japane= se kana > and locale is zh_CN.UTF-8, uniq command will consider all the lines are= the > same, and wrongly removed different punctuations. To narrow the scope down a bit, I should mention that LC_COLLATE is enoug= h to trigger the bug: printf '%s\n' =E3=80=82 =EF=BC=8C =EF=BC=9F =EF=BC=81 a b c | LC_COLLATE=3D= zh_CN.UTF-8 uniq --=20 Regards, Arthur2e5 From unknown Fri Aug 15 16:58:06 2025 X-Loop: help-debbugs@gnu.org Subject: bug#25455: uniq considers all the full-width punctuation and Japanese kana as the same under zh_CN.UTF-8 locale Resent-From: Mike Frysinger Original-Sender: "Debbugs-submit" Resent-CC: bug-coreutils@gnu.org Resent-Date: Sat, 21 Jan 2017 03:09:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 25455 X-GNU-PR-Package: coreutils X-GNU-PR-Keywords: To: Icenowy Zheng Cc: 25455@debbugs.gnu.org, arthur2e5@aosc.xyz Received: via spool by 25455-submit@debbugs.gnu.org id=B25455.148496812221125 (code B ref 25455); Sat, 21 Jan 2017 03:09:02 +0000 Received: (at 25455) by debbugs.gnu.org; 21 Jan 2017 03:08:42 +0000 Received: from localhost ([127.0.0.1]:36804 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cUm2k-0005Uf-8n for submit@debbugs.gnu.org; Fri, 20 Jan 2017 22:08:42 -0500 Received: from smtp.gentoo.org ([140.211.166.183]:42672) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1cUm2i-0005UO-7o for 25455@debbugs.gnu.org; Fri, 20 Jan 2017 22:08:41 -0500 Received: from vapier (localhost [127.0.0.1]) by smtp.gentoo.org (Postfix) with SMTP id A607A34164E; Sat, 21 Jan 2017 03:08:33 +0000 (UTC) Date: Fri, 20 Jan 2017 22:08:33 -0500 From: Mike Frysinger Message-ID: <20170121030833.GO31632@vapier> Mail-Followup-To: Icenowy Zheng , 25455@debbugs.gnu.org, arthur2e5@aosc.xyz References: <5170141484510465@web19m.yandex.ru> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="fmvA4kSBHQVZhkR6" Content-Disposition: inline In-Reply-To: <5170141484510465@web19m.yandex.ru> X-Spam-Score: -8.2 (--------) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -8.2 (--------) --fmvA4kSBHQVZhkR6 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline On 16 Jan 2017 04:01, Icenowy Zheng wrote: > When dealing lines with only a Chinese full-width punctuation or Japanese kana > and locale is zh_CN.UTF-8, uniq command will consider all the lines are the > same, and wrongly removed different punctuations. this is a problem with glibc, not coreutils. you can follow the upstream bug: https://sourceware.org/bugzilla/show_bug.cgi?id=13063 -mike --fmvA4kSBHQVZhkR6 Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature -----BEGIN PGP SIGNATURE----- iQIzBAEBCAAdFiEEuQK1JxMl+JKsJRrUQWM7n+g39YEFAliC0LAACgkQQWM7n+g3 9YGunQ//cXiN8ks9ZHZqWYT3kV7r0J0RKg2jKXu9jCn+VLVz5a1ff0I+5J5xoYUn Uo1cmp4GeH9Nxb/Q7md2YgF38FQ7lTBA3Z0kOPBSlEBIc0KdTSlIbkgoG3pXKTnE 2Mh6uKI/6fy6iMrYg/+pgBJjzzo11Gk7r3tgJgoCxg+Q/GfS8uhIW8p5M2FxyzY0 r8gaWiiuutWumjXFMUuGP9cxNDtd28+M+GBr2uGBpscnpQmm+OJJ8S07fNILTtzK oIqNLYfQ9uho8H444yErgc8JdhbZ7ilzubxmNw267qQiC3rtzFuQVZRV5huJazo6 3qUlzGCEBGsAawXxnDnC7wgIZNaKO++yU+02Gmajctis32LiBw7Rzk+Rtp5ACU6I tXfHh6Uu6HHxVeUN+nVadhmfi1PM+lGRdB12AGnJ8pCjqe0m/1rnouGVtAXj+QUM dIzumFOzwwZFX8ir8IpLP+qZs3jI7BY/I0GSgGXZW9x1wzd5qFA5L/a+wJxzUVso NcVFM29Wl8nLs34c3A0wn/LgU1bRlOSpMR7wI5TKhx8A4VNe3G0R2zNNgkkDmMAN cxZ8M29D42xKHR8dMmj5obWjmWLjfQYj55aUiejNGfSb8SA0n/rdiOXBzfu2iInO Fdox/JTgKQkt0XfmS6x8JMmyaTNshSIOP5EhVdDjmL7CD7/ryRo= =wyuG -----END PGP SIGNATURE----- --fmvA4kSBHQVZhkR6-- From unknown Fri Aug 15 16:58:06 2025 X-Loop: help-debbugs@gnu.org Subject: bug#25455: uniq considers all the full-width punctuation and Japanese kana as the same under zh_CN.UTF-8 locale Resent-From: Assaf Gordon Original-Sender: "Debbugs-submit" Resent-CC: bug-coreutils@gnu.org Resent-Date: Sun, 28 Oct 2018 07:53:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 25455 X-GNU-PR-Package: coreutils X-GNU-PR-Keywords: To: 25455@debbugs.gnu.org Received: via spool by 25455-submit@debbugs.gnu.org id=B25455.15407131451489 (code B ref 25455); Sun, 28 Oct 2018 07:53:01 +0000 Received: (at 25455) by debbugs.gnu.org; 28 Oct 2018 07:52:25 +0000 Received: from localhost ([127.0.0.1]:46532 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gGfs1-0000No-6Q for submit@debbugs.gnu.org; Sun, 28 Oct 2018 03:52:25 -0400 Received: from mail-it1-f193.google.com ([209.85.166.193]:52898) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gGfry-0000NU-Rn; Sun, 28 Oct 2018 03:52:23 -0400 Received: by mail-it1-f193.google.com with SMTP id r5-v6so4116898ith.2; Sun, 28 Oct 2018 00:52:22 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-language:content-transfer-encoding; bh=T3XHAJ8PMmxRZjBqUguxm/VbhEm4z048r++3CVjGw0s=; b=eGXhnutux7Hh/caU3Zss4F8hNWZ8CsCV3iFN9HDL/dTtd3bSxOxqHFymUIYpPU1AYD hmNrz1kdjcv/SqagYfZUwSCA0t+ttOMTOWL9kvY2oubCziNAQr9gV8wid186yAhswqrQ OpzJvk9iPvz9VVZSgfir7LCChL1SnVGwsIKPFQ99YhJKYra2UQG7BI5YARp5l6s014Vh Vb38ZWTiCWQOUpPMUCezkWtK21rRaMmggbbPuyAj5X4qCHrPVA4SzRLw1GFakR2qDiJY 3xk/HNb7Y1CNSEs9r+z4xWcA9dAz1XOmYrz8vsxC+SOQtA0iZnAyLVUtVeF9NZEBO8B3 e9Bw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=T3XHAJ8PMmxRZjBqUguxm/VbhEm4z048r++3CVjGw0s=; b=onmuYtzYif+xtvjUkeAm0RLIZwuvwHJdP9M32FpBbfe9IIkxYGQhYLzbgkMRD2bsEh T0uLObMsIPsaUo+sj4evtxLtpp2jGjqOUgCSonK0/WVRkDrmR7O2Pre9dg2+wr8eghxY 78iQeELWs4n7Y73wSRRj8LlGY9d6VrYFHlrzy5E7fO4ae2xaV3xhlCL4UzcmZ1bKq2Y6 HP1YwqbPgkvfbGd1seXRocNpsIXHyRldYU+eD31lpYlTPIEpPZbY1X/O6qc47WAVL9A1 nXuazxCq/eBSE/5DThf6pNU+/7W4hk+/L/Y7R/0DaP2fDHO3Dyq4l/ItaG9cw5047hry tKiQ== X-Gm-Message-State: AGRZ1gK09m4ZM7DNI2JD3GlfTOPPJpdKy5X7o7NXC8PSQef7niQuK4Nr 8J6MECWnfXPrxwwCwI2PpLaAkmyX+Gc= X-Google-Smtp-Source: AJdET5clfyrJO0CyqmoUK7pdYcgFPjaf8pd3vMgvbc7f8zIBuVjnFn66bGIjY5635lUrJrjZvkdSng== X-Received: by 2002:a02:1649:: with SMTP id a70-v6mr6955586jaa.128.1540713136804; Sun, 28 Oct 2018 00:52:16 -0700 (PDT) Received: from tomato.housegordon.com (moose.housegordon.com. [184.68.105.38]) by smtp.googlemail.com with ESMTPSA id e78-v6sm5475376itc.4.2018.10.28.00.52.14 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 28 Oct 2018 00:52:15 -0700 (PDT) References: <5170141484510465@web19m.yandex.ru> <20170121030833.GO31632@vapier> From: Assaf Gordon Message-ID: <5e424526-6300-5165-fd97-4623541e73e8@gmail.com> Date: Sun, 28 Oct 2018 01:52:13 -0600 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.2.1 MIME-Version: 1.0 In-Reply-To: <20170121030833.GO31632@vapier> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Spam-Score: -0.0 (/) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) tags 25455 notabug close 25455 stop (triaging old bugs) On 2017-01-20 8:08 p.m., Mike Frysinger wrote: > On 16 Jan 2017 04:01, Icenowy Zheng wrote: >> When dealing lines with only a Chinese full-width punctuation or Japanese kana >> and locale is zh_CN.UTF-8, uniq command will consider all the lines are the >> same, and wrongly removed different punctuations. > > this is a problem with glibc, not coreutils. you can follow the upstream bug: > https://sourceware.org/bugzilla/show_bug.cgi?id=13063 Given the above, and with no further comments in more than a year, I'm closing this bug. Discussion can continue by replying to this thread. -assaf