GNU bug report logs - #25455
uniq considers all the full-width punctuation and Japanese kana as the same under zh_CN.UTF-8 locale

Previous Next

Package: coreutils;

Reported by: Icenowy Zheng <icenowy <at> aosc.xyz>

Date: Sun, 15 Jan 2017 23:10:01 UTC

Severity: normal

Tags: notabug

Done: Assaf Gordon <assafgordon <at> gmail.com>

Bug is archived. No further changes may be made.

Full log


Message #16 received at control <at> debbugs.gnu.org (full text, mbox):

From: Assaf Gordon <assafgordon <at> gmail.com>
To: 25455 <at> debbugs.gnu.org
Subject: Re: bug#25455: uniq considers all the full-width punctuation and
 Japanese kana as the same under zh_CN.UTF-8 locale
Date: Sun, 28 Oct 2018 01:52:13 -0600
tags 25455 notabug
close 25455
stop

(triaging old bugs)

On 2017-01-20 8:08 p.m., Mike Frysinger wrote:
> On 16 Jan 2017 04:01, Icenowy Zheng wrote:
>> When dealing lines with only a Chinese full-width punctuation or Japanese kana
>> and locale is zh_CN.UTF-8, uniq command will consider all the lines are the
>> same, and wrongly removed different punctuations.
> 
> this is a problem with glibc, not coreutils.  you can follow the upstream bug:
> https://sourceware.org/bugzilla/show_bug.cgi?id=13063

Given the above, and with no further comments
in more than a year, I'm closing this bug.
Discussion can continue by replying to this thread.

-assaf





This bug report was last modified 6 years and 265 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.