GNU bug report logs - #8598
Bug in uniq?

Previous Next

Package: coreutils;

Reported by: emijrp <emijrp <at> gmail.com>

Date: Sat, 30 Apr 2011 17:21:01 UTC

Severity: normal

Tags: notabug

Done: Eric Blake <eblake <at> redhat.com>

Bug is archived. No further changes may be made.

Full log


Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: emijrp <emijrp <at> gmail.com>
To: bug-coreutils <at> gnu.org
Subject: Bug in uniq?
Date: Sat, 30 Apr 2011 14:03:22 +0200
[Message part 1 (text/plain, inline)]
Hi all;

I'm not sure if this is a bug.

If I download this file[1], unzip and do:

grep "<title>" wikiindexorg-20110409-history.xml | sort | uniq -D

It shows:

    <title>Felix Pleşoianu Wiki</title>
    <title>Felix Pleșoianu Wiki</title>
    <title>ᐧᐃᑭᐱᑎᔭ</title>
    <title>위키낱말사전</title>
    <title>ウィクショナリー</title>
    <title>언사이클로피디어</title>
    <title>ไทย Wikipedia</title>
    <title>한국어 Wikipedia</title>

But obviously, they are all different lines. Why?

Thanks,
emijrp

[1]
http://code.google.com/p/wikiteam/downloads/detail?name=wikiindexorg-20110409-history.xml.7z
[Message part 2 (text/html, inline)]

This bug report was last modified 14 years and 82 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.