GNU bug report logs - #12285
uniq on a UTF8 file with roman numerals

Previous Next

Package: coreutils;

Reported by: "P. Michaud" <pierrecmichaud <at> aol.com>

Date: Sun, 26 Aug 2012 19:04:03 UTC

Severity: normal

Tags: notabug

Done: Pádraig Brady <P <at> draigBrady.com>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: "P. Michaud" <pierrecmichaud <at> aol.com>
To: 12285 <at> debbugs.gnu.org
Subject: bug#12285: uniq on a UTF8 file with roman numerals
Date: Sun, 26 Aug 2012 13:49:12 -0400 (EDT)
[Message part 1 (text/plain, inline)]
Hello,

I used the command

"uniq -dc myfile.txt'

here are some lines of the output

      2 ☼ turvy
      2 ☼ with gay abandon
      2 ☼ with reckless abandon
     10 ☼ yyⅰ
      9 ☼ yyⅹⅲ
      2 ☼ yyⅺ
     12 ☼ zzⅰ


The three first lines above are correct and correspond to real duplicates lines in the file, but the numbers on the 4 last one are erroneous, each of them correspond to a single line in the file.

Yours faithfully.

Pierre Michaud



[Message part 2 (text/html, inline)]

This bug report was last modified 12 years and 272 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.