GNU bug report logs - #9569
multibyte: tr: TR operates on bytes, not characters

Previous Next

Package: coreutils;

Reported by: Milos Sramek <sramek.milos <at> gmail.com>

Date: Wed, 21 Sep 2011 19:59:01 UTC

Severity: wishlist

Merged with 9365, 10880, 12192, 13362

Full log


View this message in rfc822 format

From: Milos Sramek <sramek.milos <at> gmail.com>
To: 9569 <at> debbugs.gnu.org
Subject: bug#9569: :upper: and :lower: not working with tr and utf8 characters
Date: Wed, 21 Sep 2011 21:47:55 +0200
Hello,

there seems to be a bug in tr: utf8 characters are not converted using
:upper: and :lower:

For example:
$ echo lľsšcčtťzž | tr [:lower:] [:upper:]
LľSšCčTťZž

AWK does it correctly
$ echo lľsšcčtťzž | awk '{ print toupper($0) }'
LĽSŠCČTŤZŽ

Used system: Ubuntu 10.10, my locale settings are
$ env locale
LANG=sk_SK.utf8
LC_CTYPE="sk_SK.utf8"
LC_NUMERIC=en_US.utf8
LC_TIME="sk_SK.utf8"
LC_COLLATE="sk_SK.utf8"
LC_MONETARY="sk_SK.utf8"
LC_MESSAGES="sk_SK.utf8"
LC_PAPER="sk_SK.utf8"
LC_NAME="sk_SK.utf8"
LC_ADDRESS="sk_SK.utf8"
LC_TELEPHONE="sk_SK.utf8"
LC_MEASUREMENT="sk_SK.utf8"
LC_IDENTIFICATION="sk_SK.utf8"
LC_ALL=

Observed on other systems, too.

thank you
Milos

-- 
email & jabber: sramek.milos <at> gmail.com





This bug report was last modified 6 years and 247 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.