GNU bug report logs - #9569
multibyte: tr: TR operates on bytes, not characters

Previous Next

Package: coreutils;

Reported by: Milos Sramek <sramek.milos <at> gmail.com>

Date: Wed, 21 Sep 2011 19:59:01 UTC

Severity: wishlist

Merged with 9365, 10880, 12192, 13362

To reply to this bug, email your comments to 9569 AT debbugs.gnu.org.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-coreutils <at> gnu.org:
bug#9569; Package coreutils. (Wed, 21 Sep 2011 19:59:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to Milos Sramek <sramek.milos <at> gmail.com>:
New bug report received and forwarded. Copy sent to bug-coreutils <at> gnu.org. (Wed, 21 Sep 2011 19:59:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Milos Sramek <sramek.milos <at> gmail.com>
To: bug-coreutils <at> gnu.org
Subject: :upper: and :lower: not working with tr and utf8 characters
Date: Wed, 21 Sep 2011 21:47:55 +0200
Hello,

there seems to be a bug in tr: utf8 characters are not converted using
:upper: and :lower:

For example:
$ echo lľsšcčtťzž | tr [:lower:] [:upper:]
LľSšCčTťZž

AWK does it correctly
$ echo lľsšcčtťzž | awk '{ print toupper($0) }'
LĽSŠCČTŤZŽ

Used system: Ubuntu 10.10, my locale settings are
$ env locale
LANG=sk_SK.utf8
LC_CTYPE="sk_SK.utf8"
LC_NUMERIC=en_US.utf8
LC_TIME="sk_SK.utf8"
LC_COLLATE="sk_SK.utf8"
LC_MONETARY="sk_SK.utf8"
LC_MESSAGES="sk_SK.utf8"
LC_PAPER="sk_SK.utf8"
LC_NAME="sk_SK.utf8"
LC_ADDRESS="sk_SK.utf8"
LC_TELEPHONE="sk_SK.utf8"
LC_MEASUREMENT="sk_SK.utf8"
LC_IDENTIFICATION="sk_SK.utf8"
LC_ALL=

Observed on other systems, too.

thank you
Milos

-- 
email & jabber: sramek.milos <at> gmail.com





Information forwarded to bug-coreutils <at> gnu.org:
bug#9569; Package coreutils. (Fri, 24 Feb 2012 17:31:03 GMT) Full text and rfc822 format available.

Message #8 received at 9569 <at> debbugs.gnu.org (full text, mbox):

From: "Marton Kadar" <marton.kadar <at> mail.com>
To: 9569 <at> debbugs.gnu.org
Subject: Duplicate of 9365
Date: Fri, 24 Feb 2012 10:08:15 -0500
This is the same error as 
http://debbugs.gnu.org/cgi/bugreport.cgi?bug=9365
although it gives some examples too.




Forcibly Merged 9365 9569. Request was from Paul Eggert <eggert <at> cs.ucla.edu> to control <at> debbugs.gnu.org. (Fri, 24 Feb 2012 18:29:02 GMT) Full text and rfc822 format available.

Forcibly Merged 9365 9569 10880. Request was from Paul Eggert <eggert <at> cs.ucla.edu> to control <at> debbugs.gnu.org. (Fri, 24 Feb 2012 18:33:02 GMT) Full text and rfc822 format available.

Forcibly Merged 9365 9569 10880 12192. Request was from Jim Meyering <jim <at> meyering.net> to control <at> debbugs.gnu.org. (Sat, 15 Sep 2012 10:30:04 GMT) Full text and rfc822 format available.

Forcibly Merged 9365 9569 10880 12192 13362. Request was from Pádraig Brady <P <at> draigBrady.com> to control <at> debbugs.gnu.org. (Sun, 06 Jan 2013 12:24:03 GMT) Full text and rfc822 format available.

Severity set to 'wishlist' from 'normal' Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Mon, 15 Oct 2018 14:07:02 GMT) Full text and rfc822 format available.

Changed bug title to 'multibyte: tr: TR operates on bytes, not characters' from ':upper: and :lower: not working with tr and utf8 characters' Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Mon, 15 Oct 2018 14:07:02 GMT) Full text and rfc822 format available.

This bug report was last modified 6 years and 247 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.