GNU bug report logs - #12192
multibyte: tr: TR operates on bytes, not characters

Previous Next

Package: coreutils;

Reported by: Michael Stummvoll <michael <at> stummi.org>

Date: Mon, 13 Aug 2012 13:02:02 UTC

Severity: wishlist

Merged with 9365, 9569, 10880, 13362

Full log


View this message in rfc822 format

From: Eric Blake <eblake <at> redhat.com>
To: Michael Stummvoll <michael <at> stummi.org>
Cc: 12192 <at> debbugs.gnu.org
Subject: bug#12192: tr - bytes vs characters
Date: Mon, 13 Aug 2012 07:54:02 -0600
[Message part 1 (text/plain, inline)]
On 08/13/2012 06:52 AM, Michael Stummvoll wrote:
> Hi gnu folks,
> 
> as already known, tr cannot handle multibyte-encodings like utf-8:
> 
>> mst <at> eddie:~$ echo "foo" | tr o ö
>> fÃÃ
> 
> i know, that multibyte encoding support is not needed for
> posix-compilance,

Actually, POSIX _does_ require multi-byte support; it's just that no one
has yet contributed code for this upstream that is easy enough to
maintain and without penalizing single-byte locales.  Patches are welcome.

-- 
Eric Blake   eblake <at> redhat.com    +1-919-301-3266
Libvirt virtualization library http://libvirt.org

[signature.asc (application/pgp-signature, attachment)]

This bug report was last modified 6 years and 249 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.