GNU bug report logs - #12192
multibyte: tr: TR operates on bytes, not characters

Previous Next

Package: coreutils;

Reported by: Michael Stummvoll <michael <at> stummi.org>

Date: Mon, 13 Aug 2012 13:02:02 UTC

Severity: wishlist

Merged with 9365, 9569, 10880, 13362

Full log


View this message in rfc822 format

From: Eric Blake <eblake <at> redhat.com>
To: Paul Eggert <eggert <at> cs.ucla.edu>
Cc: 12192 <at> debbugs.gnu.org, Michael Stummvoll <michael <at> stummi.org>
Subject: bug#12192: tr - bytes vs characters
Date: Mon, 13 Aug 2012 23:34:16 -0600
[Message part 1 (text/plain, inline)]
On 08/13/2012 08:45 PM, Paul Eggert wrote:
> On 08/13/2012 06:54 AM, Eric Blake wrote:
>> POSIX _does_ require multi-byte support
> 
> The last time I checked, POSIX did not require
> the implementation to provide any multibyte locales.
> Has this changed?

Fair enough - POSIX does not require the existence of a multibyte
locale; an embedded system that provides only single-byte encodings can
still be POSIX-compliant.  But POSIX _does_ require that tr be
locale-aware, and therefore if an implementation provides multibyte
locales (which most desktop glibc-based GNU/Linux systems do), then tr
should honor those locales, including multibyte character support.

> 
> But yes, the main thing is for someone to contribute
> correct, easy-to-maintain, and efficient code.

We're in violent agreement on this point :)

-- 
Eric Blake   eblake <at> redhat.com    +1-919-301-3266
Libvirt virtualization library http://libvirt.org

[signature.asc (application/pgp-signature, attachment)]

This bug report was last modified 6 years and 249 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.