GNU bug report logs - #21395
multibyte: cut and Spanish characters

Previous Next

Package: coreutils;

Reported by: Michael Lee <michaellee213 <at> yahoo.com>

Date: Wed, 2 Sep 2015 00:54:02 UTC

Severity: wishlist

Full log


Message #8 received at 21395 <at> debbugs.gnu.org (full text, mbox):

From: Pádraig Brady <P <at> draigBrady.com>
To: Michael Lee <michaellee213 <at> yahoo.com>, 21395 <at> debbugs.gnu.org
Subject: Re: bug#21395: Bug with cut and Spanish characters from text file
 with UTF-8 encoding
Date: Wed, 02 Sep 2015 12:03:10 +0100
On 02/09/15 01:41, Michael Lee wrote:
> When using cut as, "cut -c 1" with a text file with Spanish characters, it does not display those characters.
> For example, the character ã or á will not display if it is the first character and the file is trimmed using the cut command.

Debian/Ubuntu do not use the i18n patch used in Fedora/RHEL/Suse for example,
and so do not support multi-byte characters. Now that i18n patch is
problematic and incomplete, and there are plans to bring the
functionality upstream at some stage:

http://www.pixelbeat.org/docs/coreutils_i18n/

cheers,
Pádraig




This bug report was last modified 6 years and 297 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.