GNU bug report logs - #26806
multibyte: tr: Invalid byte sequence in tr command

Previous Next

Package: coreutils;

Reported by: maximiliam steffens <maxsteffens <at> gmail.com>

Date: Sat, 6 May 2017 16:43:02 UTC

Severity: wishlist

To reply to this bug, email your comments to 26806 AT debbugs.gnu.org.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-coreutils <at> gnu.org:
bug#26806; Package coreutils. (Sat, 06 May 2017 16:43:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to maximiliam steffens <maxsteffens <at> gmail.com>:
New bug report received and forwarded. Copy sent to bug-coreutils <at> gnu.org. (Sat, 06 May 2017 16:43:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: maximiliam steffens <maxsteffens <at> gmail.com>
To: bug-coreutils <at> gnu.org
Subject: Invalid byte sequence in tr command
Date: Sat, 6 May 2017 12:07:38 -0300
[Message part 1 (text/plain, inline)]
Trying to convert subtitles with characters (accented) possibly changed by
the opensubtitles.

Converting only the lowercase letters is ok

$cat legenda1.srt | tr 'AЯЖЗсржьзнЩКу' 'AÀÊÔãáéíóôúÇç' > teste.srt

With the capital letters of invalid code page
$cat legenda1.srt | tr '├┴┬╔' 'ÃÁÂÉ' > teste2.srt


subritle anexo
http://www.cdef.com.br/ <eu> <http://www.cdef.com.br/>
http://www.cdef.com.br/busca/ <eu> <http://www.cdef.com.br/busca/>
http://www.mamaeebebeabordo.com.br/<esposa>
http://www.lostinchicklit.com.br/ <filha n°1>
<http://www.lostinchicklit.com.br/>
http://msteffensillustrations.blogspot.com.br/ <filha n°2>
<http://msteffensillustrations.blogspot.com.br/>
[Message part 2 (text/html, inline)]
[legenda1.srt (application/x-subrip, attachment)]

Information forwarded to bug-coreutils <at> gnu.org:
bug#26806; Package coreutils. (Mon, 29 Oct 2018 03:15:01 GMT) Full text and rfc822 format available.

Message #8 received at 26806 <at> debbugs.gnu.org (full text, mbox):

From: Assaf Gordon <assafgordon <at> gmail.com>
To: maximiliam steffens <maxsteffens <at> gmail.com>, 26806 <at> debbugs.gnu.org
Subject: Re: bug#26806: Invalid byte sequence in tr command
Date: Sun, 28 Oct 2018 21:14:22 -0600
severity 26806 wishlist
retitle 26806 multibyte: tr: Invalid byte sequence in tr command
stop


(triaging old bugs)

On 2017-05-06 9:07 a.m., maximiliam steffens wrote:
> Trying to convert subtitles with characters (accented) possibly changed by
> the opensubtitles.
> 
> Converting only the lowercase letters is ok
> 
> $cat legenda1.srt | tr 'AЯЖЗсржьзнЩКу' 'AÀÊÔãáéíóôúÇç' > teste.srt
> 
> With the capital letters of invalid code page
> $cat legenda1.srt | tr '├┴┬╔' 'ÃÁÂÉ' > teste2.srt

It seems your message was lost and not answered to in a year.
Sorry about that.

Regarding 'tr': multibyte/utf-8 support is currently lacking,
but is being worked on.

I'll keep this bug open until it is resolved.

-assaf





Severity set to 'wishlist' from 'normal' Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Mon, 29 Oct 2018 03:15:02 GMT) Full text and rfc822 format available.

Changed bug title to 'multibyte: tr: Invalid byte sequence in tr command' from 'Invalid byte sequence in tr command' Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Mon, 29 Oct 2018 03:15:02 GMT) Full text and rfc822 format available.

This bug report was last modified 6 years and 320 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.