GNU bug report logs - #30574
Support different charsets

Previous Next

Package: diffutils;

Reported by: Victor Porton <porton <at> narod.ru>

Date: Thu, 22 Feb 2018 16:21:02 UTC

Severity: normal

To reply to this bug, email your comments to 30574 AT debbugs.gnu.org.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-diffutils <at> gnu.org:
bug#30574; Package diffutils. (Thu, 22 Feb 2018 16:21:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to Victor Porton <porton <at> narod.ru>:
New bug report received and forwarded. Copy sent to bug-diffutils <at> gnu.org. (Thu, 22 Feb 2018 16:21:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Victor Porton <porton <at> narod.ru>
To: bug-diffutils <at> gnu.org
Subject: Support different charsets
Date: Thu, 22 Feb 2018 18:19:45 +0200
1. We should be able to do `diff` even for source and destination
having different encodings and/or charsets.

Add option for source encoding and destination encoding.

2. The conversion to common encoding should be even with the option to
not fail on wrong or unconvertible characters (like //IGNORE in GNU
iconv), that is replacing unknown characters with a placeholder
character. This is useful to compare wrongly encoded files.

3. More generally we can add filter for every compared file first to
pass through the filter. Item 1 can be implemented by passing `iconv`
or `recode` command as such a filter.




This bug report was last modified 7 years and 117 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.