GNU bug report logs - #30574
Support different charsets

Previous Next

Package: diffutils;

Reported by: Victor Porton <porton <at> narod.ru>

Date: Thu, 22 Feb 2018 16:21:02 UTC

Severity: normal

Full log


View this message in rfc822 format

From: Victor Porton <porton <at> narod.ru>
To: 30574 <at> debbugs.gnu.org
Subject: bug#30574: Support different charsets
Date: Thu, 22 Feb 2018 18:19:45 +0200
1. We should be able to do `diff` even for source and destination
having different encodings and/or charsets.

Add option for source encoding and destination encoding.

2. The conversion to common encoding should be even with the option to
not fail on wrong or unconvertible characters (like //IGNORE in GNU
iconv), that is replacing unknown characters with a placeholder
character. This is useful to compare wrongly encoded files.

3. More generally we can add filter for every compared file first to
pass through the filter. Item 1 can be implemented by passing `iconv`
or `recode` command as such a filter.




This bug report was last modified 7 years and 118 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.