GNU bug report logs - #19242
latest grep considers text files as binary

Previous Next

Package: grep;

Reported by: Thomas Wolff <towo <at> computer.org>

Date: Mon, 1 Dec 2014 18:02:01 UTC

Severity: normal

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Thomas Wolff <towo <at> computer.org>
Subject: bug#19242: closed (Re: latest grep considers text files as binary)
Date: Mon, 01 Dec 2014 22:43:02 +0000
[Message part 1 (text/plain, inline)]
Your bug report

#19242: latest grep considers text files as binary

which was filed against the grep package, has been closed.

The explanation is attached below, along with your original report.
If you require more details, please reply to 19242 <at> debbugs.gnu.org.

-- 
19242: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=19242
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Paul Eggert <eggert <at> cs.ucla.edu>
To: 19242-done <at> debbugs.gnu.org
Cc: noritnk <at> kcn.ne.jp
Subject: Re: latest grep considers text files as binary
Date: Mon, 01 Dec 2014 14:41:53 -0800
Also marking Bug#19242 as done, since it's the same as Bug#19241.

[Message part 3 (message/rfc822, inline)]
From: Thomas Wolff <towo <at> computer.org>
To: bug-grep <at> gnu.org
Cc: meyering <at> fb.com, eggert <at> cs.ucla.edu, noritnk <at> kcn.ne.jp
Subject: latest grep considers text files as binary
Date: Mon, 01 Dec 2014 18:05:51 +0100
[Message part 4 (text/plain, inline)]
Since grep 2.21, grep fails to report matches in a UTF-8 file with a few
non-UTF-8 bytes interspersed. This is likely to be related to one of the
recent patches related to encoding or multi-byte issues I see in the 
change log.

I have a number of large UTF-8 source files with some non-UTF-8 characters
used as constants and it was quite useful that grep nonetheless would
simply report the requested matches. Now it claims just
"Binary file ... matches" even if the file contains only one single
non-UTF-8 byte which I consider quite inappropriate.
I would appreciate to get the previous behaviour restored, at least in a
UTF-8 locale, as the mentioned patches are apparently intended to fix
issues in non-UTF-8 locales.

Kind regards,
Thomas
[Message part 5 (text/html, inline)]

This bug report was last modified 10 years and 65 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.