GNU bug report logs - #19242
latest grep considers text files as binary

Previous Next

Package: grep;

Reported by: Thomas Wolff <towo <at> computer.org>

Date: Mon, 1 Dec 2014 18:02:01 UTC

Severity: normal

Done: Paul Eggert <eggert <at> cs.ucla.edu>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Paul Eggert <eggert <at> cs.ucla.edu>
Cc: tracker <at> debbugs.gnu.org
Subject: bug#19242: closed (latest grep considers text files as binary)
Date: Mon, 01 Dec 2014 22:43:01 +0000
[Message part 1 (text/plain, inline)]
Your message dated Mon, 01 Dec 2014 14:41:53 -0800
with message-id <547CEEB1.2070305 <at> cs.ucla.edu>
and subject line Re: latest grep considers text files as binary
has caused the debbugs.gnu.org bug report #19242,
regarding latest grep considers text files as binary
to be marked as done.

(If you believe you have received this mail in error, please contact
help-debbugs <at> gnu.org.)


-- 
19242: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=19242
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Thomas Wolff <towo <at> computer.org>
To: bug-grep <at> gnu.org
Cc: meyering <at> fb.com, eggert <at> cs.ucla.edu, noritnk <at> kcn.ne.jp
Subject: latest grep considers text files as binary
Date: Mon, 01 Dec 2014 18:05:51 +0100
[Message part 3 (text/plain, inline)]
Since grep 2.21, grep fails to report matches in a UTF-8 file with a few
non-UTF-8 bytes interspersed. This is likely to be related to one of the
recent patches related to encoding or multi-byte issues I see in the 
change log.

I have a number of large UTF-8 source files with some non-UTF-8 characters
used as constants and it was quite useful that grep nonetheless would
simply report the requested matches. Now it claims just
"Binary file ... matches" even if the file contains only one single
non-UTF-8 byte which I consider quite inappropriate.
I would appreciate to get the previous behaviour restored, at least in a
UTF-8 locale, as the mentioned patches are apparently intended to fix
issues in non-UTF-8 locales.

Kind regards,
Thomas
[Message part 4 (text/html, inline)]
[Message part 5 (message/rfc822, inline)]
From: Paul Eggert <eggert <at> cs.ucla.edu>
To: 19242-done <at> debbugs.gnu.org
Cc: noritnk <at> kcn.ne.jp
Subject: Re: latest grep considers text files as binary
Date: Mon, 01 Dec 2014 14:41:53 -0800
Also marking Bug#19242 as done, since it's the same as Bug#19241.


This bug report was last modified 10 years and 64 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.