From unknown Sun Jun 22 07:56:11 2025 X-Loop: help-debbugs@gnu.org Subject: bug#19241: latest grep considers text files as binary Resent-From: Paul Eggert Original-Sender: "Debbugs-submit" Resent-CC: bug-grep@gnu.org Resent-Date: Mon, 01 Dec 2014 17:30:04 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 19241 X-GNU-PR-Package: grep X-GNU-PR-Keywords: To: towo@computer.org, 19241@debbugs.gnu.org Cc: meyering@fb.com, noritnk@kcn.ne.jp X-Debbugs-Original-To: Thomas Wolff , bug-grep@gnu.org Received: via spool by submit@debbugs.gnu.org id=B.141745499416271 (code B ref -1); Mon, 01 Dec 2014 17:30:04 +0000 Received: (at submit) by debbugs.gnu.org; 1 Dec 2014 17:29:54 +0000 Received: from localhost ([127.0.0.1]:50996 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvUnJ-0004EN-QQ for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:54 -0500 Received: from eggs.gnu.org ([208.118.235.92]:35856) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvUnH-0004EF-CL for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:51 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XvUn7-00045o-Ck for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:51 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.0 required=5.0 tests=BAYES_40 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([208.118.235.17]:56292) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XvUn7-00045i-9v for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:41 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:37513) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XvUmz-00071X-EN for bug-grep@gnu.org; Mon, 01 Dec 2014 12:29:41 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XvUmr-00042e-VF for bug-grep@gnu.org; Mon, 01 Dec 2014 12:29:33 -0500 Received: from smtp.cs.ucla.edu ([131.179.128.62]:60605) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XvUmr-00042a-PH for bug-grep@gnu.org; Mon, 01 Dec 2014 12:29:25 -0500 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id 78062A6003C; Mon, 1 Dec 2014 09:29:24 -0800 (PST) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id B4MK63TTVJYH; Mon, 1 Dec 2014 09:29:15 -0800 (PST) Received: from penguin.cs.ucla.edu (Penguin.CS.UCLA.EDU [131.179.64.200]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id A7B4CA6003D; Mon, 1 Dec 2014 09:29:15 -0800 (PST) Message-ID: <547CA56B.4070002@cs.ucla.edu> Date: Mon, 01 Dec 2014 09:29:15 -0800 From: Paul Eggert Organization: UCLA Computer Science Department User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.2.0 MIME-Version: 1.0 References: <547C9FEF.6090809@computer.org> In-Reply-To: <547C9FEF.6090809@computer.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 208.118.235.17 X-Spam-Score: -4.0 (----) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.0 (----) On 12/01/2014 09:05 AM, Thomas Wolff wrote: > the mentioned patches are apparently intended to fix > issues in non-UTF-8 locales. No, they're also needed for UTF-8 locales I'm afraid. There are some security issues, not only having to do with grep's internals, but also for the behavior of downstream programs that may be expecting UTF-8 text. You can work around the problem with 'grep -a'. From unknown Sun Jun 22 07:56:11 2025 MIME-Version: 1.0 X-Mailer: MIME-tools 5.503 (Entity 5.503) X-Loop: help-debbugs@gnu.org From: help-debbugs@gnu.org (GNU bug Tracking System) To: Paul Eggert Subject: bug#19241: closed (Re: bug#19241: latest grep considers text files as binary) Message-ID: References: <547CA56B.4070002@cs.ucla.edu> X-Gnu-PR-Message: they-closed 19241 X-Gnu-PR-Package: grep Reply-To: 19241@debbugs.gnu.org Date: Mon, 01 Dec 2014 17:54:02 +0000 Content-Type: multipart/mixed; boundary="----------=_1417456442-18720-1" This is a multi-part message in MIME format... ------------=_1417456442-18720-1 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Your bug report #19241: latest grep considers text files as binary which was filed against the grep package, has been closed. The explanation is attached below, along with your original report. If you require more details, please reply to 19241@debbugs.gnu.org. --=20 19241: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=3D19241 GNU Bug Tracking System Contact help-debbugs@gnu.org with problems ------------=_1417456442-18720-1 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at 19241-done) by debbugs.gnu.org; 1 Dec 2014 17:53:24 +0000 Received: from localhost ([127.0.0.1]:51014 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvVA3-0004r3-Vs for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:53:24 -0500 Received: from mail-yh0-f46.google.com ([209.85.213.46]:56251) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvVA2-0004qv-OG for 19241-done@debbugs.gnu.org; Mon, 01 Dec 2014 12:53:23 -0500 Received: by mail-yh0-f46.google.com with SMTP id t59so5103313yho.33 for <19241-done@debbugs.gnu.org>; Mon, 01 Dec 2014 09:53:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc:content-type; bh=WC3aEXZDWnfX1AhrYRRQ+138ybnDd+ZowXGCBd82nR0=; b=qDYd2DzYlP5iW5DmO3jkWGqNIcqZUPF/FQZHQ/YJhkkIuEi6DKwCLabSES0DaOZpsk PHSXmqxucratIoNuyi9X6KQ31hrYXauk5Wo0Li94AALAu2u4UuyLYbIKPbXJsnH2k+lJ yudz5iVbPCWCE8Btuw+2ZVD08xlzDM4eQI+09APuFaCRS6BjDvsBIpRo7Jk+SyAnysE5 SvRR84nHVMoRM4rRxfCwU7arvaDtzI6oKsux+jto1KUHkhTIm7rKb8dkkAZnY6HH9IaD EYlzf40ieudAFIaLO1qiQ9rDAkw8JMiq50b9AEiK94dcuREOIGMQXnaiQDHEAtDtEOtT 9PAA== X-Received: by 10.236.23.136 with SMTP id v8mr60310573yhv.25.1417456402113; Mon, 01 Dec 2014 09:53:22 -0800 (PST) MIME-Version: 1.0 Received: by 10.170.157.9 with HTTP; Mon, 1 Dec 2014 09:53:01 -0800 (PST) In-Reply-To: <547CA56B.4070002@cs.ucla.edu> References: <547C9FEF.6090809@computer.org> <547CA56B.4070002@cs.ucla.edu> From: Jim Meyering Date: Mon, 1 Dec 2014 09:53:01 -0800 X-Google-Sender-Auth: a2eaHaPMB0_iMeKlJ6RliXGiyyA Message-ID: Subject: Re: bug#19241: latest grep considers text files as binary To: Paul Eggert Content-Type: text/plain; charset=ISO-8859-1 X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 19241-done Cc: Jim Meyering , towo@computer.org, 19241-done@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) On Mon, Dec 1, 2014 at 9:29 AM, Paul Eggert wrote: > On 12/01/2014 09:05 AM, Thomas Wolff wrote: >> >> the mentioned patches are apparently intended to fix >> issues in non-UTF-8 locales. > > > No, they're also needed for UTF-8 locales I'm afraid. There are some > security issues, not only having to do with grep's internals, but also for > the behavior of downstream programs that may be expecting UTF-8 text. > > You can work around the problem with 'grep -a'. tags 19241 notabug close 19241 thanks Thanks for the report, but as you've seen (thanks, Paul), this is due to documented and desirable behavior. The above marks this auto-created issue as a non-bug in our tracker. ------------=_1417456442-18720-1 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at submit) by debbugs.gnu.org; 1 Dec 2014 17:29:54 +0000 Received: from localhost ([127.0.0.1]:50996 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvUnJ-0004EN-QQ for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:54 -0500 Received: from eggs.gnu.org ([208.118.235.92]:35856) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvUnH-0004EF-CL for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:51 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XvUn7-00045o-Ck for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:51 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.0 required=5.0 tests=BAYES_40 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([208.118.235.17]:56292) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XvUn7-00045i-9v for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:41 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:37513) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XvUmz-00071X-EN for bug-grep@gnu.org; Mon, 01 Dec 2014 12:29:41 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XvUmr-00042e-VF for bug-grep@gnu.org; Mon, 01 Dec 2014 12:29:33 -0500 Received: from smtp.cs.ucla.edu ([131.179.128.62]:60605) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XvUmr-00042a-PH for bug-grep@gnu.org; Mon, 01 Dec 2014 12:29:25 -0500 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id 78062A6003C; Mon, 1 Dec 2014 09:29:24 -0800 (PST) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id B4MK63TTVJYH; Mon, 1 Dec 2014 09:29:15 -0800 (PST) Received: from penguin.cs.ucla.edu (Penguin.CS.UCLA.EDU [131.179.64.200]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id A7B4CA6003D; Mon, 1 Dec 2014 09:29:15 -0800 (PST) Message-ID: <547CA56B.4070002@cs.ucla.edu> Date: Mon, 01 Dec 2014 09:29:15 -0800 From: Paul Eggert Organization: UCLA Computer Science Department User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.2.0 MIME-Version: 1.0 To: Thomas Wolff , bug-grep@gnu.org Subject: Re: latest grep considers text files as binary References: <547C9FEF.6090809@computer.org> In-Reply-To: <547C9FEF.6090809@computer.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 208.118.235.17 X-Spam-Score: -4.0 (----) X-Debbugs-Envelope-To: submit Cc: meyering@fb.com, noritnk@kcn.ne.jp X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.0 (----) On 12/01/2014 09:05 AM, Thomas Wolff wrote: > the mentioned patches are apparently intended to fix > issues in non-UTF-8 locales. No, they're also needed for UTF-8 locales I'm afraid. There are some security issues, not only having to do with grep's internals, but also for the behavior of downstream programs that may be expecting UTF-8 text. You can work around the problem with 'grep -a'. ------------=_1417456442-18720-1--