From unknown Sat Jun 21 03:14:09 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#19241 <19241@debbugs.gnu.org> To: bug#19241 <19241@debbugs.gnu.org> Subject: Status: latest grep considers text files as binary Reply-To: bug#19241 <19241@debbugs.gnu.org> Date: Sat, 21 Jun 2025 10:14:09 +0000 retitle 19241 latest grep considers text files as binary reassign 19241 grep submitter 19241 Paul Eggert severity 19241 normal thanks From debbugs-submit-bounces@debbugs.gnu.org Mon Dec 01 12:29:54 2014 Received: (at submit) by debbugs.gnu.org; 1 Dec 2014 17:29:54 +0000 Received: from localhost ([127.0.0.1]:50996 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvUnJ-0004EN-QQ for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:54 -0500 Received: from eggs.gnu.org ([208.118.235.92]:35856) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvUnH-0004EF-CL for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:51 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XvUn7-00045o-Ck for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:51 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.0 required=5.0 tests=BAYES_40 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([208.118.235.17]:56292) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XvUn7-00045i-9v for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:29:41 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:37513) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XvUmz-00071X-EN for bug-grep@gnu.org; Mon, 01 Dec 2014 12:29:41 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XvUmr-00042e-VF for bug-grep@gnu.org; Mon, 01 Dec 2014 12:29:33 -0500 Received: from smtp.cs.ucla.edu ([131.179.128.62]:60605) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XvUmr-00042a-PH for bug-grep@gnu.org; Mon, 01 Dec 2014 12:29:25 -0500 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id 78062A6003C; Mon, 1 Dec 2014 09:29:24 -0800 (PST) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id B4MK63TTVJYH; Mon, 1 Dec 2014 09:29:15 -0800 (PST) Received: from penguin.cs.ucla.edu (Penguin.CS.UCLA.EDU [131.179.64.200]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id A7B4CA6003D; Mon, 1 Dec 2014 09:29:15 -0800 (PST) Message-ID: <547CA56B.4070002@cs.ucla.edu> Date: Mon, 01 Dec 2014 09:29:15 -0800 From: Paul Eggert Organization: UCLA Computer Science Department User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.2.0 MIME-Version: 1.0 To: Thomas Wolff , bug-grep@gnu.org Subject: Re: latest grep considers text files as binary References: <547C9FEF.6090809@computer.org> In-Reply-To: <547C9FEF.6090809@computer.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 208.118.235.17 X-Spam-Score: -4.0 (----) X-Debbugs-Envelope-To: submit Cc: meyering@fb.com, noritnk@kcn.ne.jp X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.0 (----) On 12/01/2014 09:05 AM, Thomas Wolff wrote: > the mentioned patches are apparently intended to fix > issues in non-UTF-8 locales. No, they're also needed for UTF-8 locales I'm afraid. There are some security issues, not only having to do with grep's internals, but also for the behavior of downstream programs that may be expecting UTF-8 text. You can work around the problem with 'grep -a'. From debbugs-submit-bounces@debbugs.gnu.org Mon Dec 01 12:53:24 2014 Received: (at 19241-done) by debbugs.gnu.org; 1 Dec 2014 17:53:24 +0000 Received: from localhost ([127.0.0.1]:51014 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvVA3-0004r3-Vs for submit@debbugs.gnu.org; Mon, 01 Dec 2014 12:53:24 -0500 Received: from mail-yh0-f46.google.com ([209.85.213.46]:56251) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XvVA2-0004qv-OG for 19241-done@debbugs.gnu.org; Mon, 01 Dec 2014 12:53:23 -0500 Received: by mail-yh0-f46.google.com with SMTP id t59so5103313yho.33 for <19241-done@debbugs.gnu.org>; Mon, 01 Dec 2014 09:53:22 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc:content-type; bh=WC3aEXZDWnfX1AhrYRRQ+138ybnDd+ZowXGCBd82nR0=; b=qDYd2DzYlP5iW5DmO3jkWGqNIcqZUPF/FQZHQ/YJhkkIuEi6DKwCLabSES0DaOZpsk PHSXmqxucratIoNuyi9X6KQ31hrYXauk5Wo0Li94AALAu2u4UuyLYbIKPbXJsnH2k+lJ yudz5iVbPCWCE8Btuw+2ZVD08xlzDM4eQI+09APuFaCRS6BjDvsBIpRo7Jk+SyAnysE5 SvRR84nHVMoRM4rRxfCwU7arvaDtzI6oKsux+jto1KUHkhTIm7rKb8dkkAZnY6HH9IaD EYlzf40ieudAFIaLO1qiQ9rDAkw8JMiq50b9AEiK94dcuREOIGMQXnaiQDHEAtDtEOtT 9PAA== X-Received: by 10.236.23.136 with SMTP id v8mr60310573yhv.25.1417456402113; Mon, 01 Dec 2014 09:53:22 -0800 (PST) MIME-Version: 1.0 Received: by 10.170.157.9 with HTTP; Mon, 1 Dec 2014 09:53:01 -0800 (PST) In-Reply-To: <547CA56B.4070002@cs.ucla.edu> References: <547C9FEF.6090809@computer.org> <547CA56B.4070002@cs.ucla.edu> From: Jim Meyering Date: Mon, 1 Dec 2014 09:53:01 -0800 X-Google-Sender-Auth: a2eaHaPMB0_iMeKlJ6RliXGiyyA Message-ID: Subject: Re: bug#19241: latest grep considers text files as binary To: Paul Eggert Content-Type: text/plain; charset=ISO-8859-1 X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 19241-done Cc: Jim Meyering , towo@computer.org, 19241-done@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) On Mon, Dec 1, 2014 at 9:29 AM, Paul Eggert wrote: > On 12/01/2014 09:05 AM, Thomas Wolff wrote: >> >> the mentioned patches are apparently intended to fix >> issues in non-UTF-8 locales. > > > No, they're also needed for UTF-8 locales I'm afraid. There are some > security issues, not only having to do with grep's internals, but also for > the behavior of downstream programs that may be expecting UTF-8 text. > > You can work around the problem with 'grep -a'. tags 19241 notabug close 19241 thanks Thanks for the report, but as you've seen (thanks, Paul), this is due to documented and desirable behavior. The above marks this auto-created issue as a non-bug in our tracker. From unknown Sat Jun 21 03:14:09 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Tue, 30 Dec 2014 12:24:04 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator