From debbugs-submit-bounces@debbugs.gnu.org Thu Sep 11 22:20:26 2014 Received: (at submit) by debbugs.gnu.org; 12 Sep 2014 02:20:26 +0000 Received: from localhost ([127.0.0.1]:38706 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XSGTI-0005hi-Ed for submit@debbugs.gnu.org; Thu, 11 Sep 2014 22:20:25 -0400 Received: from eggs.gnu.org ([208.118.235.92]:49315) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XSFeT-0004Oc-GX for submit@debbugs.gnu.org; Thu, 11 Sep 2014 21:27:54 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XSFeK-0006dd-ET for submit@debbugs.gnu.org; Thu, 11 Sep 2014 21:27:53 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.0 required=5.0 tests=BAYES_40,FREEMAIL_FROM, T_DKIM_INVALID autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([208.118.235.17]:36066) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XSFeK-0006dZ-CH for submit@debbugs.gnu.org; Thu, 11 Sep 2014 21:27:44 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:50992) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XSFeB-0000NZ-4x for bug-grep@gnu.org; Thu, 11 Sep 2014 21:27:44 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1XSFe2-0006Yy-3Y for bug-grep@gnu.org; Thu, 11 Sep 2014 21:27:35 -0400 Received: from mail-ig0-x235.google.com ([2607:f8b0:4001:c05::235]:36809) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1XSFe1-0006Yn-UF for bug-grep@gnu.org; Thu, 11 Sep 2014 21:27:26 -0400 Received: by mail-ig0-f181.google.com with SMTP id h3so223699igd.8 for ; Thu, 11 Sep 2014 18:27:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:content-type:content-transfer-encoding:subject:message-id:date :to:mime-version; bh=DcxS9R8YaqkoJ6s7j1mWEa9w3xaEVAiMPjy4TEz6Yqg=; b=gmjDT5pa6ljairCK0sKtkPlm+6mHFrDKaslTMVJsjrISRitYPWFBr9QR8hcZQcBoY+ NtCEfdXolT+Ae0tQqS4/R6NQO0OX8ZQ1PnnURmwmBn+7r/Xz/0/1P8O/EsDXlsONS0H2 e6OIvsRu5hmx4wxPNZja4l6Aw8medkAxs7nTp/mOGxELpjkHcPgqWqsF8GHhf5fj3lLw fFoKexWVEFfjWaGpeB6WmyGdxVcF2EGU1L6rAcDTqAWg/RrhI6brcqFxdN2Ld+5AYy+I gePdOuSeAPitD7ITt15c0tdE40yM0icGXHrjXQukhyunvX8RKrf/qViWIMUE2nrjHHew dYiQ== X-Received: by 10.50.13.100 with SMTP id g4mr12781729igc.44.1410485244581; Thu, 11 Sep 2014 18:27:24 -0700 (PDT) Received: from [192.168.0.10] (CPE0011952cb9e9-CM00111ae2bb46.cpe.net.cable.rogers.com. [99.236.206.131]) by mx.google.com with ESMTPSA id vn5sm218012igb.1.2014.09.11.18.27.23 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Thu, 11 Sep 2014 18:27:24 -0700 (PDT) From: Mario Grgic Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Subject: grep 2.20 perl-regexp: invalid UTF-8 byte sequence in input Message-Id: <7CF2AB4D-3BDF-452E-8B98-3A869CEA5431@gmail.com> Date: Thu, 11 Sep 2014 21:27:22 -0400 To: bug-grep@gnu.org Mime-Version: 1.0 (Mac OS X Mail 7.3 \(1878.6\)) X-Mailer: Apple Mail (2.1878.6) X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 208.118.235.17 X-Spam-Score: -4.0 (----) X-Debbugs-Envelope-To: submit X-Mailman-Approved-At: Thu, 11 Sep 2014 22:20:22 -0400 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.0 (----) This happens with GNU grep version 2.20 and PCRE 8.35 on Mac OS X. The = following command reproduce the problem:=20 $ printf 'j\x82\nj\n' | grep -P j invalid UTF-8 byte sequence in input But I usually encounter this when recursively searching through files = and encountering a binary file which contains invalid UTF-8 sequence. = If binary file with invalid UTF-8 sequence is encountered first (without = any other matches), grep will abort the entire recursive search and not = even mention which file caused the error. This is somewhat confusing = when you first encounter it.=20 By the way, this works in GNU grep 2.18 without any errors (you get = messages like binary file x matches), and with PCRE 8.33 or 8.35 (I have = not tried any other combinations).=20 From debbugs-submit-bounces@debbugs.gnu.org Thu Sep 11 23:40:44 2014 Received: (at 18455) by debbugs.gnu.org; 12 Sep 2014 03:40:44 +0000 Received: from localhost ([127.0.0.1]:38743 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XSHj2-0007je-1j for submit@debbugs.gnu.org; Thu, 11 Sep 2014 23:40:44 -0400 Received: from smtp.cs.ucla.edu ([131.179.128.62]:45974) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XSHiz-0007jV-Hd for 18455@debbugs.gnu.org; Thu, 11 Sep 2014 23:40:42 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id C54E3A6001D; Thu, 11 Sep 2014 20:40:40 -0700 (PDT) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id beIM38sjTNUk; Thu, 11 Sep 2014 20:40:36 -0700 (PDT) Received: from [192.168.1.9] (pool-71-177-17-123.lsanca.dsl-w.verizon.net [71.177.17.123]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id 08248A60001; Thu, 11 Sep 2014 20:40:36 -0700 (PDT) Message-ID: <54126B33.3060803@cs.ucla.edu> Date: Thu, 11 Sep 2014 20:40:35 -0700 From: Paul Eggert Organization: UCLA Computer Science Department User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.1.1 MIME-Version: 1.0 To: Mario Grgic Subject: grep 2.20 perl-regexp: invalid UTF-8 byte sequence in input Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Score: -4.8 (----) X-Debbugs-Envelope-To: 18455 Cc: 18455@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.8 (----) This appears to be the same as Bug#18266: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=18266 which means it's fixed in the master version and the fix should appear in the next release. From debbugs-submit-bounces@debbugs.gnu.org Thu Sep 11 23:42:03 2014 Received: (at control) by debbugs.gnu.org; 12 Sep 2014 03:42:03 +0000 Received: from localhost ([127.0.0.1]:38747 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XSHkI-0007mG-Tw for submit@debbugs.gnu.org; Thu, 11 Sep 2014 23:42:03 -0400 Received: from smtp.cs.ucla.edu ([131.179.128.62]:46022) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XSHkG-0007lq-Iq for control@debbugs.gnu.org; Thu, 11 Sep 2014 23:42:01 -0400 Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp.cs.ucla.edu (Postfix) with ESMTP id 29DDCA6001E for ; Thu, 11 Sep 2014 20:42:00 -0700 (PDT) X-Virus-Scanned: amavisd-new at smtp.cs.ucla.edu Received: from smtp.cs.ucla.edu ([127.0.0.1]) by localhost (smtp.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id jFVPB3FK1C6n for ; Thu, 11 Sep 2014 20:41:51 -0700 (PDT) Received: from [192.168.1.9] (pool-71-177-17-123.lsanca.dsl-w.verizon.net [71.177.17.123]) by smtp.cs.ucla.edu (Postfix) with ESMTPSA id 9DA6EA60001 for ; Thu, 11 Sep 2014 20:41:51 -0700 (PDT) Message-ID: <54126B7F.9000402@cs.ucla.edu> Date: Thu, 11 Sep 2014 20:41:51 -0700 From: Paul Eggert Organization: UCLA Computer Science Department User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.1.1 MIME-Version: 1.0 To: control@debbugs.gnu.org Subject: 18455 is a duplicate of 18266 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Score: -4.8 (----) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.8 (----) forcemerge 18266 18455 From unknown Sun Jun 22 07:45:56 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Wed, 15 Oct 2014 11:24:03 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator