From unknown Sun Jun 22 03:56:11 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#22064 <22064@debbugs.gnu.org> To: bug#22064 <22064@debbugs.gnu.org> Subject: Status: expr: expr string : '.*' returns the number of matched bytes not characters Reply-To: bug#22064 <22064@debbugs.gnu.org> Date: Sun, 22 Jun 2025 10:56:11 +0000 retitle 22064 expr: expr string : '.*' returns the number of matched bytes = not characters reassign 22064 coreutils submitter 22064 Stephane Chazelas severity 22064 normal tag 22064 fixed thanks From debbugs-submit-bounces@debbugs.gnu.org Mon Nov 30 16:10:30 2015 Received: (at submit) by debbugs.gnu.org; 30 Nov 2015 21:10:31 +0000 Received: from localhost ([127.0.0.1]:33317 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1a3ViQ-00057M-Er for submit@debbugs.gnu.org; Mon, 30 Nov 2015 16:10:30 -0500 Received: from eggs.gnu.org ([208.118.235.92]:56517) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1a3ViO-00057F-NL for submit@debbugs.gnu.org; Mon, 30 Nov 2015 16:10:29 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1a3ViN-0004I7-UY for submit@debbugs.gnu.org; Mon, 30 Nov 2015 16:10:28 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.9 required=5.0 tests=BAYES_00,FREEMAIL_FROM, FSL_HELO_BARE_IP_2,RCVD_NUMERIC_HELO autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:56831) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1a3ViN-0004I2-RN for submit@debbugs.gnu.org; Mon, 30 Nov 2015 16:10:27 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:38290) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1a3ViM-000844-TT for bug-coreutils@gnu.org; Mon, 30 Nov 2015 16:10:27 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1a3ViI-0004G7-Sk for bug-coreutils@gnu.org; Mon, 30 Nov 2015 16:10:26 -0500 Received: from plane.gmane.org ([80.91.229.3]:40626) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1a3ViI-0004F6-MM for bug-coreutils@gnu.org; Mon, 30 Nov 2015 16:10:22 -0500 Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1a3Vi3-0000UX-TY for bug-coreutils@gnu.org; Mon, 30 Nov 2015 22:10:07 +0100 Received: from 2.121.21.200 ([2.121.21.200]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 30 Nov 2015 22:10:07 +0100 Received: from stephane.chazelas by 2.121.21.200 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 30 Nov 2015 22:10:07 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: bug-coreutils@gnu.org From: Stephane Chazelas Subject: expr: expr string : '.*' returns the number of matched bytes not characters Date: Mon, 30 Nov 2015 21:09:45 +0000 Lines: 27 Message-ID: <20151130210944.GA7790@chaz.gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: 2.121.21.200 Content-Disposition: inline User-Agent: Mutt/1.5.21 (2010-09-15) X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -1.6 (-) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.2 (---) Hello, that's another multibyte issue, it may be known already but I can't see it being referenced on debbugs.gnu.org. $ locale charmap UTF-8 $ expr é : . 2 $ expr é That is, characters are correctly matched, but a number of bytes instead of characters is returned. Same problem with $ expr length é 2 $ expr index éx x 3 (with both 8.23 and git head on Debian testing). -- Stephane From debbugs-submit-bounces@debbugs.gnu.org Wed Oct 24 17:33:10 2018 Received: (at 22064) by debbugs.gnu.org; 24 Oct 2018 21:33:10 +0000 Received: from localhost ([127.0.0.1]:40796 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gFQm5-0000Xg-SL for submit@debbugs.gnu.org; Wed, 24 Oct 2018 17:33:10 -0400 Received: from mail-pf1-f172.google.com ([209.85.210.172]:43393) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gFQm4-0000T9-8h; Wed, 24 Oct 2018 17:33:08 -0400 Received: by mail-pf1-f172.google.com with SMTP id q1-v6so3067188pfh.10; Wed, 24 Oct 2018 14:33:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-language:content-transfer-encoding; bh=TK7yDNUR1tiXOBFwWKqNaZlkSRu9+KLHz8w+ftYHKNI=; b=JETbB3Me/mBp91WRD7sB9sFIr9Fw7yJRbCWKUV0YO7/YuL6PGhl1sdAtl+AO4rWScA 4LkBmPJp4OGNkXnC/clbfgfMRkkxceQXeXH4pnZr20raJ36WrKbV1kHrclHP1HE8D/RS XK6JgXxmOXgaCqPMm2w2ckr/EoypVjBOFJTLcpRK6IwofoxB6aEhMl97brJ79wnOWWRb 0T4SnYtt//+UZGA8TX0neBQbaJC0u1MS1CI/2g6BQS/aUlPedeObRr1i41kP52+og1l/ atl+HdjVEPGyShChWmAPk71saUwSR/DtYDfXDj/4BD5fKR3Cwa2+zfUtv/ii2LISQHew 3nVA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=TK7yDNUR1tiXOBFwWKqNaZlkSRu9+KLHz8w+ftYHKNI=; b=MvQfgQjOPJvZGnK5XxHvNqBYEWJrvGDunnV+DCdgUmnau7zxziw/wPzF7VaYf5/8Jc CemVhVFYLiBX3SLqoJSQ+jHNceDfG6KmMvXHm8HxT40fIK180owl66HiYL3freltZdz1 ObaNDtEp2cqzk2UTYEZqDQHs8WB56abLCL2Hv6Uoz0asKe9wrH0KkSH4OGc+7ggFrlfL 4NqsmUj+TYlx7mAhRPGkfDbxpBdoVRMT/oG+BFmaGv0S+zHKoX+yb7WNDSTHyBAtJuSA ZfEpjTEOu9ZxoySCTiFr1cZ+tv5eMrSJs4+n9SxcTNZ4EA6A//W8DEJwyJwKsQ/ouHBB sbkA== X-Gm-Message-State: AGRZ1gJGJaNwTS2BAGMdG4y+dBjq/umy+kRPiD7mIQjG2I41+c68ycT+ F1r6wCdR0ynbt6wosBtzFGEh2E3yhBk= X-Google-Smtp-Source: AJdET5cdPVfmiXhAqUoBAa2iApfrfMFHh+LuTb+Gr+JSbzSJYdAkKZnV/yqS6607y7TzJRVSf7iT1Q== X-Received: by 2002:a62:b209:: with SMTP id x9-v6mr4325411pfe.148.1540416781647; Wed, 24 Oct 2018 14:33:01 -0700 (PDT) Received: from tomato.housegordon.com (moose.housegordon.com. [184.68.105.38]) by smtp.googlemail.com with ESMTPSA id c14-v6sm3326014pfi.156.2018.10.24.14.33.00 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 24 Oct 2018 14:33:00 -0700 (PDT) Subject: Re: bug#22064: expr: expr string : '.*' returns the number of matched bytes not characters To: 22064@debbugs.gnu.org References: <20151130210944.GA7790@chaz.gmail.com> From: Assaf Gordon Message-ID: Date: Wed, 24 Oct 2018 15:32:59 -0600 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20151130210944.GA7790@chaz.gmail.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 22064 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) tags 22064 fixed close 22064 stop (triaging old bugs) On 30/11/15 02:09 PM, Stephane Chazelas wrote: > > that's another multibyte issue, it may be known already but I > can't see it being referenced on debbugs.gnu.org. > This commit added multibyte support to expr(1): https://git.savannah.gnu.org/cgit/coreutils.git/commit/?id=a9f2be5bfec2bfe86c0851787312996467a653ee Available since coreutils 8.28 (released Sep 2017). Closing as "fixed". -assaf From unknown Sun Jun 22 03:56:11 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Thu, 22 Nov 2018 12:24:09 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator