From unknown Thu Jun 19 14:04:15 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#36852 <36852@debbugs.gnu.org> To: bug#36852 <36852@debbugs.gnu.org> Subject: Status: 27.0.50; ietf-drums-parse-address doesn't handle non-ascii properly Reply-To: bug#36852 <36852@debbugs.gnu.org> Date: Thu, 19 Jun 2025 21:04:15 +0000 retitle 36852 27.0.50; ietf-drums-parse-address doesn't handle non-ascii pr= operly reassign 36852 emacs submitter 36852 =C5=A0t=C4=9Bp=C3=A1n N=C4=9Bmec severity 36852 normal tag 36852 fixed thanks From debbugs-submit-bounces@debbugs.gnu.org Tue Jul 30 05:17:00 2019 Received: (at submit) by debbugs.gnu.org; 30 Jul 2019 09:17:00 +0000 Received: from localhost ([127.0.0.1]:49518 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hsOFg-0005wP-Ak for submit@debbugs.gnu.org; Tue, 30 Jul 2019 05:17:00 -0400 Received: from lists.gnu.org ([209.51.188.17]:52387) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hsOFf-0005wG-0j for submit@debbugs.gnu.org; Tue, 30 Jul 2019 05:16:59 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:60077) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hsOFe-0005oA-44 for bug-gnu-emacs@gnu.org; Tue, 30 Jul 2019 05:16:58 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.0 required=5.0 tests=BAYES_40,FREEMAIL_FROM, FROM_EXCESS_BASE64 autolearn=disabled version=3.3.2 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hsOFd-0006Y4-53 for bug-gnu-emacs@gnu.org; Tue, 30 Jul 2019 05:16:58 -0400 Received: from mail-wm1-x32c.google.com ([2a00:1450:4864:20::32c]:35174) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1hsOFc-0006XU-Vc for bug-gnu-emacs@gnu.org; Tue, 30 Jul 2019 05:16:57 -0400 Received: by mail-wm1-x32c.google.com with SMTP id l2so55905002wmg.0 for ; Tue, 30 Jul 2019 02:16:56 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:subject:date:message-id:mime-version :content-transfer-encoding; bh=ldEbtMRqJfZ7/uXRe18ai/H8Ej04JNn1iKMhIp3whh8=; b=LyqHpGoa7WmUDpFtZF+5/GtQqFEb0AplPjrDzhwg3G0w/i6MurtbGyGaEScCMWW7k6 SaoBnjrYX7Ne8ncdSayBASvGZRiSnTS4KyTuQzoC82PNSfpBr6mtvt3rMSQLJ+d51o1e 43MNyn0CYNe89l5npc5yEP5NrYnubslY+j1fAWPbsz2ujzRucHaX6BnRbJ16hJAlvU7c ddaB7vvBZSQWgQ28AqZUB3t8gU5kMcFzYCKQEffeFKNK6LXpj99FVJdDSZMSMyF3y/Lx SnJAdfGDjrMczhJ6PwLAwhj1cy/Tqp8X/4zlbq+6uWFkk4UCrhezn9tubdwlwJGR8CL6 rJsQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:subject:date:message-id:mime-version :content-transfer-encoding; bh=ldEbtMRqJfZ7/uXRe18ai/H8Ej04JNn1iKMhIp3whh8=; b=lf/R312EcOjytMbXnHtq5M981Na9L5uapzI7ClW+EH1fe/zm6WNvIJw+xf9KWIRhDl u0mIuSPXpEZHzW8Ws3PZDJ+x0qylrL7AmHUeCd6de3RyqtqJ3JcffV8nUzxxQdjshJK+ Y+bB3Em1jCesNc4XAIc0/+c/8su0DsU9b9oflcXpcTcONHyCVLKuRCZzKeULDagu80Xq XU7tVvDHuYWMO2B9rr7L9i/VuTrwbh64OgXP9QZVaqDIC57wrdxOL1l1WnzNpb9TAgS4 smzIhWU6bYVDDVEqmf6iGl038ImKFTWFijXPHQNQ6fb9IClKQT6adb7/AuMSgYOg+qFE dAUg== X-Gm-Message-State: APjAAAWwxXdCQdlKBy83fzfxianMjJxT5lDIzW6A9vLdqdPT5JTWybJ6 OlTY7UjbvYm+sXt1GbOxkwWlCBfH X-Google-Smtp-Source: APXvYqwFeUl2kf9niulMHmN2B+c2ZEDrugSTcs1gRdss+v/vxcUjX+vTOXWK5I+eeLExtRSRRsU+lg== X-Received: by 2002:a1c:2302:: with SMTP id j2mr100884469wmj.174.1564478215112; Tue, 30 Jul 2019 02:16:55 -0700 (PDT) Received: from localhost ([185.112.167.114]) by smtp.gmail.com with ESMTPSA id g17sm48462195wrm.7.2019.07.30.02.16.54 for (version=TLS1_3 cipher=AEAD-AES256-GCM-SHA384 bits=256/256); Tue, 30 Jul 2019 02:16:54 -0700 (PDT) From: =?utf-8?B?xaB0xJtww6FuIE7Em21lYw==?= To: bug-gnu-emacs@gnu.org Subject: 27.0.50; ietf-drums-parse-address doesn't handle non-ascii properly Date: Tue, 30 Jul 2019 11:16:53 +0200 Message-ID: <871ry7g9ve.fsf@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2a00:1450:4864:20::32c X-Spam-Score: -1.3 (-) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) ietf-drums-parse-address (AKA mail-header-parse-address) uses ietf-drums-atext-token to parse display-name, but the regexp range only contains ASCII characters, so e.g. as used in debbugs-gnu-show-reports, the following happens: (mail-header-parse-address (decode-coding-string "=C3=81a=C3=A1=C3=A1=C3=A1 =C5=AE=C5=AF=C5=AF=C5= =AF=C5=AF " 'utf-8)) ;;=3D> ("aaa@example.net" . "a=C3=A1=C3=A1=C3=A1") It actually only cares about the first char of a word: (let ((ietf-drums-atext-token "-=C3=81=C5=AE^a-zA-Z0-9!#$%&'*+/=3D?_`{|}~= ")) (mail-header-parse-address (decode-coding-string "=C3=81a=C3=A1=C3=A1=C3=A1 =C5=AE=C5=AF=C5=AF=C5= =AF=C5=AF " 'utf-8))) ;;=3D> ("aaa@example.net" . "=C3=81a=C3=A1=C3=A1=C3=A1 =C5=AE=C5=AF=C5=AF= =C5=AF=C5=AF") I'm not quite sure what the proper fix is, as the ASCII-only thing seems to be intentional. Maybe it's just not supposed to be used the way it is used in debbugs-gnu.el? From debbugs-submit-bounces@debbugs.gnu.org Tue Jul 30 05:53:48 2019 Received: (at 36852) by debbugs.gnu.org; 30 Jul 2019 09:53:49 +0000 Received: from localhost ([127.0.0.1]:49561 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hsOpI-0000Va-N4 for submit@debbugs.gnu.org; Tue, 30 Jul 2019 05:53:48 -0400 Received: from mail-wr1-f49.google.com ([209.85.221.49]:35931) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1hsOpG-0000VK-VD for 36852@debbugs.gnu.org; Tue, 30 Jul 2019 05:53:47 -0400 Received: by mail-wr1-f49.google.com with SMTP id n4so65119977wrs.3 for <36852@debbugs.gnu.org>; Tue, 30 Jul 2019 02:53:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:references:mail-copies-to:gmane-reply-to-list :date:in-reply-to:message-id:mime-version:content-transfer-encoding; bh=bSNz1Ud/6hHenYocG3EbWUu1OVuyprdkVT+IdvUaZ28=; b=pfL5Owkgs2H6kln9Vq+tDYHWxzv4GTaEnrxqG/fZBmnpWkJ+R/2JSsAirdwXmdjE8V rq5wR4tuBn8+ZK4LSYaSx0ww8dP/xgakWQplc7FUrKgYllrmcbrwD7uMrlj1VrhFYTwI BY5B/Cs0obxvlmqOAOQTHCvIxt6won+FMXlZxnPe2IPbhyGWsqwzgVPpd8I2U5Pu90gZ GQeM9ZxMRvEkR8HQk1yUeyKId+CV8jWfsKqiHAMgm++vyl2WKz1YQwXmA3YrcbRDYPAp VROgjlNOIexWr2w5fzZrPafGVGeTYW6u731NKMmx+fQrC7yyX2vSiNShSy9fXsr09VlW ae0Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:references:mail-copies-to :gmane-reply-to-list:date:in-reply-to:message-id:mime-version :content-transfer-encoding; bh=bSNz1Ud/6hHenYocG3EbWUu1OVuyprdkVT+IdvUaZ28=; b=arao3cz7X4uHujwei4HEpVJT5V1XnM/QTYBfxjMlmNPSyVjvVOEGdw0d0/mrgZSPMY 6+wjJ4BWeNEELJe40bpvZQIK/xFZRGU5aUhNJw6mT0xti5y6Mzfx+nNnN+lW3DFqFKxX sPR/+3G5t7wNpHXq0QnJiVEZQosFRvj/2rSiuwA2mHpgxw+GMBxzmFwHX6urBtwQQITJ yoRP0fvXpCRTtIBNzYdctSLE0cq49Gua/PKCGNONuVMIAd4DgBqfFo+LU6haw38sSO6k Tr2L2O8sf+M3dGTqYz5coTMEXmQTD4/0m7tTY/AGxkjU9GBEj/NbRZjNz8tCcKNnXX75 8waA== X-Gm-Message-State: APjAAAVp4oQqIEi48kTBdip/0Lo1JhV9pz3mGcWHWs5fjUZT1fxiOrlB GsWcHUTf2YTliZTnSLi2nkQlYY7o X-Google-Smtp-Source: APXvYqx0E7Kgzgmu1McY84HTWHCq9FB0jSxDRc/F1X4tqHBbkTSKGYwL9WE9rP8KHOdPUez/5VO+3A== X-Received: by 2002:a5d:51c8:: with SMTP id n8mr113914565wrv.46.1564480420546; Tue, 30 Jul 2019 02:53:40 -0700 (PDT) Received: from rpluim-mac ([149.5.228.1]) by smtp.gmail.com with ESMTPSA id a2sm66748862wmj.9.2019.07.30.02.53.39 (version=TLS1_3 cipher=AEAD-AES256-GCM-SHA384 bits=256/256); Tue, 30 Jul 2019 02:53:39 -0700 (PDT) From: Robert Pluim To: =?utf-8?B?xaB0xJtww6FuIE7Em21lYw==?= Subject: Re: bug#36852: 27.0.50; ietf-drums-parse-address doesn't handle non-ascii properly References: <871ry7g9ve.fsf@gmail.com> X-Debbugs-No-Ack: yes Mail-Copies-To: never Gmane-Reply-To-List: yes Date: Tue, 30 Jul 2019 11:53:36 +0200 In-Reply-To: <871ry7g9ve.fsf@gmail.com> (=?utf-8?B?IsWgdMSbcMOhbiBOxJtt?= =?utf-8?B?ZWMiJ3M=?= message of "Tue, 30 Jul 2019 11:16:53 +0200") Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 36852 Cc: 36852@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) >>>>> On Tue, 30 Jul 2019 11:16:53 +0200, =C5=A0t=C4=9Bp=C3=A1n N=C4=9Bmec = said: =C5=A0t=C4=9Bp=C3=A1n> ietf-drums-parse-address (AKA mail-header-parse-= address) uses =C5=A0t=C4=9Bp=C3=A1n> ietf-drums-atext-token to parse display-name, bu= t the regexp range only =C5=A0t=C4=9Bp=C3=A1n> contains ASCII characters, so e.g. as used in de= bbugs-gnu-show-reports, =C5=A0t=C4=9Bp=C3=A1n> the following happens: =C5=A0t=C4=9Bp=C3=A1n> (mail-header-parse-address =C5=A0t=C4=9Bp=C3=A1n> (decode-coding-string "=C3=81a=C3=A1=C3=A1=C3= =A1 =C5=AE=C5=AF=C5=AF=C5=AF=C5=AF " 'utf-8)) =C5=A0t=C4=9Bp=C3=A1n> ;;=3D> ("aaa@example.net" . "a=C3=A1=C3=A1=C3= =A1") =C5=A0t=C4=9Bp=C3=A1n> It actually only cares about the first char of a= word: =C5=A0t=C4=9Bp=C3=A1n> (let ((ietf-drums-atext-token "-=C3=81=C5=AE^a= -zA-Z0-9!#$%&'*+/=3D?_`{|}~")) =C5=A0t=C4=9Bp=C3=A1n> (mail-header-parse-address =C5=A0t=C4=9Bp=C3=A1n> (decode-coding-string "=C3=81a=C3=A1=C3=A1= =C3=A1 =C5=AE=C5=AF=C5=AF=C5=AF=C5=AF " 'utf-8))) =C5=A0t=C4=9Bp=C3=A1n> ;;=3D> ("aaa@example.net" . "=C3=81a=C3=A1=C3= =A1=C3=A1 =C5=AE=C5=AF=C5=AF=C5=AF=C5=AF") =C5=A0t=C4=9Bp=C3=A1n> I'm not quite sure what the proper fix is, as th= e ASCII-only thing seems =C5=A0t=C4=9Bp=C3=A1n> to be intentional. Maybe it's just not supposed = to be used the way it is =C5=A0t=C4=9Bp=C3=A1n> used in debbugs-gnu.el? Mail headers are defined to be ascii-only, although as I=CA=BCve just discovered, gmail undoes Gnus' perfectly formatted RFC 2047 encoding and replaces it with UTF-8 characters. Bad Google, bad. Perhaps mail-header-parse-address could just discard the complete display string if it finds a non-ascii char? That would at least prevent it from propagating. Robert From debbugs-submit-bounces@debbugs.gnu.org Sun Sep 15 08:00:34 2019 Received: (at 36852) by debbugs.gnu.org; 15 Sep 2019 12:00:34 +0000 Received: from localhost ([127.0.0.1]:47385 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1i9TCk-00079J-5L for submit@debbugs.gnu.org; Sun, 15 Sep 2019 08:00:34 -0400 Received: from quimby.gnus.org ([80.91.231.51]:59966) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1i9TCi-00079A-Mf for 36852@debbugs.gnu.org; Sun, 15 Sep 2019 08:00:33 -0400 Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=marnie) by quimby.gnus.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1i9TCe-0007nh-U3; Sun, 15 Sep 2019 14:00:31 +0200 From: Lars Ingebrigtsen To: =?utf-8?B?xaB0xJtww6FuIE7Em21lYw==?= Subject: Re: bug#36852: 27.0.50; ietf-drums-parse-address doesn't handle non-ascii properly References: <871ry7g9ve.fsf@gmail.com> Date: Sun, 15 Sep 2019 14:00:28 +0200 In-Reply-To: <871ry7g9ve.fsf@gmail.com> (=?utf-8?B?IsWgdMSbcMOhbiBOxJtt?= =?utf-8?B?ZWMiJ3M=?= message of "Tue, 30 Jul 2019 11:16:53 +0200") Message-ID: <874l1d6a77.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Štěpán Němec writes: > ietf-drums-parse-address (AKA mail-header-parse-address) uses > ietf-drums-atext-token to parse display-name, but the regexp range only > contains ASCII characters, so e.g. as used in debbugs-gnu-sh [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 36852 Cc: 36852@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) =C5=A0t=C4=9Bp=C3=A1n N=C4=9Bmec writes: > ietf-drums-parse-address (AKA mail-header-parse-address) uses > ietf-drums-atext-token to parse display-name, but the regexp range only > contains ASCII characters, so e.g. as used in debbugs-gnu-show-reports, > the following happens: > > (mail-header-parse-address > (decode-coding-string "=C3=81a=C3=A1=C3=A1=C3=A1 =C5=AE=C5=AF=C5=AF=C5= =AF=C5=AF " 'utf-8)) > > ;;=3D> ("aaa@example.net" . "a=C3=A1=C3=A1=C3=A1") That's not a valid email address, so perhaps `ietf-drums-parse-address' should return a blank string as the name here... On the other hand, calling that function on something that's not an email address (which debbugs-gnu does here) it should probably be free to return whatever. > I'm not quite sure what the proper fix is, as the ASCII-only thing seems > to be intentional. Maybe it's just not supposed to be used the way it is > used in debbugs-gnu.el? Indeed. I've now changed debbugs-gnu to split the "OCTETS " string returned by the debbugs web server correctly. --=20 (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Sun Sep 15 08:00:39 2019 Received: (at control) by debbugs.gnu.org; 15 Sep 2019 12:00:39 +0000 Received: from localhost ([127.0.0.1]:47388 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1i9TCp-00079d-Fw for submit@debbugs.gnu.org; Sun, 15 Sep 2019 08:00:39 -0400 Received: from quimby.gnus.org ([80.91.231.51]:59982) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1i9TCn-00079V-Mb for control@debbugs.gnu.org; Sun, 15 Sep 2019 08:00:37 -0400 Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=marnie) by quimby.gnus.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1i9TCk-0007nt-VE for control@debbugs.gnu.org; Sun, 15 Sep 2019 14:00:36 +0200 Date: Sun, 15 Sep 2019 14:00:34 +0200 Message-Id: <8736gx6a71.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #36852 X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: tags 36852 fixed close 36852 27.1 quit Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) tags 36852 fixed close 36852 27.1 quit From unknown Thu Jun 19 14:04:15 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Mon, 14 Oct 2019 11:24:13 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator