#56824 - 29.0.50; mail-header-parse-address drops the 1st character from the name

GNU bug report logs - #56824
29.0.50; mail-header-parse-address drops the 1st character from the name

Package: emacs;

Reported by: sds <at> gnu.org

Date: Fri, 29 Jul 2022 13:32:02 UTC

Severity: normal

Found in version 29.0.50

Fixed in version 29.1

Done: Lars Ingebrigtsen <larsi <at> gnus.org>

Bug is archived. No further changes may be made.

View this message in rfc822 format

From: Sam Steingold <sds <at> gnu.org> To: Lars Ingebrigtsen <larsi <at> gnus.org> Cc: 56824 <at> debbugs.gnu.org Subject: bug#56824: 29.0.50; mail-header-parse-address drops the 1st character from the name Date: Fri, 29 Jul 2022 09:58:26 -0400

> * Lars Ingebrigtsen <ynefv <at> tahf.bet> [2022-07-29 15:36:20 +0200]: > > Sam Steingold <sds <at> gnu.org> writes: > >> https://debbugs.gnu.org/cgi/bugreport.cgi?bug=10406 >> https://debbugs.gnu.org/cgi/bugreport.cgi?bug=56422 >> >> Lars told me to use `mail-header-parse-address' instead of >> `mail-extract-address-components'. > > Well, not exactly. oops. sorry. ;-) >> Well, I tried, with a positive effect (thanks Lars!), >> but here is the 1st time the former is deficient: >> >> (mail-header-parse-addresses "Štěpán Němec <stepnem <at> gmail.com>") >> ==> (("stepnem <at> gmail.com" . "těpán Němec")) >> (mail-extract-address-components "Štěpán Němec <stepnem <at> gmail.com>") >> ==> ("Štěpán Němec" "stepnem <at> gmail.com") > > `mail-header-parse-addresses' is for parsing RFC822bis mail addresses. > "Štěpán Němec <stepnem <at> gmail.com>" is definitely not one of those. Hmmm, it _used to be_ --8<---------------cut here---------------start------------->8--- "=?utf-8?B?xaB0xJtww6FuIE7Em21lYw==?= <stepnem <at> gmail.com>" --8<---------------cut here---------------end--------------->8--- (is that what RFC822bis specifies?) but was passed through `mail-decode-encoded-word-string': --8<---------------cut here---------------start------------->8--- (mail-decode-encoded-word-string "=?utf-8?B?xaB0xJtww6FuIE7Em21lYw==?= <stepnem <at> gmail.com>") ==> "Štěpán Němec <stepnem <at> gmail.com>" --8<---------------cut here---------------end--------------->8--- and then passed on to `mail-header-parse-addresses'. What is TRT here? Should I call `mail-header-parse-addresses' _first_ and then `mail-decode-encoded-word-string'? Come to think of it, passing _email address_ through `mail-decode-encoded-word-string' seems unnecessary... --8<---------------cut here---------------start------------->8--- (mail-header-parse-addresses "=?utf-8?B?xaB0xJtww6FuIE7Em21lYw==?= <stepnem <at> gmail.com>") ==> (("stepnem <at> gmail.com" . "=?utf-8?B?xaB0xJtww6FuIE7Em21lYw==?=")) (mail-decode-encoded-word-string "=?utf-8?B?xaB0xJtww6FuIE7Em21lYw==?=") ==> "Štěpán Němec" --8<---------------cut here---------------end--------------->8--- > You're probably looking for `mail-header-parse-address-lax': > > (mail-header-parse-address-lax "Štěpán Němec <stepnem <at> gmail.com>") > -> ("stepnem <at> gmail.com" . "Štěpán Němec") ... Or maybe I should keep the current order of processing and use `mail-header-parse-address-lax'? Thank you for your speedy and enlightening reply! -- Sam Steingold (http://sds.podval.org/) on darwin Ns 10.3.2113 http://childpsy.net http://calmchildstories.com http://steingoldpsychology.com http://think-israel.org https://thereligionofpeace.com https://iris.org.il In the race between idiot-proof software and idiots, the idiots are winning.

This bug report was last modified 3 years and 27 days ago.

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.

GNU bug report logs - #56824 29.0.50; mail-header-parse-address drops the 1st character from the name

GNU bug report logs - #56824
29.0.50; mail-header-parse-address drops the 1st character from the name