GNU bug report logs - #10314
24.0.92; ietf-drums-parse-address and non-ASCII names

Previous Next

Packages: emacs, gnus;

Reported by: Aaron Ecay <aaronecay <at> gmail.com>

Date: Fri, 16 Dec 2011 16:55:02 UTC

Severity: normal

Tags: notabug

Found in version 24.0.92

Done: Lars Ingebrigtsen <larsi <at> gnus.org>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 10314 in the body.
You can then email your comments to 10314 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-gnu-emacs <at> gnu.org:
bug#10314; Package emacs. (Fri, 16 Dec 2011 16:55:05 GMT) Full text and rfc822 format available.

Acknowledgement sent to Aaron Ecay <aaronecay <at> gmail.com>:
New bug report received and forwarded. Copy sent to bug-gnu-emacs <at> gnu.org. (Fri, 16 Dec 2011 16:55:05 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Aaron Ecay <aaronecay <at> gmail.com>
To: bug-gnu-emacs <at> gnu.org
Subject: 24.0.92; ietf-drums-parse-address and non-ASCII names
Date: Fri, 16 Dec 2011 05:58:51 -0500
This bug report will be sent to the Bug-GNU-Emacs mailing list
and the GNU bug tracker at debbugs.gnu.org.  Please check that
the From: line contains a valid email address.  After a delay of up
to one day, you should receive an acknowledgement at that address.

Please write in English if possible, as the Emacs maintainers
usually do not have translators for other languages.

Please describe exactly what actions triggered the bug, and
the precise symptoms of the bug.  If you can, give a recipe
starting from `emacs -Q':

The function ietf-drums-parse-address cannot cope with (unquoted)
non-ASCII characters in email addresses.  An example is the following
From: line (from a message on the mailing list for the notmuch email
client, archives accessible here, real email address obfuscated:
http://notmuchmail.org/pipermail/notmuch/
)

 From: =?utf-8?q?Philipp_=C3=9Cberbacher?= <foo <at> bar.com>

(the quoted character is an uppercase U-umlaut: Ü)

In the process of parsing this line, the function emits a minibuffer
message, and then returns the name without the non-ASCII character:
“Philipp berbacher”  This is because of the ietf-drums-atext-token
regex, which includes only ASCII alphabet characters.

-----

If Emacs crashed, and you have the Emacs process in the gdb debugger,
please include the output from the following gdb commands:
    `bt full' and `xbacktrace'.
For information about debugging Emacs, please read the file
/Users/aecay/Applications/Emacs.app/Contents/Resources/etc/DEBUG.


In GNU Emacs 24.0.92.3 (i386-apple-darwin10.8.0, NS apple-appkit-1038.36)
 of 2011-12-12 on awe
Windowing system distributor `Apple', version 10.3.1038
configured using `configure  '--with-ns' '--without-gnutls''

Important settings:
  value of $LC_ALL: nil
  value of $LC_COLLATE: nil
  value of $LC_CTYPE: nil
  value of $LC_MESSAGES: nil
  value of $LC_MONETARY: nil
  value of $LC_NUMERIC: nil
  value of $LC_TIME: nil
  value of $LANG: nil
  value of $XMODIFIERS: nil
  locale-coding-system: nil
  default enable-multibyte-characters: t

-- 
Aaron Ecay




Information forwarded to bug-gnu-emacs <at> gnu.org, bugs <at> gnus.org:
bug#10314; Package emacs,gnus. (Sat, 07 Jan 2012 01:28:01 GMT) Full text and rfc822 format available.

Message #8 received at 10314 <at> debbugs.gnu.org (full text, mbox):

From: Lars Magne Ingebrigtsen <larsi <at> gnus.org>
To: Aaron Ecay <aaronecay <at> gmail.com>
Cc: 10314 <at> debbugs.gnu.org
Subject: Re: bug#10314: 24.0.92; ietf-drums-parse-address and non-ASCII names
Date: Sat, 07 Jan 2012 02:23:35 +0100
Aaron Ecay <aaronecay <at> gmail.com> writes:

> The function ietf-drums-parse-address cannot cope with (unquoted)
> non-ASCII characters in email addresses.  An example is the following
> From: line (from a message on the mailing list for the notmuch email
> client, archives accessible here, real email address obfuscated:
> http://notmuchmail.org/pipermail/notmuch/
> )
>
>  From: =?utf-8?q?Philipp_=C3=9Cberbacher?= <foo <at> bar.com>

I'm unable to reproduce this bug:

(ietf-drums-parse-address "=?utf-8?q?Philipp_=C3=9Cberbacher?= <foo <at> bar.com>")
=> ("foo <at> bar.com" . "=?utf-8?q?Philipp_=C3=9Cberbacher?=")

Do you have a test case?

-- 
(domestic pets only, the antidote for overdose, milk.)
  bloggy blog http://lars.ingebrigtsen.no/




Information forwarded to bug-gnu-emacs <at> gnu.org, bugs <at> gnus.org:
bug#10314; Package emacs,gnus. (Fri, 13 Jan 2012 08:31:02 GMT) Full text and rfc822 format available.

Message #11 received at 10314 <at> debbugs.gnu.org (full text, mbox):

From: Aaron Ecay <aaronecay <at> gmail.com>
To: Lars Magne Ingebrigtsen <larsi <at> gnus.org>
Cc: 10314 <at> debbugs.gnu.org
Subject: Re: bug#10314: 24.0.92; ietf-drums-parse-address and non-ASCII names
Date: Fri, 13 Jan 2012 03:29:08 -0500
On Sat, 07 Jan 2012 02:23:35 +0100, Lars Magne Ingebrigtsen <larsi <at> gnus.org> wrote:
> I'm unable to reproduce this bug:
> 
> (ietf-drums-parse-address "=?utf-8?q?Philipp_=C3=9Cberbacher?= <foo <at> bar.com>")
> => ("foo <at> bar.com" . "=?utf-8?q?Philipp_=C3=9Cberbacher?=")
> 
> Do you have a test case?

This would arise in the notmuch email reader, which would decode the
address to UTF-8 before calling this function.  So you can get the
behavior by doing:

(ietf-drums-parse-address "Philipp Überbacher <foo <at> bar.com>")
; -> ("foo <at> bar.com" . "Philipp berbacher")

When this happens, there is also a message displayed in the echo area:
“Unknown symbol: Ü”.

Thanks,

-- 
Aaron Ecay




Information forwarded to bug-gnu-emacs <at> gnu.org, bugs <at> gnus.org:
bug#10314; Package emacs,gnus. (Wed, 25 Jan 2012 23:28:01 GMT) Full text and rfc822 format available.

Message #14 received at 10314 <at> debbugs.gnu.org (full text, mbox):

From: Lars Ingebrigtsen <larsi <at> gnus.org>
To: Aaron Ecay <aaronecay <at> gmail.com>
Cc: 10314 <at> debbugs.gnu.org
Subject: Re: bug#10314: 24.0.92; ietf-drums-parse-address and non-ASCII names
Date: Thu, 26 Jan 2012 00:26:50 +0100
Aaron Ecay <aaronecay <at> gmail.com> writes:

> This would arise in the notmuch email reader, which would decode the
> address to UTF-8 before calling this function.  So you can get the
> behavior by doing:
>
> (ietf-drums-parse-address "Philipp Überbacher <foo <at> bar.com>")
> ; -> ("foo <at> bar.com" . "Philipp berbacher")
>
> When this happens, there is also a message displayed in the echo area:
> “Unknown symbol: Ü”.

That's not what that function is meant to do.  It parses a DRUMS
address, which is basically an RFC2047-encoded address.  It's undefined
what it does on addresses that are of a different format.

So this sounds like a bug in Notmuch, and you should take it up with
them.

-- 
(domestic pets only, the antidote for overdose, milk.)
  http://lars.ingebrigtsen.no  *  Sent from my Rome




Added tag(s) notabug. Request was from Lars Ingebrigtsen <larsi <at> gnus.org> to control <at> debbugs.gnu.org. (Wed, 25 Jan 2012 23:30:02 GMT) Full text and rfc822 format available.

bug closed, send any further explanations to 10314 <at> debbugs.gnu.org and Aaron Ecay <aaronecay <at> gmail.com> Request was from Lars Ingebrigtsen <larsi <at> gnus.org> to control <at> debbugs.gnu.org. (Wed, 25 Jan 2012 23:30:03 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Thu, 23 Feb 2012 12:24:07 GMT) Full text and rfc822 format available.

This bug report was last modified 13 years and 204 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.