GNU bug report logs - #45925
27.1; *Summary* buffer vs. raw utf-8 headers

Previous Next

Packages: emacs, gnus;

Reported by: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>

Date: Sun, 17 Jan 2021 05:37:02 UTC

Severity: minor

Tags: fixed

Found in version 27.1

Fixed in version 28.1

Done: Lars Ingebrigtsen <larsi <at> gnus.org>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
To: Lars Ingebrigtsen <larsi <at> gnus.org>
Cc: 45925 <at> debbugs.gnu.org
Subject: bug#45925: 27.1; *Summary* buffer vs. raw utf-8 headers
Date: Wed, 20 Jan 2021 13:58:08 +0800
>>>>> "LI" == Lars Ingebrigtsen <larsi <at> gnus.org> writes:
LI> 積丹尼 Dan Jacobson <jidanni <at> jidanni.org> writes:

>> Try this simple experiment:
>> $ echo Subject: 一二三|procmail
>> $ echo Subject: 一二三|iconv -t big5|procmail

LI> I don't have procmail installed, so I'm not sure what these do -- are
LI> you sending a mail (to yourself?) here?  Do you have a recipe to
LI> reproduce this problem without the use of procmail?

$ echo Subject: 一二三 > ~/Maildir/new/Z
$ file ~/Maildir/new/Z
~/Maildir/new/Z: UTF-8 Unicode text


>> $ emacs -f gnus
>> 
>> In the *Article* buffer, both look like
>> Subject: 一二三
>> In the *Summary* buffer so does the big5 version.
>> Alas, the utf-8 version looks like
>> c\x80\xd3....
>> 
>> (Yes, these are illegal raw headers. But Gnus is supposed to be
>> accommodating. And it does... but oddly not for the majority (UTF-8) case.)

LI> [...]

>> (Might be related to bug#45724.)

LI> Is this still with nnml?  If so, could you find the resulting lines in
LI> the .overview files in the nnml directory and post them here?  (Perhaps
LI> after gzipping them to avoid Emacs helpfully re-encoding the lines.)

Yes, nnml.

The headers get appended raw to .overview.

Thus .overview contains a mix of ASCII, big5, and UTF-8, all in the same file.

$ echo Subject: 一二三|iconv -t big5 > ~/Maildir/new/B5
$ echo Subject: 一二三 > ~/Maildir/new/UT
$ emacs -f gnus
$ tail -n 2 Mail/mail/misc/.overview|qprint -e
37397   =A4@=A4G=A4T    (nobody)                <87a6t4gnpx.5.fsf <at> totally-fudged-out-mess=
age-id>         0       0       Xref: jidanni5 mail.misc:37397=09
37398   =E4=B8=80=E4=BA=8C=E4=B8=89     (nobody)                <878s8ognpx.5.fsf <at> totally-=
fudged-out-message-id>          0       0       Xref: jidanni5 mail.misc:37398=09

Anyway: *Summary* oddly can only deal with raw big5, not raw UTF-8.
However *Article* can deal with both.




This bug report was last modified 4 years and 171 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.