GNU bug report logs -
#69381
mumi does not correctly display (some?) non-ascii characters
Previous Next
Reported by: Tomas Volf <~@wolfsden.cz>
Date: Sun, 25 Feb 2024 13:27:03 UTC
Severity: normal
Tags: patch
Done: Maxim Cournoyer <maxim.cournoyer <at> gmail.com>
Bug is archived. No further changes may be made.
Full log
View this message in rfc822 format
Maxim Cournoyer <maxim.cournoyer <at> gmail.com> writes:
> Hi Noé,
>
> Noé Lopez <noe <at> noé.eu> writes:
>
>> Small update,
>>
>> I’ve investigated the issue in fibers and I now blame the guile web
>> library for the issue. Apparently it sets the port to ISO-8859-1
>> encoding each time you call read-request, but it acts like « yeah don’t
>> worry just use utf-8 for your body » in the docs.
>>
>> That’s fine UNLESS you use chunked transfers (omitting content-length in
>> fibers), in which case it just decides to blow up :///// (it assumes one
>> character = one byte)
>>
>> In the end I’m pretty sure any of this could have been avoided by just
>> not replacing every character with question marks. Had it kept the
>> invalid bytes intact they would have translated back with no issue.
>
> Nice investigation! Did you create an issue at bug-guile <at> gnu.org?
> don't see it on the tracker. Or perhaps this could be tackled from the
> angle of fibers? For example by adding a new failing test reproducing
> the problem to its test suite, and going from there.
>
I talked about this with Christopher Baines at FOSDEM and he seemed to
know much more about it than me, so maybe he can suggest a way forward?
Starting with a failing test seems like a good idea.
Have a nice day,
Noé
This bug report was last modified 101 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.