GNU bug report logs -
#4037
Characters garbled in self-insert-command
Previous Next
Reported by: Juri Linkov <juri <at> jurta.org>
Date: Tue, 4 Aug 2009 19:35:04 UTC
Severity: normal
Merged with 4240
Done: Eli Zaretskii <eliz <at> gnu.org>
Bug is archived. No further changes may be made.
Full log
View this message in rfc822 format
[Message part 1 (text/plain, inline)]
Your message dated Fri, 28 Aug 2009 11:52:21 +0300
with message-id <83fxbccoca.fsf <at> gnu.org>
and subject line Re: bug#4240: 23.1.50; C-u doesn't work with Swedish characters
has caused the Emacs bug report #4037,
regarding Characters garbled in self-insert-command
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact help-debbugs <at> gnu.org
immediately.)
--
4037: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=4037
Emacs Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
I just noticed a regression against Emacs 22.
In GNU Emacs 23.1.50 (x86_64-pc-linux-gnu) typing
C-u 5 C-x 8 ' a
inserts into the current buffer
á\341\341\341á
whereas in GNU Emacs 22.1.1 typing the same correctly inserts
ááááá
The command `self-insert-command' in Emacs 23 inserts the first
and the last characters without any modifications, but applies
the following conversion for the remaining characters:
/* Add the offset to the character, for Finsert_char.
We pass internal_self_insert the unmodified character
because it itself does this offsetting. */
if (! NILP (current_buffer->enable_multibyte_characters))
modified_char = unibyte_char_to_multibyte (modified_char);
Commenting out the above 2 lines produces the correct result.
However, I'm not sure what is the right fix.
--
Juri Linkov
http://www.jurta.org/emacs/
[Message part 3 (message/rfc822, inline)]
> From: Kenichi Handa <handa <at> m17n.org>
> Cc: eliz <at> gnu.org, 4240 <at> emacsbugs.donarmstrong.com, deniz.a.m.dogan <at> gmail.com
> Date: Thu, 27 Aug 2009 15:23:25 +0900
>
> In article <jwvocq14zlk.fsf-monnier+emacsbugreports <at> gnu.org>, Stefan Monnier <monnier <at> iro.umontreal.ca> writes:
>
> >>> > Please see bug#4037:
> >>> > http://debbugs.gnu.org/cgi/bugreport.cgi?bug=4037
> >>> > I received no confirmation that my proposed fix is correct.
> >>> I think those two lines are not necessary anymore and should be
> >>> removed (together with the comments which explain their need). I
> >>> think they belong to the old pre-unicode days when raw eight-bit
> >>> characters needed such special treatment.
>
> > I believe you're right. Nowadays, the keyboard-decoding should always
> > take place before we get to that point.
>
> Sorry for the late responce on this matter. Yes, that
> unibyte->multibyte conversion is not necessary. I've just
> installed a fix.
Thanks. I'm closing the two related bug reports.
This bug report was last modified 15 years and 327 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.