GNU bug report logs -
#29837
UTF-16 char display problems and the macOS "character palette"
Previous Next
Reported by: Alan Third <alan <at> idiocy.org>
Date: Sun, 24 Dec 2017 16:02:02 UTC
Severity: normal
Tags: fixed
Fixed in version 27.1
Done: Alan Third <alan <at> idiocy.org>
Bug is archived. No further changes may be made.
Full log
Message #29 received at 29837 <at> debbugs.gnu.org (full text, mbox):
On Mon, Dec 25, 2017 at 08:13:55PM +0000, Philipp Stephani wrote:
> IIUC Emacs receives the input as a single UTF-16 string (in
> insertText), then iterates over the UTF-16 code units, converting
> each into an Emacs event. That's wrong, no matter whether the input
> comes from the character palette or from the keyboard; normal
> keyboard layouts just happen to not contain non-BMP characters. The
> loop needs to account for surrogates.
I finally came to this conclusion myself. I now know a lot more about
UTF‐16 than I did yesterday. :)
Wish I’d looked at my email earlier, though.
> As a small optimization (which is warranted because the function is
> probably called on every keystroke), this should use [NSString
> getCharacters:range:] to copy all the UTF-16 code units to a buffer
> first, to avoid repeated calls to characterAtIndex.
Presumably the vast majority of input will consist of just one code
unit, though?
--
Alan Third
This bug report was last modified 7 years and 139 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.