GNU bug report logs -
#11073
24.0.94; BIDI-related crash in redisplay with certain byte sequences
Previous Next
Reported by: Eli Zaretskii <eliz <at> gnu.org>
Date: Fri, 23 Mar 2012 11:27:02 UTC
Severity: normal
Found in version 24.0.94
Done: Glenn Morris <rgm <at> gnu.org>
Bug is archived. No further changes may be made.
Full log
View this message in rfc822 format
>> > Please note that not all characters in the code-space of a
>> > CJK charset are unified. For instance, Big5 has it's own
>> > PUA (private use area), and characters in PUA are not
>> > unified by default. So, if Emacs reads a Big5 file that
>> > contains PUA chars, those chars stay in high-area. Then,
>> > one can provide his own unification map that also maps PUA
>> > chars to some Unicode chars as this:
>> > (unify-charset 'big5 "MyBig5.map")
>> > After this, I thought that previously read PUA chars staying
>> > in the high-area should be treated as the corresponding
>> > Unicode chars (in displaying, search, etc).
> No. In the above scenario, PUA chars read before the call
> of unify-charset are not unified. The unification should
> take place after the call of unify-charset.
But isn't this (unify-charset 'big5 "MyBig5.map") performed in the
.emacs? Is it really important to support adding unification rules
after decoding took place? If so, why? And also, what about
removing unification rules after decoding?
Stefan
This bug report was last modified 12 years and 95 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.