GNU bug report logs -
#11073
24.0.94; BIDI-related crash in redisplay with certain byte sequences
Previous Next
Reported by: Eli Zaretskii <eliz <at> gnu.org>
Date: Fri, 23 Mar 2012 11:27:02 UTC
Severity: normal
Found in version 24.0.94
Done: Glenn Morris <rgm <at> gnu.org>
Bug is archived. No further changes may be made.
Full log
Message #32 received at 11073 <at> debbugs.gnu.org (full text, mbox):
>> I understand this part. The part I don't understand is why we do
>> unification when reading a char from the buffer's text. That is: why
>> unify chars in `int' (or Lisp_Object) form but not in the
>> internal-utf-8 representation?
>> I would expect the unification to happen during encoding/decoding
> Usually, yes. But as far as there is a code space in high
> area for a CJK charset, it is unavoidable to have a
> buffer/string that contains a character represented by a
> byte sequence in that high area as the test case of
> Bug#11073. And, as "unification" means to treat such a
> character the same way as the unified character, I thought
> they both have the same character code.
Since there are two internal byte-sequence representation, I don't see
any good reason why we shouldn't have 2 internal int representations.
I.e. if unification failed for the byte-sequence (which might be the
result of a bug, for all I know), we may as well keep them non-unified
in the int representation.
Stefan
This bug report was last modified 12 years and 94 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.