GNU bug report logs - #36171
25.1; Emacs displays Hebrew text incorrectly

Previous Next

Package: emacs;

Reported by: Raoul Comninos <revrari <at> mweb.co.za>

Date: Tue, 11 Jun 2019 15:36:01 UTC

Severity: normal

Found in version 25.1

Done: Eli Zaretskii <eliz <at> gnu.org>

Bug is archived. No further changes may be made.

Full log


Message #14 received at 36171 <at> debbugs.gnu.org (full text, mbox):

From: Raoul Comninos <revrari <at> mweb.co.za>
To: Eli Zaretskii <eliz <at> gnu.org>
Cc: 36171 <at> debbugs.gnu.org
Subject: Re: bug#36171: 25.1; Emacs displays Hebrew text incorrectly
Date: Wed, 12 Jun 2019 20:19:52 +0200
I have libm17n-0 and libm17n-dev installed.

I changed to dejavu-sans-mono. (No change to the Hebrew font display.)

Below follows the information you asked for:

////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////

             position: 935 of 9244927 (0%), column: 7
            character: לּ‎ (displayed as לּ‎) (codepoint 64316, #o175474, 
#xfb3c)
              charset: unicode-bmp (Unicode Basic Multilingual Plane 
(U+0000..U+FFFF))
code point in charset: 0xFB3C
               script: hebrew
               syntax: w     which means: word
             category: .:Base, R:Right-to-left (strong)
             to input: type "C-x 8 RET fb3c" or "C-x 8 RET HEBREW 
LETTER LAMED WITH DAGESH"
          buffer code: #xEF #xAC #xBC
            file code: #xEF #xAC #xBC (encoded by coding system utf-8-unix)
              display: by this font (glyph code)
    xft:-unknown-DejaVu 
Sans-normal-normal-normal-*-29-*-*-*-*-0-iso10646-1 (#x13DB)

Character code properties: customize what to show
  name: HEBREW LETTER LAMED WITH DAGESH
  general-category: Lo (Letter, Other)
  decomposition: (1500 1468) ('ל' 'ּ')

There are text properties here:
  fontified

////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////

On 2019/06/12 18:23, Eli Zaretskii wrote:
> [Please use Reply to All to reply to the bug tracker as well, so this
> discussion gets recorded by the tracker.]
>
>> From: Raoul Comninos <revrari <at> mweb.co.za>
>> Date: Tue, 11 Jun 2019 20:59:24 +0200
>>
>> Thank you for your response. Yes, the problem still occurs in Emacs -Q.
>>
>> I did as you asked and here are the full contents of the help buffers of
>> _two_ instances:
>>
>>                position: 71 of 9244927 (0%), column: 7
>>               character: ְ (displayed as ְ) (codepoint 1456, #o2660, #x5b0)
>>       preferred charset: unicode (Unicode (ISO10646))
>> code point in charset: 0x05B0
>>                  script: hebrew
>>                  syntax: w     which means: word
>>                category: ^:Combining
>>                to input: type "C-x 8 RET 5b0" or "C-x 8 RET HEBREW POINT SHEVA"
>>             buffer code: #xD6 #xB0
>>               file code: #xD6 #xB0 (encoded by coding system utf-8-unix)
>>                 display: by this font (glyph code)
>>       xft:-unknown-Ezra SIL-normal-normal-normal-*-22-*-*-*-*-0-iso10646-1 (#xFE)
>>
>> Character code properties: customize what to show
>>     name: HEBREW POINT SHEVA
>>     general-category: Mn (Mark, Nonspacing)
>>     decomposition: (1456) ('ְ')
>>
>> There are text properties here:
>>     fontified            t
>>
>> --------------------------------------------------------------------------------------------------------------
>>
>>                position: 733239 of 9244928 (8%), column: 5
>>               character: ׁ (displayed as ׁ) (codepoint 1473, #o2701, #x5c1)
>>                 charset: unicode-bmp (Unicode Basic Multilingual Plane (U+0000..U+FFFF))
>> code point in charset: 0x05C1
>>                  script: hebrew
>>                  syntax: w     which means: word
>>                category: ^:Combining
>>                to input: type "C-x 8 RET 5c1" or "C-x 8 RET HEBREW POINT SHIN DOT"
>>             buffer code: #xD7 #x81
>>               file code: #xD7 #x81 (encoded by coding system utf-8-unix)
>>                 display: composed to form "ׁ" (see below)
>>
>> Composed using this font:
>>     xft:-unknown-Ezra SIL-normal-normal-normal-*-24-*-*-*-*-0-iso10646-1
>> by these glyphs:
>>     [2 2 9676 334 18 2 17 15 -1 nil]
>>     [2 2 1473 271 0 13 16 20 -18 nil]
>>
>> Character code properties: customize what to show
>>     name: HEBREW POINT SHIN DOT
>>     general-category: Mn (Mark, Nonspacing)
>>     decomposition: (1473) ('ׁ')
>>
>> There are text properties here:
>>     fontified            t
>>
>> -------------------------------------------------------------------------------------------------------
> These two characters are diacriticals.  I'd also ask to please do the
> same with cursor on the base character (consonant), where a problem
> happens.  For example, on the LAMED from the first screenshot.
>
> Also, can you try with a more standard font, like DejaVu Sans Mono?
>
> And finally, do you know what version of the libm17n-flt do you have
> installed?  This library is used for shaping text of scripts such as
> Arabic and Hebrew.
>
>> For your interest I also attach a few screenshots as examples.
> I attach them below, since you didn't CC the bug address.
>




This bug report was last modified 5 years and 335 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.