GNU bug report logs - #5226
Malayalam conjunct formation breaks down at the end of long words

Previous Next

Package: emacs;

Reported by: Praveen A <pravi.a <at> gmail.com>

Date: Wed, 16 Dec 2009 11:35:05 UTC

Severity: normal

Done: Praveen A <pravi.a <at> gmail.com>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 5226 in the body.
You can then email your comments to 5226 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-submit-list <at> lists.donarmstrong.com, Emacs Bugs <bug-gnu-emacs <at> gnu.org>:
bug#5226; Package emacs. (Wed, 16 Dec 2009 11:35:05 GMT) Full text and rfc822 format available.

Acknowledgement sent to Praveen A <pravi.a <at> gmail.com>:
New bug report received and forwarded. Copy sent to Emacs Bugs <bug-gnu-emacs <at> gnu.org>. (Wed, 16 Dec 2009 11:35:05 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> emacsbugs.donarmstrong.com (full text, mbox):

From: Praveen A <pravi.a <at> gmail.com>
To: bug-gnu-emacs <at> gnu.org
Cc: Parag Nemade <pnemade <at> redhat.com>, psatpute <at> redhat.com,
        suresh <surumafonts <at> gmail.com>
Subject: Malayalam conjunct formation breaks down at the end of long words
Date: Wed, 16 Dec 2009 16:58:16 +0530
Hi,

Many Malayalam conjuncts at the end of long words are broken down to
its basic forms.

Example 1: നക്ഷത്രാവശിഷ്ടങ്ങളും or
'\u0d28\u0d15\u0d4d\u0d37\u0d24\u0d4d\u0d30\u0d3e\u0d35\u0d36\u0d3f\u0d37\u0d4d\u0d1f\u0d19\u0d4d\u0d19\u0d33\u0d41\u0d02'
(ങ്ങ is broken into ങ ് ങ)
Example2: എനിയ്ക്കിഷ്ടായില്ല or
'\u0d0e\u0d28\u0d3f\u0d2f\u0d4d\u0d15\u0d4d\u0d15\u0d3f\u0d37\u0d4d\u0d1f\u0d3e\u0d2f\u0d3f\u0d32\u0d4d\u0d32'
(ല്ല is broken into ല ് ല)

To reproduce this bug: Copy the above example to a buffer and observe
the deference in rendering when a space is added in the middle of the
word.

This is tested with CVS version of emacs with most recent Indic rendering fixes.

- Praveen
-- 
പ്രവീണ്‍ അരിമ്പ്രത്തൊടിയില്‍
<GPLv2> I know my rights; I want my phone call!
<DRM> What use is a phone call, if you are unable to speak?
(as seen on /.)
Join The DRM Elimination Crew Now!
http://fci.wikia.com/wiki/Anti-DRM-Campaign

Information forwarded to bug-submit-list <at> lists.donarmstrong.com, Emacs Bugs <bug-gnu-emacs <at> gnu.org>:
bug#5226; Package emacs. (Wed, 16 Dec 2009 12:00:06 GMT) Full text and rfc822 format available.

Acknowledgement sent to Praveen A <pravi.a <at> gmail.com>:
Extra info received and forwarded to list. Copy sent to Emacs Bugs <bug-gnu-emacs <at> gnu.org>. (Wed, 16 Dec 2009 12:00:07 GMT) Full text and rfc822 format available.

Message #10 received at 5226 <at> emacsbugs.donarmstrong.com (full text, mbox):

From: Praveen A <pravi.a <at> gmail.com>
To: bug-gnu-emacs <at> gnu.org, 5226 <at> debbugs.gnu.org
Cc: Parag Nemade <pnemade <at> redhat.com>, psatpute <at> redhat.com,
        suresh <surumafonts <at> gmail.com>
Subject: Re: Malayalam conjunct formation breaks down at the end of long words
Date: Wed, 16 Dec 2009 17:22:32 +0530
2009/12/16 Praveen A <pravi.a <at> gmail.com>:
> Hi,
>
> Many Malayalam conjuncts at the end of long words are broken down to
> its basic forms.

Screen capture video of the bug
http://blip.tv/dashboard/episode/2999711

- Praveen
-- 
പ്രവീണ്‍ അരിമ്പ്രത്തൊടിയില്‍
<GPLv2> I know my rights; I want my phone call!
<DRM> What use is a phone call, if you are unable to speak?
(as seen on /.)
Join The DRM Elimination Crew Now!
http://fci.wikia.com/wiki/Anti-DRM-Campaign



Merged 5226 5227. Request was from Praveen A <pravi.a <at> gmail.com> to control <at> emacsbugs.donarmstrong.com. (Wed, 16 Dec 2009 12:20:04 GMT) Full text and rfc822 format available.

Information forwarded to owner <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org:
bug#5226; Package emacs. (Fri, 18 Dec 2009 04:46:02 GMT) Full text and rfc822 format available.

Message #15 received at 5226 <at> debbugs.gnu.org (full text, mbox):

From: Praveen A <pravi.a <at> gmail.com>
To: Kenichi Handa <handa <at> m17n.org>
Cc: 5226 <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org, pnemade <at> redhat.com,
	psatpute <at> redhat.com, surumafonts <at> gmail.com
Subject: Re: bug#5226: Malayalam conjunct formation breaks down at the end of 
	long words
Date: Thu, 17 Dec 2009 20:21:50 +0530
2009/12/17 Kenichi Handa <handa <at> m17n.org>:
> Thank you for them.  But the log shows that the Malayalam
> rendering is not using the latest Emacs, at least, it seems
> that the latest change for malayalam-composable-pattern is
> not working.
>
> Please show me the result of this:
>
> (aref composition-function-table #xD15)
>
> Mine is this:
>
> (["\\([ക-ഹ]‌?്‌?\\)*[ക-ഹ]\\(്[‍‌]?\\|[െ-ൈൊ-ൌ]?[ൢ-ൣ]?[ാ-ൄൗ]?[ം-ഃ]?\\)\\|[അ-ഔൠ-ൡ]\\(‌?്[ക-ഹ]\\)?[െ-ൈൊ-ൌ]?[ൢ-ൣ]?[ാ-ൄൗ]?[ം-ഃ]?\\|‌്[ക-ഹ]\\|[ഀ-ൿ]" 0 font-shape-gstring])

I copied (aref composition-function-table #xD15) to the Lisp
Interaction buffer and typed c-x c-e. Should I do it any differently?

This is what I get
(["[ഀ-ൿ‌‍]+" 0 font-shape-gstring])

- Praveen
-- 
പ്രവീണ്‍ അരിമ്പ്രത്തൊടിയില്‍
<GPLv2> I know my rights; I want my phone call!
<DRM> What use is a phone call, if you are unable to speak?
(as seen on /.)
Join The DRM Elimination Crew Now!
http://fci.wikia.com/wiki/Anti-DRM-Campaign

Information forwarded to owner <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org:
bug#5226; Package emacs. (Fri, 18 Dec 2009 07:29:01 GMT) Full text and rfc822 format available.

Message #18 received at 5226 <at> debbugs.gnu.org (full text, mbox):

From: Kenichi Handa <handa <at> m17n.org>
To: Praveen A <pravi.a <at> gmail.com>
Cc: 5226 <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org, pnemade <at> redhat.com,
	psatpute <at> redhat.com, surumafonts <at> gmail.com
Subject: Re: bug#5226: Malayalam conjunct formation breaks down at the end of 
	long words
Date: Fri, 18 Dec 2009 09:44:35 +0900
In article <3f2beab60912170651l6e87b7ddxeffb4ff4b08d58a4 <at> mail.gmail.com>, Praveen A <pravi.a <at> gmail.com> writes:

> I copied (aref composition-function-table #xD15) to the Lisp
> Interaction buffer and typed c-x c-e. Should I do it any differently?

No, that is the right way.

> This is what I get
> (["[ഀ-ൿ‌‍]+" 0 font-shape-gstring])

Then, you are not using the latest CVS code.  At least,
lisp/language/indian.el is not up-to-date or not
byte-compiled when you build your emacs.  Please check it.

---
Kenichi Handa
handa <at> m17n.org




Information forwarded to owner <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org:
bug#5226; Package emacs. (Fri, 18 Dec 2009 07:29:02 GMT) Full text and rfc822 format available.

Message #21 received at 5226 <at> debbugs.gnu.org (full text, mbox):

From: Kenichi Handa <handa <at> m17n.org>
To: Praveen A <pravi.a <at> gmail.com>
Cc: 5226 <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org, pnemade <at> redhat.com,
	psatpute <at> redhat.com, surumafonts <at> gmail.com
Subject: Re: bug#5226: Malayalam conjunct formation breaks down at the end of 
	long words
Date: Thu, 17 Dec 2009 21:56:47 +0900
In article <3f2beab60912162328lb9a4bb1t15f5cd828b67b257 <at> mail.gmail.com>, Praveen A <pravi.a <at> gmail.com> writes:

> > Please try this and show me the result of "log1" and "log2" files.
> >
> > % MDEBUG_FLT=3D3 emacs NO_MIDDLE_SPACE.txt >& log1
> > % MDEBUG_FLT=3D3 emacs WITH_MIDDLE_SPACE.txt >& log2

> log files are attached.

Thank you for them.  But the log shows that the Malayalam
rendering is not using the latest Emacs, at least, it seems
that the latest change for malayalam-composable-pattern is
not working.

Please show me the result of this:

(aref composition-function-table #xD15)

Mine is this:

(["\\([ക-ഹ]‌?്‌?\\)*[ക-ഹ]\\(്[‍‌]?\\|[െ-ൈൊ-ൌ]?[ൢ-ൣ]?[ാ-ൄൗ]?[ം-ഃ]?\\)\\|[അ-ഔൠ-ൡ]\\(‌?്[ക-ഹ]\\)?[െ-ൈൊ-ൌ]?[ൢ-ൣ]?[ാ-ൄൗ]?[ം-ഃ]?\\|‌്[ക-ഹ]\\|[ഀ-ൿ]" 0 font-shape-gstring])

---
Kenichi Handa
handa <at> m17n.org




Information forwarded to owner <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org:
bug#5226; Package emacs. (Fri, 18 Dec 2009 07:29:02 GMT) Full text and rfc822 format available.

Message #24 received at 5226 <at> debbugs.gnu.org (full text, mbox):

From: Kenichi Handa <handa <at> m17n.org>
To: Praveen A <pravi.a <at> gmail.com>, 5226 <at> debbugs.gnu.org
Cc: bug-gnu-emacs <at> gnu.org, pnemade <at> redhat.com, psatpute <at> redhat.com,
	surumafonts <at> gmail.com
Subject: Re: bug#5226: Malayalam conjunct formation breaks down at the end of
	long words
Date: Thu, 17 Dec 2009 15:54:32 +0900
In article <3f2beab60912160328y1084addbw7555543aaf2fcf0c <at> mail.gmail.com>, Praveen A <pravi.a <at> gmail.com> writes:

> Many Malayalam conjuncts at the end of long words are broken down to
> its basic forms.

> Example 1: നക്ഷത്രാവശിഷ്ടങ്ങളും or
> '\u0d28\u0d15\u0d4d\u0d37\u0d24\u0d4d\u0d30\u0d3e\u0d35\u0d36\u0d3f\u0d37\u0d4d\u0d1f\u0d19\u0d4d\u0d19\u0d33\u0d41\u0d02'
> (ങ്ങ is broken into ങ ് ങ)
> Example2: എനിയ്ക്കിഷ്ടായില്ല or
> '\u0d0e\u0d28\u0d3f\u0d2f\u0d4d\u0d15\u0d4d\u0d15\u0d3f\u0d37\u0d4d\u0d1f\u0d3e\u0d2f\u0d3f\u0d32\u0d4d\u0d32'
> (ല്ല is broken into ല ് ല)

> To reproduce this bug: Copy the above example to a buffer and observe
> the deference in rendering when a space is added in the middle of the
> word.

> This is tested with CVS version of emacs with most recent
> Indic rendering fixes.

Again, I cant' reproduce it.  Sigh...

Which font is it?

Please try this and show me the result of "log1" and "log2" files.

% MDEBUG_FLT=3 emacs NO_MIDDLE_SPACE.txt >& log1
% MDEBUG_FLT=3 emacs WITH_MIDDLE_SPACE.txt >& log2

Here NO_MIDDLE_SPACE.txt is a file containing only this line:
എനിയ്ക്കിഷ്ടായില്ല
and WITH_MIDDLE_SPACE.txt is a file containing only this line:
എനിയ്ക്കിഷ്ടായി ല്ല

> Screen capture video of the bug
> http://blip.tv/dashboard/episode/2999711

When I visit that page, login is requested.  What username
and password to use?

> ---
Kenichi Handa
handa <at> m17n.org




Information forwarded to owner <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org:
bug#5226; Package emacs. (Fri, 18 Dec 2009 07:54:01 GMT) Full text and rfc822 format available.

Message #27 received at 5226 <at> debbugs.gnu.org (full text, mbox):

From: Praveen A <pravi.a <at> gmail.com>
To: Kenichi Handa <handa <at> m17n.org>
Cc: 5226 <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org, pnemade <at> redhat.com,
	psatpute <at> redhat.com, surumafonts <at> gmail.com
Subject: Re: bug#5226: Malayalam conjunct formation breaks down at the end of 
	long words
Date: Thu, 17 Dec 2009 12:58:19 +0530
[Message part 1 (text/plain, inline)]
2009/12/17 Kenichi Handa <handa <at> m17n.org>:
> Again, I cant' reproduce it.  Sigh...

:-(

> Which font is it?

This bug is present with all fonts (Meera, Rachana, Lohit Malayalam...)

> Please try this and show me the result of "log1" and "log2" files.
>
> % MDEBUG_FLT=3 emacs NO_MIDDLE_SPACE.txt >& log1
> % MDEBUG_FLT=3 emacs WITH_MIDDLE_SPACE.txt >& log2

log files are attached.

>> Screen capture video of the bug
>> http://blip.tv/dashboard/episode/2999711
>
> When I visit that page, login is requested.  What username
> and password to use?

Sorry about that, the link I had given was not public. The public link
is http://blip.tv/file/2980014

I have also uploaded another screencast which shows the rendering
change when text is selected.

http://blip.tv/file/2984156

-Praveen
-- 
പ്രവീണ്‍ അരിമ്പ്രത്തൊടിയില്‍
<GPLv2> I know my rights; I want my phone call!
<DRM> What use is a phone call, if you are unable to speak?
(as seen on /.)
Join The DRM Elimination Crew Now!
http://fci.wikia.com/wiki/Anti-DRM-Campaign
[log1 (application/octet-stream, attachment)]
[log2 (application/octet-stream, attachment)]

Information forwarded to owner <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org:
bug#5226; Package emacs. (Fri, 18 Dec 2009 09:17:01 GMT) Full text and rfc822 format available.

Message #30 received at 5226 <at> debbugs.gnu.org (full text, mbox):

From: Praveen A <pravi.a <at> gmail.com>
To: Kenichi Handa <handa <at> m17n.org>
Cc: 5226 <at> debbugs.gnu.org, bug-gnu-emacs <at> gnu.org, pnemade <at> redhat.com,
	psatpute <at> redhat.com, surumafonts <at> gmail.com
Subject: Re: bug#5226: Malayalam conjunct formation breaks down at the end of 
	long words
Date: Fri, 18 Dec 2009 14:46:47 +0530
2009/12/18 Kenichi Handa <handa <at> m17n.org>:
> Then, you are not using the latest CVS code.  At least,
> lisp/language/indian.el is not up-to-date or not
> byte-compiled when you build your emacs.  Please check it.

I did a uninstall, distclean, cvs update and rebuilt emacs.
Also I removed the code needed for handling ZWJ from my .emacs file.

Now I'm getting this.
(["\\([ക-ഹ]‌?്‌?\\)*[ക-ഹ]\\(്[‍‌]?\\|[െ-ൈൊ-ൌ]?[ൢ-ൣ]?[ാ-ൄൗ]?[ം-ഃ]?\\)\\|[അ-ഔൠ-ൡ]\\(‌?്[ക-ഹ]\\)?[െ-ൈൊ-ൌ]?[ൢ-ൣ]?[ാ-ൄൗ]?[ം-ഃ]?\\|‌്[ക-ഹ]\\|[ഀ-ൿ]"
0 font-shape-gstring])

Now it is working beautifully. Thanks a lot.
Praveen
-- 
പ്രവീണ്‍ അരിമ്പ്രത്തൊടിയില്‍
<GPLv2> I know my rights; I want my phone call!
<DRM> What use is a phone call, if you are unable to speak?
(as seen on /.)
Join The DRM Elimination Crew Now!
http://fci.wikia.com/wiki/Anti-DRM-Campaign

bug closed, send any further explanations to Praveen A <pravi.a <at> gmail.com> Request was from Praveen A <pravi.a <at> gmail.com> to control <at> debbugs.gnu.org. (Fri, 18 Dec 2009 09:19:02 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <bug-gnu-emacs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Fri, 15 Jan 2010 12:24:04 GMT) Full text and rfc822 format available.

bug unarchived. Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Thu, 28 Jan 2010 00:14:02 GMT) Full text and rfc822 format available.

Disconnected #5227 from all other report(s). Request was from Glenn Morris <rgm <at> gnu.org> to control <at> debbugs.gnu.org. (Thu, 28 Jan 2010 00:14:02 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <bug-gnu-emacs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Thu, 25 Feb 2010 12:24:04 GMT) Full text and rfc822 format available.

This bug report was last modified 15 years and 112 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.