GNU bug report logs - #5256
conjunct formation should follow input sequence when inserting text

Previous Next

Package: emacs;

Reported by: Praveen A <pravi.a <at> gmail.com>

Date: Tue, 6 Oct 2009 18:55:07 UTC

Severity: normal

Done: Glenn Morris <rgm <at> gnu.org>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 5256 in the body.
You can then email your comments to 5256 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-submit-list <at> lists.donarmstrong.com, Emacs Bugs <bug-gnu-emacs <at> gnu.org>:
bug#5256; Package emacs. (Sun, 06 Sep 2009 17:00:03 GMT) Full text and rfc822 format available.

Message #3 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Praveen A <pravi.a <at> gmail.com>
To: bug-gnu-emacs <at> gnu.org
Cc: psatpute <at> redhat.com, Parag Nemade <pnemade <at> redhat.com>,
	suresh <surumafonts <at> gmail.com>
Subject: bug#5256: conjunct formation should follow input sequence when
	inserting text
Date: Mon, 21 Dec 2009 20:53:41 +0530
Example to illustrate this bug is അപ്‌ലോഡ് (upload).
>>> a=u'ലോഡ്'
>>> b=u'അപ്'
>>> c=u'അപ്‌ലോഡ്'
>>> print repr(c)
u'\u0d05\u0d2a\u0d4d\u200c\u0d32\u0d4b\u0d21\u0d4d'
>>> d=u'അപ്ലോ‌ഡ്'
>>> print repr(d)
u'\u0d05\u0d2a\u0d4d\u0d32\u0d4b\u200c\u0d21\u0d4d'
>>>

Here ZWNJ is added to prevent formation of conjunct 'pla' (\u0d2a\u0d4d\u0d32).

Enter ലോഡ് (\u0d32\u0d4b\u0d21\u0d4d) first, move cursor to the
beginning of the word, now enter അപ് (\u0d05\u0d2a\u0d4d). Now ZWNJ
entered will not be after 0d4d, but after the conjunct 'plo'
(\u0d2a\u0d4d\u0d32\u0d4b). gedit/pango has the correct behavior,
wherein the ZWNJ is inserted after 0d4d breaking the conjunct 'pla'
(\u0d2a\u0d4d\u0d32) as expected.

GNU Emacs 23.1.90.1 (x86_64-unknown-linux-gnu, GTK+ Version 2.18.3)
 of 2009-12-18 on savannah
-- 
പ്രവീണ്‍ അരിമ്പ്രത്തൊടിയില്‍
<GPLv2> I know my rights; I want my phone call!
<DRM> What use is a phone call, if you are unable to speak?
(as seen on /.)
Join The DRM Elimination Crew Now!
http://fci.wikia.com/wiki/Anti-DRM-Campaign


Information forwarded to bug-submit-list <at> lists.donarmstrong.com, Emacs Bugs <bug-gnu-emacs <at> gnu.org>:
bug#5256; Package emacs. (Mon, 07 Sep 2009 21:35:04 GMT) Full text and rfc822 format available.

Message #6 received at 5256 <at> debbugs.gnu.org (full text, mbox):

From: Kenichi Handa <handa <at> m17n.org>
To: Praveen A <pravi.a <at> gmail.com>, 5256 <at> debbugs.gnu.org
Cc: pravi.a <at> gmail.com, 5256 <at> debbugs.gnu.org
Subject: bug#5256: conjunct formation should follow input sequence when
	inserting text
Date: Tue, 22 Dec 2009 21:50:34 +0900
In article <3f2beab60912210723x4f424fa9l2b842824aff1a546 <at> mail.gmail.com>, Praveen A <pravi.a <at> gmail.com> writes:

> Example to illustrate this bug is അപ്‌ലോഡ് (upload).
>>>> a=u'ലോഡ്'
>>>> b=u'അപ്'
>>>> c=u'അപ്‌ലോഡ്'
>>>> print repr(c)
> u'\u0d05\u0d2a\u0d4d\u200c\u0d32\u0d4b\u0d21\u0d4d'
>>>> d=u'അപ്ലോ‌ഡ്'
>>>> print repr(d)
> u'\u0d05\u0d2a\u0d4d\u0d32\u0d4b\u200c\u0d21\u0d4d'
>>>> 

> Here ZWNJ is added to prevent formation of conjunct 'pla' (\u0d2a\u0d4d\u0d32).

> Enter ലോഡ് (\u0d32\u0d4b\u0d21\u0d4d) first, move cursor to the
> beginning of the word, now enter അപ് (\u0d05\u0d2a\u0d4d). Now ZWNJ
> entered will not be after 0d4d, but after the conjunct 'plo'
> (\u0d2a\u0d4d\u0d32\u0d4b). gedit/pango has the correct behavior,
> wherein the ZWNJ is inserted after 0d4d breaking the conjunct 'pla'
> (\u0d2a\u0d4d\u0d32) as expected.

Ah, I see your point.  It seems that we must set
disable-point-adjustment to t when a character/string is
inserted in a buffer.  I'll work on it.

---
Kenichi Handa
handa <at> m17n.org


Information forwarded to bug-submit-list <at> lists.donarmstrong.com, Emacs Bugs <bug-gnu-emacs <at> gnu.org>:
bug#5256; Package emacs. (Mon, 07 Sep 2009 21:35:04 GMT) Full text and rfc822 format available.

Message #9 received at 5256 <at> debbugs.gnu.org (full text, mbox):

From: Kenichi Handa <handa <at> m17n.org>
To: Praveen A <pravi.a <at> gmail.com>, 5256 <at> debbugs.gnu.org
Cc: pravi.a <at> gmail.com, 5256 <at> debbugs.gnu.org
Subject: bug#5256: conjunct formation should follow input sequence when
	inserting text
Date: Fri, 25 Dec 2009 10:20:34 +0900
In article <3f2beab60912210723x4f424fa9l2b842824aff1a546 <at> mail.gmail.com>, Praveen A <pravi.a <at> gmail.com> writes:

> Example to illustrate this bug is അപ്‌ലോഡ് (upload).
>>>> a=u'ലോഡ്'
>>>> b=u'അപ്'
>>>> c=u'അപ്‌ലോഡ്'

I have one question.  Emacs tries hard to find a grapheme
boundary to put cursor on.  For instance, in the above (c)
string, when you put cursor on the second character പ
(U+D2A) and type C-f, the cursor moves to ലോ (U+D32 U+D4B),
and the next C-f moves the cursor to ഡ (U+D21).

But, in gedit, "പ്‌ലോ" is treated as one cluster and you
can't put cursor between "പ്‌" and "ലോ".

Which is convenient for Malayalam user?

---
Kenichi Handa
handa <at> m17n.org


Information forwarded to bug-submit-list <at> lists.donarmstrong.com, Emacs Bugs <bug-gnu-emacs <at> gnu.org>:
bug#5256; Package emacs. (Mon, 07 Sep 2009 21:35:04 GMT) Full text and rfc822 format available.

Message #12 received at 5256 <at> debbugs.gnu.org (full text, mbox):

From: Kenichi Handa <handa <at> m17n.org>
To: 5256 <at> debbugs.gnu.org
Cc: pravi.a <at> gmail.com
Subject: bug#5256: conjunct formation should follow input sequence when
	inserting text
Date: Fri, 25 Dec 2009 11:48:13 +0900
In article <tl7fx73dvfp.fsf <at> m17n.org>, Kenichi Handa <handa <at> m17n.org> writes:

> In article <3f2beab60912210723x4f424fa9l2b842824aff1a546 <at> mail.gmail.com>, Praveen A <pravi.a <at> gmail.com> writes:
> > Example to illustrate this bug is അപ്‌ലോഡ് (upload).
>>>>> a=u'ലോഡ്'
>>>>> b=u'അപ്'
>>>>> c=u'അപ്‌ലോഡ്'
>>>>> print repr(c)
> > u'\u0d05\u0d2a\u0d4d\u200c\u0d32\u0d4b\u0d21\u0d4d'
>>>>> d=u'അപ്ലോ‌ഡ്'
>>>>> print repr(d)
> > u'\u0d05\u0d2a\u0d4d\u0d32\u0d4b\u200c\u0d21\u0d4d'
>>>>> 

> > Here ZWNJ is added to prevent formation of conjunct 'pla' (\u0d2a\u0d4d\u0d32).

> > Enter ലോഡ് (\u0d32\u0d4b\u0d21\u0d4d) first, move cursor to the
> > beginning of the word, now enter അപ് (\u0d05\u0d2a\u0d4d). Now ZWNJ
> > entered will not be after 0d4d, but after the conjunct 'plo'
> > (\u0d2a\u0d4d\u0d32\u0d4b). gedit/pango has the correct behavior,
> > wherein the ZWNJ is inserted after 0d4d breaking the conjunct 'pla'
> > (\u0d2a\u0d4d\u0d32) as expected.

> Ah, I see your point.  It seems that we must set
> disable-point-adjustment to t when a character/string is
> inserted in a buffer.  I'll work on it.

I fixed it.  Please try again with the lastest code.

---
Kenichi Handa
handa <at> m17n.org


Information forwarded to bug-submit-list <at> lists.donarmstrong.com, Emacs Bugs <bug-gnu-emacs <at> gnu.org>:
bug#5256; Package emacs. (Mon, 07 Sep 2009 21:35:04 GMT) Full text and rfc822 format available.

Message #15 received at 5256 <at> debbugs.gnu.org (full text, mbox):

From: YAMAMOTO Mitsuharu <mituharu <at> math.s.chiba-u.ac.jp>
To: Kenichi Handa <handa <at> m17n.org>,
	5256 <at> debbugs.gnu.org
Subject: bug#5256: conjunct formation should follow input sequence
	when	inserting text
Date: Fri, 25 Dec 2009 13:00:44 +0900
>>>>> On Fri, 25 Dec 2009 11:48:13 +0900, Kenichi Handa <handa <at> m17n.org> said:

>> Ah, I see your point.  It seems that we must set
>> disable-point-adjustment to t when a character/string is inserted
>> in a buffer.  I'll work on it.

> I fixed it.  Please try again with the lastest code.

It seems to crash on undo.

Steps to reproduce.

  1. emacs -Q
  2. C-x u

(gdb) bt
#0  0x9266be42 in __kill ()
#1  0x9266be34 in kill$UNIX2003 ()
#2  0x926de23a in raise ()
#3  0x926ea679 in abort ()
#4  0x0022bd07 in buf_charpos_to_bytepos (b=0xd01678, charpos=192) at .../emacs/src/marker.c:141
#5  0x003a5f27 in find_automatic_composition (pos=192, limit=-1, start=0xbffff414, end=0xbffff410, gstring=0xbffff40c, string=33554442) at .../emacs/src/composite.c:1310
#6  0x003a9ed1 in composition_adjust_point (last_pt=192, new_pt=192) at .../emacs/src/composite.c:1485
#7  0x001cbbaa in command_loop_1 () at .../emacs/src/keyboard.c:1981
#8  0x002c727e in internal_condition_case (bfun=0x1c713d <command_loop_1>, handlers=33588914, hfun=0x1c64f4 <cmd_error>) at .../emacs/src/eval.c:1490
#9  0x001c6bd6 in command_loop_2 () at .../emacs/src/keyboard.c:1360
#10 0x002c6b0e in internal_catch (tag=33586058, func=0x1c6ba1 <command_loop_2>, arg=33554442) at .../emacs/src/eval.c:1226
#11 0x001c6b5f in command_loop () at .../emacs/src/keyboard.c:1339
#12 0x001c5916 in recursive_edit_1 () at .../emacs/src/keyboard.c:954
#13 0x001c5fc2 in Frecursive_edit () at .../emacs/src/keyboard.c:1016
#14 0x001c3598 in main (argc=2, argv=0xbffff958) at .../emacs/src/emacs.c:1833

				     YAMAMOTO Mitsuharu
				mituharu <at> math.s.chiba-u.ac.jp

In GNU Emacs 23.1.90.1 (i386-apple-darwin9.8.0, X toolkit, Xaw3d scroll bars)
 of 2009-12-25 on yamamoto-mitsuharu-no-mac-mini.local
Windowing system distributor `The X.Org Foundation', version 11.0.10402000
configured using `configure  '--with-x' '--without-gif' '--without-jpeg' '--without-tiff' 'CFLAGS=-g -Wno-pointer-sign -DENABLE_CHECKING''

Important settings:
  value of $LC_ALL: nil
  value of $LC_COLLATE: nil
  value of $LC_CTYPE: nil
  value of $LC_MESSAGES: nil
  value of $LC_MONETARY: nil
  value of $LC_NUMERIC: nil
  value of $LC_TIME: nil
  value of $LANG: ja_JP.UTF-8
  value of $XMODIFIERS: nil
  locale-coding-system: utf-8-unix
  default enable-multibyte-characters: t

Major mode: Lisp Interaction

Minor modes in effect:
  tooltip-mode: t
  mouse-wheel-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  blink-cursor-mode: t
  global-auto-composition-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  line-number-mode: t
  transient-mark-mode: t


Information forwarded to bug-submit-list <at> lists.donarmstrong.com, Emacs Bugs <bug-gnu-emacs <at> gnu.org>:
bug#5256; Package emacs. (Mon, 07 Sep 2009 21:35:04 GMT) Full text and rfc822 format available.

Message #18 received at 5256 <at> debbugs.gnu.org (full text, mbox):

From: Kenichi Handa <handa <at> m17n.org>
To: YAMAMOTO Mitsuharu <mituharu <at> math.s.chiba-u.ac.jp>
Cc: 5256 <at> debbugs.gnu.org
Subject: bug#5256: conjunct formation should follow input sequence
	when	inserting text
Date: Fri, 25 Dec 2009 14:27:40 +0900
In article <wl7hsbhfdf.wl%mituharu <at> math.s.chiba-u.ac.jp>, YAMAMOTO Mitsuharu <mituharu <at> math.s.chiba-u.ac.jp> writes:

>>>>>> On Fri, 25 Dec 2009 11:48:13 +0900, Kenichi Handa <handa <at> m17n.org> said:
>>> Ah, I see your point.  It seems that we must set
>>> disable-point-adjustment to t when a character/string is inserted
>>> in a buffer.  I'll work on it.

> > I fixed it.  Please try again with the lastest code.

> It seems to crash on undo.

Sorry, I've just installed a fix.

---
Kenichi Handa
handa <at> m17n.org


Information forwarded to bug-submit-list <at> lists.donarmstrong.com, Emacs Bugs <bug-gnu-emacs <at> gnu.org>:
bug#5256; Package emacs. (Mon, 07 Sep 2009 21:35:04 GMT) Full text and rfc822 format available.

Message #21 received at 5256 <at> debbugs.gnu.org (full text, mbox):

From: Praveen A <pravi.a <at> gmail.com>
To: Kenichi Handa <handa <at> m17n.org>
Cc: 5256 <at> debbugs.gnu.org
Subject: bug#5256: conjunct formation should follow input sequence when
	inserting text
Date: Sat, 26 Dec 2009 23:22:48 +0530
2009/12/25 Kenichi Handa <handa <at> m17n.org>:
> I have one question.  Emacs tries hard to find a grapheme
> boundary to put cursor on.  For instance, in the above (c)
> string, when you put cursor on the second character പ
> (U+D2A) and type C-f, the cursor moves to ലോ (U+D32 U+D4B),
> and the next C-f moves the cursor to ഡ (U+D21).
>
> But, in gedit, "പ്‌ലോ" is treated as one cluster and you
> can't put cursor between "പ്‌" and "ലോ".
>
> Which is convenient for Malayalam user?

emacs way is convenient. I will test the latest code by Monday. Thanks again.

- Praveen
-- 
പ്രവീണ്‍ അരിമ്പ്രത്തൊടിയില്‍
<GPLv2> I know my rights; I want my phone call!
<DRM> What use is a phone call, if you are unable to speak?
(as seen on /.)
Join The DRM Elimination Crew Now!
http://fci.wikia.com/wiki/Anti-DRM-Campaign


Reply sent to Glenn Morris <rgm <at> gnu.org>:
You have taken responsibility. (Wed, 13 Jan 2010 00:45:02 GMT) Full text and rfc822 format available.

Notification sent to Praveen A <pravi.a <at> gmail.com>:
bug acknowledged by developer. (Wed, 13 Jan 2010 00:45:02 GMT) Full text and rfc822 format available.

Message #26 received at 5256-done <at> debbugs.gnu.org (full text, mbox):

From: Glenn Morris <rgm <at> gnu.org>
To: 5256-done <at> debbugs.gnu.org
Subject: Re: Bug#5256
Date: Tue, 12 Jan 2010 19:44:03 -0500
This bug was closed recently, but unfortunately the close message was
lost from the Emacs bug database. I am therefore resending the close
message. Sorry for the duplicate mail. The original close message was

http://lists.gnu.org/archive/html/bug-gnu-emacs/2009-12/msg00684.html

    From: Praveen A <pravi.a <at> gmail.com>
    Subject: Re: bug#5256: conjunct formation should follow input sequence when inserting text
    To: Kenichi Handa <handa <at> m17n.org>
    Cc: 5256-done <at> debbugs.gnu.org
    Date: Mon, 28 Dec 2009 15:08:20 +0530
    X-Sent: 2 weeks, 1 day, 14 hours, 56 minutes, 55 seconds ago
    
    2009/12/25 Kenichi Handa <handa <at> m17n.org>:
    > I fixed it.  Please try again with the lastest code.
    
    Thanks. I tested it and it is working beautifully.
    
    - Praveen




bug archived. Request was from Debbugs Internal Request <bug-gnu-emacs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Wed, 10 Feb 2010 12:24:04 GMT) Full text and rfc822 format available.

This bug report was last modified 15 years and 133 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.