GNU bug report logs -
#540
23.0.60; Unicode search bug
Previous Next
Reported by: Juri Linkov <juri <at> jurta.org>
Date: Sun, 6 Jul 2008 18:55:05 UTC
Severity: normal
Done: Chong Yidong <cyd <at> stupidchicken.com>
Bug is archived. No further changes may be made.
Full log
View this message in rfc822 format
[Message part 1 (text/plain, inline)]
Your message dated Wed, 27 Aug 2008 10:34:40 -0400
with message-id <87wsi2a5mn.fsf <at> cyd.mit.edu>
and subject line Re: bug#540: 23.0.60; Unicode search bug
has caused the Emacs bug report #540,
regarding 23.0.60; Unicode search bug
to be marked as done.
This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
bug report if necessary, and/or fix the problem forthwith.
(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact don <at> donarmstrong.com
immediately.)
--
540: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=540
Emacs Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
There is a weird bug in searching Unicode text. The search function
fails on Cyrillic letters between codepoints #x0400 and #x041f, but
successfully finds a Cyrillic letter between #x0420 and #x042f.
I tried to debug this and see that in case of failure
it calls `boyer_moore', and in case of successful search
it calls `simple_search'. I checked the Unicode properties,
but everything seems correct.
This bug didn't exist before the Unicode merge.
The easiest way to reproduce it: run `emacs -Q',
put in the *scratch* buffer the following 4 lines
(note the leading space):
(search-forward " П" nil t)
(search-forward " Р" nil t)
П
Р
and type `C-x C-e' after each of first two lines.
In GNU Emacs 23.0.60 (x86_64-pc-linux-gnu)
Important settings:
value of $LC_ALL: nil
value of $LC_COLLATE: nil
value of $LC_CTYPE: nil
value of $LC_MESSAGES: nil
value of $LC_MONETARY: nil
value of $LC_NUMERIC: nil
value of $LC_TIME: nil
value of $LANG: en_US.UTF-8
value of $XMODIFIERS: nil
locale-coding-system: utf-8-unix
default-enable-multibyte-characters: t
--
Juri Linkov
http://www.jurta.org/emacs/
[Message part 3 (message/rfc822, inline)]
Andreas Schwab <schwab <at> suse.de> writes:
> Should be fixed now.
Thanks!
This bug report was last modified 15 years and 246 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.