GNU bug report logs - #13041
24.2; diacritic-fold-search

Previous Next

Package: emacs;

Reported by: perin <at> acm.org

Date: Fri, 30 Nov 2012 18:31:02 UTC

Severity: wishlist

Found in version 24.2

Fixed in version 25.1

Done: Michael Albinus <michael.albinus <at> gmx.de>

Bug is archived. No further changes may be made.

Full log


Message #164 received at 13041 <at> debbugs.gnu.org (full text, mbox):

From: "Drew Adams" <drew.adams <at> oracle.com>
To: "'martin rudalics'" <rudalics <at> gmx.at>
Cc: perin <at> acm.org, 13041 <at> debbugs.gnu.org, perin <at> panix.com
Subject: RE: bug#13041: 24.2; diacritic-fold-search
Date: Thu, 6 Dec 2012 07:59:59 -0800
>  > We are using compatibility normalization, not canonical 
>  > normalization.  So a search (or a string comparison test)
>  > for `f' will match the ligature `ffi'
>  > (whereas it would not match wrt canonical normalization).
> 
> If it can be done, searching for "f" should match ligatures like "ff"
> and "fi".

That's what I thought you were planning/preparing to do.

On the other hand, as the Unicode spec points out (for level 2), sometimes
someone wants to distinguish searching for f from searching for the ligature.
Ideally (we might never get there), that would be possible as an alternative
(choice).

The spec also points to hybrid situations regarding case conversion (see sect
RL2.4) where, e.g., you might want to do full case matching on ß in a literal
name such as Strauß but simple case folding on ß when used in a character class,
such as [ß].  Dunno whether we would ever get there either.

There seems to be a lot in the Unicode regexp spec
(http://www.unicode.org/reports/tr18/) that could be food for thought for Emacs.
I imagine that some Emacs Dev folks have already taken a close look and given it
some thought.





This bug report was last modified 8 years and 342 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.