GNU bug report logs -
#10494
24.0.92; Syntax table and non-ASCII character interaction
Previous Next
Reported by: Aaron Ecay <aaronecay <at> gmail.com>
Date: Fri, 13 Jan 2012 08:42:01 UTC
Severity: wishlist
Tags: confirmed, notabug
Merged with 13129,
30815
Found in versions 24.0.92, 24.1, 25.1, 26.0.91
Done: npostavs <at> users.sourceforge.net
Bug is archived. No further changes may be made.
Full log
Message #42 received at 10494 <at> debbugs.gnu.org (full text, mbox):
> Date: Sat, 13 Aug 2016 17:31:48 +0300
> From: Eli Zaretskii <eliz <at> gnu.org>
> Cc: 10494 <at> debbugs.gnu.org, aaronecay <at> gmail.com
>
> > From: npostavs <at> users.sourceforge.net
> > Cc: 10494 <at> debbugs.gnu.org, aaronecay <at> gmail.com
> > Date: Sat, 13 Aug 2016 10:19:34 -0400
> >
> > >> (modify-syntax-entry ?’ "w" text-mode-syntax-table)
> > >> (defconst my-text-char-script-table
> > >> (let ((table (copy-sequence char-script-table)))
> > >> (aset table ?’ 'latin)
> > >> table))
> > >>
> > >> (defun my-text-mode-hook ()
> > >> (set (make-local-variable 'char-script-table)
> > >> my-text-char-script-table))
> > >> (add-hook 'text-mode-hook 'my-text-mode-hook)
> > >
> > > Are you sure nothing in text-mode will ever want to use \s_ in any
> > > regexp?
> >
> > Did you mean \> (word boundary) or \s. (punctation)? \s_ doesn't match
> > ’ regardless because its syntax class is punctuation, not symbol.
>
> Sorry, I guess I was thinking of \cl. It will not match ’, although
> it might be expected.
>
> Anyway, my point is that these char-tables should really be treated as
> read-only by Lisp applications.
Btw, some believe that using ’ as an apostrophe is wrong. They say
U+02BC should be used instead; see, for example, this discussion:
http://www.unicode.org/mail-arch/unicode-ml/y2015-m06/0047.html
That character already is word-constituent.
This bug report was last modified 7 years and 65 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.