GNU bug report logs -
#13084
boyer_moore crashes with certain characters in the case table
Previous Next
Reported by: Juri Linkov <juri <at> jurta.org>
Date: Wed, 5 Dec 2012 00:37:02 UTC
Severity: normal
Done: Juri Linkov <juri <at> jurta.org>
Bug is archived. No further changes may be made.
Full log
Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):
The minimal reproducible recipe for crashes in boyer_moore noticed in bug#13041:
1. emacs -Q
2. Eval in *scratch*:
(let ((table (standard-case-table)) canon)
(setq canon (copy-sequence table))
(aset canon #xff59 ?y)
(set-char-table-extra-slot table 1 canon)
(set-char-table-extra-slot table 2 nil)
(set-standard-case-table table))
3. Start an activity that includes a search, e.g. `C-x 8 RET TAB'
The crash in boyer_moore is caused by fullwidth characters like #xff59
whose Unicode properties are:
name: FULLWIDTH LATIN SMALL LETTER Y
decomposition: (wide 121) (wide 'y')
However, the crash doesn't occur when the same fullwidth characters are
set to their downcase counterparts in lisp/international/characters.el:
;; Fullwidth Latin
(setq c #xff21)
(while (<= c #xff3a)
(set-case-syntax-pair c (+ c #x20) tbl)
(modify-category-entry c ?l)
(modify-category-entry (+ c #x20) ?l)
(setq c (1+ c)))
This bug report was last modified 12 years and 164 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.