GNU bug report logs -
#48477
28.0.50; Seemingly incorrect codegen with multiple string-matching pcase patterns
Previous Next
Reported by: Philipp Stephani <p.stephani2 <at> gmail.com>
Date: Mon, 17 May 2021 11:35:01 UTC
Severity: normal
Found in version 28.0.50
Done: Mattias Engdegård <mattiase <at> acm.org>
Bug is archived. No further changes may be made.
To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 48477 in the body.
You can then email your comments to 48477 AT debbugs.gnu.org in the normal way.
Toggle the display of automated, internal messages from the tracker.
Report forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#48477
; Package
emacs
.
(Mon, 17 May 2021 11:35:02 GMT)
Full text and
rfc822 format available.
Acknowledgement sent
to
Philipp Stephani <p.stephani2 <at> gmail.com>
:
New bug report received and forwarded. Copy sent to
bug-gnu-emacs <at> gnu.org
.
(Mon, 17 May 2021 11:35:02 GMT)
Full text and
rfc822 format available.
Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):
Consider the following pcase form:
(require 'rx)
(pcase string
((rx bos (let prefix ?@) (* (not (any ?: ?/))) eos)
(list 1 prefix))
((rx bos (let prefix (* (not (any ?:))) "/...:" eos))
(list 2 prefix)))
The two branches should be disjoint; e.g. "@foo//...:" should match only
the second, not the first. Emacs 27.2 agrees and generates the
following code:
(cond
((string-match "\\`\\(?1:@\\)[^/:]*\\'" string)
(let*
((#1=#:x457
(match-string 1 string)))
(let
((prefix #1#))
(list 1 prefix))))
((string-match "\\`\\(?1:[^:]*/\\.\\.\\.:\\'\\)" string)
(let*
((#2=#:x458
(match-string 1 string)))
(let
((prefix #2#))
(list 2 prefix))))
(t nil))
However, Emacs master prints the following warning:
Warning: pcase pattern (rx bos (let prefix (* (not (any 58))) "/...:" eos)) shadowed by previous pcase pattern
and generates this code:
(if
(stringp string)
(let*
((#1=#:x42
(funcall
#'(lambda
(s)
(and
(string-match "\\`\\(?1:@\\)[^/:]*\\'" s)
(match-string 1 s)))
string)))
(let
((prefix #1#))
(list 1 prefix))))
which looks clearly wrong (and also needlessly complex).
In GNU Emacs 28.0.50 (build 104, x86_64-pc-linux-gnu, GTK+ Version 3.24.24, cairo version 1.16.0)
of 2021-05-17
Repository revision: 42950e9e4647c28f56c72cc27ef96edbafcbe5cd
Repository branch: master
Windowing system distributor 'The X.Org Foundation', version 11.0.12010000
System Description: Debian GNU/Linux rodete
Configured using:
'configure --enable-gcc-warnings=warn-only
--enable-gtk-deprecation-warnings --without-pop --with-mailutils
--enable-checking=all --enable-check-lisp-object-type --with-modules
'CFLAGS=-O0 -ggdb3''
Configured features:
CAIRO DBUS FREETYPE GIF GLIB GMP GNUTLS GSETTINGS HARFBUZZ JPEG JSON
LIBSELINUX LIBSYSTEMD MODULES NOTIFY INOTIFY PDUMPER PNG SECCOMP SOUND
THREADS TIFF TOOLKIT_SCROLL_BARS X11 XDBE XIM XPM GTK3 ZLIB
Important settings:
value of $LC_TIME: en_DK.utf8
value of $LANG: en_US.utf8
value of $XMODIFIERS: @im=ibus
locale-coding-system: utf-8-unix
Major mode: Lisp Interaction
Minor modes in effect:
tooltip-mode: t
global-eldoc-mode: t
eldoc-mode: t
electric-indent-mode: t
mouse-wheel-mode: t
tool-bar-mode: t
menu-bar-mode: t
file-name-shadow-mode: t
global-font-lock-mode: t
font-lock-mode: t
blink-cursor-mode: t
auto-composition-mode: t
auto-encryption-mode: t
auto-compression-mode: t
line-number-mode: t
transient-mark-mode: t
Load-path shadows:
None found.
Features:
(shadow sort mail-extr emacsbug message rmc dired dired-loaddefs rfc822
mml mml-sec epa epg epg-config gnus-util rmail rmail-loaddefs time-date
mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils
mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr
mail-utils phst skeleton derived edmacro kmacro pcase ffap thingatpt url
url-proxy url-privacy url-expand url-methods url-history url-cookie
url-domsuf url-util url-parse auth-source cl-seq eieio eieio-core
cl-macs eieio-loaddefs password-cache json map url-vars mailcap rx
gnutls puny dbus xml subr-x seq byte-opt gv bytecomp byte-compile cconv
compile text-property-search comint ansi-color ring cl-loaddefs cl-lib
iso-transl tooltip eldoc electric uniquify ediff-hook vc-hooks
lisp-float-type mwheel term/x-win x-win term/common-win x-dnd tool-bar
dnd fontset image regexp-opt fringe tabulated-list replace newcomment
text-mode elisp-mode lisp-mode prog-mode register page tab-bar menu-bar
rfn-eshadow isearch easymenu timer select scroll-bar mouse jit-lock
font-lock syntax font-core term/tty-colors frame minibuffer cl-generic
cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao
korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech
european ethiopic indian cyrillic chinese composite charscript charprop
case-table epa-hook jka-cmpr-hook help simple abbrev obarray
cl-preloaded nadvice button loaddefs faces cus-face macroexp files
window text-properties overlay sha1 md5 base64 format env code-pages
mule custom widget hashtable-print-readable backquote threads dbusbind
inotify dynamic-setting system-font-setting font-render-setting cairo
move-toolbar gtk x-toolkit x multi-tty make-network-process emacs)
Memory information:
((conses 16 69432 7725)
(symbols 48 8422 1)
(strings 32 24391 1618)
(string-bytes 1 789459)
(vectors 16 15075)
(vector-slots 8 195624 5994)
(floats 8 26 32)
(intervals 56 223 0)
(buffers 992 11))
--
Google Germany GmbH
Erika-Mann-Straße 33
80636 München
Geschäftsführer: Paul Manicle, Halimah DeLaine Prado
Registergericht und -nummer: Hamburg, HRB 86891
Sitz der Gesellschaft: Hamburg
Diese E-Mail ist vertraulich. Falls Sie diese fälschlicherweise erhalten haben
sollten, leiten Sie diese bitte nicht an jemand anderes weiter, löschen Sie
alle Kopien und Anhänge davon und lassen Sie mich bitte wissen, dass die E-Mail
an die falsche Person gesendet wurde.
This e-mail is confidential. If you received this communication by mistake,
please don’t forward it to anyone else, please erase all copies and
attachments, and please let me know that it has gone to the wrong person.
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#48477
; Package
emacs
.
(Tue, 18 May 2021 10:45:01 GMT)
Full text and
rfc822 format available.
Message #8 received at 48477 <at> debbugs.gnu.org (full text, mbox):
[Message part 1 (text/plain, inline)]
Serves me right for trying to be clever! Very sorry about that.
Matches would always succeed because the outcome was erroneously transformed into a match against a plain pcase variable which never fails. For example, the pattern
(rx (let x "a"))
would expand to
(and (pred stringp)
(app (lambda (s) (and (string-match (rx (group-n 1 "a")) s)
(match-string 1 s)))
x))
which cannot fail (as long as the input is a string). Patterns with two or more named submatches are not affected because of the structural match used, and zero submatches were treated specially anyway.
Please try the attached patch. It encodes non-matches as the number 0 (any non-nil non-string value would have done; 0 is cheap to create and test). The above pattern now expands to
(and (pred stringp)
(app (lambda (s) (if (string-match (rx (group-n 1 "a")) s)
(match-string 1 s)
0))
(and x (pred (not numberp)))))
[0001-Fix-pcase-rx-patterns-with-a-single-named-submatch-b.patch (application/octet-stream, attachment)]
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#48477
; Package
emacs
.
(Tue, 18 May 2021 11:10:01 GMT)
Full text and
rfc822 format available.
Message #11 received at 48477 <at> debbugs.gnu.org (full text, mbox):
Am Di., 18. Mai 2021 um 12:44 Uhr schrieb Mattias Engdegård <mattiase <at> acm.org>:
> Please try the attached patch.
Thanks, that fixes my use case.
Reply sent
to
Mattias Engdegård <mattiase <at> acm.org>
:
You have taken responsibility.
(Tue, 18 May 2021 11:13:03 GMT)
Full text and
rfc822 format available.
Notification sent
to
Philipp Stephani <p.stephani2 <at> gmail.com>
:
bug acknowledged by developer.
(Tue, 18 May 2021 11:13:03 GMT)
Full text and
rfc822 format available.
Message #16 received at 48477-done <at> debbugs.gnu.org (full text, mbox):
18 maj 2021 kl. 13.09 skrev Philipp Stephani <p.stephani2 <at> gmail.com>:
> Thanks, that fixes my use case.
Thank you for testing! Pushed and closed.
bug archived.
Request was from
Debbugs Internal Request <help-debbugs <at> gnu.org>
to
internal_control <at> debbugs.gnu.org
.
(Tue, 15 Jun 2021 11:24:06 GMT)
Full text and
rfc822 format available.
This bug report was last modified 4 years and 65 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.