GNU bug report logs -
#73206
28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
Previous Next
Reported by: ciolfi <at> mathworks.com
Date: Thu, 12 Sep 2024 18:04:03 UTC
Severity: normal
Found in version 28.2
Done: Eli Zaretskii <eliz <at> gnu.org>
Bug is archived. No further changes may be made.
To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 73206 in the body.
You can then email your comments to 73206 AT debbugs.gnu.org in the normal way.
Toggle the display of automated, internal messages from the tracker.
Report forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#73206
; Package
emacs
.
(Thu, 12 Sep 2024 18:04:03 GMT)
Full text and
rfc822 format available.
Acknowledgement sent
to
ciolfi <at> mathworks.com
:
New bug report received and forwarded. Copy sent to
bug-gnu-emacs <at> gnu.org
.
(Thu, 12 Sep 2024 18:04:03 GMT)
Full text and
rfc822 format available.
Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):
[Message part 1 (text/plain, inline)]
1. Create foo.xml containing:
<foo>
<!--
foo
bar
-->
</foo>
Notice that we have a blank line in the xml comment.
2. Put the point on 'bar' and press TAB.
Notice that the 'bar' line is not indented correctly.
Solution
--------
The problem is that in nxml-compute-indent-in-delimited-token, there
is a (forward-line -1) which assumes that the prior line is indented. In
this case the prior line is a blank line and thus doesn't have
indentation. After the (forward-line -1), we need to keep going back
until we see a non-blank line:
(forward-line -1)
+ (while (looking-at "^[:blank:]*$")
+ (forward-line -1))
Attached is a patch for nxml-mode.el that fixes the issue.
Thanks
John
In GNU Emacs 28.2 (build 1, x86_64-pc-linux-gnu, GTK+ Version 3.24.38, cairo version 1.16.0)
of 2024-06-25, modified by Debian built on x86-conova-01
Windowing system distributor 'The X.Org Foundation', version 11.0.12101006
System Description: Debian GNU/Linux 12 (bookworm)
Configured using:
'configure --build x86_64-linux-gnu --prefix=/usr
--sharedstatedir=/var/lib --libexecdir=/usr/libexec
--localstatedir=/var/lib --infodir=/usr/share/info
--mandir=/usr/share/man --with-libsystemd --with-pop=yes
--enable-locallisppath=/etc/emacs:/usr/local/share/emacs/28.2/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/28.2/site-lisp:/usr/share/emacs/site-lisp
--with-sound=alsa --without-gconf --with-mailutils
--with-native-compilation --build x86_64-linux-gnu --prefix=/usr
--sharedstatedir=/var/lib --libexecdir=/usr/libexec
--localstatedir=/var/lib --infodir=/usr/share/info
--mandir=/usr/share/man --with-libsystemd --with-pop=yes
--enable-locallisppath=/etc/emacs:/usr/local/share/emacs/28.2/site-lisp:/usr/local/share/emacs/site-lisp:/usr/share/emacs/28.2/site-lisp:/usr/share/emacs/site-lisp
--with-sound=alsa --without-gconf --with-mailutils
--with-native-compilation --with-cairo --with-x=yes
--with-x-toolkit=gtk3 --with-toolkit-scroll-bars 'CFLAGS=-g -O2
-ffile-prefix-map=/build/reproducible-path/emacs-28.2+1=. -fstack-protector-strong
-Wformat -Werror=format-security -Wall' 'CPPFLAGS=-Wdate-time
-D_FORTIFY_SOURCE=2' LDFLAGS=-Wl,-z,relro'
Configured features:
ACL CAIRO DBUS FREETYPE GIF GLIB GMP GNUTLS GPM GSETTINGS HARFBUZZ JPEG
JSON LCMS2 LIBOTF LIBSELINUX LIBSYSTEMD LIBXML2 M17N_FLT MODULES
NATIVE_COMP NOTIFY INOTIFY PDUMPER PNG RSVG SECCOMP SOUND THREADS TIFF
TOOLKIT_SCROLL_BARS X11 XDBE XIM XPM GTK3 ZLIB
Important settings:
value of $LANG: en_US.UTF-8
locale-coding-system: utf-8-unix
Major mode: Lisp Interaction
Minor modes in effect:
tooltip-mode: t
global-eldoc-mode: t
eldoc-mode: t
show-paren-mode: t
electric-indent-mode: t
mouse-wheel-mode: t
tool-bar-mode: t
menu-bar-mode: t
file-name-shadow-mode: t
global-font-lock-mode: t
font-lock-mode: t
blink-cursor-mode: t
auto-composition-mode: t
auto-encryption-mode: t
auto-compression-mode: t
line-number-mode: t
indent-tabs-mode: t
transient-mark-mode: t
Load-path shadows:
None found.
Features:
(shadow sort mail-extr emacsbug message rmc puny dired dired-loaddefs
rfc822 mml mml-sec epa derived epg rfc6068 epg-config gnus-util rmail
rmail-loaddefs auth-source cl-seq eieio eieio-core cl-macs
eieio-loaddefs password-cache json map text-property-search time-date
subr-x seq byte-opt gv bytecomp byte-compile cconv mm-decode mm-bodies
mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader cl-loaddefs
cl-lib sendmail rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils
iso-transl tooltip eldoc paren electric uniquify ediff-hook vc-hooks
lisp-float-type elisp-mode mwheel term/x-win x-win term/common-win x-dnd
tool-bar dnd fontset image regexp-opt fringe tabulated-list replace
newcomment text-mode lisp-mode prog-mode register page tab-bar menu-bar
rfn-eshadow isearch easymenu timer select scroll-bar mouse jit-lock
font-lock syntax font-core term/tty-colors frame minibuffer cl-generic
cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao
korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech
european ethiopic indian cyrillic chinese composite emoji-zwj charscript
charprop case-table epa-hook jka-cmpr-hook help simple abbrev obarray
cl-preloaded nadvice button loaddefs faces cus-face macroexp files
window text-properties overlay sha1 md5 base64 format env code-pages
mule custom widget hashtable-print-readable backquote threads dbusbind
inotify lcms2 dynamic-setting system-font-setting font-render-setting
cairo move-toolbar gtk x-toolkit x multi-tty make-network-process
native-compile emacs)
Memory information:
((conses 16 68059 5432)
(symbols 48 6639 0)
(strings 32 19490 1559)
(string-bytes 1 664189)
(vectors 16 13926)
(vector-slots 8 297636 11873)
(floats 8 22 31)
(intervals 56 258 0)
(buffers 992 10))
[nxml-mode.el.patch (text/x-diff, inline)]
--- ./orig/nxml-mode.el 2024-09-12 09:10:46.580451579 -0400
+++ nxml-mode.el 2024-09-12 09:04:48.473571128 -0400
@@ -1522,6 +1522,8 @@
((progn
(goto-char pos)
(forward-line -1)
+ (while (looking-at "^[:blank:]*$")
+ (forward-line -1))
(<= (point) xmltok-start))
(goto-char (+ xmltok-start (length open-delim)))
(when (and (string= open-delim "<!--")
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#73206
; Package
emacs
.
(Fri, 13 Sep 2024 07:45:02 GMT)
Full text and
rfc822 format available.
Message #8 received at 73206 <at> debbugs.gnu.org (full text, mbox):
>>>>> On Thu, 12 Sep 2024 13:53:26 -0400, John Ciolfi via "Bug reports for GNU Emacs, the Swiss army knife of text editors" <bug-gnu-emacs <at> gnu.org> said:
John> 1. Create foo.xml containing:
John> <foo>
John> <!--
John> foo
John> bar
-->
John> </foo>
John> Notice that we have a blank line in the xml comment.
John> 2. Put the point on 'bar' and press TAB.
John> Notice that the 'bar' line is not indented correctly.
John> Solution
John> --------
John> The problem is that in nxml-compute-indent-in-delimited-token, there
John> is a (forward-line -1) which assumes that the prior line is indented. In
John> this case the prior line is a blank line and thus doesn't have
John> indentation. After the (forward-line -1), we need to keep going back
John> until we see a non-blank line:
John> (forward-line -1)
John> + (while (looking-at "^[:blank:]*$")
John> + (forward-line -1))
John> Attached is a patch for nxml-mode.el that fixes the issue.
Your patch does the right thing, but for the wrong reasons. Regexp
character classes only have meaning with a bracket expression, so what
youʼre matching here is actually any lines containing zero or more of
':', 'b', 'l', 'a', 'n', or 'k'. IOW, you should use:
(looking-at "^[[:blank:]]*$")
Robert
--
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#73206
; Package
emacs
.
(Sun, 15 Sep 2024 12:09:02 GMT)
Full text and
rfc822 format available.
Message #11 received at 73206 <at> debbugs.gnu.org (full text, mbox):
[Message part 1 (text/plain, inline)]
Thanks for catching my typo and for moving the fix along.
________________________________
From: Robert Pluim <rpluim <at> gmail.com>
Sent: Friday, September 13, 2024 3:43 AM
To: 73206 <at> debbugs.gnu.org <73206 <at> debbugs.gnu.org>
Cc: John Ciolfi <ciolfi <at> mathworks.com>
Subject: Re: bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
>>>>> On Thu, 12 Sep 2024 13:53:26 -0400, John Ciolfi via "Bug reports for GNU Emacs, the Swiss army knife of text editors" <bug-gnu-emacs <at> gnu.org> said:
John> 1. Create foo.xml containing:
John> <foo>
John> <!--
John> foo
John> bar
-->
John> </foo>
John> Notice that we have a blank line in the xml comment.
John> 2. Put the point on 'bar' and press TAB.
John> Notice that the 'bar' line is not indented correctly.
John> Solution
John> --------
John> The problem is that in nxml-compute-indent-in-delimited-token, there
John> is a (forward-line -1) which assumes that the prior line is indented. In
John> this case the prior line is a blank line and thus doesn't have
John> indentation. After the (forward-line -1), we need to keep going back
John> until we see a non-blank line:
John> (forward-line -1)
John> + (while (looking-at "^[:blank:]*$")
John> + (forward-line -1))
John> Attached is a patch for nxml-mode.el that fixes the issue.
Your patch does the right thing, but for the wrong reasons. Regexp
character classes only have meaning with a bracket expression, so what
you’re matching here is actually any lines containing zero or more of
':', 'b', 'l', 'a', 'n', or 'k'. IOW, you should use:
(looking-at "^[[:blank:]]*$")
Robert
--
[Message part 2 (text/html, inline)]
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#73206
; Package
emacs
.
(Sat, 28 Sep 2024 08:47:01 GMT)
Full text and
rfc822 format available.
Message #14 received at 73206 <at> debbugs.gnu.org (full text, mbox):
Stefan, does the patch with the regexp fix look correct to you?
> Date: Sun, 15 Sep 2024 12:08:17 +0000
> msip_labels:
> From: John Ciolfi via "Bug reports for GNU Emacs,
> the Swiss army knife of text editors" <bug-gnu-emacs <at> gnu.org>
>
> Thanks for catching my typo and for moving the fix along.
> -----------------------------------------------------------------------------
> From: Robert Pluim <rpluim <at> gmail.com>
> Sent: Friday, September 13, 2024 3:43 AM
> To: 73206 <at> debbugs.gnu.org <73206 <at> debbugs.gnu.org>
> Cc: John Ciolfi <ciolfi <at> mathworks.com>
> Subject: Re: bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
>
> >>>>> On Thu, 12 Sep 2024 13:53:26 -0400, John Ciolfi via "Bug reports for GNU Emacs, the Swiss army
> knife of text editors" <bug-gnu-emacs <at> gnu.org> said:
>
> John> 1. Create foo.xml containing:
>
> John> <foo>
> John> <!--
> John> foo
>
> John> bar
> -->
> John> </foo>
>
> John> Notice that we have a blank line in the xml comment.
>
> John> 2. Put the point on 'bar' and press TAB.
>
> John> Notice that the 'bar' line is not indented correctly.
>
> John> Solution
> John> --------
>
> John> The problem is that in nxml-compute-indent-in-delimited-token, there
> John> is a (forward-line -1) which assumes that the prior line is indented. In
> John> this case the prior line is a blank line and thus doesn't have
> John> indentation. After the (forward-line -1), we need to keep going back
> John> until we see a non-blank line:
>
> John> (forward-line -1)
> John> + (while (looking-at "^[:blank:]*$")
> John> + (forward-line -1))
>
> John> Attached is a patch for nxml-mode.el that fixes the issue.
>
> Your patch does the right thing, but for the wrong reasons. Regexp
> character classes only have meaning with a bracket expression, so what
> you’re matching here is actually any lines containing zero or more of
> ':', 'b', 'l', 'a', 'n', or 'k'. IOW, you should use:
>
> (looking-at "^[[:blank:]]*$")
>
> Robert
> --
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#73206
; Package
emacs
.
(Sun, 29 Sep 2024 20:50:02 GMT)
Full text and
rfc822 format available.
Message #17 received at 73206 <at> debbugs.gnu.org (full text, mbox):
Eli Zaretskii <eliz <at> gnu.org> writes:
> Stefan, does the patch with the regexp fix look correct to you?
If we want to "keep going back until we see a non-blank line", surely
the fragment should read:
(while (looking-at "^$")
(forward-line -1))
Since
(looking-at "^[[:blank:]]*$")
will match both blank lines, and lines containing only blank space.
Which of the two do we want here?
I think it would also be good to add one or more tests here.
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#73206
; Package
emacs
.
(Wed, 02 Oct 2024 12:00:03 GMT)
Full text and
rfc822 format available.
Message #20 received at 73206 <at> debbugs.gnu.org (full text, mbox):
[Message part 1 (text/plain, inline)]
Hi
Using "^$" or "^[[:blank:]]*$" is fine. The difference is very minor. Consider foo.xml where the visibly "empty" line contains a single space and foo starts on the 1st column:
<foo>
<!--
bar
foo
-->
</foo>
When using "^$" and you type tab on the foo line, you'll get the following where foo starts on the 2nd column:
<foo>
<!--
bar
foo
-->
</foo>
If you use "^[[:blank:]]*$, you'll get what I expected:
<foo>
<!--
bar
foo
-->
</foo>
However, if you select all and indent-region, C-M-\ on the original you'll get the expected result with either "^$" or "^[[:blank:]]*$ because nxml-mode will pad out the space line. If the "empty" line truly blank (no spaces or tabs), then the two regex's behave identical.
I suggest for test cases, two versions of foo.xml where one version of it has the empty line truly blank (no spaces or tabs) and the other version contains a space in the "empty" line.
You can use the attached nxml-mode-indent-fix.el which overrides the broken function to try things out on a stock Emacs, emacs -Q.
Thanks
John
________________________________
From: Stefan Kangas <stefankangas <at> gmail.com>
Sent: Sunday, September 29, 2024 4:47 PM
To: Eli Zaretskii <eliz <at> gnu.org>; John Ciolfi <ciolfi <at> mathworks.com>
Cc: rpluim <at> gmail.com <rpluim <at> gmail.com>; 73206 <at> debbugs.gnu.org <73206 <at> debbugs.gnu.org>
Subject: Re: bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
Eli Zaretskii <eliz <at> gnu.org> writes:
> Stefan, does the patch with the regexp fix look correct to you?
If we want to "keep going back until we see a non-blank line", surely
the fragment should read:
(while (looking-at "^$")
(forward-line -1))
Since
(looking-at "^[[:blank:]]*$")
will match both blank lines, and lines containing only blank space.
Which of the two do we want here?
I think it would also be good to add one or more tests here.
[Message part 2 (text/html, inline)]
[nxml-mode-indent-fix.el (text/x-emacs-lisp, attachment)]
Information forwarded
to
bug-gnu-emacs <at> gnu.org
:
bug#73206
; Package
emacs
.
(Wed, 30 Apr 2025 10:56:01 GMT)
Full text and
rfc822 format available.
Message #23 received at 73206 <at> debbugs.gnu.org (full text, mbox):
[Message part 1 (text/plain, inline)]
Hi
I see this bug isn't fixed in Emacs 30. Is it possible to fix? The right solution is to update nxml-compute-indent-in-delimited-token to contain "(while (looking-at "^[[:blank:]]*$") (forward-line -1))" to obtain the correct indentation level for text in xml comments when there's blank lines.
(cond ((let ((end (+ pos (length close-delim))))
(and (<= end (point-max))
(string= (buffer-substring-no-properties pos end)
close-delim)))
(goto-char xmltok-start))
((progn
(goto-char pos)
(forward-line -1)
+ (while (looking-at "^[[:blank:]]*$")
+ (forward-line -1))
(<= (point) xmltok-start))
(goto-char (+ xmltok-start (length open-delim)))
(when (and (string= open-delim "<!--")
(looking-at " "))
(goto-char (1+ (point)))))
(t (back-to-indentation)))
(current-column))
For test cases, create foo.xml containing:
<foo>
<!--
bar
foo
-->
</foo>
An indent (TAB key) on the "foo" line should align it with the bar.
For test cases, create two versions of foo.xml, the first where there is a completely empty line between the bar and foo lines. The other version of foo.xml should have the line between bar and foo containing a few spaces.
Thanks
John
________________________________
From: John Ciolfi <ciolfi <at> mathworks.com>
Sent: Wednesday, October 2, 2024 7:59 AM
To: Stefan Kangas <stefankangas <at> gmail.com>; Eli Zaretskii <eliz <at> gnu.org>
Cc: rpluim <at> gmail.com <rpluim <at> gmail.com>; 73206 <at> debbugs.gnu.org <73206 <at> debbugs.gnu.org>
Subject: Re: bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
Hi
Using "^$" or "^[[:blank:]]*$" is fine. The difference is very minor. Consider foo.xml where the visibly "empty" line contains a single space and foo starts on the 1st column:
<foo>
<!--
bar
foo
-->
</foo>
When using "^$" and you type tab on the foo line, you'll get the following where foo starts on the 2nd column:
<foo>
<!--
bar
foo
-->
</foo>
If you use "^[[:blank:]]*$, you'll get what I expected:
<foo>
<!--
bar
foo
-->
</foo>
However, if you select all and indent-region, C-M-\ on the original you'll get the expected result with either "^$" or "^[[:blank:]]*$ because nxml-mode will pad out the space line. If the "empty" line truly blank (no spaces or tabs), then the two regex's behave identical.
I suggest for test cases, two versions of foo.xml where one version of it has the empty line truly blank (no spaces or tabs) and the other version contains a space in the "empty" line.
You can use the attached nxml-mode-indent-fix.el which overrides the broken function to try things out on a stock Emacs, emacs -Q.
Thanks
John
________________________________
From: Stefan Kangas <stefankangas <at> gmail.com>
Sent: Sunday, September 29, 2024 4:47 PM
To: Eli Zaretskii <eliz <at> gnu.org>; John Ciolfi <ciolfi <at> mathworks.com>
Cc: rpluim <at> gmail.com <rpluim <at> gmail.com>; 73206 <at> debbugs.gnu.org <73206 <at> debbugs.gnu.org>
Subject: Re: bug#73206: 28.2; xml comment with blank lines to do not indent correctly, nxml-mode.el
Eli Zaretskii <eliz <at> gnu.org> writes:
> Stefan, does the patch with the regexp fix look correct to you?
If we want to "keep going back until we see a non-blank line", surely
the fragment should read:
(while (looking-at "^$")
(forward-line -1))
Since
(looking-at "^[[:blank:]]*$")
will match both blank lines, and lines containing only blank space.
Which of the two do we want here?
I think it would also be good to add one or more tests here.
[Message part 2 (text/html, inline)]
Reply sent
to
Eli Zaretskii <eliz <at> gnu.org>
:
You have taken responsibility.
(Sat, 10 May 2025 10:14:02 GMT)
Full text and
rfc822 format available.
Notification sent
to
ciolfi <at> mathworks.com
:
bug acknowledged by developer.
(Sat, 10 May 2025 10:14:02 GMT)
Full text and
rfc822 format available.
Message #28 received at 73206-done <at> debbugs.gnu.org (full text, mbox):
> From: John Ciolfi <ciolfi <at> mathworks.com>
> CC: "rpluim <at> gmail.com" <rpluim <at> gmail.com>, "73206 <at> debbugs.gnu.org"
> <73206 <at> debbugs.gnu.org>
> Date: Wed, 30 Apr 2025 10:55:07 +0000
>
> I see this bug isn't fixed in Emacs 30. Is it possible to fix? The right solution is to update
> nxml-compute-indent-in-delimited-token to contain "(while (looking-at "^[[:blank:]]*$") (forward-line -1))" to
> obtain the correct indentation level for text in xml comments when there's blank lines.
Thanks, I've now installed this on the emacs-30 branch, and I'm
closing this bug.
bug archived.
Request was from
Debbugs Internal Request <help-debbugs <at> gnu.org>
to
internal_control <at> debbugs.gnu.org
.
(Sat, 07 Jun 2025 11:24:06 GMT)
Full text and
rfc822 format available.
This bug report was last modified 11 days ago.
Previous Next
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997,2003 nCipher Corporation Ltd,
1994-97 Ian Jackson.