GNU bug report logs - #41094
28.0.50; `xml-print' produces invalid XML

Previous Next

Package: emacs;

Reported by: Philipp <p.stephani2 <at> gmail.com>

Date: Tue, 5 May 2020 12:32:01 UTC

Severity: minor

Found in version 28.0.50

Done: Philipp Stephani <p.stephani2 <at> gmail.com>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: help-debbugs <at> gnu.org (GNU bug Tracking System)
To: Philipp Stephani <p.stephani2 <at> gmail.com>
Cc: tracker <at> debbugs.gnu.org
Subject: bug#41094: closed (28.0.50; `xml-print' produces invalid XML)
Date: Sat, 23 May 2020 12:04:01 +0000
[Message part 1 (text/plain, inline)]
Your message dated Sat, 23 May 2020 14:03:39 +0200
with message-id <CAArVCkR3FbWF3mtGvFd6_2QU=11tR0+9BB4gNf5oOuvD4jcojA <at> mail.gmail.com>
and subject line Re: bug#41094: 28.0.50; `xml-print' produces invalid XML
has caused the debbugs.gnu.org bug report #41094,
regarding 28.0.50; `xml-print' produces invalid XML
to be marked as done.

(If you believe you have received this mail in error, please contact
help-debbugs <at> gnu.org.)


-- 
41094: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=41094
GNU Bug Tracking System
Contact help-debbugs <at> gnu.org with problems
[Message part 2 (message/rfc822, inline)]
From: Philipp <p.stephani2 <at> gmail.com>
To: bug-gnu-emacs <at> gnu.org
Subject: 28.0.50; `xml-print' produces invalid XML
Date: Tue, 05 May 2020 14:31:36 +0200
For example:

emacs -Q -batch -l xml -eval '(with-temp-buffer (xml-print (quote ((foo nil "\0")))) (write-region nil nil "/tmp/test.xml"))' && xmllint /tmp/test.xml
/tmp/test.xml:1: parser error : Premature end of data in tag foo line 1
<foo>
     ^

`xml-print' here generates a NUL character, which is invalid in XML
(cf. https://www.w3.org/TR/xml/#charsets).  Probably `xml-escape-string'
should error out when encountering any invalid character (including
non-Unicode characters).


In GNU Emacs 28.0.50 (build 44, x86_64-apple-darwin19.3.0, NS appkit-1894.30 Version 10.15.3 (Build 19D76))
 of 2020-04-12
Repository revision: 900947fbe8b202ce2ae15e87ef377ca27da73ec9
Repository branch: master
Windowing system distributor 'Apple', version 10.3.1894
System Description:  Mac OS X 10.15.4

Recent messages:
For information about GNU Emacs and the GNU system, type C-h C-a.

Configured using:
 'configure --with-modules --without-xml2 --without-pop --with-mailutils
 --enable-gcc-warnings=warn-only --enable-checking=all
 --enable-check-lisp-object-type 'CFLAGS=-ggdb3 -O0''

Configured features:
NOTIFY KQUEUE ACL GNUTLS ZLIB TOOLKIT_SCROLL_BARS NS MODULES THREADS
JSON PDUMPER LCMS2 GMP

Important settings:
  value of $LANG: de_DE.UTF-8
  locale-coding-system: utf-8-unix

Major mode: Lisp Interaction

Minor modes in effect:
  tooltip-mode: t
  global-eldoc-mode: t
  eldoc-mode: t
  electric-indent-mode: t
  mouse-wheel-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  blink-cursor-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  line-number-mode: t
  transient-mark-mode: t

Load-path shadows:
None found.

Features:
(shadow sort mail-extr emacsbug message rmc dired dired-loaddefs
format-spec rfc822 mml easymenu mml-sec epa epg epg-config gnus-util
rmail rmail-loaddefs text-property-search time-date mm-decode mm-bodies
mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail
rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mail-utils phst skeleton
derived edmacro kmacro pcase ffap thingatpt url url-proxy url-privacy
url-expand url-methods url-history url-cookie url-domsuf url-util
url-parse auth-source cl-seq eieio eieio-core cl-macs eieio-loaddefs
password-cache json map url-vars mailcap subr-x rx gnutls puny seq
byte-opt gv bytecomp byte-compile cconv dbus xml cl-loaddefs cl-lib
tooltip eldoc electric uniquify ediff-hook vc-hooks lisp-float-type
mwheel term/ns-win ns-win ucs-normalize mule-util term/common-win
tool-bar dnd fontset image regexp-opt fringe tabulated-list replace
newcomment text-mode elisp-mode lisp-mode prog-mode register page
tab-bar menu-bar rfn-eshadow isearch timer select scroll-bar mouse
jit-lock font-lock syntax facemenu font-core term/tty-colors frame
minibuffer cl-generic cham georgian utf-8-lang misc-lang vietnamese
tibetan thai tai-viet lao korean japanese eucjp-ms cp51932 hebrew greek
romanian slovak czech european ethiopic indian cyrillic chinese
composite charscript charprop case-table epa-hook jka-cmpr-hook help
simple abbrev obarray cl-preloaded nadvice loaddefs button faces
cus-face macroexp files text-properties overlay sha1 md5 base64 format
env code-pages mule custom widget hashtable-print-readable backquote
threads kqueue cocoa ns lcms2 multi-tty make-network-process emacs)

Memory information:
((conses 16 63759 4957)
 (symbols 48 8146 0)
 (strings 32 22012 1765)
 (string-bytes 1 705507)
 (vectors 16 13330)
 (vector-slots 8 171204 16342)
 (floats 8 25 19)
 (intervals 56 200 0)
 (buffers 992 10))


[Message part 3 (message/rfc822, inline)]
From: Philipp Stephani <p.stephani2 <at> gmail.com>
To: 41094-done <at> debbugs.gnu.org
Subject: Re: bug#41094: 28.0.50; `xml-print' produces invalid XML
Date: Sat, 23 May 2020 14:03:39 +0200
Am Di., 5. Mai 2020 um 14:32 Uhr schrieb Philipp <p.stephani2 <at> gmail.com>:
>
>
> For example:
>
> emacs -Q -batch -l xml -eval '(with-temp-buffer (xml-print (quote ((foo nil "\0")))) (write-region nil nil "/tmp/test.xml"))' && xmllint /tmp/test.xml
> /tmp/test.xml:1: parser error : Premature end of data in tag foo line 1
> <foo>
>      ^
>
> `xml-print' here generates a NUL character, which is invalid in XML
> (cf. https://www.w3.org/TR/xml/#charsets).  Probably `xml-escape-string'
> should error out when encountering any invalid character (including
> non-Unicode characters).

Fixed with commit f8581bcf6a.


This bug report was last modified 5 years and 84 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.