GNU bug report logs - #37397
26.2; less-than-character cuts off end of pre tag in eww

Previous Next

Package: emacs;

Reported by: Kevin Brubeck Unhammer <unhammer <at> fsfe.org>

Date: Fri, 13 Sep 2019 09:46:01 UTC

Severity: minor

Tags: fixed

Merged with 37009

Found in version 26.2

Fixed in version 27.1

Done: Lars Ingebrigtsen <larsi <at> gnus.org>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 37397 in the body.
You can then email your comments to 37397 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-gnu-emacs <at> gnu.org:
bug#37397; Package emacs. (Fri, 13 Sep 2019 09:46:01 GMT) Full text and rfc822 format available.

Acknowledgement sent to Kevin Brubeck Unhammer <unhammer <at> fsfe.org>:
New bug report received and forwarded. Copy sent to bug-gnu-emacs <at> gnu.org. (Fri, 13 Sep 2019 09:46:01 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Kevin Brubeck Unhammer <unhammer <at> fsfe.org>
To: bug-gnu-emacs <at> gnu.org
Subject: 26.2; less-than-character cuts off end of pre tag in eww
Date: Fri, 13 Sep 2019 11:44:55 +0200
Doing

M-x eww http://www.haskellforall.com/2012/09/the-functor-design-pattern.html RET

I see

    idB   = return
    (._B) = (<=
    ... and our map function obeyed the functor laws:

where it should show

    idB   = return
    (._B) = (<=<)

    ... and our map function obeyed the functor laws:

The html for this fragment is

    <pre>
    idB   = return
    (._B) = (<=<)
    </pre>
    ... and our <tt>map</tt> function obeyed the functor laws:




In GNU Emacs 26.2 (build 2, x86_64-pc-linux-gnu, GTK+ Version 3.24.8)
 of 2019-04-12 built on lcy01-amd64-026
Windowing system distributor 'The X.Org Foundation', version 11.0.12004000
System Description:	Ubuntu 19.04

Recent messages:
For information about GNU Emacs and the GNU system, type C-h C-a.
Contacting host: www.haskellforall.com:80
Mark saved where search started

Configured using:
 'configure --build=x86_64-linux-gnu --prefix=/usr
 '--includedir=${prefix}/include' '--mandir=${prefix}/share/man'
 '--infodir=${prefix}/share/info' --sysconfdir=/etc --localstatedir=/var
 --disable-silent-rules '--libdir=${prefix}/lib/x86_64-linux-gnu'
 '--libexecdir=${prefix}/lib/x86_64-linux-gnu' --disable-maintainer-mode
 --disable-dependency-tracking --prefix=/usr --sharedstatedir=/var/lib
 --program-suffix=26 --with-modules --with-file-notification=inotify
 --with-mailutils --with-x=yes --with-x-toolkit=gtk3 --with-xwidgets
 --with-lcms2 'CFLAGS=-g -O2
 -fdebug-prefix-map=/build/emacs26-YcBQyB/emacs26-26.2~1.gitfd1b34b=. -fstack-protector-strong
 -Wformat -Werror=format-security -no-pie' 'CPPFLAGS=-Wdate-time
 -D_FORTIFY_SOURCE=2' 'LDFLAGS=-Wl,-Bsymbolic-functions -Wl,-z,relro
 -no-pie''

Configured features:
XPM JPEG TIFF GIF PNG RSVG IMAGEMAGICK SOUND GPM DBUS GSETTINGS GLIB
NOTIFY LIBSELINUX GNUTLS LIBXML2 FREETYPE M17N_FLT LIBOTF XFT ZLIB
TOOLKIT_SCROLL_BARS GTK3 X11 XDBE XIM MODULES THREADS XWIDGETS
LIBSYSTEMD LCMS2

Important settings:
  value of $LANG: nn_NO.UTF-8
  locale-coding-system: utf-8-unix

Major mode: eww

Minor modes in effect:
  tooltip-mode: t
  global-eldoc-mode: t
  electric-indent-mode: t
  mouse-wheel-mode: t
  tool-bar-mode: t
  menu-bar-mode: t
  file-name-shadow-mode: t
  global-font-lock-mode: t
  font-lock-mode: t
  blink-cursor-mode: t
  auto-composition-mode: t
  auto-encryption-mode: t
  auto-compression-mode: t
  buffer-read-only: t
  line-number-mode: t
  transient-mark-mode: t

Load-path shadows:
None found.

Features:
(shadow sort mail-extr emacsbug message dired dired-loaddefs rfc822 mml
mml-sec epa derived epg epg-config mm-decode mm-bodies mm-encode
mailabbrev gmm-utils mailheader sendmail misearch multi-isearch cl-extra
help-mode network-stream starttls url-http tls gnutls mail-parse rfc2231
url-gw nsm rmc url-cache url-auth eww easymenu puny mm-url gnus nnheader
gnus-util rmail rmail-loaddefs rfc2047 rfc2045 ietf-drums mail-utils
wid-edit mm-util mail-prsvr url-queue url url-proxy url-privacy
url-expand url-methods url-history url-cookie url-domsuf url-util
url-parse auth-source cl-seq eieio eieio-core cl-macs eieio-loaddefs
password-cache url-vars mailcap shr svg xml seq byte-opt gv bytecomp
byte-compile cconv dom browse-url format-spec cl-loaddefs cl-lib
elec-pair time-date mule-util tooltip eldoc electric uniquify ediff-hook
vc-hooks lisp-float-type mwheel term/x-win x-win term/common-win x-dnd
tool-bar dnd fontset image regexp-opt fringe tabulated-list replace
newcomment text-mode elisp-mode lisp-mode prog-mode register page
menu-bar rfn-eshadow isearch timer select scroll-bar mouse jit-lock
font-lock syntax facemenu font-core term/tty-colors frame cl-generic
cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao
korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech
european ethiopic indian cyrillic chinese composite charscript charprop
case-table epa-hook jka-cmpr-hook help simple abbrev obarray minibuffer
cl-preloaded nadvice loaddefs button faces cus-face macroexp files
text-properties overlay sha1 md5 base64 format env code-pages mule
custom widget hashtable-print-readable backquote threads dbusbind
inotify lcms2 dynamic-setting system-font-setting font-render-setting
xwidget-internal move-toolbar gtk x-toolkit x multi-tty
make-network-process emacs)

Memory information:
((conses 16 195310 11775)
 (symbols 48 25414 1)
 (miscs 40 66 284)
 (strings 32 49980 1480)
 (string-bytes 1 1464447)
 (vectors 16 19116)
 (vector-slots 8 575203 10820)
 (floats 8 111 602)
 (intervals 56 11653 664)
 (buffers 992 12))




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#37397; Package emacs. (Fri, 13 Sep 2019 10:18:02 GMT) Full text and rfc822 format available.

Message #8 received at 37397 <at> debbugs.gnu.org (full text, mbox):

From: Robert Pluim <rpluim <at> gmail.com>
To: Kevin Brubeck Unhammer <unhammer <at> fsfe.org>
Cc: 37397 <at> debbugs.gnu.org
Subject: Re: bug#37397: 26.2; less-than-character cuts off end of pre tag in
 eww
Date: Fri, 13 Sep 2019 12:17:17 +0200
>>>>> On Fri, 13 Sep 2019 11:44:55 +0200, Kevin Brubeck Unhammer <unhammer <at> fsfe.org> said:

    Kevin> Doing
    Kevin> M-x eww http://www.haskellforall.com/2012/09/the-functor-design-pattern.html RET

    Kevin> I see

    Kevin>     idB   = return
    Kevin>     (._B) = (<=
    Kevin>     ... and our map function obeyed the functor laws:

    Kevin> where it should show

    Kevin>     idB   = return
    Kevin>     (._B) = (<=<)

    Kevin>     ... and our map function obeyed the functor laws:

    Kevin> The html for this fragment is

    Kevin>     <pre>
    Kevin>     idB   = return
    Kevin>     (._B) = (<=<)
    Kevin>     </pre>
    Kevin>     ... and our <tt>map</tt> function obeyed the functor laws:

The source html is incorrect: it should have written those '<' as
'&lt;'. <pre> does not mean 'anything goes'.

Robert




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#37397; Package emacs. (Fri, 13 Sep 2019 11:26:02 GMT) Full text and rfc822 format available.

Message #11 received at 37397 <at> debbugs.gnu.org (full text, mbox):

From: Kevin Brubeck Unhammer <unhammer <at> fsfe.org>
To: 37397 <at> debbugs.gnu.org
Subject: Re: bug#37397: 26.2;
 less-than-character cuts off end of pre tag in eww
Date: Fri, 13 Sep 2019 13:25:50 +0200
Robert Pluim <rpluim <at> gmail.com> čálii:

> The source html is incorrect: it should have written those '<' as
> '&lt;'. <pre> does not mean 'anything goes'.

It does show up "correctly" in Firefox and Chromium though. With a
string such as "<)" (or perhaps "<" followed by anything other than
[a-zA-Z/]), it's obviously not a tag of any sort, so why not just print
it literally? I've noticed the problem on other pages as well – and even
MDC's example on <pre> usage has literal angle brackets:
https://developer.mozilla.org/en-US/docs/Web/HTML/Element/pre#Example_2
though eww shows those just fine.

Although looking at shr, it seems it calls to libxml-parse-html-region,
so I suppose it's not something easily fixed on the Emacs side anyway.




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#37397; Package emacs. (Fri, 13 Sep 2019 12:13:02 GMT) Full text and rfc822 format available.

Message #14 received at 37397 <at> debbugs.gnu.org (full text, mbox):

From: Robert Pluim <rpluim <at> gmail.com>
To: Kevin Brubeck Unhammer <unhammer <at> fsfe.org>
Cc: 37397 <at> debbugs.gnu.org
Subject: Re: bug#37397: 26.2; less-than-character cuts off end of pre tag in
 eww
Date: Fri, 13 Sep 2019 14:12:20 +0200
>>>>> On Fri, 13 Sep 2019 13:25:50 +0200, Kevin Brubeck Unhammer <unhammer <at> fsfe.org> said:

    Kevin> Robert Pluim <rpluim <at> gmail.com> čálii:
    >> The source html is incorrect: it should have written those '<' as
    >> '&lt;'. <pre> does not mean 'anything goes'.

    Kevin> It does show up "correctly" in Firefox and Chromium though. With a
    Kevin> string such as "<)" (or perhaps "<" followed by anything other than
    Kevin> [a-zA-Z/]), it's obviously not a tag of any sort, so why not just print
    Kevin> it literally? I've noticed the problem on other pages as well – and even
    Kevin> MDC's example on <pre> usage has literal angle brackets:
    Kevin> https://developer.mozilla.org/en-US/docs/Web/HTML/Element/pre#Example_2
    Kevin> though eww shows those just fine.

It does, but those angle brackets are balanced, which is not the case
for your example.

    Kevin> Although looking at shr, it seems it calls to libxml-parse-html-region,
    Kevin> so I suppose it's not something easily fixed on the Emacs side anyway.

Not really. Perhaps you could ask the libxml developers what they
think.

Robert




Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#37397; Package emacs. (Fri, 13 Sep 2019 12:37:01 GMT) Full text and rfc822 format available.

Message #17 received at 37397 <at> debbugs.gnu.org (full text, mbox):

From: Noam Postavsky <npostavs <at> gmail.com>
To: Kevin Brubeck Unhammer <unhammer <at> fsfe.org>
Cc: 37397 <at> debbugs.gnu.org
Subject: Re: bug#37397: 26.2;
 less-than-character cuts off end of pre tag in eww
Date: Fri, 13 Sep 2019 08:36:01 -0400
forcemerge 37009 37397
quit

Robert Pluim <rpluim <at> gmail.com> writes:

>     Kevin> Although looking at shr, it seems it calls to libxml-parse-html-region,
>     Kevin> so I suppose it's not something easily fixed on the Emacs side anyway.
>
> Not really. Perhaps you could ask the libxml developers what they
> think.

Yes, though Lars suggested a preprocessing step might fix this problem
as well (see https://debbugs.gnu.org/37009#14).




Forcibly Merged 37009 37397. Request was from Noam Postavsky <npostavs <at> gmail.com> to control <at> debbugs.gnu.org. (Fri, 13 Sep 2019 12:37:02 GMT) Full text and rfc822 format available.

Information forwarded to bug-gnu-emacs <at> gnu.org:
bug#37397; Package emacs. (Fri, 13 Sep 2019 12:46:02 GMT) Full text and rfc822 format available.

Message #22 received at 37397 <at> debbugs.gnu.org (full text, mbox):

From: Kevin Brubeck Unhammer <unhammer <at> fsfe.org>
To: Noam Postavsky <npostavs <at> gmail.com>
Cc: 37397 <at> debbugs.gnu.org
Subject: Re: bug#37397: 26.2;
 less-than-character cuts off end of pre tag in eww
Date: Fri, 13 Sep 2019 14:45:19 +0200
Noam Postavsky <npostavs <at> gmail.com> čálii:

>> Not really. Perhaps you could ask the libxml developers what they
>> think.
>
> Yes, though Lars suggested a preprocessing step might fix this problem
> as well (see https://debbugs.gnu.org/37009#14).

https://gitlab.gnome.org/GNOME/libxml2/issues/101 





Added tag(s) fixed. Request was from Lars Ingebrigtsen <larsi <at> gnus.org> to control <at> debbugs.gnu.org. (Sat, 14 Sep 2019 14:09:02 GMT) Full text and rfc822 format available.

bug marked as fixed in version 27.1, send any further explanations to 37009 <at> debbugs.gnu.org and Nick Daly <nick.m.daly <at> gmail.com> Request was from Lars Ingebrigtsen <larsi <at> gnus.org> to control <at> debbugs.gnu.org. (Sat, 14 Sep 2019 14:09:02 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Sun, 13 Oct 2019 11:24:05 GMT) Full text and rfc822 format available.

This bug report was last modified 5 years and 253 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.