From debbugs-submit-bounces@debbugs.gnu.org Mon Jul 07 08:55:23 2014 Received: (at submit) by debbugs.gnu.org; 7 Jul 2014 12:55:23 +0000 Received: from localhost ([127.0.0.1]:45622 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1X48Rx-0002PK-T5 for submit@debbugs.gnu.org; Mon, 07 Jul 2014 08:55:22 -0400 Received: from eggs.gnu.org ([208.118.235.92]:57241) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1X48Rq-0002Os-Lf for submit@debbugs.gnu.org; Mon, 07 Jul 2014 08:55:15 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1X48Rb-0005sv-JS for submit@debbugs.gnu.org; Mon, 07 Jul 2014 08:55:05 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.0 required=5.0 tests=BAYES_40,UNPARSEABLE_RELAY autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:41769) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1X48Rb-0005sr-GJ for submit@debbugs.gnu.org; Mon, 07 Jul 2014 08:54:55 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:58882) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1X48RR-0005ye-Ex for bug-gnu-emacs@gnu.org; Mon, 07 Jul 2014 08:54:55 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1X48RH-0005q7-Jj for bug-gnu-emacs@gnu.org; Mon, 07 Jul 2014 08:54:45 -0400 Received: from korolev.univ-paris7.fr ([2001:660:3301:8000::1:2]:34707) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1X48RH-0005pj-9z for bug-gnu-emacs@gnu.org; Mon, 07 Jul 2014 08:54:35 -0400 Received: from mailhub.math.univ-paris-diderot.fr (mailhub.math.univ-paris-diderot.fr [81.194.30.253]) by korolev.univ-paris7.fr (8.14.4/8.14.4/relay1/46573) with ESMTP id s67CsWVe032710 for ; Mon, 7 Jul 2014 14:54:32 +0200 Received: from mailhub.math.univ-paris-diderot.fr (localhost [127.0.0.1]) by mailhub.math.univ-paris-diderot.fr (Postfix) with ESMTP id D05E72C1A19 for ; Mon, 7 Jul 2014 14:54:32 +0200 (CEST) X-Virus-Scanned: amavisd-new at math.univ-paris-diderot.fr Received: from mailhub.math.univ-paris-diderot.fr ([127.0.0.1]) by mailhub.math.univ-paris-diderot.fr (mailhub.math.univ-paris-diderot.fr [127.0.0.1]) (amavisd-new, port 10023) with ESMTP id pQass6QyMFrN for ; Mon, 7 Jul 2014 14:54:31 +0200 (CEST) Received: from ijon.pps.univ-paris-diderot.fr (unknown [78.194.40.74]) (Authenticated sender: jch) by mailhub.math.univ-paris-diderot.fr (Postfix) with ESMTPSA id E42472C1A18 for ; Mon, 7 Jul 2014 14:54:31 +0200 (CEST) Received: from jch (uid 1000) (envelope-from jch@pps.univ-paris-diderot.fr) id c002e8 by ijon.pps.univ-paris-diderot.fr (DragonFly Mail Agent v0.9); Mon, 07 Jul 2014 14:54:30 +0200 Date: Mon, 07 Jul 2014 14:54:30 +0200 Message-ID: <87fvidba7t.wl-jch@pps.univ-paris-diderot.fr> From: Juliusz Chroboczek To: bug-gnu-emacs@gnu.org Subject: 24.3; xml-print inserts spurious whitespace User-Agent: Wanderlust/2.15.9 MIME-Version: 1.0 (generated by SEMI-EPG 1.14.7 - "Harue") Content-Type: text/plain; charset=US-ASCII X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.2.7 (korolev.univ-paris7.fr [194.254.61.138]); Mon, 07 Jul 2014 14:54:32 +0200 (CEST) X-Miltered: at korolev with ID 53BA9888.001 by Joe's j-chkmail (http : // j-chkmail dot ensmp dot fr)! X-j-chkmail-Enveloppe: 53BA9888.001 from mailhub.math.univ-paris-diderot.fr/mailhub.math.univ-paris-diderot.fr/null/mailhub.math.univ-paris-diderot.fr/ X-j-chkmail-Score: MSGID : 53BA9888.001 on korolev.univ-paris7.fr : j-chkmail score : . : R=. U=. O=. B=0.000 -> S=0.000 X-j-chkmail-Status: Ham X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) Hi. I'm trying to use Emacs' XML parser to parse Atom feeds, and I'm finding that xml-print inserts spurious whitespace in the XHTML content, which breaks formatting. Here's a pretty minimal example: (require 'xml) (defun xml-parse-from-string (s) (with-temp-buffer (insert s) (xml-parse-region (point-min) (point-max)))) (defun xml-print-to-string (x) (with-temp-buffer (xml-print x) (buffer-substring (point-min) (point-max)))) (xml-print-to-string (xml-parse-from-string "

[foo]

")) "

[ foo]

" From debbugs-submit-bounces@debbugs.gnu.org Tue Mar 01 09:14:39 2016 Received: (at control) by debbugs.gnu.org; 1 Mar 2016 14:14:39 +0000 Received: from localhost ([127.0.0.1]:54884 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aal4Q-0000DT-Uh for submit@debbugs.gnu.org; Tue, 01 Mar 2016 09:14:39 -0500 Received: from hermes.netfonds.no ([80.91.224.195]:59628) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aal4P-0000DE-43 for control@debbugs.gnu.org; Tue, 01 Mar 2016 09:14:37 -0500 Received: from 2.151.173.136.tmi.telenormobil.no ([2.151.173.136] helo=mouse) by hermes.netfonds.no with esmtpsa (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1aal3j-0005qm-K8 for control@debbugs.gnu.org; Tue, 01 Mar 2016 15:13:56 +0100 From: Lars Ingebrigtsen To: control@debbugs.gnu.org Subject: control message for bug #17964 Message-ID: <87si0au3qd.fsf@totally-fudged-out-message-id> Date: Wed, 02 Mar 2016 01:13:52 +1100 User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1.50 (gnu/linux) X-MailScanner-ID: 1aal3j-0005qm-K8 X-Netfonds-MailScanner: Found to be clean X-Netfonds-MailScanner-From: larsi@gnus.org MailScanner-NULL-Check: 1457446436.53475@7gR4k4xo/czpUstngM7jqA X-Spam-Status: No X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) tags 17964 confirmed From debbugs-submit-bounces@debbugs.gnu.org Tue Mar 01 09:15:30 2016 Received: (at control) by debbugs.gnu.org; 1 Mar 2016 14:15:30 +0000 Received: from localhost ([127.0.0.1]:54901 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aal5G-0000jU-F2 for submit@debbugs.gnu.org; Tue, 01 Mar 2016 09:15:30 -0500 Received: from hermes.netfonds.no ([80.91.224.195]:59663) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aal5F-0000hr-AU for control@debbugs.gnu.org; Tue, 01 Mar 2016 09:15:29 -0500 Received: from 2.151.173.136.tmi.telenormobil.no ([2.151.173.136] helo=mouse) by hermes.netfonds.no with esmtpsa (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1aal3s-0005rK-L5 for control@debbugs.gnu.org; Tue, 01 Mar 2016 15:14:05 +0100 From: Lars Ingebrigtsen To: control@debbugs.gnu.org Subject: control message for bug #17964 Message-ID: <87mvqiu3kx.fsf@totally-fudged-out-message-id> Date: Wed, 02 Mar 2016 01:14:01 +1100 User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1.50 (gnu/linux) X-MailScanner-ID: 1aal3s-0005rK-L5 X-Netfonds-MailScanner: Found to be clean X-Netfonds-MailScanner-From: larsi@gnus.org MailScanner-NULL-Check: 1457446446.57957@u9JO39q7+7ShaphpmWar0w X-Spam-Status: No X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) tags 17964 wontfix close 17964 From debbugs-submit-bounces@debbugs.gnu.org Tue Mar 01 09:16:05 2016 Received: (at 17964) by debbugs.gnu.org; 1 Mar 2016 14:16:05 +0000 Received: from localhost ([127.0.0.1]:54911 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aal5p-0001Ee-0j for submit@debbugs.gnu.org; Tue, 01 Mar 2016 09:16:05 -0500 Received: from hermes.netfonds.no ([80.91.224.195]:59694) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aal5l-00019q-Q9 for 17964@debbugs.gnu.org; Tue, 01 Mar 2016 09:16:03 -0500 Received: from 2.151.173.136.tmi.telenormobil.no ([2.151.173.136] helo=mouse) by hermes.netfonds.no with esmtpsa (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1aal3o-0005r5-6u; Tue, 01 Mar 2016 15:14:00 +0100 From: Lars Ingebrigtsen To: Juliusz Chroboczek Subject: Re: bug#17964: 24.3; xml-print inserts spurious whitespace In-Reply-To: <87fvidba7t.wl-jch@pps.univ-paris-diderot.fr> (Juliusz Chroboczek's message of "Mon, 07 Jul 2014 14:54:30 +0200") Date: Tue, 01 Mar 2016 17:14:52 +1100 Message-ID: <87oaayu3kz.fsf@gnus.org> References: <87fvidba7t.wl-jch@pps.univ-paris-diderot.fr> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-MailScanner-ID: 1aal3o-0005r5-6u X-Netfonds-MailScanner: Found to be clean X-Netfonds-MailScanner-From: larsi@gnus.org MailScanner-NULL-Check: 1457446440.82735@BJFG56RviFrc8L/dxK1RAw X-Spam-Status: No X-Spam-Score: 1.1 (+) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: Juliusz Chroboczek writes: > I'm trying to use Emacs' XML parser to parse Atom feeds, and I'm finding > that xml-print inserts spurious whitespace in the XHTML content, which > breaks formatting. Here's a pretty minimal example: > > (require 'xml) > > (defun xml-parse-from-string (s) > (with-temp-buffer > (insert s) > (xml-parse-region (point-min) (point-max)))) > > (defun xml-print-to-string (x) > (with-temp-buffer > (xml-print x) > (buffer-substring (point-min) (point-max)))) > > (xml-print-to-string (xml-parse-from-string "

[foo]

")) > "

[ > foo] >

" [...] Content analysis details: (1.1 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at http://www.dnswl.org/, no trust [80.91.224.195 listed in list.dnswl.org] 1.1 DATE_IN_PAST_06_12 Date: is 6 to 12 hours before Received: date X-Debbugs-Envelope-To: 17964 Cc: 17964@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.1 (+) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: Juliusz Chroboczek writes: > I'm trying to use Emacs' XML parser to parse Atom feeds, and I'm finding > that xml-print inserts spurious whitespace in the XHTML content, which > breaks formatting. Here's a pretty minimal example: > > (require 'xml) > > (defun xml-parse-from-string (s) > (with-temp-buffer > (insert s) > (xml-parse-region (point-min) (point-max)))) > > (defun xml-print-to-string (x) > (with-temp-buffer > (xml-print x) > (buffer-substring (point-min) (point-max)))) > > (xml-print-to-string (xml-parse-from-string "

[foo]

")) > "

[ > foo] >

" [...] Content analysis details: (1.1 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at http://www.dnswl.org/, no trust [80.91.224.195 listed in list.dnswl.org] 1.1 DATE_IN_PAST_06_12 Date: is 6 to 12 hours before Received: date Juliusz Chroboczek writes: > I'm trying to use Emacs' XML parser to parse Atom feeds, and I'm finding > that xml-print inserts spurious whitespace in the XHTML content, which > breaks formatting. Here's a pretty minimal example: > > (require 'xml) > > (defun xml-parse-from-string (s) > (with-temp-buffer > (insert s) > (xml-parse-region (point-min) (point-max)))) > > (defun xml-print-to-string (x) > (with-temp-buffer > (xml-print x) > (buffer-substring (point-min) (point-max)))) > > (xml-print-to-string (xml-parse-from-string "

[foo]

")) > "

[ > foo] >

" I can confirm that this is still an issue on the Emacs trunk. However, our XML reader does not retain information about whitespace (or the lack of it). And

[ foo]

is perfectly valid XML. I don't really think there's a way to "fix" this, if it's considered a bug, which is kinda isn't. So I'm closing this as "wontfix". -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Tue Mar 01 12:17:07 2016 Received: (at 17964) by debbugs.gnu.org; 1 Mar 2016 17:17:07 +0000 Received: from localhost ([127.0.0.1]:56580 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aanv0-00008h-Tp for submit@debbugs.gnu.org; Tue, 01 Mar 2016 12:17:07 -0500 Received: from mout.web.de ([212.227.17.12]:59661) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aanuy-000080-CH for 17964@debbugs.gnu.org; Tue, 01 Mar 2016 12:17:05 -0500 Received: from p55m-ud2 ([217.226.70.234]) by smtp.web.de (mrweb101) with ESMTPSA (Nemesis) id 0LhNjo-1a5v5e0MHw-00mZK2; Tue, 01 Mar 2016 18:16:58 +0100 From: Ulf Jasper To: Lars Ingebrigtsen Subject: Re: bug#17964: 24.3; xml-print inserts spurious whitespace References: <87fvidba7t.wl-jch@pps.univ-paris-diderot.fr> <87oaayu3kz.fsf@gnus.org> Date: Tue, 01 Mar 2016 18:16:56 +0100 In-Reply-To: <87oaayu3kz.fsf@gnus.org> (Lars Ingebrigtsen's message of "Tue, 01 Mar 2016 17:14:52 +1100") Message-ID: <87pove15kn.fsf@web.de> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.0.90 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Provags-ID: V03:K0:Z5swecLySpxSl3EoOz4vXJpcrJn6OYP5m2Zd5cngt6nmosqjMdR 8H+6WJYHBQAuTSt3FJPgXQ5ZcriC6ItL0xnh5kz05vr6VY8HIEN7+VuDnvvl6rjxMrnJhCQ 1zNJ+GKpocn38xJcGyjU7YIU5dIKf2jI8xGCtgUgMf0ySkySykZAesllkJm2I55W969FIht fCZv/6loa0Ag6oi+wkRzw== X-UI-Out-Filterresults: notjunk:1;V01:K0:bS20SDpLG1Q=:/pnnHaCj/LKmbKIgGLfLV1 ZqF4+tV/mPcvheTIKkMw1cZ3glhqw3X5EYJs8AmKN3gR9BiRozmaTvdwG04hbIESyLe0zfpuw 1lxcaYHJgC5ECYHnZacLl7MKUOkrl8SpNBS2PQviFfz1h9Sylm131XRlhsIlYrx2XI+iKkcY/ tmmfqP1KrP11jnNESFv8KHXMyPm0+Lz3LdCq8UsmgHFtP8cnHl57q2nzS1pqQb1MkbVwLXNmb tjBVvfYvsBp3UH96TB+oszYS99RGvl1nBgWRXdUrG2xXQyGElxK78bGJqeC3ILoh7WIuQd30k Az+ll3lCxIF4wZ2zRcfQKjETnmmnyCBm3FzTxzytWhFaaF0iGxdNNSnyMTNwZSG0wrmaw7owz c/ihsAP99mdXecGHbh9xb+f+djEsfC92YWzYDarrZ3dpYHxbzaeEyuGYyHLvEQDm8ACy4QkO1 xvPTSicdoZbx8099JCRHDM9A3ByUU1lbMCsS4oOIpc92oikUVdEj0R0EUVjTX+8LHs7OdgQKW AtyewCuhimBrGCRVNNEj8V74629hXC5y33TMecokT96vmc9HiDvUqfoDBJAXWCiLqUbRCIsnN grPmBZiTo6cY/LRP01eTmw73XJr9bfz8giVvwj73CQT0WCYbfrSHio98z6yOnf5qBRa9Lrmlz Gv3jRiTxKbwGzOsgYs725Kjr4+tbJEf0uF+irGCK3u/SuoPUF+OFUG2u3EGjEv7QTcObHvGIU 6FHHiylwMP3ncjmBApBYoyQ3j39OihBfcKDlYg8vdoXf7v+bFOm96UfNOEHiipKAOQy20bD6k dNGoTRe X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 17964 Cc: 17964@debbugs.gnu.org, Juliusz Chroboczek X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) Hi Juliusz, Am 01.03.2016 um 17:14 (+1100) schrieb Lars Ingebrigtsen: > Juliusz Chroboczek writes: > >> I'm trying to use Emacs' XML parser to parse Atom feeds, and I'm finding >> that xml-print inserts spurious whitespace in the XHTML content, which >> breaks formatting. [Not directly related to the bug report but maybe helpful:] 'newsticker.el' already does parsing of different Atom and RSS feeds. Maybe you can re-use some of that functionality. Please have a look at newst-backend.el. Best, Ulf From unknown Sun Jun 22 00:41:29 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Wed, 30 Mar 2016 11:24:03 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator