From unknown Tue Jun 24 13:57:54 2025 X-Loop: help-debbugs@gnu.org Subject: bug#16344: 24.3; xml-parse-region stuck on specific input, uses lots of memory Resent-From: Christopher Wellons Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Sat, 04 Jan 2014 23:40:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 16344 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: To: 16344@debbugs.gnu.org X-Debbugs-Original-To: bug-gnu-emacs@gnu.org Received: via spool by submit@debbugs.gnu.org id=B.13888787971725 (code B ref -1); Sat, 04 Jan 2014 23:40:02 +0000 Received: (at submit) by debbugs.gnu.org; 4 Jan 2014 23:39:57 +0000 Received: from localhost ([127.0.0.1]:34649 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Vzaou-0000Ri-IC for submit@debbugs.gnu.org; Sat, 04 Jan 2014 18:39:56 -0500 Received: from eggs.gnu.org ([208.118.235.92]:52789) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Vzaor-0000RY-5d for submit@debbugs.gnu.org; Sat, 04 Jan 2014 18:39:53 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vzaol-0005l1-Gz for submit@debbugs.gnu.org; Sat, 04 Jan 2014 18:39:52 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.5 required=5.0 tests=BAYES_05 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:45265) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vzaol-0005kx-E3 for submit@debbugs.gnu.org; Sat, 04 Jan 2014 18:39:47 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:54474) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vzaog-00015X-C8 for bug-gnu-emacs@gnu.org; Sat, 04 Jan 2014 18:39:47 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vzaob-0005j1-H6 for bug-gnu-emacs@gnu.org; Sat, 04 Jan 2014 18:39:42 -0500 Received: from mail.nullprogram.com ([192.241.191.137]:58884) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vzaob-0005ip-Dg for bug-gnu-emacs@gnu.org; Sat, 04 Jan 2014 18:39:37 -0500 Received: from localhost ([127.0.0.1] helo=susie.zeus.nullprogram.com) by mail.nullprogram.com with esmtp (Exim 4.82) (envelope-from ) id 1VzaoV-00039X-1q; Sat, 04 Jan 2014 18:39:31 -0500 From: Christopher Wellons User-Agent: Notmuch/0.16+156~gdb5189a (http://notmuchmail.org) Emacs/24.3.1 (x86_64-pc-linux-gnu) Date: Sat, 04 Jan 2014 18:39:30 -0500 Message-ID: <87ha9jqpu5.fsf@susie.zeus.nullprogram.com> MIME-Version: 1.0 Content-Type: text/plain X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x [generic] X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) The function `xml-parse-region' is never returning from parsing this specific input. It's saturing a CPU core while using huge quantities of memory. Fortunately it can be aborted with C-g. (with-temp-buffer (insert "< /x>") (xml-parse-region)) The trigger is a comment followed by at least one opening tag followed by a stray less-than character. This is not valid XML due to that extra space in the closing tag, but I'm expecting `xml-parse-region' to signal an error in the case of invalid input rather than lock up Emacs. Just for the sake of context, this issue was initially discovered here: https://github.com/skeeto/elfeed/issues/17 From unknown Tue Jun 24 13:57:54 2025 X-Loop: help-debbugs@gnu.org Subject: bug#16344: 24.3; xml-parse-region stuck on specific input, uses lots of memory In-Reply-To: <87ha9jqpu5.fsf@susie.zeus.nullprogram.com> Resent-From: Ulf Jasper Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Wed, 02 Mar 2016 18:12:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 16344 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: To: Christopher Wellons Cc: 16344@debbugs.gnu.org, 16344-done@debbugs.gnu.org Received: via spool by 16344-submit@debbugs.gnu.org id=B16344.145694227913414 (code B ref 16344); Wed, 02 Mar 2016 18:12:02 +0000 Received: (at 16344) by debbugs.gnu.org; 2 Mar 2016 18:11:19 +0000 Received: from localhost ([127.0.0.1]:58071 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1abBF1-0003UE-2v for submit@debbugs.gnu.org; Wed, 02 Mar 2016 13:11:19 -0500 Received: from mout.web.de ([212.227.15.14]:60563) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1abBEz-0003Tp-37; Wed, 02 Mar 2016 13:11:17 -0500 Received: from p55m-ud2 ([217.226.95.17]) by smtp.web.de (mrweb001) with ESMTPSA (Nemesis) id 0LpwMZ-1ZvoL90frT-00ffBk; Wed, 02 Mar 2016 19:11:04 +0100 From: Ulf Jasper References: <87ha9jqpu5.fsf@susie.zeus.nullprogram.com> Date: Wed, 02 Mar 2016 19:11:03 +0100 Message-ID: <87egbsyclk.fsf@web.de> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.0.91 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Provags-ID: V03:K0:+Xhc6/KTewAj7roMAAUtKFY/eb+sr5pYEeh3oj8XJ8snGUbMgu3 gCjGXGspXlk9PnrLiOTKbZWFl3aGYFHzwB3p6BWe4bCrDrD2TKrep8SaRRlNnf58MfDsQRW SqXGHOIGOHMZwYuCk3l3AGnsAlgeLNbBvM6VEhrloXPN6AeKGrPL4VTz4djKIt3+zWrq4uL dLVKdivjdWbacNp9NWJRg== X-UI-Out-Filterresults: notjunk:1;V01:K0:pxlWDORDraY=:Tb5zCHW+y6WdLJrtwOsH2p r35rSZFnA2hoGRHJ5L2XQCdyPs8AID8s8CnTRjX6NDFtpTkeM+uB2dF4trBO06EfIVp07T+JL D5S1VwiHMvXlUCyV/VdFNF248M/9rD3TIXXlIdB8RKKvop4UKvjuBkwSrSu+LEt2hgAUxjPFb 58LqCVAAscJZgi3a71M8Ug2apRlJ/+0D5N6ByHErAweNUOLgIcAXBjgjEhOLmWZhreeaHEtuW Qg+GvXSHZoMS7dqUvg1T96xSK1L2mNjqJZEit1a4KuvDxF58rFwezbUggWoI3rP1rtSXM5Z62 n+WXF+eJpBx3tWPCQNkyS6vWHEaozeW8asPHnaKd2vhncT+e6j+LlXziI3AjkiY3JvHT9aStt MMlMkEQSwpScy6ag+GBCt0UU4Y/COHFo+NywPGCYWOZdoUJwR1vfjcJLrcnk5qQhkxzAtLFuH 0ZkSNFCYp3DkqrovI9j8Md+o5gCUVAPDKOe+8qBNZrsXeDKr8CNMovQZTaNA285aOMxGntXHr euTMb/gaMN2If9RRTvpwqTswNy7E5dtaG2WY6jXh9+/QzymNsIezKwF9mrADXn5hZ69KUeT2R t+Lg9xKs7ZVn8swElpVfmhhZfIc5VJJBslS3RjsVP1lJ5JAn+h5m2+KtlwxjNRt83K4z21SSg 7uS00EvV9varrHT78O+yNjkVpw50Z0NrNmjCho/hTZqSOGoXOs0hvpze1jotnwY/BBYhjRUCR 7kQvk6hB5Uazbka/q2cS1uJCHvFWJrPIle0ffqaHObrLYbxVbkNkRTT5AjpxKDpABLmFbZxWi qW9vmo2 X-Spam-Score: -0.7 (/) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) Fixed in branch emacs-25. xml-parser will throw an error now. From unknown Tue Jun 24 13:57:54 2025 MIME-Version: 1.0 X-Mailer: MIME-tools 5.505 (Entity 5.505) X-Loop: help-debbugs@gnu.org From: help-debbugs@gnu.org (GNU bug Tracking System) To: Christopher Wellons Subject: bug#16344: closed (Re: bug#16344: 24.3; xml-parse-region stuck on specific input, uses lots of memory) Message-ID: References: <87egbsyclk.fsf@web.de> <87ha9jqpu5.fsf@susie.zeus.nullprogram.com> X-Gnu-PR-Message: they-closed 16344 X-Gnu-PR-Package: emacs Reply-To: 16344@debbugs.gnu.org Date: Wed, 02 Mar 2016 18:12:03 +0000 Content-Type: multipart/mixed; boundary="----------=_1456942323-13475-1" This is a multi-part message in MIME format... ------------=_1456942323-13475-1 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Your bug report #16344: 24.3; xml-parse-region stuck on specific input, uses lots of memory which was filed against the emacs package, has been closed. The explanation is attached below, along with your original report. If you require more details, please reply to 16344@debbugs.gnu.org. --=20 16344: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=3D16344 GNU Bug Tracking System Contact help-debbugs@gnu.org with problems ------------=_1456942323-13475-1 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at 16344-done) by debbugs.gnu.org; 2 Mar 2016 18:11:19 +0000 Received: from localhost ([127.0.0.1]:58069 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1abBF0-0003UA-Pj for submit@debbugs.gnu.org; Wed, 02 Mar 2016 13:11:18 -0500 Received: from mout.web.de ([212.227.15.14]:60563) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1abBEz-0003Tp-37; Wed, 02 Mar 2016 13:11:17 -0500 Received: from p55m-ud2 ([217.226.95.17]) by smtp.web.de (mrweb001) with ESMTPSA (Nemesis) id 0LpwMZ-1ZvoL90frT-00ffBk; Wed, 02 Mar 2016 19:11:04 +0100 From: Ulf Jasper To: Christopher Wellons Subject: Re: bug#16344: 24.3; xml-parse-region stuck on specific input, uses lots of memory References: <87ha9jqpu5.fsf@susie.zeus.nullprogram.com> Date: Wed, 02 Mar 2016 19:11:03 +0100 Message-ID: <87egbsyclk.fsf@web.de> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.0.91 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Provags-ID: V03:K0:+Xhc6/KTewAj7roMAAUtKFY/eb+sr5pYEeh3oj8XJ8snGUbMgu3 gCjGXGspXlk9PnrLiOTKbZWFl3aGYFHzwB3p6BWe4bCrDrD2TKrep8SaRRlNnf58MfDsQRW SqXGHOIGOHMZwYuCk3l3AGnsAlgeLNbBvM6VEhrloXPN6AeKGrPL4VTz4djKIt3+zWrq4uL dLVKdivjdWbacNp9NWJRg== X-UI-Out-Filterresults: notjunk:1;V01:K0:pxlWDORDraY=:Tb5zCHW+y6WdLJrtwOsH2p r35rSZFnA2hoGRHJ5L2XQCdyPs8AID8s8CnTRjX6NDFtpTkeM+uB2dF4trBO06EfIVp07T+JL D5S1VwiHMvXlUCyV/VdFNF248M/9rD3TIXXlIdB8RKKvop4UKvjuBkwSrSu+LEt2hgAUxjPFb 58LqCVAAscJZgi3a71M8Ug2apRlJ/+0D5N6ByHErAweNUOLgIcAXBjgjEhOLmWZhreeaHEtuW Qg+GvXSHZoMS7dqUvg1T96xSK1L2mNjqJZEit1a4KuvDxF58rFwezbUggWoI3rP1rtSXM5Z62 n+WXF+eJpBx3tWPCQNkyS6vWHEaozeW8asPHnaKd2vhncT+e6j+LlXziI3AjkiY3JvHT9aStt MMlMkEQSwpScy6ag+GBCt0UU4Y/COHFo+NywPGCYWOZdoUJwR1vfjcJLrcnk5qQhkxzAtLFuH 0ZkSNFCYp3DkqrovI9j8Md+o5gCUVAPDKOe+8qBNZrsXeDKr8CNMovQZTaNA285aOMxGntXHr euTMb/gaMN2If9RRTvpwqTswNy7E5dtaG2WY6jXh9+/QzymNsIezKwF9mrADXn5hZ69KUeT2R t+Lg9xKs7ZVn8swElpVfmhhZfIc5VJJBslS3RjsVP1lJ5JAn+h5m2+KtlwxjNRt83K4z21SSg 7uS00EvV9varrHT78O+yNjkVpw50Z0NrNmjCho/hTZqSOGoXOs0hvpze1jotnwY/BBYhjRUCR 7kQvk6hB5Uazbka/q2cS1uJCHvFWJrPIle0ffqaHObrLYbxVbkNkRTT5AjpxKDpABLmFbZxWi qW9vmo2 X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 16344-done Cc: 16344@debbugs.gnu.org, 16344-done@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) Fixed in branch emacs-25. xml-parser will throw an error now. ------------=_1456942323-13475-1 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at submit) by debbugs.gnu.org; 4 Jan 2014 23:39:57 +0000 Received: from localhost ([127.0.0.1]:34649 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Vzaou-0000Ri-IC for submit@debbugs.gnu.org; Sat, 04 Jan 2014 18:39:56 -0500 Received: from eggs.gnu.org ([208.118.235.92]:52789) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Vzaor-0000RY-5d for submit@debbugs.gnu.org; Sat, 04 Jan 2014 18:39:53 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vzaol-0005l1-Gz for submit@debbugs.gnu.org; Sat, 04 Jan 2014 18:39:52 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.5 required=5.0 tests=BAYES_05 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:45265) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vzaol-0005kx-E3 for submit@debbugs.gnu.org; Sat, 04 Jan 2014 18:39:47 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:54474) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vzaog-00015X-C8 for bug-gnu-emacs@gnu.org; Sat, 04 Jan 2014 18:39:47 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Vzaob-0005j1-H6 for bug-gnu-emacs@gnu.org; Sat, 04 Jan 2014 18:39:42 -0500 Received: from mail.nullprogram.com ([192.241.191.137]:58884) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Vzaob-0005ip-Dg for bug-gnu-emacs@gnu.org; Sat, 04 Jan 2014 18:39:37 -0500 Received: from localhost ([127.0.0.1] helo=susie.zeus.nullprogram.com) by mail.nullprogram.com with esmtp (Exim 4.82) (envelope-from ) id 1VzaoV-00039X-1q; Sat, 04 Jan 2014 18:39:31 -0500 From: Christopher Wellons To: bug-gnu-emacs@gnu.org Subject: 24.3; xml-parse-region stuck on specific input, uses lots of memory User-Agent: Notmuch/0.16+156~gdb5189a (http://notmuchmail.org) Emacs/24.3.1 (x86_64-pc-linux-gnu) Date: Sat, 04 Jan 2014 18:39:30 -0500 Message-ID: <87ha9jqpu5.fsf@susie.zeus.nullprogram.com> MIME-Version: 1.0 Content-Type: text/plain X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x [generic] X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) The function `xml-parse-region' is never returning from parsing this specific input. It's saturing a CPU core while using huge quantities of memory. Fortunately it can be aborted with C-g. (with-temp-buffer (insert "< /x>") (xml-parse-region)) The trigger is a comment followed by at least one opening tag followed by a stray less-than character. This is not valid XML due to that extra space in the closing tag, but I'm expecting `xml-parse-region' to signal an error in the case of invalid input rather than lock up Emacs. Just for the sake of context, this issue was initially discovered here: https://github.com/skeeto/elfeed/issues/17 ------------=_1456942323-13475-1--