From debbugs-submit-bounces@debbugs.gnu.org Thu Apr 23 09:24:21 2020 Received: (at submit) by debbugs.gnu.org; 23 Apr 2020 13:24:21 +0000 Received: from localhost ([127.0.0.1]:54246 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jRbq1-0005gI-Ak for submit@debbugs.gnu.org; Thu, 23 Apr 2020 09:24:21 -0400 Received: from lists.gnu.org ([209.51.188.17]:44262) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1jRbpz-0005gA-UJ for submit@debbugs.gnu.org; Thu, 23 Apr 2020 09:24:20 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:42588) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1jRbpz-0001TL-IS for bug-gnu-emacs@gnu.org; Thu, 23 Apr 2020 09:24:19 -0400 X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00, FSL_HELO_NON_FQDN_1, SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.2 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.90_1) (envelope-from ) id 1jRbpz-00019P-01 for bug-gnu-emacs@gnu.org; Thu, 23 Apr 2020 09:24:19 -0400 Received: from andalucia.tim-landscheidt.de ([116.203.78.250]:48564 helo=andalucia) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1jRbpy-00013q-Dd for bug-gnu-emacs@gnu.org; Thu, 23 Apr 2020 09:24:18 -0400 Received: from dslb-090-186-010-106.090.186.pools.vodafone-ip.de ([90.186.10.106]:52708 helo=passepartout.tim-landscheidt.de) by andalucia with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1jRbps-0003BM-Sg for bug-gnu-emacs@gnu.org; Thu, 23 Apr 2020 15:24:12 +0200 From: Tim Landscheidt To: bug-gnu-emacs@gnu.org Subject: 26.3; HTML entities ☆ and ★ (inter alia) are not parsed by libxml-parse-html-region Organization: http://www.tim-landscheidt.de/ Date: Thu, 23 Apr 2020 13:24:12 +0000 Message-ID: <87368uwd1f.fsf@passepartout.tim-landscheidt.de> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.3 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=116.203.78.250; envelope-from=tim@tim-landscheidt.de; helo=andalucia X-detected-operating-system: by eggs.gnu.org: First seen = 2020/04/23 09:24:15 X-ACL-Warn: Detected OS = Linux 3.11 and newer X-Received-From: 116.203.78.250 X-Spam-Score: 0.2 (/) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.8 (/) (Prologue: This bug showed up in the "ALT" attribute of an "IMG" element of an HTML mail in Gnus. I am reasonably cer- tain that this stems from libxml-parse-html-region and should be fixed there, but there may be more prudent solu- tions.) With GNU Emacs 26.3 on Fedora: | ELISP> (with-temp-buffer | (insert " | | Title | |

Hello world

|

ä

|

|

| | ") | (libxml-parse-html-region (point-min) (point-max))) | (html | ((lang . "en")) | (head nil | (title nil "Title")) | (body nil "\n " | (p nil "Hello world") | "\n " | (p nil "=C3=A4") | "\n " | (p nil "☆") | "\n " | (p nil "★") | "\n")) | ELISP> These should instead yield "=C3=A4" (228), "=E2=98=86" (9734) and "=E2=98=85" (9733). lisp/leim/quail/sgml-input.el seems to contain the necessary data for ☆ and ★ that could probably be fed to libxml. From debbugs-submit-bounces@debbugs.gnu.org Wed Jul 29 01:26:27 2020 Received: (at 40794) by debbugs.gnu.org; 29 Jul 2020 05:26:27 +0000 Received: from localhost ([127.0.0.1]:59746 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k0ebi-0003EW-Sp for submit@debbugs.gnu.org; Wed, 29 Jul 2020 01:26:27 -0400 Received: from quimby.gnus.org ([95.216.78.240]:41986) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k0ebh-0003EK-DN for 40794@debbugs.gnu.org; Wed, 29 Jul 2020 01:26:25 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID :In-Reply-To:Date:References:Subject:Cc:To:From:Sender:Reply-To:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=TR0tMONcdeULTYuKuyThijpClM1vOGipMcIzXYbGuz8=; b=FjmAhdcmf894ruyJWTJWO6/AN8 vjuHE+JaWrc6z0Yp56dghICpZ0WZNBpqyYWmr2lhGVQKsc+qElzwEtk9PXc+6GsjXSCFJqNrF2b9e h7lbGpjqPEhhi8BTaO+hMCfwchXbcgADU0CW17D9DHvXND7AVcz18bo12RkpWzPJtl1I=; Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=xo) by quimby with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1k0ebY-0002WC-Ck; Wed, 29 Jul 2020 07:26:19 +0200 From: Lars Ingebrigtsen To: Tim Landscheidt Subject: Re: bug#40794: 26.3; HTML entities ☆ and ★ (inter alia) are not parsed by libxml-parse-html-region References: <87368uwd1f.fsf@passepartout.tim-landscheidt.de> Date: Wed, 29 Jul 2020 07:26:15 +0200 In-Reply-To: <87368uwd1f.fsf@passepartout.tim-landscheidt.de> (Tim Landscheidt's message of "Thu, 23 Apr 2020 13:24:12 +0000") Message-ID: <878sf23n9k.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Tim Landscheidt writes: > (Prologue: This bug showed up in the "ALT" attribute of an > "IMG" element of an HTML mail in Gnus. I am reasonably cer- > tain that this stems from libxml-parse-html-region and > should be fixed th [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 40794 Cc: 40794@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Tim Landscheidt writes: > (Prologue: This bug showed up in the "ALT" attribute of an > "IMG" element of an HTML mail in Gnus. I am reasonably cer- > tain that this stems from libxml-parse-html-region and > should be fixed there, but there may be more prudent solu- > tions.) [...] > These should instead yield "=C3=A4" (228), "=E2=98=86" (9734) and > "=E2=98=85" (9733). > > lisp/leim/quail/sgml-input.el seems to contain the necessary > data for ☆ and ★ that could probably be fed to > libxml. As far as I can tell, libxml2 doesn't take a list of entities as an input when parsing HTML? I may have missed something... Hm, a bit of googling shows http://xmlsoft.org/html/libxml-entities.html and there is apparently a way to tell libxml2 about further entities? But I think this all sounds more like a libxml2 than an Emacs bug, really? --=20 (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Wed Jul 29 01:36:11 2020 Received: (at 40794) by debbugs.gnu.org; 29 Jul 2020 05:36:11 +0000 Received: from localhost ([127.0.0.1]:59783 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k0el9-0005j8-HJ for submit@debbugs.gnu.org; Wed, 29 Jul 2020 01:36:11 -0400 Received: from quimby.gnus.org ([95.216.78.240]:42106) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k0el7-0005it-7Q for 40794@debbugs.gnu.org; Wed, 29 Jul 2020 01:36:10 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=TzEbr2UfztG4hQgvGYVUht+u1I+VlmRESLAYE0wM2nY=; b=gA2I+Rcl4hvlHOzxaPhBSN4pHd ToF3Qd6MsfOztbz1eAdV5mD7IIjBo/G5nk2wv2mf1zWwTwG0C1ZZOgKfI9jsdUOPCtP6MVhz6c+Fv 2c0FfZiyIhimiC2FqYfp8s13cDiGMg8bwyI+OyRqI53CzmogeDOTrdU+7njozQbFvNCg=; Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=xo) by quimby with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1k0ekq-0002bY-W1; Wed, 29 Jul 2020 07:36:02 +0200 From: Lars Ingebrigtsen To: Tim Landscheidt Subject: Re: bug#40794: 26.3; HTML entities ☆ and ★ (inter alia) are not parsed by libxml-parse-html-region References: <87368uwd1f.fsf@passepartout.tim-landscheidt.de> <878sf23n9k.fsf@gnus.org> Date: Wed, 29 Jul 2020 07:35:51 +0200 In-Reply-To: <878sf23n9k.fsf@gnus.org> (Lars Ingebrigtsen's message of "Wed, 29 Jul 2020 07:26:15 +0200") Message-ID: <874kpq3mtk.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: I had a look at the libxml2 sources. The logic isn't really explained, but apparently they include all the <255-value entities, and then a selected number of the other entities (about 160 of them). I have no idea what the logic behind this is... perhaps they've just forgotten to add the new ones? Which makes me think that this is really a libxml2 bug, and you should report it there instead. Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 40794 Cc: 40794@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) I had a look at the libxml2 sources. The logic isn't really explained, but apparently they include all the <255-value entities, and then a selected number of the other entities (about 160 of them). I have no idea what the logic behind this is... perhaps they've just forgotten to add the new ones? Which makes me think that this is really a libxml2 bug, and you should report it there instead. Excerpt: /************************************************************************ * * * The list of HTML predefined entities * * * ************************************************************************/ static const htmlEntityDesc html40EntitiesTable[] = { /* * the 4 absolute ones, plus apostrophe. */ { 34, "quot", "quotation mark = APL quote, U+0022 ISOnum" }, { 38, "amp", "ampersand, U+0026 ISOnum" }, { 39, "apos", "single quote" }, { 60, "lt", "less-than sign, U+003C ISOnum" }, { 62, "gt", "greater-than sign, U+003E ISOnum" }, /* * A bunch still in the 128-255 range * Replacing them depend really on the charset used. */ { 160, "nbsp", "no-break space = non-breaking space, U+00A0 ISOnum" }, { 161, "iexcl","inverted exclamation mark, U+00A1 ISOnum" }, { 162, "cent", "cent sign, U+00A2 ISOnum" }, [...] { 376, "Yuml", "latin capital letter Y with diaeresis, U+0178 ISOlat2" }, /* * Anything below should really be kept as entities references */ { 402, "fnof", "latin small f with hook = function = florin, U+0192 ISOtech" }, { 710, "circ", "modifier letter circumflex accent, U+02C6 ISOpub" }, { 732, "tilde","small tilde, U+02DC ISOdia" }, { 913, "Alpha","greek capital letter alpha, U+0391" }, { 914, "Beta", "greek capital letter beta, U+0392" }, { 915, "Gamma","greek capital letter gamma, U+0393 ISOgrk3" }, { 916, "Delta","greek capital letter delta, U+0394 ISOgrk3" }, [...] { 9824, "spades","black spade suit, U+2660 ISOpub" }, { 9827, "clubs","black club suit = shamrock, U+2663 ISOpub" }, { 9829, "hearts","black heart suit = valentine, U+2665 ISOpub" }, { 9830, "diams","black diamond suit, U+2666 ISOpub" }, -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Wed Sep 09 09:22:19 2020 Received: (at 40794) by debbugs.gnu.org; 9 Sep 2020 13:22:19 +0000 Received: from localhost ([127.0.0.1]:57875 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kG03H-0001BD-43 for submit@debbugs.gnu.org; Wed, 09 Sep 2020 09:22:19 -0400 Received: from mail-ej1-f48.google.com ([209.85.218.48]:33636) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kG03G-0001B1-07 for 40794@debbugs.gnu.org; Wed, 09 Sep 2020 09:22:18 -0400 Received: by mail-ej1-f48.google.com with SMTP id j11so3566276ejk.0 for <40794@debbugs.gnu.org>; Wed, 09 Sep 2020 06:22:17 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:user-agent :mime-version:date:message-id:subject:to:cc; bh=e8AokCPQUlZwM8hvu+AgfQAvfWScWqC5IOSW1ApNrUE=; b=VXdJEUV9n3+wcSfRbCQvpz2uWCZJ4AEPMkeebPn5aq/IaKFd3PQhdpPrPpjTk99bRn qyKYTdcA1RahZvW2Cr6fj6DgldGy7Ny8ozyenllD2E60z8TlaOstZTtMSS4t3i89LaZj mNIEm6D0AhBgUiSgpCdg60iPoBlsRRHlQjXzvWcGpWSEyqliTdvmKUyE1MOetLKkHZ/I BjR8f+Q7FrkfEE4+SdaVOEuVqM1AthUmYZLU3fpOcsAWTb+EG+axDDrAWMdqGzAWzH5o irJ/l7WQ8r/bbtSDekY/Oy3E7ksZsEQ8Pasrm7BJ2DfQg2feRQAd+Zg+CAbbJgmcYKIp SpEQ== X-Gm-Message-State: AOAM530i5F00E3yb4JxxQl9C/1YyyZ/C3C2WfrZWVkyG3nLGMqtjh56g 14lu7nPfnbUnLMtPSFKStlLd+kk99Fviml7SIog= X-Google-Smtp-Source: ABdhPJz1aq4X9PnikieDpDNDC54MYM+LETmTsZswJa1RKOxNys/knFcf4ddKGyqiNiPsZ/3ZGTX9VQBTVzvoqEo+vIM= X-Received: by 2002:a17:906:16c8:: with SMTP id t8mr3432778ejd.272.1599657732174; Wed, 09 Sep 2020 06:22:12 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Wed, 9 Sep 2020 06:22:11 -0700 From: Stefan Kangas In-Reply-To: <874kpq3mtk.fsf@gnus.org> (Lars Ingebrigtsen's message of "Wed, 29 Jul 2020 07:35:51 +0200") References: <87368uwd1f.fsf@passepartout.tim-landscheidt.de> <878sf23n9k.fsf@gnus.org> <874kpq3mtk.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Date: Wed, 9 Sep 2020 06:22:11 -0700 Message-ID: Subject: Re: bug#40794: 26.3; HTML entities ☆ and ★ (inter alia) are not parsed by libxml-parse-html-region To: Lars Ingebrigtsen Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.5 (/) X-Debbugs-Envelope-To: 40794 Cc: 40794@debbugs.gnu.org, Tim Landscheidt X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.5 (/) Lars Ingebrigtsen writes: > I had a look at the libxml2 sources. The logic isn't really explained, > but apparently they include all the <255-value entities, and then a > selected number of the other entities (about 160 of them). > > I have no idea what the logic behind this is... perhaps they've just > forgotten to add the new ones? Which makes me think that this is really > a libxml2 bug, and you should report it there instead. Agreed. Tim, could you please report this to the libxml2 developers? From debbugs-submit-bounces@debbugs.gnu.org Wed Nov 25 05:03:48 2020 Received: (at 40794) by debbugs.gnu.org; 25 Nov 2020 10:03:48 +0000 Received: from localhost ([127.0.0.1]:34469 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1khreN-0003Ac-Rh for submit@debbugs.gnu.org; Wed, 25 Nov 2020 05:03:48 -0500 Received: from mail-ed1-f51.google.com ([209.85.208.51]:33853) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1khreL-0003AI-In for 40794@debbugs.gnu.org; Wed, 25 Nov 2020 05:03:45 -0500 Received: by mail-ed1-f51.google.com with SMTP id a15so1924993edy.1 for <40794@debbugs.gnu.org>; Wed, 25 Nov 2020 02:03:45 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:user-agent :mime-version:date:message-id:subject:to:cc; bh=W1HD5wdxun32q+wRPvMdWNrKtzgcOGTvhKjkBXoXTqE=; b=FpM05XseefmW/BNZiKNqVva/S4dJT7sGpIEkOy7f/Wmh9VF31rbJQRuGRtmiBOq8Co 9blsyvoPEjr7OpoYCQ21Xj6HRyHlRUxrK1DGhQugQRchGYFn08RGdgQgSyVFNGMDimY2 aFnxkwvXJi8hMbjWsQGyK8cGLZTE1HnZvrEcBbWwiSQVDJWrXQmS8UnzI6kfbImb2fq9 OAm8mo5aItFrlmNicjTum+fBjEoJt4aEx/flhvT06/hiHxqNlwST+9XQcNizFtvhWvEp s+JwbA1dAitzRoki4DwQGG2Q3xrWIl0G1/YXFAF+9pazjnFl2fPqIyhLqkTfazSuJka9 TTFA== X-Gm-Message-State: AOAM5334AZZO+cVmKwa9Ea5kN3N1rRa09E6wz9HxM0FF+FT/F1kSGLgK jDxxw7plN/akIGfyyomlRAENtfToqLL+I0XdK0s= X-Google-Smtp-Source: ABdhPJwQaf2txdnWfSI5QdbIzvUn4d0g5b/PcrmNtxDU4a7Kde56cQlKnvhzgfBGAnZIyiMt2SPvzdc/9GbY5CPU5oA= X-Received: by 2002:a05:6402:3089:: with SMTP id de9mr2740621edb.100.1606298619923; Wed, 25 Nov 2020 02:03:39 -0800 (PST) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Wed, 25 Nov 2020 02:03:39 -0800 From: Stefan Kangas In-Reply-To: (Stefan Kangas's message of "Wed, 9 Sep 2020 06:22:11 -0700") References: <87368uwd1f.fsf@passepartout.tim-landscheidt.de> <878sf23n9k.fsf@gnus.org> <874kpq3mtk.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Date: Wed, 25 Nov 2020 02:03:39 -0800 Message-ID: Subject: Re: bug#40794: 26.3; HTML entities ☆ and ★ (inter alia) are not parsed by libxml-parse-html-region To: Lars Ingebrigtsen Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.5 (/) X-Debbugs-Envelope-To: 40794 Cc: 40794@debbugs.gnu.org, Tim Landscheidt X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.5 (/) tags 40794 notabug close 40794 thanks Stefan Kangas writes: > Lars Ingebrigtsen writes: > >> I had a look at the libxml2 sources. The logic isn't really explained, >> but apparently they include all the <255-value entities, and then a >> selected number of the other entities (about 160 of them). >> >> I have no idea what the logic behind this is... perhaps they've just >> forgotten to add the new ones? Which makes me think that this is really >> a libxml2 bug, and you should report it there instead. > > Agreed. Tim, could you please report this to the libxml2 developers? That was 10 weeks ago, and we seem to agree that this is not a bug in Emacs. I'm therefore closing this bug report. Please report this issue to the libxml2 developers if it is still an issue. From unknown Sun Jun 22 11:37:15 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Wed, 23 Dec 2020 12:24:11 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator