From debbugs-submit-bounces@debbugs.gnu.org Mon May 22 09:59:40 2023 Received: (at submit) by debbugs.gnu.org; 22 May 2023 13:59:40 +0000 Received: from localhost ([127.0.0.1]:35366 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1q164a-0001rO-5o for submit@debbugs.gnu.org; Mon, 22 May 2023 09:59:40 -0400 Received: from lists.gnu.org ([209.51.188.17]:38606) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1q164W-0001rD-AL for submit@debbugs.gnu.org; Mon, 22 May 2023 09:59:39 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1q164V-0005rJ-Bq for bug-gnu-emacs@gnu.org; Mon, 22 May 2023 09:59:35 -0400 Received: from smtp1a.inetd.co.jp ([210.129.88.11]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1q164R-0007WF-RX for bug-gnu-emacs@gnu.org; Mon, 22 May 2023 09:59:34 -0400 Received: from localhost (42-144-46-173.rev.home.ne.jp [42.144.46.173]) by smtp1a.inetd.co.jp (Postfix) with ESMTPA id DD062140D for ; Mon, 22 May 2023 22:59:23 +0900 (JST) From: Ikumi Keita To: bug-gnu-emacs@gnu.org Subject: 29.0.91; Coding system detection defect in html X-Mailer: MH-E 8.6+git; nmh 1.7.1; GNU Emacs 28.2 MIME-Version: 1.0 Content-Type: text/plain; charset=iso-2022-jp Date: Mon, 22 May 2023 22:59:23 +0900 Message-ID: <82357.1684763963@localhost> Received-SPF: pass client-ip=210.129.88.11; envelope-from=ikumi@ikumi.que.jp; helo=smtp1a.inetd.co.jp X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-Spam-Score: -1.3 (-) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) The function `sgml-html-meta-auto-coding-function' signals error for html file with legacy encoding specification. 0. Save the following file as /tmp/foo.html with the coding system `euc-jp': ---------------------------------------------------------------------- dummy あいうえお ---------------------------------------------------------------------- 1. emacs -Q 2. C-x C-f /tmp/foo.html RET 3. M-: (sgml-html-meta-auto-coding-function 1000) RET 4. Then emacs signals error with the following backtrace: Debugger entered--Lisp error: (coding-system-error iso-2022) coding-system-plist(iso-2022) coding-system-equal(utf-8 iso-2022) sgml-html-meta-auto-coding-function(1000) eval((sgml-html-meta-auto-coding-function 1000) t) eval-expression((sgml-html-meta-auto-coding-function 1000) nil nil 127) funcall-interactively(eval-expression (sgml-html-meta-auto-coding-function 1000) nil nil 127) call-interactively(eval-expression nil nil) command-execute(eval-expression) It seems that this error is due to change in `sgml-html-meta-auto-coding-function' introduced in emacs 27. When I use emacs 26.1 definition of the function, it returns `euc-jp' as expected. Regards, Ikumi Keita #StandWithUkraine #StopWarInUkraine In GNU Emacs 29.0.91 (build 1, x86_64-unknown-freebsd13.2, GTK+ Version 3.24.34, cairo version 1.17.4) of 2023-05-22 built on freebsd.vmware Windowing system distributor 'The X.Org Foundation', version 11.0.12101007 System Description: 13.2-RELEASE Configured features: ACL CAIRO DBUS FREETYPE GIF GLIB GNUTLS GSETTINGS HARFBUZZ JPEG JSON LCMS2 LIBXML2 MODULES NOTIFY KQUEUE PDUMPER PNG RSVG SOUND SQLITE3 THREADS TIFF TOOLKIT_SCROLL_BARS WEBP X11 XDBE XIM XINPUT2 XPM GTK3 ZLIB Important settings: value of $EMACSLOADPATH: /home/keita/elisp: value of $LANG: ja_JP.UTF-8 locale-coding-system: utf-8-unix Major mode: HTML+ Minor modes in effect: tooltip-mode: t global-eldoc-mode: t show-paren-mode: t electric-indent-mode: t mouse-wheel-mode: t tool-bar-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t line-number-mode: t indent-tabs-mode: t transient-mark-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t Load-path shadows: /home/keita/elisp/reftex-parse hides /home/keita/scr/emacs-29.0.91/lisp/textmodes/reftex-parse Features: (shadow sort mail-extr emacsbug message dired dired-loaddefs rfc822 mml mml-sec epa derived epg rfc6068 epg-config mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums debug backtrace find-func cl-extra pp cl-print help-fns radix-tree help-mode yank-media mhtml-mode css-mode smie eww xdg url-queue thingatpt shr pixel-fill kinsoku url-file svg xml browse-url url url-proxy url-privacy url-expand url-methods url-history url-cookie generate-lisp-file url-domsuf url-util url-parse auth-source eieio eieio-core cl-macs password-cache url-vars mailcap puny mm-url gnus nnheader gnus-util text-property-search time-date mail-utils range wid-edit mm-util mail-prsvr color js c-ts-common treesit cl-seq json subr-x map byte-opt gv bytecomp byte-compile imenu cc-mode cc-fonts cc-guess cc-menus cc-cmds cc-styles cc-align cc-engine cc-vars cc-defs sgml-mode facemenu dom cl-loaddefs cl-lib japan-util rmc iso-transl tooltip cconv eldoc paren electric uniquify ediff-hook vc-hooks lisp-float-type elisp-mode mwheel term/x-win x-win term/common-win x-dnd tool-bar dnd fontset image regexp-opt fringe tabulated-list replace newcomment text-mode lisp-mode prog-mode register page tab-bar menu-bar rfn-eshadow isearch easymenu timer select scroll-bar mouse jit-lock font-lock syntax font-core term/tty-colors frame minibuffer nadvice seq simple cl-generic indonesian philippine cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese eucjp-ms cp51932 hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese composite emoji-zwj charscript charprop case-table epa-hook jka-cmpr-hook help abbrev obarray oclosure cl-preloaded button loaddefs theme-loaddefs faces cus-face macroexp files window text-properties overlay sha1 md5 base64 format env code-pages mule custom widget keymap hashtable-print-readable backquote threads dbusbind kqueue lcms2 dynamic-setting system-font-setting font-render-setting cairo move-toolbar gtk x-toolkit xinput2 x multi-tty make-network-process emacs) Memory information: ((conses 16 122445 10073) (symbols 48 12761 0) (strings 32 41719 1774) (string-bytes 1 1318626) (vectors 16 23622) (vector-slots 8 397515 14753) (floats 8 154 34) (intervals 56 350 0) (buffers 976 15)) From debbugs-submit-bounces@debbugs.gnu.org Mon May 22 12:03:55 2023 Received: (at 63644) by debbugs.gnu.org; 22 May 2023 16:03:55 +0000 Received: from localhost ([127.0.0.1]:36034 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1q180p-0005yV-D9 for submit@debbugs.gnu.org; Mon, 22 May 2023 12:03:55 -0400 Received: from eggs.gnu.org ([209.51.188.92]:53460) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1q180l-0005yH-GH for 63644@debbugs.gnu.org; Mon, 22 May 2023 12:03:53 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1q180e-0000Rw-D8; Mon, 22 May 2023 12:03:44 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=MIME-version:References:Subject:In-Reply-To:To:From: Date; bh=Jv7GhAGeFqndBI2GvXmbvXe5m599h0ZuCNT4uBVV7UU=; b=Tr0AwEqQZAEz/BhZJ3BQ rs3MoQE+n1jg8BmF2pAf5pOPy3z5feQt6loD672OFCIS8nuM/8FnaUpD3gtd1tXlmr0tBpw52ArdM Bm6nQT5pbO8mcaf5xNuiGySpUFdNVgrcrFiIPVmJYJ96H34OykoKG3yq4Fxl2lFbho9eHfV+DkDT2 WAsxjmAFERvA+p5vunQXjmsnX3k6GcrVdQj95HqfiQx2SA7UvmY0huYiwg2MoZmv+OBtYeV4Kw3LG UW+lgZtei8jEMKPye+OBghWHsWcDIb7pVDk2NJ890sIEDQ+kAjsZixPsHng0D9nk6g3OP83EC8wbK 5XMj1QG2zFzTbw==; Received: from [87.69.77.57] (helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1q180a-0003lo-QJ; Mon, 22 May 2023 12:03:42 -0400 Date: Mon, 22 May 2023 19:04:02 +0300 Message-Id: <83mt1wpd3h.fsf@gnu.org> From: Eli Zaretskii To: Ikumi Keita In-Reply-To: <82357.1684763963@localhost> (message from Ikumi Keita on Mon, 22 May 2023 22:59:23 +0900) Subject: Re: bug#63644: 29.0.91; Coding system detection defect in html References: <82357.1684763963@localhost> MIME-version: 1.0 Content-type: text/plain; charset=iso-2022-jp Content-Transfer-Encoding: 8bit X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 63644 Cc: 63644@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Ikumi Keita > Date: Mon, 22 May 2023 22:59:23 +0900 > > 0. Save the following file as /tmp/foo.html with the coding system `euc-jp': > ---------------------------------------------------------------------- > > > > > dummy > > > あいうえお > > ---------------------------------------------------------------------- > 1. emacs -Q > 2. C-x C-f /tmp/foo.html RET > 3. M-: (sgml-html-meta-auto-coding-function 1000) RET > 4. Then emacs signals error with the following backtrace: > Debugger entered--Lisp error: (coding-system-error iso-2022) > coding-system-plist(iso-2022) > coding-system-equal(utf-8 iso-2022) > sgml-html-meta-auto-coding-function(1000) > eval((sgml-html-meta-auto-coding-function 1000) t) > eval-expression((sgml-html-meta-auto-coding-function 1000) nil nil 127) > funcall-interactively(eval-expression (sgml-html-meta-auto-coding-function 1000) nil nil 127) > call-interactively(eval-expression nil nil) > command-execute(eval-expression) Thanks. Does the patch below give good results? diff --git a/lisp/international/mule.el b/lisp/international/mule.el index 25b90b4..2b44a2e 100644 --- a/lisp/international/mule.el +++ b/lisp/international/mule.el @@ -2484,10 +2484,12 @@ sgml-xml-auto-coding-function ;; called as part of visiting a file, as opposed ;; to when saving a buffer to a file. (if (and enable-multibyte-characters - ;; 'charset' will signal an error in - ;; coding-system-equal, since it isn't a - ;; coding-system. So test that up front. + ;; 'charset' and 'iso-2022' will signal + ;; an error in coding-system-equal, since + ;; they aren't coding-systems. So test + ;; that up front. (not (equal sym-type 'charset)) + (not (equal sym-type 'iso-2022)) (coding-system-equal 'utf-8 sym-type) (coding-system-equal 'utf-8 bfcs-type)) buffer-file-coding-system @@ -2540,11 +2542,13 @@ sgml-html-meta-auto-coding-function (bfcs-type (coding-system-type buffer-file-coding-system))) (if (and enable-multibyte-characters - ;; 'charset' will signal an error in - ;; coding-system-equal, since it isn't a - ;; coding-system. So test that up front. + ;; 'charset' and 'iso-2022' will signal an error + ;; in coding-system-equal, since they aren't + ;; coding-systems. So test that up front. (not (equal sym-type 'charset)) (not (equal bfcs-type 'charset)) + (not (equal sym-type 'iso-2022)) + (not (equal bfcs-type 'iso-2022)) (coding-system-equal 'utf-8 sym-type) (coding-system-equal 'utf-8 bfcs-type)) buffer-file-coding-system From debbugs-submit-bounces@debbugs.gnu.org Mon May 22 12:42:50 2023 Received: (at 63644) by debbugs.gnu.org; 22 May 2023 16:42:50 +0000 Received: from localhost ([127.0.0.1]:36097 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1q18cQ-00010W-AC for submit@debbugs.gnu.org; Mon, 22 May 2023 12:42:50 -0400 Received: from smtp1a.inetd.co.jp ([210.129.88.11]:56808) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1q18cL-00010F-1b for 63644@debbugs.gnu.org; Mon, 22 May 2023 12:42:44 -0400 Received: from localhost (42-144-46-173.rev.home.ne.jp [42.144.46.173]) by smtp1a.inetd.co.jp (Postfix) with ESMTPA id 186BC140D; Tue, 23 May 2023 01:42:39 +0900 (JST) From: Ikumi Keita To: Eli Zaretskii Subject: Re: bug#63644: 29.0.91; Coding system detection defect in html In-reply-to: <83mt1wpd3h.fsf@gnu.org> References: <82357.1684763963@localhost> <83mt1wpd3h.fsf@gnu.org> Comments: In-reply-to Eli Zaretskii message dated "Mon, 22 May 2023 19:04:02 +0300." X-Mailer: MH-E 8.6+git; nmh 1.7.1; GNU Emacs 28.2 MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-ID: <83654.1684773759.1@localhost> Date: Tue, 23 May 2023 01:42:39 +0900 Message-ID: <83656.1684773759@localhost> X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 63644 Cc: 63644@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) >>>>> Eli Zaretskii writes: > Thanks. Does the patch below give good results? Yes. It returns `euc-jp' as expected. Regards, Ikumi Keita #StandWithUkraine #StopWarInUkraine From debbugs-submit-bounces@debbugs.gnu.org Mon May 22 14:25:08 2023 Received: (at 63644-done) by debbugs.gnu.org; 22 May 2023 18:25:08 +0000 Received: from localhost ([127.0.0.1]:36380 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1q1ADT-0004HH-W4 for submit@debbugs.gnu.org; Mon, 22 May 2023 14:25:08 -0400 Received: from eggs.gnu.org ([209.51.188.92]:51764) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1q1ADR-0004Gf-Mk for 63644-done@debbugs.gnu.org; Mon, 22 May 2023 14:25:06 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1q1ADK-0003we-KV; Mon, 22 May 2023 14:24:58 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=References:Subject:In-Reply-To:To:From:Date: mime-version; bh=EcKO5WlQJbboyuIj636oM2dSIX8lHsJP4FJShqXO14M=; b=QY/xCqT3mCKI Iy5CxeSmCHhwSTEvMu+2DxcSFLeDhj6sx4z2oijRnKuc2K3CC9Z8awp+xvNjUnlE3mW1LoUEjmyPB 9YiEb07xakHqDOYGC92JxFxwR66SJ19lbf5xXbq0vGZjW7yqc9b6/YOOWNzmr/gu1YfJO4BDQEY5e Zb0VZyEFDQBP6rPJrGNRM7/QHcHjEkjhTvTjlw5edQ0hRxLct6UI/nY/5fpvsJfKvPJCQ6AKh8MlT OQdC2rrxhdkYevTdiCUagw0ImV5Jtz3kV+LS6XPkEVVijMESx+dvwX9njVDVcAvpBan7Nu99HzJoI ZQ4+xCm+LVfexENao+dO1Q==; Received: from [87.69.77.57] (helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1q1ADJ-0001rf-HZ; Mon, 22 May 2023 14:24:58 -0400 Date: Mon, 22 May 2023 21:25:20 +0300 Message-Id: <83jzx0p6jz.fsf@gnu.org> From: Eli Zaretskii To: Ikumi Keita In-Reply-To: <83656.1684773759@localhost> (message from Ikumi Keita on Tue, 23 May 2023 01:42:39 +0900) Subject: Re: bug#63644: 29.0.91; Coding system detection defect in html References: <82357.1684763963@localhost> <83mt1wpd3h.fsf@gnu.org> <83656.1684773759@localhost> X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 63644-done Cc: 63644-done@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Ikumi Keita > cc: 63644@debbugs.gnu.org > Comments: In-reply-to Eli Zaretskii > message dated "Mon, 22 May 2023 19:04:02 +0300." > Date: Tue, 23 May 2023 01:42:39 +0900 > > >>>>> Eli Zaretskii writes: > > Thanks. Does the patch below give good results? > > Yes. It returns `euc-jp' as expected. Thanks, installed on the emacs-29 branch, and closing the bug. From unknown Sun Jun 22 03:57:11 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Tue, 20 Jun 2023 11:24:07 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator