From unknown Sun Jun 22 07:43:35 2025 X-Loop: help-debbugs@gnu.org Subject: bug#6971: 24.0.50.1: non-ascii chars appear as numbers Resent-From: Andreas =?UTF-8?Q?R=C3=B6hler?= Original-Sender: debbugs-submit-bounces@debbugs.gnu.org Resent-To: owner@debbugs.gnu.org Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Thu, 02 Sep 2010 10:15:05 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 6971 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: To: 6971@debbugs.gnu.org X-Debbugs-Original-To: bug-gnu-emacs@gnu.org Received: via spool by submit@debbugs.gnu.org id=B.12834224795901 (code B ref -1); Thu, 02 Sep 2010 10:15:05 +0000 Received: (at submit) by debbugs.gnu.org; 2 Sep 2010 10:14:39 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Or6oZ-0001X8-7i for submit@debbugs.gnu.org; Thu, 02 Sep 2010 06:14:39 -0400 Received: from mail.gnu.org ([199.232.76.166] helo=mx10.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Or6oW-0001X1-TU for submit@debbugs.gnu.org; Thu, 02 Sep 2010 06:14:37 -0400 Received: from lists.gnu.org ([199.232.76.165]:57564) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1Or6q9-0001xa-BV for submit@debbugs.gnu.org; Thu, 02 Sep 2010 06:16:17 -0400 Received: from [140.186.70.92] (port=58944 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Or6q7-0007ID-IY for bug-gnu-emacs@gnu.org; Thu, 02 Sep 2010 06:16:16 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_NONE autolearn=unavailable version=3.3.1 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1Or6q2-00047Y-AY for bug-gnu-emacs@gnu.org; Thu, 02 Sep 2010 06:16:15 -0400 Received: from moutng.kundenserver.de ([212.227.17.8]:55635) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Or6q1-00046x-VR for bug-gnu-emacs@gnu.org; Thu, 02 Sep 2010 06:16:10 -0400 Received: from [192.168.178.27] (brln-4db9c3cd.pool.mediaWays.net [77.185.195.205]) by mrelayeu.kundenserver.de (node=mrbap2) with ESMTP (Nemesis) id 0M7VWZ-1P2ROL3aYZ-00wcke; Thu, 02 Sep 2010 12:16:07 +0200 Message-ID: <4C7F7946.9040502@easy-emacs.de> Date: Thu, 02 Sep 2010 12:15:34 +0200 From: Andreas =?UTF-8?Q?R=C3=B6hler?= User-Agent: Mozilla/5.0 (X11; U; Linux i686; de; rv:1.9.1.11) Gecko/20100711 Thunderbird/3.0.6 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Provags-ID: V02:K0:vpyZnrlOakevzmTVKWdAdsGEs3Vb4thKim0WUkN/fVS N94jSGPGyYZpNYbDixLpDnQCNpcdSIzefBiGyhfJhfG2vVCkQK iBnOT87OdD0ACpTkja/pFHGUpl5rLBeiWXd6LYvbwSLVB1mezF vLCQquaMeCql+CWSBdlKGrMIgt///tH9rjwsyP/FOucZWgw2dr o6QLMxP6gaSjxWqLMCPCjg/r/UvBGK/itslxUwm5nI= X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6, seldom 2.4 (older, 4) X-Spam-Score: -4.1 (----) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -5.0 (-----) Hi, encounter an encoding bug, which I've seen years ago and reported already, but didn't occur with Emacs 23: when opening a file containing non-ascii chars, german umlauts etc., these aren't shown as glyphs but as numbers. (define-abbrev-table 'global-abbrev-table '(("Infinity" "∞" nil 0) ("alpha" "α" nil 2) ("beta" "β" nil 1) ("gamma" "γ" nil 1) ("theta" "θ" nil 0))) I see ("alpha" "\316\261" nil 2) for example. May send a screenshot if useful. Curious: if the chars-as-numbers code is pasted here in this mail, glyphs are displayed correctly. As the only thing I remember is editing the file with GNU Emacs 24.0.50.1 (i686-pc-linux-gnu, GTK+ Version 2.12.0) of 2010-08-28 assume the bug comes from there. Sorry, not being able to truck down further the issue. Andreas -- https://code.launchpad.net/~a-roehler/python-mode https://code.launchpad.net/s-x-emacs-werkstatt/ From unknown Sun Jun 22 07:43:35 2025 MIME-Version: 1.0 X-Mailer: MIME-tools 5.427 (Entity 5.427) X-Loop: help-debbugs@gnu.org From: help-debbugs@gnu.org (GNU bug Tracking System) To: Andreas =?UTF-8?Q?R=C3=B6hler?= Subject: bug#6971: closed (Re: bug#6974: Emacs doesn't like Swedish =?UTF-8?Q?=C3=A4?= (on w32)) Message-ID: References: <838w3hj53z.fsf@gnu.org> <4C7F7946.9040502@easy-emacs.de> X-Gnu-PR-Message: they-closed 6971 X-Gnu-PR-Package: emacs Reply-To: 6971@debbugs.gnu.org Date: Sat, 04 Sep 2010 08:13:02 +0000 Content-Type: multipart/mixed; boundary="----------=_1283587982-22423-1" This is a multi-part message in MIME format... ------------=_1283587982-22423-1 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Your bug report #6971: 24.0.50.1: non-ascii chars appear as numbers which was filed against the emacs package, has been closed. The explanation is attached below, along with your original report. If you require more details, please reply to 6971@debbugs.gnu.org. --=20 6971: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=3D6971 GNU Bug Tracking System Contact help-debbugs@gnu.org with problems ------------=_1283587982-22423-1 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at 6971-done) by debbugs.gnu.org; 4 Sep 2010 08:12:55 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Ornrr-0005pT-Ds for submit@debbugs.gnu.org; Sat, 04 Sep 2010 04:12:55 -0400 Received: from mtaout20.012.net.il ([80.179.55.166]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Ornrq-0005pO-1T for 6971-done@debbugs.gnu.org; Sat, 04 Sep 2010 04:12:54 -0400 Received: from conversion-daemon.a-mtaout20.012.net.il by a-mtaout20.012.net.il (HyperSendmail v2007.08) id <0L8700E00QTTRT00@a-mtaout20.012.net.il> for 6971-done@debbugs.gnu.org; Sat, 04 Sep 2010 11:14:30 +0300 (IDT) Received: from HOME-C4E4A596F7 ([77.126.202.93]) by a-mtaout20.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0L8700EF0QW5RW00@a-mtaout20.012.net.il>; Sat, 04 Sep 2010 11:14:30 +0300 (IDT) Date: Sat, 04 Sep 2010 11:16:32 +0300 From: Eli Zaretskii Subject: Re: bug#6974: Emacs doesn't like Swedish =?UTF-8?B?w6QgKG9uIHczMg==?= =?UTF-8?B?KQ==?= In-reply-to: <4C81F5AF.6010001@easy-emacs.de> To: =?UTF-8?B?QW5kcmVhcyBSw7ZobGVy?= Message-id: <838w3hj53z.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-transfer-encoding: QUOTED-PRINTABLE X-012-Sender: halo1@inter.net.il References: <4C80ABD4.8090106@easy-emacs.de> <83tym7i4jj.fsf@gnu.org> <4C80C3D8.5020308@easy-emacs.de> <83mxrzhr86.fsf@gnu.org> <4C812105.2020200@easy-emacs.de> <83iq2miu77.fsf@gnu.org> <4C814D80.3050703@easy-emacs.de> <83fwxqillu.fsf@gnu.org> <4C81E5BB.2050306@easy-emacs.de> <83eidahuuo.fsf@gnu.org> <4C81F5AF.6010001@easy-emacs.de> X-Spam-Score: -2.1 (--) X-Debbugs-Envelope-To: 6971-done Cc: 6971-done@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -2.1 (--) > Date: Sat, 04 Sep 2010 09:30:55 +0200 > From: Andreas R=C3=B6hler > CC: bug-gnu-emacs@gnu.org >=20 > > Please post the file as an attachment. > > >=20 > Attached. Thanks. Here's your culprit: > > \240 (autoload 'muse-mode "muse-mode" "" t) You have literal \240 characters in the file, which are invalid UTF-8 sequences. This file has also other similar problems, like this one: Du kannst es nat\365\202\211\205\365\200\210\246\357\275\357\275\27= 4rlich auch unter Linux ausprobieren, z.B.: I believe the 4th word should have been "nat=C5=B1rlich", and the inv= alid long byte sequence instead of =C5=B1 (which Emacs decodes into some Japanese Kanji character that cannot be encoded by UTF-8) is the result of multiple saving of this file with incorrect encoding. To fix all this corruption, I suggest the following steps: 1) C-x RET c utf-8 RET C-x C-f befehle.txt RET 2) M-: (unencodable-char-position (point) (point-max) 'utf-8) RET 3) Go to the position shown by the previous command, and edit the file to replace invalid bytes with valid characters. 4) Move point past the corrected portion. 5) Go back to 2. When unencodable-char-position returns nil, you are done; save the file. I'm closing bug #6971 with this message, since there's no Emacs bug here. ------------=_1283587982-22423-1 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at submit) by debbugs.gnu.org; 2 Sep 2010 10:14:39 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Or6oZ-0001X8-7i for submit@debbugs.gnu.org; Thu, 02 Sep 2010 06:14:39 -0400 Received: from mail.gnu.org ([199.232.76.166] helo=mx10.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Or6oW-0001X1-TU for submit@debbugs.gnu.org; Thu, 02 Sep 2010 06:14:37 -0400 Received: from lists.gnu.org ([199.232.76.165]:57564) by monty-python.gnu.org with esmtps (TLS-1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.60) (envelope-from ) id 1Or6q9-0001xa-BV for submit@debbugs.gnu.org; Thu, 02 Sep 2010 06:16:17 -0400 Received: from [140.186.70.92] (port=58944 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Or6q7-0007ID-IY for bug-gnu-emacs@gnu.org; Thu, 02 Sep 2010 06:16:16 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_NONE autolearn=unavailable version=3.3.1 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.69) (envelope-from ) id 1Or6q2-00047Y-AY for bug-gnu-emacs@gnu.org; Thu, 02 Sep 2010 06:16:15 -0400 Received: from moutng.kundenserver.de ([212.227.17.8]:55635) by eggs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Or6q1-00046x-VR for bug-gnu-emacs@gnu.org; Thu, 02 Sep 2010 06:16:10 -0400 Received: from [192.168.178.27] (brln-4db9c3cd.pool.mediaWays.net [77.185.195.205]) by mrelayeu.kundenserver.de (node=mrbap2) with ESMTP (Nemesis) id 0M7VWZ-1P2ROL3aYZ-00wcke; Thu, 02 Sep 2010 12:16:07 +0200 Message-ID: <4C7F7946.9040502@easy-emacs.de> Date: Thu, 02 Sep 2010 12:15:34 +0200 From: =?UTF-8?B?QW5kcmVhcyBSw7ZobGVy?= User-Agent: Mozilla/5.0 (X11; U; Linux i686; de; rv:1.9.1.11) Gecko/20100711 Thunderbird/3.0.6 MIME-Version: 1.0 To: bug-gnu-emacs@gnu.org Subject: 24.0.50.1: non-ascii chars appear as numbers Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Provags-ID: V02:K0:vpyZnrlOakevzmTVKWdAdsGEs3Vb4thKim0WUkN/fVS N94jSGPGyYZpNYbDixLpDnQCNpcdSIzefBiGyhfJhfG2vVCkQK iBnOT87OdD0ACpTkja/pFHGUpl5rLBeiWXd6LYvbwSLVB1mezF vLCQquaMeCql+CWSBdlKGrMIgt///tH9rjwsyP/FOucZWgw2dr o6QLMxP6gaSjxWqLMCPCjg/r/UvBGK/itslxUwm5nI= X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6, seldom 2.4 (older, 4) X-Spam-Score: -4.1 (----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -5.0 (-----) Hi, encounter an encoding bug, which I've seen years ago and reported already, but didn't occur with Emacs 23: when opening a file containing non-ascii chars, german umlauts etc., these aren't shown as glyphs but as numbers. (define-abbrev-table 'global-abbrev-table '(("Infinity" "∞" nil 0) ("alpha" "α" nil 2) ("beta" "β" nil 1) ("gamma" "γ" nil 1) ("theta" "θ" nil 0))) I see ("alpha" "\316\261" nil 2) for example. May send a screenshot if useful. Curious: if the chars-as-numbers code is pasted here in this mail, glyphs are displayed correctly. As the only thing I remember is editing the file with GNU Emacs 24.0.50.1 (i686-pc-linux-gnu, GTK+ Version 2.12.0) of 2010-08-28 assume the bug comes from there. Sorry, not being able to truck down further the issue. Andreas -- https://code.launchpad.net/~a-roehler/python-mode https://code.launchpad.net/s-x-emacs-werkstatt/ ------------=_1283587982-22423-1--