From unknown Wed Jun 18 00:05:51 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#9608 <9608@debbugs.gnu.org> To: bug#9608 <9608@debbugs.gnu.org> Subject: Status: 24.0.50; Emacs lisp reader thinks no-break space is 0x08a0 (should be 0x00a0) Reply-To: bug#9608 <9608@debbugs.gnu.org> Date: Wed, 18 Jun 2025 07:05:51 +0000 retitle 9608 24.0.50; Emacs lisp reader thinks no-break space is 0x08a0 (sh= ould be 0x00a0) reassign 9608 emacs submitter 9608 "David M. Cooke" severity 9608 normal thanks From debbugs-submit-bounces@debbugs.gnu.org Mon Sep 26 20:02:27 2011 Received: (at submit) by debbugs.gnu.org; 27 Sep 2011 00:02:27 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1R8L7y-0004iV-JX for submit@debbugs.gnu.org; Mon, 26 Sep 2011 20:02:27 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1R8L71-0004gr-HR for submit@debbugs.gnu.org; Mon, 26 Sep 2011 20:01:28 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1R8L6G-0006pf-Ae for submit@debbugs.gnu.org; Mon, 26 Sep 2011 20:00:41 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,FREEMAIL_FROM, RCVD_IN_DNSWL_LOW, T_DKIM_INVALID, T_TO_NO_BRKTS_FREEMAIL autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([140.186.70.17]:53308) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1R8L6G-0006pb-8y for submit@debbugs.gnu.org; Mon, 26 Sep 2011 20:00:40 -0400 Received: from eggs.gnu.org ([140.186.70.92]:48459) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1R8L6E-0005FE-Uk for bug-gnu-emacs@gnu.org; Mon, 26 Sep 2011 20:00:40 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1R8L6D-0006pG-Pq for bug-gnu-emacs@gnu.org; Mon, 26 Sep 2011 20:00:38 -0400 Received: from mail-yw0-f41.google.com ([209.85.213.41]:62371) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1R8L6D-0006pA-M3 for bug-gnu-emacs@gnu.org; Mon, 26 Sep 2011 20:00:37 -0400 Received: by ywe9 with SMTP id 9so5678355ywe.0 for ; Mon, 26 Sep 2011 17:00:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=from:content-type:content-transfer-encoding:subject:date:message-id :to:mime-version:x-mailer; bh=oNPVra0P0jtGX0rmpcj4EiuzdH01jr7lqVKIpskAqQk=; b=rZiboz1lX6Vel9oDzLMqBvxjIIb22Mu53Q4m5LMMosBBVT6MYDhO2XpulT49Ulh4PO AO5+G2c3VjVr0SmzYT34bwy4+s2quZ3T0EUybO7s3FGraN0ZjB+zcAsRQMp/oUYUhRfd 2ikP7MqIB9m5dR0kKDnQ1SXglPqoGWVVU31yU= Received: by 10.236.175.229 with SMTP id z65mr5777596yhl.45.1317081636746; Mon, 26 Sep 2011 17:00:36 -0700 (PDT) Received: from mars.lan (d207-216-27-213.bchsia.telus.net. [207.216.27.213]) by mx.google.com with ESMTPS id o25sm31205985yhj.24.2011.09.26.17.00.35 (version=TLSv1/SSLv3 cipher=OTHER); Mon, 26 Sep 2011 17:00:36 -0700 (PDT) From: "David M. Cooke" Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Subject: 24.0.50; Emacs lisp reader thinks no-break space is 0x08a0 (should be 0x00a0) Date: Mon, 26 Sep 2011 17:00:34 -0700 Message-Id: <456C995A-EC64-43C6-A96E-FBF6004D9EDD@gmail.com> To: bug-gnu-emacs@gnu.org Mime-Version: 1.0 (Apple Message framework v1244.3) X-Mailer: Apple Mail (2.1244.3) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-Received-From: 140.186.70.17 X-Spam-Score: -1.3 (-) X-Debbugs-Envelope-To: submit X-Mailman-Approved-At: Mon, 26 Sep 2011 20:02:25 -0400 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.3 (-) [zapped boilerplate header] After reading through lread.c (I was writing an emacs lisp lexer for syntax-highlighting in pygments), I discovered it treats the unicode character U+08A0 as whitespace (with the comment "NBSP"). I believe this was meant to be U+00A0 (NO-BREAK SPACE), as the code point U+08A0 has no character assigned to it yet (it lies between the Samaritan and the Devanagari blocks). Additionally, you can see this by running the following lisp code: (mapcar (lambda (sym) (string-as-unibyte (symbol-name sym) )) (read "(a b c\u00a0d e\u08a0f g \u00a0 h i \u08a0 j)")) This gives the result ("a" "b" "c\302\240d" "e" "f" "g" "\302\240" "h" "i" "j") where we can see U+00A0 (utf-8: "\302\240") is being treated as a symbol-constituent character, whereas U+08A0 is whitespace. The changes to the whitespace handling were introduced in bzr revision 78902 (on 2007-07-30, which is a few weeks after a discussion about handling NO-BREAK SPACE on the mailing list). I'm guessing using 0x8a0 was just a thinko. cheers, David M. Cooke If Emacs crashed, and you have the Emacs process in the gdb debugger, please include the output from the following gdb commands: `bt full' and `xbacktrace'. For information about debugging Emacs, please read the file /Applications/_Editors/Emacs.app/Contents/Resources/etc/DEBUG. In GNU Emacs 24.0.50.2 (x86_64-apple-darwin10.7.0, NS apple-appkit-1038.35) of 2011-05-27 on mars.lan Windowing system distributor `Apple', version 10.3.1138 configured using `configure '--with-ns'' Important settings: value of $LC_ALL: nil value of $LC_COLLATE: nil value of $LC_CTYPE: nil value of $LC_MESSAGES: nil value of $LC_MONETARY: nil value of $LC_NUMERIC: nil value of $LC_TIME: nil value of $LANG: en_CA.UTF-8 value of $XMODIFIERS: nil locale-coding-system: utf-8-unix default enable-multibyte-characters: t Major mode: Lisp Interaction Minor modes in effect: tooltip-mode: t mouse-wheel-mode: t tool-bar-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t line-number-mode: t transient-mark-mode: t Recent input: ( s m a p c a r SPC ' s y m b o l - n a m e SPC ( e v a l SPC " ( a SPC b SPC c \ u 0 0 a 0 d SPC e \ u 0 8 a 0 d f ) " ) ) C-j q # C-j q ' C-e C-j q " r e a d C-e C-j SPC g SPC \ u 0 0 a 0 SPC h SPC i SPC \ u 0 8 a 0 SPC j C-e C-j x r e m p o r p o r Recent messages: For information about GNU Emacs and the GNU system, type C-h C-a. Entering debugger... Back to top level. Entering debugger... Back to top level. Entering debugger... Back to top level. Load-path shadows: None found. Features: (shadow sort gnus-util time-date mail-extr message format-spec rfc822 mml mml-sec mm-decode mm-bodies mm-encode mail-parse rfc2231 rfc2047 rfc2045 ietf-drums mm-util mail-prsvr mailabbrev mail-utils gmm-utils mailheader emacsbug help-mode easymenu view debug tooltip ediff-hook vc-hooks lisp-float-type mwheel ns-win tool-bar dnd fontset image fringe lisp-mode register page menu-bar rfn-eshadow timer select scroll-bar mouse jit-lock font-lock syntax facemenu font-core frame cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese case-table epa-hook jka-cmpr-hook help simple abbrev minibuffer loaddefs button faces cus-face files text-properties overlay sha1 md5 base64 format env code-pages mule custom widget hashtable-print-readable backquote make-network-process dbusbind ns multi-tty emacs) From debbugs-submit-bounces@debbugs.gnu.org Tue Sep 27 04:45:27 2011 Received: (at 9608-done) by debbugs.gnu.org; 27 Sep 2011 08:45:27 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1R8TI5-0008Lv-7Z for submit@debbugs.gnu.org; Tue, 27 Sep 2011 04:45:26 -0400 Received: from mail-out.m-online.net ([212.18.0.10]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1R8THy-0008Li-Hj for 9608-done@debbugs.gnu.org; Tue, 27 Sep 2011 04:45:22 -0400 Received: from frontend1.mail.m-online.net (frontend1.mail.intern.m-online.net [192.168.8.180]) by mail-out.m-online.net (Postfix) with ESMTP id 96A451804033; Tue, 27 Sep 2011 10:44:29 +0200 (CEST) Received: from localhost (dynscan1.mnet-online.de [192.168.8.164]) by mail.m-online.net (Postfix) with ESMTP id 61BCB1C0005E; Tue, 27 Sep 2011 10:44:29 +0200 (CEST) X-Virus-Scanned: amavisd-new at mnet-online.de Received: from mail.mnet-online.de ([192.168.8.180]) by localhost (dynscan1.mail.m-online.net [192.168.8.164]) (amavisd-new, port 10024) with ESMTP id hmRSaz8lXlDE; Tue, 27 Sep 2011 10:44:26 +0200 (CEST) Received: from hase.home (ppp-88-217-107-205.dynamic.mnet-online.de [88.217.107.205]) by mail.mnet-online.de (Postfix) with ESMTP; Tue, 27 Sep 2011 10:44:25 +0200 (CEST) From: Andreas Schwab To: "David M. Cooke" Subject: Re: bug#9608: 24.0.50; Emacs lisp reader thinks no-break space is 0x08a0 (should be 0x00a0) References: <456C995A-EC64-43C6-A96E-FBF6004D9EDD@gmail.com> X-Yow: My mind is making ashtrays in Dayton.... Date: Tue, 27 Sep 2011 10:44:26 +0200 In-Reply-To: <456C995A-EC64-43C6-A96E-FBF6004D9EDD@gmail.com> (David M. Cooke's message of "Mon, 26 Sep 2011 17:00:34 -0700") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Spam-Score: -2.6 (--) X-Debbugs-Envelope-To: 9608-done Cc: 9608-done@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -2.6 (--) "David M. Cooke" writes: > The changes to the whitespace handling were introduced in bzr revision > 78902 (on 2007-07-30, which is a few weeks after a discussion about > handling NO-BREAK SPACE on the mailing list). That was before the unicode merge. > I'm guessing using 0x8a0 was just a thinko. No, it was the correct number at that time, when Emacs used the mule encoding internally. Andreas. -- Andreas Schwab, schwab@linux-m68k.org GPG Key fingerprint = 58CA 54C7 6D53 942B 1756 01D3 44D5 214B 8276 4ED5 "And now for something completely different." From unknown Wed Jun 18 00:05:51 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Tue, 25 Oct 2011 11:24:03 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator