From unknown Tue Aug 19 08:37:12 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#8308 <8308@debbugs.gnu.org> To: bug#8308 <8308@debbugs.gnu.org> Subject: Status: 23.3; Use utf-8 for writing abbrev file Reply-To: bug#8308 <8308@debbugs.gnu.org> Date: Tue, 19 Aug 2025 15:37:12 +0000 retitle 8308 23.3; Use utf-8 for writing abbrev file reassign 8308 emacs submitter 8308 Leo severity 8308 minor thanks From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 02:22:52 2011 Received: (at submit) by debbugs.gnu.org; 21 Mar 2011 06:22:53 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1YVv-0002qY-Q7 for submit@debbugs.gnu.org; Mon, 21 Mar 2011 02:22:52 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1YVt-0002qM-JY for submit@debbugs.gnu.org; Mon, 21 Mar 2011 02:22:50 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1YVn-0000f4-Mq for submit@debbugs.gnu.org; Mon, 21 Mar 2011 02:22:44 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,FREEMAIL_FROM, RCVD_IN_DNSWL_LOW, T_DKIM_INVALID, T_TO_NO_BRKTS_FREEMAIL autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:49087) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1YVn-0000f0-KU for submit@debbugs.gnu.org; Mon, 21 Mar 2011 02:22:43 -0400 Received: from [140.186.70.92] (port=42457 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1YVl-0005i0-4v for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 02:22:43 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1YVk-0000dk-26 for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 02:22:41 -0400 Received: from mail-iw0-f169.google.com ([209.85.214.169]:60929) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1YVj-0000dP-TW for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 02:22:40 -0400 Received: by iwl42 with SMTP id 42so7686420iwl.0 for ; Sun, 20 Mar 2011 23:22:38 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:from:to:subject:x-debbugs-cc:date:message-id :mime-version:content-type; bh=fvwcCt2IPtoqRqJAxaCOfS3bBv+RTDvgvbb7QQWYWhE=; b=krd3FCBtzBRQE8fHJMxRyZPki2MBbrAx4O9fwj1XGAjpIQ+UBzGdlsqzIJw+zLqtGq E3YR4+II5+1O4VevYWACifmDVVch12MrbJ609KELM589pxWI0imEAvbJmihvFxFPgYwX 1ULLKZCIQUcHbe5ZRo61iw+smyk5alHg9H/3E= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:to:subject:x-debbugs-cc:date:message-id:mime-version :content-type; b=V6yx8gL9wwhIzBmacIPaw1qUH2DQH2u0lzNaDst/ugywhRteHKXF3zhs5v/yCmnvro oxw6Avk26eLtgT6RRjcW4f6pbIigOfbNdbW6iTMhPMn9oIigctJyZDgy+RFWNeJVnH7b S1IsI4u02Vvem4hSTUDIgTH/uwDRyuPStqlH4= Received: by 10.42.38.71 with SMTP id b7mr6075796ice.438.1300688558221; Sun, 20 Mar 2011 23:22:38 -0700 (PDT) Received: from localhost ([114.249.16.245]) by mx.google.com with ESMTPS id d10sm3492343ibb.51.2011.03.20.23.22.31 (version=TLSv1/SSLv3 cipher=OTHER); Sun, 20 Mar 2011 23:22:34 -0700 (PDT) From: Leo To: bug-gnu-emacs@gnu.org Subject: 23.3; Use utf-8 for writing abbrev file X-Debbugs-CC: Stefan Monnier Date: Mon, 21 Mar 2011 14:22:24 +0800 Message-ID: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -5.9 (-----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -5.9 (-----) Is it OK to change the encoding for abbrev file to utf-8? === modified file 'lisp/abbrev.el' --- a/lisp/abbrev.el 2011-03-21 05:49:12 +0000 +++ b/lisp/abbrev.el 2011-03-21 06:20:36 +0000 @@ -225,9 +225,9 @@ abbrev-file-name))) (or (and file (> (length file) 0)) (setq file abbrev-file-name)) - (let ((coding-system-for-write 'emacs-mule)) + (let ((coding-system-for-write 'utf-8)) (with-temp-file file - (insert ";;-*-coding: emacs-mule;-*-\n") + (insert ";;-*-coding: utf-8;-*-\n") (dolist (table ;; We sort the table in order to ease the automatic ;; merging of different versions of the user's abbrevs Leo From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 05:00:59 2011 Received: (at 8308) by debbugs.gnu.org; 21 Mar 2011 09:00:59 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1ayw-00075S-FJ for submit@debbugs.gnu.org; Mon, 21 Mar 2011 05:00:58 -0400 Received: from fencepost.gnu.org ([140.186.70.10]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1ayu-00075F-El for 8308@debbugs.gnu.org; Mon, 21 Mar 2011 05:00:56 -0400 Received: from eliz by fencepost.gnu.org with local (Exim 4.71) (envelope-from ) id 1Q1ayp-0004r6-4i; Mon, 21 Mar 2011 05:00:51 -0400 Date: Mon, 21 Mar 2011 05:00:51 -0400 Message-Id: From: Eli Zaretskii To: Leo In-reply-to: (message from Leo on Mon, 21 Mar 2011 14:22:24 +0800) Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file References: X-Spam-Score: -6.5 (------) X-Debbugs-Envelope-To: 8308 Cc: 8308@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -6.5 (------) > From: Leo > Date: Mon, 21 Mar 2011 14:22:24 +0800 > Cc: > > Is it OK to change the encoding for abbrev file to utf-8? What will that do to characters that are not unified into the range of valid Unicode code points? Can you tell what is the purpose of this change? From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 06:01:49 2011 Received: (at submit) by debbugs.gnu.org; 21 Mar 2011 10:01:49 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1bvo-0008P3-TU for submit@debbugs.gnu.org; Mon, 21 Mar 2011 06:01:49 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1bvl-0008Oq-Pv for submit@debbugs.gnu.org; Mon, 21 Mar 2011 06:01:46 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1bvf-0005VP-GF for submit@debbugs.gnu.org; Mon, 21 Mar 2011 06:01:40 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.7 required=5.0 tests=BAYES_00,FREEMAIL_FROM, RCVD_NUMERIC_HELO, T_RP_MATCHES_RCVD, T_TO_NO_BRKTS_FREEMAIL autolearn=no version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:59608) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1bvf-0005VJ-Ag for submit@debbugs.gnu.org; Mon, 21 Mar 2011 06:01:39 -0400 Received: from [140.186.70.92] (port=58264 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1bvd-0003uR-JN for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 06:01:38 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1bvb-0005UU-OO for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 06:01:36 -0400 Received: from lo.gmane.org ([80.91.229.12]:40488) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1bvb-0005Tb-Dx for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 06:01:35 -0400 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1Q1bvZ-0001vv-53 for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 11:01:33 +0100 Received: from 114.249.16.245 ([114.249.16.245]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 21 Mar 2011 11:01:33 +0100 Received: from sdl.web by 114.249.16.245 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 21 Mar 2011 11:01:33 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: bug-gnu-emacs@gnu.org From: Leo Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file Date: Mon, 21 Mar 2011 18:01:17 +0800 Lines: 40 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: 114.249.16.245 Face: iVBORw0KGgoAAAANSUhEUgAAACgAAAAoBAMAAAB+0KVeAAAAGFBMVEUzRVhbQj4eZqO6SjnT eWpxnMetm5b6/PmidmqrAAAAAWJLR0QAiAUdSAAAAAlwSFlzAAALEwAACxMBAJqcGAAAAAd0SU1F B9cBBwMLBfKABCMAAAFoSURBVCjPtZI9a8MwEIaFoc7aYDdelQMna0Em3tsSr0XUeE2Q6a22a+v+ fk8fSSBkbDUI6dHpfe9OEvRgiD+ApqKPJgJeB6iUUXWESjUe/ig38AJrhqqvaU2nTIXbNvOQ40fe qdry4kyGoVWsfCQalXpHnJGM01wjWdYbMlXNFdsZDO69m9aqNqxEJqTEgbM5OF7wlEfIoll1Ked4 LbM5X2EdILLokEdmI8z7g5cKED0cuTC930TYhy7ZDekkXVGw/L60TguJePPxcJF48lpsSUWEA/Ju jGFNgJOXc4Hz7TmAdBeu5Ve4AEjOi2/2jfd3cAJZ+IbNrvdjgBZY01b+HTuG3cLws6BJZqVOj/pp T0OqVwx3rFq+QmJwx3loK5JSLEhDIt62+mtC2C+SrAUxEbV6C6v2BRbd6pILBKFpepKZJHgGgrKF sptSUUoczpwg2pQ7ZH1tgs0ou/917mzz6Cs2//C978cv5l07L02orIEAAAAASUVORK5CYII= User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (Mac OS X 10.6.6) Cancel-Lock: sha1:0U0tbH4dq0ECDg2KGX8rZXqD+0s= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -4.9 (----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.7 (----) On 2011-03-21 17:00 +0800, Eli Zaretskii wrote: >> From: Leo >> Date: Mon, 21 Mar 2011 14:22:24 +0800 >> Cc: >> >> Is it OK to change the encoding for abbrev file to utf-8? > > What will that do to characters that are not unified into the range of > valid Unicode code points? That's a valid concern. But ,---- | M -- emacs-mule | | Emacs 21 internal format used in buffer and string. | Type: emacs-mule (Emacs 21 internal encoding) | EOL type: Automatic selection from: | [emacs-mule-unix emacs-mule-dos emacs-mule-mac] | This coding system can encode all emacs-mule charsets. | | [back] `---- ,----[ (info "(elisp)Text Representations") ] | (1) This internal representation is based on one of the encodings | defined by the Unicode Standard, called "UTF-8", for representing any | Unicode codepoint, but Emacs extends UTF-8 to represent the additional | codepoints it uses for raw 8-bit bytes and characters not unified with | Unicode. `---- Would you agree to use utf-8-emacs instead, which covers all characters. > > Can you tell what is the purpose of this change? Make abbrev file editable to other editors. Leo From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 06:54:39 2011 Received: (at submit) by debbugs.gnu.org; 21 Mar 2011 10:54:39 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1ckw-0001A4-Jj for submit@debbugs.gnu.org; Mon, 21 Mar 2011 06:54:38 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1cku-00019r-IA for submit@debbugs.gnu.org; Mon, 21 Mar 2011 06:54:37 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1cko-00061s-Ld for submit@debbugs.gnu.org; Mon, 21 Mar 2011 06:54:31 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-2.9 required=5.0 tests=ALL_TRUSTED,BAYES_00, T_RP_MATCHES_RCVD autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:45421) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1cko-00061l-9r for submit@debbugs.gnu.org; Mon, 21 Mar 2011 06:54:30 -0400 Received: from [140.186.70.92] (port=53515 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1cke-0005c9-6Z for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 06:54:24 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1ckc-0005zK-J4 for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 06:54:19 -0400 Received: from fencepost.gnu.org ([140.186.70.10]:57312) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1ckc-0005zG-Hj for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 06:54:18 -0400 Received: from eliz by fencepost.gnu.org with local (Exim 4.71) (envelope-from ) id 1Q1ckc-00018i-5H; Mon, 21 Mar 2011 06:54:18 -0400 Date: Mon, 21 Mar 2011 06:54:18 -0400 Message-Id: From: Eli Zaretskii To: Leo In-reply-to: (message from Leo on Mon, 21 Mar 2011 18:01:17 +0800) Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file References: X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -6.5 (------) X-Debbugs-Envelope-To: submit Cc: bug-gnu-emacs@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -6.5 (------) > From: Leo > Date: Mon, 21 Mar 2011 18:01:17 +0800 > Cc: > > Would you agree to use utf-8-emacs instead, which covers all characters. That's better, but the characters outside Unicode are still going to do bad things to any software except Emacs. AFAIK, emacs-mule is a superset of iso-2022 in the same way as utf-8-emacs is a superset of utf-8. > > Can you tell what is the purpose of this change? > > Make abbrev file editable to other editors. If we are really keen on making the abbrev files editable to other editors, we should make sure they are encoded in some encoding that these other editors will understand. That probably calls for using utf-8 for everything that's covered by Unicode, and using other appropriate encodings for characters outside Unicode. From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 07:20:34 2011 Received: (at submit) by debbugs.gnu.org; 21 Mar 2011 11:20:34 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1dA0-0001jH-WA for submit@debbugs.gnu.org; Mon, 21 Mar 2011 07:20:33 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1d9z-0001j3-3W for submit@debbugs.gnu.org; Mon, 21 Mar 2011 07:20:31 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1d9s-0002Cs-Mu for submit@debbugs.gnu.org; Mon, 21 Mar 2011 07:20:25 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_NONE autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:38363) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1d9s-0002Cl-FE for submit@debbugs.gnu.org; Mon, 21 Mar 2011 07:20:24 -0400 Received: from [140.186.70.92] (port=35052 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1d9q-0000zJ-Nt for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 07:20:23 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1d9p-0002C2-7E for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 07:20:22 -0400 Received: from moutng.kundenserver.de ([212.227.126.187]:63044) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1d9o-0002Bi-La for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 07:20:21 -0400 Received: from [192.168.178.29] (brln-d9ba3060.pool.mediaWays.net [217.186.48.96]) by mrelayeu.kundenserver.de (node=mrbap4) with ESMTP (Nemesis) id 0MgNde-1QM1ot0uIQ-00NeyJ; Mon, 21 Mar 2011 12:20:17 +0100 Message-ID: <4D8735D7.10202@easy-emacs.de> Date: Mon, 21 Mar 2011 12:26:15 +0100 From: =?ISO-8859-15?Q?Andreas_R=F6hler?= User-Agent: Mozilla/5.0 (X11; U; Linux i686; de; rv:1.9.2.15) Gecko/20110303 Thunderbird/3.1.9 MIME-Version: 1.0 To: bug-gnu-emacs@gnu.org Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-15; format=flowed Content-Transfer-Encoding: 7bit X-Provags-ID: V02:K0:a3tvepwrug+pXpg/l1XlL3S/d32ZaDn8NzTL6atSsPc BYsxGU9EKAsk4e57JD1yyHFBzaPhgexTIQ19AbNKv5uU4pRzjB bKH/VtX3OtDR1RZJIrhtYlQfRahaLYOAgkIFqj+NiZbI9A2tO5 9O24RLZwo8o926PU14h9fFSI/+GL38fH4J+xTDiFrkIIOKe6z/ 9t7S1gxxUBOBwea3lXVBI3RKHNaPrZAjnPrGMtUjoU= X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -6.6 (------) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -6.6 (------) Am 21.03.2011 11:54, schrieb Eli Zaretskii: >> From: Leo >> Date: Mon, 21 Mar 2011 18:01:17 +0800 >> Cc: >> >> Would you agree to use utf-8-emacs instead, which covers all characters. > > That's better, but the characters outside Unicode are still going to > do bad things to any software except Emacs. AFAIK, emacs-mule is a > superset of iso-2022 in the same way as utf-8-emacs is a superset of > utf-8. > >>> Can you tell what is the purpose of this change? >> >> Make abbrev file editable to other editors. > > If we are really keen on making the abbrev files editable to other > editors, we should make sure they are encoded in some encoding that > these other editors will understand. That probably calls for using > utf-8 for everything that's covered by Unicode, and using other > appropriate encodings for characters outside Unicode. > > > > Hi, sounds interesting for me, as not just other editors are at stake AFAIU, but auto-generated abbrevs produced by programms. These might be theme-specific, cover items of medicine, jura etc. Could offer modes with preloaded abbrevs resp. to matter of writing. Regards, Andreas From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 10:50:30 2011 Received: (at 8308) by debbugs.gnu.org; 21 Mar 2011 14:50:31 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1gRC-0007zX-5A for submit@debbugs.gnu.org; Mon, 21 Mar 2011 10:50:30 -0400 Received: from ironport2-out.teksavvy.com ([206.248.154.181] helo=ironport2-out.pppoe.ca) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1gR9-0007zJ-UT; Mon, 21 Mar 2011 10:50:28 -0400 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AvsEADsCh01Ld/X5/2dsb2JhbAClcniITbd5hWMElWk X-IronPort-AV: E=Sophos;i="4.63,219,1299474000"; d="scan'208";a="97413709" Received: from 75-119-245-249.dsl.teksavvy.com (HELO pastel.home) ([75.119.245.249]) by ironport2-out.pppoe.ca with ESMTP/TLS/ADH-AES256-SHA; 21 Mar 2011 10:50:22 -0400 Received: by pastel.home (Postfix, from userid 20848) id BAD7958AF5; Mon, 21 Mar 2011 10:50:21 -0400 (EDT) From: Stefan Monnier To: Leo Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file Message-ID: References: Date: Mon, 21 Mar 2011 10:50:21 -0400 In-Reply-To: (Leo's message of "Mon, 21 Mar 2011 14:22:24 +0800") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: -2.0 (--) X-Debbugs-Envelope-To: 8308 Cc: 8308@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -2.0 (--) > Is it OK to change the encoding for abbrev file to utf-8? > === modified file 'lisp/abbrev.el' > --- a/lisp/abbrev.el 2011-03-21 05:49:12 +0000 > +++ b/lisp/abbrev.el 2011-03-21 06:20:36 +0000 > @@ -225,9 +225,9 @@ > abbrev-file-name))) > (or (and file (> (length file) 0)) > (setq file abbrev-file-name)) > - (let ((coding-system-for-write 'emacs-mule)) > + (let ((coding-system-for-write 'utf-8)) > (with-temp-file file > - (insert ";;-*-coding: emacs-mule;-*-\n") > + (insert ";;-*-coding: utf-8;-*-\n") > (dolist (table > ;; We sort the table in order to ease the automatic > ;; merging of different versions of the user's abbrevs Sounds good in general, but I'm wondering whether we should worry about the presence of abbrevs which include bytes (aka eight-bit-chars). Using `utf-8-emacs' should fix those issues, but would then bump into the problem that such abbrev files wouldn't be compatible with Emacs-22. Stefan From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 11:38:08 2011 Received: (at submit) by debbugs.gnu.org; 21 Mar 2011 15:38:09 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1hBH-0002kl-W4 for submit@debbugs.gnu.org; Mon, 21 Mar 2011 11:38:08 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1hBF-0002k0-O1 for submit@debbugs.gnu.org; Mon, 21 Mar 2011 11:38:06 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1hB9-000775-Sl for submit@debbugs.gnu.org; Mon, 21 Mar 2011 11:38:00 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,FREEMAIL_FROM, T_RP_MATCHES_RCVD, T_TO_NO_BRKTS_FREEMAIL autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:55615) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1hB9-00076u-Mz for submit@debbugs.gnu.org; Mon, 21 Mar 2011 11:37:59 -0400 Received: from [140.186.70.92] (port=60905 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1hB7-0005HE-JO for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 11:37:59 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1hB6-00075f-9N for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 11:37:57 -0400 Received: from lo.gmane.org ([80.91.229.12]:39767) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1hB6-00075V-1s for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 11:37:56 -0400 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1Q1hB4-0006SU-9s for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 16:37:54 +0100 Received: from th041135.ip.tsinghua.edu.cn ([59.66.41.135]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 21 Mar 2011 16:37:54 +0100 Received: from sdl.web by th041135.ip.tsinghua.edu.cn with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 21 Mar 2011 16:37:54 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: bug-gnu-emacs@gnu.org From: Leo Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file Date: Mon, 21 Mar 2011 23:37:41 +0800 Lines: 9 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: th041135.ip.tsinghua.edu.cn Face: iVBORw0KGgoAAAANSUhEUgAAACgAAAAoAgMAAADxkFD+AAAADFBMVEUvT09qWs3/pQD///+J kUVcAAAAAWJLR0QAiAUdSAAAAAlwSFlzAAALEwAACxMBAJqcGAAAAAd0SU1FB9cBBwMLOd3veKQA AACuSURBVBjTldE9CgIxEAXgB+lEyFUC2wo5ikdZ8DSypxhMY7H9VuIVwlqkGRgnm59VsHGafIQ3 CZlAtmKIRaHETgYa12lqvEsPYKf8wXHsPGfqPaUM0g9aJPKFXkmNQmSDqwzz4Fpgpz+6WAPY2z5o uPJJpu0uypcl4nyCibMLQ8lCiVjayLoQvw5LsVKQuHPRR958HZbOcVsKeepcLxpByjycGvnKmY+c MBvrtyjfe0vmuLvdq/kAAAAASUVORK5CYII= User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (Mac OS X 10.6.6) Cancel-Lock: sha1:La848sBfd/Wzi0xVFsCW4zYxjDY= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -4.5 (----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.8 (----) On 2011-03-21 22:50 +0800, Stefan Monnier wrote: > Sounds good in general, but I'm wondering whether we should worry about > the presence of abbrevs which include bytes (aka eight-bit-chars). > Using `utf-8-emacs' should fix those issues, but would then bump into > the problem that such abbrev files wouldn't be compatible with Emacs-22. I think we should just use utf-8-emacs. What do other people think? Leo From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 14:18:34 2011 Received: (at submit) by debbugs.gnu.org; 21 Mar 2011 18:18:34 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1jgX-0006ba-Mn for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:18:34 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1jgV-0006bO-54 for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:18:31 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1jgP-0003MA-4i for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:18:26 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_NONE autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:54975) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1jgP-0003M6-2i for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:18:25 -0400 Received: from [140.186.70.92] (port=42495 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1jgM-0002h0-Pk for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 14:18:25 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1jgK-0003Im-AI for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 14:18:22 -0400 Received: from moutng.kundenserver.de ([212.227.17.9]:59911) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1jgJ-0003Ho-Re for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 14:18:20 -0400 Received: from [192.168.178.29] (brln-d9ba30ef.pool.mediaWays.net [217.186.48.239]) by mrelayeu.kundenserver.de (node=mrbap2) with ESMTP (Nemesis) id 0Lmel1-1PRZiU0t6p-00a43I; Mon, 21 Mar 2011 19:18:17 +0100 Message-ID: <4D8797D0.3010109@easy-emacs.de> Date: Mon, 21 Mar 2011 19:24:16 +0100 From: =?ISO-8859-15?Q?Andreas_R=F6hler?= User-Agent: Mozilla/5.0 (X11; U; Linux i686; de; rv:1.9.2.15) Gecko/20110303 Thunderbird/3.1.9 MIME-Version: 1.0 To: bug-gnu-emacs@gnu.org Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file References: In-Reply-To: Content-Type: text/plain; charset=ISO-8859-15; format=flowed Content-Transfer-Encoding: 7bit X-Provags-ID: V02:K0:BgOPTRYCZIqEu+/sWTejxzKZrhgD3x5lGLjjqQNojag DlizBb98buPou16oP+lW1BYXlwTZ9eZluYXU9vZl4DFRsbZBoX 4yZmGJMadqB2PXGKRMqjBwU02M5ZgwDi2ujtxSXEEQCqCFO7ED M5ILQiuYtk+xMMMVzh5kgSq3S9dhCft90dNltQH9PJbh+dsqGb wFmfxhXrI2XM9k76Zh7YPRIFAXivwIEfaDZ2C2URt8= X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -6.6 (------) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -6.6 (------) Am 21.03.2011 15:50, schrieb Stefan Monnier: >> Is it OK to change the encoding for abbrev file to utf-8? >> === modified file 'lisp/abbrev.el' >> --- a/lisp/abbrev.el 2011-03-21 05:49:12 +0000 >> +++ b/lisp/abbrev.el 2011-03-21 06:20:36 +0000 >> @@ -225,9 +225,9 @@ >> abbrev-file-name))) >> (or (and file (> (length file) 0)) >> (setq file abbrev-file-name)) >> - (let ((coding-system-for-write 'emacs-mule)) >> + (let ((coding-system-for-write 'utf-8)) >> (with-temp-file file >> - (insert ";;-*-coding: emacs-mule;-*-\n") >> + (insert ";;-*-coding: utf-8;-*-\n") >> (dolist (table >> ;; We sort the table in order to ease the automatic >> ;; merging of different versions of the user's abbrevs > > Sounds good in general, but I'm wondering whether we should worry about > the presence of abbrevs which include bytes (aka eight-bit-chars). > Using `utf-8-emacs' should fix those issues, but would then bump into > the problem that such abbrev files wouldn't be compatible with Emacs-22. > > > Stefan > Hi, so maybe not hard-code it, rather have a variable? Andreas From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 14:45:43 2011 Received: (at submit) by debbugs.gnu.org; 21 Mar 2011 18:45:44 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1k6p-0007Cf-Bl for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:45:43 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1k6n-0007CU-G8 for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:45:41 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1k6h-0001eg-C5 for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:45:36 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_NONE autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:48242) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1k6h-0001ec-AU for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:45:35 -0400 Received: from [140.186.70.92] (port=59554 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1k6g-00064e-DP for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 14:45:35 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1k6f-0001cj-6H for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 14:45:34 -0400 Received: from mtaout21.012.net.il ([80.179.55.169]:34239) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1k6e-0001cN-Sx for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 14:45:33 -0400 Received: from conversion-daemon.a-mtaout21.012.net.il by a-mtaout21.012.net.il (HyperSendmail v2007.08) id <0LIF00200804PB00@a-mtaout21.012.net.il> for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 20:45:27 +0200 (IST) Received: from HOME-C4E4A596F7 ([77.126.16.52]) by a-mtaout21.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0LIF002U383P2HC0@a-mtaout21.012.net.il>; Mon, 21 Mar 2011 20:45:27 +0200 (IST) Date: Mon, 21 Mar 2011 20:45:33 +0200 From: Eli Zaretskii Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file In-reply-to: X-012-Sender: halo1@inter.net.il To: Leo Message-id: <834o6w9usy.fsf@gnu.org> References: X-detected-operating-system: by eggs.gnu.org: Solaris 10 (beta) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -4.3 (----) X-Debbugs-Envelope-To: submit Cc: bug-gnu-emacs@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.3 (----) > From: Leo > Date: Mon, 21 Mar 2011 23:37:41 +0800 > Cc: > > I think we should just use utf-8-emacs. Why do you think so? From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 14:53:54 2011 Received: (at submit) by debbugs.gnu.org; 21 Mar 2011 18:53:55 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1kEk-0007NJ-JI for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:53:54 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1kEi-0007N7-Sz for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:53:53 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1kEc-0003Wg-Qx for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:53:47 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:54149) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1kEc-0003Wc-Oy for submit@debbugs.gnu.org; Mon, 21 Mar 2011 14:53:46 -0400 Received: from [140.186.70.92] (port=54521 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1kEb-0002R6-NF for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 14:53:46 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1kEa-0003WH-2h for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 14:53:45 -0400 Received: from mtaout20.012.net.il ([80.179.55.166]:33047) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1kEZ-0003Vx-N0 for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 14:53:44 -0400 Received: from conversion-daemon.a-mtaout20.012.net.il by a-mtaout20.012.net.il (HyperSendmail v2007.08) id <0LIF00C008AA3R00@a-mtaout20.012.net.il> for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 20:53:41 +0200 (IST) Received: from HOME-C4E4A596F7 ([77.126.16.52]) by a-mtaout20.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0LIF00AFF8H94TN0@a-mtaout20.012.net.il>; Mon, 21 Mar 2011 20:53:41 +0200 (IST) Date: Mon, 21 Mar 2011 20:53:41 +0200 From: Eli Zaretskii Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file In-reply-to: <4D8797D0.3010109@easy-emacs.de> To: Andreas =?iso-8859-15?Q?R=F6hler?= Message-id: <83zkoo8fuy.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=iso-8859-15 Content-transfer-encoding: QUOTED-PRINTABLE X-012-Sender: halo1@inter.net.il References: <4D8797D0.3010109@easy-emacs.de> X-detected-operating-system: by eggs.gnu.org: Solaris 10 (beta) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -4.4 (----) X-Debbugs-Envelope-To: submit Cc: bug-gnu-emacs@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.4 (----) > Date: Mon, 21 Mar 2011 19:24:16 +0100 > From: Andreas R=F6hler > Cc:=20 >=20 > so maybe not hard-code it, rather have a variable? A constant encoding will never DTRT in all cases. From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 21:01:11 2011 Received: (at submit) by debbugs.gnu.org; 22 Mar 2011 01:01:11 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1pyA-0008CG-Dj for submit@debbugs.gnu.org; Mon, 21 Mar 2011 21:01:10 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1py8-0008C5-PR for submit@debbugs.gnu.org; Mon, 21 Mar 2011 21:01:09 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1py2-0004j6-MQ for submit@debbugs.gnu.org; Mon, 21 Mar 2011 21:01:03 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,FREEMAIL_FROM, RCVD_IN_DNSWL_LOW,T_DKIM_INVALID autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:48452) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1py2-0004ix-KJ for submit@debbugs.gnu.org; Mon, 21 Mar 2011 21:01:02 -0400 Received: from [140.186.70.92] (port=51973 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1py1-0001Ff-Lq for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 21:01:02 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1py0-0004ib-P4 for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 21:01:01 -0400 Received: from mail-iy0-f169.google.com ([209.85.210.169]:39618) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1py0-0004iU-Lr; Mon, 21 Mar 2011 21:01:00 -0400 Received: by iyf13 with SMTP id 13so8864694iyf.0 for ; Mon, 21 Mar 2011 18:01:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-type; bh=cOUOy4ViwwDugZ2ofBumAfy9xCLZqdJidmZB+WSt+aw=; b=qZcLrNzWb89tEFn1Ihl78YFUF7GZvthQJP1bDnlA2dde3208PG0BD+3tqOt/rLYN7S MxD5PBNLzZ3HTVcn+XEhb+38mphpXTJ+X19qE4bMMFBPmkFMIFV22avjGgpESdf41SEk MUMw24Z/CH0lwohD1qCUf780xCo/fGy0Qq4BQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-type; b=jW50ztcs/zlwNB6+JaBoeT8HJsuSG6a6FPyiizmrde52tKohxC+Elv6xbFy4HmYsPq XiRh1RJiG1TRwznn8dCUx1OFMJ192Bp02cTuDaDf4tK73kGrg9TODcgEsjjW3sLTXCY/ WugsjdKuLXuUIAICkVCeq4DTFPMmlJBrxo9Gg= Received: by 10.231.195.212 with SMTP id ed20mr4793770ibb.112.1300755659930; Mon, 21 Mar 2011 18:00:59 -0700 (PDT) Received: from localhost (th041092.ip.tsinghua.edu.cn [59.66.41.92]) by mx.google.com with ESMTPS id 41sm4037687ibi.61.2011.03.21.18.00.55 (version=TLSv1/SSLv3 cipher=OTHER); Mon, 21 Mar 2011 18:00:57 -0700 (PDT) From: Leo To: Eli Zaretskii Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file References: <834o6w9usy.fsf@gnu.org> Date: Tue, 22 Mar 2011 09:00:51 +0800 In-Reply-To: <834o6w9usy.fsf@gnu.org> (Eli Zaretskii's message of "Mon, 21 Mar 2011 20:45:33 +0200") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (Mac OS X 10.6.6) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -5.1 (-----) X-Debbugs-Envelope-To: submit Cc: bug-gnu-emacs@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -5.2 (-----) On 2011-03-22 02:45 +0800, Eli Zaretskii wrote: >> I think we should just use utf-8-emacs. > > Why do you think so? By the time 24.1 is released, it will be 1-2 years from now and there will be two major stable releases that work with utf-8-emacs, which are backward-compatible enough. But I don't know so I'll forget about this bug and let the gurus figure it out. Leo From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 22:48:54 2011 Received: (at submit) by debbugs.gnu.org; 22 Mar 2011 02:48:55 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1reQ-000278-0k for submit@debbugs.gnu.org; Mon, 21 Mar 2011 22:48:54 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1reN-00026v-Gb for submit@debbugs.gnu.org; Mon, 21 Mar 2011 22:48:52 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1reH-0002QP-GI for submit@debbugs.gnu.org; Mon, 21 Mar 2011 22:48:46 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:48429) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1reH-0002Q6-CE for submit@debbugs.gnu.org; Mon, 21 Mar 2011 22:48:45 -0400 Received: from [140.186.70.92] (port=57335 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1reG-0002aW-Fd for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 22:48:45 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1reE-0002Oz-T1 for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 22:48:44 -0400 Received: from ironport2-out.teksavvy.com ([206.248.154.181]:62361 helo=ironport2-out.pppoe.ca) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1reE-0002Of-QG; Mon, 21 Mar 2011 22:48:42 -0400 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AvsEAASrh01Ld/X5/2dsb2JhbAClQniITbpnhWMElWk X-IronPort-AV: E=Sophos;i="4.63,223,1299474000"; d="scan'208";a="97503848" Received: from 75-119-245-249.dsl.teksavvy.com (HELO ceviche.home) ([75.119.245.249]) by ironport2-out.pppoe.ca with ESMTP/TLS/ADH-AES256-SHA; 21 Mar 2011 22:48:41 -0400 Received: by ceviche.home (Postfix, from userid 20848) id C68EC660AF; Mon, 21 Mar 2011 22:48:40 -0400 (EDT) From: Stefan Monnier To: Leo Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file Message-ID: References: <834o6w9usy.fsf@gnu.org> Date: Mon, 21 Mar 2011 22:48:40 -0400 In-Reply-To: (Leo's message of "Tue, 22 Mar 2011 09:00:51 +0800") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -4.0 (----) X-Debbugs-Envelope-To: submit Cc: Eli Zaretskii , bug-gnu-emacs@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.0 (----) >>> I think we should just use utf-8-emacs. >> Why do you think so? > By the time 24.1 is released, it will be 1-2 years from now and there > will be two major stable releases that work with utf-8-emacs, which are > backward-compatible enough. But I don't know so I'll forget about this > bug and let the gurus figure it out. I think it might be OK to do it for Emacs-25, but since Emacs-22 can't handle utf-8-emacs, I think it's a bit early to switch to it in Emacs-24. If utf-8 is sufficient, OTOH it's the best choice. So maybe we should check the buffer first to see if utf-8 is safe, and only fall back to emacs-mule if utf-8 is not safe. Stefan From debbugs-submit-bounces@debbugs.gnu.org Mon Mar 21 23:47:41 2011 Received: (at submit) by debbugs.gnu.org; 22 Mar 2011 03:47:41 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1sZI-0003qV-Kk for submit@debbugs.gnu.org; Mon, 21 Mar 2011 23:47:40 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1sZG-0003qB-O3 for submit@debbugs.gnu.org; Mon, 21 Mar 2011 23:47:39 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1sZA-0001xh-Mc for submit@debbugs.gnu.org; Mon, 21 Mar 2011 23:47:33 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,FREEMAIL_FROM, RCVD_IN_DNSWL_LOW,T_DKIM_INVALID autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:33596) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1sZA-0001xa-KJ for submit@debbugs.gnu.org; Mon, 21 Mar 2011 23:47:32 -0400 Received: from [140.186.70.92] (port=38260 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1sZ9-0001vS-KI for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 23:47:32 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1sZ8-0001xF-AJ for bug-gnu-emacs@gnu.org; Mon, 21 Mar 2011 23:47:31 -0400 Received: from mail-iy0-f169.google.com ([209.85.210.169]:59339) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1sZ8-0001x9-3J; Mon, 21 Mar 2011 23:47:30 -0400 Received: by iyf13 with SMTP id 13so9005651iyf.0 for ; Mon, 21 Mar 2011 20:47:29 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-type; bh=ip85oOhi07+JLnNfjsbcq1KNOg48UlxKXUF2EwvuEIc=; b=Vl8emCxdb+dXhMB1HFBe6i76JmWobYvZmsgA1nUugR9DFPPuejMBah3dHd21/eAkBw 2ccaXQze7t2QHIZ9O4ZMMM8PE6E3u+S+X5AUvCxjGjCJ3BxFASb+bOCg8Wy1Z4r6RV3z k7MdXdeanauqxKUeVh5RuBkbv1jl4m2cv3lso= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-type; b=vJmLW0mX+cijtbxFzalxVwAJtmpcb4pN2UNBobSBqN68hj36CW9PhmEAdQIYN5w02D Es86nUyqMVgSf/PfstYpj99fIhCquYGURcSp2scuglBo6l9iOa3Dlz7HyuBjRNT5yqJN mb67sVBVLzquzNzxG3JrW3d91SRaZft4xgEo8= Received: by 10.42.73.195 with SMTP id t3mr6000968icj.106.1300765649359; Mon, 21 Mar 2011 20:47:29 -0700 (PDT) Received: from localhost (th041092.ip.tsinghua.edu.cn [59.66.41.92]) by mx.google.com with ESMTPS id g16sm2453730ibb.3.2011.03.21.20.47.25 (version=TLSv1/SSLv3 cipher=OTHER); Mon, 21 Mar 2011 20:47:27 -0700 (PDT) From: Leo To: Stefan Monnier Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file References: <834o6w9usy.fsf@gnu.org> Date: Tue, 22 Mar 2011 11:47:21 +0800 In-Reply-To: (Stefan Monnier's message of "Mon, 21 Mar 2011 22:48:40 -0400") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (Mac OS X 10.6.6) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -5.2 (-----) X-Debbugs-Envelope-To: submit Cc: Eli Zaretskii , bug-gnu-emacs@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -5.3 (-----) On 2011-03-22 10:48 +0800, Stefan Monnier wrote: > I think it might be OK to do it for Emacs-25, but since Emacs-22 can't > handle utf-8-emacs, I think it's a bit early to switch to it in > Emacs-24. If utf-8 is sufficient, OTOH it's the best choice. So maybe > we should check the buffer first to see if utf-8 is safe, and only fall > back to emacs-mule if utf-8 is not safe. I think default to utf-8 is good, which is sufficient for most people. Any comments on the following patch? I don't know how to introduce a char unencodable with utf-8 to the abbrevs. So it is only partially tested. === modified file 'lisp/abbrev.el' --- lisp/abbrev.el 2011-01-25 04:08:28 +0000 +++ lisp/abbrev.el 2011-03-22 03:30:52 +0000 @@ -225,21 +225,29 @@ abbrev-file-name))) (or (and file (> (length file) 0)) (setq file abbrev-file-name)) - (let ((coding-system-for-write 'emacs-mule)) - (with-temp-file file - (insert ";;-*-coding: emacs-mule;-*-\n") + (let ((coding-system-for-write 'utf-8)) + (with-temp-buffer (dolist (table - ;; We sort the table in order to ease the automatic - ;; merging of different versions of the user's abbrevs - ;; file. This is useful, for example, for when the - ;; user keeps their home directory in a revision - ;; control system, and is therefore keeping multiple - ;; slightly-differing copies loosely synchronized. - (sort (copy-sequence abbrev-table-name-list) - (lambda (s1 s2) - (string< (symbol-name s1) - (symbol-name s2))))) - (insert-abbrev-table-description table nil))))) + ;; We sort the table in order to ease the automatic + ;; merging of different versions of the user's abbrevs + ;; file. This is useful, for example, for when the + ;; user keeps their home directory in a revision + ;; control system, and is therefore keeping multiple + ;; slightly-differing copies loosely synchronized. + (sort (copy-sequence abbrev-table-name-list) + (lambda (s1 s2) + (string< (symbol-name s1) + (symbol-name s2))))) + (insert-abbrev-table-description table nil)) + (when (unencodable-char-position (point-min) (point-max) 'utf-8) + (setq coding-system-for-write + (if (> emacs-major-version 24) + 'utf-8-emacs + ;; For compatibility with Emacs 22 + 'emacs-mule))) + (goto-char (point-min)) + (insert (format ";;-*-coding: %s;-*-\n" coding-system-for-write)) + (write-region nil nil file nil 0)))) (defun add-mode-abbrev (arg) "Define mode-specific abbrev for last word(s) before point. From debbugs-submit-bounces@debbugs.gnu.org Tue Mar 22 01:24:42 2011 Received: (at submit) by debbugs.gnu.org; 22 Mar 2011 05:24:43 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1u5B-00075G-RT for submit@debbugs.gnu.org; Tue, 22 Mar 2011 01:24:42 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1u5A-000753-8S for submit@debbugs.gnu.org; Tue, 22 Mar 2011 01:24:40 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1u53-0006Zk-Vc for submit@debbugs.gnu.org; Tue, 22 Mar 2011 01:24:35 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:56884) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1u53-0006ZU-Th for submit@debbugs.gnu.org; Tue, 22 Mar 2011 01:24:33 -0400 Received: from [140.186.70.92] (port=58839 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1u52-0003W7-OO for bug-gnu-emacs@gnu.org; Tue, 22 Mar 2011 01:24:33 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1u50-0006Z4-Gp for bug-gnu-emacs@gnu.org; Tue, 22 Mar 2011 01:24:32 -0400 Received: from ironport2-out.teksavvy.com ([206.248.154.183]:55558 helo=ironport2-out.pppoe.ca) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1u50-0006Ys-EF; Tue, 22 Mar 2011 01:24:30 -0400 X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AvsEAE/Ph01Ld/X5/2dsb2JhbAClQ3iITboShWMElWk X-IronPort-AV: E=Sophos;i="4.63,224,1299474000"; d="scan'208";a="97511818" Received: from 75-119-245-249.dsl.teksavvy.com (HELO ceviche.home) ([75.119.245.249]) by ironport2-out.pppoe.ca with ESMTP/TLS/ADH-AES256-SHA; 22 Mar 2011 01:24:28 -0400 Received: by ceviche.home (Postfix, from userid 20848) id 270F5660AF; Tue, 22 Mar 2011 01:24:28 -0400 (EDT) From: Stefan Monnier To: Leo Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file Message-ID: References: <834o6w9usy.fsf@gnu.org> Date: Tue, 22 Mar 2011 01:24:28 -0400 In-Reply-To: (Leo's message of "Tue, 22 Mar 2011 11:47:21 +0800") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -4.0 (----) X-Debbugs-Envelope-To: submit Cc: Eli Zaretskii , bug-gnu-emacs@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.1 (----) > I think default to utf-8 is good, which is sufficient for most people. > Any comments on the following patch? I don't know how to introduce a > char unencodable with utf-8 to the abbrevs. So it is only partially > tested. (unibyte-string 129) returns a string containing an unencodable char. So you can test with it. The patch looks good, Stefan From debbugs-submit-bounces@debbugs.gnu.org Tue Mar 22 06:42:13 2011 Received: (at submit) by debbugs.gnu.org; 22 Mar 2011 10:42:13 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1z2T-0006MD-4i for submit@debbugs.gnu.org; Tue, 22 Mar 2011 06:42:13 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q1z2P-0006M0-R3 for submit@debbugs.gnu.org; Tue, 22 Mar 2011 06:42:10 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1z2J-0003xV-OT for submit@debbugs.gnu.org; Tue, 22 Mar 2011 06:42:04 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.7 required=5.0 tests=BAYES_00,FREEMAIL_FROM, RCVD_NUMERIC_HELO, T_RP_MATCHES_RCVD, T_TO_NO_BRKTS_FREEMAIL autolearn=no version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:43140) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1z2J-0003xL-MM for submit@debbugs.gnu.org; Tue, 22 Mar 2011 06:42:03 -0400 Received: from [140.186.70.92] (port=54200 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q1z2I-0005ra-Qg for bug-gnu-emacs@gnu.org; Tue, 22 Mar 2011 06:42:03 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q1z2H-0003x7-Ua for bug-gnu-emacs@gnu.org; Tue, 22 Mar 2011 06:42:02 -0400 Received: from lo.gmane.org ([80.91.229.12]:59073) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q1z2H-0003wt-P5 for bug-gnu-emacs@gnu.org; Tue, 22 Mar 2011 06:42:01 -0400 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1Q1z2F-0000oN-C4 for bug-gnu-emacs@gnu.org; Tue, 22 Mar 2011 11:41:59 +0100 Received: from 114.249.16.245 ([114.249.16.245]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 22 Mar 2011 11:41:59 +0100 Received: from sdl.web by 114.249.16.245 with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 22 Mar 2011 11:41:59 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: bug-gnu-emacs@gnu.org From: Leo Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file Date: Tue, 22 Mar 2011 18:41:39 +0800 Lines: 10 Message-ID: References: <834o6w9usy.fsf@gnu.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: 114.249.16.245 Face: iVBORw0KGgoAAAANSUhEUgAAACgAAAAoAgMAAADxkFD+AAAADFBMVEUvT09qWs3/pQD///+J kUVcAAAAAWJLR0QAiAUdSAAAAAlwSFlzAAALEwAACxMBAJqcGAAAAAd0SU1FB9cBBwMLOd3veKQA AACuSURBVBjTldE9CgIxEAXgB+lEyFUC2wo5ikdZ8DSypxhMY7H9VuIVwlqkGRgnm59VsHGafIQ3 CZlAtmKIRaHETgYa12lqvEsPYKf8wXHsPGfqPaUM0g9aJPKFXkmNQmSDqwzz4Fpgpz+6WAPY2z5o uPJJpu0uypcl4nyCibMLQ8lCiVjayLoQvw5LsVKQuHPRR958HZbOcVsKeepcLxpByjycGvnKmY+c MBvrtyjfe0vmuLvdq/kAAAAASUVORK5CYII= User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (Mac OS X 10.6.6) Cancel-Lock: sha1:4lxyoiO6CVEof8cGECnphxi05r8= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 3) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -4.5 (----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.4 (----) On 2011-03-22 13:24 +0800, Stefan Monnier wrote: > (unibyte-string 129) returns a string containing an unencodable char. > So you can test with it. I still cannot get any byte into the abbrevs. For example, (unibyte-string 129) returns byte \201 but when it is written to abbrev file by write-abbrev-file, it is changed to \ 2 0 1, so utf-8 appear sufficient even for bytes. Leo From debbugs-submit-bounces@debbugs.gnu.org Tue Mar 22 14:27:25 2011 Received: (at submit) by debbugs.gnu.org; 22 Mar 2011 18:27:27 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q26If-0001pI-Hs for submit@debbugs.gnu.org; Tue, 22 Mar 2011 14:27:25 -0400 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q26Ic-0001p4-Ql for submit@debbugs.gnu.org; Tue, 22 Mar 2011 14:27:23 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q26IW-0005Rw-NP for submit@debbugs.gnu.org; Tue, 22 Mar 2011 14:27:17 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,T_RP_MATCHES_RCVD autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:40554) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q26IW-0005Rr-LN for submit@debbugs.gnu.org; Tue, 22 Mar 2011 14:27:16 -0400 Received: from [140.186.70.92] (port=53183 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Q26IV-00004t-F6 for bug-gnu-emacs@gnu.org; Tue, 22 Mar 2011 14:27:16 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Q26IT-0005Qv-RF for bug-gnu-emacs@gnu.org; Tue, 22 Mar 2011 14:27:15 -0400 Received: from chene.dit.umontreal.ca ([132.204.246.20]:41480) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Q26IT-0005Pc-KN for bug-gnu-emacs@gnu.org; Tue, 22 Mar 2011 14:27:13 -0400 Received: from faina.iro.umontreal.ca (lechon.iro.umontreal.ca [132.204.27.242]) by chene.dit.umontreal.ca (8.14.1/8.14.1) with ESMTP id p2MISDrU000531; Tue, 22 Mar 2011 14:28:13 -0400 Received: by faina.iro.umontreal.ca (Postfix, from userid 20848) id 8DF9DB4269; Tue, 22 Mar 2011 14:27:05 -0400 (EDT) From: Stefan Monnier To: Leo Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file Message-ID: References: <834o6w9usy.fsf@gnu.org> Date: Tue, 22 Mar 2011 14:27:04 -0400 In-Reply-To: (Leo's message of "Tue, 22 Mar 2011 18:41:39 +0800") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-NAI-Spam-Flag: NO X-NAI-Spam-Threshold: 5 X-NAI-Spam-Score: 0 X-NAI-Spam-Rules: 1 Rules triggered RV3804=0 X-NAI-Spam-Version: 2.2.0.9286 : core <3804> : streams <611568> : uri <832846> X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Received-From: 199.232.76.165 X-Spam-Score: -4.0 (----) X-Debbugs-Envelope-To: submit Cc: bug-gnu-emacs@gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.0 (----) >> (unibyte-string 129) returns a string containing an unencodable char. >> So you can test with it. > I still cannot get any byte into the abbrevs. For example, > (unibyte-string 129) returns byte \201 but when it is written to abbrev > file by write-abbrev-file, it is changed to \ 2 0 1, so utf-8 appear > sufficient even for bytes. Good. In any case your unencodable-foo test would trigger if there were eight-bit-chars in there, so it works correctly in this respect. Please install your patch. Stefan From debbugs-submit-bounces@debbugs.gnu.org Tue Mar 22 20:42:25 2011 Received: (at 8308-done) by debbugs.gnu.org; 23 Mar 2011 00:42:25 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q2C9Y-0002QO-KP for submit@debbugs.gnu.org; Tue, 22 Mar 2011 20:42:24 -0400 Received: from mail-iy0-f172.google.com ([209.85.210.172]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1Q2C9V-0002QA-Bs for 8308-done@debbugs.gnu.org; Tue, 22 Mar 2011 20:42:21 -0400 Received: by iyi12 with SMTP id 12so7675828iyi.3 for <8308-done@debbugs.gnu.org>; Tue, 22 Mar 2011 17:42:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:from:to:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-type; bh=zJrQMC2S8lADBz1JFsb4x/eQ9kWUzz5q3HJDi8aR/Cc=; b=i//oXGyRLDU810BM5sRMlr8sUBpVWYV/7P3SCUE4Gq3Qy2gYlzkmZgMicwz73ncfPb FAP/WqNiAVhos6mQRJhCQvBPkODtxT7XXx+4Y8Y+t7oRZaiG/zSbYhOm1cI6JASdCArU 1x060BkavSkG+KQVPqH1ueOVhWQ5azJOXDZ1k= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=from:to:subject:references:date:in-reply-to:message-id:user-agent :mime-version:content-type; b=MfeTk3z3UiLasS8XaddCppu7gTwT9/SnY8a1A/A2vWELwDzjfedcB6hpu+dpuuI1lQ POHfvkYSggDtBV7w5YuH2N5VwqjK67v2CI7H/+GLp1+6ZEpxkv93dx0tIXYjIw8R8x4B 42trIy2l8X61gDpFtByfYPncxqV4j/rML1WWI= Received: by 10.231.65.68 with SMTP id h4mr5107534ibi.36.1300840935664; Tue, 22 Mar 2011 17:42:15 -0700 (PDT) Received: from localhost (th041098.ip.tsinghua.edu.cn [59.66.41.98]) by mx.google.com with ESMTPS id 41sm4721134ibi.61.2011.03.22.17.42.11 (version=TLSv1/SSLv3 cipher=OTHER); Tue, 22 Mar 2011 17:42:13 -0700 (PDT) From: Leo To: 8308-done@debbugs.gnu.org Subject: Re: bug#8308: 23.3; Use utf-8 for writing abbrev file References: <834o6w9usy.fsf@gnu.org> Date: Wed, 23 Mar 2011 08:42:08 +0800 In-Reply-To: (Stefan Monnier's message of "Tue, 22 Mar 2011 14:27:04 -0400") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.3 (Mac OS X 10.6.6) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Spam-Score: -4.2 (----) X-Debbugs-Envelope-To: 8308-done X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -4.1 (----) Version: 24.1. From unknown Tue Aug 19 08:37:12 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Wed, 20 Apr 2011 11:24:04 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator