From debbugs-submit-bounces@debbugs.gnu.org Fri Mar 28 18:23:11 2014 Received: (at submit) by debbugs.gnu.org; 28 Mar 2014 22:23:11 +0000 Received: from localhost ([127.0.0.1]:55186 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WTfB9-00012V-3D for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:23:11 -0400 Received: from eggs.gnu.org ([208.118.235.92]:33024) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WTfB5-00012K-8H for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:23:09 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WTfAx-0002d7-1J for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:23:07 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.5 required=5.0 tests=BAYES_05,T_DKIM_INVALID autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:49292) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WTfAw-0002d1-Tj for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:22:58 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:34712) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WTfAp-0007Ot-S9 for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 18:22:58 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WTfAi-0002bi-4v for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 18:22:51 -0400 Received: from gproxy3-pub.mail.unifiedlayer.com ([69.89.30.42]:40115) by eggs.gnu.org with smtp (Exim 4.71) (envelope-from ) id 1WTfAh-0002bG-RH for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 18:22:44 -0400 Received: (qmail 12494 invoked by uid 0); 28 Mar 2014 22:22:36 -0000 Received: from unknown (HELO CMOut01) (10.0.90.82) by gproxy3.mail.unifiedlayer.com with SMTP; 28 Mar 2014 22:22:36 -0000 Received: from host393.hostmonster.com ([66.147.240.193]) by CMOut01 with id jANW1n00d4B3kjm01ANZZB; Fri, 28 Mar 2014 16:22:35 -0600 X-Authority-Analysis: v=2.1 cv=Re0DVTdv c=1 sm=1 tr=0 a=GZ6qK+eS4AuCRVUKGEKC+Q==:117 a=GZ6qK+eS4AuCRVUKGEKC+Q==:17 a=DsvgjBjRAAAA:8 a=f5113yIGAAAA:8 a=4GsTxW34auoA:10 a=6B7uFKZ1SAwA:10 a=lfvU_ReahkwA:10 a=IkcTkHD0fZMA:10 a=ngU5ixn2AAAA:8 a=fWyWhr6xdMwA:10 a=MI32vHeHC_XyZ8H8-CkA:9 a=QEXdDO2ut3YA:10 a=qaYDxA4u8PgA:10 a=GFDHnmBnslQA:10 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=nbtrap.com; s=default; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID:Date:Subject:To:From; bh=88rFcdndaEWu6VI+5LhU1DfcXfGBdL+vO9akbfJ2tuw=; b=pasnMtfT2a/QBiu/WlXfXEBEVpJlsUB1CzeuA52X7GgDswmEA0sCp2WRjfjBcvrRAWHkg2P/P4woaglstU8Pgvi9JeAJsG6XpEA/XG5obq09jQDDv2Dh56lpoSfFbWLe; Received: from [50.90.253.209] (port=34716 helo=Nathan-GNU) by host393.hostmonster.com with esmtpsa (TLSv1.2:CAMELLIA128-SHA:128) (Exim 4.82) (envelope-from ) id 1WTfAV-00013S-1h for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 16:22:31 -0600 From: Nathan Trapuzzano To: bug-gnu-emacs@gnu.org Subject: json-encode-string incorrectly encodes extra-BMP characters Date: Fri, 28 Mar 2014 18:22:25 -0400 Message-ID: <87wqfegdzy.fsf@nbtrap.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Identified-User: {1585:host393.hostmonster.com:nbtrapco:nbtrap.com} {sentby:smtp auth 50.90.253.209 authed with nbtrap@nbtrap.com} X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -4.3 (----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.3 (----) M-: (princ (json-encode "\U0001d11e")) =3D=3D> "\u1d11e" ;; should be "\ud834\udd1e" or "=F0=9D=84=9E" >From ECMA-404: To escape a code point that is not in the Basic Multilingual Plane, the character is represented as a twelve-character sequence, encoding the UTF-16 surrogate pair. So for example, a string containing only the G clef character (U+1D11E) may be represented as "\uD834\uDD1E". From debbugs-submit-bounces@debbugs.gnu.org Sun Oct 04 11:55:27 2015 Received: (at 17133-done) by debbugs.gnu.org; 4 Oct 2015 15:55:28 +0000 Received: from localhost ([127.0.0.1]:54495 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1ZildH-0008Kh-GZ for submit@debbugs.gnu.org; Sun, 04 Oct 2015 11:55:27 -0400 Received: from mail-lb0-f171.google.com ([209.85.217.171]:32925) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1ZildF-0008KZ-Kp for 17133-done@debbugs.gnu.org; Sun, 04 Oct 2015 11:55:26 -0400 Received: by lbos8 with SMTP id s8so46563190lbo.0 for <17133-done@debbugs.gnu.org>; Sun, 04 Oct 2015 08:55:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-type:content-transfer-encoding; bh=lhxbt91a8F1YsWyDSnoBZFopWa6PC2s6Fj5PFwxU6tk=; b=sjZFRHg+fPmpH/dLpSuC9x8NMZ1YkUPhtbGBEwvIOgU2lB2nblKf+XovI5vH1MgOQG KObvu2PCSQzcJt6Yw9/wDpWVMIlHVgsgoSyR06uexbAtjeJmnPGrmh67ZZ/JXP8Idfup 6VlubrGYXtuFrwE9prPo6U2JeUG1VOXYWCx9BSJEjeKW3QMpNWJdK5BzqE14DMNN2e2n KGzJ04M9RW+jvr1GwtbEL2OBvuc+UG8E98eaJQVUfwhqdoQpktOMUxRc2K9u3tytkT4Z OK1rDBCLVyjMWFymzkxXh6JJc8GNlbyeYiX8iK1J6kb25SPQ2S6+OYSacxOvqSpJHhbH Qujg== X-Received: by 10.112.158.38 with SMTP id wr6mr9851264lbb.25.1443974124484; Sun, 04 Oct 2015 08:55:24 -0700 (PDT) Received: from x240 (cm-84.210.143.4.getinternet.no. [84.210.143.4]) by smtp.gmail.com with ESMTPSA id jk6sm3318723lbc.36.2015.10.04.08.55.23 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 04 Oct 2015 08:55:23 -0700 (PDT) From: =?utf-8?Q?Simen_Heggest=C3=B8yl?= To: Nathan Trapuzzano Subject: Re: bug#17133: json-encode-string incorrectly encodes extra-BMP characters References: <87wqfegdzy.fsf@nbtrap.com> Date: Sun, 04 Oct 2015 17:55:22 +0200 In-Reply-To: <87wqfegdzy.fsf@nbtrap.com> (Nathan Trapuzzano's message of "Fri, 28 Mar 2014 18:22:25 -0400") Message-ID: <87egha7hn9.fsf@gmail.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.5 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 17133-done Cc: 17133-done@debbugs.gnu.org, dgutov@yandex.ru X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) Nathan Trapuzzano writes: > M-: (princ (json-encode "\U0001d11e")) > =3D=3D> "\u1d11e" ;; should be "\ud834\udd1e" or "=F0=9D=84=9E" > >>>From ECMA-404: > > To escape a code point that is not in the Basic Multilingual Plane, > the character is represented as a twelve-character sequence, encoding > the UTF-16 surrogate pair. So for example, a string containing only > the G clef character (U+1D11E) may be represented as "\uD834\uDD1E". This seems to be working as expected in master now; (json-encode "\U0001d11e") produces "=F0=9D=84=9E" as described. -- Simen From unknown Mon Aug 18 11:33:16 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Mon, 02 Nov 2015 12:24:04 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator