From unknown Sun Aug 17 01:00:56 2025 X-Loop: help-debbugs@gnu.org Subject: bug#17133: json-encode-string incorrectly encodes extra-BMP characters Resent-From: Nathan Trapuzzano Original-Sender: "Debbugs-submit" Resent-CC: bug-gnu-emacs@gnu.org Resent-Date: Fri, 28 Mar 2014 22:24:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 17133 X-GNU-PR-Package: emacs X-GNU-PR-Keywords: To: 17133@debbugs.gnu.org X-Debbugs-Original-To: bug-gnu-emacs@gnu.org Received: via spool by submit@debbugs.gnu.org id=B.13960453914005 (code B ref -1); Fri, 28 Mar 2014 22:24:01 +0000 Received: (at submit) by debbugs.gnu.org; 28 Mar 2014 22:23:11 +0000 Received: from localhost ([127.0.0.1]:55186 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WTfB9-00012V-3D for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:23:11 -0400 Received: from eggs.gnu.org ([208.118.235.92]:33024) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WTfB5-00012K-8H for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:23:09 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WTfAx-0002d7-1J for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:23:07 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.5 required=5.0 tests=BAYES_05,T_DKIM_INVALID autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:49292) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WTfAw-0002d1-Tj for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:22:58 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:34712) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WTfAp-0007Ot-S9 for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 18:22:58 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WTfAi-0002bi-4v for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 18:22:51 -0400 Received: from gproxy3-pub.mail.unifiedlayer.com ([69.89.30.42]:40115) by eggs.gnu.org with smtp (Exim 4.71) (envelope-from ) id 1WTfAh-0002bG-RH for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 18:22:44 -0400 Received: (qmail 12494 invoked by uid 0); 28 Mar 2014 22:22:36 -0000 Received: from unknown (HELO CMOut01) (10.0.90.82) by gproxy3.mail.unifiedlayer.com with SMTP; 28 Mar 2014 22:22:36 -0000 Received: from host393.hostmonster.com ([66.147.240.193]) by CMOut01 with id jANW1n00d4B3kjm01ANZZB; Fri, 28 Mar 2014 16:22:35 -0600 X-Authority-Analysis: v=2.1 cv=Re0DVTdv c=1 sm=1 tr=0 a=GZ6qK+eS4AuCRVUKGEKC+Q==:117 a=GZ6qK+eS4AuCRVUKGEKC+Q==:17 a=DsvgjBjRAAAA:8 a=f5113yIGAAAA:8 a=4GsTxW34auoA:10 a=6B7uFKZ1SAwA:10 a=lfvU_ReahkwA:10 a=IkcTkHD0fZMA:10 a=ngU5ixn2AAAA:8 a=fWyWhr6xdMwA:10 a=MI32vHeHC_XyZ8H8-CkA:9 a=QEXdDO2ut3YA:10 a=qaYDxA4u8PgA:10 a=GFDHnmBnslQA:10 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=nbtrap.com; s=default; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID:Date:Subject:To:From; bh=88rFcdndaEWu6VI+5LhU1DfcXfGBdL+vO9akbfJ2tuw=; b=pasnMtfT2a/QBiu/WlXfXEBEVpJlsUB1CzeuA52X7GgDswmEA0sCp2WRjfjBcvrRAWHkg2P/P4woaglstU8Pgvi9JeAJsG6XpEA/XG5obq09jQDDv2Dh56lpoSfFbWLe; Received: from [50.90.253.209] (port=34716 helo=Nathan-GNU) by host393.hostmonster.com with esmtpsa (TLSv1.2:CAMELLIA128-SHA:128) (Exim 4.82) (envelope-from ) id 1WTfAV-00013S-1h for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 16:22:31 -0600 From: Nathan Trapuzzano Date: Fri, 28 Mar 2014 18:22:25 -0400 Message-ID: <87wqfegdzy.fsf@nbtrap.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Identified-User: {1585:host393.hostmonster.com:nbtrapco:nbtrap.com} {sentby:smtp auth 50.90.253.209 authed with nbtrap@nbtrap.com} X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -4.3 (----) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.3 (----) M-: (princ (json-encode "\U0001d11e")) =3D=3D> "\u1d11e" ;; should be "\ud834\udd1e" or "=F0=9D=84=9E" >From ECMA-404: To escape a code point that is not in the Basic Multilingual Plane, the character is represented as a twelve-character sequence, encoding the UTF-16 surrogate pair. So for example, a string containing only the G clef character (U+1D11E) may be represented as "\uD834\uDD1E". From unknown Sun Aug 17 01:00:56 2025 MIME-Version: 1.0 X-Mailer: MIME-tools 5.503 (Entity 5.503) X-Loop: help-debbugs@gnu.org From: help-debbugs@gnu.org (GNU bug Tracking System) To: Nathan Trapuzzano Subject: bug#17133: closed (Re: bug#17133: json-encode-string incorrectly encodes extra-BMP characters) Message-ID: References: <87egha7hn9.fsf@gmail.com> <87wqfegdzy.fsf@nbtrap.com> X-Gnu-PR-Message: they-closed 17133 X-Gnu-PR-Package: emacs Reply-To: 17133@debbugs.gnu.org Date: Sun, 04 Oct 2015 15:56:02 +0000 Content-Type: multipart/mixed; boundary="----------=_1443974162-32081-1" This is a multi-part message in MIME format... ------------=_1443974162-32081-1 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" Your bug report #17133: json-encode-string incorrectly encodes extra-BMP characters which was filed against the emacs package, has been closed. The explanation is attached below, along with your original report. If you require more details, please reply to 17133@debbugs.gnu.org. --=20 17133: http://debbugs.gnu.org/cgi/bugreport.cgi?bug=3D17133 GNU Bug Tracking System Contact help-debbugs@gnu.org with problems ------------=_1443974162-32081-1 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at 17133-done) by debbugs.gnu.org; 4 Oct 2015 15:55:28 +0000 Received: from localhost ([127.0.0.1]:54495 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1ZildH-0008Kh-GZ for submit@debbugs.gnu.org; Sun, 04 Oct 2015 11:55:27 -0400 Received: from mail-lb0-f171.google.com ([209.85.217.171]:32925) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1ZildF-0008KZ-Kp for 17133-done@debbugs.gnu.org; Sun, 04 Oct 2015 11:55:26 -0400 Received: by lbos8 with SMTP id s8so46563190lbo.0 for <17133-done@debbugs.gnu.org>; Sun, 04 Oct 2015 08:55:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-type:content-transfer-encoding; bh=lhxbt91a8F1YsWyDSnoBZFopWa6PC2s6Fj5PFwxU6tk=; b=sjZFRHg+fPmpH/dLpSuC9x8NMZ1YkUPhtbGBEwvIOgU2lB2nblKf+XovI5vH1MgOQG KObvu2PCSQzcJt6Yw9/wDpWVMIlHVgsgoSyR06uexbAtjeJmnPGrmh67ZZ/JXP8Idfup 6VlubrGYXtuFrwE9prPo6U2JeUG1VOXYWCx9BSJEjeKW3QMpNWJdK5BzqE14DMNN2e2n KGzJ04M9RW+jvr1GwtbEL2OBvuc+UG8E98eaJQVUfwhqdoQpktOMUxRc2K9u3tytkT4Z OK1rDBCLVyjMWFymzkxXh6JJc8GNlbyeYiX8iK1J6kb25SPQ2S6+OYSacxOvqSpJHhbH Qujg== X-Received: by 10.112.158.38 with SMTP id wr6mr9851264lbb.25.1443974124484; Sun, 04 Oct 2015 08:55:24 -0700 (PDT) Received: from x240 (cm-84.210.143.4.getinternet.no. [84.210.143.4]) by smtp.gmail.com with ESMTPSA id jk6sm3318723lbc.36.2015.10.04.08.55.23 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Sun, 04 Oct 2015 08:55:23 -0700 (PDT) From: =?utf-8?Q?Simen_Heggest=C3=B8yl?= To: Nathan Trapuzzano Subject: Re: bug#17133: json-encode-string incorrectly encodes extra-BMP characters References: <87wqfegdzy.fsf@nbtrap.com> Date: Sun, 04 Oct 2015 17:55:22 +0200 In-Reply-To: <87wqfegdzy.fsf@nbtrap.com> (Nathan Trapuzzano's message of "Fri, 28 Mar 2014 18:22:25 -0400") Message-ID: <87egha7hn9.fsf@gmail.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.5 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 17133-done Cc: 17133-done@debbugs.gnu.org, dgutov@yandex.ru X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) Nathan Trapuzzano writes: > M-: (princ (json-encode "\U0001d11e")) > =3D=3D> "\u1d11e" ;; should be "\ud834\udd1e" or "=F0=9D=84=9E" > >>>From ECMA-404: > > To escape a code point that is not in the Basic Multilingual Plane, > the character is represented as a twelve-character sequence, encoding > the UTF-16 surrogate pair. So for example, a string containing only > the G clef character (U+1D11E) may be represented as "\uD834\uDD1E". This seems to be working as expected in master now; (json-encode "\U0001d11e") produces "=F0=9D=84=9E" as described. -- Simen ------------=_1443974162-32081-1 Content-Type: message/rfc822 Content-Disposition: inline Content-Transfer-Encoding: 7bit Received: (at submit) by debbugs.gnu.org; 28 Mar 2014 22:23:11 +0000 Received: from localhost ([127.0.0.1]:55186 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WTfB9-00012V-3D for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:23:11 -0400 Received: from eggs.gnu.org ([208.118.235.92]:33024) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WTfB5-00012K-8H for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:23:09 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WTfAx-0002d7-1J for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:23:07 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-0.5 required=5.0 tests=BAYES_05,T_DKIM_INVALID autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:49292) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WTfAw-0002d1-Tj for submit@debbugs.gnu.org; Fri, 28 Mar 2014 18:22:58 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:34712) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1WTfAp-0007Ot-S9 for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 18:22:58 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1WTfAi-0002bi-4v for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 18:22:51 -0400 Received: from gproxy3-pub.mail.unifiedlayer.com ([69.89.30.42]:40115) by eggs.gnu.org with smtp (Exim 4.71) (envelope-from ) id 1WTfAh-0002bG-RH for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 18:22:44 -0400 Received: (qmail 12494 invoked by uid 0); 28 Mar 2014 22:22:36 -0000 Received: from unknown (HELO CMOut01) (10.0.90.82) by gproxy3.mail.unifiedlayer.com with SMTP; 28 Mar 2014 22:22:36 -0000 Received: from host393.hostmonster.com ([66.147.240.193]) by CMOut01 with id jANW1n00d4B3kjm01ANZZB; Fri, 28 Mar 2014 16:22:35 -0600 X-Authority-Analysis: v=2.1 cv=Re0DVTdv c=1 sm=1 tr=0 a=GZ6qK+eS4AuCRVUKGEKC+Q==:117 a=GZ6qK+eS4AuCRVUKGEKC+Q==:17 a=DsvgjBjRAAAA:8 a=f5113yIGAAAA:8 a=4GsTxW34auoA:10 a=6B7uFKZ1SAwA:10 a=lfvU_ReahkwA:10 a=IkcTkHD0fZMA:10 a=ngU5ixn2AAAA:8 a=fWyWhr6xdMwA:10 a=MI32vHeHC_XyZ8H8-CkA:9 a=QEXdDO2ut3YA:10 a=qaYDxA4u8PgA:10 a=GFDHnmBnslQA:10 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=nbtrap.com; s=default; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID:Date:Subject:To:From; bh=88rFcdndaEWu6VI+5LhU1DfcXfGBdL+vO9akbfJ2tuw=; b=pasnMtfT2a/QBiu/WlXfXEBEVpJlsUB1CzeuA52X7GgDswmEA0sCp2WRjfjBcvrRAWHkg2P/P4woaglstU8Pgvi9JeAJsG6XpEA/XG5obq09jQDDv2Dh56lpoSfFbWLe; Received: from [50.90.253.209] (port=34716 helo=Nathan-GNU) by host393.hostmonster.com with esmtpsa (TLSv1.2:CAMELLIA128-SHA:128) (Exim 4.82) (envelope-from ) id 1WTfAV-00013S-1h for bug-gnu-emacs@gnu.org; Fri, 28 Mar 2014 16:22:31 -0600 From: Nathan Trapuzzano To: bug-gnu-emacs@gnu.org Subject: json-encode-string incorrectly encodes extra-BMP characters Date: Fri, 28 Mar 2014 18:22:25 -0400 Message-ID: <87wqfegdzy.fsf@nbtrap.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Identified-User: {1585:host393.hostmonster.com:nbtrapco:nbtrap.com} {sentby:smtp auth 50.90.253.209 authed with nbtrap@nbtrap.com} X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -4.3 (----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.3 (----) M-: (princ (json-encode "\U0001d11e")) =3D=3D> "\u1d11e" ;; should be "\ud834\udd1e" or "=F0=9D=84=9E" >From ECMA-404: To escape a code point that is not in the Basic Multilingual Plane, the character is represented as a twelve-character sequence, encoding the UTF-16 surrogate pair. So for example, a string containing only the G clef character (U+1D11E) may be represented as "\uD834\uDD1E". ------------=_1443974162-32081-1--