From debbugs-submit-bounces@debbugs.gnu.org Wed Apr 14 05:02:03 2021 Received: (at submit) by debbugs.gnu.org; 14 Apr 2021 09:02:03 +0000 Received: from localhost ([127.0.0.1]:33371 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lWbPO-0006nw-QF for submit@debbugs.gnu.org; Wed, 14 Apr 2021 05:02:03 -0400 Received: from lists.gnu.org ([209.51.188.17]:58226) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lWbPM-0006nV-63 for submit@debbugs.gnu.org; Wed, 14 Apr 2021 05:02:00 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:33382) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lWbPL-0000bU-Uv for bug-gnu-emacs@gnu.org; Wed, 14 Apr 2021 05:01:59 -0400 Received: from snd00006.auone-net.jp ([111.86.247.6]:53025 helo=dmta0003.auone-net.jp) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lWbPI-00047A-2c for bug-gnu-emacs@gnu.org; Wed, 14 Apr 2021 05:01:58 -0400 Received: from kzhr.d1.dion.ne.jp by dmta0003.auone-net.jp with ESMTP id <20210414090148193.RDRP.44995.kzhr.d1.dion.ne.jp@dmta0003.auone-net.jp>; Wed, 14 Apr 2021 18:01:48 +0900 Date: Wed, 14 Apr 2021 18:01:46 +0900 Message-ID: <86tuo96xz9.wl--xmue@d1.dion.ne.jp> From: Kazuhiro Ito To: bug-gnu-emacs@gnu.org Subject: 28.0.50; japanese-hankaku unnatural or misconversion results X-Hashcash: 1:20:210414:bug-gnu-emacs@gnu.org::889ZYV13ODz6vFGQ:00000000000000000000000000000000000000005ri/ User-Agent: Wanderlust/2.15.9 (Almost Unreal) SEMI-EPG/1.14.7 (Harue) FLIM-LB/1.14.9 (=?ISO-8859-4?Q?Goj=F2?=) APEL-LB/10.8 EasyPG/1.0.0 Emacs/28.0.50 (x86_64-w64-mingw32) MULE/6.0 (HANACHIRUSATO) MIME-Version: 1.0 (generated by SEMI-EPG 1.14.7 - "Harue") Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Received-SPF: pass client-ip=111.86.247.6; envelope-from=kzhr@d1.dion.ne.jp; helo=dmta0003.auone-net.jp X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-Spam-Score: -1.3 (-) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) 1. japanese-hankaku[-region] with ASCII-ONLY option converts Japanese panctuation characters and prolonged sound mark to ascii. (japanese-hankaku "=E3=82=B1=E3=83=BC=E3=82=AD=E3=80=81=E3=83=89=E3=83=BC= =E3=83=8A=E3=83=84=E3=80=82" t) -> "=E3=82=B1-=E3=82=AD,=E3=83=89-=E3=83=8A=E3=83=84." the result is very unnatural because "=E3=83=BC", "=E3=80=81" and "=E3=80= =82" are normally used among Japanese characters which are not converted to hankaku in this case. I hope they are kept as is (the result should be "=E3=82=B1=E3=83=BC=E3=82=AD=E3=80=81=E3=83=89=E3=83=BC=E3=83=8A=E3=83=84= =E3=80=82"). 2. japanese-hankaku[-region] without ASCII-ONLY option and japanese-zenkaku[-region] fails to convert Latin punctuations. (japanese-zenkaku "A, B, C.") -> "=EF=BC=A1=E3=80=81=E3=80=80=EF=BC=A2=E3=80=81=E3=80=80=EF=BC=A3=E3=80= =82" (japanese-hankaku "=EF=BC=A1=EF=BC=8C=EF=BC=A2=EF=BC=8C=EF=BC=A3=EF=BC=8E") -> "A=EF=BD=A4B=EF=BD=A4C=EF=BD=A1" They should be "=EF=BC=A1=EF=BC=8C=E3=80=80=EF=BC=A2=EF=BC=8C=E3=80=80=EF= =BC=A3=EF=BC=8E" and "A,B,C." respectively. Below patch fixes problems. diff --git a/lisp/language/japan-util.el b/lisp/language/japan-util.el index 3f1fb2b749..8b80599d99 100644 --- a/lisp/language/japan-util.el +++ b/lisp/language/japan-util.el @@ -96,9 +96,9 @@ japanese-kana-table (put-char-code-property jisx0201 'jisx0208 katakana))))) =20 (defconst japanese-symbol-table - '((?\=E3=80=80 ?\ ) (?=EF=BC=8C ?, ?=EF=BD=A4) (?=EF=BC=8E ?. ?=EF=BD=A1= ) (?=E3=80=81 ?, ?=EF=BD=A4) (?=E3=80=82 ?. ?=EF=BD=A1) (?=E3=83=BB nil ?= =EF=BD=A5) + '((?\=E3=80=80 ?\ ) (?=EF=BC=8C ?,) (?=EF=BC=8E ?.) (?=E3=80=81 nil ?=EF= =BD=A4) (?=E3=80=82 nil ?=EF=BD=A1) (?=E3=83=BB nil ?=EF=BD=A5) (?=EF=BC=9A ?:) (?=EF=BC=9B ?\;) (?=EF=BC=9F ??) (?=EF=BC=81 ?!) (?=E3= =82=9B nil ?=EF=BE=9E) (?=E3=82=9C nil ?=EF=BE=9F) - (?=C2=B4 ?') (?=EF=BD=80 ?`) (?=EF=BC=BE ?^) (?=EF=BC=BF ?_) (?=E3=83= =BC ?- ?=EF=BD=B0) (?=E2=80=94 ?-) (?=E2=80=90 ?-) + (?=C2=B4 ?') (?=EF=BD=80 ?`) (?=EF=BC=BE ?^) (?=EF=BC=BF ?_) (?=E3=83= =BC nil ?=EF=BD=B0) (?=E2=80=94 ?-) (?=E2=80=90 ?-) (?=EF=BC=8F ?/) (?=EF=BC=BC ?\\) (?=E3=80=9C ?~) (?=EF=BD=9C ?|) (?= =E2=80=98 ?`) (?=E2=80=99 ?') (?=E2=80=9C ?\") (?=E2=80=9D ?\") (?\=EF=BC=88 ?\() (?\=EF=BC=89 ?\)) (?\=EF=BC=BB ?\[) (?\=EF=BC=BD ?\]= ) (?\=EF=BD=9B ?{) (?\=EF=BD=9D ?}) (?=E3=80=88 ?<) (?=E3=80=89 ?>) (?\=E3=80=8C nil ?\=EF=BD=A2) (?\=E3= =80=8D nil ?\=EF=BD=A3) --=20 Kazuhiro Ito From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 20 20:02:35 2021 Received: (at control) by debbugs.gnu.org; 21 Apr 2021 00:02:35 +0000 Received: from localhost ([127.0.0.1]:56392 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lZ0KB-0000rg-J3 for submit@debbugs.gnu.org; Tue, 20 Apr 2021 20:02:35 -0400 Received: from mail-pg1-f176.google.com ([209.85.215.176]:39581) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1lZ0K9-0000rO-8X for control@debbugs.gnu.org; Tue, 20 Apr 2021 20:02:33 -0400 Received: by mail-pg1-f176.google.com with SMTP id s22so6905189pgk.6 for ; Tue, 20 Apr 2021 17:02:33 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:mime-version:date:message-id:subject:to; bh=0MHG5kz8M32vwWFz9aC8sCeykUvBn1B28fe3IxbCZ/Y=; b=thJMlVk/dxO7lE+V4j7i4iDTEC5VLw+lobykn2J3V5SfMEjyOqEnRPAJ+tkVsZq2tI ANS695XME53yFz3X+r7MWtux5NRmbs+XXpC3U7RNegl4HkjTrmpnQBOiXtic/SNxOZxm XSW1/DtSz3dJG/06um7BE+2ejBY7/k03SYmGbWe1M2tSdoeJQaOUdzbxTITyiSZ0Owmo GaWpAHay7NoZlPNHBJvqmk45GGDW5KVAdY/2Ep8ni3ajf90cKe/ghrUGr/Wir8or/V8Z 47S+8HajrTqeXwQ+HE4DszGCErB6FK8tFLDBXWOiU0/qcoxeO8mcQa937lC/eZhmTGZQ u+NA== X-Gm-Message-State: AOAM533IQpj2pCQxT3wTgV94hNbfWH+24ig+2NoqMe4ARZyAJ0/NNvx2 KO8fYeG9ymz2naOQxs5aDd/2nLlpS6kgXTzwPMuQg3lm X-Google-Smtp-Source: ABdhPJwvDw+JfhKOSDRx/ro9CmLE7PHsKHl77CTeVpcpJHmNp7gUacP1tgoq4tqi9A3GcAja+TFH+bqlqfO8KfM1Tnc= X-Received: by 2002:a62:5bc1:0:b029:20d:69a5:189 with SMTP id p184-20020a625bc10000b029020d69a50189mr27280546pfb.57.1618963347289; Tue, 20 Apr 2021 17:02:27 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Tue, 20 Apr 2021 19:02:23 -0500 From: Stefan Kangas MIME-Version: 1.0 Date: Tue, 20 Apr 2021 19:02:23 -0500 Message-ID: Subject: To: control@debbugs.gnu.org Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 2.5 (++) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: tags 47767 + patch thanks Content analysis details: (2.5 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (stefankangas[at]gmail.com) -0.0 SPF_PASS SPF: sender matches SPF record 0.2 HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail domains are different -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at https://www.dnswl.org/, no trust [209.85.215.176 listed in list.dnswl.org] -0.0 RCVD_IN_MSPIKE_H2 RBL: Average reputation (+2) [209.85.215.176 listed in wl.mailspike.net] 2.0 BLANK_SUBJECT Subject is present but empty 0.0 UNPARSEABLE_RELAY Informational: message has unparseable relay lines 0.2 FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and EnvelopeFrom freemail headers are different X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.5 (+) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: tags 47767 + patch thanks Content analysis details: (1.5 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at https://www.dnswl.org/, no trust [209.85.215.176 listed in list.dnswl.org] -0.0 RCVD_IN_MSPIKE_H2 RBL: Average reputation (+2) [209.85.215.176 listed in wl.mailspike.net] 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (stefankangas[at]gmail.com) -0.0 SPF_PASS SPF: sender matches SPF record 0.2 HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail domains are different 2.0 BLANK_SUBJECT Subject is present but empty 0.0 UNPARSEABLE_RELAY Informational: message has unparseable relay lines 0.2 FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and EnvelopeFrom freemail headers are different -1.0 MAILING_LIST_MULTI Multiple indicators imply a widely-seen list manager tags 47767 + patch thanks From debbugs-submit-bounces@debbugs.gnu.org Wed May 05 11:10:43 2021 Received: (at 47767) by debbugs.gnu.org; 5 May 2021 15:10:43 +0000 Received: from localhost ([127.0.0.1]:60859 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1leJAg-0004vI-Qn for submit@debbugs.gnu.org; Wed, 05 May 2021 11:10:43 -0400 Received: from quimby.gnus.org ([95.216.78.240]:37926) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1leJAf-0004v7-FU for 47767@debbugs.gnu.org; Wed, 05 May 2021 11:10:41 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID :In-Reply-To:Date:References:Subject:Cc:To:From:Sender:Reply-To:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=hvHU7D3Kr2PI/+Boz9vn3LZfI84XzmuSVj0WjsGEDZg=; b=anykMsCEIrNjwHVjOp6/Bm+o17 X8UzZYMX0f7V6hS8JBov7ftrXHIdW4XwSJYfNDH2C5m34oo/k/jF/lYCgGXLMn8q8x+0ryuzheEUw jbUjkh9KWjIu/dq6JIRchjO35V3bQxolWltQ7wbKThFWpPSDopLzZ4aX052lxAWMwahY=; Received: from cm-84.212.220.105.getinternet.no ([84.212.220.105] helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1leJAV-0002hM-UQ; Wed, 05 May 2021 17:10:34 +0200 From: Lars Ingebrigtsen To: Kazuhiro Ito Subject: Re: bug#47767: 28.0.50; japanese-hankaku unnatural or misconversion results References: <86tuo96xz9.wl--xmue@d1.dion.ne.jp> X-Now-Playing: Chrome Hoof's _Pre-Emptive False Rapture_: "Spokes of Uridium" Date: Wed, 05 May 2021 17:10:31 +0200 In-Reply-To: <86tuo96xz9.wl--xmue@d1.dion.ne.jp> (Kazuhiro Ito's message of "Wed, 14 Apr 2021 18:01:46 +0900") Message-ID: <87pmy543o8.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Kazuhiro Ito writes: > 1. japanese-hankaku[-region] with ASCII-ONLY option converts Japanese > panctuation characters and prolonged sound mark to ascii. > > (japanese-hankaku "ケーキ、ドーナツ。" t) > -> "ケ-キ [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 47767 Cc: 47767@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Kazuhiro Ito writes: > 1. japanese-hankaku[-region] with ASCII-ONLY option converts Japanese > panctuation characters and prolonged sound mark to ascii. > > (japanese-hankaku "=E3=82=B1=E3=83=BC=E3=82=AD=E3=80=81=E3=83=89=E3=83=BC= =E3=83=8A=E3=83=84=E3=80=82" t) > -> "=E3=82=B1-=E3=82=AD,=E3=83=89-=E3=83=8A=E3=83=84." > > the result is very unnatural because "=E3=83=BC", "=E3=80=81" and "=E3=80= =82" are normally > used among Japanese characters which are not converted to hankaku in > this case. I hope they are kept as is (the result should be > "=E3=82=B1=E3=83=BC=E3=82=AD=E3=80=81=E3=83=89=E3=83=BC=E3=83=8A=E3=83=84= =E3=80=82"). > > 2. japanese-hankaku[-region] without ASCII-ONLY option and > japanese-zenkaku[-region] fails to convert Latin punctuations. > > (japanese-zenkaku "A, B, C.") > -> "=EF=BC=A1=E3=80=81=E3=80=80=EF=BC=A2=E3=80=81=E3=80=80=EF=BC=A3=E3=80= =82" > > (japanese-hankaku "=EF=BC=A1=EF=BC=8C=EF=BC=A2=EF=BC=8C=EF=BC=A3=EF=BC=8E= ") > -> "A=EF=BD=A4B=EF=BD=A4C=EF=BD=A1" > > They should be "=EF=BC=A1=EF=BC=8C=E3=80=80=EF=BC=A2=EF=BC=8C=E3=80=80=EF= =BC=A3=EF=BC=8E" and "A,B,C." respectively. > > Below patch fixes problems. Thanks; applied to Emacs 28. I don't know Japanese, but since nobody else has piped up about this patch in three weeks, I'm applying it. If other Japanese-speaking people disagree with this patch, we can revert it. --=20 (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Wed May 05 11:10:47 2021 Received: (at control) by debbugs.gnu.org; 5 May 2021 15:10:47 +0000 Received: from localhost ([127.0.0.1]:60862 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1leJAl-0004vV-23 for submit@debbugs.gnu.org; Wed, 05 May 2021 11:10:47 -0400 Received: from quimby.gnus.org ([95.216.78.240]:37940) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1leJAj-0004vB-Hs for control@debbugs.gnu.org; Wed, 05 May 2021 11:10:45 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Subject:From:To:Message-Id:Date:Sender:Reply-To:Cc: MIME-Version:Content-Type:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:In-Reply-To:References:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=heJnoBkPTqAEPTMtazSMD3PrzGrGvcnRrS3C+h6Q0n0=; b=VyJL18LXuGaaiBv1OWmPP+jTMW K59sugIE9J7REaHyayhEY4uq3mY2CEVgKsTAq1le18IXabZSmsi1E+f2qql/im1OVwxYR2vJLc48c QDbBDt6E/oNkz3KVgJDobwQRpgMWdCPxoigKWacKTTO1td4O9FJ3hlfKkb0hd08F9Xes=; Received: from cm-84.212.220.105.getinternet.no ([84.212.220.105] helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1leJAc-0002hU-4b for control@debbugs.gnu.org; Wed, 05 May 2021 17:10:40 +0200 Date: Wed, 05 May 2021 17:10:37 +0200 Message-Id: <87o8dp43o2.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #47767 X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: tags 47767 fixed close 47767 28.1 quit Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) tags 47767 fixed close 47767 28.1 quit From unknown Thu Jun 19 16:22:19 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Thu, 03 Jun 2021 11:24:09 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator