From unknown Fri Jun 20 18:00:51 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#19653 <19653@debbugs.gnu.org> To: bug#19653 <19653@debbugs.gnu.org> Subject: Status: ispell misalignment with hunspell when Unicode apostrophe is used Reply-To: bug#19653 <19653@debbugs.gnu.org> Date: Sat, 21 Jun 2025 01:00:51 +0000 retitle 19653 ispell misalignment with hunspell when Unicode apostrophe is = used reassign 19653 emacs submitter 19653 Tobias Getzner severity 19653 normal tag 19653 moreinfo thanks From debbugs-submit-bounces@debbugs.gnu.org Thu Jan 22 09:40:28 2015 Received: (at submit) by debbugs.gnu.org; 22 Jan 2015 14:40:28 +0000 Received: from localhost ([127.0.0.1]:52499 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YEIvr-0006rj-7M for submit@debbugs.gnu.org; Thu, 22 Jan 2015 09:40:27 -0500 Received: from eggs.gnu.org ([208.118.235.92]:41510) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YEIvl-0006rR-3P for submit@debbugs.gnu.org; Thu, 22 Jan 2015 09:40:21 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YEIve-0006oZ-Gf for submit@debbugs.gnu.org; Thu, 22 Jan 2015 09:40:15 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.0 required=5.0 tests=BAYES_20,FREEMAIL_FROM autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:35659) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YEIve-0006oV-Du for submit@debbugs.gnu.org; Thu, 22 Jan 2015 09:40:14 -0500 Received: from eggs.gnu.org ([2001:4830:134:3::10]:43220) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YEIvd-0000Be-AK for bug-gnu-emacs@gnu.org; Thu, 22 Jan 2015 09:40:14 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1YEIvY-0006mh-Kk for bug-gnu-emacs@gnu.org; Thu, 22 Jan 2015 09:40:13 -0500 Received: from mout.gmx.net ([212.227.15.15]:60334) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1YEIvY-0006hA-Bd for bug-gnu-emacs@gnu.org; Thu, 22 Jan 2015 09:40:08 -0500 Received: from glenalbyn.linguistics.ruhr-uni-bochum.de ([134.147.14.84]) by mail.gmx.com (mrgmx002) with ESMTPSA (Nemesis) id 0MVN0w-1YDrsx3EMA-00Yi26 for ; Thu, 22 Jan 2015 15:40:05 +0100 Message-ID: <1421937605.31090.19.camel@gmx.de> Subject: ispell misalignment with hunspell when Unicode apostrophe is used From: Tobias Getzner To: bug-gnu-emacs@gnu.org Date: Thu, 22 Jan 2015 15:40:05 +0100 Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.12.10 Mime-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-Provags-ID: V03:K0:Yy/V3V+KNSOiaK41l7FRBW8+Ubvt/C7UL4FpgmZPGexfaBYZR82 EoWgR8xaIKLXDtQDNGvwPz9udNEcFMdeDL+rS4rUdqd6OVeATkoxgbWcRMYe+g3/8hfRixL JwsglfsWJGhHb97OBEp/9VNqB1ryfNREFmUMqaoTOSJU6LWgiZprdgFIrCjNpUjyNqN82mC nXKNEIBMtY7DIj72uqUoQ== X-UI-Out-Filterresults: notjunk:1; X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x [generic] X-detected-operating-system: by eggs.gnu.org: Error: Malformed IPv6 address (bad octet value). X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -4.1 (----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.1 (----) Hello, I=E2=80=99ve noticed that when ispell.el (Emacs 24.4.1) is using hunspell (= v. 1.3.3) to spell-check a buffer containing the typographically correct apostrophe (=C2=AB=E2=80=99=C2=BB; U+2019), ispell will error out with the = message =C2=ABispell misalignment=C2=BB. The problem can be reproduced by setting ispell-program-name to =C2=ABhunspell=C2=BB, and spell-checking a buffer containing the string =C2= =ABabc=E2=80=99s zzz=C2=BB. This yields the following error: > ispell-process-line: Ispell misalignment: word `zzz' point 9; probably in= compatible versions This seems to be a regression from 24.3, where hunspell support was working (with the caveat that the apostrophe had to be manually added to the dictionary=E2=80=99s =C2=ABOTHERCHARS=C2=BB). Best regards, Tobias From debbugs-submit-bounces@debbugs.gnu.org Thu Jan 22 12:42:09 2015 Received: (at 19653) by debbugs.gnu.org; 22 Jan 2015 17:42:09 +0000 Received: from localhost ([127.0.0.1]:53189 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YELlh-00054B-1K for submit@debbugs.gnu.org; Thu, 22 Jan 2015 12:42:09 -0500 Received: from mtaout23.012.net.il ([80.179.55.175]:35862) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1YELla-00053a-EI for 19653@debbugs.gnu.org; Thu, 22 Jan 2015 12:42:04 -0500 Received: from conversion-daemon.a-mtaout23.012.net.il by a-mtaout23.012.net.il (HyperSendmail v2007.08) id <0NIL007009Y9IQ00@a-mtaout23.012.net.il> for 19653@debbugs.gnu.org; Thu, 22 Jan 2015 19:41:55 +0200 (IST) Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout23.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0NIL007AHAHVEM90@a-mtaout23.012.net.il>; Thu, 22 Jan 2015 19:41:55 +0200 (IST) Date: Thu, 22 Jan 2015 19:41:54 +0200 From: Eli Zaretskii Subject: Re: bug#19653: ispell misalignment with hunspell when Unicode apostrophe is used In-reply-to: <1421937605.31090.19.camel@gmx.de> X-012-Sender: halo1@inter.net.il To: Tobias Getzner Message-id: <83egqmvhjx.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-transfer-encoding: 8BIT References: <1421937605.31090.19.camel@gmx.de> X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 19653 Cc: 19653@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.0 (+) > From: Tobias Getzner > Date: Thu, 22 Jan 2015 15:40:05 +0100 > > I’ve noticed that when ispell.el (Emacs 24.4.1) is using hunspell (v. > 1.3.3) to spell-check a buffer containing the typographically correct > apostrophe («’»; U+2019), ispell will error out with the message > «ispell misalignment». > > The problem can be reproduced by setting ispell-program-name to > «hunspell», and spell-checking a buffer containing the string «abc’s > zzz». This yields the following error: > > > ispell-process-line: Ispell misalignment: word `zzz' point 9; probably incompatible versions I cannot reproduce this with Emacs 24.4 and Hunspell 1.3.2 (heavily patched to fix known problems in Hunspell). You didn't provide enough information for me to be sure I did the same as you, so here are the possible explanations for the different experience: . I use a different version of Hunspell, and yours has a bug. Hunspell is known to have a problem with reporting mis-spelled words with byte offsets, whereas Emacs expects character offsets, so dictionaries encoded in UTF-8 cause symptoms similar to those you report. My Hunspell is patched to avoid this problem. . I didn't change OTHERCHARS. Frankly, I think doing this asks for trouble, since the speller still uses the characters recorded in the .aff file. . You didn't tell which dictionary you used. I tried en_US and de_DE, and none of them produced these problems. Maybe this is specific to some dictionary you used. In particular, the encoding of that dictionary is important vs the encoding you tell ispell.el to use (if you customized that part). > This seems to be a regression from 24.3, where hunspell support was > working (with the caveat that the apostrophe had to be manually added > to the dictionary’s «OTHERCHARS»). Are you saying that the same version of Hunspell with the same dictionary worked in Emacs 24.3, where Emacs 24.4 doesn't? If so, please try to eliminate or at least minimize your ispell-related customizations, and try again. If the problem persists, please show the minimal set of customizations to reproduce the problem. From debbugs-submit-bounces@debbugs.gnu.org Sat Dec 26 11:53:55 2015 Received: (at 19653) by debbugs.gnu.org; 26 Dec 2015 16:53:55 +0000 Received: from localhost ([127.0.0.1]:41874 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aCs6M-0000iT-Tu for submit@debbugs.gnu.org; Sat, 26 Dec 2015 11:53:55 -0500 Received: from hermes.netfonds.no ([80.91.224.195]:35592) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aCs6K-0000i9-QR for 19653@debbugs.gnu.org; Sat, 26 Dec 2015 11:53:53 -0500 Received: from 2.150.58.24.tmi.telenormobil.no ([2.150.58.24] helo=mouse) by hermes.netfonds.no with esmtpsa (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1aCs5x-0002HY-Rw; Sat, 26 Dec 2015 17:53:30 +0100 From: Lars Ingebrigtsen To: Tobias Getzner Subject: Re: bug#19653: ispell misalignment with hunspell when Unicode apostrophe is used References: <1421937605.31090.19.camel@gmx.de> Date: Sat, 26 Dec 2015 17:53:28 +0100 In-Reply-To: <1421937605.31090.19.camel@gmx.de> (Tobias Getzner's message of "Thu, 22 Jan 2015 15:40:05 +0100") Message-ID: <87h9j5m9rr.fsf@gnus.org> User-Agent: Gnus/5.130014 (Ma Gnus v0.14) Emacs/25.1.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-MailScanner-ID: 1aCs5x-0002HY-Rw X-Netfonds-MailScanner: Found to be clean X-Netfonds-MailScanner-From: larsi@gnus.org MailScanner-NULL-Check: 1451753610.12428@8ZacIxB9UvtDevnGeTS4Ww X-Spam-Status: No X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 19653 Cc: 19653@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) Tobias Getzner writes: > I=92ve noticed that when ispell.el (Emacs 24.4.1) is using hunspell (v. > 1.3.3) to spell-check a buffer containing the typographically correct > apostrophe (=AB=92=BB; U+2019), ispell will error out with the message > =ABispell misalignment=BB. There was an earlier similar report where the conclusion was that hunspell was buggy, but a new version of hunspell fixed the problem... --=20 (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Sat Dec 26 11:54:03 2015 Received: (at control) by debbugs.gnu.org; 26 Dec 2015 16:54:03 +0000 Received: from localhost ([127.0.0.1]:41886 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aCs6U-0000jM-OY for submit@debbugs.gnu.org; Sat, 26 Dec 2015 11:54:02 -0500 Received: from hermes.netfonds.no ([80.91.224.195]:35606) by debbugs.gnu.org with esmtp (Exim 4.84) (envelope-from ) id 1aCs6S-0000iq-Vr for control@debbugs.gnu.org; Sat, 26 Dec 2015 11:54:01 -0500 Received: from 2.150.58.24.tmi.telenormobil.no ([2.150.58.24] helo=mouse) by hermes.netfonds.no with esmtpsa (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.72) (envelope-from ) id 1aCs62-0002Hy-Qf for control@debbugs.gnu.org; Sat, 26 Dec 2015 17:53:35 +0100 Date: Sat, 26 Dec 2015 17:53:33 +0100 Message-Id: <87fuypm9rm.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #19653 X-MailScanner-ID: 1aCs62-0002Hy-Qf X-Netfonds-MailScanner: Found to be clean X-Netfonds-MailScanner-From: larsi@gnus.org MailScanner-NULL-Check: 1451753615.81802@MbgSTcJeFXy2JsFS4nXTpw X-Spam-Status: No X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) close 19653 From unknown Fri Jun 20 18:00:51 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Sun, 24 Jan 2016 12:24:08 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator From debbugs-submit-bounces@debbugs.gnu.org Fri Oct 21 01:03:48 2016 Received: (at control) by debbugs.gnu.org; 21 Oct 2016 05:03:48 +0000 Received: from localhost ([127.0.0.1]:43004 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxRzg-00019g-Ib for submit@debbugs.gnu.org; Fri, 21 Oct 2016 01:03:48 -0400 Received: from mail-qk0-f170.google.com ([209.85.220.170]:34521) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxRzf-00019T-6Y for control@debbugs.gnu.org; Fri, 21 Oct 2016 01:03:47 -0400 Received: by mail-qk0-f170.google.com with SMTP id f128so125253823qkb.1 for ; Thu, 20 Oct 2016 22:03:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ftfl.ca; s=google; h=from:to:date:message-id:user-agent:mime-version; bh=eXOAgPMWpAgLi0v9g0v3QTwANQPDTpXoSEMCNVX6Qbs=; b=USP5czywkmjlX3jTBsQaMgyHm2a8Und8TknvlFyAd/1d/PJuulq86MQpx+AASgNBNu V+pE/RhZIpNfovVS34xZDx9jB0CfEDu8N9m12cLn8yWzjrE7N46708QS+ZgHdW8uOiOp IZK1qFlrX/RFEwfwmEDsianUYiSNvRQHm+Yrs= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:date:message-id:user-agent:mime-version; bh=eXOAgPMWpAgLi0v9g0v3QTwANQPDTpXoSEMCNVX6Qbs=; b=UMbACGzXc6EykCBv8dIXmXONq5C00lNmYKWJGMxgTCggdgwHlOBzJX/YdQH9rOrBQe Ks9flYHXu5O8+Sp1M/Pr1jpQ3cQwnFIfkYNLM/NqGlwmeU4fbpA4K298gCc+h/BaJED5 fYths/4DUfZ3fb1R8p1pEwCFUlcXYCb9HRBUMG24NyF9zq6hdYQXqtqKnoqdHDnb7Aww XV75077XVZ2dgAdMjl946w4M/xAJ/AyWs9WATgSoOvZDNrBzbJQar417xOsCL6kmaAQ1 BDKaxj0lVDenUoiduXW3XRfsfmBwWpp+S7RHfpNLTq0OUGaYxcY18DxXlB57PlQbidIu 03/g== X-Gm-Message-State: ABUngvdQwX/I+NidKdOio9+xZSbFm4wAT/CKJXH51a4cwyGu1hK7t3r2r/vaNHGs5Om2QA== X-Received: by 10.233.220.129 with SMTP id q123mr4390112qkf.290.1477026221523; Thu, 20 Oct 2016 22:03:41 -0700 (PDT) Received: from phe.ftfl.ca.ftfl.ca (hlfxns017vw-142068082201.dhcp-dynamic.FibreOp.ns.bellaliant.net. [142.68.82.201]) by smtp.gmail.com with ESMTPSA id c38sm487283qte.17.2016.10.20.22.03.39 for (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Thu, 20 Oct 2016 22:03:40 -0700 (PDT) From: Joseph Mingrone To: control@debbugs.gnu.org Date: Fri, 21 Oct 2016 02:03:37 -0300 Message-ID: <861sza1efq.fsf@phe.ftfl.ca> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1 (berkeley-unix) MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" X-Spam-Score: 1.8 (+) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: unarchive 19653 [...] Content analysis details: (1.8 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -0.0 SPF_PASS SPF: sender matches SPF record -0.7 RCVD_IN_DNSWL_LOW RBL: Sender listed at http://www.dnswl.org/, low trust [209.85.220.170 listed in list.dnswl.org] -0.0 RCVD_IN_MSPIKE_H3 RBL: Good reputation (+3) [209.85.220.170 listed in wl.mailspike.net] 0.5 RCVD_IN_SORBS_SPAM RBL: SORBS: sender is a spam source [209.85.220.170 listed in dnsbl.sorbs.net] 0.0 T_DKIM_INVALID DKIM-Signature header exists but is not valid 1.8 MISSING_SUBJECT Missing Subject: header 0.2 NO_SUBJECT Extra score for no subject -0.0 RCVD_IN_MSPIKE_WL Mailspike good senders 0.0 TVD_SPACE_RATIO No description available. X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.8 (+) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: unarchive 19653 [...] Content analysis details: (1.8 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -0.7 RCVD_IN_DNSWL_LOW RBL: Sender listed at http://www.dnswl.org/, low trust [209.85.220.170 listed in list.dnswl.org] -0.0 RCVD_IN_MSPIKE_H3 RBL: Good reputation (+3) [209.85.220.170 listed in wl.mailspike.net] 0.5 RCVD_IN_SORBS_SPAM RBL: SORBS: sender is a spam source [209.85.220.170 listed in dnsbl.sorbs.net] -0.0 SPF_PASS SPF: sender matches SPF record 0.0 T_DKIM_INVALID DKIM-Signature header exists but is not valid 1.8 MISSING_SUBJECT Missing Subject: header 0.2 NO_SUBJECT Extra score for no subject -0.0 RCVD_IN_MSPIKE_WL Mailspike good senders 0.0 TVD_SPACE_RATIO No description available. --=-=-= Content-Type: text/plain unarchive 19653 --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQJ8BAEBCgBmBQJYCaGrXxSAAAAAAC4AKGlzc3Vlci1mcHJAbm90YXRpb25zLm9w ZW5wZ3AuZmlmdGhob3JzZW1hbi5uZXQ1NUIwOTNBNzI2QzM4ODU1NzEyMkJBRDUz NkE0MEM4M0IwRDZFRjlFAAoJEDakDIOw1u+evQcQAI8CH7rxjoVuoAdkvoY/M9FG ftVdonVa3VYBA+UKarSJfDcgCpAIWua6ODG+0ovSDsH2jb8k+Bz+QWUoD0QpVkCr ncelwP5k1XkBX0poIEQaRBJc8WxS0Mth7YOPGFOOumXeLIzcbot3u06oRSQVbbup EaLHrqJrPDuD6i8LEKUA9OnI7moFRlLAOEi0lM12ZnlMyfo50RzgEY1xE8FiWgCI k2A51aE34L4OcFbHHYV1V+cwJqQrJqeVGJNISwlwSFTA2MeZghh//h22coD0gF1d Xlvx+BK93JdJwSFjFV6T266oRadZjk+M4JZGfB/YH7dIVkyLJXceOw90iIFa3sxz 3NJ+TR8ebDesFmJR4gziboC91DhbkDJORNG809uoqPgJuAen2ugQ3XDmPaYXRLIT RjI+O4T95aISO8KNOyoj2GQriPhObxdZwmxT3l76OXlieYjXMzqJZHFH3kNSfx79 ulw/5GDUjviHpNAY3H5dodGVgTENT24MvJBIcJpRtW1HVX+hUDeqowuS46vSVUoc ptfVXhRpJ4StzhsPU9jyjvCK86PxZ9xRVdGbEL2uVXpbe2Bp8gYa5uw92L9P8rnN V91ir9nSQX0aclio2xawtgTvDwXkv4B3Xazt043bu7hzDdwTBWXGMYv9RUfrVgKR 3ycv9Pp/KgOmThdB5YRb =ZNda -----END PGP SIGNATURE----- --=-=-=-- From debbugs-submit-bounces@debbugs.gnu.org Fri Oct 21 01:05:07 2016 Received: (at 19653) by debbugs.gnu.org; 21 Oct 2016 05:05:07 +0000 Received: from localhost ([127.0.0.1]:43009 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxS0w-0001CJ-V6 for submit@debbugs.gnu.org; Fri, 21 Oct 2016 01:05:07 -0400 Received: from mail-qk0-f170.google.com ([209.85.220.170]:34058) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxS0v-0001Bi-87 for 19653@debbugs.gnu.org; Fri, 21 Oct 2016 01:05:05 -0400 Received: by mail-qk0-f170.google.com with SMTP id f128so125278936qkb.1 for <19653@debbugs.gnu.org>; Thu, 20 Oct 2016 22:05:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ftfl.ca; s=google; h=from:to:subject:references:date:in-reply-to:message-id:user-agent :mime-version; bh=2iA5BDQLXn5u0iyrRi4qG5gwX/8Dgxg3y3yNk7Xb9OU=; b=Q49lpWHcFmedljAwTcdkSVzwJa2rr4HOX2hN8D6wnn0ANcpXvMhHQweEgct2qg1hSt XrNdA4UtZaDlOqQ5DOX/n98eIvQMSCI/W1B/zg2W9f5nr2eGHkJIWUUzX9/LBW7QgqZK tjX27rk1TTTtm2ckBeGlJBSZR0DaVTNSJLXd4= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:subject:references:date:in-reply-to :message-id:user-agent:mime-version; bh=2iA5BDQLXn5u0iyrRi4qG5gwX/8Dgxg3y3yNk7Xb9OU=; b=HZdQQMM/dNTHqLa4yJbpta21FgJElewZ1tpYnZYHkCOdpmmg3jWRQWMjo5J2oKzhiM jGy3PXU042zg7LGuIjq06MG91Gtnl+arBFjwtj7+An3eA4czdC0+hlbKd2SC/yjK+7UU BW7Z0GLAiMkEFse/arSc5Pcbywlz2RYIvYy7lNV4vZbOO86ChHPmg0/jCbYtIEq4/LF2 gTLF/4Q7QV2MlnWish39/ShZoA/2GpJ3mTLr6gAcB1OWxjVt509JpQGrM2FyvwVUPtBz gfAALb+SQfMEpeY6L8wGgj+K7twzA5+02e8sQTiNuA/tHVXLTZDIBOEKZ9ABkCMtH14e RmxA== X-Gm-Message-State: ABUngveKMh47NUnnsKIlCJ9MVpdXQIcAtwviZtTZiwJ92z3KveNeD9ADbPHNWweTMmRqBQ== X-Received: by 10.55.56.11 with SMTP id f11mr3593862qka.255.1477026299818; Thu, 20 Oct 2016 22:04:59 -0700 (PDT) Received: from phe.ftfl.ca.ftfl.ca (hlfxns017vw-142068082201.dhcp-dynamic.FibreOp.ns.bellaliant.net. [142.68.82.201]) by smtp.gmail.com with ESMTPSA id n128sm454556qka.49.2016.10.20.22.04.59 for <19653@debbugs.gnu.org> (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Thu, 20 Oct 2016 22:04:59 -0700 (PDT) From: Joseph Mingrone To: 19653@debbugs.gnu.org Subject: Re: bug#19653: ispell misalignment with hunspell when Unicode apostrophe is used References: <8660om1en7.fsf@phe.ftfl.ca> Date: Fri, 21 Oct 2016 02:04:58 -0300 In-Reply-To: <8660om1en7.fsf@phe.ftfl.ca> (Joseph Mingrone's message of "Fri, 21 Oct 2016 01:59:08 -0300") Message-ID: <86wph2z405.fsf@phe.ftfl.ca> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1 (berkeley-unix) MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" X-Spam-Score: -0.2 (/) X-Debbugs-Envelope-To: 19653 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.2 (/) --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable This still seems to be a problem with hunspell version 1.3.3. The problem can be reproduced by spell checking a file with this one line. alsdk =E2=9C=85 sdfkjdsf sldksdfkjsfd During spell checking, the process list shows: ispell run -- -- /usr/local/bin/hunspell -a -d en_CA -i UTF-8 The error Emacs (version 25.1.1) reports is: ispell-process-line: Ispell misalignment: word =E2=80=98sdfkjdsf=E2=80=99 p= oint 11; probably incompatible versions Hunspell skips over the special character when it is run at a terminal prom= pt. This is the initial output. ### begin hunspell output ### alsdk File: test.txt alsdk \~E sdfkjdsf sldksdfkjsfd 0: Alaska [SPACE] R)epl A)ccept I)nsert U)ncap S)tem Q)uit e(X)it or ? for help ### end hunspell output ### --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQJ8BAEBCgBmBQJYCaH6XxSAAAAAAC4AKGlzc3Vlci1mcHJAbm90YXRpb25zLm9w ZW5wZ3AuZmlmdGhob3JzZW1hbi5uZXQ1NUIwOTNBNzI2QzM4ODU1NzEyMkJBRDUz NkE0MEM4M0IwRDZFRjlFAAoJEDakDIOw1u+eGLQP/3gbdBTbnhQ6okFWXrbN30Tx mU2ZvZA/Er+Rj2gvW7717UH8BHFgCp1F6zspz6nIdDyqco6xutF52zWBCf/rs7VJ m1P04GFJtZAtpBfj57mWVtDaYkdAL7FPNje2hxW5GoVfj3eyVEQFZCfNekpVYJ5u KIpLdNJ0Kz/3WV2uafxHP4z7p6Gx3AFn5POQ0ZYuHH81VN9yNcBLisl7ujMOIAib hCASF816S1bJx1DsmRnoB3yXZ7yHr3o5B26lvenWbyPfm19bSTYqiVr+pcx3WqsV cxjK+BEN42bAGrXVJryGSj/BGmHyMzGPqyCndrDLHf1q+LkTAt+QiyTYPerDxbDj clAmpFzJEmKE1iMWScBthgxtsEI9FXofJfwKcYmp92kvr3CZq7DVA9XBJssyb0Hf 1fr55EXs85bTvOfoVCGDhkqVOyaNjmDl6cqRJ/LaGISKAgyFDxWdxaJXLIb9TPcc PK7ABQLurUFwEBtIEB7uaKuQhvP584J2m+14efjfjvTHkR51B6e6TpgZxyWSYlk0 Qr31XQjIxXAT0dCJpxFmz9HWz/D6Ps9Qs4qrgJRU1abMkRVn/5RJXSrHiQXxWbjp 9zIfOViNexULZ5JCeTAgNF8tAjeXb+zcdo/cSPVNt1cb7dPLDn5y5AQXVzczlQeg kBokbu0VS0s7KDaM59fF =14SO -----END PGP SIGNATURE----- --=-=-=-- From debbugs-submit-bounces@debbugs.gnu.org Fri Oct 21 03:33:32 2016 Received: (at 19653) by debbugs.gnu.org; 21 Oct 2016 07:33:32 +0000 Received: from localhost ([127.0.0.1]:43418 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxUKa-0006jz-7M for submit@debbugs.gnu.org; Fri, 21 Oct 2016 03:33:32 -0400 Received: from eggs.gnu.org ([208.118.235.92]:59324) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxUKY-0006jk-PP for 19653@debbugs.gnu.org; Fri, 21 Oct 2016 03:33:30 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bxUKP-0004gZ-Ho for 19653@debbugs.gnu.org; Fri, 21 Oct 2016 03:33:25 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.5 required=5.0 tests=BAYES_50,RP_MATCHES_RCVD autolearn=disabled version=3.3.2 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:32777) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bxUKP-0004gL-F2; Fri, 21 Oct 2016 03:33:21 -0400 Received: from 84.94.185.246.cable.012.net.il ([84.94.185.246]:2657 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.82) (envelope-from ) id 1bxUKO-0006Jg-0h; Fri, 21 Oct 2016 03:33:20 -0400 Date: Fri, 21 Oct 2016 10:33:10 +0300 Message-Id: <83mvhyrwax.fsf@gnu.org> From: Eli Zaretskii To: Joseph Mingrone In-reply-to: <86wph2z405.fsf@phe.ftfl.ca> (message from Joseph Mingrone on Fri, 21 Oct 2016 02:04:58 -0300) Subject: Re: bug#19653: ispell misalignment with hunspell when Unicode apostrophe is used References: <8660om1en7.fsf@phe.ftfl.ca> <86wph2z405.fsf@phe.ftfl.ca> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-Spam-Score: -5.3 (-----) X-Debbugs-Envelope-To: 19653 Cc: 19653@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: Eli Zaretskii Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.3 (-----) > From: Joseph Mingrone > Date: Fri, 21 Oct 2016 02:04:58 -0300 > > This still seems to be a problem with hunspell version 1.3.3. > > The problem can be reproduced by spell checking a file with this one line. > > alsdk ✅ sdfkjdsf sldksdfkjsfd > > During spell checking, the process list shows: > > ispell run -- -- /usr/local/bin/hunspell -a -d en_CA -i UTF-8 > > The error Emacs (version 25.1.1) reports is: > > ispell-process-line: Ispell misalignment: word ‘sdfkjdsf’ point 11; probably incompatible versions Did Hunspell ever fix the problem whereby it reported byte offsets of the misspelled words, as opposed to character offsets? If not, that is your problem, and Hunspell should finally get its act together. To see whether this is the problem, invoke Hunspell like this: /usr/local/bin/hunspell -a -d en_CA -i UTF-8 < test.txt and see what Hunspell emits. It should emit something like this (the below is taken from my system, and I don't have the en_CA dictionary, so your output might be slightly different): @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2) & alsdk 3 0: Alaska, elastic, Alston & sdfkjdsf 2 8: artefact's, postfix & sldksdfkjsfd 2 17: justification, staphylococcus The second number after each misspelled word is the offset of that word's beginning, measured in characters, from the start of the line. Hunspell used to report this in bytes instead of characters; if it still does, you will have to patch it to fix that bug. AFAIR, the Hunspell issue tracker includes several patches for this bug. Or maybe the latest Hunspell 1.4.1 already fixes this, in which case please upgrade. From debbugs-submit-bounces@debbugs.gnu.org Fri Oct 21 09:00:09 2016 Received: (at 19653) by debbugs.gnu.org; 21 Oct 2016 13:00:09 +0000 Received: from localhost ([127.0.0.1]:44265 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxZQd-0002sy-ML for submit@debbugs.gnu.org; Fri, 21 Oct 2016 09:00:09 -0400 Received: from mail-qt0-f178.google.com ([209.85.216.178]:34416) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxZQb-0002r6-Mn for 19653@debbugs.gnu.org; Fri, 21 Oct 2016 09:00:06 -0400 Received: by mail-qt0-f178.google.com with SMTP id q7so85379717qtq.1 for <19653@debbugs.gnu.org>; Fri, 21 Oct 2016 06:00:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ftfl.ca; s=google; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version; bh=jLyNz510VhgVqSzrmaBcs197K6gZ/rpVjW+oPah6FA8=; b=gYnyOVVX+nN5BdJgiNJc/DJvI0KYN1kK25E4J9NZKARMowXOMzFbVqVUg59CvtCVTm AadX/Bvd4LX/CRDcD2tYyh+JCgZ6XW51gPzLfwK1jNR5pe47F2ufGvd726kH/DBpFYzO xeEvJNULGCNgvJ0inH4l7zSZPVotkqg4XjVr4= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version; bh=jLyNz510VhgVqSzrmaBcs197K6gZ/rpVjW+oPah6FA8=; b=iKPpwRjRAeRYlW9p91NmQOE1JnnJYsn+hBA9u125yYLQaHl56Jj49CcD4htZdckD6H 1/M8I4eJPlk63gQ6a43ePRdWsMNgR/iBKyp24gWRRfLR2NuwJG5T7bwHtvMFIa4Cxe6L uP6qXoenyoF5gficZ7u1X2ohX/4reJz6koRaKWY4Z++3aPfYrAqCzwv3rBNhhLD1RUr2 fewDsc63HK6hJQ/BivzU6xCjixxqUr50bW65AcjSqjZ60KQTN70Y41Di42ZdoraJY9+5 xQa9rSiyJNte/tlnkmJl5qrCDUoBONzkNNgYf2BXA6nkj7/GHeOKRamtVxI/zX3MDpHC w6CA== X-Gm-Message-State: ABUngvc0ShdEZXE0XGRhmTvoo0cjIjoqMhRxSvVC/Jtkgr7gBRkIuXpLuENdaeZe01jTcg== X-Received: by 10.237.41.196 with SMTP id o62mr887466qtd.122.1477054800053; Fri, 21 Oct 2016 06:00:00 -0700 (PDT) Received: from phe.ftfl.ca.ftfl.ca (hlfxns017vw-142068082201.dhcp-dynamic.FibreOp.ns.bellaliant.net. [142.68.82.201]) by smtp.gmail.com with ESMTPSA id h6sm1256561qkd.38.2016.10.21.05.59.59 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Fri, 21 Oct 2016 05:59:59 -0700 (PDT) From: Joseph Mingrone To: Eli Zaretskii Subject: Re: bug#19653: ispell misalignment with hunspell when Unicode apostrophe is used References: <8660om1en7.fsf@phe.ftfl.ca> <86wph2z405.fsf@phe.ftfl.ca> <83mvhyrwax.fsf@gnu.org> Date: Fri, 21 Oct 2016 09:59:57 -0300 In-Reply-To: <83mvhyrwax.fsf@gnu.org> (Eli Zaretskii's message of "Fri, 21 Oct 2016 10:33:10 +0300") Message-ID: <86shrpzwky.fsf@phe.ftfl.ca> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1 (berkeley-unix) MIME-Version: 1.0 Content-Type: multipart/signed; boundary="=-=-="; micalg=pgp-sha512; protocol="application/pgp-signature" X-Spam-Score: -0.2 (/) X-Debbugs-Envelope-To: 19653 Cc: 19653@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.2 (/) --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Eli Zaretskii writes: >> From: Joseph Mingrone >> Date: Fri, 21 Oct 2016 02:04:58 -0300 >> This still seems to be a problem with hunspell version 1.3.3. >> The problem can be reproduced by spell checking a file with this one lin= e. >> alsdk =E2=9C=85 sdfkjdsf sldksdfkjsfd >> During spell checking, the process list shows: >> ispell run -- -- /usr/local/bin/hunspell -a -d en_CA -i UTF-8 >> The error Emacs (version 25.1.1) reports is: >> ispell-process-line: Ispell misalignment: word =E2=80=98sdfkjdsf=E2=80= =99 point 11; probably incompatible versions > Did Hunspell ever fix the problem whereby it reported byte offsets of > the misspelled words, as opposed to character offsets? If not, that > is your problem, and Hunspell should finally get its act together. > To see whether this is the problem, invoke Hunspell like this: > /usr/local/bin/hunspell -a -d en_CA -i UTF-8 < test.txt > and see what Hunspell emits. It should emit something like this (the > below is taken from my system, and I don't have the en_CA dictionary, > so your output might be slightly different): > @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2) > & alsdk 3 0: Alaska, elastic, Alston > & sdfkjdsf 2 8: artefact's, postfix > & sldksdfkjsfd 2 17: justification, staphylococcus > The second number after each misspelled word is the offset of that > word's beginning, measured in characters, from the start of the line. > Hunspell used to report this in bytes instead of characters; if it > still does, you will have to patch it to fix that bug. AFAIR, the > Hunspell issue tracker includes several patches for this bug. Or > maybe the latest Hunspell 1.4.1 already fixes this, in which case > please upgrade. It's still a problem with hunspell. % echo "=C3=A9 startingCharTwo" | hunspell -a -d en_CA -i UTF-8 @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.3) & =C3=A9 15 0: e, s, i, a, n, r, t, o, l, c, d, u, g, m, p & startingCharTwo 1 3: nonparticipating https://github.com/hunspell/hunspell/issues/418 --=-=-= Content-Type: application/pgp-signature; name="signature.asc" -----BEGIN PGP SIGNATURE----- iQJ8BAEBCgBmBQJYChFNXxSAAAAAAC4AKGlzc3Vlci1mcHJAbm90YXRpb25zLm9w ZW5wZ3AuZmlmdGhob3JzZW1hbi5uZXQ1NUIwOTNBNzI2QzM4ODU1NzEyMkJBRDUz NkE0MEM4M0IwRDZFRjlFAAoJEDakDIOw1u+eQmAP/3nNLN/6v7lbh3dz9onnYmRl W3RZnc8hOVwNcy5ja7crpw/c3FldB3I+zfa5Vo7Go3SDJc2V1NJ55pcpHsUtMw3H 94Zqe2WsEyeqlCiAyQTXb1JghEyBWvd7pccnQYn42qxHFy6zFC0lybadzJKXLCYy WpXqewPja8mOQqOBTzmespYooufez2aV2Y7Frk+LGVGltb/KylkNBpBwqGod90H/ ufFUpVR3CQaTmd27b94234KomFR4EFQHOCLfI1ENE9RiEdKKf+e5E1KM5oUr9JUP KxuOeF3eD5uhljRjtOYMQZT/lkCSP6UEfb0la+v683J0RPABvoAg6t32+f3Ol0BW RNOl8H7SWZHnnj/VcXfHh5m58WZ9dwRbxxEuyPe3f/D1QX/JlRe5ss8mtWYeyk3Q /fFD0B10vguCSPRGpgY7TlsG/2xKxXSGRAYsZ0tgE9helYsKasFkO96qcsb86VUX 96MaRgmVrUorflBM5aD6nbEwevjhLN1F7MLIV1BTbGZR/lqrlSVzTqupk5lS2K0t lnzvhUTH565a4SbAVv/yuNIwGJuwfbsLvxqyyMDudIhyc2+HS4R46eQDS8UYOF8J b1pWPdE3edRsqE5/+duWJaPRecoP8cIDjI4+2HrSyL0M8mem4nWWRzUT5wvbFbfj AkzK2M/1Hw4voqxDOp9K =wyZO -----END PGP SIGNATURE----- --=-=-=-- From debbugs-submit-bounces@debbugs.gnu.org Fri Oct 21 10:52:28 2016 Received: (at 19653) by debbugs.gnu.org; 21 Oct 2016 14:52:28 +0000 Received: from localhost ([127.0.0.1]:45017 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxbBM-0005iE-9K for submit@debbugs.gnu.org; Fri, 21 Oct 2016 10:52:28 -0400 Received: from eggs.gnu.org ([208.118.235.92]:49806) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bxbBK-0005hv-Ma for 19653@debbugs.gnu.org; Fri, 21 Oct 2016 10:52:26 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bxbBC-0003tD-Di for 19653@debbugs.gnu.org; Fri, 21 Oct 2016 10:52:21 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.5 required=5.0 tests=BAYES_50,RP_MATCHES_RCVD autolearn=disabled version=3.3.2 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:51115) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bxbBC-0003t9-AO; Fri, 21 Oct 2016 10:52:18 -0400 Received: from 84.94.185.246.cable.012.net.il ([84.94.185.246]:3263 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.82) (envelope-from ) id 1bxbBB-0000Gz-Ie; Fri, 21 Oct 2016 10:52:17 -0400 Date: Fri, 21 Oct 2016 17:52:09 +0300 Message-Id: <838tthsqjq.fsf@gnu.org> From: Eli Zaretskii To: Joseph Mingrone In-reply-to: <86shrpzwky.fsf@phe.ftfl.ca> (message from Joseph Mingrone on Fri, 21 Oct 2016 09:59:57 -0300) Subject: Re: bug#19653: ispell misalignment with hunspell when Unicode apostrophe is used References: <8660om1en7.fsf@phe.ftfl.ca> <86wph2z405.fsf@phe.ftfl.ca> <83mvhyrwax.fsf@gnu.org> <86shrpzwky.fsf@phe.ftfl.ca> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-Spam-Score: -5.3 (-----) X-Debbugs-Envelope-To: 19653 Cc: 19653@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: Eli Zaretskii Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.3 (-----) > From: Joseph Mingrone > Cc: 19653@debbugs.gnu.org > Date: Fri, 21 Oct 2016 09:59:57 -0300 > > > @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2) > > & alsdk 3 0: Alaska, elastic, Alston > > & sdfkjdsf 2 8: artefact's, postfix > > & sldksdfkjsfd 2 17: justification, staphylococcus > > > The second number after each misspelled word is the offset of that > > word's beginning, measured in characters, from the start of the line. > > Hunspell used to report this in bytes instead of characters; if it > > still does, you will have to patch it to fix that bug. AFAIR, the > > Hunspell issue tracker includes several patches for this bug. Or > > maybe the latest Hunspell 1.4.1 already fixes this, in which case > > please upgrade. > > It's still a problem with hunspell. > > % echo "é startingCharTwo" | hunspell -a -d en_CA -i UTF-8 > @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.3) > & é 15 0: e, s, i, a, n, r, t, o, l, c, d, u, g, m, p > & startingCharTwo 1 3: nonparticipating > > https://github.com/hunspell/hunspell/issues/418 Thanks for checking. From unknown Fri Jun 20 18:00:51 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Sat, 19 Nov 2016 12:24:03 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator