From unknown Fri Aug 15 03:57:10 2025 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 X-Mailer: MIME-tools 5.509 (Entity 5.509) Content-Type: text/plain; charset=utf-8 From: bug#7781 <7781@debbugs.gnu.org> To: bug#7781 <7781@debbugs.gnu.org> Subject: Status: 23.2.91; ispell problem with hunspell and UTF-8 file Reply-To: bug#7781 <7781@debbugs.gnu.org> Date: Fri, 15 Aug 2025 10:57:10 +0000 retitle 7781 23.2.91; ispell problem with hunspell and UTF-8 file reassign 7781 emacs submitter 7781 Reuben Thomas severity 7781 normal tag 7781 notabug thanks From debbugs-submit-bounces@debbugs.gnu.org Mon Jan 03 18:07:51 2011 Received: (at submit) by debbugs.gnu.org; 3 Jan 2011 23:07:51 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PZtVG-0008Dg-MD for submit@debbugs.gnu.org; Mon, 03 Jan 2011 18:07:51 -0500 Received: from eggs.gnu.org ([140.186.70.92]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PZtVE-0008DU-LM for submit@debbugs.gnu.org; Mon, 03 Jan 2011 18:07:50 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PZtc6-0003Lt-PI for submit@debbugs.gnu.org; Mon, 03 Jan 2011 18:14:56 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED autolearn=unavailable version=3.3.1 Received: from lists.gnu.org ([199.232.76.165]:48785) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1PZtc6-0003Lo-L3 for submit@debbugs.gnu.org; Mon, 03 Jan 2011 18:14:54 -0500 Received: from [140.186.70.92] (port=42219 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PZtc4-0008Es-Rm for bug-gnu-emacs@gnu.org; Mon, 03 Jan 2011 18:14:54 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PZtc2-0003KU-77 for bug-gnu-emacs@gnu.org; Mon, 03 Jan 2011 18:14:52 -0500 Received: from exprod7og117.obsmtp.com ([64.18.2.6]:46914) by eggs.gnu.org with smtp (Exim 4.71) (envelope-from ) id 1PZtc1-0003J0-Ox for bug-gnu-emacs@gnu.org; Mon, 03 Jan 2011 18:14:50 -0500 Received: from source ([74.125.82.180]) by exprod7ob117.postini.com ([64.18.6.12]) with SMTP ID DSNKTSJYZSwGu9KUs92jjoN/sixmT0CvmwgC@postini.com; Mon, 03 Jan 2011 15:14:49 PST Received: by wyb28 with SMTP id 28so13620920wyb.39 for ; Mon, 03 Jan 2011 15:14:45 -0800 (PST) Received: by 10.216.170.213 with SMTP id p63mr24498657wel.6.1294096484771; Mon, 03 Jan 2011 15:14:44 -0800 (PST) Received: from mord (87-194-87-241.bethere.co.uk [87.194.87.241]) by mx.google.com with ESMTPS id m50sm10180679wek.32.2011.01.03.15.14.43 (version=TLSv1/SSLv3 cipher=RC4-MD5); Mon, 03 Jan 2011 15:14:43 -0800 (PST) From: Reuben Thomas To: bug-gnu-emacs@gnu.org Subject: 23.2.91; ispell problem with hunspell and UTF-8 file Date: Mon, 03 Jan 2011 23:14:41 +0000 Message-ID: <87sjx9fula.fsf@sc3d.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6, seldom 2.4 (older, 4) X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6 (newer, 2) X-Spam-Score: -6.0 (------) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -6.0 (------) With the following text, and using emacs -Q, I get the errors you can see in the messages log below when using hunspell to spell-check a UTF-8 buffer with some extended characters in it. I did test this with emacs -Q, but the current session, in which I reproduced the problem and am now composing this bug report, was not started with -Q (this is so submitting the bug report works properly!). I am running a freshly bzr-pulled build of the emacs-23 branch. Text follows ----cut here---- --- title: Kindle 3 is a good first attempt tags: computing, books format: markdown date: Mon, 03 Jan 2011 20:53:13 +0000 post-id: 2585181001 --- Giving my girlfriend a Kindle for Christmas was the carrot in a multi-prong= ed strategy to avoid needing more bookshelves (the stick being =E2=80=9CI w= ill start giving away your books=E2=80=9D and my contribution being to arch= ive books I=E2=80=99ve read (or return the many that aren=E2=80=99t even mi= ne). This therefore required that I stocked it with books before she got he= r hands on it, which in turn was all the excuse I needed to play with the t= hing. My lazy solution was simply to download all of [Feedbooks](http://www.feedb= ooks.com); I [wrote some scripts](http://rrt.sc3d.org/Software/Kindle/) to = make this actually lazy, rather than brain-numbingly dull. In the process I= found that while the Kindle is nice to hold and great to read, it struggle= s to cope with a large collection of books (even though the nearly 3,000 vo= lumes of Feedbooks only half-filled its 4Gb memory), and is woeful as a res= earch tool. And, of course, Amazon=E2=80=99s first-mover-evil surfaced earl= y. Here are the problems I had: 1. Amazon=E2=80=99s own store doesn=E2=80=99t seem to contain free books. I= think it=E2=80=99s poor form not to give people a straightforward choice o= f free editions of out-of-copyright works. The Kindle may be a loss leader,= but at =C2=A3109 it=E2=80=99s still not cheap. Feedbooks, rather than inte= grating easily into the Kindle, like, say, a 3rd-party software provider in= to Ubuntu=E2=80=99s Software Center, provide a catalogue which itself is in= the form of a book, doesn=E2=80=99t automatically update, and offers a lis= t ordered only by title. In other words, it=E2=80=99s useless; one is bette= r off using the built-in web browser to search the online catalogue=E2=80=A6 2. =E2=80=A6or better, another browser, since the Kindle=E2=80=99s is woefu= lly slow (and I don=E2=80=99t just mean the screen update). It=E2=80=99s ju= st about usable, and hence useful in an emergency, but is no good as, for e= xample, an online research tool to use in parallel with the books you have = downloaded, although=E2=80=A6 3. =E2=80=A6offline search is awful too. With just the few ebooks that come= loaded on the device, it was slow; with the thousands of books I loaded, i= t simply locked up the device, even when trying to search in the manual, pr= esumably already indexed. The Kindle seems to index its contents in the bac= kground, but even now, over a week later, search doesn=E2=80=99t work. The = only effective navigation is by a book=E2=80=99s table of contents, and, to= choose which books to read, the user-definable collections, though=E2=80=A6 4. =E2=80=A6collections are a pain to set up for many books, as you have to= select each book manually; there is no way I have found to select a range.= (Fortunately, I was able to define collections programmatically, but this = will be beyond most users.) In summary, it=E2=80=99s a lovely device, but the software is rather toytow= n. Amazon could improve it (and indeed, the 3.0.3 firmware update, at the e= xperimental stage when I checked, claims, vaguely, =E2=80=9Cperformance imp= rovements=E2=80=9D), but given that their main interest is in selling books= and Kindles, I=E2=80=99m not hopeful that it will happen before the next h= ardware iteration; whether it happens at all depends on competition, and th= ere should be plenty of that, to go by the number of other ebook readers. ----cut here---- In GNU Emacs 23.2.91.3 (i686-pc-linux-gnu, GTK+ Version 2.22.0) of 2011-01-03 on mord Windowing system distributor `The X.Org Foundation', version 11.0.10900000 Important settings: value of $LC_ALL: nil value of $LC_COLLATE: nil value of $LC_CTYPE: nil value of $LC_MESSAGES: nil value of $LC_MONETARY: nil value of $LC_NUMERIC: nil value of $LC_TIME: nil value of $LANG: en_GB.UTF-8 value of $XMODIFIERS: nil locale-coding-system: utf-8-unix default enable-multibyte-characters: t Major mode: Text Minor modes in effect: longlines-mode: t buffer-face-mode: t flyspell-mode: t show-paren-mode: t savehist-mode: t minibuffer-electric-default-mode: t iswitchb-mode: t icomplete-mode: t global-auto-revert-mode: t desktop-save-mode: t smart-quotes-mode: t mouse-wheel-mode: t use-hard-newlines: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t auto-encryption-mode: t auto-compression-mode: t column-number-mode: t line-number-mode: t transient-mark-mode: t Recent input: M-x r e p o r t - e m h u n s p e l=20 l SPC i s p e l l SPC w i t h SPC h u=20 n s l e s p e =20 p e p e l l SPC=20 f a i l s C-g =20 =20 M-x i s p e l l=20 SPC SPC SPC M-x i s p e =20 Recent messages: Scanning for "hard" Perl constructions... done Applying style hooks... done Scanning for "hard" Perl constructions... done Scanning for "hard" Perl constructions... done Scanning for "hard" Perl constructions... done Scanning for "hard" Perl constructions... done Lazy desktop load complete Quit Spell-checking Kindle 3 is a good first attempt using hunspell with british= +accs dictionary... Spell-checking region using hunspell with british+accs dictionary...done ispell-process-line: Ispell misalignment: word `Feedbooks' point 1363; prob= ably incompatible versions Load-path shadows: /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex-style hides /usr/share/= emacs/site-lisp/auctex/tex-style /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex-buf hides /usr/share/em= acs/site-lisp/auctex/tex-buf /usr/local/share/emacs/23.2.91/site-lisp/auctex/context hides /usr/share/em= acs/site-lisp/auctex/context /usr/local/share/emacs/23.2.91/site-lisp/auctex/bib-cite hides /usr/share/e= macs/site-lisp/auctex/bib-cite /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex-fold hides /usr/share/e= macs/site-lisp/auctex/tex-fold /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex-jp hides /usr/share/ema= cs/site-lisp/auctex/tex-jp /usr/local/share/emacs/23.2.91/site-lisp/auctex/context-nl hides /usr/share= /emacs/site-lisp/auctex/context-nl /usr/local/share/emacs/23.2.91/site-lisp/auctex/toolbar-x hides /usr/share/= emacs/site-lisp/auctex/toolbar-x /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex-mik hides /usr/share/em= acs/site-lisp/auctex/tex-mik /usr/local/share/emacs/23.2.91/site-lisp/auctex/context-en hides /usr/share= /emacs/site-lisp/auctex/context-en /usr/local/share/emacs/23.2.91/site-lisp/auctex/texmathp hides /usr/share/e= macs/site-lisp/auctex/texmathp /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex-info hides /usr/share/e= macs/site-lisp/auctex/tex-info /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex-fptex hides /usr/share/= emacs/site-lisp/auctex/tex-fptex /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex-font hides /usr/share/e= macs/site-lisp/auctex/tex-font /usr/local/share/emacs/23.2.91/site-lisp/auctex/latex hides /usr/share/emac= s/site-lisp/auctex/latex /usr/local/share/emacs/23.2.91/site-lisp/auctex/font-latex hides /usr/share= /emacs/site-lisp/auctex/font-latex /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex-bar hides /usr/share/em= acs/site-lisp/auctex/tex-bar /usr/local/share/emacs/23.2.91/site-lisp/auctex/multi-prompt hides /usr/sha= re/emacs/site-lisp/auctex/multi-prompt /usr/local/share/emacs/23.2.91/site-lisp/auctex/tex hides /usr/share/emacs/= site-lisp/auctex/tex Features: (shadow sort mail-extr message sendmail ecomplete rfc822 mml mml-sec password-cache mm-decode mm-bodies mm-encode mailcap mail-parse rfc2231 rfc2047 rfc2045 qp ietf-drums mailabbrev nnheader gnus-util netrc time-date mm-util mail-prsvr gmm-utils wid-edit mailheader canlock sha1 hex-util hashcash mail-utils emacsbug preview prv-emacs byte-opt warnings tex-buf noutline outline font-latex bytecomp byte-compile latex tex-style tex nxml-uchnm rng-xsd xsd-regexp rng-cmpct rng-nxml rng-valid rng-loc rng-uri rng-parse nxml-parse rng-match rng-dt rng-util rng-pttrn nxml-ns nxml-mode nxml-outln nxml-rap nxml-util nxml-glyph nxml-enc xmltok sgml-mode conf-mode newcomment make-mode vc-git cperl-mode longlines face-remap filladapt flyspell auto-dictionary-autoloads dictionary-autoloads js2-mode-autoloads package reporter completing-help ff-paths uniquify paren savehist minibuf-eldef iswitchb icomplete autorevert time cus-start cus-load desktop server change-mode advice help-fns advice-preload php-mode derived etags cc-langs cl cl-19 cc-mode cc-fonts cc-menus cc-cmds cc-styles cc-align cc-engine cc-vars cc-defs speedbar sb-image ezimage dframe easymenu assoc lua-mode regexp-opt comint ring whitespace etags-update smart-quotes edmacro kmacro ispell ffap muse-autoloads emacs-goodies-el emacs-goodies-custom emacs-goodies-loaddefs easy-mmode devhelp preview-latex tex-site auto-loads tooltip ediff-hook vc-hooks lisp-float-type mwheel x-win x-dnd font-setting tool-bar dnd fontset image fringe lisp-mode register page menu-bar rfn-eshadow timer select scroll-bar mldrag mouse jit-lock font-lock syntax facemenu font-core frame cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese case-table epa-hook jka-cmpr-hook help simple abbrev loaddefs button minibuffer faces cus-face files text-properties overlay md5 base64 format env code-pages mule custom widget hashtable-print-readable backquote make-network-process dbusbind system-font-setting font-render-setting gtk x-toolkit x multi-tty emacs) --=20 http://rrt.sc3d.org/ From debbugs-submit-bounces@debbugs.gnu.org Fri Jan 07 08:06:56 2011 Received: (at 7781) by debbugs.gnu.org; 7 Jan 2011 13:06:56 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PbC1o-0000vl-Cc for submit@debbugs.gnu.org; Fri, 07 Jan 2011 08:06:56 -0500 Received: from mail-iy0-f172.google.com ([209.85.210.172]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PbC1n-0000vZ-1F for 7781@debbugs.gnu.org; Fri, 07 Jan 2011 08:06:47 -0500 Received: by iyi42 with SMTP id 42so16260690iyi.3 for <7781@debbugs.gnu.org>; Fri, 07 Jan 2011 05:14:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:sender:received :in-reply-to:references:date:x-google-sender-auth:message-id:subject :from:to:cc:content-type:content-transfer-encoding; bh=i+hijFMoKnysf7sqYkv/nuN1AWSEdhbVx7E30B96dlU=; b=Jb21LTJ8D4gPyd5HLDIWkbB6IYupoB+UxGuDr1YJfgLmmWCmLt288xINUpGfLvaO5Y 8duCRY8tX9YLyi3jRhoaNCOSaUATKcJfIs0bQncF2V1NverX1tbOaFKzLukvpjZv5UBG NchJph6FnFsqWadgcLNmuwFVwtLpyVXGO+qMY= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type :content-transfer-encoding; b=oVLBXBh0oqQJVOq2xoCsnKr5qww8OP6OXWLwMbjgsj46z8wTWx6P8pnLlyUYUokNjJ pqemvjYb6khSS7oMUt2aPh4Dm6qNInZtCWvDFMozrstIofBEU5MwK8o8iws1yjQKUh7p WJqi8odxQmvaOAgy1hGzeAoHfCSzjsm0p7IdI= MIME-Version: 1.0 Received: by 10.231.36.68 with SMTP id s4mr26798060ibd.178.1294406043666; Fri, 07 Jan 2011 05:14:03 -0800 (PST) Received: by 10.231.14.13 with HTTP; Fri, 7 Jan 2011 05:14:03 -0800 (PST) In-Reply-To: <87sjx9fula.fsf@sc3d.org> References: <87sjx9fula.fsf@sc3d.org> Date: Fri, 7 Jan 2011 14:14:03 +0100 X-Google-Sender-Auth: kH0hKpNGqXzxuPe5k1Z6JCXeuVc Message-ID: Subject: Re: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file From: Agustin Martin To: Reuben Thomas Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Spam-Score: -1.9 (-) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -2.2 (--) 2011/1/4 Reuben Thomas : > With the following text, and using emacs -Q, I get the errors you can > see in the messages log below when using hunspell to spell-check a UTF-8 > buffer with some extended characters in it. > > I did test this with emacs -Q, but the current session, in which I > reproduced the problem and am now composing this bug report, was not > started with -Q (this is so submitting the bug report works properly!). > > I am running a freshly bzr-pulled build of the emacs-23 branch. Hi, Reuben, I can also reproduce this with emacs23.2. I could locate problems in two lines, after splititng original lines, -- Cut here -- 8< ----- minimal.txt: utf-8 of out-of-copyright works. The Kindle may be a loss leader, but at =A3109 it=92s still not cheap. Feedbooks, rather than integrating easily into -- Cut here -- 8< ----- End of minimal.txt In first line, currency seems to give some conversion errors when iso-8859-1 is used, when that should have ignored by hunspell. I get tons of UTF-8 encoding error. Missing continuation byte in 0. character position: for that line when using $ cat minimal.txt | hunspell -d en_US -a -i iso-8859-1 In second line unusual apostrophe seems to cause some confusion to hunspell when utf8 is used. Comparing what aspell and hunspell give in similar text I get $ cat minimal.txt | aspell --encoding=3Dutf-8 -d en_US -a & Feedbooks 6 22: Feed books, Feed-books, Feedback's, Feedbags, ... $ cat minimal.txt | hunspell -d en_US -i utf-8 -a & Feedbooks 8 24: Feed books, Feed-books, Feedback, Feedbags, ... Do not worry about first number, is the number of suggestions. However position in second number differ. Seems that hunspell is not considering that apostrophe as a single (multibyte) char when counting, but as three components Looks to me an hunspell bug. I found no reference to this problem in hunspell sf site, but noticed that Hunspell 1.2.14 was released yesterday. Need to check if that has some related new. --=20 Agustin From debbugs-submit-bounces@debbugs.gnu.org Fri Jan 07 09:23:24 2011 Received: (at 7781) by debbugs.gnu.org; 7 Jan 2011 14:23:24 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PbDDw-0002XI-83 for submit@debbugs.gnu.org; Fri, 07 Jan 2011 09:23:24 -0500 Received: from exprod7og120.obsmtp.com ([64.18.2.18]) by debbugs.gnu.org with smtp (Exim 4.69) (envelope-from ) id 1PbDDt-0002X5-Uz for 7781@debbugs.gnu.org; Fri, 07 Jan 2011 09:23:22 -0500 Received: from source ([209.85.218.50]) by exprod7ob120.postini.com ([64.18.6.12]) with SMTP ID DSNKTScjjjV9UVXLP6RJnarpuYa1/Efsp1pH@postini.com; Fri, 07 Jan 2011 06:30:44 PST Received: by mail-yi0-f50.google.com with SMTP id 1so4852165yic.23 for <7781@debbugs.gnu.org>; Fri, 07 Jan 2011 06:30:38 -0800 (PST) MIME-Version: 1.0 Received: by 10.150.230.21 with SMTP id c21mr24869384ybh.130.1294410637924; Fri, 07 Jan 2011 06:30:37 -0800 (PST) Received: by 10.150.186.15 with HTTP; Fri, 7 Jan 2011 06:30:37 -0800 (PST) In-Reply-To: References: <87sjx9fula.fsf@sc3d.org> Date: Fri, 7 Jan 2011 14:30:37 +0000 Message-ID: Subject: Re: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file From: Reuben Thomas To: Agustin Martin Content-Type: text/plain; charset=ISO-8859-1 X-Spam-Score: -6.1 (------) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -6.1 (------) Thanks very much for your investigation, Agustin. I tried hunspell 1.2.14 and got exactly the same error. From debbugs-submit-bounces@debbugs.gnu.org Fri Feb 11 11:52:09 2011 Received: (at 7781) by debbugs.gnu.org; 11 Feb 2011 16:52:09 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PnwE4-0003EA-QB for submit@debbugs.gnu.org; Fri, 11 Feb 2011 11:52:09 -0500 Received: from mail-iy0-f172.google.com ([209.85.210.172]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1PnwE0-0003DU-3r for 7781@debbugs.gnu.org; Fri, 11 Feb 2011 11:52:05 -0500 Received: by iym1 with SMTP id 1so2753755iym.3 for <7781@debbugs.gnu.org>; Fri, 11 Feb 2011 09:00:53 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; bh=VxY9e1mkWCIS0yNUkVCSf82ea2WjAt8DnGBltfcA9eY=; b=irRhxVqdUqYJB0Ms9GcfrB+tHvBYf/tjlVZdHYWSYRR5uawNTrAd9a4zqFevzLZFnv NeXlUIwp8ysvK8lOh6ZNjmffo93e7X++ppyZM5p48y5tov7AVSI/InsGLQM33dECqose rHXMGyNRClhExRH0FmsKdt7vHXcWt4o+UpLK4= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:in-reply-to:references:date :x-google-sender-auth:message-id:subject:from:to:cc:content-type; b=W7Qua77Zq1BxXGN2I+SBEhUdKyaH9W/RS/sIQr+qnJmcVxuofgAVNkaJoB/fiGcavM YCj79+MkfBaKilF85uLFllFHc2xyeFYBe8CXVwqHcpiA/vMANkkpqgbLnXVnafUtVdNh UjBPzJRmL4dEpN9uIhZbxNnlj+mMxsgBO03pU= MIME-Version: 1.0 Received: by 10.231.16.137 with SMTP id o9mr550825iba.158.1297443653426; Fri, 11 Feb 2011 09:00:53 -0800 (PST) Received: by 10.231.34.3 with HTTP; Fri, 11 Feb 2011 09:00:53 -0800 (PST) In-Reply-To: References: <87sjx9fula.fsf@sc3d.org> Date: Fri, 11 Feb 2011 18:00:53 +0100 X-Google-Sender-Auth: bEbROuPIhPl6I_AwkJJmxjRcsXU Message-ID: Subject: Re: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file From: Agustin Martin To: Reuben Thomas Content-Type: text/plain; charset=ISO-8859-1 X-Spam-Score: -2.9 (--) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -3.1 (---) forwarded 7781 https://sourceforge.net/tracker/?func=detail&aid=3178449&group_id=143754&atid=756395 thanks 2011/1/7 Agustin Martin : > 2011/1/4 Reuben Thomas : >> With the following text, and using emacs -Q, I get the errors you can >> see in the messages log below when using hunspell to spell-check a UTF-8 >> buffer with some extended characters in it. > Do not worry about first number, is the number of suggestions. However > position in second number differ. Seems that hunspell is not > considering that apostrophe as a single (multibyte) char when > counting, but as three components > > Looks to me an hunspell bug. I found no reference to this problem in > hunspell sf site, but noticed that Hunspell 1.2.14 was released > yesterday. Need to check if that has some related new. Opened an hunspell bug report for bad count problem https://sourceforge.net/tracker/?func=detail&aid=3178449&group_id=143754&atid=756395 Seems I no longer see the other problem. Cheers, -- Agustin From debbugs-submit-bounces@debbugs.gnu.org Sun Jan 01 16:45:01 2012 Received: (at 7781) by debbugs.gnu.org; 1 Jan 2012 21:45:01 +0000 Received: from localhost ([127.0.0.1] helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1RhTDA-0005sU-Pc for submit@debbugs.gnu.org; Sun, 01 Jan 2012 16:45:01 -0500 Received: from mtaout03-winn.ispmail.ntl.com ([81.103.221.49]) by debbugs.gnu.org with esmtp (Exim 4.69) (envelope-from ) id 1RhTD6-0005sK-Fe for 7781@debbugs.gnu.org; Sun, 01 Jan 2012 16:44:57 -0500 Received: from aamtaout02-winn.ispmail.ntl.com ([81.103.221.35]) by mtaout03-winn.ispmail.ntl.com (InterMail vM.7.08.04.00 201-2186-134-20080326) with ESMTP id <20120101214145.MXEF21018.mtaout03-winn.ispmail.ntl.com@aamtaout02-winn.ispmail.ntl.com> for <7781@debbugs.gnu.org>; Sun, 1 Jan 2012 21:41:45 +0000 Received: from JRWUBU2 ([82.16.217.33]) by aamtaout02-winn.ispmail.ntl.com (InterMail vG.3.00.04.00 201-2196-133-20080908) with ESMTP id <20120101214145.IIUT5924.aamtaout02-winn.ispmail.ntl.com@JRWUBU2> for <7781@debbugs.gnu.org>; Sun, 1 Jan 2012 21:41:45 +0000 Date: Sun, 1 Jan 2012 21:42:30 +0000 From: Richard Wordingham To: 7781@debbugs.gnu.org Subject: ispell problem with hunspell and UTF-8 file (and other, related hunspell problems) Message-ID: <20120101214230.25591b8c@JRWUBU2> X-Mailer: Claws Mail 3.7.4 (GTK+ 2.20.1; i486-pc-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Cloudmark-Analysis: v=1.1 cv=JvdXmxIgLJv2/GthKqHpGJEEHukvLcvELVXUanXFreg= c=1 sm=0 a=TKAxtaOfLdMA:10 a=uObrxnre4hsA:10 a=Fj0c77JwVQQA:10 a=kj9zAlcOel0A:10 a=NLZqzBF-AAAA:8 a=FP58Ms26AAAA:8 a=danhDmx_AAAA:8 a=4hT6gdF-KsBtrh6LI0gA:9 a=NSw-8ghqa48zkDX2PMYA:7 a=CjuIK1q_8ugA:10 a=HpAAvcLHHh0Zw7uRqdWCyQ==:117 X-Spam-Score: -4.3 (----) X-Debbugs-Envelope-To: 7781 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.11 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -3.7 (---) Those who want to compile a bug fix in Hunspell for themselves can find fixes (based on Hunspell 1.2.8 and Emacs V23) to spell check word-separated Thai in UTF-8 from Emacs at http://homepage.ntlworld.com/richard.wordingham/thai/hunspell-1.2.8-jrw1.1.zip - the byte v. character count problem was just one of those met and resolved. The full list is: On Hunspell: Bad UTF-8 char count in pipe mode - ID: 3178449 No Encoding of Word for Suggestions in Piped Mode (https://sourceforge.net/tracker/?func=detail&aid=3468022&group_id=143754&atid=756395) Multidictionary guesses dictionary for suggestions (https://sourceforge.net/tracker/?func=detail&aid=3468039&group_id=143754&atid=756395) Hunspell 1.2.8 Groups Thai TIS-620 Chars in Lower/Upper Case Pairs (https://bugs.launchpad.net/ubuntu/+source/hunspell/+bug/910452) (fixed in Release 1.2.14) On the Thai dictionary: th_TH Affix File Inadequate for Hunspell (https://bugs.launchpad.net/ubuntu/+source/openoffice.org-dictionaries/+bug/910447) There is also a problem with the size of the window holding correction in Thai (probably depending on the choice of font); the addition of (fit-window-to-buffer) at the appropriate point in ispell.el (as in the zip file) fixes that. Richard. From debbugs-submit-bounces@debbugs.gnu.org Sat Apr 13 19:45:27 2013 Received: (at 7781) by debbugs.gnu.org; 13 Apr 2013 23:45:27 +0000 Received: from localhost ([127.0.0.1]:49303 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1URA8M-0000W2-6i for submit@debbugs.gnu.org; Sat, 13 Apr 2013 19:45:27 -0400 Received: from forward7.mail.yandex.net ([77.88.61.37]:41689) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1UR5x2-0008L1-1Z for 7781@debbugs.gnu.org; Sat, 13 Apr 2013 15:17:29 -0400 Received: from smtp7.mail.yandex.net (smtp7.mail.yandex.net [77.88.61.55]) by forward7.mail.yandex.net (Yandex) with ESMTP id 596311C0390 for <7781@debbugs.gnu.org>; Sat, 13 Apr 2013 23:12:39 +0400 (MSK) Received: from smtp7.mail.yandex.net (localhost [127.0.0.1]) by smtp7.mail.yandex.net (Yandex) with ESMTP id 3516C1580524 for <7781@debbugs.gnu.org>; Sat, 13 Apr 2013 23:12:39 +0400 (MSK) Received: from ppp95-165-99-165.pppoe.spdop.ru (ppp95-165-99-165.pppoe.spdop.ru [95.165.99.165]) by smtp7.mail.yandex.net (nwsmtp/Yandex) with ESMTP id l8xntAtQc2-CchWL8FN; Sat, 13 Apr 2013 23:12:38 +0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex.ru; s=mail; t=1365880359; bh=+2WCRn5ZJIZ/wrEPqM+iHUFjD7+PSpKqVRnBDyQGXTs=; h=Message-ID:Date:From:User-Agent:MIME-Version:To:Subject: Content-Type; b=v99LkM8ni70h4iQc8K2P9k612zgyHDLk5KXEE+zJ3IzYxUc8E4MCfzrx/BHd1Xvwd iFr+4ZYx3kyhP7bZgFh8aZfGPNlL7i3WeocgpSCFrsxXqqIWZOvVZ6blzrhayKffGt /em4PR+9iZDf76CGLzW+CDD/+PSn4wSDcQ23n3Mg= Authentication-Results: smtp7.mail.yandex.net; dkim=pass header.i=@yandex.ru Message-ID: <5169AE26.1000403@yandex.ru> Date: Sat, 13 Apr 2013 23:12:38 +0400 From: =?UTF-8?B?0J3QuNC60L7Qu9Cw0Lkg0KHRg9GJ0LXQvdC60L4=?= User-Agent: Mozilla/5.0 (X11; Linux i686; rv:17.0) Gecko/20130308 Thunderbird/17.0.4 MIME-Version: 1.0 To: 7781@debbugs.gnu.org Subject: [PATCH] Fix ispell problem with hunspell and UTF-8 file Content-Type: multipart/mixed; boundary="------------060404020300020106050208" X-Spam-Score: 0.1 (/) X-Debbugs-Envelope-To: 7781 X-Mailman-Approved-At: Sat, 13 Apr 2013 19:45:19 -0400 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -2.6 (--) This is a multi-part message in MIME format. --------------060404020300020106050208 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit As soon as I can see, the hunspell team haven't fixed the bug in more then 2 years. Maybe for them it is not a bug but a feature. The problem is that hunspell reports byte-position instead of char-position with multi-byte character input, while Emacs waits for char-position. With the patch attached I propose to make conversation in the ispell-parse-output function. Thanks, Nikolay Suschenko --------------060404020300020106050208 Content-Type: text/plain; charset=UTF-8; name="ispell.el.patch" Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="ispell.el.patch" LS0tIGEvbGlzcC90ZXh0bW9kZXMvaXNwZWxsLmVsCTIwMTMtMDItMjIgMTM6MDk6MjYuMDAw MDAwMDAwICswNDAwCisrKyBiL2xpc3AvdGV4dG1vZGVzL2lzcGVsbC5lbAkyMDEzLTA0LTEz IDIyOjQ2OjM1LjIwOTMyMzU4NCArMDQwMApAQCAtMjU5Nyw3ICsyNTk3LDExIEBACiAJICAo c2V0cSBjb3VudCAwKQkJOyBubyBtaXNzZXMgZm9yIHR5cGUgIwogCShzZXRxIGNvdW50IChz dHJpbmctdG8tbnVtYmVyIG91dHB1dCkgOyBnZXQgbnVtYmVyIG9mIG1pc3Nlcy4KIAkgICAg ICBvdXRwdXQgKHN1YnN0cmluZyBvdXRwdXQgKDErIChzdHJpbmctbWF0Y2ggIiAiIG91dHB1 dCAxKSkpKSkKLSAgICAgIChzZXRxIG9mZnNldCAoc3RyaW5nLXRvLW51bWJlciBvdXRwdXQp KQorICAgICAgOyBodW5zcGVsbCByZXBvcnRzIGJ5dGUtcG9zaXRpb24gaW5zdGVhZCBvZiBj aGFyLXBvc2l0aW9uCisgICAgICA7IHdpdGggbXVsdGktYnl0ZSBjaGFyYWN0ZXIgaW5wdXQK KyAgICAgIChzZXRxIG9mZnNldCAoaWYgaXNwZWxsLXJlYWxseS1odW5zcGVsbAorCQkgICAg ICAgKGJ5dGUtdG8tcG9zaXRpb24gKHN0cmluZy10by1udW1iZXIgb3V0cHV0KSkKKwkJICAg ICAoc3RyaW5nLXRvLW51bWJlciBvdXRwdXQpKSkKICAgICAgIChzZXRxIG91dHB1dCAoaWYg KGVxIHR5cGUgPyMpICAgICA7IE5vIG1pc3Mgb3IgZ3Vlc3MgbGlzdC4KICAgICAgICAgICAg ICAgICAgICAgICAgbmlsCiAgICAgICAgICAgICAgICAgICAgICAoc3Vic3RyaW5nIG91dHB1 dCAoMSsgKHN0cmluZy1tYXRjaCAiICIgb3V0cHV0IDEpKSkpKQo= --------------060404020300020106050208-- From debbugs-submit-bounces@debbugs.gnu.org Sun Apr 14 01:46:13 2013 Received: (at 7781) by debbugs.gnu.org; 14 Apr 2013 05:46:13 +0000 Received: from localhost ([127.0.0.1]:49592 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1URFlV-0007qY-Kq for submit@debbugs.gnu.org; Sun, 14 Apr 2013 01:46:13 -0400 Received: from mtaout23.012.net.il ([80.179.55.175]:61072) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1URFlS-0007qP-LF for 7781@debbugs.gnu.org; Sun, 14 Apr 2013 01:46:12 -0400 Received: from conversion-daemon.a-mtaout23.012.net.il by a-mtaout23.012.net.il (HyperSendmail v2007.08) id <0ML800400D2W0X00@a-mtaout23.012.net.il> for 7781@debbugs.gnu.org; Sun, 14 Apr 2013 08:42:05 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout23.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0ML8003L0D64TUB0@a-mtaout23.012.net.il>; Sun, 14 Apr 2013 08:42:05 +0300 (IDT) Date: Sun, 14 Apr 2013 08:42:11 +0300 From: Eli Zaretskii Subject: Re: bug#7781: [PATCH] Fix ispell problem with hunspell and UTF-8 file In-reply-to: <5169AE26.1000403@yandex.ru> To: =?utf-8?B?0J3QuNC60L7Qu9Cw0Lkg0KHRg9GJ0LXQvdC60L4=?= Message-id: <83wqs57jnw.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-transfer-encoding: QUOTED-PRINTABLE X-012-Sender: halo1@inter.net.il References: <87sjx9fula.fsf@sc3d.org> <5169AE26.1000403@yandex.ru> X-Spam-Score: 0.7 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: 0.2 (/) > Date: Sat, 13 Apr 2013 23:12:38 +0400 > From: =D0=9D=D0=B8=D0=BA=D0=BE=D0=BB=D0=B0=D0=B9 =D0=A1=D1=83=D1= =89=D0=B5=D0=BD=D0=BA=D0=BE >=20 > As soon as I can see, the hunspell team haven't fixed the bug in mo= re=20 > then 2 years. Maybe for them it is not a bug but a feature. Hunspell bug resolution process could use some speedup. > The problem is that hunspell reports byte-position instead of=20 > char-position with multi-byte character input, while Emacs waits fo= r=20 > char-position. With the patch attached I propose to make conversati= on in=20 > the ispell-parse-output function. Sorry, no. I tried that initially, but this work-around has problems (don't remember the details, though). It is much better to rebuild Hunspell with this bug fixed. I can giv= e you a patch for that if you need it (I think there's a patch in the bug database as well). I fixed my hunspell long ago, and never looke= d back. Or ask your distribution's maintainers to release a fixed hunspell distro. Thanks. From debbugs-submit-bounces@debbugs.gnu.org Sun Apr 14 02:38:43 2013 Received: (at 7781) by debbugs.gnu.org; 14 Apr 2013 06:38:43 +0000 Received: from localhost ([127.0.0.1]:49636 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1URGaI-0001x6-I2 for submit@debbugs.gnu.org; Sun, 14 Apr 2013 02:38:42 -0400 Received: from forward1.mail.yandex.net ([77.88.46.6]:35507) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1URGaE-0001ws-Ju for 7781@debbugs.gnu.org; Sun, 14 Apr 2013 02:38:41 -0400 Received: from smtp4.mail.yandex.net (smtp4.mail.yandex.net [77.88.46.104]) by forward1.mail.yandex.net (Yandex) with ESMTP id 56F481240172; Sun, 14 Apr 2013 10:33:40 +0400 (MSK) Received: from smtp4.mail.yandex.net (localhost [127.0.0.1]) by smtp4.mail.yandex.net (Yandex) with ESMTP id 248335C0835; Sun, 14 Apr 2013 10:33:40 +0400 (MSK) Received: from ppp95-165-99-165.pppoe.spdop.ru (ppp95-165-99-165.pppoe.spdop.ru [95.165.99.165]) by smtp4.mail.yandex.net (nwsmtp/Yandex) with ESMTP id YvlKSGRoCD-Xd2imfab; Sun, 14 Apr 2013 10:33:39 +0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex.ru; s=mail; t=1365921220; bh=0h8HPve+mnDwbGcXUjKDCURd+P67kp85lbuRem7tq+4=; h=Message-ID:Date:From:User-Agent:MIME-Version:To:CC:Subject: References:In-Reply-To:Content-Type:Content-Transfer-Encoding; b=qjhVBiCStnVraH6DbMJOV7xagwl5mlSmZOp3Wt0OrnZ0I953yaUaf78KUcROjlF0V fDEeImTwJnvmtYcii0QSpboPRek9wb5IvD+KoPLyyqlH+ruBzXKNXBs+phcXlHbdVG Uvoq5fmDpfiKxn89UD41J14a2RrvJuNaUuNe700g= Authentication-Results: smtp4.mail.yandex.net; dkim=pass header.i=@yandex.ru Message-ID: <516A4DC3.90205@yandex.ru> Date: Sun, 14 Apr 2013 10:33:39 +0400 From: =?UTF-8?B?0J3QuNC60L7Qu9Cw0Lkg0KHRg9GJ0LXQvdC60L4=?= User-Agent: Mozilla/5.0 (X11; Linux i686; rv:17.0) Gecko/20130308 Thunderbird/17.0.4 MIME-Version: 1.0 To: Eli Zaretskii Subject: Re: bug#7781: [PATCH] Fix ispell problem with hunspell and UTF-8 file References: <87sjx9fula.fsf@sc3d.org> <5169AE26.1000403@yandex.ru> <83wqs57jnw.fsf@gnu.org> In-Reply-To: <83wqs57jnw.fsf@gnu.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Score: 0.1 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -2.6 (--) Hi, Eli Please send me this patch, I'll ask the hunspell developers to include it. Could you also recall which concrete problems produces this workaround? For me it works fine, but I haven't tested it in different languages and encodings. If it is some problems, I could try to fix it, but as for now, Emacs don't work with hunspell+utf-8 at all, at the minimum in Slackware and Arch. From debbugs-submit-bounces@debbugs.gnu.org Sun Apr 14 03:12:52 2013 Received: (at 7781) by debbugs.gnu.org; 14 Apr 2013 07:12:52 +0000 Received: from localhost ([127.0.0.1]:49675 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1URH7L-0003bm-Jg for submit@debbugs.gnu.org; Sun, 14 Apr 2013 03:12:52 -0400 Received: from mtaout22.012.net.il ([80.179.55.172]:33453) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1URH7I-0003bI-0J for 7781@debbugs.gnu.org; Sun, 14 Apr 2013 03:12:49 -0400 Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0ML800200H3LPP00@a-mtaout22.012.net.il> for 7781@debbugs.gnu.org; Sun, 14 Apr 2013 10:08:36 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0ML8002ECH6B5X70@a-mtaout22.012.net.il>; Sun, 14 Apr 2013 10:08:36 +0300 (IDT) Date: Sun, 14 Apr 2013 10:08:42 +0300 From: Eli Zaretskii Subject: Re: bug#7781: [PATCH] Fix ispell problem with hunspell and UTF-8 file In-reply-to: <516A4DC3.90205@yandex.ru> To: =?utf-8?B?0J3QuNC60L7Qu9Cw0Lkg0KHRg9GJ0LXQvdC60L4=?= Message-id: <83ppxx7fnp.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-transfer-encoding: QUOTED-PRINTABLE X-012-Sender: halo1@inter.net.il References: <87sjx9fula.fsf@sc3d.org> <5169AE26.1000403@yandex.ru> <83wqs57jnw.fsf@gnu.org> <516A4DC3.90205@yandex.ru> X-Spam-Score: 0.2 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -1.2 (-) > Date: Sun, 14 Apr 2013 10:33:39 +0400 > From: =D0=9D=D0=B8=D0=BA=D0=BE=D0=BB=D0=B0=D0=B9 =D0=A1=D1=83=D1= =89=D0=B5=D0=BD=D0=BA=D0=BE > > CC: 7781@debbugs.gnu.org >=20 > Please send me this patch, I'll ask the hunspell developers to incl= ude it. Attached. This is a small part of a much larger patch, most of it fo= r Windows-specific problems. If you have problems compiling the patche= d hunspell, let me know: it could be that I omitted some hunk that is needed for this part. > Could you also recall which concrete problems produces this workaro= und?=20 > For me it works fine, but I haven't tested it in different language= s and=20 > encodings. One problem is that you assume the encoding of the communications wit= h hunspell is UTF-8, and thus matches the internal representation of text in Emacs buffers and strings (only then will byte-to-position give correct results). But that assumption is false: hunspell supports any encoding that it can convert to/from UTF-8 (it uses libiconv internally). The "usual" choice of the encoding is the one used by the dictionary. Not every dictionary out there is in UTF-8. > If it is some problems, I could try to fix it I don't think you can fix this on the Emacs side, because Emacs canno= t easily and/or quickly convert between bytes and characters in an arbitrary multibyte encoding. When I discovered this problem, I also tried fixing it on the Emacs side first, but then I realized that this kind of solution has too many problems, and instead fixed it in hunspell. --- src/tools/hunspell.cxx~0=092011-01-21 19:01:29.000000000 +0200 +++ src/tools/hunspell.cxx=092013-02-07 10:11:54.443610900 +0200 @@ -710,13 +748,22 @@ if (pos >=3D 0) { =09=09=09fflush(stdout); =09=09} else { =09=09=09char ** wlst =3D NULL; -=09=09=09int ns =3D pMS[d]->suggest(&wlst, token); +=09=09=09int byte_offset =3D parser->get_tokenpos() + pos; +=09=09=09int char_offset =3D 0; +=09=09=09if (strcmp(io_enc, "UTF-8") =3D=3D 0) { +=09=09=09=09for (int i =3D 0; i < byte_offset; i++) { +=09=09=09=09=09if ((buf[i] & 0xc0) !=3D 0x80) +=09=09=09=09=09=09char_offset++; +=09=09=09=09} +=09=09=09} else { +=09=09=09=09char_offset =3D byte_offset; +=09=09=09} +=09=09=09int ns =3D pMS[d]->suggest(&wlst, chenc(token, io_enc, dic_= enc[d])); =09=09=09if (ns =3D=3D 0) { -=09=09 =09=09fprintf(stdout,"# %s %d", token, -=09=09 =09=09 parser->get_tokenpos() + pos); +=09=09 =09=09fprintf(stdout,"# %s %d", token, char_offset); =09=09=09} else { =09=09=09=09fprintf(stdout,"& %s %d %d: ", token, ns, -=09=09=09=09 parser->get_tokenpos() + pos); +=09=09=09=09=09char_offset); =09=09=09=09fprintf(stdout,"%s", chenc(wlst[0], dic_enc[d], io_enc))= ; =09=09=09} =09=09=09for (int j =3D 1; j < ns; j++) { @@ -745,13 +792,23 @@ if (pos >=3D 0) { =09=09=09if (root) free(root); =09=09} else { =09=09=09char ** wlst =3D NULL; +=09=09=09int byte_offset =3D parser->get_tokenpos() + pos; +=09=09=09int char_offset =3D 0; +=09=09=09if (strcmp(io_enc, "UTF-8") =3D=3D 0) { +=09=09=09=09for (int i =3D 0; i < byte_offset; i++) { +=09=09=09=09=09if ((buf[i] & 0xc0) !=3D 0x80) +=09=09=09=09=09=09char_offset++; +=09=09=09=09} +=09=09=09} else { +=09=09=09=09char_offset =3D byte_offset; +=09=09=09} =09=09=09int ns =3D pMS[d]->suggest(&wlst, chenc(token, io_enc, dic_= enc[d])); =09=09=09if (ns =3D=3D 0) { =09=09 =09=09fprintf(stdout,"# %s %d", chenc(token, io_enc, ui_en= c), -=09=09 =09=09 parser->get_tokenpos() + pos); +=09=09 =09=09 char_offset); =09=09=09} else { =09=09=09=09fprintf(stdout,"& %s %d %d: ", chenc(token, io_enc, ui_e= nc), ns, -=09=09=09=09 parser->get_tokenpos() + pos); +=09=09=09=09 char_offset); =09=09=09=09fprintf(stdout,"%s", chenc(wlst[0], dic_enc[d], ui_enc))= ; =09=09=09} =09=09=09for (int j =3D 1; j < ns; j++) { From debbugs-submit-bounces@debbugs.gnu.org Sat Apr 20 14:48:11 2013 Received: (at 7781) by debbugs.gnu.org; 20 Apr 2013 18:48:11 +0000 Received: from localhost ([127.0.0.1]:33867 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1UTcpW-00020U-OG for submit@debbugs.gnu.org; Sat, 20 Apr 2013 14:48:11 -0400 Received: from forward6.mail.yandex.net ([77.88.60.125]:47065) by debbugs.gnu.org with esmtp (Exim 4.72) (envelope-from ) id 1UTcpS-00020F-QU for 7781@debbugs.gnu.org; Sat, 20 Apr 2013 14:48:09 -0400 Received: from smtp8.mail.yandex.net (smtp8.mail.yandex.net [77.88.61.54]) by forward6.mail.yandex.net (Yandex) with ESMTP id 8D8E31120883; Sat, 20 Apr 2013 22:43:20 +0400 (MSK) Received: from smtp8.mail.yandex.net (localhost [127.0.0.1]) by smtp8.mail.yandex.net (Yandex) with ESMTP id 569C91B6001D; Sat, 20 Apr 2013 22:43:20 +0400 (MSK) Received: from ppp95-165-126-4.pppoe.spdop.ru (ppp95-165-126-4.pppoe.spdop.ru [95.165.126.4]) by smtp8.mail.yandex.net (nwsmtp/Yandex) with ESMTP id d6cD8vPIl6-hJtS9FXM; Sat, 20 Apr 2013 22:43:19 +0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yandex.ru; s=mail; t=1366483400; bh=DOLE775lXKdax0ll9MUGeuirnrT/zD7cjrRUnZhy9xo=; h=Message-ID:Date:From:User-Agent:MIME-Version:To:CC:Subject: References:In-Reply-To:Content-Type:Content-Transfer-Encoding; b=ukzu0fZo+Shu5SepzrQCjK+0jOzkS5TgtqFO2p1O67LtZLWcUgc3bIyI66gNlDkSt vV2Z94hi6KyKnIwsGm+AMQOBWIp+FS4MVjMpkx5YPPJIVxtTABuGgErpRYh7sikU8l ioZJDcI7EY9GCCuAyoaqgQ42+xH9oyaofZVkPpw0= Authentication-Results: smtp8.mail.yandex.net; dkim=pass header.i=@yandex.ru Message-ID: <5172E1C7.1070008@yandex.ru> Date: Sat, 20 Apr 2013 22:43:19 +0400 From: =?UTF-8?B?0J3QuNC60L7Qu9Cw0Lkg0KHRg9GJ0LXQvdC60L4=?= User-Agent: Mozilla/5.0 (X11; Linux i686; rv:17.0) Gecko/20130308 Thunderbird/17.0.4 MIME-Version: 1.0 To: Eli Zaretskii Subject: Re: bug#7781: [PATCH] Fix ispell problem with hunspell and UTF-8 file References: <87sjx9fula.fsf@sc3d.org> <5169AE26.1000403@yandex.ru> <83wqs57jnw.fsf@gnu.org> <516A4DC3.90205@yandex.ru> <83ppxx7fnp.fsf@gnu.org> In-Reply-To: <83ppxx7fnp.fsf@gnu.org> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Score: 0.1 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.13 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: debbugs-submit-bounces@debbugs.gnu.org Errors-To: debbugs-submit-bounces@debbugs.gnu.org X-Spam-Score: -2.6 (--) Thank you, for me this patch worked well. However, somebody have already proposed another patch: https://sourceforge.net/tracker/?func=detail&aid=3610147&group_id=143754&atid=756397 From debbugs-submit-bounces@debbugs.gnu.org Sun Apr 27 17:30:36 2014 Received: (at 7781) by debbugs.gnu.org; 27 Apr 2014 21:30:36 +0000 Received: from localhost ([127.0.0.1]:43295 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WeWeh-0002cH-QP for submit@debbugs.gnu.org; Sun, 27 Apr 2014 17:30:36 -0400 Received: from smtp12.smtpout.orange.fr ([80.12.242.134]:44066 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WeWee-0002c2-6W for 7781@debbugs.gnu.org; Sun, 27 Apr 2014 17:30:32 -0400 Received: from micropit.couberia.selfip.net ([2.2.212.209]) by mwinf5d35 with ME id v9WR1n00B4Xdq4q039WR74; Sun, 27 Apr 2014 23:30:26 +0200 X-ME-Helo: micropit.couberia.selfip.net X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Sun, 27 Apr 2014 23:30:26 +0200 X-ME-IP: 2.2.212.209 Received: by micropit.couberia.selfip.net (Postfix, from userid 1000) id 50088A60C6C; Sun, 27 Apr 2014 23:30:25 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: 7781@debbugs.gnu.org Subject: hunspell and latex-mode Date: Sun, 27 Apr 2014 23:30:25 +0200 Message-ID: <87y4yqh34e.fsf@micropit.couberia.selfip.net> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) Hi, I'm using a patched hunspell (http://sourceforge.net/p/hunspell/patches/57/) and it works well with text-mode and message-mode. But unfortunately it does not work with context-mode or latex-mode. Example: --8<---------------cut here---------------start------------->8--- \documentclass{article} \begin{document} bla \end{document} --8<---------------cut here---------------end--------------->8--- Running ispell fails with this error: ispell-process-line: Ispell misalignment: word `bla' point 41; probably incompatible versions Do you know a solution? I'm using bzr emacs and git auctex. TIA for any help, -- Peter From debbugs-submit-bounces@debbugs.gnu.org Mon Apr 28 11:37:06 2014 Received: (at 7781) by debbugs.gnu.org; 28 Apr 2014 15:37:06 +0000 Received: from localhost ([127.0.0.1]:44028 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wenc9-0007P0-JZ for submit@debbugs.gnu.org; Mon, 28 Apr 2014 11:37:05 -0400 Received: from mtaout28.012.net.il ([80.179.55.184]:44201) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wenc6-0007OH-IQ for 7781@debbugs.gnu.org; Mon, 28 Apr 2014 11:37:03 -0400 Received: from conversion-daemon.mtaout28.012.net.il by mtaout28.012.net.il (HyperSendmail v2007.08) id <0N4Q00100YZY0Y00@mtaout28.012.net.il> for 7781@debbugs.gnu.org; Mon, 28 Apr 2014 18:35:40 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.4.28]) by mtaout28.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0N4Q001W2ZBGVI00@mtaout28.012.net.il>; Mon, 28 Apr 2014 18:35:40 +0300 (IDT) Date: Mon, 28 Apr 2014 18:37:01 +0300 From: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode In-reply-to: <87y4yqh34e.fsf@micropit.couberia.selfip.net> X-012-Sender: halo1@inter.net.il To: Peter =?iso-8859-1?Q?M=FCnster?= Message-id: <83eh0ho482.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=iso-8859-1 Content-transfer-encoding: 8BIT References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.0 (+) > From: Peter Münster > Date: Sun, 27 Apr 2014 23:30:25 +0200 > > I'm using a patched hunspell > (http://sourceforge.net/p/hunspell/patches/57/) and it works well with > text-mode and message-mode. But unfortunately it does not work with > context-mode or latex-mode. > > Example: > > --8<---------------cut here---------------start------------->8--- > \documentclass{article} > \begin{document} > bla > \end{document} > --8<---------------cut here---------------end--------------->8--- > > Running ispell fails with this error: > > ispell-process-line: Ispell misalignment: word `bla' point 41; probably incompatible versions I cannot reproduce this. If I start "emacs -Q" and try spell-checking your example (with Hunspell being the speller), it works just fine for me: I get suggestions to replace "bla". Same thing if I load AUCTeX into "emacs -Q" (does AUCTeX even change anything about spell-checking?). Does this work for you in "emacs -Q"? If so, I suggest to review your customizations to look for those which somehow cause this. If "emacs -Q" doesn't work either, please provide a detailed reproduction recipe starting from "emacs -Q". From debbugs-submit-bounces@debbugs.gnu.org Mon Apr 28 12:18:23 2014 Received: (at 7781) by debbugs.gnu.org; 28 Apr 2014 16:18:23 +0000 Received: from localhost ([127.0.0.1]:44046 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WeoG5-00006p-HC for submit@debbugs.gnu.org; Mon, 28 Apr 2014 12:18:22 -0400 Received: from smtp05.smtpout.orange.fr ([80.12.242.127]:19659 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WeoG1-00006U-Sd for 7781@debbugs.gnu.org; Mon, 28 Apr 2014 12:18:18 -0400 Received: from micropit.couberia.selfip.net ([2.2.212.209]) by mwinf5d10 with ME id vUJA1n00Z4Xdq4q03UJBHD; Mon, 28 Apr 2014 18:18:11 +0200 X-ME-Helo: micropit.couberia.selfip.net X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Mon, 28 Apr 2014 18:18:11 +0200 X-ME-IP: 2.2.212.209 Received: by micropit.couberia.selfip.net (Postfix, from userid 1000) id 7814FA60C6C; Mon, 28 Apr 2014 18:18:10 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> Date: Mon, 28 Apr 2014 18:18:10 +0200 Message-ID: <87mwf5fmwt.fsf@micropit.couberia.selfip.net> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) On Mon, Apr 28 2014, Eli Zaretskii wrote: > I cannot reproduce this. If I start "emacs -Q" and try spell-checking > your example (with Hunspell being the speller), it works just fine for > me: I get suggestions to replace "bla". Same thing if I load AUCTeX > into "emacs -Q" (does AUCTeX even change anything about > spell-checking?). Hi Eli, It's not AUCTeX, I've just tested with normal latex-mode. > If "emacs -Q" doesn't work either, please provide a detailed > reproduction recipe starting from "emacs -Q". Here a reproduction recipe: - create minimal latex file /tmp/test.tex - start emacs: LANG=C emacs -Q --eval '(setq ispell-program-name "hunspell")' /tmp/test.tex - M-x ispell Here are more details about my system: In GNU Emacs 24.4.50.2 (x86_64-suse-linux-gnu, GTK+ Version 3.10.4) of 2014-04-20 on micropit Repository revision: 116996 dancol@dancol.org-20140420144613-8e4t4swlxauwl4w7 Windowing system distributor `The X.Org Foundation', version 11.0.11403901 System Description: openSUSE 13.1 (Bottle) (x86_64) Configured using: `configure --without-toolkit-scroll-bars' Configured features: XPM JPEG TIFF GIF PNG RSVG IMAGEMAGICK SOUND GPM DBUS GCONF GSETTINGS NOTIFY LIBSELINUX LIBXML2 FREETYPE M17N_FLT LIBOTF XFT ZLIB Important settings: value of $LANG: C value of $XMODIFIERS: @im=ibus locale-coding-system: nil Major mode: LaTeX Minor modes in effect: shell-dirtrack-mode: t tooltip-mode: t electric-indent-mode: t mouse-wheel-mode: t tool-bar-mode: t menu-bar-mode: t file-name-shadow-mode: t global-font-lock-mode: t font-lock-mode: t blink-cursor-mode: t auto-composition-mode: t auto-encryption-mode: t auto-compression-mode: t line-number-mode: t transient-mark-mode: t Recent input: M-x i s p M-x r e p o r t - e m Recent messages: For information about GNU Emacs and the GNU system, type C-h C-a. Starting new Ispell process hunspell with default dictionary... Spell-checking test.tex using hunspell with default dictionary...done ispell-process-line: Ispell misalignment: word `bla' point 41; probably incompatible versions Load-path shadows: None found. Features: (shadow sort gnus-util mail-extr emacsbug message dired format-spec rfc822 mml easymenu mml-sec mm-decode mm-bodies mm-encode mail-parse rfc2231 mailabbrev gmm-utils mailheader sendmail rfc2047 rfc2045 ietf-drums mm-util help-fns mail-prsvr mail-utils ispell tex-mode compile shell pcomplete comint ansi-color ring latexenc time-date tooltip electric uniquify ediff-hook vc-hooks lisp-float-type mwheel x-win x-dnd tool-bar dnd fontset image regexp-opt fringe tabulated-list newcomment lisp-mode prog-mode register page menu-bar rfn-eshadow timer select scroll-bar mouse jit-lock font-lock syntax facemenu font-core frame cham georgian utf-8-lang misc-lang vietnamese tibetan thai tai-viet lao korean japanese hebrew greek romanian slovak czech european ethiopic indian cyrillic chinese case-table epa-hook jka-cmpr-hook help simple abbrev minibuffer nadvice loaddefs button faces cus-face macroexp files text-properties overlay sha1 md5 base64 format env code-pages mule custom widget hashtable-print-readable backquote make-network-process dbusbind gfilenotify dynamic-setting system-font-setting font-render-setting move-toolbar gtk x-toolkit x multi-tty emacs) Memory information: ((conses 16 87695 6922) (symbols 48 19137 0) (miscs 40 44 125) (strings 32 14709 4542) (string-bytes 1 418678) (vectors 16 10601) (vector-slots 8 389507 5806) (floats 8 67 64) (intervals 56 250 165) (buffers 960 13) (heap 1024 42866 735)) -- Peter From debbugs-submit-bounces@debbugs.gnu.org Mon Apr 28 12:48:04 2014 Received: (at 7781) by debbugs.gnu.org; 28 Apr 2014 16:48:04 +0000 Received: from localhost ([127.0.0.1]:44059 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Weoip-000130-Rw for submit@debbugs.gnu.org; Mon, 28 Apr 2014 12:48:04 -0400 Received: from mtaout20.012.net.il ([80.179.55.166]:59145) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Weoim-00012N-Uv for 7781@debbugs.gnu.org; Mon, 28 Apr 2014 12:48:01 -0400 Received: from conversion-daemon.a-mtaout20.012.net.il by a-mtaout20.012.net.il (HyperSendmail v2007.08) id <0N4R00H002GN9700@a-mtaout20.012.net.il> for 7781@debbugs.gnu.org; Mon, 28 Apr 2014 19:47:54 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout20.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0N4R00G2W2NUVI70@a-mtaout20.012.net.il>; Mon, 28 Apr 2014 19:47:54 +0300 (IDT) Date: Mon, 28 Apr 2014 19:48:00 +0300 From: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode In-reply-to: <87mwf5fmwt.fsf@micropit.couberia.selfip.net> X-012-Sender: halo1@inter.net.il To: Peter =?iso-8859-1?Q?M=FCnster?= Message-id: <838uqpo0xr.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=iso-8859-1 Content-transfer-encoding: 8BIT References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.0 (+) > From: Peter Münster > Cc: 7781@debbugs.gnu.org > Date: Mon, 28 Apr 2014 18:18:10 +0200 > > - create minimal latex file /tmp/test.tex > - start emacs: > LANG=C emacs -Q --eval '(setq ispell-program-name "hunspell")' /tmp/test.tex > - M-x ispell Works fine for me, sorry. Maybe your Hunspell is not patched enough. Mine has much more patches than the one you mentioned. Most of them are Windows-specific or related to encoding/decoding non-ASCII characters, something that doesn't sound relevant for your use case. But who knows? you might take a look at the file DIFFS in this archive, where you will find all the changes I made to Hunspell: http://sourceforge.net/projects/ezwinports/files/hunspell-1.3.2-3-w32-src.zip/download Or maybe wait for someone on Unix to try reproducing your recipe. One other idea is to try spell-checking your sample file outside of Emacs, maybe you will see something that will give some ideas. Finally, are you sure the 'hunspell' executable Emacs finds on PATH is indeed the one you intend? (Try putting a full absolute file name into ispell-program-name.) From debbugs-submit-bounces@debbugs.gnu.org Mon Apr 28 13:17:47 2014 Received: (at 7781) by debbugs.gnu.org; 28 Apr 2014 17:17:47 +0000 Received: from localhost ([127.0.0.1]:44072 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WepBa-0001xz-I0 for submit@debbugs.gnu.org; Mon, 28 Apr 2014 13:17:47 -0400 Received: from smtp05.smtpout.orange.fr ([80.12.242.127]:17991 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WepBX-0001xf-Op for 7781@debbugs.gnu.org; Mon, 28 Apr 2014 13:17:44 -0400 Received: from micropit.couberia.selfip.net ([2.2.212.209]) by mwinf5d10 with ME id vVHc1n00J4Xdq4q03VHdoR; Mon, 28 Apr 2014 19:17:37 +0200 X-ME-Helo: micropit.couberia.selfip.net X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Mon, 28 Apr 2014 19:17:37 +0200 X-ME-IP: 2.2.212.209 Received: by micropit.couberia.selfip.net (Postfix, from userid 1000) id 9126AA60C6C; Mon, 28 Apr 2014 19:17:36 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <838uqpo0xr.fsf@gnu.org> Date: Mon, 28 Apr 2014 19:17:36 +0200 Message-ID: <877g69fk5r.fsf@micropit.couberia.selfip.net> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) On Mon, Apr 28 2014, Eli Zaretskii wrote: > Maybe your Hunspell is not patched enough. Perhaps. > Mine has much more patches than the one you mentioned. Most of them > are Windows-specific or related to encoding/decoding non-ASCII > characters, something that doesn't sound relevant for your use case. > But who knows? you might take a look at the file DIFFS in this > archive, where you will find all the changes I made to Hunspell: > > http://sourceforge.net/projects/ezwinports/files/hunspell-1.3.2-3-w32-src.zip/download Indeed. I'll take a look when I have some more time. > Or maybe wait for someone on Unix to try reproducing your recipe. Yes, let's see. > One other idea is to try spell-checking your sample file outside of > Emacs, maybe you will see something that will give some ideas. No. Here is the result: --8<---------------cut here---------------start------------->8--- $ hunspell -a -d en_US -i UTF-8 /tmp/test.tex @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2) & documentclass 8 1: document class, document-class, documentations, documentation, documents, documentary, underclassmen, underclassman * * * & bla 15 0: alb, bl, la, blat, bola, blag, blah, blab, lab, baa, bra, boa, Ila, Ala, Ola * * --8<---------------cut here---------------end--------------->8--- > Finally, are you sure the 'hunspell' executable Emacs finds on PATH is > indeed the one you intend? Yes. And after switching to "M-x text-mode", there is no more problem. -- Peter From debbugs-submit-bounces@debbugs.gnu.org Mon Apr 28 13:32:18 2014 Received: (at 7781) by debbugs.gnu.org; 28 Apr 2014 17:32:18 +0000 Received: from localhost ([127.0.0.1]:44080 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WepPd-0002Qo-UK for submit@debbugs.gnu.org; Mon, 28 Apr 2014 13:32:18 -0400 Received: from mtaout22.012.net.il ([80.179.55.172]:43797) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WepPY-0002QV-TD for 7781@debbugs.gnu.org; Mon, 28 Apr 2014 13:32:14 -0400 Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0N4R008004M36M00@a-mtaout22.012.net.il> for 7781@debbugs.gnu.org; Mon, 28 Apr 2014 20:32:06 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0N4R007S34PHUC70@a-mtaout22.012.net.il>; Mon, 28 Apr 2014 20:32:06 +0300 (IDT) Date: Mon, 28 Apr 2014 20:32:11 +0300 From: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode In-reply-to: <877g69fk5r.fsf@micropit.couberia.selfip.net> X-012-Sender: halo1@inter.net.il To: Peter =?iso-8859-1?Q?M=FCnster?= Message-id: <837g69nyw4.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=iso-8859-1 Content-transfer-encoding: 8BIT References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <838uqpo0xr.fsf@gnu.org> <877g69fk5r.fsf@micropit.couberia.selfip.net> X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.0 (+) > From: Peter Münster > Cc: 7781@debbugs.gnu.org > Date: Mon, 28 Apr 2014 19:17:36 +0200 > > after switching to "M-x text-mode", there is no more problem. Maybe you should activate the debugging code in ispell.el and see what is being submitted to hunspell and what it returns. From debbugs-submit-bounces@debbugs.gnu.org Mon Apr 28 14:27:42 2014 Received: (at 7781) by debbugs.gnu.org; 28 Apr 2014 18:27:42 +0000 Received: from localhost ([127.0.0.1]:44101 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WeqHF-000492-OW for submit@debbugs.gnu.org; Mon, 28 Apr 2014 14:27:42 -0400 Received: from smtp05.smtpout.orange.fr ([80.12.242.127]:41804 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WeqHD-00048h-Am for 7781@debbugs.gnu.org; Mon, 28 Apr 2014 14:27:40 -0400 Received: from micropit.couberia.selfip.net ([2.2.212.209]) by mwinf5d09 with ME id vWTY1n00N4Xdq4q03WTYYq; Mon, 28 Apr 2014 20:27:33 +0200 X-ME-Helo: micropit.couberia.selfip.net X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Mon, 28 Apr 2014 20:27:33 +0200 X-ME-IP: 2.2.212.209 Received: by micropit.couberia.selfip.net (Postfix, from userid 1000) id D2E0FA60C6C; Mon, 28 Apr 2014 20:27:31 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <838uqpo0xr.fsf@gnu.org> <877g69fk5r.fsf@micropit.couberia.selfip.net> <837g69nyw4.fsf@gnu.org> Date: Mon, 28 Apr 2014 20:27:31 +0200 Message-ID: <87vbtte2cs.fsf@micropit.couberia.selfip.net> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="=-=-=" X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) --=-=-= Content-Type: text/plain On Mon, Apr 28 2014, Eli Zaretskii wrote: >> after switching to "M-x text-mode", there is no more problem. > > Maybe you should activate the debugging code in ispell.el and see what > is being submitted to hunspell and what it returns. Please find attached 2 debug-outputs, one with latex-mode and one with text-mode. Both are created with `ispell-buffer-with-debug'. Do you see, what is going on there? -- Peter --=-=-= Content-Type: text/plain Content-Disposition: attachment; filename=ispell-debug-latex.txt -- ispell-fhd: Skipping entry: SEARCH PATH: -- ispell-fhd: Skipping entry: .::/usr/share/hunspell:/usr/share/myspell:/usr/share/myspell/dicts:/Library/Spelling:/home/peter/.openoffice.org/3/user/wordbook:.openoffice.org2/user/wordbook:.openoffice.org2.0/user/wordbook:Library/Spelling:/opt/openoffice.org/basis3.0/share/dict/ooo:/usr/lib/openoffice.org/basis3.0/share/dict/ooo:/opt/openoffice.org2.4/share/dict/ooo:/usr/lib/openoffice.org2.4/share/dict/ooo:/opt/openoffice.org2.3/share/dict/ooo:/usr/lib/openoffice.org2.3/share/dict/ooo:/opt/openoffice.org2.2/share/dict/ooo:/usr/lib/openoffice.org2.2/share/dict/ooo:/opt/openoffice.org2.1/share/dict/ooo:/usr/lib/openoffice.org2.1/share/dict/ooo:/opt/openoffice.org2.0/share/dict/ooo:/usr/lib/openoffice.org2.0/share/dict/ooo -- ispell-fhd: Skipping entry: AVAILABLE DICTIONARIES (path is not mandatory for -d option): ++ ispell-fhd: dict-entry:/usr/share/myspell/fr_FR name:fr_FR basename:fr_FR affix-file:/usr/share/myspell/fr_FR.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/en_US name:en_US basename:en_US affix-file:/usr/share/myspell/en_US.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/es_ES name:es_ES basename:es_ES affix-file:/usr/share/myspell/es_ES.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/en_GB name:en_GB basename:en_GB affix-file:/usr/share/myspell/en_GB.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/de_DE name:de_DE basename:de_DE affix-file:/usr/share/myspell/de_DE.aff -- ispell-fhd: Skipping entry: LOADED DICTIONARY: -- ispell-fhd: Skipping entry: /usr/share/myspell/en_US.dic -- ispell-fhd: Skipping entry: Hunspell 1.3.2 ++ ispell-fhd: Adding alias german8 -> /usr/share/myspell/de_DE.aff. ++ ispell-fhd: Adding alias german -> /usr/share/myspell/de_DE.aff. ++ ispell-fhd: Adding alias francais -> /usr/share/myspell/fr_FR.aff. ++ ispell-fhd: Adding alias english -> /usr/share/myspell/en_US.aff. ++ ispell-fhd: Adding alias deutsch8 -> /usr/share/myspell/de_DE.aff. ++ ispell-fhd: Adding alias deutsch -> /usr/share/myspell/de_DE.aff. ++ ispell-fhd: Adding alias castellano8 -> /usr/share/myspell/es_ES.aff. ++ ispell-fhd: Adding alias castellano -> /usr/share/myspell/es_ES.aff. ++ ispell-fhd: Adding alias british -> /usr/share/myspell/en_GB.aff. ++ ispell-fhd: Adding alias american -> /usr/share/myspell/en_US.aff. ispell-region: (ispell-skip-region-list): ((\\addcontentsline ispell-tex-arg-end 2) (\\add\(tocontents\|vspace\) ispell-tex-arg-end) (\\\([aA]lph\|arabic\) ispell-tex-arg-end) (\\bibliographystyle ispell-tex-arg-end) (\\makebox ispell-tex-arg-end 0) (\\e?psfig ispell-tex-arg-end) (\\document\(class\|style\) . \\begin[ ]*{[ ]*document[ ]*}) (\(figure\|table\)\*? ispell-tex-arg-end 0) (list ispell-tex-arg-end 2) (program . \\end[ ]*{[ ]*program[ ]*}) (verbatim\*? . \\end[ ]*{[ ]*verbatim\*?[ ]*}) (ispell-words-keyword forward-line) (ispell-dictionary-keyword forward-line) (ispell-pdict-keyword forward-line) (ispell-parsing-keyword forward-line) (^---*BEGIN PGP [A-Z ]*--* . ^---*END PGP [A-Z ]*--*) (^begin [0-9][0-9][0-9] [^ ]+$ . end ) (^%!PS-Adobe-[123].0 . %%EOF ) (^---* \(Start of \)?[Ff]orwarded [Mm]essage . ^---* End of [Ff]orwarded [Mm]essage) (\(--+\|_+\|\(/\w\|\(\(\w\|[-_]\)+[.:@]\)\)\(\w\|[-_]\)*\([.:/@]+\(\w\|[-_~=?&]\)+\)+\))) ispell-region: (ispell-begin-skip-region-regexp): \\addcontentsline\|\\add\(tocontents\|vspace\)\|\\\([aA]lph\|arabic\)\|\\bibliographystyle\|\\makebox\|\\e?psfig\|\\document\(class\|style\)\|\\begin[ ]*{[ ]*\(figure\|table\)\*?[ ]*}\|\\begin[ ]*{[ ]*list[ ]*}\|\\begin[ ]*{[ ]*program[ ]*}\|\\begin[ ]*{[ ]*verbatim\*?[ ]*}\|LocalWords: \|Local IspellDict: \|Local IspellPersDict: \|Local IspellParsing: \|^---*BEGIN PGP [A-Z ]*--*\|^begin [0-9][0-9][0-9] [^ ]+$\|^%!PS-Adobe-[123].0\|^---* \(Start of \)?[Ff]orwarded [Mm]essage\|\(--+\|_+\|\(/\w\|\(\(\w\|[-_]\)+[.:@]\)\)\(\w\|[-_]\)*\([.:/@]+\(\w\|[-_~=?&]\)+\)+\) ispell-region: Search for first region to skip after (ispell-begin-skip-region-regexp) ispell-region: First skip: \documentclass at (pos,line,column): (1,1,0). ispell-region: Continue spell-checking with hunspell and default dictionary... ispell-region: string pos (41->41), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (42->45), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [^bla ] ispell-process-line: Ispell misalignment error: [Word from ispell pipe]: [bla], actual (point,line,column): (41,2,16) --=-=-= Content-Type: text/plain Content-Disposition: attachment; filename=ispell-debug-text.txt -- ispell-fhd: Skipping entry: SEARCH PATH: -- ispell-fhd: Skipping entry: .::/usr/share/hunspell:/usr/share/myspell:/usr/share/myspell/dicts:/Library/Spelling:/home/peter/.openoffice.org/3/user/wordbook:.openoffice.org2/user/wordbook:.openoffice.org2.0/user/wordbook:Library/Spelling:/opt/openoffice.org/basis3.0/share/dict/ooo:/usr/lib/openoffice.org/basis3.0/share/dict/ooo:/opt/openoffice.org2.4/share/dict/ooo:/usr/lib/openoffice.org2.4/share/dict/ooo:/opt/openoffice.org2.3/share/dict/ooo:/usr/lib/openoffice.org2.3/share/dict/ooo:/opt/openoffice.org2.2/share/dict/ooo:/usr/lib/openoffice.org2.2/share/dict/ooo:/opt/openoffice.org2.1/share/dict/ooo:/usr/lib/openoffice.org2.1/share/dict/ooo:/opt/openoffice.org2.0/share/dict/ooo:/usr/lib/openoffice.org2.0/share/dict/ooo -- ispell-fhd: Skipping entry: AVAILABLE DICTIONARIES (path is not mandatory for -d option): ++ ispell-fhd: dict-entry:/usr/share/myspell/fr_FR name:fr_FR basename:fr_FR affix-file:/usr/share/myspell/fr_FR.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/en_US name:en_US basename:en_US affix-file:/usr/share/myspell/en_US.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/es_ES name:es_ES basename:es_ES affix-file:/usr/share/myspell/es_ES.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/en_GB name:en_GB basename:en_GB affix-file:/usr/share/myspell/en_GB.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/de_DE name:de_DE basename:de_DE affix-file:/usr/share/myspell/de_DE.aff -- ispell-fhd: Skipping entry: LOADED DICTIONARY: -- ispell-fhd: Skipping entry: /usr/share/myspell/en_US.dic -- ispell-fhd: Skipping entry: Hunspell 1.3.2 ++ ispell-fhd: Adding alias german8 -> /usr/share/myspell/de_DE.aff. ++ ispell-fhd: Adding alias german -> /usr/share/myspell/de_DE.aff. ++ ispell-fhd: Adding alias francais -> /usr/share/myspell/fr_FR.aff. ++ ispell-fhd: Adding alias english -> /usr/share/myspell/en_US.aff. ++ ispell-fhd: Adding alias deutsch8 -> /usr/share/myspell/de_DE.aff. ++ ispell-fhd: Adding alias deutsch -> /usr/share/myspell/de_DE.aff. ++ ispell-fhd: Adding alias castellano8 -> /usr/share/myspell/es_ES.aff. ++ ispell-fhd: Adding alias castellano -> /usr/share/myspell/es_ES.aff. ++ ispell-fhd: Adding alias british -> /usr/share/myspell/en_GB.aff. ++ ispell-fhd: Adding alias american -> /usr/share/myspell/en_US.aff. ispell-region: (ispell-skip-region-list): ((ispell-words-keyword forward-line) (ispell-dictionary-keyword forward-line) (ispell-pdict-keyword forward-line) (ispell-parsing-keyword forward-line) (^---*BEGIN PGP [A-Z ]*--* . ^---*END PGP [A-Z ]*--*) (^begin [0-9][0-9][0-9] [^ ]+$ . end ) (^%!PS-Adobe-[123].0 . %%EOF ) (^---* \(Start of \)?[Ff]orwarded [Mm]essage . ^---* End of [Ff]orwarded [Mm]essage) (\(--+\|_+\|\(/\w\|\(\(\w\|[-_]\)+[.:@]\)\)\(\w\|[-_]\)*\([.:/@]+\(\w\|[-_~=?&]\)+\)+\))) ispell-region: (ispell-begin-skip-region-regexp): LocalWords: \|Local IspellDict: \|Local IspellPersDict: \|Local IspellParsing: \|^---*BEGIN PGP [A-Z ]*--*\|^begin [0-9][0-9][0-9] [^ ]+$\|^%!PS-Adobe-[123].0\|^---* \(Start of \)?[Ff]orwarded [Mm]essage\|\(--+\|_+\|\(/\w\|\(\(\w\|[-_]\)+[.:@]\)\)\(\w\|[-_]\)*\([.:/@]+\(\w\|[-_~=?&]\)+\)+\) ispell-region: Search for first region to skip after (ispell-begin-skip-region-regexp) ispell-region: Continue spell-checking with hunspell and default dictionary... ispell-region: string pos (1->24), eol: 24, [in-comment]: [nil], [add-comment]: [nil], [string]: [^\documentclass{article} ] ispell-region: string pos (24->24), eol: 41, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (25->41), eol: 41, [in-comment]: [nil], [add-comment]: [nil], [string]: [^\begin{document} ] ispell-region: string pos (41->41), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (42->45), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [^bla ] ispell-region: string pos (45->45), eol: 60, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (46->60), eol: 60, [in-comment]: [nil], [add-comment]: [nil], [string]: [^\end{document} ] ispell-region: string pos (60->60), eol: 61, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] --=-=-=-- From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 06:03:36 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 10:03:36 +0000 Received: from localhost ([127.0.0.1]:44462 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf4sx-00081u-Ke for submit@debbugs.gnu.org; Tue, 29 Apr 2014 06:03:35 -0400 Received: from edison.ccupm.upm.es ([138.100.198.71]:43771) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf4su-00081d-Da for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 06:03:33 -0400 Received: from agmartin.aq.upm.es (Agmartin.aq.upm.es [138.100.41.131]) by smtp.upm.es (8.14.3/8.14.3/edison-001) with ESMTP id s3TA3Pa3016747; Tue, 29 Apr 2014 12:03:25 +0200 Received: by agmartin.aq.upm.es (Postfix, from userid 1000) id B2B1740EAA; Tue, 29 Apr 2014 12:03:25 +0200 (CEST) Date: Tue, 29 Apr 2014 12:03:25 +0200 From: Agustin Martin To: Peter =?utf-8?Q?M=C3=BCnster?= , 7781@debbugs.gnu.org Subject: Re: bug#7781: hunspell and latex-mode Message-ID: <20140429100325.GA13604@agmartin.aq.upm.es> References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <87mwf5fmwt.fsf@micropit.couberia.selfip.net> User-Agent: Mutt/1.5.23 (2014-03-12) X-Spam-Score: -3.0 (---) X-Debbugs-Envelope-To: 7781 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.0 (---) On Mon, Apr 28, 2014 at 06:18:10PM +0200, Peter Münster wrote: > On Mon, Apr 28 2014, Eli Zaretskii wrote: > > > I cannot reproduce this. If I start "emacs -Q" and try spell-checking > > your example (with Hunspell being the speller), it works just fine for > > me: I get suggestions to replace "bla". Same thing if I load AUCTeX > > into "emacs -Q" (does AUCTeX even change anything about > > spell-checking?). > > Hi Eli, > > It's not AUCTeX, I've just tested with normal latex-mode. > > > > If "emacs -Q" doesn't work either, please provide a detailed > > reproduction recipe starting from "emacs -Q". > > Here a reproduction recipe: > > - create minimal latex file /tmp/test.tex > - start emacs: > LANG=C emacs -Q --eval '(setq ispell-program-name "hunspell")' /tmp/test.tex > - M-x ispell > > Here are more details about my system: > > In GNU Emacs 24.4.50.2 (x86_64-suse-linux-gnu, GTK+ Version 3.10.4) > of 2014-04-20 on micropit Cannot reproduce it here with emacs-snapshot 24.3.50.1 in Debian. What does 'ps -aux' show for hunspell call when run in xterm? -- Agustin From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 06:13:15 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 10:13:15 +0000 Received: from localhost ([127.0.0.1]:44466 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf52I-0008Jw-Pc for submit@debbugs.gnu.org; Tue, 29 Apr 2014 06:13:15 -0400 Received: from smtp07.smtpout.orange.fr ([80.12.242.129]:21856 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf52G-0008Je-4x for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 06:13:13 -0400 Received: from micropit.roche-blanche.homenet.org ([2.14.79.144]) by mwinf5d14 with ME id vmD51n00436onnE03mD5CL; Tue, 29 Apr 2014 12:13:05 +0200 X-ME-Helo: micropit.roche-blanche.homenet.org X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Tue, 29 Apr 2014 12:13:05 +0200 X-ME-IP: 2.14.79.144 Received: by micropit.roche-blanche.homenet.org (Postfix, from userid 1000) id D3921A60CF0; Tue, 29 Apr 2014 12:13:04 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: Agustin Martin Subject: Re: bug#7781: hunspell and latex-mode References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> Date: Tue, 29 Apr 2014 12:13:04 +0200 Message-ID: <87y4yoa1fz.fsf@micropit.roche-blanche.homenet.org> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) On Tue, Apr 29 2014, Agustin Martin wrote: > Cannot reproduce it here with emacs-snapshot 24.3.50.1 in Debian. What does > 'ps -aux' show for hunspell call when run in xterm? hunspell -a -d en_US -i UTF-8 -- Peter From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 06:20:59 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 10:20:59 +0000 Received: from localhost ([127.0.0.1]:44470 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf59m-00006r-QD for submit@debbugs.gnu.org; Tue, 29 Apr 2014 06:20:59 -0400 Received: from smtp07.smtpout.orange.fr ([80.12.242.129]:39935 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf59k-00006c-Cz for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 06:20:57 -0400 Received: from micropit.roche-blanche.homenet.org ([2.14.79.144]) by mwinf5d14 with ME id vmLq1n00736onnE03mLq6D; Tue, 29 Apr 2014 12:20:50 +0200 X-ME-Helo: micropit.roche-blanche.homenet.org X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Tue, 29 Apr 2014 12:20:50 +0200 X-ME-IP: 2.14.79.144 Received: by micropit.roche-blanche.homenet.org (Postfix, from userid 1000) id 28B4BA60CF0; Tue, 29 Apr 2014 12:20:50 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: Agustin Martin Subject: Re: bug#7781: hunspell and latex-mode References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> Date: Tue, 29 Apr 2014 12:20:50 +0200 In-Reply-To: <20140429100325.GA13604@agmartin.aq.upm.es> (Agustin Martin's message of "Tue, 29 Apr 2014 12:03:25 +0200") Message-ID: <87r44ga131.fsf@micropit.roche-blanche.homenet.org> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) On Tue, Apr 29 2014, Agustin Martin wrote: > Cannot reproduce it here with emacs-snapshot 24.3.50.1 in Debian. Could you please send the ispell-debug buffer, created with `ispell-buffer-with-debug'? Then we could compare it with mine. There are perhaps differences. -- Peter From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 06:22:07 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 10:22:07 +0000 Received: from localhost ([127.0.0.1]:44474 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf5As-00009e-Ld for submit@debbugs.gnu.org; Tue, 29 Apr 2014 06:22:07 -0400 Received: from fibonacci.ccupm.upm.es ([138.100.198.70]:56301) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf5Aq-000093-De for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 06:22:05 -0400 Received: from agmartin.aq.upm.es (Agmartin.aq.upm.es [138.100.41.131]) by smtp.upm.es (8.14.3/8.14.3/fibonacci-001) with ESMTP id s3TALvta023436; Tue, 29 Apr 2014 12:21:57 +0200 Received: by agmartin.aq.upm.es (Postfix, from userid 1000) id 7BDAD4082D; Tue, 29 Apr 2014 12:21:57 +0200 (CEST) Date: Tue, 29 Apr 2014 12:21:57 +0200 From: Agustin Martin To: Peter =?utf-8?Q?M=C3=BCnster?= , 7781@debbugs.gnu.org Subject: Re: bug#7781: hunspell and latex-mode Message-ID: <20140429102157.GA15334@agmartin.aq.upm.es> References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> <87y4yoa1fz.fsf@micropit.roche-blanche.homenet.org> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <87y4yoa1fz.fsf@micropit.roche-blanche.homenet.org> User-Agent: Mutt/1.5.23 (2014-03-12) X-Spam-Score: -3.0 (---) X-Debbugs-Envelope-To: 7781 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.0 (---) On Tue, Apr 29, 2014 at 12:13:04PM +0200, Peter Münster wrote: > On Tue, Apr 29 2014, Agustin Martin wrote: > > > Cannot reproduce it here with emacs-snapshot 24.3.50.1 in Debian. What does > > 'ps -aux' show for hunspell call when run in xterm? > > hunspell -a -d en_US -i UTF-8 That is what is expected. I am clueless about this. -- Agustin From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 06:40:08 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 10:40:08 +0000 Received: from localhost ([127.0.0.1]:44482 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf5SI-000128-JN for submit@debbugs.gnu.org; Tue, 29 Apr 2014 06:40:07 -0400 Received: from edison.ccupm.upm.es ([138.100.198.71]:42382) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf5S8-00011H-7d for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 06:39:59 -0400 Received: from agmartin.aq.upm.es (Agmartin.aq.upm.es [138.100.41.131]) by smtp.upm.es (8.14.3/8.14.3/edison-001) with ESMTP id s3TAdneR026114; Tue, 29 Apr 2014 12:39:49 +0200 Received: by agmartin.aq.upm.es (Postfix, from userid 1000) id E0CF44082D; Tue, 29 Apr 2014 12:39:49 +0200 (CEST) Date: Tue, 29 Apr 2014 12:39:49 +0200 From: Agustin Martin To: Peter =?utf-8?Q?M=C3=BCnster?= , 7781@debbugs.gnu.org Subject: Re: bug#7781: hunspell and latex-mode Message-ID: <20140429103949.GA28561@agmartin.aq.upm.es> References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> <87r44ga131.fsf@micropit.roche-blanche.homenet.org> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="3MwIy2ne0vdjdPXF" Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <87r44ga131.fsf@micropit.roche-blanche.homenet.org> User-Agent: Mutt/1.5.23 (2014-03-12) X-Spam-Score: -3.0 (---) X-Debbugs-Envelope-To: 7781 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.0 (---) --3MwIy2ne0vdjdPXF Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit On Tue, Apr 29, 2014 at 12:20:50PM +0200, Peter Münster wrote: > On Tue, Apr 29 2014, Agustin Martin wrote: > > > Cannot reproduce it here with emacs-snapshot 24.3.50.1 in Debian. > > Could you please send the ispell-debug buffer, created with > `ispell-buffer-with-debug'? Then we could compare it with mine. There > are perhaps differences. Please find it attached. Apart from the misalignment problem the only difference seems to be that I have lots of dicts installed and the ~/.openoffice.org/ path. -- Agustin --3MwIy2ne0vdjdPXF Content-Type: text/plain; charset=us-ascii Content-Disposition: attachment; filename="ispell-debug-buffer-amd-7781.txt" -- ispell-fhd: Skipping entry: SEARCH PATH: -- ispell-fhd: Skipping entry: .::/usr/share/hunspell:/usr/share/myspell:/usr/share/myspell/dicts:/Library/Spelling:/home/amd/.openoffice.org/3/user/wordbook:.openoffice.org2/user/wordbook:.openoffice.org2.0/user/wordbook:Library/Spelling:/opt/openoffice.org/basis3.0/share/dict/ooo:/usr/lib/openoffice.org/basis3.0/share/dict/ooo:/opt/openoffice.org2.4/share/dict/ooo:/usr/lib/openoffice.org2.4/share/dict/ooo:/opt/openoffice.org2.3/share/dict/ooo:/usr/lib/openoffice.org2.3/share/dict/ooo:/opt/openoffice.org2.2/share/dict/ooo:/usr/lib/openoffice.org2.2/share/dict/ooo:/opt/openoffice.org2.1/share/dict/ooo:/usr/lib/openoffice.org2.1/share/dict/ooo:/opt/openoffice.org2.0/share/dict/ooo:/usr/lib/openoffice.org2.0/share/dict/ooo -- ispell-fhd: Skipping entry: AVAILABLE DICTIONARIES (path is not mandatory for -d option): ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_NI name:es_NI basename:es_NI affix-file:/usr/share/hunspell/es_NI.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_PA name:es_PA basename:es_PA affix-file:/usr/share/hunspell/es_PA.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_EG name:ar_EG basename:ar_EG affix-file:/usr/share/hunspell/ar_EG.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/eu_ES name:eu_ES basename:eu_ES affix-file:/usr/share/hunspell/eu_ES.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/cs name:cs basename:cs affix-file:/usr/share/hunspell/cs.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ca_ES-valencia name:ca_ES-valencia basename:ca_ES-valencia affix-file:/usr/share/hunspell/ca_ES-valencia.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/sk_SK name:sk_SK basename:sk_SK affix-file:/usr/share/hunspell/sk_SK.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_TN name:ar_TN basename:ar_TN affix-file:/usr/share/hunspell/ar_TN.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_YE name:ar_YE basename:ar_YE affix-file:/usr/share/hunspell/ar_YE.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_BO name:es_BO basename:es_BO affix-file:/usr/share/hunspell/es_BO.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/en_US name:en_US basename:en_US affix-file:/usr/share/hunspell/en_US.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_GT name:es_GT basename:es_GT affix-file:/usr/share/hunspell/es_GT.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/eo name:eo basename:eo affix-file:/usr/share/hunspell/eo.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_AR name:es_AR basename:es_AR affix-file:/usr/share/hunspell/es_AR.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_LB name:ar_LB basename:ar_LB affix-file:/usr/share/hunspell/ar_LB.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/sl name:sl basename:sl affix-file:/usr/share/hunspell/sl.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_BH name:ar_BH basename:ar_BH affix-file:/usr/share/hunspell/ar_BH.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/pt_PT name:pt_PT basename:pt_PT affix-file:/usr/share/hunspell/pt_PT.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/hr_HR name:hr_HR basename:hr_HR affix-file:/usr/share/hunspell/hr_HR.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/lt name:lt basename:lt affix-file:/usr/share/hunspell/lt.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/hu_HU_u8 name:hu_HU_u8 basename:hu_HU_u8 affix-file:/usr/share/hunspell/hu_HU_u8.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_CR name:es_CR basename:es_CR affix-file:/usr/share/hunspell/es_CR.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/sv name:sv basename:sv affix-file:/usr/share/hunspell/sv.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_DZ name:ar_DZ basename:ar_DZ affix-file:/usr/share/hunspell/ar_DZ.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/de_LI name:de_LI basename:de_LI affix-file:/usr/share/hunspell/de_LI.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_OM name:ar_OM basename:ar_OM affix-file:/usr/share/hunspell/ar_OM.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_JO name:ar_JO basename:ar_JO affix-file:/usr/share/hunspell/ar_JO.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ca_ES name:ca_ES basename:ca_ES affix-file:/usr/share/hunspell/ca_ES.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_HN name:es_HN basename:es_HN affix-file:/usr/share/hunspell/es_HN.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/et_EE name:et_EE basename:et_EE affix-file:/usr/share/hunspell/et_EE.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_ES name:es_ES basename:es_ES affix-file:/usr/share/hunspell/es_ES.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/de_DE name:de_DE basename:de_DE affix-file:/usr/share/hunspell/de_DE.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/gl_ES name:gl_ES basename:gl_ES affix-file:/usr/share/hunspell/gl_ES.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_PR name:es_PR basename:es_PR affix-file:/usr/share/hunspell/es_PR.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_SA name:ar_SA basename:ar_SA affix-file:/usr/share/hunspell/ar_SA.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_VE name:es_VE basename:es_VE affix-file:/usr/share/hunspell/es_VE.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/de_AT name:de_AT basename:de_AT affix-file:/usr/share/hunspell/de_AT.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/cs_CZ name:cs_CZ basename:cs_CZ affix-file:/usr/share/hunspell/cs_CZ.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/castellano name:castellano basename:castellano affix-file:/usr/share/hunspell/castellano.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ru_RU name:ru_RU basename:ru_RU affix-file:/usr/share/hunspell/ru_RU.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/en_GB name:en_GB basename:en_GB affix-file:/usr/share/hunspell/en_GB.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_PY name:es_PY basename:es_PY affix-file:/usr/share/hunspell/es_PY.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_UY name:es_UY basename:es_UY affix-file:/usr/share/hunspell/es_UY.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_CL name:es_CL basename:es_CL affix-file:/usr/share/hunspell/es_CL.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/eo_XX name:eo_XX basename:eo_XX affix-file:/usr/share/hunspell/eo_XX.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/lv_LV name:lv_LV basename:lv_LV affix-file:/usr/share/hunspell/lv_LV.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/hr name:hr basename:hr affix-file:/usr/share/hunspell/hr.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_LY name:ar_LY basename:ar_LY affix-file:/usr/share/hunspell/ar_LY.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_DO name:es_DO basename:es_DO affix-file:/usr/share/hunspell/es_DO.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/de_LU name:de_LU basename:de_LU affix-file:/usr/share/hunspell/de_LU.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_SY name:ar_SY basename:ar_SY affix-file:/usr/share/hunspell/ar_SY.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/fo name:fo basename:fo affix-file:/usr/share/hunspell/fo.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/lt_LT name:lt_LT basename:lt_LT affix-file:/usr/share/hunspell/lt_LT.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_IQ name:ar_IQ basename:ar_IQ affix-file:/usr/share/hunspell/ar_IQ.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/tl name:tl basename:tl affix-file:/usr/share/hunspell/tl.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/sl_SI name:sl_SI basename:sl_SI affix-file:/usr/share/hunspell/sl_SI.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/pl_PL name:pl_PL basename:pl_PL affix-file:/usr/share/hunspell/pl_PL.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar name:ar basename:ar affix-file:/usr/share/hunspell/ar.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/tl_PH name:tl_PH basename:tl_PH affix-file:/usr/share/hunspell/tl_PH.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/pl name:pl basename:pl affix-file:/usr/share/hunspell/pl.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_IN name:ar_IN basename:ar_IN affix-file:/usr/share/hunspell/ar_IN.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_MA name:ar_MA basename:ar_MA affix-file:/usr/share/hunspell/ar_MA.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_AE name:ar_AE basename:ar_AE affix-file:/usr/share/hunspell/ar_AE.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_QA name:ar_QA basename:ar_QA affix-file:/usr/share/hunspell/ar_QA.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_CO name:es_CO basename:es_CO affix-file:/usr/share/hunspell/es_CO.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_PE name:es_PE basename:es_PE affix-file:/usr/share/hunspell/es_PE.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/uz_UZ name:uz_UZ basename:uz_UZ affix-file:/usr/share/hunspell/uz_UZ.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/de_CH name:de_CH basename:de_CH affix-file:/usr/share/hunspell/de_CH.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/fo_FO name:fo_FO basename:fo_FO affix-file:/usr/share/hunspell/fo_FO.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_EC name:es_EC basename:es_EC affix-file:/usr/share/hunspell/es_EC.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/sv_SE name:sv_SE basename:sv_SE affix-file:/usr/share/hunspell/sv_SE.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/pt name:pt basename:pt affix-file:/usr/share/hunspell/pt.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_MX name:es_MX basename:es_MX affix-file:/usr/share/hunspell/es_MX.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/en_AU name:en_AU basename:en_AU affix-file:/usr/share/hunspell/en_AU.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/pt_BR name:pt_BR basename:pt_BR affix-file:/usr/share/hunspell/pt_BR.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/de_BE name:de_BE basename:de_BE affix-file:/usr/share/hunspell/de_BE.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es name:es basename:es affix-file:/usr/share/hunspell/es.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_KW name:ar_KW basename:ar_KW affix-file:/usr/share/hunspell/ar_KW.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/sk name:sk basename:sk affix-file:/usr/share/hunspell/sk.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_CU name:es_CU basename:es_CU affix-file:/usr/share/hunspell/es_CU.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/es_SV name:es_SV basename:es_SV affix-file:/usr/share/hunspell/es_SV.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/ar_SD name:ar_SD basename:ar_SD affix-file:/usr/share/hunspell/ar_SD.aff ++ ispell-fhd: dict-entry:/usr/share/hunspell/hu_HU name:hu_HU basename:hu_HU affix-file:/usr/share/hunspell/hu_HU.aff -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/cs ++ ispell-fhd: dict-entry:/usr/share/myspell/dicts/de-BE name:de-BE basename:de-BE affix-file:/usr/share/myspell/dicts/de-BE.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/dicts/de-LI name:de-LI basename:de-LI affix-file:/usr/share/myspell/dicts/de-LI.aff -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/lt -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/de_LI ++ ispell-fhd: dict-entry:/usr/share/myspell/dicts/de-DE name:de-DE basename:de-DE affix-file:/usr/share/myspell/dicts/de-DE.aff -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/de_DE -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/de_AT ++ ispell-fhd: dict-entry:/usr/share/myspell/dicts/en-GB name:en-GB basename:en-GB affix-file:/usr/share/myspell/dicts/en-GB.aff -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/cs_CZ -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/en_GB ++ ispell-fhd: dict-entry:/usr/share/myspell/dicts/de-CH name:de-CH basename:de-CH affix-file:/usr/share/myspell/dicts/de-CH.aff -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/lv_LV -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/de_LU -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/lt_LT ++ ispell-fhd: dict-entry:/usr/share/myspell/dicts/lv-LV name:lv-LV basename:lv-LV affix-file:/usr/share/myspell/dicts/lv-LV.aff -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/ar -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/hyph_lv_LV -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/hyph_lt_LT -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/de_CH -- ispell-fhd: Skipping entry: /usr/share/myspell/dicts/de_BE ++ ispell-fhd: dict-entry:/usr/share/myspell/dicts/de-LU name:de-LU basename:de-LU affix-file:/usr/share/myspell/dicts/de-LU.aff ++ ispell-fhd: dict-entry:/usr/share/myspell/dicts/de-AT name:de-AT basename:de-AT affix-file:/usr/share/myspell/dicts/de-AT.aff -- ispell-fhd: Skipping entry: LOADED DICTIONARY: -- ispell-fhd: Skipping entry: /usr/share/hunspell/en_US.dic -- ispell-fhd: Skipping entry: Hunspell 1.3.2 -- ispell-fhd: Excluding castellano alias. Standalone dict found. ++ ispell-fhd: Adding alias svenska -> /usr/share/hunspell/sv_SE.aff. ++ ispell-fhd: Adding alias slovenian -> /usr/share/hunspell/sl_SI.aff. ++ ispell-fhd: Adding alias slovak -> /usr/share/hunspell/sk_SK.aff. ++ ispell-fhd: Adding alias russianw -> /usr/share/hunspell/ru_RU.aff. ++ ispell-fhd: Adding alias russian -> /usr/share/hunspell/ru_RU.aff. ++ ispell-fhd: Adding alias portugues -> /usr/share/hunspell/pt_PT.aff. ++ ispell-fhd: Adding alias polish -> /usr/share/hunspell/pl_PL.aff. ++ ispell-fhd: Adding alias german8 -> /usr/share/hunspell/de_DE.aff. ++ ispell-fhd: Adding alias german -> /usr/share/hunspell/de_DE.aff. ++ ispell-fhd: Adding alias esperanto -> /usr/share/hunspell/eo.aff. ++ ispell-fhd: Adding alias english -> /usr/share/hunspell/en_US.aff. ++ ispell-fhd: Adding alias deutsch8 -> /usr/share/hunspell/de_DE.aff. ++ ispell-fhd: Adding alias deutsch -> /usr/share/hunspell/de_DE.aff. ++ ispell-fhd: Adding alias czech -> /usr/share/hunspell/cs_CZ.aff. ++ ispell-fhd: Adding alias castellano8 -> /usr/share/hunspell/es_ES.aff. ++ ispell-fhd: Adding alias british -> /usr/share/hunspell/en_GB.aff. ++ ispell-fhd: Adding alias brasileiro -> /usr/share/hunspell/pt_BR.aff. ++ ispell-fhd: Adding alias american -> /usr/share/hunspell/en_US.aff. ispell-region: (ispell-skip-region-list): ((\\addcontentsline ispell-tex-arg-end 2) (\\add\(tocontents\|vspace\) ispell-tex-arg-end) (\\\([aA]lph\|arabic\) ispell-tex-arg-end) (\\bibliographystyle ispell-tex-arg-end) (\\makebox ispell-tex-arg-end 0) (\\e?psfig ispell-tex-arg-end) (\\document\(class\|style\) . \\begin[ ]*{[ ]*document[ ]*}) (\(figure\|table\)\*? ispell-tex-arg-end 0) (list ispell-tex-arg-end 2) (program . \\end[ ]*{[ ]*program[ ]*}) (verbatim\*? . \\end[ ]*{[ ]*verbatim\*?[ ]*}) (ispell-words-keyword forward-line) (ispell-dictionary-keyword forward-line) (ispell-pdict-keyword forward-line) (ispell-parsing-keyword forward-line) (^---*BEGIN PGP [A-Z ]*--* . ^---*END PGP [A-Z ]*--*) (^begin [0-9][0-9][0-9] [^ ]+$ . end ) (^%!PS-Adobe-[123].0 . %%EOF ) (^---* \(Start of \)?[Ff]orwarded [Mm]essage . ^---* End of [Ff]orwarded [Mm]essage) (\(--+\|_+\|\(/\w\|\(\(\w\|[-_]\)+[.:@]\)\)\(\w\|[-_]\)*\([.:/@]+\(\w\|[-_~=?&]\)+\)+\))) ispell-region: (ispell-begin-skip-region-regexp): \\addcontentsline\|\\add\(tocontents\|vspace\)\|\\\([aA]lph\|arabic\)\|\\bibliographystyle\|\\makebox\|\\e?psfig\|\\document\(class\|style\)\|\\begin[ ]*{[ ]*\(figure\|table\)\*?[ ]*}\|\\begin[ ]*{[ ]*list[ ]*}\|\\begin[ ]*{[ ]*program[ ]*}\|\\begin[ ]*{[ ]*verbatim\*?[ ]*}\|LocalWords: \|Local IspellDict: \|Local IspellPersDict: \|Local IspellParsing: \|^---*BEGIN PGP [A-Z ]*--*\|^begin [0-9][0-9][0-9] [^ ]+$\|^%!PS-Adobe-[123].0\|^---* \(Start of \)?[Ff]orwarded [Mm]essage\|\(--+\|_+\|\(/\w\|\(\(\w\|[-_]\)+[.:@]\)\)\(\w\|[-_]\)*\([.:/@]+\(\w\|[-_~=?&]\)+\)+\) ispell-region: Search for first region to skip after (ispell-begin-skip-region-regexp) ispell-region: First skip: \documentclass at (pos,line,column): (1,1,0). ispell-region: Continue spell-checking with hunspell and default dictionary... ispell-region: string pos (41->41), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (42->45), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [^bla ] ispell-region: string pos (45->45), eol: 60, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (46->60), eol: 60, [in-comment]: [nil], [add-comment]: [nil], [string]: [^\end{document} ] ispell-region: string pos (60->60), eol: 61, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] --3MwIy2ne0vdjdPXF-- From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 07:55:05 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 11:55:05 +0000 Received: from localhost ([127.0.0.1]:44519 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf6cq-0004dJ-Di for submit@debbugs.gnu.org; Tue, 29 Apr 2014 07:55:05 -0400 Received: from smtp07.smtpout.orange.fr ([80.12.242.129]:56571 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf6cn-0004cb-9s for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 07:55:02 -0400 Received: from micropit.roche-blanche.homenet.org ([2.14.79.144]) by mwinf5d66 with ME id vnuu1n00536onnE03nuuyR; Tue, 29 Apr 2014 13:54:55 +0200 X-ME-Helo: micropit.roche-blanche.homenet.org X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Tue, 29 Apr 2014 13:54:55 +0200 X-ME-IP: 2.14.79.144 Received: by micropit.roche-blanche.homenet.org (Postfix, from userid 1000) id 140E7A60CF0; Tue, 29 Apr 2014 13:54:54 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: Agustin Martin Subject: Re: bug#7781: hunspell and latex-mode References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> <87r44ga131.fsf@micropit.roche-blanche.homenet.org> <20140429103949.GA28561@agmartin.aq.upm.es> Date: Tue, 29 Apr 2014 13:54:54 +0200 Message-ID: <87fvkw9wq9.fsf@micropit.roche-blanche.homenet.org> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) On Tue, Apr 29 2014, Agustin Martin wrote: > Please find it attached. Apart from the misalignment problem the only > difference seems to be that I have lots of dicts installed and the > ~/.openoffice.org/ path. There is probably not enough information in the debug buffer. Could you please try this: mv /usr/bin/hunspell /usr/bin/hunspell-orig And create the file /usr/bin/hunspell with the following content: --8<---------------cut here---------------start------------->8--- #!/bin/bash tee /tmp/hunspell-input | hunspell-orig "$@" | tee /tmp/hunspell-output --8<---------------cut here---------------end--------------->8--- This is what I get: input: --8<---------------cut here---------------start------------->8--- ! + ^bla --8<---------------cut here---------------end--------------->8--- output: --8<---------------cut here---------------start------------->8--- @(#) International Ispell Version 3.2.06 (but really Hunspell 1.3.2) & bla 15 0: alb, bl, la, blat, bola, blag, blah, blab, lab, baa, bra, boa, Ila, Ala, Ola --8<---------------cut here---------------end--------------->8--- I guess, that you get "bla 15 1", because of the "^" before the "bla". That would mean, that my hunspell would need another patch. Which one please? Thanks for your efforts, -- Peter From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 08:48:54 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 12:48:55 +0000 Received: from localhost ([127.0.0.1]:44634 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf7Sw-0006H9-91 for submit@debbugs.gnu.org; Tue, 29 Apr 2014 08:48:54 -0400 Received: from smtp07.smtpout.orange.fr ([80.12.242.129]:43117 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf7Ss-0006Gl-TU for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 08:48:52 -0400 Received: from micropit.roche-blanche.homenet.org ([2.14.79.144]) by mwinf5d66 with ME id vook1n00836onnE03ookDF; Tue, 29 Apr 2014 14:48:44 +0200 X-ME-Helo: micropit.roche-blanche.homenet.org X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Tue, 29 Apr 2014 14:48:44 +0200 X-ME-IP: 2.14.79.144 Received: by micropit.roche-blanche.homenet.org (Postfix, from userid 1000) id CB0BCA60DD4; Tue, 29 Apr 2014 14:48:43 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: Agustin Martin Subject: Re: bug#7781: hunspell and latex-mode References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> <87r44ga131.fsf@micropit.roche-blanche.homenet.org> <20140429103949.GA28561@agmartin.aq.upm.es> <87fvkw9wq9.fsf@micropit.roche-blanche.homenet.org> Date: Tue, 29 Apr 2014 14:48:43 +0200 In-Reply-To: <87fvkw9wq9.fsf@micropit.roche-blanche.homenet.org> ("Peter =?utf-8?Q?M=C3=BCnster=22's?= message of "Tue, 29 Apr 2014 13:54:54 +0200") Message-ID: <87bnvk9u8k.fsf@micropit.roche-blanche.homenet.org> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) I've just tried unpatched hunspell: no problem with TeX-mode. It's the patch on sf.net that breaks the TeX-mode, the character position is always 0: https://sourceforge.net/p/hunspell/patches/57/#d425 I'll build hunspell with Eli's patch now. Sorry for the noise... -- Peter From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 09:57:40 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 13:57:40 +0000 Received: from localhost ([127.0.0.1]:45083 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf8XU-0008JQ-45 for submit@debbugs.gnu.org; Tue, 29 Apr 2014 09:57:40 -0400 Received: from mtaout22.012.net.il ([80.179.55.172]:45282) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf8XO-0008JD-VZ for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 09:57:36 -0400 Received: from conversion-daemon.a-mtaout22.012.net.il by a-mtaout22.012.net.il (HyperSendmail v2007.08) id <0N4S00K00P6NT000@a-mtaout22.012.net.il> for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 16:57:28 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.4.28]) by a-mtaout22.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0N4S00K3PPFRAZC0@a-mtaout22.012.net.il>; Tue, 29 Apr 2014 16:57:27 +0300 (IDT) Date: Tue, 29 Apr 2014 16:57:36 +0300 From: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode In-reply-to: <87bnvk9u8k.fsf@micropit.roche-blanche.homenet.org> X-012-Sender: halo1@inter.net.il To: Peter =?iso-8859-1?Q?M=FCnster?= Message-id: <83tx9cme5r.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=iso-8859-1 Content-transfer-encoding: 8BIT References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> <87r44ga131.fsf@micropit.roche-blanche.homenet.org> <20140429103949.GA28561@agmartin.aq.upm.es> <87fvkw9wq9.fsf@micropit.roche-blanche.homenet.org> <87bnvk9u8k.fsf@micropit.roche-blanche.homenet.org> X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 7781 Cc: agustin.martin@hispalinux.es, 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.0 (+) > From: Peter Münster > Date: Tue, 29 Apr 2014 14:48:43 +0200 > Cc: 7781@debbugs.gnu.org > > I've just tried unpatched hunspell: no problem with TeX-mode. > It's the patch on sf.net that breaks the TeX-mode, the character > position is always 0: > https://sourceforge.net/p/hunspell/patches/57/#d425 That's what I thought. If I invoke Hunspell like ispell.el does for a LaTeX buffer, i.e. hunspell -a -d en_US -i UTF-8 and then type "^bla RET" into Hunspell, I get this as output: & bla 15 1: alb, bl, la, bola, blah, blab, lab, baa, ala, bra, boa, Ila, Ala, Ola, Ula As you see, I get "15 1". If you get 0 instead of 1, then that's the cause of the problem, because the part of your debug output marked below: ispell-process-line: Ispell misalignment error: [Word from ispell pipe]: [bla], actual (point,line,column): (41,2,16) ^^^^^^^ clearly shows that ispell.el is confused about where the word "bla" begins in the buffer; the correct data is 42,3,0. Also note that just before reading Hunspell's output, ispell.el correctly identified both the word and its location: ispell-region: string pos (42->45), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [^bla ] > I'll build hunspell with Eli's patch now. I think that will solve the problem. (I have no idea why visiting the same file in Text mode avoids the problem. The only difference is that in Text mode, ispell.el does not skip the first 2 lines, but instead submits them to Hunspell. Why this makes the difference, I don't know, but probably the lone "^bla" somehow triggers the bug in the patch you installed, whatever that bug is.) From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 10:30:17 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 14:30:17 +0000 Received: from localhost ([127.0.0.1]:45156 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf932-000233-Q1 for submit@debbugs.gnu.org; Tue, 29 Apr 2014 10:30:17 -0400 Received: from smtp07.smtpout.orange.fr ([80.12.242.129]:22223 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf930-000221-GS for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 10:30:15 -0400 Received: from micropit.roche-blanche.homenet.org ([2.14.79.144]) by mwinf5d66 with ME id vqW71n00H36onnE03qW7ev; Tue, 29 Apr 2014 16:30:08 +0200 X-ME-Helo: micropit.roche-blanche.homenet.org X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Tue, 29 Apr 2014 16:30:08 +0200 X-ME-IP: 2.14.79.144 Received: by micropit.roche-blanche.homenet.org (Postfix, from userid 1000) id 6959EA60DD4; Tue, 29 Apr 2014 16:30:07 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> <87r44ga131.fsf@micropit.roche-blanche.homenet.org> <20140429103949.GA28561@agmartin.aq.upm.es> <87fvkw9wq9.fsf@micropit.roche-blanche.homenet.org> <87bnvk9u8k.fsf@micropit.roche-blanche.homenet.org> <83tx9cme5r.fsf@gnu.org> Date: Tue, 29 Apr 2014 16:30:07 +0200 Message-ID: <878uqo439s.fsf@micropit.roche-blanche.homenet.org> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 Cc: agustin.martin@hispalinux.es, 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) On Tue, Apr 29 2014, Eli Zaretskii wrote: > (I have no idea why visiting the same file in Text mode avoids the > problem. The only difference is that in Text mode, ispell.el does not > skip the first 2 lines, but instead submits them to Hunspell. No. In latex-mode, emacs switches hunspell into TeX-mode with the "+". > Why this makes the difference, I don't know, but probably the lone > "^bla" somehow triggers the bug in the patch you installed, whatever > that bug is.) No. In normal mode, the "^bla" works fine. The patch on sf.net just breaks the TeX-mode: every position becomes 0. Your patch works nicely, thanks! I should have tested hunspell on the command line, before reporting the problem. Now I know, how to do that. -- Peter From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 11:25:33 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 15:25:33 +0000 Received: from localhost ([127.0.0.1]:45220 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf9uW-0004on-O1 for submit@debbugs.gnu.org; Tue, 29 Apr 2014 11:25:33 -0400 Received: from mtaout29.012.net.il ([80.179.55.185]:51678) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Wf9uT-0004oT-MD for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 11:25:31 -0400 Received: from conversion-daemon.mtaout29.012.net.il by mtaout29.012.net.il (HyperSendmail v2007.08) id <0N4S00000THEQF00@mtaout29.012.net.il> for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 18:26:42 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.4.28]) by mtaout29.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0N4S00KBITKI5460@mtaout29.012.net.il>; Tue, 29 Apr 2014 18:26:42 +0300 (IDT) Date: Tue, 29 Apr 2014 18:25:31 +0300 From: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode In-reply-to: <878uqo439s.fsf@micropit.roche-blanche.homenet.org> X-012-Sender: halo1@inter.net.il To: Peter =?iso-8859-1?Q?M=FCnster?= Message-id: <83d2g0ma38.fsf@gnu.org> MIME-version: 1.0 Content-type: text/plain; charset=iso-8859-1 Content-transfer-encoding: 8BIT References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> <87r44ga131.fsf@micropit.roche-blanche.homenet.org> <20140429103949.GA28561@agmartin.aq.upm.es> <87fvkw9wq9.fsf@micropit.roche-blanche.homenet.org> <87bnvk9u8k.fsf@micropit.roche-blanche.homenet.org> <83tx9cme5r.fsf@gnu.org> <878uqo439s.fsf@micropit.roche-blanche.homenet.org> X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 7781 Cc: agustin.martin@hispalinux.es, 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.0 (+) > From: Peter Münster > Cc: agustin.martin@hispalinux.es, 7781@debbugs.gnu.org > Date: Tue, 29 Apr 2014 16:30:07 +0200 > > On Tue, Apr 29 2014, Eli Zaretskii wrote: > > > (I have no idea why visiting the same file in Text mode avoids the > > problem. The only difference is that in Text mode, ispell.el does not > > skip the first 2 lines, but instead submits them to Hunspell. > > No. In latex-mode, emacs switches hunspell into TeX-mode with the "+". It does both, evidently. Compare this part of your debug output (in LaTeX buffer): ispell-region: First skip: \documentclass at (pos,line,column): (1,1,0). ispell-region: Continue spell-checking with hunspell and default dictionary... ispell-region: string pos (41->41), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (42->45), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [^bla ] with this (in Text buffer): ispell-region: string pos (1->24), eol: 24, [in-comment]: [nil], [add-comment]: [nil], [string]: [^\documentclass{article} ] ispell-region: string pos (24->24), eol: 41, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (25->41), eol: 41, [in-comment]: [nil], [add-comment]: [nil], [string]: [^\begin{document} ] ispell-region: string pos (41->41), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (42->45), eol: 45, [in-comment]: [nil], [add-comment]: [nil], [string]: [^bla ] ispell-region: string pos (45->45), eol: 60, [in-comment]: [nil], [add-comment]: [nil], [string]: [nil] ispell-region: string pos (46->60), eol: 60, [in-comment]: [nil], [add-comment]: [nil], [string]: [^\end{document} ] As you see, in the second case, the TeX directives are also sent to Hunspell for checking, while in the first case they are not. From debbugs-submit-bounces@debbugs.gnu.org Tue Apr 29 12:34:18 2014 Received: (at 7781) by debbugs.gnu.org; 29 Apr 2014 16:34:18 +0000 Received: from localhost ([127.0.0.1]:45293 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WfAz3-0000sR-In for submit@debbugs.gnu.org; Tue, 29 Apr 2014 12:34:18 -0400 Received: from smtp08.smtpout.orange.fr ([80.12.242.130]:48372 helo=smtp.smtpout.orange.fr) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1WfAz0-0000s6-Vr for 7781@debbugs.gnu.org; Tue, 29 Apr 2014 12:34:15 -0400 Received: from micropit.roche-blanche.homenet.org ([2.14.79.144]) by mwinf5d31 with ME id vsa71n00M36onnE03sa7a9; Tue, 29 Apr 2014 18:34:08 +0200 X-ME-Helo: micropit.roche-blanche.homenet.org X-ME-Auth: ZHJwbUB3YW5hZG9vLmZy X-ME-Date: Tue, 29 Apr 2014 18:34:08 +0200 X-ME-IP: 2.14.79.144 Received: by micropit.roche-blanche.homenet.org (Postfix, from userid 1000) id 9181BA60DD4; Tue, 29 Apr 2014 18:34:07 +0200 (CEST) From: =?utf-8?Q?Peter_M=C3=BCnster?= To: Eli Zaretskii Subject: Re: bug#7781: hunspell and latex-mode References: <87sjx9fula.fsf@sc3d.org> <87y4yqh34e.fsf@micropit.couberia.selfip.net> <83eh0ho482.fsf@gnu.org> <87mwf5fmwt.fsf@micropit.couberia.selfip.net> <20140429100325.GA13604@agmartin.aq.upm.es> <87r44ga131.fsf@micropit.roche-blanche.homenet.org> <20140429103949.GA28561@agmartin.aq.upm.es> <87fvkw9wq9.fsf@micropit.roche-blanche.homenet.org> <87bnvk9u8k.fsf@micropit.roche-blanche.homenet.org> <83tx9cme5r.fsf@gnu.org> <878uqo439s.fsf@micropit.roche-blanche.homenet.org> <83d2g0ma38.fsf@gnu.org> Date: Tue, 29 Apr 2014 18:34:07 +0200 Message-ID: <87siow2iyo.fsf@micropit.roche-blanche.homenet.org> User-Agent: Gnus/5.13001 (=?utf-8?Q?=E7=9C=9F?= Gnus v0.10) Emacs/24.4.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 7781 Cc: agustin.martin@hispalinux.es, 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 0.0 (/) On Tue, Apr 29 2014, Eli Zaretskii wrote: >> > The only difference is that in Text mode, ispell.el does not skip >> > the first 2 lines, but instead submits them to Hunspell. >> >> No. In latex-mode, emacs switches hunspell into TeX-mode with the "+". > > It does both, evidently. Compare this part of your debug output (in > LaTeX buffer): Sorry. I just wanted to say: "No, it's not the *only* difference." ... ;) -- Peter From debbugs-submit-bounces@debbugs.gnu.org Thu Sep 25 05:54:10 2014 Received: (at 7781) by debbugs.gnu.org; 25 Sep 2014 09:54:10 +0000 Received: from localhost ([127.0.0.1]:51634 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XX5kY-00009Z-3J for submit@debbugs.gnu.org; Thu, 25 Sep 2014 05:54:10 -0400 Received: from mail-la0-f49.google.com ([209.85.215.49]:43367) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XX5kV-00009N-3O for 7781@debbugs.gnu.org; Thu, 25 Sep 2014 05:54:07 -0400 Received: by mail-la0-f49.google.com with SMTP id pn19so11986962lab.8 for <7781@debbugs.gnu.org>; Thu, 25 Sep 2014 02:54:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sc3d.org; s=google; h=mime-version:date:message-id:subject:from:to:content-type; bh=Jo/PxEr+Kqhbh8yXRl9HFn77n/LAFTtPBsIPzROaQEk=; b=QG0qBxERRqrPpkwtU7ypJnWNyV7P9FUQhJStThCuu+ACjeKURsumoWM+wc4d9qs53Q +z6PbC7P6l06P/ZkZVUes3z0kwzzio+QozePtJ3zUlAgV4mVxyPN3q46fVwn3PhwBbjE yFgAMCI4UY3w7salIScaRz5JODkwnH5VA3p8M= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:date:message-id:subject:from:to :content-type; bh=Jo/PxEr+Kqhbh8yXRl9HFn77n/LAFTtPBsIPzROaQEk=; b=FoxZ0b0UoJbsVuMCa4CzLmSBzbEl1dqojzO3xbhFzDcY3fAdiNwTwIK+EldJ3Pq25z iOOCdvtRoyuInSJSt9fmoJ/iVtB68IoJdbVL+C1oDJvsUAu0RkQxfz7O5iWG8k+TlFAI t/3DvM5SfT/qBBR3eapI9fFgFOXSiWd0RBw4b3gi9ngU4fCSRTwcWkot1mJ3jiSKuHdY nQpAcVFvRxQFNwd2dhMn/UGlLYBTYzY8s3SVe8Ep4X6JOohmeCum0Itpawa3CD9238M2 UcWBJv1HUn4Vvf2ggIxFDqjYdcR9AxZmtRSxFkAkGRz9CkERyRHJBwvB8ExndwVj4cIp zL0g== X-Gm-Message-State: ALoCoQkxeSVj09l8oYgQo/or1HqHqS2resk+lePxfcpP+1xbZSIU7ih+Olt/F26nEbJxdsHLgi6U MIME-Version: 1.0 X-Received: by 10.112.125.132 with SMTP id mq4mr1649206lbb.103.1411638845728; Thu, 25 Sep 2014 02:54:05 -0700 (PDT) Received: by 10.152.36.201 with HTTP; Thu, 25 Sep 2014 02:54:05 -0700 (PDT) Date: Thu, 25 Sep 2014 10:54:05 +0100 Message-ID: Subject: Bug still present in hunspell 1.3.3; Eli's patch still works From: Reuben Thomas To: 7781@debbugs.gnu.org Content-Type: multipart/alternative; boundary=089e0122aecef381580503e0c76e X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 7781 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) --089e0122aecef381580503e0c76e Content-Type: text/plain; charset=UTF-8 I have sent a message to the upstream maintainer informing him of the situation and asking for the patch to be included in the next release. -- http://rrt.sc3d.org --089e0122aecef381580503e0c76e Content-Type: text/html; charset=UTF-8
I have sent a message to the upstream maintainer informing him of the situation and asking for the patch to be included in the next release.
--089e0122aecef381580503e0c76e-- From debbugs-submit-bounces@debbugs.gnu.org Thu Oct 16 09:37:29 2014 Received: (at 7781) by debbugs.gnu.org; 16 Oct 2014 13:37:29 +0000 Received: from localhost ([127.0.0.1]:45410 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XelFB-0005Wb-0O for submit@debbugs.gnu.org; Thu, 16 Oct 2014 09:37:29 -0400 Received: from neon-v1.ccupm.upm.es ([138.100.198.71]:48701) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XelF8-0005WQ-0A for 7781@debbugs.gnu.org; Thu, 16 Oct 2014 09:37:27 -0400 Received: from agmartin.aq.upm.es (Agmartin.aq.upm.es [138.100.41.131]) by smtp.upm.es (8.14.4/8.14.4/neon-v1-002) with ESMTP id s9GDbOoW019076; Thu, 16 Oct 2014 15:37:24 +0200 Received: by agmartin.aq.upm.es (Postfix, from userid 1000) id 75AA540FDA; Thu, 16 Oct 2014 15:37:24 +0200 (CEST) Date: Thu, 16 Oct 2014 15:37:24 +0200 From: Agustin Martin To: Reuben Thomas , 7781@debbugs.gnu.org Subject: Re: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file Message-ID: <20141016133724.GA21044@agmartin.aq.upm.es> References: <87sjx9fula.fsf@sc3d.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.23 (2014-03-12) X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 7781 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) Control: tag 7781 + upstream fixed-upstream On Fri, Feb 11, 2011 at 06:00:53PM +0100, Agustin Martin wrote: > forwarded 7781 https://sourceforge.net/tracker/?func=detail&aid=3178449&group_id=143754&atid=756395 > thanks > > 2011/1/7 Agustin Martin : > > 2011/1/4 Reuben Thomas : > >> With the following text, and using emacs -Q, I get the errors you can > >> see in the messages log below when using hunspell to spell-check a UTF-8 > >> buffer with some extended characters in it. > > > Do not worry about first number, is the number of suggestions. However > > position in second number differ. Seems that hunspell is not > > considering that apostrophe as a single (multibyte) char when > > counting, but as three components > > > > Looks to me an hunspell bug. I found no reference to this problem in > > hunspell sf site, but noticed that Hunspell 1.2.14 was released > > yesterday. Need to check if that has some related new. > > Opened an hunspell bug report for bad count problem > > https://sourceforge.net/tracker/?func=detail&aid=3178449&group_id=143754&atid=756395 Reuben Thomas wrote: > I have sent a message to the upstream maintainer informing him of the > situation and asking for the patch to be included in the next release. Proposed patch has been integrated in hunspell upstream by caolan mcnamara. Regards, PS: My old hispalinux.es address is failing silently and I do not if I will ever be able to get it fixed. Please use current gmail address for replies. -- Agustin From debbugs-submit-bounces@debbugs.gnu.org Thu Oct 16 09:54:19 2014 Received: (at 7781) by debbugs.gnu.org; 16 Oct 2014 13:54:19 +0000 Received: from localhost ([127.0.0.1]:45430 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XelVS-0005wn-D8 for submit@debbugs.gnu.org; Thu, 16 Oct 2014 09:54:18 -0400 Received: from mtaout28.012.net.il ([80.179.55.184]:53925) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1XelVQ-0005we-1V for 7781@debbugs.gnu.org; Thu, 16 Oct 2014 09:54:16 -0400 Received: from conversion-daemon.mtaout28.012.net.il by mtaout28.012.net.il (HyperSendmail v2007.08) id <0NDJ00A00HWZQW00@mtaout28.012.net.il> for 7781@debbugs.gnu.org; Thu, 16 Oct 2014 16:52:29 +0300 (IDT) Received: from HOME-C4E4A596F7 ([87.69.4.28]) by mtaout28.012.net.il (HyperSendmail v2007.08) with ESMTPA id <0NDJ00AC5IJG8R20@mtaout28.012.net.il>; Thu, 16 Oct 2014 16:52:29 +0300 (IDT) Date: Thu, 16 Oct 2014 16:54:16 +0300 From: Eli Zaretskii Subject: Re: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file In-reply-to: <20141016133724.GA21044@agmartin.aq.upm.es> X-012-Sender: halo1@inter.net.il To: Agustin Martin Message-id: <834mv43yt3.fsf@gnu.org> References: <87sjx9fula.fsf@sc3d.org> <20141016133724.GA21044@agmartin.aq.upm.es> X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list Reply-To: Eli Zaretskii List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.0 (+) > Date: Thu, 16 Oct 2014 15:37:24 +0200 > From: Agustin Martin > > > Opened an hunspell bug report for bad count problem > > > > https://sourceforge.net/tracker/?func=detail&aid=3178449&group_id=143754&atid=756395 > > Reuben Thomas wrote: > > I have sent a message to the upstream maintainer informing him of the > > situation and asking for the patch to be included in the next release. > > Proposed patch has been integrated in hunspell upstream by caolan mcnamara. Do you mean there's now an official release of Hunspell with this bug fixed? If so, where can one find it? From debbugs-submit-bounces@debbugs.gnu.org Thu Oct 16 10:09:04 2014 Received: (at 7781) by debbugs.gnu.org; 16 Oct 2014 14:09:04 +0000 Received: from localhost ([127.0.0.1]:46030 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Xeljj-0006Qo-Oz for submit@debbugs.gnu.org; Thu, 16 Oct 2014 10:09:04 -0400 Received: from neon-v2.ccupm.upm.es ([138.100.198.70]:46411) by debbugs.gnu.org with esmtp (Exim 4.80) (envelope-from ) id 1Xeljh-0006QP-Ns for 7781@debbugs.gnu.org; Thu, 16 Oct 2014 10:09:02 -0400 Received: from agmartin.aq.upm.es (Agmartin.aq.upm.es [138.100.41.131]) by smtp.upm.es (8.14.4/8.14.4/neon-v2-001) with ESMTP id s9GE8x8T010698; Thu, 16 Oct 2014 16:08:59 +0200 Received: by agmartin.aq.upm.es (Postfix, from userid 1000) id 6824440FDA; Thu, 16 Oct 2014 16:08:59 +0200 (CEST) Date: Thu, 16 Oct 2014 16:08:59 +0200 From: Agustin Martin To: rrt@sc3d.org, 7781@debbugs.gnu.org Subject: Re: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file Message-ID: <20141016140859.GA21898@agmartin.aq.upm.es> References: <87sjx9fula.fsf@sc3d.org> <20141016133724.GA21044@agmartin.aq.upm.es> <834mv43yt3.fsf@gnu.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <834mv43yt3.fsf@gnu.org> User-Agent: Mutt/1.5.23 (2014-03-12) X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 7781 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) On Thu, Oct 16, 2014 at 04:54:16PM +0300, Eli Zaretskii wrote: > > Date: Thu, 16 Oct 2014 15:37:24 +0200 > > From: Agustin Martin > > > > > Opened an hunspell bug report for bad count problem > > > > > > https://sourceforge.net/tracker/?func=detail&aid=3178449&group_id=143754&atid=756395 > > > > Reuben Thomas wrote: > > > I have sent a message to the upstream maintainer informing him of the > > > situation and asking for the patch to be included in the next release. > > > > Proposed patch has been integrated in hunspell upstream by caolan mcnamara. > > Do you mean there's now an official release of Hunspell with this bug > fixed? If so, where can one find it? I am afraid it only means that fix has been pushed to upstream VCS. http://hunspell.cvs.sourceforge.net/viewvc/hunspell/hunspell/src/tools/hunspell.cxx?r1=1.60&r2=1.61 Another good new is that this is not the only bug just handled, http://sourceforge.net/p/hunspell/bugs/228/ [hunspell:bugs] #228 Some problems with Emacs and init string in pipe mode has been changed to closed-accepted and pushed to the repo (r1.62). Regards, -- Agustin From debbugs-submit-bounces@debbugs.gnu.org Fri Aug 28 08:00:20 2020 Received: (at 7781) by debbugs.gnu.org; 28 Aug 2020 12:00:20 +0000 Received: from localhost ([127.0.0.1]:45283 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kBd3M-00038X-Dk for submit@debbugs.gnu.org; Fri, 28 Aug 2020 08:00:20 -0400 Received: from mail-yb1-f171.google.com ([209.85.219.171]:39687) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kBd3J-00038H-UY for 7781@debbugs.gnu.org; Fri, 28 Aug 2020 08:00:19 -0400 Received: by mail-yb1-f171.google.com with SMTP id q16so388155ybk.6 for <7781@debbugs.gnu.org>; Fri, 28 Aug 2020 05:00:17 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:user-agent :mime-version:date:message-id:subject:to:cc; bh=YyMzdn7dxmOyVBGHqaArSDmlIgEPEFl4n7hxLLNB/14=; b=c4ib70XXT97E1uRP6/OFMGyxRTFVJ8OMYWBHFkh4sysAPpuzL+aD1ir6mroW6IyVLw t3Ah+9kkNUQxAFUtEwwzXbfqPQ+IAkg7o8Wp4cZkC2q+D2+xnQuh9AqCrJxgY1YGJWJr t7xGnYYeXXLG0RE3iZeTCWzetATkLx/M44F7ZG7u943Gp0DMUPKsB1efBVp6eX+xivxU lfdgTTsO4GWm8RHobU6A1HEInhvBb1OnF3lGjmmGqJrTdVCarbQv1tbwDarNGCN3RxEt t2ydmZZuAveu/Gu1t3t0Yho6IaVIAsBUgkNhqW2PrxN35G/iFSQSYyVzq1HRbKcmgYxA ++XQ== X-Gm-Message-State: AOAM533T9qC9vf0LYs2Rr2CSMzIHSKJQZiCG+YRUDHHT4WRwmWfbnj9+ qvBCJFG1w6NBRAeTqxwzePwLTcDaeY8LbbCm8fs= X-Google-Smtp-Source: ABdhPJyavcXLeSnNwhg4sw3Dt8aVHPY9r8fgeoNBcw5BAqvBvipavytKi09hf2K2V1LPess3VWq00aNU06AYJPUQ1PY= X-Received: by 2002:a5b:508:: with SMTP id o8mr2152323ybp.43.1598616012162; Fri, 28 Aug 2020 05:00:12 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Fri, 28 Aug 2020 05:00:11 -0700 From: Stefan Kangas In-Reply-To: <87sjx9fula.fsf@sc3d.org> (Reuben Thomas's message of "Mon, 03 Jan 2011 23:14:41 +0000") References: <87sjx9fula.fsf@sc3d.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Date: Fri, 28 Aug 2020 05:00:11 -0700 Message-ID: Subject: Re: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file To: Reuben Thomas Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.5 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.5 (/) Reuben Thomas writes: > With the following text, and using emacs -Q, I get the errors you can > see in the messages log below when using hunspell to spell-check a UTF-8 > buffer with some extended characters in it. > > I did test this with emacs -Q, but the current session, in which I > reproduced the problem and am now composing this bug report, was not > started with -Q (this is so submitting the bug report works properly!). > > I am running a freshly bzr-pulled build of the emacs-23 branch. > > Text follows I tried this but couldn't reproduce the bug using current master and Hunspell 1.7.0. Having read the bug report, IIUC, this was a bug in Hunspell and not in Emacs? Are you still able to reproduce this using a recent Emacs and Hunspell? If I don't hear back from you within a couple of weeks, I'll just close this bug as unreproducible. Best regards, Stefan Kangas From debbugs-submit-bounces@debbugs.gnu.org Fri Aug 28 08:36:19 2020 Received: (at 7781) by debbugs.gnu.org; 28 Aug 2020 12:36:19 +0000 Received: from localhost ([127.0.0.1]:45445 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kBdcA-00048w-Vz for submit@debbugs.gnu.org; Fri, 28 Aug 2020 08:36:19 -0400 Received: from eggs.gnu.org ([209.51.188.92]:55136) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kBdcA-00048l-0b for 7781@debbugs.gnu.org; Fri, 28 Aug 2020 08:36:18 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:56851) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kBdc4-0004ow-IR; Fri, 28 Aug 2020 08:36:12 -0400 Received: from [176.228.60.248] (port=2536 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1kBdc2-0002Bu-Ml; Fri, 28 Aug 2020 08:36:12 -0400 Date: Fri, 28 Aug 2020 15:36:01 +0300 Message-Id: <83k0xjymku.fsf@gnu.org> From: Eli Zaretskii To: Stefan Kangas In-Reply-To: (message from Stefan Kangas on Fri, 28 Aug 2020 05:00:11 -0700) Subject: Re: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file References: <87sjx9fula.fsf@sc3d.org> X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Stefan Kangas > Date: Fri, 28 Aug 2020 05:00:11 -0700 > Cc: 7781@debbugs.gnu.org > > Reuben Thomas writes: > > > With the following text, and using emacs -Q, I get the errors you can > > see in the messages log below when using hunspell to spell-check a UTF-8 > > buffer with some extended characters in it. > > > > I did test this with emacs -Q, but the current session, in which I > > reproduced the problem and am now composing this bug report, was not > > started with -Q (this is so submitting the bug report works properly!). > > > > I am running a freshly bzr-pulled build of the emacs-23 branch. > > > > Text follows > > I tried this but couldn't reproduce the bug using current master and > Hunspell 1.7.0. Having read the bug report, IIUC, this was a bug in > Hunspell and not in Emacs? > > Are you still able to reproduce this using a recent Emacs and Hunspell? Some (old) versions of Hunspell had a bug, whereby the mis-spelled words were reported with offsets in bytes, not in characters. When this happens, ispell.el reports "misalignment" errors. I don't remember when (or even if) Hunspell fixed that problem (in the version I use I fixed it myself), but if 1.7.0 has that problem fixed, you will not see the problem. From debbugs-submit-bounces@debbugs.gnu.org Fri Aug 28 08:56:23 2020 Received: (at 7781) by debbugs.gnu.org; 28 Aug 2020 12:56:23 +0000 Received: from localhost ([127.0.0.1]:45476 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kBdvb-0004dC-7e for submit@debbugs.gnu.org; Fri, 28 Aug 2020 08:56:23 -0400 Received: from mail-yb1-f176.google.com ([209.85.219.176]:43197) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kBdvU-0004cY-5y for 7781@debbugs.gnu.org; Fri, 28 Aug 2020 08:56:18 -0400 Received: by mail-yb1-f176.google.com with SMTP id m200so468618ybf.10 for <7781@debbugs.gnu.org>; Fri, 28 Aug 2020 05:56:16 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:mime-version:date :message-id:subject:to:cc; bh=izk6wR/UaIQPinwhJ1kl7ROlEL/rdaNX5+kdgArUbI8=; b=sW69EU+ltGwjf4efomW5H6ha9oDUpnPWw/FMnow0ANBwQTBo7Cze/HIITLtOqOB8od AP9COjPYoily3A+c/do5OucEyCUI5yqdogNrX7eJyRSpaWqgF+0eZ//i7+J3aqIXOLLB xUwkRavkG5VedTanJJjZGhclQY7ZOJCqTQG3QYZeYJOBkQeILyiicXKmgwvPDwSOCp7b kWynLX1GpOLznh+1zKswboMdQiZ4z2SGtrPmTL60XAOeP0epN27+O5/jwAlIFxI+2zYb lAi5rdzxrCKlLk0pOOg70qa8Ls17uB5RphA0xvDYhM3UBhtsMyUC0eq1WpOEvx/koORo qqjg== X-Gm-Message-State: AOAM531O3owwpeG/1qYaFwyZvVoVnUgChSwjbJcfAA3QOyf3SDzM2hIv 3Uh/yY4tx6Stj77KcnbSfFBfqLPzISH7R5OAaW8= X-Google-Smtp-Source: ABdhPJwBP5lhZRCaxnQ72jByCy9sTWBN+S0Fidt395RcEzUJEEb9Ly7VTcmJ0aPoO8qwrSf+enCO7n314WiWZcU1JSE= X-Received: by 2002:a5b:508:: with SMTP id o8mr2498346ybp.43.1598619370502; Fri, 28 Aug 2020 05:56:10 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Fri, 28 Aug 2020 05:56:10 -0700 From: Stefan Kangas In-Reply-To: <83k0xjymku.fsf@gnu.org> References: <87sjx9fula.fsf@sc3d.org> <83k0xjymku.fsf@gnu.org> MIME-Version: 1.0 Date: Fri, 28 Aug 2020 05:56:09 -0700 Message-ID: Subject: Re: bug#7781: 23.2.91; ispell problem with hunspell and UTF-8 file To: Eli Zaretskii Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.5 (/) X-Debbugs-Envelope-To: 7781 Cc: 7781@debbugs.gnu.org, rrt@sc3d.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.5 (/) tags 7781 + notabug close 7781 thanks Eli Zaretskii writes: > Some (old) versions of Hunspell had a bug, whereby the mis-spelled > words were reported with offsets in bytes, not in characters. When > this happens, ispell.el reports "misalignment" errors. > > I don't remember when (or even if) Hunspell fixed that problem (in the > version I use I fixed it myself), but if 1.7.0 has that problem fixed, > you will not see the problem. Thanks, so this is not a bug in Emacs. I'm therefore closing this bug report. If this conclusion is incorrect, please reopen the bug report. Best regards, Stefan Kangas From unknown Fri Aug 15 03:57:10 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Sat, 26 Sep 2020 11:24:07 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator