From debbugs-submit-bounces@debbugs.gnu.org Wed Mar 23 14:11:56 2016 Received: (at submit) by debbugs.gnu.org; 23 Mar 2016 18:11:56 +0000 Received: from localhost ([127.0.0.1]:34699 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ainG7-0007nU-RM for submit@debbugs.gnu.org; Wed, 23 Mar 2016 14:11:56 -0400 Received: from eggs.gnu.org ([208.118.235.92]:38004) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ainG6-0007nE-UD for submit@debbugs.gnu.org; Wed, 23 Mar 2016 14:11:55 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ainG0-0002pQ-JY for submit@debbugs.gnu.org; Wed, 23 Mar 2016 14:11:49 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50,FREEMAIL_FROM, T_DKIM_INVALID autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:37858) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ainG0-0002pM-G2 for submit@debbugs.gnu.org; Wed, 23 Mar 2016 14:11:48 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:57191) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ainFz-0000YA-LE for bug-gnu-emacs@gnu.org; Wed, 23 Mar 2016 14:11:48 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ainFu-0002oD-LY for bug-gnu-emacs@gnu.org; Wed, 23 Mar 2016 14:11:47 -0400 Received: from mail-lf0-x235.google.com ([2a00:1450:4010:c07::235]:33571) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ainFu-0002mr-DI for bug-gnu-emacs@gnu.org; Wed, 23 Mar 2016 14:11:42 -0400 Received: by mail-lf0-x235.google.com with SMTP id o73so17604652lfe.0 for ; Wed, 23 Mar 2016 11:11:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:to:subject:message-id:date:user-agent:mime-version :content-transfer-encoding; bh=ajVq6aHx8lCtKBcjXkdGy2e4QuQfeZnFmos3Yw1TcUc=; b=hBqOlX07WYHRf748f+L+RztJC0yGvy6gxEYvabM0XszAVcnCpjWve6kwgGx/hs00VJ wRgpyv71ipJsNEzxNOjnTeo8EkfTmMN2b5w4aren+LFBYt/8cLDY5s2DGp4fw0q76otr BAVRBU/fUBmEszd5QnazpMh0w/m9gc/H3rKIDRCZdoZTHpxoC4RH/hWylGC8LnHowels 64QiYxbaCfqDN+Z4mCN9iXRE9fvnA3i7B06kvpmtM5Kob6vukqPDryhBzkqpj9upXL4J 5nO0fp5Fcz8JPH0TS8XxuhtK1Qeq9o0MVTwkNi09U5EGXEZw0JRBHIFJKJxiYFQOTSYe V2iQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:to:subject:message-id:date:user-agent :mime-version:content-transfer-encoding; bh=ajVq6aHx8lCtKBcjXkdGy2e4QuQfeZnFmos3Yw1TcUc=; b=Oeh6JjRS7OyDx0D2ndBTnexcYyB2fEdyzMZzFGXPKnBc0MW0QmpK65PIXtfdw+9QgS iSPPMNz3BVCs/4mEjQo+ZSIb+LdrgY3+eEDbY7f5LSVGST6FJ7PysG19khZngpqEeqJW 4QTWVjOJSfYa1w9DlREd3dE41ew6lxloJqBrcgI+K1MapCssyeqVcznq6NDLg8loVAHY DgWh/usNqNF+hyaL9jMuayLrl3zHLK8tcb5VZbtu98ef2JKIfu/EBxZEZVAB96LwZvQh CukJpgDTHS0Ygb4gZsSwXZQcoN2OQcl8Qr4rCei5LwPtVjIpzU8bXa3UJ5I65I4/TOSv SSmg== X-Gm-Message-State: AD7BkJJ4ePGvZeJ/lcerXG96+9qNmwrUGPRQX79fvrik8Z8fI/tnxSydZocyRVJir6r2kw== X-Received: by 10.25.22.214 with SMTP id 83mr1559343lfw.60.1458756701423; Wed, 23 Mar 2016 11:11:41 -0700 (PDT) Received: from [192.168.199.2] (broadband-95-84-209-126.nationalcablenetworks.ru. [95.84.209.126]) by smtp.gmail.com with ESMTPSA id c14sm588190lfc.9.2016.03.23.11.11.40 for (version=TLSv1/SSLv3 cipher=OTHER); Wed, 23 Mar 2016 11:11:40 -0700 (PDT) From: Nikolay Kudryavtsev X-Google-Original-From: Nikolay Kudryavtsev To: bug-gnu-emacs@gnu.org Subject: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker Message-ID: <56F2DC47.2090600@gmail.com> Date: Wed, 23 Mar 2016 21:11:19 +0300 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.6.0 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -4.0 (----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -4.0 (----) Each entry ispell-dictionary-alist has elements called CASECHARS and NOT-CASECHARS. They are used for defining what gets sent to the spell checker and what does not. One use case for them is that, if you have two dictionaries for languages with totally different alphabets, you can spellcheck a file where both languages are mixed together. In theory. Here's what happens in practice: If line contains only CASECHARS, it gets sent to the spell checker. If line contains only NOT-CASECHARS, it does not get sent to the spell checker. If line contains both CASECHARS and NOT-CASECHARS, the whole line gets sent to the spell checker. Sending the whole line makes NOT-CASECHARS pretty useless. I think the reasonable behavior in this case would be sending the line word by word. Here's how to repeat this with aspell. 1. Starting from emacs -Q eval this: (setq ispell-program-name "aspell") (defun ispell-set-my-dictionaries() (setq ispell-dictionary-alist (delq (assoc "english" ispell-dictionary-alist) ispell-dictionary-alist)) (add-to-list 'ispell-dictionary-alist '("english" "[kcat]" "[dogh]" "[']" nil ("-B") nil iso-8859-1))) (advice-add 'ispell-set-spellchecker-params :after #'ispell-set-my-dictionaries) 2. ispell-change-dictionary to english. 3. ispell-buffer a buffer containing this: kat doh kat doh "Kat" at the first line would get sent to aspell, since it passes CASECHARS. This is fine. "Doh" at the second line would be ignored, since it's not in CASECHARS. This is fine too. At the line with both words, not only "kat" would get sent, but also "doh" and that's what we don't want to happen. -- Best Regards, Nikolay Kudryavtsev From debbugs-submit-bounces@debbugs.gnu.org Wed Mar 23 14:23:08 2016 Received: (at 23097) by debbugs.gnu.org; 23 Mar 2016 18:23:08 +0000 Received: from localhost ([127.0.0.1]:34724 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ainQy-00087J-MT for submit@debbugs.gnu.org; Wed, 23 Mar 2016 14:23:08 -0400 Received: from eggs.gnu.org ([208.118.235.92]:40292) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ainQy-000872-0k for 23097@debbugs.gnu.org; Wed, 23 Mar 2016 14:23:08 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ainQp-0005KT-0p for 23097@debbugs.gnu.org; Wed, 23 Mar 2016 14:23:02 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50,T_RP_MATCHES_RCVD autolearn=disabled version=3.3.2 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:56074) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ainQo-0005KO-Tq; Wed, 23 Mar 2016 14:22:58 -0400 Received: from 84.94.185.246.cable.012.net.il ([84.94.185.246]:2115 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_128_CBC_SHA1:128) (Exim 4.82) (envelope-from ) id 1ainQo-0001sc-1x; Wed, 23 Mar 2016 14:22:58 -0400 Date: Wed, 23 Mar 2016 20:22:42 +0200 Message-Id: <83fuvh2gwd.fsf@gnu.org> From: Eli Zaretskii To: Nikolay Kudryavtsev In-reply-to: <56F2DC47.2090600@gmail.com> (message from Nikolay Kudryavtsev on Wed, 23 Mar 2016 21:11:19 +0300) Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker References: <56F2DC47.2090600@gmail.com> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: Eli Zaretskii Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) > From: Nikolay Kudryavtsev > Date: Wed, 23 Mar 2016 21:11:19 +0300 > > Each entry ispell-dictionary-alist has elements called CASECHARS and > NOT-CASECHARS. They are used for defining what gets sent to the spell > checker and what does not. > > One use case for them is that, if you have two dictionaries for > languages with totally different alphabets, you can spellcheck a file > where both languages are mixed together. In theory. Don't you need to restart the spell-checker each time you switch the dictionaries? AFAIK, only Hunspell supports such mixed spell-checking, and with Hunspell you don't need to break the line into separate words in that case. With any other spell-checker, you need to restart it whenever you switch languages. From debbugs-submit-bounces@debbugs.gnu.org Wed Mar 23 16:13:12 2016 Received: (at 23097) by debbugs.gnu.org; 23 Mar 2016 20:13:12 +0000 Received: from localhost ([127.0.0.1]:34795 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1aip9U-0004B0-3g for submit@debbugs.gnu.org; Wed, 23 Mar 2016 16:13:12 -0400 Received: from mail-lb0-f173.google.com ([209.85.217.173]:35136) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1aip9T-0004Am-3X for 23097@debbugs.gnu.org; Wed, 23 Mar 2016 16:13:11 -0400 Received: by mail-lb0-f173.google.com with SMTP id bc4so17528838lbc.2 for <23097@debbugs.gnu.org>; Wed, 23 Mar 2016 13:13:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=from:subject:to:references:cc:message-id:date:user-agent :mime-version:in-reply-to; bh=cQEYhqVlTDe1Dgr4gKBHS0Dri73d2/HtaLfHPHDssQQ=; b=R+s1l9i+bOWJFhs8cgbHfSnIK/N2gm82Pb/ECudsqooxurw0d1qjBqIcA+CsA5yUi3 TvuOse6o/AsszgcGwyBG5+rSdlpzrVNwSfg83Gff4All/FW/KTIKLwD8S6/Sm7nhS4ai Wl2n8ls+OlRy2c0SQw6UCGy6T7WMPOQY7KfEQBA/x4tTLU2oDyeKqA5QQrCDU+Oif2uA 2X9dLTJQhwyfmnEa9jOR1YSbHt0dA/p2VMTcsDkUb1+c/ttDqHTad60UVfGaXxnUGbVd /iQGjBKrymr6d5IKeYupjUVaiowuGhc125pj5Sn4xgkxoB2pWEpLRRwRC4GnijPzZrv2 iZjA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:subject:to:references:cc:message-id:date :user-agent:mime-version:in-reply-to; bh=cQEYhqVlTDe1Dgr4gKBHS0Dri73d2/HtaLfHPHDssQQ=; b=QNilW18jI9fL+dlkWnqEp/WH0M5uSmTKTjKdxLbtDlaghJKvvf2O8SU3xW+lEQz216 tNq4WLCaIc+MpajuUEB2KlL7BsSuyWWy9TgDFefPDQmlhrhm+4SFn0E0JoXBENF3YjLa e9oj3OA3httsgMA8U/Gw+xb2UUElZHeA4h2c5DDU4rDTW3Dha5+tFX8Eou6pMhdrKvxW UQqQKRwtGKWai4EW2PihzrO1tCNIOP6vwezTC6XRv4waIFpuzGRI4wywmIhUcx/p3eTn laBBXs6ENEn2/4Vg+/iW88NdLbck7E6qSpOjXnjjc00GgEaPyykEzjcqVWIsc2DdYLHd KXRg== X-Gm-Message-State: AD7BkJKb/qFITGAvpglb1O00m5ZLSuDXvz1G3jGlfrhvta8b4QLJ6ZZifx2o02fFiCUD3Q== X-Received: by 10.112.201.202 with SMTP id kc10mr2040780lbc.33.1458763985374; Wed, 23 Mar 2016 13:13:05 -0700 (PDT) Received: from [192.168.199.2] (broadband-95-84-209-126.nationalcablenetworks.ru. [95.84.209.126]) by smtp.gmail.com with ESMTPSA id nv8sm633690lbb.7.2016.03.23.13.13.04 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 23 Mar 2016 13:13:04 -0700 (PDT) From: Nikolay Kudryavtsev X-Google-Original-From: Nikolay Kudryavtsev Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker To: Eli Zaretskii References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> Message-ID: <56F2F8BC.5020000@gmail.com> Date: Wed, 23 Mar 2016 23:12:44 +0300 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.6.0 MIME-Version: 1.0 In-Reply-To: <83fuvh2gwd.fsf@gnu.org> Content-Type: multipart/alternative; boundary="------------050207020505010703030303" X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) This is a multi-part message in MIME format. --------------050207020505010703030303 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Yes, you do need to restart the spell checker when you switch dictionaries, but it's not too inconvenient in practice. As you know, I've ran into issues with hunspell, which I described in this thread . -- Best Regards, Nikolay Kudryavtsev --------------050207020505010703030303 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 7bit Yes, you do need to restart the spell checker when you switch dictionaries, but it's not too inconvenient in practice.

As you know, I've ran into issues with hunspell, which I described in this thread.
-- 
Best Regards,
Nikolay Kudryavtsev
--------------050207020505010703030303-- From debbugs-submit-bounces@debbugs.gnu.org Sat Aug 15 00:22:34 2020 Received: (at 23097) by debbugs.gnu.org; 15 Aug 2020 04:22:34 +0000 Received: from localhost ([127.0.0.1]:54160 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k6niE-0003Ub-5a for submit@debbugs.gnu.org; Sat, 15 Aug 2020 00:22:34 -0400 Received: from mail-yb1-f174.google.com ([209.85.219.174]:39211) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k6niA-0003U8-KA for 23097@debbugs.gnu.org; Sat, 15 Aug 2020 00:22:31 -0400 Received: by mail-yb1-f174.google.com with SMTP id q16so6306382ybk.6 for <23097@debbugs.gnu.org>; Fri, 14 Aug 2020 21:22:30 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:user-agent :mime-version:date:message-id:subject:to:cc; bh=/dsI6Gv8cuAfW377s8EMNLOO2CPjiZjvMmbSCWQy0Qg=; b=uEpO1X+Ezs1SajjkcIjyZ8JMRLnpbVpOL/UrZYvHyHUZt+B+0IySQJqJwAz8f6jh28 lc/koWxLLMDR2R3M4Aj5v0N+03cMrNF2dNN1RsVGCBnGdC0yzWjVWU8N+cCmAq+Jb3a0 bhBXSRZnCOKQKksG2tQzMpivCUOXiWt71O9I0UQzpcfEuaRMrsdjSg7zaULqHmVORJCZ MiygsP/U+UP2WE7yjvcLtYS+XWp+mfNAkipunWCWrHG4FdSfaEmdnAwPbDCCryC4zGd1 RClcxCTQO9/WeWV/3UCbBfKClJD9e0Z/oEEEjl8CAQR5Q92+cq8dOE8SajL1Vpmk2/UO Cl3A== X-Gm-Message-State: AOAM531R7G27xrH9ZbtMjl5GA/MaZgwz7sazGTqbBgVgEHffX7n3f+As 7TP10z0m7v3ONCR7IOVjLJbb5eA5nXaq+FST/xU= X-Google-Smtp-Source: ABdhPJzffEspw9Nb5s7eNN87PNL2w6IQelHIzHn7KzX7t0wNbe2vOohzVCDePG6dIjdLH0wG6DQgqIyy7YMm69aop+s= X-Received: by 2002:a5b:508:: with SMTP id o8mr8653716ybp.43.1597465345208; Fri, 14 Aug 2020 21:22:25 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Fri, 14 Aug 2020 21:22:24 -0700 From: Stefan Kangas In-Reply-To: <83fuvh2gwd.fsf@gnu.org> (Eli Zaretskii's message of "Wed, 23 Mar 2016 20:22:42 +0200") References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Date: Fri, 14 Aug 2020 21:22:24 -0700 Message-ID: Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker To: Eli Zaretskii Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org, Nikolay Kudryavtsev X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Eli Zaretskii writes: >> From: Nikolay Kudryavtsev >> Date: Wed, 23 Mar 2016 21:11:19 +0300 >> >> Each entry ispell-dictionary-alist has elements called CASECHARS and >> NOT-CASECHARS. They are used for defining what gets sent to the spell >> checker and what does not. >> >> One use case for them is that, if you have two dictionaries for >> languages with totally different alphabets, you can spellcheck a file >> where both languages are mixed together. In theory. > > Don't you need to restart the spell-checker each time you switch the > dictionaries? AFAIK, only Hunspell supports such mixed > spell-checking, and with Hunspell you don't need to break the line > into separate words in that case. With any other spell-checker, you > need to restart it whenever you switch languages. It seems like this is a limitation of external software then, and not in Emacs? Should this therefore be closed, or is there anything more to do here? Best regards, Stefan Kangas From debbugs-submit-bounces@debbugs.gnu.org Sat Aug 15 00:22:30 2020 Received: (at control) by debbugs.gnu.org; 15 Aug 2020 04:22:30 +0000 Received: from localhost ([127.0.0.1]:54157 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k6niA-0003UJ-0f for submit@debbugs.gnu.org; Sat, 15 Aug 2020 00:22:30 -0400 Received: from mail-yb1-f181.google.com ([209.85.219.181]:33011) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k6ni8-0003Ty-8N for control@debbugs.gnu.org; Sat, 15 Aug 2020 00:22:28 -0400 Received: by mail-yb1-f181.google.com with SMTP id p191so6332394ybg.0 for ; Fri, 14 Aug 2020 21:22:28 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:mime-version:date:message-id:subject:to; bh=8lWoAiiE2EvDeiEAmLsmPtFIzNO4DUTcJuOSCG93rrY=; b=bAcfkspcX0u4QI+W39mbgnOU7asGdwdwHVpB08kXy3n+q21fG55tLHrTRDOZ/Fr1Ip Hw8L8vTAOabdgpsasxHuVUcn0i7/v6gevpzXhVYCrRfdRspS8doEjsKCtqhTOlAbb2ct VlhwCYXb2M2gKd0AGTEd+80vVa/BcMfDxJ8yV9wO+x1lXpLtJt6IuR04M8DywZFiI1fZ McLljWCLQNkfUP7umcDYAnN3EfEz7BO+aibzdvw4xvhw3dwSMHmXvvarPsfeuWZYsHTy yLoEoQdtAJFNdzpbDWsA0bcGcrGnPLt7SRhJz1oa8j7Lz5VXugVmz4MQh/1eW0GydCiD ntRw== X-Gm-Message-State: AOAM531pGynCpQ0rXRXOqPnZYR/aC5T346iKyXa/HaWEJ3NnLHZ1M1PD 5HCLvWkKoM1hFz9fXdNN7hbqv7U83Wfd2FGzCxFq1/pD61iheA== X-Google-Smtp-Source: ABdhPJyuY3cp9Kfn5xPu+WMvnTv+f9Tdw7Q+JBPysMF5NWR26EcZ1eKuuDnpPH58b83+LQjeiVM5Hp2qHwBDqwE6Vys= X-Received: by 2002:a25:4609:: with SMTP id t9mr7827796yba.231.1597465342787; Fri, 14 Aug 2020 21:22:22 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Fri, 14 Aug 2020 21:22:22 -0700 From: Stefan Kangas MIME-Version: 1.0 Date: Fri, 14 Aug 2020 21:22:22 -0700 Message-ID: Subject: To: control@debbugs.gnu.org Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 2.0 (++) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: tags 23097 + moreinfo thanks Content analysis details: (2.0 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record 0.0 HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail domains are different 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (stefankangas[at]gmail.com) -0.0 SPF_PASS SPF: sender matches SPF record -0.0 RCVD_IN_DNSWL_NONE RBL: Sender listed at https://www.dnswl.org/, no trust [209.85.219.181 listed in list.dnswl.org] -0.0 RCVD_IN_MSPIKE_H2 RBL: Average reputation (+2) [209.85.219.181 listed in wl.mailspike.net] 0.0 UNPARSEABLE_RELAY Informational: message has unparseable relay lines 2.0 BLANK_SUBJECT Subject is present but empty 0.0 FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and EnvelopeFrom freemail headers are different X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 1.0 (+) tags 23097 + moreinfo thanks From debbugs-submit-bounces@debbugs.gnu.org Sat Aug 15 12:15:36 2020 Received: (at 23097) by debbugs.gnu.org; 15 Aug 2020 16:15:36 +0000 Received: from localhost ([127.0.0.1]:56327 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k6yqF-00024m-Qq for submit@debbugs.gnu.org; Sat, 15 Aug 2020 12:15:36 -0400 Received: from eggs.gnu.org ([209.51.188.92]:48336) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k6yqD-00024W-D7 for 23097@debbugs.gnu.org; Sat, 15 Aug 2020 12:15:33 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:51175) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1k6yq7-0003JI-Uh; Sat, 15 Aug 2020 12:15:27 -0400 Received: from [176.228.60.248] (port=3195 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1k6yq6-0000F4-Vo; Sat, 15 Aug 2020 12:15:27 -0400 Date: Sat, 15 Aug 2020 19:15:14 +0300 Message-Id: <83bljbkhrh.fsf@gnu.org> From: Eli Zaretskii To: Stefan Kangas In-Reply-To: (message from Stefan Kangas on Fri, 14 Aug 2020 21:22:24 -0700) Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org, nikolay.kudryavtsev@gmail.com X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Stefan Kangas > Date: Fri, 14 Aug 2020 21:22:24 -0700 > Cc: Nikolay Kudryavtsev , 23097@debbugs.gnu.org > > Eli Zaretskii writes: > > >> From: Nikolay Kudryavtsev > >> Date: Wed, 23 Mar 2016 21:11:19 +0300 > >> > >> Each entry ispell-dictionary-alist has elements called CASECHARS and > >> NOT-CASECHARS. They are used for defining what gets sent to the spell > >> checker and what does not. > >> > >> One use case for them is that, if you have two dictionaries for > >> languages with totally different alphabets, you can spellcheck a file > >> where both languages are mixed together. In theory. > > > > Don't you need to restart the spell-checker each time you switch the > > dictionaries? AFAIK, only Hunspell supports such mixed > > spell-checking, and with Hunspell you don't need to break the line > > into separate words in that case. With any other spell-checker, you > > need to restart it whenever you switch languages. > > It seems like this is a limitation of external software then, and not in > Emacs? Should this therefore be closed, or is there anything more to do > here? Yes, I think we should close this issue. From debbugs-submit-bounces@debbugs.gnu.org Sat Aug 15 12:40:46 2020 Received: (at 23097) by debbugs.gnu.org; 15 Aug 2020 16:40:46 +0000 Received: from localhost ([127.0.0.1]:56350 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k6zEb-0002iq-Uj for submit@debbugs.gnu.org; Sat, 15 Aug 2020 12:40:46 -0400 Received: from mail-yb1-f182.google.com ([209.85.219.182]:46000) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k6zEZ-0002iX-SW for 23097@debbugs.gnu.org; Sat, 15 Aug 2020 12:40:44 -0400 Received: by mail-yb1-f182.google.com with SMTP id x2so6933767ybf.12 for <23097@debbugs.gnu.org>; Sat, 15 Aug 2020 09:40:43 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:mime-version:date :message-id:subject:to:cc; bh=AWs6Bmq1urc21IdKFVOKIWXp/JPyH4/kBlMe2Dp26Qs=; b=XVXjRzbSNwCY+UGcbXq/ottTd6m/bb46uGxOe7csT11MU9DwXlzrGhk0OqkcBBQyQ/ UuogeZyBaiHlYOxUhIPbX+NQCW/LGTzUst7xzc3wAk7UTH4wV/LY6NhbATob3VPiGjjH vIMbYUblNpQOcxkCIhyH1yKehjRyGmuoC3BB8Ophh5JYEf2/J/noz/HecNQlZGFDztev PfWhq87NbNPQ6dxHTG3LcK0N05fdQnyy4LZ+lpUsb7RCYlMSsbj8ePpGUzIbzFAl0ke9 dHXVH7ILPVIqHEGWWYal6Bm+BFrYtNhF7A+URzLo0EYerObZlJZNBH35rw/WhE8+Na8o bd4w== X-Gm-Message-State: AOAM531NfJJTBb8nV5pxgxMHkIpWtQhymsriTiZeyORDRGuKge96/sZw k7qu9p8bVzj0u3JBZN/4ScRjfoVZWSPMPvFNJ/4= X-Google-Smtp-Source: ABdhPJxPryE9hnnlEDbeceNlzqOE9EVoJbnP1kNklZaoMUuUidgGOaN6uM68Za0P20TTTPm96B4Si1QXO6/VWNqzNk8= X-Received: by 2002:a5b:508:: with SMTP id o8mr11734022ybp.43.1597509638330; Sat, 15 Aug 2020 09:40:38 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Sat, 15 Aug 2020 09:40:37 -0700 From: Stefan Kangas In-Reply-To: <83bljbkhrh.fsf@gnu.org> References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> <83bljbkhrh.fsf@gnu.org> MIME-Version: 1.0 Date: Sat, 15 Aug 2020 09:40:37 -0700 Message-ID: Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker To: Eli Zaretskii Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.1 (/) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org, nikolay.kudryavtsev@gmail.com X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.9 (/) tags 23097 + notabug close 23097 thanks Eli Zaretskii writes: > Yes, I think we should close this issue. Thanks, done. From debbugs-submit-bounces@debbugs.gnu.org Mon Aug 17 05:20:24 2020 Received: (at 23097) by debbugs.gnu.org; 17 Aug 2020 09:20:24 +0000 Received: from localhost ([127.0.0.1]:59247 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k7bJY-0003Jw-7j for submit@debbugs.gnu.org; Mon, 17 Aug 2020 05:20:24 -0400 Received: from mail-lj1-f180.google.com ([209.85.208.180]:41098) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k7bJR-0003Jc-LE for 23097@debbugs.gnu.org; Mon, 17 Aug 2020 05:20:22 -0400 Received: by mail-lj1-f180.google.com with SMTP id f26so16648582ljc.8 for <23097@debbugs.gnu.org>; Mon, 17 Aug 2020 02:20:17 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:subject:to:cc:references:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding:content-language; bh=pFBW+5jdWEbaPCXBvV5mzYdXVkCLhaqR/33GjQ0KoCM=; b=CeNTKX8/YON2NUfApbalZSBMzrjDS9HzR23aJIxe4RD0YJgGtIJoJ0E+XpgIofrdd1 JK8Rq1BSasuC3X0kbrAdWSzPVmHP5b0wHdDD+pTCFqaFMUHXUEizO78L42zQ1n8jhUxY c0RYNCYzhIb6Z72rG5NDeqa1ovvgZEBjh43XTgZcez1QgvdlHvazzbIlaueBYaYM7JPP eUGC7z3NeiRFpAeSOrJ47xvyBQVX+pGb3tkwuFY7zX/qAd50uEjXDjC8Cml0rhIXddfg FgW8bJKeeNWGqid/eM1CuS7NFPlmRdjSb0IbrCVuHg8m5+aFCNesaLhb4OajuqQiBYx6 p8ow== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:subject:to:cc:references:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=pFBW+5jdWEbaPCXBvV5mzYdXVkCLhaqR/33GjQ0KoCM=; b=mwAgYR0eWPZrOE52B7eyMhTNK9fsKcQcmhLiCjs4oSsYsK6QZc2a06YVu7UKQsLEt+ NMQVNr1JFVkRb0Lrd18FfrBhit+su623DFxtIUxzIPdId/1HoHIQG6xq49g0vRq+D8yk mUHfmouUoMjciQXZpYy9K/1VftrUcycWn+4iOvlJqxa+5kYXkixDvm5Wedti0UrDwAPE sgZ+OQFqMjhEkmyCW6b2rfM7WBsgraGx/8FetZiAaZTZYNL8qOKiGSxleNt6Kr5/ub4Y P1pxeuZQaiIAn8JkytO0MH00+IH1nKMV0MB9WCDr1NsHcCJlNWh2lq/QIBWlTkKKdh40 yVTQ== X-Gm-Message-State: AOAM533l9b4peH/mNYtDTe1H4QSiFZyBsua5LX9+nkUqkjpsO308wr47 216IHsK/Yx8yAg6A4VoUAi55YvPyP/c= X-Google-Smtp-Source: ABdhPJxchgBEE074m25XMu1WRCxlqAMAyqChEWZxWXXEX1b4q4mAMO7jwlyhyaWaWtWSEpMgo9ipnQ== X-Received: by 2002:a2e:6808:: with SMTP id c8mr7512618lja.13.1597656011155; Mon, 17 Aug 2020 02:20:11 -0700 (PDT) Received: from ?IPv6:2a02:2168:b010:9800:6dc0:5c5f:9c37:8471? ([2a02:2168:b010:9800:6dc0:5c5f:9c37:8471]) by smtp.gmail.com with ESMTPSA id z25sm4892234lji.33.2020.08.17.02.20.10 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 17 Aug 2020 02:20:10 -0700 (PDT) From: Nikolay Kudryavtsev X-Google-Original-From: Nikolay Kudryavtsev Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker To: Stefan Kangas , Eli Zaretskii References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> <83bljbkhrh.fsf@gnu.org> Message-ID: <5fe8e18f-efb4-4f9b-fd85-0cb4eccc58b4@gmail.com> Date: Mon, 17 Aug 2020 12:20:08 +0300 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.11.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: ru X-Spam-Score: -1.0 (-) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.0 (--) This is not an external software bug, but very much an Emacs bug. I'm not sure what was the initial design idea for CASECHARS and NOT-CASECHARS, but whatever it was, it would not work effectively due to feeding the entire line. The most obvious practical use for them(being able to spellcheck languages with completely different alphabets without the spellchecker misfiring on either pass) would not work either. The ideal pratical fix for this should spellcheck such lines word by word. -- Best Regards, Nikolay Kudryavtsev From debbugs-submit-bounces@debbugs.gnu.org Mon Aug 17 08:48:53 2020 Received: (at 23097) by debbugs.gnu.org; 17 Aug 2020 12:48:54 +0000 Received: from localhost ([127.0.0.1]:59564 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k7eZJ-0004Py-Lc for submit@debbugs.gnu.org; Mon, 17 Aug 2020 08:48:53 -0400 Received: from mail-yb1-f170.google.com ([209.85.219.170]:41442) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k7eZD-0004PY-CW for 23097@debbugs.gnu.org; Mon, 17 Aug 2020 08:48:51 -0400 Received: by mail-yb1-f170.google.com with SMTP id s195so9345288ybc.8 for <23097@debbugs.gnu.org>; Mon, 17 Aug 2020 05:48:47 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:in-reply-to:references:mime-version:date :message-id:subject:to:cc; bh=ACME72fPK2EKcesGbczXN1zjq/EexXwABJxk24+A0GA=; b=obnfZmz/+eV5DIitnq1CRLz157NHO6xwyU9EK9+7I7uu3Zidk2gfgfA4TSAcWa4HqX x8knon0SJu463glUdTUxOIkpjfVSpa24E4m+Bu4Kkd4boQM7DwCOxOW7mw58YrJuEl4i EmXijva3CwFcfrnvXlxzXzrEg+rQygOZxxlkNGK9aHRxoPeCNtcBlQduu2ERGCAF/Ukf PEliclltOFohe/ksWQ0wuvJbCuKD5sWbKroCJGVi+/ff5rIPhC43aha8tgAUVqdYR7gA o5BKhbxmvYjua58E/6vUE/95OS/t/1IpIruCT9aGi5vbq9foPgDTnePDHUbme/lygDwX tZRA== X-Gm-Message-State: AOAM532W0YQkzgoXSESt6OnEFPEV4UaA6IV73uQ+SsmjJzaEUroDEHpC tN87NdLcyasUY3w7dHdE7SLIOeGDFRQfMfX+bGk= X-Google-Smtp-Source: ABdhPJyIpb5W0/O4RSWX9VsUEs8sWGfB775vtqSZOU8jEA5t1HtRfYRvgeyogOUFw0A/+OAWyV03Gy7InBgbNT+cfdo= X-Received: by 2002:a25:7007:: with SMTP id l7mr19811152ybc.85.1597668521838; Mon, 17 Aug 2020 05:48:41 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Mon, 17 Aug 2020 12:48:41 +0000 From: Stefan Kangas In-Reply-To: <5fe8e18f-efb4-4f9b-fd85-0cb4eccc58b4@gmail.com> References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> <83bljbkhrh.fsf@gnu.org> <5fe8e18f-efb4-4f9b-fd85-0cb4eccc58b4@gmail.com> MIME-Version: 1.0 Date: Mon, 17 Aug 2020 12:48:41 +0000 Message-ID: Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker To: Nikolay Kudryavtsev , Eli Zaretskii Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.5 (/) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.5 (/) Nikolay Kudryavtsev writes: > This is not an external software bug, but very much an Emacs bug. > > I'm not sure what was the initial design idea for CASECHARS and > NOT-CASECHARS, but whatever it was, it would not work effectively due to > feeding the entire line. The most obvious practical use for them(being > able to spellcheck languages with completely different alphabets without > the spellchecker misfiring on either pass) would not work either. > > The ideal pratical fix for this should spellcheck such lines word by word. Okay, but that's not a documented use-case, so I'm not sure that it's a bug. The thing you suggest may be possible, but would require developing a new feature, for example to run two instances of the same spell checker at once. AFAIU, the best solution is to use an external spell checker that has support for using two languages at once. Why not use that? Best regards, Stefan Kangas From debbugs-submit-bounces@debbugs.gnu.org Mon Aug 17 12:41:22 2020 Received: (at 23097) by debbugs.gnu.org; 17 Aug 2020 16:41:22 +0000 Received: from localhost ([127.0.0.1]:33068 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k7iCI-0000ES-9d for submit@debbugs.gnu.org; Mon, 17 Aug 2020 12:41:22 -0400 Received: from eggs.gnu.org ([209.51.188.92]:38358) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1k7iCG-0000EE-3F for 23097@debbugs.gnu.org; Mon, 17 Aug 2020 12:41:20 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:60371) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1k7iCA-00087N-OH; Mon, 17 Aug 2020 12:41:14 -0400 Received: from [176.228.60.248] (port=2323 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1k7iCA-0000NX-1v; Mon, 17 Aug 2020 12:41:14 -0400 Date: Mon, 17 Aug 2020 19:40:58 +0300 Message-Id: <83eeo5gr8l.fsf@gnu.org> From: Eli Zaretskii To: Nikolay Kudryavtsev In-Reply-To: <5fe8e18f-efb4-4f9b-fd85-0cb4eccc58b4@gmail.com> (message from Nikolay Kudryavtsev on Mon, 17 Aug 2020 12:20:08 +0300) Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> <83bljbkhrh.fsf@gnu.org> <5fe8e18f-efb4-4f9b-fd85-0cb4eccc58b4@gmail.com> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org, stefan@marxist.se X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Nikolay Kudryavtsev > Cc: 23097@debbugs.gnu.org > Date: Mon, 17 Aug 2020 12:20:08 +0300 > > This is not an external software bug, but very much an Emacs bug. > > I'm not sure what was the initial design idea for CASECHARS and > NOT-CASECHARS, but whatever it was, it would not work effectively due to > feeding the entire line. The most obvious practical use for them(being > able to spellcheck languages with completely different alphabets without > the spellchecker misfiring on either pass) would not work either. The original design was that a spell-checker supports a single language, and any text in other languages is a spelling mistake. This is still true for Ispell and for Aspell; only Hunspell (and Enchant, when it uses Hunspell as its back-end) supports multiple languages. With Hunspell, ispell.el effectively ignores CASECHARS and NOT-CASECHARS, and instead uses the character set specified by the dictionary file itself. This is the only multi-dictionary spell-checking configuration that ispell.el currently supports. Which is why, when you first reported this, I asked you why you couldn't use Hunspell; your answer, which described some kind of failure related to encoding, I couldn't understand then and I don't understand now (primarily because that feature works for me). Instead, you seem to insist on using Aspell in a way that to me sounds like a kludge: spell-check the region with one dictionary, then restart ispell.el with another dictionary and spell-check the same region again. AFAIU, you'd like ispell.el to support this kind of workaround OOTB. Is that correct, or did I miss something? If my understanding is correct, then, apart of being a kludgey solution for a problem that has a much cleaner one, I don't think I understand how this could work well in general. Suppose you have in your buffer a mis-spelled word such as this: fooЫbar with the Cyrillic letter being there by accident: perhaps you unintentionally pressed a key when you shouldn't have. Or imagine the following typo: fooбар which could happen if you forgot to switch the input method. With your proposed mode of operation, the spell-checker will check partial words and decide that in both cases there's no spelling mistakes here, because each partial word is spelled correctly in its language. But clearly these are typos that need to be flagged. Thus, just using 2 sets of characters is not enough to handle these typos intelligently, as you'd get a lot of false negatives. So even if we consider your report as a feature request, it is not entirely clear to me how to implement such a feature. And frankly, since at least one spell-checker exists which supports multiple dictionaries, it is not clear to me why we should try so hard forcing Aspell look as if it did, too. > The ideal pratical fix for this should spellcheck such lines word by word. I think I show above why such simplistic strategy will backfire by leaving some typos undetected. From unknown Sun Jun 22 17:12:01 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Tue, 15 Sep 2020 11:24:08 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator From debbugs-submit-bounces@debbugs.gnu.org Tue Oct 13 12:51:59 2020 Received: (at control) by debbugs.gnu.org; 13 Oct 2020 16:51:59 +0000 Received: from localhost ([127.0.0.1]:48637 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kSNWp-0003D3-C6 for submit@debbugs.gnu.org; Tue, 13 Oct 2020 12:51:59 -0400 Received: from eggs.gnu.org ([209.51.188.92]:50606) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kSNWm-0003Cm-Qd; Tue, 13 Oct 2020 12:51:57 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:38931) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kSNWh-0005qP-55; Tue, 13 Oct 2020 12:51:51 -0400 Received: from [176.228.60.248] (port=2848 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1kSNWg-00038o-Kq; Tue, 13 Oct 2020 12:51:51 -0400 Date: Tue, 13 Oct 2020 19:51:59 +0300 Message-Id: <83mu0q132o.fsf@gnu.org> From: Eli Zaretskii To: Nikolay Kudryavtsev In-Reply-To: <08f3ac29-761c-ced7-1e2f-0f643512b986@gmail.com> (message from Nikolay Kudryavtsev on Tue, 13 Oct 2020 16:19:10 +0300) Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> <83bljbkhrh.fsf@gnu.org> <5fe8e18f-efb4-4f9b-fd85-0cb4eccc58b4@gmail.com> <83eeo5gr8l.fsf@gnu.org> <08f3ac29-761c-ced7-1e2f-0f643512b986@gmail.com> X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control Cc: 23097@debbugs.gnu.org, stefan@marxist.se X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) unarchive 23097 thanks > From: Nikolay Kudryavtsev > Cc: stefan@marxist.se, 23097@debbugs.gnu.org > Date: Tue, 13 Oct 2020 16:19:10 +0300 > > Anyway, Hunspell IMHO is sort of besides the point for this discussion. > This bug is about ispell.el not performing in a way a user would > realistically expect from its public facing API. Which expectations from the public API are being violated here? From debbugs-submit-bounces@debbugs.gnu.org Tue Oct 13 13:00:31 2020 Received: (at 23097) by debbugs.gnu.org; 13 Oct 2020 17:00:31 +0000 Received: from localhost ([127.0.0.1]:48663 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kSNf4-0003vK-Mj for submit@debbugs.gnu.org; Tue, 13 Oct 2020 13:00:30 -0400 Received: from eggs.gnu.org ([209.51.188.92]:52498) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kSNf2-0003nt-G4 for 23097@debbugs.gnu.org; Tue, 13 Oct 2020 13:00:28 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:39129) by eggs.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kSNew-0006wK-Oq; Tue, 13 Oct 2020 13:00:23 -0400 Received: from [176.228.60.248] (port=3372 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1kSNew-0003zv-1T; Tue, 13 Oct 2020 13:00:22 -0400 Date: Tue, 13 Oct 2020 20:00:31 +0300 Message-Id: <83k0vu12og.fsf@gnu.org> From: Eli Zaretskii To: Nikolay Kudryavtsev In-Reply-To: <08f3ac29-761c-ced7-1e2f-0f643512b986@gmail.com> (message from Nikolay Kudryavtsev on Tue, 13 Oct 2020 16:19:10 +0300) Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> <83bljbkhrh.fsf@gnu.org> <5fe8e18f-efb4-4f9b-fd85-0cb4eccc58b4@gmail.com> <83eeo5gr8l.fsf@gnu.org> <08f3ac29-761c-ced7-1e2f-0f643512b986@gmail.com> X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org, stefan@marxist.se X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Nikolay Kudryavtsev > Cc: stefan@marxist.se, 23097@debbugs.gnu.org > Date: Tue, 13 Oct 2020 16:19:10 +0300 > > Anyway, Hunspell IMHO is sort of besides the point for this discussion. > This bug is about ispell.el not performing in a way a user would > realistically expect from its public facing API. Which expectations from what public API are being violated here? From debbugs-submit-bounces@debbugs.gnu.org Wed Oct 14 15:20:18 2020 Received: (at 23097) by debbugs.gnu.org; 14 Oct 2020 19:20:18 +0000 Received: from localhost ([127.0.0.1]:53018 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kSmJu-0003oE-DA for submit@debbugs.gnu.org; Wed, 14 Oct 2020 15:20:18 -0400 Received: from mail-lj1-f172.google.com ([209.85.208.172]:45605) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kSmJt-0003o1-5P for 23097@debbugs.gnu.org; Wed, 14 Oct 2020 15:20:17 -0400 Received: by mail-lj1-f172.google.com with SMTP id a4so579523lji.12 for <23097@debbugs.gnu.org>; Wed, 14 Oct 2020 12:20:17 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:subject:to:cc:references:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding:content-language; bh=UNMJE3pqezUDdlqlMllO4ke0xor7x7DDHEFCktkfvlE=; b=tRk3JOkjrH9VhjkIdgdG2oL6haO4/J5+wgWRHyjNFAxmaXsc8HorwsB9KK7SnJWcwf PV/kZzMA7J3zTLQH2LLxkoee2lcrZAgsh2VP4+ihnLc8zLdUo5x32UUABQWSzwBEtCRn uJoDYRgvUUUr6JL04PgOXjkhcp/vodTtbP31RgdR6oMb6xrD6kv5YEUbP8Brjw+/mALP P9LMndC/j3NPiI2cb1RIq1PgWxaAN1ylDTaN9ljhhtzUOJZ2bV2kialG4fZwVW7Cael3 53vi4LvbvOM+GxRdZ5aoilBow/HnZgemwnM4Ii3gTJ/JpvRQ1FaoF+qZ2iv1TP1uXaox 65tQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:subject:to:cc:references:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=UNMJE3pqezUDdlqlMllO4ke0xor7x7DDHEFCktkfvlE=; b=Pc0FaH5n5ZuJPH5Mk95J58gjtpb5r9PeTWKMcNM42xY2+dUR1KmyCK4xGbr3kh0FfL lWeApmp0wp4Oy+EpV+cM/hRP8/Hpw6Oua/t04L37ckFn06G38lWKWL448bWWbpLM0eVA GSn4wXTOJaWiUndD2IG4PqMSo9RBYfDsvb9VkF87Jra+gYLMK2u5cWiy0XRYF+KrnzUB PHRh2tXNNN31+Q/n8mSV8emnBaZIFwXs0kcaUQlxYw79Asz+WQzM64hWX0DMHgJZQ2UD Eh3msjBR9lmnOUHMgTckgkwi+XGgH4Nq9AfynQUGFZqNfW+69tYn7oXROiVF0yInkJR3 7Jdw== X-Gm-Message-State: AOAM532Hq90yL+CRfvc0ZWjA1TyDY5lvZrGPk5nLhJg9luO76i2aiMva UvsPIXj18ypadJY2EID2nXS4H+v0VgQOqA== X-Google-Smtp-Source: ABdhPJy5YQBnylpOTI3Cxl6f90D3EsetvDYJ3M6+RzO2T9x1nE63BRR/MrbKYYkjkY26XjTw5dah1w== X-Received: by 2002:a2e:2e1a:: with SMTP id u26mr56263lju.295.1602703210438; Wed, 14 Oct 2020 12:20:10 -0700 (PDT) Received: from [192.168.1.72] (128-69-156-0.broadband.corbina.ru. [128.69.156.0]) by smtp.gmail.com with ESMTPSA id i124sm72160lfd.236.2020.10.14.12.20.09 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 14 Oct 2020 12:20:09 -0700 (PDT) From: Nikolay Kudryavtsev X-Google-Original-From: Nikolay Kudryavtsev Subject: Re: bug#23097: 24.5; ispell.el: lines with both CASECHARS and NOT-CASECHARS get sent to the spell checker To: Eli Zaretskii References: <56F2DC47.2090600@gmail.com> <83fuvh2gwd.fsf@gnu.org> <83bljbkhrh.fsf@gnu.org> <5fe8e18f-efb4-4f9b-fd85-0cb4eccc58b4@gmail.com> <83eeo5gr8l.fsf@gnu.org> <08f3ac29-761c-ced7-1e2f-0f643512b986@gmail.com> <83mu0q132o.fsf@gnu.org> Message-ID: <32dfc219-ac9c-8d9a-3140-d23977c8836c@gmail.com> Date: Wed, 14 Oct 2020 22:20:08 +0300 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.12.1 MIME-Version: 1.0 In-Reply-To: <83mu0q132o.fsf@gnu.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US X-Spam-Score: -0.0 (/) X-Debbugs-Envelope-To: 23097 Cc: 23097@debbugs.gnu.org, stefan@marxist.se X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) The whole ispell-dictionary-alist structure implies that matching would be done word by word. And looking into the dictionary setup is the first thing ispell.el user would do. Apart from NOT-CASECHARS it also has this element: > OTHERCHARS is a regexp of characters in the NOT-CASECHARS set but > which can be > used to construct words in some special way.  If OTHERCHARS characters > follow > and precede characters from CASECHARS, they are parsed as part of a word, > otherwise they become word-breaks... Basically presence of both NOT-CASECHARS and OTHERCHARS implies that ispell.el does strict word by word matching. If we're just sending any line that contains a CASECHARS match, we don't really need either of them, since we can just match by CASECHARS alone and then send the line. Oh, and there's another thing. Ispell.el actually does word by word search, but only on resume. Try my recipe again, just make the last line of spellchecked buffer to look like "doh kat". Then suspend the spellcheck after the first line and resume it with C-u M-$. You'd see that it skips the last line "doh" fine in this scenario. But then it suffers from the word mix problem described by Eli: spellchecking dohkat" and "katdoh" results in kat alone being sent. Thinking a bit more about this word mix problem, seems like it's not as simple to fix it as I thought in my previous letter, since we need some list of legitimate word separators for each language. -- Best Regards, Nikolay Kudryavtsev From unknown Sun Jun 22 17:12:01 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Thu, 12 Nov 2020 12:24:09 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator