From debbugs-submit-bounces@debbugs.gnu.org Sat Jul 30 14:06:26 2022 Received: (at submit) by debbugs.gnu.org; 30 Jul 2022 18:06:26 +0000 Received: from localhost ([127.0.0.1]:35727 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oHqr4-0007pO-5n for submit@debbugs.gnu.org; Sat, 30 Jul 2022 14:06:26 -0400 Received: from lists.gnu.org ([209.51.188.17]:33082) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oHqr1-0007pE-Rv for submit@debbugs.gnu.org; Sat, 30 Jul 2022 14:06:25 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:46222) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1oHqr1-0000v3-IN for bug-gnu-emacs@gnu.org; Sat, 30 Jul 2022 14:06:23 -0400 Received: from mail-lj1-x22e.google.com ([2a00:1450:4864:20::22e]:45694) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1oHqqz-00009T-Mv for bug-gnu-emacs@gnu.org; Sat, 30 Jul 2022 14:06:23 -0400 Received: by mail-lj1-x22e.google.com with SMTP id p21so8235703ljh.12 for ; Sat, 30 Jul 2022 11:06:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:subject:date:message-id:mime-version; bh=ccfpEvPKJJ898w+BVl+X6zfnCbDUoY6RbbUZKfonmQ4=; b=e2zbVxCUkVPYVSytYS70eCLKyEIL+ErVoo0fuTEbZJfFBtIo4McPFZkvBaV32whc0w HihcqyOPEEd94gLkrpAWhVeNPQ1x+gqkQngpKG30SK6PcaBTBo9TDbeoyDfvudjLME2y uJ3FfETiSGGLiKCBuxc3KDdIAxOVH1VFd/c3JtSdqg9QTE9xR1/Ax5ikbvfxtoL80t2j 7wB+H1oh60BwXBMA+zh6R0U70orWfHaZ45yQM7WCZ8MPjPd7jS0k0TVJAj+SL2UAyP+N a0XFPFhc+NHHYiD6tGuvrXnPfAlNvqIAwlCE6M6nD9axfDj43RIqO5aQ+328S2ZmVE4N VO7A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:subject:date:message-id:mime-version; bh=ccfpEvPKJJ898w+BVl+X6zfnCbDUoY6RbbUZKfonmQ4=; b=4nk+CbfNfRUV/MQL+QXLWNklg1LXbqQhvFzEsyLYf5jLvpw/bA7AgUvx7ZT4biq/hD effrqZnIAPUyw/G4vmXxQx1nlL2DezPui3XR7SFP4leuWIEvMDlP5mtepLvcn4nr9G+9 KzA5aoDz+OzvXJjnr4mhOaQ7TSHiHLWOB0yJM9ffsdWhHATIlzQlcSut0J2/imyIE3kV Ez/vO83xOgdKVXYLjPjjIa2NvnKb27Espne0GbG3RqJ1q4lbLd/vfE1bEU7JyxllxojL xUa3pqOOShLMAj47fneeWUizrgmkICoQJlMMkZUIB0d7tl9QVmZFMifISpa3liE1eN2g dZ/w== X-Gm-Message-State: AJIora8nDKLG/z1VRh1O6thekzG1xDCetbq1tgWDj5/HKBGiHcfyJueU qaw9bx5G864RPIS6/Y2XDiLKGtEDxF3tgJ5U X-Google-Smtp-Source: AGRyM1uPFpGQjLkrsQtm8X29nm0k53gC7eHztwdm/daJa8woom3HISrwC32Le22QiT7917Ro2xgN3Q== X-Received: by 2002:a2e:9e9a:0:b0:25d:8553:d794 with SMTP id f26-20020a2e9e9a000000b0025d8553d794mr2678416ljk.22.1659204379187; Sat, 30 Jul 2022 11:06:19 -0700 (PDT) Received: from guix-xps13 (95-29-44-32.broadband.corbina.ru. [95.29.44.32]) by smtp.gmail.com with ESMTPSA id u5-20020a05651206c500b0048a7c86f4e7sm426757lff.291.2022.07.30.11.06.17 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 30 Jul 2022 11:06:18 -0700 (PDT) From: =?utf-8?Q?Andr=C3=A9_A=2E_Gomes?= To: bug-gnu-emacs@gnu.org Subject: [PATCH] Refactor repunctuate-sentences to accommodate corner case. Date: Sat, 30 Jul 2022 21:06:16 +0300 Message-ID: <87r122eaqv.fsf@gmail.com> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="=-=-=" Received-SPF: pass client-ip=2a00:1450:4864:20::22e; envelope-from=andremegafone@gmail.com; helo=mail-lj1-x22e.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-Spam-Score: -1.3 (-) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.3 (--) --=-=-= Content-Type: text/plain Tags: patch Hi Emacs, Please find the patch below. In GNU Emacs 29.0.50 (build 1, x86_64-pc-linux-gnu, GTK+ Version 3.24.30, cairo version 1.16.0) Windowing system distributor 'The X.Org Foundation', version 11.0.12101004 System Description: Guix System Configured using: 'configure CONFIG_SHELL=/gnu/store/4y5m9lb8k3qkb1y9m02sw9w9a6hacd16-bash-minimal-5.1.8/bin/bash SHELL=/gnu/store/4y5m9lb8k3qkb1y9m02sw9w9a6hacd16-bash-minimal-5.1.8/bin/bash --prefix=/gnu/store/7a6fnkqrxb0chmvj63f7ddr6wg3pq9g5-emacs-next-29.0.50-1.0a5477b --enable-fast-install --with-modules --with-cairo --disable-build-details' --=-=-= Content-Type: text/patch Content-Disposition: attachment; filename=0001-Refactor-repunctuate-sentences-to-accommodate-corner.patch >From c57f51b7bfec3e3e5c9c2f680d7936c3e546bb28 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Andr=C3=A9=20A=2E=20Gomes?= Date: Sat, 30 Jul 2022 21:01:38 +0300 Subject: [PATCH] Refactor repunctuate-sentences to accommodate corner case. It now gracefully handles the case when abbreviations such as e.g. or i.e. are used in sentences. --- lisp/textmodes/paragraphs.el | 32 +++++++++++++------------ test/lisp/textmodes/paragraphs-tests.el | 5 ++-- 2 files changed, 20 insertions(+), 17 deletions(-) diff --git a/lisp/textmodes/paragraphs.el b/lisp/textmodes/paragraphs.el index cd726ad4776..89624b66318 100644 --- a/lisp/textmodes/paragraphs.el +++ b/lisp/textmodes/paragraphs.el @@ -506,29 +506,31 @@ It is advised to use `add-function' on this to add more filters, for example, `(looking-back (rx (or \"e.g.\" \"i.e.\") \" \") 5)' with a set of predefined abbreviations to skip from adding two spaces.") -(defun repunctuate-sentences (&optional no-query start end) - "Put two spaces at the end of sentences from point to the end of buffer. -It works using `query-replace-regexp'. In Transient Mark mode, -if the mark is active, operate on the contents of the region. -Second and third arg START and END specify the region to operate on. -If optional argument NO-QUERY is non-nil, make changes without asking -for confirmation. You can use `repunctuate-sentences-filter' to add -filters to skip occurrences of spaces that don't need to be replaced." - (interactive (list nil - (if (use-region-p) (region-beginning)) - (if (use-region-p) (region-end)))) - (let ((regexp "\\([]\"')]?\\)\\([.?!]\\)\\([]\"')]?\\) +") - (to-string "\\1\\2\\3 ")) +(defun repunctuate-sentences (&optional no-query) + "Put two spaces at the end of sentences. + +In Transient Mark mode, if the mark is active, operate on the +contents of the region. If optional argument NO-QUERY is +non-nil, make changes without asking for confirmation. + +Use `repunctuate-sentences-filter' to add filters to skip +occurrences of spaces that don't need to be replaced." + (interactive "P") + (let ((beg (if (use-region-p) (region-beginning) (point-min))) + (end (if (use-region-p) (region-end) (point-max))) + (case-fold-search nil) + (regexp "\\([]\"')]?\\)\\([.?!]\\)\\([]\"')]?\\) +\\([\"')[:upper:]]\\)") + (to-string "\\1\\2\\3 \\4")) (if no-query (progn - (when start (goto-char start)) + (goto-char beg) (while (re-search-forward regexp end t) (replace-match to-string))) (unwind-protect (progn (add-function :after-while isearch-filter-predicate repunctuate-sentences-filter) - (query-replace-regexp regexp to-string nil start end)) + (query-replace-regexp regexp to-string nil beg end)) (remove-function isearch-filter-predicate repunctuate-sentences-filter))))) diff --git a/test/lisp/textmodes/paragraphs-tests.el b/test/lisp/textmodes/paragraphs-tests.el index e54b459b20e..53735b4bf4b 100644 --- a/test/lisp/textmodes/paragraphs-tests.el +++ b/test/lisp/textmodes/paragraphs-tests.el @@ -101,10 +101,11 @@ (ert-deftest paragraphs-tests-repunctuate-sentences () (with-temp-buffer - (insert "Just. Some. Sentences.") + (insert "Just. Some. Sentences. Yet another, e.g. this one.") (goto-char (point-min)) (repunctuate-sentences t) - (should (equal (buffer-string) "Just. Some. Sentences.")))) + (should (equal (buffer-string) + "Just. Some. Sentences. Yet another, e.g. this one.")))) (ert-deftest paragraphs-tests-backward-sentence () (with-temp-buffer -- 2.37.1 --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable --=20 Andr=C3=A9 A. Gomes "You cannot even find the ruins..." --=-=-=-- From debbugs-submit-bounces@debbugs.gnu.org Sun Jul 31 04:34:20 2022 Received: (at 56844) by debbugs.gnu.org; 31 Jul 2022 08:34:20 +0000 Received: from localhost ([127.0.0.1]:36357 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oI4Oy-0006R6-3D for submit@debbugs.gnu.org; Sun, 31 Jul 2022 04:34:20 -0400 Received: from quimby.gnus.org ([95.216.78.240]:33542) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oI4Ow-0006Qq-Dl for 56844@debbugs.gnu.org; Sun, 31 Jul 2022 04:34:18 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID :Date:References:In-Reply-To:Subject:Cc:To:From:Sender:Reply-To:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=AaDnQG6lpBR0nka+mSYDQ+ijlF0mn+3IuUjVmAtZnSQ=; b=S9pfglLxTFqcZbQsm399qBWcu6 4xfDuR7ed16dlaGlXgKT56/Zg9ICC6VONoIgkzruqlX00Nxl/2bxm45VcqReJMZol/idPY5A5nNKc bKSiCGKzkcEDBwoEVnro2gMZZkS0QFB2jdA9y4ZFrxY+C8mP5f1WIST7nRSNbhh7vWPE=; Received: from [84.212.220.105] (helo=joga) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1oI4Oo-0001KM-5n; Sun, 31 Jul 2022 10:34:12 +0200 From: Lars Ingebrigtsen To: =?utf-8?Q?Andr=C3=A9?= A. Gomes Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. In-Reply-To: <87r122eaqv.fsf@gmail.com> (=?utf-8?Q?=22Andr=C3=A9?= A. Gomes"'s message of "Sat, 30 Jul 2022 21:06:16 +0300") References: <87r122eaqv.fsf@gmail.com> X-Now-Playing: Simple Minds's _Sons and Fascination_: "Love Song" Date: Sun, 31 Jul 2022 10:34:09 +0200 Message-ID: <87a68p4r5q.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: André A. Gomes writes: > It now gracefully handles the case when abbreviations such as e.g. or > i.e. are used in sentences. [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 56844 Cc: 56844@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Andr=C3=A9 A. Gomes writes: > It now gracefully handles the case when abbreviations such as e.g. or > i.e. are used in sentences. [...] > + (regexp "\\([]\"')]?\\)\\([.?!]\\)\\([]\"')]?\\) +\\([\"')[:uppe= r:]]\\)") I'm not quite sure I understand this patch. Are you changing this to only consider punctuation that's followed by an upper-case character to be sentence-end punctuation? From debbugs-submit-bounces@debbugs.gnu.org Sun Jul 31 16:11:57 2022 Received: (at 56844) by debbugs.gnu.org; 31 Jul 2022 20:11:58 +0000 Received: from localhost ([127.0.0.1]:38660 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIFI5-00040P-Kf for submit@debbugs.gnu.org; Sun, 31 Jul 2022 16:11:57 -0400 Received: from relay9-d.mail.gandi.net ([217.70.183.199]:52419) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIFI0-0003zq-Df for 56844@debbugs.gnu.org; Sun, 31 Jul 2022 16:11:54 -0400 Received: (Authenticated sender: juri@linkov.net) by mail.gandi.net (Postfix) with ESMTPSA id DB719FF802; Sun, 31 Jul 2022 20:11:43 +0000 (UTC) From: Juri Linkov To: Lars Ingebrigtsen Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. In-Reply-To: <87a68p4r5q.fsf@gnus.org> (Lars Ingebrigtsen's message of "Sun, 31 Jul 2022 10:34:09 +0200") Organization: LINKOV.NET References: <87r122eaqv.fsf@gmail.com> <87a68p4r5q.fsf@gnus.org> Date: Sun, 31 Jul 2022 22:49:33 +0300 Message-ID: <865yjd5ax6.fsf@mail.linkov.net> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 56844 Cc: "=?iso-8859-1?Q?Andr=E9?= A. Gomes" , 56844@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.7 (-) >> It now gracefully handles the case when abbreviations such as e.g. or >> i.e. are used in sentences. > > [...] > >> + (regexp "\\([]\"')]?\\)\\([.?!]\\)\\([]\"')]?\\) +\\([\"')[:upper:]]\\)") > > I'm not quite sure I understand this patch. Are you changing this to > only consider punctuation that's followed by an upper-case character to > be sentence-end punctuation? It would be better to add such heuristics to repunctuate-sentences-filter, so anyone could customize it. From debbugs-submit-bounces@debbugs.gnu.org Tue Aug 02 06:54:58 2022 Received: (at control) by debbugs.gnu.org; 2 Aug 2022 10:54:58 +0000 Received: from localhost ([127.0.0.1]:42486 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIpYA-0007xN-0J for submit@debbugs.gnu.org; Tue, 02 Aug 2022 06:54:58 -0400 Received: from quimby.gnus.org ([95.216.78.240]:56008) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIpY8-0007x7-49 for control@debbugs.gnu.org; Tue, 02 Aug 2022 06:54:56 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Subject:From:To:Message-Id:Date:Sender:Reply-To:Cc: MIME-Version:Content-Type:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:In-Reply-To:References:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=IYJ31Ls1VXbnK1pVdesSpCVdniRB3RhNJMsKingTDYA=; b=gmXhnYjZTHkiyV3C5veT7q2GWv QfoDIp6CrqOrqgsfyuSadbZf9WdALnGm56jKZbYcDg9FQa/RQiCktLWdcOGnqHAB/43QhNBLF6hWz TSABIq/thpH3TR77CGsKC7jRV10nZYzjLF3IXTz+JdDYC+4mU122Uxug0uZZOjxCuWqY=; Received: from [84.212.220.105] (helo=joga) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1oIpY0-0007YR-FQ for control@debbugs.gnu.org; Tue, 02 Aug 2022 12:54:50 +0200 Date: Tue, 02 Aug 2022 12:54:48 +0200 Message-Id: <874jyuzzif.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #56844 X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: tags 56844 + moreinfo quit Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) tags 56844 + moreinfo quit From debbugs-submit-bounces@debbugs.gnu.org Tue Aug 02 07:41:47 2022 Received: (at 56844) by debbugs.gnu.org; 2 Aug 2022 11:41:47 +0000 Received: from localhost ([127.0.0.1]:42657 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIqHQ-0007LG-16 for submit@debbugs.gnu.org; Tue, 02 Aug 2022 07:41:47 -0400 Received: from mail-lf1-f51.google.com ([209.85.167.51]:33750) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIqHM-0007L2-C2 for 56844@debbugs.gnu.org; Tue, 02 Aug 2022 07:41:42 -0400 Received: by mail-lf1-f51.google.com with SMTP id e15so11492377lfs.0 for <56844@debbugs.gnu.org>; Tue, 02 Aug 2022 04:41:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-transfer-encoding; bh=1AOYfRSCxG1xaI1V8zBiJQaAQCvRgmIYwY/T2/weprY=; b=bZNqH/6V3EV4zkspE+mjPNZx5rpDWcjip9rXZJCzNyhCTr/FVIJdDjB4JlOaKZ0rla D55B7jJtLntldTuOGAfFpqYS7DO7jT9F7TlR1ON/Rh4USAlB6SaQhJQWGL9L/Jg1NEEx N9Zkj7T4a/HKitbeB1rDTipFMAYyBurFegMgvItjAqh/1Qd0il6Wz4Jybqjgr+Z0HlgH jxuiv4kfhzISbiOsH/X/J5hFNrS2fiwHFNS984uVdcjHHQsBBBTZ6oDEktKmBVtY/gnd xAZrS8hZ42yHLDBOYvwQ7XTY8m8bX8dNBGzhwgAbgN1agQYRKhtKizntIhKtSUgpHI70 5yXw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-transfer-encoding; bh=1AOYfRSCxG1xaI1V8zBiJQaAQCvRgmIYwY/T2/weprY=; b=zWlUpHe8QuWN/kAcw7f3yBcsNIZJzUhczsw19YCVKrRJu2tnJqIhKGrUrlLXvdqKEF S75cZP4QzxMIawgSA/M0bvUvgUmcmzG6q5PSR3aCFnRAIRHdYNTdKXztxbDk3uzBKkTs o4C9V2hJIt0J814MSKCkVSyp6krfNEE7AA2uPRhQE+sfs2JYqlMl89L5ZB0yQdnqHv4B McevUumilefET9vaF7H3Cs51GU1SByVS8vOajHkB1w95Xs8JVAmPuE8C0NvhPxE7l1+t vW3Ig42fmID9xOQz0OI+712P31Ug6ZgjM/bEEWO9zhibbNPTHJEy5fdcWjUFN9fGhGHs ZtUQ== X-Gm-Message-State: ACgBeo2x7lGSDlVohfLxrmQaWQX99GQqzAxqMkhcfWwqhtxSfBt2VL3q p2/TIu19D7p3/6pfxNpz6vG6s5vuI62pKxyC X-Google-Smtp-Source: AA6agR6DOIuh6OohQcxAdPXKDTK8usFoq4LU1gf/Kvm175Wiuer6ex52LXe4G3CoR0H08ZM1NXM/Tw== X-Received: by 2002:a05:6512:6d3:b0:48a:e68c:15c9 with SMTP id u19-20020a05651206d300b0048ae68c15c9mr6657973lff.488.1659440494127; Tue, 02 Aug 2022 04:41:34 -0700 (PDT) Received: from guix-xps13 (95-29-44-11.broadband.corbina.ru. [95.29.44.11]) by smtp.gmail.com with ESMTPSA id o3-20020a2e9b43000000b0025e2cb58c6esm1901927ljj.37.2022.08.02.04.41.33 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 02 Aug 2022 04:41:33 -0700 (PDT) From: =?utf-8?Q?Andr=C3=A9_A=2E_Gomes?= To: Lars Ingebrigtsen Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. References: <87r122eaqv.fsf@gmail.com> <87a68p4r5q.fsf@gnus.org> Date: Tue, 02 Aug 2022 14:41:32 +0300 In-Reply-To: <87a68p4r5q.fsf@gnus.org> (Lars Ingebrigtsen's message of "Sun, 31 Jul 2022 10:34:09 +0200") Message-ID: <877d3qzxcj.fsf@gmail.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 56844 Cc: 56844@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Lars Ingebrigtsen writes: >> + (regexp "\\([]\"')]?\\)\\([.?!]\\)\\([]\"')]?\\) +\\([\"')[:upp= er:]]\\)") > > I'm not quite sure I understand this patch. Are you changing this to > only consider punctuation that's followed by an upper-case character to > be sentence-end punctuation? Yes. The patch section relative to testing is illustrative: --8<---------------cut here---------------start------------->8--- (ert-deftest paragraphs-tests-repunctuate-sentences () (with-temp-buffer - (insert "Just. Some. Sentences.") + (insert "Just. Some. Sentences. Yet another, e.g. this one.") (goto-char (point-min)) (repunctuate-sentences t) - (should (equal (buffer-string) "Just. Some. Sentences.")))) + (should (equal (buffer-string) + "Just. Some. Sentences. Yet another, e.g. this one."= )))) --8<---------------cut here---------------end--------------->8--- Thanks. =20 --=20 Andr=C3=A9 A. Gomes "You cannot even find the ruins..." From debbugs-submit-bounces@debbugs.gnu.org Tue Aug 02 07:43:51 2022 Received: (at 56844) by debbugs.gnu.org; 2 Aug 2022 11:43:51 +0000 Received: from localhost ([127.0.0.1]:42667 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIqJT-0007Ot-9d for submit@debbugs.gnu.org; Tue, 02 Aug 2022 07:43:51 -0400 Received: from mail-lj1-f179.google.com ([209.85.208.179]:37492) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIqJS-0007Of-2s for 56844@debbugs.gnu.org; Tue, 02 Aug 2022 07:43:50 -0400 Received: by mail-lj1-f179.google.com with SMTP id e11so15324845ljl.4 for <56844@debbugs.gnu.org>; Tue, 02 Aug 2022 04:43:50 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-transfer-encoding; bh=vTyWiPrZ3Jtbuf2K1AKle1+TdLk+wnC6yx6KvnxuBT0=; b=eivTGgStWcH1OGio9uyKl2Hw/erv0Z46z4jh/DfPhCPidu8huRtydW5Bc+x3vVcMuP Vczuy04a+iRWXhMVqiD25aFNa/37VDhx1iJZtVgtHX3nPNPDGlltB9vk0NRdYP7zPyMn 4PZWdwQiCM0WIZBCZSlwpkm8rTylOjD5/dR2GOkRdCypkjhil+GUoQDoIJbJ5znQ12uP /DWxuSwoqzfGCRjeJeNzAmpBv9cia5XOcW4WLQgoOpS4KvKJXfCgztizwKKRPq3NsZ6S yumK339wp2bwLMvv+P6kMWzTHszdUiGtSJMkG371obTJ19+pAgZOny/ltzcThA9BPRGs ikpQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-transfer-encoding; bh=vTyWiPrZ3Jtbuf2K1AKle1+TdLk+wnC6yx6KvnxuBT0=; b=AhkQav0FSyloyEwa0TKV+Du4VBmXLs5SVdxcTqhtKAkrx1Ni4vpuLmRNuDvW//EqEw ii8MZK6+7lVJGpN1mH2KHoHIp7GoGDIWUcKyNPbHaA2whhfvQ8DBWT3cOLXN5qgNOMur 14tkQ8G0hyDlNpBnmWNPYrntYVkkVWmRhp1ok1aYxoDIhE+uzpzCUoUKr8G63nwsF3cA 8ptStDo7jnqo3i5Rjus2Qtr31HIFWnnoEIaA2YRczHk3uUj0AUGkTiMTLsM6eV9jI3ST wy3lzdmLlQpBTeNngvFfJjnvou70UoJqsfkYc5vW/Bheh0nFgJgzr4LjzYdTywocAQnI zhkw== X-Gm-Message-State: ACgBeo2n9PQGlbRliDbCJAViNU1KYLhuwOBVe3K8JXB+kAFAcXElLBcR +qI4WB/yDEXdiGOGIdrrRW87r1XNmlJO8A1V X-Google-Smtp-Source: AA6agR5lyXWN+jhIYNQapgKA+mPAhv3pZZ/INvEyXbuDVXgbxhlOADuxLKWnT8PkYTQ+XDFM6BD+Jg== X-Received: by 2002:a05:651c:91:b0:25e:4418:4c20 with SMTP id 17-20020a05651c009100b0025e44184c20mr5143256ljq.131.1659440624223; Tue, 02 Aug 2022 04:43:44 -0700 (PDT) Received: from guix-xps13 (95-29-44-11.broadband.corbina.ru. [95.29.44.11]) by smtp.gmail.com with ESMTPSA id t27-20020a05651c205b00b0025df5f38da5sm210695ljo.130.2022.08.02.04.43.43 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 02 Aug 2022 04:43:43 -0700 (PDT) From: =?utf-8?Q?Andr=C3=A9_A=2E_Gomes?= To: Juri Linkov Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. References: <87r122eaqv.fsf@gmail.com> <87a68p4r5q.fsf@gnus.org> <865yjd5ax6.fsf@mail.linkov.net> Date: Tue, 02 Aug 2022 14:43:42 +0300 In-Reply-To: <865yjd5ax6.fsf@mail.linkov.net> (Juri Linkov's message of "Sun, 31 Jul 2022 22:49:33 +0300") Message-ID: <8735eezx8x.fsf@gmail.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 56844 Cc: Lars Ingebrigtsen , 56844@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Juri Linkov writes: >>> It now gracefully handles the case when abbreviations such as e.g. or >>> i.e. are used in sentences. >> >> [...] >> >>> + (regexp "\\([]\"')]?\\)\\([.?!]\\)\\([]\"')]?\\) +\\([\"')[:up= per:]]\\)") >> >> I'm not quite sure I understand this patch. Are you changing this to >> only consider punctuation that's followed by an upper-case character to >> be sentence-end punctuation? > > It would be better to add such heuristics to repunctuate-sentences-filter, > so anyone could customize it. In general I'd agree with you, but this patch is actually fixing a bug, not introducing a personal preference. That's how I see it at least. --=20 Andr=C3=A9 A. Gomes "You cannot even find the ruins..." From debbugs-submit-bounces@debbugs.gnu.org Tue Aug 02 07:45:20 2022 Received: (at 56844) by debbugs.gnu.org; 2 Aug 2022 11:45:21 +0000 Received: from localhost ([127.0.0.1]:42671 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIqKu-0007S1-Mb for submit@debbugs.gnu.org; Tue, 02 Aug 2022 07:45:20 -0400 Received: from quimby.gnus.org ([95.216.78.240]:56880) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIqKo-0007Rf-Is for 56844@debbugs.gnu.org; Tue, 02 Aug 2022 07:45:19 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID :Date:References:In-Reply-To:Subject:Cc:To:From:Sender:Reply-To:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=DJy3QmsBuCii6u/ySClgSVC++y2axCZlOw9MFsiDHCA=; b=b1MACXHbPf12gjkkWvENpOpibb jsJPGtglxTTkRibP81NwOHl52GfJzFE0werdEIb7GC9fjpo7gwqqVhsDZSlM7iKYTKs5xU1ug5cjl VSvLH8zG2WNvT0I2TtxwQGwmrhIwslFDER6raqy0WY3gZPedIQS+HApoI8anbLX4N48Q=; Received: from [84.212.220.105] (helo=joga) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1oIqKg-0008FG-41; Tue, 02 Aug 2022 13:45:08 +0200 From: Lars Ingebrigtsen To: =?utf-8?Q?Andr=C3=A9?= A. Gomes Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. In-Reply-To: <877d3qzxcj.fsf@gmail.com> (=?utf-8?Q?=22Andr=C3=A9?= A. Gomes"'s message of "Tue, 02 Aug 2022 14:41:32 +0300") References: <87r122eaqv.fsf@gmail.com> <87a68p4r5q.fsf@gnus.org> <877d3qzxcj.fsf@gmail.com> Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAABGdBTUEAALGPC/xhBQAAACBj SFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAAAElBMVEUlIykdHCFWUVsg HiOooKT///8N4aiDAAAAAWJLR0QF+G/pxwAAAAd0SU1FB+YIAgsOD5TZDosAAAGoSURBVDjLbZOL kcMgDETXqAETN4A5ChChA0/6r+lW4mNn7kicGB67CCGA1V45JChfFKrAE5TrEmnenuSVSr4wQHuC s5zAf4qtnNK0yQCL6VYEbbZ7HLsrFrjt9lTkAWQQBp/KpnJbLYXmkL+s+HA+n/wplzzXYEdUVM8P clob5BjH7UfPckRgbtC+JKZMnx1tbH1NMEnK9eir3sD+tpxzsjnUIGAIDKSPx+t7HqAbI8edbzZL J7DeFuikWEDcCQ6yD2Mpeqcim5NHLB10eTMr9I5O4I4hM5OilmJk9O0PwJRh94x24CkUUwglBsIC NGlvC0o3008FnRjKO1lEsC8VvgcrCQOifN8Z67Tiinx52SmxvjBBVziw49nTF+Dk148nQxdQLwaC 03ILXgZgAjwAeEtkLo6ZTD8jdsLTqjYXBEq6VRtA+30J4OEPoL2GvYzTXzAuEkt4gdUc5AWOL5Bu xVHv6mccNn45iKaJxygxvRev9jnIvYmGDmqNkSU4bpkdj3ZwEFSeqSfZd9k8u9FdIpeQOuvbztx8 ejNlrK5olUmMj2Zx9BZ+AaHMiNUj8kNtAAAAJXRFWHRkYXRlOmNyZWF0ZQAyMDIyLTA4LTAyVDEx OjE0OjE1KzAwOjAwuqP/5AAAACV0RVh0ZGF0ZTptb2RpZnkAMjAyMi0wOC0wMlQxMToxNDoxNSsw MDowMMv+R1gAAAAASUVORK5CYII= X-Now-Playing: The Bug's _In Blue_: "No Return" Date: Tue, 02 Aug 2022 13:45:05 +0200 Message-ID: <87zggmvpha.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: André A. Gomes writes: >> I'm not quite sure I understand this patch. Are you changing this to >> only consider punctuation that's followed by an upper-case character to >> be sentence-end punctuation? > > Yes. Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 56844 Cc: 56844@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Andr=C3=A9 A. Gomes writes: >> I'm not quite sure I understand this patch. Are you changing this to >> only consider punctuation that's followed by an upper-case character to >> be sentence-end punctuation? > > Yes. I don't think that'll be a generally welcome change -- some people write using non-standard orthography. If this change is to be made, it has to be optional. From debbugs-submit-bounces@debbugs.gnu.org Tue Aug 02 08:10:43 2022 Received: (at 56844) by debbugs.gnu.org; 2 Aug 2022 12:10:44 +0000 Received: from localhost ([127.0.0.1]:42731 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIqjT-0008Cg-Iy for submit@debbugs.gnu.org; Tue, 02 Aug 2022 08:10:43 -0400 Received: from mail-wr1-f50.google.com ([209.85.221.50]:38650) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIqjS-0008CR-BT for 56844@debbugs.gnu.org; Tue, 02 Aug 2022 08:10:42 -0400 Received: by mail-wr1-f50.google.com with SMTP id bv3so3739342wrb.5 for <56844@debbugs.gnu.org>; Tue, 02 Aug 2022 05:10:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:references :in-reply-to:subject:cc:to:from:from:to:cc; bh=m0/ey4iC3zoLrgrUoBI1knd+52D+qFnTEmHuWn50UUY=; b=f+cfb74jAcWzG6qOPAT+zzKht6eaZEjsrYkgMwtOxiyShsBgZrTCLvjjPTPWfFUgxB wfD4ZGWVrg+lrBJFzaLN2LCIWT3CgULysJ/Z7SCxNDazpuvcdfjFakMU1CosqOsyyP5/ I+1gkEj9Bm2Dr8Hyo1K0f18FY7877gT9LIxEO0EZ5Wobo8MjBTKlny/EL39eeHtXBVCt k6CPj5kSypGn2p92gs4Nnl5pTbuCoiTaYn7C1la9Y/mbO4yDLbmzMbdMiLJxKY7JUqsH jkXDGUfr69nwjx9BG+9Ig2Dms20CQYw1RjODSbMCkfKiCUKWCfZhv6eup31bsvjwBmUn Ozmw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:message-id:date:references :in-reply-to:subject:cc:to:from:x-gm-message-state:from:to:cc; bh=m0/ey4iC3zoLrgrUoBI1knd+52D+qFnTEmHuWn50UUY=; b=1AaglD9XXbfFAhJdsBoV8SOCcOBPnq0vcDI0wUa9Qus/FjA8Yu+OnTGTpxf2YK18dl gkFQwy3n03qUTe//E89wsmdfT0Y7qs6Z3j33HltqYsUvCYdHY7xw+xuIb3AN7eQOfRuY wT7+Kok0/ydn+S2MY1oavgO7xUCHUJGuiHA69F9D/yzyfFRIiQtSPegv38H5BO1iP97N zBkPMdKjVPdUHAldXT+EFWUE+nGQOP9MCUJAeurBGxXMg0YjG7gDnbGzx5Kn8SM5lHn9 mZ+DxDDEKPrxwOWwv1vQxJIILwz1eiaKG4luZNdYR/OkCvJEaye3JCdIF9fcwGko7xPI QCrg== X-Gm-Message-State: ACgBeo1E4Kvj/dnsqIP4Q3tzNBWcrymQeRcUxNz4MjjuoHEJ9ZG9gn/V OIXk6GoLEs4ayIdZNZIEet9t5jH5tfs= X-Google-Smtp-Source: AA6agR51kL59rzz4nZrrVK36YMqog1hjlAzWgBileseoQaRGmQttrApSuSr1d+jfK5Ur81YDffnnHQ== X-Received: by 2002:a5d:498e:0:b0:21f:b9a:6608 with SMTP id r14-20020a5d498e000000b0021f0b9a6608mr13324347wrq.498.1659442236026; Tue, 02 Aug 2022 05:10:36 -0700 (PDT) Received: from rltb ([2a01:e0a:3f3:fb50:6d96:2ef3:e3b8:ccb2]) by smtp.gmail.com with ESMTPSA id t13-20020adfe44d000000b0021e2fccea97sm15045334wrm.64.2022.08.02.05.10.35 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 02 Aug 2022 05:10:35 -0700 (PDT) From: Robert Pluim To: Lars Ingebrigtsen Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. In-Reply-To: <87zggmvpha.fsf@gnus.org> (Lars Ingebrigtsen's message of "Tue, 02 Aug 2022 13:45:05 +0200") References: <87r122eaqv.fsf@gmail.com> <87a68p4r5q.fsf@gnus.org> <877d3qzxcj.fsf@gmail.com> <87zggmvpha.fsf@gnus.org> Date: Tue, 02 Aug 2022 14:10:35 +0200 Message-ID: <877d3qj16s.fsf@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 56844 Cc: "=?utf-8?Q?Andr=C3=A9?= A. Gomes" , 56844@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) >>>>> On Tue, 02 Aug 2022 13:45:05 +0200, Lars Ingebrigtsen said: Lars> Andr=C3=A9 A. Gomes writes: >>> I'm not quite sure I understand this patch. Are you changing this = to >>> only consider punctuation that's followed by an upper-case characte= r to >>> be sentence-end punctuation? >>=20 >> Yes. Lars> I don't think that'll be a generally welcome change -- some peopl= e write Lars> using non-standard orthography. If this change is to be made, it= has to Lars> be optional. It doesn=CA=BCt even have to be that non-standard. Consider. De deur sloeg open. de Valk stond in het licht. That=CA=BCs pedantically incorrect, but there are many people (myself included) who think that certain grammarians should keep quiet =F0=9F=98=80 Robert --=20 From debbugs-submit-bounces@debbugs.gnu.org Tue Aug 02 08:35:43 2022 Received: (at 56844) by debbugs.gnu.org; 2 Aug 2022 12:35:43 +0000 Received: from localhost ([127.0.0.1]:42787 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIr7f-0000Sl-3m for submit@debbugs.gnu.org; Tue, 02 Aug 2022 08:35:43 -0400 Received: from mail-vs1-f44.google.com ([209.85.217.44]:37828) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIr7d-0000SS-OA for 56844@debbugs.gnu.org; Tue, 02 Aug 2022 08:35:42 -0400 Received: by mail-vs1-f44.google.com with SMTP id 66so14378972vse.4 for <56844@debbugs.gnu.org>; Tue, 02 Aug 2022 05:35:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:in-reply-to:references:mime-version:date:message-id:subject:to :cc:content-transfer-encoding; bh=B2p3NVP6mG4PoRXO3WACEc/nMETpOQbhtVzL8FhYrLA=; b=oVEhnYNPMyvKk6AMf5cY/apThE3YPzP+xPQAkLAIIbzQn2ZA92HxGKOTVC0Vr0dc9s REZqnE8r6413vbDgfuIXaIvQQZzWF8Fglqa+Q/51iEDO4Pm2UjGqQrnl5Jb6XkdbvW3M AyePorWeynv0vPWhUFgVC18NlH+lXD4gsiklNNNJK8Si6c8rVohHmK9bM25DKAGjwKTq TU8K7RIOupgf33t0xo+MN3mpU+DPrRUMW2fJS17tb0Pnb6gNAw+9RkLNqn4kABRY7eB7 efn5eiVUBLZ7tC0WPBrihUJ9ssDagu+GqNb4txeZzuBPNaRkY/lhWONkKmq+RNc3h7MG tbsA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:in-reply-to:references:mime-version:date :message-id:subject:to:cc:content-transfer-encoding; bh=B2p3NVP6mG4PoRXO3WACEc/nMETpOQbhtVzL8FhYrLA=; b=TRKsr1i4SZcIUam7Qape1SrMxMjAcK17twPsECWG2yDr24lLGipUrLoh9InfhVU0KK ZNclSVNgTuHCBmRvH3pdt81us1cyWdh9D9EAS7ojHxqKA8EFAzMVhoUYW+saSqyTxNKJ yLs0R43+fb9e8sWwKbCFDnbXlQS0QHN+0VXXHxAc15LIc9J4GRKf78kRxP2qkNBsMsr0 gCdoITup6jYBhU2mS+QF2LYfA2znmAII9LcD4hXRwc19hXLvlPszfeZs0+N4N2LNp5H1 OdmZRH0+bfNq4NM4NDDkIZIq3iPLZQg6YCnqTx7xu6h9e2MWIjOumqAFGspnUimN76sw l4xw== X-Gm-Message-State: ACgBeo3OXHtbHq95+6f9i54GZWFCF5InmLWaKcnU8+2W5Q7HouxUJNWW gZbGWfDLHnxxDvk9Za8RhLWdVvCr1wA4Odt66iQ= X-Google-Smtp-Source: AA6agR5MUFCqWNe4b0YRTwXYfNF35jCK+zOyS5AeG1Pgo1i6JuSYaF0aSInahr3QVMLBaJfKntJem+MayKsKuS11dPM= X-Received: by 2002:a67:ab09:0:b0:387:ebbb:848 with SMTP id u9-20020a67ab09000000b00387ebbb0848mr1091643vse.23.1659443736152; Tue, 02 Aug 2022 05:35:36 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Tue, 2 Aug 2022 12:35:35 +0000 From: Stefan Kangas In-Reply-To: <877d3qzxcj.fsf@gmail.com> References: <87r122eaqv.fsf@gmail.com> <87a68p4r5q.fsf@gnus.org> <877d3qzxcj.fsf@gmail.com> X-Hashcash: 1:21:220802:larsi@gnus.org::Y7vfnksYP3kKIpXT:GHC5 MIME-Version: 1.0 Date: Tue, 2 Aug 2022 12:35:35 +0000 Message-ID: Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. To: =?UTF-8?B?QW5kcsOpIEEuIEdvbWVz?= , Lars Ingebrigtsen Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 56844 Cc: 56844@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Andr=C3=A9 A. Gomes writes: >> I'm not quite sure I understand this patch. Are you changing this to >> only consider punctuation that's followed by an upper-case character to >> be sentence-end punctuation? > > Yes. FWIW, I would rather want to specify a list of ignored abbreviations that I'd like to not consider ending a sentence. This could include standard US ones like "e.g.", "i.e.", etc. by default, but should be customizable so I can add any localized equivalents. From debbugs-submit-bounces@debbugs.gnu.org Tue Aug 02 08:48:30 2022 Received: (at 56844) by debbugs.gnu.org; 2 Aug 2022 12:48:30 +0000 Received: from localhost ([127.0.0.1]:42801 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIrK2-0002xr-77 for submit@debbugs.gnu.org; Tue, 02 Aug 2022 08:48:30 -0400 Received: from mail-pf1-f195.google.com ([209.85.210.195]:36440) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIrK0-0002xb-28 for 56844@debbugs.gnu.org; Tue, 02 Aug 2022 08:48:28 -0400 Received: by mail-pf1-f195.google.com with SMTP id g12so13476516pfb.3 for <56844@debbugs.gnu.org>; Tue, 02 Aug 2022 05:48:28 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:mime-version:user-agent:message-id:date :references:in-reply-to:subject:cc:to:from:from:to:cc; bh=o90oJwwjAA9HDX3yB1oZzsOkOM+xSHoD5YYvbBEHRm8=; b=R4YokXbi3Zot+elhXvlWmIz1EmPT5MSy6pWcTiJaV+RKNCi7jdI/VxR65zcKnIemua dzIjKK91MxO++UAsFJcViWJaLx+z5Hp8ypGhLBwgQzWnYWaTnG587h0o5wqmp4I8m2wt pOQ9Xe3MZRN6yycqeO1H4uYmvgYeBWfNsz5drqF7IeFAFhdZsgOw/NwyA+Xd88qZWPSY ZJ9IKwVA5e4C9bVXR8pShM+IUUi9ADErAd6jp5gnlG56sGb3NbZPHI2xja7mOHASj1qm QZJWjwxgcBKQ5KxLp0Q6Xl6hNpOAJ7/bRH4R/AFidTBaQUOy2HiKa5NbVmhw/wD/OkkD SLCQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:user-agent:message-id:date :references:in-reply-to:subject:cc:to:from:x-gm-message-state:from :to:cc; bh=o90oJwwjAA9HDX3yB1oZzsOkOM+xSHoD5YYvbBEHRm8=; b=7p9rJP6KCmSle6giHVQ7uG30gikC/9aCmCCJb+w2/i4ABaJUhn7EVy90bZX4c6gYBQ 6a9dTNYtbYRETTPUaRiL7mFp53FwU8LHzPEE7TWkAtENhCudn48MEJ4trf271usHoJ5O dvQ81Pwbt1hT0JLDY/3kDxHXkVLYPp50vlmiGmhdgz8734X2/JHsn5V1N98pWyfUMIBd fqZOguTgE6oXspEkzFr8UvHXyc8JPkg7qQ4k8M9eWGFXADV/pFsVcT7JiTUe13QfgUGy j0iJetZHMD9jdtLsU8CYp4E/Tbu3oQsa3uS7Z+QxWG5TG80qcT6SfOq6vi47OtPcIkht ONbg== X-Gm-Message-State: ACgBeo1zlUYFeF0cVR0oLK9qyga4SijvADik/dIa3bANFCFMOOO2QiS/ IKpZujTG3dnstMqKOr1t+Xo= X-Google-Smtp-Source: AA6agR6nlqNgQpnPcfchA0/kUDtM2GUlhQ9hNVOTgw0xdBr7lphkP7+69IjfQEFwmKQFuPCdnIfn8g== X-Received: by 2002:a63:fe46:0:b0:41b:bf59:f4a2 with SMTP id x6-20020a63fe46000000b0041bbf59f4a2mr13092547pgj.198.1659444502035; Tue, 02 Aug 2022 05:48:22 -0700 (PDT) Received: from localhost ([49.204.134.78]) by smtp.gmail.com with ESMTPSA id i1-20020a17090332c100b0016c4546fbf9sm11889984plr.128.2022.08.02.05.48.19 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 02 Aug 2022 05:48:20 -0700 (PDT) From: Visuwesh To: =?utf-8?Q?Andr=C3=A9?= A. Gomes Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. In-Reply-To: <8735eezx8x.fsf@gmail.com> (=?utf-8?Q?=22Andr=C3=A9?= A. Gomes"'s message of "Tue, 02 Aug 2022 14:43:42 +0300") References: <87r122eaqv.fsf@gmail.com> <87a68p4r5q.fsf@gnus.org> <865yjd5ax6.fsf@mail.linkov.net> <8735eezx8x.fsf@gmail.com> Date: Tue, 02 Aug 2022 18:18:18 +0530 Message-ID: <87czdi3j71.fsf@gmail.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: base64 X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 56844 Cc: Lars Ingebrigtsen , 56844@debbugs.gnu.org, Juri Linkov X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) W+CumuCvhuCuteCvjeCuteCuvuCur+CvjSDgrobgrpXgrrjgr43grp/gr40gMDIsIDIwMjJdIEFu ZHLDqSBBLiBHb21lcyB3cm90ZToNCg0KPiBKdXJpIExpbmtvdiA8anVyaUBsaW5rb3YubmV0PiB3 cml0ZXM6DQo+DQo+Pj4+IEl0IG5vdyBncmFjZWZ1bGx5IGhhbmRsZXMgdGhlIGNhc2Ugd2hlbiBh YmJyZXZpYXRpb25zIHN1Y2ggYXMgZS5nLiBvcg0KPj4+PiBpLmUuIGFyZSB1c2VkIGluIHNlbnRl bmNlcy4NCj4+Pg0KPj4+IFsuLi5dDQo+Pj4NCj4+Pj4gKyAgICAgICAgKHJlZ2V4cCAiXFwoW11c IicpXT9cXClcXChbLj8hXVxcKVxcKFtdXCInKV0/XFwpICtcXChbXCInKVs6dXBwZXI6XV1cXCki KQ0KPj4+DQo+Pj4gSSdtIG5vdCBxdWl0ZSBzdXJlIEkgdW5kZXJzdGFuZCB0aGlzIHBhdGNoLiAg QXJlIHlvdSBjaGFuZ2luZyB0aGlzIHRvDQo+Pj4gb25seSBjb25zaWRlciBwdW5jdHVhdGlvbiB0 aGF0J3MgZm9sbG93ZWQgYnkgYW4gdXBwZXItY2FzZSBjaGFyYWN0ZXIgdG8NCj4+PiBiZSBzZW50 ZW5jZS1lbmQgcHVuY3R1YXRpb24/DQo+Pg0KPj4gSXQgd291bGQgYmUgYmV0dGVyIHRvIGFkZCBz dWNoIGhldXJpc3RpY3MgdG8gcmVwdW5jdHVhdGUtc2VudGVuY2VzLWZpbHRlciwNCj4+IHNvIGFu eW9uZSBjb3VsZCBjdXN0b21pemUgaXQuDQo+DQo+IEluIGdlbmVyYWwgSSdkIGFncmVlIHdpdGgg eW91LCBidXQgdGhpcyBwYXRjaCBpcyBhY3R1YWxseSBmaXhpbmcgYSBidWcsDQo+IG5vdCBpbnRy b2R1Y2luZyBhIHBlcnNvbmFsIHByZWZlcmVuY2UuICBUaGF0J3MgaG93IEkgc2VlIGl0IGF0IGxl YXN0Lg0KDQpUaGlzIGJyZWFrcyByZXB1bmN0dWF0ZS1zZW50ZW5jZXMgZm9yIGxhbmd1YWdlcyB0 aGF0IGRvbid0IGhhdmUgdGhlDQpjb25jZXB0IG9mIHVwcGVyIGFuZCBsb3dlciBjYXNlIGNoYXJh Y3RlcnMuICBUcnkgcmVwdW5jdHVhdGUtc2VudGVuY2VzDQp3aXRoIGFuZCB3aXRob3V0IHlvdXIg cGF0Y2ggZm9yIHRoZSBmb2xsb3dpbmcgdGV4dCwNCg0K4K6k4K+K4K604K6/4K6x4K+N4K6a4K6+ 4K6y4K+IIOCur+CuqOCvjeCupOCuv+CusOCumeCvjeCuleCus+CvjSDgrqTgr4fgrrXgr4jgrq/g rr7grqkg4K6u4K6f4K+N4K6f4K+B4K6u4K+NIOCuheCuqOCvjeCupOCupOCvjSDgrqTgr4rgrrTg rr/grrLgrr7grrPgrrDgr43grpXgrrPgrqTgr4Eg4K6a4K6V4K+N4K6k4K6/4K6v4K+IIOCuieCu seCuv+CunuCvjeCumuCuv+CupOCvjQ0K4K6k4K+A4K6w4K+N4K6k4K+N4K6k4K+B4K614K6/4K6f 4K+B4K614K6k4K+L4K6f4K+BIOCuheCuqOCvjeCupCDgrqjgrr7grrPgr40g4K614K6/4K604K+B 4K6Z4K+N4K6V4K6q4K+N4K6q4K6f4K+N4K6f4K+B4K614K6/4K6f4K+B4K6u4K+NLiDgro7grqjg r43grqTgrrXgrr/grqTgrq7grr7grqkg4K6O4K6a4K+N4K6a4K6u4K6/4K6a4K+N4K6a4K6Z4K+N 4K6V4K6z4K+B4K6u4K+NIOCuh+CusuCvjeCusuCuvuCuruCusuCvjQ0K4K6F4K6p4K+N4K6x4K+I 4K6v4K6q4K+NIOCuquCviuCutOCvgeCupOCvgSDgroXgrrTgrr/grqjgr43grqTgr4Eg4K6V4K60 4K6/4K6v4K+B4K6u4K+NOyDgrq7grqngrr/grqTgrqngr4Hgrq7gr40g4K6k4K6p4K6k4K+BIOCu muCuteCuleCvjeCuleCvgeCutOCuv+Cur+CviCDgrqjgr4vgrpXgr43grpXgrr8g4K6T4K6w4K6f 4K6/DQrgrq7gr4Hgrqngr43grqngr4fgrrHgrr/grrXgrr/grp/gr4HgrrXgrr7grqngr40uIOCu huCuqeCuvuCusuCvjSDgrofgrqrgr43grqrgr4vgrqTgr4sg4K6S4K6v4K+N4K614K6/4K6p4K+N DQo= From debbugs-submit-bounces@debbugs.gnu.org Tue Aug 02 16:00:54 2022 Received: (at 56844) by debbugs.gnu.org; 2 Aug 2022 20:00:54 +0000 Received: from localhost ([127.0.0.1]:45139 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIy4U-0000OZ-4Y for submit@debbugs.gnu.org; Tue, 02 Aug 2022 16:00:54 -0400 Received: from relay8-d.mail.gandi.net ([217.70.183.201]:51007) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oIy4R-0000OK-EY for 56844@debbugs.gnu.org; Tue, 02 Aug 2022 16:00:52 -0400 Received: (Authenticated sender: juri@linkov.net) by mail.gandi.net (Postfix) with ESMTPSA id 42D511BF207; Tue, 2 Aug 2022 20:00:40 +0000 (UTC) From: Juri Linkov To: Stefan Kangas Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. In-Reply-To: (Stefan Kangas's message of "Tue, 2 Aug 2022 12:35:35 +0000") Organization: LINKOV.NET References: <87r122eaqv.fsf@gmail.com> <87a68p4r5q.fsf@gnus.org> <877d3qzxcj.fsf@gmail.com> Date: Tue, 02 Aug 2022 22:59:21 +0300 Message-ID: <86pmhizaau.fsf@mail.linkov.net> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 56844 Cc: "=?iso-8859-1?Q?Andr=E9?= A. Gomes" , Lars Ingebrigtsen , 56844@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.7 (-) > FWIW, I would rather want to specify a list of ignored abbreviations > that I'd like to not consider ending a sentence. This could include > standard US ones like "e.g.", "i.e.", etc. by default, but should be > customizable so I can add any localized equivalents. Please see an example in the docstring of the variable 'repunctuate-sentences-filter'. From debbugs-submit-bounces@debbugs.gnu.org Thu Aug 04 09:58:00 2022 Received: (at control) by debbugs.gnu.org; 4 Aug 2022 13:58:00 +0000 Received: from localhost ([127.0.0.1]:54038 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oJbMO-0006um-J5 for submit@debbugs.gnu.org; Thu, 04 Aug 2022 09:58:00 -0400 Received: from mail-ua1-f51.google.com ([209.85.222.51]:34683) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oJbML-0006uI-I7 for control@debbugs.gnu.org; Thu, 04 Aug 2022 09:57:59 -0400 Received: by mail-ua1-f51.google.com with SMTP id y22so8060097uay.1 for ; Thu, 04 Aug 2022 06:57:57 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:mime-version:date:message-id:subject:to; bh=yKlSeVviptuENMROYLwtbpLGFcxf6AeA2SdJRiU665Y=; b=mDp7uP/stZCyZlHO/DbObsjJDuaQtvcPGzctV4YL+WFdX9TBhTy5Fqmh/+EnO3VXVU 5PSWboqJY0QZONo9ZGVI0gPYCQQDiuSyuQjt3qwg7ocWiCGXkeHCAwkQ6KUWC5yYK/h2 4ojcelg4df8bQOhA/hfA5/VVIE5oDNWm50XAXD9P2KSxIdj5aZq4kLKH8Zb4l7GSBvvG cyS+srPy++nbg03ubXRX5G2fbT67JsO/Ey87BnnhI963qQilU3+0C9praMac1CUDkVG3 niUBhqCEyNIU/5xc3uwKjAfrQqttReOVYxAPRHPzb8uDCv2EcjmCIBi3QOqLeRM0CHY1 c6zA== X-Gm-Message-State: ACgBeo2H3W6wfzf4c6SM+/yEMc5TS1udQpBXtxZ1+kT7jSre8n34EUXq xV4me1PUlBTZnjvjn7GWvLtKxs9UP0oAUiYlDGcwc7BU X-Google-Smtp-Source: AA6agR5kIakuyys5joAW8cBxz08GRB/cjGXVSIyl45jcSUpJvX0ixCuGpf3J1Fv60IrLdnT5ZtW3jUXDE/I7D8PaelQ= X-Received: by 2002:ab0:6890:0:b0:385:7893:d6dc with SMTP id t16-20020ab06890000000b003857893d6dcmr876751uar.54.1659621472012; Thu, 04 Aug 2022 06:57:52 -0700 (PDT) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Thu, 4 Aug 2022 13:57:51 +0000 X-Hashcash: 1:21:220804:control@debbugs.gnu.org::6h0Xg7UOUgmHvmBr:743m From: Stefan Kangas MIME-Version: 1.0 Date: Thu, 4 Aug 2022 13:57:51 +0000 Message-ID: Subject: control message for bug #56844 To: control@debbugs.gnu.org Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.5 (/) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.5 (/) severity 56844 wishlist quit From debbugs-submit-bounces@debbugs.gnu.org Fri Sep 02 06:47:41 2022 Received: (at 56844) by debbugs.gnu.org; 2 Sep 2022 10:47:41 +0000 Received: from localhost ([127.0.0.1]:44995 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oU4D6-0001sy-U1 for submit@debbugs.gnu.org; Fri, 02 Sep 2022 06:47:41 -0400 Received: from quimby.gnus.org ([95.216.78.240]:34092) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oU4D4-0001sb-UJ for 56844@debbugs.gnu.org; Fri, 02 Sep 2022 06:47:39 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:Date:References: In-Reply-To:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=sdMeiw4HXydmkLM0HWT0VylNIVOcXiTGwZqTiffB7T8=; b=uI1b0zjkHG+M+jpXHj3Tq/3QCK D2BUyOHrTrZ0pTjqFHCoDFB09aOl2582AG/Wk7yNTfWR2Mz9p4rW4rOpPnxZU02ipn/l+dlzsLd2z sDrBBRyoeLyTx1ZM9alUcU0DdhGZRI9lc20Ld0zIFKDeGg5Ui4FSEEx6b/1RVOad/frM=; Received: from [84.212.220.105] (helo=joga) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1oU4Cw-0004hf-Dh; Fri, 02 Sep 2022 12:47:32 +0200 From: Lars Ingebrigtsen To: Juri Linkov Subject: Re: bug#56844: [PATCH] Refactor repunctuate-sentences to accommodate corner case. In-Reply-To: <86pmhizaau.fsf@mail.linkov.net> (Juri Linkov's message of "Tue, 02 Aug 2022 22:59:21 +0300") References: <87r122eaqv.fsf@gmail.com> <87a68p4r5q.fsf@gnus.org> <877d3qzxcj.fsf@gmail.com> <86pmhizaau.fsf@mail.linkov.net> X-Now-Playing: Neil Young & Crazy Horse's _American Stars 'N Bars_: "Homegrown" Date: Fri, 02 Sep 2022 12:47:29 +0200 Message-ID: <87czcexdby.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Juri Linkov writes: >> FWIW, I would rather want to specify a list of ignored abbreviations >> that I'd like to not consider ending a sentence. This could include >> standard US ones like "e.g.", "i.e.", etc. by default, [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 56844 Cc: =?utf-8?Q?Andr=C3=A9_A=2E_Gom?= =?utf-8?Q?es?= , 56844@debbugs.gnu.org, Stefan Kangas X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Juri Linkov writes: >> FWIW, I would rather want to specify a list of ignored abbreviations >> that I'd like to not consider ending a sentence. This could include >> standard US ones like "e.g.", "i.e.", etc. by default, but should be >> customizable so I can add any localized equivalents. > > Please see an example in the docstring of the variable > 'repunctuate-sentences-filter'. I think the conclusion here is that we don't want to change how repunctuate-sentences work here, so I'm closing this bug report. From debbugs-submit-bounces@debbugs.gnu.org Fri Sep 02 06:47:45 2022 Received: (at control) by debbugs.gnu.org; 2 Sep 2022 10:47:45 +0000 Received: from localhost ([127.0.0.1]:44998 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oU4DB-0001tb-7Q for submit@debbugs.gnu.org; Fri, 02 Sep 2022 06:47:45 -0400 Received: from quimby.gnus.org ([95.216.78.240]:34108) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1oU4D9-0001sk-EA for control@debbugs.gnu.org; Fri, 02 Sep 2022 06:47:43 -0400 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Subject:From:To:Message-Id:Date:Sender:Reply-To:Cc: MIME-Version:Content-Type:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:In-Reply-To:References:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=KqLo9Av+0MTMvAiFYn6XavobjZu/BS39n34eeL9t9Hc=; b=SdYFRD6mutsWrKWSfVLxvep8tI XIxQ4vKiL2RbxV/6G+xfmtbIqJPGDz6q01hgspSFrgIzYMH8T0acbnVV6+IIU69D3++StJgqhwecM M7pdTqRMIDpk3jYTRZgggBtzPnqePRCdzun7LhCs0MZIRt2kN7Ops6tWtnkrazW9G7p4=; Received: from [84.212.220.105] (helo=joga) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1oU4D1-0004hp-T6 for control@debbugs.gnu.org; Fri, 02 Sep 2022 12:47:37 +0200 Date: Fri, 02 Sep 2022 12:47:35 +0200 Message-Id: <87bkryxdbs.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #56844 X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: close 56844 quit Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) close 56844 quit From unknown Sat Jun 21 12:30:37 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Fri, 30 Sep 2022 11:24:08 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator