From derick.eddington@gmail.com Mon Jan 12 12:38:30 2009 Received: (at submit) by emacsbugs.donarmstrong.com; 12 Jan 2009 20:38:30 +0000 X-Spam-Checker-Version: SpamAssassin 3.2.5-bugs.debian.org_2005_01_02 (2008-06-10) on rzlab.ucr.edu X-Spam-Level: X-Spam-Bayes: score:0.5 Bayes not run. spammytokens:Tokens not available. hammytokens:Tokens not available. X-Spam-Status: No, score=0.0 required=4.0 tests=none autolearn=ham version=3.2.5-bugs.debian.org_2005_01_02 Received: from lists.gnu.org (lists.gnu.org [199.232.76.165]) by rzlab.ucr.edu (8.13.8/8.13.8/Debian-3) with ESMTP id n0CKcR7L005066 for ; Mon, 12 Jan 2009 12:38:28 -0800 Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1LMTYI-00020w-Ey for bug-gnu-emacs@gnu.org; Mon, 12 Jan 2009 15:38:26 -0500 Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1LMTYE-0001zl-Lh for bug-gnu-emacs@gnu.org; Mon, 12 Jan 2009 15:38:24 -0500 Received: from [199.232.76.173] (port=48856 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1LMTYE-0001zc-1i for bug-gnu-emacs@gnu.org; Mon, 12 Jan 2009 15:38:22 -0500 Received: from rv-out-0708.google.com ([209.85.198.242]:12632) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1LMTYD-0002Cm-HX for bug-gnu-emacs@gnu.org; Mon, 12 Jan 2009 15:38:21 -0500 Received: by rv-out-0708.google.com with SMTP id k29so13038755rvb.6 for ; Mon, 12 Jan 2009 12:38:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:subject:from:to:content-type :date:message-id:mime-version:x-mailer:content-transfer-encoding; bh=O0iZ+h5Pz6ufGrCNgD87H6KRlGuoyapQ/89yBt37cG4=; b=qS5xWeGfQBfyi9mFIFw69ZGnzp97xSjE2ny2D9sjF0u74lvM8D/hLVa30SzAk2z2Ci mgfB6OlRcgzGFyVEYgWHj3zh6+bjR10zPv/+667iDBY/Q47TlhDY0cUBgh5yuDXjJiK0 c5HSnax7UXnW09mLcY7wJpjD1vjIRKAadHb0k= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=subject:from:to:content-type:date:message-id:mime-version:x-mailer :content-transfer-encoding; b=a22iK9MEk4alGtvXXvt4HtzgHS/iy6HlajjuPqkwEcxUNFVIX3Ek/EDJnguMrZA1Ut 43Li9vlV4zxbHxQjXhNxyIEWqiTtTdwil8vRM1hUNMDtwvD/ExU79Kd1ndmgvheZeujT ByXvixW5B4b1BDD4WmI7+ElkCr19UJRJsomkY= Received: by 10.114.147.7 with SMTP id u7mr19737822wad.138.1231792699779; Mon, 12 Jan 2009 12:38:19 -0800 (PST) Received: from ?192.168.1.2? (pool-173-51-86-88.lsanca.fios.verizon.net [173.51.86.88]) by mx.google.com with ESMTPS id y25sm47669915pod.10.2009.01.12.12.38.17 (version=SSLv3 cipher=RC4-MD5); Mon, 12 Jan 2009 12:38:18 -0800 (PST) Subject: Request: Regular expressions that can match Unicode general categories From: Derick Eddington To: bug-gnu-emacs@gnu.org Content-Type: text/plain Date: Mon, 12 Jan 2009 12:38:12 -0800 Message-Id: <1231792692.22467.115.camel@eep> Mime-Version: 1.0 X-Mailer: Evolution 2.24.2 Content-Transfer-Encoding: 7bit X-detected-operating-system: by monty-python.gnu.org: GNU/Linux 2.6 (newer, 2) A new Scheme major mode I've made [1] requires regular expressions that can match characters by their Unicode general categories. It seems Emacs regular expressions do not provide a way to do that directly (I'm using GNU Emacs 23.0.60.1) (I couldn't find anything about it in the Emacs documentation, emacswiki.org, or by asking on help-gnu-emacs@gnu.org or in that list's archives). So currently I pre-compute character sets for the needed general categories (using `get-char-code-property') and place these in their positions in the larger regular expressions. However, including character sets for every general category I need makes the regular expressions too large for Emacs and it errors trying to use them (some of them are pretty big); so currently I'm not supporting all of them that are required. Another issue is these character sets are duplicated in different regular expressions and since they're so large this causes code size bloat. Another issue is I suspect matching character sets this large is not the most time-efficient. If Emacs regular expressions had some construct, similar to the existing `\cC' one, that matched a character by its general category, I think that would solve all the above issues nicely. PLT Scheme regular expressions have this ability [2]. [1] https://code.launchpad.net/~derick-eddington/scheme-mode/derick-.emacs.d [2] http://docs.plt-scheme.org/reference/regexp.html Thank you for your work on Emacs and for your time, -- : Derick ---------------------------------------------------------------- From rgm@gnu.org Mon Jan 12 14:07:38 2009 Received: (at control) by emacsbugs.donarmstrong.com; 12 Jan 2009 22:07:39 +0000 X-Spam-Checker-Version: SpamAssassin 3.2.5-bugs.debian.org_2005_01_02 (2008-06-10) on rzlab.ucr.edu X-Spam-Level: X-Spam-Bayes: score:0.5 Bayes not run. spammytokens:Tokens not available. hammytokens:Tokens not available. X-Spam-Status: No, score=0.8 required=4.0 tests=MISSING_SUBJECT,NOSUBJECT, PHONENUMBER,VALID_BTS_CONTROL,X_DEBBUGS_NO_ACK autolearn=no version=3.2.5-bugs.debian.org_2005_01_02 Received: from fencepost.gnu.org (fencepost.gnu.org [140.186.70.10]) by rzlab.ucr.edu (8.13.8/8.13.8/Debian-3) with ESMTP id n0CM7W1a028558 for ; Mon, 12 Jan 2009 14:07:33 -0800 Received: from rgm by fencepost.gnu.org with local (Exim 4.67) (envelope-from ) id 1LMUvE-0000s5-3F; Mon, 12 Jan 2009 17:06:12 -0500 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <18795.48851.908021.594560@fencepost.gnu.org> Date: Mon, 12 Jan 2009 17:06:11 -0500 From: Glenn Morris To: control X-Debbugs-No-Ack: yes severity 1858 minor reassign 1861 spam severity 1863 wishlist reassign 1865 spam reassign 1866 spam reassign 1869 spam reassign 1870 spam reassign 1871 spam reassign 1872 spam reassign 1873 spam reassign 1874 spam reassign 1875 spam severity 1876 wishlist severity 1877 wishlist reassign 1878 emacs,w32 reassign 1879 emacs,w32 forcemerge 876 1878 1879 reassign 1880 emacs,org-mode severity 1880 minor From debbugs-submit-bounces@debbugs.gnu.org Mon Sep 30 03:45:21 2019 Received: (at 1877) by debbugs.gnu.org; 30 Sep 2019 07:45:21 +0000 Received: from localhost ([127.0.0.1]:56803 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1iEqMy-0004Ac-Nr for submit@debbugs.gnu.org; Mon, 30 Sep 2019 03:45:21 -0400 Received: from quimby.gnus.org ([80.91.231.51]:46652) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1iEqMw-00048G-Ve for 1877@debbugs.gnu.org; Mon, 30 Sep 2019 03:45:19 -0400 Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=marnie) by quimby.gnus.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1iEqMt-0004sw-Gf; Mon, 30 Sep 2019 09:45:17 +0200 From: Lars Ingebrigtsen To: Derick Eddington Subject: Re: bug#1877: Request: Regular expressions that can match Unicode general categories References: <1231792692.22467.115.camel@eep> Date: Mon, 30 Sep 2019 09:45:15 +0200 In-Reply-To: <1231792692.22467.115.camel@eep> (Derick Eddington's message of "Mon, 12 Jan 2009 12:38:12 -0800") Message-ID: <87zhimfcs4.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Derick Eddington writes: > A new Scheme major mode I've made [1] requires regular expressions that > can match characters by their Unicode general categories. It seems > Emacs regular expressions do not provide a way to do th [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 1877 Cc: 1877@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Derick Eddington writes: > A new Scheme major mode I've made [1] requires regular expressions that > can match characters by their Unicode general categories. It seems > Emacs regular expressions do not provide a way to do that directly (I'm > using GNU Emacs 23.0.60.1) (I'm going through old bug reports that unfortunately didn't get any response at the time.) I'm not quite sure what Unicode general categories you're referring to, but the Emacs regexp matcher has gained a bunch of categories in the ten years since you made the request. Are the categories below what you were thinking of? =E2=80=98[:print:]=E2=80=99 This matches any printing character=E2=80=94either whitespace, or a gr= aphic character matched by =E2=80=98[:graph:]=E2=80=99. =E2=80=98[:punct:]=E2=80=99 This matches any punctuation character. (At present, for multibyte characters, it matches anything that has non-word syntax.) =E2=80=98[:space:]=E2=80=99 This matches any character that has whitespace syntax (*note Syntax Class Table::). =E2=80=98[:upper:]=E2=80=99 This matches any upper-case letter, as determined by the current case table (*note Case Tables::). If =E2=80=98case-fold-search=E2=80= =99 is non-=E2=80=98nil=E2=80=99, this also matches any lower-case letter. =E2=80=98[:word:]=E2=80=99 This matches any character that has word syntax (*note Syntax Class Table::). (etc) --=20 (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Mon Sep 30 03:45:25 2019 Received: (at control) by debbugs.gnu.org; 30 Sep 2019 07:45:25 +0000 Received: from localhost ([127.0.0.1]:56806 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1iEqN3-0004Fh-2z for submit@debbugs.gnu.org; Mon, 30 Sep 2019 03:45:25 -0400 Received: from quimby.gnus.org ([80.91.231.51]:46668) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1iEqN2-0004EM-8c for control@debbugs.gnu.org; Mon, 30 Sep 2019 03:45:24 -0400 Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=marnie) by quimby.gnus.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1iEqMz-0004t4-GA for control@debbugs.gnu.org; Mon, 30 Sep 2019 09:45:23 +0200 Date: Mon, 30 Sep 2019 09:45:21 +0200 Message-Id: <87y2y6fcry.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #1877 X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: tags 1877 + moreinfo quit Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) tags 1877 + moreinfo quit From debbugs-submit-bounces@debbugs.gnu.org Mon Sep 30 04:45:30 2019 Received: (at 1877) by debbugs.gnu.org; 30 Sep 2019 08:45:30 +0000 Received: from localhost ([127.0.0.1]:56913 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1iErJC-0005Gy-B7 for submit@debbugs.gnu.org; Mon, 30 Sep 2019 04:45:30 -0400 Received: from eggs.gnu.org ([209.51.188.92]:44451) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1iErJ9-0005Gk-Vy for 1877@debbugs.gnu.org; Mon, 30 Sep 2019 04:45:28 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:35580) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1iErJ0-00043Z-7I; Mon, 30 Sep 2019 04:45:20 -0400 Received: from [176.228.60.248] (port=4816 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1iErIz-0000qO-JQ; Mon, 30 Sep 2019 04:45:18 -0400 Date: Mon, 30 Sep 2019 11:45:14 +0300 Message-Id: <83r23ycgv9.fsf@gnu.org> From: Eli Zaretskii To: Lars Ingebrigtsen In-reply-to: <87zhimfcs4.fsf@gnus.org> (message from Lars Ingebrigtsen on Mon, 30 Sep 2019 09:45:15 +0200) Subject: Re: bug#1877: Request: Regular expressions that can match Unicode general categories References: <1231792692.22467.115.camel@eep> <87zhimfcs4.fsf@gnus.org> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 1877 Cc: derick.eddington@gmail.com, 1877@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) > From: Lars Ingebrigtsen > Date: Mon, 30 Sep 2019 09:45:15 +0200 > Cc: 1877@debbugs.gnu.org > > Derick Eddington writes: > > > A new Scheme major mode I've made [1] requires regular expressions that > > can match characters by their Unicode general categories. It seems > > Emacs regular expressions do not provide a way to do that directly (I'm > > using GNU Emacs 23.0.60.1) > > (I'm going through old bug reports that unfortunately didn't get any > response at the time.) > > I'm not quite sure what Unicode general categories you're referring to, > but the Emacs regexp matcher has gained a bunch of categories in the ten > years since you made the request. > > Are the categories below what you were thinking of? > > ‘[:print:]’ > This matches any printing character—either whitespace, or a graphic > character matched by ‘[:graph:]’. > ‘[:punct:]’ > This matches any punctuation character. (At present, for multibyte > characters, it matches anything that has non-word syntax.) > ‘[:space:]’ > This matches any character that has whitespace syntax (*note Syntax > Class Table::). > ‘[:upper:]’ > This matches any upper-case letter, as determined by the current > case table (*note Case Tables::). If ‘case-fold-search’ is > non-‘nil’, this also matches any lower-case letter. > ‘[:word:]’ > This matches any character that has word syntax (*note Syntax Class > Table::). No, he means the categories described in the node "Character Properties" of the ELisp manual. We don't yet have full support for the Unicode Regular Expressions, as specified in UTS#18. In particular, see http://unicode.org/reports/tr18/#General_Category_Property for General Category regexp specs. It is not clear to me which categories are of interest here. Some of them are nowadays definitely available indirectly via the classes mentioned above (they weren't available in Emacs 23 when the bug was filed). Maybe the OP could provide an explicit list of categories needed for this Scheme mode, together with their required usage in this mode. Looking at R6RS sec 4.2.1, all I see is "whitespace" (which we provide via [:blank:]), "letter" (provided by [:alpha:]), "digit" (provided by [:alnum:]), and "intraline whitespace" (provided by [:blank:]). If this is all, then we have all the required support now. From debbugs-submit-bounces@debbugs.gnu.org Thu Jan 16 09:08:25 2020 Received: (at control) by debbugs.gnu.org; 16 Jan 2020 14:08:25 +0000 Received: from localhost ([127.0.0.1]:36900 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1is5or-00022N-AO for submit@debbugs.gnu.org; Thu, 16 Jan 2020 09:08:25 -0500 Received: from ted.gofardesign.uk ([67.225.143.91]:47278) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1is5ol-00021v-T4 for control@debbugs.gnu.org; Thu, 16 Jan 2020 09:08:19 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=marxist.se; s=default; h=Content-Type:MIME-Version:Message-ID:Date:References: In-Reply-To:Subject:To:From:Sender:Reply-To:Cc:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=3VU1En1XNSYObIlS1jWN2zo1F6ZhSZbWKptMCcTmD6s=; b=j9n7+n80gn728yYLBdtFEMzNPr KcyVAQLnKbnmh8DLsDQ7tMkekf2gJPFDC0QmZlf70LjD8KzdzAFPFDNGy8RASQfCnVmHyA8lT3mKK 2bDnqwRmmlxO4N3ajE3xwP4t2fSZp9TMbUu5yk5DkEFMyDF00kcj5gQcBy+kGsSpXJxLY34LsPXqv lw8zIRq475ZA2Y6SsZc0GtwRw+zTH2UXGzVuwmtn3gpW/skjHwrFZrXfYsDPYI4WA2ddXbOm6urBi wO/zRmh9hJdS2+WdwSVPR3eygZicycvAgqzMGM/G66uOudVtMcy7P3UpdNn647m7l7NqWQbSjoK1J kVb3xUrw==; Received: from h-70-69.a785.priv.bahnhof.se ([155.4.70.69]:59958 helo=localhost) by ted.gofardesign.uk with esmtpsa (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384:256) (Exim 4.92) (envelope-from ) id 1is5oe-000JKH-9J for control@debbugs.gnu.org; Thu, 16 Jan 2020 09:08:08 -0500 From: Stefan Kangas To: control Subject: Re: control message for bug #1877 In-Reply-To: <87y2y6fcry.fsf@gnus.org> (Lars Ingebrigtsen's message of "Mon, 30 Sep 2019 09:45:21 +0200") References: <87y2y6fcry.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) Date: Thu, 16 Jan 2020 15:08:06 +0100 Message-ID: <87blr3xznd.fsf@marxist.se> MIME-Version: 1.0 Content-Type: text/plain X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - ted.gofardesign.uk X-AntiAbuse: Original Domain - debbugs.gnu.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - marxist.se X-Get-Message-Sender-Via: ted.gofardesign.uk: authenticated_id: stefan@marxist.se X-Authenticated-Sender: ted.gofardesign.uk: stefan@marxist.se X-Source: X-Source-Args: X-Source-Dir: X-Spam-Score: 3.0 (+++) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: tags 1877 - moreinfo quit Content analysis details: (3.0 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 URIBL_BLOCKED ADMINISTRATOR NOTICE: The query to URIBL was blocked. See http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block for more information. [URIs: marxist.se] 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record -0.0 SPF_PASS SPF: sender matches SPF record 1.0 PDS_TONAME_EQ_TOLOCAL_VSHORT Very short body and From looks like 2 different emails 2.0 PDS_TONAME_EQ_TOLOCAL_SHORT Short body with To: name matches everything in local email X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: 2.0 (++) X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: tags 1877 - moreinfo quit Content analysis details: (2.0 points, 10.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 URIBL_BLOCKED ADMINISTRATOR NOTICE: The query to URIBL was blocked. See http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block for more information. [URIs: marxist.se] 0.0 SPF_HELO_NONE SPF: HELO does not publish an SPF Record -0.0 SPF_PASS SPF: sender matches SPF record 1.0 PDS_TONAME_EQ_TOLOCAL_VSHORT Very short body and From looks like 2 different emails 2.0 PDS_TONAME_EQ_TOLOCAL_SHORT Short body with To: name matches everything in local email -1.0 MAILING_LIST_MULTI Multiple indicators imply a widely-seen list manager tags 1877 - moreinfo quit From debbugs-submit-bounces@debbugs.gnu.org Sun Nov 14 01:28:18 2021 Received: (at 1877) by debbugs.gnu.org; 14 Nov 2021 06:28:18 +0000 Received: from localhost ([127.0.0.1]:48875 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mm8zy-00062b-DC for submit@debbugs.gnu.org; Sun, 14 Nov 2021 01:28:18 -0500 Received: from quimby.gnus.org ([95.216.78.240]:56880) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mm8zw-00062L-VS for 1877@debbugs.gnu.org; Sun, 14 Nov 2021 01:28:17 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=htJEtKLYwU8Hb2P5y33LjTPrvERHPxc/ADB7WGDZt8U=; b=FMgPLYBQEJ5SaqL1aGNTEzuRHh 2QUfabAfsVK/3ihGJ7sUV88ExRZc3vIcb+vmok9vNGef5VhcEcuXqS/VdePr/qaRJG2/58nyOumto LtYXfMd4pEerYz878E6QpqFGIRC5x9nv8YneYCQGeogISB0Dm7gCDeX9NFqoxMFeoNmI=; Received: from [84.212.220.105] (helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mm8zn-0001Je-Bl; Sun, 14 Nov 2021 07:28:10 +0100 From: Lars Ingebrigtsen To: Eli Zaretskii Subject: Re: bug#1877: Request: Regular expressions that can match Unicode general categories References: <1231792692.22467.115.camel@eep> <87zhimfcs4.fsf@gnus.org> <83r23ycgv9.fsf@gnu.org> X-Now-Playing: Charles Manier's _Two Synths, A Guitar (And) A Drum Machine_: "Sift Through Art Collecting People" Date: Sun, 14 Nov 2021 07:28:06 +0100 In-Reply-To: <83r23ycgv9.fsf@gnu.org> (Eli Zaretskii's message of "Mon, 30 Sep 2019 11:45:14 +0300") Message-ID: <877ddbb6a1.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Eli Zaretskii writes: > It is not clear to me which categories are of interest here. Some of > them are nowadays definitely available indirectly via the classes > mentioned above (they weren't available in Emacs 23 when th [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 1877 Cc: derick.eddington@gmail.com, 1877@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Eli Zaretskii writes: > It is not clear to me which categories are of interest here. Some of > them are nowadays definitely available indirectly via the classes > mentioned above (they weren't available in Emacs 23 when the bug was > filed). Maybe the OP could provide an explicit list of categories > needed for this Scheme mode, together with their required usage in > this mode. Looking at R6RS sec 4.2.1, all I see is "whitespace" > (which we provide via [:blank:]), "letter" (provided by [:alpha:]), > "digit" (provided by [:alnum:]), and "intraline whitespace" (provided > by [:blank:]). If this is all, then we have all the required support > now. There was no response here (in two years), so I'm guessing that we have the categories required, and I'm closing this bug report. If there are any further categories that would be useful to have added, please respond to the debbugs address and we'll reopen. -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Sun Nov 14 01:28:22 2021 Received: (at control) by debbugs.gnu.org; 14 Nov 2021 06:28:22 +0000 Received: from localhost ([127.0.0.1]:48878 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mm902-00062x-L1 for submit@debbugs.gnu.org; Sun, 14 Nov 2021 01:28:22 -0500 Received: from quimby.gnus.org ([95.216.78.240]:56896) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mm900-00062R-Vf for control@debbugs.gnu.org; Sun, 14 Nov 2021 01:28:21 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Subject:From:To:Message-Id:Date:Sender:Reply-To:Cc: MIME-Version:Content-Type:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:In-Reply-To:References:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=TIPBkV93dn4cAsICI+Pd16syeTW5frEvLupFs5cjwhE=; b=nhS1XYAy+t9OJA6f0oPp5aNfS/ krnSjrqzktV5K9caSusyasKjbm65dMseZmaa8Fogdi6LNHCBbShU7K/F2kQ2IpIS6ICB13SAMnZXO QNgzvT0dnyekVlPqQUZyUhQoTC6qNFPwwcpvj0W2KGMEtKkEv3Vf/EgFiyQ7MBzVYnV4=; Received: from [84.212.220.105] (helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1mm8zt-0001Jn-8W for control@debbugs.gnu.org; Sun, 14 Nov 2021 07:28:15 +0100 Date: Sun, 14 Nov 2021 07:28:12 +0100 Message-Id: <875ysvb69v.fsf@gnus.org> To: control@debbugs.gnu.org From: Lars Ingebrigtsen Subject: control message for bug #1877 X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: close 1877 quit Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) close 1877 quit From unknown Sun Jun 22 15:24:59 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Sun, 12 Dec 2021 12:24:10 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator