From debbugs-submit-bounces@debbugs.gnu.org Sun Dec 27 16:30:48 2020 Received: (at submit) by debbugs.gnu.org; 27 Dec 2020 21:30:48 +0000 Received: from localhost ([127.0.0.1]:33684 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ktdcl-0006aL-S4 for submit@debbugs.gnu.org; Sun, 27 Dec 2020 16:30:48 -0500 Received: from lists.gnu.org ([209.51.188.17]:42578) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ktdci-0006aC-W5 for submit@debbugs.gnu.org; Sun, 27 Dec 2020 16:30:46 -0500 Received: from eggs.gnu.org ([2001:470:142:3::10]:54518) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1ktdci-00036q-PY for bug-gnu-emacs@gnu.org; Sun, 27 Dec 2020 16:30:44 -0500 Received: from andalucia.tim-landscheidt.de ([2a01:4f8:1c1c:d4d0::1]:44568) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1ktdcf-0006j6-NP for bug-gnu-emacs@gnu.org; Sun, 27 Dec 2020 16:30:44 -0500 Received: from dslb-090-186-126-124.090.186.pools.vodafone-ip.de ([90.186.126.124]:44220 helo=passepartout.tim-landscheidt.de) by andalucia.tim-landscheidt.de with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1ktdcb-0005NN-6d for bug-gnu-emacs@gnu.org; Sun, 27 Dec 2020 22:30:37 +0100 From: Tim Landscheidt To: bug-gnu-emacs@gnu.org Subject: 27.1; RFE: Make full RSS fragments available for nnrss servers Organization: http://www.tim-landscheidt.de/ X-Debbugs-Package: emacs,gnus Date: Sun, 27 Dec 2020 21:30:36 +0000 Message-ID: <874kk7os1v.fsf@passepartout.tim-landscheidt.de> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain Received-SPF: pass client-ip=2a01:4f8:1c1c:d4d0::1; envelope-from=tim@tim-landscheidt.de; helo=andalucia.tim-landscheidt.de X-Spam_score_int: -6 X-Spam_score: -0.7 X-Spam_bar: / X-Spam_report: (-0.7 / 5.0 requ) BAYES_00=-1.9, LONG_HEX_URI=1.194, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no X-Spam_action: no action X-Spam-Score: -0.2 (/) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.2 (-) Severity: wishlist Some RSS feeds provide additional data in extra tags; for example, http://feeds.feedburner.com/DougLovesMovies (and others) includes information for/from iTunes: | | Jimmy Pardo, Matt Belknap, Eliot Hochberg and Garon Cockrell guest | In a special holiday Doug Loves Movies-Never Not Funny cross-over event, Doug welcomes Jimmy Pardo, Matt Belknap, Eliot Hochberg and Garon Cockrell to the show.

]]>
| Jimmy Pardo, Matt Belknap, Eliot Hochberg and Garon Cockrell guest | full | 1286 | In a special holiday Doug Loves Movies-Never Not Funny cross-over event, Doug welcomes Jimmy Pardo, Matt Belknap, Eliot Hochberg and Garon Cockrell to the show. | In a special holiday Doug Loves Movies-Never Not Funny cross-over event, Doug welcomes Jimmy Pardo, Matt Belknap, Eliot Hochberg and Garon Cockrell to the show.

]]>
| gid://art19-episode-locator/V0/R8l82ylk4BaF_BypOVQuG89EyfRunLiF485aNFQW_mA | Thu, 24 Dec 2020 08:00:00 -0000 | yes | | DLM | 00:48:13 | | https://rss.art19.com/episodes/7d50103f-d685-4f0f-814e-b38fd2d643d8.mp3 |
AFAICT, all information except title, date, description and enclosure gets thrown away by nnrss-check-group. This makes it impossible to process this information when displaying an article. It would be very useful to have this information available. A very simplistic solution would be to add item to the tuple that gets pushed to nnrss-group-data by nnrss-check-group so that it can be accessed via: | (nth 9 (alist-get | (gnus-summary-article-number) | nnrss-group-data)) (This method, with 9 replaced by 2 or 6, already allows ac- cess to "pure" representations of title, URL & Co.) However it might be prudent to have a more stable inter- face :-). From debbugs-submit-bounces@debbugs.gnu.org Sun Dec 27 19:06:02 2020 Received: (at 45477) by debbugs.gnu.org; 28 Dec 2020 00:06:02 +0000 Received: from localhost ([127.0.0.1]:33874 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ktg30-00044F-12 for submit@debbugs.gnu.org; Sun, 27 Dec 2020 19:06:02 -0500 Received: from quimby.gnus.org ([95.216.78.240]:37088) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ktg2x-00043w-Lu for 45477@debbugs.gnu.org; Sun, 27 Dec 2020 19:06:00 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=eqKIs9+a67PzokiQi6mDGHLYKIu8qEe1lb6rjd24a6s=; b=UNoWgICZJvCcawXCBtBFpF5QwI fjlSkRy7TeS6fY6xASFXhmvnc58PfS9cKGdXni5HCFNOOAd2TCAEXUJzTqMyCdKkDB9akyS62Q7rX t1EW2ZUvtevLP/15BiDRwBXJ6psAjEvWPbLnSygnhH3lJYilBKQdqYzC6yvHXXBoSvHs=; Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1ktg2l-0005wk-GP; Mon, 28 Dec 2020 01:05:53 +0100 From: Lars Ingebrigtsen To: Tim Landscheidt Subject: Re: bug#45477: 27.1; RFE: Make full RSS fragments available for nnrss servers References: <874kk7os1v.fsf@passepartout.tim-landscheidt.de> X-Now-Playing: Marianne Faithfull's _Broken English_: "Why D'Ya Do It" Date: Mon, 28 Dec 2020 01:05:45 +0100 In-Reply-To: <874kk7os1v.fsf@passepartout.tim-landscheidt.de> (Tim Landscheidt's message of "Sun, 27 Dec 2020 21:30:36 +0000") Message-ID: <87lfdi6bhi.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Tim Landscheidt writes: > Some RSS feeds provide additional data in extra tags; for > example, http://feeds.feedburner.com/DougLovesMovies (and > others) includes information for/from iTunes: Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 45477 Cc: 45477@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Tim Landscheidt writes: > Some RSS feeds provide additional data in extra tags; for > example, http://feeds.feedburner.com/DougLovesMovies (and > others) includes information for/from iTunes: [...] > | Jimmy Pardo, Matt Belknap, Eliot Hochberg and > | Garon Cockrell guest > | full > | 1286 > | In a special holiday Doug Loves Movies-Never > | Not Funny cross-over event, Doug welcomes Jimmy Pardo, Matt Belknap, > | Eliot Hochberg and Garon Cockrell to the show. This information? > | In a special holiday Doug Loves > | Movies-Never Not Funny cross-over event, Doug welcomes Jimmy Pardo, > | Matt Belknap, Eliot Hochberg and Garon Cockrell to the > | show.

]]>
[...] > | | url="http://feedproxy.google.com/~r/DougLovesMovies/~5/4ig02kQVqkQ/7d50103f-d685-4f0f-814e-b38fd2d643d8.mp3" > | length="46292323" type="audio/mpeg"/> I'm not very familiar with nnrss, but it looks like the itunes: info mostly replicates the info in the other fields? > It would be very useful to have this information available. > A very simplistic solution would be to add item to the tuple > that gets pushed to nnrss-group-data by nnrss-check-group so > that it can be accessed via: > > | (nth 9 (alist-get > | (gnus-summary-article-number) > | nnrss-group-data)) > > (This method, with 9 replaced by 2 or 6, already allows ac- > cess to "pure" representations of title, URL & Co.) > > However it might be prudent to have a more stable inter- > face :-). Sure, I guess stashing it there would make sense, but it would require people that want to use the info to write a bit of code, right? Just stashing all the info there seems a bit... odd to me somehow. I don't think any other backends do that? So I'm wondering whether this could be fixed in some other way, that would be useful to everybody without writing further code to use the data. So would it make sense just to include the data from the extra fields here in the message body? -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Tue Dec 29 11:50:13 2020 Received: (at 45477) by debbugs.gnu.org; 29 Dec 2020 16:50:13 +0000 Received: from localhost ([127.0.0.1]:40055 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kuICK-0001lZ-Re for submit@debbugs.gnu.org; Tue, 29 Dec 2020 11:50:13 -0500 Received: from andalucia.tim-landscheidt.de ([116.203.78.250]:34536) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kuICH-0001lJ-Qx for 45477@debbugs.gnu.org; Tue, 29 Dec 2020 11:50:10 -0500 Received: from dslb-090-186-126-124.090.186.pools.vodafone-ip.de ([90.186.126.124]:57254 helo=passepartout.tim-landscheidt.de) by andalucia.tim-landscheidt.de with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1kuICF-000311-Lq; Tue, 29 Dec 2020 17:50:07 +0100 From: Tim Landscheidt To: Lars Ingebrigtsen Subject: Re: bug#45477: 27.1; RFE: Make full RSS fragments available for nnrss servers Organization: http://www.tim-landscheidt.de/ References: <874kk7os1v.fsf@passepartout.tim-landscheidt.de> <87lfdi6bhi.fsf@gnus.org> Date: Tue, 29 Dec 2020 16:50:07 +0000 In-Reply-To: <87lfdi6bhi.fsf@gnus.org> (Lars Ingebrigtsen's message of "Mon, 28 Dec 2020 01:05:45 +0100") Message-ID: <87wnx0mu9s.fsf@passepartout.tim-landscheidt.de> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 45477 Cc: 45477@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Lars Ingebrigtsen wrote: >> Some RSS feeds provide additional data in extra tags; for >> example, http://feeds.feedburner.com/DougLovesMovies (and >> others) includes information for/from iTunes: > [...] >> | Jimmy Pardo, Matt Belknap, Eliot Hochberg and >> | Garon Cockrell guest >> | full >> | 1286 >> | In a special holiday Doug Loves Movies-Never >> | Not Funny cross-over event, Doug welcomes Jimmy Pardo, Matt Belknap, >> | Eliot Hochberg and Garon Cockrell to the show. > This information? >> | In a special holiday Doug Loves >> | Movies-Never Not Funny cross-over event, Doug welcomes Jimmy Pardo, >> | Matt Belknap, Eliot Hochberg and Garon Cockrell to the >> | show.

]]>
> [...] >> | > | url="http://feedproxy.google.com/~r/DougLovesMovies/~5/4ig02kQVqkQ/7d50103f-d685-4f0f-814e-b38fd2d643d8.mp3" >> | length="46292323" type="audio/mpeg"/> > I'm not very familiar with nnrss, but it looks like the itunes: info > mostly replicates the info in the other fields? No; for example and especially in my use case at hand, the content of itunes:episode is not contained anywhere else. >> It would be very useful to have this information available. >> A very simplistic solution would be to add item to the tuple >> that gets pushed to nnrss-group-data by nnrss-check-group so >> that it can be accessed via: >> | (nth 9 (alist-get >> | (gnus-summary-article-number) >> | nnrss-group-data)) >> (This method, with 9 replaced by 2 or 6, already allows ac- >> cess to "pure" representations of title, URL & Co.) >> However it might be prudent to have a more stable inter- >> face :-). > Sure, I guess stashing it there would make sense, but it would require > people that want to use the info to write a bit of code, right? Just > stashing all the info there seems a bit... odd to me somehow. I don't > think any other backends do that? > So I'm wondering whether this could be fixed in some other way, that > would be useful to everybody without writing further code to use the > data. So would it make sense just to include the data from the extra > fields here in the message body? To draw a bigger picture: My use case (and existing workflow with newsticker) is, when displaying an episode's Gnus arti- cle, to provide the user (me) with a command to import the episode's data (feed title, episode number, episode title, episode URL) into my database for further processing. Therefore, I need the data, and I need it in a format that can be processed further, and there will be a need for a custom user function to process the data because each use case will be different. So just unconditionally mangling and dumping the data into the message body will help neither me nor the users who just want to use nnrss "normally". I thought about including the raw XML fragments as either a Base64-encoded X-Gnus-nnrss-Entry-XML header or a multipart/alternative MIME part. However, regardless of the solution, Gnus would need to provide a function that returns the DOM for the current/an article, and with the current de- sign, adding an element to nnrss-group-data is probably the easiest path of the three. (If there was a major overhaul of nnrss, it could be inter- esting to forego the intermediate nnrss-group-data saved in ~/News/rss/* and either store the feeds as pure XML files, re-parsed on demand and available for further processing, or write out all the articles as mbox files after parsing the feeds, with the entries' fragments as MIME parts.) From debbugs-submit-bounces@debbugs.gnu.org Tue Dec 29 22:03:13 2020 Received: (at 45477) by debbugs.gnu.org; 30 Dec 2020 03:03:13 +0000 Received: from localhost ([127.0.0.1]:40634 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kuRlY-00019x-WE for submit@debbugs.gnu.org; Tue, 29 Dec 2020 22:03:13 -0500 Received: from quimby.gnus.org ([95.216.78.240]:60530) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kuRlX-00019j-Ba for 45477@debbugs.gnu.org; Tue, 29 Dec 2020 22:03:11 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=nPsNa1/MnCEUoxuK62S23j1KA+OhRjIrb2Xo42dXHwU=; b=bNIJX/64hDEG+TkoF3uuMExkxi l6nlysl2RzkrD5Lg3XpKnvQ8L54V2ODWCLFE+pUp/1mxWdpYlE+SG+BZLpqgTs0ISP6ozyp6oNaZ7 k9o7xgGeEdz9q6hNOurmYaQhbtxEyGsZrHxsAYLtyQiFq5vc54TzDpr5Msmm9YjePoKA=; Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kuRlI-0001AT-Vq; Wed, 30 Dec 2020 04:03:05 +0100 From: Lars Ingebrigtsen To: Tim Landscheidt Subject: Re: bug#45477: 27.1; RFE: Make full RSS fragments available for nnrss servers References: <874kk7os1v.fsf@passepartout.tim-landscheidt.de> <87lfdi6bhi.fsf@gnus.org> <87wnx0mu9s.fsf@passepartout.tim-landscheidt.de> Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAABGdBTUEAALGPC/xhBQAAACBj SFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAAAElBMVEVeMjCKSzkXDRSj Z1ezhGj///+MK9mlAAAAAWJLR0QF+G/pxwAAAAd0SU1FB+QMHgIkBbYx/94AAAGVSURBVDjLrZTh leMgDISF2AJQcAEyuQLMygXcLvRf00lAHHD+Hi8vifnsGTEIA1B6D5ZrAEBK+kn6xwVXwUZsAFNC 6MPVE57ILC8w5oHD1n47SMugt8cK/AXCCvb/B64lwuM9aZXfAWt1uxBOUjgUQtokY3qugGPJ+yny 8YQTyb4Y4BVonD/bDTQLkVoUqIdfAOp8UYtPALFUvdBy4yoF6CRnmCxe68DkDvt+LoCRPcSDOc1A Q+TgSGKWAOjfAAxQNJAJ5/ZRR9toBXp1A05OBVqpm0FCBTUbyH4BPUN1keMOOAqReAl3ABQz6Owj TqAlomWF5DPOwCIBMwEKC+hS+kgOkOT+BFmPJ2C5ebALdggeo6iSO2AFRKQRdgvdMuoAWY1Nq4Gz BFLSgLkrGhtef+OQgpZvfhjIW4hwasNYuKGlNdpQBYPIAGhp5VZ4/N7rXzvP8K06TW90N/kCVtgL TCfn+Kq1rxzG3S3PXaAW3ZsPEI+vQiMruLQQWIvNWym94a63gAPULYDSVz5p/bFXz7FnZPoHnpKa sWn/yd8AAAAldEVYdGRhdGU6Y3JlYXRlADIwMjAtMTItMzBUMDI6MzY6MDUrMDA6MDDoTYxrAAAA JXRFWHRkYXRlOm1vZGlmeQAyMDIwLTEyLTMwVDAyOjM2OjA1KzAwOjAwmRA01wAAAABJRU5ErkJg gg== X-Now-Playing: David Bowie's _ChangesTwoBowie_: "Sound And Vision" Date: Wed, 30 Dec 2020 04:02:55 +0100 In-Reply-To: <87wnx0mu9s.fsf@passepartout.tim-landscheidt.de> (Tim Landscheidt's message of "Tue, 29 Dec 2020 16:50:07 +0000") Message-ID: <87wnx0q9ls.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Tim Landscheidt writes: > Therefore, I need the data, and I need it in a format that > can be processed further, and there will be a need for a > custom user function to process the data because each use > case will be diffe [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 45477 Cc: 45477@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Tim Landscheidt writes: > Therefore, I need the data, and I need it in a format that > can be processed further, and there will be a need for a > custom user function to process the data because each use > case will be different. Sure, that sounds reasonable. However: > (If there was a major overhaul of nnrss, it could be inter- > esting to forego the intermediate nnrss-group-data saved in > ~/News/rss/* and either store the feeds as pure XML files, > re-parsed on demand and available for further processing, or > write out all the articles as mbox files after parsing the > feeds, with the entries' fragments as MIME parts.) I've not used nnrss myself, but reading the code, it seems like it's storing all the data needed for Gnus to read an nnrss group in `nnrss-group-data', so storing all the XML data in case somebody is going to use it would require orders of magnitude more storage? I think a way to implement this would be to add an nnrss variable that says what "extra" XML fields to store -- like (nnrss-extra-fields '(itunes:episodeType ...)). -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Wed Dec 30 03:22:23 2020 Received: (at 45477) by debbugs.gnu.org; 30 Dec 2020 08:22:23 +0000 Received: from localhost ([127.0.0.1]:40818 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kuWkR-0004Xh-7V for submit@debbugs.gnu.org; Wed, 30 Dec 2020 03:22:23 -0500 Received: from andalucia.tim-landscheidt.de ([116.203.78.250]:34556) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kuWkM-0004XW-4R for 45477@debbugs.gnu.org; Wed, 30 Dec 2020 03:22:22 -0500 Received: from dslb-090-186-126-124.090.186.pools.vodafone-ip.de ([90.186.126.124]:42182 helo=passepartout.tim-landscheidt.de) by andalucia.tim-landscheidt.de with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1kuWkK-0000WC-D4; Wed, 30 Dec 2020 09:22:16 +0100 From: Tim Landscheidt To: Lars Ingebrigtsen Subject: Re: bug#45477: 27.1; RFE: Make full RSS fragments available for nnrss servers Organization: http://www.tim-landscheidt.de/ References: <874kk7os1v.fsf@passepartout.tim-landscheidt.de> <87lfdi6bhi.fsf@gnus.org> <87wnx0mu9s.fsf@passepartout.tim-landscheidt.de> <87wnx0q9ls.fsf@gnus.org> Date: Wed, 30 Dec 2020 08:22:15 +0000 In-Reply-To: <87wnx0q9ls.fsf@gnus.org> (Lars Ingebrigtsen's message of "Wed, 30 Dec 2020 04:02:55 +0100") Message-ID: <87czyrn1oo.fsf@passepartout.tim-landscheidt.de> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 45477 Cc: 45477@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Lars Ingebrigtsen wrote: >> Therefore, I need the data, and I need it in a format that >> can be processed further, and there will be a need for a >> custom user function to process the data because each use >> case will be different. > Sure, that sounds reasonable. However: >> (If there was a major overhaul of nnrss, it could be inter- >> esting to forego the intermediate nnrss-group-data saved in >> ~/News/rss/* and either store the feeds as pure XML files, >> re-parsed on demand and available for further processing, or >> write out all the articles as mbox files after parsing the >> feeds, with the entries' fragments as MIME parts.) > I've not used nnrss myself, but reading the code, it seems like it's > storing all the data needed for Gnus to read an nnrss group in > `nnrss-group-data', so storing all the XML data in case somebody is > going to use it would require orders of magnitude more storage? In my practice (so far), not even one magnitude. Random on- disk sample: | -rw-r--r--. 1 root root 132680 Dec 30 06:56 Conan O=E2=80=99Brien Need= s A Friend.el | -rw-r--r--. 1 root root 471799 Dec 30 07:28 Conan O=E2=80=99Brien Need= s A Friend.xml | -rw-r--r--. 1 root root 72249 Dec 30 06:56 Doug Loves Movies.el | -rw-r--r--. 1 root root 245312 Dec 30 07:01 Doug Loves Movies.xml | -rw-r--r--. 1 root root 630495 Dec 30 06:56 ID10T with Chris Hardwick.= el | -rw-r--r--. 1 root root 2100500 Dec 27 21:30 ID10T with Chris Hardwick.= xml | -rw-r--r--. 1 root root 21754 Dec 30 06:56 Sprechen wir =C3=BCber Mor= d?! Der SWR2 True Crime Podcast.el | -rw-r--r--. 1 root root 47741 Dec 30 06:36 Sprechen wir =C3=BCber Mor= d?! Der SWR2 True Crime Podcast.xml | -rw-r--r--. 1 root root 93927 Dec 30 06:56 Stone Clearing With Richar= d Herring.el | -rw-r--r--. 1 root root 221040 Dec 30 07:25 Stone Clearing With Richar= d Herring.xml | -rw-r--r--. 1 root root 17002 Dec 30 06:56 Taskmaster The Podcast.el | -rw-r--r--. 1 root root 53080 Dec 30 07:28 Taskmaster The Podcast.xml | -rw-r--r--. 1 root root 265970 Dec 30 06:56 You Made It Weird with Pet= e Holmes.el | -rw-r--r--. 1 root root 650710 Dec 30 04:07 You Made It Weird with Pet= e Holmes.xml Even if the XML gets bloated when saved in nnrss-group-data (it holds one feed at most), IMHO almost all feeds will be small enough to be negligible in a typical Emacs/Gnus setup (the largest feed above holds data from February 2010 till now; usually feeds only contain the most recent x entries). > I think a way to implement this would be to add an nnrss variable that > says what "extra" XML fields to store -- like (nnrss-extra-fields > '(itunes:episodeType ...)). That would allow my use case. (In a major overhaul, another way to approach this could be a hook/function variable (con- figurable per group) that gets called in addition/in lieu of nnrss-request-article with the raw XML data and then has free rein to format the Gnus article as it wishes to.) From debbugs-submit-bounces@debbugs.gnu.org Wed Dec 30 23:33:12 2020 Received: (at 45477) by debbugs.gnu.org; 31 Dec 2020 04:33:12 +0000 Received: from localhost ([127.0.0.1]:51655 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kupeB-0005Gu-SS for submit@debbugs.gnu.org; Wed, 30 Dec 2020 23:33:12 -0500 Received: from quimby.gnus.org ([95.216.78.240]:44234) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kupe9-0005Gg-Fr for 45477@debbugs.gnu.org; Wed, 30 Dec 2020 23:33:10 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=cSeIIQZRQDSfUGNfrNlgIn0ye+4aaHsJqzlisduFYtw=; b=Xah8ZxyDikudyeIeSd66fmycOa GxEBrfL+Kkb0A/Tp026PbojRI7z81E6kxW/KtgpExqrlwTDfIVfh49vZicqOv4QyDZCEpOzjFMiMK eWO3X1cfu3OKKGUoZDgf13oilsQhLkcc41Upza/Nfe65EIGX60c9fmZ8NvZ4cdfCmbO0=; Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kupe1-00027N-FV; Thu, 31 Dec 2020 05:33:03 +0100 From: Lars Ingebrigtsen To: Tim Landscheidt Subject: Re: bug#45477: 27.1; RFE: Make full RSS fragments available for nnrss servers References: <874kk7os1v.fsf@passepartout.tim-landscheidt.de> <87lfdi6bhi.fsf@gnus.org> <87wnx0mu9s.fsf@passepartout.tim-landscheidt.de> <87wnx0q9ls.fsf@gnus.org> <87czyrn1oo.fsf@passepartout.tim-landscheidt.de> Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAABGdBTUEAALGPC/xhBQAAACBj SFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAAAGFBMVEUEAQIvFRxjJjBu OEeVKzm6SVTUja3///9ttdmYAAAAAWJLR0QHFmGI6wAAAAd0SU1FB+QMHwQdC+M8RLQAAAGwSURB VDjLvdNdcpswEAdwyekBWJIDgCDvDSIHsLzKe1utboDuf4TufyG20xnPtC9dj7HNj/2QwM79zyDv ibp/gb+Oh8kP6xLiIdyS/PVrQIw2bmfX/QnejhRGfwfT0vl+CovzfVzoClOMifo+Rh5OwYCODOZL mPTISx85jscSDDjMOKaehQfnJojCrKdWPc+XPguP5KPBSKjFFpPCG/n3MQC6nnPZIWbJZ/I1TAPA z3JAUkh0qkt8A9AVLlkkjTdAT+as71W2RE81ymKldsB7zfKTnmusi0uBDqgoluWHwnuNCqELWZvk K7xUadGx7mp4Fcm1YgIx2NovhTnNOkxurVjGWaEB0oyfBoX1ddYerW0Kdl0tClLKyt+p38H2Q1qp m34CPKABBCAZkCYe7kCktg0js97KjugTmkHBuLrd7oAZUABYfRicp73UC/O2Axauj5CnyeAbVnsP p2k1OCnkIgcohXkHD9gU+DNmbgCHUl9gzTtgrE3qDRiwOR0LIHeAFsVp94w68jWBAWx17FFFfCBB weOagl1iuwUATXfozgD76wQDlHXoHj4O0KcVgLzf7ojGH+OStcgAAAAldEVYdGRhdGU6Y3JlYXRl ADIwMjAtMTItMzFUMDQ6Mjk6MTArMDA6MDBI12lTAAAAJXRFWHRkYXRlOm1vZGlmeQAyMDIwLTEy LTMxVDA0OjI5OjEwKzAwOjAwOYrR7wAAAABJRU5ErkJggg== X-Now-Playing: The Cure's _Pornography (1)_: "Siamese Twins" Date: Thu, 31 Dec 2020 05:33:00 +0100 In-Reply-To: <87czyrn1oo.fsf@passepartout.tim-landscheidt.de> (Tim Landscheidt's message of "Wed, 30 Dec 2020 08:22:15 +0000") Message-ID: <87mtxuoarn.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Tim Landscheidt writes: > Even if the XML gets bloated when saved in nnrss-group-data > (it holds one feed at most), IMHO almost all feeds will be > small enough to be negligible in a typical Emacs/Gnus setup > (the largest [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 45477 Cc: 45477@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Tim Landscheidt writes: > Even if the XML gets bloated when saved in nnrss-group-data > (it holds one feed at most), IMHO almost all feeds will be > small enough to be negligible in a typical Emacs/Gnus setup > (the largest feed above holds data from February 2010 till > now; usually feeds only contain the most recent x entries). Doesn't nnrss-group-data store older entries, though? I just skimmed the nnrss code, and I didn't see any pruning... >> I think a way to implement this would be to add an nnrss variable that >> says what "extra" XML fields to store -- like (nnrss-extra-fields >> '(itunes:episodeType ...)). > > That would allow my use case. Patches welcome. :-) -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no From debbugs-submit-bounces@debbugs.gnu.org Fri Jan 01 12:09:07 2021 Received: (at 45477) by debbugs.gnu.org; 1 Jan 2021 17:09:07 +0000 Received: from localhost ([127.0.0.1]:34676 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kvNvH-0007sU-Ba for submit@debbugs.gnu.org; Fri, 01 Jan 2021 12:09:07 -0500 Received: from andalucia.tim-landscheidt.de ([116.203.78.250]:34600) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kvNvF-0007sL-5a for 45477@debbugs.gnu.org; Fri, 01 Jan 2021 12:09:05 -0500 Received: from dslb-090-186-126-124.090.186.pools.vodafone-ip.de ([90.186.126.124]:58136 helo=passepartout.tim-landscheidt.de) by andalucia.tim-landscheidt.de with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.89) (envelope-from ) id 1kvNvC-0000kq-HX; Fri, 01 Jan 2021 18:09:02 +0100 From: Tim Landscheidt To: Lars Ingebrigtsen Subject: Re: bug#45477: 27.1; RFE: Make full RSS fragments available for nnrss servers Organization: http://www.tim-landscheidt.de/ References: <874kk7os1v.fsf@passepartout.tim-landscheidt.de> <87lfdi6bhi.fsf@gnus.org> <87wnx0mu9s.fsf@passepartout.tim-landscheidt.de> <87wnx0q9ls.fsf@gnus.org> <87czyrn1oo.fsf@passepartout.tim-landscheidt.de> <87mtxuoarn.fsf@gnus.org> Date: Fri, 01 Jan 2021 17:09:02 +0000 In-Reply-To: <87mtxuoarn.fsf@gnus.org> (Lars Ingebrigtsen's message of "Thu, 31 Dec 2020 05:33:00 +0100") Message-ID: <87k0swinyp.fsf@passepartout.tim-landscheidt.de> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 45477 Cc: 45477@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Lars Ingebrigtsen wrote: >> Even if the XML gets bloated when saved in nnrss-group-data >> (it holds one feed at most), IMHO almost all feeds will be >> small enough to be negligible in a typical Emacs/Gnus setup >> (the largest feed above holds data from February 2010 till >> now; usually feeds only contain the most recent x entries). > Doesn't nnrss-group-data store older entries, though? I just skimmed > the nnrss code, and I didn't see any pruning... I assumed that was done by the normal expire process, but I didn't look deeper into that. >>> I think a way to implement this would be to add an nnrss variable that >>> says what "extra" XML fields to store -- like (nnrss-extra-fields >>> '(itunes:episodeType ...)). >> That would allow my use case. > Patches welcome. :-) Well, in that case I rather work on a new, clean, shiny backend that accepts Atom and RSS feeds and does everything The Right Way=E2=84=A2 :-). From debbugs-submit-bounces@debbugs.gnu.org Sat Jan 02 00:52:30 2021 Received: (at 45477) by debbugs.gnu.org; 2 Jan 2021 05:52:30 +0000 Received: from localhost ([127.0.0.1]:38888 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kvZq2-0000qh-7V for submit@debbugs.gnu.org; Sat, 02 Jan 2021 00:52:30 -0500 Received: from quimby.gnus.org ([95.216.78.240]:35756) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1kvZq0-0000qR-6L for 45477@debbugs.gnu.org; Sat, 02 Jan 2021 00:52:28 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Transfer-Encoding:Content-Type:MIME-Version:Message-ID :In-Reply-To:Date:References:Subject:Cc:To:From:Sender:Reply-To:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=JJZZegjWhTWkEIZbUXzrsnMKPMsYRhWVUV3GaA77USM=; b=fFP6VXDJ7QbIPBITKFIY0zB6ip ALJ4sPrd0ZULgnpGSAQvInNMBIQAeeW/Bc2ZgAKrRzBHNj+oM2VatcalQiBPoDR2Uq8H6dRl+0lPj 4nz40bszOK4GpY4ZtN0BSQ/wK5itDl8otZnqWHTVdwJHkf29M+xTTuBxw6p0huzjYmW8=; Received: from cm-84.212.202.86.getinternet.no ([84.212.202.86] helo=xo) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1kvZpr-0006n0-QU; Sat, 02 Jan 2021 06:52:22 +0100 From: Lars Ingebrigtsen To: Tim Landscheidt Subject: Re: bug#45477: 27.1; RFE: Make full RSS fragments available for nnrss servers References: <874kk7os1v.fsf@passepartout.tim-landscheidt.de> <87lfdi6bhi.fsf@gnus.org> <87wnx0mu9s.fsf@passepartout.tim-landscheidt.de> <87wnx0q9ls.fsf@gnus.org> <87czyrn1oo.fsf@passepartout.tim-landscheidt.de> <87mtxuoarn.fsf@gnus.org> <87k0swinyp.fsf@passepartout.tim-landscheidt.de> Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAABGdBTUEAALGPC/xhBQAAACBj SFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAAAElBMVEXFmHT7+/vX1NS3 raOHW1v////nY4s6AAAAAWJLR0QF+G/pxwAAAAd0SU1FB+UBAgUgDIfNa8gAAAGlSURBVDjLrZRZ tuQgCIZTqWwAdAENZgHlsIA47H9NDZpUmTs89Dn9H1/CF0ABXZZ/1uPxWB/hLRY51UJ3QRciLvCL BkBmZGIgCXR6DWBCjNFHjCE652eP6ELwESK74IIYSAD6HspRjH1LLKHENsBXGQVQsLbaZnmjoQJB +yJCQwIQTGson60dfWVAK8AZgC3zXnBvy6YLgHcJZRDAEhnZ0LEssgqAZ92V7NSSnLuDRUGhfg5u 3motwgWweQWQUrYE6MoFTE0KTLbFyv/M5gS22qygJQGufHJY3bHkqFvbpUVvj5qOVy9J5T2Nlo5z 1xLTSB5faS5IfcbU+2Hdn9SDbFkKLhGe4dmBqROostZXHh1M2wWwlgL1aGdrTXp7PH2GWukEiHaA cmzHUQpeAPgEOax19YU/YF/OHDlKjhlcyWPxpvJ7rqTyx0huvGk8A1Ci5zAyCTegRD2adIY+I6r/ tGPLageeQB9GKZ/av3ko6SYzT/sYWZo+rhvF5x3j+40SIPOP1Bt5A6gW7GNHd49fLudPgL5Jg8o9 X9ewdk2vQ38Z/p/+Ahx1dyBmyh8EAAAAJXRFWHRkYXRlOmNyZWF0ZQAyMDIxLTAxLTAyVDA1OjMy OjEyKzAwOjAwkPuzkQAAACV0RVh0ZGF0ZTptb2RpZnkAMjAyMS0wMS0wMlQwNTozMjoxMiswMDow MOGmCy0AAAAASUVORK5CYII= X-Now-Playing: Simple Minds's _Sparkle In The Rain_: "The Kick Inside Of Me" Date: Sat, 02 Jan 2021 06:52:18 +0100 In-Reply-To: <87k0swinyp.fsf@passepartout.tim-landscheidt.de> (Tim Landscheidt's message of "Fri, 01 Jan 2021 17:09:02 +0000") Message-ID: <878s9bansd.fsf@gnus.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Tim Landscheidt writes: > Well, in that case I rather work on a new, clean, shiny > backend that accepts Atom and RSS feeds and does everything > The Right Way™ :-). That sounds more fun. :-) But nnrss users would probably prefer that nnrss gets improved instead of getting a new backend that does kinda sorta the same thing, though. Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 45477 Cc: 45477@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Tim Landscheidt writes: > Well, in that case I rather work on a new, clean, shiny > backend that accepts Atom and RSS feeds and does everything > The Right Way=E2=84=A2 :-). That sounds more fun. :-) But nnrss users would probably prefer that nnrss gets improved instead of getting a new backend that does kinda sorta the same thing, though. --=20 (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no