From unknown Fri Jun 13 10:53:09 2025 X-Loop: help-debbugs@gnu.org Subject: bug#24496: offloading should fall back to local build after n tries Resent-From: ng0 Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Wed, 21 Sep 2016 15:41:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 24496 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: 24496@debbugs.gnu.org X-Debbugs-Original-To: bug-guix@gnu.org Received: via spool by submit@debbugs.gnu.org id=B.147447241810547 (code B ref -1); Wed, 21 Sep 2016 15:41:02 +0000 Received: (at submit) by debbugs.gnu.org; 21 Sep 2016 15:40:18 +0000 Received: from localhost ([127.0.0.1]:59729 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bmjdB-0002k2-Ur for submit@debbugs.gnu.org; Wed, 21 Sep 2016 11:40:18 -0400 Received: from eggs.gnu.org ([208.118.235.92]:51847) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1bme17-0007LF-H9 for submit@debbugs.gnu.org; Wed, 21 Sep 2016 05:40:37 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bme11-0001BM-Bi for submit@debbugs.gnu.org; Wed, 21 Sep 2016 05:40:32 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50,FREEMAIL_FROM autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:36936) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bme11-0001BG-9F for submit@debbugs.gnu.org; Wed, 21 Sep 2016 05:40:31 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:42811) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bme10-0002RY-07 for bug-guix@gnu.org; Wed, 21 Sep 2016 05:40:30 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bme0v-00019e-VW for bug-guix@gnu.org; Wed, 21 Sep 2016 05:40:29 -0400 Received: from aibo.runbox.com ([91.220.196.211]:35375) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bme0v-000184-PB for bug-guix@gnu.org; Wed, 21 Sep 2016 05:40:25 -0400 Received: from [10.9.9.210] (helo=mailfront10.runbox.com) by bars.runbox.com with esmtp (Exim 4.71) (envelope-from ) id 1bme0p-0005Ny-Rz for bug-guix@gnu.org; Wed, 21 Sep 2016 11:40:19 +0200 Received: from xd9bb8cb8.dyn.telefonica.de ([217.187.140.184] helo=localhost) by mailfront10.runbox.com with esmtpsa (uid:892961 ) (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) id 1bme0L-0006RN-3W for bug-guix@gnu.org; Wed, 21 Sep 2016 11:39:49 +0200 From: ng0 Date: Wed, 21 Sep 2016 09:39:48 +0000 Message-ID: <8760ppr3q3.fsf@we.make.ritual.n0.is> MIME-Version: 1.0 Content-Type: text/plain X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Mailman-Approved-At: Wed, 21 Sep 2016 11:40:16 -0400 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -5.0 (-----) When I forgot that my build machine is offline and I did not pass --no-build-hook, the offloading keeps trying forever until I had to cancel the build, boot the build-machine and started the build again. A solution could be a config option or default behavior which after failing to offload for n times gives up and uses the local builder. Is this desired at all? Setups like hydra could get problems, but for small setups with the same architecture there could be a solution beyond --no-build-hook? -- ng0 From unknown Fri Jun 13 10:53:09 2025 X-Loop: help-debbugs@gnu.org Subject: bug#24496: offloading should fall back to local build after n tries Resent-From: ludo@gnu.org (Ludovic =?UTF-8?Q?Court=C3=A8s?=) Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Mon, 26 Sep 2016 15:51:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24496 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: ng0 Cc: 24496@debbugs.gnu.org Received: via spool by 24496-submit@debbugs.gnu.org id=B24496.147490501710912 (code B ref 24496); Mon, 26 Sep 2016 15:51:01 +0000 Received: (at 24496) by debbugs.gnu.org; 26 Sep 2016 15:50:17 +0000 Received: from localhost ([127.0.0.1]:36572 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1boYAb-0002pw-5a for submit@debbugs.gnu.org; Mon, 26 Sep 2016 11:50:17 -0400 Received: from mail2-relais-roc.national.inria.fr ([192.134.164.83]:27149) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1boYAY-0002ph-8w for 24496@debbugs.gnu.org; Mon, 26 Sep 2016 11:50:15 -0400 X-IronPort-AV: E=Sophos;i="5.30,400,1470693600"; d="scan'208";a="238321904" Received: from smb-adpcdg1-06.hotspot.hub-one.net (HELO pluto) ([213.174.99.134]) by mail2-relais-roc.national.inria.fr with ESMTP/TLS/AES256-GCM-SHA384; 26 Sep 2016 17:50:07 +0200 From: ludo@gnu.org (Ludovic =?UTF-8?Q?Court=C3=A8s?=) In-Reply-To: <8760ppr3q3.fsf@we.make.ritual.n0.is> (ng0's message of "Wed, 21 Sep 2016 09:39:48 +0000") Date: Mon, 26 Sep 2016 18:20:51 +0900 Message-ID: <87r387nhjg.fsf@gnu.org> References: <8760ppr3q3.fsf@we.make.ritual.n0.is> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1 (gnu/linux) X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 5 =?UTF-8?Q?Vend=C3=A9miaire?= an 225 de la =?UTF-8?Q?R=C3=A9volution?= X-PGP-Key-ID: 0x090B11993D9AEBB5 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4 0CFB 090B 1199 3D9A EBB5 X-OS: x86_64-unknown-linux-gnu MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: -2.9 (--) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -2.9 (--) Hello! ng0 skribis: > When I forgot that my build machine is offline and I did not pass > --no-build-hook, the offloading keeps trying forever until I had to > cancel the build, boot the build-machine and started the build again. > > A solution could be a config option or default behavior which after > failing to offload for n times gives up and uses the local builder. > > Is this desired at all? Setups like hydra could get problems, but for > small setups with the same architecture there could be a solution beyond > --no-build-hook? Like you say, on Hydra-style setup this could be a problem: the front-end machine may have --max-jobs=3D0, meaning that it cannot perform builds on its own. So I guess we would need a command-line option to select a different behavior. I=E2=80=99m not sure how to do that because =E2=80=98guix offloa= d=E2=80=99 is =E2=80=9Chidden=E2=80=9D behind =E2=80=98guix-daemon=E2=80=99, so there=E2= =80=99s no obvious place for such an option. In the meantime, you could also hack up your machines.scm: it would return a list where unreachable machines have been filtered out. Ludo=E2=80=99. From unknown Fri Jun 13 10:53:09 2025 X-Loop: help-debbugs@gnu.org Subject: bug#24496: offloading should fall back to local build after n tries Resent-From: ng0 Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Tue, 04 Oct 2016 17:10:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24496 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: Ludovic =?UTF-8?Q?Court=C3=A8s?= Cc: 24496@debbugs.gnu.org Received: via spool by 24496-submit@debbugs.gnu.org id=B24496.147560094915455 (code B ref 24496); Tue, 04 Oct 2016 17:10:01 +0000 Received: (at 24496) by debbugs.gnu.org; 4 Oct 2016 17:09:09 +0000 Received: from localhost ([127.0.0.1]:44808 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brTDI-00041C-LP for submit@debbugs.gnu.org; Tue, 04 Oct 2016 13:09:08 -0400 Received: from aibo.runbox.com ([91.220.196.211]:44400) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brTDH-000413-E2 for 24496@debbugs.gnu.org; Tue, 04 Oct 2016 13:09:08 -0400 Received: from [10.9.9.212] (helo=mailfront12.runbox.com) by bars.runbox.com with esmtp (Exim 4.71) (envelope-from ) id 1brTDF-0001CG-Qg; Tue, 04 Oct 2016 19:09:05 +0200 Received: from x5d83ef73.dyn.telefonica.de ([93.131.239.115] helo=localhost) by mailfront12.runbox.com with esmtpsa (uid:892961 ) (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) id 1brTD9-0004tx-IW; Tue, 04 Oct 2016 19:08:59 +0200 From: ng0 In-Reply-To: <87r387nhjg.fsf@gnu.org> References: <8760ppr3q3.fsf@we.make.ritual.n0.is> <87r387nhjg.fsf@gnu.org> Date: Tue, 04 Oct 2016 17:08:58 +0000 Message-ID: <87vax8nis5.fsf@we.make.ritual.n0.is> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Spam-Score: -0.7 (/) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.7 (/) Ludovic Courtès writes: > Hello! > > ng0 skribis: > >> When I forgot that my build machine is offline and I did not pass >> --no-build-hook, the offloading keeps trying forever until I had to >> cancel the build, boot the build-machine and started the build again. >> >> A solution could be a config option or default behavior which after >> failing to offload for n times gives up and uses the local builder. >> >> Is this desired at all? Setups like hydra could get problems, but for >> small setups with the same architecture there could be a solution beyond >> --no-build-hook? > > Like you say, on Hydra-style setup this could be a problem: the > front-end machine may have --max-jobs=0, meaning that it cannot perform > builds on its own. > > So I guess we would need a command-line option to select a different > behavior. I’m not sure how to do that because ‘guix offload’ is > “hidden” behind ‘guix-daemon’, so there’s no obvious place for such an > option. Could the daemon run with --enable-hydra-style or --disable-hydra-style and --disable-hydra-style would allow falling back to local build if after a defined time - keeping slow connections in mind - the machine did not reply. > In the meantime, you could also hack up your machines.scm: it would > return a list where unreachable machines have been filtered out. How can I achieve this? And to append to this bug: it seems to me that offloading requires 1 lsh-key for each build-machine. (https://lists.gnu.org/archive/html/help-guix/2016-10/msg00007.html) and that you can not directly address them (say I want to create some system where I want to build on machine 1 AND machine 2. Having 2 x86_64 in machines.scm only selects one of them (if 2 were working, see linked thread) and builds on the one which is accessible first. If however the first machine is somehow blocked and it fails, therefore terminates lsh connection, the build does not happen at all. Leaving out the problems, what I want to do in short: How could I build on both systems at the same time when I desire to do so? > Ludo’. > -- From unknown Fri Jun 13 10:53:09 2025 X-Loop: help-debbugs@gnu.org Subject: bug#24496: offloading should fall back to local build after n tries Resent-From: ludo@gnu.org (Ludovic =?UTF-8?Q?Court=C3=A8s?=) Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Wed, 05 Oct 2016 11:37:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24496 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: ng0 Cc: 24496@debbugs.gnu.org Received: via spool by 24496-submit@debbugs.gnu.org id=B24496.147566739424451 (code B ref 24496); Wed, 05 Oct 2016 11:37:01 +0000 Received: (at 24496) by debbugs.gnu.org; 5 Oct 2016 11:36:34 +0000 Received: from localhost ([127.0.0.1]:45221 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brkV0-0006MI-0E for submit@debbugs.gnu.org; Wed, 05 Oct 2016 07:36:34 -0400 Received: from eggs.gnu.org ([208.118.235.92]:52119) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1brkUx-0006M3-UN for 24496@debbugs.gnu.org; Wed, 05 Oct 2016 07:36:32 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1brkUp-0005gy-Jz for 24496@debbugs.gnu.org; Wed, 05 Oct 2016 07:36:26 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.8 required=5.0 tests=BAYES_50,RP_MATCHES_RCVD autolearn=disabled version=3.3.2 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:34659) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1brkUp-0005gp-Gw; Wed, 05 Oct 2016 07:36:23 -0400 Received: from reverse-83.fdn.fr ([80.67.176.83]:48446 helo=pluto) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1brkUo-00030Q-OY; Wed, 05 Oct 2016 07:36:23 -0400 From: ludo@gnu.org (Ludovic =?UTF-8?Q?Court=C3=A8s?=) References: <8760ppr3q3.fsf@we.make.ritual.n0.is> <87r387nhjg.fsf@gnu.org> <87vax8nis5.fsf@we.make.ritual.n0.is> X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 14 =?UTF-8?Q?Vend=C3=A9miaire?= an 225 de la =?UTF-8?Q?R=C3=A9volution?= X-PGP-Key-ID: 0x090B11993D9AEBB5 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4 0CFB 090B 1199 3D9A EBB5 X-OS: x86_64-unknown-linux-gnu Date: Wed, 05 Oct 2016 13:36:20 +0200 In-Reply-To: <87vax8nis5.fsf@we.make.ritual.n0.is> (ng0's message of "Tue, 04 Oct 2016 17:08:58 +0000") Message-ID: <87a8ej81u3.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-Spam-Score: -7.7 (-------) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -7.7 (-------) ng0 skribis: > Ludovic Court=C3=A8s writes: [...] >> Like you say, on Hydra-style setup this could be a problem: the >> front-end machine may have --max-jobs=3D0, meaning that it cannot perform >> builds on its own. >> >> So I guess we would need a command-line option to select a different >> behavior. I=E2=80=99m not sure how to do that because =E2=80=98guix off= load=E2=80=99 is >> =E2=80=9Chidden=E2=80=9D behind =E2=80=98guix-daemon=E2=80=99, so there= =E2=80=99s no obvious place for such an >> option. > > Could the daemon run with --enable-hydra-style or --disable-hydra-style > and --disable-hydra-style would allow falling back to local build if > after a defined time - keeping slow connections in mind - the machine > did not reply. That would be too ad-hoc IMO, and the problem mentioned above remains. >> In the meantime, you could also hack up your machines.scm: it would >> return a list where unreachable machines have been filtered out. > > How can I achieve this? Something like: (define the-machine (build-machine =E2=80=A6)) (if (managed-to-connect-timely the-machine) (list the-machine) '()) =E2=80=A6 where =E2=80=98managed-to-connect-timely=E2=80=99 would try to co= nnect to the machine with a timeout. > And to append to this bug: it seems to me that offloading requires 1 > lsh-key for each > build-machine. The main machine needs to be able to connect to each build machine over SSH, so indeed, that requires proper SSH key registration (host keys and authorized user keys). > (https://lists.gnu.org/archive/html/help-guix/2016-10/msg00007.html) > and that you can not directly address them (say I want to create some > system where I want to build on machine 1 AND machine 2. Having 2 > x86_64 in machines.scm only selects one of them (if 2 were working, > see linked thread) and builds on the one which is accessible first. If > however the first machine is somehow blocked and it fails, therefore > terminates lsh connection, the build does not happen at all. The code that selects machines is in (guix scripts offload), specifically =E2=80=98choose-build-machine=E2=80=99. It tries to choose th= e =E2=80=9Cbest=E2=80=9D machine, which means, roughly, the fastest and least loaded one. HTH, Ludo=E2=80=99. From unknown Fri Jun 13 10:53:09 2025 X-Loop: help-debbugs@gnu.org Subject: bug#24496: offloading should fall back to local build after n tries Resent-From: zimoun Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Thu, 16 Dec 2021 13:02:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24496 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: ludo@gnu.org (Ludovic =?UTF-8?Q?Court=C3=A8s?=) Cc: 24496@debbugs.gnu.org, ng0 Received: via spool by 24496-submit@debbugs.gnu.org id=B24496.163965971824811 (code B ref 24496); Thu, 16 Dec 2021 13:02:01 +0000 Received: (at 24496) by debbugs.gnu.org; 16 Dec 2021 13:01:58 +0000 Received: from localhost ([127.0.0.1]:34928 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mxqOT-0006S6-IZ for submit@debbugs.gnu.org; Thu, 16 Dec 2021 08:01:58 -0500 Received: from mail-wm1-f53.google.com ([209.85.128.53]:50824) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mxqOR-0006Rt-6y for 24496@debbugs.gnu.org; Thu, 16 Dec 2021 08:01:55 -0500 Received: by mail-wm1-f53.google.com with SMTP id b73so3407629wmd.0 for <24496@debbugs.gnu.org>; Thu, 16 Dec 2021 05:01:55 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-transfer-encoding; bh=0de+Rd5rou7k59jgL47noiSwLyfEtWE+eXI/Zwhj3ww=; b=H4qYdqj/wDvWB+k47xkzPGS5dDPej6Qx9MmimEGBZjyjLNR2tnl+02X7uf9CGmL2fz usurrAnKHo/3TPmSYgD1H7tbvH4r476U2NEiiInCIKEYXtD+MrvRAFRYgHN2HGuL2GCe n+xhVOCOX1qrVvLwCh6V8gkJQTZNNFO74Zip1eW4L5DsFdFsZmuSVKrfRk+TSTNpLGAp ulaT5p7HnnRkyaboSkn0lZGOX2MMupmHuBhSP2/fpsgxc0QzpAuutoOL3kBNHRwX/vn4 ZQF3N3ibfoHvvE5oAawo63F6E9hjoXfF4Fo8xqakQBa9I3MMzzql0SGFE2ZMv2dvFHS6 6oPg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-transfer-encoding; bh=0de+Rd5rou7k59jgL47noiSwLyfEtWE+eXI/Zwhj3ww=; b=uaCr2v/AKS2qTP9xsJZGJIR74ALCbcEF5Ll3rYHav3AB2+zG5jNbyZLS/wbcu1TDPa cVnSbn2huc+PK4fK7yqB8/Z7fgEU4P4k2Ovj3t0cXWph6tRfIA8FTbZ5FJTMXOaaFaS5 mapP6hHvBAXOlbhQ7XWa8Edjfx9zer4CMz1k8Dwy0IBnuOLSna5Oo4iqTiL0wPCwN+bG iE+XGsLmPXvZDuPW8FsJDvey+D1kK/ZpamJg0T1BdLP3+6kP1B3l2+urDAQmfZ7xNQH+ 8dc14kypayl/JBqXnTk5uBVu2xZj76IVHnSGPNJ2CuSnxAA7+uv6YX8wbsGxZZxS4C6o JLDw== X-Gm-Message-State: AOAM532UUSnqvkc/Ww/H94ZpPZJ10P+2ql9PuUVtgD5Fx7HDdpAkpUJK +zxQakJ2SIinaTJ5c4RpXFyeUBX8AGg= X-Google-Smtp-Source: ABdhPJymijOG7fUiCS7nl3dC5U9QRZTUs/su2HuHFwyX9yWmOj3iSIxiF+ryOa9PqiRQqjsKiz6XEw== X-Received: by 2002:a05:600c:b42:: with SMTP id k2mr4868501wmr.6.1639659709373; Thu, 16 Dec 2021 05:01:49 -0800 (PST) Received: from lili ([2a01:e0a:59b:9120:65d2:2476:f637:db1e]) by smtp.gmail.com with ESMTPSA id z17sm1272537wmi.22.2021.12.16.05.01.48 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 16 Dec 2021 05:01:49 -0800 (PST) From: zimoun References: <8760ppr3q3.fsf@we.make.ritual.n0.is> <87r387nhjg.fsf@gnu.org> Date: Thu, 16 Dec 2021 13:52:14 +0100 In-Reply-To: <87r387nhjg.fsf@gnu.org> ("Ludovic =?UTF-8?Q?Court=C3=A8s?="'s message of "Mon, 26 Sep 2016 18:20:51 +0900") Message-ID: <868rwkiuf5.fsf@gmail.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Hi, I am just hitting this old bug#24496 [1]. On Mon, 26 Sep 2016 at 18:20, ludo@gnu.org (Ludovic Court=C3=A8s) wrote: > ng0 skribis: > >> When I forgot that my build machine is offline and I did not pass >> --no-build-hook, the offloading keeps trying forever until I had to >> cancel the build, boot the build-machine and started the build again. [...] > Like you say, on Hydra-style setup this could be a problem: the > front-end machine may have --max-jobs=3D0, meaning that it cannot perform > builds on its own. > > So I guess we would need a command-line option to select a different > behavior. I=E2=80=99m not sure how to do that because =E2=80=98guix offl= oad=E2=80=99 is > =E2=80=9Chidden=E2=80=9D behind =E2=80=98guix-daemon=E2=80=99, so there= =E2=80=99s no obvious place for such an > option. When the build machine used to offload is offline and the master daemon is --max-jobs=3D0, I expect X tries (leading to timeout) and then just fails with a hint, where X is defined by user. WDYT? > In the meantime, you could also hack up your machines.scm: it would > return a list where unreachable machines have been filtered out. Maybe, this could be done by =E2=80=9Cguix offload=E2=80=9D. Cheers, simon 1: From unknown Fri Jun 13 10:53:09 2025 X-Loop: help-debbugs@gnu.org Subject: bug#24496: offloading should fall back to local build after n tries Resent-From: Ludovic =?UTF-8?Q?Court=C3=A8s?= Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Fri, 17 Dec 2021 15:34:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24496 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: zimoun Cc: 24496@debbugs.gnu.org, Maxim Cournoyer , ng0 Received: via spool by 24496-submit@debbugs.gnu.org id=B24496.16397552395550 (code B ref 24496); Fri, 17 Dec 2021 15:34:02 +0000 Received: (at 24496) by debbugs.gnu.org; 17 Dec 2021 15:33:59 +0000 Received: from localhost ([127.0.0.1]:40447 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1myFF8-0001RS-SD for submit@debbugs.gnu.org; Fri, 17 Dec 2021 10:33:59 -0500 Received: from eggs.gnu.org ([209.51.188.92]:56642) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1myFF7-0001RE-3I for 24496@debbugs.gnu.org; Fri, 17 Dec 2021 10:33:57 -0500 Received: from [2001:470:142:3::e] (port=50734 helo=fencepost.gnu.org) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1myFF0-00051r-B2; Fri, 17 Dec 2021 10:33:51 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org; s=fencepost-gnu-org; h=MIME-Version:In-Reply-To:Date:References:Subject:To: From; bh=DKgqktBBZ1nlyknqH8Xnc7cvP1OEaieT3zWjem+gaXI=; b=l0go/6mPKb25TgJk2LQ3 Fi4sWOw0b9f5xLAKzv96FdUPRZ86KKCew93xhB1/AKGWMGlsqkXAvDICOX5c67EHLJt+SZEK94nMy sZ1nUTlrnerztjsOshLHpMpk0hDfKL1JQM+5KtUzk4Pk3i+cziLoW3n+9bBVpOGnL8842hNakU5CL Y4UsrO+UJDe7SJAmo39t0ND/EoWAwYnw6gc1bVCBNkap2fRew7lix5hHLIuPFGWg85O65jJy4tw2a CvSKVnbDihZvePzgUhzGW6HLykv/Hnwq7ydv8RE9bG7NcFwWLx0FfaDuz9tlYywtyPvyOne+Uk8iL ofAJRZ8P19aTpg==; Received: from [193.50.110.211] (port=43476 helo=ribbon) by fencepost.gnu.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1myFEz-0001pX-2T; Fri, 17 Dec 2021 10:33:49 -0500 From: Ludovic =?UTF-8?Q?Court=C3=A8s?= References: <8760ppr3q3.fsf@we.make.ritual.n0.is> <87r387nhjg.fsf@gnu.org> <868rwkiuf5.fsf@gmail.com> X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 27 Frimaire an 230 de la =?UTF-8?Q?R=C3=A9volution?= X-PGP-Key-ID: 0x090B11993D9AEBB5 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4 0CFB 090B 1199 3D9A EBB5 X-OS: x86_64-pc-linux-gnu Date: Fri, 17 Dec 2021 16:33:46 +0100 In-Reply-To: <868rwkiuf5.fsf@gmail.com> (zimoun's message of "Thu, 16 Dec 2021 13:52:14 +0100") Message-ID: <878rwjqm91.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: -2.3 (--) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -3.3 (---) Hi! zimoun skribis: > I am just hitting this old bug#24496 [1]. > > On Mon, 26 Sep 2016 at 18:20, ludo@gnu.org (Ludovic Court=C3=A8s) wrote: >> ng0 skribis: >> >>> When I forgot that my build machine is offline and I did not pass >>> --no-build-hook, the offloading keeps trying forever until I had to >>> cancel the build, boot the build-machine and started the build again. > > [...] > >> Like you say, on Hydra-style setup this could be a problem: the >> front-end machine may have --max-jobs=3D0, meaning that it cannot perform >> builds on its own. >> >> So I guess we would need a command-line option to select a different >> behavior. I=E2=80=99m not sure how to do that because =E2=80=98guix off= load=E2=80=99 is >> =E2=80=9Chidden=E2=80=9D behind =E2=80=98guix-daemon=E2=80=99, so there= =E2=80=99s no obvious place for such an >> option. > > When the build machine used to offload is offline and the master daemon > is --max-jobs=3D0, I expect X tries (leading to timeout) and then just > fails with a hint, where X is defined by user. WDYT? > > >> In the meantime, you could also hack up your machines.scm: it would >> return a list where unreachable machines have been filtered out. > > Maybe, this could be done by =E2=80=9Cguix offload=E2=80=9D. Prior to commit efbf5fdd01817ea75de369e3dd2761a85f8f7dd5, this was the case: an unreachable machine would have =E2=80=98machine-load=E2=80=99 retu= rn +inf.0, and so it would be discarded from the list of candidates. However, I think this behavior was unintentionally lost in efbf5fdd01817ea75de369e3dd2761a85f8f7dd5. Maxim, WDYT? Thanks, Ludo=E2=80=99. From unknown Fri Jun 13 10:53:09 2025 X-Loop: help-debbugs@gnu.org Subject: bug#24496: offloading should fall back to local build after n tries Resent-From: Maxim Cournoyer Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Fri, 17 Dec 2021 21:58:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24496 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: Ludovic =?UTF-8?Q?Court=C3=A8s?= Cc: ng0 , 24496@debbugs.gnu.org, zimoun Received: via spool by 24496-submit@debbugs.gnu.org id=B24496.16397782737177 (code B ref 24496); Fri, 17 Dec 2021 21:58:02 +0000 Received: (at 24496) by debbugs.gnu.org; 17 Dec 2021 21:57:53 +0000 Received: from localhost ([127.0.0.1]:40780 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1myLEf-0001rh-Dl for submit@debbugs.gnu.org; Fri, 17 Dec 2021 16:57:53 -0500 Received: from mail-qt1-f180.google.com ([209.85.160.180]:35817) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1myLEV-0001rI-5r for 24496@debbugs.gnu.org; Fri, 17 Dec 2021 16:57:52 -0500 Received: by mail-qt1-f180.google.com with SMTP id j17so4111434qtx.2 for <24496@debbugs.gnu.org>; Fri, 17 Dec 2021 13:57:43 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version:content-transfer-encoding; bh=LJiS26n7xCyDa5YM9nlQg43/t1qpTk5omRWW8++c61M=; b=hfVNBDcJnxrjz2qTuz157Ldad7bf+aJUzvMAqK93PKhVAVxXRMgxi7niAXDpzpF72b uIrvcxh/0vY+KYm/9J2dH6dBPKZxPiR4YzK10PTWDEjxqSMsZy8KMoTUgd/mhYbeOw6Y in2NSfMqxtloLoHjiRK5zigirC9LDewIawuX1wy/3x96AcaQo4eQr9P4nywAG+YY+0il 82j+JvTThrDJfzJdV0yZJXcSfyjBESkYTPIHmJTvzFbIx/4hcumJ2Uehfvy1HQoJJLV0 Oy0ln+2LP2ilUFLQIiOfA0CpJXQgv/8aT/ptiCqWpaYt+fI22T2ipZIImB3SWTwZ0DN4 HG9A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version:content-transfer-encoding; bh=LJiS26n7xCyDa5YM9nlQg43/t1qpTk5omRWW8++c61M=; b=mIdHVPSx9G3Htzx1peOlBcbTiAjAJ7VEIwJBgFr9r3ufNYWsHHojEARNgj8DKEaCSk ByJAM/0RmgaaZR2yCC85803BDnHsGL9k9847C0OZQjFFnWcsYyo5azkGrDCnm2OSdacD LSqPNC3Q7asYQZcZssNadMNxm1NAJ3Vy9pLyj13WeKQLknr9l/f5+1cmzVNZHtf61qef QcQGLBJvFkKfPqItNcCXOV1YLFaylQygz2qnI9xh8fAtK0hTTOHkgnILsltwXvn2HMga xVPyiHq0Ja7+KvRFQJDGR3S4jl4S7Wlmw5Bd/TP+VKdVvS+6GdIyOOUU/IlGhiHwzP+H WFOA== X-Gm-Message-State: AOAM533/sQfqc7CsFzUFxkOoKgYnvCV1JY7GxzVC4swDZh+YfBH/qzHz etVHYxWNQfavTv5gRWspLqnLkGRbCJI= X-Google-Smtp-Source: ABdhPJyysJhVA36fWj5MKa6g5h9MRMKcTsFiKrP03ifEvRwyEOgdGHq0j/zWMf4rZPpBFdi/5Ub4Tg== X-Received: by 2002:a05:622a:1c7:: with SMTP id t7mr4218153qtw.441.1639778257349; Fri, 17 Dec 2021 13:57:37 -0800 (PST) Received: from hurd ([207.35.95.41]) by smtp.gmail.com with ESMTPSA id az14sm5658074qkb.97.2021.12.17.13.57.34 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 17 Dec 2021 13:57:34 -0800 (PST) From: Maxim Cournoyer References: <8760ppr3q3.fsf@we.make.ritual.n0.is> <87r387nhjg.fsf@gnu.org> <868rwkiuf5.fsf@gmail.com> <878rwjqm91.fsf@gnu.org> Date: Fri, 17 Dec 2021 16:57:33 -0500 In-Reply-To: <878rwjqm91.fsf@gnu.org> ("Ludovic =?UTF-8?Q?Court=C3=A8s?="'s message of "Fri, 17 Dec 2021 16:33:46 +0100") Message-ID: <87lf0i6gj6.fsf@gmail.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: -0.0 (/) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Hello Ludovic, Ludovic Court=C3=A8s writes: > Hi! > > zimoun skribis: > >> I am just hitting this old bug#24496 [1]. >> >> On Mon, 26 Sep 2016 at 18:20, ludo@gnu.org (Ludovic Court=C3=A8s) wrote: >>> ng0 skribis: >>> >>>> When I forgot that my build machine is offline and I did not pass >>>> --no-build-hook, the offloading keeps trying forever until I had to >>>> cancel the build, boot the build-machine and started the build again. >> >> [...] >> >>> Like you say, on Hydra-style setup this could be a problem: the >>> front-end machine may have --max-jobs=3D0, meaning that it cannot perfo= rm >>> builds on its own. >>> >>> So I guess we would need a command-line option to select a different >>> behavior. I=E2=80=99m not sure how to do that because =E2=80=98guix of= fload=E2=80=99 is >>> =E2=80=9Chidden=E2=80=9D behind =E2=80=98guix-daemon=E2=80=99, so there= =E2=80=99s no obvious place for such an >>> option. >> >> When the build machine used to offload is offline and the master daemon >> is --max-jobs=3D0, I expect X tries (leading to timeout) and then just >> fails with a hint, where X is defined by user. WDYT? >> >> >>> In the meantime, you could also hack up your machines.scm: it would >>> return a list where unreachable machines have been filtered out. >> >> Maybe, this could be done by =E2=80=9Cguix offload=E2=80=9D. > > Prior to commit efbf5fdd01817ea75de369e3dd2761a85f8f7dd5, this was the > case: an unreachable machine would have =E2=80=98machine-load=E2=80=99 re= turn +inf.0, > and so it would be discarded from the list of candidates. > > However, I think this behavior was unintentionally lost in > efbf5fdd01817ea75de369e3dd2761a85f8f7dd5. Maxim, WDYT? I just reviewed this commit, and don't see anywhere where the behavior would have changed. The discarding happens here: --8<---------------cut here---------------start------------->8--- - (if (and node (< load 2.) (>=3D space %minimum-disk-space)) + (if (and node + (or (not threshold) (< load threshold)) + (>=3D space %minimum-disk-space)) --8<---------------cut here---------------end--------------->8--- previously load could be set to +inf.0. Now it is a float between 0.0 and 1.0, with threshold defaulting to 0.6. As far as I remember, this has always been a problem for me (busy offload machines being forever retried with no fallback to the local machine). Thanks, Maxim From unknown Fri Jun 13 10:53:09 2025 X-Loop: help-debbugs@gnu.org Subject: bug#24496: offloading should fall back to local build after n tries Resent-From: zimoun Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Sat, 18 Dec 2021 00:12:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24496 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: Maxim Cournoyer , Ludovic =?UTF-8?Q?Court=C3=A8s?= Cc: 24496@debbugs.gnu.org, ng0 Received: via spool by 24496-submit@debbugs.gnu.org id=B24496.163978626821221 (code B ref 24496); Sat, 18 Dec 2021 00:12:01 +0000 Received: (at 24496) by debbugs.gnu.org; 18 Dec 2021 00:11:08 +0000 Received: from localhost ([127.0.0.1]:40901 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1myNJc-0005WD-C5 for submit@debbugs.gnu.org; Fri, 17 Dec 2021 19:11:08 -0500 Received: from mail-wr1-f43.google.com ([209.85.221.43]:44890) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1myNJa-0005VT-KQ for 24496@debbugs.gnu.org; Fri, 17 Dec 2021 19:11:07 -0500 Received: by mail-wr1-f43.google.com with SMTP id t18so6909827wrg.11 for <24496@debbugs.gnu.org>; Fri, 17 Dec 2021 16:11:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:in-reply-to:references:date:message-id :mime-version:content-transfer-encoding; bh=tsaZh84wsHq6MMH3Nu4Uvg48VLQZAeLNIOsPsY8NQWE=; b=F2/oyswl08GS9Z30LtHHIwzVWzGXVsFQeeP+WfRShe9+o04yZebcA55fK+vNml9JsI SQtluV/yk5r0ZA7Y1gIHuxglJJCPgj6pqxXx2aOVNP4uZXZuaFJTmYYsxNJv9stU9jrx a9qGCRE+id28LgUMUkGmAZWP3V8CjM83QPNOU2O+p5W9ewco2rX7Zvsjk+xxCx9EY+pH f5/YZa1MZGz/x/v/O4LPZbxH+9N1KXPzjRzJkBH8Yag+nV2e81AH7XOcurFP0vQtTfdh lK8HFILgYO7O3zE8ze/1WnFSrtdqGMPvjP3dLAOGT7Cifws0M+F0L2o54Cfz75pZGBGL VONw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:in-reply-to:references:date :message-id:mime-version:content-transfer-encoding; bh=tsaZh84wsHq6MMH3Nu4Uvg48VLQZAeLNIOsPsY8NQWE=; b=f40IJQ+RJzFlT9lXjsvt22ISapihrdHaaiWEdhLud9NHjj2a75fpo8PNro9iKucpMA hqsp13SMnQsoyxtd0Kaky1GmHEZC/0vvFTazVLZ3Lzt7au19RLQwA80s5AxyosYIth5l czaOPUjVl2T3iHVcxpOJhGuR7YxJeflw8XJ1GFoA5VYhL8WSVsKi7gyTkyqCzuFXNmOD vkO4j5+BDqb4YPRcqsI/fJzXWblwpZrTM4VF4Wiq85UOW9RnNrp8x81KOUurmFnsoaib oVbVfOFU6cOgDu5SVvNop0muD/Fhnxl7EKB4dKrE7RXTap1UVGKjcMxLU8wuUKDryIMy dFNw== X-Gm-Message-State: AOAM532OPfNlX9TbejJVYhn7lsba172x+XNuXr/dJwn8qVtOWfUPjfRO yxw2D/eJctqx5TxVlgAFTeQSbK7oAL0= X-Google-Smtp-Source: ABdhPJwGM6jXg/NCwmVNOxm09K7pXoFugrAyjFza87tmTNCCoDTAi9jfY+vtwcjWrBcoTbqGtRvG5A== X-Received: by 2002:a05:6000:1842:: with SMTP id c2mr4426935wri.301.1639786260694; Fri, 17 Dec 2021 16:11:00 -0800 (PST) Received: from lili ([2a01:e0a:59b:9120:65d2:2476:f637:db1e]) by smtp.gmail.com with ESMTPSA id q123sm11478528wma.30.2021.12.17.16.10.59 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 17 Dec 2021 16:11:00 -0800 (PST) From: zimoun In-Reply-To: <87lf0i6gj6.fsf@gmail.com> References: <8760ppr3q3.fsf@we.make.ritual.n0.is> <87r387nhjg.fsf@gnu.org> <868rwkiuf5.fsf@gmail.com> <878rwjqm91.fsf@gnu.org> <87lf0i6gj6.fsf@gmail.com> Date: Sat, 18 Dec 2021 01:10:49 +0100 Message-ID: <86tuf6rcvq.fsf@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.0 (-) Hi, I have not checked all the details, since the code of =E2=80=9Cguix offload= =E2=80=9D is run by root, IIUC and so it is not as friendly as usual to debug. :-) On Fri, 17 Dec 2021 at 16:57, Maxim Cournoyer w= rote: >> However, I think this behavior was unintentionally lost in >> efbf5fdd01817ea75de369e3dd2761a85f8f7dd5. Maxim, WDYT? > > I just reviewed this commit, and don't see anywhere where the behavior > would have changed. The discarding happens here: [...] > previously load could be set to +inf.0. Now it is a float between 0.0 > and 1.0, with threshold defaulting to 0.6. My /etc/guix/machines.scm contains only one machine and --max-jobs=3D0. Because the machine is unreachable, IIUC, =E2=80=99node=E2=80=99 is (or sho= uld be) false and =E2=80=99load=E2=80=99 is thus not involved, I guess. Indeed, =E2=80= =99report-load=E2=80=99 displays nothing, and instead I get: --8<---------------cut here---------------start------------->8--- The following derivation will be built: /gnu/store/c1qicg17ygn1a0biq0q4mkprzy4p2x74-hello-2.10.drv process 75621 acquired build slot '/var/guix/offload/x.x.x.x:22/0' guix offload: error: failed to connect to 'x.x.x.x': Timeout connecting to = x.x.x.x waiting for locks or build slots... process 75621 acquired build slot '/var/guix/offload/x.x.x.x:22/0' guix offload: error: failed to connect to 'x.x.x.x': Timeout connecting to = x.x.x.x process 75621 acquired build slot '/var/guix/offload/x.x.x.x:22/0' guix offload: error: failed to connect to 'x.x.x.x': Timeout connecting to = x.x.x.x process 75621 acquired build slot '/var/guix/offload/x.x.x.x:22/0' guix offload: error: failed to connect to 'x.x.x.x': Timeout connecting to = x.x.x.x process 75621 acquired build slot '/var/guix/offload/x.x.x.x:22/0' C-c C-c --8<---------------cut here---------------end--------------->8--- Well, if the machine is not reachable, then =E2=80=99session=E2=80=99 is fa= lse, right? --8<---------------cut here---------------start------------->8--- @@ -472,11 +480,15 @@ (define (machine-faster? m1 m2) (let* ((session (false-if-exception (open-ssh-session best %short-timeou= t))) (node (and session (remote-inferior session))) - (load (and node (normalized-load best (node-load node)))) + (load (and node (node-load node))) + (threshold (build-machine-overload-threshold best)) (space (and node (node-free-disk-space node)))) + (when load (report-load best load)) (when node (close-inferior node)) (when session (disconnect! session)) - (if (and node (< load 2.) (>=3D space %minimum-disk-space)) + (if (and node + (or (not threshold) (< load threshold)) + (>=3D space %minimum-disk-space)) [...] (begin ;; BEST is unsuitable, so try the next one. (when (and space (< space %minimum-disk-space)) (format (current-error-port) "skipping machine '~a' because it is low \ on disk space (~,2f MiB free)~%" (build-machine-name best) (/ space (expt 2 20) 1.))) (release-build-slot slot) (loop others))))) --8<---------------cut here---------------end--------------->8--- Therefore, the =E2=80=99else=E2=80=99 branch goes and so the codes does =E2= =80=99(loop others)=E2=80=99. However, I miss why =E2=80=99others=E2=80=99 is not empty (only one machine= in /etc/guix/machines.scm). Well, the message =C2=ABwaiting for locks or build slots...=C2=BB suggests that something is restarted and it is not that =E2= =80=99loop=E2=80=99 we are observing but another one. On daemon side, I do not know what this =E2=80=99waitingForAWhile=E2=80=99 = and =E2=80=99lastWokenUp=E2=80=99 mean. --8<---------------cut here---------------start------------->8--- /* If we are polling goals that are waiting for a lock, then wake up after a few seconds at most. */ if (!waitingForAWhile.empty()) { useTimeout =3D true; if (lastWokenUp =3D=3D 0) printMsg(lvlError, "waiting for locks or build slots..."); if (lastWokenUp =3D=3D 0 || lastWokenUp > before) lastWokenUp =3D b= efore; timeout.tv_sec =3D std::max((time_t) 1, (time_t) (lastWokenUp + set= tings.pollInterval - before)); } else lastWokenUp =3D 0; --8<---------------cut here---------------end--------------->8--- Bah it requires more investigations and I agree with Maxim that efbf5fdd01817ea75de369e3dd2761a85f8f7dd5 is probably not the issue there. Cheers, simon From unknown Fri Jun 13 10:53:09 2025 X-Loop: help-debbugs@gnu.org Subject: bug#24496: offloading should fall back to local build after n tries Resent-From: Ludovic =?UTF-8?Q?Court=C3=A8s?= Original-Sender: "Debbugs-submit" Resent-CC: bug-guix@gnu.org Resent-Date: Tue, 21 Dec 2021 14:29:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 24496 X-GNU-PR-Package: guix X-GNU-PR-Keywords: To: Maxim Cournoyer Cc: ng0 , 24496@debbugs.gnu.org, zimoun Received: via spool by 24496-submit@debbugs.gnu.org id=B24496.164009692722191 (code B ref 24496); Tue, 21 Dec 2021 14:29:02 +0000 Received: (at 24496) by debbugs.gnu.org; 21 Dec 2021 14:28:47 +0000 Received: from localhost ([127.0.0.1]:52815 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mzg8F-0005lq-Jy for submit@debbugs.gnu.org; Tue, 21 Dec 2021 09:28:47 -0500 Received: from hera.aquilenet.fr ([185.233.100.1]:37656) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1mzg8A-0005lZ-UB for 24496@debbugs.gnu.org; Tue, 21 Dec 2021 09:28:46 -0500 Received: from localhost (localhost [127.0.0.1]) by hera.aquilenet.fr (Postfix) with ESMTP id 19ADD3FC; Tue, 21 Dec 2021 15:28:37 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at aquilenet.fr Received: from hera.aquilenet.fr ([127.0.0.1]) by localhost (hera.aquilenet.fr [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id fh_orBe2bXcD; Tue, 21 Dec 2021 15:28:36 +0100 (CET) Received: from ribbon (unknown [193.50.110.211]) by hera.aquilenet.fr (Postfix) with ESMTPSA id 5CF8035C; Tue, 21 Dec 2021 15:28:36 +0100 (CET) From: Ludovic =?UTF-8?Q?Court=C3=A8s?= References: <8760ppr3q3.fsf@we.make.ritual.n0.is> <87r387nhjg.fsf@gnu.org> <868rwkiuf5.fsf@gmail.com> <878rwjqm91.fsf@gnu.org> <87lf0i6gj6.fsf@gmail.com> X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 1 =?UTF-8?Q?Niv=C3=B4se?= an 230 de la =?UTF-8?Q?R=C3=A9volution?= X-PGP-Key-ID: 0x090B11993D9AEBB5 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4 0CFB 090B 1199 3D9A EBB5 X-OS: x86_64-pc-linux-gnu Date: Tue, 21 Dec 2021 15:28:35 +0100 In-Reply-To: <87lf0i6gj6.fsf@gmail.com> (Maxim Cournoyer's message of "Fri, 17 Dec 2021 16:57:33 -0500") Message-ID: <878rwe6nho.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spamd-Bar: + X-Spam-Level: * X-Rspamd-Server: hera Authentication-Results: hera.aquilenet.fr; none X-Rspamd-Queue-Id: 19ADD3FC X-Spamd-Result: default: False [1.40 / 15.00]; ARC_NA(0.00)[]; RCVD_VIA_SMTP_AUTH(0.00)[]; FREEMAIL_CC(0.00)[gmail.com,runbox.com,debbugs.gnu.org]; FROM_HAS_DN(0.00)[]; RCPT_COUNT_THREE(0.00)[4]; TO_MATCH_ENVRCPT_ALL(0.00)[]; FREEMAIL_ENVRCPT(0.00)[gmail.com,runbox.com]; TAGGED_RCPT(0.00)[]; MIME_GOOD(-0.10)[text/plain]; TO_DN_SOME(0.00)[]; FREEMAIL_TO(0.00)[gmail.com]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+]; RCVD_COUNT_TWO(0.00)[2]; RCVD_TLS_ALL(0.00)[]; MID_RHS_MATCH_FROM(0.00)[]; SUSPICIOUS_RECIPS(1.50)[] X-Spam-Score: 1.0 (+) X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.0 (/) Hi, Maxim Cournoyer skribis: > I just reviewed this commit, and don't see anywhere where the behavior > would have changed. The discarding happens here: > > - (if (and node (< load 2.) (>=3D space %minimum-disk-space)) > + (if (and node > + (or (not threshold) (< load threshold)) > + (>=3D space %minimum-disk-space)) > > previously load could be set to +inf.0. Now it is a float between 0.0 > and 1.0, with threshold defaulting to 0.6. Ah alright, so we=E2=80=99re fine. > As far as I remember, this has always been a problem for me (busy > offload machines being forever retried with no fallback to the local > machine). OK, I guess I=E2=80=99m overlooking something. Thanks, Ludo=E2=80=99.