From debbugs-submit-bounces@debbugs.gnu.org Fri Nov 02 06:57:31 2018 Received: (at submit) by debbugs.gnu.org; 2 Nov 2018 10:57:31 +0000 Received: from localhost ([127.0.0.1]:59268 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gIX8t-0003EZ-0r for submit@debbugs.gnu.org; Fri, 02 Nov 2018 06:57:31 -0400 Received: from eggs.gnu.org ([208.118.235.92]:35550) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gIX8r-0003EN-BL for submit@debbugs.gnu.org; Fri, 02 Nov 2018 06:57:29 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gIX8j-0002EL-7y for submit@debbugs.gnu.org; Fri, 02 Nov 2018 06:57:23 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=disabled version=3.3.2 Received: from lists.gnu.org ([2001:4830:134:3::11]:57635) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1gIX8e-0002DT-Ky for submit@debbugs.gnu.org; Fri, 02 Nov 2018 06:57:17 -0400 Received: from eggs.gnu.org ([2001:4830:134:3::10]:45117) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gIX8d-0001IT-GW for bug-guix@gnu.org; Fri, 02 Nov 2018 06:57:16 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gIX8Z-0002CK-Ty for bug-guix@gnu.org; Fri, 02 Nov 2018 06:57:13 -0400 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:35174) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gIX8X-0002Bc-L4 for bug-guix@gnu.org; Fri, 02 Nov 2018 06:57:11 -0400 Received: from [2a01:e0a:1d:7270:af76:b9b:ca24:c465] (port=42504 helo=ribbon) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1gIX8W-00074R-1L for bug-guix@gnu.org; Fri, 02 Nov 2018 06:57:09 -0400 From: ludo@gnu.org (Ludovic =?utf-8?Q?Court=C3=A8s?=) To: bug-guix@gnu.org Subject: 'guix offload' regularly hangs in 'channel-get-exit-status' call X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 12 Brumaire an 227 de la =?utf-8?Q?R=C3=A9volution?= X-PGP-Key-ID: 0x090B11993D9AEBB5 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4 0CFB 090B 1199 3D9A EBB5 X-OS: x86_64-pc-linux-gnu Date: Fri, 02 Nov 2018 11:57:06 +0100 Message-ID: <87k1lvrblp.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Received-From: 2001:4830:134:3::11 X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -6.0 (------) Hello, The =E2=80=98guix offload=E2=80=99 processes on berlin regularly hang while= calling =E2=80=98channel-get-exit-status=E2=80=99: --8<---------------cut here---------------start------------->8--- (gdb) bt #0 0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeout= =3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29 #1 0x00007f2994287577 in ssh_poll_ctx_dopoll () from target:/gnu/store/wmp= g67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 #2 0x00007f29942884d9 in ssh_handle_packets () from target:/gnu/store/wmpg= 67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 #3 0x00007f29942885ad in ssh_handle_packets_termination () from target:/gn= u/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 #4 0x00007f2994275080 in ssh_channel_get_exit_status () from target:/gnu/s= tore/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 #5 0x00007f29946dd11a in guile_ssh_channel_get_exit_status () from target:= /gnu/store/i3nfl17wfx7sryq6w15r9wxl7ilmq4rb-guile-ssh-0.11.3/lib/libguile-s= sh.so.11 #6 0x00007f29a1765965 in vm_regular_engine (thread=3D0x1dd58c0, vp=3D0x1d4= df30, registers=3D0xffffffff, resume=3D-1615646479) at vm-engine.c:786 #7 0x00007f29a1768fba in scm_call_n (proc=3D#, argv= =3Dargv@entry=3D0x7ffc76b1ece8, nargs=3Dnargs@entry=3D1) at vm.c:1257 #8 0x00007f29a16ecff7 in scm_primitive_eval ( exp=3Dexp@entry=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-lin= e) load/lang) "/gnu/store/zz3b7j4iv6v143v7cqyr77k83zc5n3zw-guix-0.15.0-6.f9= a8fce/bin/.guix-real") (main (command-line)) (quit)))) at eval.c:662 #9 0x00007f29a16ed053 in scm_eval ( exp=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-line) load/lang= ) "/gnu/store/zz3b7j4iv6v143v7cqyr77k83zc5n3zw-guix-0.15.0-6.f9a8fce/bin/.g= uix-real") (main (command-line)) (quit))), module_or_state=3Dmodule_or_stat= e@entry=3D"#" =3D {...}) at eval.c:696 #10 0x00007f29a1738220 in scm_shell (argc=3D11, argv=3D0x1dd5280) at script= .c:454 (gdb) frame 0 #0 0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeout= =3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29 29 in ../sysdeps/unix/sysv/linux/poll.c (gdb) p *fds $1 =3D {fd =3D 14, events =3D 1, revents =3D 0} (gdb) shell ls -l /proc/12605/fd total 0 lr-x------ 1 root root 64 Nov 2 11:20 0 -> 'pipe:[44413497]' l-wx------ 1 root root 64 Nov 2 11:33 1 -> 'pipe:[44413496]' lr-x------ 1 root root 64 Nov 2 11:33 10 -> 'pipe:[44459532]' l-wx------ 1 root root 64 Nov 2 11:33 11 -> 'pipe:[44459532]' lr-x------ 1 root root 64 Nov 2 11:33 12 -> 'pipe:[44429590]' l-wx------ 1 root root 64 Nov 2 11:33 13 -> 'pipe:[44429590]' lrwx------ 1 root root 64 Nov 2 11:33 14 -> 'socket:[44444783]' lrwx------ 1 root root 64 Nov 2 11:33 15 -> 'socket:[44444784]' l-wx------ 1 root root 64 Nov 2 11:33 16 -> /var/guix/offload/141.80.167.1= 40/0 l-wx------ 1 root root 64 Nov 2 11:33 2 -> 'pipe:[44413496]' lr-x------ 1 root root 64 Nov 2 11:33 3 -> 'pipe:[44459528]' lr-x------ 1 root root 64 Nov 2 11:33 33 -> /dev/urandom l-wx------ 1 root root 64 Nov 2 11:33 4 -> 'pipe:[44413498]' l-wx------ 1 root root 64 Nov 2 11:33 5 -> 'pipe:[44459528]' lr-x------ 1 root root 64 Nov 2 11:33 6 -> 'pipe:[44459531]' l-wx------ 1 root root 64 Nov 2 11:33 7 -> 'pipe:[44459531]' lr-x------ 1 root root 64 Nov 2 11:33 8 -> 'pipe:[44453928]' l-wx------ 1 root root 64 Nov 2 11:33 9 -> 'pipe:[44453928]' --8<---------------cut here---------------end--------------->8--- I believe this is because in (guix ssh) we don=E2=80=99t ensure the remote process is dead by the time we call =E2=80=98channel-get-exit-status=E2=80= =99, as in this example: --8<---------------cut here---------------start------------->8--- scheme@(guix ssh)> (define s (open-ssh-session "localhost" #:user "ludo" #:= port 22)) scheme@(guix ssh)> (define c (open-remote-pipe* s OPEN_BOTH "sleep 1000")) scheme@(guix ssh)> (channel-send-eof c) $4 =3D # scheme@(guix ssh)> (channel-get-exit-status c) ;; hangs --8<---------------cut here---------------end--------------->8--- Problem is that calling =E2=80=98channel-get-exit-status=E2=80=99 on a clos= ed port doesn=E2=80=99t work, so forcing a port close isn=E2=80=99t really an optio= n: --8<---------------cut here---------------start------------->8--- scheme@(guix ssh)> (define c (open-remote-pipe* s OPEN_BOTH "sleep 100")) scheme@(guix ssh)> (close-port c) $4 =3D #t scheme@(guix ssh)> (channel-get-exit-status c) ERROR: In procedure channel-get-exit-status: In procedure channel-get-exit-status: Wrong type argument in position 1 (ex= pecting open channel): # --8<---------------cut here---------------end--------------->8--- To be continued=E2=80=A6 Ludo=E2=80=99. From debbugs-submit-bounces@debbugs.gnu.org Fri Nov 02 09:41:02 2018 Received: (at 33239) by debbugs.gnu.org; 2 Nov 2018 13:41:02 +0000 Received: from localhost ([127.0.0.1]:59353 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gIZh7-0000py-Th for submit@debbugs.gnu.org; Fri, 02 Nov 2018 09:41:02 -0400 Received: from mx1.riseup.net ([198.252.153.129]:37180) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gIZh6-0000pX-Bu for 33239@debbugs.gnu.org; Fri, 02 Nov 2018 09:41:00 -0400 Received: from piha.riseup.net (piha-pn.riseup.net [10.0.1.163]) (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (256/256 bits)) (Client CN "*.riseup.net", Issuer "COMODO RSA Domain Validation Secure Server CA" (verified OK)) by mx1.riseup.net (Postfix) with ESMTPS id BFA991A0181; Fri, 2 Nov 2018 06:40:59 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=riseup.net; s=squak; t=1541166059; bh=OjWhxUymoxfw1OOjBpQrt0SKjC/hRt2vH+SNruRbe0o=; h=Subject:To:References:From:Date:In-Reply-To:From; b=Owj7+6O8uQLjpUM57pH6WeX0Aow6kco73VLR4wQli+s7LoUFe279cTFUQdLDwjZ+n AKa3G7/S/GJpRg1aYIYHlBNcGIYHU2xWnrZosd0b6kvNME5qm2rujFOiVilRQaTV5f Y4C4MwPwfoF46bjrLelicwiUzDkUg7t5eDgWnw84= X-Riseup-User-ID: 01484FA11B571B80D2C2A75450BCE10BE15B70ADA41062CFD65A9A7B38636E68 Received: from [127.0.0.1] (localhost [127.0.0.1]) by piha.riseup.net with ESMTPSA id DC09C64121; Fri, 2 Nov 2018 06:40:58 -0700 (PDT) Subject: Re: bug#33239: 'guix offload' regularly hangs in 'channel-get-exit-status' call To: =?UTF-8?Q?Ludovic_Court=c3=a8s?= , 33239@debbugs.gnu.org References: <87k1lvrblp.fsf@gnu.org> From: swedebugia Message-ID: Date: Fri, 2 Nov 2018 14:46:30 +0100 MIME-Version: 1.0 In-Reply-To: <87k1lvrblp.fsf@gnu.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: sv-FI X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 33239 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.7 (-) Hi :) On 2018-11-02 11:57, Ludovic Courtès wrote: > (gdb) shell ls -l /proc/12605/fd How did you come up with this file descriptor (did not appear above) What does this give you in the debugging? -- Cheers Swedebugia From debbugs-submit-bounces@debbugs.gnu.org Sat Nov 03 10:10:07 2018 Received: (at 33239) by debbugs.gnu.org; 3 Nov 2018 14:10:07 +0000 Received: from localhost ([127.0.0.1]:33057 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gIwco-0002Vz-Vx for submit@debbugs.gnu.org; Sat, 03 Nov 2018 10:10:07 -0400 Received: from eggs.gnu.org ([208.118.235.92]:52221) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gIwcl-0002VL-CL for 33239@debbugs.gnu.org; Sat, 03 Nov 2018 10:10:03 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gIwcf-0002KN-IU for 33239@debbugs.gnu.org; Sat, 03 Nov 2018 10:09:58 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=disabled version=3.3.2 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:39555) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gIwcf-0002Je-CW; Sat, 03 Nov 2018 10:09:57 -0400 Received: from [2a01:e0a:1d:7270:af76:b9b:ca24:c465] (port=57118 helo=ribbon) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1gIwcf-0007RH-4N; Sat, 03 Nov 2018 10:09:57 -0400 From: ludo@gnu.org (Ludovic =?utf-8?Q?Court=C3=A8s?=) To: swedebugia Subject: Re: bug#33239: 'guix offload' regularly hangs in 'channel-get-exit-status' call References: <87k1lvrblp.fsf@gnu.org> X-URL: http://www.fdn.fr/~lcourtes/ X-Revolutionary-Date: 13 Brumaire an 227 de la =?utf-8?Q?R=C3=A9volution?= X-PGP-Key-ID: 0x090B11993D9AEBB5 X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4 0CFB 090B 1199 3D9A EBB5 X-OS: x86_64-pc-linux-gnu Date: Sat, 03 Nov 2018 15:09:55 +0100 In-Reply-To: (swedebugia's message of "Fri, 2 Nov 2018 14:46:30 +0100") Message-ID: <87sh0ip80c.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: 33239 Cc: 33239@debbugs.gnu.org X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -6.0 (------) Hello, swedebugia skribis: > On 2018-11-02 11:57, Ludovic Court=C3=A8s wrote: >> (gdb) shell ls -l /proc/12605/fd > How did you come up with this file descriptor (did not appear above) It showed up in the pollfd structure passed to =E2=80=98poll=E2=80=99. > What does this give you in the debugging? It shows that the file descriptor is indeed open and corresponds to a socket (which is not much, I admit.) Ludo=E2=80=99. From debbugs-submit-bounces@debbugs.gnu.org Sun Nov 11 11:15:01 2018 Received: (at control) by debbugs.gnu.org; 11 Nov 2018 16:15:01 +0000 Received: from localhost ([127.0.0.1]:46604 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gLsO5-0001zY-3a for submit@debbugs.gnu.org; Sun, 11 Nov 2018 11:15:01 -0500 Received: from eggs.gnu.org ([208.118.235.92]:42784) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gLsO3-0001zB-LB for control@debbugs.gnu.org; Sun, 11 Nov 2018 11:15:00 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gLsNw-0008Qs-EL for control@debbugs.gnu.org; Sun, 11 Nov 2018 11:14:54 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=disabled version=3.3.2 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:49693) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gLsNw-0008Qk-9x for control@debbugs.gnu.org; Sun, 11 Nov 2018 11:14:52 -0500 Received: from [2a01:e0a:1d:7270:af76:b9b:ca24:c465] (port=56160 helo=ribbon) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1gLsNu-0001wd-TR for control@debbugs.gnu.org; Sun, 11 Nov 2018 11:14:51 -0500 Date: Sun, 11 Nov 2018 17:14:49 +0100 Message-Id: <87efbrsi9y.fsf@gnu.org> To: control@debbugs.gnu.org From: ludo@gnu.org (Ludovic =?utf-8?Q?Court=C3=A8s?=) Subject: control message for bug #33239 MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: control X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -6.0 (------) severity 33239 important From debbugs-submit-bounces@debbugs.gnu.org Sat Nov 17 14:09:10 2018 Received: (at 33239) by debbugs.gnu.org; 17 Nov 2018 19:09:10 +0000 Received: from localhost ([127.0.0.1]:58143 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gO5xu-0003FV-KX for submit@debbugs.gnu.org; Sat, 17 Nov 2018 14:09:10 -0500 Received: from mx1.riseup.net ([198.252.153.129]:43681) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gO5xs-0003FL-EQ for 33239@debbugs.gnu.org; Sat, 17 Nov 2018 14:09:09 -0500 Received: from piha.riseup.net (piha-pn.riseup.net [10.0.1.163]) (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (256/256 bits)) (Client CN "*.riseup.net", Issuer "COMODO RSA Domain Validation Secure Server CA" (verified OK)) by mx1.riseup.net (Postfix) with ESMTPS id B77781A01E7; Sat, 17 Nov 2018 11:09:07 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=riseup.net; s=squak; t=1542481747; bh=zgdr9GwxcjrIkkEI2ELr3c1AtZM9mhcaRaKbSRMx5I4=; h=Subject:To:References:From:Date:In-Reply-To:From; b=R7Lc/X5iLrU99iW7PIvXihBEPu/uUnvWJONMCxCQ/EbQfABN9P8Mj0Klg6SR9oaPR PgafR8NP5dAK2oksYyKH6zAZnrN8hnzsKzrKomfwzIQmtO1O9tkzInzJdjHuz36Y4U V+gcwTgIEIhNpd2eE6EgO1FOvr+aiobIJdgPiU4Q= X-Riseup-User-ID: 1775A1C8F0E51E820465DEE066876CEFD3C2539CB43785E075B782E9FEF39291 Received: from [127.0.0.1] (localhost [127.0.0.1]) by piha.riseup.net with ESMTPSA id D4E861E041A; Sat, 17 Nov 2018 11:09:06 -0800 (PST) Subject: Re: bug#33239: 'guix offload' regularly hangs in 'channel-get-exit-status' call To: =?UTF-8?Q?Ludovic_Court=c3=a8s?= , 33239@debbugs.gnu.org References: <87k1lvrblp.fsf@gnu.org> From: swedebugia Message-ID: Date: Sat, 17 Nov 2018 20:14:52 +0100 MIME-Version: 1.0 In-Reply-To: <87k1lvrblp.fsf@gnu.org> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: sv-FI Content-Transfer-Encoding: 8bit X-Spam-Score: -0.7 (/) X-Debbugs-Envelope-To: 33239 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -1.7 (-) On 2018-11-02 11:57, Ludovic Courtès wrote: snip > To be continued… I found this which might be related and point you to a solution: https://github.com/paramiko/paramiko/issues/448 found here: https://duckduckgo.com/?q=channel-get-exit-status+ssh -- Cheers Swedebugia From debbugs-submit-bounces@debbugs.gnu.org Fri Nov 23 12:25:35 2018 Received: (at 33239) by debbugs.gnu.org; 23 Nov 2018 17:25:35 +0000 Received: from localhost ([127.0.0.1]:44482 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gQFCx-0001Pl-8M for submit@debbugs.gnu.org; Fri, 23 Nov 2018 12:25:35 -0500 Received: from eggs.gnu.org ([208.118.235.92]:35144) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gQFCv-0001PV-5J for 33239@debbugs.gnu.org; Fri, 23 Nov 2018 12:25:33 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gQFCn-00071g-D9 for 33239@debbugs.gnu.org; Fri, 23 Nov 2018 12:25:28 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=disabled version=3.3.2 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:34676) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gQFCm-0006zr-Vp for 33239@debbugs.gnu.org; Fri, 23 Nov 2018 12:25:25 -0500 Received: from [2001:660:6102:320:e120:2c8f:8909:cdfe] (port=51896 helo=ribbon) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1gQFCm-0003S8-DZ for 33239@debbugs.gnu.org; Fri, 23 Nov 2018 12:25:24 -0500 From: ludo@gnu.org (Ludovic =?utf-8?Q?Court=C3=A8s?=) To: 33239@debbugs.gnu.org Subject: Re: bug#33239: 'guix offload' regularly hangs in 'channel-get-exit-status' call References: <87k1lvrblp.fsf@gnu.org> Date: Fri, 23 Nov 2018 18:25:21 +0100 In-Reply-To: <87k1lvrblp.fsf@gnu.org> ("Ludovic \=\?utf-8\?Q\?Court\=C3\=A8s\=22'\?\= \=\?utf-8\?Q\?s\?\= message of "Fri, 02 Nov 2018 11:57:06 +0100") Message-ID: <87wop33dvi.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: 33239 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -6.0 (------) ludo@gnu.org (Ludovic Court=C3=A8s) skribis: > (gdb) bt > #0 0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeout= =3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29 > #1 0x00007f2994287577 in ssh_poll_ctx_dopoll () from target:/gnu/store/w= mpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 > #2 0x00007f29942884d9 in ssh_handle_packets () from target:/gnu/store/wm= pg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 > #3 0x00007f29942885ad in ssh_handle_packets_termination () from target:/= gnu/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 > #4 0x00007f2994275080 in ssh_channel_get_exit_status () from target:/gnu= /store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 > #5 0x00007f29946dd11a in guile_ssh_channel_get_exit_status () from targe= t:/gnu/store/i3nfl17wfx7sryq6w15r9wxl7ilmq4rb-guile-ssh-0.11.3/lib/libguile= -ssh.so.11 > #6 0x00007f29a1765965 in vm_regular_engine (thread=3D0x1dd58c0, vp=3D0x1= d4df30, registers=3D0xffffffff, resume=3D-1615646479) at vm-engine.c:786 > #7 0x00007f29a1768fba in scm_call_n (proc=3D#, arg= v=3Dargv@entry=3D0x7ffc76b1ece8, nargs=3Dnargs@entry=3D1) at vm.c:1257 > #8 0x00007f29a16ecff7 in scm_primitive_eval ( > exp=3Dexp@entry=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-l= ine) load/lang) "/gnu/store/zz3b7j4iv6v143v7cqyr77k83zc5n3zw-guix-0.15.0-6.= f9a8fce/bin/.guix-real") (main (command-line)) (quit)))) at eval.c:662 > #9 0x00007f29a16ed053 in scm_eval ( > exp=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-line) load/la= ng) "/gnu/store/zz3b7j4iv6v143v7cqyr77k83zc5n3zw-guix-0.15.0-6.f9a8fce/bin/= .guix-real") (main (command-line)) (quit))), module_or_state=3Dmodule_or_st= ate@entry=3D"#" =3D {...}) at eval.c:696 > #10 0x00007f29a1738220 in scm_shell (argc=3D11, argv=3D0x1dd5280) at scri= pt.c:454 > > (gdb) frame 0 > #0 0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeout= =3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29 > 29 in ../sysdeps/unix/sysv/linux/poll.c > (gdb) p *fds > $1 =3D {fd =3D 14, events =3D 1, revents =3D 0} > (gdb) shell ls -l /proc/12605/fd > total 0 > lr-x------ 1 root root 64 Nov 2 11:20 0 -> 'pipe:[44413497]' > l-wx------ 1 root root 64 Nov 2 11:33 1 -> 'pipe:[44413496]' > lr-x------ 1 root root 64 Nov 2 11:33 10 -> 'pipe:[44459532]' > l-wx------ 1 root root 64 Nov 2 11:33 11 -> 'pipe:[44459532]' > lr-x------ 1 root root 64 Nov 2 11:33 12 -> 'pipe:[44429590]' > l-wx------ 1 root root 64 Nov 2 11:33 13 -> 'pipe:[44429590]' > lrwx------ 1 root root 64 Nov 2 11:33 14 -> 'socket:[44444783]' > lrwx------ 1 root root 64 Nov 2 11:33 15 -> 'socket:[44444784]' > l-wx------ 1 root root 64 Nov 2 11:33 16 -> /var/guix/offload/141.80.167= .140/0 When that happens, the guile process on the remote node that runs the =E2=80=98redirect=E2=80=99 code of =E2=80=98remote-daemon-channel=E2=80=99 = is stuck in select(2) with infinite timeout. Note on berlin the build nodes are still running Guile 2.2.2, vulnerable to the =E2=80=98select=E2=80=99 bug , which we = =E2=80=98redirect=E2=80=99 supposedly works around. Ludo=E2=80=99. From debbugs-submit-bounces@debbugs.gnu.org Sun Nov 25 11:17:26 2018 Received: (at 33239) by debbugs.gnu.org; 25 Nov 2018 16:17:26 +0000 Received: from localhost ([127.0.0.1]:47530 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gQx66-0001G3-2D for submit@debbugs.gnu.org; Sun, 25 Nov 2018 11:17:26 -0500 Received: from eggs.gnu.org ([208.118.235.92]:42332) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gQx65-0001C2-1o for 33239@debbugs.gnu.org; Sun, 25 Nov 2018 11:17:25 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gQx5z-0003jH-2j for 33239@debbugs.gnu.org; Sun, 25 Nov 2018 11:17:19 -0500 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=disabled version=3.3.2 Received: from fencepost.gnu.org ([2001:4830:134:3::e]:54377) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gQx5y-0003jB-WA for 33239@debbugs.gnu.org; Sun, 25 Nov 2018 11:17:19 -0500 Received: from [2a01:e0a:1d:7270:af76:b9b:ca24:c465] (port=54332 helo=ribbon) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from ) id 1gQx5y-0004jC-Ou for 33239@debbugs.gnu.org; Sun, 25 Nov 2018 11:17:18 -0500 From: ludo@gnu.org (Ludovic =?utf-8?Q?Court=C3=A8s?=) To: 33239@debbugs.gnu.org Subject: Re: bug#33239: 'guix offload' regularly hangs in 'channel-get-exit-status' call References: <87k1lvrblp.fsf@gnu.org> Date: Sun, 25 Nov 2018 17:17:17 +0100 In-Reply-To: <87k1lvrblp.fsf@gnu.org> ("Ludovic \=\?utf-8\?Q\?Court\=C3\=A8s\=22'\?\= \=\?utf-8\?Q\?s\?\= message of "Fri, 02 Nov 2018 11:57:06 +0100") Message-ID: <87sgzpuo6q.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 2001:4830:134:3::e X-Spam-Score: -5.0 (-----) X-Debbugs-Envelope-To: 33239 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -6.0 (------) Hello, ludo@gnu.org (Ludovic Court=C3=A8s) skribis: > The =E2=80=98guix offload=E2=80=99 processes on berlin regularly hang whi= le calling > =E2=80=98channel-get-exit-status=E2=80=99: The bug still shows up periodically on berlin but I haven=E2=80=99t found a= way to reproduce it in a controlled environment. Commit 63fd9f084a5e345d2edaeaf5e8f435a3130f9edc should make it less likely, we=E2=80=99ll see=E2=80=A6 Ludo=E2=80=99. From debbugs-submit-bounces@debbugs.gnu.org Sat Dec 22 11:49:54 2018 Received: (at 33239) by debbugs.gnu.org; 22 Dec 2018 16:49:54 +0000 Received: from localhost ([127.0.0.1]:60187 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gakTJ-0003uW-Pg for submit@debbugs.gnu.org; Sat, 22 Dec 2018 11:49:53 -0500 Received: from hera.aquilenet.fr ([185.233.100.1]:46054) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gakTI-0003uO-39 for 33239@debbugs.gnu.org; Sat, 22 Dec 2018 11:49:52 -0500 Received: from localhost (localhost [127.0.0.1]) by hera.aquilenet.fr (Postfix) with ESMTP id 2A9F91006 for <33239@debbugs.gnu.org>; Sat, 22 Dec 2018 17:49:51 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at aquilenet.fr Received: from hera.aquilenet.fr ([127.0.0.1]) by localhost (hera.aquilenet.fr [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id AhjwcYBB_LlU for <33239@debbugs.gnu.org>; Sat, 22 Dec 2018 17:49:50 +0100 (CET) Received: from ribbon (unknown [IPv6:2a01:e0a:1d:7270:af76:b9b:ca24:c465]) by hera.aquilenet.fr (Postfix) with ESMTPSA id 2C401FCE for <33239@debbugs.gnu.org>; Sat, 22 Dec 2018 17:49:50 +0100 (CET) From: =?utf-8?Q?Ludovic_Court=C3=A8s?= To: 33239@debbugs.gnu.org Subject: Re: bug#33239: 'guix offload' regularly hangs in 'channel-get-exit-status' call References: <87k1lvrblp.fsf@gnu.org> Date: Sat, 22 Dec 2018 17:49:49 +0100 In-Reply-To: <87k1lvrblp.fsf@gnu.org> ("Ludovic \=\?utf-8\?Q\?Court\=C3\=A8s\=22'\?\= \=\?utf-8\?Q\?s\?\= message of "Fri, 02 Nov 2018 11:57:06 +0100") Message-ID: <874lb5ld5e.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 33239 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.0 (/) ludo@gnu.org (Ludovic Court=C3=A8s) skribis: > The =E2=80=98guix offload=E2=80=99 processes on berlin regularly hang whi= le calling > =E2=80=98channel-get-exit-status=E2=80=99: > > (gdb) bt > #0 0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeout= =3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29 > #1 0x00007f2994287577 in ssh_poll_ctx_dopoll () from target:/gnu/store/w= mpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 > #2 0x00007f29942884d9 in ssh_handle_packets () from target:/gnu/store/wm= pg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 > #3 0x00007f29942885ad in ssh_handle_packets_termination () from target:/= gnu/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 > #4 0x00007f2994275080 in ssh_channel_get_exit_status () from target:/gnu= /store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 > #5 0x00007f29946dd11a in guile_ssh_channel_get_exit_status () from targe= t:/gnu/store/i3nfl17wfx7sryq6w15r9wxl7ilmq4rb-guile-ssh-0.11.3/lib/libguile= -ssh.so.11 > #6 0x00007f29a1765965 in vm_regular_engine (thread=3D0x1dd58c0, vp=3D0x1= d4df30, registers=3D0xffffffff, resume=3D-1615646479) at vm-engine.c:786 I was able to come up with a reduced test case for Guile-SSH: https://github.com/artyom-poptsov/guile-ssh/issues/11 Ludo=E2=80=99. From debbugs-submit-bounces@debbugs.gnu.org Tue Dec 25 11:49:03 2018 Received: (at 33239) by debbugs.gnu.org; 25 Dec 2018 16:49:03 +0000 Received: from localhost ([127.0.0.1]:36388 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gbpt9-00054p-I1 for submit@debbugs.gnu.org; Tue, 25 Dec 2018 11:49:03 -0500 Received: from hera.aquilenet.fr ([185.233.100.1]:37318) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1gbpt8-00054g-7E for 33239@debbugs.gnu.org; Tue, 25 Dec 2018 11:49:02 -0500 Received: from localhost (localhost [127.0.0.1]) by hera.aquilenet.fr (Postfix) with ESMTP id B122B1315 for <33239@debbugs.gnu.org>; Tue, 25 Dec 2018 17:49:01 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at aquilenet.fr Received: from hera.aquilenet.fr ([127.0.0.1]) by localhost (hera.aquilenet.fr [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id IIZ1E8_Eazms for <33239@debbugs.gnu.org>; Tue, 25 Dec 2018 17:49:01 +0100 (CET) Received: from ribbon (unknown [IPv6:2a01:e0a:1d:7270:af76:b9b:ca24:c465]) by hera.aquilenet.fr (Postfix) with ESMTPSA id DD7BC12CF for <33239@debbugs.gnu.org>; Tue, 25 Dec 2018 17:49:00 +0100 (CET) From: =?utf-8?Q?Ludovic_Court=C3=A8s?= To: 33239@debbugs.gnu.org Subject: Re: bug#33239: 'guix offload' regularly hangs in 'channel-get-exit-status' call References: <87k1lvrblp.fsf@gnu.org> <874lb5ld5e.fsf@gnu.org> Date: Tue, 25 Dec 2018 17:49:00 +0100 In-Reply-To: <874lb5ld5e.fsf@gnu.org> ("Ludovic \=\?utf-8\?Q\?Court\=C3\=A8s\=22'\?\= \=\?utf-8\?Q\?s\?\= message of "Sat, 22 Dec 2018 17:49:49 +0100") Message-ID: <87zhsteemb.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 33239 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.0 (/) Hello! Ludovic Court=C3=A8s skribis: > ludo@gnu.org (Ludovic Court=C3=A8s) skribis: > >> The =E2=80=98guix offload=E2=80=99 processes on berlin regularly hang wh= ile calling >> =E2=80=98channel-get-exit-status=E2=80=99: >> >> (gdb) bt >> #0 0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeou= t=3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29 >> #1 0x00007f2994287577 in ssh_poll_ctx_dopoll () from target:/gnu/store/= wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 >> #2 0x00007f29942884d9 in ssh_handle_packets () from target:/gnu/store/w= mpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 >> #3 0x00007f29942885ad in ssh_handle_packets_termination () from target:= /gnu/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 >> #4 0x00007f2994275080 in ssh_channel_get_exit_status () from target:/gn= u/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 >> #5 0x00007f29946dd11a in guile_ssh_channel_get_exit_status () from targ= et:/gnu/store/i3nfl17wfx7sryq6w15r9wxl7ilmq4rb-guile-ssh-0.11.3/lib/libguil= e-ssh.so.11 >> #6 0x00007f29a1765965 in vm_regular_engine (thread=3D0x1dd58c0, vp=3D0x= 1d4df30, registers=3D0xffffffff, resume=3D-1615646479) at vm-engine.c:786 > > I was able to come up with a reduced test case for Guile-SSH: > > https://github.com/artyom-poptsov/guile-ssh/issues/11 It turned out that the code to start a REPL server in (ssh dist node) would currently hang, as I wrote in the bug report above. After investigation, I decided that inferiors are more appropriate than Guile-SSH=E2=80=99s node to address this use case, after all. Commit ed7b44370f71126087eb953f36aad8dc4c44109f changes =E2=80=98guix offload=E2= =80=99 to inferiors. As a result, build machines must now run Guix > 0.15.0, which provides =E2=80=98guix repl=E2=80=99. That in turn simplifies setup of build machin= es: no need to fiddle with GUILE_LOAD_PATH. On berlin, build machines were running an older Guix so I copied a recently pulled Guix on each of them and installed it in ~/.config/guix/current. They=E2=80=99re now operational, except for the AR= Mv7 one which is still pulling. So far it seems to be working well but we=E2=80=99ll have to keep an eye on it. Thanks, Ludo=E2=80=99. From debbugs-submit-bounces@debbugs.gnu.org Wed Jan 09 15:37:15 2019 Received: (at 33239-done) by debbugs.gnu.org; 9 Jan 2019 20:37:15 +0000 Received: from localhost ([127.0.0.1]:52581 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ghKbC-0007EG-UM for submit@debbugs.gnu.org; Wed, 09 Jan 2019 15:37:15 -0500 Received: from hera.aquilenet.fr ([185.233.100.1]:50148) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1ghKb8-0007E3-Ek for 33239-done@debbugs.gnu.org; Wed, 09 Jan 2019 15:37:10 -0500 Received: from localhost (localhost [127.0.0.1]) by hera.aquilenet.fr (Postfix) with ESMTP id D34931588 for <33239-done@debbugs.gnu.org>; Wed, 9 Jan 2019 21:37:09 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at aquilenet.fr Received: from hera.aquilenet.fr ([127.0.0.1]) by localhost (hera.aquilenet.fr [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id EuFKQDXSy5Qw for <33239-done@debbugs.gnu.org>; Wed, 9 Jan 2019 21:37:09 +0100 (CET) Received: from ribbon (unknown [IPv6:2a01:e0a:1d:7270:af76:b9b:ca24:c465]) by hera.aquilenet.fr (Postfix) with ESMTPSA id 1465C104E for <33239-done@debbugs.gnu.org>; Wed, 9 Jan 2019 21:37:08 +0100 (CET) From: =?utf-8?Q?Ludovic_Court=C3=A8s?= To: 33239-done@debbugs.gnu.org Subject: Re: bug#33239: 'guix offload' regularly hangs in 'channel-get-exit-status' call References: <87k1lvrblp.fsf@gnu.org> <874lb5ld5e.fsf@gnu.org> <87zhsteemb.fsf@gnu.org> Date: Wed, 09 Jan 2019 21:37:08 +0100 In-Reply-To: <87zhsteemb.fsf@gnu.org> ("Ludovic \=\?utf-8\?Q\?Court\=C3\=A8s\=22'\?\= \=\?utf-8\?Q\?s\?\= message of "Tue, 25 Dec 2018 17:49:00 +0100") Message-ID: <87ef9l36vf.fsf@gnu.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 1.0 (+) X-Debbugs-Envelope-To: 33239-done X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: debbugs-submit-bounces@debbugs.gnu.org Sender: "Debbugs-submit" X-Spam-Score: -0.0 (/) Ludovic Court=C3=A8s skribis: > Ludovic Court=C3=A8s skribis: > >> ludo@gnu.org (Ludovic Court=C3=A8s) skribis: >> >>> The =E2=80=98guix offload=E2=80=99 processes on berlin regularly hang w= hile calling >>> =E2=80=98channel-get-exit-status=E2=80=99: >>> >>> (gdb) bt >>> #0 0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeo= ut=3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29 >>> #1 0x00007f2994287577 in ssh_poll_ctx_dopoll () from target:/gnu/store= /wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 >>> #2 0x00007f29942884d9 in ssh_handle_packets () from target:/gnu/store/= wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 >>> #3 0x00007f29942885ad in ssh_handle_packets_termination () from target= :/gnu/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 >>> #4 0x00007f2994275080 in ssh_channel_get_exit_status () from target:/g= nu/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4 >>> #5 0x00007f29946dd11a in guile_ssh_channel_get_exit_status () from tar= get:/gnu/store/i3nfl17wfx7sryq6w15r9wxl7ilmq4rb-guile-ssh-0.11.3/lib/libgui= le-ssh.so.11 >>> #6 0x00007f29a1765965 in vm_regular_engine (thread=3D0x1dd58c0, vp=3D0= x1d4df30, registers=3D0xffffffff, resume=3D-1615646479) at vm-engine.c:786 >> >> I was able to come up with a reduced test case for Guile-SSH: >> >> https://github.com/artyom-poptsov/guile-ssh/issues/11 > > It turned out that the code to start a REPL server in (ssh dist node) > would currently hang, as I wrote in the bug report above. > > After investigation, I decided that inferiors are more appropriate than > Guile-SSH=E2=80=99s node to address this use case, after all. Commit > ed7b44370f71126087eb953f36aad8dc4c44109f changes =E2=80=98guix offload=E2= =80=99 to > inferiors. It looks like this commit fixed the bug above, so I=E2=80=99m closing it. There are still occasional hangs in =E2=80=98ssh_handle_packets_termination= =E2=80=99 though while reading from a channel but AFAICS that=E2=80=99s a different i= ssue. Ludo=E2=80=99. From unknown Fri Jun 13 10:11:27 2025 Received: (at fakecontrol) by fakecontrolmessage; To: internal_control@debbugs.gnu.org From: Debbugs Internal Request Subject: Internal Control Message-Id: bug archived. Date: Thu, 07 Feb 2019 12:24:08 +0000 User-Agent: Fakemail v42.6.9 # This is a fake control message. # # The action: # bug archived. thanks # This fakemail brought to you by your local debbugs # administrator