GNU bug report logs - #40981
shepherd 0.8.0 race condition can lead to stopping itself

Previous Next

Package: guix;

Reported by: Mathieu Othacehe <m.othacehe <at> gmail.com>

Date: Thu, 30 Apr 2020 11:52:02 UTC

Severity: important

Merged with 41429

Done: Mathieu Othacehe <mathieu <at> meru.i-did-not-set--mail-host-address--so-tickle-me>

Bug is archived. No further changes may be made.

Full log


View this message in rfc822 format

From: Mathieu Othacehe <m.othacehe <at> gmail.com>
To: 40981 <at> debbugs.gnu.org
Subject: bug#40981: Graphical installer tests sometimes hang.
Date: Mon, 04 May 2020 14:50:43 +0200
> I'll keep looking!

Ok, getting closer. Here's a suspect part of Shepherd strace log:

--8<---------------cut here---------------start------------->8---
[pid     1] stat("/etc/localtime", {st_mode=S_IFREG|0444, st_size=2298, ...}) = 0
[pid     1] write(9, "shepherd[1]: changing HTTP/HTTPS"..., 86) = 86
[pid     1] getpgid(194)                = 194
[pid     1] kill(-194, SIGTERM)         = 0
--8<---------------cut here---------------end--------------->8---

I think the problem is introduced by commit
1e7a91d21f1cc5d02697680e19e3878ff8565710 in Shepherd.

"(getpgid <guix-daemon-pid>") returns 0, and calling "(kill 0 SIGTERM)"
kills all processes.

Now, I really don't get how guix-daemon pgid could be zero. Man page of
setpgid(2) says:

--8<---------------cut here---------------start------------->8---
   A child created via fork(2) inherits its parent's process group ID.  The PGID  is
   preserved across an execve(2).
--8<---------------cut here---------------end--------------->8---

WDYT?

Thanks,

Mathieu




This bug report was last modified 4 years and 341 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.