[Buildbot-commits] [Buildbot] #2454: SiGHUP doesn't always work
Buildbot trac
trac at buildbot.net
Sat May 4 22:16:13 UTC 2013
#2454: SiGHUP doesn't always work
--------------------+------------------------
Reporter: virgilg | Owner:
Type: defect | Status: new
Priority: major | Milestone: undecided
Version: 0.8.7p1 | Resolution:
Keywords: |
--------------------+------------------------
Old description:
> Every now and then on 0.8.5 and more often on 0.8.7p1 we see at
> reconfigure time:
> sending SIGHUP to process 41208
> Never saw reconfiguration finish.
>
> The fix is generally to restart the master, but the problem with this
> approach is it's going to stop everybody else's builds from happening (we
> get slaves lost due to the time it takes to reconfigure - way less than
> the 10 minutes timeout, but we lose them still).
>
> Doing $ kill -SIGHUP 41208 doesn't produce anything in twistd.log, so it
> appears to be indeed stuck.
>
> How can we make this rock-solid?
New description:
Every now and then on 0.8.5 and more often on 0.8.7p1 we see at
reconfigure time:
sending SIGHUP to process 41208
Never saw reconfiguration finish.
The fix is generally to restart the master, but the problem with this
approach is it's going to stop everybody else's builds from happening (we
get slaves lost due to the time it takes to reconfigure - way less than
the 10 minutes timeout, but we lose them still).
Doing $ kill -SIGHUP 41208 doesn't produce anything in twistd.log, so it
appears to be indeed stuck.
How can we make this rock-solid?
--
Comment (by dustin):
Did you try adding the debugging code in comment 3? That would help to
narrow this down.
--
Ticket URL: <http://trac.buildbot.net/ticket/2454#comment:6>
Buildbot <http://buildbot.net/>
Buildbot: build/test automation
More information about the Commits
mailing list