[Buildbot-devel] lost remote - slave cannot keep connection with master

Aaron Maxwell amax at snaplogic.org
Mon Feb 25 18:51:12 UTC 2008


Hi all,
I'm suddenly having a problem today with my buildbot installation.  We have a slave node, "linbot1", which is central for our QA setup.  Currently it's nonfunctional:  linbot1 is unable to keep its connection to the master for more than a few seconds.  This is repeated in the logs:
{{{
2008/02/25 12:43 -0700 [Broker,client] message from master: attached
2008/02/25 12:43 -0700 [Broker,client] SlaveBuilder.remote_print(slave_builder): message from master: attached
2008/02/25 12:43 -0700 [Broker,client] sending application-level keepalives every 600 seconds
2008/02/25 12:43 -0700 [Broker,client] lost remote
2008/02/25 12:43 -0700 [Broker,client] <twisted.internet.tcp.Connector instance at 0xb792318c> will retry in 2 seconds
2008/02/25 12:43 -0700 [Broker,client] Stopping factory <buildbot.slave.bot.BotFactory instance at 0xb791ec4c>
2008/02/25 12:43 -0700 [-] Starting factory <buildbot.slave.bot.BotFactory instance at 0xb791ec4c>
2008/02/25 12:43 -0700 [Broker,client] message from master: attached
}}}
I'm having trouble just debugging this and figuring out what is going on.  Any suggestions, advice, etc. is appreciated.  Thanks.
PS This happened after I deployed a significant number of changes from my development QA-system sandbox to the live production version.  The dev and production versions are pretty tightly controlled, and (unless I miss a spot) essentially identical to each other.  This connection-dropping does not exist on the development version, just production.
--
Aaron Maxwell .:. amax at snaplogic.org .:. http://snaplogic.orgSnapLogic, Inc. - Data Integration for the Last Mile
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://buildbot.net/pipermail/devel/attachments/20080225/13ad018b/attachment.html>


More information about the devel mailing list