<div dir="ltr">Oh, I did not realize the very old twisted version. you can try to downgrade on the worker indeed.<div><br><div>I see no reason not to upgrade twisted on master, though</div></div><div><br></div><div>Pierre</div></div><br><div class="gmail_quote"><div dir="ltr">On Tue, Jun 20, 2017 at 2:45 PM Colin Chargy <<a href="mailto:Colin.Chargy@bentley.com">Colin.Chargy@bentley.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div lang="EN-US" link="blue" vlink="purple">
<div class="m_-5725504791902163159WordSection1">
<p class="MsoNormal">Hi Pierre,<u></u><u></u></p>
<p class="MsoNormal">I tested what you suggested :<u></u><u></u></p>
<p class="MsoNormal">$ buildslave --version<u></u><u></u></p>
<p class="MsoNormal">Buildslave version: 0.8.8<u></u><u></u></p>
<p class="MsoNormal">Twisted version: 17.5.0<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">This does not change the behavior. Should I test with another twisted version ?<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Regards,<u></u><u></u></p>
<p class="MsoNormal">Colin<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"></p></div></div><div lang="EN-US" link="blue" vlink="purple"><div class="m_-5725504791902163159WordSection1"><p class="MsoNormal"><b>From:</b> Pierre Tardy [mailto:<a href="mailto:tardyp@gmail.com" target="_blank">tardyp@gmail.com</a>] <br>
</p></div></div><div lang="EN-US" link="blue" vlink="purple"><div class="m_-5725504791902163159WordSection1"><p class="MsoNormal"><b>Sent:</b> Tuesday, June 20, 2017 14:15</p></div></div><div lang="EN-US" link="blue" vlink="purple"><div class="m_-5725504791902163159WordSection1"><p class="MsoNormal"><br>
<b>To:</b> Colin Chargy <<a href="mailto:Colin.Chargy@bentley.com" target="_blank">Colin.Chargy@bentley.com</a>>; <a href="mailto:users@buildbot.net" target="_blank">users@buildbot.net</a><br>
<b>Subject:</b> Re: [<a href="mailto:users@bb.net" target="_blank">users@bb.net</a>] FW: Issue connecting slave to master<u></u><u></u></p></div></div><div lang="EN-US" link="blue" vlink="purple"><div class="m_-5725504791902163159WordSection1">
<p class="MsoNormal"><u></u> <u></u></p>
<div>
<p class="MsoNormal">Colin,<u></u><u></u></p>
<div>
<p class="MsoNormal">Its a bit harder to me to efficiently help you as 0.8.8 is quite an old version. I imagine upgrading is not an option..<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">it might be an incompatibility of the slave version string. We usually try to maintain compatibility for new master version to old slave version, but we might not always take care of supporting running new slaves with older master.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Did you try downgrading your slave version to 0.8.8?<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
<div>
<p class="MsoNormal">Pierre<u></u><u></u></p>
</div>
</div>
<p class="MsoNormal"><u></u> <u></u></p>
<div>
<div>
<p class="MsoNormal">On Tue, Jun 20, 2017 at 11:53 AM Colin Chargy <<a href="mailto:Colin.Chargy@bentley.com" target="_blank">Colin.Chargy@bentley.com</a>> wrote:<u></u><u></u></p>
</div>
<blockquote style="border:none;border-left:solid #cccccc 1.0pt;padding:0in 0in 0in 6.0pt;margin-left:4.8pt;margin-right:0in">
<div>
<div>
<p class="MsoNormal">Hi Pierre,<u></u><u></u></p>
<p class="MsoNormal">Thanks for your reply.<u></u><u></u></p>
<p class="MsoNormal">Indeed, I’ve seen in the failedToGetPerspective doc that it could fail with a wrong login password. However, the slave name and password seems correct (ie the same on the slave
.toc file and on the server config). We also tested multiple login/password couple to see if that changes anything (with no luck). The TCP dump seems to show that the last things which are sent are the host name and slave info which are the default one (I
tried modify them with no luck). What happen after/inside failedToGetPerspective ? Does the connection changes port/connection/setting or anything else at this point ?<u></u><u></u></p>
<p class="MsoNormal"> <u></u><u></u></p>
<p class="MsoNormal">I should probably add about info our set up : the server runs 2 buildbot masters and the slave computer also 2 buildbot slave (one for each master). We do have other computer that
work that way without any problem. Of course, we checked that each slave is connecting to the correct master. Only one of the slave/master couple fails (and as already said, only on this computer).<u></u><u></u></p>
<p class="MsoNormal"> <u></u><u></u></p>
<p class="MsoNormal">Best regards,<u></u><u></u></p>
<p class="MsoNormal">Colin Chargy<u></u><u></u></p>
<p class="MsoNormal"> <u></u><u></u></p>
<p class="MsoNormal"><b>From:</b> Pierre Tardy [mailto:<a href="mailto:tardyp@gmail.com" target="_blank">tardyp@gmail.com</a>]
<br>
<b>Sent:</b> Tuesday, June 20, 2017 11:41<br>
<b>To:</b> Colin Chargy <<a href="mailto:Colin.Chargy@bentley.com" target="_blank">Colin.Chargy@bentley.com</a>>;
<a href="mailto:users@buildbot.net" target="_blank">users@buildbot.net</a><br>
<b>Subject:</b> Re: [<a href="mailto:users@bb.net" target="_blank">users@bb.net</a>] FW: Issue connecting slave to master<u></u><u></u></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <u></u><u></u></p>
<div>
<div>
<p class="MsoNormal">Hi Colin<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal">Could that be a problem with your slave password?<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> <u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> def failedToGetPerspective(self, why):<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> """The login process failed, most likely because of an authorization<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> failure (bad password), but it is also possible that we lost the new<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> connection before we managed to send our credentials.<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> """<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> log.msg("ReconnectingPBClientFactory.failedToGetPerspective")<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> if why.check(pb.PBConnectionLost):<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> log.msg("we lost the brand-new connection")<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> # retrying might help here, let clientConnectionLost decide<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> return<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> # probably authorization<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> self.stopTrying() # logging in harder won't help<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> log.err(why)<u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"> <u></u><u></u></p>
</div>
</div>
<p class="MsoNormal"> <u></u><u></u></p>
<div>
<div>
<p class="MsoNormal">On Tue, Jun 20, 2017 at 9:18 AM Colin Chargy <<a href="mailto:Colin.Chargy@bentley.com" target="_blank">Colin.Chargy@bentley.com</a>> wrote:<u></u><u></u></p>
</div>
<blockquote style="border:none;border-left:solid #cccccc 1.0pt;padding:0in 0in 0in 6.0pt;margin-left:4.8pt;margin-top:5.0pt;margin-right:0in;margin-bottom:5.0pt">
<p class="MsoNormal">Hi everyone,<br>
Before I start describing my issue, let me say to we have dozen of slaves (Win, Mac and Linux platform perfectly working right now), only one is problematic :<br>
We are facing an issue with slave connection to master. Here is the log on the slave side (see enclosed twisted.log for complete log) :<br>
[Broker,client] message from master: attached [Broker,client] ReconnectingPBClientFactory.failedToGetPerspective<br>
[Broker,client] we lost the brand-new connection [Broker,client] Lost connection to
<a href="http://192.168.0.1:9989" target="_blank">192.168.0.1:9989</a> [Broker,client] <twisted.internet.tcp.Connector instance at 0x03471918> will retry in 3 seconds<br>
<br>
And it starts it again.<br>
On the server side, the following log is produced :<br>
2017-06-19 16:11:27+0200 [Broker,9423,192.168.0.254] slave 'lrttestauto-test' attaching from IPv4Address(TCP, '192.168.0.254', 35524)<br>
2017-06-19 16:11:27+0200 [Broker,9423,192.168.0.254] Starting buildslave keepalive timer for 'lrttestauto-test'<br>
2017-06-19 16:11:27+0200 [Broker,9423,192.168.0.254] Peer will receive following PB traceback:<br>
2017-06-19 16:11:27+0200 [Broker,9423,192.168.0.254] Unhandled Error<br>
Traceback (most recent call last):<br>
Failure: twisted.spread.pb.PBConnectionLost: [Failure instance: Traceback (failure with no frames): <class 'twisted.internet.error.ConnectionLost'>: Connection to the other side was lost in a non-clean fashion.<br>
]<br>
<br>
I've checked that the login and password are correct and Buildbot version are the following :<br>
On the server-side (which is a Debian):<br>
Buildbot version: 0.8.8<br>
Twisted version: 12.3.0<br>
<br>
On the slave side (which is a Windows 10, buildslave installed via pip):<br>
Buildslave version: 0.8.14<br>
Twisted version: 17.5.0<br>
<br>
I've enclosed the slave log, the slave tac file and a tcpdump showing data transfer between slave and server (I've tried to debug it with Wireshark with no luck).<br>
<br>
What can I do to debug or to solve this issue ?<br>
<br>
Best regards,<br>
Colin Chargy<br>
_______________________________________________<br>
users mailing list<br>
<a href="mailto:users@buildbot.net" target="_blank">users@buildbot.net</a><br>
<a href="https://lists.buildbot.net/mailman/listinfo/users" target="_blank">https://lists.buildbot.net/mailman/listinfo/users</a><u></u><u></u></p>
</blockquote>
</div>
</div>
</div>
</blockquote>
</div>
</div></div></blockquote></div>