[Buildbot] #2833: probably deadlock situations with MasterLock

Buildbot trac trac at buildbot.net
Thu Jul 3 08:18:12 UTC 2014


#2833: probably deadlock situations with MasterLock
----------------------+-----------------------
Reporter:  lunochod   |      Owner:
    Type:  undecided  |     Status:  new
Priority:  major      |  Milestone:  undecided
 Version:  0.8.9      |   Keywords:
----------------------+-----------------------
 Hi,

 I have the following constellation:

 Many builders build on two slaves which share a common hard disk.
 A SlaveLock exists that allows 2 resp. 4 parallel builds on each slave.

 The builders write (make install) and read (make, make test) libraries
 from a common directory.
 Further, builders trigger other builders that use the provided libraries.
 Sometimes builds fail because a library is linked when it is just written.

 Therefore I introduced a MasterLock 'harddisk_lock'. The 'make' and 'make
 test' ShellCommands have counting access, whereas the maximum number of
 accessing commands is 1000 which should be more than enough.
 The 'make install' commands have exclusive access.

 Now the strange behaviour:
 Sometimes there are only 'make' commands waiting for the harddisk lock and
 all builds stand still.
 The last two lines in twisted.log say:
 "aquireLocks ..."
 "step ... waiting for lock"
 After some minutes some lines in the logging say:
 "automatically retrying query after OperationalError (1.0s sleep)"
 followed by a
 "releaseLocks ..."

 From then on the builders are building as expected until the next "hang".
 These "hangs" are rather seldom but occure persistently.

 Sorry for this vague problem report. I don't know where to start to track
 this down further.

--
Ticket URL: <http://trac.buildbot.net/ticket/2833>
Buildbot <http://buildbot.net/>
Buildbot: build/test automation


More information about the bugs mailing list