Bugzilla – Bug 4331
Jobs disappear from state machine under heavy loads.
Last modified: 2006-04-06 09:53:32
You need to
before you can comment on or make changes to this bug.
Jobs seem to be disappearing from the state machine before they can reach Done
or Failed when a large number of staging jobs are requested.
The remove() Destroy opertaion callback wasn't making sure that the external
state of the job was Done or Failed (just the internal state) before removing
the resource. Since the state machine sets the internal state before it
actually processes that state, the job could disappear before the terminal
states could be processed or a terminal state notification could be sent. A fix
for the race condition is in the globus_4_0_branch.