Bugzilla – Bug 4331
Jobs disappear from state machine under heavy loads.
Last modified: 2006-04-06 09:53:32
You need to log in before you can comment on or make changes to this bug.
Jobs seem to be disappearing from the state machine before they can reach Done or Failed when a large number of staging jobs are requested.
The remove() Destroy opertaion callback wasn't making sure that the external state of the job was Done or Failed (just the internal state) before removing the resource. Since the state machine sets the internal state before it actually processes that state, the job could disappear before the terminal states could be processed or a terminal state notification could be sent. A fix for the race condition is in the globus_4_0_branch.