| Summary: | error deleting a directory | ||
|---|---|---|---|
| Product: | RFT | Reporter: | Stuart Martin <smartin@mcs.anl.gov> |
| Component: | RFT | Assignee: | Ravi Madduri <madduri@mcs.anl.gov> |
| Status: | RESOLVED FIXED | ||
| Severity: | normal | CC: | allcock@mcs.anl.gov, childers@mcs.anl.gov, lane@mcs.anl.gov, madduri@mcs.anl.gov, rynge@isi.edu, smartin@mcs.anl.gov |
| Priority: | P3 | ||
| Version: | 4.0.1 | ||
| Target Milestone: | 4.1 | ||
| Hardware: | Macintosh | ||
| OS: | All | ||
RFT is complaining that the server couldn't delete the sleep.sh file specifically. The server wouldn't report any particular file back in the error message if there was a problem with the RDEL, so perhaps rft is attempting to delete sleep.sh seperately? I would expect that RFT would only send the RDEL on the parent directory and that would handle everything.
Subject: Re: error deleting a directory What version of gridftp servers is the test invoking for deletion ? RFT will send RDEL only if the server supports RDEL. Else it would send MLST, make a list of files and call delete on each one of them.
From one of the gram debug container log files,
>>>>>
fileStageIn destinationUrl after: gsiftp://skynet-7.isi.edu:2811/scratch/rynge/jobs/
5864c970-9ecf-11da-8c83-ab803b611239/sleep.sh
<<<<<
it looks like the GT container gridftp server is "skynet-7.isi.edu"
[macdaddy:~] smartin% telnet skynet-7.isi.edu 2811
Trying 128.9.233.17...
Connected to skynet-7.isi.edu.
Escape character is '^]'.
220 skynet-7.isi.edu GridFTP Server 2.1 (gcc32, 1122653280-63) ready.
k, thats a 4.0.1 server, Ravi says rdel should have been used... gimme logs: start the gridftp server with '-d all -l <logfile>' and the env var 'GLOBUS_GRIDFTP_SERVER_FILE_DEBUG=ALL,<debuglogfile>' an rft log wouldn't hurt as well.
Subject: Re: error deleting a directory Now I guess I need to see the gridftp logs, if they are available.
There is a possibility that this is related to bug 3840. Look at transfer 10:
[Thread-68,getDeleteClient:382] [Request 5, Transfer 10] deleting
gsiftp://viz-1.isi.edu:2811/scratch/rynge/jobs/e5e50980-4633-11da-8b4c-d7d91db7639c/sleep.sh
Note that sleep.sh was not explicitly mentioned for deletion in the rsl, and
that it looks like RFT (and/or GRAM) is confusing a stagein request for a
deletion request.
Test with deletion failure and gridftp logs: http://skynet-login.isi.edu/gram-testing/test-details.php?uuid=2dabb058-7735-4de2-a85f-fba8fb7e3c1b
The gridftp log agrees with the 3840 theory... for the particular work dir with the DELE X/sleep.sh failure, the order of operations involving it are: mkdir X dele X/sleep.sh (fail) stor X/.ignoreme rdel X one way or another, that DELE makes no sense on a freshly created (within seconds) dir.
Subject: Re: error deleting a directory Mats Will it be possible for you to repeat this test :http://skynet-login.isi.edu/gram-testing/test-details.php?uuid=2dabb058-7735-4de2-a85f-fba8fb7e3c1b with RFT logging turned on ?
Looks like test 70 has the same problem occuring, and RFT logging is enabled for that one.
My fix to trunk seems to fix this bug. Merged the fix to the release branch and waiting for results.
Looks like this is fixed now in release and head. closing the bug