Weird TS3500 problem: redux

Well, after yesterday’s episode with our tape library today continued to be a taxing day. After restarting a few exports that were hanging yesterday due to our library problems, something similar returned. TSM was unable to locate a few (two to be exact) tapes in the library.

Yet the library reported the tapes were still inventoried. *shrug* Here we are again, looking completely baffled. After a short while trying to figure out what to do, we went through the Data Cartridge inventory again. As it turns out, through putting the library in “Pause”-Mode and restarting TSM multiple times, TSM apparently completely forgot that it had these tapes put into drives.

After manually moving the tapes back to their home slot via the management interface of the TS3500 and setting the volume access mode back to read-write, everything is fine now I could finish my pending exports!

Weird TS3500 problem

Well, today we had a rather weird problem with our TS3500. TSM running on AIX basically went bonko and spit out weird media sense errors, all stating that there is a hardware or media error of unknown nature:

After restarting the TSM server (as in the service, not the whole box) five times, which didn’t resolve squat we decided to take a look at the TS3500 itself. We opened up the Management interface and tried moving a tape into a drive. That didn’t work. Hrmmmmm.

We tried the manual move from the LCD display mounted on the front of the TS3500 base frame, that didn’t work either. So we figured the gripper was stuck and placed a call with our trustworthy support provider.

After a few minutes, they called us back and told us: “Try the following: Place the library in “Pause”-Mode and open it up, maybe a tape fell down …“.

We did exactly that, the gripper moved back to it’s pause position (which is in the base frame), and we started looking inside after opening up the base frame and an expansion frame. Nothing …

So we closed it back up, and let the base frame resume it’s normal duties … guess what: After resuming normal operations, it worked again *shrug*