NIRSPEC Reliability Improvement 
Progress report:  12  Mar  2004

Overview:

During this report period, significant progress has been made on a number of fronts.

Ia.  Speeding recovery from server crashes:

The recovery script was finished about six weeks ago.      Time lost to weather has been high for some time now.   We may still be in the regime of small number statistics in terms of evaluating our success.     Compared to last summer, when we first started working on (what became) this task, time lost has decreased by about a factor of two or three.    Since the last meeting, the recovery script has been run three times at night.     On the first two occasions, the script correctly diagnosed the problem as not being a server crash and took the appropriate action.    No time on sky was lost as a result.    On the third occasion, the script failed because the iBoot device addressing the computer room black box seemed to be off-line.

IIa.  Upgrade instrument host:

This task (as befitting a background task) has not seen much progress.      Work has started though on replacing explicit references to waimea with an environment variable that will allow future hot swaps.

IIb.  Correlation research:

This task continues as a pure background task.

IIc.  Crash free periods:

It appears we have exhausted this avenue of research and we now consider this task complete.

IId.  Characterize communications chain:

This task is now finished.      No glaring deficiency was found in the communications chain hardware which would account for the frequency of server crashes witnessed over the past year.

IIe.  Examine power supplies:

Prompted by NIRSPEC's sensitivity to power glitches, this task got underway during this report period.

IIIa.  Reduce communications traffic:

This task accounted for most of our effort during this report period.     Going into our most recent engineering night,  the new keyword server and rotator server codes worked in stationary mode but not PA mode.

Issues and Concerns: