In order to provide a backup for the main on-line science archive at ROE and also to facilitate several of the advanced processing/reprocessing stages needed, we propose to store on-line the science output and master calibration data of the basic pipeline in Cambridge. Costs of suitable scale RAID arrays (eg. initially 10 Tbytes capacity) are certain to fall sufficiently by 2004 to make this a cost effective and viable solution.
With on-line storage in place at two sites we do not foresee the need for an
(expensive) near-line ``tape'' storage system providing that a high density
transfer medium is used from Hawaii, whereby the 100 Gbytes of data
expected per night is stored on one tape. If reprocessing from the raw data
is required this would be feasible, even for on-the-shelf storage, with
a one-to-one system like this.
We expect to have sufficient computing power in place as part of the Cambridge Data Processing Centre activities to handle the data flow from WFCAM. Our current pipeline processing activities will provide the basis for development of a UKIRT WFCAM pipeline data. We intend to appoint a dedicated UKIRT WFCAM data processing manager who will be responsible for overseeing the operation of the UKIRT pipeline and organising appropriate software effort as necessary.