next up previous contents
Next: This document Up: INTRODUCTION Previous: Background   Contents

The need for science archives.

The question naturally arises as to how science exploitation of such large datasets will be undertaken. Data volumes will simply be too large for users to download and keep their own copies. Raw data processing is likely to be complicated, while calibration procedures will evolve as cameras are better characterised and more calibration data are obtained. Reprocessing of substantial amounts of pixel data may be necessary in the light of improved algorithms or for specific `non-standard' science goals. Once data are reduced using standardised pipeline procedures, the establishment of a centralised `science archive' offers the greatest potential for full science exploitation (see the paper presented by Lawrence et al. at the 2002 SPIE meeting in Kona, Hawaii; available online at http://www.roe.ac.uk/~nch/wfcam/misc). Again, calibration procedures can be more easily developed and applied in a controlled manner to data in a central repository - it makes sense to solve data-specific reduction and calibration problems once, yielding an optimal solution. Early community access to well calibrated data will facilitate timely science exploitation. A well constructed science archive will enhance greatly the scope of research that can be done with the survey data; in fact, many science applications will only be feasible via a sophisticated science archive. For example, much of the science that will be done with the UKIDSS LAS will rely on complementary data from the SDSS and other non-IR wavelength surveys. Given the volume of all of these datasets, some thought needs to go into the design of the archive to enable full exploitation.

In outline, what should the science archive provide? As an absolute bare minimum, a simple web interface to pixels and object catalogues along the lines of WFAU's SuperCOSMOS Sky Survey (SSS; see http://www-wfau.roe.ac.uk/sss) provides community access to the data. However, full science exploitation requires much more. A well-designed archive should:

Additionally, we expect to follow new developments in web server protocols (e.g. VOTable - see http://vizier.u-strasbg.fr/doc/VOTable - a proposed protocol for exchange of astronomical data embedded in XML). Clearly, such functionality adds considerable value to the survey data by creating the potential for advanced science exploitation. Furthermore, in the era of `e-science' and `virtual observatories' we envision yet more advanced archive functionality:

In short, the WFCAM/UKIDSS science programme comprises surveys of unprecedented scope with data volumes that are hugely increased over previous experience in the UK. Moreover, user expectation as to the kinds of science that should be possible with these surveys is growing. Archive developments for this major new project will benefit all UK astronomers over the next few decades and will continue the traditional leading role of the UK in survey astronomy which underpins much of the research done here. The science archive is so much more than a simple repository of data - it is the means by which users will exploit the huge WFCAM data volumes.


next up previous contents
Next: This document Up: INTRODUCTION Previous: Background   Contents
Nigel Hambly 2002-10-08