SDSS Data Products

1. Introduction

This CD-ROM contains a small sample of the first imaging data from the Sloan Digital Sky Survey. We emphasize that these are engineering data; they were observed and reduced before the SDSS instrumentation and software had reached their final level of performance. Nevertheless, the data are good enough to support some preliminary science. This sampler both shows you a sample of the images and tables and presents the data in a form close to that in which the SDSS data products will be released.

This writeup describes:

Imaging operations with the SDSS;
Data processing through the imaging software pipelines, including calibrations;
Full frame data outputs from the Photometric Pipeline (Photo) which are included in directory data on this CD;
Tabular data outputs from the Photometric Pipeline (Photo) which are also included in directory data on this CD;
References.
There are three additional documents that accompany this writeup:

Source code is provided in the src directory on this CD-ROM to aid in reading the atlas and mask data files. Instructions for compiling and running the code are given in the file readme in that directory. The data directory hierarchy is organized by run number and camcol number. All outputs on this CD-ROM are in the data/77/5 subdirectory.



2. The Sloan Digital Sky Survey: Imaging

The goal of the SDSS is to image 1/4 of the sky, roughly the area of the North Galactic Cap, in five bands spanning the optical wavelength range to a depth (in the sensitive bands) of ~23m. Spectra of objects selected from the imaging data will be taken; the SDSS strategy is to take spectra of a magnitude limited sample of some 1,000,000 galaxies and 100,000 quasars, as well as calibration objects and other targets of opportunity.

The SDSS imaging camera is a mosaic of 54 CCD detectors in the focal plane, 30 large devices arranged in 6 columns of 5 each and 24 smaller devices around the periphery. The SDSS imaging data are taken in time-delay-and-integrate (TDI) mode at the sidereal rate almost simultaneously in five bands: the sky tracks through 5 CCD detectors in succession, each located behind a different filter, as shown below:

Camera scanning the sky

The filters and their effective wavelengths are: u' (3540 Å), g' (4760 Å), r' (6280 Å), i' (7690 Å) and z' (9130 Å). The imaging CCDs are 2048 x 2048 pixel devices which image an area of about 13' x 13' on the sky with 0.4'' pixels. The filter responses and SDSS photometric calibration are described by Fukugita et al. (1996) and in the SDSS Project "Black Book" (1996). The camera is described in detail by Gunn et al. (1998) and the entire SDSS is described in several places, including the Black Book, Gunn and Weinberg (1995) and Knapp (1997).

Because it operates in TDI mode, the camera must scan great circles. The first data, including those on this CD-ROM, were taken by drift-scanning at the celestial equator. These data were taken when the telescope collimation was not yet finalized, and, due to residual astigmatism, the images sizes range from 1.2'' to 1.5'' full width at half maximum (FWHM). The data also contain some horizontal streaking that is an artifact of a problem in the data acquisition readout electronics, which has subsequently been fixed. During more recent observing runs, image sizes < 1'' FWHM have been achieved.



3. Data Reduction

Since they are taken in TDI mode, the data are in the form of a continuous tapestry (in this case along declination zero). The data are reduced automatically by a set of interoperating pipelines, which correct the data for defects, calculate and apply astrometric and photometric calibrations, calculate sky, apply bias and flat field vectors and find, measure and catalogue objects. Since the last step is by far the most time consuming, the data stream is cut in a series of fields, each of which is processed independently.

Some SDSS nomenclature will be useful:

A scanline is the data from a single set of CCDs which sweep the same area of sky. Each set of 5 CCDs is housed in a single dewar: there are 6 sets of CCDs separated by about 80% of the CCD width. The area of sky swept by the 6 CCD columns is called a strip. A given area of sky is imaged by performing two successive scans, offset by almost a CCD width, to fill in a stripe. The scanlines in a stripe overlap about 10% on each side with the adjacent scanlines.

The data stream from a single CCD in a scanline is cut into a series of frames which measure 2048 x 1489 pixels and overlap 10% with the adjacent frame. The frames in the 5 filters for the same part of the sky are called a field.

A run is the set of data collected from one continuous pass of the 2.5 m telescope across the sky, and thus covers one strip. Typically, a run lasts for a few hours. Each scanline is labelled by the camcol number (1-6, increasing from S to N) of the camera column of CCDs covering that scanline.

Astrometric calibrations are performed by the astrometric pipeline (Astrom). The astrometric solutions for the frames on this CD-ROM were obtained by comparing the pixel centroids of stars on the frames with the catalog Right Ascension and Declination of matching stars found in Stone's (1997) equatorial astrometric catalog. The matching stars' centroids and coordinates were then fit by a low-order least squares polynomial. The residuals from this fit were used to generate spline functions which were then applied to the polynomial fit to reduce the residuals. The astrometric accuracy of these fits is about 0.1 arcsec (rms, per coordinate).

The photometric calibrations are produced by the monitor telescope pipeline (MT). The calibration procedures, absolute standards and color conversions to the SDSS filters are described by Fukugita et al. (1996). A network of about 100 primary standard stars is being set up using a 20" Monitor Telescope located at APO and a 40" telescope at the US Naval Observatory station in Flagstaff, AZ. The zero points are tied to stars with good absolute spectrophotometric calibrations. Because the primary stars are too bright to be observed by the 2.5 m telescope, secondary sky patches that overlap the 2.5 m scans are observed with the Monitor Telescope. These secondary patches are used to tie the MT photometric system to the 2.5 m observations.

The imaging data reduction is performed by the photometric pipeline (Photo). Photo consists of three successive pipelines. The last of these, the Frames pipeline, operates on an individual set of five frames covering a field. It corrects the frames for artifacts (bad columns, cosmic rays, bleed trails); bias-subtracts and flattens them; finds objects (as linked pixels above the sky) in two stages (bright and faint), as well searching for lower surface brightness objects in a binned image; combines the objects in the five filters; measures the objects (centroid, size, shape, counts); measures the radial profiles; extracts the PSF filtered counts; fits to three simple models (PSF, de Vaucouleurs profile, exponential disk); classifies the objects; deblends compound objects; cuts atlas images (including both parent and child images for blended objects) and writes out a series of outputs: tables of measured parameters; corrected frames; 4x4 pixel binned sky frames (with objects subtracted); atlas images; mask frames (location of saturated pixels, interpolated pixels etc); and summary statistics for each frame.

To perform these calculations, Frames needs to know the flat field, bias, sky, and PSF. These are calculated for the entire run by the Postage Stamp Pipeline, (PSP), which calculates these quantities for the whole run and interpolates them to the center of each frame. The PSP uses cutout images of bright (but unsaturated) stars, at present those used by the ASTROM pipeline; rejects bad stars (double etc) and calculates the parameters of a simple PSF fit. The cutouts are made by the Serial Stamp Collecting Pipeline (SSC) which also aligns the frames in a field.

Several caveats need to be kept in mind when interpreting the data:

  1. The point spread function (PSF) varies by a small (5%) but significant amount across a single CCD due to optics and within a single field due seeing fluctuations that occur on a time scale faster than the 54 seconds of observation time used for each frame. Corrections for these effects will be incorporated into Photo in the future, butare NOT in the reduction used for the data on this CD.
  2. Some other imaging effects (diffraction spikes and ghosts) are also not yet treated.
  3. The object deblender is known to need further refinement.
  4. Star/galaxy separation is still crude.
  5. Most of the outputs of Photo have had limited sanity checks of their correctness.
  6. The SDSS primary standard star system is not yet finalized, and the current calibrations are good to about 3-5% in g'r'i'z' and 5 - 8% in u'. To indicate the preliminary character of the photometric data, magnitudes are labelled with the designations "u*, g*, r*, i*, z*". In the data files themselves, filters are generally designated by a single letter without the trailing prime or asterisk.

More detailed information on data processing is given in chapter DATASYS of the Black Book. A major technical paper describing the photometric pipelines and their outputs is in preparation by Lupton et al. (1998).



4. Contents of the Directory "data": Images

The data directory contains a set of outputs for 10 fields from one scanline for one run in the following area (J2000):

RA = 16h 07m 36s to 16h 13m 12s
DEC = +00o 33' 00" to +00o 46' 12"

These data were collected during the night of June 27, 1998. In our nomenclature, these data are for run 77, camcol 5, fields 311-320.

The data directory hierarchy is organized by run number and camcol number. All outputs on this CD-ROM are thus in the data/77/5 subdirectory.

The imaging outputs are of the following types:




5. Contents of the Directory "data": Tables

The output tables come in four flavors:

Code to read FITS images and FITS binary tables can be obtained in C or FORTRAN from NASA:

http://legacy.gsfc.nasa.gov/fitsio

See also the FITS primer at this server.

Code for FV, a FITS binary table browser, and other FITS tools can be found at:

http://legacy.gsfc.nasa.gov/ftools



6. References

Fukugita, M., Ichikawa, T., Gunn, J.E., Doi, M., Shimasaku, K., & Schneider, D.P. 1996, AJ, 111, 1748.

Gunn, J.E., Carr, M.A., Rockosi, C.M., Sekiguchi, M., et al. 1998, AJ (in press, December 1998).

Gunn, J.E., & Weinberg, D.H. 1995, in ``Wide Field Spectroscopy and the Distant Universe'', ed. S.J. Maddox and A. Aragon-Salamanca (Singapore: World Scientific), p 3.

Knapp, G.R. 1997, Sky and Telescope, August, p 40.

Lupton, R.H., Ivezic, Z., Gunn, J.E., et al. 1998, in preparation.

Oke, J.B., & Gunn, J.E. 1983, ApJ, 266, 713.

SDSS Collaboration, 1996 ``The SDSS Black Book'' Black Book. See also http://www.sdss.org.

Stone, R.C. 1997, AJ, 114, 2811.