Data Model

The 3-D data cubes that constitute the final MaNGA data products are constructed from a few tens to a few thousands of individual spectra that have been combined onto a regular grid. This page will explain the structure of the raw data (spectra dispersed onto individual CCDs), intermediate data processed through the 2-D (per-exposure) stage of the data reduction pipeline, and final data products processed through the 3-D (per plate) stage of the reduction pipeline. Additionally, we describe the preimaging data drawn from the NSA reprocessing of the original SDSS imaging survey; this preimaging data is used for the astrometric alignment of the MaNGA spectral imaging.

The default storage format for MaNGA data (images and cubes) is multi-extension FITS (gzipped to save space). The zeroth extension of such files is blank except for the global header. All extensions of such files are labelled with the EXTNAME header keyword so that they can be read by extension name instead of extension number.

Raw Data

The raw data (identical between MaNGA and eBOSS) is obtained for each of the four cameras individually (b1,b2,r1,r2) with files of the form:

sdR-[CAMERA]-[EXPNUM].fit.gz

(see details here)

Aside from some new keywords, the format is the same as for BOSS raw data. These keywords include:

  • Dither position (MGDPOS, values of N,S,E,C). C means no dither; N,S,E are the 3 allowed dither values for the MaNGA dither triangle. (For more details see the MaNGA Observing Strategy paper, Law et al. 2015).
  • Dither location (MGDRA, MGDDEC), i.e. offset of the pointing from the nominal center in arcsec of RA and DEC. These are the values actually used by the DRP in determining the astrometric solution.
  • PLATETYP='MANGA' or 'APOGEE-2&MANGA', as appropriate (i.e., whether the plate contains only MaNGA holes, or both MaNGA and APOGEE holes for co-observing)
  • SRVYMODE='MaNGA stare', 'MaNGA dither', or 'APOGEE-lead' as appropriate. Most galaxy plates are 'MaNGA dither', all-sky plates are 'MaNGA star', and bright-time stellar library plates are 'APOGEE-lead'.

2-D Reduction Pipeline Output

Each exposure through the MaNGA instrument is processed separately through the 2-D Data Reduction Pipeline (DRP) up to and including flux calibration and combination of spectra from individual cameras across the dichroic break.

The structure of these files is loosely based on the BOSS spFrame type files.
That is, they are row-stacked spectra (RSS), two dimensional arrays in which each row corresponds to an individual one-dimensional spectrum.
The following files can (and should) be read using the routine ml_mgframeread.pro.

Note that NFIBER=709 for spectrograph 1 (cameras b1 & r1), and 714 for spectrograph 2 (cameras b2 and r2). CCDROW refers to the number of rows on a given CCD after processing through sdssproc.pro to remove the overscan regions from the raw data; this differs between cameras.

List of files:

mgArc-[CAMERA]-[EXPNUM].fits.gz This is the extracted arc frame (see details here). It is almost exactly the same format as BOSS spArc files, with the exception of a blank extension 0 and extension names vs numbers.

mgFlat-[CAMERA]-[EXPNUM].fits.gz This is the extracted flatfield frame (see details here). It is almost exactly the same format as BOSS spFlat files, with the exception of a blank extension 0 and extension names vs numbers.

mgFrame-[CAMERA]-[EXPNUM].fits.gz The mgFrame files are the extracted fiber spectra for each camera for the science exposures (see details here).

mgSFrame-[CAMERA]-[EXPNUM].fits.gz The mgSFrame files are the per-camera science frames after extraction and sky-subtraction has been applied (see details here). Note that the 'S' in the name means Sky-Subtracted.

fluxcal-[SPEC]-[EXPNUM].fits.gz The fluxcal files store intermediate products derived during the flux calibration procedure. It is written by mdrp_fluxcal.pro for each spectrograph and each exposure.

mgFFrame-[CAMERA]-[EXPNUM].fits.gzThe mgFFrame files are the per-camera science frame after extraction, sky subtraction, and flux calibration (see details here). Note that the 'F' in the name means Flux-calibrated.

mgCFrame-[EXPNUM]-LOG.fits.gz The mgCFrame files are per-exposure combinations of the mgFFrame files by stitching the blue and red channels together across the dichroic break and adding rows for fibers from spectrograph 2 atop those from spectrograph 1 (i.e., in order of fiberid). All spectra in this file have been resampled to a common wavelength grid for ALL MaNGA observations using a cubic b-spline. See details here. The MaNGA common wavelength solution is set by ml_setwcalib.pro. There are two versions of this file; the first uses a logarithmic wavelength sampling from log10(lambda/Angstroms)=3.5589 to 4.0151 (NWAVE=4563 spectral elements). Note that 'C' in the name means Calibrated and Camera Combined on a Common wavelength grid).

mgCFrame-[EXPNUM]-LIN.fits.gz As for the LOG format file, but on a linear wavelength solution running from 3622.0 to 10353.0 Angstroms (NWAVE=6732 spectral elements).

3-D Reduction Pipeline Output

Once a given plate is complete (ie. all of the required sets of dithers are collected), the 3D stage of the DRP extracts the relevant rows for each IFU from the mgCFrame files, computes the astrometric solution of each, and combines the exposures into row-stacked spectra and data cubes.

These output files follow the naming convention [plate]-[ifudesign], which uniquely identifies a given galaxy observation. Note, however, that if a galaxy were to be reobserved on a different plate it would have a different 'plateifu' identifier. 'plateifu' thus uniquely identifies a set of observations, while 'mangaid' uniquely identifies an astronomical target.

In this section NFIBER is taken to mean the number of fibers in a given IFU (e.g., 19, 37, 61, 91 or 127), and NEXP is the number of exposures.

There are two kinds of output files: row-stacked spectra (RSS) that contain each individual spectrum stacked atop each other into into a 2d format, and a data cube that combines individual spectra together into a rectified 3d data cube. Note that RSS files and cubes are provided for all MaNGA dark time targets, and thus include both galaxies and spectrophotometric standard stars.

HDUCLASS

MaNGA adopts the HDUCLASS FITS header extension keyword structure (see https://heasarc.gsfc.nasa.gov/docs/heasarc/ofwg/docs/ofwg_recomm/r8.html) to indicate the type of information contained in the science, error, and mask (i.e., data quality, or DQ) extensions (and the relationship between those extensions). We define HDUCLASS SDSS.

The science extension has HDUCLAS1=IMAGE (for RSS files) or CUBE (for data cubes) and HDUCLAS2=DATA. The ERRDATA and QUALDATA keywords in this extension header point to the error and DQ extensions.

The error extension has HDUCLAS1=IMAGE or CUBE and HDUCLAS2=ERROR. Valid HDUCLAS3 entries are MSE, RMSE, INVMSE, INVRMSE. MaNGA uses INVMSE (i.e., we provide inverse variance). The SCIDATA and QUALDATA keywords in this header point to the science and DQ extensions.

The DQ extension has HDUCLAS1=IMAGE or CUBE and HDUCLAS2=QUALITY. MaNGA uses HDUCLAS3=FLAG64BIT, indicating that the DQ extension should be interpreted as a bitmask with up to 64 independent bits available. The SCIDATA and ERRDATA keywords in this header point to the science and error extensions.

In all cases, the dimensionality of the ERROR and QUALITY extensions matches that of the DATA extension.

RSS Files

Identifier:

manga-[PLATE]-[IFUDESIGN]-LOGRSS.fits.gz or manga-[PLATE]-[IFUDESIGN]-LINRSS.fits.gz

These are the row-stacked, flux-calibrated fiber spectra for a given galaxy across all exposures. The "LOGRSS" file has logarithmic wavelength sampling from log10(lambda/Angstroms)=3.5589 to 4.0151 (NWAVE=4563 spectral elements). The "LINRSS" file has linear wavelength sampling from 3622.0 to 10353.0 Angstroms (NWAVE=6732 spectral elements). Both files contain one row for each fiber, for a total of NFIBER*NEXP rows.

The detailed data models for the LOGRSS and LINRSS files can be found here. In brief, they contain extensions for the flux (in units of 10-17 erg/s/cm2/Angstrom/fiber), the inverse variance, the pixel mask, the spectral line spread function for each fiber, the wavelength vector, the median spectral resolution as a function of wavelength for the fibers in this IFU, the standard deviation of spectral resolution as a function of wavelength for the fibers in this IFU, a binary table describing the individual exposures that make up the file, and arrays of the effective X and Y positions in arcsec of each fiber (as a function of wavelength) relative to the IFU center.

Cube Files

Identifier:

manga-[PLATE]-[IFUDESIGN]-LOGCUBE.fits.gz or manga-[PLATE]-[IFUDESIGN]-LINCUBE.fits.gz

These are the final 3d data cubes for a given galaxy that combine all fiber spectra across all exposures as described here.

The LOGCUBE data cube has logarithmic wavelength sampling from log10(lambda/Angstroms)=3.5589 to 4.0151 (NWAVE=4563 spectral elements), and 0.5 arcsec spatial pixels (spaxels) for a total size of NX x NY x NWAVE pixels; LINCUBE is the same, except it has linear wavelength sampling from from 3622.0 to 10353.0 Angstroms (NWAVE=6732 spectral elements).

The detailed data models for the LOGCUBE and LINCUBE files can be found here). Similar to the RSS files, they contain extensions for the flux (now in units of 10-17 erg/s/cm2/Angstrom/spaxel), inverse variance, pixel mask, wavelength vector, median spectral resolution and standard deviation thereof, and a binary table describing the individual exposures that make up the file. Additionally, they also included reconstructed broadband 'griz' images created from the spectral data cube, and estimates of the reconstructed point source profiles in each of the 'griz' bands.

SDSS Legacy Imaging

Identifier:

preimage-[mangaid].fits.gz

MaNGA requires that broadband imaging data for target galaxies be available for the DRP to use to register astrometric solutions against and to estimate achieved S/N depth. We use the NASA Sloan Atlas (NSA) re-reduction of the SDSS imaging data, and we provide FITS postage stamps of this re-reduction for each MaNGA galaxy released as part of DR13.

The detailed data model for the preimaging data can be found here. In brief, they are multi-extensions FITS files giving the flux, inverse variance, and psf in each of the four 'griz' bands.