Next: 10. An RGS Data Up: XMM ABC Guide Previous: 8. An EPIC Data Contents

Subsections

9. An EPIC Data Processing and Analysis Primer (Timing Mode, GUI)

So, you've received an XMM-Newton EPIC data set. What are you going to do with it? After checking what the observation consists of (see § 3.2), you should note when the observation was taken. If it is a recent observation, it was likely processed with the most recent calibrations and SAS, and you can immediately start to analyze the Pipeline Processed data. However, if it is more than a year old, it was probably processed with older versions of CCF and SAS prior to archiving, and the pipeline should be rerun to generate event files with the latest calibrations.

As noted in Chapter 4, a variety of analysis packages can be used for the following steps. However, as the SAS was designed for the basic reduction and analysis of XMM-Newton data (extraction of spatial, spectral, and temporal data), it will be used here for demonstration purposes. SAS will be required at any rate for the production of detector response files (RMFs and ARFs) and other observatory-specific requirements. (Although for the simple case of on-axis point sources the canned response files provided by the SOC can be used.)

It is strongly recommended that you keep all reprocessed data in its own directory! SAS places output files in whichever directory it is in when a task is called. Throughout this primer, it is assumed that the Pipleline Processed data are in the PPS directory, the ODF data (with upper case file names, and uncompressed) are in the directory ODF, the analysis is taking place in the PROC directory, and the CCF data are in the directory CCF.

If your data are recent, you need only to gunzip the files and prepare the data for processing (see §5. Feel free to skip the section on repipelining and proceed to the later discussions. In any case, for simplicity, it is recommended that you change the name of the unzipped event file to something easy to type. For example, a PN event list:

: cp PPS/PiiiiiijjkkPNSlllTIEVLI0000.FTZ PROC/pn.fits

where

: iiiiiijjkk - observation number
lll - exposure number within the observation

Various analysis procedures are demonstrated using the Cen X-3 dataset, ObsID 0400550201. The following procedures are applicable to all XMM-Newton datasets, so it is not required that you use this particular dataset; any observation should be sufficient.

For detailed descriptions of PP data nomenclature, file contents, and which tasks can be used to view them, see Tables 3.2 and 3.3. For detailed descriptions of ODF data nomenclature and file contents, see Table 3.1.

9.1 Rerun the Pipeline

We assume that the data was prepared and environment variables were set according to §5, the GUI has been invoked (see §5.3), and we are in our working directory, ``PROC''.

From the upper window of the GUI, select epproc. Epproc will automatically detect the science mode of the data, so just click "Run". (It is safe to ignore the warnings.)

By default, no intermediate files that are generated are kept. Epproc designates its output event file with ``*TimingEvts.ds''. You may want to name the new file something easy to type:

: mv 1206_0403530301_EPN_S003_TimingEvts.ds pn.fits

9.2 Create and Display an Image

Many popular data products, such as images and light curves, are made with the task xmmselect. The following sections detail how to make them. For an introduction to xmmselect and a discussion on how to load an event file in it, please see §7.2.

Once the event file has been loaded,

1): Check the square boxes to the left of the ``RAWX'' and ``RAWY'' entries.
2): Click the ``Image'' button near the bottom of the page. This brings up the evselect GUI (see Figure 7.2).
3): In the ``General'' tab, in the imageset box, enter the name of the output file, in this case, image.fits.
4): In the ``Image'' tab, toggle Binning to binSize, and confirm that ximagebinsize and yimagebinsize are set to 1.
5): Click the ``Run'' button on the lower left corner of the evselect GUI.

The output file image.fits can be viewed by using a standard FITS display such as ds9 (see Fig. 9.1).

**Figure 9.1:** An image of PN data taken in Timing mode, displayed in *ds9*.
$\includegraphics[scale=0.5]{CenX-3_0400550201_timing_image_ds9.eps}$

9.3 Applying Standard Filters to the Data

The filtering expressions for PN in Timing Mode is:

: (PATTERN 4)&&(PI in [200:15000])&&#XMMEA_EP

The first expression will select good events with PATTERN between 0 and 4. The PATTERN value is similar the GRADE selection for ASCA data, and is related to the number and pattern of the CCD pixels triggered for a given event. The PATTERN assignments are: single pixel events: PATTERN == 0, double pixel events: PATTERN in [1:4], triple and quadruple events: PATTERN in [5:12].

The second keyword in the expression, PI, selects the preferred pulse height of the event. Since we are working with PN data, this should be between 200 and 15000 eV. This should clean up the image significantly with most of the rest of the obvious contamination due to low pulse height events. Setting the lower PI channel limit somewhat higher (e.g., to 300 eV) will eliminate much of the rest.

Finally, the #XMMEA_EP filter provides a canned screening set of FLAG values for the event. (The FLAG value provides a bit encoding of various event conditions, e.g., near hot pixels or outside of the field of view.) Setting FLAG == 0 in the selection expression provides the most conservative screening criteria and should always be used when serious spectral analysis is to be done on PN data.

To filter the data using xmmselect,

1)

Enter the filtering criteria in the ``Selection Expression'' area at the top of the xmmselect window:

: (PATTERN 4)&&(PI in [200:15000])&&#XMMEA_EP

2)

Click on the ``Filtered Table'' box at the lower left of the xmmselect GUI.

3)

Change the evselect filteredset parameter, the output file name, to something useful, e.g., pn_filt.fits

4)

Click ``Run''.

9.4 Create and Display a Light Curve

Sometimes, it is necessary to use filters on time in addition to those mentioned above. This is because of soft proton background flaring, which can have count rates of 100 counts/sec or higher across the entire bandpass. To determine if our observation is affected by background flaring, we can make a light curve with xmmselect. For the time binning, we will set it to something reasonable (usually between 10 and 100 s).

Load the filtered event file, and then

1): Check the round box to the left of the ``Time'' entry.
2): Click on the ``OGIP Rate Curve'' button near the bottom of the page. This brings up the evselect GUI (see Figure 7.2).
3): Click on the ``Lightcurve'' tab and change the ``timebinsize'' to a reasonable amount, e.g., 50. In the ``rateset'' textbox, enter the name of the output file, pn_ltcrv.fits.
4): Click on the ``Run'' button at the lower left corner of the evselect GUI.

The output file pn_ltcrv.fits can be viewed by using fv:

: fv pn_ltcrv.fits &

In the fv pop-up window, the RATE extension will be available in the second row (index 1, as numbering begins with 0). Select ``PLOT'' from this row, and select the column name and axis on which to plot it. The light curve is shown in Fig. 9.2. No flares are evident, so we will continue to the next section. However, if a dataset does contain flares, they should be removed in the same way as shown for EPIC Imaging mode data in §7.6.

**Figure 9.2:** The light curve of our example PN Timing mode data, displayed in fv.
$\includegraphics[scale=0.5]{CenX-3_0400550201_ltcrv_fv.eps}$

9.5 Extract the Source and Background Spectra

The first step in extracting a spectrum from PN Timing data is to make an image of the event file over the energy range we are interested in; for this example, we'll say 0.5-15 keV. And since this is the PN, we need to remember to set (FLAG==0) to get a high-quality spectrum. Thus, the filtering expression would be set to (FLAG==0) && (PI in [500:15000]). So, with pn_filt.fits loaded in xmmselect,

1): Enter the filtering criteria in the "Selection Expression" area at the top of the xmmselect window: (FLAG == 0)&&(PI in [500:15000])
2): Make an image by checking the square boxes to the left of the "RAWX" and "RAWY" entries to indicate which is on the X and Y axis. Click the "Image" button near the bottom of the page. This brings up the evselect GUI.
3): In the "Image" tab, enter the name of the output file in the imageset box; we will use pn_image.fits. Toggle Binning to binSize, and confirm that ximagebinsize and yimagebinsize are set to 1.
4): Click "Run". The image will be displayed automatically in a ds9 window.

As can be seen in Figure 9.3 (top), the source is centered on RAWX=37. We will extract this and the 10 pixels on either side of it.

1): Enter the filtering criteria in the ``Selection Expression'' area at the top of the xmmselect window: (FLAG==0) && (PI in [500:15000]) && (RAWX in [27:47]).
2): Click the round button next the PI column on the xmmselect GUI.
3): Click on ``OGIP Spectrum''.
4): In the ``General'' tab, check keepfilteroutput and withfilteredset. In the filteredset box, enter the name of the event file output, in this case, pn_filt_source_WithBore.fits.
5): In the ``Spectrum'' tab, set the file name and binning parameters for the spectrum. Confirm that withspectrumset is checked. Set spectrumset to the desired output name, source_pi_WithBore.fits. Confirm that withspecranges is checked. Set specchannelmin to 0 and specchannelmax to 20479.
6): Click ``Run''.

For the background, the extraction area should be as far from the source as possible. However, sources with 200 ct/s (like our example!) are so bright that they dominate the entire CCD area, and there is no source-free region from which to extract a background. (It goes without saying that this is highly energy-dependent.) In such a case, it may be best not to subtract a background. Users are referred to Ng et al. (2010, A&A, 522, 96) for an in-depth discussion. While this observation is too bright to have a good background extraction region, the process is shown below nonetheless for the sake of demonstration:

1): Enter the filtering criteria in the ``Selection Expression'' area at the top of the xmmselect window: (FLAG==0) && (PI in [500:15000]) && (RAWX in [3:5]).
2): Click the round button next the PI column on the xmmselect GUI.
3): Click on ``OGIP Spectrum''.
4): In the ``General'' tab, check keepfilteroutput and withfilteredset. In the filteredset box, enter the name of the event file output, in this case, pn_filt_bkg.fits.
5): In the ``Spectrum'' tab, set the file name and binning parameters for the spectrum. Confirm that withspectrumset is checked. Set spectrumset to the desired output name, in this case, bkg_pi.fits. Confirm that withspecranges is checked. Set specchannelmin to 0 and specchannelmax to 20479.
6): Click ``Run''.

**Figure 9.3:** TOP: The Cen X-3 Timing Mode image, from 0.5-15 keV. The green line is a projection cut. BOTTOM: The average counts in the cut across the CCD.
$\includegraphics[scale=0.5]{CenX-3_0400550201_image_with_cut.eps}$ $\includegraphics[scale=0.5]{CenX-3_0400550201_projection.eps}$

9.6 Check for Pile Up

Depending on how bright the source is and what modes the EPIC detectors are in, event pile up may be a problem. Pile up occurs when a source is so bright that incoming X-rays strike two neighboring pixels or the same pixel in the CCD more than once in a read-out cycle. In such cases the energies of the two events are in effect added together to form one event. If this happens sufficiently often, 1) the spectrum will appear to be harder than it actually is, and 2) the count rate will be underestimated, since multiple events will be undercounted. Briefly, we deal with it in PN Timing data essentially the same way as in Imaging data, that is, by using only single pixel events, and/or removing the regions with very high count rates, checking the amount of pile up, and repeating until it is no longer a problem. We recommend to always check for it.

Note that this procedure requires as input the event files created when the spectrum was made, not the usual time-filtered event file.

To check for pile up,

1): Invoke epatplot in the SAS GUI.
2): In the ``0'' tab, in the set text area, enter the name of the event file that was made when the spectrum was extracted, pn_filt_source.fits. If you want to change the output file name to something other than the default, click the useplotfile box and enter the name in the plotfile text area. For this example, we will use pn_epat.ps.
3): In the ``1'' tab, set withbackgroundset to yes. In the backgroundset text area, enter the name of the background event file output when the background spectrum was extracted, pn_filt_bkg.fits.
4): Click ``Run''.

The output of epatplot is a postscript file, pn_epat.ps, which may be viewed with viewers such as gv, containing two graphs describing the distribution of counts as a function of PI channel; see Figure 9.4.

A few words about interpretting the plots are in order. The top is the distribution of counts versus PI channel for each pattern class (single, double, triple, quadruple), and the bottom is the expected pattern distribution (smooth lines) plotted over the observed distribution (histogram). The lower plot shows the model distributions for single and double events and the observed distributions. It also gives the ratio of observed-to-modeled events with 1- $\sigma$ uncertainties for single and double pattern events over a given energy range. (The default is 0.5-2.0 keV; this can be changed with the pileupnumberenergyrange parameter.) If the data is not piled up, there will be good agreement between the modeled and observed single and double event pattern distributions. Also, the observed-to-modeled fractions for both singles and doubles in the 0.5-2.0 keV range will be unity, within errors. In contrast, if the data is piled up, there will be clear divergence between the modeled and observed pattern distributions, and the observed-to-modeled fraction for singles will be less than 1.0, and for doubles, it will be greater than 1.0.

Finally, when examining the plots, it should noted that the observed-to-modeled fractions can be inaccurate. Therefore, the agreement between the modeled and observed single and double event pattern distributions should be the main factor in determining if an observation is affected by pile up or not.

Examining the plots, we see that there is a large difference between the modeled and observed single and double pattern events, and that the observed-to-model fraction for doubles is over 1.0, indicating that the observation is piled up.

**Figure 9.4:** The output of *epatplot*.
$\includegraphics[scale=0.5]{CenX-3_0400550201_epat_piled.eps}$

9.7 My Data is Piled Up! Now What?

There are a couple ways to deal with pile up. First, you can use event file filtering procedures to include only single pixel events (PATTERN==0), as these events are less sensitive to pile up than other patterns.

You can also excise areas of high count rates, i.e., the boresight column and several columns to either side of it. (This is analogous to removing the inner-most regions of a source in Imaging data.) The spectrum can then be re-extracted and you can continue your analysis on the excised event file. As with Imaging data, it is recommended that you take an iterative approach: remove an inner region, extract a spectrum, check with epatplot, and repeat, each time removing a slightly larger region, until the model and observed pattern distributions agree.

To extract only the columns to either side of the boresight,

1): Enter the filtering criteria in the "Selection Expression" area at the top of the xmmselect window: (FLAG==0) && (PI in [500:15000]) && (RAWX in [27:47]) &&! (RAWX in [29:45])
2): Click the round button next the PI column on the xmmselect GUI.
3): Click on "OGIP Spectrum".
4): In the "General" tab, check keepfilteroutput and withfilteredset. In the filteredset box, enter the name of the event file output. We will use pn_filt_source_NoBore.fits.
5): In the "Spectrum" tab, set the file name and binning parameters for the spectrum. Confirm that withspectrumset is checked. Set spectrumset to the desired output name. We will use source_pi_NoBore.fits. Confirm that withspecranges is checked. Set specchannelmin to 0 and specchannelmax to 20479.
6): Click "Run".

Be aware that if you do this and are using SAS v. 13.x or older, you will need to use a non-standard way to make the ancillary files (ARFs) for your spectrum! This is discussed further in a later section.

9.8 Determine the Spectrum Extraction Areas

Now that we are confident that our spectrum is not piled up, we can continue by finding the source and background region areas. (This process is identical to that used for Imaging data.) This is done with the task backscale, which takes into account any bad pixels or chip gaps, and writes the result into the BACKSCAL keyword of the spectrum table. To find the source and background extraction areas, call backscale and then

1): In the "Main" tab, enter the name of the spectrum, source_pi_NoBore.fits.
2): In the "Effects" tab, confirm that withbadpixcorr is checked, and enter the name of the event file, pn_filt.fits, in badpixlocation.
3): Click "Run".

If you extracted a background spectrum, follow the same steps to find its extraction area, changing the input spectrum file to bkg_pi.fits.

9.9 Create the Photon Redistribution Matrix (RMF) and Ancillary File (ARF)

If you are using SAS v. 14 or higher, making the RMF and ARF for PN data in TIMING mode is exactly the same as in IMAGING mode, even if you had to excise piled up areas. This is a change from earlier SAS versions; if you are working with an older SAS, you will need to use the special recipe below to generate the ARF (the method to make a RMF file is the same as shown here.)

To make the RMF, call rmfgen from the GUI, and then

1): In the "main" tab, set rmfset to the output file name. We will use source_rmf_NoBore.fits. Set the spectrumset parameter to the spectrum, source_pi_NoBore.fits.
2): Click "Run".

To make the ARF, call arfgen from the GUI, and then

1): In the "main" tab, set arfset to the output file name. We will use source_arf_NoBore.fits. Set the spectrumset parameter to the spectrum, source_pi_NoBore.fits.
2): In the "detector map" tab, use the pull-down menu to set the map type to psf.
3): In the "calibration" tab, set withrmfset to yes and rmfset to source_rmf_NoBore.fits.
3): In the "effects" tab, verify that withbadpixcorr is checked, and set badpixlocation to the event file, pn_filt.fits.
4): Click "Run".

If you excised regions to make a spectrum and are using SAS v. 13.x or older, you will need to make an ARF for the full extraction area, another one for the piled up area, and then subtract the two to find the ARF for the non-piled regions. To get those, we will need the spectra of the full extraction area and the excised area. We already have it for the full extraction area, so for the excised area, use the same procedure as in §9.5, but change the "Selection Expression" to (FLAG==0) && (PI in [500:15000]) && (RAWX in [29:45]), set the filteredset parameter to pn_filt_source_Excised.fits, and spectrumset to source_pi_Excised.fits.

Now we can use the spectra to make the ARFs. Invoke arfgen from the SAS GUI, and then

1): In the "main" tab, set arfset to the output file name. We will use source_arf_WithBore.fits. Set the spectrumset parameter to the spectrum, source_pi_WithBore.fits.
2): In the "detector map" tab, use the pull-down menu to set the map type to psf.
3): Click "Run".

Use the same procedure to make the ARF for the excised region, changing the arfset parameter to source_arf_Excised.fits and spectrumset parameter to source_pi_Excised.

We can now subtract them. This is easiest to do on the command line:

: addarf "source_arf_WithBore.fits source_arf_Excised.fits" "1.0 -1.0" source_arf_NoBore.fits

At this point, the spectrum is ready to be analyzed, so skip ahead to prepare the spectrum for fitting (§13).

The timing data can also be examined in Xronos; this is discussed in §15.

Next: 10. An RGS Data Up: XMM ABC Guide Previous: 8. An EPIC Data Contents

Lynne Valencic 2023-06-29