1.
Introduction
Recent developments in large interferometers (e.g., ALMA and NOEMA) have made it possible to spatially resolve the brightness distribution of many more astronomical objects. These observations have enabled us to obtain a three-dimensional (right ascension, declination, and the line-of-sight velocity) structure of the gas emission and two-dimensional images of the continuum within galaxies with high spatial resolution and sensitivity. As a result, the data allow for detailed image analysis; e.g., investigating spectral features of spatially resolved regions, characterizing faint and extended structures, performing Fourier analysis of the image, etc. However, the spatial correlation of the noise in interferometric images makes it difficult to evaluate the uncertainty of the results. There has been a lack of quantitative understanding of the spatial correlation of noise and methods to evaluate the statistical uncertainty of measured quantities and the significance of scientific results, such as signal detection and image analysis.
To estimate the statistical uncertainty of integrated fluxes or spectra under correlated noise, both variance and covariance of the pixel pairs in the integrated region need to be taken into account for the uncertainty propagation. Sun et al.1 proposed a method based on an approximation that the covariance of noise between pixels is proportional to the synthesized beam. However, the covariance is actually proportional to the autocorrelation of the synthesized beam.2 Also, more importantly, the method approximates the synthesized beam with a single Gaussian. In contrast, the true synthesized beam has a complex structure with a main lobe inducing short-range strong noise correlation and side lobes inducing long-range weak noise correlation. Such an oversimplified assumption can lead to underestimation of the uncertainty. A more widely used method to estimate the statistical uncertainty of integrated fluxes is based on an intuitive interpretation that the noise can be regarded as independent across beam-sized regions as described by Alatalo et al.3 The noise variance in the integrated values is evaluated by scaling the noise variance of individual pixels by the number of the beam area in the integrating aperture. This method also implicitly assumes the Gaussian beam to estimate the beam area of the synthesized beam. To evaluate the statistical uncertainty of the best-fitting parameters in a model fitting to the interferometric data with the correlated noise, Davis et al.4 proposed a method to construct a covariance matrix from the synthesized beam, which describes the covariance of the noise and uses it to compute the value. Again, they also assumed a simple Gaussian function for the synthesized beam.
Several studies have used workarounds to avoid the need to characterize the correlated noise using Monte Carlo methods. Harikane et al.5 measured the statistical uncertainty of the integrated flux by randomly placing identical apertures to the emission-free region and adoptting the root mean square (rms) of the summed values. Boizelle et al.6 estimated the statistical uncertainty of the fitting parameters by Monte Carlo resampling of the best-fitting parameters: adding noise extracted from the emission-free regions to the original data and refitting models. Another common technique is to fit the model in the visibility plane to measure the source size and shape where the noise in the visibility measurements is independent. This method is particularly beneficial if the source is small compared to the resolution of the interferometer since the model needs to be simple and axisymmetric for computational efficiency. However, in many cases, analysis in the image plane is necessary (e.g., complex structures, such as spiral arms, bar, and clumpy structures).7
There have yet to be any attempts to evaluate the statistical uncertainty for the general measurements using interferometric images (e.g., integrated flux, spectra, and fitting) by fully characterizing the detailed noise correlation. Refregier and Brown2 proposed to use the noise autocorrelation function (ACF) to characterize the correlated noise of the very large array FIRST radio survey data. They used the noise ACF to explore the effect of the spatially correlated noise in the signal of the ellipticity correlation function, which encodes the imprint of the weak lensing signal by the large-scale structure of the universe. The noise ACF fully characterizes the noise correlation properties of interferometric images and provides the covariance of noise between different pixels, which allows us to measure statistical uncertainty under the noise correlation.
In this paper, we present a method and associate Python code to characterize the spatial correlation of noise in interferometric images by measuring the noise ACF and evaluating its effect on the measured quantities and the analysis results.
This paper is organized as follows. In Sec. 2, we present the noise correlation properties of ALMA data characterized by the ACF and show that the noise correlation originates from the synthesized beam (dirty beam) structures, which remain even in the CLEAN image and cannot be removed by any deconvolution algorithm. In Sec. 3, we introduce methods for (1) estimating the statistical uncertainties associated with spatially integrated flux or spectra; (2) generating simulated noise maps from the measured noise ACF, which are useful to estimate the statistical significance of the result obtained by any image analysis; and (3) constructing the covariance matrix from the noise ACF which can be used in the formalism of the model fitting to the observed image, with example applications to real scientific data from Ref. 8.
Throughout this paper, we use the noise map from emission line cube and continuum image data taken by ALMA band 7 (2017100394.S; PI: González López, Jorge) as an example, but the method proposed by this paper can be applied to other interferometric images. A Python package for easy application of the methods described in this paper, Evaluating Statistical Significance under Noise Correlation (ESSENCE), is publicly available at https://github.com/takafumi291/ESSENCE.
2.
Noise Characterization of Interferometric Image
2.1.
Characterization of Spatially Correlated Noise
First, we consider a two-dimensional noise map , where denotes the position of the pixels. Pixel regions with signal from the object of interest are excluded. The statistical properties of the noise are assumed to be uniform in the noise image, which appears to be valid in the interferometric image (We discuss this in Sec. 2.2.) In most of the literature, the noise in the radio interferometric image is quantified and reported with the rms of the noise map :
Eq. (1)
Fig. 1
(a) Example ALMA band 7 noise map. The source emission region is eliminated with the 4 sigma clipping; see text. (b) The histogram of the pixel values of the noise map. The red dashed line indicates the best-fit Gaussian with the mean and the standard deviation ().
When noise can be assumed to be Gaussian, the statistical and correlation properties of noise are fully quantified by the noise ACF:2
Eq. (2)
Eq. (3)
Eq. (4)
Eq. (5)
Figure 2 shows the results of the noise ACF [Eq. (2)] computed for the noise map shown in Fig. 1, and the synthesized beam of the observation, both of which are normalized so that the central value is one. The noise ACF shows a pattern similar to that of the synthesized beam, with a correlation signal near the center and a correlation signal away from the center corresponding to the main lobe and side lobe of the synthesized beam, respectively. This suggests that most of the correlation of the noise originates from the discrete Fourier transform involved in the interferometric imaging, which is illustrated in the following section.
Fig. 2
(a) The noise ACF computed for the ALMA band 7 noise map and (b) a similar pattern in the synthesized beam of the observation.
Note that the noise ACF is measured for the noise maps of the band 7 continuum image (shown in Fig. 2), [Cii] line (velocity integrated over the velocity range of to , where the velocity is computed with respect to the redshifted [Cii] line frequency with the galaxy’s redshift of 4.40749) moment 0 map, and the [Cii] line cubes (each velocity channel map). These maps are primary beam uncorrected and cleaned images produced by the CLEAN algorithm in CASA (see details in Ref. 8).
2.2.
Origin of the Noise Correlation
In interferometric observations, measurable quantities are visibility (Fourier amplitude and phase) of the astronomical image at the given spatial frequencies , which are related to the antenna baseline vector (separation vector of pairs of antennas) projected onto the plane of the sky and the observed wavelength . The image is then computed by the Fourier transform of the measured visibility.
To explore the origin of the noise correlation in the image, seen in (Fig. 2), we start with the ideal case in which the observation measures visibilities at all spatial frequencies . The visibility of the source of interest is , which is the Fourier transform of the true flux distribution of the source in the image, , where FT denotes the Fourier transform. A measurement of usually involves uncorrelated random noise, which we describe with the random variable with zero mean. We assume that the statistical property of the random variable is uniform as a function of and , that is, the system noise temperature is the same for all antennas. The image obtained is the Fourier transform of the measurement
Eq. (6)
In practice, visibilities are measured only at the limited spatial frequencies ( coverage). The spatial transfer function is used to describe the spatial frequencies , in which we measure the visibility. This function is nonzero if the visibility at is actually measured, which can be expressed as
Eq. (7)
Eq. (8)
For convenience, by replacing the sky position of with the pixel position , the noise map of the image in Eq. (8) is written as
Eq. (9)
Eq. (10)
Eq. (11)
Equation (10) implies that the noise ACF is related to the ACF of the synthesized beam with a constant multiplicative factor, which is the variance of the noise. In Fig. 3, we compare the ACF of noise and that of the synthesized beam, along with the residuals (noise ACF minus beam ACF). Although the noise ACF and beam ACF show a common characteristic pattern, they do not completely coincide. The difference of the two ACF shows an extended weak positive correlation and a relatively large negative around the main beam in the residual. This disagreement is likely due to not only (1) the remaining contamination by the emission from the sources, but also (2) the process involved in the imaging of the visibility measurements. These are discussed in detail in Appendix A comparing Fig. 3 obtained from the actual data with the one (Fig. 13) obtained from the simulated data with a similar observational setup and realistic noise in the visibilities but without emission in the sky. Note that our interest is on the statistical property of the noise in the image plane, which we characterize by the noise ACF including effects of contamination from the source and the imaging process.
Fig. 3
(a) The same noise ACF shown in Fig. 2, (b) the ACF of the synthesized beam, and (c) the residual of the noise ACF minus the ACF of the synthesized beam.
Due to the limited spatial frequency coverage, the synthesized beam has a complex structure with sidelobes that extend from the center to a large radius. The flux from the source is spread out by the side lobes to the distant pixels in the image. The CLEAN algorithm, which is most commonly used in radio imaging, deconvolves the beam pattern for signals with high ) and replaces it with a CLEAN beam without side lobes (a Gaussian that approximates the mainlobe of the synthesized beam). The CLEAN algorithm successfully suppresses the influence of the sidelobe and produces a high-fidelity image, but cannot remove the spatial correlations that exist in stochastic noise . Therefore, it is important to evaluate their effects on image analysis and signal detection, which we will describe in Sec. 3.
3.
Example Application to Scientific Data
3.1.
Contribution of the Correlated Noise to the Statistical Uncertainty in the Measured Flux
The most fundamental measurement of astronomy is the total flux distributing over some sky region in the images, which are measured by summing the pixel values over the region of interest (i.e., aperture photometry in optical astronomy). In particular, at the submillimeter band of ALMA, the flux of the continuum emission arising primarily from thermal dust, and line emission and absorption by the various atomic and molecular gases are used to estimate the physical properties of the interstellar medium (e.g., dust mass, gas mass, and the energy source of the ionization or excitation). It is important to estimate the uncertainty of the measured quantities. As shown in Fig. 2, the noise in interferometric images correlates significantly between pixels, making the estimation of the noise in the integrated flux difficult. In the previous literature, statistical uncertainties of integrated fluxes were estimated by one of the two methods: (1) randomly placing identical apertures in the noise region of the image, measuring the sum within each aperture and then adopting the rms as the noise in the sum of pixels in the aperture5 and (2) assuming that the regions in the image separated with a beam size do not correlate and adopting , where is the number of beams (independent regions) in the aperture.3 is estimated as , where and are the aperture area and the CLEAN beam area in pixels, respectively. For convenience, we call methods (1) and (2) "random aperture method" and "independent beam method," respectively, in this paper.
In the "independent beam method" (), the factor is the standard deviation of the sum of noise in individual pixels within a beam assuming that the noise perfectly correlates within a beam. Then the standard deviation of the sum of the noise of each independent beam area in the aperture is computed by scaling by the square root of the number of independent beams within the aperture. The terms and in the denote just the number of data points to be summed. Therefore, we caution readers that has the same unit as . Most interferometric maps and measured are in brightness units, e.g., or . So we need to divide to compare with the integrated flux or spectral flux density, e.g., or Jy. is a factor of different from described by Alatalo et al.3 due to the unit difference where they assume the quantity in the unit of flux.
This section introduces how to derive the statistical uncertainty associated with the spatially integrated flux directly from the computed noise ACF. We consider adding all the pixel values at pixel positions within the sky region of interest . The random noise in the map is characterized by the noise ACF . The statistical uncertainty associated with the summed value within the pixel region can be estimated as
Eq. (12)
As an illustration, we used the spatially resolved [Cii] moment 0 map of BRI 1335-0417 taken by ALMA, shown in Fig. 4, where the emission spreads over multiple pixels. We calculated the noise in the integrated flux measured using a variety of apertures with different sizes. The largest aperture is a dotted line shown in Fig. 4. Figure 5 shows the computed noise from the measured noise ACF compared to the previously used "random aperture" and "independent beam" methods. The noise calculated from the ACF is in excellent agreement with the random aperture method, whereas the independent beam method tends to overestimate at smaller apertures (considering the limiting case that the aperture is a single pixel, the obtained value by "independent beam method", ) is clearly different from the fiducial value ) and underestimate at large apertures, showing that the assumption of "independent beam" is oversimplified. When the field of view is small (the field of view becomes smaller at higher frequency bands in ALMA) or the aperture area becomes larger, the random aperture method cannot place apertures randomly in the limited area of the emission-free region, and thus the standard error of the estimate increases, as shown in the blue shaded region in Fig. 5. The proposed method of calculating , however, can provide the best estimate by exploiting all available data to estimate the noise ACF. The total [Cii] flux measured with the aperture shown in Fig. 4 is 29.51 ( statistical uncertainty calculated from the noise ACF).
Fig. 4
[Cii] velocity integrated intensity map of BRI 1335-0417.8 The white contour is shown every from to . The white elipse shown in bottom-left corner indicates the FWHM of the main lobe of the synthesized beam. The dotted line circle shows the aperture corresponding to the point with the largest number of pixels shown in Fig. 5. The [Cii] spectrum within the aperture is also shown in Fig. 8.
Fig. 5
(a) The noise rms in the integrated flux for apertures with different sizes estimated by various methods: the one computed from the noise ACF (black points), the one estimated from the random aperture methods (blue dashed line) with the standard error (blue shade), and the one estimated from the independent beam method (black solid line). (b) The fractional difference between the one computed from the noise ACF and the ones from the random aperture method and independent beam method.
To demonstrate the significant effect of spatially correlated noise, Fig. 6 shows the noise variance in the integrated flux calculated from the ACF along with the contributions of the noise variance of individual pixels in the aperture [the first term in Eq. (12), this is the value we obtain if we are unaware of the noise correlation] and the interpixel correlation in the aperture [the second term in Eq. (12)]. In the case of an aperture with two pixels, it is not mathematically allowed for the first term to exceed the second term. However, as the number of pixels in the aperture increases, the contribution of the second term dominates over the first term because the number of the first terms is , whereas the number of the second terms is . Ignoring the noise correlation will lead to a significant underestimation of the integrated flux uncertainty.
Fig. 6
The measured noise variance estimated from the noise ACF for different apertures (blue points and solid line), with the contribution from the noise variance of the individual pixels in the aperture [the first term of the last line in Eq. (12), green points and dotted line] and the contribution from the covariance due to the interpixel correlation (the second term of the last line in Eq. (12), orange points and dashed line).
Figure 7 further divides the variance due to noise correlation into the effects of the mainlobe (correlation due to the mainlobe of the synthesized beam) and the sidelobe (long-range pixel correlation due to the sidelobe of the synthesized beam). To illustrate the significance of the long-range correlation conservatively, we define short-range/long-range correlation components of the noise ACF inside/outside the ellipse with the beam FWHM in radius, respectively. After exceeds 200, the effect of the sidelobe becomes significant. This explains the deviation of the estimate by the independent beam method from the true value (computed by the noise ACF or the random aperture method; see Fig. 5); the number of independent beams is estimated by the area of the CLEAN beam, and the long-range correlation due to the sidelobe is not properly taken into account.
Fig. 7
The covariance term due to the interpixel correlation shown in Fig. 6. [the second term of the last line in Eq. (12), orange points and solid line], which are further decomposed into the component of the long-range correlation due to the sidelobe of the synthesized beam (purple points and dotted line) and short-range correlation due to the mainlobe of the beam (red points and dashed line).
Another important measurement in astronomy is the shape of the spectrum integrated over a certain region of interest. Similar to deriving the noise in the integrated flux, we can derive the underlying noise in the spatially integrated spectrum using the noise ACFs computed for every velocity channel of the data cube. Figure 8 shows the spatially integrated [Cii] spectrum enclosed by the aperture shown in Fig. 4 with and noise levels. The [Cii] spectrum of BRI 1335-0417 is well described by a single Gaussian without deviations from the Gaussian above . Because emission lines from interesting astronomical phenomena, such as outflows and tidal tails, are faint, it is important to accurately estimate the noise, otherwise, it will lead to false detections.
Fig. 8
(a) The spatially integrated flux within the aperture shown in Fig. 4 (solid black line) and the best-fit Gaussian (solid blue line). (b) The residual (measured spectrum minus the best-fit Gaussian). The gray shade and dashed line in both panels show the statistical uncertainty of and , respectively, calculated from the noise ACF. This figure is reproduced with permission from Ref. 8 where the underlying noise is recalculated by the noise ACF of each velocity channel.
3.2.
Simulating the Noise Maps
In image analysis, generating random noise based on the statistical properties of the noise is useful to assess the significance of the results. In this section, we describe how to generate random noise based on the noise ACF, which fully characterizes the correlated Gaussian noise. Using an example of simulated noise, we demonstrate how ignoring correlated noise leads to misinterpretation of results.
Once the noise ACF is measured, the noise at the positions in images with a size of pixels can be generated randomly by the joint probability distribution, the probability that takes the value in small intervals () given by10
Eq. (13)
Eq. (14)
Figure 9 shows the comparison of the noise of the observed data, the noise randomly generated from the measured noise ACF using the joint Gaussian probability distribution [Eq. (13); we used the MULTIVARIATE_NORMAL function from the scipy package11], and the spatially uncorrelated Gaussian noise, all of which have the same standard deviation . The observed noise and the randomly generated noise using the noise ACF are qualitatively similar, whereas the spatially correlated noise and the spatially uncorrelated noise look completely different, illustrating how dangerous it is to assume naively that noise is uncorrelated in image analysis.
Fig. 9
(a) Noise map in the observed data, (b) noise map generated from the measured noise ACF, and (c) uncorrelated noise map. All of which have the same standard deviation .
In the literature, emission-free regions have been extracted and used as realistic noise maps to estimate the statistical uncertainty associated with the parameters derived by the model fitting.6 The field of view of the interferometric observation is generally small, preventing us from obtaining a sufficient number of independent noise maps or cubes to conduct Monte Carlo experiments. (As the is not strongly variable across a spectral window, we may use the line-free channels with cautions of the channels affected by emission lines in the atmosphere and the usually negligible variation of spatial frequencies with spectral frequency.) The proposed method can generate random noise repeatedly from the measured statistical properties of the noise (the noise ACF) in the interferometric image with the best precision limited by the available area of the emission-free region.
There are many potential applications of generating noise. We illustrate the significance of the correlated noise using one example: the image analysis done in Ref. 8, which expands the image with series of logarithmic spirals and identifies the dominant spiral structure in the image. In Fig. 10, we show the Fourier spectrum of the logarithmic spirals in the [Cii] intensity map of the BRI 1335-0417 shown in Fig. 4 (black solid line) and the underlying noise spectra estimated from the simulated noise maps from noise ACF (blue shaded region) and the simulated noise maps with no spatial correlation but the same standard deviation (orange shaded region). The spectrum shows the amplitude of the logarithmic spiral with arms (-fold symmetry) as a function of a variable , which is related to the pitch angle of . To calculate the spectrum, the image (Fig. 4) was first deprojected to be viewed face-on with an inclination angle of 37.8 deg and a position angle of 4.5 deg, where the package SCIKIT-IMAGE performed rotation and stretching for the deprojection (which induces an additional noise correlation in the image). Then the amplitude of each Fourier component of the logarithmic spiral and is calculated (see Equation S2 in Ref. 8). The noise spectrum was measured by generating 300 noise maps, calculating the amplitude of each Fourier component in the same way as the image, and taking the 84th percentiles. The noise spectrum computed by assuming the uncorrelated Gaussian noise map significantly underestimates the true noise spectrum, which could lead to the false detection of statistically insignificant structures, such as the second peak in and multiple peaks in . The estimated noise spectra are higher than those reported in the original paper because the image stretching and rotation were not applied to the noise maps in the original paper, resulting in an underestimation of the noise level. However, the result of the original article does not change except for the updated statistical significance of each peak in being , , , and , respectively. The peak with corresponds to the fact that the northern arm is longer than the southern arm.
Fig. 10
Fourier spectra of logarithmic spiral models (solid black lines) and the underlying noise due to the noise in the images computed from the simulated noise maps by the noise ACF (blue shaded region) and the uncorrelated Gaussian noise maps with the same standard deviation (orange shaded region). The peak in indicates that the dominant component of the image is a 2 armed spiral structure with a pitch angle of . This figure is adopted with permission from Ref. 8 and the underlying noise spectrum is recalculated using the noise maps simulated by the noise ACFs.
3.3.
Fitting a Model to an Interferometric Image Under Spatially Correlated Noise
Fitting an analytic model of the intensity distribution to the 2D observed image is one of the most fundamental tasks in astronomy, allowing us to extract information to characterize astronomical sources from images efficiently. For a simple example, fitting 2D Gaussian to the intensity image provides essential information regarding the source’s position, size, and brightness. This section introduces the construction of a covariance matrix from the measured noise ACF, which is required to calculate , to properly estimate the statistical uncertainty on the parameters derived by fitting the model to the observed interferometric image.
Consider fitting a 2D model to an observed image with pixels, where denotes a set of model parameters. We denote the flattened model image and the observed image by and , respectively, with the flat indices that correspond to the coordinate indices in the images. The formal expression to calculate is
Eq. (15)
Eq. (16)
We can construct the covariance matrix in Eq. (15) from measured noise ACF by
Eq. (17)
Now, the question becomes; what effect does the noise correlation have on the estimated statistical uncertainties of the model parameters derived from the fitting process? To explore this, we fit the 2D Gaussian model to a noiseless 2D Gaussian image and sampled the posterior distribution of the model parameters: total flux of the Gaussian distribution , center of the Gaussian , major axis and minor axis of the Gaussian , and position angle (P.A.) from the north to the major axis (counterclockwise). The noise effect perturbing the best-fitting parameters is effectively taken into account by the covariance matrix in the likelihood [Eq. (15)]. We compared the posterior distributions that resulted in two cases where (1) the noise has no spatial correlation [using Eq. (16)] and (2) the noise has spatial correlation characterized by the noise ACF shown in Fig. 2 [using Eq. (15), and the covariance matrix constructed from the noise ACF]. Note that in both cases, the noise variance in individual pixels is the same, but the latter case has spatial correlation. We used emcee12 to sample posterior distributions with logarithmic likelihood and a uniform prior. Figure 11 shows the posterior distributions of the parameters of the Gaussian model in two cases. Accounting for the spatial correlation of the noise results in a significantly larger confidence interval than that obtained assuming that there is no spatial correlation of the noise. The model parameters would appear to be constrained too well if we ignored the noise correlation in the fitting process.
Fig. 11
The posterior distributions that we obtain when fitting the Gaussian model to the image with different assumptions about noise. Black: the posterior distribution sampled using the covariance matrix constructed from the noise ACF shown in Fig. 2. Blue: posterior distribution sampled assuming that there is no spatial correlation in the noise but with the same . The black distribution corresponds to the expected posterior distribution that we obtain if we properly take into account the noise correlation by measuring the noise ACF, whereas the blue shows that we obtain if we ignore the noise correlation and just naively use Eq. (16) with the sky noise level . True values are shown with solid cyan lines. (a) A noiseless image to be fitted, (b) an example image with the correlated noise added, and (c) an example image with the uncorrelated noise added. The comparisons with (b) and (c) illustrate that the image data with uncorrelated noise can retain more information on the intrinsic Gaussian distribution than the image with the correlated noise with the same . The white contours show emission levels of , , and in both images.
Figure 12 compares the posterior distribution sampled with the covariance matrix (the same as Fig. 11) with the distribution estimated by Monte Carlo resampling. The resampling is performed by repeatedly obtaining bestfitting parameter values after adding the randomly generated correlated noise to the noiseless image (the method described in Sec. 3.2). This confirms that the posterior distributions estimated by the two approaches agree well.
Fig. 12
Black: same as in Fig. 11; green: posterior distribution estimated by Monte Carlo resampling: repeating the fitting the Gaussian model to the image added with randomly generated noise map from the noise ACF (the method described in Sec. 3.2), showing the agreement between the black and green distributions. (a), (b) Same as in Fig. 11.
In addition to the 2D fitting problem when investigating emission distributions, many recent studies fit 3D model cubes to observe 3D data cubes (R.A., Dec., and velocity) thanks to the improvement of interferometric imaging in terms of spatial resolution and sensitivity. Radio interferometer has sufficiently high-frequency resolution ( or at 110 GHz for ALMA), which is usually further binned to increase the signal to noise. Therefore, the noise can be regarded as independent across the frequency (or velocity) channels, and the above method described for the 2D case can be applied to compute for the 3D case. One of the main examples of 3D cube fitting is the kinematical modeling of emission lines.
There are several publicly available modeling codes for this purpose, which assume the disk geometry of the emission line, parametrize the rotation curve, velocity, dispersion, and 3D geometry of the disk (position angle, inclination), and produce 3D model cubes that can be directly compared to observations taking into account the resolution and pixel binning (e.g., TIRIFIC,13 KINMS,14 3D-BAROLO,15 GALPAK 3D,16 and QUBEFIT17). Under dynamical equilibrium, rotation curves are powerful tracers for the mass distribution of galaxies that can be further decomposed into black hole mass, stellar bulge, exponential disk, etc., depending on the data quality. These codes have been widely used to measure galaxy mass distributions and BH masses from distant to nearby galaxies (e.g., Refs. 8 and 14). However, these codes do not consider the correlation of the noise. As the number of data becomes large and the effect of noise correlation becomes significant, estimates of the statistical uncertainty on best-fitting parameters are too small.
Davis et al.4 estimate a covariance matrix from a point spread function and use it to calculate the values. They approximate the point spread function by a single Gaussian function ignoring the side-lobe, which is an oversimplification as seen in Fig. 7. We find that the side-lobe effect can dominate as an image region becomes bigger. As a result, the estimates of the statistical uncertainty of parameters can be underestimated. In addition, as seen in Sec. 2.2, the point spread function does not coincide with the actual noise correlation pattern in the image. Therefore, we recommend constructing the covariance matrix using Eq. (17) from the noise ACF, which fully characterizes the actual noise correlation pattern rather than from the point spread function.
Boizelle et al.6 proposed to estimate the formal by block averaging the data and model cube to form roughly beam-sized cells to avoid the need to calculate the covariance matrix mentioned above. They stated that the block averaging method does not fully mitigate the correlation between neighboring pixels with the presence of the long-range correlation as shown in Fig. 7. They estimated the final statistical uncertainty on the model parameters by Monte Carlo realisation using line-free channels.
The example calculation for the 2D fitting confirms that the two approaches, Monte Carlo resampling (in 2D or 3D),6 as well as the use of the covariance matrix during fitting are equivalent and should provide the correct estimation under spatially correlated noise.
4.
Conclusions
Understanding the spatial correlation of noise in interferometric images is important to correctly evaluate the statistical significance of the result. We have shown that the noise ACF of an ALMA noise image has a pattern similar to that of the synthesized beam (dirty beam) and that the spatial correlation of the noise originates from the limited coverage. To correctly evaluate the statistical uncertainty of the measured quantities, we propose first measuring the noise ACF in the interferometric image, which can provide the best estimate of the full statistical properties of the noise (correlation properties) with all emission-free regions available. Once the noise ACF is measured, we can directly (1) evaluate the statistical uncertainty associated with a spatially integrated flux or spectrum, (2) randomly generate noise maps with the same correlation property, and (3) construct the covariance matrix and determine a value when fitting a 2D model to an image. The method to deal with the spatially correlated noise in the interferometric image has not been documented in the astronomical literature, even for the basic spatially integrated flux measurements. We demonstrated example applications of our methods to scientific data showing that ignoring noise correlation leads to significant underestimation of statistical uncertainty of the results and false detections. A Python package for easy application of the method described in this paper, ESSENCE, is publicly available at https://github.com/takafumi291/ESSENCE.
Drizzling has become a common technique in producing optical-IR observational images. The drizzling method resamples raw under-sampled images and corrects for geometric distortion by shifting, rotating, and interpolation, and coadd these corrected images to have a common Cartesian grid. This process induces significant pixel correlation (e.g., Refs. 18 and 19). Measuring the noise ACF does not require any assumption about the probability distribution of the noise (e.g., Gaussian). Therefore, our method has potential applications to a range of astronomical images not only of interferometers but also of optical-IR observations.
5.
Appendix A: The Difference between the Noise ACF and the Synthesized Beam ACF
Equation (10) implies that the noise ACF is identical to the ACF of the synthesized beam with a constant multiplicative factor. However, they do not completely coincide, showing an extended weak positive correlation and a relatively large negative around the main beam in the residual (noise ACF-ACF of the synthesized beam, Fig. 3). To investigate the origin of the feature, we simulate the observation with a similar setup: hour angle, sky position, antenna configuration, and realistic atomospheric noise, but without emission in the sky, using SIMALMA in CASA.20 (The actual data are taken with observation, with 1 h of integration on a source intermittently separated by calibrator observations and other overheads. In comparison, the simulated observation is a continuous 1 h of integration on the source.) The visibility obtained is imaged with the same imaging parameters as those used for the actual data. Figure 13 shows the noise ACF, the ACF of the synthesized beam, and their residual (noise ACF-ACF of the synthesized beam) for the simulated observation without sky emisssion. The residual does not show the extended positive correlation pattern seen in Fig. 3, suggesting that the positive correlation pattern may be due to the sky emission of the sources and undetected background sources. The emissions from the astronomical sources are spread in the noise region by long-range sidelobes of the synthesized beam, which are generally removed for bright emissions down to the noise level of 1.5 to 3 sigma by CLEAN. Weaker emissions and its leakage into the surrounding pixels remain in the noise region. These uncleaned components and weak background sources hidden in the noise map may produce the faint positive correlation pattern in the residual Fig. 3. However, the residual obtained for the simulated observation without sky emission (Fig. 13) still shows a similar negative around the main beam as seen in Fig. 3, suggesting that the large negative cannot be attributed to emissions from the sky and may be due to the process involved in the imaging. In practice, the Fourier transform in the imaging is performed by the discrete Fourier transform, in which the visibility data are evaluated discretely using a rectangular grid. The visibility data represented as ( in Eq. (8) are interpolated (more precisely, the visibility data are convolved by a function to produce a continuous distribution. The function is chosen to minimize the image aliasing)21 to estimate the visibility value in the center of the grid. Each grid is further weighted depending on the number of data points in the grid cell or the rms error of the data in the cell.22 The negative feature in the north–south direction around the main beam may correspond to the fact that the sampling of is relatively denser in the north–south direction (see Fig. 14), which is more affected by the averaging process, than in the east–west direction.
Fig. 13
(a)–(c) Same as in Fig. 3 but for the noise image produced from the simulated data with a similar observational setup and without sky emission. The simulated data were produced by SIMALMA in CASA.
Fig. 14
The spatial transfer function of the simulated observation . The relative density is estimated by the kernel density estimate method in the seaborn plotting package and shown in blue solid lines.