KKCorrelation: Scalar-scalar correlations
- class treecorr.KKCorrelation(config=None, *, logger=None, **kwargs)[source]
Bases:
Corr2
This class handles the calculation and storage of a 2-point scalar-scalar correlation function.
Note
While we use the term kappa (\(\kappa\)) here and the letter K in various places, in fact any scalar field will work here. For example, you can use this to compute correlations of the CMB temperature fluctuations, where “kappa” would really be \(\Delta T\).
Ojects of this class holds the following attributes:
- Attributes:
nbins – The number of bins in logr
bin_size – The size of the bins in logr
min_sep – The minimum separation being considered
max_sep – The maximum separation being considered
In addition, the following attributes are numpy arrays of length (nbins):
- Attributes:
logr – The nominal center of the bin in log(r) (the natural logarithm of r).
rnom – The nominal center of the bin converted to regular distance. i.e. r = exp(logr).
meanr – The (weighted) mean value of r for the pairs in each bin. If there are no pairs in a bin, then exp(logr) will be used instead.
meanlogr – The (weighted) mean value of log(r) for the pairs in each bin. If there are no pairs in a bin, then logr will be used instead.
xi – The correlation function, \(\xi(r)\)
varxi – An estimate of the variance of \(\xi\)
weight – The total weight in each bin.
npairs – The number of pairs going into each bin (including pairs where one or both objects have w=0).
cov – An estimate of the full covariance matrix.
Note
The default method for estimating the variance and covariance attributes (
varxi
, andcov
) is ‘shot’, which only includes the shot noise propagated into the final correlation. This does not include sample variance, so it is always an underestimate of the actual variance. To get better estimates, you need to setvar_method
to something else and use patches in the input catalog(s). cf. Covariance Estimates.If
sep_units
are given (either in the config dict or as a named kwarg) then the distances will all be in these units.Note
If you separate out the steps of the
Corr2.process
command and useprocess_auto
and/orCorr2.process_cross
, then the units will not be applied tomeanr
ormeanlogr
until thefinalize
function is called.The typical usage pattern is as follows:
>>> kk = treecorr.KKCorrelation(config) >>> kk.process(cat) # For auto-correlation. >>> kk.process(cat1,cat2) # For cross-correlation. >>> kk.write(file_name) # Write out to a file. >>> xi = kk.xi # Or access the correlation function directly.
- Parameters:
config (dict) – A configuration dict that can be used to pass in kwargs if desired. This dict is allowed to have addition entries besides those listed in
Corr2
, which are ignored here. (default: None)logger – If desired, a logger object for logging. (default: None, in which case one will be built according to the config dict’s verbose level.)
- Keyword Arguments:
**kwargs – See the documentation for
Corr2
for the list of allowed keyword arguments, which may be passed either directly or in the config dict.
- __init__(config=None, *, logger=None, **kwargs)[source]
Initialize
KKCorrelation
. See class doc for details.
- finalize(vark1, vark2)[source]
Finalize the calculation of the correlation function.
The
process_auto
andCorr2.process_cross
commands accumulate values in each bin, so they can be called multiple times if appropriate. Afterwards, this command finishes the calculation by dividing each column by the total weight.- Parameters:
vark1 (float) – The variance of the first scalar field.
vark2 (float) – The variance of the second scalar field.
- process_auto(cat, *, metric=None, num_threads=None)[source]
Process a single catalog, accumulating the auto-correlation.
This accumulates the weighted sums into the bins, but does not finalize the calculation by dividing by the total weight at the end. After calling this function as often as desired, the
finalize
command will finish the calculation.- Parameters:
cat (Catalog) – The catalog to process
metric (str) – Which metric to use. See Metrics for details. (default: ‘Euclidean’; this value can also be given in the constructor in the config dict.)
num_threads (int) – How many OpenMP threads to use during the calculation. (default: use the number of cpu cores; this value can also be given in the constructor in the config dict.)
- write(file_name, *, file_type=None, precision=None, write_patch_results=False, write_cov=False)[source]
Write the correlation function to the file, file_name.
The output file will include the following columns:
Column
Description
r_nom
The nominal center of the bin in r
meanr
The mean value \(\langle r \rangle\) of pairs that fell into each bin
meanlogr
The mean value \(\langle \log(r) \rangle\) of pairs that fell into each bin
xi
The estimate of the correlation function xi(r)
sigma_xi
The sqrt of the variance estimate of xi(r)
weight
The total weight contributing to each bin
npairs
The total number of pairs in each bin
If
sep_units
was given at construction, then the distances will all be in these units. Otherwise, they will be in either the same units as x,y,z (for flat or 3d coordinates) or radians (for spherical coordinates).- Parameters:
file_name (str) – The name of the file to write to.
file_type (str) – The type of file to write (‘ASCII’ or ‘FITS’). (default: determine the type automatically from the extension of file_name.)
precision (int) – For ASCII output catalogs, the desired precision. (default: 4; this value can also be given in the constructor in the config dict.)
write_patch_results (bool) – Whether to write the patch-based results as well. (default: False)
write_cov (bool) – Whether to write the covariance matrix as well. (default: False)