Ongoing galaxy surveys map the Universe’s large-scale structure with unprecedented fidelity across unprecedented volumes. They promise to significantly deepen our understanding of the Universe, though this potential hinges on our ability to tackle systematic uncertainties. Until recently, this was beyond reach in implicit cosmological inference frameworks. In this talk, I present the first framework to uncover and correct model misspecification in field-based, implicit likelihood cosmological inference. Our approach proceeds in two steps. First, we infer the initial matter power spectrum using the SELFI algorithm, and use it to conduct a thorough simulation-based analysis of a range of systematic effects. This investigation relies on a single, joint suite of N-body simulations. Second, we perform implicit likelihood inference of cosmological parameters using a realistic forward model of large-scale spectroscopic galaxy survey that incorporates non-linear gravitational evolution with N-body. I show how the SELFI posterior can be used to assess the impact of misspecified galaxy bias, selection functions, survey masks, and redshift errors on the reconstructed initial power spectrum. I further show that percent-order misspecification may shift constraints by as much as 2σ in the (Ωₘ, σ₈) plane—biases our framework can now expose and avoid. This marks a critical step towards robust cosmological inference from full forward models of galaxy surveys such as DESI, Euclid, and LSST.
The next generation of galaxy surveys has the potential to significantly deepen our understanding of the Universe, though this potential hinges on our ability to rigorously address systematic uncertainties. In this talk, I present the first framework to uncover and correct model misspecification in field-based, implicit likelihood cosmological inference. Our approach is built upon a two-step framework. First, we employ the SELFI algorithm to infer the initial matter power spectrum, which we utilise to comprehensively investigate the impact of systematic effects. This investigation relies on a single, joint suite of N-body simulations. Second, we perform implicit likelihood inference of cosmological parameters using a realistic forward model of large-scale spectroscopic galaxy survey that incorporates non-linear gravitational evolution with N-body. I show how the SELFI posterior can be used to assess the impact of misspecified galaxy bias, selection functions, survey masks, and redshift errors on the reconstructed initial power spectrum. I further show that percent-order misspecification can lead to a bias greater than 2σ in the (Ωₘ, σ₈) plane, which we are able to detect and avoid using SELFI prior to inferring the cosmological parameters. This framework has the potential to significantly enhance the robustness of physical information extraction from full-forward models of large-scale galaxy surveys such as DESI, Euclid, and LSST.
Ongoing galaxy surveys map the Universe’s large-scale structure with unprecedented fidelity across immense cosmological volumes. Cosmological inference at the field level demands thousands of N-body simulations. To fully exploit Stage-IV data, simulators must therefore produce fast, high-precision realisations spanning vast cosmological volumes and reaching deep into the non-linear regime. The COmoving Lagrangian Acceleration (COLA) algorithm accelerates large-scale cosmological simulations by decoupling the temporal evolution of large and small scales: large scales are evolved analytically using Lagrangian Perturbation Theory (LPT), while small scales are integrated numerically. Contrary to a common misconception, COLA does not inherently sacrifice small-scale accuracy for speed: the LPT change of frame of reference can be done with any force calculation technique. I show that accurate small-scale dynamics can be obtained at a tractable computational cost by employing Particle–Particle–Particle–Mesh (P3M) force evaluations within an LPT frame of reference, achieving the precision of tree-based codes down to scales of just a few particle lengths. This result is a significant advance towards fully harnessing the cosmological potential of Stage-IV galaxy surveys.
The next generation of galaxy surveys has the potential to significantly deepen our understanding of the Universe, though this potential hinges on our ability to rigorously address systematic uncertainties. This was previously beyond reach in field-based implicit likelihood cosmological inference frameworks. We aim at inferring the initial matter power spectrum after recombination to diagnose a variety of systematic effects in galaxy surveys prior to inferring the cosmological parameters. Our approach is built upon a two-step framework. First, we employ the SELFI algorithm to infer the initial matter power spectrum, which we utilise to comprehensively investigate and disentangle how systematic effects influence the power spectrum reconstruction, using a single set of N-body simulations. Second, we obtain posterior cosmological parameters via implicit likelihood inference, recycling the simulations from the first step for data compression. We rely on a model of large-scale spectroscopic galaxy surveys that incorporates fully non-linear gravitational evolution and simulates multiple systematic effects typically encountered in astrophysical surveys. We demonstrate along with a practical guide how the SELFI posterior can be utilised to thoroughly assess the impact of misspecified linear galaxy bias parameters, selection functions, survey masks and inaccurate redshifts on the initial power spectrum after recombination. We show that a subtly misspecified model can lead to a bias greater than 2σ in the (Ωm, σ8) plane, which we are able to detect and avoid using SELFI prior to inferring the cosmological parameters. This framework has the potential to significantly enhance the robustness of physical information extraction from full-forward models of large-scale galaxy surveys such as DESI, Euclid, and LSST.
We present methodological advances to perform implicit likelihood inference of cosmology from arbitrarily complex models of galaxy surveys, while efficiently checking for systematics. This novel approach makes it possible to fully utilise our prior theoretical understanding of the initial matter power spectrum, in order to investigate the effects of known sources of systematics at play in the complex data generating process. It is currently being used for Additional Galaxy Clustering probes in preparation for the first Euclid data release.
We present methodological advances to perform implicit likelihood inference of cosmology from any forward model of galaxy surveys, while efficiently checking for systematics. The approach is based on a two-steps framework, and does not require any inner knowledge of the forward data model. First, we use SELFI (Simulator expansion for likelihood-free inference) to infer the initial matter power spectrum from any probe, and we use it to check whether all systematics are correctly accounted for based on qualitative and quantitative criteria. Second, cosmological parameters are inferred using implicit likelihood inference. Simulations used in the first step are recycled for optimal data compression, which is required for the second step. We show that mis-modelled systematic effects that would result in a biased posterior are unambiguously detected before performing the inference of cosmological parameters. The method is currently being used for Additional Galaxy Clustering probes in preparation for the first Euclid data release.
Next-generation large-scale optical surveys, such as Euclid and LSST, promise to unveil the nature of dark energy, elucidate the processes driving cosmic inflation, and constrain neutrino masses with unprecedented precision. Despite the wealth of data these surveys will provide, they will be dominated by systematic rather than statistical uncertainty, challenging standard Bayesian inference methods that heavily rely on the accuracy of the forward models involved. Notably, even slightly misspecified models can significantly bias or overconcentrate the credible posteriors on cosmological parameters, risking false discoveries. Furthermore, even if systematic effects arising from survey strategies were fully understood and controlled, theoretical challenges like galaxy biasing and non-linear structure growth at late times would persist, reducing the detectability of cosmological signatures and therefore calling for novel inference techniques. In this talk, I will present recent methodological advances to perform implicit likelihood inference of cosmology using arbitrarily complex black-box models of cosmological surveys. These advances enable efficient and thorough checks for systematic effects by leveraging our theoretical understanding of the primordial matter power spectrum after inflation, or any other latent variable in the forward model of cosmological observables.
We present methodological advances to perform implicit likelihood inference in cosmology from arbitrarily complex probabilistic forward models of cosmological surveys.
We present methodological advances to perform implicit likelihood inference of cosmology from arbitrarily complex models of cosmological surveys, while efficiently and extensively checking for systematics. This novel approach makes it possible to fully utilise our prior theoretical understanding of the initial matter power spectrum after inflation, in order to investigate the effects of known sources of systematics at play in the data generating process.