Revealing The Local Cosmic Web From Galaxies by Deep Learning

The Astrophysical Journal, 913:76 (14pp), 2021 May 20 https://doi.org/10.
3847/1538-4357/abf040
© 2021. The American Astronomical Society. All rights reserved.
Revealing the Local Cosmic Web from Galaxies by Deep Learning

Sungwook E. Hong (홍성욱)1,2 , Donghui Jeong3 , Ho Seong Hwang2,4 , and Juhan Kim5
1
Natural Science Research Institute, University of Seoul, 163 Seoulsiripdaero, Dongdaemun-gu, Seoul 02504, Republic of Korea; djeong@psu.edu
2
Korea Astronomy and Space Science Institute, 776 Daedeokdae-ro, Yuseong-gu, Daejeon 34055, Republic of Korea
3
Department of Astronomy and Astrophysics, and Institute for Gravitation and the Cosmos, The Pennsylvania State University, University Park, PA 16802, USA
4
Astronomy Program, Department of Physics and Astronomy, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul 08826, Republic of Korea
5
Center for Advanced Computation, Korea Institute for Advanced Study, 85 Heogiro, Dongdaemun-gu, Seoul 02455, Republic of Korea
Received 2020 August 3; revised 2021 March 15; accepted 2021 March 17; published 2021 May 26
Abstract
A total of 80% of the matter in the universe is in the form of dark matter that composes the skeleton of the large-
scale structure called the cosmic web. As the cosmic web dictates the motion of all matter in galaxies and
intergalactic media through gravity, knowing the distribution of dark matter is essential for studying the large-scale
structure. However, the cosmic web’s detailed structure is unknown because it is dominated by dark matter and
warm−hot intergalactic media, both of which are hard to trace. Here we show that we can reconstruct the cosmic
web from the galaxy distribution using the convolutional-neural-network-based deep-learning algorithm. We find
the mapping between the position and velocity of galaxies and the cosmic web using the results of the state-of-the-
art cosmological galaxy simulations of Illustris-TNG. We confirm the mapping by applying it to the EAGLE
simulation. Finally, using the local galaxy sample from Cosmicflows-3, we find the dark matter map in the local
universe. We anticipate that the local dark matter map will illuminate the studies of the nature of dark matter and
the formation and evolution of the Local Group. High-resolution simulations and precise distance measurements to
local galaxies will improve the accuracy of the dark matter map.
Unified Astronomy Thesaurus concepts: Dark matter distribution (356); Cosmology (343); Large-scale structure of
the universe (902); Local Group (929)
1. Introduction signals from the extragalactic sources by cross-correlating

the high-energy cosmic rays with the distribution of galaxies
Since Fritz Zwicky inferred its existence from the large
velocity dispersion of the Coma Cluster (Zwicky 1933) and (Fornasa et al. 2016; Fang et al. 2020) and dark matter
Vera Rubin confirmed it with the flat rotation curve of galaxies traced by weak gravitational lensing (Tröster et al. 2017;
(Rubin et al. 1970), astronomers have been only strengthening Ammazzalorso et al. 2020). All searches for the dark matter
the necessity of the nonbaryonic matter providing excess particles thus far, however, have not concluded with a firm
gravity. We call that dark matter. The most substantial pieces of detection. They have been only narrowing down the possible
evidence include an excessive mass-to-light ratio in the dwarf dark matter masses and the interaction strengths among dark
galaxies (Aaronson 1983), the mismatch between the X-ray matter particles, as well as between dark matter and atoms
map (gas distribution) and the weak gravitational lensing map (Akerib et al. 2017; Arcadi et al. 2018). For these efforts of
(mass distribution; Clowe et al. 2006), and the disparity searching for the nature of dark matter, the most basic
between the heights of even- and odd-acoustic peaks in the information currently lacking is the distribution of the dark
temperature power spectrum of the cosmic microwave back- matter, or cosmic web, in the local large-scale structure beyond
ground (CMB; Larson et al. 2011). Dark matter is also an the Milky Way halo. Of course, we have a good reason to
indispensable component of the concordance cosmological believe that dark matter halos surround each galaxy in the
model. Accounting for the measured expansion rate of the universe. It is, however, also well known that the galaxies are
universe (Planck Collaboration et al. 2020) requires the matter biased, rather than faithful, tracers of the large-scale structure
component whose energy density is over five times larger than (Desjacques et al. 2018).
that of atoms for which the robust upper limit comes from big In this article, we shall present a novel method of unveiling
bang nucleosynthesis (Cooke et al. 2014). The observed large- the cosmic web in the local universe. As dark matter is dark, of
scale distribution of galaxies (Anderson et al. 2014) and the course, we cannot observe it directly from the telescope. The
map of weak gravitational lensing potential (Abbott et al. 2018) only guaranteed way of searching for the dark matter is the
also require the dark matter providing the skeleton of the large- same method for their discovery, through their gravitational
scale structure within which clouds of atoms collapse to form influence on visible objects. On the intergalactic scales, dark
stars and galaxies (Davis et al. 1985). matter dominates the gravitational interaction and determines
With the essential role that dark matter plays in modern the cosmic velocity flow. We can, therefore, infer the
astronomy and cosmology, in the past few decades there have distribution of dark matter by carefully studying the distribu-
been continuous efforts to search for the nature of dark matter tion and motion of galaxies. Taking the observed distribution of
particles in the particle accelerators (ATLAS Collaboration galaxies and their peculiar velocity flow, in what follows we
et al. 2019; Vannerom 2019), cosmic rays (Giesen et al. 2015), shall decipher the dark matter distribution, or cosmic web,
gamma rays (Ackermann et al. 2015), and high-energy within the local ∼20 Mpc h−1.
neutrinos (Aartsen et al. 2018). Beyond the Milky Way halo, When reconstructing the local dark matter distribution
there have also been recent studies focusing on the dark matter directly from observed galaxy distributions, we face the
1
The Astrophysical Journal, 913:76 (14pp), 2021 May 20 Hong et al.
following challenges. First, the local galaxy distribution at the sample over the given region, we make the volume-limited
low Galactic latitudes is hidden behind the intense radiation subsample of the CF3 as follows. First, since the number
from the Galactic disk and contaminated by the interstellar gas density of the CF3 galaxies close to the Galactic plane
and dust, which makes it hard to obtain the complete map of (Galactic latitude |b| < 10°) is lower than average, we only use
the galaxy distribution. Second, even if we had the complete the galaxies at |b| > 10°. Also, we use the B-band absolute
map of galaxies, they are biased tracers of the large-scale magnitude (MB) compiled from the Lyon Extragalactic
structure, that is, the distribution of galaxies does not Database (LEDA; Paturel et al. 2003) as a proxy of the stellar
necessarily reflect the distribution of dark matter. mass (Må; Wilman & Erwin 2012). We set the B-band
Previous attempts (Gottloeber et al. 2010; Libeskind et al. magnitude as −15 for the selection criterion, which is sufficient
2010; Carrick et al. 2015; Carlesi et al. 2016; Lavaux & for covering the 20 and 40 Mpc h−1 cubic volume around the
Jasche 2016) of making the local dark matter map, therefore, Milky Way. We have also tested the cases with MB < −16 and
have relied on the cosmological simulations constrained by the −17 and found no noticeable difference of the predictions from
smoothed density field at high Galactic latitudes. Typically, a the fiducial choice (see Section 4). Note that we have not used
smoothing scale of a few megaparsecs is employed when the KS-band absolute magnitude, one of the best-known tracers
matching the simulation output to the observation. However, of the stellar mass (Bell et al. 2003), because that information is
this observational constraint for the fully evolved galaxy missing for about 30% of the galaxies in our sample (Lavaux &
distribution is nontrivial to implement because the simulation Hudson 2011; Huchra et al. 2012).
needs the density distribution at the initial time. Alternatively, We calculate the radial peculiar velocity by subtracting the
the Bayesian Origin Reconstruction from Galaxies (BORG; Hubble flow from the velocity in the Galactic standard of rest
see, e.g., Jasche & Wandelt 2013; Jasche et al. 2015) approach (VGSR; Kourkchi et al. 2020). Note that we do not use the
uses the multiple Gaussian processes to draw the probability velocity in the CMB standard of rest (VCMB) to reduce any bias
distribution of the initial density perturbation from a given that might be introduced in the conversion. Instead, when
galaxy distribution. As based on the dark matter density field generating training and test samples from simulation data, we
evolution by second-order Lagrangian perturbation theory include the peculiar motion of the Milky Way corresponding
(2LPT) and the linear galaxy bias model, the method is also galaxy in each simulation. There exists a difference on the
limited to, again, the scale larger than a few megaparsecs where Hubble constant between recent CMB observations
the 2LPT and linear bias models are accurate. (H0 = 67.77 km s−1 Mpc−1; Planck Collaboration et al. 2020)
Here we overcome the challenges by taking a novel approach and the best fit from the CF3 (H0 = 75 km s−1 Mpc−1; Tully
based on deep learning (DL). DL, as well as a conventional et al. 2016). In this study, we have tested both values and find
machine-learning technique, has been introduced to measure the that the effect from the different Hubble constants stays within
dark matter distribution from weak gravitational lensing or spatial the uncertainty of the dark matter map (see Section 4).
distribution of dark matter halos (e.g., Modi et al. 2018; Shirasaki
et al. 2019; Jeffrey et al. 2020). On the contrary, our DL approach
2.2. Simulation Data: Illustris-TNG and EAGLE
aims to reconstruct the local dark matter map down to a
megaparsec scale by incorporating all information in the observed We use TNG100-1, a simulation with a comoving volume
galaxy data: the spatial distribution and the radial peculiar velocity V = (75 Mpc h-1)3 and 18203 dark matter and gas particles
of galaxies. We use the DL algorithm based on the convolutional from the Illustris-TNG simulation suite (Marinacci et al. 2018;
neural network (CNN) to find the mapping between the local dark Naiman et al. 2018; Nelson et al. 2018, 2019; Pillepich et al.
matter distribution and the observed positions and the radial 2018; Springel et al. 2018), as our high-resolution simulation
peculiar velocities of local galaxies. data (TNG100 hereafter). To mimic the observation from the
The structure of this paper is as follows. In Section 2, we Milky Way, we select 988 galaxies with stellar mass
describe the simulation and observational data used for DL 4 × 1010Me < Må < 1011Me (center galaxies hereafter) by
training and prediction, respectively. In Section 3, we will adopting that the Galactic stellar mass is about 5.2 × 1010Me
briefly describe our DL architecture and the evaluation of our (Licquia & Newman 2015). Around each center galaxy, we
DL model. In Section 4, we will show the reconstructed local make a subcube with 20 Mpc h−1 box size and calculate the
dark matter map and its statistical robustness. We will dark matter density field within the 643 uniform grid. We also
summarize our result in Section 5. calculate the relative position of galaxies with MB < −15
Throughout the paper, we assume a standard ΛCDM cosmology (target galaxies hereafter) and the difference of peculiar
in concordance with the Planck 2018 analysis (Planck Collabora- velocity between the target galaxy and center galaxy.
tion et al. 2020): (W0m , WL0 , h ) = (0.31, 0.69, 0.6777). It is For the low-resolution dark matter map with V =
similar to the standard cosmologies adopted in Illustris-TNG and (40 Mpc h-1)3, we use the TNG300-1 from the Illustris-TNG
EAGLE simulations: (W0m , WL0 , h ) = (0.3089, 0.6911, 0.6774) simulations, whose volume and number of particles are
and (0.307, 0.693, 0.6777), respectively (Springel et al. 2018; V = (205 Mpc h-1)3 and 25003, respectively (TNG300 here-
Schaye et al. 2015). after). Note that the amplitude of the luminosity function of
TNG300 is lower than the observation and TNG100, mainly
due to the lower spatial resolution of the simulation (Pillepich
2. Data et al. 2018). Therefore, we also apply the resolution correction
to find the center and target galaxies using the number density
2.1. Observational Data: Cosmicflows-3
obtained from TNG100 rather than directly using the face
We use the Cosmicflows-3 galaxy catalog (Tully et al. 2016; values of Må or MB. We also use TNG300-1-Dark, a dark-
CF3 hereafter), one of the most comprehensive galaxy catalogs matter-only counterpart of TNG300, to test how baryonic
that provide distance, radial peculiar velocity, and luminosity physics affects our result. We select the center and target
of 17,647 galaxies up to 200 Mpc. To produce a fair galaxy galaxies by finding the mass cut of dark matter halos with the
2
Figure 1. CNN architecture used for TNG300. We denote the layer size by the quadruple where the spatial dimension (2n, 2n, 2n) follows the number of channels. The
size (except the number of filters) of each layer for TNG100 is half that of TNG300.
same number density. The result from TNG300-1-Dark is can extract different physical features in the data. Specifically,
similar to or slightly worse than that from TNG300 (see we use a CNN architecture similar to the U-Net (Ronneberger
Section 4). et al. 2015) or V-Net (Milletari et al. 2016) to predict the dark
Also, we use RefL0100N1504, a reference simulation with matter density field from the galaxy position and radial peculiar
V = (67.77 Mpc h-1)3 and 15043 dark matter and gas particles velocity (see Figure 1). Our CNN architecture consists of the
from the EAGLE simulation suite (Schaye et al. 2015; Crain et al. following two stages: the encoding stage (Input—ConvNs),
2015, EAGLE hereafter), to check the fidelity of our result. For the with increasing number of filters and decreasing the size of
center galaxies, we use the same selection criterion as TNG100 hidden layers, and the decoding stage (UpConvNs—Output),
and find 478 center galaxies. For the target galaxies, however, we with decreasing number of filters and increasing the size of
do not directly use MB. This is because the luminosity function of hidden layers. Here, Ns denotes the spatial size of hidden
EAGLE is reliable only for bright galaxies (MB  − 18) since the
layers. To retain the small-scale spatial resolution, we also
EAGLE simulations calculate the luminosity only to massive
attach the hidden layers in the equivalent (with the same layer
galaxies (Må 108.5Me; Camps et al. 2018). Instead, similar to
TNG300, we use the galaxy number density obtained from size) encoding stage as additional channels to the decoding
TNG100 to find the stellar mass cut of target galaxies. layer, doubling the number of channels. We refer to this
process as concatenation.
The encoding stage consists of a series of ConvNs layers.
3. Methods Let us define the input of a given ConvNs,0 as  ℓ; i, j, k , where
i, j, k Î [1, Ns,0] are the spatial coordinates and ℓ Î [1, Nch,0] is
3.1. Deep-learning Architecture the channel index, with Nch,0 being the total number of
We construct the DL architecture using CNN that highlights channels. To accommodate the convolution at the edge, we
features in the data by a series of convolutions, resulting in so- have added the buffer around the input array (padding process).
called hidden layers. By varying the convolution filters, one As we use a 5 × 5 × 5 convolution filter, it suffices to add
3
Np = 2 padding pixels at both edges of each dimension. We fill In addition to the usual steps described above, the final
the padding pixels by reflecting the inner 2 pixels next to the Output layer requires following two special treatments so that
edge pixels. the output layer represents the single dark matter density
After the padding, we apply a three-dimensional convolution proportional to log10 (r r0 ), which can be both positive and
with a multichannel filter wℓ, ℓ ¢ ; i ¢ , j ¢ , k ¢ and bias bℓ, with indices negative. First, instead of a gradual decrease of the number of
i¢ , j ¢ , k ¢ Î [1, Nk = 5], ℓ ä [1, Nch,1], ℓ ¢ Î [1, Nch,0], to obtain output channels by a factor of 2, we set the number of output
the output  as channels for Output as 1. Second, instead of the ReLU
activation function, whose output range is [0, +inf), we use the
 ℓ; i, j, k = bℓ + å ℓ ¢ ; s (i ¢ ; i ), s ( j ¢ ; j ), s (k ¢ ; k ) wℓ, ℓ ¢ ; i ¢ , j ¢ , k ¢, (1 ) hyperbolic tangent function (tanh) so that its output range
ℓ ¢,i¢,j¢,k ¢
becomes finite ([ −1, +1] in this case).
where ℓ ¢ ; s (i ¢ ; i), s ( j ¢ ; j), s (k ¢ ; k ) is the input array after the padding. We have adopted different spatial sizes of the hidden layer
We sample the convolution sparsely, s (i¢ ; i ) = i ´ Nst + i¢, for TNG100 and TNG300 to accommodate the difference in
their spatial resolution. For TNG100, the encoding stage starts
and reduce the spatial dimension by a factor of 23 at each step
from two channels of 643-grid input layers and ends with the
by choosing the spatial interval Nst = 2 (strides hereafter). 2048 channels of the 23-grid layer (Conv2), and for TNG300,
Accompanying the reduction of spatial dimension, we increase the encoding stage starts from two channels of 1283-grid input
the number of channels Nch by a factor of 2 at each step of the layers and ends with the 2048 channels of the 43-grid layer
convolution, from 128 (Conv64) to 2048 (Conv4). Note that (Conv2). The final output layers are 643 and 1283 for TNG100
the convolution filter wℓ, ℓ ¢ ; i ¢ , j ¢ , k ¢ and bias bℓ are trainable and TNG300, respectively. We have also tested other CNN
parameters that we adjust for the training. architectures with various channel sizes and confirmed that the
The padding and convolution processes are linear operations, CNN architecture that we use here (shown in Figure 1)
so any combinations of these operations simplify to a single performs the best among the tested cases.
linear algebra operation. In order to fully utilize the multiple
hidden layers of DL, we apply the rectified linear unit (ReLU; 3.2. Training
Hahnloser et al. 2000; Glorot et al. 2011),
We divide the training and validation samples from
 ℓ; i, j, k = max ( ℓ; i, j, k , 0) , (2 ) TNG100 so that all subcubes from the validation sample do
not overlap with those from the training sample. As a result, we
as a nonlinear activation function for each hidden layer. only use 525 subcubes—432 for training and 93 for validation.
Finally, we apply the batch normalization (Ioffe & Szegedy For each subcube, we make two 643 uniform grids as a two-
2015) channel input layer; each channel stores the number of target
 ℓ; i, j, k - mℓ; i, j, k galaxies (Ngal) and the averaged radial peculiar velocity (Vpec)
ℓ; i, j, k = gℓ; i, j, k + b ℓ; i , j , k , (3 ) in units of km s−1. For the input layer, we apply the same
s 2ℓ; i, j, k +  Galactic latitude mask as the CF3 data (masking out |b| < 10°).
For the output layer, we normalize the logarithm of dark matter
to obtain an output ConvNs,1 layer, ℓ; i, j, k (i, j, k ä [1, Ns,1 = density to be
Ns,0/2], ℓ ä [1, Nch,1]). Here, μℓ;i,j,k and σℓ;i,j,k are the mean and
standard deviation of  ℓ; i, j, k over samples in the same mini- y=
1
log10 (r r 0) , (5 )
batch, and ò = 10−3 is a small value for the numerical stability. 4.5
Note that the mini-batch refers to the bundle of input−output
where ρ0 is the mean dark matter density of the universe so that
pairs that we have used for updating the trainable parameters.
all values in the output layer would be between −1 and +1.
The normalization factor γℓ;i,j,k and bias factor βℓ;i,j,k are other
For data augmentation, we allow swapping the (x, y, z)-axes
trainable parameters. The batch normalization introduces an of each subcube, which increases the number of samples by a
extra level of nonlinearity, ensuring that the trainable factor of three. We further increase the sample size by flipping
parameters introduced at earlier hidden layers still affect the the axis direction, with which the number of samples increases
output. eight times. Note that, unlike U-Net or V-Net, we do not split a
The decoding stage consists of a series of UpConvNs layers, single cube into multiple smaller cubes for data augmentation
which are constructed in a parallel manner. In contrast to the because that would change the Galactic latitude mask and the
ConvNs, where we decreases the spatial dimension by sparsely radial peculiar velocity. In the end, we obtain samples of
sampling the convolved array, we increase the spatial 10,368 and 2232, respectively, in training and validation sets.
dimension of each UpConvNs layer, We implement our CNN architecture in Keras (Chollet et al.
ℓ; i, j, k =  ℓ; u (i ), u ( j ), u (k ), (4 ) 2015) with the Tensorflow back end (Abadi et al. 2015) and
perform the training with an NVIDIA Tesla V100 graphic
by duplicating the input array  ℓ; i, j, k . Here u(x) = ⌈x/Nu⌉, and processing unit (GPU) with 16 GB memory. We choose the
we set the upsampling factor Nu = 2 in order to increase the mean squared error (MSE) as the loss function that the DL
spatial size of ℓ; i, j, k by a factor of 8. After the upsampling, we minimizes during the training:
concatenate the ConvNs layer (the same size) and apply batch 1 n
normalization. We then apply a three-dimensional convolution  TNG100 =
n
å ( yi,pred - yi,truth )2 (6 )
i=1
with (Nk, Nst) = (3, 1), after the reflective padding of the edge
arrays with Np = 1. We decrease the number of output channels n 2
1 1
of each UpConvNs from 1024 to 128 by a factor of 2. After the =
n
å ⎡ 4.5 log10 (ri,pred ri,truth) ⎤ , (7 )
convolution, we apply the ReLU activation function. i=1 ⎣ ⎦
4
where the subscripts (i, pred) and (i, truth) are, respectively, the
prediction and truth values of the y (defined in Equation (5)) at
the ith grid.
Initially, we set the trainable parameters in the convolution
filters (θ; parameter vector hereafter) randomly. The training
process for minimizing the loss function is done with 200
epochs, a unit process that updates the parameter vector from a
subset of the train set and applies the updated parameter vector
to a subset of the validation set. The parameter vector update
process at each epoch consists of 1728 mini-batches. We set the
mini-batch size as six, mainly due to the GPU memory limit.
For each mini-batch we numerically calculate the gradient of
the loss function (q ) and update the parameter vector by the
Adam optimizer (Kingma & Ba 2014),
mt (1 - b 1t )
qt = qt - 1 - a (8 )
vt (1 - b 2t ) + 
Figure 2. Evolution of loss function ( ) as a function of learning rate of the Adam
mt = b1mt - 1 + (1 - b1) q t (qt - 1) (9 ) optimizer (α) from an additional test training for TNG300. A too low learning rate
(α  10−8) gives a too slow update of the parameter vector, which is presented as
a flat slope of  (a). On the other hand, a too high learning rate (α  10−5)
vt = b 2 vt - 1 + (1 - b 2)[q t (qt - 1)]2 . (10) prevents finding a solution, which is presented as a noisy increment of  (a).
Here t is a mini-batch step number starting from zero, mt and vt

are the first- and second-moment vectors with initial values
m0 = v0 = 0, β1 = 0.9 and β2 = 0.999 are exponential decay
rates for moment estimates, and ò = 10−7 is a small value for
the numerical stability. α is the learning rate that determines
how fast one updates the parameter vector, and we set it as
10−3. As a result, the training process for TNG100 takes about
73 hr for a single run.
We perform a similar training for the TNG300 outcome,
except for the following differences. First, we have 10,629
training subcubes and 1256 validation subcubes, with each
subcube having 1283 grids. Unlike TNG100, we do not apply
further data augmentation, mainly due to the expensive
computational cost from large CNN architecture size. Second,
since the dynamic range of dark matter density of TNG300 is
wider than TNG100, we use
1
y= log10 (r r 0) (11)
5 Figure 3. Evolution of loss functions from train (blue) and validation (orange)
sets as a function of epoch for TNG300.
for the output layer instead. As a result, the MSE loss function
becomes
vector update is too slow, the loss function as a function of
n 2 learning rate ( (a)) has a flat slope. On the other hand, if the
1 1
 TNG300 =
n
å ⎡ 5 log10 (ri,pred ri,truth) ⎤ . (12) learning rate is too high, i.e., if an interval of parameter vector
i=1 ⎣ ⎦ update is too large to find a solution,  (a) presents a noisy
increment. We found that 3 × 10−8 < α < 4 × 10−5 is a suitable
Third, instead of using a fixed learning rate, we apply a
range of the learning rate and set (αL, αU) = (3 × 10−8, 4 × 10−5)
triangular cyclic learning rate (Smith 2015),
for the triangular cyclic learning rate accordingly. Finally, due to
a - aL the large CNN architecture size, we use four NVIDIA Tesla V100
at = a L + U ´ min {(t mod T ) , T - (t mod T )},
T 2 GPUs with 32 GB of memory each, with a mini-batch size of 8.
(13) For each training, we run 400 epochs by using only 157 mini-
batches per epoch, and it takes about 90 hr for a single run.
to avoid the training being stuck in local minima. Here T is the Figure 3 shows the evolution of the MSE loss functions from
number of mini-batches that consists of a single learning rate both train and validation sets as a function of epoch in
cycle, and we set it as 8; αL and αU are the minimum and TNG300. Both train and validation losses similarly decrease
maximum values of the learning rates, respectively. To find a over epoch until the validation loss reaches its minimum
suitable range of learning rates, we have performed an additional around 8 × 10−3 at ∼140 epochs, while the train loss continues
test training with a few epochs by varying learning rates (see decreasing at all epochs. Similar minimum values of validation
Figure 2). If the learning rate is too low, i.e., if the parameter losses have been found during our test training, and we expect
5
Table 1 statistics of the 2pCFs between truth and prediction at a given

Summary of TNG300 and Its Comparison Models Used in This Paper scale,
Model Description
KS(x pred, xtruth) = maxx¢∣P˜pred (x ¢) - P˜truth (x ¢)∣. (14)
TNG300 Simulation: TNG300-1 hydrodynamic simulation
Center galaxies: 4 × 1010Me < Må < 1011Me after resolution
correction
Here, P˜ (x ¢) = N (x < x ¢) Nsample is the empirical distribution
Target galaxies: MB < −15 after resolution correction function, where Nsample and N (x < x ¢) are, respectively, the
Input layer: two-channel (Ngal and Vpec) number of whole samples and the number of those satisfying
Hubble parameter: 67.77 km s−1 Mpc−1 x < x ¢. The smaller KS(ξpred, ξtruth) indicates that the predicted
16mag Target galaxies: MB < − 16 after resolution correction probability distribution of the 2pCF is closer to the true
17mag Target galaxies: MB < − 17 after resolution correction distribution, so we use that as a metric to compare the
noVpec Input layer: one-channel (Ngal) performance of models. For both TNG100 and TNG300, the
stellarMass Input layer: two-channel (log10 (M M) (logarithm of the total models at the minimum training loss provide the closest
stellar mass) and Vpec) distribution of the 2pCF predictions to their truth, and we adopt
DMhalo Simulation: TNG300-1-Dark dark-matter-only simulation
Center & target galaxies: applying halo mass cut that matches
them as our optimal models.
the same galaxy number density to TNG300 Table 2 and Figures 4 and 5 show a visual inspection and the
diffH0 Hubble parameter: 75 km s−1 Mpc−1 statistics of the TNG300 validation samples, which show a
good agreement with their true values. Interestingly, the
Note. Each comparison model is the same as TNG300 except those mentioned predicted dark matter distribution shows small-scale filamen-
in its “Description.” tary structures, which are not apparently shown in Ngal alone.
This is the first indication of the importance of the (radial)
peculiar velocity field for reconstructing the small-scale
that the above value is close to the global minimum of the filamentary structures; that is, the recovered dark matter map
validation loss function in our current CNN setup. If the shows much more detailed structure than simply connecting the
validation loss greatly increases over epoch after reaching its galaxy positions, since the peculiar velocity could provide
minimum while the train loss keeps decreasing, it may infer information about the underlying gravitational potential.
that the learning process starts overfitting the data—the Simply put, we use the galaxies as test particles for recovering
learning process tries to memorize the data without finding the local gravitational field. Note that, however, there is a slight
any global feature. In our runs, however, the validation loss difference in the detailed distribution of filamentary structures
does not increase more than 1.1 times its minimum until the last between truth and prediction. Also, note that there exists a
epoch, which suggests that our result would not suffer from an sharp lower cut in the predicted density min rpred ~ 10-2r0 .
overfitting problem significantly. From each run, we select The above two issues could be overcome by using higher-
three models from three different epochs for the following resolution hydrodynamic simulations and observational data
performance test: at the minimum validation loss, at the with more low-brightness galaxies. Also, fine-tuned choices of
minimum training loss, and the last epoch. loss function might help manage an issue about a slight
For TNG300, we perform six additional alternative trainings difference of filamentary structures.
with different configurations of the input layer (comparison models After choosing the optimal models, we perform the
hereafter) to understand how such difference affects our prediction convergence test between models with different simulation
(see Table 1). 16mag and 17mag use the alternative absolute resolutions and setups. First, we compare the local dark matter
B-band magnitude cutoffs MB < −16 and −17, respectively. density field predictions from TNG100 and TNG300 within
stellarMass uses the logarithm of the total stellar mass rather than the radius r = 10 Mpc h−1. We find that they show similar
the simple galaxy number as an input layer, while noVpec does distribution up to r ∼ 4 Mpc h−1, while the dark matter map
not use the radial peculiar velocity. Finally, DMhalo uses the dark from TNG100 shows finer small-scale structures than
matter halos in the dark-matter-only simulation TNG300-1-Dark TNG300 (see Section 4.3). Also, we apply the CNN model
instead of galaxies in the TNG300-1.6 from TNG100 to the test sample of EAGLE (EAGLE-
TNG100 in Table 2).
Note that we do not apply the CNN model from TNG300 to
4. Results EAGLE because the volume of EAGLE is not sufficiently
4.1. Performance Test larger than the volume of TNG300 subcubes. We find that its
performance test result is similar to the TNG100 validation
To test the model parameters tuned with TNG100 and sample, except that EAGLE-TNG100 tends to slightly over-
TNG300 training sets, we apply the model to the validation estimate the dark matter density (see Figure 5 and Table 2).
samples to compare the resulting dark matter density cube with We also test the performance of various comparison models
the ground truth. Specifically, we use the following four of TNG300 (see Table 1 for definitions). Most comparison
methods for the performance test: visual comparison, joint models show similar overall performance to TNG300, while
probability distribution, histogram, and two-point correlation those from the dark-matter-only simulation (DMhalo) have
function (2pCF) ξ(r) = 〈δ(x)δ(x + r)〉x. To examine the perfor- slightly more offset in the distribution of 2pCFs. Those without
mance of the each model, we use the Kolmogorov–Smirnov using the radial peculiar velocity as inputs (noVpec), however,
6
do not reproduce any small-scale filamentary structure shown
Note that Modi et al. (2018) performed a similar study to reconstruct the in the true dark matter distribution (see the right panel of
(initial) density perturbation from the dark matter halo distributions by DL, but
they focused more on large scales such as baryon acoustic oscillation (BAO) Figure 4). From its visual inspection, one could interpret the
rather than relatively small scales such as ours. output of noVpec as a smoothing of the galaxy number
6
Figure 4. Three-way projections of a single TNG300 validation sample with 5 Mpc h−1 thickness. From left to right: galaxy number (Ngal), radial peculiar velocity
(Vpec), truth dark matter density (ρtruth), reconstructed dark matter density (ρTNG300), and another reconstruction from the CNN architecture without using the radial
peculiar velocity (noVpec; ρnoVpec). TNG300 can well reconstruct the filamentary structure of a few megaparsec scales in the true dark matter distribution, while
noVpec does not show such structure.
Table 2
Summary of the Performance Test Done by Validation Samples of TNG100, TNG300, and Their Comparison Models
Model log10 (rpred rtruth ) KS(ξpred, ξtruth)

−1
0 − 1 Mpc h 1 − 3 Mpc h−1 3 − 10 Mpc h−1
TNG100 −0.014 ± 0.543 0.263 ± 0.035 0.175 ± 0.087 0.130 ± 0.042
EAGLE-TNG100 +0.129 ± 0.491 0.171 ± 0.055 0.152 ± 0.047 0.149 ± 0.040
TNG300 −0.020 ± 0.451 0.153 ± 0.035 0.134 ± 0.040 0.163 ± 0.017

16mag −0.008 ± 0.468 0.109 ± 0.010 0.161 ± 0.033 0.254 ± 0.016
17mag +0.017 ± 0.481 0.143 ± 0.037 0.168 ± 0.018 0.251 ± 0.019
noVpec +0.016 ± 0.481 0.367 ± 0.115 0.407 ± 0.061 0.170 ± 0.036
stellarMass −0.050 ± 0.471 0.186 ± 0.056 0.218 ± 0.016 0.269 ± 0.021
DMhalo +0.002 ± 0.481 0.264 ± 0.029 0.243 ± 0.030 0.263 ± 0.034
Note. KS(ξpred, ξtruth) is the Kolmogorov–Smirnov statistics of the two-point correlation functions of dark matter distribution between truth and prediction. EAGLE-
TNG100 is the application of the TNG100 model to the EAGLE samples. diffH0 is identical to TNG300 since Hubble flow estimation is not considered in this test.
distribution—the only available input of the given DL model— SGY, and SGZ), extended to the full cube with the side length of
with a few megaparsec scale. As a result, the 2pCFs of noVpec 40 Mpc h−1. Figure 6 clearly shows known local objects that we
show a significant deviation from their truth in small scales designated by their common name. The figure also recovers
with r  3 Mpc h−1 (see Table 2). From the comparison to known local large-scale structures. For example, we find a
TNG300 and its other comparison models, it is apparent that 10 Mpc h−1 spread along + SGY-direction in the SGZ−SGY (top
the (radial) peculiar velocity plays a significant role in left panel) and SGY−SGX (bottom right panel) planes. This
reconstructing the small-scale filamentary structure. structure is known as the Local Sheet, which connects the Local
Group and Virgo Cluster and contains M81, NGC 5194, Canes II,
and Coma I groups (Tully et al. 2008; Courtois et al. 2013). We
4.2. Three-dimensional View of the Local Cosmic Web
also find that, around the Local Group, the Local Sheet is
Figure 6 shows a sliced view of the reconstructed cosmic web connected to the Fornax Wall (Fairall et al. 1994), which is a
integrated over 4 Mpc h−1 thickness. Each panel shows the cosmic 20 Mpc h−1 sized spread along the (−SGY, −SGZ) direction,
web on the plane of the Supergalactic Cartesian coordinates (SGX, containing the Fornax Cluster, Eridanus Cluster, and Dorado
7
Figure 5. Result of the performance tests for the DL result using the three-dimensional dark matter density field of simulations. Top panel: statistical comparison
between the ground truth and the predicted dark matter density from the entire TNG300 validation sample. From left to right: joint probability distribution (colors)
with 1σ, 2σ, 3σ certainty level contours (lines), median (lines) and 1σ deviation (shades) of histograms, and median (lines) and 1σ deviation (shades) of the two-point
correlation functions. Bottom panel: similar to the top panel, but by applying the TNG100 training to the entire EAGLE test sample.
Group as members (top left panel). At the opposite direction to the Furthermore, to estimate the uncertainties of the dark matter
Fornax Wall on the SGZ−SGY plane, the Local Void (Tully & map, we perform a stress test on our CNN models by incorporating
Fisher 1987) is also apparent (also shown on the SGZ−SGX distance measurement uncertainties in the CF3. We use the one
plane), which might extend beyond the boundary of our local standard deviation uncertainty in distance modulus (òμ) in the CF3,
universe sample. In Figure 6, we also present the velocity flow
1
lines derived from the reconstructed gravitational potential gradient m º . (15)
with arrows and black lines. The velocity flow shows the motion åi 1  i2
of material from the Local Void to nearby filamentary structures
and clusters such as the Local Sheet, Fornax Wall, and Virgo Here òi includes the one standard deviation uncertainty
Cluster. Note that we cannot reproduce the velocity flow from the determined from a recalibration of galaxy magnitude with H I
Virgo Cluster to the Great Attractor (+SGX-direction), because of line width (Tully & Courtois 2012), distance measurement of
the limited extension of the volume that we analyze here. the tip of the red giant branch from the Hubble Space
However, we would like to emphasize that the recovered dark Telescope, Type Ia supernovae from various samples (Tully
matter map provides us detailed density and velocity fields around et al. 2013), Tully–Fisher relation using Spitzer [3.6] photo-
these known local large-scale structures. metry, and the fundamental plane relation from the Six Degree
The recovered cosmic web also shows a hint of new Field Galaxy Survey (6dFGS; Tully et al. 2016). We then
structures that require further investigation. For example, the generate 1000 sets of random distance moduli that follow the
direction of the Local Sheet is similar to the direction of the so- normal distribution,
called vast polar structure (VPOS), which consists of satellite
galaxies, globular clusters, and stellar streams around the Milky 1 ⎡ Dm2 ⎤
P (Dm) = exp ⎢ - 2 ⎥. (16)
Way (Pawlowski et al. 2012). As shown in Figure 6, the Local  m 2p ⎣ 2 m ⎦
Sheet, being the strongest filamentary structure around the
Local Group, is a source of velocity flow; that might cause a Then, we recalculate the radial peculiar velocity by subtracting
connection between the two. Also, a couple of small filaments the Hubble flow corresponding to the random distances from the
are visible in our maps, which could be good targets for VGSR. Since the distance measurement error exists only along the
systematic examination with deep imaging surveys. radial direction, we have generated the two-dimensional column
8
Figure 6. Three-dimensional density maps of the local dark matter with 40 Mpc h−1 box size and 4 Mpc h−1 thickness. Cross at the center: Milky Way. Dots: galaxies with
MB < − 15. Texts: galaxy groups, clusters, and local structures. Arrows: estimated directions of motion derived from the gradient of the reconstructed gravitational potential.
density map of the dark matter that is less affected by the error where θ, r, ρ(θ, r) are the two-dimensional sky coordinates,
than the three-dimensional dark matter density field (see Figure 9). distance from the observer, and the dark matter density at the given
Also, we find that the dark matter column density map driven (θ, r), respectively. We use the HEALPix (Górski et al. 2005;
from TNG300 shows significantly less deviation than that of Zonca et al. 2019) package to reconstruct the two-dimensional sky
TNG100, which suffers from some spurious structure consistently map from the three-dimensional data cube. We set the resolution
appearing near the Galactic plane. parameter Nside = 128, which roughly corresponds to the angular
resolution of 27¢. This figure also shows the locations and radial
4.3. Sky Map of the Local Cosmic Web peculiar velocities of galaxies that we use for the reconstruction
The left panels of Figure 7 (labeled as TNG300) show the (color-coded dots), as well as the locations of some well-known
recovered local dark matter map on the sky (gray map), galaxy groups and clusters (large dots).
The map in Figure 7 uses the radial distance and radial peculiar
S (q ) º ò d r r (q , r ) , (17) velocities reported in the CF3 catalog (Tully et al. 2016). We have
9
Figure 7. Two-dimensional full-sky map of the local dark matter column density with 4 Mpc h−1 widths. Left panels: predictions from TNG300 training, from the
nearest to the farthest radial bin. Right panels: comparison predictions from TNG100 training (TNG100), training with dark matter halos from the dark-matter-only
simulation (DMhalo), and training without using the radial peculiar velocity (noVpec). Small dots: positions and peculiar velocity (color) of known local galaxies.
Large dots: galaxy groups and clusters with their names.
10
the density contrast. As a result, the signal-to-noise ratio

S N º ∣ álog10 Sñ ∣ D log10 S scales almost linearly as the
density contrast, reaching up to S/N ; 10 for the density
peaks. On average, the signal-to-noise ratios for dark matter
distribution per pixel at higher Galactic latitudes (|b| > 10°) are
4.25, 3.76, 3.94, 4.19, and 4.52, respectively, from the nearest
to the farthest radial bin.
Note that, in addition to the distance measurement uncertainty,
there are systematic uncertainties in DL mapping itself into the
error budget. For example, the galaxy simulations with different
resolutions or different subgrid prescriptions can lead to different
DL mapping. We check such a systematic effect by comparing
TNG300 with various comparison models, including those
already introduced in Table 1. To do this, we calculate the on-
sky average of the systematics
∣log10 S - log10 STNG300 ∣

Dsys º , (20)
Figure 8. Angular covariance function C(δθ) as a function of angular distance D log10 STNG300
(δθ) in the sky. Each angular covariance function roughly follows
C (dq ) µ exp (-dq dq0 ), where δθ0 is a proxy of angular resolution of a where Σ and ΣTNG300 are the local dark matter column
degraded map each of whose pixel is statistically independent.
densities from a given comparison model and TNG300, both
by adopting the reported values of galaxy locations and
mitigated the 10%–30% uncertainties of distance measurement in
the catalog by adopting the radial binning of Δr = 4 Mpc h−1. We peculiar velocities. First, we check the systematic effect of the
further analyze the statistical uncertainties of the recovered dark resolution by comparing the local dark matter map estimated
matter map by generating 1000 realizations incorporating the from TNG100 and TNG300. The top right panel of Figure 7
uncertainties of the distance measurement (see Section 4.2). From shows the r < 4 Mpc h−1 bin dark matter map driven from the
the high angular resolution map (Nside = 128), we define the high-resolution result (TNG100). TNG100 systematically
angular covariance function, underestimates the density contrast by Δsys = 2.3 on average
(see Table 3).
ádS (q ) dS (q ¢)ñN, q, q¢
C (dq ) º , (18) Second, to estimate the systematic effect from different
S20 subgrid prescriptions, we have repeated the DL procedure by
using the dark matter halo samples from the dark-matter-only
where Σ0 = ρ0Δr is the mean dark matter column density, simulation TNG300-1-Dark by matching the galaxy/halo
á¼ñN, q, q¢ is the average over N = 1000 realizations and sky number density (DMhalo). The right panels of Figure 9 show
coordinates θ and q¢ that satisfy ∣q - q ¢∣ = dq , and δΣ(θ) ≡ the difference between the two dark matter maps in units of
Σ(θ) − 〈Σ(θ〉N. We found that the angular covariance function standard deviation at each pixel. Even with this extreme
follows an exponential decay over δθ, comparison between full hydrodynamic simulation and pure N-
body simulation, we find that systematic effects lead to
dq ⎞ Δsys = 1.7, 1.4, 1.2, 1.1, and 1.0 on average from the top
C (dq ) » C0 exp ⎛ -⎜ , ⎟ (19)
⎝ dq 0⎠ (nearest) to the bottom (farthest) maps.
We further test the systematic effect due to different Hubble
and the values of the angular scale that shows a strong pixel-to- parameters (H0 = 75 km s−1 Mpc−1; diffH0) and find only
pixel correlation are δθ0 = 20°. 7, 9°. 71, 6°. 53, 5°. 04, and 4°. 24, Δsys ; 0.15. Different B-band magnitude cuts (MB < − 16
respectively, from the nearest (r < 4 Mpc h−1) to the farthest and −17; 16mag and 17mag, respectively) and using total
(16 Mpc h−1 < r < 20 Mpc h−1) radial bins (see Figure 8). δθ0 stellar mass instead of galaxy number (stellarMass) lead to
at different radial bins correspond to the linear scales δℓ = 0.26, Δsys ; 1. Most importantly, none of the systematic maps show
0.68, 0.92, 1.06, and 1.18 Mpc h−1, respectively. δℓ at the a significant correlation with the derived cosmic web structure,
nearest radial bin well represent the spatial resolution of the ensuring the robustness of the derived dark matter distribution,
three-dimensional grid (0.3215 Mpc h−1). On the other hand, δℓ or the cosmic web (see the right panel of Figure 9).
The most striking feature that we have recovered in this
at the farthest radial bin may mean a typical scale of the
study is the filamentary cosmic web that is apparent in
filamentary structure at a given radial bin width and galaxy Figures 7 and 9. First of all, we find that the radial peculiar
number density. For the statistical analysis, we degrade the velocity information is vital to reconstructing the cosmic web,
angular resolution of each map to δθ0–Nside = 4, 8, 8, 16, and without which the same DL algorithm cannot reproduce the
16 from the nearest to the farthest radial bins—and assume that cosmic web structure at all. For example, the right panels of
each pixel in the degraded map is statistically independent. Figure 7, indicated by noVpec, show the DL result only using
Figure 9 shows the mean (álog10 Sñ; left panel) and the galaxy distributions. Note the absence of the filamentary
standard deviation (D log10 S; middle panel) of the logarithm of structure in those maps. We note that the noVpec maps
the local dark matter column density over 1000 realizations resemble the smoothed version of the galaxy distribution. The
incorporating the uncertainties of the distance measurement. DL algorithm with stellar-mass-weighted galaxy distribution,
We find that the standard deviation per pixel stays in the range without peculiar velocity information, leads to the similarly
of D log10 S S0  0.1 - 0.4, with only a mild dependence on poor quality map.
11
Figure 9. Same as Figure 7, but showing statistical maps. Left panels: mean of the logarithm of dark matter column density estimated from 1000 random realizations
incorporating the uncertainties in distance estimate to the local galaxies. Middle panels: standard deviation from 1000 random realizations (Nside = 4, 8, 8, 16, 16
from top to bottom). Right panels: systematic bias from different simulation input for the DL (TNG300 vs. DMhalo).
Another interesting feature in the map is the dark matter to the farthest radial bin. However, we anticipate that the
distribution at lower Galactic latitudes (|b| < 10°), where we do theoretical uncertainties for the DL mapping would be most
not have any input galaxy data. To our surprise, we find that the substantial for this region. For example, from the aforemen-
averaged signal-to-noise ratios per pixel for this region are tioned studies on systematic uncertainties, we find that, on
4.18, 4.73, 5.31, 5.80, and 6.21, respectively, from the nearest average, a lower Galactic latitude (|b| < 10°) map suffers about
12
Table 3
On-sky Average (Median and 1σ Certainty Level in Parentheses) of the Systematics Dsys º ∣log10 S - log10 STNG300,face ∣ D log10 STNG300 over High Galactic Latitude
|b| > 10° with Different Radial Bins
Comparison Model 0.7–4 Mpc h−1 4–8 Mpc h−1 8–12 Mpc h−1 12–16 Mpc h−1 16–20 Mpc h−1
+1.993 +1.414
TNG100 2.281 (1.837- 1.104 ) 1.474 (1.196-0.842 ) L L L
+0.223 +0.148 +0.161 +0.153 +0.151
diffH0 0.212 (0.171- 0.115 ) 0.162 (0.133- 0.092 ) 0.154 (0.116- 0.083) 0.152 (0.117- 0.082 ) 0.160 (0.128- 0.092 )
+0.748 +1.089 +0.729 +0.751 +0.790
16mag 1.032 (0.949- 0.647 ) 1.093 (0.868- 0.611 ) 0.862 (0.716- 0.508 ) 0.785 (0.641-0.455 ) 0.804 (0.631-0.443 )
+1.081 +1.026 +0.947 +0.862 +0.833
17mag 1.178 (0.901- 0.572 ) 1.105 (0.889- 0.621 ) 1.001 (0.815- 0.575 ) 0.887 (0.726- 0.502 ) 0.898 (0.734- 0.506 )
+1.919 +1.120 +0.890 +0.751 +0.742
noVpec 1.935 (1.715-1.359 ) 1.105 (0.834- 0.631 ) 0.943 (0.701-0.524 ) 0.828 (0.672- 0.470 ) 0.750 (0.626-0.440 )
+1.435 +1.156 +0.909 +0.837 +0.899
stellarMass 1.544 (1.256- 0.843 ) 1.175 (0.946- 0.684 ) 0.925 (0.734- 0.521 ) 0.877 (0.692- 0.485 ) 0.907 (0.713- 0.490 )
+2.253 +1.414 +1.097 +1.029 +0.889
DMhalo 1.737 (1.154- 0.863 ) 1.445 (1.127-0.816 ) 1.176 (0.913-0.610 ) 1.057 (0.846- 0.595 ) 0.957 (0.796- 0.574 )
Note. See Table 1 for the definition of each comparison model except TNG100.
δΔsys ; 0.5 more systematical shifts than a higher Galactic the in-depth study of the nature of dark matter by cross-correlating
latitude (|b| > 10°) map. This is indicated in the top two panels the reconstructed dark matter map with the full-sky diffuse
of Figure 7 and the systematic shifts shown in the right panels emission maps constructed from the radio-to-gamma-ray electro-
of Figure 9. magnetic spectra, as well as the full-sky map of gravitational wave
binaries. The latter can test the models where black holes in
binaries have formed out of dark matter (Shandera et al. 2018).
5. Discussion Finally, as we have introduced a novel CNN-based DL
In this paper, we present a novel CNN-based DL method of method to reconstruct the local cosmic web, the quantitative
reconstructing the local dark matter distribution map and study comparing the prediction power of the DL method
discover the local cosmic web structure traced by the positions presented here with preexisting methods such as BORG may be
and radial peculiar velocities of Cosmicflow-3 galaxies. We in order. Note that, however, many previous studies reconstruct
find that including the radial peculiar velocity field is the key to the dark matter distribution on sales much larger than the size
recovering the dark matter distribution in the cosmic web. of our local cosmic web (e.g.,  3 − 5 Mpc h−1 in Jasche &
Incorporating the observational uncertainties in the galaxy Wandelt 2013; Jasche et al. 2015), which complicates the direct
distance measurements, the average detection significance of comparison between the two methods. Nevertheless, an
the dark matter map exceeds 4.1σ for each HEALPix pixel at additional study that applies the existing methods to similar
higher Galactic latitudes (|b| > 10°). The quoted statistical observational and simulation data to ours and compares them to
significance, however, does not include the uncertainties in the our DL method would be beneficial, and we leave it for the
galaxy-to-dark-matter mapping itself. We have tested that the future.
DL results stay robust for three different simulations,
TNG100-1 and TNG300-1 from the Illustris-TNG simulation The authors acknowledge Christophe Pichon, Changbom
and RefL0100N1504 from the EAGLE simulation, but future Park, Sungryong Hong, Inkyu Park, Dongsu Bak, Graziano
studies must quantify the theoretical uncertainties by applying Rossi, and Yung-Kyun Noh for discussion. The authors also
the same method to the large-scale structure simulations with acknowledge an anonymous referee for suggestions to improve
different baryonic prescriptions. The comparison of the DL this article. The list of nearby galaxy groups and clusters is
results between TNG300-1 and N-body simulations, however, derived from www.atlasoftheuniverse.com. The authors
indicates that the filamentary cosmic web structure may not acknowledge the Korea Institute for Advanced Study for
suffer from the systematic effects. providing computing resources (KIAS Center for Advanced
The main statistical uncertainty in the galaxy data comes Computation Linux Cluster System). Computational data were
from the uncertainty in the distance measurement. As the transferred through a high-speed network provided by the
observed shift in the galaxy spectra constrains the sum of the Korea Research Environment Open NETwork (KREONET).
distance (Hubble flow) and the radial peculiar velocity, the S.E.H. was partly supported by Basic Science Research
uncertainty affects both the galaxy distribution and the radial Program through the National Research Foundation of Korea
peculiar velocity field. Therefore, to obtain a dark matter map funded by the Ministry of Education (2018R1A6A1A06024977).
with higher significance, it is necessary to explore the ways to S.E.H. was also partly supported by the project 우주거대구조를
reduce the uncertainties of the current distance estimators such 이용한암흑우주연구 (“Understanding Dark Universe Using
as the tip of the red giant branch, Type Ia supernovae, and the Large Scale Structure of the Universe”), funded by the Ministry
fundamental plane through continuous cross-calibration (Tully of Science. D.J. was supported at Pennsylvania State University
et al. 2016) and to increase the number of galaxies with by NSF grant (AST-1517363) and NASA ATP program
measured distances through systematic surveys (e.g., 6dFGS, (80NSSC18K1103). J.K. was supported by a KIAS Individual
Springob et al. 2014; James Webb Space Telescope, Gardner Grant (KG039603) via the Center for Advanced Computation at
et al. 2006). Korea Institute for Advanced Study.
We anticipate that the reconstructed three-dimensional dark Software: HEALPix (Górski et al. 2005), Healpy (Zonca et al.
matter map and peculiar velocity field will open an entirely new 2019), astropy (Astropy Collaboration et al. 2013, 2018), NumPy
chapter of cosmological study. For example, the dark matter map (van der Walt et al. 2011; Harris et al. 2020), Scipy (Jones et al.
can make it possible to run the cosmological galaxy simulations 2001; Virtanen et al. 2020), matplotlib (Hunter 2007), pandas
with the precise initial condition of the Local Group for studying (Wes McKinney 2010), Keras (Chollet et al. 2015), Tensorflow
the past and future of our cosmic neighborhood. It will also allow back end (Abadi et al. 2015).
13
ORCID iDs Hahnloser, R. H. R., Sarpeshkar, R., Mahowald, M. A., Douglas, R. J., &
Seung, H. S. 2000, Natur, 405, 947
Sungwook E. Hong (홍성욱) https://orcid.org/0000-0003- Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Natur, 585, 357
4923-8485 Huchra, J. P., Macri, L. M., Masters, K. L., et al. 2012, ApJS, 199, 26
Donghui Jeong https://orcid.org/0000-0002-8434-979X Hunter, J. D. 2007, CSE, 9, 90
Ioffe, S., & Szegedy, C. 2015, arXiv:1502.03167
Ho Seong Hwang https://orcid.org/0000-0003-3428-7612 Jasche, J., Leclercq, F., & Wandelt, B. D. 2015, JCAP, 2015, 036
Juhan Kim https://orcid.org/0000-0002-4391-2275 Jasche, J., & Wandelt, B. D. 2013, MNRAS, 432, 894
Jeffrey, N., Lanusse, F., Lahav, O., & Starck, J.-L. 2020, MNRAS, 492, 5023
Jones, E., Oliphant, T., Peterson, P., et al. 2001, SciPy: Open Source Scientific
References Tools for Python, http://www.scipy.org
Kingma, D. P., & Ba, J. 2014, arXiv:1412.6980
Aaronson, M. 1983, ApJL, 266, L11 Kourkchi, E., Courtois, H. M., Graziani, R., et al. 2020, AJ, 159, 67
Aartsen, M. G., Ackermann, M., Adams, J., et al. 2018, EPJC, 78, 831 Larson, D., Dunkley, J., Hinshaw, G., et al. 2011, ApJS, 192, 16
Abadi, M., Agarwal, A., Barham, P., et al. 2015, TensorFlow: Large-Scale Lavaux, G., & Hudson, M. J. 2011, MNRAS, 416, 2840
Machine Learning on Heterogeneous Systems, https://www.tensorflow. Lavaux, G., & Jasche, J. 2016, MNRAS, 455, 3169
org/ Libeskind, N. I., Yepes, G., Knebe, A., et al. 2010, MNRAS, 401, 1889
Abbott, T. M. C., Abdalla, F. B., Alarcon, A., et al. 2018, PhRvD, 98, 043526 Licquia, T. C., & Newman, J. A. 2015, ApJ, 806, 96
Ackermann, M., Albert, A., Anderson, B., et al. 2015, PhRvL, 115, 231301 Marinacci, F., Vogelsberger, M., Pakmor, R., et al. 2018, MNRAS, 480, 5113
Akerib, D. S., Alsum, S., Araújo, H. M., et al. 2017, PhRvL, 118, 021303 Milletari, F., Navab, N., & Ahmadi, S.-A. 2016, arXiv:1606.04797
Ammazzalorso, S., Gruen, D., Regis, M., et al. 2020, PhRvL, 124, 101102 McKinney, W. 2010, in Proc. 9th Python in Science Conf., ed.
Anderson, L., Aubourg, É., Bailey, S., et al. 2014, MNRAS, 441, 24 S. van der Walt & J. Millman (Austin, TX: SciPy), 56
Arcadi, G., Dutra, M., Ghosh, P., et al. 2018, EPJC, 78, 203 Modi, C., Feng, Y., & Seljak, U. 2018, JCAP, 2018, 028
Astropy Collaboration, Robitaille, T. P., Tollerud, E. J., et al. 2013, A&A, Naiman, J. P., Pillepich, A., Springel, V., et al. 2018, MNRAS, 477, 1206
558, A33 Nelson, D., Pillepich, A., Springel, V., et al. 2018, MNRAS, 475, 624
Astropy Collaboration, Price-Whelan, A. M., Sipőcz, B. M., et al. 2018, AJ, Nelson, D., Springel, V., Pillepich, A., et al. 2019, ComAC, 6, 2
156, 123 Paturel, G., Petit, C., Prugniel, P., et al. 2003, A&A, 412, 45
ATLAS Collaboration, Aaboud, M., Aad, G., et al. 2019, JHEP, 2019, 142 Pawlowski, M. S., Pflamm-Altenburg, J., & Kroupa, P. 2012, MNRAS,
Bell, E. F., McIntosh, D. H., Katz, N., & Weinberg, M. D. 2003, ApJS, 423, 1109
149, 289 Pillepich, A., Nelson, D., Hernquist, L., et al. 2018, MNRAS, 475, 648
Camps, P., Trčka, A., Trayford, J., et al. 2018, ApJS, 234, 20 Planck Collaboration, Aghanim, N., Akrami, Y., et al. 2020, A&A, 641, A6
Carlesi, E., Sorce, J. G., Hoffman, Y., et al. 2016, MNRAS, 458, 900 Ronneberger, O., Fischer, P., & Brox, T. 2015, arXiv:1505.04597
Carrick, J., Turnbull, S. J., Lavaux, G., & Hudson, M. J. 2015, MNRAS, Rubin, V. C., Ford, W., & Kent, J. 1970, ApJ, 159, 379
450, 317 Schaye, J., Crain, R. A., Bower, R. G., et al. 2015, MNRAS, 446, 521
Chollet, F., et al. 2015, Keras, https://keras.io Shandera, S., Jeong, D., & Grasshorn Gebhardt, H. S. 2018, PhRvL, 120,
Clowe, D., Bradač, M., Gonzalez, A. H., et al. 2006, ApJL, 648, L109 241102
Cooke, R. J., Pettini, M., Jorgenson, R. A., Murphy, M. T., & Steidel, C. C. Shirasaki, M., Yoshida, N., & Ikeda, S. 2019, PhRvD, 100, 043527
2014, ApJ, 781, 31 Smith, L. N. 2015, arXiv:1506.01186
Courtois, H. M., Pomarède, D., Tully, R. B., Hoffman, Y., & Courtois, D. Springel, V., Pakmor, R., Pillepich, A., et al. 2018, MNRAS, 475, 676
2013, AJ, 146, 69 Springob, C. M., Magoulas, C., Colless, M., et al. 2014, MNRAS, 445, 2677
Crain, R. A., Schaye, J., Bower, R. G., et al. 2015, MNRAS, 450, 1937 Tröster, T., Camera, S., Fornasa, M., et al. 2017, MNRAS, 467, 2706
Davis, M., Efstathiou, G., Frenk, C. S., & White, S. D. M. 1985, ApJ, 292, 371 Tully, R. B., & Courtois, H. M. 2012, ApJ, 749, 78
Desjacques, V., Jeong, D., & Schmidt, F. 2018, PhR, 733, 1 Tully, R. B., Courtois, H. M., Dolphin, A. E., et al. 2013, AJ, 146, 86
Fairall, A. P., Paverd, W. R., & Ashley, R. P. 1994, in ASP Conf. Ser. 67, Tully, R. B., Courtois, H. M., & Sorce, J. G. 2016, AJ, 152, 50
Unveiling Large-Scale Structures Behind the Milky Way, ed. Tully, R. B., & Fisher, J. R. 1987, Atlas of Nearby Galaxies (Cambridge:
C. Balkowski & R. C. Kraan-Korteweg (San Francisco, CA: ASP), 21 Cambridge Univ. Press)
Fang, K., Banerjee, A., Charles, E., & Omori, Y. 2020, ApJ, 894, 112 Tully, R. B., Shaya, E. J., Karachentsev, I. D., et al. 2008, ApJ, 676, 184
Fornasa, M., Cuoco, A., Zavala, J., et al. 2016, PhRvD, 94, 123005 van der Walt, S., Colbert, S. C., & Varoquaux, G. 2011, CSE, 13, 22
Gardner, J. P., Mather, J. C., Clampin, M., et al. 2006, SSRv, 123, 485 Vannerom, D. 2019, in Proc. of Science 352, XXVII Int. Workshop on Deep-
Giesen, G., Boudaud, M., Génolini, Y., et al. 2015, JCAP, 2015, 023 Inelastic Scattering and Related Subjects (DIS2019) (Trieste: Sissa
Glorot, X., Bordes, A., & Bengio, Y. 2011, in Proc. Machine Learning Medialab), 111
Research 15, Fourteenth Int. Conf. on Artificial Intelligence and Statistics , Virtanen, P., Gommers, R., Oliphant, T. E., et al. 2020, NatMe, 17, 261
ed. G. Gordon et al. (Fort Lauderdale, FL: JMLR), 315 Wilman, D. J., & Erwin, P. 2012, ApJ, 746, 160
Górski, K. M., Hivon, E., Banday, A. J., et al. 2005, ApJ, 622, 759 Zonca, A., Singer, L., Lenz, D., et al. 2019, JOSS, 4, 1298
Gottloeber, S., Hoffman, Y., & Yepes, G. 2010, arXiv:1005.2687 Zwicky, F. 1933, AcHPh, 6, 110
14

Revealing The Local Cosmic Web From Galaxies by Deep Learning

Uploaded by

Copyright:

Available Formats

Revealing The Local Cosmic Web From Galaxies by Deep Learning

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Revealing The Local Cosmic Web From Galaxies by Deep Learning

Uploaded by

Copyright:

Available Formats

The Astrophysical Journal, 913:76 (14pp), 2021 May 20 https://doi.org/10.

Revealing the Local Cosmic Web from Galaxies by Deep Learning

1. Introduction signals from the extragalactic sources by cross-correlating

Here t is a mini-batch step number starting from zero, mt and vt

Table 1 statistics of the 2pCFs between truth and prediction at a given

Model log10 (rpred rtruth ) KS(ξpred, ξtruth)

TNG300 −0.020 ± 0.451 0.153 ± 0.035 0.134 ± 0.040 0.163 ± 0.017

the density contrast. As a result, the signal-to-noise ratio

∣log10 S - log10 STNG300 ∣

You might also like