Abstract
Stellar streams form through the tidal disruption of satellite galaxies or globular clusters orbiting a host galaxy. Globular cluster streams are exciting since they are thin (dynamically cold) and therefore sensitive to perturbations from low-mass subhalos. Since the subhalo mass function differs depending on the dark matter composition, these gaps can provide unique constraints on dark matter models. However, current samples are limited to the Milky Way. With its large field of view, deep imaging sensitivity, and high angular resolution, the upcoming Nancy Grace Roman Space Telescope (Roman) presents a unique opportunity to increase the number of observed streams and gaps significantly. This paper presents a first exploration of the prospects for detecting gaps in streams in M31 and other nearby galaxies with resolved stars. We simulate the formation of gaps in a Palomar 5–like stream and generate mock observations of these gaps with background stars in M31 and foreground Milky Way stellar fields. We assess Roman's ability to detect gaps out to 10 Mpc through visual inspection and with the gap-finding tool FindTheGap. We conclude that gaps of ≈1.5 kpc in streams that are created from subhalos of masses ≥5 × 106M⊙ are detectable within a 2–3 Mpc volume in exposure times of 1000 s to 1 hr. This volume contains ≈150 galaxies, including ≈eight galaxies with luminosities >109L⊙. Large samples of stream gaps in external galaxies will open up a new era of statistical analyses of gap characteristics in stellar streams and help constrain dark matter models.
Export citation and abstract BibTeX RIS
Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
1. Introduction
Large-scale cosmological simulations with cold dark matter (i.e., the Lambda cold dark matter (ΛCDM) model) predict hierarchical formation of dark matter halos and the existence of substructure at all scales (White & Rees 1978; Blumenthal et al. 1984; Bullock et al. 2001; Springel et al. 2008; Fiacconi et al. 2016). To test ΛCDM predictions at small scales, previous studies have uncovered satellite galaxies around the Milky Way and dwarf galaxies in the Local Group with stellar masses down to 103 M⊙ (Willman et al. 2005; Simon & Geha 2007; Martin et al. 2008; Koposov et al. 2009; Willman et al. 2011; McConnachie 2012; Bechtol et al. 2015; Drlica-Wagner et al. 2015; Geha et al. 2017; Mao et al. 2021). However, in ΛCDM models, galaxies with halos of masses ≲108 M⊙ are faint and more dominated by dark matter compared to higher-mass galaxies, which makes their detection difficult (Efstathiou 1992; Bullock et al. 2000; Okamoto et al. 2008; Sawala et al. 2016). Other dark matter models differ from ΛCDM in their predictions for the masses and number densities of dark matter subhalos (subhalo mass functions). For instance, warm dark matter (WDM) models (Bode et al. 2001) predict a similar hierarchical collapse at large scales, but this collapse is strongly suppressed at lower masses (≲109 M⊙, depending on particle mass), resulting in a smaller fraction of low-mass subhalos (Bose et al. 2017). Similarly, some fuzzy CDM models (Hu et al. 2000; Hui et al. 2017), predict a sharp cutoff at low masses (≤107 M⊙). Self-interacting dark matter (SIDM) models produce halos with pronounced cores with different tidal evolution, masses, and densities compared to CDM halos (Spergel & Steinhardt 2000; Rocha et al. 2013; Tulin & Yu 2018; Forouhar Moreno et al. 2022; Glennon et al. 2022). Even in ΛCDM simulations, the survival and the properties of low-mass subhalos within a larger halo are poorly understood. The tidal field of the central galaxy, preexisting substructure in the halo, and deviations from a smooth spherical halo density profile can all affect the tidal evolution of accreted subhalos (Garrison-Kimmel et al. 2017). On the other hand, analytical calculations, N-body simulations, and high-resolution hydrodynamical simulations show that the central cores of subhalos are likely to survive for long periods, or even indefinitely (van den Bosch & Ogiya 2018; van den Bosch et al. 2018; Errani & Peñarrubia 2020).
All of these differences between different dark matter models can, in principle, be tested by statistical surveys of nearby low-mass subhalos. The critical challenge is detecting these invisible dark subhalos. Strong gravitational lensing offers an opportunity to validate predictions of dark matter models (Dalal & Kochanek 2002; Amara et al. 2006; Nierenberg et al. 2014; Hezaveh et al. 2016; Nierenberg et al. 2017; Gilman et al. 2019). However, this technique probes all subhalos along the line of sight up to the lensed luminous source, complicating the inference of dark matter properties.
Globular cluster (GC) streams provide a complementary approach for detecting and measuring the spectrum of low-mass subhalos in the Local Volume (Johnston et al. 2002; Yoon et al. 2011; Bovy 2016; Bovy et al. 2017). As GCs orbit the host galaxy, internal evolution and tidal stripping lead to the escape of stars from the central cluster, forming thin, elongated stellar streams which persist for billions of years (Johnston 1998; Helmi & White 1999). A free-floating dark matter subhalo can subsequently perturb these streams, creating a gap-like feature inside the stream (Yoon et al. 2011). Numerical and analytical calculations predict the morphology and frequency of these features in various types of GC streams (Yoon et al. 2011; Carlberg 2012; Erkal et al. 2016; Sanderson et al. 2016; Koppelman & Helmi 2021).
Photometric and spectroscopic surveys have identified and characterized ≈100 stellar streams in the Milky Way, with the majority being GC streams (Odenkirchen et al. 2001; Newberg et al. 2002; Majewski et al. 2003; Newberg et al. 2009; Odenkirchen et al. 2009; Grillmair & Carlin 2016; Mateu et al. 2018; Shipp et al. 2018; Ibata et al. 2019; Li et al. 2022; Martin et al. 2022; Mateu 2023). A few of these GC streams show evidence of gap-like features that are predicted in numerical simulations of dark matter subhalo encounters (de Boer et al. 2018; Bonaca et al. 2020; de Boer et al. 2020; Tavangar et al. 2022). In particular, Price-Whelan & Bonaca (2018) identified a spur and a gap in GD-1, which Bonaca et al. (2019) attributed to a likely encounter with a 106–107 dark matter subhalo ∼8 Gyr ago, after ruling out other types of perturbers.
Gaps in GC streams can be created through other processes, however. Previous studies have shown that baryonic matter perturbers (galactic bars, molecular clouds, black holes, and spiral arms) can produce similar features in GC streams (Amorisco et al. 2016; Hattori et al. 2016; Price-Whelan et al. 2016; Erkal et al. 2017; Pearson et al. 2017; Banik & Bovy 2019; Bonaca et al. 2020), which makes gaps challenging to decipher, even when they are detected. Moreover, since these streams have intrinsically low surface brightnesses, the detection of gaps has been limited to the Milky Way, which has resulted in relatively small samples.
Detecting gaps in GC streams in external galaxies offers a new window into testing dark matter models by increasing the number and diversity of stream gaps. Previous studies have observed streams in external galaxies arising from tidally disrupted satellites (Martínez-Delgado et al. 2010; Martinez-Delgado et al. 2023). While several candidate GC streams have been proposed in M31 (Pearson et al. 2022), these candidate detections require deeper imaging and spectroscopy to be confirmed, and to map both the streams and gap structures eventually.
The upcoming Nancy Grace Roman Space Telescope (Roman; Spergel et al. 2015) will have a large field of view, high angular resolution, and deep-imaging sensitivity. Pearson et al. (2019, 2022) demonstrated how this combination allows for the detection of very low surface brightness GC streams out to a couple of megaparsecs. In this work, we examine the plausibility of detecting gaps in Palomar 5–like GCs formed from interactions with dark matter subhalos by extending the predictions of Pearson et al. (2019). The paper is arranged as follows: Section 2 describes our methodology for simulating isolated, evolving streams with gaps. Section 3 discusses our simulations of mock observations with Roman including foreground and background star fields, and the feasibility of visual inspection to confirm gaps in simulated star count data. Section 4 discusses the application of an automated gap-finding pipeline, FindTheGap (Contardo et al. 2022), to find gaps in simulated data. Section 5 discusses the implications and limitations of this work. We summarize our findings in Section 6.
2. Simulating GC Streams with Gaps
Our goal is to simulate observations of gaps in GC streams, starting with the Palomar 5 stream (hereafter Pal 5; Odenkirchen et al. 2001, 2003) as a test case, defined to have a present-day mass of 104 M⊙ (Ibata et al. 2017). We conducted all numerical calculations with the gala package (Price-Whelan 2017), which implements numerical integration techniques to model the orbits of stars in a prespecified static potential. We briefly describe our simulation procedure in this section. Additional details on the simulation parameters are given in Appendix A.
2.1. Stream Progenitor Coordinates
As no currently known GC streams exist in M31, we used a Pal 5–like stream as an example. In the Milky Way, the Cartesian Galactocentric coordinates of Pal 5 are (X, Y, Z) = (6.1 kpc, 0.2 kpc, 14.7 kpc) and (VX , VY , VZ ) = (−49.7 km s−1, −119.4 km s−1, −11.4 km s−1) (Price-Whelan et al. 2019; Vasiliev 2019), 10 and we used identical position and velocities to simulate a Pal 5–like stream in a M31 potential. Our goal is to test the observability of gaps in streams located at various locations in galactic halos; hence, we also simulated streams at 35 and 55 kpc as the tidal field and the orbit of the stream changes with Galactocentric radius, affecting the stream's physical width. Our choice for the initial coordinates of the stream's progenitor was designed to capture this effect. For simplicity, to simulate equivalent streams at 35 and 55 kpc, we used positions that produce streams at approximately the desired Galactocentric radii, and we assumed that the velocities of the stream at 35 and 55 kpc were the same as the velocity at 15 kpc. We used the same present-day velocities (VX , VY , VZ ) for all streams, but we note that this process results in different orbits compared to the stream at 15 kpc. The initial coordinates and additional parameters of our simulations are summarized in Tables 1 and 2 and in Appendix A.
Table 1. Summary of the Simulation Parameters for the Stream and the Subhalo
Parameter | Description | Range of Values | |
---|---|---|---|
Stream | mp | progenitor mass | 5 ×104 M⊙ |
⋯ | number of particles | ≈80,000 | |
Subhalo | Mh | mass | 2×106 M⊙–107 M⊙ |
rs | scale radius | 0.14–0.32 kpc | |
⋯ | potential | Hernquist a | |
Galaxy | ⋯ | potential | Hernquist bulge + |
Miyamoto–Nagai disk + | |||
Navarro–Frenk–White (NFW) halo b |
Notes.
a Hernquist (1990). b Profiles based on Miyamoto & Nagai (1975) and Navarro et al. (1996) with parameters based on Milky Way measurements by McMillan (2017).Download table as: ASCIITypeset image
2.2. Generating a Gap in the Stream
We generated model streams using the "particle-spray" method described by Fardal et al. (2015) and implemented in gala. We assumed a uniform mass loss history and a progenitor mass (mp ) of 5 × 104 M⊙ based on Bonaca et al. (2020). We subsequently simulated the direct impact of a dark matter subhalo on a stream. Throughout these calculations, individual stream stars were treated as noninteracting massless particles, and we did not include the stream's progenitor potential.
To ensure a direct impact between stream and subhalo, we first needed to determine the initial coordinates of both components, given their positions and velocities at the moment of collision. We backward integrated the present-day coordinates of the stream progenitor to time t1, initiated a stream at these coordinates, and then forward integrated by Δt2. At this point, the collision position was assumed to be at a position Δx away from the progenitor position. We adjusted the coordinates of the subhalo to achieve a fixed relative velocity (∣ V rel∣) between the stream stars and the subhalo perpendicular to the impact location. After determining the subhalo's position and velocity at the collision point, we backward integrated its orbit by Δt2 again to set the subhalo's initial conditions.
With the initial positions and velocities of the stream and subhalo determined, we forward integrated the system for Δt2, computing the stream particle–subhalo interactions using the gala DirectNBody routine, with an additional Δt3 time to allow the subhalo to pass completely through the stream. At this point, we removed the subhalo from the simulation to avoid potential multiple interactions, allowing for a more direct analysis of the observability of well-defined stream gaps. We then forward integrated the stream stars for the remaining t1 − (Δt2 + Δt3) to observe the gap's growth over time.
These timescales (t1, Δt2, and Δt3) were chosen to allow the stream to have similar lengths as that of Pal 5 in M31 (7–12 kpc at RGC = 15–55 kpc, based on estimates by Pearson et al. 2019). Additionally, after the subhalo encounter, we continued releasing stars into the mock stream to ensure that there was not a gap at the location of the progenitor. Lastly, we resampled the final stream to match the number of stars observed in Pal 5 in the Milky Way based on Bonaca et al. (2020). Table 2 and Appendix A summarize all parameters for streams at 15 kpc, 35 kpc, and 55 kpc.
Table 2. Summary of the Stream, Subhalo Coordinates, and Resulting Gap Sizes
Simulation Step | Parameter and Description | RGC = 15 kpc | RGC = 35 kpc | RGC = 55 kpc |
---|---|---|---|---|
initial | progenitor position at t1 ( x 1, kpc) | (6.1, 0.2, 14.7) | (6.1, 0.2, 34.7) | (6.1, 31.7, 44.7) |
progenitor velocity at t1 ( v 1, km s−1) | (−49.7, −119.4, −11.4) | (−49.7, −119.4, −11.4) | (−49.7, −119.4, −11.4) | |
total integration time (t1, Gyr) | 2 | 3 | 3 | |
collision | ∣ V rel∣, km s−1 | 50 | 70 | 50 |
time before collision (Δt2, Gyr) | 0.7 | 1.7 | 1.5 | |
time during collision (Δt3, Gyr) | 0.5 | 0.5 | 0.1 | |
distance of impact from center (Δx, kpc) | 0.5 | 0.7 | 0.8 | |
result | size of the gap (kpc) | 1.4 | 1.8 | 1.8 |
Download table as: ASCIITypeset image
2.3. Quantifying the Size of the Simulated Gap
To estimate the size of the simulated gap, we fit a Gaussian near the visually identifiable gap. To account for density variations along the stream and the decrease in density near the wings of the stream, we measured both the density ratio and the density difference between the perturbed stream (with a gap) and an equivalent unperturbed stream. By averaging the FWHMs of the Gaussian fits to both the density ratios and density difference, we obtained gap sizes of 1.4 kpc, 1.8 kpc, and 1.8 kpc at RGC = 15 kpc, 35 kpc, and 55 kpc, respectively for subhalo masses of 5 × 106 M⊙. We note that in our stream integration procedure, the orbits of the stream and the subhalo, the total integration times, and the impact velocities were chosen to achieve the desired lengths (7–12 kpc) of the stream and to obtain approximately the same gap sizes at all Galactocentric values. Obtaining gaps of comparable sizes at each Galactocentric radius allows us to compare their detection limits systematically. Generally, it is possible to create or smaller larger gaps at these Galactocentric radii by changing the impact parameters of the stream–subhalo encounter (Sanders et al. 2016). We report our parameters in Appendix A.
Figure 1 depicts three simulated Pal 5–like streams at RGC = 15 kpc with gaps induced by dark matter subhalos with masses of 2 × 106 M⊙, 5 × 106 M⊙, and 107 M⊙. The gap's size increases with the subhalo's mass, as previously shown by analytical and numerical simulations (Yoon et al. 2011; Erkal & Belokurov 2015). Our results are consistent with the numerical simulations of Yoon et al. (2011), who found that gaps induced by 105–107.5 M⊙ subhalos can be visually identified in Pal 5–like streams, although they used a higher relative impact velocity (>100 km s−1), a single Galactocentric radius (RGC ≈ 25 kpc), and a longer integration times after the impact (≈4.34 Gyr). Their simulations found that subhalo masses ≥ 106 M⊙ induce gaps with physical sizes of ≈1 kpc (visually), comparable to the observed gaps in our simulations. However, we note that even when using similar impact parameters and integration times, centrally concentrated halo profiles (e.g., NFW profiles) will result in larger gaps (Sanders et al. 2016).
3. Generating Mock Observations of Streams with Gaps in M31 and Other External Galaxies
To model the observability of both streams and gaps, we need to generate mock observations of our streams in external galaxies as they will appear to Roman, by taking into account sensitivity, resolution, and contamination from the Milky Way foreground and the host galaxy halo background stellar populations. To address contaminant populations, we followed a method similar to Pearson et al. (2019) as applied to observations of the halo of M31. However, our approach in simulating gaps and mock observations differs from the method by Pearson et al. (2019) in two main ways: (i) we are now modeling the stream formation process (and the interaction with the dark matter subhalo) and performing this modeling in a more realistic M31 potential; and (ii) we incorporate a model of the Galactic stellar density profile to simulate the Milky Way foreground stars and the stellar halo background stars in M31 and external galaxies. Just like in Pearson et al. (2019), we assumed all stars are resolved down to the magnitude limits.
In Sections 3.1.1 and 3.1.2, we describe our process for generating Milky Way foreground stars and M31 background stellar halo stars; in Section 3.1.3, we estimate the number of stars in a Pal 5–like stream at various galaxy distances; and in Section 3.2, we investigate whether these gaps are visible by eye. We summarize our simulation parameters for the Milky Way and M31 stellar populations in Appendix B.
3.1. Simulating Mock Observations with Roman
We obtained M31 data from the Pan-Andromeda Archaeological Survey (PAndAS; McConnachie et al. 2009; Martin et al. 2016; McConnachie et al. 2018; Ibata et al., private communication). PAndAS provides wide-field imaging data for the Milky Way, M31, and other nearby galaxies over a total area of 300 deg2, with the 3.6 m Canada–France–Hawaii Telescope (CFHT) MegaPrime/MegaCam camera in the optical and infrared u, g, r, i, and z filters. We used extinction-corrected CFHT AB magnitudes (denoted by g0 and i0) based on the corrections by Ibata et al. (2014). We selected three patches with projected areas of 10 kpc × 10 kpc at the distance of M31 at radial separations of 15, 35, and 55 kpc from its center, corresponding to regions of ≈0.5 deg2 on the sky. To generate mock Roman observations, we assumed a total field of view of 0.28 deg2 (not simulating the shape of the detector) and limiting Vega magnitudes of Z(F087) = 27.15 for 1000 s exposures and Z(F087) = 28.69 for 1 hr exposures. 11 As in Pearson et al. (2019), we limited our analysis to the R(F062) and Z(F087) bands. We did not attempt to simulate any substructures along the line of sight (satellites, streams, or other overdensities; e.g., the PAndAS-MW stream; Martin et al. 2014).
3.1.1. Simulating Milky Way Foregrounds
We simulated foregrounds along the line of sight of M31 assuming a central coordinate of R.A. = 0.57° and decl. = 431 based on the central coordinates of the M31 PAndAS field. We used a Kroupa power-law initial mass function (IMF; Kroupa 2001) for stellar masses between 0.1 M⊙ and 120 M⊙, isochrones from the PAdova and tRieste Stellar Evolution Code (PARSEC; Bressan et al. 2012) spanning ages of 4 Myr to 13 Gyr, and metallicities of −2.0 ≤ [Fe/H] ≤ 0.2 that realistically encompass the Milky Way thin and thick disks and halo populations. We sampled 106 stars with masses from the IMF, and assigned ages and metallicities based on uniform distributions. We computed the CFHT g0 and i0 and Roman R and Z absolute magnitudes by interpolating in initial mass–absolute magnitude space for every combination of metallicity and age.
We assigned distances drawn from a Galactic density model composed of a thin disk, a thick disk, and a halo based on Jurić et al. (2008; see the parameters in Appendix B). We drew distances from a probability distribution function P(d) = d2 × ρ(R, z) out to 100 kpc. After we estimated the distance distribution of Milky Way stars, we computed their observable apparent magnitudes. To model the magnitude uncertainties, we fit the magnitude dependence of the uncertainty (δmag) for the CFHT g0 and i0 filters based on the McConnachie et al. (2018) point sources. 12 We then assigned apparent g0 and i0 magnitudes for the simulated population by drawing from a normal distribution with a scatter equal to the standard deviation of the estimated dependence. For all Roman magnitudes we assumed a constant uncertainty of 0.1 mag; but the true uncertainties will likely vary with magnitude and exposure time.
Finally, we determined the total number of stars that are observable by Roman at a given magnitude limit by scaling the simulated Milky Way foreground distribution to the observed PAndAS data within the region of the color–magnitude diagram (CMD) bound by 2 < g0 − i0 < 3 and 19.5 < i0 < 21. This region in the CMD is predominantly covered by Milky Way foreground isochrones, which makes it ideal for scaling our total number of foreground stars. We then applied the magnitude limit cut corresponding to 1 hr and 1000 s exposures. While this scaling does not consider exact selection effects, it provided a first-order estimate for the number of stars that can be observed by Roman. We obtained agreement between our simulations and both the CMDs and the final g0-band luminosity function from the PAndAS data in Figure 2; we further discuss limitations in our foreground and background simulations in Section 5.2.
Download figure:
Standard image High-resolution image3.1.2. Simulating Background Stars in M31 and Other External Galaxies
We selected PARSEC isochrone tracks that span ages of 5–13 Gyr and metallicities of −2.5 ≤ [Fe/H] ≤ +0.5 to cover the approximate range of ages and metallicities of stars in the halo of M31 (Brown et al. 2003; Ibata et al. 2014). Similar to the Milky Way simulation, we assumed a Kroupa IMF for stellar masses between 0.1 M⊙ and 120 M⊙ and a uniform age distribution. We simulated three distinct patches with an area of 10 kpc × 10 kpc at galactocentric radii (RGC) of 15, 35, and 55 kpc in the halo of M31. PAndAS and other previous studies (e.g., Escala et al. 2020) have characterized the metallicity and abundance distributions of small regions of M31's stellar halo in detail. To assign metallicities to our simulated M31 stars for each galactocentric radius for a given age, we sampled metallicities from the PAndAS photometric metallicity distribution for M31 stars. We then estimated the absolute magnitude distribution in a given filter using a 2D linear interpolation in log-mass versus metallicity space. Figure 2 compares our simulated and PAndAS metallicities. While the PAndAS metallicities extend below −2.5, we were limited by the availability of stellar isochrones below this metallicity. Throughout, we assumed the distance to M31 to be 770 kpc (distance modulus of 24.4; Ibata et al. 2014). To assign distances to stars in the halo of M31, we modeled the stellar density as a flattened spheroid profile based on Ibata et al. (2014; the parameters are discussed in Appendix B). We drew distances assuming that the halo of M31 extends to ≈100 kpc (Chapman et al. 2006) and we assigned projected distances to M31 halo stars as , where is the randomly drawn cylindrical galactocentric height for simplicity. Table 3 summarizes other population simulation parameters.
Table 3. Summary of the Simulation Parameters for Resolved Stellar Populations
Population | Quantity | Distribution | Reference |
---|---|---|---|
Foregrounds & backgrounds | IMF | Kroupa a | Kroupa (2001) |
Milky Way thin disk | age | Uniform (0, 8) Gyr | Jurić et al. (2008) |
[Fe/H] | Uniform (−1, 0.5) | Mackereth et al. (2019) | |
spatial density | Exponential b (H = 350 pc, L = 2600 pc) | Jurić et al. (2008) | |
Milky Way thick disk | age | Uniform (8, 10) Gyr | Kilic et al. (2017) |
[Fe/H] | Uniform (−1, 0.5) | Hawkins et al. (2015) | |
spatial density | Exponential (H = 900 pc, L = 3600 pc) | Jurić et al. (2008) | |
Milky Way halo | age | Uniform (10, 13) Gyr | Jofré & Weiss (2011) |
[Fe/H] | Uniform (−2.5, −1) | Mackereth et al. (2019) | |
spatial density | Spheroid c (n = 0.64, q = 2.77) | Jurić et al. (2008) | |
M31 halo | age | Uniform (5, 13) Gyr | Ibata et al. (2014) |
[Fe/H] | Based on PAndAS data | Ibata et al. (2014) | |
spatial density | Spheroid d (n = 1.11, q = 3) | Ibata et al. (2014) | |
Pal 5 | age | 11.5 Gyr | Ibata et al. (2017) |
[Fe/H] | −1.3 | Ibata et al. (2017) | |
IMF | Grillmair & Smith (2001) |
Notes.
a , with α = 1.3 for masses between 0.08 M⊙ and 0.5 M⊙, α = 2.3 for masses > 0.5 M⊙. b Exponential profile defined in Equation B2. c Spheroidal profile defined in Equation B3. d Profile given by Equation B4.Download table as: ASCIITypeset image
Finally, we assigned apparent magnitudes and magnitude uncertainties similarly as for Milky Way foreground stars. To obtain the correct normalization for the number of stars, we scaled the total number of stars to the observed number between 0.5 < g0 − i0 < 2 and 21.5 < i0 < 23.5 in the PAndAS data as there is a significant drop-off in the PAndAS magnitude completeness to below ≈70% for i0, g0 > 23.5 (Martin et al. 2016). As a final check, we examined the simulated luminosity function (number of stars as a function of magnitude) in the CFHT g band based on our CMD-based scaling, luminosity function inferred from PAndAS data.
Figure 2 shows the combined CHFT i-band luminosity function of both components (M31 population and the Milky Way foregrounds) compared to the observed luminosity function from the PAndAS data, and it shows the simulated Roman CMD. This figure illustrates that sources with Z < 25 dominate Milky Way foreground stellar populations, while M31 includes stars with Z > 20. In real Roman observations, it will be possible to separate most Milky Way foreground stars from M31 stars based on their positions on the CMD (R − Z versus Z space, see Figure 2). We note that low-mass stars and brown dwarfs are lacking in our simulated foregrounds; hence they may introduce an additional source of contamination in the real data. Nevertheless, we found general agreement between the simulated and the observed luminosity functions in the CFHT bands. We further compare our simulations and the PAndAS data in Appendix D. While the region corresponding to the Milky Way disk is reasonably well matched, we could not reproduce all of the structures in the CFHT CMDs, perhaps due to an underestimation of the halo and thick disk fraction or our lack of modeling of other density substructures along this line of sight.
To validate further our methodology for simulating the stellar populations of both the Milky Way foregrounds and the M31 stellar halo fields, we compared our surface densities to the CFHT data in the 10 kpc × 10 kpc patches and to the predictions of Pearson et al. (2019). For a 1 hr exposure with Roman, we obtained stellar densities of 8.2 × 104 stars degrees–2 at RGC = 55 kpc, 8.2 × 104 stars degrees–2 at RGC = 35 kpc and 6.8 × 105 stars degrees–2 at RGC = 15 kpc for the halo of M31. These densities are ≈2–6 times higher than the densities obtained by Pearson et al. (2019) at the same radial distances and a similar Roman magnitude cut, but closer to the stellar density profiles fitted to Hubble Space Telescope (HST) results from Brown et al. (2009) to which Pearson et al. (2019) compare. We note that our methods for estimating the foregrounds deviate from the original methodology by Pearson et al. (2019), by incorporating a stellar density model and by scaling the stellar density to the brighter regions of the CMD where PAndAS is most complete. Within the magnitude limits of the PAndAS data (18 < g0 and i0 < 26), our simulated densities of 1.0 × 104 stars degrees–2, 1.0 × 104 stars degrees–2, and 5 × 104 stars degrees–2 at RGC = 55 kpc, 35 kpc, and 15 kpc, respectively, are slightly lower (within a factor of 1.4 at 15 kpc and ≈3 at larger RGC values) than the observed densities in PAndAS at the same galactocentric radii. As we scaled the number of stars to a prespecified color range, comparing the stellar density in the PAndAS fields and our simulations provide an additional validation. We further discuss the limitations of our simulations in Section 5.
3.1.3. Simulating Observed Stars in Pal 5
We simulated the stream population similarly to the backgrounds, but scaling to the observed properties of Pal 5 in this case. We generated a sample of 106 stars assuming a power-law stellar mass function (). As argued by the analysis of Ibata et al. (2017), this mass function for Pal 5 can be possibly attributed to the loss of low-mass stars in the cluster before the formation of the stream. We then assigned CFHT g and Roman R and Z absolute magnitudes by interpolating the PARSEC isochrones for an age of 11.5 Gyr and [Fe/H] = −1.3. We applied a distance modulus corresponding to Pal 5 ( = 16.85; Pearson et al. 2019), and then determined a population normalization factor by comparing the distribution of simulated CFHT g magnitudes to the 3000 stars with 20 ≤ g ≤ 23 that are known members of the Pal 5 stream (Bonaca et al. 2019). With this normalization factor, we computed the number of stream stars detectable in a given host galaxy based on the corresponding distance modulus and Roman magnitude limit. Our number count predictions for Pal 5 match the predictions of Pearson et al. (2019).
To generate a final simulated Roman image, we resampled the simulated streams described in Section 2.2, drawing only the expected number of detectable stars for our Roman Z-band limits as a function of distance. We also drew foreground and background stellar fields at the same Z-band limits, the latter sampling the three galactocentric radii from M31 PAndAS data and different host galaxy distances. For simplicity, the positions of background and foreground stars were assumed to be uniformly distributed for a given 10 kpc by 10 kpc patch. As most background halo stars are of similar stellar populations as the stellar stream stars, detecting these streams and their gaps depends mostly on density contrasts. We, therefore, focus our analysis on star count maps of these fields, rather than simulated images that incorporate brightness and instrumental point-spread function effects. Additionally, in our analysis, we assume that the stellar stream can be identified using other methodologies, and we focus on the possibility of detecting gaps in the streams. Figure 3 shows examples of our simulations of streams in M31. Henceforth, we will use "mock observations" or "density maps" to describe the results of our simulations.
Download figure:
Standard image High-resolution image3.2. Gap Identification by Visual Inspection of Density Maps
We now examine gaps and quantify their detection with distance and exposure times. Pearson et al. (2019), Pearson et al. (2022) developed methods for finding Pal 5–like streams in Roman observations. Our primary goal is to investigate the detection of gaps, assuming the stream has already been identified. We present the visual inspection of gaps from simulated streams in M31 (Figure 3) and other external galaxies (Figure 4), centering the density maps on the gap region. For simplicity, we only considered gaps from interactions with subhalos of 5 × 106 M⊙ as this mass is in the appropriate range for testing different dark matter models (Bullock & Boylan-Kolchin 2017), and gaps that resulted from these interactions are visible in mock streams (Figure 1). In all of our simulated observations, we applied a photometric metallicity constraint of [Fe/H] < −1, as GC streams are typically metal poor (e.g., Martin et al. 2022), allowing us to reduce the number of background stars. In real Roman images, selecting low-metallicity stars will require fitting the foreground populations of a given galaxy to synthetic isochrones. In Appendix E, we discuss the feasibility of detecting these gaps without requiring this metallicity cut.
Download figure:
Standard image High-resolution imageThe angular lengths of Pal 5 streams for galaxies within a ≈1.2 Mpc volume are larger than the field of view of the Roman telescope; hence, only a portion of the stream will fit inside a Roman field for these distances. For an M31 distance of 770 kpc ( = 24.4; Ibata et al. 2014), the projected angular distance is 13.4 kpc degree–1, making the angular size of Pal 5–like streams in M31 (lengths of 7.8–12 kpc; Pearson et al. 2019) equal or larger than the expected 052 × 052 Roman field of view. Visually, the density contrast between the stream and background stars increases with galactocentric radius and with exposure time. Pearson et al. (2019) estimated that the width of a Pal 5–like stream in M31 would vary between 0.053 and 0.127 kpc at a galactocentric radius (RGC) of 15–55 kpc, and the length would vary between 7.8 and 12 kpc. We could best identify the gaps in the density maps for a 1 hr exposure; otherwise, visual identification of the stream and gaps is difficult (see Figure 3).
To simulate streams in other external galaxies with distances spanning 0.5–10 Mpc (assuming a similar stellar composition and tidal field as M31), we offset the M31 background population in the Roman CMD space to the appropriate distance modulus, retaining the same Milky Way foreground populations. The number of stars in Pal 5 was resampled to match the precomputed number of stars at the new galactic distance. We applied the same magnitude cuts as our simulated foreground and background models.
Figure 4 compares the simulated streams, all placed at a fixed galactocentric distance of 15 kpc. We display a fixed area in physical units of 7 kpc × 4 kpc in the figure (7 kpc is ≈half the length of the stream), which corresponds to smaller angular scales in the total Roman field of view at more considerable distances. We show the streams at RGC = 15 kpc because the combination of the density contrast and the thickness of the stream makes it easier to detect the gap in these mock observations visually compared to images at 35 kpc and 55 kpc. As the distance to the host galaxy increases, the density of background stars and the stream decreases. We caution that in actual observations, other external galaxies will have different sizes, and their halos will have a different composition compared to M31. All these caveats are further discussed in Section 5.
Through a visual inspection of streams with gaps in host galaxies spanning a distance of 0.5–10 Mpc, we find that the gap is visible by eye in external galaxies at distances out to ≈1.5 Mpc (see Figure 4).
4. Automating the Detection of a Gap
Visual confirmation alone can result in biased assessments of stream and gap detections; hence, we now turn to quantify detection using an automated tool. In Sections 4.1 and 4.2 we lay out methods for defining the gap and the stream region, and in Sections 4.3, 4.4, and 4.5 we outline a procedure for quantifying the detection of each gap. We provide a detection limit as a function of distance using a large sample of simulated mock streams.
4.1. Density Estimation and Detecting Gaps
Previous studies have developed algorithms to find and characterize stellar streams in the Milky Way (e.g., Mateu et al. 2017; Malhan & Ibata 2018; Shih et al. 2022, 2023) and in external galaxies (e.g., Hendel et al. 2019; Pearson et al. 2022). Once such algorithms have determined a stream's presence, location, extent, and orientation, we can then search for gaps. We used the gap-finding tool of Contardo et al. (2022), FindTheGap, designed to evaluate underdensities in multidimensional data. Gaps, just like streams, can be detected by eye. This tool provides an automated approach and serves as an additional method for confirmation or rejection in conjunction with visual detection.
FindTheGap uses the projection of the second derivatives (Hessian, H ) of the density estimate onto the orthogonal subspace of the density gradient vector (g), denoted as Π H Π, where Π is a projection matrix defined as:
The maximum eigenvalue of Π H Π can then be used as a statistic to estimate if a point in the data space is "in a gap." Conversely, the minimum eigenvalue of Π H Π can be used to highlight ridges and overdensities. The density estimation depends on a free parameter, the bandwidth, which relates the estimated density to the spacing between data points. In addition to the bandwidth, the stability of the gap detection also depends on the number of data points.
To apply this tool to simulated observations, we started with simulated density maps (Figures 3 and 4), making a cutout centered on the visually identified gap. We did not use the entire Roman field of view as the angular size of the stream becomes progressively smaller at larger galaxy distances, making it more difficult to identify gaps. We then created a grid of 50 by 20 points along each cutout with uniform spacing, covering an area of 5 kpc × 2 kpc that includes the main track of the stream and surrounding foreground and background stars.
The accuracy of this method relies on the choice of bandwidth. Large bandwidths tended to smooth over structures in the data, including the gap, but small bandwidths introduced gaps and other small-scale structures that were not necessarily present in the underlying true density. Additionally, the density estimation in FindTheGap assigns lower densities to regions near the edge of the simulated mock observations. To avoid these edge effects, we first estimated the stellar density and Π H Π values on a slightly larger data set, incorporating stars beyond the specified grid. Specifically, we required the data bounds to be larger than the grid bounds by a factor of twice the bandwidth. For example, we used a 9 kpc by 6 kpc region for a bandwidth of 1 kpc, given our fixed grid size of 5 kpc by 2 kpc. After we fit the density estimator to the data, we predicted the values of density and Π H Π on the smaller 5 kpc by 2 kpc grid (see Figure 5). To ensure the fidelity of each gap detection and to remove spurious gaps, we ran this estimation five times for every simulation, choosing the same number of randomly selected stars for each estimation (bootstrap resampling). In each iteration, the density, and the minimum and maximum eigenvalues of Π H Π were scaled to span values of 0 and 1, respectively, to maintain a consistent range across bootstrap samples. We then computed the final Π H Π map by taking the median over all bootstraps. Figure 5 shows the result of applying this tool to simulated observations at a distance of 0.8 Mpc. The map of Π H Π eigenvalues reliably locates the gap and the stream. We further discuss our determination of the optimal bandwidth in Section 4.4.
Download figure:
Standard image High-resolution image4.2. Further Outlining the Stream and Gap Regions with Indicator Points
In principle, it is possible to determine the stream path from the minimum eigenvalues of the Π H Π matrix as the stream represents an overdensity. Nevertheless, our focus was solely on detecting gaps within a stream. We constrained the stream region inside the density maps by fitting a second-degree polynomial to the predetermined positions of the injected stream. To define the stream track, we fixed the size of the stream to be 0.2 kpc at RGC = 15 kpc, 0.3 kpc at RGC = 35 kpc, and 0.5 kpc at RGC = 55 kpc. These widths are chosen to encompass the majority of the stars inside the stream visually but exceed the predicted stream width based on the analytical calculations of Pearson et al. (2019) by a factor of 3–5. This process ensures that there are enough grid points to cover the full stream region. This step also allows us to measure the density of stars inside the stream, later discussed in Section 4.5. We note again that we assume that the stream has been observed, and the approximate stream region is therefore known.
To outline the gap region, we selected points on the grid falling within the top 95th percentile of the distribution of minimum eigenvalues of Π H Π. We will refer to the regions on the grid that match this criterion as "gap indicator points." As shown in Figure 5, this criterion provided a first-order estimation of the location of the gap. We then further constrained the gap region to be centered around the median position of the gap indicator points, with a width equal to the size of the stream region and a length equal to the gap size (see Section 2.3). These definitions of the gap region and indicator points were later used to develop metrics for distinguishing successful detections from noise.
Figure 5 illustrates the gap-detection procedure. As all the mock observations were converted to physical projected coordinates, this assumption yielded consistent gap identification with galaxy distance. We further discuss our quantification of the breakdown of the gap identification procedure in the upcoming sections.
4.3. Defining Metrics for Gap Detections
We now discuss applying the gap-detection method on a sample of mock streams in an automated fashion. We created an automatic pipeline using the methods described in Section 4.1 to search for gaps in simulated mock streams, and two metrics to quantify the detections. First, we computed Π H Π maps and their eigenvalues for mock observations spanning distances between 0.5 and 10 Mpc and using bandwidths between 0.1 and 2 kpc for all three RGC values. We restricted the bandwidth range to 2 kpc as the metrics discussed below did not improve beyond this range. In the end, bandwidths between 0.5 and 0.9 kpc were optimal in finding the gap region. For each step, we repeated the stream generation, the generation of background populations, and the gap-detection process to account for the scatter in the detection metrics at low stellar densities. This process resulted in 538,650 independent mock observations.
We used three metrics to quantify the significance of each detection. To quantify the uncertainty in the gap detection, we defined the spread of all the gap indicator points () as the range of their x-positions (maximum to minimum). As a reminder, "gap indicator points" were defined as points on the grid in the top 95th percentile of maximum eigenvalues of Π H Π. We anticipate that a robust gap detection has low values, as these gap points would be concentrated around one point near the stream (see the blue markers in the lower right panel of Figure 5).
We then computed the median value of the absolute difference between the x-positions of gap indicator points to the center of the density maps, which measures the precision for locating the gap in kiloparsecs, denoted by Δ, computed for each stream separately. As we designed each simulated observation to be centered around the gap, we expect an optimal detection to have a small value for Δ. Nevertheless, in Appendix F.3, we show that our pipeline can also identify gaps away from the center. After we defined and Δ per stream, we used these metrics to determine which bandwidths were optimal for detecting gaps.
4.4. Determining the Optimal Bandwidth
The effects of bandwidth choice on the gap-detection metrics as a function of distance are shown in Figure 6. We found that bandwidths between 0.5 and 1 kpc resulted in the lowest values for the spread of gap indicator points () and the deviation of the location of the gap from the center of the density maps (Δ). For large bandwidths, the estimated density on the grid is centrally concentrated and features are washed out. For small bandwidths, the stellar density was fragmented into small groups of spurious gaps, resulting in large values for and Δ. We illustrate these effects in Appendix F.2. We inferred a middle value of ≈0.8 kpc as our optimal bandwidth.
Download figure:
Standard image High-resolution imageTo evaluate the gap-detection tool's performance further and ensure that our pipeline was robust, we also applied the gap-finding tool to mock observations with an intact stream (without a gap from an interaction with a subhalo) using our described methodology. As shown in Appendix F.3, we could identify the drop-off in density toward the edge of the stream, but we could not identify any gaps inside the track of the unperturbed stream for a bandwidth range of 0.5–0.9 kpc, further validating our method. When applying this tool to real Roman images, the optimal bandwidth may depend on the scale of underdensities in the background. A positive detection would require further characterization. Our goal in this study is to provide an additional methodology for detecting gaps in conjunction with visual inspections. While our pipeline could lead to false positives, it is unlikely to miss any real gaps in the data.
4.5. Establishing Tentative Distance Limits for Gap Detections
To determine a tentative detection limit, we used both the density of stars inside the gap region and the location of the gap region as a reference. We estimated the stellar density inside the stream and the gap by counting the number of stars inside each region and dividing this number by the physical area (in units of kpc2; see Section 4.4 for the definition of the gap and stream points/regions). To determine the area of the stream region, we multiplied the total area of the grid (10 kpc2) by the fraction of grid points that fell within each respective region. Because the stream track did not follow a simple straight line, this procedure allowed us to obtain a more accurate measurement of each region's area. For the gap region and the background region, we approximated the area as a rectangle. For the gap, we used a width equal to the width of the stream, and the precomputed length; and for the background, we used a width of 0.5 kpc and a length of 5 kpc.
Figure 7 shows the surface densities (units of number kpc–2) inside the stream region, the gap, and the background for bandwidths between 0.5 and 0.9 kpc. As expected, there is a monotonic decrease in the surface density inside the stream, the gap, and the background with increasing galaxy distance. The stream density is generally higher than the gap density and the background. Additionally, streams at smaller RGC values are denser and thinner than streams at larger RGC values. However, it was difficult to determine the detection limits from these densities alone.
Download figure:
Standard image High-resolution imageTo establish a tentative detection limit for our pipeline, we examined the evolution of the gap location with distance. We plot the median value of the location of gap points (Δ) for 10 random streams for each distance step in the bottom panels in Figure 7. We expect the central gap location to remain stable across several iterations for robust detections of gaps. While there was a systematic offset between the center of the stream from the true center, we observed this "flaring" for the value of Δ at larger distances. While the stellar densities for the background stars are low at RGC = 55 kpc, the stream is wide. In contrast, at RGC = 15 kpc the stream is thin, which still makes the detection of gaps difficult in both scenarios. At RGC = 35 kpc, however, the width of the stream and the stellar background densities are optimal for detecting gaps with our pipeline. By examining the evolution of the gap-detection precision (Δ) with galaxy distance for both 1000 s and 1 hr exposure times, this translates to distance limits of 2–3 Mpc. In Appendix F.1, we discuss examples of gap detections in mock streams at RGC = 35 kpc, which show that the location of the gap inside the stream becomes progressively uncertain beyond these distance limits. Additionally, in Appendix E, we demonstrate that gaps are undetectable at RGC ≥ 55 kpc for exposure times ≤1 hr with our method if a metallicity cut is not applied to filter out background and foreground stars.
To summarize, we used the tool FindTheGap developed by Contardo et al. (2022) to evaluate the detection of gaps beyond a simple visual inspection and to quantify the distance limit with exposure time. We applied this tool to a set of >500,000 mock observations for galaxy distances between 0.5 and 10 Mpc with M31-like stellar populations as background stars. For each mock observation, we defined "gap indicator points" based on gap statistic provided by FindTheGap. The gap-detection method relies on the bandwidth as an additional parameter to compute the density of stars on a predefined grid by changing this parameter uniformly between 0.1 and 2 kpc; based on this, we determined that the optimal bandwidth for detecting gaps was between 0.5 and 0.9 kpc. We then evaluated the effectiveness of each detection by estimating the central location of gap indicator points with galaxy distance. The results from this procedure pipeline suggest that gaps from subhalos of 5 × 106 M⊙ in the halo of M31-like galaxies will be detectable to 2–3 Mpc for exposure times between 1000 s and 1 hr.
5. Discussion
In this section, we revisit assumptions in our numerical simulations (Section 5.1), the observational limitations (Section 5.2), their implications, and how they affect our results. We also discuss prospects of using extragalactic streams for dark matter science (Section 5.3).
5.1. Limitations in Our Simulation of a Gap
In our simulations, we have assumed that the galactic potential is smooth and static. However, previous studies of the Milky Way have shown that inhomogeneities in the global potential, including giant molecular clouds, the Galactic bar, streams, other GCs, and spiral arms can perturb GC streams (Amorisco et al. 2016; Hattori et al. 2016; Price-Whelan et al. 2016; Erkal et al. 2017; Pearson et al. 2017; Banik & Bovy 2019; Doke & Hattori 2022). To mitigate this effect, we can search for streams located at large galactocentric radii in external galaxies, where the contrast between the stellar stream and background is more dramatic (see Figure 3). In addition, at large galactocentric radii, bars, spirals, and molecular clouds are less likely to cause dynamical perturbations in streams. Furthermore, with larger sample sizes, we can determine the frequencies, sizes, and locations of underdensities in streams, allowing us to evaluate signatures of perturbations from dark matter subhalos statistically.
Global potentials in galaxies are also deformed by mergers and satellite interactions (Weinberg 1998; Garavito-Camargo et al. 2019), and observations suggest that M31, in particular, has been largely shaped by a possibly recent minor (or major) merger (D'Souza & Bell 2018; Escala et al. 2021; Bhattacharya et al. 2023; Dey et al. 2023). Merger events and interactions with satellites can distort present streams (Erkal et al. 2019; Shipp et al. 2021; Lilleengen et al. 2023) and contribute to the accretion of new GCs that will eventually form streams (Carlberg 2020; Kruijssen et al. 2020; Qian et al. 2022). The details of the formation and disruption of GC streams have not been extensively explored in large cosmological simulations.
Throughout this work, we have only considered one encounter between a stream and a subhalo. Old GC streams can undergo multiple collisions with subhalos, creating multiple observable density fluctuations and perturbations to the stream morphologies (Yoon et al. 2011). In fact, multiple underdensities have been observed in several Milky Way streams such as GD-1 (Bonaca & Hogg 2018) and Pal 5 (Erkal et al. 2017). We do not further explore the effects of multiple encounters here. Still, based on our analysis of the detectability of a gap from a single subhalo encounter, we expect that streams, which have undergone multiple interactions with subhalos to develop multiple observable gaps, can be detected to similar distance limits using our methodology. Using realistic galaxy simulations that include baryonic physics, Barry et al. (2023) predict that Pal 5–like streams in the Milky Way could undergo 2–3 interactions per gigayear with subhalos of masses >106 M⊙ before dissolution.
In our analysis, we limited our investigation of the observability of gaps with Roman to subhalo encounters between GC streams and dark matter subhalos with Hernquist profiles (Hernquist 1990). Cuspier profiles for the dark matter subhalo can result in larger gaps for the same encounter properties (Sanders et al. 2016). Additionally, as gaps grow with time, the initial size and the growth of gaps will depend on the collision parameters, such as the mass and scale radius of the subhalo, the relative velocities of the subhalos to the stream, the impact parameter, the stream orbit, the time of the collision, the impact position, and others. These parameters have been extensively explored in numerical and analytical works (Yoon et al. 2011; Erkal & Belokurov 2015; Sanders et al. 2016; Koppelman & Helmi 2021). These effects will be difficult to disentangle in external galaxies given the lack of kinematic information. However, large statistical sample sizes of observed gaps will allow for rigorous comparisons to predictions of gaps in streams evolved within various dark matter frameworks (e.g., warm, fuzzy, or self-interacting).
5.2. Limitations of Our Simulations of Stellar Background and Foregrounds
In this work, we have generated mock Roman observations to mimic stellar halos of external galaxies at various distances without accounting for observational biases due to crowding, extinction, or star–galaxy separation. Pearson et al. (2019) discussed several limitations to consider for such mock observations. In particular, they concluded that crowding effects would not affect the pure detection of thin GC streams in external galaxies with Roman. We can further minimize crowding effects by observing external galaxies with sightlines pointing away from the Milky Way's Galactic plane. Our method relies on estimating the underlying density of stars, as measuring the density contrast between stream stars and background stars is the determining factor in the success of stream detections and of gap detections. Additionally, the effect of dust extinction will be minimal for the halo of M31 at infrared wavelengths (Dalcanton et al. 2015).
Pearson et al. (2019) evaluated the effect of star–galaxy separation on the detection limits of GC streams with Roman. They used the Space Telescope Image Product Simulator to inject known galaxy catalogs into simulated fields and applied quality cuts based on source shape. They concluded that including background galaxies would limit the detection of Pal 5 in M31-like galaxies to 1.1 Mpc (poor star–galaxy separation; no photometric metallicity cut) and 2.6 Mpc ("perfect" star–galaxy separation; applying a metallicity cut of [Fe/H]- < −1) for an exposure time of 1 hr. There are ≈136 galaxies in this volume, including seven galaxies with luminosities > 109 L⊙ (Karachentsev et al. 2013; Karachentsev & Kaisina 2019), and based on the results presented here gaps will be detectable to these distances. Investigating GC streams 5–10 times more massive than Pal 5, Pearson et al. (2019) estimated that thin GC streams could be detected in host galaxies out to 6.2–7.8 Mpc with a 1 hr Roman exposure for perfect star–galaxy separation and without a metallicity cut (including a metallicity cut they preject the upper limit to be 9.3 Mpc). This volume contains ≈660 galaxies where the vast majority within the 7.8 Mpc limit are dwarf galaxies, but 68 galaxies are more luminous than >109 L⊙ and 25 galaxies are more luminous than the Milky Way. While we did not include such an analysis in this paper, because background galaxies will be randomly spread out throughout the image, we expect the feasibility of star–galaxy separation to have a similar effect on the detection of gaps as in the predictions by Pearson et al. (2019) on the detection of streams. We note that the formation of GC streams in dwarf galaxies has been explored in simulations (e.g., Peñarrubia et al. 2009), but more work is needed to estimate their observability.
Our distance limit of <3 Mpc contains ≈150 galaxies, including ≈eight galaxies with luminosities > 109 L⊙ (NGC 55, M32, M31, NGC 300, NGC 404, M33, UGCA 86, and the LMC). Our estimate for the distance limits for the detection of gaps depends on our implicit assumptions that extragalactic halos have similar stellar density fields and metallicity distributions as M31. Additionally, it relies on the presence of a low-metallicity ([Fe/H] < −1) stream population. Dwarf galaxies and the stellar halos of massive galaxies in the Local Group are relatively metal poor (Kirby et al. 2013; Ho et al. 2015), hence a metallicity cut would be appropriate for reducing contamination from Milky Way disk foregrounds, enhancing the stream and gap detection. Additionally, the 3D stellar densities profiles of stellar halos (<50 kpc) for relatively massive (>109 L⊙) galaxies are consistent with our assumptions (ρ ∝ r−n , n between −2 and −3; Bullock & Johnston 2005; Johnston et al. 2008; Harmsen et al. 2017; Monachesi et al. 2019; Font et al. 2020). While the results presented in Figure 7 include a M31-like metallicity distribution for the background stellar halo, and a metallicity cut, we have also considered the case of a uniform metallicity distribution for the stellar halo, and present results without a metallicity cut in Appendix E. An M31-like metallicity distribution for stars in the background stellar halo results in a significant fraction of the halo being removed when a metallicity cut is applied. If we instead assume a uniform metallicity distribution for the stellar halo, the modeled M31 stellar halo contains a larger fraction of low-metallicity ([Fe/H] < −1) halo stars, leading to higher background densities after applying the metallicity cut. Nevertheless, using the same requirements for gap detection, we find we can recover gaps out to distances of ∼2 Mpc.
With these considerations, if a Pal 5–like GC stream (i.e., [Fe/H] < −1 with an initial cluster mass ≈ 50,000 M⊙) has formed in the halo of a nearby relatively massive galaxy, the assumptions in our simulations and our applied selection procedure are likely robust.
Finally, future large observing programs dedicated to searching for gaps in streams in external galaxies can extend to longer exposure times, allowing for larger sample sizes and the potential detection of GC streams in dwarf galaxies. Optimistically, it is likely that a full program that is dedicated to observing these gaps with Roman would extend over several hours of observing time, allowing the stacking of images from multiple visits to reach depths beyond our estimates.
5.3. Inference of Dark Matter Properties and Expected Sample Sizes
Our focus throughout this paper has been on the detectability of underdensities in extragalactic GC streams. Previous studies have explored pathways to isolate dark matter effects from baryonic effects and infer dark matter properties (e.g., particle mass) from observations of stream gaps. Using linear perturbation techniques, Bovy et al. (2017) constrained the number of dark matter subhalos of masses between 106.5 and 109 M⊙ within 20 kpc of the Milky Way's Galactic center by modeling Pal 5 data (see also Banik et al. 2021). Banik & Bovy (2019) provided a powerful method for disentangling underdensities caused by dark matter subhalos from baryonic perturbers (e.g., bars, molecular clouds, and spiral arms) in Pal 5 by computing the various perturbers' contributions to the stream's density power spectrum. They concluded that the contribution from spiral structure to Pal 5's substructure is low, but that giant molecular clouds can create small-scale underdensities comparable to those from dark matter subhalos (see also Amorisco et al. 2016). Recently, Hermans et al. (2021) showed that simulation-based inference techniques with machine learning that map observed densities in streams to simulations can help constrain dark matter structure. They found that GD-1 stream data can constrain WDM particle masses and distinguish between CDM and WDM models. These techniques will be enhanced by the development of fast and accurate stream simulation tools (e.g., Alvey et al. 2023). Lovell et al. (2021) confirmed that the structure in GD-1 and Pal 5 can place limits on the fraction of WDM versus CDM subhalos within a 40 kpc distance from the Galactic center (see also the discussion by Pearson et al. 2019, 2022). Searching for streams in the halos of external galaxies far from the bar and star-forming regions will increase the likelihood of finding gaps induced by gravitational perturbations from dark matter substructure, which can be compared to expectations from various dark matter candidates.
Even though we have shown that Roman will not be able to detect gaps in GC streams in external galaxies further than 2–3 Mpc away for 1 hr exposures, in M31 alone there are ≈450 GCs (Galleti et al. 2006, 2007; Huxor et al. 2008, 2014; Caldwell & Romanowsky 2016; Mackey et al. 2019), which is a factor of three more than the number of known GCs in the Milky Way (Harris 1996, 2010). It is not unreasonable to assume that there is also a factor of three more, yet to be detected, GC streams in M31 than the ≈100 GC streams observed in the Milky Way (Malhan et al. 2018; Martin et al. 2022; Mateu 2023). We know that GCs are also prevalent in other galaxies (Harris et al. 2013). Thus, M31 and other galaxies could provide a diverse set of GC streams with gaps that can be used to constrain substructure within various frameworks of dark matter (Bovy et al. 2017).
While the full survey parameters of Roman are yet to be determined, the proposed High Latitude Survey (HLS) is expected to image high-latitude fields (Akeson et al. 2019). The WFI instrument can reach depths of ≈28 mag (AB) in the R, Z, and Y bands for exposure times of 1 hr. Furthermore, the instrument has slitless spectroscopic capabilities that cover (0.6–1.8 μm), which will be beneficial in identifying resolved stellar populations, albeit at a much shallower 1 hr sensitivity. Compared to previous M31 surveys with HST (e.g., the Panchromatic Hubble Andromeda Treasury; Dalcanton et al. 2012), Roman will offer an opportunity to observe M31 at higher efficiency and sensitivity.
Finally, in addition to the Roman, other imaging and astrometric surveys, such as the Vera C. Rubin Observatory, will also help detect new gaps from dark matter subhalos down to ≈106 M⊙ in dozens of streams in the Milky Way (Drlica-Wagner et al. 2019). These detections will offer the possibility to constrain CDM models at a 99% confidence level, opening up an exciting era for using both Galactic and extragalactic streams to constrain dark matter models.
6. Summary
We aimed to quantify the detection prospects of gaps in GC streams in external galaxies with the Roman. To do this, we simulated mock Roman observations of gaps in extragalactic Pal 5–like streams produced by their interaction with dark matter subhalos. We generated mock streams and simulated direct encounters with dark matter subhalos with masses between 2 × 106 M⊙ and 107 M⊙. Additionally, we simulated realistic mock observations of background of stars in the halo of M31 at galactocentric radii of 15 kpc, 35 kpc, and 55 kpc, taking into account contamination from Milky Way foregrounds. To mimic observations of galaxies at distances that are further than M31, we moved the simulated M31 population to distances of 0.5–10 Mpc, retaining foreground Milky Way populations. To search for gaps in the streams, we first visually inspected mock observations, used the gap-detection tool of Contardo et al. (2022), deriving several metrics to quantify the reliability of our detections with galaxy distance.
We summarize our findings as follows:
- 1.We find that gaps formed by 5 × 106 M⊙ subhalos gaps will be visually obvious in 1000 s and 1 hr photometric exposures in the halo of M31.
- 2.Mock observations of the same stream at various distances from the Milky Way indicate that gaps can be seen at distances of ≈1.5 Mpc by visual inspection.
- 3.With the automated detection tool, we confirmed that gaps formed from 5 × 106 M⊙ subhalos can be identified to distances of 2–3 Mpc, a volume which includes ≈eight galaxies with luminosities > 109 L⊙.
While our analysis was limited to gaps from in Pal 5–like streams embedded in M31-like halos, it points to the potential of Roman to build a large and diverse set of GC stream gaps in multiple galaxies, which will contribute to constraining various dark matter models.
Acknowledgments
We thank the CCA dynamics group, Robyn Sanderson, Ivanna Escala, Nora Shipp, and the UCSD Coolstars group for insightful discussions. C.A. thanks Christopher Theissen for providing additional computational resources. This work was developed at the Big Apple Dynamics School in 2021 at the Flatiron Institute. We thank them for their generous support. The Flatiron Institute is supported by the Simons Foundation. This work was made possible through the Preparing for Astrophysics with LSST Program, supported by the Heising-Simons Foundation and managed by Las Cumbres Observatory. Support for S.P. was provided by NASA through the NASA Hubble Fellowship grant #HST-HF2-51466.001-A awarded by the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Incorporated, under NASA contract NAS5-26555. T.S. was supported by a CIERA Postdoctoral Fellowship.
Software: Astropy (Price-Whelan et al. 2018), Scipy (Virtanen & Gommers et al. 2020), Matplotlib (Hunter 2007), Seaborn (Waskom et al. 2014), Numpy (Harris et al. 2020), Pandas (McKinney 2010), Scikit-learn (Pedregosa et al. 2011), and FindTheGap (Contardo et al. 2022).
Appendix A: Gravitational Potentials and Subhalo Collision Parameters
In our analysis, we used M31 as a model for the external host galaxy. Many groups have estimated the total mass and the potential of M31 by modeling the kinematics of satellites (Watkins et al. 2010); constraining the rotation curve (Chemin et al. 2009); modeling the velocity distributions using tracer particles such as stars, GCs, and planetary nebulae (Kafle et al. 2018); dynamical modeling of the giant southern stream in M31 (Fardal et al. 2013); and the Local Group timing argument (González et al. 2014; Chamberlain et al. 2023). Additional references and trade-offs from these techniques are summarized by Fardal et al. (2013) and Kafle et al. (2018). To set up an M31-like potential, we used the model of Kafle et al. (2018), which is composed of a central bulge, a disk, and a halo. The bulge potential follows a Hernquist profile (Hernquist 1990) given by:
with a scale length (q) of 0.7 kpc and a bulge mass(Mb ) of 3.4 × 1010 M⊙. The disk potential follows a Miyamoto–Nagai density profile (Miyamoto & Nagai 1975) given by:
with a scale length (a) of 6.5 kpc, a scale height (b) of 0.26 kpc, and a total disk mass (Md ) of 6.9 × 1010 M⊙. These parameters for the disk and the bulge were adopted from a compilation of literature values (Bekki et al. 2001; Font et al. 2006; Geehan et al. 2006; Seigar et al. 2008; Chemin et al. 2009; Corbelli et al. 2010; Tamm et al. 2012). We assumed an NFW profile (Navarro et al. 1996) for the halo given by:
where Mvir is the virial mass, rvir is the virial radius, c is the concentration parameter, Δ is the virial overdensity parameter, and ρc is the critical density of the Universe. As many of these parameters are interrelated, we used best-fit values for the halo virial mass at Δ = 200 of M200 = 0.7 × 1012 M⊙ and based on the inferred posterior distribution of Kafle et al. (2018). We note that the concentration parameter was poorly constrained in this work. Additionally, while their inferred mass is lower compared to recent literature values (1–2 × 1012 M⊙, as summarized by Bhattacharya et al. 2023), we keep this assumption for self-consistency. A recent measurement by Dey et al. (2023) reports a mass of 0.6 × 1012 M⊙, consistent with our assumption. We also assumed H0 = 67.7 km s–1 Mpc–1 based on Planck results (Planck Collaboration et al. 2020).
For the dark matter subhalo, we again assumed a Hernquist density profile with masses (Mh ) and radii (rh ) determined by the scaling relation from Erkal et al. (2016):
Tables 1 and 2 summarize our simulation parameters for the creation of a stream with a gap inside a galaxy potential.
Appendix B: Milky Way and M31 Population Simulation Parameters
We modeled the Milky Way density as a three-component model based on Jurić et al. (2008) given by:
where f0 and f1 are the relative fraction of thick disk and halo stars to the thin disk population at the position of the Sun, set to 0.12 and 0.005, respectively. Stellar densities for the disk were assumed to follow exponential profiles parameterized by a scale height (H) and scale length (L):
For the thin disk, we assumed H = 300 pc and L = 2600 pc. For the thick disk, we assumed H = 900 pc and L = 3600 pc. We also assumed R⊙ = 8.3 kpc and Z⊙ = 0.027 kpc (Jurić et al. 2008). While the scale height of a population varies with its main-sequence lifetime (Bovy 2017), and dynamical evolution leads to asymmetries in the density profile (Reylé et al. 2009; Liu et al. 2017; Nitschai et al. 2021), these simple assumptions provide a reasonable first-order estimate of the broad stellar densities of present-day Milky Way stellar populations. For the halo stellar density, we used a flattened spheroid profile:
with q = 0.64 and n = 2.77.
The 3D stellar density for the halo of M31 is given by:
(see Equation (B3)), where and are the cylindrical radius and height starting from the center of M31, in the plane and perpendicular to its disk, respectively, with q = 1.11, and n = −3.
Appendix C: Simulated Streams at 35 and 55 kpc
In Figure 8, we show the simulated streams at larger galactocentric radii, with gaps from perturbations due to subhalos of masses between 2 and 10 × 106 M⊙. The stream and gap sizes for all our simulations are provided in Table 2. Streams at larger galactocentric radii are thicker than the stream at 15 kpc, which affects the feasibility of detecting gaps (see Section 4.5).
Download figure:
Standard image High-resolution imageAppendix D: Comparing Simulated Populations to PAndAS data
In Figure 9, we show simulated CMDs in the CFHT g0 and i0 bands compared to the reddening-corrected PAndAS data, which reproduces a significant portion of the range of colors and magnitudes covered by the data. We also show the regions of the CMDs that were used to scale the simulation to the data. While our simulations are a reasonable match to the data, we did not reproduce the overdensities at g0 − i0 ≈ 1 and i0 > 22, which were labeled as Milky Way halo stars by Ibata et al. (2014), pointing to perhaps an underestimation of the fraction of Milky Way disk to halo stars in our simulations. Additionally, our simulations assume magnitude completeness down to the magnitude limits, which is not valid for real data. As reported by Martin et al. (2016), the completeness of PAndAS drops below 70% for i0 > 23. Nevertheless, as discussed in the main text, this scaling provided a robust estimation of the CFHT g-band luminosity function and the total stellar density within the PAndAS fields.
Download figure:
Standard image High-resolution imageAppendix E: Detection Limits Without a Metallicity Cut
We also report detection limits without applying an additional metallicity cut. To generate mock observations, we followed the same procedure as in our simulations with a metallicity cut, and we applied the FindTheGap tool to these mock observations with the optimal bandwidths of 0.5 to 0.9 kpc. Figure 10 shows the resulting stellar densities and our gap-detection metric. Gaps at 35 kpc show the clearest distinction between detections (i.e., where the density inside the gap is above the background and the location of the gap is stable) and nondetections. By visually inspecting the simulated stellar density maps in Figure 10, it is unlikely that the gaps at galactocentric radii of 55 kpc are detectable using our method, but gaps at smaller galactocentric radii will be detected to ≈1 Mpc. While the stellar backgrounds are less dense at larger galactocentric radii, the streams are wider, which makes it hard to detect if the background levels are high.
Download figure:
Standard image High-resolution imageAppendix F: Additional Checks of the Gap Detection Pipeline
F.1. Visual Inspections of Gap Detections with Distance
Figure 11 shows additional examples of the density of stars in mock observations as a function of distance and the identification of gaps. Here, we only show streams simulated for a 1000 s exposure time. The localizations of the gaps are shown by arrows, which are stable <2 Mpc for RGC = 35 kpc and 55 kpc and <5 Mpc for RGC = 15 kpc, but become scattered at larger distances.
Download figure:
Standard image High-resolution imageF.2. Examples of the Effect of Bandwidth on the Detections of Gaps
We demonstrate where the detection of gap breaks down as a function for mock observations RGC = 15 kpc by showing three cases in Figure 12: (a) the case where the gap was detectable by eye, but the bandwidth was much smaller than our optimal bandwidth; (b) the case where the gap was detectable by eye and the bandwidth was much larger than our optimal bandwidths; and (c) the case where the bandwidth was optimal, but the stellar density in the stream was extremely low. For the first and second cases, the spread in the location of gap points () was large. In the last case, the deviation of gap points (Δ) from the center was large.
Download figure:
Standard image High-resolution imageF.3. Comparing Streams with Gaps to Intact Streams and Backgrounds
We compare the performance of the gap-detection tool to simulations of an intact stream with no perturbation from the dark matter subhalo for bandwidths of 0.5 and 0.8 kpc, and a stream with an off-centered gap with a bandwidth of 0.8 kpc in Figure 13. We followed the methodology highlighted in the main text to generate mock observations. Spurious gaps were persistent in the backgrounds, but careful visual inspection can rule out these detections. Additionally, the tool can detect off-centered gaps.
Download figure:
Standard image High-resolution imageFootnotes
- 10
These coordinates are obtained by assuming that our GC progenitor lies at the present-day heliocentric equatorial coordinates of Pal 5, (α, δ) = (229022, −0112) at a distance of 20.6 kpc; and has a proper motion vector (, μδ ) = (−2.736 mas yr−1, −2.646 mas yr−1) and radial velocity of −58.60 km s−1. We assumed a Galactocentric coordinate system with the local standard of rest velocity V lsr=(8.4 km s−1, 251.8 km s−1, 8.4 km s−1) and the Sun radial distance of 8.275 kpc from the Galactic center based on Schönrich et al. (2010), Bovy et al. (2012), and GRAVITY Collaboration et al. (2019).
- 11
- 12
We selected point sources from the McConnachie et al. (2018) catalog by restricting the morphology flags in the g and i bands to −1. The catalog can be accessed at https://www.cadc-ccda.hia-iha.nrc-cnrc.gc.ca/en/community/pandas/query.html.