Abstract
The baryonic Tully–Fisher relation (BTFR) has applications in galaxy evolution as a test bed for the galaxy–halo connection and in observational cosmology as a redshift-independent secondary distance indicator. This analysis leverages the 31,000+ galaxy Arecibo Legacy Fast ALFA (Arecibo L-band Feed Array) Survey (ALFALFA) sample—which provides redshifts, velocity widths, and H i content for a large number of gas-bearing galaxies in the local universe—to fit and test an extensive local universe BTFR. The fiducial relation is fit using a 3000-galaxy subsample of ALFALFA, and is shown to be consistent with the full sample. This BTFR is designed to be as inclusive of ALFALFA and comparable samples as possible. Velocity widths measured via an automated method and Mb proxies extracted from survey data can be uniformly and efficiently measured for other samples, giving this analysis broad applicability. We also investigate the role of sample demographics in determining the best-fit relation. We find that the best-fit relations are changed significantly by changes to the sample mass range and to second order by changes to mass sampling, gas fraction, different stellar mass and velocity width measurements. We use a subset of ALFALFA with demographics that reflect the full sample to measure a robust BTFR slope of 3.30 ± 0.06. We apply this relation and estimate source distances, finding general agreement with flow-model distances as well as average distance uncertainties of ∼0.17 dex for the full ALFALFA sample. We demonstrate the utility of these distance estimates by applying them to a sample of sources in the Virgo vicinity, recovering signatures of infall consistent with previous work.
Export citation and abstract BibTeX RIS
1. Introduction
The baryonic Tully–Fisher relation (BTFR) is the tight empirical relation between the rotational velocity and baryonic mass of disk galaxies, extending the Tully–Fisher relation (TFR) between stellar luminosity (tracing stellar mass) and rotational velocity (Tully & Fisher 1977) to lower-mass galaxies with higher gas fractions (McGaugh et al. 2000). Fundamentally, the BTFR traces the relationship between dynamical and luminous mass in galaxies: in Lambda cold dark matter (ΛCDM), this is the relationship between dark matter halos and the baryons which occupy them.
The BTFR is thus a test bed for cosmology and galaxy evolution, as its slope, scatter, and normalization are set by this connection. Simple schematic arguments which assume that the mass of a halo is defined by a density contrast threshold and constant baryon fraction predict a power law M ∝ V3. More robust treatments in ΛCDM can accommodate larger slopes (∼3.4; Zaritsky et al. 2014), and semi-analytic models which incorporate abundance matching predict curvature in the BTFR as a consequence of the nonuniform baryon fraction (Trujillo-Gomez et al. 2011). Some studies suggest that the BTFR may pose a challenge to ΛCDM, as the highest reported slopes are more consistent with modified Newtonian dynamics' prediction of M ∝ V4 (Lelli et al. 2016), and the very small intrinsic scatter of the observed BTFR is perhaps smaller than expected given variations in halo spin parameter, concentration, and baryon mass fraction (McGaugh 2012). However, the cosmological significance of any best-fit BTFR is muddled by the relationship between observed and fundamental properties, observational uncertainties, and sample-selection effects, making it difficult to fully assess the significance of these interpretations (Desmond 2017) and whether or not the scatter is consistent with expectations from ΛCDM (Desmond 2012).
Aside from its cosmological significance, the observed low intrinsic scatter of the BTFR is valuable for estimating low-uncertainty, redshift-independent distances to galaxies, although it is reliant on reliable primary distance estimators used to calibrate the template relation. Recent works have used this property of the TFR and BTFR to make local constraints on H0 (Schombert et al. 2020; Kourkchi et al. 2022), and to measure large-scale structure via peculiar velocities (e.g., Springob et al. 2016). The TFR has been used extensively to measure peculiar velocities and large-scale structure in the local universe (e.g., COSMICFLOWS; Kourkchi et al. 2020; Springob et al. 2007). Despite this, the curvature of the TFR at low mass indicates the utility of the BTFR for determining distances to a larger fraction of the (preferentially low-mass) sources in the local universe.
One application of BTFR distance estimates and a key focus of this paper is to measure the mass density of dark matter in large-scale structure. Eisenstein et al. (1997) use the redshift dispersion of galaxies along the Pisces–Perseus supercluster (PPS), fortuitously aligned in the plane of the sky, to estimate the mass-to-light ratio of the supercluster within a factor of 2. Today, simulations suggest that infall onto filamentary structure can be detected using a sample of ∼a few thousand galaxies with typical BTFR-derived distance uncertainties of ∼0.2 dex (Odekon et al. 2022). Measuring distances to this accuracy for such a large sample requires the use of a carefully calibrated BTFR.
Aside from the cosmological considerations above, the slope, normalization, and observed scatter of the BTFR vary with the use of different velocity definitions (e.g., Verheijen 2001; Lelli et al. 2019), selected galaxy populations (Bradford et al. 2016), methods for line fitting (Weiner et al. 2006), mass-to-light ratios used (McGaugh & Schombert 2014; Ponomareva et al. 2018), the size of the galaxy sample used (Sorce & Guo 2016), and more. As with the Malmquist biases observed to increase the slope of the traditional TFR (Strauss & Willick 1995), flux-limited samples may be biased by sensitivity-based selection effects (Kourkchi et al. 2022). Though there is general agreement that the slope of the BTFR is somewhere between ∼3–4, the heterogenous samples, types, and treatments of data used to calibrate literature BTFRs are partially responsible for the plethora of observed slopes.
Given its rich, homogenous, digital data set, the Arecibo Legacy Fast ALFA (Arecibo L-band Feed Array) Survey (ALFALFA; Giovanelli et al. 2005; Haynes et al. 2018) presents fertile ground for an inclusive BTFR calibrated using uniform survey data. Here we use the ALFALFA extragalactic catalog (Haynes et al. 2018) to derive a template relation that can be used to estimate secondary distances. Over the course of this work, we compare this fiducial template relation to others calibrated using different subsamples and velocity and mass definitions to investigate the influence of these choices on the presented relation.
In Section 2, we review the sources of data and fitting methods used to find line widths and BTFRs. In Section 3, we discuss the methods used to fit our template relation, including considerations for sample selection and line-fitting methods for determining a template relation. We present a template relation using a high-quality, representative subsample of sources in Section 4, identifying common classes of fit-identified outliers, comparing distance measurements to those from flow models, and reporting typical distance uncertainties. In Section 5, we compare this relation to others in the literature, finding that these relations become more consistent when demographics, velocity width measurements, and stellar mass estimating methods are matched. Section 6 presents an initial application of our derived distances, measuring infall onto Virgo. Finally, we present our conclusions, including recommended guidelines for applications of this relation, in Section 7.
2. Data Compilation and Spectral Profile Fitting
2.1. Neutral Hydrogen (H i) Data and Gas Mass
The ALFALFA survey is a drift-scan, sensitivity-limited survey using the seven-beam Arecibo L-band Feed Array (ALFA) receiver on the Arecibo 305 m telescope. This survey detected more than 31,000 extragalactic sources (Haynes et al. 2018), with the goal of sampling the H i mass function to ≃100 Mpc. The typical sensitivity of the ALFALFA survey (5σ detection of 0.75 Jy km s−1 for W50 = 200 km s−1) allows for the detection of low-mass (log(MH I/M⊙) < 8) sources out to ∼35 Mpc and higher-mass sources to ∼260 Mpc (heliocentric velocity = 17,912 km s−1). When calibrating the template relation for this study, we use the distance estimates from the ALFALFA catalog, which are a combination of flow-model distances and distances from cluster catalogs (Haynes et al. 2018). The sample used in this analysis is drawn from the ALFALFA extragalactic catalog with the goal of being representative of the same population. The H i masses and velocity widths used in this analysis are calculated using the α.100 catalog and ALFALFA spectra, respectively. The α.100 catalog also contains measures of profile velocity widths including W50,P (defined as the width of the profile at 50% of the peak flux), which is calculated with some manual intervention. The new widths presented here are calculated fully automatically and thus can also be homogeneously calculated for samples where there is interest in applying this analysis.
Assuming optically thin emission, H i mass is computed using ALFALFA catalog flux values and the following standard formula:
where DH i is the distance to the source taken from the ALFALFA catalog (as described above) in megaparsecs, and S21 is the integrated H i line flux in Jansky kilometer per second. Previous works find that the inclination-dependent self-absorption correction is likely only ∼10% of the H i mass in most ALFALFA galaxies, and reaches ∼35% only in the most edge-on sources (Jones et al. 2018), so no correction for self-absorption is made in the fiducial relation. To account for other contributions to the gas mass, the fractions of metals and molecular gas are incorporated via the flat conversion Mgas = 1.33MH I, which is commonly used in BTFR studies.
2.2. Stellar Mass
The stellar masses used in the fiducial relation are based on Sloan Digital Sky Survey (SDSS; Blanton et al. 2017) optical photometry and are tabulated using Equation (7) of Taylor et al. (2011) and additional considerations discussed in Durbala et al. (2020). These masses are recommended among those discussed in Durbala et al. (2020) based on their agreement with rigorous spectral-energy-distribution-based mass estimates and the availability of data for a large fraction of sources in the full ALFALFA catalog. Additionally, near-infrared (NIR) masses tabulated using unWISE (Lang et al. 2016) photometry are subject to shredding effects, leading to underestimated masses in the most extended ALFALFA sources; SDSS-based stellar masses are not subject to these effects. Section 5.2.1 presents a BTFR using unWISE NIR photometry-based stellar masses, which may be used to extend this work beyond sources with available optical observations.
2.3. Velocity Definition
The Doppler shift of the 21 cm line is a useful tracer of the motion of galaxies' baryons, and thus the rotational velocity at the extreme extent of galaxies. When resolved rotation curves are unavailable, a galaxy's rotational velocity can be approximated by measuring the width of the spectral-line profile. To ensure uniformity, specific definitions of velocity width are used across the literature. Commonly used velocity width definitions measure the width across the global H i line profile at some percentage of the peak flux (e.g., W50,P ), or similarly of the mean flux (e.g., W50,M ). Some velocity definitions are, in general, larger and in better agreement with the flat velocities at the extreme edge of a galaxy's rotation curve, e.g., W20,P (Lelli et al. 2019) and W50,M (Courtois et al. 2009); others may be more robust at low signal-to-noise (S/N), e.g., W50,P (Springob et al. 2005). Additional velocity width measurements come from parametric fits to the full shape of a line profile (e.g., busy function; Westmeier et al. 2014), or fitting line widths using physical source modeling (Peng et al. 2022).
The velocity width used in this analysis is based on the curve-of-growth (COG) width-finding method proposed in Yu et al. (2020). As discussed there, this algorithm is straightforward to implement automatically. Automatically measuring profile line widths allows for efficient and uniform reduction of large data sets, which is our aim in this study. The program used here fits a piecewise function to a pair of lines; the cost surface of this problem is generally simple, and relevant parameters (the pivot channel, the slope of the line on either side of the pivot) are straightforward to initialize with rigorous guesses. 7 Because of this, this method is computationally faster than a nonlinear least-squares fitting algorithm that includes more parameters (e.g., the busy function; Westmeier et al. 2014; Stewart et al. 2014), and is currently more reliable than recent attempts using trained neural networks (e.g., Hallenbeck et al. 2019). Additionally, its dependence on the integrated flux of a line profile rather than its peaks and boundaries means its performance at low S/N is significantly more reliable than attempts at automating W50,P and W20,P .
The implementation of a COG width-finding algorithm presented here is slightly different from the original presentation in Yu et al. (2020). Three examples of the fitting procedure used in this analysis are presented graphically in Figure 1 for galaxies with different profile shapes and typical S/N values, and a general sketch of the procedure is as follows:
- 1.Find the heliocentric velocity of a line profile using the hermite polynomial matched filtering described in Saintonge (2007). This determines the velocity channel where the two black curves of growth meet and are centered in the figure.
- 2.Moving outward from the central channel, add the flux in subsequent channels to both the left and right in order to create a COG—the integrated flux of the profile as a function of channel from profile center. The result of this integration is the black curve in Figure 1. Errors for this curve are determined from the sum of the per-channel rms.
- 3.Fit a piecewise function consisting of two straight lines to the resulting COG, allowing for a coarse determination of the integrated flux of the profile; we use the pivot of the piecewise as this integrated flux. This best-fit piecewise is plotted on each profile in Figure 1 using a dashed–dotted line, and the pivot of the piecewise intersects the dotted red line, indicating this integrated flux. The red region in each figure is centered on the best-fit integrated flux value, with the region extending to ±1× the uncertainty on that integrated flux value.
- 4.To measure the COG velocity width, which is defined as the width of the profile at some percent of the integrated flux of the profile, the COG is renormalized, and the profile width is determined by the velocity width where the COG summed from the data intersects the desired percentage of the total flux. In Figure 1, this occurs when the black COG intersects the dotted horizontal black line at 0.75 of the integrated flux, with the profile width indicated using solid black vertical lines on the figure. As discussed below, this threshold of 0.75× the integrated flux is chosen to balance the increased uncertainty near the edge of the integrated curve with the desire to reach as large of a velocity width as possible, in attempt to define a velocity proxy more likely to describe the flat part of a galaxy's rotation curve. In this, V75 is designated to balance two desires: first, defining a velocity width that can be reliably measured automatically, even in low-S/N profiles; and, second, defining a velocity which is a rough proxy for the rotational velocity on the edge of a galaxy's baryons.
In theory, the summing of the COG and subsequent piecewise fitting could be done separately for each half of the profile in order to more flexibly measure the widths of very asymmetric profiles. Testing this procedure on a subsample of sources finds that the widths are more reliable (i.e., follow the same general trends between V75 versus W50,P confirmed with a high-S/N sample and shown in Figure 2) when both sides are summed simultaneously; in the typical S/N range of this sample, the increased S/N of the summed COG is more important than carefully treating the profile shape.
Download figure:
Standard image High-resolution imageOccasionally, this width-fitting program outputs widths that are clearly discrepant from ALFALFA width measures (where the fit V75 is more than twice, or less than half, the rotation velocity measured using W50,P /2). The most robust indicator that the width fitting has failed is if the output Vhelio measurement greatly differs from that of the ALFALFA catalog; we use ΔVhel = Vhelio,ALFALFA − Vhelio,WF, where Vhelio,ALFALFA is the catalog measurement and Vhelio,WF is the measurement made by the width-fitting program. When ΔVhel is large, the matched filtering step finds some other noise peak, or uses a single peak of a strongly double-horned profile as its center. We find that taking a cutoff of ∣ΔVhel∣ < 50 km s−1 removes the majority of such sources from the sample, as illustrated in the first panel of Figure 2. In this panel, the blue points are the sources selected with ∣ΔVhel∣ > 50 km s−1, superimposed on the remaining sources which have ∣ΔVhel∣ ≤ 50 km s−1 criteria (gray). As demonstrated in the following panels, two additional cuts further reduce the number of outliers created by the width-fitting program: (i) unphysical negative uncertainties, which are indicative of an algorithmic issue normalizing the COG, and (ii) a long-term slope of the COG above an empirically determined value, which indicates that the piecewise curve fit did not find the flat part of the COG successfully. These three exclusion criteria remove many of the most clearly incorrect V75 values for the α.100 sample, minimizing the number of good fits that are removed as we control for the performance of the width-fitting program. In total, these criteria flag 2539/31,502 (∼8%) ALFALFA sources, reducing the number of sources with V75/W50,P ratios vastly different from the average range of 0.5 <2V75/W50,P < 2 (bottom-right panel, Figure 2).
In order to have confidence that the reported uncertainties represent the width-finding program's success in finding the true width of a line profile, we test the program on a sample of 66 real profiles with S/N > 300 by degrading them to lower S/N values. Comparing the high-S/N reliable values of the profile widths with those measured in the noisy profiles, ≳68% of these noisy profile widths agree with the "base truth" value within 1σ for S/N > 7. The number within 2 and 3σ is nearly consistent with Gaussian distributions above this S/N threshold, as well, although for some classes of profiles there is an outlier fraction of ∼a few percent. This suggests that, although the reported uncertainties are reasonably good at describing the difference between true and noisy profile widths, ∼2% of widths for profiles with S/N > 9 may be inconsistent with expectations because of poor performance of the width-finding program. The outlier fraction increases in the lower-S/N regime of the sample, which have 4.5, 6, and 10% sources with ΔV > 3σ for S/Ns of 9, 7, and 5, respectively. Given the distribution of profile S/N values for the full ALFALFA sample and the sample used to fit the template relation (5.8, 8.4, 17, and 6.8, 10.2, 21 for the 16th, 50th, 84th percentiles, respectively), we expect ≲6.6% of sources to have widths significantly distorted by noise in an ALFALFA BTFR and ≲4% of the sample used to fit the template relation (α.R) to have distorted widths attributable to the performance of the width-finding program. We run similar tests using the approximate profile shapes used as the matched filtering template bank, finding comparable results above S/N = 5.
An additional factor that makes it difficult to reliably measure the width of low-S/N profiles is variation in the baseline of the spectrum. To determine how the width-fitting program is affected by baseline artefacts, we simulate noise and simple baseline variations over noiseless profiles simulated from input model rotation curves using the schema discussed in Obreschkow et al. (2009). Again, we find similar consistency between the true widths and degraded widths as in the ALFALFA data experiment, as the piecewise fit in the width-fitting program is appropriately insensitive to artefacts beyond the bulk of the line (as can also be seen in Figure 1). In the case of higher-order baselines (sloped, curved baselines), the method for calculating the uncertainty of the pivot flux takes these variations into account in determining appropriate uncertainties for measured widths.
Though V75 is a smaller width than W50,P , the two have a relatively constant proportionality for most double-horned profiles, with V75/W50,P increasing at low widths where profiles are more Gaussian (see Figure 2). This suggests that V75 weights the shape of the line profile differently in determining the line profile width than W50,P , as one would expect given that one looks at the integrated flux of the profile and the other looks at the extrema. Though in general smaller velocity measurements represent less extreme velocities in a galaxy's rotation curve, a straightforward flat multiplier correction (V75 = 0.41W50,p ; see Figure 2) could be applied to recover W50,P from V75 across the majority of the ALFALFA mass range (for sources with log (W50,P ) > 1.8, true for ∼26,000/31,000 ALFALFA galaxies). In fact, the increase in V75/W50,P at low mass suggests that V75 may in fact be more effective at tracing the extreme velocities at the edges of dwarf galaxies' slow-rising rotation curves (e.g., McQuinn et al. 2022). Given the difficulty tying the width of a profile to the internal dynamics of a source, the differences between this velocity width and others does not preclude the use of V75 in BTFR studies; rather, it provides a new avenue for automatically fitting widths for use in large data sets.
2.4. Inclination
The observed Doppler-broadened line width measures only the radial component of the galaxy's rotation and must be corrected for disk inclination. We estimate the inclination of these sources using their observed axial ratios from SDSS photometry and the relation
where is the observed axial ratio of the source, and q0 is the intrinsic ratio (Hubble 1926). These inclinations allow us to correct the sources' rotational velocities for projection effects:
Vrot,75 is thus a measure of source rotational velocity as it includes corrections for inclination effects and is converted to a rest-frame velocity (via multiplication by a 1/(1 + z) factor). We use q0 = 0.15; most studies use values between q0 ∼ 0.1–0.2 for blue spirals, which we expect to dominate the ALFALFA sample (e.g., Karachentsev et al. 2017, their Figure 2; Yuan & Zhu 2004). To investigate the reliability of these inclination estimates, we compare them with estimates tabulated in the HYPERLEDA catalog, 8 which derives morphology-dependent inclination values using
where r0 is a dimensionless parameter that varies with morphological Hubble type and r25 is the axial ratio of the B-band 25 mag arcsec−2 isophotes for the source (Paturel et al. 2003). We plot these inclination estimates against one another in Figure 3 and find general agreement, consistent with Bradford et al. (2016)'s finding that a morphology-dependent inclination does not appear to significantly affect the best-fit BTFR.
Download figure:
Standard image High-resolution imageAs illustrated by the dashed lines around the 1:1 ratio in Figure 3, there is an average spread of ∼10° difference in the inclinations measured via these different methods for the sample of crossmatched HYPERLEDA-ALFALFA sources. This is incorporated as a systematic term in the uncertainty of the inclination correction. The statistical term of this uncertainty comes from the propagation of the axial ratio uncertainty in the photometry of each source, and is in most cases much smaller than the systematic term.
As can be seen in Figure 3, there are a number of sources with inclinations >10° disparate between measurement methods; the axial ratios for these sources are different between these data sets and thus have different inclinations and inclination-corrected velocity measurements. We find consistent relations fit using SDSS and HYPERLEDA inclinations when using an outlier-downweighting fitting method. Sources with δ i > 10° between estimation methods are often outliers from only one of the two BTFRs, which we take as evidence that these inclinations are not reliable. Of the 31/842 sources which are only outliers in the SDSS sample, 28/31 sources have δ i > 10°. Similarly, 31 of the 35 outliers from the HYPERLEDA relation have δ i > 10°. Corrections based on an over/underestimated inclination will move points away from the central relation, making them more likely to be outliers from the BTFR. When we find outliers where the inclination is overestimated, this overestimated inclination will lead to smaller sin(i) uncertainties, causing these unreliable inclinations to have disproportionately large influence when using a simple fitting likelihood that does not account for an outlier fraction as described in Section 3.1. These sources are dealt with through the downweighting of outliers using a Gaussian mixture model and, in the sample of sources used to fit the template relation, imposing an inclination restriction designed to reduce the fraction of sources with unreliable SDSS inclinations in the HYPERLEDA-ALFALFA crossmatch sample (i > 60°, red vertical line in Figure 3). In the sample with i >60°, 5/353 sources are identified as SDSS-only outliers, and these sources all have HYPERLEDA i ≪ SDSS i. We note that sources with understimated inclinations from SDSS axial ratios have previously been noted as outliers in other studies of the BTFR (Catinella et al. 2012).
Note that the fiducial relation does not include a turbulence correction for the rotational velocities as is occasionally done in BTFR studies, although we do discuss a BTFR with such a correction applied in Section 5.2.2. As this is the first attempt to use and interpret a V75-based BTFR, applying a uniform correction to a velocity measure likely to be more sensitive to V/σ than other width measures (see, for example, the different relation between shape and width demonstrated by the V75/W50,P relation) adds unnecessary complication for our specific application of these velocities.
3. Fitting the Template Baryonic Tully–Fisher Relation
3.1. Fitting Algorithms
We fit the relations discussed here using two likelihood functions. The first is a simple linear model in log–log space, much like the ones used in Lelli et al. (2019) and Papastergis et al. (2016). As these observations have significant uncertainties in both the x and y variables, using straightforward linear regression is known to introduce a bias to the slope of a fit relation (Willick et al. 1995). Generative likelihood models rigorously handle this while maintaining model simplicity. This model is an intrinsic Gaussian distribution perpendicular to a best-fit linear relation in log–log space. This model assumes that the uncertainties, here approximated as Gaussian, describe the probability of an observation given a true underlying point within this distribution (see Appendix appendix for more details.) The other model used to produce the presented fiducial relation, which we call the mixture model, fits for a second "outlier" component parallel to this best-fit relation. As discussed in Hogg et al. (2010), this class of model can be used to reduce the influence of outlier sources on a population's characteristics. Sources have an intrinsic binary flag placing them in the "outlier" or "bulk" components of the model. The best-fit likelihood includes the probability of each source belonging to each component, meaning that sources that have a low probability of belonging to the "bulk" component have less influence over the best-fit parameters for that component. The likelihood for this model is discussed further in Appendix appendix, where there is also discussion of additional tests of more complicated outlier models.
We can identify sources as confident members of the outlier component by looking at the posterior probability for each data point. These sources are difficult to exclude from the sample using the parametric cuts described in Section 3.2. They may include sources with underestimated uncertainties, confused sources in the α.100 sample (Jones et al. 2015) or sources with unreliable or misidentified optical photometric objects (Durbala et al. 2020), or sources with intrinsic properties which make them distinctly different from the regular rotators most adherent to the BTFR (e.g., blended sources, mergers, tidally disturbed galaxies, ultra-diffuse galaxies; Mancera Piña et al. 2019). Each of these types of sources appears multiple times in the most discrepant sample of outliers, discussed in more detail in Section 4.2.
3.2. Restrictions to the ALFALFA Sample
Restricted samples of high-quality, targeted observations are important for use in the BTFR in cosmological analyses, as they reduce observational uncertainties and probe the fundamental relation while minimizing competing effects. As discussed in Bradford et al. (2016), the properties of a sample used to define a template are important for determining where that template is applicable. If restrictive cuts are not appropriately accounted for in the fitting process, they can limit the applicability of, and in some cases bias, the resulting relation (see demonstrations of this in Section 5.2).
To define the template sample here, we take a different approach: we make minimal cuts to preserve the properties of the underlying α.100 sample of sources. The sample α.R (R for "restricted"), which defines the fiducial template relation, includes some cuts which make it more selective than the full ALFALFA catalog. However, these cuts are agnostic to the demographics of the galaxy population (e.g., gas fraction, stellar mass, morphology), and are instead motivated by a desire to reduce the number of outliers caused by improper fitting, low-quality data, or sources with disturbed morphologies/that unlikely to be regularly rotating. We also run fits which relax the α.R-defining restrictions, showing that broadly the full α.100 sample is consistent with this template (see Section 5.1). The criteria used to define the α.R sample are as follows:
- 1.Sources recommended for use by the output of the width-fitting program. As described in Section 2.3, we determine a number of criteria that prune the most severe failures of the width-finding program.
- 2.Sources that have stellar masses derived from SDSS photometry in the Durbala et al. ( 2020 ) catalog.
- 3.Sources far from large overdensities. One intended application of this relation is to measure signatures of infall onto the PPS. We want to fit a template that is unlikely to contain infall signatures, necessitating the removal of certain regions of the sky from the sample. To this end, we also exclude Virgo, the other large overdensity in the ALFALFA survey range. Within the ALFALFA footprint, we exclude all sources that are at 22h < R.A. <3h to exclude PPS and all sources with R.As. 12h08m−12h56m, decls. 2°–18°, and velocities 350–3000 km s−1, to exclude Virgo from the sample.
- 4.Sources with inclinations i > 60°. As discussed in Section 2.4, we use an inclination cut of SDSS i > 60°, as shown in Figure 3, in order to reduce the number of sources with unreliable SDSS inclinations. This does not remove all of the outliers with SDSS i > HYPERLEDA i but instead minimizes the fraction of sources with misidentified inclinations while maximizing the size of the remaining, nonrestricted sample.
- 5.Sources with uncertainties <20% profile widths. This restriction is essentially a width measurement S/N threshold. We test a number of uncertainty thresholds and find, as expected, that the slope of the relation increases as we impose a stricter uncertainty threshold and the noisy scatter of the distribution of widths is reduced. Above this imposed 20% uncertainty threshold the slope of the fit BTFR stays relatively constant. This threshold therefore appropriately limits the additional scatter and uncertainty caused by low-S/N widths and is the least restrictive cut we can make to this effect.
- 6.Sources which do not appear highly asymmetric to the width-fitting program. Sources may appear highly asymmetric for a number of reasons. First, this selects for many of the remaining failures of the width-fitting program; if the program finds an inaccurate Vhelio, it will center the curve on a noise spike or on one horn of a double-horned profile, causing different "width" measurements for the left and right sides of the profiles when measured separately. Second, interacting sources may also have highly asymmetric line profiles (Espada et al. 2011; Bok et al. 2019). As these sources are disturbed, they are more likely to be offset from the BTFR defined by regularly rotating sources. Confusion (e.g., Jones et al. 2015) may also contaminate the sample; this emission may appear lopsided if the two sources in the beam are at different distances from the H i centroid.
The number of sources in the α.100 sample which satisfy each selection criteria are listed in Table 1. The imposition of all six of these criteria produce a reliable sample of ∼4500 sources broadly representative of the α.100 population, which we use to define a template relation that can be used as a distance indicator with minimal uncertainty and contamination.
4. Results
4.1. Presentation of Relation
The fiducial "inclusive ALFALFA relation" presented here is based on the α.R sample discussed in Section 3.2. The fiducial relation, as well as a best-fit relation fit without an outlier component, is plotted over both the α.R and α.100 samples in Figure 4.
Download figure:
Standard image High-resolution imageTable 1. Table of the Number of Sources which Fit Each of the Exclusion Criteria Described in Section 3.2
Sample | Number of Sources |
---|---|
α.100 | 31,502 |
(1) Recommended by width fitter | 28,963 |
(2) SDSS stellar masses available | 28,267 |
(3) Far from overdensities | 17,772 |
(4) i > 60° | 7455 |
(5) Width uncertainties <20% | 8821 |
(6) Not asymmetric profiles | 11,630 |
(1–6) α.R | 4525 |
Download table as: ASCIITypeset image
The template α.R relation,
(where Vrot,75 is V75 corrected for inclination and redshift broadening) has best-fit slope (hereafter m), intercept (b), and intrinsic perpendicular scatter (σ⊥) values of m = 3.30 ± 0.06, b = 9.9 ± 0.014, and σ⊥ = 0.025 ± 0.004, respectively. This model includes an additional component that describes the distribution of outliers—a Gaussian distribution which runs parallel to the best-fit relation (Equation (5)) with a perpendicular offset (−0.07 ± 0.01, σ = 0.13 ± 0.01), and a weighting of (0.2 ± 0.03) relative to the main component. This relation is fit using 3000 galaxies randomly drawn from the 4525 galaxies in α.R to leave a significant sample of α.R sources available for validation. We note that this relation is stable within the uncertainties regardless of which random sample of 3000 sources is drawn. This template relation, along with a relation fit without a Gaussian outlier component, is presented in Figure 4 and the first row (Ia) of Table 2, which summarizes all of the fits run in this investigation. To the eye, the fiducial relation may appear steeper than the distribution of data; this is an expected result of rigorously fitting a relation to data with significant uncertainties in the independent variable: fits by eye or using methods which do not appropriately treat x uncertainties will tend to fit relations that are inappropriately shallow.
Table 2. Summary of Fits Discussed in Section 5
ID | Sample Characteristics | Sample Size | Velocity Width | Simple Slope | Simple Intercept | Simple Scatter | Mix Slope | Mix Intercept | Mix Scatter |
---|---|---|---|---|---|---|---|---|---|
Ia | Fiducial | 4525 | V75 | 3.36 ± 0.04 | 9.94 ± 0.01 | 0.065 ± 0.0025 | 3.30 ± 0.03 | 9.9 ± 0.01 | 0.024 ± 0.008 |
Ib | Restrictions w/o 3 (No PPS/Virgo removal) | 6811 | V75 | 3.4 ± 0.04 | 9.96 ± 0.01 | 0.07 ± 0.002 | 3.34 ± 0.03 | 9.9 ± 0.01 | 0.026 ± 0.004 |
Ic | Restrictions w/o 4 (No inclination restriction) | 6970 | V75 | 3.43 ± 0.04 | 9.94 ± 0.006 | 0.072 ± 0.002 | 3.34 ± 0.03 | 9.89 ± 0.01 | 0.027 ± 0.003 |
Id | Restrictions w/o 5 (No uncertainty restriction) | 5160 | V75 | 3.32 ± 0.03 | 9.96 ± 0.005 | 0.0677 ± 0.015 | 3.28 ± 0.03 | 9.91 ± 0.07 | 0.026 ± 0.005 |
Ie | Restrictions w/o 6 (No asymmetry restriction) | 5818 | V75 | 3.33 ± 0.03 | 9.96 ± 0.005 | 0.071 ± 0.002 | 3.29 ± 0.03 | 9.9 ± 0.01 | 0.025 ± 0.0045 |
If | Forced same mass sampling | 2931 | V75 | 3.37 ± 0.035 | 9.949 ± 0.005 | 0.064 ± 0.001 | 3.326 ± 0.03 | 9.91 ± 0.01 | 0.029 ± 0.003 |
IIa | Fiducial, McGaugh mass, | 4253 | V75 | 3.48 ± 0.03 | 9.99 ± 0.01 | 0.004 ± 0.003 | 9.96 ± 0.01 | 0.004 ± 0.003 | |
IIb | Fiducial, Ponomareva mass, | 4253 | V75 | 3.34 ± 0.03 | 9.77 ± 0.01 | 9.69 ± 0.01 | 0.000 ± 0.00 | ||
IIc | Fiducial, recovered Taylor mass, | 4253 | V75 | 3.54 ± 0.04 | 9.93 ± 0.01 | 0.069 ± 0.002 | 9.91 ± 0.01 | 0.03 ± 0.01 | |
IIIa | Fiducial | 4525 | W50,P | 3.35 ± 0.04 | 9.72 ± 0.01 | 0.08 ± 0.01 | 3.34 ± 0.03 | 9.66 ± 0.01 | 0.045 ± 0.002 |
IIIb | Fiducial, turbulence corrected V75 | 4525 | V75,TC | 9.95 ± 0.01 | 0.07 ± 0.002 | 3.19 ± 0.03 | 9.90 ± 0.01 | 0.026 ± 0.003 | |
IIIc | Fiducial/Springob crossmatch | 705 | V75 | 3.33 ± 0.07 | 9.89 ± 0.01 | 0.032 ± 0.003 | 9.87 ± 0.02 | 0.01 ± 0.01 | |
IIId | Fiducial/Springob crossmatch | 705 | W50,M | 9.52 ± 0.02 | 0.026 ± 0.004 | 3.44 ± 0.06 | 9.52 ± 0.02 | ||
IIIe | Fiducial/Springob crossmatch | 705 | W20,P | 9.41 ± 0.02 | 0.033 ± 0.003 | 3.56 ± 0.07 | 9.41 ± 0.02 | ||
IIIf | Fiducial/Springob crossmatch | 705 | WC | 3.35 ± 0.07 | 9.69 ± 0.02 | 0.033 ± 0.003 | 3.37 ± 0.06 | 9.68 ± 0.02 | 0.007 ± 0.005 |
IVa | Fiducial/Springob crossmatch, McGaugh masses | 705 | WC | 9.78 ± 0.02 | 0.029 ± 0.004 | 3.68 ± 0.07 | 9.78 ± 0.02 | 0.01 ± 0.01 | |
IVb | Fiducial/Springob crossmatch, Ponomareva masses | 705 | WC | 3.45 ± 0.07 | 9.73 ± 0.02 | 0.035 ± 0.003 | 3.45 ± 0.07 | 9.72 ± 0.02 | 0.02 ± 0.01 |
IVc | Gas fraction > 2 | 1836 | W50,P | 3.80 ± 0.08 | 9.81 ± 0.02 | 0.089 ± 0.002 | 9.77 ± 0.02 | 0.05 ± 0.005 | |
IVd | 4096 | W50,P | 3.37 ± 0.04 | 9.74 ± 0.01 | 0.079 ± 0.002 | 3.39 ± 0.03 | 9.68 ± 0.01 | 0.046 ± 0.001 | |
IVe | b/a < 0.25 | 1178 | W50,P | 9.71 ± 0.01 | 0.066 ± 0.002 | 3.30 ± 0.05 | 9.70 ± 0.01 | 0.042 ± 0.006 | |
Va | 2666 | V75 | 2.82 ± 0.06 | 10.02 ± 0.02 | 0.059 ± 0.002 | 9.97 ± 0.01 | |||
Vb | 2666 | V75,TC | 10.02 ± 0.01 | 0.060 ± 0.02 | 2.85 ± 0.04 | 9.97 ± 0.01 | |||
Vc | 2666 | W50,P | 2.83 ± 0.07 | 9.83 ± 0.02 | 0.071 ± 0.002 | 9.75 ± 0.02 | 0.036 ± 0.003 | ||
Vd | 1859 | V75 | 10.01 ± 0.02 | 0.066 ± 0.002 | 10.00 ± 0.01 | 0.036 ± 0.004 | |||
Ve | 1859 | V75,TC | 3.68 ± 0.10 | 10.00 ± 0.01 | 0.073 ± 0.002 | 9.98 ± 0.01 | 0.040 ± 0.004 | ||
Vf | 1859 | W50,P | 9.73 ± 0.01 | 0.084 ± 0.002 | 9.69 ± 0.01 | 0.055 ± 0.002 | |||
Vg | DALF ≤ 40 Mpc | 299 | V75 | 9.92 ± 0.04 | 0.063 ± 0.007 | 9.88 ± 0.04 | |||
Vh | DALF ≤ 70 Mpc | 786 | V75 | 9.94 ± 0.02 | 0.072 ± 0.005 | 3.67 ± 0.08 | 9.89 ± 0.01 | 0.03 ± 0.01 | |
Vi | DALF ≤ 100 Mpc | 1663 | V75 | 3.48 ± 0.05 | 9.92 ± 0.01 | 0.063 ± 0.003 | 9.88 ± 0.01 | 0.025 ± 0.05 | |
VIa | Fiducial, scatter = 0 | 4525 | V75 | 3.15 ± 0.02 | 9.958 ± 0.003 | 0 | 9.90 ± 0.01 | 0 | |
VIb | Fiducial, scatter = 0 | 4525 | W50,P | 2.70 ± 0.01 | 9.81 ± 0.002 | 0 | 9.66 ± 0.01 | 0 |
Notes. Horizontal bars demarcate different portions of the analysis; the first section demonstrates that relaxing the criteria which define α.R results in a relatively stable fiducial relation generally applicable to the full α.100 sample. Note that we do not present results for a relation without relaxing restriction 2, which is that the sources must have stellar masses in our catalog, as it is not possible to place sources on our relation without a usable stellar mass estimate. The second region illustrates the differences between fits to the full fiducial sample using different stellar mass estimates, the third illustrates the difference induced by using different velocity definitions, the fourth provides relations with data methods/velocity definitions/sample criteria more closely matched to literature relations, and the fifth demonstrates the slope discrepancy between the high- and low-mass ends of the sample.
Download table as: ASCIITypeset image
The uncertainties reported here include both the statistical uncertainties from the fitting procedure (reported in Table 2) and systematic uncertainties (0.05 in m and 0.01 in b), estimated using the spread in best-fit relations when we relax the imposed restriction criteria in Section 3.2. Importantly, the slope and intercept of this template relation (Equation (5)) are consistent with these relations (see Section 5.1). Though defining a restricted sample is useful for ensuring that the relation is fit with a high-confidence sample, the consistency with fits to less restricted samples suggests that this α.R relation can be used to estimate distances to the broader α.100 sample, including sources which do not fit all listed selection criteria. As discussed later in this section, the corresponding uncertainties and outlier fractions will correspondingly be larger in this case.
4.2. Outlier Classes from Fiducial Relation
This fiducial relation is drawn very generally from the full α.100 catalog—itself complicated by issues of varying coverage, the impact of radio frequency interference, confusion, and uncertainties in the identification of optical counterparts—meaning that there is a possibility of distorted data which may affect the relation and be difficult to remove parametrically. This motivates use of the mixture model to de facto downweight the most anomalous data. We investigate the 80 sources with the lowest probability of membership in the bulk relation model, 9 finding the following common classes:
- 1.Sources with close companions. Thirty sources (∼38% of the 80 most extreme outliers) have nearby companions, identified in SDSS imaging by small sky (≲6', ≲120 kpc at 70 Mpc) and velocity (∼few hundreds of kilometers per second) separations. Seven of these sources are deblended from nearby sources in the ALFALFA data-reduction process. Nearby companions may impact sources' positions on the BTFR, as confusion with gas-rich sources within the ALFALFA beam or distortion of H i disks via interaction add significant nonrotational velocity to these source profile widths. Other studies find evidence that line profiles are affected by the presence of a close companion, including twice as many kinematic anomalies in pairs versus field galaxies (Kannappan & Barton 2004), significantly more asymmetry in H i line profiles for sources in close pairs versus isolated sources (Bok et al. 2019), and more flat-topped or widened H i profiles for sources in close pairs (Zuo et al. 2022). The outlying sources with close companions sit both above and below the bulk relation in Figure 5, demonstrating that sources with close companions add scatter to the bulk relation but do not preferentially displace these sources to larger velocity widths.
- 2.Sources with misidentified inclinations. Eight (10%) of the 80 extreme outliers can be confidently identified as nearly face-on in SDSS imaging. All eight sources have external evidence suggesting SDSS inclinations may be unreliable, including HYPERLEDA inclinations >10° different for 7/8 and detailed modeling in Meert et al. (2015) and Galaxy Zoo classification as "not edge-on" (Willett et al. 2013) for the eighth. All eight of these sources sit to the left of the bulk relation in Figure 5, suggesting that their inclination-corrected velocity is underestimated, consistent with the interpretation that the 1/sin(i) factor is too small for these sources. Though the inclination selection criteria in Section 3.2 are designed to account for most outliers due to misattributed inclinations, a small fraction of overestimated SDSS-based inclinations in the above HYPERLEDA-ALFALFA comparison persists even with this selection criteria. The persistence of a (albeit small, ∼0.25% of the total sample) fraction of misattributed inclination outliers is unsurprising.
- 3.Sources with inconsistent stellar mass estimates. Nine (∼11%) additional outliers have stellar masses that are likely to be unreliable. Four such sources have bright foreground stars in the same field as the galaxy in SDSS imaging. These sources all sit significantly above (large, positive ) the general relation, suggesting that their masses are overestimated, potentially appearing brighter than expected because of leaking flux from the foreground star. The remaining five sources without foreground stars have inconsistent >5× SDSS and unWISE stellar masses. These sources all have an SDSS photometric object identifier (PhotoObjID) which is offset from the center of the source and instead centered on a compact region of brightness ≪the size of the galaxy, meaning that a misassigned PhotoObjID may contribute to these sources' separation from the bulk relation.
- 4.Sources with inconsistent velocity measures. Several sources may be outliers because of unreliable V75 measurements. Twenty-one sources identified as outliers when fitting the V75 relation are not identified as outliers when fitting the W50,P relation. Ten of these sources overlap with the above categories; 9/21 have close companions, 1/21 has an overestimated inclination. The remaining 11 sources represent a fraction of the α.R 3000-source sample (0.3%) significantly smaller than the estimated percentage of ≲4% incorrect widths described in Section 2.3. Even if the majority of these 21 sources are primarily outliers because their V75 values are incorrect, this still indicates an outlier percentage 0.7% ≪ 4%. This closely analyzed sample of 80 outliers represents only the most discrepant sources from the fiducial relation; this underrepresentation of width-based outliers in this analysis may indicate that inconsistent widths increase the scatter of the relation, but it is not significant enough to dramatically increase the number of outliers in the ALFALFA BTFR.
Download figure:
Standard image High-resolution imageThe above four classes account for 58/80 of the sources identified as outliers by their low probability of membership in the bulk component of the Gaussian mixture model. The remaining ∼quarter (22/80) of outliers are somewhat more complicated to interpret. Three are additional blue sources with irregular morphologies, three are the remaining highly inclined sources which may be affected by unaccounted for extinction/absorption corrections, but most (16/22) are too faint in SDSS imaging to interpret and are not particularly anomalous in other one- and two-parameter descriptions of the full ALFALFA data set (e.g., gas fraction versus mass, V75 versus W50,P , S/N, etc.). Regardless, we are able to account for the majority of outliers from the fiducial relation, finding tractable reasons for their offsets.
4.3. Residuals and Expected Accuracy of Distance Estimates
We use all ALFALFA extragalactic sources with recommended widths and Durbala et al. (2020) SDSS-based stellar masses to estimate the average uncertainty of a BTFR-derived distance measurement. We propagate these errors using the observational uncertainties on photometry, width measurement, inclination, and mass-to-light ratio, plus the uncertainties of the best-fit relation and intrinsic scatter. For this full ALFALFA sample, we find a median uncertainty on log(distance) estimates of 0.17 dex, with 16th and 84th percentile uncertainty estimates of 0.1 and 0.28 dex. The cumulative distribution of uncertainties for BTFR-derived distances is shown as the gray curve in Figure 6, with vertical bars at each of these reported percentiles. The bump in the α.R curve at 0.2 dex is caused by the systematic uncertainty on ALFALFA H i flux measurements, which becomes the dominant uncertainty on log(Mb ) measurements at high S/N.
Download figure:
Standard image High-resolution imageWhen we compare the predicted distance measurements to ALFALFA distances, which are calculated using flow models as described in Haynes et al. (2018), we find that the distribution has a significantly larger standard deviation than would be expected if the departures were pure Gaussian noise (e.g., μ = 0, σ = 1). The gray distribution in Figure 7 is the difference between the ALFALFA distances and the BTFR-based distances for the full ALFALFA sample. The distribution is more sharply peaked than the best-fit Gaussian, indicating the heavy tails of this distribution and the significant number of sources with discrepancies between these distance measurements: ∼6% of these distances are >3σ different. This is larger than expected if the differences are solely attributable to Gaussian noise; however, given the nonzero outlier fraction from Gaussian mixture modeling, the numerous causes of additional noise around the relation discussed above, and the corresponding increase in noise from looking at a less restricted sample, we expect a larger-than-Gaussian number of outliers. This distribution is also asymmetric, with more sources at positive values of DBTFR − DALFALFA than negative values. When plotted using their ALFALFA distance estimates, such sources sit above the template BTFR; these sources are more likely to belong to the outlier component of the best-fit likelihood than the bulk component, and thus may be a byproduct of the average data quality of the sample (in fact, a number of the outlier classes identified in Section 4.2 preferentially set sources above the best-fit relation, including overestimated inclinations and overestimated stellar masses either from foreground objects or off-center photometry). Indeed, when we consider this distribution for the more restricted, higher-quality α.R sample, we see a smaller (albeit still present) degree of asymmetry in the distribution of distance residuals, with the skewness reduced from −4.2 to −1.7.
Download figure:
Standard image High-resolution imageUsing the more restricted α.R sample of sources, the median, 16th, and 84th percentile uncertainties decrease to 0.092, 0.07, and 0.12, respectively (shown as the green curve in Figure 7, which reaches larger cumulative fractions at a given uncertainty than the gray curve). Overall, these distances are in better agreement with the ALFALFA distances, demonstrating, as expected, that pruning for data quality can reduce the average uncertainty and outlier occurrence in an application sample. Despite this reduction, there are still about ∼5% sources with distance disagreements larger than 3σ, and the heavy tails demonstrate the presence of outliers even in the restricted sample.
Additionally, comparing the residuals from the template relation with various sample properties allows us to search for systematics that may be important when using this relation to measure source distances. We find that sources are generally distributed symmetrically about the relation, as shown in the top-left panel of Figure 8, although there are hints of a falloff at the lowest masses/velocities, suggesting galaxies have larger velocities at a given mass. A similar trend appears in the residuals of the turbulence-corrected relation, suggesting that if the fractionally larger contribution of turbulence to velocity widths at low velocity causes this departure, the single turbulence correction as a function of velocity proposed in Yu et al. (2020) may not fully account for the variation of this correction across the galaxy population. The strongest correlation we find is between rotational velocity and perpendicular distance (left panel, second row of Figure 8). This moderate correlation is likely attributable to the distribution of data points rather than to a slope mismatch for a number of reasons. First, a large number of sources with the largest positive offsets do not sit at the lowest masses and rather at log(Mb ) > 9.5, suggesting that these sources have large perpendicular offsets specifically because they have low rotational velocities for sources of their mass. We further test this by varying the slope of the relation and recalculating the correlation coefficients between the residuals and velocity/mass, finding that as the magnitude of the Vrot,75 coefficient decreases, the magnitude of the log(Mb ) coefficient increases—if the correlation was due to a mismatch in the slope, both coefficients would decrease as we reached the correct value. In addition to this distribution-based correlation, we find additional, weak correlations between the residuals and the heliocentric velocity, a selection effect due to the flux-limited strategy of the ALFALFA survey (Kourkchi et al. 2022), and with gas fraction, consistent with results from (Ponomareva et al. 2018).
Download figure:
Standard image High-resolution image4.4. Impact of Fitting and Data-selection Methodologies on Results
To explore structure in this relation beyond a simple power-law fit, we include constrained B-spline quantile fits (implemented in R as COBS; Ng & Maechler 2015) overlaid in Figure 4. The overplotted gold lines are spline fits to the central 20, 50, and 80% percentiles of the log(Mbary) values for the α.R sample, where the blue lines are the same for a more inclusive subsample of α.100. In this subsample, sources with velocity uncertainties of >0.2 dex have been removed, as COBS accounts for dependent variable uncertainty and not independent variable uncertainty. When we fit to the full α.100 sample, including sources with still larger uncertainties, the relation appears shallower still, exactly as expected given the impact of these uncertainties. We note that this likely indicates that the "true" relation is steeper than the fit splines. Beyond this, these fits highlight the additional structure in the sample beyond a simple power law, as both lines curve away from power-law fits in the high- and low-mass range. We note that the fit to the less restricted data set has increased spread and more significant deviation, demonstrating that some part of this deviation is related to data quality. These effects persist in the spline fits to the α.R sample, suggesting that some portion of this structure may be intrinsic. Previous studies have discussed the expectation of curvature in the BTFR as predicted by semi-analytic ΛCDM modeling (Trujillo-Gomez et al. 2011), as well as the observational difficulties of conclusively finding such deviations (Iorio et al. 2017). Generally, the expected curvature from semi-analytic models curves to faster rotation for lower galaxy mass, leading to higher slopes at lower galaxy masses and lower slopes at higher galaxy masses. Though we see some downturn in the shape of the relation at high rotational velocity, the relation curves upward at high velocity; further dedicated study is necessary to determine the origin and interpretation of this additional apparent structure.
5. Discussion
The tests in this section, compiled in Table 2 and discussed below, demonstrate that (i) the best-fit relation is not impacted by the restrictions used to define α.R, and (ii) the best-fit relation changes when demographic-shifting restrictions or changes in observational proxy are introduced. The range of slopes we recover is consistent with the range of slopes, from 2.8 (Ponomareva et al. 2018) to 4.1 (Papastergis et al. 2016), measured in studies with heterogeneous sample selections and properties across the literature. The benefit of running this analysis on subsamples of α.100 is that it allows a self-consistent comparison as observables and demographic ranges are modified one-by-one, demonstrating the sensitivity of our relation to each property separately. The comparisons presented in this section can be used to guide decision-making about whether or not it is appropriate to use this relation for a specific application beyond the clearly necessary matches in velocity definition, mass definition, etc.
5.1. Consistency of Results
We confirm the consistency of the α.R fit with the broader ALFALFA sample by running fits relaxing each of the criteria in Section 3.2 in sequence, collecting the best-fit parameters for these relations in section I of Table 2. As demonstrated in Figure 9, the lines fit to less restricted versions of α.100 are consistent with the fiducial relation. In all cases, the relations fit with an outlier component have higher intercepts than those fit without such a component, consistent with the fact that most of the outliers observed sit above and to the left of the bulk distribution of sources (see Figure 5).
Download figure:
Standard image High-resolution image5.2. Reconciling Differences with Literature Results
5.2.1. Stellar Mass Measurement Method
Stellar mass estimates can be computed using a variety of methods, from the optical photometry-based method used for the fiducial relation (Durbala et al. 2020) to NIR estimates calculated using a variety of mass-to-light ratios, e.g., ϒ3.6 ≃ 0.5 (McGaugh & Schombert 2014) and ϒ3.6 = 0.36 (Ponomareva et al. 2018). Fits IIa,b demonstrate that the slope of the relation deviates from the reported value of 3.30 when either of these conventions is used. In some cases, optical photometry may be available for most—but not all—sources within a sample. Using Durbala et al.'s (2020) relations between NIR and optical stellar mass estimates, we "recover" optical mass estimates from NIR photometry. Though the dispersion of these sources around the fiducial line is comparable to the optical mass-derived relation, fitting a relation using these masses results in a steeper slope (fit IIc), potentially attributable to structure in the relationship between NIR-calculated stellar masses and optical masses (e.g., the three-part piecewise discussed in Kourkchi et al. 2022).
5.2.2. Velocity Definition
The BTFR presented here uses a unique measure for rotational velocity. Fits IIIa–IIIf of Table 2 are fits to subsamples of α.R using different velocity definitions: W50,P measurements from the ALFALFA catalog as well as W20,P (width at 20% profile peak), W50,M (width at 50% mean profile flux), and WC (W50,P corrected for broadening due to turbulence and instrumental effects) measurements for 705 sources in Springob et al.'s (2005) catalog of H i line widths.
Comparing the slopes of these relations can be useful for placing the new proposed velocity tracer (V75) into context. Generally, BTFRs fit with unresolved velocities and velocities measured using tracers which do not extend far into the galactic halo have shallower slopes than tracers which reliably reach the extremes of the rotation curve (e.g., Verheijen 2001; Brook et al. 2016; Ponomareva et al. 2017). Given that the V75 relation for the Springob et al. (2005) example is marginally shallower than the corresponding W20,P and W50,M relations (∼1 and 2σ, respectively), these widths reach larger velocities at low mass and lower velocities at high mass than V75. This is consistent with the interpretation that these velocities may probe somewhat larger radii in the galaxy rotation curves (Lelli et al. 2019). Though V75 is an efficient velocity width to measure for large samples of galaxies, studies intended to probe the flat part of galaxies' rotation curves may prefer other width measures.
5.2.3. Mass Cutoff
The mass range covered by a BTFR template may affect the slope of an output relation. The decrease in baryon fraction at high and low mass cause semi-analytic models to predict curvature in the BTFR (Trujillo-Gomez et al. 2011), although studies of the BTFR at low mass suggest its linear trend remains consistent to M* ∼ 106 M⊙ (Iorio et al. 2017). Fits Va–Vf show significantly (∼9–10σ) different slopes for samples of higher () and lower () mass subsamples of α.R, indicating that flat mass cuts dramatically impact the best-fit relation. As ALFALFA is a sensitivity-limited sample, imposing distance cuts also changes the mass distribution of a sample as proportionally more low-mass galaxies are observed at lower distances. We run three fits with hard upper distance cuts at 40, 70, and 100 Mpc to determine the magnitude of this effect and find, as expected, that the volume-limited sample with a distance cut of 40 Mpc has a significantly (∼3.7σ, fit Vg) higher slope than both the fiducial relation and the samples with 70 and 100 Mpc distance limits (fits Vh and Vi, respectively). We also run fits which force quasi-uniform sampling, drawing nine galaxies per 0.25 dex mass bin between 7.5 and 11.5 log(Mb ). With only 146 galaxies per fit, these relations are sensitive to the sample drawn (consistent with Sorce & Guo 2016), however they give on average fits that have a somewhat steeper slope (∼3.5 versus 3.3) than the fiducial relation.
Some of this difference may be caused by statistical bias due to flat cuts in parameter space. As these cuts in ) are not perpendicular to the best-fit line, flat cuts will change the distribution of points above and below the best-fit line. For sources with , this bias decreases the slope of the best-fit relation. Despite this, there is additional evidence for structure in the relation presented here. For instance, fits run with hard distance cuts (Vg–Vi) change the mass distributions of the sample without cutting data at an angle from the best-fit relation. Additionally, the constrained B-splines deviate from a best-fit linear relation at both high and low velocity (see Figure 4), demonstrating that structure beyond a power-law fit is present even in the noncensored data set. This reinforces the interpretation that some of this slope discrepancy is due to intrinsic structure in the distribution of data.
5.2.4. Gas Fraction
One of the most discrepant slopes in the literature (4.15 ± 0.23; Papastergis et al. 2016) also uses a subsample of ALFALFA to fit their relation, demonstrating the importance of demographic differences in addition to observational proxies for template-application agreement. Their sample differs from α.100 by imposing the following criteria: (i) a stringent quality threshold, including both a S/N cut and a manual inspection of all candidates above this threshold; (ii) a larger axial ratio cut of b/a < 0.25 (corresponding to an inclination cut i > 78°); and (iii) a stringent cut to include only gas-rich sources in the sample, with MH I/M* > 2. Fits to subsamples of α.R using these cuts as well as an additional subsample with an upper mass limit of Mb < 1010.75 M⊙, which accounts for the fact that low-mass galaxies are more likely to be gas-rich, are collected as fits IVc–e in Table 2. The best-fit slope for the sample with a gas fraction restriction imposed is , significantly higher than the other slopes for α.R subsamples. This is consistent with other work finding that preferentially weighting gas-rich galaxies marginally increases BTFR slope (Lelli et al. 2016), and that, at a given mass, galaxies with a higher spin parameter (and thus lower rotational velocity) have a higher gas fraction (Huang et al. 2012). We also note that the small number of sources in the final Papastergis et al. (2016) sample (97 galaxies) may contribute to some of the difference in slope, as small samples from an otherwise uniform data set may result in significantly differing slopes (Sorce & Guo 2016).
5.2.5. Fitting Routine
Astrophysical problems with errors in both the dependent and independent variables require rigorous treatments of model fitting (Hogg et al. 2010), with previous studies demonstrating this effect's relevance for best-fit estimates of the slope (Weiner et al. 2006; Bradford et al. 2016) and intrinsic scatter (Stone et al. 2021) of TFRs. To ensure consistency in our results, we run all literature comparison samples through our fitting algorithm, and also run tests setting intrinsic scatter to 0. We find, consistent with Papastergis et al. (2016), that relations without an intrinsic scatter component have shallower slopes (see, for example, the differences between Ia versus VIa, and IIIa versus VIb in Table 2), and that this effect roughly scales with the size of the best-fit scatter (which is clearly visible in the smaller discrepancy between the best-fit mixture models with/without scatter). These results are in agreement with previous work by Weiner et al. (2006) on the TFR, further demonstrating the role of the fitting routine in determining the presented relation.
6. Application: Recovering Virgo Infall
We use the BTFR presented here to find distances to sources near Virgo, recovering signatures of infall onto the nearby (∼17 Mpc) cluster. Previous studies using large samples of galaxies with relatively precise (5%–25% uncertainties) distance measurements (Karachentsev & Nasonova 2010) and high-precision distances measured using the tip of the red giant branch in resolved color–magnitude diagrams (Karachentsev et al. 2018) find clear signatures of infall attributable to a mass overdensity of ∼8.6 × 1014 M⊙.
Within 15° of M87 (R.A. 12h30m4942, decl. +12d23m28043; here considered the center of Virgo), there are 517 sources in the ALFALFA catalog with Vhel < 3300 km s−1. Of these sources, 154 are in the subsample of ALFALFA including all α.R selection criteria but #3 ("Sources far from large overdensities"); we test best-fit infall using this sample (called "all Virgo vicinity, α.R" in Table 3), as well as a subsample of sources in the Virgo vicinity unlikely to be members of the virialized cluster core by removing Extended Virgo Cluster Catalog members (Kim et al. 2014), leaving 80 sources in the "pure infall"/"No EVCC members, α.R" 15° sample.
Table 3. Infall Fits to Sources in the Vicinity of Virgo
Sample | r0 | H0 | Doff | VC,off | σV |
---|---|---|---|---|---|
(Mpc) | (km s−1 Mpc−1) | (Mpc) | (km s−1) | (km s−1) | |
All Virgo vicinity, α.R | 7.58 ± 0.45 | 72.0 ± 4.0 | 0.25 ± 0.23 | 95.3 ± 41.9 | 351.7 ± 15.6 |
No EVCC members a , α.R | 7.8 ± 0.7 | 83.7 ± 5.8 | 0.05 ± 0.25 | 145.8 ± 60.3 | 290.6 ± 18.2 |
Non-Virgo, α.R | 4.3 ± 1.9 | 29.5 ± 3.0 | 0.01 ± 0.00 | 505.4 ± 37.3 | 453.8 ± 11.8 |
a Cluster members have been excised from this sample based on their inclusion in the Extended Virgo Cluster Catalog (Kim et al. 2014)
Download table as: ASCIITypeset image
In a Virgocentric reference frame, we expect source infall velocity on a single attractor to follow the expression discussed in Karachentsev et al. (2018) and Karachentsev & Nasonova (2010):
where rVirgo is the distance of the source to Virgo:
The infall onto Virgo can thus be described by the parameters r0, the zero-velocity radius of the cluster, H0, the Hubble parameter, which may depart significantly from 70 km s−1 in this model, VC,off, the offset of the cluster velocity from fiducial 984 km s−1 in the Local Group (LG)-centric frame, DC,off, the offset of cluster center from fiducial 17 Mpc distance, and Verr, the scatter in velocities due to random motion. Note that Θ is the angular separation of the galaxy from the cluster center (M87) and that DGal is the distance to the galaxy from the observer.
These distances can be measured using the fiducial BTFR (Equation (5)), V75 measurements, H i fluxes, and SDSS source magnitudes. We find good agreement (mean difference = 0, dispersion 1σ; see Figure 7) within the reported uncertainties for 109 sources with independent distance measurements (Kashibadze et al. 2020), underscoring the importance of properly handling uncertainties when modeling infall. With these distances, we plot binned medians of the Virgocentric infall of these galaxies in Figure 10, along with the best-fit model (Equation (6)). Though these curves appear consistent with previous studies (Karachentsev & Nasonova 2010) and with expectation (near 0 within the virial radius of the cluster, decreasing below Hubble flow outside of the virial radius, increasing toward Hubble flow at larger distances), the relatively small magnitude of this velocity difference (a few hundred kilometers per second) is difficult to see in the raw data, and likely to have smaller magnitudes in binned averages given the significant distance uncertainties in the data.
Download figure:
Standard image High-resolution imageModel fitting provides another avenue for describing this signature of infall. To handle the significant distance uncertainties, the model used here simultaneously fits for the true distance to each galaxy within the sample as well as parameters describing the infall onto the cluster. This large number of free parameters necessitates the use of an efficient method for sampling this likelihood function. We use pySTAN, a Python implementation of the Stan Hamiltonian Monte Carlo statistical analysis program 10 to evaluate this likelihood.
We assume a prior on the distance to the galaxies, allowing the sources to preferentially be further than their measured distances. Such a prior corrects for the Malmquist bias of a flux-limited sample, which is roughly applicable for our sample. We use Gaussian priors for the remaining parameters: μr0, σr0 = 7.3, 3.7 Mpc; μH0, σH0 = 73, 50 km s−1 Mpc−1; Mpc; km s−1; μVerr, σVerr = 100, 25 km s−1. These priors are also truncated to avoid the unphysical situations of negative dispersion, r0, and H0 values. The priors on DC,off and VC,off are slightly more constraining in order to avoid one common poor fit, wherein the S-shape (see the blue and black curves in Figure 10) can move to a poorly sampled region of distance–velocity space which is unlikely to be the true position/velocity of the cluster.
A summary of these fits is collected in Table 3. In the case of the fits run on Virgo-vicinity samples, the highly nonzero values of r0, sensible values of H0, and the position and velocity of the cluster suggest that we are able to recover the signature of infall from this data. In order to provide a "null result" comparison, we also run this fit on a sample of sources selected to be Θ ≫ 15° from Virgo ("Non-Virgo, α.R" in Table 3). As is seen in the third row of this table, this best-fit model finds a smaller, less significantly nonzero r0 (we expect →0 in the zero infall case) as well as a low H0 and large Vc,off value. We note that the best-fit H0 for the "null" sample is significantly lower than the true value, which is not consistent with the expectation that this relation should converge to Hubble flow in the presence of zero infall. Instead, this result moves the sharpest features of the relation to a poorly constrained portion of distance–velocity space (above and in front of simulated "Virgo," Vhel > 1000 km s−1, Dgal < 15 Mpc), and the preferentially larger uncertainties and distance priors allow this to occur by placing distances further "behind" the simulated cluster, decreasing the best-fit H0. This demonstrates that priors which broadly reflect the expected cluster position and velocity are important in this fitting process.
7. Conclusions
Using a velocity definition previously unused in BTFR applications (V75, where the integrated flux of a line profile reaches 75% of the "flat, fully integrated flux"), we present a BTFR representative of the full ALFALFA sample:
where Vrot,75 is V75 corrected for inclination and redshift. We fit this fiducial relation with a highly restricted portion of the full α.100 sample (α.R, a sample of ∼4500 galaxies or ∼10% of α.100) and demonstrate that the full ALFALFA sample agrees with this relation, albeit with an expected increase in uncertainty and outlier fraction. This relation is fit using a mixture model to simultaneously find the best-fit BTFR while controlling for the fraction of unreliable but difficult to excise data (a common issue for large survey data sets). We demonstrate that the majority of the sources downweighted by this process are justifiably separate from the reliable measurements and regularly rotating spiral galaxies that define the bulk of the α.R relation. An overwhelming majority of these sources have reasons for being offset from the relation, including a close companion, an overestimated inclination with underestimated uncertainties, a stellar mass that may be unreliable, or a velocity width that may be unreliable. Altogether, these account for ∼70% of the most discrepant outliers, motivating our use of a mixture model to downweight these sources which are difficult to identify automatically.
Using subsets of the fiducial (∼4500 galaxy) sample, we demonstrate the effects of mass sampling, stellar mass estimation methods, velocity width measurement methods, and various selection effects on the slope of the best-fit BTFR, collecting these in Table 2. Comparing the fiducial relation with these fits motivates us to make recommendations for circumstances where the relation presented here should/should not be applied.
The following is a summary of recommendations for use of the BTFRs presented in this paper:
- 1.Application samples should use the same velocity width definition used in the fitting sample. As noted in other detailed studies of the BTFR, we find marginal differences in slope when using different velocity definitions to determine Vrot, even though all of the definitions used here are measures derived from the unresolved line profile.
- 2.Samples which have significantly different mass ranges or which have drastically different mass samplings should not be used with this relation. Forcing high- or low-mass samples created the largest discrepancies of the tested demographic cuts, indicating the presence of additional nonlinear structure independent of the velocity definition used to calibrate this relation. If this relation is to be applied to volume-limited samples, the mass range of those samples will determine to first order their agreement with the presented relation. Forcing uniform sampling has a marginal effect on the relation (∼2σ slope difference), consistent with the expectation that a sensitivity-limited sample oversamples the most massive, brightest galaxies.
- 3.Samples should use consistent stellar mass estimation methods—in this case, optical photometry-based estimates of galaxy stellar masses. For small subsamples where no such data are available, corrected NIR estimates can be used; however, using these estimates for the full sample results in a significantly different slope to the best-fit relation.
- 4.Samples should use matched flat gas fraction cuts. We find that imposing a flat gas fraction cut preferentially includes galaxies at lower Vrot at log(Mb ) > 9, increasing the slope of the relation.
Within the range 3–4, an ALFALFA-based BTFR will have a slope largely dictated by the properties of the sample used to calibrate the relation. Through cherry-picked subsamples of α.R and the use of different width and stellar mass definitions, we found best-fit slopes across the range 2.8 ∼ 4.1. This has implications for the use of the BTFR as a distance indicator, as the sample the template is applied to should have similar properties (mass range, mass sampling, gas fraction, velocity definition, stellar mass definition) to the sample used to calibrate the relation. As demonstrated in Section 5.2, differences in both parameter range and sampling may cause systematic differences in the distribution of points along the fiducial BTFR (leading to different best-fit slopes for subsample BTFRs); to use this relation as a distance indicator for a sample of sources, the properties of the sample should be well matched to the properties of the template in order to produce distance estimates that are, on average, accurate.
Additionally, this demonstrates the challenge of observationally determining the slope, scatter, and thus cosmological significance of the "fundamental" BTFR. Interest exists in tracking the evolution of the fundamental BTFR beyond the local universe, given its implications for the relationship between halo mass and baryon mass over cosmic time (Glowacki et al. 2021). Given the low redshift limit (z < 0.06) of the ALFALFA data set, the search for such evolution is beyond the scope of the work presented here. Dedicated studies find little evidence for the evolution of the BTFR within the ALFALFA redshift range (Ponomareva et al. 2021), meaning this BTFR can be used with the expectation of no significant bias due to redshift evolution within ALFALFA volume so long as the other sample properties (particularly mass range and sampling) are well matched. For future studies, this work emphasizes the importance of ensuring that the evolving low-mass (<107 M⊙ at z ≃ 0) end of the BTFR is sampled adequately in these searches (or is robustly treated below detection limits; Pan et al. 2021) in order to avoid the introduction of slope differences caused by mass sampling effects.
The relation presented here has application to H i-selected surveys, and is particularly optimized for large samples that may include spurious data points. The COG-based velocity definition used here, first presented in Yu et al. (2020), is similarly ideal for use in large surveys as it can be calculated completely automatically (as has been done with related velocity width V85 for the full ALFALFA sample; Yu et al. 2022). Through multiple tests, we demonstrate that the returned widths and their uncertainties are generally successful descriptions of known velocity widths, even when profiles have been degraded to S/Ns comparable with average sources in the ALFALFA sample and when they include simulated baseline effects. We estimate <7% of sources in a sample with an ALFALFA-like S/N distribution have underestimated uncertainties, based on the success of the width-fitting program at recovering these known velocities within 3σ.
In conclusion, this work demonstrates that large surveys that can be efficiently and uniformly reduced using automated methods provide a test for how different sample properties or selection effects may bias a resulting BTFR. Future large-scale H i surveys including CRAFTS, undertaken by FAST (Zhang et al. 2021), and the recent MIGHTEE-H i survey, undertaken with MeerKAT (Ponomareva et al. 2021), will continue to increase our sensitivity to H i-rich galaxies at lower masses and larger distance and redshifts than before, meaning that the BTFR will continue to be a compelling tool for measuring distances and galaxy evolution over cosmic time.
In memory of Riccardo Giovanelli, without whom this work would not exist, and Christopher Springob, whose research was foundational to much of our work. We thank our referees for their careful reading and constructive suggestions, which significantly improved this work. C.J.B. and M.P.H. acknowledge support from NSF/AST-1714828 and grants from the Brinson Foundation. We thank all members of the Undergraduate ALFALFA Team for their contributions, especially Mary Crone-Odekon, Anhad Gande, and Rose Finn, as well as NSF grant Nos. AST-1211005, AST-1637339, and AST-2045369 for support. We acknowledge the usage of the HyperLeda database (http://leda.univ-lyon1.fr). This research made use of Astropy, 11 a community-developed core Python package for Astronomy (Astropy Collaboration et al. 2013,2018) and matplotlib (Hunter 2007).
Acknowledgments
From 2011 to 2018, the Arecibo Observatory was operated by SRI International under a cooperative agreement with the National Science Foundation (AST-1100968) and in alliance with Ana G. Méndez-Universidad Metropolitana, and the Universities Space Research Association. Currently, the Arecibo Observatory is a facility of the National Science Foundation operated under cooperative agreement (AST-1744119) by the University of Central Florida in alliance with Universidad Ana G. Méndez (UAGM) and Yang Enterprises (YEI), Inc.
Facility: Arecibo. -
Software: AstroPy (Astropy Collaboration et al. 2013, 2018), emcee (Foreman-Mackey et al. 2013), Matplotlib (Hunter 2007), NumPy (Harris et al. 2020), pySTAN.
Appendix A: Baryonic Tully–Fisher Relation Fitting
As discussed elsewhere, the method used to fit a relation with uncertainty in both the x and y variables can affect the output BTFR (e.g., Bradford et al. 2016). For reproducibility, we provide the derivation of the likelihood used here. As used in Lelli et al. (2019) and Papastergis et al. (2016), and as discussed in Hogg et al. (2010), optimizing this likelihood minimizes the scaled orthogonal distance of data points from the central relation. As is regularly done in studies of the BTFR, we use a relation that simultaneously fits for the slope, intercept, and intrinsic scatter of the relation—in the case of this likelihood, the intrinsic scatter is modeled as a Gaussian perpendicular to the best-fit line. We use three parameters to describe the best-fit relation: m, the relation's slope, b, the relation's vertical offset, and σ⊥, the intrinsic perpendicular scatter of the relation. The following transformations, illustrated in Figure 11, are useful for fitting this likelihood: , , , and . Rewriting
and gives
Again, our model assumes that the data are distributed about the best-fit relation with an intrinsic Gaussian scatter σ⊥. The probability of a set of data parameters = log(Mbary/M⊙) and given model parameters θ, b, and σ⊥ is thus
which when rewritten, replacing θ with m (slope) and b (intercept):
However, we do not observe ; instead, we observe and . In theory, we want to marginalize over all possible values of the true values:
If we assume that the errors in all three variables are Gaussian distributed, then we can use the fact that a convolution of two Gaussians is itself a Gaussian to marginalize as desired, giving the likelihood:
where .
Download figure:
Standard image High-resolution imageIn reality, likelihoods can be built to more correctly address the asymmetries of these errors due to their true asymmetry/non-Gaussianity. However, depending on the expression of the errors chosen, these integrals do not marginalize out and the computational requirements for evaluating the likelihood function become severe. Using pySTAN, we test the effect of relaxing some of these criteria, and do not find that the improvements in accuracy of this fit scale with the increasing complexity of these models, especially given selection criteria which select for the largest uncertainties and thus the data points where these effects will be most significant.
The mixture model used to find the fiducial relation contains two components, one identical to Equation (A6), and a second component offset perpendicularly from this component. This model includes a binary flag, which is = 1 if a source truly belongs to the source component and which is = 0 if the source belongs to the outlier component. These flags can be marginalized over such that the probability of belonging to each component does not need to be fit for each source individually. Instead, choosing a prior with some parameter P describing the average probability of belonging to the outlier component, marginalizing over the binary flag parameter results in the following likelihood for the outlier model with three additional parameters, the offset (o), intrinsic scatter (σol ), and average probability of belonging to the outlier component (Pol ):
We test additional likelihoods that include more complicated outlier parameterizations, and find that additional parameters are either unconstrained or result in equivalent likelihoods to the ones best fit by the mixture and single-component models.
Footnotes
- 7
Such guesses are important as this is a nonlinear least-squares problem, meaning the cost surface can be complicated and a simple optimization routine may find a local minimum which is not the global best fit.
- 8
- 9
See dfm.io/posts/mixture-models for more details.
- 10
Stan Development Team (2021). Stan Modeling Language Users Guide and Reference Manual, v2.28, available online: https://mc-stan.org.
- 11