Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

WISE2MBH: A scaling-based algorithm for probing supermassive black hole masses through WISE catalogues

J. Hernández-Yévenes [Uncaptioned image],1 N. Nagar [Uncaptioned image],1 V. Arratia [Uncaptioned image]1 and T. H. Jarrett [Uncaptioned image]2,3,4
1Departamento de Astronomía, Universidad de Concepción, Av. Esteban Iturra s/n, Casilla 160-C, Concepción, Chile
2Department of Astronomy, University of Cape Town, Rondebosch, 7700, South Africa
3Centre for Astrophysics and Supercomputing, Swinburne University of Technology, John Street, Hawthorn, 3122, Australia
4Institute for Astronomy, University of Hawaii at Hilo, 640 N Aohoku Pl 209, Hilo, HI 96720, USA
Contact e-mail: jheryev@gmail.com
(Accepted 2024 May 15. Received 2024 May 14; in original form 2023 September 27)
Abstract

Supermassive Black Holes (SMBHs) are commonly found at the centers of massive galaxies. Estimating their masses (MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT) is crucial for understanding galaxy-SMBH co-evolution. We present WISE2MBH, an efficient algorithm that uses cataloged Wide-field Infrared Survey Explorer (WISE) magnitudes to estimate total stellar mass (Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT) and scale this to bulge mass (MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT), and MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT, estimating the morphological type (TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT) and bulge fraction (B/T𝐵𝑇B/Titalic_B / italic_T) in the process. WISE2MBH uses scaling relations from the literature or developed in this work, providing a streamlined approach to derive these parameters. It also distinguishes QSOs from galaxies and estimates the galaxy TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT using WISE colors with a relation trained with galaxies from the 2MASS Redshift Survey. WISE2MBH performs well up to z0.5similar-to𝑧0.5z\sim 0.5italic_z ∼ 0.5 thanks to K-corrections in magnitudes and colors. WISE2MBH MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates agree very well with those of a selected sample of local galaxies with MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT measurements or reliable estimates: a Spearman score of similar-to\sim0.8 and a RMSE of similar-to\sim0.63 were obtained. When applied to the ETHER sample at z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5, WISE2MBH provides similar-to\sim1.9 million MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates (78.5% new) and similar-to\sim100 thousand upper limits. The derived local black hole mass function (BHMF) is in good agreement with existing literature BHMFs. Galaxy demographic projects, including target selection for the Event Horizon Telescope, can benefit from WISE2MBH for up-to-date galaxy parameters and MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates. The WISE2MBH algorithm is publicly available on GitHub.

keywords:
galaxies: general – infrared: general – quasars: supermassive black holes - methods: data analysis
pubyear: 2024pagerange: WISE2MBH: A scaling-based algorithm for probing supermassive black hole masses through WISE cataloguesC

1 Introduction

Supermassive black holes (SMBH) are characterized by having masses (MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT) ranging from similar-to\sim 105 to 1010Msuperscript105 to superscript1010subscript𝑀direct-product10^{5}\text{ to }10^{10}M_{\odot}10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT to 10 start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT and are believed to be located at the centers of all galaxies with a bulge, including the Milky Way (e.g., Ferrarese & Ford, 2005; Graham, 2016). The presence of SMBH is inferred from observations of stellar and gas motions in galactic nuclei (e.g., Genzel et al., 2010; Saglia et al., 2016), as well as powerful nuclear X-ray to radio emission (e.g., Broderick et al., 2015; Liu et al., 2022). The Event Horizon Telescope (EHT) has provided the most direct evidence of the existence of SMBH (Event Horizon Telescope Collaboration et al., 2019, 2022). SMBH play a crucial role in shaping the evolution and structure of galaxies, as they can affect the surrounding stars and gas through ‘feedback’, and they are expected to co-evolve with their host galaxies (Kormendy & Ho, 2013).

Refer to caption
Figure 1: WISE2MBH: A simple algorithm that makes use of WISE cataloged data and a spectroscopic redshift to estimate the stellar mass, morphological type, bulge fraction, and MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT of an extragalactic source. Solid (dotted) lines represent the main path to estimate a value (or upper limit), or to reject an object from the algorithm. Orange (blue) boxes show the input (derived) quantities; boxes with both colors can either be provided to or are estimated by the algorithm. Inputs in dashed boxes are optional. WISE magnitudes with their respective mean photometric errors (\upsigmaWX¯subscript\upsigma¯𝑊𝑋\upsigma_{\overline{WX}}start_POSTSUBSCRIPT over¯ start_ARG italic_W italic_X end_ARG end_POSTSUBSCRIPT) are used to generate random normal samples of size 104superscript10410^{4}10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT for a Monte Carlo approach to error propagation.

Active galactic nuclei (AGN) are a manifestation of SMBH that are powered by a luminous accretion process at the centers of numerous galaxies. AGN emission can cover the entire electromagnetic spectrum (Padovani et al., 2017). At infrared (IR) wavelengths, the emission is primarily attributed to a toroidal arrangement of dust, which absorbs the radiation emitted by the central accretion disk and re-radiates it at IR (Netzer, 2015; Hickox & Alexander, 2018).

Quasars (QSO) are a type of extremely luminous AGN (with bolometric luminosity, logLbolsubscript𝐿bol\log L_{\text{bol}}roman_log italic_L start_POSTSUBSCRIPT bol end_POSTSUBSCRIPT, in the range of 44 to 48 erg s144 to 48superscript erg s144\text{ to }48\text{ erg s}^{-1}44 to 48 erg s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT) that are powered by high accretion rates onto SMBH (Lbol/LEdd102.9similar-tosubscript𝐿bolsubscript𝐿Eddsuperscript102.9L_{\text{bol}}/L_{\text{Edd}}\sim 10^{-2.9}italic_L start_POSTSUBSCRIPT bol end_POSTSUBSCRIPT / italic_L start_POSTSUBSCRIPT Edd end_POSTSUBSCRIPT ∼ 10 start_POSTSUPERSCRIPT - 2.9 end_POSTSUPERSCRIPT to 101.8superscript101.810^{1.8}10 start_POSTSUPERSCRIPT 1.8 end_POSTSUPERSCRIPT, Kong & Ho, 2018). Given these high accretion rates, QSOs are among the most luminous objects in the universe, making it difficult (though not completely impossible) to discriminate the morphology of their host galaxies at any redshift (Dunlop et al., 2003). QSOs are among the earliest and most distant observable objects (e.g., zsimilar-to𝑧absentz\simitalic_z ∼7.64, Wang et al., 2021), and are important probes of the early universe and the formation and growth of galaxies and their SMBHs (Inayoshi et al., 2020).

The Wide-field Infrared Survey Explorer (WISE, Wright et al., 2010) was a NASA IR--Wavelength astronomical space telescope that surveyed the entire sky in four IR bands, with central wavelengths at 3.4 μ𝜇\muitalic_μm, 4.6 μ𝜇\muitalic_μm, 12 μ𝜇\muitalic_μm, and 22 μ𝜇\muitalic_μm (W1, W2, W3 and W4, respectively). These bands can be used to study several features of a galaxy. The shorter wavelength bands (W1 and W2) predominantly capture emission from stars (e.g., Jarrett et al., 2011, 2012; Norris et al., 2014) and warm dust (e.g., Lyu et al., 2019; Noda et al., 2020; Li & Shen, 2023), providing insights into stellar populations in galaxies (e.g., Kettlety et al., 2018). The longer wavelength bands (W3 and W4) are sensitive to emission from cooler dust, revealing regions of colder dust associated with older stars (e.g., Singh et al., 2021; Li et al., 2023), allow estimation of star formation rates (SFRs, e.g., Lee et al., 2013; Cluver et al., 2017) and potentially highlighting the presence of AGN (e.g., Lyu & Rieke, 2022b; Hviding et al., 2022). By analyzing the relative intensities and the spatial distribution of emission across these bands, it is possible to obtain the temperature, composition, and distribution of a galaxy’s dust and stars, thereby unraveling its evolutionary history and physical characteristics.

One of the most relevant physical characteristics that can be estimated with WISE photometry is the total stellar mass of a galaxy (Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT). As stated by Cluver et al. (2014, hereafter C14) the W1 band is the most sensitive to the light emitted by the bulk of stellar population in galaxies (e.g., Meidt et al., 2012; Norris et al., 2014), thus allowing a determination of the mass attributed to the mass-dominant stellar population by using a mass-to-light (M/L) ratio (e.g., Kettlety et al., 2018). The M/L ratio can be constrained using the W1--W2 color. More recently, Jarrett et al. (2023, hereafter J23) have refined this process, developing a more stringent method to obtain stellar mass estimates from only the W1 band and assuming a global M/L ratio of similar-to\sim0.35 for all galaxies. Since early-type galaxies have higher M/L (similar-to\sim0.8) ratios, WISE-based estimates obtained using individually tailored M/L ratios give results that are significantly different from the W1-only estimate. In instances of low uncertainty in WISE colors, the M/L ratio can be obtained through the use of the W1--W2 or W1--W3 color, and hence better correct for the range of M/L that is observed from early to late-types.

Refer to caption
Refer to caption
Figure 2: Redshift distribution of the WISE2MBH parent sample. Left: Separated by object type. Right: Separated by source extension (as listed in the AllWISE catalog). Galaxies and unknown type sources dominate by a few orders of magnitude over other object types, while point-like (3-0) sources dominate at z greater-than-or-equivalent-to\gtrsim 0.1.

Morphology and its evolution is another relevant property of galaxies and (e.g., Abraham et al., 1994; Abraham & van den Bergh, 2001; Willett et al., 2013). Many studies (e.g., Abraham et al., 1996; Whyte et al., 2002; Pahre et al., 2004) suggest that IR morphological classifications of galaxies can be superior to optical classifications, due to the physical properties that can be studied in the mid-IR, e.g., SFR and stellar populations, which evolve with the morphological type of the galaxy. Using WISE, distinct populations of early-type and late-type galaxies shown clearly different IR colors (e.g., Wright et al., 2010; Lee et al., 2017; Yao et al., 2020). Recently, Jarrett et al. (2019) showed a Hubble sequence-like morphological evolution with the W2--W3 color for a sample of well-known nearby galaxies, where early-type galaxies showed redder (stellar-dominated) colors compared to late-type (ISM + stellar colors) galaxies, with a clear sequence from the early-type to late-type. When combined with Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates (e.g., C14, J23), the most massive galaxies were shown to be dominated by high fractions (0.8absent0.8\geq 0.8≥ 0.8) of spheroid-like galaxies. These authors found the same behavior when exploring the bulge-to-total ratios (B/T𝐵𝑇B/Titalic_B / italic_T): galaxies with B/T𝐵𝑇B/Titalic_B / italic_T 0.9absent0.9\geq 0.9≥ 0.9 are dominantly at W2--W3 1.5absent1.5\leq 1.5≤ 1.5, which was set as the cutoff between spheroid galaxies and intermediate disks. Other studies have also shown that early-type and late-type galaxies showed distinct distributions of W2--W3 color (e.g., Sadler et al., 2014; Cluver et al., 2020), with the former being once again the most massive galaxies. In the sample of Cluver et al. (2020) at logM11.2subscript𝑀11.2\log M_{*}\geq 11.2roman_log italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ≥ 11.2, the percentage of early-type galaxies reaches 80%similar-toabsentpercent80\sim 80\%∼ 80 %.

Powerful AGN are characterized by their high IR luminosity, resulting in strong WISE detections. WISE can differentiate between these AGN and galaxies using WISE color-color criteria (e.g., Jarrett et al., 2011; Hviding et al., 2022). This differentiation is important as AGN play a crucial role in the evolution of galaxies: they have the capability to heat the surrounding gas and dust, suppress the formation of new stars, and ultimately quench star formation. Stern et al. (2012) proposed a simple W1--W2 cutoff to successfully identify AGN to a depth of W2 15similar-toabsent15\sim 15∼ 15. This cutoff was exploited, improved, and used for the creation of the WISE AGN catalog (Assef et al., 2018) and to derive new criteria to identify low luminosity AGN (logLbolsubscript𝐿bol\log L_{\text{bol}}roman_log italic_L start_POSTSUBSCRIPT bol end_POSTSUBSCRIPT in the range of 40 to 44 erg s140 to 44superscript erg s140\text{ to }44\text{ erg s}^{-1}40 to 44 erg s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT; Hviding et al., 2022).

WISE colors are useful in identifying AGN, being notably good for type 2 obscured AGN (e.g., Hviding et al., 2022), but are poor in identifying relatively weak AGN and highly obscured AGN at modest to large redshifts (z>2absent2>2> 2 e.g., Hickox & Alexander, 2018; Lyu & Rieke, 2022a; Lyu et al., 2023). The latter may constitute a significant fraction (\geq40%) of the QSO population at high redshifts (Lyu et al., 2023). For complete identification of AGN, observations at multiple wavelengths are required (e.g., Yao et al., 2022). In this work AGN identification is not a goal in itself; it is used to evaluate and correct AGN contamination to WISE W1 and W2 based host galaxy stellar mass estimates. Since highly obscured (and relatively weak) AGN are expected to contribute only a small fraction of the total (AGN plus galaxy) flux at wavelengths shorter than 5µm (e.g., Lyu & Rieke, 2022a, Fig. 6) they are not expected to significantly contaminate the host galaxy stellar mass estimates calculated here.

Empirically, a strong correlation is observed between the MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT and the bulge mass (MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT) in elliptical galaxies, as well as in spiral galaxies with pseudo-bulges and classical bulges (e.g., Häring & Rix, 2004; Kormendy & Ho, 2013; Schutte et al., 2019) with previous theoretical approaches supporting this idea (e.g., Croton, 2006). Combining this with the observed evolution of B/T𝐵𝑇B/Titalic_B / italic_T with the TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT of a galaxy (e.g., Wang et al., 2019; Dimauro et al., 2022; Quilley & de Lapparent, 2022), it is possible to exploit the WISE magnitudes and colors to estimate MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT and then MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT, from Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT.

Combining all of the above, WISE photometry can be used to distinguish between different types of extragalactic objects, e.g. QSOs, ULIRGS, AGN, and galaxies, and then for galaxies, to estimate the total stellar mass and morphology (using colors between the W1, W2, and W3 bands, e.g., Wright et al., 2010, C14, J23) and thus estimate MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT and MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT.

In this work, we introduce a new algorithm, WISE2MBH (see Fig. 1), which takes advantage of existing relationships derived from WISE data, the proportionality between the masses of the galaxy bulge and its SMBH, and new scaling relationships derived here. The WISE2MBH algorithm is capable of classifying regular galaxies, estimating their morphological type, and thus their bulge to total mass ratio, and estimating the mass of the SMBH. Additionally, it can identify QSOs from WISE colors; due to the AGN contamination, the algorithm provides upper limit values for these. This algorithm and the resulting sample of SMBH masses are relevant to the study of individual sources using powerful instruments such as the EHT and its next-generation upgrade (ngEHT, Johnson et al., 2023; Doeleman et al., 2023), as well as to studies of SMBH populations and evolution. In Sect. 2 we introduce the data used as input to the algorithm, in Sect. 3 to 9 we explain, in detail, the main steps of the algorithm. In Sect. 10 we present the main results and statistics of the WISE2MBH final sample generated by the algorithm, in Sect. 11 we briefly discuss the results, their relevance, main assumptions and limitations, and lastly in Sect. 12 we present our conclusions.

Throughout this work, we use Vega magnitudes and adopt the cosmological parameters of Planck Collaboration et al. (2020): Ωm=0.31subscriptΩm0.31\Omega_{\text{m}}=0.31roman_Ω start_POSTSUBSCRIPT m end_POSTSUBSCRIPT = 0.31, ΩΛ=0.69subscriptΩΛ0.69\Omega_{\Lambda}=0.69roman_Ω start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT = 0.69, and H0=67.66 km s1Mpc1subscript𝐻067.66superscript km s1superscriptMpc1H_{0}=67.66\text{ km s}^{-1}\text{Mpc}^{-1}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 67.66 km s start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT.

2 Data

Table 1: Statistics of the WISE2MBH parent sample.
Object type N𝑁Nitalic_N z𝑧zitalic_z MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ex W1 W2 W3
Galaxy 601658 99.80 60.39 30.95 / 69.05 99.98 / <0.01 99.49 / 0.51 69.98 / 29.97
QSO 16200 99.27 55.98 0.81 / 99.18 99.96 / <0.01 99.87 / 0.13 95.41 / 4.54
RS 9360 71.83 3.31 0.33 / 99.66 99.83 / 0.13 99.48 / 0.52 67.13 / 32.83
Unknown 1468364 98.85 59.40 5.95 / 94.05 99.81 / 0.15 99.40 / 0.60 48.91 / 51.05
Total 2095582 99.01 59.41 13.06 / 86.94 99.86 / 0.12 99.43 / 0.57 55.40 / 44.56
Total ETHER 2202339 –.– –.– –.– / –.– –.– / –.– –.– / –.– –.– / –.–

2.1 ETHER sample

The Event Horizon and Environs sample (ETHER) aims to be the definitive sample and database from which to choose targets for the EHT and ngEHT. The database, its algorithms, and references for data sources, was first presented in Ramakrishnan et al. (2023, hereafter R23). ETHER has since been expanded by including the following literature and database samples: (a) all galaxies in the HyperLeda database (Makarov et al., 2014) with recessional velocity (defined nonrelativistically, i.e., z=v/c𝑧𝑣𝑐z=v/citalic_z = italic_v / italic_c) less than 100.000 km/s; (b) the Million Quasar catalog (Milliquas, Flesch, 2023); (c) the Veron-Cetty and Veron AGN catalog (Version 13, Véron-Cetty & Véron, 2010) with updates from (Flesch, 2013); (d) the 2M++ redshift survey (Lavaux & Hudson, 2011); (e) the ROMA BZCAT blazar catalog (5th Edition, Massaro et al., 2015); (f) the 2MRS sample (Huchra et al., 2012); and (g) the similar-to\sim million galaxies with SDSS and WISE photometry in the catalog of Chang et al. (2015). Several other individual black hole masses and radio fluxes from the literature have also been incorporated. Full details on the updated ETHER sample will be published in \textcolorblueNagar et al. (\textcolorbluein prep).

Given the above updates and after consolidating multiple entries from the same source, the ETHER sample currently contains 3.8 million extragalactic sources, of which 233 have MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT measurements, including methods such as stellar dynamics (e.g., Thater et al., 2019), gas kinematics (e.g., Boizelle et al., 2021), and reverberation mapping (Bentz & Katz, 2015), and similar-to\sim860,000 have MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates. Of the estimates, similar-to\sim331,000 are from the M-sigma relationship, similar-to\sim525,000 are from ‘single-epoch’ spectroscopy and using scaling relationships from reverberation mapping (e.g., Dalla Bontà et al., 2020; Rakshit et al., 2021), similar-to\sim3,000 are from M-Lbulge estimations, and similar-to\sim600 are from other ‘fundamental-plane’ type relationships (see R23 for more details).

Astroqueries to the NASA Extragalactic Database (NED) and SIMBAD are used to incorporate and update positions, spectroscopic redshifts (thus luminosity and angular distances for objects at D \geq 50 Mpc), object types, morphological types (TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT), AGN classifications, and radio to X-ray fluxes. The object type comes directly from NED source classifications, which have shown great precision (80%absentpercent80\geq 80\%≥ 80 %) for nearby sources (D11𝐷11D\leq 11italic_D ≤ 11 Mpc, Kuhn et al., 2022). The object types in ETHER, incorporated from NED, are as follows: ‘Galaxies’ cover regular galaxies over the range from elliptical to spiral galaxies, ‘QSOs’ denote galaxies with significant nuclear activity thus luminosity, ‘Radio Sources’ (RS) refer to sources detected in the radio regime, without any distinction between galaxies or QSOs. If a source lacks any NED classification, it is designated as an ‘Unknown’ object type. Through visual inspection of sub-samples using the Sloan Digital Sky Survey (SDSS), and the color-color criteria using WISE colors (see Section 4), we established that this NED source classification was sufficiently accurate for our needs.

Morphological types (TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT) are available for a significant fraction of the sample (22.4%percent22.422.4\%22.4 % via NED and SIMBAD queries and from individual samples e.g., Huchra et al., 2012; Makarov et al., 2014). These are predominantly E (--6 to 44-4- 4) and Sc (4–5), representing 37.3%percent37.337.3\%37.3 % and 23.8%percent23.823.8\%23.8 % of all sources with available morphology, respectively. The large fractions in these two TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT bins is primarily due to the binary classification of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT in some ingested samples (e.g., Dobrycheva, 2013). When available, this TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is used as an input to the algorithm; when not available, the TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is estimated from the W2--W3 color (see Section 5). All ETHER sources at z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5 (similar-to\sim2.3 M) form the parent sample for this work.

2.2 WISE catalogues

The WISE mission, funded by NASA as a Medium-Class Explorer mission, features a space-based infrared telescope with megapixel cameras cooled by a two-stage solid hydrogen cryostat. This telescope conducted an all-sky survey which simultaneously captured images in four broad spectral bands: W1, W2, W3, and W4, centered on 3.4, 4.6, 12, and 22 μ𝜇\muitalic_μm with an angular resolution of 6.1″, 6.4″, 6.5″and 12.0″, and achieving sensitivities of 0.08, 0.11, 1 and 6 mJy, respectively. In this work, we use both the AllWISE catalog (Cutri et al., 2021) and the WISE Extended Source Catalog (WXSC, Jarrett et al., 2019). The latter includes mid-infrared photometry and measured global properties of the 100 largest (in angular size) galaxies in WISE.

2.2.1 AllWISE catalog

The AllWISE catalog, which combines data from the WISE cryogenic and NEOWISE (Mainzer et al., 2011) catalogues, contains almost 750 million sources, including galaxies, stars, brown dwarfs, and asteroids, making it one of the most comprehensive IR catalogues ever created. This catalog provides photometric quality flags (qph) for a given detection for all sources, making it the best option to extract data for our purposes; we accepted only sources with quality A, B, C or U in the first three bands, which translates to detections with signal-to-noise ratios (S/N) in the ranges of S/N10S/N10\text{S/N}\geq 10S/N ≥ 10, 10>S/N310S/N310>\text{S/N}\geq 310 > S/N ≥ 3, 3>S/N23S/N23>\text{S/N}\geq 23 > S/N ≥ 2 and S/N<2S/N2\text{S/N}<2S/N < 2, respectively, with the last flag considered an upper limit with 95%percent9595\%95 % confidence. Extension flags (ex) are also provided in this catalog, allowing easy differentiation between extended and point sources. This flag is directly related to the 2MASS Extended Source Catalog (XSC, Jarrett et al., 2000), ranging from 5 for completely extended sources to 0 for point sources. This flag contributes to the overall quality flag of our estimates and thus allows the selection of high-quality sub-samples (e.g., using only extended sources) from our overall sample. In this work, sources with ex equal to 4 or 5 are considered extended, while those with values 0 to 3 are considered point sources. Thus, in our definition, a WISE extended source is one whose position falls within 5″of the central position of a 2MASS XSC source, and a WISE point source is one that is not associated with a 2MASS XSC source, or is offset by 5absent5\geq 5\arcsec≥ 5 ″ from the central position of a 2MASS XSC source even if it falls within its isophotes.

2.2.2 WISE Extended Source Catalog (WXSC)

The AllWISE photometric catalogues are optimized for the characterization of point-sources111AllWISE Explanatory Supplement: Cautionary Notes. For highly extended sources, source detection and extraction may not include all the extended components of the galaxy in a single source, thus leading to an underestimation of true brightness.

The WXSC (Jarrett et al., 2019) provides full source characterization in all four WISE bands for the 100 largest galaxies (in angular extent) in the sky. WXSC uses new mosaics with native resolution allowing for precise measurements of both the target galaxy and its local environment. These mosaics are further resampled with 1" pixels, greatly enhancing analysis accuracy. When a galaxy is present in WXSC, we use these magnitudes instead of those from AllWISE.

2.3 WISE2MBH parent sample

All sources in the ETHER sample at z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5 were cross-matched with the AllWISE and WXSC catalogues with a cone search radius of 3″, preferring WXSC matches over AllWISE to consider better photometric values. After deleting duplicates, 95.2%percent95.295.2\%95.2 % of the ETHER targets matched the two WISE catalogues.

Given that the AllWISE catalog has a high density of sources (on average 5similar-toabsent5\sim 5∼ 5 sources per arcmin2) and the ETHER sample at z <<< 0.5 is also relatively large (similar-to\sim2.3 million galaxies, with a bias towards northern galaxies), it is important to address the probability of false matches in a 333\arcsec3 ″ radius. We generate random samples of 105superscript10510^{5}10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT right ascension and declination coordinate pairs, divide them into subsamples in (l𝑙litalic_l between ±15plus-or-minus15\pm 15± 15) and outside the Galactic plane, and test for random matches and possible contamination of W1 magnitudes due to confusion. When matching these random samples to the AllWISE catalog, we find a 4%percent44\%4 % (5%similar-toabsentpercent5\sim 5\%∼ 5 %) chance of matching a random position source out of the plane (in the plane) of the galaxy with an AllWISE source. The distributions of W1 magnitudes for real ETHER galaxies matched to WISE are consistently brighter than the magnitudes obtained in the random matches. The W1 magnitude distributions of both AllWISE sources matched to the random position catalogues (μ=17𝜇17\mu=17italic_μ = 17 mag) and to the ETHER galaxies (μ=14𝜇14\mu=14italic_μ = 14 mag), are significantly fainter than those of Galactic AGB stars in AllWISE (μ=8𝜇8\mu=8italic_μ = 8 mag; Suh, 2021).

The differentiation between early-type galaxies and stars in AllWISE catalogues, in the absence of redshifts, remains a topic of ongoing discussion. Machine learning classification techniques have exhibited significant achievements in this area (e.g., Kurcz et al., 2016). In our case we do not expect to be affected by this problem for two reasons (a) ETHER has both precise (typically sub-arcsec accuracy) coordinates and includes only known extragalactic objects (when redshifts exist these are spectroscopic); (b) the low probability of random matches as explored in the previous paragraph.

A summary of the resulting WISE2MBH parent sample is given in Table LABEL:table:summary where we list the total number of sources of each object type (N𝑁Nitalic_N), and the total number of sources of the WISE2MBH parent sample and ETHER. The other columns represent a percentage of completeness. The distribution of sources in redshift by object type and ex can be seen in Fig. 2.

3 K-corrections for WISE magnitudes and colors

Jarrett et al. (2023) used spectral energy distribution (SED) templates of galaxies with different morphological types (e.g., Brown et al., 2014) to calculate K-corrections for the W1 magnitude, and the W1--W2 and W2--W3 colors, for each available TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT over the range z=03𝑧03z=0-3italic_z = 0 - 3. These K-correction lookup tables are shown in Figs. 9 and 10 of J23, but only for z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5.

Due to the uncertain morphological type and AGN contamination of objects classified as ‘RS’, and given the tendency of QSOs to continue to reside in the AGN/QSO area of a WISE color-color diagram independent of redshift (e.g., Mateos et al., 2012; Stern et al., 2012), we do not apply K-corrections to the RS and QSO object types.

Of the remaining objects (i.e., galaxies or unknown) only those which reside in the ‘Estimate zone’ of Fig. 3 (see Sect. 4) are K-corrected. This selection is made to avoid a false K-correction for AGN-contaminated sources.

Our algorithm applies the J23 K-corrections for three TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT ranges; ellipticals (--5 to --3), lenticulars (from --3 to 0) and spirals (from 0 to 8). The existing TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT of an individual object, if available, is used to select between these three lookup tables. If no TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is available, the lookup table to be used is decided as follows. Mateos et al. (2012, see their Fig. 5) use galaxy templates of various morphological types, and with varying amounts of AGN contamination, to derive the redshift evolution of a galaxy’s WISE colors over the redshift range 0 and 2. Although the WISE colors vary with redshift (especially for late-type galaxies), they find that, in the absence of a large AGN contamination, WISE colors can distinguish between early-type and late-type galaxies over this full redshift range. We thus define the following cutoff in the observed W2--W3 color to distinguish between elliptical and spiral galaxies (without AGN contamination) up to z1.5similar-to𝑧1.5z\sim 1.5italic_z ∼ 1.5,

W2W3Limit=2.5(log(FW3FW2)log(fW3fW2))subscriptW2W3Limit2.5subscript𝐹W3subscript𝐹W2subscript𝑓W3subscript𝑓W2\text{W2$-$W3}_{\text{Limit}}=-2.5\left(\log\left(\frac{F_{\text{W3}}}{F_{% \text{W2}}}\right)-\log\left(\frac{f_{\text{W3}}}{f_{\text{W2}}}\right)\right)W2 - W3 start_POSTSUBSCRIPT Limit end_POSTSUBSCRIPT = - 2.5 ( roman_log ( divide start_ARG italic_F start_POSTSUBSCRIPT W3 end_POSTSUBSCRIPT end_ARG start_ARG italic_F start_POSTSUBSCRIPT W2 end_POSTSUBSCRIPT end_ARG ) - roman_log ( divide start_ARG italic_f start_POSTSUBSCRIPT W3 end_POSTSUBSCRIPT end_ARG start_ARG italic_f start_POSTSUBSCRIPT W2 end_POSTSUBSCRIPT end_ARG ) ) (1)

where F0subscript𝐹0F_{0}italic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT are the zero magnitude flux density and observed flux density in the respective band, respectively. We set the logarithmic ratio of observed flux densities in Eq. 1 to 0.10.1-0.1- 0.1, since Fig. 5 of Mateos et al. (2012) shows that this value clearly separates elliptical and spiral galaxies over their full redshift range. This translates to a cutoff value of W2W3Limit1.58similar-tosubscriptW2W3Limit1.58\text{W2$-$W3}_{\text{Limit}}\sim 1.58W2 - W3 start_POSTSUBSCRIPT Limit end_POSTSUBSCRIPT ∼ 1.58 to distinguish elliptical and spiral galaxies (values larger than this imply a spiral galaxy). Note that this cutoff is similar to the values previously derived by Jarrett et al. (2019). The Lenticular lookup table is not used in this case, since lenticulars and spirals are not easily distinguishable in WISE color-color diagrams (see e.g., Sect. 5).

These K-correction look up tables have been shown to be reliable for galaxies at z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5 (e.g., Jarrett et al., 2023; Karademir et al., 2023).

Refer to caption
Figure 3: WISE color-color plot showing the location of our sample objects and defining the areas over which the stellar mass estimation is an upperlimit (Upper limit zone), is not estimated by the algorithm (Reject zone), or estimated by the algorithm (Estimate zone); these zones are separated by black dashed lines. Note that QSO and RS object types are considered as stellar mass upper limits independent of whether they fall in the "Upper limit" or "Estimate" zone. For clarity, we plot separately the point sources (top panel) and extended sources (bottom panel; see Sect. 2). The background-filled colors and labels are from Fig. 12 of Wright et al. (2010), and are shown for reference. Contours, in colors following the color legend on top of the figure, show the number density of object types in our sample. In the bottom panel, the object types QSO and RS are shown as colored points (instead of contours) of the corresponding color.

4 Distinguishing Galaxies and powerful AGN/QSOs using WISE

If an object type is not already available (which is the case for 76%percent7676\%76 % of the parent sample; Sect. 2), the algorithm uses WISE color-color criteria to distinguish galaxies from powerful AGN/QSOs, and to determine whether the derived values are estimates or upper limits (due to contamination from an AGN).

The algorithm is first tested on targets with an available object type. As mentioned in Sect. 3, we use cutoffs in the observed W1--W2 and W2--W3 colors to identify objects that will be K-corrected. The same cutoffs are now applied to the K-corrected colors to determine if this target falls within the estimate, upper limit or rejection zones in the WISE color-color plot shown in Fig. 3. These objects then, respectively, follow the estimate, upper limit or reject paths of the algorithm shown in Fig. 1 and described in Sect. 7.

Objects with object type QSO and RS are likely to have significant AGN contamination in their W1 mag, which will likely increase their Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates, independent of their extension in WISE. For these objects, we therefore immediately consider the WISE2MBH Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT values as upperlimits. The horizontal dashed line in Fig. 3, which separates the upper limit zone from the estimate zone, is a combination of the widely used W1W2=0.8W1W20.8\text{W1$-$W2}=0.8W1 - W2 = 0.8 limit to separate AGN/QSOs from galaxies (e.g., Stern et al., 2012; Michalik & Lindegren, 2016; Guo et al., 2018), together with a wedge region between W2--W3 similar-to\sim2.2 – 4.4 motivated by previously defined AGN/QSOs regions (Jarrett et al., 2011; Hviding et al., 2022). We slightly modified the wedge region defined by Jarrett et al. (2011) by optimizing the WISE color classification of known QSOs in our sample. The wedge region we chose is defined as follows

2.2<W2W3<4.4W1W2>(0.05[W2W3]+0.38).2.2W2W3expectation4.4W1W20.05delimited-[]W2W30.382.2<\text{W2$-$W3}<4.4\wedge\text{W1$-$W2}>(0.05[\text{W2$-$W3}]+0.38).2.2 < W2 - W3 < 4.4 ∧ W1 - W2 > ( 0.05 [ W2 - W3 ] + 0.38 ) . (2)

which now fully extends to the blue-side of the diagram thanks to the W1W2=0.8W1W20.8\text{W1$-$W2}=0.8W1 - W2 = 0.8 cutoff. With this overall cut, almost 95%percent9595\%95 % of the known QSOs in our sample are classified as QSOs by this color criterion.

The vertical limit at W2W3=4.4W2W34.4\text{W2$-$W3}=4.4W2 - W3 = 4.4 in Fig. 3, which separates the Reject zone from the other two zones, is also set by us. At W2W34.4W2W34.4\text{W2$-$W3}\geq 4.4W2 - W3 ≥ 4.4, the WISE2MBH TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT (see Sect. 5) is 8 or larger, and the estimated B/T𝐵𝑇B/Titalic_B / italic_T would be very low (Sect. 6). While this would be correct for, e.g., irregular galaxies, it is incorrect for, e.g., extremely dusty, hybrid-starburst-AGN galaxies (Tsai et al., 2015) or newborn AGN (\textcolorblueArevalo et al. in prep)222P. Arevalo showed in XVII LARIM on December 2023, the evolution in W1--W2 color for a typical star-forming non-active galaxy to AGN-like in a time-span of years, together with spectroscopic data from SDSS and SOAR that support the idea of new AGN activity in ZTF20aaglfpy, which was also classified as a type I AGN by the ALeRCE light curve classifier..

The WISE color-color distributions of different object types (including unknown types) in our sample are shown in the top (WISE point sources) and bottom (WISE extended sources) panels of Fig. 3.

The top panel of Fig. 3 shows that most QSOs and RS reside at high W1--W2 colors, in an area populated by a variety of AGN and also ULIRGs, making it a challenge to correctly identify them using only WISE data. On the other hand, known galaxies, even if point-like for WISE, show clear overdensities in the region where elliptical and spiral galaxies are expected to reside.

For the bottom panel of Fig. 3 (WISE extended sources), all (99%similar-toabsentpercent99\sim 99\%∼ 99 %) known galaxies lie in the Estimate zone, at WISE colors expected of elliptical and spiral galaxies, and away from the regions of starbursts, LINERs and (U)LIRGs. Only a few QSOs and RS are present in this figure: the QSOs (orange dots) do not clump in the expected QSO area, but are instead distributed over a large range of colors, overlapping with regions of galaxies, ULIRGs, and Seyferts; RS (green dots) are predominantly situated in the galaxies area, with a few in the LINER area. This tendency for both QSOs and RS to not only reside in the expected wedge region described above is not unexpected; since these extended sources are most likely weaker AGNs whose emission is not sufficient to change the galaxy color so as to be classified as a QSO. This was shown in Mateos et al. (2012) who tested different percentages of AGN activity, and showed that only powerful AGNs (80%percent8080\%80 % fraction) up to z=2𝑧2z=2italic_z = 2 reside in an area similar to the wedge shaped region defined here or in previous studies.

The bulk of sources with unknown type (76%percent7676\%76 % of the parent sample; red contours in both panels) sit in the Estimate zone, mostly following the expected distribution of galaxies. The clear separations seen for known object types give us confidence that the classification of these unknown types as galaxies is reliable.

5 Using W2-W3 color to estimate T-type

While TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is available for some (22%similar-toabsentpercent22\sim 22\%∼ 22 %) sources in the WISE2MBH parent sample, a reliable estimation of this is required for most of the sample. To obtain these TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT, we exploit the fact that the W2--W3 color shows clearly separated regions where elliptical and spiral galaxies reside (e.g., Wright et al., 2010; Jarrett et al., 2019; Cluver et al., 2020). Although these regions partially overlap with other classifications based on star formation activity, the trend is sufficient to estimate the morphology of the galaxy, i.e., TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT.

Refer to caption
Figure 4: TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT as a function of W2--W3 color for galaxies in the 2MRS sample. For each TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT bin we plot the median value (black circle) and one sigma dispersion (horizontal bar) of the W2--W3 colors of galaxies in the bin. Distributions of W2--W3 and TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT are shown in the panels at the top and right of the figure, respectively. The red line in the right panel marks the threshold number of galaxies in a bin in order for that bin’s median to be used for the fit (filled black circles in the main panel). The black dot-dashed line shows the best fit logit function to the filled black circles: this is used for the W2--W3 to TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT conversion when TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is previously unknown. The estimated TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is limited to the range --5 and 8; when a galaxy’s W2-W3 color lies beyond the range of the logit function shown, the TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is clipped at these values. The colored areas distinguishing morphologies listed in the inset are from Jarrett et al. (2019). Given the similarity of the color dispersions in the three bins at each extreme end of the x𝑥xitalic_x-axis, we define two vertical dashed lines which delineate galaxies we refer to as high bulge fractions (HBF; bulge fractions between 0.4 and 1) and low bulge fractions (LBF; bulge fractions between 0.1 and 0.3).
Refer to caption
Figure 5: Distributions of the bulge-to-total mass ratio (B/T𝐵𝑇B/Titalic_B / italic_T) as a function of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT for different literature samples. A decreasing exponential is fitted to the data points: the gray dashed curve is the original fit and the black dashed curve is the fit when one fixes B/T=1𝐵𝑇1B/T=1italic_B / italic_T = 1 for TType=subscript𝑇TypeT_{\text{Type}}=-italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = -5. For clarity, small shifts on the x-axis are used to avoid overlapping symbols and error bars. Data points and the horizontal pink dashed line are from Caramete & Biermann (2010); Mendel et al. (2014); Sofue (2016); Morell et al. (2020); Gao et al. (2020) following the colors listed in the inset.

Our conversion of W2--W3 color to TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is trained using similar-to\sim18,000 galaxies from the 2MRS catalog (Huchra et al., 2012) for which manually classified TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT are available. The median W2--W3 colors of 2MRS galaxies in each TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT bin between 55-5- 5 and 8888 (open and filled black circles in the main panel of Fig. 4) show a clear S-shape curve. The number of galaxies in each TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT bin is shown in the right histogram of Fig. 4. Given the S-shape, i.e., the lack of differentiation in the x𝑥xitalic_x-axis for the three to four bins at each extreme end of W2-W3 colors, a sufficiently large number of objects per bin is required for a clear result. We therefore use statistical power analysis to define the required sample size threshold; details of this analysis can be found in Appendix B. For this power analysis, a power P=0.8𝑃0.8P=0.8italic_P = 0.8 and a significance threshold α=0.05𝛼0.05\alpha=0.05italic_α = 0.05 were assigned. The effect size (Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT) was calculated for each set of two consecutive bins in W2--W3 (in order of increasing TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT), and the resulting median Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT (0.15) implies that the sample size per bin must be N700greater-than-or-equivalent-to𝑁700N\gtrsim 700italic_N ≳ 700. Therefore, all bins with a sample size greater than this N𝑁Nitalic_N were accepted (filled black circles in Fig. 4).

A logit function was fit to these accepted median values, providing us with our W2--W3 to TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT conversion. Since the logit function’s domain goes from 0 to 1, the W2--W3 color is shifted and normalized (W2W3SNsubscriptW2W3SN\text{W2$-$W3}_{\text{SN}}W2 - W3 start_POSTSUBSCRIPT SN end_POSTSUBSCRIPT) before fitting:

W2W3SN=W2W30.752.71TType=(1.21±0.01) logit(W2W3SN)+(1.36±0.02)subscriptW2W3SNW2W30.752.71subscript𝑇Typeplus-or-minus1.210.01 logitsubscriptW2W3SNplus-or-minus1.360.02\begin{split}&\text{W2$-$W3}_{\text{SN}}=\frac{\text{W2$-$W3}-0.75}{2.71}\\ &T_{\text{Type}}=(1.21\pm 0.01)\text{ logit}\left(\text{W2$-$W3}_{\text{SN}}% \right)+(1.36\pm 0.02)\end{split}start_ROW start_CELL end_CELL start_CELL W2 - W3 start_POSTSUBSCRIPT SN end_POSTSUBSCRIPT = divide start_ARG W2 - W3 - 0.75 end_ARG start_ARG 2.71 end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = ( 1.21 ± 0.01 ) logit ( W2 - W3 start_POSTSUBSCRIPT SN end_POSTSUBSCRIPT ) + ( 1.36 ± 0.02 ) end_CELL end_ROW (3)

this logit function is shown as a dashed-dotted line in Fig. 4. Despite leaving out three late-type TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT bins from the fit (TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = 6, 7 and 8), the logit function fits almost perfectly to all medians.

The morphological limits of Fig. 10 of Jarrett et al. (2019) are presented for comparison as colored regions in the figure. While the overall fit is S-shaped, we find two clearly separated regions in the graph: the high bulge fraction (HBF) region, which is delimited by the 84thth{}^{\text{th}}start_FLOATSUPERSCRIPT th end_FLOATSUPERSCRIPT percentile values of the bins centered on TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = 55-5- 5 to 33-3- 3, and the low bulge fraction (LBF) region, which is delimited by the 16thth{}^{\text{th}}start_FLOATSUPERSCRIPT th end_FLOATSUPERSCRIPT percentile values of the bins centered on TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = 7777 and 8888. Our HBF region limit (W2W3=1.58W2W31.58\text{W2$-$W3}=1.58W2 - W3 = 1.58; vertical dashed line in the figure) is similar to the cutoff used by Jarrett et al. (2019) to distinguish between spheroids and intermediate disks (the division between pink and green regions in the figure), and also similar to the value of W2W3LimitsubscriptW2W3Limit\text{W2$-$W3}_{\text{Limit}}W2 - W3 start_POSTSUBSCRIPT Limit end_POSTSUBSCRIPT which we use to classify galaxies with unknown morphological type into elliptical and spiral galaxies in order to select the K-correction lookup table to be used (see Eq. 1 and Sect. 3).

The logit function in Eq. 3 is used whenever a source does not have an available TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT or if the available TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT comes from a binary classification (e.g., Dobrycheva, 2013). The WISE2MBH algorithm uses TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT in the range 55-5- 5 to 8888. Available TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT values outside this range are clipped to the closest limit value, in case these really define a morphology, e.g., 99-9- 9 is often used to define a QSO (ZCAT convention), so those values are not clipped. If a measured W2--W3 color is outside the range of the logit function presented here, the estimated TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is also clipped to the closest limit value TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT. This is most relevant for elliptical galaxies, whose W2--W3 colors are often less than 0.70.70.70.7, which the algorithm converts to TType=5subscript𝑇Type5T_{\text{Type}}=-5italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = - 5.

This TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT estimator in WISE2MBH is an auxiliary function: it prefers an input value but will provide a WISE-based value when necessary. The S-shape curve makes the classifications at extreme TTypessubscript𝑇TypesT_{\text{Types}}italic_T start_POSTSUBSCRIPT Types end_POSTSUBSCRIPT very uncertain at the moment of distinguishing between consecutive TTypessubscript𝑇TypesT_{\text{Types}}italic_T start_POSTSUBSCRIPT Types end_POSTSUBSCRIPT, but the distinction between bulge-dominated and disk-dominated galaxies is clear.

Refer to caption
Figure 6: WISE2MBH Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT compared to the low redshift (z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5) galaxy samples of Chang et al. (2015, Top left) and Mendel et al. (2014, Top right) and the SDSS Value Added catalogues of the Portsmouth (Maraston et al., 2013, Middle), Granada (Chen et al., 2012, Bottom right) and Wisconsin (Montero-Dorta et al., 2016, Bottom left) groups. The black dashed line shows the line of equality and red dots with error bars represent the median and 1σ𝜎\sigmaitalic_σ dispersion of the difference between the respective masses for bins of 0.05 in redshift. Colors represent the counts following the color bar to the right of each panel.

6 Bulge-to-Total ratio from T-Type

The morphological type of a galaxy within the Hubble sequence has been shown to be a good proxy of B/T𝐵𝑇B/Titalic_B / italic_T. This inverse behavior (recently discussed in Quilley & de Lapparent, 2022, 2023) shows that early-type galaxies tend to be almost pure bulges (B/T1similar-to𝐵𝑇1B/T\sim 1italic_B / italic_T ∼ 1), while very late-type galaxies and irregulars tend to have small to null bulge fractions (B/T0.01similar-to𝐵𝑇0.01B/T\sim 0.01italic_B / italic_T ∼ 0.01). This inverse behavior also supports the posited scenarios of galaxy bulge growth via mergers: late-type galaxies merge consecutively until lenticular, elliptical, and finally, the brightest cluster galaxies (BCGs) are formed (Edwards et al., 2020), leading not only to the formation of the most massive galaxies (Bluck et al., 2014), but also to the most massive SMBHs (Mezcua et al., 2018).

By nature, elliptical galaxies are expected to have quenched their star formation, leading to a decrease in their SFR and specific SFR (sSFR), despite environmental effects (e.g., Casado et al., 2015). Recently, Ge et al. (2018) showed that galaxies with lower sSFR tend to be more massive and have higher B/T𝐵𝑇B/Titalic_B / italic_T (0.7absent0.7\geq 0.7≥ 0.7) compared to galaxies with higher sSFR, and also found a trend with galaxy age where the oldest galaxies have higher B/T𝐵𝑇B/Titalic_B / italic_T. Morell et al. (2020) showed similar results, showing that their passive galaxy sample (made up of 70% ellipticals and 15% lenticulars) is the one with higher B/T𝐵𝑇B/Titalic_B / italic_T (0.7similar-toabsent0.7\sim 0.7∼ 0.7).

Massive elliptical galaxies are the most relevant sources for future EHT observations, and while varying their estimated B/T𝐵𝑇B/Titalic_B / italic_T between 0.8–1 does not considerably affect the final MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT estimate, a misclassification of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT could result in the incorrect use of a spiral-like B/T𝐵𝑇B/Titalic_B / italic_T (0.2absent0.2\leq 0.2≤ 0.2), leading to an incorrectly low MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT, thus MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT (Bluck et al., 2014). Despite many references pointing to a B/T0.8𝐵𝑇0.8B/T\leq 0.8italic_B / italic_T ≤ 0.8 for early-type galaxies (e.g., Ge et al., 2018; Morell et al., 2020), we will impose a limit of B/T=1𝐵𝑇1B/T=1italic_B / italic_T = 1 for TType=5subscript𝑇Type5T_{\text{Type}}=-5italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = - 5 (e.g., Caramete & Biermann, 2010; Quilley & de Lapparent, 2023), with an exponential decrease with TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT as shown in Fig. 5. We note that Caramete & Biermann (2010) derived correction factors to convert near-IR luminosities to MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT for galaxies of a variety of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT in the nearby universe (z0.025𝑧0.025z\leq 0.025italic_z ≤ 0.025) with similar impositions for elliptical galaxies; our B/T𝐵𝑇B/Titalic_B / italic_T ratios are analogous to their conversion factors but calibrated with a larger sample.

We calibrate the B/T𝐵𝑇B/Titalic_B / italic_T ratio as a function of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT using several samples from the literature. Mendel et al. (2014) provide total, bulge, and disk masses, for a large sample of SDSS galaxies. Their values, combined with TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT from ETHER, give us distributions of B/T𝐵𝑇B/Titalic_B / italic_T over a wide range of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT, although only 55-5- 5 and 5555 have enough statistics to be considered robust. The ETHER TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT unfortunately comes primarily from the binary classification of Dobrycheva (2013), so we expect that the relationship between TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT and B/T𝐵𝑇B/Titalic_B / italic_T is biased, i.e., underestimated for ellipticals and overestimated for spirals. Sofue (2016) provide both bulge and disk masses for a small sample of nearby galaxies (z0.03𝑧0.03z\leq 0.03italic_z ≤ 0.03) for which we obtained TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT from NED. Morell et al. (2020) provide an average value of B/T𝐵𝑇B/Titalic_B / italic_T for a sample of passive galaxies, which are composed of specific fractions of ellipticals, lenticulars and spirals, mostly dominated by the first two. The TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT value we used in this case is weighted by these fractions. From Gao et al. (2020) we take values of B/T𝐵𝑇B/Titalic_B / italic_T only for TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT equal to 11-1- 1, 1111 and 2222 (S0, Sa and Sab) and consider TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT equal to 8888 (Sdm) as a limit to secure reasonable bulge masses even for very late-type galaxies.

A plot of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT as a function of B/T𝐵𝑇B/Titalic_B / italic_T for all these samples is shown in Fig. 5. A direct fit to all these points is shown with the dashed gray line in the figure. When we force a value of B/T=1𝐵𝑇1B/T=1italic_B / italic_T = 1 for TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = 55-5- 5, our final fit is functionally similar to that of Caramete & Biermann (2010, Fig. 1), but now for significantly larger and more recent datasets. The final fit is,

B/T=0.05+0.36(7.720.1TType)𝐵𝑇0.050.36superscript7.720.1subscript𝑇TypeB/T=0.05+0.36\left(7.72^{-0.1\cdot T_{\text{Type}}}\right)italic_B / italic_T = 0.05 + 0.36 ( 7.72 start_POSTSUPERSCRIPT - 0.1 ⋅ italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) (4)

which results in a 13%percent1313\%13 % increase in B/T𝐵𝑇B/Titalic_B / italic_T at TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = 55-5- 5 compared to the direct fit to all datapoints, while for TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT 1absent1\geq-1≥ - 1 the difference is negligible.

Refer to caption
Figure 7: Same as Fig. 6, but now comparing MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT from WISE with its equivalent value in the low redshift (z0.4𝑧0.4z\leq 0.4italic_z ≤ 0.4) sample of Mendel et al. (2014).

7 Stellar and Bulge mass from WISE photometry

The process of converting WISE photometry to stellar mass is described in C14, who take advantage of the fact that W1 is an exceptional tracer of the bulk of stellar population in galaxies and that the W1--W2 color can constrain the M/L ratio. For their (and our) calculation, the W1 absolute magnitude of the Sun is taken from Willmer (2018). J23 have presented an updated Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimator valid across a larger redshift range, making use of multi-color criteria and K-corrections. We use the K-corrections of the latter (see Section 3) for W1 and W1--W2 colors together with the stellar mass estimator of C14 to derive the total stellar mass (Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT).

To avoid excessive (and likely erroneous) M/L values estimated from W1--W2 colors, our algorithm limits the input W1--W2 values to the range 0.20.2-0.2- 0.2 to 0.60.60.60.6 (corresponding to high and low M/L). Any source with a W1--W2 color outside this range is clipped to the nearest limit M/L, i.e., the distribution of W1--W2 (generated by the random normal samples of W1 and W2, see Fig. 1 and Sect. 9) is shifted until the median reaches the closest limit value.

Once Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT and its errors are calculated following the process outlined above, the value is stored unless the estimated mass is logM6.5subscript𝑀6.5\log M_{*}\leq 6.5roman_log italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ≤ 6.5 or logM13subscript𝑀13\log M_{*}\geq 13roman_log italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT ≥ 13. This range is more strictly constrained at the low mass end than other catalogues, and more lax at the high mass end (e.g., Dimauro et al., 2018; Durbala et al., 2020). The flexibility at the high mass end is in order to not lose extremely rare extreme SMBHs, often called ultra massive black holes (UMBHs, e.g, Runge & Walker, 2021) and SMBH upper-limits, e.g., QSO with high M/L, which produce very high MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates, but which are flagged as upper limits. The lower limit value is extracted directly from J23.

The WISE2MBH Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates derived here are compared to two low redshift control samples from Mendel et al. (2014, z0.4𝑧0.4z\leq 0.4italic_z ≤ 0.4) and Chang et al. (2015, z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5) in Fig. 6 and to a group of SDSS samples (Chen et al., 2012; Maraston et al., 2013; Montero-Dorta et al., 2016), all for z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5. For the samples of Mendel et al. and Chang et al., the agreement is relatively good, with a scatter of 0.2similar-toabsent0.2\sim 0.2∼ 0.2 dex in both cases. There is a slight tendency for WISE2MBH Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT to be underestimated with increasing redshift compared to some of the SDSS samples, No strong evidence was found to suggest that the estimates from SDSS are superior to those from WISE2MBH; correlations and scatters between these samples did not demonstrate a preference. Appendix C contains a more detailed discussion of this WISE to SDSS comparison.

The MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT estimations are obtained by combining B/T𝐵𝑇B/Titalic_B / italic_T with Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT. Effectively, the estimated B/T𝐵𝑇B/Titalic_B / italic_T allows the estimation of both MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT and MDisksubscript𝑀DiskM_{\text{Disk}}italic_M start_POSTSUBSCRIPT Disk end_POSTSUBSCRIPT. In the WISE2MBH final catalog, we provide only Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT and B/T𝐵𝑇B/Titalic_B / italic_T for simplicity. Figure 7 presents a comparison of WISE2MBH bulge masses with those derived by Mendel et al. (2014) in a low redshift sample: once again, the agreement is relatively good, but now shows an increased scatter. This can be explained by our assumption of a simple evolution of B/T𝐵𝑇B/Titalic_B / italic_T with TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT, which does not consider other important factors in galaxy evolution such as gas availability, molecular gas content, size distribution, stellar age, and the impact of bars and bulges (e.g., Laurikainen et al., 2007; Fisher & Drory, 2011; Koyama et al., 2019).

Refer to caption
Figure 8: A comparison of measured black hole masses (Top, crosses) and highly reliable black hole mass estimates (Middle, boxes) vs. WISE2MBH MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT values in the WISE2MBH final sample. Each data point is marked with its 1σ1𝜎1\sigma1 italic_σ error bars. Subsamples of HBF and LBF galaxies are distinguished by color following the inset. Gray dotted lines are the RMSE scatter bands. The bottom panel shows the distribution of the mass ratios for subsamples of measured and estimates and a KDE for the complete distribution; the mean ratio and 1 σ𝜎\sigmaitalic_σ dispersion (1.00±0.08plus-or-minus1.000.081.00\pm 0.081.00 ± 0.08) of the latter are shown with red and gray dashed lines.
Refer to caption
Figure 9: Mean MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT errors as a function of B/T𝐵𝑇B/Titalic_B / italic_T. In LBF, there are few cases with larger error compared to the mean errors for both HBF and LBF. This comes directly from the detection of small sources that have worst quality than the bulk of sources detected by WISE in the AllWISE catalog, producing greater errors due to error propagation.
Table 2: Control sample of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT measurements and reliable Mσ𝜎-\sigma- italic_σ estimations.
N𝑁Nitalic_N MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT / WISE MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT Method Publication
46 1.01±0.08plus-or-minus1.010.081.01\pm 0.081.01 ± 0.08
Stellar / Gas dynamics
Megamaser
RM
van den Bosch (2016)
13 1.03±0.10plus-or-minus1.030.101.03\pm 0.101.03 ± 0.10 Stellar dynamics Saglia et al. (2016)
6 0.97±0.05plus-or-minus0.970.050.97\pm 0.050.97 ± 0.05 Stellar dynamics Thater et al. (2019)
71 1.00±0.10plus-or-minus1.000.101.00\pm 0.101.00 ± 0.10
Stellar / Gas dynamics
Megamaser
Gültekin et al. (2019)
16 1.06±0.08plus-or-minus1.060.081.06\pm 0.081.06 ± 0.08 Gas dynamics ETHER compilationa
647 1.00±0.08plus-or-minus1.000.081.00\pm 0.081.00 ± 0.08 Mσ𝜎-\sigma- italic_σ van den Bosch et al. (2015)
799 1.00±0.08plus-or-minus1.000.081.00\pm 0.081.00 ± 0.08

8 Black Hole Mass from WISE photometry

The value of MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT calculated in the previous section is used to derive a first estimate of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT using the MBHMBulgesubscript𝑀BHsubscript𝑀BulgeM_{\text{BH}}-M_{\text{Bulge}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT - italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT relationship of Schutte et al. (2019). These first MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates were compared with a control sample presented in Table 2. The control sample consists of 152152152152 galaxies with directly measured MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT from different methods (e.g., Saglia et al., 2016; van den Bosch, 2016) and 647647647647 galaxies for which high-quality stellar velocity dispersions (σ𝜎\sigmaitalic_σ) were available, obtained via observations with the Hobby-Eberly Telescope (HET; van den Bosch et al., 2015). Using these values of σ𝜎\sigmaitalic_σ allowed us to accurately estimate MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT using the M-σ𝜎\sigmaitalic_σ relationship of Saglia et al. (2016).

The control sample was selected according to the following criteria: (a) the value of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT is flagged as a measurement or high-quality estimate from σ𝜎\sigmaitalic_σ, (b) the MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimate from WISE2MBH is not an upper limit, (c) the source has an ex flag equal to 5 and (d) the control sample source must have logMBH10.32subscript𝑀BH10.32\log M_{\text{BH}}\leq 10.32roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ≤ 10.32. While points (a) and (b) are self-explanatory, (c) is required to consider only completely extended sources, and (d) is necessary since van den Bosch et al. (2015) contains a few very large σ𝜎\sigmaitalic_σ values. To avoid those, we consider the maximum value from van den Bosch (2016), who used the same observations from HET as part of the study to measure MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT, as a limit for our control sample. The control sample covers a mass range from logMBHsubscript𝑀BH\log M_{\text{BH}}roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT of similar-to\sim6 to similar-to\sim10, this being almost the complete mass range for SMBH. The heterogeneity of this control sample is discussed in Sect. 11.3.

Linear regression done to the control sample MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT (dependent variable) and WISE2MBH MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates (independent variable) revealed a slope of 0.9 and an intercept of 0.98. A t-test was then used to check the statistical significance of these results compared to a linear regression close to the equality line (expected slope of 1 and intercept of 0) with a p-value of p=0.05𝑝0.05p=0.05italic_p = 0.05. The results showed that the slope did not differ significantly from the equality line, while the intercept was significantly different from zero, suggesting the need for a compensation factor. These findings demonstrate the presence of subtle, yet systematic, discrepancies between the WISE2MBH MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates (when the Schutte et al. (2019) scaling is used) and the control sample values. To address these systematic offsets empirically, a compensation factor (Cfsubscript𝐶𝑓C_{f}italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT) is defined as follows,

Cf=0.104logMBH+0.98subscript𝐶𝑓0.104subscript𝑀BH0.98C_{f}=-0.104\log M_{\text{BH}}+0.98italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = - 0.104 roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT + 0.98 (5)

and added to the estimate. After this empirical correction, a Spearman score of 0.78 and a root mean squared error (RMSE) of 0.63 dex (see Fig. 8) were calculated for the set of compensated estimates and control sample. Since the majority of the control sample are HBF, the small offset of LBF shown at lower MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ranges does not affect the overall comparison, but may be interpreted as the need for a specific Cfsubscript𝐶𝑓C_{f}italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT for LBF sources or a misbehavior of previous steps for these types of galaxies, e.g. underestimation of B/T𝐵𝑇B/Titalic_B / italic_T or Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT.

The MBHMBulgesubscript𝑀BHsubscript𝑀BulgeM_{\text{BH}}-M_{\text{Bulge}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT - italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT scaling relation of Schutte et al. can be combined with our Cfsubscript𝐶𝑓C_{f}italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT, to obtain the following relationship, which is effectively that used in WISE2MBH:

logMBH=1.12log(MBulge1011)+8.84subscript𝑀BH1.12subscript𝑀Bulgesuperscript10118.84\log M_{\text{BH}}=1.12\log\left(\frac{M_{\text{Bulge}}}{10^{11}}\right)+8.84roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT = 1.12 roman_log ( divide start_ARG italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT end_ARG start_ARG 10 start_POSTSUPERSCRIPT 11 end_POSTSUPERSCRIPT end_ARG ) + 8.84 (6)

Finally, in case the algorithm estimates a logMBH5subscript𝑀BH5\log M_{\text{BH}}\leq 5roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ≤ 5, the source is dropped. While the MBHMBulgesubscript𝑀BHsubscript𝑀BulgeM_{\text{BH}}-M_{\text{Bulge}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT - italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT relation of Schutte et al. (2019) can reach logMBH5subscript𝑀BH5\log M_{\text{BH}}\leq 5roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ≤ 5, such black holes go down to the limits of intermediate mass black holes (IMBH). IMBHs and their host populations are an active topic of research (for a review, see Greene et al., 2020), and these populations do not necessarily follow the several scaling relations used in WISE2MBH. This also means that estimates slightly higher than MBH=5subscript𝑀BH5M_{\text{BH}}=5italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT = 5 should be treated with caution.

9 Algorithm

The WISE2MBH algorithm was conceived with the purpose of addressing the lack of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates for more than 80%percent8080\%80 % of the ETHER sample (see Sect. 11.4). Nevertheless, it provides a simple and uniform tool with wide-ranging applications in studies of morphology and galaxy and black hole evolution. It is useful for both generation of large samples from existing data, and of sub-samples for future observations and monitoring with observational facilities.

This main steps of the algorithm are summarized in Fig. 1. In summary, the process is as follows:

  • W1, W2 and W3 magnitudes and a spectroscopic redshift are used to calculate the K-corrected W1 absolute magnitude, W1--W2 and W2--W3 colors with the use of lookup tables from J23 (see Sect. 3).

  • The K-corrected W1 absolute magnitude and the W1--W2 color are used to estimate Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT of the source with the process described in C14. For that calculation, the W1 absolute magnitude of the Sun is taken from Willmer (2018) (see Sect. 7).

  • The source is placed into the estimate or upper limit zone, making use of several inputs (see Sect. 4).

    • If a source is placed in the estimate zone and its TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is available, it continues in the algorithm.

    • If a source is placed in the estimate zone and no TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is available, the latter is estimated from the W2--W3 color (see Sect. 5).

    • If a source is placed in the upper limit zone, its TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT is not considered if available, nor calculated if not available. In this case, values calculated later in the algorithm are considered upper limits.

  • The TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT of the source is used to estimate B/T𝐵𝑇B/Titalic_B / italic_T; for sources in the upper limit zone B/T𝐵𝑇B/Titalic_B / italic_T is set to 1 (see Sect. 6).

  • MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT is estimated using the previously derived Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT and B/T𝐵𝑇B/Titalic_B / italic_T (see Sect. 7).

  • A first MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimate is obtained using the MBHMBulgesubscript𝑀BHsubscript𝑀BulgeM_{\text{BH}}-M_{\text{Bulge}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT - italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT relation of Schutte et al. (2019).

  • The final estimate of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT is obtained after adding Cfsubscript𝐶𝑓C_{f}italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT to the first estimate. Sources with MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT in the IMBH range are removed.

Not every source enters the algorithm. There are two main reasons for a source to be rejected during the algorithms pre-processing: (a) the source does not have a spectroscopic z𝑧zitalic_z available, or (b) the quality of the WISE magnitudes are not considered ‘usable’, i.e., qph from the AllWISE catalog is not equal to A, B, C or U, for the W1, W2 and W3 bands. A source can be dropped during an intermediate step of the algorithm if Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT or MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT are considered outliers (described in Sects. 7 and 8).

The algorithm uses a Monte Carlo approach to estimate errors. It generates random normal samples (using arrays of size 104superscript10410^{4}10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT) for the W1, W2, and W3 magnitudes and respective mean photometric errors for each source. These distributions are propagated through the algorithm, where errors in the scaling relations used, if available, are considered, thus delivering a final MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT (or other estimated quantity) with asymmetric error bands for each source that was not rejected. The Cfsubscript𝐶𝑓C_{f}italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT is applied to the final distribution of values and not only to the nominal value. The nominal, low, and high values reported are the median and 1σ1𝜎1\sigma1 italic_σ percentile values of the final distribution, respectively. Two examples of the final error distributions can be seen in Fig. 14.

10 Results

10.1 WISE2MBH final sample

Table 3: Statistics of the WISE2MBH final sample.
All MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT Galaxy QSO RS Unknown
Total New (%) Total New (%) Total New (%) Total New (%) Total New (%)
Any 2010718 1577941 (78.48) 586013 435903 (74.38) 15800 7372 (46.66) 6577 6439 (97.90) 1402328 1128227 (80.45)
Est. 1901419 1492582 555924 411201 1345495 1081381
Est. (%) 94.56 74.23 94.87 70.17 95.95 77.11
Uplim. 109299 85359 30089 24702 15800 7372 6577 6439 56833 46846
Uplim. (%) 5.44 4.26 5.13 4.22 100 46.66 100 97.90 4.05 3.34
Rejected 84869
Refer to caption
Figure 10: Distribution of Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT (left panel) and MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT (middle panel) for the WISE2MBH final sample. For Galaxies and Unknown, two distributions are shown: the dashed line for estimates and the solid line for both estimates and upper limits. The distributions change slightly from one panel to the other, due to the use of B/T𝐵𝑇B/Titalic_B / italic_T to obtain MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT. The right panel shows a stacked histogram illustrating the number of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates and upper limits for each object type.

When used with the WISE2MBH parent sample, the algorithm generates similar-to\sim2 million MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates and upper limits, rejecting only 3.9%percent3.93.9\%3.9 % of the parent sample. A summary of the statistics in the final sample is shown in Table 3.

Percentages listed represent the percentage with respect to the total value of a given object type. New WISE2MBH estimations are those for which no previous measurement / estimate of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT existed in the ETHER sample. Almost 80%percent8080\%80 % of the final sample are first-time MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates or upper limits, most of which come from galaxies and unknown object-type sources that reside in the estimate zone of Fig. 3.

We provide a table, in FITS format, of our WISE2MBH final sample and its derived quantities. The table provides the source name, RA and DEC in degrees from the AllWISE catalog, spectroscopic redshift, object type, TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT used (either from ETHER or calculated by the algorithm), plus the estimates of Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT, B/T𝐵𝑇B/Titalic_B / italic_T, median plus 1σ𝜎\sigmaitalic_σ values of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT as estimated by the algorithm and a quality flag for each source. An excerpt of the online table is shown in Table 4.

The 7-digit quality flag provided in the table stores information useful for the selection of subsamples. The first four digits of this flag sequentially report the photometric quality of the measurements in the W1, W2, and W3 bands, as well as the extension flag of the source of AllWISE. The fifth digit serves as a binary indicator for the upper limit condition, where 0 denotes an estimate and 1 an upper limit. The sixth digit characterizes the K-correction quality; a value of 0 denotes an optimal correction, 1 is a suboptimal correction, and 2 indicates that K-correction was not applied. Lastly, the seventh digit denotes the origin of the TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT estimate. A value of 0 implies a source with a known TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT, 1 indicates a TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT estimated by the algorithm, and 2 indicates that TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT was not estimated by the algorithm due to an upper limit condition. In particular, it is important to clarify that the seventh digit does not inherently rank the quality of the TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT estimate as superior or inferior, as the available TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT values can come from binary classifications, as described in Dobrycheva (2013), or consider all morphological categories, as in the case of 2MRS. This is discussed in Sect. 11.3.

The highest quality sources (HQS) are classified as AAA500 in the first six digits, while the lowest quality sources (LQS) have the fourth and fifth digits equal to 01. Examples of HQS and LQS sources can be seen in Fig. 15. These sources illustrate the wide range of sources which the algorithm deals with, with HQS being mostly nearby galaxies large enough to have top quality in W1, W2, W3 and also being considered completely extended, while LQS are mostly QSO and compact very late-type galaxies. While HQS is not a strict proxy of the best MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates, we recommend using this sub-sample (118367 HQS, more than half of them (82812) also classified as galaxies) when a high reliability sample is required. Most QSO and RS are classified as LQS (all of their MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT are upper limits), due to the resolution limits of WISE and 2MASS, and/or AGN contamination (see Sect. 11.1).

The errors in the estimates tend to be smaller in HBF sources than in LBF sources, since the extension of the sources allows for better quality in the WISE magnitudes. In Fig. 9 it is possible to see that behavior for a few tens to thousands of sources at LBF.

Final distributions of Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT and MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT for each object type can be seen in Fig. 10. It is clear that QSO and RS appear to have more massive Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT and MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates: this is due to these being upper limits (i.e. assuming B/T=1𝐵𝑇1B/T=1italic_B / italic_T = 1). Galaxies are primarily estimates: the B/T𝐵𝑇B/Titalic_B / italic_T ratio is used here to estimate MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT and MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT. More than 80%percent8080\%80 % of sources with unknown type are estimates: their distribution is almost the same as galaxies but very different from QSO and RS.

Table 4: Excerpt of the WISE2MBH final sample.
Name (WISEA) RA DEC z𝑧zitalic_z Object type TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT B/T𝐵𝑇B/Titalic_B / italic_T MBH50subscriptsubscript𝑀BH50{}_{50}M_{\text{BH}}start_FLOATSUBSCRIPT 50 end_FLOATSUBSCRIPT italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT MBH16subscriptsubscript𝑀BH16{}_{16}M_{\text{BH}}start_FLOATSUBSCRIPT 16 end_FLOATSUBSCRIPT italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT MBH84subscriptsubscript𝑀BH84{}_{84}M_{\text{BH}}start_FLOATSUBSCRIPT 84 end_FLOATSUBSCRIPT italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT Quality
() () (logMsubscript𝑀direct-product\log M_{\odot}roman_log italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT) (logMsubscript𝑀direct-product\log M_{\odot}roman_log italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT) (logMsubscript𝑀direct-product\log M_{\odot}roman_log italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT) (logMsubscript𝑀direct-product\log M_{\odot}roman_log italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT)
J005254.81-260228.5 13.228403 -26.041252 0.075047 Galaxy 8.00 10.33 0.13 7.13 6.49 7.76 AAA0001
J085853.20+381944.2 134.721705 38.328948 0.089774 Unknown --5.0 10.40 1.00 8.19 7.58 8.83 AAA0000
J090734.40+502953.9 136.893367 50.498308 0.149000 QSO –.– 9.96 1.00 7.70 7.14 8.34 AAB0122
J143516.80-482147.8 218.820003 -48.363285 0.164000 RadioSource –.– 10.86 1.00 8.74 8.14 9.36 AAB0122
J090536.54+612254.9 136.402256 61.381936 0.178334 Unknown 1.49 11.11 0.32 8.44 7.81 9.06 AAU0001
J072846.20+404954.0 112.192538 40.831674 0.217425 Unknown 1.99 11.10 0.29 8.42 7.82 9.04 AAU0001
J231318.65+094133.4 348.327736 9.692628 0.328817 Unknown 2.58 11.18 0.27 8.39 7.72 9.07 AAU0001
J120245.58+071133.3 180.689952 7.192605 0.400000 Galaxy –.– 10.86 1.00 8.74 8.06 9.32 AAA0122
J120939.83+344601.5 182.415971 34.767088 0.434100 QSO –.– 11.07 1.00 9.00 8.34 9.64 AAA0122
J230846.43+173546.2 347.193464 17.596178 0.470583 Unknown 4.81 11.12 0.19 8.19 7.59 8.82 AAU0001

10.2 Local black hole mass function

Refer to caption
Figure 11: Left: Solid lines show the black hole mass function (BHMF) in the WISE2MBH final sample for shells of width 30 Mpc ending at distances of 30 to 300 Mpc, following the color bar at right; at high black hole masses, the BHMFs are uncertain and are shown with dashed lines. ETHER+WISE and ETHER points (see inset) represent the closest BHMF (0 -- 30 Mpc) present in ETHER with and without considering WISE MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates. Right: Same as left, but now for three shells ending at 30, 180 and 300 Mpc, showing the BHMF using median values (solid) and ten curves showing the BHMF created using random sampling from the PDF of each MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimate (dashed; see text). For reference both panels show four BHMFs independently derived by Shankar et al. (2016); Gallo & Sesana (2019); Pesce et al. (2021); Yao et al. (2023).

The local black hole mass functions (BHMFs) produced using the MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimated from the WISE2MBH final sample can be seen in Fig. 11 for shells of increasing distance out to 300 Mpc. These WISE2MBH final sample BHMFs are compared to the local BHMF in R23, and other BHMFs from independent methods: Shankar et al. (2016, compensated MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT-σ𝜎\sigmaitalic_σ), Gallo & Sesana (2019, X-ray), Pesce et al. (2021, MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT-Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT) and Yao et al. (2023, TDE). The WISE2MBH final sample BHMF was derived using only estimates from WISE. At logMBH>10subscript𝑀BH10\log M_{\text{BH}}>10roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT > 10, the WISE2MBH final sample has only a few (elliptical) galaxies, so that the derived BHMFs shows large fluctuations given the small number statistics. For this reason, the BHMFs at logMBH10greater-than-or-equivalent-tosubscript𝑀BH10\log M_{\text{BH}}\gtrsim 10roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ≳ 10 are shown as dashed lines.

The symbols in the figure show two cases of the BHMF for BHs at D30𝐷30D\leq 30italic_D ≤ 30 Mpc: (a) using only estimates present in ETHER before the WISE2MBH algorithm was used (black symbols), and (b) using ETHER estimates, and in the absence of these, using estimates from WISE2MBH (red symbols). The difference between these two cases, and the WISE2MBH BHMF at D30𝐷30D\leq 30italic_D ≤ 30 Mpc (darkest blue line) is clear. At low masses (logMBH7subscript𝑀BH7\log M_{\text{BH}}\leq 7roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ≤ 7) the ETHER-only BHMF is significantly lower than the other two. The ETHER+WISE (red symbols) and WISE2MBH BHMF agree well between each other and primarily sit on or between previously derived literature expectations. At the highest masses, the ETHER and ETHER+WISE BHMFs are above previous literature predictions: of the 17 SMBH with log MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT \geq 9, the majority are M-σ𝜎\sigmaitalic_σ estimates with σ𝜎\sigmaitalic_σ from LAMOST and SDSS, so their true MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT are sensitive to both errors in σ𝜎\sigmaitalic_σ and in the high-mass M-σ𝜎\sigmaitalic_σ relationship. These BHMFs in the highest mass bins should thus be treated with caution, and will be explored in more detail in \textcolorblueHernández-Yévenes et al. (\textcolorbluein prep).

At the high mass end, the WISE2MBH BHMFs in all distance shells except for the closest one, drop significantly below the ETHER and ETHER+WISE BHMFs and also the literature expectations. That is, when considering median values of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT, pure WISE-based estimates do not find the few massive known black holes in the local universe. The increasing incompleteness of the WISE2MBH BHMFs with distance, especially at lower masses, is due to the redshift distribution in the sample.

Why does WISE2MBH not recover the shape of the BHMF at the massive end? We explain this as a result of not considering the intrinsic scatter of the relationship between the bulge mass and the SMBH mass. Indeed, when we consider the predicted probability density function (PDF) of each MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimate from WISE2MBH (the dispersion of each PDF is dominated by the dispersion in the bulge-to-SMBH mass conversion) and randomly select a value from this PDF (instead of its median), the high-mass black hole populations are recovered and the WISE2MBH BHMFs are even higher than previous literature BHMFs (right panel of Fig. 11). We do the above exercise 10 times in each distance shell to create different PDF-sampled BHMFs. Even through the closest distance shell shows considerable fluctuations in its ten PDF-sampled BHMFs, all three shells consistently show BHMFs close to, but higher than, previous literature BHMFs.

The combination of less massive Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates and low B/T𝐵𝑇B/Titalic_B / italic_T for spiral galaxies in the sample can lead to differences at the low mass end when comparing WISE2MBH BHMFs with other BHMFs in the literature. In any case the ETHER-only BHMF is lower than literature expectations due to the construction of the sample: galaxies with an estimated black hole mass of logMBH5.5subscript𝑀BH5.5\log M_{\text{BH}}\leq 5.5roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ≤ 5.5 were eliminated from the sample, but note that galaxies without black hole masses remained in the sample and for these WISE2MBH could later add a mass estimate. The ETHER+WISE BHMF is in good agreement with other BHMF that obtain similar densities.

The BHMFs from Gallo & Sesana (2019) and Yao et al. (2023) are mostly focused in lower mass ranges. The former used the stellar mass function derived from Galaxy And Mass Assembly (GAMA) together X-ray imaging from the Chandra Observatory, to constrain the BHMF down to low-mass regimes (logMBH4subscript𝑀BH4\log M_{\text{BH}}\geq 4roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ≥ 4). The latter was derived by studying tidal disruption events (TDE), which are less frequent in SMBH with higher masses (logMBH7subscript𝑀BH7\log M_{\text{BH}}\geq 7roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ≥ 7); they argue that their BHMF over logMBHsubscript𝑀BH\log M_{\text{BH}}roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT 5 to 7 could be considered as an upper limit. With that information, a limit was established and our BHMF is in good agreement with that.

The BHMF of Shankar et al. (2016) is the most widely used for comparisons, due to the compensated phenomenology used to derive it, as compared to the BHMF previously derived in Shankar et al. (2009). The Shankar et al. (2016) BHMF corrects the low-mass range, now showing a clear downward trend, but it is still limited in MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT range. Pesce et al. (2021) used the Shankar et al. BHMF to build an updated BHMF which extrapolates to higher mass ranges, but does not update the low-mass end BHMF. The WISE2MBH BHMFs are lower at both the high and low mass end as compared to Shankar et al. (2016) and Pesce et al. (2021).

Given recent interest in black hole binaries (BHB) over a wide range of masses (for a review, see De Rosa et al., 2019), our predicted BHMF favors recent studies. Sato-Polito et al. (2023) discussed the need for a larger population of inspiraling supermassive BHB (SMBHB) compared to current predictions to explain the stochastic gravitational wave background found by pulsar timing array (PTA) collaborations. Their prediction agrees with our closest WISE2MBH PDF-sampled BHMF; favoring the hypothesis. Izquierdo-Villalba et al. (2024) predict a similar local BHMF when trying to connect LISA-detectable binaries (logMBH<7subscript𝑀BH7\log M_{\text{BH}}<7roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT < 7) to SMBHB populations: our ETHER+WISE and most-local PDF-sampled BHMF presents almost perfect agreement with the predicted low-mass population in their work, while our WISE2MBH PDF-sampled BHMF is almost one dex lower at the high mass end (see their Fig. B1).

11 Discussion

11.1 Assumptions and limitations

The algorithm makes several assumptions and approximations, potentially introducing biases, particularly for certain types of galaxies. As highlighted by Jarrett et al. (2019) and Cluver et al. (2020), despite the reduced sensitivity of the W3 band compared to W1 and W2, leading to fewer detections (as noted by Jarrett et al., 2013), the color W2--W3 has been shown to be an effective indicator of morphology. Therefore, we choose to use this color to estimate the morphology. The galaxy-averaged W1--W2 color (rather than the colors of the individual components, e.g., bulge and disk) is assumed to correctly trace the averaged M/L ratio as C14 and J23 suggest, and is effectively used for this purpose.

Refer to caption
Figure 12: Comparison of different MBHMBulgesubscript𝑀BHsubscript𝑀BulgeM_{\text{BH}}-M_{\text{Bulge}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT - italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT scaling relations from the literature, including Kormendy & Ho (2013); Saglia et al. (2016); Schutte et al. (2019) and the modified scaling presented in this work. Grey area represents the limit of the WISE2MBH algorithm for logMBH<5subscript𝑀BH5\log M_{\text{BH}}<5roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT < 5, where it drops all estimates.

Our estimations of Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT come from the method of C14, which presents a different dependence of M/L𝑀𝐿M/Litalic_M / italic_L on W1--W2 compared to J23, which was the method for which the K-corrections were developed. Despite this, our most important goal with respect to the ETHER sample is not Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates, but rather MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates. The latter showed better agreement with measured MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT values in the literature when using the method of C14.

We initially used the empirical relationship of Schutte et al. (2019) to convert the WISE-based bulge masses to a SMBH mass, given that this is the most recent study of the relationship, and extends to relatively low black hole masses. For this conversion, a comparison of WISE2MBH MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT to the control sample of SMBH measurements and high quality estimates shows a slight offset from equality. The best fit compensation factor (Cfsubscript𝐶𝑓C_{f}italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT) is positive for an initial MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimate with logMBH<9.42subscript𝑀BH9.42\log M_{\text{BH}}<9.42roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT < 9.42. When the first estimate of logMBHsubscript𝑀BH\log M_{\text{BH}}roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT is close to the realm of IMBH, this can lead to a large compensation factor. For example, when the first MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimate is logMBH=4.6subscript𝑀BH4.6\log M_{\text{BH}}=4.6roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT = 4.6, thus a Cf=0.5subscript𝐶𝑓0.5C_{f}=0.5italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = 0.5, then a final MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimate of logMBH=5.1subscript𝑀BH5.1\log M_{\text{BH}}=5.1roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT = 5.1 is stored. In this example, if Cfsubscript𝐶𝑓C_{f}italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT was not applied, the source would have been dropped from the final sample.

When combining the Schutte et al. scaling relationship with the compensation factor (Eq. 6; see also Fig. 12), it is clear that the final scaling relationship we derive and use lies in-between (Fig. 12) the MBulgeMBHsubscript𝑀Bulgesubscript𝑀BHM_{\text{Bulge}}-M_{\text{BH}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT - italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT scaling relations of Kormendy & Ho (2013) and Schutte et al. (2019), being more in agreement with the former (latter) at lower (higher) masses. This shows that our final estimates effectively show a population of SMBH with masses larger than that predicted by Schutte et al.. Given that we use only galaxies with detections in all of W1, W2, and W3, and that SMBH with final mass estimates at logMBH<5subscript𝑀BH5\log M_{\text{BH}}<5roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT < 5 are deleted, the WISE2MBH algorithm is highly incomplete in estimates for dwarf galaxies, so the compensation factor and thus the final MBulgeMBHsubscript𝑀Bulgesubscript𝑀BHM_{\text{Bulge}}-M_{\text{BH}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT - italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT scaling relationship are calibrated using only more massive SMBH.

The compensation factor obviously biases the final WISE2MBH MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates to predict a population of SMBH mostly similar to the few local sources that have measurements or reliable estimates of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT. Given that we use this same Cfsubscript𝐶𝑓C_{f}italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT for all redshifts in WISE2MBH, this can lead to systematic errors at higher redshifts. This underlines the need for more and better direct measurements of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT over a range of redshifts, plus a review of systematics in previous measurements (e.g., Liepold et al., 2023; Osorno et al., 2023).

The algorithm is limited by the relatively low angular resolution of WISE (6.1absent6.1\geq 6.1\arcsec≥ 6.1 ″), i.e., its limited capacity to constrain the extension or morphology of compact sources. This factor, combined with a certain degree of AGN contamination in some cases, exacerbates the classification of some targets as upper limits (rather than estimates) using the color-color criteria described in Sect. 3, causing us to use unnecessarily large values of MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT (due to fixing B/T=1𝐵𝑇1B/T=1italic_B / italic_T = 1) and thus providing overestimated MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT values as an upperlimit.

11.2 Using WISE2MBH at higher redshifts

Refer to caption
Figure 13: WISE2MBH-derived Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT compared to the high redshift (z0.5𝑧0.5z\geq 0.5italic_z ≥ 0.5) samples of Tacconi et al. (2013); Liu et al. (2019); Bacon et al. (2023); López et al. (2023); Mei et al. (2023); Poitevineau et al. (2023), with colors specified in the inset. The black dashed line is the line of equality. Datapoints are colored by redshift following the color bar on the right (blue is z=0.5𝑧0.5z=0.5italic_z = 0.5 and red is z3similar-to𝑧3z\sim 3italic_z ∼ 3). Filled and unfilled markers are used for estimates and upper limits of WISE2MBH Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT, respectively; all literature values are estimates.

In this work we have presented and discussed the use of the WISE2MBH algorithm up to redshift 0.5. This since K-corrections to the WISE photometry are relatively reliable up to this redshift, and the scaling relationships used are derived primarily in local galaxy samples.

In principle, the WISE2MBH algorithm can be extended to the redshift range 0.5–3 by using the K-correction lookup tables from J23. However, the results over this redshift range have significantly larger uncertainties both due to the larger uncertainties in the K-corrections and due to the several other scaling relationships used being derived and calibrated for lower redshift samples (z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5).

Specifically, the K-corrections, as presented in J23, have been made available for redshifts up to z=3𝑧3z=3italic_z = 3, but have been shown to be reliable only at z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5. The zone selection and object criteria explained in Sect. 4, have been applied systematically to every source within the WISE2MBH parent sample. Object selection and upper limit or rejection criteria are based on K-corrected fluxes and are thus more reliable at z0.5𝑧0.5z\leq 0.5italic_z ≤ 0.5. The TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT estimator, an essential component of the algorithm, was calibrated using the 2MRS sample, whose sources are z0.15𝑧0.15z\leq 0.15italic_z ≤ 0.15. Lastly, the method used to estimate Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT, as explained in C14, has been calibrated for z0.12𝑧0.12z\leq 0.12italic_z ≤ 0.12.

To test the algorithm at higher redshifts, we use WISE2MBH to derive estimates for ETHER galaxies at redshifts 0.5–3. Figure 13 compares WISE2MBH Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates with literature values of Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT for galaxies at z0.5𝑧0.5z\geq 0.5italic_z ≥ 0.5 from a control sample from multiple catalogues (Tacconi et al., 2013; Liu et al., 2019; Bacon et al., 2023; López et al., 2023; Mei et al., 2023; Poitevineau et al., 2023). In this figure, the filled markers show sources with WISE Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates and the unfilled markers show sources with WISE Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT upper limits. All literature values are Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates.

In the context of these comparisons between estimates and upper limits against the control sample, the following observations hold:

  • WISE-based estimates, having undergone prior K-correction, predominantly show a slight underestimation in their Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT values.

  • WISE-based upper limits, which did not undergo K-correction, typically show an overestimation in their WISE2MBH Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT values. This tendency is consistent with the possible AGN contamination of these sources and the absence of K-correction. Since these are explicitly marked as upper limits in our algorithm, this does not lead to errors.

The WISE2MBH algorithm has been shown to be reliable in recognizing sources that are contaminated by powerful AGN emission. To err on the safe side, and given that these sources have insignificant color-color evolution across redshift (e.g., Mateos et al., 2012), we do not use K-corrections for these sources, and mark their mass estimates as upper limits. Nevertheless, it should be noted that the algorithm has a significantly decreased accuracy in estimating Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT at redshifts higher than z=0.5𝑧0.5z=0.5italic_z = 0.5.

11.3 Building on the algorithm and heterogeneity

The WISE2MBH algorithm can be considered as an auxiliary tool for obtaining MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates from a homogeneous dataset. This dataset provides consistent estimates of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT using WISE data for most sources and a consistent set of relations, following accepted ideas and scaling relations applicable to the majority of extragalactic sources. Although the WISE2MBH final sample is heterogeneous in its composition (i.e. different extragalactic objects), users can define subsamples to recover homogeneity.

The use of a single, well-calibrated dataset from observations with the same instrument and method can ensure consistency and comparability of results, but may limit their generalizability and bias the final conclusions. The process described in this work does not necessarily rely solely on WISE data. WISE is used to obtain physical properties of extragalactic sources in the ETHER sample, but external data, such as source classification and morphological types, are also employed in some cases.

Independently derived physical properties, such as Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT or MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT can be input to an intermediate stage of WISE2MBH in order to obtain MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates. This approach injects heterogeneity into the algorithm and its results.

Despite the use of a homogeneous parent sample, both MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT (low-z) and Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT (low- and high-z) were compared with heterogeneous control samples, the former being more important to discuss. The MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT control sample is described in Table 2. The primary methods used to measure MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT in the control sample are stellar dynamics, gas dynamics, and reverberation mapping (RM); all known for their reliability and precision in measuring MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT. Poor quality σ𝜎\sigmaitalic_σ, single-epoch RM, and other methods used to obtain MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT in R23 were excluded. Despite the heterogeneity of the control sample, the ratio between the literature measured or estimated MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT and the WISE2MBH MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimate was calculated in every case, and no prominent differences or scatter was found. Furthermore, the WISE-based bulge luminosity to black hole mass scaling closely follows the relationship of Kormendy & Ho (2013), which is widely used in the literature.

Authors who wish to work with a completely homogeneous subsample of the WISE2MBH parent sample which is based only on WISE data and spectroscopic redshift, could define the subsample as follows: (a) only sources with object type galaxy or unknown, and (b) only sources with a quality flag ending in 2. These constraints ensure that authors work with estimates that only used WISE data for the classification of upper limits and omit estimates that made use of previously known TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT.

11.4 Relevance for the EHT and ngEHT

The EHT (and future ngEHT) is the best facility for the imaging of the innermost environments of black holes in terms of sensitivity and resolution (similar-to\sim10 mJy and similar-to\sim15 μ𝜇\muitalic_μas, Doeleman et al., 2019; Pesce et al., 2022; Johnson et al., 2023; Doeleman et al., 2023) for the next decade, opening the possibility of imaging (and making movies of) tens to hundreds of SMBH in the nearby Universe. Relevant science goals include testing general relativity, the role of magnetic fields in black hole accretion and jet formation. Recently, Pesce et al. (2021) have demonstrated that with current EHT facilities (at 230GHz), we can expect to resolve similar-to\sim5 new SMBH shadows, while with ngEHT observing at 345GHz, this number can be increased by factor similar-to\sim3. The challenge is to identify these very rare sources.

In the context of scientific exploitation of the EHT, R23 have developed the ETHER sample and database. Combining greater-than-or-equivalent-to\gtrsim3M sources in a parent sample of galaxies and AGN, comprehensive radio to X-ray observed spectral energy densities (SEDs), and jet and accretion flow SED modelling to predict the expected EHT flux, ETHER can provide target samples for the EHT for any given science goal. WISE2MBH was originally developed to fill large gaps in the ETHER sample: delivering both black hole mass estimates and upper limits, and galaxy morphologies. Its accuracy and ease of use, combined with its relevance to galaxy evolution studies, especially at high redshift, motivate its publication as a separate entity from ETHER.

This algorithm does not necessarily intend to replace previous estimates of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT in the literature (except for values based on poor quality σ𝜎\sigmaitalic_σ or relatively unreliable ’fundamental planes’), but rather to increase the completeness of the ETHER sample. As more precise estimations or measurements become available (e.g., from SDSS Black Hole Mapper, Kollmeier et al., 2017, Sect. 2.2), WISE MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates can be replaced in ETHER.

In R23, the authors detail a methodology that requires MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates (or upper limits) and X-ray flux data, primarily from the Chandra Source Catalog (CSC) to predict the EHT-observable flux of an SMBH. With the release of the eROSITA all-sky survey (eRASS, Merloni et al., 2024) DR1 an estimated 1similar-toabsent1\sim 1∼ 1 million new hard X-ray flux measurements have become available. Integrating eROSITA, WISE2MBH and ETHER, will allow SED fitting, thus radio flux estimates, for almost all sources in eROSITA. This is a critical step for target selection for the ngEHT.

12 Summary

This work presents a simple and new algorithm to obtain stellar masses (Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT), morphological types (TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT), bulge-to-total ratios (B/T𝐵𝑇B/Titalic_B / italic_T), and black hole masses (MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT) estimates of a galaxy. The WISE2MBH algorithm is publicly available at GitHub333A general use version of the algorithm is available at the following GitHub repository: https://github.com/joacoh/wise2mbh. This algorithm, which only requires WISE catalog data, classifies sources as galaxies or QSOs, and estimates multiple physical quantities such as Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT and MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT. The algorithm uses previously derived scaling relations and our own derived relations to obtain a final MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimate or upper limit. Using a parent sample of similar-to\sim2.1 million sources from the ETHER sample post-cross-match to the AllWISE and WISE extended source (WXSC) catalogues, a final sample of similar-to\sim1.9 million MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates and similar-to\sim100 thousand upper limits were calculated. Among the estimates (i.e. not considering upper limits), similar-to\sim78.5%percent78.578.5\%78.5 % are first-time estimates of known galaxies or unclassified sources. QSOs and radio sources (RS), as classified by NED, are also part of the sample, but due to the nature of their emission and the quality or extension of these sources in WISE, all of their final values of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT are marked as upper limits. The final sample table is available online via CDS.

The detailed manual morphological classifications (TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT) of galaxies in the 2MASS redshift survey (2MRS) were used to derive a relation between TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT and WISE W2--W3 color, with the objective of estimating TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT for sources that do not have one previously assigned in the literature. All available and estimated TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT are used to obtain B/T𝐵𝑇B/Titalic_B / italic_T using an exponential relation described in Sect. 6 that is consistent with previous studies. The obtained B/T𝐵𝑇B/Titalic_B / italic_T are used to calculate MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT from a WISE derived Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT. Finally, we use our MBHMBulgesubscript𝑀BHsubscript𝑀BulgeM_{\text{BH}}-M_{\text{Bulge}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT - italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT scaling relation - which lies between accepted literature scaling relationships - to estimate MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT. All uncertainties are propagated through the algorithm using a Monte Carlo approach, delivering the 1σ1𝜎1\sigma1 italic_σ (upper and lower) errors of the final distributions as low and high values, respectively.

The MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates were compared to a control sample of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT measurements and reliable estimates, showing a significant difference in the linear regression analysis with respect to the equality line, i.e., an offset that causes some values to be overestimated and others underestimated. To compensate for this offset, we implement a compensation factor (Cfsubscript𝐶𝑓C_{f}italic_C start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT) derived with the use of the control sample. After compensation, the comparison achieves a Spearman score of 0.78similar-toabsent0.78\sim 0.78∼ 0.78 and a RMSE of 0.63similar-toabsent0.63\sim 0.63∼ 0.63. The final scaling relationship used (eqn. 6) lies between the relationships of Kormendy & Ho (2013) and Schutte et al. (2019), being more consistent with the former at low SMBH masses and the latter at high SMBH masses. The mean uncertainty was calculated for the MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates, considering a simple mean between low and high errors and then taking the mean value of the distribution of means, obtaining a value of 0.5similar-toabsent0.5\sim 0.5∼ 0.5 dex, showing more scatter in low bulge fraction (LBF) sources, compared to high bulge fraction (HBF) sources (see Fig. 9).

The black hole mass function (BHMF) of the WISE2MBH final sample is in good agreement with other previously and independently derived BHMFs. The ETHER-only sample has few low mass estimates (logMBH6subscript𝑀BH6\log M_{\text{BH}}\leq 6roman_log italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT ≤ 6), while the WISE2MBH final sample can provide this population of sources, and the overall combination of both samples generates the most complete BHMF. We find evidence that high mass SMBH are more common than predicted by literature BHMFs.

When using the WISE2MBH algorithm or the final sample described in this work, it is important to take into account the assumptions and associated limitations. The algorithm provided on GitHub can be easily modified to change the scaling relations used or incorporate new ones, tailored to the user’s requirements.

Regarding the final sample, we recommend not considering all Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT or MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates if the main goal is to study only a few sources and/or restrict the sample to only high quality sources (HQS). In case of population studies, almost the complete final sample can be used, depending on the redshift limit, quality flag, and the requirements of the user.

The final sample was generated in a homogeneous manner, i.e. all estimates come from relations that make use of WISE cataloged data to derive physical quantities, except for the use of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT from the literature in some cases. This gives confidence that the derived values are consistent from one to another and no externally derived physical parameters were used to obtain the final MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT.

The WISE2MBH final sample is already incorporated into the ETHER sample, providing almost 3 million new MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT estimates and upper limits that, and it will be used iteratively to provide up-to-date values in case new sources are ingested into the ETHER sample. These estimates are crucial for the selection of samples of interest for the Event Horizon Telescope (EHT) and the next-generation EHT (ngEHT), and are used on each update of the sample. Its high percentage of success in estimating a new MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT, combined with spectral fitting of accretion and jet models to hard X-ray data from Chandra, XMM, and eROSITA, allows one to predict radio fluxes from the accretion inflow and jets, and thus obtain a first selection of sources detectable with the EHT or ngEHT.

Acknowledgements

We thank Yuri Kovalev, Angelo Ricarte, Dominic Pesce, and Priyamvada Natarajan for useful discussions and feedback, and Yuhan Yao for providing black hole mass functions for comparison. We acknowledge funding from ANID Chile via Nucleo Milenio TITANs (Project NCN2023_002), Fondecyt Regular (Project 1221421) and Basal (Project FB210003). T.H.J. acknowledges support from the National Research Foundation (South Africa). This research has used the VizieR catalog access tool, CDS, Strasbourg, France. We acknowledge the usage of the HyperLeda database (http://leda.univ-lyon1.fr). This research made use of the following software: Pandas (Reback et al., 2022), Astropy (Astropy Collaboration et al., 2022), Numpy (Harris et al., 2020), Scipy (Virtanen et al., 2020), Matplotlib (Hunter, 2007), Seaborn (Waskom, 2021), StatsModels (Seabold & Perktold, 2010) and Topcat (Taylor, 2005).

Data Availability

The WISE2MBH algorithm and the K-correction tables from J23 for eliptical, lenticular and spiral morphologies, can be found in the GitHub repository. The WISE2MBH final sample, which includes all the data described in Table 3, is distributed through CDS.

References

  • Abraham & van den Bergh (2001) Abraham R. G., van den Bergh S., 2001, Science, 293, 1273
  • Abraham et al. (1994) Abraham R. G., Valdes F., Yee H. K. C., van den Bergh S., 1994, ApJ, 432, 75
  • Abraham et al. (1996) Abraham R. G., van den Bergh S., Glazebrook K., Ellis R. S., Santiago B. X., Surma P., Griffiths R. E., 1996, ApJS, 107, 1
  • Assef et al. (2018) Assef R. J., Stern D., Noirot G., Jun H. D., Cutri R. M., Eisenhardt P. R. M., 2018, ApJS, 234, 23
  • Astropy Collaboration et al. (2022) Astropy Collaboration et al., 2022, ApJ, 935, 167
  • Bacon et al. (2023) Bacon R., et al., 2023, A&A, 670, A4
  • Bentz & Katz (2015) Bentz M. C., Katz S., 2015, PASP, 127, 67
  • Bluck et al. (2014) Bluck A. F. L., Mendel J. T., Ellison S. L., Moreno J., Simard L., Patton D. R., Starkenburg E., 2014, MNRAS, 441, 599
  • Boizelle et al. (2021) Boizelle B. D., et al., 2021, ApJ, 908, 19
  • Borenstein et al. (2009) Borenstein M., Hedges L., Higgins J., Rothstein H., 2009, Introduction to Meta-Analysis. Wiley, doi:10.1002/9780470743386
  • Broderick et al. (2015) Broderick A. E., Narayan R., Kormendy J., Perlman E. S., Rieke M. J., Doeleman S. S., 2015, ApJ, 805, 179
  • Brown et al. (2014) Brown M. J. I., et al., 2014, ApJS, 212, 18
  • Bruzual & Charlot (2003) Bruzual G., Charlot S., 2003, MNRAS, 344, 1000
  • Caramete & Biermann (2010) Caramete L. I., Biermann P. L., 2010, A&A, 521, A55
  • Casado et al. (2015) Casado J., Ascasibar Y., Gavilán M., Terlevich R., Terlevich E., Hoyos C., Díaz A. I., 2015, MNRAS, 451, 888
  • Chang et al. (2015) Chang Y.-Y., van der Wel A., da Cunha E., Rix H.-W., 2015, ApJS, 219, 8
  • Chen et al. (2012) Chen Y.-M., et al., 2012, MNRAS, 421, 314
  • Cluver et al. (2014) Cluver M. E., et al., 2014, ApJ, 782, 90
  • Cluver et al. (2017) Cluver M. E., Jarrett T. H., Dale D. A., Smith J. D. T., August T., Brown M. J. I., 2017, ApJ, 850, 68
  • Cluver et al. (2020) Cluver M. E., et al., 2020, ApJ, 898, 20
  • Croton (2006) Croton D., 2006, MNRAS, 369, 1808
  • Cutri et al. (2021) Cutri R. M., et al., 2021, VizieR Online Data Catalog, p. II/328
  • Dalla Bontà et al. (2020) Dalla Bontà E., et al., 2020, ApJ, 903, 112
  • De Rosa et al. (2019) De Rosa A., et al., 2019, New Astron. Rev., 86, 101525
  • Dimauro et al. (2018) Dimauro P., et al., 2018, MNRAS, 478, 5410
  • Dimauro et al. (2022) Dimauro P., et al., 2022, MNRAS, 513, 256
  • Dobrycheva (2013) Dobrycheva D. V., 2013, Odessa Astronomical Publications, 26, 187
  • Doeleman et al. (2019) Doeleman S., et al., 2019, Bulletin of the AAS, 51
  • Doeleman et al. (2023) Doeleman S. S., et al., 2023, Galaxies, 11, 107
  • Dunlop et al. (2003) Dunlop J. S., McLure R. J., Kukula M. J., Baum S. A., O’Dea C. P., Hughes D. H., 2003, MNRAS, 340, 1095
  • Durbala et al. (2020) Durbala A., Finn R. A., Crone Odekon M., Haynes M. P., Koopmann R. A., O’Donoghue A. A., 2020, AJ, 160, 271
  • Edwards et al. (2020) Edwards L. O. V., et al., 2020, MNRAS, 491, 2617
  • Event Horizon Telescope Collaboration et al. (2019) Event Horizon Telescope Collaboration et al., 2019, ApJ, 875, L1
  • Event Horizon Telescope Collaboration et al. (2022) Event Horizon Telescope Collaboration et al., 2022, ApJ, 930, L12
  • Ferrarese & Ford (2005) Ferrarese L., Ford H., 2005, Space Sci. Rev., 116, 523
  • Fisher & Drory (2011) Fisher D., Drory N., 2011, ApJ, 733, L47
  • Flesch (2013) Flesch E., 2013, Publ. Astron. Soc. Australia, 30, e004
  • Flesch (2023) Flesch E. W., 2023, The Open Journal of Astrophysics, 6, 49
  • Gallo & Sesana (2019) Gallo E., Sesana A., 2019, ApJ, 883, L18
  • Gao et al. (2020) Gao H., Ho L. C., Barth A. J., Li Z.-Y., 2020, ApJS, 247, 20
  • Ge et al. (2018) Ge X., Gu Q.-S., Chen Y.-Y., Ding N., 2018, arXiv e-prints, p. arXiv:1808.01709
  • Genzel et al. (2010) Genzel R., Eisenhauer F., Gillessen S., 2010, Reviews of Modern Physics, 82, 3121
  • Graham (2016) Graham A. W., 2016, Astrophysics and Space Science Library, pp 263–313
  • Greene et al. (2020) Greene J. E., Strader J., Ho L. C., 2020, ARA&A, 58, 257
  • Grundler et al. (2022) Grundler A., Dazer M., Herzig T., 2022, Applied Sciences, 12
  • Gültekin et al. (2019) Gültekin K., King A. L., Cackett E. M., Nyland K., Miller J. M., Di Matteo T., Markoff S., Rupen M. P., 2019, ApJ, 871, 80
  • Guo et al. (2018) Guo S., Qi Z., Liao S., Cao Z., Lattanzi M. G., Bucciarelli B., Tang Z., Yan Q.-Z., 2018, A&A, 618, A144
  • Häring & Rix (2004) Häring N., Rix H.-W., 2004, ApJ, 604, L89
  • Harris et al. (2020) Harris C. R., et al., 2020, Nature, 585, 357
  • Hickox & Alexander (2018) Hickox R. C., Alexander D. M., 2018, ARA&A, 56, 625
  • Huchra et al. (2012) Huchra J. P., et al., 2012, ApJS, 199, 26
  • Hunter (2007) Hunter J. D., 2007, Computing in Science & Engineering, 9, 90
  • Hviding et al. (2022) Hviding R. E., Hainline K. N., Rieke M., Juneau S., Lyu J., Pucha R., 2022, AJ, 163, 224
  • Inayoshi et al. (2020) Inayoshi K., Visbal E., Haiman Z., 2020, ARA&A, 58, 27
  • Izquierdo-Villalba et al. (2024) Izquierdo-Villalba D., Sesana A., Colpi M., Spinoso D., Bonetti M., Bonoli S., Valiante R., 2024, arXiv e-prints, p. arXiv:2401.10983
  • Jarrett et al. (2000) Jarrett T. H., Chester T., Cutri R., Schneider S., Skrutskie M., Huchra J. P., 2000, AJ, 119, 2498
  • Jarrett et al. (2011) Jarrett T. H., et al., 2011, ApJ, 735, 112
  • Jarrett et al. (2012) Jarrett T. H., et al., 2012, AJ, 145, 6
  • Jarrett et al. (2013) Jarrett T. H., et al., 2013, AJ, 145, 6
  • Jarrett et al. (2019) Jarrett T. H., Cluver M. E., Brown M. J. I., Dale D. A., Tsai C. W., Masci F., 2019, ApJS, 245, 25
  • Jarrett et al. (2023) Jarrett T. H., Cluver M. E., Taylor E. N., Bellstedt S., Robotham A. S. G., Yao H. F. M., 2023, ApJ, 946, 95
  • Johnson et al. (2023) Johnson M. D., et al., 2023, Galaxies, 11
  • Karademir et al. (2023) Karademir G. S., Taylor E. N., Blake C., Cluver M. E., Jarrett T. H., Triani D. P., 2023, MNRAS, 522, 3693
  • Kettlety et al. (2018) Kettlety T., et al., 2018, MNRAS, 473, 776
  • Kollmeier et al. (2017) Kollmeier J. A., et al., 2017, arXiv e-prints, p. arXiv:1711.03234
  • Kong & Ho (2018) Kong M., Ho L. C., 2018, ApJ, 859, 116
  • Kormendy & Ho (2013) Kormendy J., Ho L. C., 2013, ARA&A, 51, 511
  • Koyama et al. (2019) Koyama S., et al., 2019, ApJ, 874, 142
  • Kroupa (2001) Kroupa P., 2001, MNRAS, 322, 231
  • Kuhn et al. (2022) Kuhn L., Shubat M., Barmby P., 2022, MNRAS, 515, 807
  • Kurcz et al. (2016) Kurcz A., Bilicki M., Solarz A., Krupa M., Pollo A., Małek K., 2016, A&A, 592, A25
  • Laurikainen et al. (2007) Laurikainen E., Salo H., Buta R., Knapen J., 2007, MNRAS, 381, 401
  • Lavaux & Hudson (2011) Lavaux G., Hudson M. J., 2011, MNRAS, 416, 2840
  • Lee et al. (2013) Lee J. C., Hwang H. S., Ko J., 2013, ApJ, 774, 62
  • Lee et al. (2017) Lee G.-H., Hwang H. S., Sohn J., Lee M. G., 2017, ApJ, 835, 280
  • Li & Shen (2023) Li J., Shen Y., 2023, ApJ, 950, 122
  • Li et al. (2023) Li F., Wang J., Xu F., Kong X., Chen X., Lin Z., Wang S., 2023, ApJ, 950, 84
  • Liepold et al. (2023) Liepold E. R., Ma C.-P., Walsh J. L., 2023, ApJ, 945, L35
  • Liu et al. (2019) Liu D., et al., 2019, ApJS, 244, 40
  • Liu et al. (2022) Liu T., et al., 2022, A&A, 661, A5
  • López et al. (2023) López I. E., et al., 2023, A&A, 672, A137
  • Lyu & Rieke (2022a) Lyu J., Rieke G., 2022a, Universe, 8, 304
  • Lyu & Rieke (2022b) Lyu J., Rieke G. H., 2022b, ApJ, 940, L31
  • Lyu et al. (2019) Lyu J., Rieke G. H., Smith P. S., 2019, ApJ, 886, 33
  • Lyu et al. (2023) Lyu J., et al., 2023, arXiv e-prints, p. arXiv:2310.12330
  • Mainzer et al. (2011) Mainzer A., et al., 2011, ApJ, 731, 53
  • Makarov et al. (2014) Makarov D., Prugniel P., Terekhova N., Courtois H., Vauglin I., 2014, A&A, 570, A13
  • Maraston & Strömbäck (2011) Maraston C., Strömbäck G., 2011, MNRAS, 418, 2785
  • Maraston et al. (2013) Maraston C., et al., 2013, MNRAS, 435, 2764
  • Massaro et al. (2015) Massaro E., Maselli A., Leto C., Marchegiani P., Perri M., Giommi P., Piranomonte S., 2015, Ap&SS, 357, 75
  • Mateos et al. (2012) Mateos S., et al., 2012, MNRAS, 426, 3271
  • Mei et al. (2023) Mei S., et al., 2023, A&A, 670, A58
  • Meidt et al. (2012) Meidt S. E., et al., 2012, ApJ, 744, 17
  • Mendel et al. (2014) Mendel J. T., Simard L., Palmer M., Ellison S. L., Patton D. R., 2014, ApJS, 210, 3
  • Merloni et al. (2024) Merloni A., et al., 2024, A&A, 682, A34
  • Mezcua et al. (2018) Mezcua M., Hlavacek-Larrondo J., Lucey J. R., Hogan M. T., Edge A. C., McNamara B. R., 2018, MNRAS, 474, 1342
  • Michalik & Lindegren (2016) Michalik D., Lindegren L., 2016, A&A, 586, A26
  • Montero-Dorta et al. (2016) Montero-Dorta A. D., et al., 2016, MNRAS, 461, 1131
  • Morell et al. (2020) Morell D. F., Ribeiro A. L. B., de Carvalho R. R., Rembold S. B., Lopes P. A. A., Costa A. P., 2020, MNRAS, 494, 3317
  • Netzer (2015) Netzer H., 2015, ARA&A, 53, 365
  • Noda et al. (2020) Noda H., Kawamuro T., Kokubo M., Minezaki T., 2020, MNRAS, 495, 2921
  • Norris et al. (2014) Norris M. A., Meidt S., Van de Ven G., Schinnerer E., Groves B., Querejeta M., 2014, ApJ, 797, 55
  • Osorno et al. (2023) Osorno J., Nagar N., Richtler T., Humire P., Gebhardt K., Gultekin K., 2023, A&A, 679, A37
  • Padovani et al. (2017) Padovani P., et al., 2017, A&ARv, 25, 2
  • Pahre et al. (2004) Pahre M. A., Ashby M. L. N., Fazio G. G., Willner S. P., 2004, ApJS, 154, 235
  • Pesce et al. (2021) Pesce D. W., et al., 2021, ApJ, 923, 260
  • Pesce et al. (2022) Pesce D. W., Palumbo D. C. M., Ricarte A., Broderick A. E., Johnson M. D., Nagar N. M., Natarajan P., Gómez J. L., 2022, Galaxies, 10
  • Planck Collaboration et al. (2020) Planck Collaboration et al., 2020, A&A, 641, A6
  • Poitevineau et al. (2023) Poitevineau R., Castignani G., Combes F., 2023, A&A, 672, A164
  • Quilley & de Lapparent (2022) Quilley L., de Lapparent V., 2022, A&A, 666, A170
  • Quilley & de Lapparent (2023) Quilley L., de Lapparent V., 2023, A&A, 680, A49
  • Rakshit et al. (2021) Rakshit S., Stalin C. S., Kotilainen J., Shin J., 2021, ApJS, 253, 28
  • Ramakrishnan et al. (2023) Ramakrishnan V., et al., 2023, Galaxies, 11
  • Reback et al. (2022) Reback J., et al., 2022, pandas-dev/pandas: Pandas 1.4.2, Zenodo, doi:10.5281/zenodo.3509134
  • Runge & Walker (2021) Runge J., Walker S. A., 2021, MNRAS, 502, 5487
  • Sadler et al. (2014) Sadler E. M., Ekers R. D., Mahony E. K., Mauch T., Murphy T., 2014, MNRAS, 438, 796
  • Saglia et al. (2016) Saglia R. P., et al., 2016, ApJ, 818, 47
  • Sato-Polito et al. (2023) Sato-Polito G., Zaldarriaga M., Quataert E., 2023, arXiv e-prints, p. arXiv:2312.06756
  • Schutte et al. (2019) Schutte Z., Reines A. E., Greene J. E., 2019, ApJ, 887, 245
  • Seabold & Perktold (2010) Seabold S., Perktold J., 2010, in 9th Python in Science Conference.
  • Shankar et al. (2009) Shankar F., Weinberg D. H., Miralda-Escudé J., 2009, ApJ, 690, 20
  • Shankar et al. (2016) Shankar F., et al., 2016, MNRAS, 460, 3119
  • Singh et al. (2021) Singh S., et al., 2021, MNRAS, 504, 4143
  • Sofue (2016) Sofue Y., 2016, PASJ, 68, 2
  • Stern et al. (2012) Stern D., et al., 2012, ApJ, 753, 30
  • Suh (2021) Suh K.-W., 2021, ApJS, 256, 43
  • Tacconi et al. (2013) Tacconi L. J., et al., 2013, ApJ, 768, 74
  • Taylor (2005) Taylor M., 2005, ASPC, 347, 29
  • Thater et al. (2019) Thater S., Krajnović D., Cappellari M., Davis T. A., de Zeeuw P. T., McDermid R. M., Sarzi M., 2019, A&A, 625, A62
  • Tsai et al. (2015) Tsai C.-W., et al., 2015, ApJ, 805, 90
  • Vavilova et al. (2021) Vavilova I. B., Dobrycheva D. V., Vasylenko M. Y., Elyiv A. A., Melnyk O. V., Khramtsov V., 2021, A&A, 648, A122
  • Vavilova et al. (2022) Vavilova I. B., Khramtsov V., Dobrycheva D. V., Vasylenko M. Y., Elyiv A. A., Melnyk O. V., 2022, Kosmichna Nauka i Tekhnologiya, 28, 03
  • Véron-Cetty & Véron (2010) Véron-Cetty M. P., Véron P., 2010, A&A, 518, A10
  • Virtanen et al. (2020) Virtanen P., et al., 2020, Nature Methods, 17, 261
  • Wang et al. (2019) Wang L., Xu D., Gao L., Guo Q., Qu Y., Pan J., 2019, MNRAS, 485, 2083
  • Wang et al. (2021) Wang F., et al., 2021, ApJ, 907, L1
  • Waskom (2021) Waskom M. L., 2021, Journal of Open Source Software, 6, 3021
  • Whyte et al. (2002) Whyte L. F., Abraham R. G., Merrifield M. R., Eskridge P. B., Frogel J. A., Pogge R. W., 2002, MNRAS, 336, 1281
  • Willett et al. (2013) Willett K. W., et al., 2013, MNRAS, 435, 2835
  • Willmer (2018) Willmer C. N. A., 2018, ApJS, 236, 47
  • Wright et al. (2010) Wright E. L., et al., 2010, AJ, 140, 1868
  • Yao et al. (2020) Yao H. F. M., et al., 2020, ApJ, 903, 91
  • Yao et al. (2022) Yao H. F. M., et al., 2022, ApJ, 939, 26
  • Yao et al. (2023) Yao Y., et al., 2023, ApJ, 955, L6
  • van den Bosch (2016) van den Bosch R. C. E., 2016, ApJ, 831, 134
  • van den Bosch et al. (2015) van den Bosch R. C. E., Gebhardt K., Gültekin K., Yıldırım A., Walsh J. L., 2015, ApJS, 218, 10

Appendix A Diversity of objects in the WISE2MBH final sample

As stated in Secs. 4, 5 and 10.1, the object type is an optional input to the algorithm. The algorithm estimates multiple parameters or classifications for the objects, e.g., high and low bulge fractions (HBF and LBF) and high and low quality sources (HQS and LQS). An example comparing an LBF and a HBF object is shown in Fig. 14. Figure 15 contrasts two examples of HQS sources (top row) with two examples of LQS sources (bottom row).

Refer to caption
Figure 14: Top: WISE two-color images with FOV of 10’ of NGC 7626 (left; a HBF galaxy) and NGC 7773 (right; an LBF galaxy): blue represents the W2 band and red the W3 band. The W2--W3 color clearly distinguishes between the HBF and LBF galaxies. In each panel, the SDSS DR16 image of the galaxy is shown as an insert in the upper right corner for reference. Bottom: The corresponding WISE2MBH MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT probability density function (PDF) provided by our algorithm for each source in the top row. The red vertical line denotes our final (median) MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT value and the dashed vertical lines represent the 1σ1𝜎1\sigma1 italic_σ of the distribution; the reported values for the lower and upper values of MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT. The HBF galaxy has smaller MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT uncertainties as compared to the LBF galaxy; a trend seen in general for LBF galaxies, e.g., Fig. 9.
Refer to caption
Refer to caption
Figure 15: SDSS DR16 images of HQS (top) and LQS (bottom) examples. From left to right, at the top are NGC 4849 (HBF) and SDSS J082825.42+262350.4 (LBF). At bottom are SDSS J075226.52+251020.1 and SDSS J122309.61+251036.7, both are have final MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT upper limits.

Appendix B Comparison for various samples in W2-W3 color to T-type conversion

Our conversion of W2--W3 color to TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT was calibrated with galaxies in the 2MRS sample (Sect. 5). To explore the reliability and errors of this conversion, we used the same method of Sect. 5, but for other samples present in Hyperleda (Makarov et al., 2014).

Hyperleda includes the Huchra et al. (2012, 2MRS), Willett et al. (2013, GZ2), and Dobrycheva (2013) samples, which collectively represent more than 70%percent7070\%70 % of the available values for TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT in Hyperleda (25k, 310k, and 350k values, respectively). Each sample was tested following the same approach described in Sect. 5 for 2MRS, with results shown in Fig. 16.

The GZ2 sample exhibits a clear bimodality in T (bottom row of Fig. 16). The authors provide detailed classifications for spiral galaxies, allowing clear differentiation from Sa to almost Sd classifications (1 to 6 in TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT). However, in the morphological range from lenticular to elliptical, the level of detail is completely lost: all except 530 (i.e. 98%) of these sources are classified as TType=5subscript𝑇Type5T_{\text{Type}}=-5italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT = - 5.

Dobrycheva classified their sample galaxies into two bins: TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT equal --5 and 5 (i.e. ellipticals and spirals; bottom row of Fig. 16). Although this binary classification has demonstrated efficacy for machine learning training and subsequent classification of different samples (e.g., Vavilova et al., 2021, 2022), it does not provide the level of detail required for our W2--W3 color to TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT calibration. Nevertheless, we use this sample here only for comparison purposes.

In contrast, the 2MRS sample shows a relatively smooth and well populated distribution of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT values (bottom row of Fig. 16).

Refer to caption
Figure 16: Top: As in Fig. 4 but for all samples described in Appendix B. Blue dots are slightly shifted in the Y-axis to distinguish between error bars. Colors follow the legend in the top left of the panel. Middle: Effect size (Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT) for every consecutive distribution of W2--W3 color, following the order of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT. The position on the X-axis is the middle value between consecutive TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT. Bottom: TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT distributions of the three samples shown in the top panel are shown in colored histograms of the corresponding color. Black connected dots denote the sample sizes (N𝑁Nitalic_N) required to establish a distinction between consecutive TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT bins, as derived from the medians of the 2MRS W2--W3 distributions in each TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT bin.

The distributions (median values and 1σ1𝜎1\sigma1 italic_σ intervals) of W2--W3 for each TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT in 2MRS and GZ2 are presented in the top panel of Fig. 16. For Dobrycheva, only the medians are shown for comparison. The GZ2 fit shows a similar trend to 2MRS in the range of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT 0 to 5, covering the same range but with smaller error bars. At the upper end, the fits differ by approximately 0.2 in W2--W3, which has a negligible impact on the final MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT estimates when comparing the GZ2 fit to 2MRS fit. At the lower end, there is a difference of almost 0.4 in W2--W3, leading to significant variations in the estimated B/T𝐵𝑇B/Titalic_B / italic_T. For a source with a W2--W3 color of 1, the 2MRS fit gives a B/T0.5similar-to𝐵𝑇0.5B/T\sim 0.5italic_B / italic_T ∼ 0.5 estimate, while the GZ2 fit results in a B/T1similar-to𝐵𝑇1B/T\sim 1italic_B / italic_T ∼ 1, corresponding to a difference of 0.3 dex in the MBulgesubscript𝑀BulgeM_{\text{Bulge}}italic_M start_POSTSUBSCRIPT Bulge end_POSTSUBSCRIPT estimates, which are the first estimates affected by the B/T𝐵𝑇B/Titalic_B / italic_T value. These systematic changes in fit have significant implications for the final estimates and the overall statistics of the sample. Due to the lack of detail and biased representation, the GZ2 sample and its fit were discarded from our analysis. For the Dobrycheva sample, elliptical galaxies exhibit distinct shifts compared to both 2MRS and GZ2, primarily due to the binarity of the classification used.

Therefore, it is crucial to use a sample that is large enough to accurately discriminate between TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT and then to rely on the trend shown by that specific sample. We defined this ‘large enough’ sample size to be N𝑁Nitalic_N, and determined it to be approximately N700similar-to𝑁700N\sim 700italic_N ∼ 700 per bin by statistical power analysis, which is a widely used statistical tool for sample size determination in meta-analyses (e.g., Borenstein et al., 2009; Grundler et al., 2022). In statistical power analysis, the three parameters to be set are statistical power, significance threshold, and effect size. The statistical power (P𝑃Pitalic_P) is often defined as the probability that a study can correctly detect a real effect (i.e., probability of avoiding a Type II error). The significance threshold (α𝛼\alphaitalic_α) is defined as the highest level of acceptable risk in terms of incorrectly rejecting a null hypothesis that is actually true (i.e., Type I error probability). Effect size (Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT) is a standardized technique available to measure the expected strength of the results in a study, regardless of the sample size. This Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT can only be calculated for two samples, so in multisample scenarios this has to be calculated for each pair of samples in a predefined order, e.g., if the samples represent an evolution from 0 to 10 in a quantity, the pairs to calculate Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT must be 0--1, 1--2, and so on.

For this analysis, we used the median values of the W2--W3 distribution for each TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT value in 2MRS, with parameters; P=0.8𝑃0.8P=0.8italic_P = 0.8, α=0.05𝛼0.05\alpha=0.05italic_α = 0.05 and Es=0.15subscript𝐸𝑠0.15E_{s}=0.15italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = 0.15. However, the value of Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT varies for each pair or consecutive distribution of W2--W3 following the order of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT, as shown in the middle plot of Fig. 16. The median value of Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT was approximately 0.17, which corresponds to N550similar-to𝑁550N\sim 550italic_N ∼ 550. To ensure stricter statistical power, we reduced Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT to 0.15. For the lower and upper ends of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT, Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT can go as low as 0.001, requiring a sample size of N107𝑁superscript107N\geq 10^{7}italic_N ≥ 10 start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT to confidently claim that the consecutive distributions are, in fact, two distinct populations and not two samples from the same parent sample. However, the chosen value of Essubscript𝐸𝑠E_{s}italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is sufficiently strict to clearly discriminate between each distribution of W2--W3 for TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT ranging from -3 to 6.

In the bottom plots of Fig. 16 one can see that the required N𝑁Nitalic_N (black dots) is surpassed in most cases by the three samples (colored histograms), but notoriously larger samples are needed for the most extreme values of TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT. It is clear that the results obtained from the 2MRS statistical power analysis are not directly applicable to limit the use of other samples, but similar median values are expected for the whole population of galaxies and the different TTypesubscript𝑇TypeT_{\text{Type}}italic_T start_POSTSUBSCRIPT Type end_POSTSUBSCRIPT between samples, thus needing similar N𝑁Nitalic_N, independent of the sample used.

We decided to use 2MRS over the other samples tested due to the completeness in the lenticular-spiral regime and since both 2MRS and GZ2 samples showed similar behavior in the late-type regime, even though 2MRS is almost 10 times smaller in overall sample size.

Appendix C Correlations and scatter between WISE2MBH and SDSS estimates of stellar mass

Refer to caption
Figure 17: Correlation (top) and scatter (bottom) matrices when comparing WISE2MBH Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates with those from Montero-Dorta et al. (2016, Gra_Early and Gra_Wide), Maraston & Strömbäck (2011, Port_Passive and Port_SF) and Chen et al. (2012, Wis_BC03 and Wis_M11) for three redshift ranges of z<0.1𝑧0.1z<0.1italic_z < 0.1 (left), z<0.3𝑧0.3z<0.3italic_z < 0.3 (middle) and z<0.5𝑧0.5z<0.5italic_z < 0.5 (right). Boxes are colored following the colorbar on top of each panel; for the scatter matrices (bottom row) the colorbar is inverted for easier understanding.

Since deriving Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT is the first step towards estimating MBHsubscript𝑀BHM_{\text{BH}}italic_M start_POSTSUBSCRIPT BH end_POSTSUBSCRIPT, the WISE-estimated Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT must be accurate as compared to estimates obtained with different homogeneous methods applied to very large samples. We found that using only the C14 method, or only the J23 method, did not lead to optimal results when comparing to literature values. Instead combining the Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimation from C14 with prior K-corrections derived by J23 gave the most accurate results.

To test for bias and consistency, we compare the WISE2MBH derived Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT estimates to those of three different value added catalogues (VACs) of Msubscript𝑀M_{*}italic_M start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT available via SDSS: Portsmouth group (Maraston et al., 2013) who used SED fitting with passive and star-forming templates with the Kroupa initial mass function (IMF, Kroupa, 2001), Wisconsin group (Chen et al., 2012) who used the principal component analysis (PCA) method in the optical rest-frame spectral region (3700--5500 Å) with two different single stellar population models from Bruzual & Charlot (2003, BC03) and Maraston & Strömbäck (2011, M11), and finally the Granada group (Montero-Dorta et al., 2016) who used the flexible stellar population synthesis (FSPS) method for early and wide formation times.

The correlation and scatter matrices considering all catalogues mentioned above and WISE2MBH for three different redshift bins are presented in Fig. 17.

For correlations (upper row of Fig. 17), Spearman scores range from 0.60.9similar-toabsent0.60.9\sim 0.6-0.9∼ 0.6 - 0.9 for almost every method considered in the lowest redshift bin (z<0.1𝑧0.1z<0.1italic_z < 0.1). When considering a larger redshift bin (z<0.3𝑧0.3z<0.3italic_z < 0.3), all correlation scores decreased, particularly for methods that take into account completely different methodologies, such as Port_SF and Gra_Early. When considering the entire redshift range (z<0.5𝑧0.5z<0.5italic_z < 0.5), the scores decreased once again, now down to 0.340.340.340.34 for the case of WISE2MBH and Gra_Wide.

The worst scores are found between Gra_Wide and Port_SF in the z<0.1𝑧0.1z<0.1italic_z < 0.1 (score = 0.620.620.620.62) and z<0.3𝑧0.3z<0.3italic_z < 0.3 (score = 0.470.470.470.47) bins. Port_SF performs the worst in intercomparisons in the z<0.3𝑧0.3z<0.3italic_z < 0.3 bin. WISE2MBH performs well compared to SDSS VACs except in the highest (z<0.5𝑧0.5z<0.5italic_z < 0.5) redshift bin.

When considering the scatter between multiple methods (lower row of Fig. 17) WISE2MBH shows the lowest RMSE across all the methods and redshift bins, i.e., it is the method which overall most closely agrees with all VACs on average. For WISE2MBH and Port_Passive, the RMSE is 0.240.240.240.24 for z<0.1𝑧0.1z<0.1italic_z < 0.1 and even lower (0.220.220.220.22) when considering the entire redshift range. However, for WISE2MBH and Gra_Early, the RMSE increases with redshift.

For the specific case of WISE2MBH and Gra_Early, both correlation and scatter got worse with increasing redshift. The correlation starts reasonably well (score = 0.760.760.760.76) in the lowest redshift bin, then becomes one of the worst scores in the entire z<0.5𝑧0.5z<0.5italic_z < 0.5 bin (score = 0.440.440.440.44). The same pattern is seen in the scatter, which increases from RMSE of 0.320.320.320.32 to 0.480.480.480.48, the latter being one of the highest RMSE in the entire matrix. In Fig. 6 it is clear that there is a systematic shift with redshift that causes the low correlation score. It is the only pair that shows this strong tendency to get worse in both metrics with increasing redshift.