-
The Extended Mapping Obscuration to Reionization with ALMA (Ex-MORA) Survey: 5$σ$ Source Catalog and Redshift Distribution
Authors:
Arianna S. Long,
Caitlin M. Casey,
Jed McKinney,
Jorge A. Zavala,
Hollis B. Akins,
Olivia R. Cooper,
Matthieu Bethermin Erini L. Lambrides,
Maximilien Franco,
Karina Caputi,
Jaclyn B. Champagne,
Allison W. S. Man,
Ezequiel Treister,
Sinclaire M. Manning,
David B. Sanders,
Margherita Talia,
Manuel Aravena,
D. L. Clements,
Elisabete da Cunha,
Andreas L. Faisst,
Fabrizio Gentile,
Jacqueline Hodge,
Gabriel Brammer,
Marcella Brusa,
Steven L. Finkelstein,
Seiji Fujimoto
, et al. (19 additional authors not shown)
Abstract:
One of the greatest challenges in galaxy evolution over the last decade has been constraining the prevalence of heavily dust-obscured galaxies in the early Universe. At $z>3$, these galaxies are increasingly rare, and difficult to identify as they are interspersed among the more numerous dust-obscured galaxy population at $z=1-3$, making efforts to secure confident spectroscopic redshifts expensiv…
▽ More
One of the greatest challenges in galaxy evolution over the last decade has been constraining the prevalence of heavily dust-obscured galaxies in the early Universe. At $z>3$, these galaxies are increasingly rare, and difficult to identify as they are interspersed among the more numerous dust-obscured galaxy population at $z=1-3$, making efforts to secure confident spectroscopic redshifts expensive, and sometimes unsuccessful. In this work, we present the Extended Mapping Obscuration to Reionization with ALMA (Ex-MORA) Survey -- a 2mm blank-field survey in the COSMOS-Web field, and the largest ever ALMA blank-field survey to-date covering 577 arcmin$^2$. Ex-MORA is an expansion of the MORA survey designed to identify primarily $z>3$ dusty, star-forming galaxies while simultaneously filtering out the more numerous $z<3$ population by leveraging the very negative $K$-correction at observed-frame 2mm. We identify 37 significant ($>$5$σ$) sources, 33 of which are robust thermal dust emitters. We measure a median redshift of $\langle z \rangle = 3.6^{+0.1}_{-0.2}$, with two-thirds of the sample at $z>3$, and just under half at $z>4$, demonstrating the overall success of the 2mm-selection technique. The integrated $z>3$ volume density of Ex-MORA sources is $\sim1-3\times10^{-5}$ Mpc$^{-3}$, consistent with other surveys of infrared luminous galaxies at similar epochs. We also find that techniques using rest-frame optical emission (or lack thereof) to identify $z>3$ heavily dust-obscured galaxies miss at least half of Ex-MORA galaxies. This supports the idea that the dusty galaxy population is heterogeneous, and that synergies across observatories spanning multiple energy regimes are critical to understanding their formation and evolution at $z>3$.
△ Less
Submitted 26 August, 2024;
originally announced August 2024.
-
Stripe 82-XL: the $\sim$54.8 deg$^2$ and $\sim$18.8 Ms Chandra and XMM-Newton point source catalog and number of counts
Authors:
Alessandro Peca,
Nico Cappelluti,
Stephanie LaMassa,
C. Megan Urry,
Massimo Moscetti,
Stefano Marchesi,
David Sanders,
Connor Auge,
Aritra Ghosh,
Tonima Tasnim Ananna,
Núria Torres-Albà,
Ezequiel Treister
Abstract:
We present an enhanced version of the publicly-available Stripe 82X catalog (S82-XL), featuring a comprehensive set of 22,737 unique X-ray point sources identified with a significance $\gtrsim 4σ$. This catalog is four times larger than the original Stripe 82X catalog, by including additional archival data from the Chandra and XMM-Newton telescopes. Now covering $\sim54.8$ deg$^2$ of non-overlappi…
▽ More
We present an enhanced version of the publicly-available Stripe 82X catalog (S82-XL), featuring a comprehensive set of 22,737 unique X-ray point sources identified with a significance $\gtrsim 4σ$. This catalog is four times larger than the original Stripe 82X catalog, by including additional archival data from the Chandra and XMM-Newton telescopes. Now covering $\sim54.8$ deg$^2$ of non-overlapping sky area, the S82-XL catalog roughly doubles the area and depth of the original catalog, with limiting fluxes (half-area fluxes) of 3.4$\times 10^{-16}$ (2.4$\times 10^{-15}$), 2.9$\times 10^{-15}$ (1.5$\times 10^{-14}$), and 1.4$\times 10^{-15}$ (9.5$\times 10^{-15}$) erg s$^{-1}$ cm$^{-2}$ across the soft (0.5-2 keV), hard (2-10 keV), and full (0.5-10 keV) bands, respectively. S82-XL occupies a unique region of flux-area parameter space compared to other X-ray surveys, identifying sources with rest-frame luminosities from $1.2\times 10^{38}$ to $1.6\times 10^{47}$ erg s$^{-1}$ in the 2-10 keV band (median X-ray luminosity, $7.2\times 10^{43}$ erg s$^{-1}$), and spectroscopic redshifts up to $z\sim6$. By using hardness ratios, we derived Active Galactic Nuclei (AGNs) obscuration obtaining a median value of $N_H=21.6_{-1.6}^{+1.0}$, and an overall, obscured fraction ($\log N_H/\mathrm{cm^{-2}}>22$) of $\sim 36.9\%$. S82-XL serves as a benchmark in X-ray surveys and, with its extensive multiwavelength data, is especially valuable for comprehensive studies of luminous AGNs.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Wavelet Convolutions for Large Receptive Fields
Authors:
Shahaf E. Finder,
Roy Amoyal,
Eran Treister,
Oren Freifeld
Abstract:
In recent years, there have been attempts to increase the kernel size of Convolutional Neural Nets (CNNs) to mimic the global receptive field of Vision Transformers' (ViTs) self-attention blocks. That approach, however, quickly hit an upper bound and saturated way before achieving a global receptive field. In this work, we demonstrate that by leveraging the Wavelet Transform (WT), it is, in fact,…
▽ More
In recent years, there have been attempts to increase the kernel size of Convolutional Neural Nets (CNNs) to mimic the global receptive field of Vision Transformers' (ViTs) self-attention blocks. That approach, however, quickly hit an upper bound and saturated way before achieving a global receptive field. In this work, we demonstrate that by leveraging the Wavelet Transform (WT), it is, in fact, possible to obtain very large receptive fields without suffering from over-parameterization, e.g., for a $k \times k$ receptive field, the number of trainable parameters in the proposed method grows only logarithmically with $k$. The proposed layer, named WTConv, can be used as a drop-in replacement in existing architectures, results in an effective multi-frequency response, and scales gracefully with the size of the receptive field. We demonstrate the effectiveness of the WTConv layer within ConvNeXt and MobileNetV2 architectures for image classification, as well as backbones for downstream tasks, and show it yields additional properties such as robustness to image corruption and an increased response to shapes over textures. Our code is available at https://github.com/BGU-CS-VIL/WTConv.
△ Less
Submitted 15 July, 2024; v1 submitted 8 July, 2024;
originally announced July 2024.
-
Graph Neural Reaction Diffusion Models
Authors:
Moshe Eliasof,
Eldad Haber,
Eran Treister
Abstract:
The integration of Graph Neural Networks (GNNs) and Neural Ordinary and Partial Differential Equations has been extensively studied in recent years. GNN architectures powered by neural differential equations allow us to reason about their behavior, and develop GNNs with desired properties such as controlled smoothing or energy conservation. In this paper we take inspiration from Turing instabiliti…
▽ More
The integration of Graph Neural Networks (GNNs) and Neural Ordinary and Partial Differential Equations has been extensively studied in recent years. GNN architectures powered by neural differential equations allow us to reason about their behavior, and develop GNNs with desired properties such as controlled smoothing or energy conservation. In this paper we take inspiration from Turing instabilities in a Reaction Diffusion (RD) system of partial differential equations, and propose a novel family of GNNs based on neural RD systems. We \textcolor{black}{demonstrate} that our RDGNN is powerful for the modeling of various data types, from homophilic, to heterophilic, and spatio-temporal datasets. We discuss the theoretical properties of our RDGNN, its implementation, and show that it improves or offers competitive performance to state-of-the-art methods.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Global-Local Graph Neural Networks for Node-Classification
Authors:
Moshe Eliasof,
Eran Treister
Abstract:
The task of graph node classification is often approached by utilizing a local Graph Neural Network (GNN), that learns only local information from the node input features and their adjacency. In this paper, we propose to improve the performance of node classification GNNs by utilizing both global and local information, specifically by learning label- and node- features. We therefore call our metho…
▽ More
The task of graph node classification is often approached by utilizing a local Graph Neural Network (GNN), that learns only local information from the node input features and their adjacency. In this paper, we propose to improve the performance of node classification GNNs by utilizing both global and local information, specifically by learning label- and node- features. We therefore call our method Global-Local-GNN (GLGNN). To learn proper label features, for each label, we maximize the similarity between its features and nodes features that belong to the label, while maximizing the distance between nodes that do not belong to the considered label. We then use the learnt label features to predict the node classification map. We demonstrate our GLGNN using three different GNN backbones, and show that our approach improves baseline performance, revealing the importance of global information utilization for node classification.
△ Less
Submitted 16 June, 2024;
originally announced June 2024.
-
Physics-guided Full Waveform Inversion using Encoder-Solver Convolutional Neural Networks
Authors:
Matan Goren,
Eran Treister
Abstract:
Full Waveform Inversion (FWI) is an inverse problem for estimating the wave velocity distribution in a given domain, based on observed data on the boundaries. The inversion is computationally demanding because we are required to solve multiple forward problems, either in time or frequency domains, to simulate data that are then iteratively fitted to the observed data. We consider FWI in the freque…
▽ More
Full Waveform Inversion (FWI) is an inverse problem for estimating the wave velocity distribution in a given domain, based on observed data on the boundaries. The inversion is computationally demanding because we are required to solve multiple forward problems, either in time or frequency domains, to simulate data that are then iteratively fitted to the observed data. We consider FWI in the frequency domain, where the Helmholtz equation is used as a forward model, and its repeated solution is the main computational bottleneck of the inversion process. To ease this cost, we integrate a learning process of an encoder-solver preconditioner that is based on convolutional neural networks (CNNs). The encoder-solver is trained to effectively precondition the discretized Helmholtz operator given velocity medium parameters. Then, by re-training the CNN between the iterations of the optimization process, the encoder-solver is adapted to the iteratively evolving velocity medium as part of the inversion. Without retraining, the performance of the solver deteriorates as the medium changes. Using our light retraining procedures, we obtain the forward simulations effectively throughout the process. We demonstrate our approach to solving FWI problems using 2D geophysical models with high-frequency data.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
The NuSTAR Serendipitous Survey: the 80-month catalog and source properties of the high-energy emitting AGN and quasar population
Authors:
Claire L. Greenwell,
Lizelke Klindt,
George B. Lansbury,
David J. Rosario,
David M. Alexander,
James Aird,
Daniel Stern,
Karl Forster,
Michael J. Koss,
Franz E. Bauer,
Claudio Ricci,
John Tomsick,
William N. Brandt,
Thomas Connor,
Peter G. Boorman,
Adlyka Annuar,
David R. Ballantyne,
Chien-Ting Chen,
Francesca Civano,
Andrea Comastri,
Victoria A. Fawcett,
Francesca M. Fornasini,
Poshak Gandhi,
Fiona Harrison,
Marianne Heida
, et al. (10 additional authors not shown)
Abstract:
We present a catalog of hard X-ray serendipitous sources detected in the first 80 months of observations by the Nuclear Spectroscopic Telescope Array (NuSTAR). The NuSTAR serendipitous survey 80-month (NSS80) catalog has an unprecedented $\sim$ 62 Ms of effective exposure time over 894 unique fields (a factor of three increase over the 40-month catalog), with an areal coverage of $\sim $36 deg…
▽ More
We present a catalog of hard X-ray serendipitous sources detected in the first 80 months of observations by the Nuclear Spectroscopic Telescope Array (NuSTAR). The NuSTAR serendipitous survey 80-month (NSS80) catalog has an unprecedented $\sim$ 62 Ms of effective exposure time over 894 unique fields (a factor of three increase over the 40-month catalog), with an areal coverage of $\sim $36 deg$^2$, larger than all NuSTAR extragalactic surveys. NSS80 provides 1274 hard X-ray sources in the $3-24$ keV band (822 new detections compared to the previous 40-month catalog). Approximately 76% of the NuSTAR sources have lower-energy ($<10$ keV) X-ray counterparts from Chandra, XMM-Newton, and Swift-XRT. We have undertaken an extensive campaign of ground-based spectroscopic follow-up to obtain new source redshifts and classifications for 427 sources. Combining these with existing archival spectroscopy provides redshifts for 550 NSS80 sources, of which 547 are classified. The sample is primarily composed of active galactic nuclei (AGN), detected over a large range in redshift ($z$ = 0.012-3.43), but also includes 58 spectroscopically confirmed Galactic sources. In addition, five AGN/galaxy pairs, one dual AGN system, one BL Lac candidate, and a hotspot of 4C 74.26 (radio quasar) have been identified. The median rest-frame $10-40$ keV luminosity and redshift of the NSS80 are $\langle{L_\mathrm{10-40 keV}}\rangle$ = 1.2 $\times$ 10$^{44}$ erg s$^{-1}$ and $\langle z \rangle = 0.56$. We investigate the optical properties and construct composite optical spectra to search for subtle signatures not present in the individual spectra, finding an excess of redder BL AGN compared to optical quasar surveys predominantly due to the presence of the host-galaxy and, at least in part, due to dust obscuration.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Stellar Abundances at the Center of Early Type Galaxies with Fine Structure
Authors:
Nicholas Barth,
George C. Privon,
Rana Ezzeddine,
Aaron S. Evans,
Ezequiel Treister
Abstract:
Our understanding of early-type galaxies (ETGs) has grown in the past decade with the advance of full-spectrum fitting techniques used to infer the properties of the stellar populations that make-up the galaxy. We present ages, central velocity dispersions, and abundance ratios relative to Fe of C, N, O, Mg, Si, Ca, Ti, Cr, Mn, Co, Ni, Cu, Sr, Ba, and Eu, derived using full-spectrum fitting techni…
▽ More
Our understanding of early-type galaxies (ETGs) has grown in the past decade with the advance of full-spectrum fitting techniques used to infer the properties of the stellar populations that make-up the galaxy. We present ages, central velocity dispersions, and abundance ratios relative to Fe of C, N, O, Mg, Si, Ca, Ti, Cr, Mn, Co, Ni, Cu, Sr, Ba, and Eu, derived using full-spectrum fitting techniques for three ETGs NGC 2865, NGC 3818, and NGC 4915. Each of these three galaxies were selected because they have optical, disturbed structures (fine structure) that are linked to major merger events that occurred 1, 7, and 6 Gyr ago, respectively. Two of the ETGs, NGC 3818 and NGC 4915, show chemical signatures similar to ETGs without fine structure, which is consistent with a gas-poor merger of elliptical galaxies in which substantial star formation is not expected. For NGC 2865, we find a statistically higher abundance of Ca (an $α$-element) and Cr and Mn (Fe-peak elements). We show that for NGC 2865, a simple gas-rich merger scenario fails to explain the larger abundance ratios compared to ETGs without fine structure. These three early-type galaxies with fine structure exhibit a range of abundances, suggesting ETGs with fine structure can form via multiple pathways and types of galaxy mergers.
△ Less
Submitted 20 April, 2024;
originally announced April 2024.
-
Distribution of merging and post-merging galaxies in nearby galaxy clusters
Authors:
Duho Kim,
Yun-Kyeong Sheen,
Yara L. Jaffé,
Kshitija Kelkar,
Adarsh Ranjan,
Franco Piraino-Cerda,
Jacob P. Crossett,
Ana Carolina Costa Lourenço,
Garreth Martin,
Julie B. Nantais,
Ricardo Demarco,
Ezequiel Treister,
Sukyoung K. Yi
Abstract:
We study the incidence and spatial distribution of galaxies that are currently undergoing gravitational merging (M) or that have signs of a post merger (PM) in six galaxy clusters (A754, A2399, A2670, A3558, A3562, and A3716) within the redshift range, 0.05$\lesssim$$z$$\lesssim$0.08. To this aim, we obtained Dark Energy Camera (DECam) mosaics in $u^{\prime}$, $g^{\prime}$, and $r^{\prime}$-bands…
▽ More
We study the incidence and spatial distribution of galaxies that are currently undergoing gravitational merging (M) or that have signs of a post merger (PM) in six galaxy clusters (A754, A2399, A2670, A3558, A3562, and A3716) within the redshift range, 0.05$\lesssim$$z$$\lesssim$0.08. To this aim, we obtained Dark Energy Camera (DECam) mosaics in $u^{\prime}$, $g^{\prime}$, and $r^{\prime}$-bands covering up to $3\times R_{200}$ of the clusters, reaching 28 mag/arcsec$^2$ surface brightness limits. We visually inspect $u^{\prime}$$g^{\prime}$$r^{\prime}$ color-composite images of volume-limited ($M_r < -20$) cluster-member galaxies to identify whether galaxies are of M or PM types. We find 4% M-type and 7% PM-type galaxies in the galaxy clusters studied. By adding spectroscopic data and studying the projected phase space diagram (PPSD) of the projected clustocentric radius and the line-of-sight velocity, we find that PM-type galaxies are more virialized than M-type galaxies, having 1--5% point higher fraction within the escape-velocity region, while the fraction of M-type was $\sim$10% point higher than PM-type in the intermediate environment. Similarly, on a substructure analysis, M types were found in the outskirt groups, while PM types populated groups in ubiquitous regions of the PPSD. Adopting literature-derived dynamical state indicator values, we observed a higher abundance of M types in dynamically relaxed clusters. This finding suggests that galaxies displaying post-merging features within clusters likely merged in low-velocity environments, including cluster outskirts and dynamically relaxed clusters.
△ Less
Submitted 3 May, 2024; v1 submitted 11 March, 2024;
originally announced March 2024.
-
Feedback and ionized gas outflows in four low-radio power AGN at z $\sim$0.15
Authors:
L. Ulivi,
G. Venturi,
G. Cresci,
A. Marconi,
C. Marconcini,
A. Amiri,
F. Belfiore,
E. Bertola,
S. Carniani,
Q. D Amato,
E. Di Teodoro,
M. Ginolfi,
A. Girdhar,
C. Harrison,
R. Maiolino,
F. Mannucci,
M. Mingozzi,
M. Perna,
M. Scialpi,
N. Tomicic,
G. Tozzi,
E. Treister
Abstract:
An increasing number of observations and simulations suggests that low-power (<10$^{44}$ erg s$^{-1}$) jets may be a significant channel of feedback produced by active galactic nuclei (AGN), but little is known about their actual effect on their host galaxies from the observational point of view. We targeted four luminous type 2 AGN hosting moderately powerful radio emission ($\sim$10$^{44}$ erg s…
▽ More
An increasing number of observations and simulations suggests that low-power (<10$^{44}$ erg s$^{-1}$) jets may be a significant channel of feedback produced by active galactic nuclei (AGN), but little is known about their actual effect on their host galaxies from the observational point of view. We targeted four luminous type 2 AGN hosting moderately powerful radio emission ($\sim$10$^{44}$ erg s$^{-1}$), two of which and possibly a third are associated with jets, with optical integral field spectroscopy observations from the Multi Unit Spectroscopic Explorer (MUSE) at the Very Large Telescope (VLT) to analyze the properties of their ionized gas as well as the properties and effects of ionized outflows. We combined these observations with Very Large Array (VLA) and e-MERLIN data to investigate the relations and interactions between the radio jets and host galaxies. We detected ionized outflows as traced by the fast bulk motion of the gas. The outflows extended over kiloparsec scales in the direction of the jet, when present. In the two sources with resolved radio jets, we detected a strong enhancement in the emission-line velocity dispersion (up to 1000 km s$^{-1}$) perpendicular to the direction of the radio jets. We also found a correlation between the mass and the energetics of this high-velocity dispersion gas and the radio power, which supports the idea that the radio emission may cause the enhanced turbulence. This phenomenon, which is now being observed in an increasing number of objects, might represent an important channel for AGN feedback on galaxies.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
An Over Complete Deep Learning Method for Inverse Problems
Authors:
Moshe Eliasof,
Eldad Haber,
Eran Treister
Abstract:
Obtaining meaningful solutions for inverse problems has been a major challenge with many applications in science and engineering. Recent machine learning techniques based on proximal and diffusion-based methods have shown promising results. However, as we show in this work, they can also face challenges when applied to some exemplary problems. We show that similar to previous works on over-complet…
▽ More
Obtaining meaningful solutions for inverse problems has been a major challenge with many applications in science and engineering. Recent machine learning techniques based on proximal and diffusion-based methods have shown promising results. However, as we show in this work, they can also face challenges when applied to some exemplary problems. We show that similar to previous works on over-complete dictionaries, it is possible to overcome these shortcomings by embedding the solution into higher dimensions. The novelty of the work proposed is that we jointly design and learn the embedding and the regularizer for the embedding vector. We demonstrate the merit of this approach on several exemplary and common inverse problems.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
On The Temporal Domain of Differential Equation Inspired Graph Neural Networks
Authors:
Moshe Eliasof,
Eldad Haber,
Eran Treister,
Carola-Bibiane Schönlieb
Abstract:
Graph Neural Networks (GNNs) have demonstrated remarkable success in modeling complex relationships in graph-structured data. A recent innovation in this field is the family of Differential Equation-Inspired Graph Neural Networks (DE-GNNs), which leverage principles from continuous dynamical systems to model information flow on graphs with built-in properties such as feature smoothing or preservat…
▽ More
Graph Neural Networks (GNNs) have demonstrated remarkable success in modeling complex relationships in graph-structured data. A recent innovation in this field is the family of Differential Equation-Inspired Graph Neural Networks (DE-GNNs), which leverage principles from continuous dynamical systems to model information flow on graphs with built-in properties such as feature smoothing or preservation. However, existing DE-GNNs rely on first or second-order temporal dependencies. In this paper, we propose a neural extension to those pre-defined temporal dependencies. We show that our model, called TDE-GNN, can capture a wide range of temporal dynamics that go beyond typical first or second-order methods, and provide use cases where existing temporal models are challenged. We demonstrate the benefit of learning the temporal dependencies using our method rather than using pre-defined temporal dynamics on several graph benchmarks.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
The determinant of the Laplacian matrix of a quaternion unit gain graph
Authors:
Ivan I. Kyrchei,
Eran Treister,
Volodymyr O. Pelykh
Abstract:
A quaternion unit gain graph is a graph where each orientation of an edge is given a quaternion unit, and the opposite orientation is assigned the inverse of this quaternion unit. In this paper, we provide a combinatorial description of the determinant of the Laplacian matrix of a quaternion unit gain graph by using row-column noncommutative determinants recently introduced by one of the authors.…
▽ More
A quaternion unit gain graph is a graph where each orientation of an edge is given a quaternion unit, and the opposite orientation is assigned the inverse of this quaternion unit. In this paper, we provide a combinatorial description of the determinant of the Laplacian matrix of a quaternion unit gain graph by using row-column noncommutative determinants recently introduced by one of the authors. A numerical example is presented for illustrating our results.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
BASS XLII: The relation between the covering factor of dusty gas and the Eddington ratio in nearby active galactic nuclei
Authors:
C. Ricci,
K. Ichikawa,
M. Stalevski,
T. Kawamuro,
S. Yamada,
Y. Ueda,
R. Mushotzky,
G. C. Privon,
M. J. Koss,
B. Trakhtenbrot,
A. C. Fabian,
L. C. Ho,
D. Asmus,
F. E. Bauer,
C. S. Chang,
K. K. Gupta,
K. Oh,
M. Powell,
R. W. Pfeifle,
A. Rojas,
F. Ricci,
M. J. Temple,
Y. Toba,
A. Tortosa,
E. Treister
, et al. (3 additional authors not shown)
Abstract:
Accreting supermassive black holes (SMBHs) located at the center of galaxies are typically surrounded by large quantities of gas and dust. The structure and evolution of this circumnuclear material can be studied at different wavelengths, from the submillimeter to the X-rays. Recent X-ray studies have shown that the covering factor of the obscuring material tends to decrease with increasing Edding…
▽ More
Accreting supermassive black holes (SMBHs) located at the center of galaxies are typically surrounded by large quantities of gas and dust. The structure and evolution of this circumnuclear material can be studied at different wavelengths, from the submillimeter to the X-rays. Recent X-ray studies have shown that the covering factor of the obscuring material tends to decrease with increasing Eddington ratio, likely due to radiative feedback on dusty gas. Here we study a sample of 549 nearby (z<0.1) hard X-ray (14-195 keV) selected non-blazar active galactic nuclei (AGN), and use the ratio between the AGN infrared and bolometric luminosity as a proxy of the covering factor. We find that, in agreement with what has been found by X-ray studies of the same sample, the covering factor decreases with increasing Eddington ratio. We also confirm previous findings which showed that obscured AGN typically have larger covering factors than unobscured sources. Finally, we find that the median covering factors of AGN located in different regions of the column density-Eddington ratio diagram are in good agreement with what would be expected from a radiation-regulated growth of SMBHs.
△ Less
Submitted 11 November, 2023; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Complex AGN feedback in the Teacup galaxy. A powerful ionised galactic outflow, jet-ISM interaction, and evidence for AGN-triggered star formation in a giant bubble
Authors:
G. Venturi,
E. Treister,
C. Finlez,
G. D'Ago,
F. Bauer,
C. M. Harrison,
C. Ramos Almeida,
M. Revalski,
F. Ricci,
L. F. Sartori,
A. Girdhar,
W. C. Keel,
D. Tubín
Abstract:
The $z$~0.1 type-2 QSO J1430+1339 (the 'Teacup') is a complex galaxy showing a loop of ionised gas ~10 kpc in diameter, co-spatial radio bubbles, a compact (~1 kpc) jet, and outflow activity. We used VLT/MUSE optical integral field spectroscopic observations to characterise the properties and effects of the galactic ionised outflow from kpc up to tens of kpc scales and compare them with those of t…
▽ More
The $z$~0.1 type-2 QSO J1430+1339 (the 'Teacup') is a complex galaxy showing a loop of ionised gas ~10 kpc in diameter, co-spatial radio bubbles, a compact (~1 kpc) jet, and outflow activity. We used VLT/MUSE optical integral field spectroscopic observations to characterise the properties and effects of the galactic ionised outflow from kpc up to tens of kpc scales and compare them with those of the radio jet. We detect a velocity dispersion enhancement (>300 km/s) elongated over several kpc perpendicular to the radio jet, the AGN ionisation lobes, and the fast outflow, similar to what is found in other galaxies hosting compact, low-power jets, indicating that the jet strongly perturbs the host ISM. The mass outflow rate decreases with distance from the nucleus, from around 100 $M_\odot$/yr in the inner 1-2 kpc to <0.1 $M_\odot$/yr at 30 kpc. The ionised mass outflow rate is ~1-8 times higher than the molecular one, in contrast with what is often quoted in AGN. The driver of the multi-phase outflow is likely a combination of AGN radiation and the jet. The outflow mass-loading factor (~5-10) and the molecular gas depletion time (<10$^8$ yr) indicate that the outflow can significantly affect the star formation and the gas reservoir in the galaxy. However, the fraction of the ionised outflow that is able to escape the dark matter halo potential is likely negligible. We detect blue-coloured continuum emission co-spatial with the ionised gas loop. Here, stellar populations are younger (<100-150 Myr) than in the rest of the galaxy (~0.5-1 Gyr). This constitutes possible evidence for star formation triggered at the edge of the bubble due to the compressing action of the jet and outflow ('positive feedback'), as predicted by theory. All in all, the Teacup constitutes a rich system in which AGN feedback from outflows and jets, in both its negative and positive flavours, co-exist.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
The Accretion History of AGN: The Spectral Energy Distributions of X-ray Luminous AGN
Authors:
Connor Auge,
David Sanders,
Ezequiel Treister,
C. Megan Urry,
Allison Kirkpatrick,
Nico Cappelluti,
Tonima Tasnim Ananna,
Médéric Boquien,
Mislav Baloković,
Francesca Civano,
Brandon Coleman,
Aritra Ghosh,
Jeyhan Kartaltepe,
Michael Koss,
Stephanie LaMassa,
Stefano Marchesi,
Alessandro Peca,
Meredith Powell,
Benny Trakhtenbrot,
Tracey Jane Turner
Abstract:
Spectral energy distributions (SEDs) from X-ray to far-infrared (FIR) wavelengths are presented for a sample of 1246 X-ray luminous active galactic nuclei (AGN; $L_{0.5-10\rm{keV}}>10^{43}$ erg s$^{-1}$), with $z_{\rm{spec}}<1.2$, selected from Stripe 82X, COSMOS, and GOODS-N/S. The rest-frame SEDs show a wide spread ($\sim2.5$ dex) in the relative strengths of broad continuum features at X-ray, u…
▽ More
Spectral energy distributions (SEDs) from X-ray to far-infrared (FIR) wavelengths are presented for a sample of 1246 X-ray luminous active galactic nuclei (AGN; $L_{0.5-10\rm{keV}}>10^{43}$ erg s$^{-1}$), with $z_{\rm{spec}}<1.2$, selected from Stripe 82X, COSMOS, and GOODS-N/S. The rest-frame SEDs show a wide spread ($\sim2.5$ dex) in the relative strengths of broad continuum features at X-ray, ultraviolet (UV), mid-infrared (MIR), and FIR wavelengths. A linear correlation (log-log slope of 0.7$\pm0.04$) is found between $L_{\rm{MIR}}$ and $L_{\rm{X}}$. There is significant scatter in the relation between the $L_{\rm{UV}}$ and $L_{\rm{X}}$ due to heavy obscuration, however the most luminous and unobscured AGN show a linear correlation (log-log slope of 0.8$\pm0.06$) in the relation above this scatter. The relation between $L_{\rm{FIR}}$ and $L_{\rm{X}}$ is predominantly flat, but with decreasing dispersion at $L_{\rm{X}}>10^{44}$ erg s$^{-1}$. The ratio between the "galaxy subtracted" bolometric luminosity and the intrinsic $L_{\rm{X}}$ increases from a factor of $\sim$$10-70$ from log $L_{\rm{bol}}/{\rm(erg\; s}^{-1})=44.5-46.5$. Characteristic SED shapes have been determined by grouping AGN based on relative strengths of the UV and MIR emission. The average $L_{1μ\rm{m}}$ is constant for the majority of these SED shapes, while AGN with the strongest UV and MIR emission have elevated $L_{1μ\rm{m}}$, consistent with the AGN emission dominating their SEDs at optical and NIR wavelengths. A strong correlation is found between the SED shape and both the $L_{\rm{X}}$ and $L_{\rm{bol}}$, such that $L_{\rm{bol}}/L_{\rm{X}}=20.4\pm1.8$, independent of the SED shape. This is consistent with an evolutionary scenario of increasing $L_{\rm{bol}}$ with decreasing obscuration as the AGN blows away circumnuclear gas.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Feature Transportation Improves Graph Neural Networks
Authors:
Moshe Eliasof,
Eldad Haber,
Eran Treister
Abstract:
Graph neural networks (GNNs) have shown remarkable success in learning representations for graph-structured data. However, GNNs still face challenges in modeling complex phenomena that involve feature transportation. In this paper, we propose a novel GNN architecture inspired by Advection-Diffusion-Reaction systems, called ADR-GNN. Advection models feature transportation, while diffusion captures…
▽ More
Graph neural networks (GNNs) have shown remarkable success in learning representations for graph-structured data. However, GNNs still face challenges in modeling complex phenomena that involve feature transportation. In this paper, we propose a novel GNN architecture inspired by Advection-Diffusion-Reaction systems, called ADR-GNN. Advection models feature transportation, while diffusion captures the local smoothing of features, and reaction represents the non-linear transformation between feature channels. We provide an analysis of the qualitative behavior of ADR-GNN, that shows the benefit of combining advection, diffusion, and reaction. To demonstrate its efficacy, we evaluate ADR-GNN on real-world node classification and spatio-temporal datasets, and show that it improves or offers competitive performance compared to state-of-the-art networks.
△ Less
Submitted 20 December, 2023; v1 submitted 29 July, 2023;
originally announced July 2023.
-
LFA-tuned matrix-free multigrid method for the elastic Helmholtz equation
Authors:
Rachel Yovel,
Eran Treister
Abstract:
We present an efficient matrix-free geometric multigrid method for the elastic Helmholtz equation, and a suitable discretization. Many discretization methods had been considered in the literature for the Helmholtz equations, as well as many solvers and preconditioners, some of which are adapted for the elastic version of the equation. However, there is very little work considering the reciprocity…
▽ More
We present an efficient matrix-free geometric multigrid method for the elastic Helmholtz equation, and a suitable discretization. Many discretization methods had been considered in the literature for the Helmholtz equations, as well as many solvers and preconditioners, some of which are adapted for the elastic version of the equation. However, there is very little work considering the reciprocity of discretization and a solver. In this work, we aim to bridge this gap. By choosing an appropriate stencil for re-discretization of the equation on the coarse grid, we develop a multigrid method that can be easily implemented as matrix-free, relying on stencils rather than sparse matrices. This is crucial for efficient implementation on modern hardware. Using two-grid local Fourier analysis, we validate the compatibility of our discretization with our solver, and tune a choice of weights for the stencil for which the convergence rate of the multigrid cycle is optimal. It results in a scalable multigrid preconditioner that can tackle large real-world 3D scenarios.
△ Less
Submitted 3 December, 2023; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Multigrid-Augmented Deep Learning Preconditioners for the Helmholtz Equation using Compact Implicit Layers
Authors:
Bar Lerer,
Ido Ben-Yair,
Eran Treister
Abstract:
We present a deep learning-based iterative approach to solve the discrete heterogeneous Helmholtz equation for high wavenumbers. Combining classical iterative multigrid solvers and convolutional neural networks (CNNs) via preconditioning, we obtain a learned neural solver that is faster and scales better than a standard multigrid solver. Our approach offers three main contributions over previous n…
▽ More
We present a deep learning-based iterative approach to solve the discrete heterogeneous Helmholtz equation for high wavenumbers. Combining classical iterative multigrid solvers and convolutional neural networks (CNNs) via preconditioning, we obtain a learned neural solver that is faster and scales better than a standard multigrid solver. Our approach offers three main contributions over previous neural methods of this kind. First, we construct a multilevel U-Net-like encoder-solver CNN with an implicit layer on the coarsest grid of the U-Net, where convolution kernels are inverted. This alleviates the field of view problem in CNNs and allows better scalability. Second, we improve upon the previous CNN preconditioner in terms of the number of parameters, computation time, and convergence rates. Third, we propose a multiscale training approach that enables the network to scale to problems of previously unseen dimensions while still maintaining a reasonable training procedure. Our encoder-solver architecture can be used to generalize over different slowness models of various difficulties and is efficient at solving for many right-hand sides per slowness model. We demonstrate the benefits of our novel architecture with numerical experiments on a variety of heterogeneous two-dimensional problems at high wavenumbers.
△ Less
Submitted 6 March, 2024; v1 submitted 30 June, 2023;
originally announced June 2023.
-
Lack of Correlations between Cold Molecular Gas and AGN Properties in Type 1 AGNs at $z \lesssim 0.5$
Authors:
Juan Molina,
Jinyi Shangguan,
Ran Wang,
Luis C. Ho,
Franz E. Bauer,
Ezequiel Treister
Abstract:
We present new NOrthern Extended Millimeter Array (NOEMA) observations of the CO(2--1) emission in eight of the brightest Palomar-Green quasars at $z \lesssim 0.5$ to investigate the role of active galactic nuclei (AGN) feedback in luminous quasars detected at low redshifts. We detect CO(2--1) emission in three objects, from which we derive CO luminosities, molecular gas masses and fractions, and…
▽ More
We present new NOrthern Extended Millimeter Array (NOEMA) observations of the CO(2--1) emission in eight of the brightest Palomar-Green quasars at $z \lesssim 0.5$ to investigate the role of active galactic nuclei (AGN) feedback in luminous quasars detected at low redshifts. We detect CO(2--1) emission in three objects, from which we derive CO luminosities, molecular gas masses and fractions, and gas depletion times. In combination with data available in the literature, we build a total sample of 138 local type 1 AGNs with CO(2--1) measurements. We compare the AGN properties with the host galaxy molecular gas properties, considering the systems non-detected in CO emission. We find that the CO luminosity does not correlate with AGN luminosity and Eddington ratio, while the molecular gas fraction is weakly correlated with Eddington ratio. The type 1 AGNs can be roughly separated into two populations in terms of infrared-to-CO luminosity ratio, one population presenting values typically found in normal star-forming systems, while the other have lower ratio values, comparable to those measured for starbursts. We find no evidence that AGN feedback rapidly quenches star formation in type 1 AGNs. Our results may imply an underlying the role of host galaxy gravitational instabilities or the fast inflow of cold gas in triggering AGN activity.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
DRIP: Deep Regularizers for Inverse Problems
Authors:
Moshe Eliasof,
Eldad Haber,
Eran Treister
Abstract:
In this paper we consider inverse problems that are mathematically ill-posed. That is, given some (noisy) data, there is more than one solution that approximately fits the data. In recent years, deep neural techniques that find the most appropriate solution, in the sense that it contains a-priori information, were developed. However, they suffer from several shortcomings. First, most techniques ca…
▽ More
In this paper we consider inverse problems that are mathematically ill-posed. That is, given some (noisy) data, there is more than one solution that approximately fits the data. In recent years, deep neural techniques that find the most appropriate solution, in the sense that it contains a-priori information, were developed. However, they suffer from several shortcomings. First, most techniques cannot guarantee that the solution fits the data at inference. Second, while the derivation of the techniques is inspired by the existence of a valid scalar regularization function, such techniques do not in practice rely on such a function, and therefore veer away from classical variational techniques. In this work we introduce a new family of neural regularizers for the solution of inverse problems. These regularizers are based on a variational formulation and are guaranteed to fit the data. We demonstrate their use on a number of highly ill-posed problems, from image deblurring to limited angle tomography.
△ Less
Submitted 25 August, 2023; v1 submitted 30 March, 2023;
originally announced April 2023.
-
Graph Positional Encoding via Random Feature Propagation
Authors:
Moshe Eliasof,
Fabrizio Frasca,
Beatrice Bevilacqua,
Eran Treister,
Gal Chechik,
Haggai Maron
Abstract:
Two main families of node feature augmentation schemes have been explored for enhancing GNNs: random features and spectral positional encoding. Surprisingly, however, there is still no clear understanding of the relation between these two augmentation schemes. Here we propose a novel family of positional encoding schemes which draws a link between the above two approaches and improves over both. T…
▽ More
Two main families of node feature augmentation schemes have been explored for enhancing GNNs: random features and spectral positional encoding. Surprisingly, however, there is still no clear understanding of the relation between these two augmentation schemes. Here we propose a novel family of positional encoding schemes which draws a link between the above two approaches and improves over both. The new approach, named Random Feature Propagation (RFP), is inspired by the power iteration method and its generalizations. It concatenates several intermediate steps of an iterative algorithm for computing the dominant eigenvectors of a propagation matrix, starting from random node features. Notably, these propagation steps are based on graph-dependent propagation operators that can be either predefined or learned. We explore the theoretical and empirical benefits of RFP. First, we provide theoretical justifications for using random features, for incorporating early propagation steps, and for using multiple random initializations. Then, we empirically demonstrate that RFP significantly outperforms both spectral PE and random features in multiple node classification and graph classification benchmarks.
△ Less
Submitted 19 July, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
Efficient Graph Laplacian Estimation by Proximal Newton
Authors:
Yakov Medvedovsky,
Eran Treister,
Tirza Routtenberg
Abstract:
The Laplacian-constrained Gaussian Markov Random Field (LGMRF) is a common multivariate statistical model for learning a weighted sparse dependency graph from given data. This graph learning problem can be formulated as a maximum likelihood estimation (MLE) of the precision matrix, subject to Laplacian structural constraints, with a sparsity-inducing penalty term. This paper aims to solve this lea…
▽ More
The Laplacian-constrained Gaussian Markov Random Field (LGMRF) is a common multivariate statistical model for learning a weighted sparse dependency graph from given data. This graph learning problem can be formulated as a maximum likelihood estimation (MLE) of the precision matrix, subject to Laplacian structural constraints, with a sparsity-inducing penalty term. This paper aims to solve this learning problem accurately and efficiently. First, since the commonly used $\ell_1$-norm penalty is inappropriate in this setting and may lead to a complete graph, we employ the nonconvex minimax concave penalty (MCP), which promotes sparse solutions with lower estimation bias. Second, as opposed to existing first-order methods for this problem, we develop a second-order proximal Newton approach to obtain an efficient solver, utilizing several algorithmic features, such as using Conjugate Gradients, preconditioning, and splitting to active/free sets. Numerical experiments demonstrate the advantages of the proposed method in terms of both computational complexity and graph learning accuracy compared to existing methods.
△ Less
Submitted 12 April, 2024; v1 submitted 13 February, 2023;
originally announced February 2023.
-
Dynamics of Molecular Gas in the Central Region of the Quasar I$\,$Zwicky$\,$1
Authors:
Qinyue Fei,
Ran Wang,
Juan Molina,
Jinyi Shangguan,
Luis C. Ho,
Franz E. Bauer,
Ezequiel Treister
Abstract:
We present a study of the molecular gas distribution and kinematics in the cicumnuclear region (radii $\lesssim 2\,$kpc) of the $z\approx0.061$ quasar I$\,$Zwicky$\,$1 using a collection of available Atacama Large Millimeter/submillimeter Array (ALMA) observations of the carbon monoxide (CO) emission. With an angular resolution of $\sim0.36''$ (corresponding to $\sim\,400\,\rm pc$), the host galax…
▽ More
We present a study of the molecular gas distribution and kinematics in the cicumnuclear region (radii $\lesssim 2\,$kpc) of the $z\approx0.061$ quasar I$\,$Zwicky$\,$1 using a collection of available Atacama Large Millimeter/submillimeter Array (ALMA) observations of the carbon monoxide (CO) emission. With an angular resolution of $\sim0.36''$ (corresponding to $\sim\,400\,\rm pc$), the host galaxy sub-structures including the nuclear molecular gas disk, spiral arms, and a compact bar-like component are resolved. We analyzed the gas kinematics based on the CO image cube and obtained the rotation curve and radial distribution of velocity dispersion. The velocity dispersion is about $30\,\rm km\,s^{-1}$ in the outer CO disk region and rises up to $\gtrsim 100\,\rm km\,s^{-1}$ at radius $\lesssim 1\,$kpc, suggesting that the central region of disk is dynamically hot. We constrain the CO-to-$\rm H_2$ conversion factor, $α_{\rm CO}$, by modeling the cold gas disk dynamics. We find that, with prior knowledge about the stellar and dark matter components, the $α_{\rm CO}$ value in the circumnuclear region of this quasar host galaxy is $1.55_{-0.49}^{+0.47}\,M_\odot\,\left(\rm K\,km\,s^{-1}\,pc^2\right)^{-1}$, which is between the value reported in ultra-luminous infrared galaxies and in the Milky-Way. The central 1$\,$kpc region of this quasar host galaxy has significant star formation activity, which can be identified as a nuclear starburst. We further investigate the high velocity dispersion in the central region. We find that the ISM turbulent pressure derived from the gas velocity dispersion is in equilibrium with the weight of the ISM. This argues against extra power from AGN feedback that significantly affects the kinematics of the cold molecular gas.
△ Less
Submitted 8 February, 2023; v1 submitted 8 February, 2023;
originally announced February 2023.
-
UGC 4211: A Confirmed Dual Active Galactic Nucleus in the Local Universe at 230 pc Nuclear Separation
Authors:
Michael J. Koss,
Ezequiel Treister,
Darshan Kakkad,
J. Andrew Casey-Clyde,
Taiki Kawamuro,
Jonathan Williams,
Adi Foord,
Benny Trakhtenbrot,
Franz E. Bauer,
George C. Privon,
Claudio Ricci,
Richard Mushotzky,
Loreto Barcos-Munoz,
Laura Blecha,
Thomas Connor,
Fiona Harrison,
Tingting Liu,
Macon Magno,
Chiara M. F. Mingarelli,
Francisco Muller-Sanchez,
Kyuseok Oh,
T. Taro Shimizu,
Krista L. Smith,
Daniel Stern,
Miguel Parra Tello
, et al. (1 additional authors not shown)
Abstract:
We present multi-wavelength high-spatial resolution (~0.1'', 70 pc) observations of UGC 4211 at z=0.03474, a late-stage major galaxy merger at the closest nuclear separation yet found in near-IR imaging (0.32'', ~230 pc projected separation). Using Hubble Space Telescope/STIS, VLT/MUSE+AO, Keck/OSIRIS+AO spectroscopy, and ALMA observations, we show that the spatial distribution, optical and NIR em…
▽ More
We present multi-wavelength high-spatial resolution (~0.1'', 70 pc) observations of UGC 4211 at z=0.03474, a late-stage major galaxy merger at the closest nuclear separation yet found in near-IR imaging (0.32'', ~230 pc projected separation). Using Hubble Space Telescope/STIS, VLT/MUSE+AO, Keck/OSIRIS+AO spectroscopy, and ALMA observations, we show that the spatial distribution, optical and NIR emission lines, and millimeter continuum emission are all consistent with both nuclei being powered by accreting supermassive black holes (SMBHs). Our data, combined with common black hole mass prescriptions, suggests that both SMBHs have similar masses, log MBH~8.1 (south) and log MBH~8.3 (north), respectively. The projected separation of 230 pc (~6X the black hole sphere of influence) represents the closest-separation dual AGN studied to date with multi-wavelength resolved spectroscopy and shows the potential of nuclear (<50 pc) continuum observations with ALMA to discover hidden growing SMBH pairs. While the exact occurrence rate of close-separation dual AGN is not yet known, it may be surprisingly high, given that UGC 4211 was found within a small, volume-limited sample of nearby hard X-ray detected AGN. Observations of dual SMBH binaries in the sub-kpc regime at the final stages of dynamical friction provide important constraints for future gravitational wave observatories.
△ Less
Submitted 9 January, 2023;
originally announced January 2023.
-
NeRN -- Learning Neural Representations for Neural Networks
Authors:
Maor Ashkenazi,
Zohar Rimon,
Ron Vainshtein,
Shir Levi,
Elad Richardson,
Pinchas Mintz,
Eran Treister
Abstract:
Neural Representations have recently been shown to effectively reconstruct a wide range of signals from 3D meshes and shapes to images and videos. We show that, when adapted correctly, neural representations can be used to directly represent the weights of a pre-trained convolutional neural network, resulting in a Neural Representation for Neural Networks (NeRN). Inspired by coordinate inputs of p…
▽ More
Neural Representations have recently been shown to effectively reconstruct a wide range of signals from 3D meshes and shapes to images and videos. We show that, when adapted correctly, neural representations can be used to directly represent the weights of a pre-trained convolutional neural network, resulting in a Neural Representation for Neural Networks (NeRN). Inspired by coordinate inputs of previous neural representation methods, we assign a coordinate to each convolutional kernel in our network based on its position in the architecture, and optimize a predictor network to map coordinates to their corresponding weights. Similarly to the spatial smoothness of visual scenes, we show that incorporating a smoothness constraint over the original network's weights aids NeRN towards a better reconstruction. In addition, since slight perturbations in pre-trained model weights can result in a considerable accuracy loss, we employ techniques from the field of knowledge distillation to stabilize the learning process. We demonstrate the effectiveness of NeRN in reconstructing widely used architectures on CIFAR-10, CIFAR-100, and ImageNet. Finally, we present two applications using NeRN, demonstrating the capabilities of the learned representations.
△ Less
Submitted 21 April, 2023; v1 submitted 27 December, 2022;
originally announced December 2022.
-
Enhanced Star Formation Efficiency in the Central Regions of Nearby Quasars Hosts
Authors:
Juan Molina,
Luis C. Ho,
Ran Wang,
Jinyi Shangguan,
Franz E. Bauer,
Ezequiel Treister
Abstract:
We combine Atacama Large Millimeter/submillimeter Array and Multi Unit Spectroscopic Explorer observations tracing the molecular gas, millimeter continuum, and ionized gas emission in six low-redshift ($z \lesssim 0.06$) Palomar-Green quasar host galaxies to investigate their ongoing star formation at $\sim$kpc-scale resolution. The AGN contribution to the cold dust emission and the optical emissi…
▽ More
We combine Atacama Large Millimeter/submillimeter Array and Multi Unit Spectroscopic Explorer observations tracing the molecular gas, millimeter continuum, and ionized gas emission in six low-redshift ($z \lesssim 0.06$) Palomar-Green quasar host galaxies to investigate their ongoing star formation at $\sim$kpc-scale resolution. The AGN contribution to the cold dust emission and the optical emission-line flux is carefully removed to derive spatial distributions of the star formation rate (SFR), which, complemented with the molecular gas data, enables the mapping of the depletion time ($t_{\rm dep}$). We report ubiquitous star formation activity within the quasar host galaxies, with the majority of the ongoing star formation occurring in the galaxy center. The rise of the star formation rate surface density ($Σ_{\rm SFR}$) toward the nucleus is steeper than that observed for the cold molecular gas surface density, reaching values up to $Σ_{\rm SFR} \approx 0.15-0.80\,M_\odot\,$yr$^{-1}\,$kpc$^{-2}$. The gas in the nuclear regions is converted into stars at a shortened depletion time ($t_{\rm dep} \approx 0.2-2.0\,$Gyr), suggesting that those zones can be deemed as starbursts. At large galactocentric radius, we find that the ongoing star formation takes place within spiral arms or H$\,$II region complexes, with an efficiency comparable to that reported for nearby inactive spirals ($t_{\rm dep} \approx 1.8\,$Gyr). We find no evidence of star formation activity shutoff in the PG quasar host galaxies. On the contrary, these observations shed light on how the central environments of galaxies hosting actively accreting supermassive black holes builds up stellar mass.
△ Less
Submitted 10 December, 2022;
originally announced December 2022.
-
Morphological Parameters and Associated Uncertainties for 8 Million Galaxies in the Hyper Suprime-Cam Wide Survey
Authors:
Aritra Ghosh,
C. Megan Urry,
Aayush Mishra,
Laurence Perreault-Levasseur,
Priyamvada Natarajan,
David B. Sanders,
Daisuke Nagai,
Chuan Tian,
Nico Cappelluti,
Jeyhan S. Kartaltepe,
Meredith C. Powell,
Amrit Rau,
Ezequiel Treister
Abstract:
We use the Galaxy Morphology Posterior Estimation Network (GaMPEN) to estimate morphological parameters and associated uncertainties for $\sim 8$ million galaxies in the Hyper Suprime-Cam (HSC) Wide survey with $z \leq 0.75$ and $m \leq 23$. GaMPEN is a machine learning framework that estimates Bayesian posteriors for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and…
▽ More
We use the Galaxy Morphology Posterior Estimation Network (GaMPEN) to estimate morphological parameters and associated uncertainties for $\sim 8$ million galaxies in the Hyper Suprime-Cam (HSC) Wide survey with $z \leq 0.75$ and $m \leq 23$. GaMPEN is a machine learning framework that estimates Bayesian posteriors for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and flux ($F$). By first training on simulations of galaxies and then applying transfer learning using real data, we trained GaMPEN with $<1\%$ of our dataset. This two-step process will be critical for applying machine learning algorithms to future large imaging surveys, such as the Rubin-Legacy Survey of Space and Time (LSST), the Nancy Grace Roman Space Telescope (NGRST), and Euclid. By comparing our results to those obtained using light-profile fitting, we demonstrate that GaMPEN's predicted posterior distributions are well-calibrated ($\lesssim 5\%$ deviation) and accurate. This represents a significant improvement over light profile fitting algorithms which underestimate uncertainties by as much as $\sim60\%$. For an overlapping sub-sample, we also compare the derived morphological parameters with values in two external catalogs and find that the results agree within the limits of uncertainties predicted by GaMPEN. This step also permits us to define an empirical relationship between the Sérsic index and $L_B/L_T$ that can be used to convert between these two parameters. The catalog presented here represents a significant improvement in size ($\sim10 \times $), depth ($\sim4$ magnitudes), and uncertainty quantification over previous state-of-the-art bulge+disk decomposition catalogs. With this work, we also release GaMPEN's source code and trained models, which can be adapted to other datasets.
△ Less
Submitted 1 March, 2024; v1 submitted 30 November, 2022;
originally announced December 2022.
-
Every Node Counts: Improving the Training of Graph Neural Networks on Node Classification
Authors:
Moshe Eliasof,
Eldad Haber,
Eran Treister
Abstract:
Graph Neural Networks (GNNs) are prominent in handling sparse and unstructured data efficiently and effectively. Specifically, GNNs were shown to be highly effective for node classification tasks, where labelled information is available for only a fraction of the nodes. Typically, the optimization process, through the objective function, considers only labelled nodes while ignoring the rest. In th…
▽ More
Graph Neural Networks (GNNs) are prominent in handling sparse and unstructured data efficiently and effectively. Specifically, GNNs were shown to be highly effective for node classification tasks, where labelled information is available for only a fraction of the nodes. Typically, the optimization process, through the objective function, considers only labelled nodes while ignoring the rest. In this paper, we propose novel objective terms for the training of GNNs for node classification, aiming to exploit all the available data and improve accuracy. Our first term seeks to maximize the mutual information between node and label features, considering both labelled and unlabelled nodes in the optimization process. Our second term promotes anisotropic smoothness in the prediction maps. Lastly, we propose a cross-validating gradients approach to enhance the learning from labelled data. Our proposed objectives are general and can be applied to various GNNs and require no architectural modifications. Extensive experiments demonstrate our approach using popular GNNs like GCN, GAT and GCNII, reading a consistent and significant accuracy improvement on 10 real-world node classification datasets.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Improving Graph Neural Networks with Learnable Propagation Operators
Authors:
Moshe Eliasof,
Lars Ruthotto,
Eran Treister
Abstract:
Graph Neural Networks (GNNs) are limited in their propagation operators. In many cases, these operators often contain non-negative elements only and are shared across channels, limiting the expressiveness of GNNs. Moreover, some GNNs suffer from over-smoothing, limiting their depth. On the other hand, Convolutional Neural Networks (CNNs) can learn diverse propagation filters, and phenomena like ov…
▽ More
Graph Neural Networks (GNNs) are limited in their propagation operators. In many cases, these operators often contain non-negative elements only and are shared across channels, limiting the expressiveness of GNNs. Moreover, some GNNs suffer from over-smoothing, limiting their depth. On the other hand, Convolutional Neural Networks (CNNs) can learn diverse propagation filters, and phenomena like over-smoothing are typically not apparent in CNNs. In this paper, we bridge these gaps by incorporating trainable channel-wise weighting factors $ω$ to learn and mix multiple smoothing and sharpening propagation operators at each layer. Our generic method is called $ω$GNN, and is easy to implement. We study two variants: $ω$GCN and $ω$GAT. For $ω$GCN, we theoretically analyse its behaviour and the impact of $ω$ on the obtained node features. Our experiments confirm these findings, demonstrating and explaining how both variants do not over-smooth. Additionally, we experiment with 15 real-world datasets on node- and graph-classification tasks, where our $ω$GCN and $ω$GAT perform on par with state-of-the-art methods.
△ Less
Submitted 5 May, 2023; v1 submitted 31 October, 2022;
originally announced October 2022.
-
Unsupervised Image Semantic Segmentation through Superpixels and Graph Neural Networks
Authors:
Moshe Eliasof,
Nir Ben Zikri,
Eran Treister
Abstract:
Unsupervised image segmentation is an important task in many real-world scenarios where labelled data is of scarce availability. In this paper we propose a novel approach that harnesses recent advances in unsupervised learning using a combination of Mutual Information Maximization (MIM), Neural Superpixel Segmentation and Graph Neural Networks (GNNs) in an end-to-end manner, an approach that has n…
▽ More
Unsupervised image segmentation is an important task in many real-world scenarios where labelled data is of scarce availability. In this paper we propose a novel approach that harnesses recent advances in unsupervised learning using a combination of Mutual Information Maximization (MIM), Neural Superpixel Segmentation and Graph Neural Networks (GNNs) in an end-to-end manner, an approach that has not been explored yet. We take advantage of the compact representation of superpixels and combine it with GNNs in order to learn strong and semantically meaningful representations of images. Specifically, we show that our GNN based approach allows to model interactions between distant pixels in the image and serves as a strong prior to existing CNNs for an improved accuracy. Our experiments reveal both the qualitative and quantitative advantages of our approach compared to current state-of-the-art methods over four popular datasets.
△ Less
Submitted 21 October, 2022;
originally announced October 2022.
-
Probing the Structure and Evolution of BASS AGN through Eddington Ratios
Authors:
Tonima Tasnim Ananna,
C. Megan Urry,
Claudio Ricci,
Priyamvada Natarajan,
Ryan C. Hickox,
Benny Trakhtenbrot,
Ezequiel Treister,
Anna K. Weigel,
Yoshihiro Ueda,
Michael J. Koss,
F. E. Bauer,
Matthew J. Temple,
Mislav Balokovic,
Richard Mushotzky,
Connor Auge,
David B. Sanders,
Darshan Kakkad,
Lia F. Sartori,
Stefano Marchesi,
Fiona Harrison,
Daniel Stern,
Kyuseok Oh,
Turgay Caglar,
Meredith C. Powell,
Stephanie A. Podjed
, et al. (1 additional authors not shown)
Abstract:
We constrain the intrinsic Eddington ratio (\lamEdd ) distribution function for local AGN in bins of low and high obscuration (log NH <= 22 and 22 < log NH < 25), using the Swift-BAT 70-month/BASS DR2 survey. We interpret the fraction of obscured AGN in terms of circum-nuclear geometry and temporal evolution. Specifically, at low Eddington ratios (log lamEdd < -2), obscured AGN outnumber unobscure…
▽ More
We constrain the intrinsic Eddington ratio (\lamEdd ) distribution function for local AGN in bins of low and high obscuration (log NH <= 22 and 22 < log NH < 25), using the Swift-BAT 70-month/BASS DR2 survey. We interpret the fraction of obscured AGN in terms of circum-nuclear geometry and temporal evolution. Specifically, at low Eddington ratios (log lamEdd < -2), obscured AGN outnumber unobscured ones by a factor of ~4, reflecting the covering factor of the circum-nuclear material (0.8, or a torus opening angle of ~ 34 degrees). At high Eddington ratios (\log lamEdd > -1), the trend is reversed, with < 30% of AGN having log NH > 22, which we suggest is mainly due to the small fraction of time spent in a highly obscured state. Considering the Eddington ratio distribution function of narrow-line and broad-line AGN from our prior work, we see a qualitatively similar picture. To disentangle temporal and geometric effects at high lamEdd, we explore plausible clearing scenarios such that the time-weighted covering factors agree with the observed population ratio. We find that the low fraction of obscured AGN at high lamEdd is primarily due to the fact that the covering factor drops very rapidly, with more than half the time is spent with < 10% covering factor. We also find that nearly all obscured AGN at high-lamEdd exhibit some broad-lines. We suggest that this is because the height of the depleted torus falls below the height of the broad-line region, making the latter visible from all lines of sight.
△ Less
Submitted 15 October, 2022;
originally announced October 2022.
-
On the cosmic evolution of AGN obscuration and the X-ray luminosity function: XMM-Newton and Chandra spectral analysis of the 31.3 deg$^2$ Stripe 82X
Authors:
Alessandro Peca,
Nico Cappelluti,
Meg Urry,
Stephanie LaMassa,
Stefano Marchesi,
Tonima Ananna,
Mislav Baloković,
David Sanders,
Connor Auge,
Ezequiel Treister,
Meredith Powell,
Tracey Jane Turner,
Allison Kirkpatrick,
Chuan Tian
Abstract:
We present X-ray spectral analysis of XMM and Chandra observations in the 31.3 deg$^2$ Stripe-82X (S82X) field. Of the 6181 X-ray sources in this field, we analyze a sample of 2937 active galactic nuclei (AGN) with solid redshifts and sufficient counts determined by simulations. Our results show a population with median values of spectral index $Γ=1.94_{-0.39}^{+0.31}$, column density log…
▽ More
We present X-ray spectral analysis of XMM and Chandra observations in the 31.3 deg$^2$ Stripe-82X (S82X) field. Of the 6181 X-ray sources in this field, we analyze a sample of 2937 active galactic nuclei (AGN) with solid redshifts and sufficient counts determined by simulations. Our results show a population with median values of spectral index $Γ=1.94_{-0.39}^{+0.31}$, column density log$\,N_{\mathrm{H}}/\mathrm{cm}^{-2}=20.7_{-0.5}^{+1.2}$ and intrinsic, de-absorbed, 2-10 keV luminosity log$\,L_{\mathrm{X}}/\mathrm{erg\,s}^{-1}=44.0_{-1.0}^{+0.7}$, in the redshift range 0-4. We derive the intrinsic fraction of AGN that are obscured ($22\leq\mathrm{log}\,N_{\mathrm{H}}/\mathrm{cm}^{-2}<24$), finding a significant increase in the obscured AGN fraction with redshift and a decline with increasing luminosity. The average obscured AGN fraction is $57\pm4\%$ for log$\,L_{\mathrm{X}}/\mathrm{erg\,s}^{-1}>43$. This work constrains the AGN obscuration and spectral shape of the still uncertain high-luminosity and high-redshift regimes (log$\,L_{\mathrm{X}}/\mathrm{erg\,s}^{-1}>45.5$, $z>3$), where the obscured AGN fraction rises to $64\pm12\%$. We report a luminosity and density evolution of the X-ray luminosity function, with obscured AGN dominating at all luminosities at $z>2$ and unobscured sources prevailing at log$\,L_{\mathrm{X}}/\mathrm{erg\,s}^{-1}>45$ at lower redshifts. Our results agree with evolutionary models in which the bulk of AGN activity is triggered by gas-rich environments and in a downsizing scenario. Also, the black hole accretion density (BHAD) is found to evolve similarly to the star formation rate density, confirming the co-evolution between AGN and host-galaxy, but suggesting different time scales in their growing history. The derived BHAD evolution shows that Compton-thick AGN contribute to the accretion history of AGN as much as all other AGN populations combined.
△ Less
Submitted 21 November, 2022; v1 submitted 14 October, 2022;
originally announced October 2022.
-
The hidden side of cosmic star formation at z > 3: Bridging optically-dark and Lyman break galaxies with GOODS-ALMA
Authors:
Mengyuan Xiao,
David Elbaz,
Carlos Gómez-Guijarro,
Lucas Leroy,
Longji Bing,
Emanuele Daddi,
Benjamin Magnelli,
Maximilien Franco,
Luwenjia Zhou,
Mark Dickinson,
Tao Wang,
Wiphu Rujopakarn,
Georgios E. Magdis,
Ezequiel Treister,
Hanae Inami,
Ricardo Demarco,
Mark T. Sargent,
Xinwen Shu,
Jeyhan S. Kartaltepe,
David M. Alexander,
Matthieu Béthermin,
Frederic Bournaud,
Laure Ciesla,
Henry C. Ferguson,
Steven L. Finkelstein
, et al. (15 additional authors not shown)
Abstract:
Our current understanding of the cosmic star formation history at z>3 is primarily based on UV-selected galaxies (i.e., LBGs). Recent studies of H-dropouts have revealed that we may be missing a large proportion of star formation that is taking place in massive galaxies at z>3. In this work, we extend the H-dropout criterion to lower masses to select optically dark/faint galaxies (OFGs), in order…
▽ More
Our current understanding of the cosmic star formation history at z>3 is primarily based on UV-selected galaxies (i.e., LBGs). Recent studies of H-dropouts have revealed that we may be missing a large proportion of star formation that is taking place in massive galaxies at z>3. In this work, we extend the H-dropout criterion to lower masses to select optically dark/faint galaxies (OFGs), in order to complete the census between LBGs and H-dropouts. Our criterion (H> 26.5 mag & [4.5] < 25 mag) combined with a de-blending technique is designed to select not only extremely dust-obscured massive galaxies but also normal star-forming galaxies. In total, we identified 27 OFGs at z_phot > 3 (z_med=4.1) in the GOODS-ALMA field, covering a wide distribution of stellar masses with log($M_{\star}$/$M_{\odot}$) = 9.4-11.1. We find that up to 75% of the OFGs with log($M_{\star}$/$M_{\odot}$) = 9.5-10.5 were neglected by previous LBGs and H-dropout selection techniques. After performing stacking analyses, the OFGs exhibit shorter gas depletion timescales, slightly lower gas fractions, and lower dust temperatures than typical star-forming galaxies. Their SFR_tot (SFR_ IR+SFR_UV) is much larger than SFR_UVcorr (corrected for dust extinction), with SFR_tot/SFR_UVcorr = $8\pm1$, suggesting the presence of hidden dust regions in the OFGs that absorb all UV photons. The average dust size measured by a circular Gaussian model fit is R_e(1.13 mm)=1.01$\pm$0.05 kpc. We find that the cosmic SFRD at z>3 contributed by massive OFGs is at least two orders of magnitude higher than the one contributed by equivalently massive LBGs. Finally, we calculate the combined contribution of OFGs and LBGs to the cosmic SFRD at z=4-5 to be 4 $\times$ 10$^{-2}$ $M_{\odot}$ yr$^{-1}$Mpc$^{-3}$, which is about 0.15 dex (43%) higher than the SFRD derived from UV-selected samples alone at the same redshift.
△ Less
Submitted 10 February, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Investigating the Effect of Galaxy Interactions on Star Formation at 0.5<z<3.0
Authors:
Ekta A. Shah,
Jeyhan S. Kartaltepe,
Christina T. Magagnoli,
Isabella G. Cox,
Caleb T. Wetherell,
Brittany N. Vanderhoof,
Kevin C. Cooke,
Antonello Calabro,
Nima Chartab,
Christopher J. Conselice,
Darren J. Croton,
Alexander de la Vega,
Nimish P. Hathi,
Olivier Ilbert,
Hanae Inami,
Dale D. Kocevski,
Anton M. Koekemoer,
Brian C. Lemaux,
Lori Lubin,
Kameswara Bharadwaj Mantha,
Stefano Marchesi,
Marie Martig,
Jorge Moreno,
Belen Alcalde Pampliega,
David R. Patton
, et al. (2 additional authors not shown)
Abstract:
Observations and simulations of interacting galaxies and mergers in the local universe have shown that interactions can significantly enhance the star formation rates (SFR) and fueling of Active Galactic Nuclei (AGN). However, at higher redshift, some simulations suggest that the level of star formation enhancement induced by interactions is lower due to the higher gas fractions and already increa…
▽ More
Observations and simulations of interacting galaxies and mergers in the local universe have shown that interactions can significantly enhance the star formation rates (SFR) and fueling of Active Galactic Nuclei (AGN). However, at higher redshift, some simulations suggest that the level of star formation enhancement induced by interactions is lower due to the higher gas fractions and already increased SFRs in these galaxies. To test this, we measure the SFR enhancement in a total of 2351 (1327) massive ($M_*>10^{10}M_\odot$) major ($1<M_1/M_2<4$) spectroscopic galaxy pairs at 0.5<z<3.0 with $ΔV <5000$ km s$^{-1}$ (1000 km s$^{-1}$) and projected separation <150 kpc selected from the extensive spectroscopic coverage in the COSMOS and CANDELS fields. We find that the highest level of SFR enhancement is a factor of 1.23$^{+0.08}_{-0.09}$ in the closest projected separation bin (<25 kpc) relative to a stellar mass-, redshift-, and environment-matched control sample of isolated galaxies. We find that the level of SFR enhancement is a factor of $\sim1.5$ higher at 0.5<z<1 than at 1<z<3 in the closest projected separation bin. Among a sample of visually identified mergers, we find an enhancement of a factor of 1.86$^{+0.29}_{-0.18}$ for coalesced systems. For this visually identified sample, we see a clear trend of increased SFR enhancement with decreasing projected separation (2.40$^{+0.62}_{-0.37}$ vs.\ 1.58$^{+0.29}_{-0.20}$ for 0.5<z<1.6 and 1.6<z<3.0, respectively). The SFR enhancement seen in our interactions and mergers are all lower than the level seen in local samples at the same separation, suggesting that the level of interaction-induced star formation evolves significantly over this time period.
△ Less
Submitted 30 September, 2022;
originally announced September 2022.
-
BASS XXXVII: The role of radiative feedback in the growth and obscuration properties of nearby supermassive black holes
Authors:
C. Ricci,
T. T. Ananna,
M. J. Temple,
C. M. Urry,
M. J. Koss,
B. Trakhtenbrot,
Y. Ueda,
D. Stern,
F. E. Bauer,
E. Treister,
G. C. Privon,
K. Oh,
S. Paltani,
M. Stalevski,
L. C. Ho,
A. C. Fabian,
R. Mushotzky,
C. S. Chang,
F. Ricci,
D. Kakkad,
L. Sartori,
R. Baer,
T. Caglar,
M. Powell,
F. Harrison
Abstract:
We study the relation between obscuration and supermassive black hole (SMBH) growth using a large sample of hard X-ray selected Active Galactic Nuclei (AGN). We find a strong decrease in the fraction of obscured sources above the Eddington limit for dusty gas ($\log λ_{\rm Edd}\gtrsim -2$) confirming earlier results, and consistent with the radiation-regulated unification model. This also explains…
▽ More
We study the relation between obscuration and supermassive black hole (SMBH) growth using a large sample of hard X-ray selected Active Galactic Nuclei (AGN). We find a strong decrease in the fraction of obscured sources above the Eddington limit for dusty gas ($\log λ_{\rm Edd}\gtrsim -2$) confirming earlier results, and consistent with the radiation-regulated unification model. This also explains the difference in the Eddington ratio distribution functions (ERDFs) of type 1 and type 2 AGN obtained by a recent study. The break in the ERDF of nearby AGN is at $\log λ_{\rm Edd}^{*}=-1.34\pm0.07$. This corresponds to the $λ_{\rm Edd}$ where AGN transition from having most of their sky covered by obscuring material to being mostly devoid of absorbing material. A similar trend is observed for the luminosity function, which implies that most of the SMBH growth in the local Universe happens when the AGN is covered by a large reservoir of gas and dust. These results could be explained with a radiation-regulated growth model, in which AGN move in the $N_{\rm H}-λ_{\rm Edd}$ plane during their life cycle. The growth episode starts with the AGN mostly unobscured and accreting at low $λ_{\rm Edd}$. As the SMBH is further fueled, $λ_{\rm Edd}$, $N_{\rm H}$ and covering factor increase, leading AGN to be preferentially observed as obscured. Once $λ_{\rm Edd}$ reaches the Eddington limit for dusty gas, the covering factor and $N_{\rm H}$ rapidly decrease, leading the AGN to be typically observed as unobscured. As the remaining fuel is depleted, the SMBH goes back into a quiescent phase.
△ Less
Submitted 31 August, 2022;
originally announced September 2022.
-
Signatures of Feedback in the Spectacular Extended Emission Region of NGC 5972
Authors:
Thomas Harvey,
W. Peter Maksym,
William Keel,
Michael Koss,
Vardha N. Bennert,
S. D. Chojnowski,
Ezequiel Treister,
Carolina Finlez,
Chris J. Lintott,
Alexei Moiseev,
Brooke D. Simmons,
Lia F. Sartori,
Megan Urry
Abstract:
We present Chandra X-ray Observatory observations and Space Telescope Imaging Spectrograph spectra of NGC 5972, one of the 19 "Voorwerpjes" galaxies. This galaxy contains an Extended Emission Line Region (EELR) and an arc-second scale nuclear bubble. NGC 5972 is a faded AGN, with EELR luminosity suggesting a 2.1 dex decrease in L$_{\textrm{bol}}$ in the last $\sim5\times10^{4}$ yr. We investigate…
▽ More
We present Chandra X-ray Observatory observations and Space Telescope Imaging Spectrograph spectra of NGC 5972, one of the 19 "Voorwerpjes" galaxies. This galaxy contains an Extended Emission Line Region (EELR) and an arc-second scale nuclear bubble. NGC 5972 is a faded AGN, with EELR luminosity suggesting a 2.1 dex decrease in L$_{\textrm{bol}}$ in the last $\sim5\times10^{4}$ yr. We investigate the role of AGN feedback in exciting the EELR and bubble given the long-term variability and potential accretion state changes. We detect broadband (0.3-8 keV) nuclear X-ray emission coincident with the [OIII] bubble, as well as diffuse soft X-ray emission coincident with the EELR. The soft nuclear (0.5-1.5 keV) emission is spatially extended and the spectra are consistent with two APEC thermal populations ($\sim$0.80,$\sim$0.10 keV). We find a bubble age >2.2 Myr, suggesting formation before the current variability. We find evidence for efficient feedback with L$_{\textrm{kin}}/L_{\textrm{bol}}\sim0.8\%$, which may be overestimated given the recent L$_{\textrm{bol}}$ variation. Kinematics suggest an out-flowing 300 km s$^{-1}$ high-ionization [OIII]-emitting gas which may be the line of sight component of a $\sim$780 km s$^{-1}$ thermal X-ray outflow capable of driving strong shocks that could photoionize the precursor material. We explore possibilities to explain the overall jet, radio lobe and EELR misalignment including evidence for a double SMBH which could support a complex misaligned system.
△ Less
Submitted 11 August, 2022;
originally announced August 2022.
-
Detailed accretion history of the supermassive black hole in NGC 5972 over the past $\gtrsim$10$^4$ years through the extended emission line region
Authors:
C. Finlez,
E. Treister,
F. Bauer,
W. Keel,
M. Koss,
N. Nagar,
L. Sartori,
W. P. Maksym,
G. Venturi,
D. Tubin,
T. Harvey
Abstract:
We present integral field spectroscopic observations of NGC 5972 obtained with the Multi Unit Spectroscopic Explorer (MUSE) at VLT. NGC 5972 is a nearby galaxy containing both an active galactic nucleus (AGN), and an extended emission line region (EELR) reaching out to $\sim 17$ kpc from the nucleus. We analyze the physical conditions of the EELR using spatially-resolved spectra, focusing on the r…
▽ More
We present integral field spectroscopic observations of NGC 5972 obtained with the Multi Unit Spectroscopic Explorer (MUSE) at VLT. NGC 5972 is a nearby galaxy containing both an active galactic nucleus (AGN), and an extended emission line region (EELR) reaching out to $\sim 17$ kpc from the nucleus. We analyze the physical conditions of the EELR using spatially-resolved spectra, focusing on the radial dependence of ionization state together with the light travel time distance to probe the variability of the AGN on $\gtrsim 10^{4}$ yr timescales. The kinematic analysis suggests multiple components: (a) a faint component following the rotation of the large scale disk; (b) a component associated with the EELR suggestive of extraplanar gas connected to tidal tails; (c) a kinematically decoupled nuclear disk. Both the kinematics and the observed tidal tails suggest a major past interaction event. Emission line diagnostics along the EELR arms typically evidence Seyfert-like emission, implying that the EELR was primarily ionized by the AGN. We generate a set of photoionization models and fit these to different regions along the EELR. This allows us to estimate the bolometric luminosity required at different radii to excite the gas to the observed state. Our results suggests that NGC 5972 is a fading quasar, showing a steady gradual decrease in intrinsic AGN luminosity, and hence the accretion rate onto the SMBH, by a factor $\sim 100$ over the past $5 \times 10^{4}$ yr.
△ Less
Submitted 9 August, 2022;
originally announced August 2022.
-
BASS XXVI: DR2 Host Galaxy Stellar Velocity Dispersions
Authors:
Michael J. Koss,
Benny Trakhtenbrot,
Claudio Ricci,
Kyuseok Oh,
Franz E. Bauer,
Daniel Stern,
Turgay Caglar,
Jakob S. den Brok,
Richard Mushotzky,
Federica Ricci,
Julian E. Mejia-Restrepo,
Isabella Lamperti,
Ezequiel Treister,
Rudolf E. Bar,
Fiona Harrison,
Meredith C. Powell,
George C. Privon,
Rogerio Riffel,
Alejandra F. Rojas,
Kevin Schawinski,
C. Megan Urry
Abstract:
We present new central stellar velocity dispersions for 484 Sy 1.9 and Sy 2 from the second data release of the Swift/BAT AGN Spectroscopic Survey (BASS DR2). This constitutes the largest study of velocity dispersion measurements in X-ray selected, obscured AGN with 956 independent measurements of the Ca H+K and Mg b region (3880-5550A) and the Ca triplet region (8350-8730A) from 642 spectra mainl…
▽ More
We present new central stellar velocity dispersions for 484 Sy 1.9 and Sy 2 from the second data release of the Swift/BAT AGN Spectroscopic Survey (BASS DR2). This constitutes the largest study of velocity dispersion measurements in X-ray selected, obscured AGN with 956 independent measurements of the Ca H+K and Mg b region (3880-5550A) and the Ca triplet region (8350-8730A) from 642 spectra mainly from VLT/Xshooter or Palomar/DoubleSpec. Our sample spans velocity dispersions of 40-360 km/s, corresponding to 4-5 orders of magnitude in black holes mass (MBH=10^5.5-9.6 Msun), bolometric luminosity (LBol~10^{42-46 ergs/s), and Eddington ratio (L/Ledd~10^{-5}-2). For 281 AGN, our data provide the first published central velocity dispersions, including 6 AGN with low mass black holes (MBH=10^5.5-6.5 Msun), discovered thanks to our high spectral resolution observations (sigma~25 km/s). The survey represents a significant advance with a nearly complete census of hard-X-ray selected obscured AGN with measurements for 99% of nearby AGN (z<0.1) outside the Galactic plane. The BASS AGN have higher velocity dispersions than the more numerous optically selected narrow line AGN (i.e., ~150 vs. ~100 km/s), but are not biased towards the highest velocity dispersions of massive ellipticals (i.e., >250 km/s). Despite sufficient spectral resolution to resolve the velocity dispersions associated with the bulges of small black holes (~10^4-5 Msun), we do not find a significant population of super-Eddington AGN. Using estimates of the black hole sphere of influence, direct stellar and gas black hole mass measurements could be obtained with existing facilities for more than ~100 BASS AGN.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
BASS XXII: The BASS DR2 AGN Catalog and Data
Authors:
Michael J. Koss,
Claudio Ricci,
Benny Trakhtenbrot,
Kyuseok Oh,
Jakob S. den Brok,
Julian E. Mejia-Restrepo,
Daniel Stern,
George C. Privon,
Ezequiel Treister,
Meredith C. Powell,
Richard Mushotzky,
Franz E. Bauer,
Tonima T. Ananna,
Mislav Balokovic,
Rudolf E. Bar,
George Becker,
Patricia Bessiere,
Leonard Burtscher,
Turgay Caglar,
Enrico Congiu,
Phil Evans,
Fiona Harrison,
Marianne Heida,
Kohei Ichikawa,
Nikita Kamraj
, et al. (10 additional authors not shown)
Abstract:
We present the AGN catalog and optical spectroscopy for the second data release of the Swift BAT AGN Spectroscopic Survey (BASS DR2). With this DR2 release we provide 1425 optical spectra, of which 1181 are released for the first time, for the 858 hard X-ray selected AGN in the Swift BAT 70-month sample. The majority of the spectra (813/1425, 57%) are newly obtained from VLT/Xshooter or Palomar/Do…
▽ More
We present the AGN catalog and optical spectroscopy for the second data release of the Swift BAT AGN Spectroscopic Survey (BASS DR2). With this DR2 release we provide 1425 optical spectra, of which 1181 are released for the first time, for the 858 hard X-ray selected AGN in the Swift BAT 70-month sample. The majority of the spectra (813/1425, 57%) are newly obtained from VLT/Xshooter or Palomar/Doublespec. Many of the spectra have both higher resolution (R>2500, N~450) and/or very wide wavelength coverage (3200-10000 A, N~600) that are important for a variety of AGN and host galaxy studies. We include newly revised AGN counterparts for the full sample and review important issues for population studies, with 44 AGN redshifts determined for the first time and 780 black hole mass and accretion rate estimates. This release is spectroscopically complete for all AGN (100%, 858/858) with 99.8% having redshift measurements (857/858) and 96% completion in black hole mass estimates of unbeamed AGN (outside the Galactic plane). This AGN sample represents a unique census of the brightest hard X-ray selected AGN in the sky, spanning many orders of magnitude in Eddington ratio (Ledd=10^-5-100), black hole mass (MBH=10^5-10^10 Msun), and AGN bolometric luminosity (Lbol=10^40-10^47 ergs/s).
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
BAT AGN Spectroscopic Survey XXI: The Data Release 2 Overview
Authors:
Michael J. Koss,
Benny Trakhtenbrot,
Claudio Ricci,
Franz E. Bauer,
Ezequiel Treister,
Richard Mushotzky,
C. Megan Urry,
Tonima T. Ananna,
Mislav Balokovic,
Jakob S. den Brok,
S. Bradley Cenko,
Fiona Harrison,
Kohei Ichikawa,
Isabella Lamperti,
Amy Lein,
Julian E. Mejia-Restrepo,
Kyuseok Oh,
Fabio Pacucci,
Ryan W. Pfeifle,
Meredith C. Powell,
George C. Privon,
Federica Ricci,
Mara Salvato,
Kevin Schawinski,
Taro Shimizu
, et al. (2 additional authors not shown)
Abstract:
The BAT AGN Spectroscopic Survey (BASS) is designed to provide a highly complete census of the key physical parameters of supermassive black holes (SMBHs) that power local active galactic nuclei (AGN) (z<0.3), including their bolometric luminosity, black hole mass, accretion rates, and line-of-sight gas obscuration, and the distinctive properties of their host galaxies (e.g., star formation rates,…
▽ More
The BAT AGN Spectroscopic Survey (BASS) is designed to provide a highly complete census of the key physical parameters of supermassive black holes (SMBHs) that power local active galactic nuclei (AGN) (z<0.3), including their bolometric luminosity, black hole mass, accretion rates, and line-of-sight gas obscuration, and the distinctive properties of their host galaxies (e.g., star formation rates, masses, and gas fractions). We present an overview of the BASS data release 2 (DR2), an unprecedented spectroscopic survey in spectral range, resolution, and sensitivity, including 1449 optical (3200-10000 A) and 233 NIR (1-2.5 um) spectra for the brightest 858 ultra-hard X-ray (14-195 keV) selected AGN across the entire sky and essentially all levels of obscuration. This release provides a highly complete set of key measurements (emission line measurements and central velocity dispersions), with 99.9% measured redshifts and 98% black hole masses estimated (for unbeamed AGN outside the Galactic plane). The BASS DR2 AGN sample represents a unique census of nearby powerful AGN, spanning over 5 orders of magnitude in AGN bolometric luminosity, black hole mass, Eddington ratio, and obscuration. The public BASS DR2 sample and measurements can thus be used to answer fundamental questions about SMBH growth and its links to host galaxy evolution and feedback in the local universe, as well as open questions concerning SMBH physics. Here we provide a brief overview of the survey strategy, the key BASS DR2 measurements, data sets and catalogs, and scientific highlights from a series of DR2-based works.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
BASS XXVIII: Near-infrared Data Release 2, High-Ionization and Broad Lines in Active Galactic Nuclei
Authors:
Jakob den Brok,
Michael J. Koss,
Benny Trakhtenbrot,
Daniel Stern,
Sebastiano Cantalupo,
Isabella Lamperti,
Federica Ricci,
Claudio Ricci,
Kyuseok Oh,
Franz E. Bauer,
Rogerio Riffel,
Alberto Rodriguez-Ardila,
Rudolf Baer,
Fiona Harrison,
Kohei Ichikawa,
Julian E. Mejia-Restrepo,
Richard Mushotzky,
Meredith C. Powell,
Rozenn Boissay-Malaquin,
Marko Stalevski,
Ezequiel Treister,
C. Megan Urry,
Sylvain Veilleux
Abstract:
We present the BAT AGN Spectroscopic Survey (BASS) Near-infrared Data Release 2 (DR2), a study of 168 nearby ($\bar z$ = 0.04, $z$ < 0.6) active galactic nuclei (AGN) from the all-sky Swift Burst Array Telescope X-ray survey observed with Very Large Telescope (VLT)/X-shooter in the near-infrared (NIR; 0.8 - 2.4 $μ$m). We find that 49/109 (45%) Seyfert 2 and 35/58 (60%) Seyfert 1 galaxies observed…
▽ More
We present the BAT AGN Spectroscopic Survey (BASS) Near-infrared Data Release 2 (DR2), a study of 168 nearby ($\bar z$ = 0.04, $z$ < 0.6) active galactic nuclei (AGN) from the all-sky Swift Burst Array Telescope X-ray survey observed with Very Large Telescope (VLT)/X-shooter in the near-infrared (NIR; 0.8 - 2.4 $μ$m). We find that 49/109 (45%) Seyfert 2 and 35/58 (60%) Seyfert 1 galaxies observed with VLT/X-shooter show at least one NIR high-ionization coronal line (CL, ionization potential $χ$ > 100 eV). Comparing the emission of the [Si vi] $λ$1.9640 CL with the X-ray emission for the DR2 AGN, we find a significantly tighter correlation, with a lower scatter (0.37 dex) than for the optical [O iii] $λ$5007 line (0.71 dex). We do not find any correlation between CL emission and the X-ray photon index $Γ$. We find a clear trend of line blueshifts with increasing ionization potential in several CLs, such as [Si vi] $λ$1.9640, [Si x] $λ$1.4300, [S viii] $λ$0.9915, and [S ix] $λ$1.2520, indicating the radial structure of the CL region. Finally, we find a strong underestimation bias in black hole mass measurements of Sy 1.9 using broad H$α$ due to the presence of significant dust obscuration. In contrast, the broad Pa$α$ and Pa$β$ emission lines are in agreement with the $M$-$σ$ relation. Based on the combined DR1 and DR2 X-shooter sample, the NIR BASS sample now comprises 266 AGN with rest-frame NIR spectroscopic observations, the largest set assembled to date.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
pathGCN: Learning General Graph Spatial Operators from Paths
Authors:
Moshe Eliasof,
Eldad Haber,
Eran Treister
Abstract:
Graph Convolutional Networks (GCNs), similarly to Convolutional Neural Networks (CNNs), are typically based on two main operations - spatial and point-wise convolutions. In the context of GCNs, differently from CNNs, a pre-determined spatial operator based on the graph Laplacian is often chosen, allowing only the point-wise operations to be learnt. However, learning a meaningful spatial operator i…
▽ More
Graph Convolutional Networks (GCNs), similarly to Convolutional Neural Networks (CNNs), are typically based on two main operations - spatial and point-wise convolutions. In the context of GCNs, differently from CNNs, a pre-determined spatial operator based on the graph Laplacian is often chosen, allowing only the point-wise operations to be learnt. However, learning a meaningful spatial operator is critical for developing more expressive GCNs for improved performance. In this paper we propose pathGCN, a novel approach to learn the spatial operator from random paths on the graph. We analyze the convergence of our method and its difference from existing GCNs. Furthermore, we discuss several options of combining our learnt spatial operator with point-wise convolutions. Our extensive experiments on numerous datasets suggest that by properly learning both the spatial and point-wise convolutions, phenomena like over-smoothing can be inherently avoided, and new state-of-the-art performance is achieved.
△ Less
Submitted 15 July, 2022;
originally announced July 2022.
-
GaMPEN: A Machine Learning Framework for Estimating Bayesian Posteriors of Galaxy Morphological Parameters
Authors:
Aritra Ghosh,
C. Megan Urry,
Amrit Rau,
Laurence Perreault-Levasseur,
Miles Cranmer,
Kevin Schawinski,
Dominic Stark,
Chuan Tian,
Ryan Ofman,
Tonima Tasnim Ananna,
Connor Auge,
Nico Cappelluti,
David B. Sanders,
Ezequiel Treister
Abstract:
We introduce a novel machine learning framework for estimating the Bayesian posteriors of morphological parameters for arbitrarily large numbers of galaxies. The Galaxy Morphology Posterior Estimation Network (GaMPEN) estimates values and uncertainties for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and flux ($F$). To estimate posteriors, GaMPEN uses the Monte Carl…
▽ More
We introduce a novel machine learning framework for estimating the Bayesian posteriors of morphological parameters for arbitrarily large numbers of galaxies. The Galaxy Morphology Posterior Estimation Network (GaMPEN) estimates values and uncertainties for a galaxy's bulge-to-total light ratio ($L_B/L_T$), effective radius ($R_e$), and flux ($F$). To estimate posteriors, GaMPEN uses the Monte Carlo Dropout technique and incorporates the full covariance matrix between the output parameters in its loss function. GaMPEN also uses a Spatial Transformer Network (STN) to automatically crop input galaxy frames to an optimal size before determining their morphology. This will allow it to be applied to new data without prior knowledge of galaxy size. Training and testing GaMPEN on galaxies simulated to match $z < 0.25$ galaxies in Hyper Suprime-Cam Wide $g$-band images, we demonstrate that GaMPEN achieves typical errors of $0.1$ in $L_B/L_T$, $0.17$ arcsec ($\sim 7\%$) in $R_e$, and $6.3\times10^4$ nJy ($\sim 1\%$) in $F$. GaMPEN's predicted uncertainties are well-calibrated and accurate ($<5\%$ deviation) -- for regions of the parameter space with high residuals, GaMPEN correctly predicts correspondingly large uncertainties. We also demonstrate that we can apply categorical labels (i.e., classifications such as "highly bulge-dominated") to predictions in regions with high residuals and verify that those labels are $\gtrsim 97\%$ accurate. To the best of our knowledge, GaMPEN is the first machine learning framework for determining joint posterior distributions of multiple morphological parameters and is also the first application of an STN to optical imaging in astronomy.
△ Less
Submitted 11 July, 2022;
originally announced July 2022.
-
Ionized Outflows in Nearby Quasars are Poorly Coupled to their Host Galaxies
Authors:
Juan Molina,
Luis C. Ho,
Ran Wang,
Jinyi Shangguan,
Franz E. Bauer,
Ezequiel Treister,
Ming-Yang Zhuang,
Claudio Ricci,
Fuyan Bian
Abstract:
We analyze Multi Unit Spectroscopic Explorer observations of nine low-redshift (z < 0.1) Palomar-Green quasar host galaxies to investigate the spatial distribution and kinematics of the warm, ionized interstellar medium, with the goal of searching for and constraining the efficiency of active galactic nucleus (AGN) feedback. After separating the bright AGN from the starlight and nebular emission,…
▽ More
We analyze Multi Unit Spectroscopic Explorer observations of nine low-redshift (z < 0.1) Palomar-Green quasar host galaxies to investigate the spatial distribution and kinematics of the warm, ionized interstellar medium, with the goal of searching for and constraining the efficiency of active galactic nucleus (AGN) feedback. After separating the bright AGN from the starlight and nebular emission, we use pixel-wise, kpc-scale diagnostics to determine the underlying excitation mechanism of the line emission, and we measure the kinematics of the narrow-line region (NLR) to estimate the physical properties of the ionized outflows. The radial size of the NLR correlates with the AGN luminosity, reaching scales of $\sim 5\,$kpc and beyond. The geometry of the NLR is well-represented by a projected biconical structure, suggesting that the AGN radiation preferably escapes through the ionization cone. We find enhanced velocity dispersions ($\sim 100\,$km$\,$s$^{-1}$) traced by the H$α$ emission line in localized zones within the ionization cones. Interpreting these kinematic features as signatures of interaction between an AGN-driven ionized gas outflow and the host galaxy interstellar medium, we derive mass outflow rates of $\sim 0.008-1.6\, M_\odot \,$yr$^{-1}$ and kinetic injection rates of $\sim 10^{39}-10^{42} \,$erg$\,$s$^{-1}$, which yield extremely low coupling efficiencies of $\lesssim 10^{-3}$. These findings add to the growing body of recent observational evidence that AGN feedback is highly ineffective in the host galaxies of nearby AGNs.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Rethinking Unsupervised Neural Superpixel Segmentation
Authors:
Moshe Eliasof,
Nir Ben Zikri,
Eran Treister
Abstract:
Recently, the concept of unsupervised learning for superpixel segmentation via CNNs has been studied. Essentially, such methods generate superpixels by convolutional neural network (CNN) employed on a single image, and such CNNs are trained without any labels or further information. Thus, such approach relies on the incorporation of priors, typically by designing an objective function that guides…
▽ More
Recently, the concept of unsupervised learning for superpixel segmentation via CNNs has been studied. Essentially, such methods generate superpixels by convolutional neural network (CNN) employed on a single image, and such CNNs are trained without any labels or further information. Thus, such approach relies on the incorporation of priors, typically by designing an objective function that guides the solution towards a meaningful superpixel segmentation. In this paper we propose three key elements to improve the efficacy of such networks: (i) the similarity of the \emph{soft} superpixelated image compared to the input image, (ii) the enhancement and consideration of object edges and boundaries and (iii) a modified architecture based on atrous convolution, which allow for a wider field of view, functioning as a multi-scale component in our network. By experimenting with the BSDS500 dataset, we find evidence to the significance of our proposal, both qualitatively and quantitatively.
△ Less
Submitted 21 June, 2022;
originally announced June 2022.
-
Accretion History of AGN: Estimating the Host Galaxy Properties in X-ray Luminous AGN from z=0-3
Authors:
Brandon Coleman,
Allison Kirkpatrick,
Kevin C. Cooke,
Eilat Glikman,
Stephanie La Massa,
Stefano Marchesi,
Alessandro Peca,
Ezequiel Treister,
Connor Auge,
C. Megan Urry,
Dave Sanders,
Tracey Jane Turner,
Tonima Tasnim Ananna
Abstract:
We aim to determine the intrinsic far-Infrared (far-IR) emission of X-ray-luminous quasars over cosmic time. Using a 16 deg^2 region of the Stripe 82 field surveyed by XMM-Newton and Herschel Space Observatory, we identify 2905 X-ray luminous (LX > 10^42 erg/s) Active Galactic Nuclei (AGN) in the range z ~ 0-3. The IR is necessary to constrain host galaxy properties such as star formation rate (SF…
▽ More
We aim to determine the intrinsic far-Infrared (far-IR) emission of X-ray-luminous quasars over cosmic time. Using a 16 deg^2 region of the Stripe 82 field surveyed by XMM-Newton and Herschel Space Observatory, we identify 2905 X-ray luminous (LX > 10^42 erg/s) Active Galactic Nuclei (AGN) in the range z ~ 0-3. The IR is necessary to constrain host galaxy properties such as star formation rate (SFR) and gas mass. However, only 10% of our AGN are detected both in the X-ray and IR. Because 90% of the sample is undetected in the far-IR by Herschel, we explore the mean IR emission of these undetected sources by stacking their Herschel/SPIRE images in bins of X-ray luminosity and redshift. We create stacked spectral energy distributions from the optical to the far-IR, and estimate the median star formation rate, dust mass, stellar mass, and infrared luminosity using a fitting routine. We find that the stacked sources on average have similar SFR/L_bol ratios as IR detected sources. The majority of our sources fall on or above the main sequence line suggesting that X-ray selection alone does not predict the location of a galaxy on the main sequence. We also find that the gas depletion timescales of our AGN are similar to those of dusty star forming galaxies. This suggests that X-ray selected AGN host high star formation and that there are no signs of declining star formation.
△ Less
Submitted 13 June, 2022;
originally announced June 2022.
-
Fairness and Unfairness in Binary and Multiclass Classification: Quantifying, Calculating, and Bounding
Authors:
Sivan Sabato,
Eran Treister,
Elad Yom-Tov
Abstract:
We propose a new interpretable measure of unfairness, that allows providing a quantitative analysis of classifier fairness, beyond a dichotomous fair/unfair distinction. We show how this measure can be calculated when the classifier's conditional confusion matrices are known. We further propose methods for auditing classifiers for their fairness when the confusion matrices cannot be obtained or ev…
▽ More
We propose a new interpretable measure of unfairness, that allows providing a quantitative analysis of classifier fairness, beyond a dichotomous fair/unfair distinction. We show how this measure can be calculated when the classifier's conditional confusion matrices are known. We further propose methods for auditing classifiers for their fairness when the confusion matrices cannot be obtained or even estimated. Our approach lower-bounds the unfairness of a classifier based only on aggregate statistics, which may be provided by the owner of the classifier or collected from freely available data. We use the equalized odds criterion, which we generalize to the multiclass case. We report experiments on data sets representing diverse applications, which demonstrate the effectiveness and the wide range of possible uses of the proposed methodology. An implementation of the procedures proposed in this paper and as the code for running the experiments are provided in https://github.com/sivansabato/unfairness.
△ Less
Submitted 5 April, 2024; v1 submitted 7 June, 2022;
originally announced June 2022.
-
Wavelet Feature Maps Compression for Image-to-Image CNNs
Authors:
Shahaf E. Finder,
Yair Zohav,
Maor Ashkenazi,
Eran Treister
Abstract:
Convolutional Neural Networks (CNNs) are known for requiring extensive computational resources, and quantization is among the best and most common methods for compressing them. While aggressive quantization (i.e., less than 4-bits) performs well for classification, it may cause severe performance degradation in image-to-image tasks such as semantic segmentation and depth estimation. In this paper,…
▽ More
Convolutional Neural Networks (CNNs) are known for requiring extensive computational resources, and quantization is among the best and most common methods for compressing them. While aggressive quantization (i.e., less than 4-bits) performs well for classification, it may cause severe performance degradation in image-to-image tasks such as semantic segmentation and depth estimation. In this paper, we propose Wavelet Compressed Convolution (WCC) -- a novel approach for high-resolution activation maps compression integrated with point-wise convolutions, which are the main computational cost of modern architectures. To this end, we use an efficient and hardware-friendly Haar-wavelet transform, known for its effectiveness in image compression, and define the convolution on the compressed activation map. We experiment with various tasks that benefit from high-resolution input. By combining WCC with light quantization, we achieve compression rates equivalent to 1-4bit activation quantization with relatively small and much more graceful degradation in performance. Our code is available at https://github.com/BGUCompSci/WaveletCompressedConvolution.
△ Less
Submitted 16 October, 2022; v1 submitted 24 May, 2022;
originally announced May 2022.
-
pISTA: preconditioned Iterative Soft Thresholding Algorithm for Graphical Lasso
Authors:
Gal Shalom,
Eran Treister,
Irad Yavneh
Abstract:
We propose a novel quasi-Newton method for solving the sparse inverse covariance estimation problem also known as the graphical least absolute shrinkage and selection operator (GLASSO). This problem is often solved using a second-order quadratic approximation. However, in such algorithms the Hessian term is complex and computationally expensive to handle. Therefore, our method uses the inverse of…
▽ More
We propose a novel quasi-Newton method for solving the sparse inverse covariance estimation problem also known as the graphical least absolute shrinkage and selection operator (GLASSO). This problem is often solved using a second-order quadratic approximation. However, in such algorithms the Hessian term is complex and computationally expensive to handle. Therefore, our method uses the inverse of the Hessian as a preconditioner to simplify and approximate the quadratic element at the cost of a more complex \(\ell_1\) element. The variables of the resulting preconditioned problem are coupled only by the \(\ell_1\) sub-derivative of each other, which can be guessed with minimal cost using the gradient itself, allowing the algorithm to be parallelized and implemented efficiently on GPU hardware accelerators. Numerical results on synthetic and real data demonstrate that our method is competitive with other state-of-the-art approaches.
△ Less
Submitted 17 October, 2023; v1 submitted 20 May, 2022;
originally announced May 2022.