Search | arXiv e-print repository

arXiv:2405.20389 [pdf, other]

Designing an Evaluation Framework for Large Language Models in Astronomy Research

Authors: John F. Wu, Alina Hyk, Kiera McCormick, Christine Ye, Simone Astarita, Elina Baral, Jo Ciuca, Jesse Cranney, Anjalie Field, Kartheik Iyer, Philipp Koehn, Jenn Kotler, Sandor Kruk, Michelle Ntampaka, Charles O'Neill, Joshua E. G. Peek, Sanjib Sharma, Mikaeel Yunus

Abstract: Large Language Models (LLMs) are shifting how scientific research is done. It is imperative to understand how researchers interact with these models and how scientific sub-communities like astronomy might benefit from them. However, there is currently no standard for evaluating the use of LLMs in astronomy. Therefore, we present the experimental design for an evaluation study on how astronomy rese… ▽ More Large Language Models (LLMs) are shifting how scientific research is done. It is imperative to understand how researchers interact with these models and how scientific sub-communities like astronomy might benefit from them. However, there is currently no standard for evaluating the use of LLMs in astronomy. Therefore, we present the experimental design for an evaluation study on how astronomy researchers interact with LLMs. We deploy a Slack chatbot that can answer queries from users via Retrieval-Augmented Generation (RAG); these responses are grounded in astronomy papers from arXiv. We record and anonymize user questions and chatbot answers, user upvotes and downvotes to LLM responses, user feedback to the LLM, and retrieved documents and similarity scores with the query. Our data collection method will enable future dynamic evaluations of LLM tools for astronomy. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 7 pages, 3 figures. Code available at https://github.com/jsalt2024-evaluating-llms-for-astronomy/astro-arxiv-bot

arXiv:2402.10337 [pdf, other]

LoVoCCS. II. Weak Lensing Mass Distributions, Red-Sequence Galaxy Distributions, and Their Alignment with the Brightest Cluster Galaxy in 58 Nearby X-ray-Luminous Galaxy Clusters

Authors: Shenming Fu, Ian Dell'Antonio, Zacharias Escalante, Jessica Nelson, Anthony Englert, Søren Helhoski, Rahul Shinde, Julia Brockland, Philip LaDuca, Christelyn Larkin, Lucca Paris, Shane Weiner, William K. Black, Ranga-Ram Chary, Douglas Clowe, M. C. Cooper, Megan Donahue, August Evrard, Mark Lacy, Tod Lauer, Binyang Liu, Jacqueline McCleary, Massimo Meneghetti, Hironao Miyatake, Mireia Montes , et al. (9 additional authors not shown)

Abstract: The Local Volume Complete Cluster Survey (LoVoCCS) is an on-going program to observe nearly a hundred low-redshift X-ray-luminous galaxy clusters (redshifts $0.03<z<0.12$ and X-ray luminosities in the 0.1-2.4 keV band $L_{\rm X500c}>10^{44}$ erg/s) with the Dark Energy Camera (DECam), capturing data in $u,g,r,i,z$ bands with a $5σ$ point source depth of approximately 25-26th AB magnitudes. Here, w… ▽ More The Local Volume Complete Cluster Survey (LoVoCCS) is an on-going program to observe nearly a hundred low-redshift X-ray-luminous galaxy clusters (redshifts $0.03<z<0.12$ and X-ray luminosities in the 0.1-2.4 keV band $L_{\rm X500c}>10^{44}$ erg/s) with the Dark Energy Camera (DECam), capturing data in $u,g,r,i,z$ bands with a $5σ$ point source depth of approximately 25-26th AB magnitudes. Here, we map the aperture masses in 58 galaxy cluster fields using weak gravitational lensing. These clusters span a variety of dynamical states, from nearly relaxed to merging systems, and approximately half of them have not been subject to detailed weak lensing analysis before. In each cluster field, we analyze the alignment between the 2D mass distribution described by the aperture mass map, the 2D red-sequence (RS) galaxy distribution, and the brightest cluster galaxy (BCG). We find that the orientations of the BCG and the RS distribution are strongly aligned throughout the interiors of the clusters: the median misalignment angle is 19 deg within 2 Mpc. We also observe the alignment between the orientations of the RS distribution and the overall cluster mass distribution (by a median difference of 32 deg within 1 Mpc), although this is constrained by galaxy shape noise and the limitations of our cluster sample size. These types of alignment suggest long-term dynamical evolution within the clusters over cosmic timescales. △ Less

Submitted 1 August, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

Comments: 40 pages, 16 figures, 5 tables; revised and accepted for publication in ApJ

arXiv:2310.12528 [pdf, other]

Constructing Impactful Machine Learning Research for Astronomy: Best Practices for Researchers and Reviewers

Authors: D. Huppenkothen, M. Ntampaka, M. Ho, M. Fouesneau, B. Nord, J. E. G. Peek, M. Walmsley, J. F. Wu, C. Avestruz, T. Buck, M. Brescia, D. P. Finkbeiner, A. D. Goulding, T. Kacprzak, P. Melchior, M. Pasquato, N. Ramachandra, Y. -S. Ting, G. van de Ven, S. Villar, V. A. Villar, E. Zinger

Abstract: Machine learning has rapidly become a tool of choice for the astronomical community. It is being applied across a wide range of wavelengths and problems, from the classification of transients to neural network emulators of cosmological simulations, and is shifting paradigms about how we generate and report scientific results. At the same time, this class of method comes with its own set of best pr… ▽ More Machine learning has rapidly become a tool of choice for the astronomical community. It is being applied across a wide range of wavelengths and problems, from the classification of transients to neural network emulators of cosmological simulations, and is shifting paradigms about how we generate and report scientific results. At the same time, this class of method comes with its own set of best practices, challenges, and drawbacks, which, at present, are often reported on incompletely in the astrophysical literature. With this paper, we aim to provide a primer to the astronomical community, including authors, reviewers, and editors, on how to implement machine learning models and report their results in a way that ensures the accuracy of the results, reproducibility of the findings, and usefulness of the method. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 14 pages, 3 figures; submitted to the Bulletin of the American Astronomical Society

arXiv:2307.16733 [pdf, other]

doi 10.1093/mnras/stad2596

Painting baryons onto N-body simulations of galaxy clusters with image-to-image deep learning

Authors: Urmila Chadayammuri, Michelle Ntampaka, John ZuHone, Àkos Bogdàn, Ralph Kraft

Abstract: Galaxy cluster mass functions are a function of cosmology, but mass is not a direct observable, and systematic errors abound in all its observable proxies. Mass-free inference can bypass this challenge, but it requires large suites of simulations spanning a range of cosmologies and models for directly observable quantities. In this work, we devise a U-net - an image-to-image machine learning algor… ▽ More Galaxy cluster mass functions are a function of cosmology, but mass is not a direct observable, and systematic errors abound in all its observable proxies. Mass-free inference can bypass this challenge, but it requires large suites of simulations spanning a range of cosmologies and models for directly observable quantities. In this work, we devise a U-net - an image-to-image machine learning algorithm - to ``paint'' the IllustrisTNG model of baryons onto dark-matter-only simulations of galaxy clusters. Using 761 galaxy clusters with $M_{200c} \gtrsim 10^{14}M_\odot$ from the TNG-300 simulation at $z<1$, we train the algorithm to read in maps of projected dark matter mass and output maps of projected gas density, temperature, and X-ray flux. The models train in under an hour on two GPUs, and then predict baryonic images for $\sim2700$ dark matter maps drawn from the TNG-300 dark-matter-only (DMO) simulation in under two minutes. Despite being trained on individual images, the model reproduces the true scaling relation and scatter for the $M_{DM}-L_X$, as well as the distribution functions of the cluster X-ray luminosity and gas mass. For just one decade in cluster mass, the model reproduces three orders of magnitude in $L_X$. The model is biased slightly high when using dark matter maps from the DMO simulation. The model performs well on inputs from TNG-300-2, whose mass resolution is 8 times coarser; further degrading the resolution biases the predicted luminosity function high. We conclude that U-net-based baryon painting is a promising technique to build large simulated cluster catalogs which can be used to improve cluster cosmology by combining existing full-physics and large $N$-body simulations. △ Less

Submitted 24 August, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

Comments: Accepted to MNRAS

arXiv:2303.00005 [pdf, other]

doi 10.1093/mnras/stad2005

Benchmarks and Explanations for Deep Learning Estimates of X-ray Galaxy Cluster Masses

Authors: Matthew Ho, John Soltis, Arya Farahi, Daisuke Nagai, August Evrard, Michelle Ntampaka

Abstract: We evaluate the effectiveness of deep learning (DL) models for reconstructing the masses of galaxy clusters using X-ray photometry data from next-generation surveys. We establish these constraints using a catalogue of realistic mock eROSITA X-ray observations which use hydrodynamical simulations to model realistic cluster morphology, background emission, telescope response, and AGN sources. Using… ▽ More We evaluate the effectiveness of deep learning (DL) models for reconstructing the masses of galaxy clusters using X-ray photometry data from next-generation surveys. We establish these constraints using a catalogue of realistic mock eROSITA X-ray observations which use hydrodynamical simulations to model realistic cluster morphology, background emission, telescope response, and AGN sources. Using bolometric X-ray photon maps as input, DL models achieve a predictive mass scatter of $σ_{\ln M_\mathrm{500c}} = 17.8\%$, a factor of two improvements on scalar observables such as richness $N_\mathrm{gal}$, 1D velocity dispersion $σ_\mathrm{v,1D}$, and photon count $N_\mathrm{phot}$ as well as a $32\%$ improvement upon idealised, volume-integrated measurements of the bolometric X-ray luminosity $L_X$. We then show that extending this model to handle multichannel X-ray photon maps, separated in low, medium, and high energy bands, further reduces the mass scatter to $16.2\%$. We also tested a multimodal DL model incorporating both dynamical and X-ray cluster probes and achieved marginal gains at a mass scatter of $15.9\%$. Finally, we conduct a quantitative interpretability study of our DL models and find that they greatly down-weight the importance of pixels in the centres of clusters and at the location of AGN sources, validating previous claims of DL modelling improvements and suggesting practical and theoretical benefits for using DL in X-ray mass inference. △ Less

Submitted 25 July, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

Comments: 14 pages, 9 figures, 3 tables, accepted in MNRAS

Journal ref: 2023 MNRAS, 524, 3, 3289-3302

arXiv:2301.02231 [pdf, other]

Predicting the impact of feedback on matter clustering with machine learning in CAMELS

Authors: Ana Maria Delgado, Daniel Angles-Alcazar, Leander Thiele, Shivam Pandey, Kai Lehman, Rachel S. Somerville, Michelle Ntampaka, Shy Genel, Francisco Villaescusa-Navarro, Lars Hernquist

Abstract: Extracting information from the total matter power spectrum with the precision needed for upcoming cosmological surveys requires unraveling the complex effects of galaxy formation processes on the distribution of matter. We investigate the impact of baryonic physics on matter clustering at $z=0$ using a library of power spectra from the Cosmology and Astrophysics with MachinE Learning Simulations… ▽ More Extracting information from the total matter power spectrum with the precision needed for upcoming cosmological surveys requires unraveling the complex effects of galaxy formation processes on the distribution of matter. We investigate the impact of baryonic physics on matter clustering at $z=0$ using a library of power spectra from the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) project, containing thousands of $(25\,h^{-1}{\rm Mpc})^3$ volume realizations with varying cosmology, initial random field, stellar and AGN feedback strength and sub-grid model implementation methods. We show that baryonic physics affects matter clustering on scales $k \gtrsim 0.4\,h\,\mathrm{Mpc}^{-1}$ and the magnitude of this effect is dependent on the details of the galaxy formation implementation and variations of cosmological and astrophysical parameters. Increasing AGN feedback strength decreases halo baryon fractions and yields stronger suppression of power relative to N-body simulations, while stronger stellar feedback often results in weaker effects by suppressing black hole growth and therefore the impact of AGN feedback. We find a broad correlation between mean baryon fraction of massive halos ($M_{\rm 200c} > 10^{13.5}$\,\Msun) and suppression of matter clustering but with significant scatter compared to previous work owing to wider exploration of feedback parameters and cosmic variance effects. We show that a random forest regressor trained on the baryon content and abundance of halos across the full mass range $10^{10} \leq M_\mathrm{halo}/$\Msun$< 10^{15}$ can predict the effect of galaxy formation on the matter power spectrum on scales $k = 1.0$--20.0\,$h\,\mathrm{Mpc}^{-1}$. △ Less

Submitted 5 October, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

arXiv:2207.14324 [pdf, other]

doi 10.3847/1538-4357/ac9b1b

A Machine Learning Approach to Enhancing eROSITA Observations

Authors: John Soltis, Michelle Ntampaka, John Wu, John ZuHone, August Evrard, Arya Farahi, Matthew Ho, Daisuke Nagai

Abstract: The eROSITA X-ray telescope, launched in 2019, is predicted to observe roughly 100,000 galaxy clusters. Follow-up observations of these clusters from Chandra, for example, will be needed to resolve outstanding questions about galaxy cluster physics. Deep Chandra cluster observations are expensive and follow-up of every eROSITA cluster is infeasible, therefore, objects chosen for follow-up must be… ▽ More The eROSITA X-ray telescope, launched in 2019, is predicted to observe roughly 100,000 galaxy clusters. Follow-up observations of these clusters from Chandra, for example, will be needed to resolve outstanding questions about galaxy cluster physics. Deep Chandra cluster observations are expensive and follow-up of every eROSITA cluster is infeasible, therefore, objects chosen for follow-up must be chosen with care. To address this, we have developed an algorithm for predicting longer duration, background-free observations based on mock eROSITA observations. We make use of the hydrodynamic cosmological simulation Magneticum, have simulated eROSITA instrument conditions using SIXTE, and have applied a novel convolutional neural network to output a deep Chandra-like "super observation" of each cluster in our simulation sample. Any follow-up merit assessment tool should be designed with a specific use case in mind; our model produces observations that accurately and precisely reproduce the cluster morphology, which is a critical ingredient for determining cluster dynamical state and core type. Our model will advance our understanding of galaxy clusters by improving follow-up selection and demonstrates that image-to-image deep learning algorithms are a viable method for simulating realistic follow-up observations. △ Less

Submitted 9 November, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

Comments: 21 pages, 11 figures, 3 tables. Minor changes upon revision. Corrected caption of Figure 3. Added discussion of alternative asymmetry metrics. To be published in the Astrophysical Journal

arXiv:2206.14834 [pdf, other]

doi 10.1038/s41550-022-01711-1

The Dynamical Mass of the Coma Cluster from Deep Learning

Authors: Matthew Ho, Michelle Ntampaka, Markus Michael Rau, Minghan Chen, Alexa Lansberry, Faith Ruehle, Hy Trac

Abstract: In 1933, Fritz Zwicky's famous investigations of the mass of the Coma cluster led him to infer the existence of dark matter \cite{1933AcHPh...6..110Z}. His fundamental discoveries have proven to be foundational to modern cosmology; as we now know such dark matter makes up 85\% of the matter and 25\% of the mass-energy content in the universe. Galaxy clusters like Coma are massive, complex systems… ▽ More In 1933, Fritz Zwicky's famous investigations of the mass of the Coma cluster led him to infer the existence of dark matter \cite{1933AcHPh...6..110Z}. His fundamental discoveries have proven to be foundational to modern cosmology; as we now know such dark matter makes up 85\% of the matter and 25\% of the mass-energy content in the universe. Galaxy clusters like Coma are massive, complex systems of dark matter in addition to hot ionized gas and thousands of galaxies, and serve as excellent probes of the dark matter distribution. However, empirical studies show that the total mass of such systems remains elusive and difficult to precisely constrain. Here, we present new estimates for the dynamical mass of the Coma cluster based on Bayesian deep learning methodologies developed in recent years. Using our novel data-driven approach, we predict Coma's $\mthc$ mass to be $10^{15.10 \pm 0.15}\ \hmsun$ within a radius of $1.78 \pm 0.03\ h^{-1}\mathrm{Mpc}$ of its center. We show that our predictions are rigorous across multiple training datasets and statistically consistent with historical estimates of Coma's mass. This measurement reinforces our understanding of the dynamical state of the Coma cluster and advances rigorous analyses and verification methods for empirical applications of machine learning in astronomy. △ Less

Submitted 29 June, 2022; originally announced June 2022.

Comments: 15 pages, 3 figures, 1 table, accepted for publication at Nature Astronomy, see https://www.nature.com/articles/s41550-022-01711-1

arXiv:2205.14270 [pdf, ps, other]

A Referee Primer for Early Career Astronomers

Authors: Michelle Ntampaka, Ana Bonaca, Sownak Bose, Daniel J. Eisenstein, Boryana Hadzhiyska, Charlotte Mason, Daisuke Nagai, Joshua S. Speagle

Abstract: Refereeing is a crucial component of publishing astronomical research, but few professional astronomers receive formal training on how to effectively referee a manuscript. In this article, we lay out considerations and best practices for referees. This document is intended as a tool for early career researchers to develop a fair, effective, and efficient approach to refereeing. Refereeing is a crucial component of publishing astronomical research, but few professional astronomers receive formal training on how to effectively referee a manuscript. In this article, we lay out considerations and best practices for referees. This document is intended as a tool for early career researchers to develop a fair, effective, and efficient approach to refereeing. △ Less

Submitted 27 May, 2022; originally announced May 2022.

Comments: Submitted to the Bulletin of the AAS

arXiv:2202.12311 [pdf, other]

R2-D2: Roman and Rubin -- From Data to Discovery

Authors: Suvi Gezari, Misty Bentz, Kishalay De, K. Decker French, Aaron Meisner, Michelle Ntampaka, Robert Jedicke, Ekta Patel, Daniel Perley, Robyn Sanderson, Christian Aganze, Igor Andreoni, Eric F. Bell, Edo Berger, Ian Dell'Antonio, Ryan Foley, Henry Hsieh, Mansi Kasliwal, Joel Kastner, Charles D. Kilpatrick, J. Davy Kirkpatrick, Casey Lam, Karen Meech, Dante Minniti, Ethan O. Nadler , et al. (6 additional authors not shown)

Abstract: The NASA Nancy Grace Roman Space Telescope (Roman) and the Vera C. Rubin Observatory Legacy Survey of Space and Time (Rubin), will transform our view of the wide-field sky, with similar sensitivities, but complementary in wavelength, spatial resolution, and time domain coverage. Here we present findings from the AURA Roman+Rubin Synergy Working group, charged by the STScI and NOIRLab Directors to… ▽ More The NASA Nancy Grace Roman Space Telescope (Roman) and the Vera C. Rubin Observatory Legacy Survey of Space and Time (Rubin), will transform our view of the wide-field sky, with similar sensitivities, but complementary in wavelength, spatial resolution, and time domain coverage. Here we present findings from the AURA Roman+Rubin Synergy Working group, charged by the STScI and NOIRLab Directors to identify frontier science questions in General Astrophysics, beyond the well-covered areas of Dark Energy and Cosmology, that can be uniquely addressed with Roman and Rubin synergies in observing strategy, data products and archiving, joint analysis, and community engagement. This analysis was conducted with input from the community in the form of brief (1-2 paragraph) "science pitches" (see Appendix), and testimony from "outside experts" (included as co-authors). We identify a rich and broad landscape of potential discoveries catalyzed by the combination of exceptional quality and quantity of Roman and Rubin data, and summarize implementation requirements that would facilitate this bounty of additional science with coordination of survey fields, joint coverage of the Galactic plane, bulge, and ecliptic, expansion of General Investigator and Target of Opportunity observing modes, co-location of Roman and Rubin data, and timely distribution of data, transient alerts, catalogs, value-added joint analysis products, and simulations to the broad astronomical community. △ Less

Submitted 24 February, 2022; originally announced February 2022.

Comments: 29 pages, 12 figures, Table of Implementation Recommendations, Appendix of Community Science Pitches, AURA-commissioned whitepaper submitted to the Director of STScI (Ken Sembach) and the Director of NOIRLab (Pat McCarthy)

arXiv:2112.05768 [pdf, other]

doi 10.3847/1538-4357/ac423e

The Importance of Being Interpretable: Toward An Understandable Machine Learning Encoder for Galaxy Cluster Cosmology

Authors: Michelle Ntampaka, Alexey Vikhlinin

Abstract: We present a deep machine learning (ML) approach to constraining cosmological parameters with multi-wavelength observations of galaxy clusters. The ML approach has two components: an encoder that builds a compressed representation of each galaxy cluster and a flexible CNN to estimate the cosmological model from a cluster sample. It is trained and tested on simulated cluster catalogs built from the… ▽ More We present a deep machine learning (ML) approach to constraining cosmological parameters with multi-wavelength observations of galaxy clusters. The ML approach has two components: an encoder that builds a compressed representation of each galaxy cluster and a flexible CNN to estimate the cosmological model from a cluster sample. It is trained and tested on simulated cluster catalogs built from the Magneticum simulations. From the simulated catalogs, the ML method estimates the amplitude of matter fluctuations, sigma_8, at approximately the expected theoretical limit. More importantly, the deep ML approach can be interpreted. We lay out three schemes for interpreting the ML technique: a leave-one-out method for assessing cluster importance, an average saliency for evaluating feature importance, and correlations in the terse layer for understanding whether an ML technique can be safely applied to observational data. These interpretation schemes led to the discovery of a previously unknown self-calibration mode for flux- and volume-limited cluster surveys. We describe this new mode, which uses the amplitude and peak of the cluster mass PDF as anchors for mass calibration. We introduce the term "overspecialized" to describe a common pitfall in astronomical applications of machine learning in which the ML method learns simulation-specific details, and we show how a carefully constructed architecture can be used to check for this source of systematic error. △ Less

Submitted 10 December, 2021; originally announced December 2021.

Comments: Accepted for publication in The Astrophysical Journal

arXiv:2111.14566 [pdf, ps, other]

Building Trustworthy Machine Learning Models for Astronomy

Authors: Michelle Ntampaka, Matthew Ho, Brian Nord

Abstract: Astronomy is entering an era of data-driven discovery, due in part to modern machine learning (ML) techniques enabling powerful new ways to interpret observations. This shift in our scientific approach requires us to consider whether we can trust the black box. Here, we overview methods for an often-overlooked step in the development of ML models: building community trust in the algorithms. Trust… ▽ More Astronomy is entering an era of data-driven discovery, due in part to modern machine learning (ML) techniques enabling powerful new ways to interpret observations. This shift in our scientific approach requires us to consider whether we can trust the black box. Here, we overview methods for an often-overlooked step in the development of ML models: building community trust in the algorithms. Trust is an essential ingredient not just for creating more robust data analysis techniques, but also for building confidence within the astronomy community to embrace machine learning methods and results. △ Less

Submitted 29 November, 2021; originally announced November 2021.

Comments: Prepared for the Astronomical Data Analysis Software and Systems (ADASS) XXXI Proceedings

arXiv:2110.02232 [pdf, other]

doi 10.1093/mnras/stac438

Emulating Sunyaev-Zeldovich Images of Galaxy Clusters using Auto-Encoders

Authors: Tibor Rothschild, Daisuke Nagai, Han Aung, Sheridan B. Green, Michelle Ntampaka, John ZuHone

Abstract: We develop a machine learning algorithm that generates high-resolution thermal Sunyaev-Zeldovich (SZ) maps of novel galaxy clusters given only halo mass and mass accretion rate. The algorithm uses a conditional variational autoencoder (CVAE) in the form of a convolutional neural network and is trained with SZ maps generated from the IllustrisTNG simulation. Our method can reproduce many of the det… ▽ More We develop a machine learning algorithm that generates high-resolution thermal Sunyaev-Zeldovich (SZ) maps of novel galaxy clusters given only halo mass and mass accretion rate. The algorithm uses a conditional variational autoencoder (CVAE) in the form of a convolutional neural network and is trained with SZ maps generated from the IllustrisTNG simulation. Our method can reproduce many of the details of galaxy clusters that analytical models usually lack, such as internal structure and aspherical distribution of gas created by mergers, while achieving the same computational feasibility, allowing us to generate mock SZ maps for over $10^5$ clusters in 30 seconds on a laptop. We show that the model is capable of generating novel clusters (i.e. not found in the training set) and that the model accurately reproduces the effects of mass and mass accretion rate on the SZ images, such as scatter, asymmetry, and concentration, in addition to modeling merging sub-clusters. This work demonstrates the viability of machine-learning--based methods for producing the number of realistic, high-resolution maps of galaxy clusters necessary to achieve statistical constraints from future SZ surveys. △ Less

Submitted 9 February, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

Comments: 13 pages, 12 figures, 1 tables, accepted for publication in MNRAS

arXiv:2008.04921 [pdf, other]

doi 10.3847/1538-4357/abc6fd

SuperRAENN: A Semi-supervised Supernova Photometric Classification Pipeline Trained on Pan-STARRS1 Medium Deep Survey Supernovae

Authors: V. Ashley Villar, Griffin Hosseinzadeh, Edo Berger, Michelle Ntampaka, David O. Jones, Peter Challis, Ryan Chornock, Maria R. Drout, Ryan J. Foley, Robert P. Kirshner, Ragnhild Lunnan, Raffaella Margutti, Dan Milisavljevic, Nathan Sanders, Yen-Chen Pan, Armin Rest, Daniel M. Scolnic, Eugene Magnier, Nigel Metcalfe, Richard Wainscoat, Christopher Waters

Abstract: Automated classification of supernovae (SNe) based on optical photometric light curve information is essential in the upcoming era of wide-field time domain surveys, such as the Legacy Survey of Space and Time (LSST) conducted by the Rubin Observatory. Photometric classification can enable real-time identification of interesting events for extended multi-wavelength follow-up, as well as archival p… ▽ More Automated classification of supernovae (SNe) based on optical photometric light curve information is essential in the upcoming era of wide-field time domain surveys, such as the Legacy Survey of Space and Time (LSST) conducted by the Rubin Observatory. Photometric classification can enable real-time identification of interesting events for extended multi-wavelength follow-up, as well as archival population studies. Here we present the complete sample of 5,243 "SN-like" light curves (in griz) from the Pan-STARRS1 Medium-Deep Survey (PS1-MDS). The PS1-MDS is similar to the planned LSST Wide-Fast-Deep survey in terms of cadence, filters and depth, making this a useful training set for the community. Using this dataset, we train a novel semi-supervised machine learning algorithm to photometrically classify 2,315 new SN-like light curves with host galaxy spectroscopic redshifts. Our algorithm consists of a random forest supervised classification step and a novel unsupervised step in which we introduce a recurrent autoencoder neural network (RAENN). Our final pipeline, dubbed SuperRAENN, has an accuracy of 87% across five SN classes (Type Ia, Ibc, II, IIn, SLSN-I). We find the highest accuracy rates for Type Ia SNe and SLSNe and the lowest for Type Ibc SNe. Our complete spectroscopically- and photometrically-classified samples break down into: 62.0% Type Ia (1839 objects), 19.8% Type II (553 objects), 4.8% Type IIn (136 objects), 11.7% Type Ibc (291 objects), and 1.6% Type I SLSNe (54 objects). Finally, we discuss how this algorithm can be modified for online LSST data streams. △ Less

Submitted 11 August, 2020; originally announced August 2020.

Comments: Submitted to ApJ; Companion paper to Hosseinzadeh et al.; Tables 1 and 2 available on ashleyvillar.com prior to publication

arXiv:2007.05144 [pdf, other]

doi 10.1093/mnras/staa2690

A deep learning view of the census of galaxy clusters in IllustrisTNG

Authors: Y. Su, Y. Zhang, G. Liang, J. A. ZuHone, D. J. Barnes, N. B. Jacobs, M. Ntampaka, W. R. Forman, P. E. J. Nulsen, R. P. Kraft, C. Jones

Abstract: The origin of the diverse population of galaxy clusters remains an unexplained aspect of large-scale structure formation and cluster evolution. We present a novel method of using X-ray images to identify cool core (CC), weak cool core (WCC), and non cool core (NCC) clusters of galaxies, that are defined by their central cooling times. We employ a convolutional neural network, ResNet-18, which is c… ▽ More The origin of the diverse population of galaxy clusters remains an unexplained aspect of large-scale structure formation and cluster evolution. We present a novel method of using X-ray images to identify cool core (CC), weak cool core (WCC), and non cool core (NCC) clusters of galaxies, that are defined by their central cooling times. We employ a convolutional neural network, ResNet-18, which is commonly used for image analysis, to classify clusters. We produce mock Chandra X-ray observations for a sample of 318 massive clusters drawn from the IllustrisTNG simulations. The network is trained and tested with low resolution mock Chandra images covering a central 1 Mpc square for the clusters in our sample. Without any spectral information, the deep learning algorithm is able to identify CC, WCC, and NCC clusters, achieving balanced accuracies (BAcc) of 92%, 81%, and 83%, respectively. The performance is superior to classification by conventional methods using central gas densities, with an average BAcc = 81%, or surface brightness concentrations, giving BAcc = 73%. We use Class Activation Mapping to localize discriminative regions for the classification decision. From this analysis, we observe that the network has utilized regions from cluster centers out to r~300 kpc and r~500 kpc to identify CC and NCC clusters, respectively. It may have recognized features in the intracluster medium that are associated with AGN feedback and disruptive major mergers. △ Less

Submitted 25 August, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

Comments: Accepted for publication in MNRAS

arXiv:1911.02479 [pdf, ps, other]

Algorithms and Statistical Models for Scientific Discovery in the Petabyte Era

Authors: Brian Nord, Andrew J. Connolly, Jamie Kinney, Jeremy Kubica, Gautaum Narayan, Joshua E. G. Peek, Chad Schafer, Erik J. Tollerud, Camille Avestruz, G. Jogesh Babu, Simon Birrer, Douglas Burke, João Caldeira, Douglas A. Caldwell, Joleen K. Carlberg, Yen-Chi Chen, Chuanfei Dong, Eric D. Feigelson, V. Zach Golkhou, Vinay Kashyap, T. S. Li, Thomas Loredo, Luisa Lucie-Smith, Kaisey S. Mandel, J. R. Martínez-Galarza , et al. (13 additional authors not shown)

Abstract: The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our… ▽ More The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our technical and collaborative frameworks to promote efficient algorithmic development and take advantage of opportunities for scientific discovery in the petabyte era. We discuss challenges for discovery in large and complex data sets; challenges and requirements for the next stage of development of statistical methodologies and algorithmic tool sets; how we might change our paradigms of collaboration and education; and the ethical implications of scientists' contributions to widely applicable algorithms and computational modeling. We start with six distinct recommendations that are supported by the commentary following them. This white paper is related to a larger corpus of effort that has taken place within and around the Petabytes to Science Workshops (https://petabytestoscience.github.io/). △ Less

Submitted 4 November, 2019; originally announced November 2019.

Comments: arXiv admin note: substantial text overlap with arXiv:1905.05116

Report number: FERMILAB-FN-1093-A-AE-SCD

arXiv:1909.10527 [pdf, other]

doi 10.3847/1538-4357/ab5f5e

A Hybrid Deep Learning Approach to Cosmological Constraints From Galaxy Redshift Surveys

Authors: Michelle Ntampaka, Daniel J. Eisenstein, Sihan Yuan, Lehman H. Garrison

Abstract: We present a deep machine learning (ML)-based technique for accurately determining $σ_8$ and $Ω_m$ from mock 3D galaxy surveys. The mock surveys are built from the AbacusCosmos suite of $N$-body simulations, which comprises 40 cosmological volume simulations spanning a range of cosmological models, and we account for uncertainties in galaxy formation scenarios through the use of generalized halo o… ▽ More We present a deep machine learning (ML)-based technique for accurately determining $σ_8$ and $Ω_m$ from mock 3D galaxy surveys. The mock surveys are built from the AbacusCosmos suite of $N$-body simulations, which comprises 40 cosmological volume simulations spanning a range of cosmological models, and we account for uncertainties in galaxy formation scenarios through the use of generalized halo occupation distributions (HODs). We explore a trio of ML models: a 3D convolutional neural network (CNN), a power-spectrum-based fully connected network, and a hybrid approach that merges the two to combine physically motivated summary statistics with flexible CNNs. We describe best practices for training a deep model on a suite of matched-phase simulations and we test our model on a completely independent sample that uses previously unseen initial conditions, cosmological parameters, and HOD parameters. Despite the fact that the mock observations are quite small ($\sim0.07h^{-3}\,\mathrm{Gpc}^3$) and the training data span a large parameter space (6 cosmological and 6 HOD parameters), the CNN and hybrid CNN can constrain $σ_8$ and $Ω_m$ to $\sim3\%$ and $\sim4\%$, respectively. △ Less

Submitted 23 September, 2019; originally announced September 2019.

Comments: Submitted to The Astrophysical Journal

arXiv:1908.02765 [pdf, other]

doi 10.3847/1538-4357/ab426f

Using X-Ray Morphological Parameters to Strengthen Galaxy Cluster Mass Estimates via Machine Learning

Authors: Sheridan B. Green, Michelle Ntampaka, Daisuke Nagai, Lorenzo Lovisari, Klaus Dolag, Dominique Eckert, John A. ZuHone

Abstract: We present a machine learning approach for estimating galaxy cluster masses, trained using both Chandra and eROSITA mock X-ray observations of 2,041 clusters from the Magneticum simulations. We train a random forest regressor, an ensemble learning method based on decision tree regression, to predict cluster masses using an input feature set. The feature set uses core-excised X-ray luminosity and a… ▽ More We present a machine learning approach for estimating galaxy cluster masses, trained using both Chandra and eROSITA mock X-ray observations of 2,041 clusters from the Magneticum simulations. We train a random forest regressor, an ensemble learning method based on decision tree regression, to predict cluster masses using an input feature set. The feature set uses core-excised X-ray luminosity and a variety of morphological parameters, including surface brightness concentration, smoothness, asymmetry, power ratios, and ellipticity. The regressor is cross-validated and calibrated on a training sample of 1,615 clusters (80% of sample), and then results are reported as applied to a test sample of 426 clusters (20% of sample). This procedure is performed for two different mock observation series in an effort to bracket the potential enhancement in mass predictions that can be made possible by including dynamical state information. The first series is computed from idealized Chandra-like mock cluster observations, with high spatial resolution, long exposure time (1 Ms), and the absence of background. The second series is computed from realistic-condition eROSITA mocks with lower spatial resolution, short exposures (2 ks), instrument effects, and background photons modeled. We report a 20% reduction in the mass estimation scatter when either series is used in our random forest model compared to a standard regression model that only employs core-excised luminosity. The morphological parameters that hold the highest feature importance are smoothness, asymmetry, and surface brightness concentration. Hence, these parameters, which encode the dynamical state of the cluster, can be used to make more accurate predictions of cluster masses in upcoming surveys, offering a crucial step forward for cosmological analyses. △ Less

Submitted 30 September, 2019; v1 submitted 7 August, 2019; originally announced August 2019.

Comments: 12 pages, 5 figures, 3 tables. Accepted to ApJ

Journal ref: ApJ 884, 33 (2019)

arXiv:1907.01676 [pdf, other]

Astro2020 APC White Paper: The Early Career Perspective on the Coming Decade, Astrophysics Career Paths, and the Decadal Survey Process

Authors: Emily Moravec, Ian Czekala, Kate Follette, Zeeshan Ahmed, Mehmet Alpaslan, Alexandra Amon, Will Armentrout, Giada Arney, Darcy Barron, Eric Bellm, Amy Bender, Joanna Bridge, Knicole Colon, Rahul Datta, Casey DeRoo, Wanda Feng, Michael Florian, Travis Gabriel, Kirsten Hall, Erika Hamden, Nimish Hathi, Keith Hawkins, Keri Hoadley, Rebecca Jensen-Clem, Melodie Kao , et al. (31 additional authors not shown)

Abstract: In response to the need for the Astro2020 Decadal Survey to explicitly engage early career astronomers, the National Academies of Sciences, Engineering, and Medicine hosted the Early Career Astronomer and Astrophysicist Focus Session (ECFS) on October 8-9, 2018 under the auspices of Committee of Astronomy and Astrophysics. The meeting was attended by fifty six pre-tenure faculty, research scientis… ▽ More In response to the need for the Astro2020 Decadal Survey to explicitly engage early career astronomers, the National Academies of Sciences, Engineering, and Medicine hosted the Early Career Astronomer and Astrophysicist Focus Session (ECFS) on October 8-9, 2018 under the auspices of Committee of Astronomy and Astrophysics. The meeting was attended by fifty six pre-tenure faculty, research scientists, postdoctoral scholars, and senior graduate students, as well as eight former decadal survey committee members, who acted as facilitators. The event was designed to educate early career astronomers about the decadal survey process, to solicit their feedback on the role that early career astronomers should play in Astro2020, and to provide a forum for the discussion of a wide range of topics regarding the astrophysics career path. This white paper presents highlights and themes that emerged during two days of discussion. In Section 1, we discuss concerns that emerged regarding the coming decade and the astrophysics career path, as well as specific recommendations from participants regarding how to address them. We have organized these concerns and suggestions into five broad themes. These include (sequentially): (1) adequately training astronomers in the statistical and computational techniques necessary in an era of "big data", (2) responses to the growth of collaborations and telescopes, (3) concerns about the adequacy of graduate and postdoctoral training, (4) the need for improvements in equity and inclusion in astronomy, and (5) smoothing and facilitating transitions between early career stages. Section 2 is focused on ideas regarding the decadal survey itself, including: incorporating early career voices, ensuring diverse input from a variety of stakeholders, and successfully and broadly disseminating the results of the survey. △ Less

Submitted 12 July, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

Comments: 9 pages; Astro2020 APC White Paper: State of the Profession Consideration

arXiv:1906.07729 [pdf, other]

doi 10.3847/1538-4357/ab2a00

Cluster Cosmology with the Velocity Distribution Function of the HeCS-SZ Sample

Authors: Michelle Ntampaka, Ken Rines, Hy Trac

Abstract: We apply the Velocity Distribution Function (VDF) to a sample of Sunyaev-Zel'dovich (SZ)-selected clusters, and we report preliminary cosmological constraints in the $σ_8$-$Ω_m$ cosmological parameter space. The VDF is a forward-modeled test statistic that can be used to constrain cosmological models directly from galaxy cluster dynamical observations. The method was introduced in Ntampaka et al.… ▽ More We apply the Velocity Distribution Function (VDF) to a sample of Sunyaev-Zel'dovich (SZ)-selected clusters, and we report preliminary cosmological constraints in the $σ_8$-$Ω_m$ cosmological parameter space. The VDF is a forward-modeled test statistic that can be used to constrain cosmological models directly from galaxy cluster dynamical observations. The method was introduced in Ntampaka et al. (2017) and employs line-of-sight velocity measurements to directly constrain cosmological parameters; it is less sensitive to measurement error than a standard halo mass function approach. The method is applied to the Hectospec Survey of Sunyaev-Zeldovich-Selected Clusters (HeCS-SZ) sample, which is a spectroscopic follow up of a Planck-selected sample of 83 galaxy clusters. Credible regions are calculated by comparing the VDF of the observed cluster sample to that of mock observations, yielding $\mathcal{S}_8 \equiv σ_8 \left(Ω_m/0.3\right)^{0.25} = 0.751\pm0.037$. These constraints are in tension with the Planck Cosmic Microwave Background (CMB) TT fiducial value, which lies outside of our 95% credible region, but are in agreement with some recent analyses of large scale structure that observe fewer massive clusters than are predicted by the Planck fiducial cosmological parameters. △ Less

Submitted 18 June, 2019; originally announced June 2019.

Comments: Accepted for publication in The Astrophysical Journal

arXiv:1903.06796 [pdf, ps, other]

Astro2020 Science White Paper: The Next Decade of Astroinformatics and Astrostatistics

Authors: A. Siemiginowska, G. Eadie, I. Czekala, E. Feigelson, E. B. Ford, V. Kashyap, M. Kuhn, T. Loredo, M. Ntampaka, A. Stevens, A. Avelino, K. Borne, T. Budavari, B. Burkhart, J. Cisewski-Kehe, F. Civano, I. Chilingarian, D. A. van Dyk, G. Fabbiano, D. P. Finkbeiner, D. Foreman-Mackey, P. Freeman, A. Fruscione, A. A. Goodman, M. Graham , et al. (27 additional authors not shown)

Abstract: Over the past century, major advances in astronomy and astrophysics have been largely driven by improvements in instrumentation and data collection. With the amassing of high quality data from new telescopes, and especially with the advent of deep and large astronomical surveys, it is becoming clear that future advances will also rely heavily on how those data are analyzed and interpreted. New met… ▽ More Over the past century, major advances in astronomy and astrophysics have been largely driven by improvements in instrumentation and data collection. With the amassing of high quality data from new telescopes, and especially with the advent of deep and large astronomical surveys, it is becoming clear that future advances will also rely heavily on how those data are analyzed and interpreted. New methodologies derived from advances in statistics, computer science, and machine learning are beginning to be employed in sophisticated investigations that are not only bringing forth new discoveries, but are placing them on a solid footing. Progress in wide-field sky surveys, interferometric imaging, precision cosmology, exoplanet detection and characterization, and many subfields of stellar, Galactic and extragalactic astronomy, has resulted in complex data analysis challenges that must be solved to perform scientific inference. Research in astrostatistics and astroinformatics will be necessary to develop the state-of-the-art methodology needed in astronomy. Overcoming these challenges requires dedicated, interdisciplinary research. We recommend: (1) increasing funding for interdisciplinary projects in astrostatistics and astroinformatics; (2) dedicating space and time at conferences for interdisciplinary research and promotion; (3) developing sustainable funding for long-term astrostatisics appointments; and (4) funding infrastructure development for data archives and archive support, state-of-the-art algorithms, and efficient computing. △ Less

Submitted 15 March, 2019; originally announced March 2019.

Comments: Submitted to the Astro2020 Decadal Survey call for science white papers

arXiv:1903.06634 [pdf]

Increasing the Discovery Space in Astrophysics - A Collation of Six Submitted White Papers

Authors: G. Fabbiano, M. Elvis, A. Accomazzi, G. B. Berriman, N. Brickhouse, S. Bose, D. Carrera, I. Chilingarian, F. Civano, B. Czerny, R. D'Abrusco, B. Diemer, J. Drake, R. Emami Meibody, J. R. Farah, G. G. Fazio, E. Feigelson, F. Fornasini, Jay Gallagher, J. Grindlay, L. Hernquist, D. J. James, M. Karovska, V. Kashyap, D. -W. Kim , et al. (24 additional authors not shown)

Abstract: We write in response to the call from the 2020 Decadal Survey to submit white papers illustrating the most pressing scientific questions in astrophysics for the coming decade. We propose exploration as the central question for the Decadal Committee's discussions.The history of astronomy shows that paradigm changing discoveries are not driven by well formulated scientific questions, based on the kn… ▽ More We write in response to the call from the 2020 Decadal Survey to submit white papers illustrating the most pressing scientific questions in astrophysics for the coming decade. We propose exploration as the central question for the Decadal Committee's discussions.The history of astronomy shows that paradigm changing discoveries are not driven by well formulated scientific questions, based on the knowledge of the time. They were instead the result of the increase in discovery space fostered by new telescopes and instruments. An additional tool for increasing the discovery space is provided by the analysis and mining of the increasingly larger amount of archival data available to astronomers. Revolutionary observing facilities, and the state of the art astronomy archives needed to support these facilities, will open up the universe to new discovery. Here we focus on exploration for compact objects and multi messenger science. This white paper includes science examples of the power of the discovery approach, encompassing all the areas of astrophysics covered by the 2020 Decadal Survey. △ Less

Submitted 18 March, 2019; v1 submitted 15 March, 2019; originally announced March 2019.

arXiv:1902.10159 [pdf, ps, other]

The Role of Machine Learning in the Next Decade of Cosmology

Authors: Michelle Ntampaka, Camille Avestruz, Steven Boada, Joao Caldeira, Jessi Cisewski-Kehe, Rosanne Di Stefano, Cora Dvorkin, August E. Evrard, Arya Farahi, Doug Finkbeiner, Shy Genel, Alyssa Goodman, Andy Goulding, Shirley Ho, Arthur Kosowsky, Paul La Plante, Francois Lanusse, Michelle Lochner, Rachel Mandelbaum, Daisuke Nagai, Jeffrey A. Newman, Brian Nord, J. E. G. Peek, Austin Peel, Barnabas Poczos , et al. (5 additional authors not shown)

Abstract: In recent years, machine learning (ML) methods have remarkably improved how cosmologists can interpret data. The next decade will bring new opportunities for data-driven cosmological discovery, but will also present new challenges for adopting ML methodologies and understanding the results. ML could transform our field, but this transformation will require the astronomy community to both foster an… ▽ More In recent years, machine learning (ML) methods have remarkably improved how cosmologists can interpret data. The next decade will bring new opportunities for data-driven cosmological discovery, but will also present new challenges for adopting ML methodologies and understanding the results. ML could transform our field, but this transformation will require the astronomy community to both foster and promote interdisciplinary research endeavors. △ Less

Submitted 14 January, 2021; v1 submitted 26 February, 2019; originally announced February 2019.

Comments: Submitted to the Astro2020 call for science white papers

arXiv:1902.05950 [pdf, other]

doi 10.3847/1538-4357/ab4f82

A Robust and Efficient Deep Learning Method for Dynamical Mass Measurements of Galaxy Clusters

Authors: Matthew Ho, Markus Michael Rau, Michelle Ntampaka, Arya Farahi, Hy Trac, Barnabas Poczos

Abstract: We demonstrate the ability of convolutional neural networks (CNNs) to mitigate systematics in the virial scaling relation and produce dynamical mass estimates of galaxy clusters with remarkably low bias and scatter. We present two models, CNN$_\mathrm{1D}$ and CNN$_\mathrm{2D}$, which leverage this deep learning tool to infer cluster masses from distributions of member galaxy dynamics. Our first m… ▽ More We demonstrate the ability of convolutional neural networks (CNNs) to mitigate systematics in the virial scaling relation and produce dynamical mass estimates of galaxy clusters with remarkably low bias and scatter. We present two models, CNN$_\mathrm{1D}$ and CNN$_\mathrm{2D}$, which leverage this deep learning tool to infer cluster masses from distributions of member galaxy dynamics. Our first model, CNN$_\text{1D}$, infers cluster mass directly from the distribution of member galaxy line-of-sight velocities. Our second model, CNN$_\text{2D}$, extends the input space of CNN$_\text{1D}$ to learn on the joint distribution of galaxy line-of-sight velocities and projected radial distances. We train each model as a regression over cluster mass using a labeled catalog of realistic mock cluster observations generated from the MultiDark simulation and UniverseMachine catalog. We then evaluate the performance of each model on an independent set of mock observations selected from the same simulated catalog. The CNN models produce cluster mass predictions with lognormal residuals of scatter as low as $0.132$ dex, greater than a factor of 2 improvement over the classical $M$-$σ$ power-law estimator. Furthermore, the CNN model reduces prediction scatter relative to similar machine learning approaches by up to $17\%$ while executing in drastically shorter training and evaluation times (by a factor of 30) and producing considerably more robust mass predictions (improving prediction stability under variations in galaxy sampling rate by $30\%$). △ Less

Submitted 22 December, 2020; v1 submitted 15 February, 2019; originally announced February 2019.

Comments: 22 pages, 10 figures, 4 tables, accepted for publication at ApJ

Journal ref: 2019 ApJ, 887, 25

arXiv:1810.08211 [pdf, other]

doi 10.3847/1538-4357/ab2983

Machine Learning Applied to the Reionization History of the Universe in the 21 cm Signal

Authors: Paul La Plante, Michelle Ntampaka

Abstract: The Epoch of Reionization (EoR) features a rich interplay between the first luminous sources and the low-density gas of the intergalactic medium (IGM), where photons from these sources ionize the IGM. There are currently few observational constraints on key observables related to the EoR, such as the midpoint and duration of reionization. Although upcoming observations of the 21 cm power spectrum… ▽ More The Epoch of Reionization (EoR) features a rich interplay between the first luminous sources and the low-density gas of the intergalactic medium (IGM), where photons from these sources ionize the IGM. There are currently few observational constraints on key observables related to the EoR, such as the midpoint and duration of reionization. Although upcoming observations of the 21 cm power spectrum with next-generation radio interferometers such as the Hydrogen Epoch of Reionization Array (HERA) and the Square Kilometre Array (SKA) are expected to provide information about the midpoint of reionization readily, extracting the duration from the power spectrum alone is a more difficult proposition. As an alternative method for extracting information about reionization, we present an application of convolutional neural networks (CNNs) to images of reionization. These images are two-dimensional in the plane of the sky, and extracted at a series of redshift values to generate "image cubes" that are qualitatively similar to those of the HERA and the SKA will generate in the near future. Additionally, we include the impact that the bright foreground signal from the the Milky Way imparts on such image cubes from interferometers, but do not include the noise induced from observations. We show that we are able to recover the duration of reionization $Δ$z to within 5% using CNNs, assuming that the midpoint of reionization is already relatively well constrained. These results have exciting impacts for estimating $τ$, the optical depth to the cosmic microwave background, which can help constrain other cosmological parameters. △ Less

Submitted 5 August, 2019; v1 submitted 18 October, 2018; originally announced October 2018.

Comments: 11 pages, 7 figures, published at ApJ

Journal ref: ApJ 810:110 (9pp), 2019 August 1

arXiv:1810.07703 [pdf, other]

doi 10.3847/1538-4357/ab14eb

A Deep Learning Approach to Galaxy Cluster X-ray Masses

Authors: M. Ntampaka, J. ZuHone, D. Eisenstein, D. Nagai, A. Vikhlinin, L. Hernquist, F. Marinacci, D. Nelson, R. Pakmor, A. Pillepich, P. Torrey, M. Vogelsberger

Abstract: We present a machine-learning approach for estimating galaxy cluster masses from Chandra mock images. We utilize a Convolutional Neural Network (CNN), a deep machine learning tool commonly used in image recognition tasks. The CNN is trained and tested on our sample of 7,896 Chandra X-ray mock observations, which are based on 329 massive clusters from the IllustrisTNG simulation. Our CNN learns fro… ▽ More We present a machine-learning approach for estimating galaxy cluster masses from Chandra mock images. We utilize a Convolutional Neural Network (CNN), a deep machine learning tool commonly used in image recognition tasks. The CNN is trained and tested on our sample of 7,896 Chandra X-ray mock observations, which are based on 329 massive clusters from the IllustrisTNG simulation. Our CNN learns from a low resolution spatial distribution of photon counts and does not use spectral information. Despite our simplifying assumption to neglect spectral information, the resulting mass values estimated by the CNN exhibit small bias in comparison to the true masses of the simulated clusters (-0.02 dex) and reproduce the cluster masses with low intrinsic scatter, 8% in our best fold and 12% averaging over all. In contrast, a more standard core-excised luminosity method achieves 15-18% scatter. We interpret the results with an approach inspired by Google DeepDream and find that the CNN ignores the central regions of clusters, which are known to have high scatter with mass. △ Less

Submitted 18 June, 2019; v1 submitted 17 October, 2018; originally announced October 2018.

Comments: 10 pages, 6 figures, accepted for publication in The Astrophysical Journal

arXiv:1602.01837 [pdf, other]

doi 10.3847/1538-4357/835/1/106

The Velocity Distribution Function of Galaxy Clusters as a Cosmological Probe

Authors: M. Ntampaka, H. Trac, J. Cisewski, L. C. Price

Abstract: We present a new approach for quantifying the abundance of galaxy clusters and constraining cosmological parameters using dynamical measurements. In the standard method, galaxy line-of-sight (LOS) velocities, $v$, or velocity dispersions are used to infer cluster masses, $M$, in order to quantify the halo mass function (HMF), $dn(M)/d\log(M)$, which is strongly affected by mass measurement errors.… ▽ More We present a new approach for quantifying the abundance of galaxy clusters and constraining cosmological parameters using dynamical measurements. In the standard method, galaxy line-of-sight (LOS) velocities, $v$, or velocity dispersions are used to infer cluster masses, $M$, in order to quantify the halo mass function (HMF), $dn(M)/d\log(M)$, which is strongly affected by mass measurement errors. In our new method, the probability distribution of velocities for each cluster in the sample are summed to create a new statistic called the velocity distribution function (VDF), $dn(v)/dv$. The VDF can be measured more directly and precisely than the HMF and it can also be robustly predicted with cosmological simulations which capture the dynamics of subhalos or galaxies. We apply these two methods to mock cluster catalogs and forecast the bias and constraints on the matter density parameter $Ω_m$ and the amplitude of matter fluctuations $σ_8$ in flat $Λ$CDM cosmologies. For an example observation of 200 massive clusters, the VDF with (without) velocity errors constrains the parameter combination $σ_8Ω_m^{0.29\ (0.29)} = 0.587 \pm 0.011\ (0.583 \pm 0.011)$ and shows only minor bias. However, the HMF with dynamical mass errors is biased to low $Ω_m$ and high $σ_8$ and the fiducial model lies well outside of the forecast constraints, prior to accounting for Eddington bias. When the VDF is combined with constraints from the cosmic microwave background (CMB), the degeneracy between cosmological parameters can be significantly reduced. Upcoming spectroscopic surveys that probe larger volumes and fainter magnitudes will provide a larger number of clusters for applying the VDF as a cosmological probe. △ Less

Submitted 20 October, 2016; v1 submitted 4 February, 2016; originally announced February 2016.

Comments: 10 pages, 4 figures, accepted for publication at ApJ

arXiv:1509.05409 [pdf, other]

doi 10.3847/0004-637X/831/2/135

Dynamical Mass Measurements of Contaminated Galaxy Clusters Using Machine Learning

Authors: M. Ntampaka, H. Trac, D. J. Sutherland, S. Fromenteau, B. Poczos, J. Schneider

Abstract: We study dynamical mass measurements of galaxy clusters contaminated by interlopers and show that a modern machine learning (ML) algorithm can predict masses by better than a factor of two compared to a standard scaling relation approach. We create two mock catalogs from Multidark's publicly available $N$-body MDPL1 simulation, one with perfect galaxy cluster membership information and the other w… ▽ More We study dynamical mass measurements of galaxy clusters contaminated by interlopers and show that a modern machine learning (ML) algorithm can predict masses by better than a factor of two compared to a standard scaling relation approach. We create two mock catalogs from Multidark's publicly available $N$-body MDPL1 simulation, one with perfect galaxy cluster membership information and the other where a simple cylindrical cut around the cluster center allows interlopers to contaminate the clusters. In the standard approach, we use a power-law scaling relation to infer cluster mass from galaxy line-of-sight (LOS) velocity dispersion. Assuming perfect membership knowledge, this unrealistic case produces a wide fractional mass error distribution, with a width of $Δε\approx0.87$. Interlopers introduce additional scatter, significantly widening the error distribution further ($Δε\approx2.13$). We employ the support distribution machine (SDM) class of algorithms to learn from distributions of data to predict single values. Applied to distributions of galaxy observables such as LOS velocity and projected distance from the cluster center, SDM yields better than a factor-of-two improvement ($Δε\approx0.67$) for the contaminated case. Remarkably, SDM applied to contaminated clusters is better able to recover masses than even the scaling relation approach applied to uncontaminated clusters. We show that the SDM method more accurately reproduces the cluster mass function, making it a valuable tool for employing cluster observations to evaluate cosmological models. △ Less

Submitted 25 October, 2016; v1 submitted 17 September, 2015; originally announced September 2015.

Comments: 18 pages, 12 figures, accepted for publication at ApJ

arXiv:1410.0686 [pdf, other]

doi 10.1088/0004-637X/803/2/50

A Machine Learning Approach for Dynamical Mass Measurements of Galaxy Clusters

Authors: Michelle Ntampaka, Hy Trac, Danica J. Sutherland, Nicholas Battaglia, Barnabas Poczos, Jeff Schneider

Abstract: We present a modern machine learning approach for cluster dynamical mass measurements that is a factor of two improvement over using a conventional scaling relation. Different methods are tested against a mock cluster catalog constructed using halos with mass >= 10^14 Msolar/h from Multidark's publicly-available N-body MDPL halo catalog. In the conventional method, we use a standard M(sigma_v) pow… ▽ More We present a modern machine learning approach for cluster dynamical mass measurements that is a factor of two improvement over using a conventional scaling relation. Different methods are tested against a mock cluster catalog constructed using halos with mass >= 10^14 Msolar/h from Multidark's publicly-available N-body MDPL halo catalog. In the conventional method, we use a standard M(sigma_v) power law scaling relation to infer cluster mass, M, from line-of-sight (LOS) galaxy velocity dispersion, sigma_v. The resulting fractional mass error distribution is broad, with width=0.87 (68% scatter), and has extended high-error tails. The standard scaling relation can be simply enhanced by including higher-order moments of the LOS velocity distribution. Applying the kurtosis as a correction term to log(sigma_v) reduces the width of the error distribution to 0.74 (16% improvement). Machine learning can be used to take full advantage of all the information in the velocity distribution. We employ the Support Distribution Machines (SDMs) algorithm that learns from distributions of data to predict single values. SDMs trained and tested on the distribution of LOS velocities yield width=0.46 (47% improvement). Furthermore, the problematic tails of the mass error distribution are effectively eliminated. Decreasing cluster mass errors will improve measurements of the growth of structure and lead to tighter constraints on cosmological parameters. △ Less

Submitted 14 January, 2021; v1 submitted 2 October, 2014; originally announced October 2014.

Comments: Published in The Astrophysical Journal, 13 pages, 8 figures. Support Distribution Machines is publicly available at https://github.com/djsutherland/py-sdm

arXiv:1303.1055 [pdf, ps, other]

doi 10.1088/0004-637X/772/2/147

A First Look at creating mock catalogs with machine learning techniques

Authors: Xiaoying Xu, Shirley Ho, Hy Trac, Jeff Schneider, Barnabas Poczos, Michelle Ntampaka

Abstract: We investigate machine learning (ML) techniques for predicting the number of galaxies (N_gal) that occupy a halo, given the halo's properties. These types of mappings are crucial for constructing the mock galaxy catalogs necessary for analyses of large-scale structure. The ML techniques proposed here distinguish themselves from traditional halo occupation distribution (HOD) modeling as they do not… ▽ More We investigate machine learning (ML) techniques for predicting the number of galaxies (N_gal) that occupy a halo, given the halo's properties. These types of mappings are crucial for constructing the mock galaxy catalogs necessary for analyses of large-scale structure. The ML techniques proposed here distinguish themselves from traditional halo occupation distribution (HOD) modeling as they do not assume a prescribed relationship between halo properties and N_gal. In addition, our ML approaches are only dependent on parent halo properties (like HOD methods), which are advantageous over subhalo-based approaches as identifying subhalos correctly is difficult. We test 2 algorithms: support vector machines (SVM) and k-nearest-neighbour (kNN) regression. We take galaxies and halos from the Millennium simulation and predict N_gal by training our algorithms on the following 6 halo properties: number of particles, M_200, σ_v, v_max, half-mass radius and spin. For Millennium, our predicted N_gal values have a mean-squared-error (MSE) of ~0.16 for both SVM and kNN. Our predictions match the overall distribution of halos reasonably well and the galaxy correlation function at large scales to ~5-10%. In addition, we demonstrate a feature selection algorithm to isolate the halo parameters that are most predictive, a useful technique for understanding the mapping between halo properties and N_gal. Lastly, we investigate these ML-based approaches in making mock catalogs for different galaxy subpopulations (e.g. blue, red, high M_star, low M_star). Given its non-parametric nature as well as its powerful predictive and feature selection capabilities, machine learning offers an interesting alternative for creating mock catalogs. △ Less

Submitted 5 March, 2013; originally announced March 2013.

Comments: 11 pages, 6 figures

Showing 1–30 of 30 results for author: Ntampaka, M