Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

A publishing partnership

A Comprehensive Study of Lyα Emission in the High-redshift Galaxy Population

, , , , and

Published 2017 July 13 © 2017. The American Astronomical Society. All rights reserved.
, , Citation Grecco A. Oyarzún et al 2017 ApJ 843 133 DOI 10.3847/1538-4357/aa7552

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/843/2/133

Abstract

We present an exhaustive census of Lyman alpha (Lyα) emission in the general galaxy population at $3\lt z\lt 4.6$. We use the Michigan/Magellan Fiber System (M2FS) spectrograph to study a stellar mass (M${}_{* }$) selected sample of 625 galaxies homogeneously distributed in the range $7.6\lt \mathrm{log}\,{M}_{* }/{M}_{\odot }\lt 10.6$. Our sample is selected from the 3D-HST/CANDELS survey, which provides the complementary data to estimate Lyα equivalent widths (${W}_{\mathrm{Ly}\alpha }$) and escape fractions (fesc) for our galaxies. We find both quantities to anti-correlate with M*, star formation rate (SFR), UV luminosity, and UV slope (β). We then model the ${W}_{\mathrm{Ly}\alpha }$ distribution as a function of MUV and β using a Bayesian approach. Based on our model and matching the properties of typical Lyman break galaxy (LBG) selections, we conclude that the ${W}_{\mathrm{Ly}\alpha }$ distribution in such samples is heavily dependent on the limiting MUV of the survey. Regarding narrowband surveys, we find their ${W}_{\mathrm{Ly}\alpha }$ selections to bias samples toward low M*, while their line-flux limitations preferentially leave out low-SFR galaxies. We can also use our model to predict the fraction of Lyα-emitting LBGs at 4 ≤ z ≤ 7. We show that reported drops in the Lyα fraction at z ≥ 6, usually attributed to the rapidly increasing neutral gas fraction of the universe, can also be explained by survey MUV incompleteness. This result does not dismiss reionization occurring at z ∼ 7, but highlights that current data is not inconsistent with this process taking place at z > 7.

Export citation and abstract BibTeX RIS

1. Introduction

In hand with observational progress, the understanding of high-redshift galaxies has progressed immensely in the last few decades. Almost 20 years ago, the only efficient way of observing these galaxies was by detecting breaks in their spectra with broadband photometry. For example, the two main classes of rest-frame UV-selected galaxies are Lyman Break Galaxies (LBGs; Steidel et al. 1996, 1999, 2003; Shapley et al. 2003; Stark et al. 2009; Bouwens et al. 2015) and Lyα Emitters (LAEs; Cowie et al. 1996; Cowie & Hu 1998; Hu et al. 1998; Kashikawa et al. 2006; Shimasaku et al. 2006; Gronwall et al. 2007; Ouchi et al. 2008; Blanc et al. 2011; Sobral et al. 2015). While the former are observed by their Lyman break at 912 Å, the latter are detected by their Lyman alpha (Lyα) emission at 1216 Å, which is produced in star-forming regions and is subject to resonant scattering in the neutral hydrogen of the ISM. After being, at first, used as a galaxy detection method, Lyα emission from galaxies is now widely used to derive a wide variety of information about the physical properties of galaxies and the universe as a whole.

A lot of effort has been dedicated to study the process of Lyα emission within galaxies. Radiative transfer analysis focuses on the dependence of Lyα escape on ISM kinematics, clumpiness, and outflows (Dijkstra & Kramer 2012; Duval et al. 2014; Rivera-Thorsen et al. 2015; Gronke & Dijkstra 2016; Gronke et al. 2016). These studies yield elementary insights that, when combined with observations of the Lyα escape fraction (fesc; Hayes et al. 2011; Ciardullo et al. 2014), provide a coherent picture for Lyα emission. Still, despite the complex modeling required to explain this particular type of radiation at small scales, Lyα is actively used as a technique for solving cosmological scale paradigms. Among other things, Lyα emission is being used to study the properties of neutral hydrogen in the interstellar, circumgalactic, and intergalactic mediums. For instance, the fraction of LAEs as a function of redshift is used as a proxy for the fraction of neutral gas in the IGM (Stark et al. 2011; Ono et al. 2012; Tilvi et al. 2014). If successful, this insight can be used to trace the epoch of reionization, which we have been able to constrain using QSO sightlines (Fan et al. 2006) and cosmic microwave background (CMB) measurements (Bennett et al. 2013; Planck Collaboration et al. 2016a, 2016b). Moreover, with the hope of tracing the sources responsible for reionization, Lyα emission is also conveniently used for characterizing luminosity functions (LFs) at high redshift (Ouchi et al. 2008; Sobral et al. 2009; Dressler et al. 2015; Santos et al. 2016). Nevertheless, the cosmological questions that we are trying to answer by means of this complex emission are not restricted to the reionization of the universe. For instance, Lyα emission can be used to trace the cosmic web (Gould & Weinberg 1996; Cantalupo et al. 2005, 2012, 2014; Kollmeier et al. 2010) and young, metal-poor stellar populations in the early universe (Sobral et al. 2015). Even more, Lyα emission will also be used to study dark energy through baryonic acoustic oscillations in the clustering of LAEs (Adams et al. 2011; Blanc et al. 2011).

As evidenced by small-scale studies, Lyα emission is highly stochastic and complex. The observed equivalent width (${W}_{\mathrm{Ly}\alpha }$) of Lyα lines is highly dependent on neutral gas opacity and kinematics, clumpiness of the ISM, dust distribution, and, therefore, line of sight toward the observer. Furthermore, typical Lyα emitters are extreme objects in terms of their luminosity, metallicity, and emission line diagnostics (Trainor et al. 2016). As a consequence, if we are to use Lyα emission as a tracer, a careful and thorough approach is required. However, limitations in survey design make such an approach difficult. For instance, spectroscopic studies of UV continuum detected galaxies, typically LBGs (e.g., Shapley et al. 2003; Stark et al. 2010; Ono et al. 2012), reject galaxies with no significant Lyman break and/or red rest-frame UV spectral energy distributions (SEDs), i.e., these studies do not account for passive and heavily extincted galaxies. Observationally, this technique is limited by the MUV sensitivity of the survey, which translates to incompleteness at low SFR. On the other hand, samples where galaxies are detected directly in Lyα using narrowband imaging (e.g., Gronwall et al. 2007; Ouchi et al. 2008) or blind spectroscopy (e.g., Blanc et al. 2011; Dressler et al. 2011) select on Lyα equivalent width (W${}_{\mathrm{Ly}\alpha }$) and emission line flux. Such methodology does not require continuum selection, which allows for line detection in faint objects. However, follow-up rarely allows for more than spectroscopic confirmation of the line, limiting the use of this technique to the measurement of line fluxes, ${W}_{\mathrm{Ly}\alpha }$, and emission profiles. Even more, such surveys can have non-negligible amounts of line interlopers (Sobral et al. 2017).

Comprehensive studies of Lyα emission in the overall galaxy population are, therefore, of crucial importance. Correlations between galaxy properties and emission are the statistical manifestation of the radiative escape of Lyα photons. Properties such as stellar mass (M${}_{* }$), star formation rate (SFR), UV slope (β), and merger rate reveal information of how the escape of Lyα radiation is affected by stellar population ages, gas fraction, dust content, and ISM turbulence, respectively. Acknowledgment of such relations is required to draw conclusions from high-redshift Lyα surveys. Oyarzún et al. (2016) show how the normalization and e-folding scale of the ${W}_{\mathrm{Ly}\alpha }$ distribution of 3 < z < 4.6 galaxies anti-correlate with M*. In other words, higher M* objects typically have lower ${W}_{\mathrm{Ly}\alpha }$. More massive galaxies typically have lower gas fractions, but higher gas mass (Tacconi et al. 2013; Song et al. 2014; Genzel et al. 2015). More neutral gas contributes to the increase in the scatter of Lyα photons, spreading this radiation throughout the galaxy and toward the circumgalactic medium. Higher M* galaxies also have more dust extinction (Franx et al. 2003; Daddi et al. 2004, 2007; Förster Schreiber et al. 2004; Pérez-González et al. 2008), which leads to more Lyα photon absorption. The escape fraction of this radiation is, therefore, severely affected by the stellar mass of galaxies. These results even explain some discrepancies between LBG and narrowband studies of Lyα emission that originated in sample selection effects. This example highlights the need for proper assessment.

In this work, we present a comprehensive analysis of Lyα emission in the 3 < z < 4.6 galaxy population. To this end, we further analyze the data introduced by Oyarzún et al. (2016). This sample is composed of 625 galaxies in the M* range $7.6\lt \mathrm{log}({M}_{* }[{M}_{\odot }])\lt 10.6$ from the 3D-HST/CANDELS survey (Grogin et al. 2011; Koekemoer et al. 2011). Our work is based on spectroscopic observations of the sample using the Michigan/Magellan Fiber System (M2FS; Mateo et al. 2012), allowing us to measure Lyα fluxes and use 3D-HST/CANDELS ancillary data. In particular, we study the Lyα emission dependence on the M*, SFR, UV luminosity, and dust extinction of galaxies. To do so, we introduce a Bayesian approach to properly compare ${W}_{\mathrm{Ly}\alpha }$ distribution models, account for incompleteness, and quantify the significance of observed correlations. We also use our results to simulate high-redshift Lyα emission surveys, allowing us to infer their biases and imprints on the resulting samples. We finally use the correlations we recover to predict the fraction of LAEs as a function of redshift. This semi-analytic approach provides a baseline for comparing observational drops in the fraction of LAEs at high redshift. This work is structured as follows. In Section 2, we describe our sample and data set. In Section 3, we describe our line detection and measurement methodologies. We explain our Bayesian ${W}_{\mathrm{Ly}\alpha }$ distribution analysis in Section 4. We show our results on the ${W}_{\mathrm{Ly}\alpha }$ distribution dependence on different properties and selection techniques in Sections 5 and 6. We state in Section 7 our inferences on higher redshift Lyα emission. We present our conclusions in Section 8. Throughout this work, all magnitudes are in the AB system (Oke & Gunn 1983). A CDM cosmology with H0 = 70 km s−1 Mpc−1, ${{\rm{\Omega }}}_{m}=0.3$, and ${{\rm{\Omega }}}_{{\rm{\Lambda }}}=0.7$ was assumed whenever needed.

2. Data Set

2.1. Sample Selection

Our sample was initially composed of 629 galaxies in the COSMOS, GOODS-S, and UDS fields. Every object was observed under the 3D-HST/CANDELS program, providing HST and Spitzer photometry from 3800 Å to 7.9 μm (9 bands for COSMOS, 14 for GOODS-S, and 9 for UDS). We construct our sample using 3D-HST outputs (Skelton et al. 2014). According to these, our 629 photometric redshifts satisfy $3.25\lt {z}_{3{\rm{D}} \mbox{-} {\text{}}{\rm{HST}}}\lt 4.25$ and have a 95% probability of 2.9 < z < 4.25. Every galaxy also complies with a photometric redshift reliability parameter ${Q}_{z}\leqslant 3$ selection to remove catastrophic outliers (Brammer et al. 2008). In terms of M*, our galaxies are homogeneously distributed in the range $8\lt \mathrm{log}{({M}_{* }[{M}_{\odot }])}_{3{\rm{D}} \mbox{-} {\text{}}{\rm{HST}}}\lt 10.4$. These 3D-HST output values are based on the Bruzual & Charlot (2003) stellar population synthesis model library with a Chabrier (2003) IMF and solar metallicity. Exponentially declining star formation histories (SFHs) with a minimum e-folding time of ${\mathrm{log}}_{10}(\tau /\mathrm{yr})=7$ and a Calzetti et al. (2000) dust attenuation law were also assumed for the calculation (Skelton et al. 2014). We further emphasize that sample selection was performed based on the values of ${z}_{3{\rm{D}} \mbox{-} {\text{}}{\rm{HST}}}$ and M* detailed in this section, whereas all further analysis in this paper is based on our own estimates (Section 2.3).

2.2. Data

Spectroscopy of the full sample was conducted at the Magellan Clay 6.5 m telescope during 2014 December and 2015 February. To this end, we used the M2FS, a multi-object fiber-fed spectrograph. The 1farcs2 fibers of this instrument allow the observation of 256 targets within 30 arcmin in a single exposure. We are then able to observe 210 targets in each field simultaneously, while using 40 fibers for sky apertures and five for calibration stars. We used this spectrograph in LoRes mode, which features an expected resolution of R = 2000 and a continuum sensitivity of V = 24 with S/N = 5 in 2 hr (Mateo et al. 2012). The final data set consists of six exposure hours on each of the three fields with an average seeing of 0farcs6.

For data reduction, we developed a custom M2FS pipeline. This routine features standard bias subtraction and dark correction. For wavelength calibration, we use HgArNe lamps observed on each night. The wavelength solution is obtained separately for each fiber, with a typical rms uncertainty of 0.03 Å. We further correct the solution for each fiber using the sky lines in the science spectra. For flat-fielding, we use sky-flats obtained during either twilight or dawn. The correction is calculated separately for each fiber, and features illumination, fiber profile, and a correction to account for the shifting of fiber spectra on the detector due to thermal effects. Despite the fact that we are not using dome flats, we estimate the uncertainties in our flat-fielding to be <5%.

Sky subtraction is performed using the 40 sky fibers homogeneously distributed over the field of view and across the detector, where fibers are grouped in blocks. We find the sky solution to be more dependent on fiber location in the CCD than in the sky, leading us to perform sky subtraction separately for each frame fiber block. The solution is computed using a non-parametric spline fit, yielding satisfactory sky subtraction for most sky lines. For the bright sky lines that we are not able to properly subtract, we build emission line masks. For consistency, we use the same mask for all spectra, except for particularly noisy fibers. In those rare cases (11), we mask broad sections of the spectrum. Due to fiber malfunction, we could not obtain spectra for 4 of the 629 targets, leaving our final sample in 625 objects.

Flux calibration is performed using five ${M}_{V}=19\mbox{--}22$ calibration stars on each exposure. Due to atmospheric differential refraction (ADR) comparable to the 1farcs2 fiber size, we need to recover the intrinsic spectrum of each star. To do so, we fit stellar templates from the Pickles library (Pickles 1998) to the continuum-normalized instrumental spectra of the calibration stars. We then use the five stars on each exposure to obtain an average sensitivity curve. We estimate the rms uncertainty of our method to be about ∼15%. After this calibration, we correct the fluxes in our spectra for Galactic extinction. Samples of sky-subtracted spectra are shown in Figure 1.

Figure 1. Refer to the following caption and surrounding text.

Figure 1. Reduced and continuum subtracted spectra of six Lyα lines (black) from our study. These examples show the wide range of profiles and galaxy properties encompassed by our 120 detections (S/N ≥ 5.5). We also include in these plots the line profiles we fit to each case (red, dashed). The purpose of profile fitting in this work is limited to line-flux measurements.

Standard image High-resolution image

Considering the three fields, the science area we survey is ∼550 arcmin2. The resulting spectral FWHM line resolution is ∼2 Å, and we reach a 1σ continuum flux density limit of $\sim 4\times {10}^{-19}$ erg s−1 cm−2 Å−1 per pixel in our 6 hr of exposure. We estimate a 5σ emission line-flux sensitivity of $\sim 8\times {10}^{-18}$ erg s−1 cm−2 in our final spectra (see Figure 1). We identify 120 Lyα emission lines with S/N ≥ 5.5 in our data (details in Section 3.1). Given that we have 625 observed targets, we are then recovering a Lyα-emitting galaxy with S/N ≥ 5.5 every ∼5 objects.

2.3. Sample Properties

As stated by Skelton et al. (2014), 3D-HST outputs for these galaxies were obtained using FAST (Kriek et al. 2009). These calculations assume exponentially declining SFHs with a minimum e-folding time of ${\mathrm{log}}_{10}(\tau /\mathrm{yr})=7$ (Skelton et al. 2014). However, since recent studies suggest that it is more adequate to reproduce high-redshift observations with constant SFHs (cSFHs; González et al. 2014), or even rising SFHs (Maraston et al. 2010), we perform our own executions of FAST assuming cSFHs. For a detailed discussion on this topic, we refer the reader to Conroy (2013). Our FAST executions also adopt the Bruzual & Charlot (2003) stellar population synthesis model library, a Chabrier (2003) IMF, and solar metallicity, similarly to Skelton et al. (2014). We do not account for nebular emission lines in our SED fitting, which, in principle, can overestimate our reported M* by a factor of up to 4 (Atek et al. 2011; Conroy 2013; Stark et al. 2013) or even ∼7 for strong emitters (de Barros et al. 2014). However, our galaxies are at z ∼ 4. At this redshift, the overestimate is within a factor of ≲1.1 (Stark et al. 2013; Salmon et al. 2015), since the 4.5 μm Spitzer/IRAC band is unaffected.7 Our FAST outputs also include extinction-corrected SFRs. The SFRs are derived from the SED, and the extinction correction assumes a Calzetti et al. (2000) attenuation law with ${R}_{V}=4.05$.

We are also required to run EAZY (Brammer et al. 2008) on the photometry from CANDELS/IRAC, since FAST requires redshifts as input. The executions of EAZY yield a most probable redshift zEAZY. Our 625 objects satisfy $3\lt {z}_{\mathrm{EAZY}}\lt 4.25$, with a median uncertainty ${\sigma }_{\mathrm{EAZY}}=0.1$ (Figure 2). EAZY outputs also include 2σ constraints, which for our sample are limited to 2.95 < z < 4.5. Our FAST executions yield a mass coverage of $7.6\lt \mathrm{log}({M}_{* }[{M}_{\odot }])\lt 10.6$ (Figure 3), with a characteristic uncertainty of $\mathrm{log}({M}_{* }[{M}_{\odot }])\sim 0.3$. Most of our SFR values are in the range $1\mbox{--}100\,{M}_{\odot }\,{\mathrm{yr}}^{-1}$ (further analysis and plots below). From now on, we use the FAST outputs based on the spectroscopic redshifts (${z}_{\mathrm{Ly}\alpha }$) for the 120 detections (S/N ≥ 5.5; Section 3.1) and the outputs based on zEAZY for the 505 non-detections.

Figure 2. Refer to the following caption and surrounding text.

Figure 2. Measured spectroscopic Lyα redshifts (${z}_{\mathrm{Ly}\alpha }$) as a function of the corresponding photometric counterparts (zEAZY) for the 120 Lyα detections (S/N ≥ 5.5). Low-, medium-, and high-${W}_{\mathrm{Ly}\alpha }$ galaxies are shown as cyan circles, blue diamonds, and magenta pentagons, respectively. We find a median deviation of ${\rm{\Delta }}z={z}_{\mathrm{Ly}\alpha }\mbox{--}{z}_{\mathrm{EAZY}}=0.24$ from the 1:1 relation (dashed). As revealed by this figure, we find the deviation to correlate with ${W}_{\mathrm{Ly}\alpha }$. Hence, we associate this deviation with photometric redshift fitting biases when a strong Lyα emission line is present.

Standard image High-resolution image
Figure 3. Refer to the following caption and surrounding text.

Figure 3. Redshift−M${}_{* }$ distribution of the galaxies composing our sample, according to our executions of EAZY and FAST (Section 2.3). We divide our sample into three M* bins using as boundaries log(${M}_{* }[{M}_{\odot }]$) = 8.8, 9.6. We find 120 Lyα detections (${\rm{S}}/{\rm{N}}\geqslant 5.5$), of which 60 are low M* (blue circles), 40 are medium M* (green diamonds), and 20 are high M* (red pentagons). Non-detections are shown as gray points. For detections, we plot (${z}_{\mathrm{Ly}\alpha }$, ${M}_{* }({z}_{\mathrm{Ly}\alpha })$), whereas for non-detections we use (zEAZY, ${M}_{* }({z}_{\mathrm{EAZY}})$).

Standard image High-resolution image

2.4. Considerations

The flux calibration procedure performed in our spectra is based on using stars; therefore, it corrects for a ∼32% fiber flux loss, which corresponds to a point-source Lyα surface brightness distribution. In cases of extended Lyα emission halos (Steidel et al. 2011; Matsuda et al. 2012; Feldmeier et al. 2013; Hayes et al. 2013; Momose et al. 2014; Caminha et al. 2016; Patrício et al. 2016; Wisotzki et al. 2016), the fluxes we derive are mostly associated with the galaxies themselves and the inner parts of their Lyα halos. Furthermore, Lyα emission from a galaxy can show significant misalignment with the UV continuum (Rauch et al. 2011), although such cases are not the norm (Finkelstein et al. 2011; Jiang et al. 2013a). At the cosmic time of our sample, the fiber diameter corresponds to a scale of ∼8 kpc, roughly a factor of four larger than the typical effective diameter of galaxies (Bond et al. 2012; Law et al. 2012).

We find a median redshift offset of ${\rm{\Delta }}z={z}_{\mathrm{Ly}\alpha }\mbox{--}{z}_{\mathrm{EAZY}}\,=0.24$ in our detections (see Figure 2). We show in Oyarzún et al. (2016) that this offset does not have any noticeable effects on our sample dependence on M*, which is the primary selection criterion for our galaxies. A very similar offset has also been found in the MUSE-Wide Survey (Herenz et al. 2017). We find the offset to correlate with ${W}_{\mathrm{Ly}\alpha }$ (see Figure 2), hinting at biases in photometric redshift fitting when a strong Lyα emission line is present. A plausible scenario is one in which the Lyman break from the template is fitted slightly blueshifted to account for the flux excess in the redder band. Thorough simulations beyond the scope of this work are required to explore the causes behind this bias.

We stress that the 3D-HST/CANDELS mass incompleteness is restricted to $\mathrm{log}({M}_{* }[{M}_{\odot }])\lt 8.5$ at z ∼ 4 (at least for GOODS-S; Duncan et al. 2014), which corresponds to about one-quarter of the sample. We want to stress that this is a homogeneously M*-selected sample, designed to study Lyα emission statistics dependence on galaxy properties. As a consequence, it is by no means representative of the M* or Lyα LFs at 3 < z < 4.6. This must be taken into account when comparing this sample to analogues directly drawn from the galaxy population (i.e., LBGs or narrowband samples).

3. Lyα Measurements

3.1. Line Detection

For line detection, we use an automated maximum likelihood fitting routine after continuum subtraction. We assume intrinsic Gaussian profiles of the form

Equation (1)

where F, ${\lambda }_{0}$, and ${\sigma }_{\lambda }$ compose the parameter space explored by the maximum likelihood.

In order to account for false positives, we run the line detection routine on the 115 sky fibers. The results are shown in Figure 4. We detect four lines above 4σ and none above 5σ. Therefore, down to 5σ, we expect at most two false detections in our 120 lines, translating into ≲2% contamination using signal-to-noise S/N* = 5.5 as our threshold. We also characterize our detection completeness (see Figure 4). To obtain it, we use p(S/N${}_{i}\,\gt $ S/N ${}^{* }\,| $S/N), with S/Ni the measured signal-to-noise ratio and S/N* the imposed detection threshold. We define the simulated signal to noise as ${\rm{S}}/{{\rm{N}}}_{\mathrm{sim}}=F/\sqrt{\sum d{\lambda }^{2}{\sigma }_{k}^{2}}$, with $d\lambda $ the wavelength dispersion in the spectrum and ${\sigma }_{k}$ the flux uncertainty for pixel k. We find the most accurate representation by summing over an interval of 20 Å centered at 5500 Å. To recover the actual completeness shown in Figure 4, we simulate $\sim {10}^{3}$ lines on the 115 sky-spectra sampling fluxes of ${10}^{-19}\mbox{--}{10}^{-17}$ ergs s−1 cm−2, FWHMs between 5–13 Å, and wavelengths of 4800–6700 Å. The fraction for which we measure S/Ni > S/N* is our completeness. We use S/N* = 5.5 as our detection threshold, which corresponds to a line-flux sensitivity in the final spectra of $\sim 1\times {10}^{-17}$ erg s−1 cm−2 (see Figure 1).

Figure 4. Refer to the following caption and surrounding text.

Figure 4. Completeness (black) and false positives (gray) yielded by our line detection method. The completeness histogram is recovered by simulating $\sim {10}^{3}$ Gaussian emission lines on the 115 reduced sky spectra. Hence, for this curve, the x-axis corresponds to S/Nsim. The false positives histogram is obtained by running our line detection code on the 115 sky spectra. The dashed red line shows our line detection threshold S/N = 5.5, for which we have a completeness of ∼40%.

Standard image High-resolution image

3.2. Line Profiles

The radiative transfer and escape of Lyα radiation from galaxies can be highly complicated. As a matter of fact, the resonant nature of this line has led to thorough modeling of its radiative escape (e.g., Verhamme et al. 2006; Dijkstra & Kramer 2012). Such complications imply that the flux profile of a Lyα line is not always well reproduced by the usual Gaussian profile (Chonis et al. 2013; Trainor et al. 2015). Hence, for flux measurements, we adopt a more sophisticated model. Similarly to McLinden et al. (2011) and Chonis et al. (2013), we fit double-peaked Gaussian profiles of the form

Equation (2)

where ${f}_{\mathrm{blue}}(\lambda )$ represents the blue emission component and ${f}_{\mathrm{red}}(\lambda )$ represents the red component. We assume each of these components to be asymmetric, i.e., they follow Equation (1), with σ defined as

Equation (3)

Before fitting, we convolve the profiles given by Equations (2) and (3) with the spectral resolution. This allows us to properly characterize the errors in our measurements and methodology. In case there is some sky contamination (which happens for 28 of the 120 detections), we only fit single-peaked profiles. The resulting fits to six of our emission lines are shown in Figure 1.

3.3. Lyα Equivalent Width and Escape Fraction

There are typically two diagnostics used to characterize the prominence of Lyα emission in galaxies: the equivalent width (${W}_{\mathrm{Ly}\alpha }$) and the Lyα escape fraction (fesc). The rest-frame ${W}_{\mathrm{Ly}\alpha }$ is defined as the fraction between line flux and UV continuum flux in the rest frame of the galaxy. Explicitly,

Equation (4)

with F the Lyα flux we measure in the spectra and ${f}_{\lambda }$ the observed flux at rest frame 1700 Å from 3D-HST rest-frame colors (Skelton et al. 2014). It must be noted that the 3D-HST rest-frame colors come from the best-fit template to the photometry, i.e., they are not direct measurements. Still, this allows us to use, in principle, the same rest-frame wavelength for every object, regardless of its actual redshift. In addition, we can also derive a value of ${f}_{\lambda }$ even if the rest-frame 1700 Å photometry is missing. For reference, we find that the templates match the photometry at ∼1700 Å of every object to a typical agreement of ∼90%. For the uncertainties on ${f}_{\lambda }$, however, we use the errors on the photometry. We do not use any ${f}_{\lambda }$ directly measured from our data, since every galaxy has a continuum fainter or comparable to the 1σ errors in the spectra.

On the other hand, fesc is the fraction of the number of Lyα photons that escape the galaxy from the number produced. This diagnostic is typically indirectly recovered using SFRs derived from the Lyα line and intrinsic SFRs. The latter are typically calculated from UV continuum measurements or Hα fluxes, subject to extinction correction. In our case, we use the intrinsic (i.e., extinction-corrected) SFRs from FAST. The observed SFRs are derived from the SED, whereas the extinction corrections assume a Calzetti et al. (2000) attenuation law with ${R}_{V}=4.05$. The explicit definition is then (e.g., Blanc et al. 2011)

Equation (5)

with ${L}_{\mathrm{Ly}\alpha }$ the luminosity associated with the Lyα line and SFR(UV)corr the extinction-corrected SFRs. This assumes a Lyα to Hα ratio of 8.7 and the Hα SFR calibration for a Chabrier (2003) IMF of $4.4\times {10}^{-42}$ (Matthee et al. 2016). In case of non-detections, we use the corresponding S/N* = 5.5 line fluxes to derive upper limits on ${W}_{\mathrm{Ly}\alpha }$ and fesc. Our calculations do not account for the UV excess associated with binaries, which induce uncertainties on fesc that can be very difficult to account for (Stanway et al. 2016).

3.4. AGN Contamination

We do not measure any objects to have more than one emission line with ${\rm{S}}/{{\rm{N}}}_{i}\gt {\rm{S}}/{{\rm{N}}}^{* }$, dismissing the existence of evident active galactic nuclei (AGNs) in our sample. We do not find any of the 120 Lyα-emitting galaxies to have S/N > 1 emission potentially associated with the C iv λ1549 line. Moreover, we find no broad-line Lyα emission (${\rm{\Delta }}v\,\gt 1000$ km s−1), consistent with the rarity of Type I AGNs at high redshift (Dawson et al. 2004), and in strong contrast with the AGN fractions at low redshift (∼0.4; Finkelstein et al. 2009). This is consistent with the fact that bright AGNs are found in ≲1% of z > 3 LAEs (Malhotra et al. 2003; Wang et al. 2004; Gawiser et al. 2007; Ouchi et al. 2008; Zheng et al. 2010). Low-luminosity AGN contamination in LAE samples is more difficult to constrain, but Wang et al. (2004) estimate the total AGN fraction to be <17%. We use our fesc measurements to further dismiss the existence of evident AGNs. We perform cross-matching with the NASA/IPAC Extragalactic Database (NED8 ) for all objects with ${f}_{\mathrm{esc}}\gt 1$, and we find none of them to be a reported X-ray source.

4. The Lyα Equivalent Width Distribution

4.1. Bayesian Inference

Measurement of ${W}_{\mathrm{Ly}\alpha }$ for a galaxy sample yields the ${W}_{\mathrm{Ly}\alpha }$ distribution. Since ${W}_{\mathrm{Ly}\alpha }$ is directly measured from the data, characterization of the ${W}_{\mathrm{Ly}\alpha }$ distribution can be naively considered straightforward. However, careful consideration of uncertainties and completeness can yield important insights into the underlying information. Therefore, for proper characterization of uncertainties, significance, and trends in our results, we use Bayesian statistics. In this section, we explain how to recover the ${W}_{\mathrm{Ly}\alpha }$ distribution within this framework, complementary to the one introduced in Treu et al. (2012).

Different probability distribution models can be adopted to reproduce ${W}_{\mathrm{Ly}\alpha }$ distribution measurements. For instance, studies use Gaussian (e.g., Guaita et al. 2010), exponential (e.g., Jiang et al. 2013b; Zheng et al. 2014), and log-normal distributions. For our analysis, we define the probability distribution as $p({W}_{\mathrm{Ly}\alpha }| {W}_{n})$. Let Wn be the parameter space associated with the model. From now on, we describe the Bayesian approach to recover the posterior distribution of Wn. By means of this approach, we include the uncertainties in sample size, flux measurements, and photometry in the estimation of the posterior. The description we provide is not limited to this particular work, allowing for further application in similar data sets. We only present here the fundamental equations, as this procedure is already described in detail in Oyarzún et al. (2016).

Our Bayesian analysis is based on the Lyα line flux F instead of ${W}_{\mathrm{Ly}\alpha }$, as introduced in Equation (4). This approach simplifies the equations, since we can assume F and ${f}_{\lambda }$ to be normally distributed, which cannot be done for ${W}_{\mathrm{Ly}\alpha }$. According to Bayes' theorem, the posterior distribution $p({W}_{n}| \{F\})$, i.e., the parameter space probability distribution given our data set $\{F\}$, is

Equation (6)

The likelihood is just the product of the individual likelihood for every galaxy, i.e., $p(\{F\}| {W}_{n})=\prod p({F}_{i}| {W}_{n})$. For a detection, it is given by

Equation (7)

where $p({F}_{i}| F)$ is the line-flux probability distribution for the corresponding galaxy valued at a flux F, which we consider to distribute normally. On the other hand, the term $p(F| {W}_{n})$ is just the probability distribution of F given Wn. Using the definition of ${W}_{\mathrm{Ly}\alpha }$ from Equation (4), the term translates to the product distribution $p(F| {W}_{n})=p({W}_{\mathrm{Ly}\alpha }| {W}_{n})p({f}_{\lambda })$. At this point, we include the probability distribution for the continuum, which we also assume to be Gaussian.

The limiting line flux ${F}_{i}^{* }$ for discerning detections from noise is given by our S/N threshold, i.e., ${F}_{i}^{* }={\rm{S}}/{{\rm{N}}}^{* }{\sigma }_{i}$. For galaxies with no detections above ${F}_{i}^{* }$, we adopt the following value for the likelihood:

Equation (8)

with $p({F}_{i}\gt {F}_{i}^{* }| F)$ the detection completeness at a line flux F (see Section 3.1).

Using the expressions for detections and non-detections, the posterior distribution takes the final form:

Equation (9)

with $p({W}_{n})$ the priors of the model parameters and $p(\{F\})$ a normalization constant reflecting the likelihood of the model. The form of Equation (9) is general and will be used as a starting point for multiple analysis throughout.

4.2. Model Comparison

As we introduced in the previous section, multiple probability distributions can be adopted for the representation of the ${W}_{\mathrm{Ly}\alpha }$ distribution. In order to perform model selection, several elements are taken into account, such as model complexity and number of parameters. In this section, we describe our methodology to perform such a selection from a quantitative standpoint. By means of a Bayesian approach, we recover probability ratios for the different models, providing insight into how we perform the selection given our measurements. The analysis presented here is a quantitative implementation of Occam's Razor and is not unique to our data set, i.e., it can be applied to any data set modeling.

Every model we discuss here is composed of a scaled probability distribution and a Dirac delta, as proposed in Treu et al. (2012). If we define the standard probability distributions as ${p}_{0}({W}_{\mathrm{Ly}\alpha }| {W}_{n})$, the modified counterparts we consider are given by

Equation (10)

where the first term is the scaled probability distribution. It is multiplied by the Heaviside $H({W}_{\mathrm{Ly}\alpha })$ to ensure it only represents positive ${W}_{\mathrm{Ly}\alpha }$ values. Hence, this scaled term integrates A, i.e., A can only adopt values between 0 and 1. Note that this term is different from the fraction of detections in our sample, as we consider upper limits for our non-detections. The second term groups the fraction of galaxies that do not emit in Lyα (i.e., no line and/or absorption). As our data is restricted to emission lines, we represent this term using the Dirac delta.

We explore here exponential-, Gaussian-, and log-normal-based distributions. The first two are two-parameter models, while the log-normal is three-parameter dependent. Hence, we generalize our parameter space as ${W}_{n}=(A,{W}_{0},\mu )$. Then, the expressions for our exponential, Gaussian, and log-normal models are, respectively,

Equation (11)

Equation (12)

Equation (13)

We now describe our approach for comparing the three models. For a set of measurements, Bayes' theorem gives the probability of model Mi in the model space $\{M\}$:

Equation (14)

with $p({M}_{i})$ the prior for model Mi in the set $\{M\}$, which we assume to be equal for the three models. Once again, $p(\{F\})$ is a normalization constant. Therefore, the probability for model Mi is proportional to the likelihood of the model, i.e.,

Equation (15)

The absolute probability of each model Mi within the set $\{M\}$ is obtained by imposing that the models explored cover all possible choices, i.e., $\sum p({M}_{i}| \{F\})=1$. Hence, the probability of Mi given our data set is

Equation (16)

For an analytically correct model comparison, analysis of Equation (16) is required. Still, the term in Equation (15) is strongly dependent on the priors assumed for the parameter space Wn of every model. Therefore, different prior selections can have significant effects on the odds for each model. As a workaround, we rewrite the model probabilities as

Equation (17)

Effectively, this simplification conveniently limits our analysis to a pure likelihood comparison, i.e., we adopt constant, uninformative priors. This is equivalent to assuming ignorance in linear scales for every parameter. Since our distributions are smooth and single peaked, we are confident in this assumption. The dimensions of our parameter spaces do not go beyond three, and the uncertainties in our parameters are of the order of the most probable values, providing further assurance to our assumptions.

In Oyarzún et al. (2016), we show that a galaxy sample with a broad M* range yields a composite ${W}_{\mathrm{Ly}\alpha }$ distribution. For the rest of this section, we divide our sample into three M* bins, as shown in Figure 3. We use the complete sample and these three subsamples to contrast the models. The outcomes are presented in Table 1 and Figure 5. Table 1 gives evidence that the best likelihood is obtained with the log-normal model for three of the four distributions. This can be verified in our distribution simulations for the complete sample in Figure 5, especially toward the high ${W}_{\mathrm{Ly}\alpha }$ tail. Still, when integrating the likelihoods, the log-normal distribution is the least probable. This is a consequence of the extra parameter needed by the model, which penalizes the likelihood when integrating over the parameter space. Then, according to our analysis, the preferred model is the exponential. While it models the distribution better than the Gaussian, it also reproduces our ${W}_{\mathrm{Ly}\alpha }$ measurements fairly well, despite depending on only two parameters. In addition, the uncertainties in Figure 5, especially for low ${W}_{\mathrm{Ly}\alpha }$, confirm this model is the most adequate to reproduce our measurements. We remark that the procedure for model selection described here considers the lower ${W}_{\mathrm{Ly}\alpha }$ end of the distribution, which includes our completeness and non-detections. Nonetheless, further ${W}_{\mathrm{Ly}\alpha }$ distribution analysis in this paper is mostly focused on the higher ${W}_{\mathrm{Ly}\alpha }$ end and is not strongly dependent on model preference.

Figure 5. Refer to the following caption and surrounding text.

Figure 5. Shown in histograms are the observed rest-frame ${W}_{\mathrm{Ly}\alpha }$ distribution for the whole sample. These histograms account for the 120 measured ${W}_{\mathrm{Ly}\alpha }$ with S/N ≥ 5.5. From left to right, we overplot the exponential, Gaussian, and log-normal model constraints, respectively. The solid colored lines correspond to the peak of the likelihood, i.e., the most probable solutions. The dashed lines represent these solutions corrected for our completeness. The shaded regions show the 1σ and 3σ confidence levels yielded by the likelihood. Our Bayesian model comparison prefers the exponential model, since it reproduces our ${W}_{\mathrm{Ly}\alpha }$ measurements fairly well despite having only two free parameters.

Standard image High-resolution image

Table 1.  Lyα Equivalent Width Distribution Model Comparison

Likelihood Low Mass Medium Mass High Mass Complete Sample
Output Exp Gaussian Log n Exp Gaussian Log n Exp Gaussian Log n Exp Gaussian Log n
Model oddsa 0.93 0.06 0.01 0.97 0.01 0.02 1 10−3 10−5 0.98 10−6 0.02
Peak oddsb 0.25 0.02 0.73 0.23 10−3 0.77 0.97 10−3 0.03 0.19 10−7 0.81
Ac 0.75 0.55 0.55 0.55 0.4 0.5 0.35 0.2 0.2 0.4 0.3 0.45
W0c 46 74 0.7 26 46 0.85 14 26 0.75 38 64 1.05
μc 3.85 3.05 3 3.05

Notes.

aObtained by integrating the likelihood over the whole parameter space. bCalculated with the maximum of the likelihood. cModel parameters for the corresponding likelihood maximum. Note that these values are different from the ones in Oyarzún et al. (2016), since here we are just working with the likelihood.

Download table as:  ASCIITypeset image

From now on, we perform our ${W}_{\mathrm{Ly}\alpha }$ distribution analysis using the exponential model of Equation (11). We stress that this expression is dependent on the parameters A and W0, with the first being the fraction of galaxies showing emission and the second the e-folding scale of the distribution. We advise caution when using A as a proxy for the fraction of line emitters in the parent population, since it is tied to the adequacy of an exponential profile.

5. Lyα Emission Dependence on Galaxy Properties

5.1. Stellar Mass

Evidence suggests Lyα emission is strongly dependent on the M* of galaxies. Galaxies with higher M* have been forming stars for longer, leading to greater ISM dust that presumably forms in supernovae and AGB stars (Silva et al. 1998). A greater dust content leads to more Lyα photon absorption, decreasing ${W}_{\mathrm{Ly}\alpha }$. This effect has already been observed, at least for high ${W}_{\mathrm{Ly}\alpha }$, in Blanc et al. (2011) and Hagen et al. (2014). Similarly, the bulk of M* is dominated by older stars, which do not contribute significantly to the Lyα photon budget of galaxies. As a matter of fact, Lyα emission decreases steadily with the age of stellar populations, as seen in Charlot & Fall (1993) and Schaerer (2003). Lyα radiative transfer is also severely affected by the neutral gas structure and kinematics of the ISM and circumgalactic medium (Verhamme et al. 2006). Since more massive star-forming galaxies are bound to have higher gas mass (e.g., Kereš et al. 2005; Finlator et al. 2007), Lyα photons should be subject to more resonant scattering, therefore decreasing their ${W}_{\mathrm{Ly}\alpha }$. The trends we find in Oyarzún et al. (2016) confirm this qualitative scheme at $z\sim 4$. In this section, we perform a more detailed and robust characterization of Lyα emission dependence on M*.

Our sample is especially designed to study the dependence of Lyα emission in M*. As shown in Figure 3, our objects are selected in redshift and M*, homogeneously covering the range $7.6\lt \mathrm{log}({M}_{* }[{M}_{\odot }])\lt 10.6$. For further clarity, we plot in Figure 6 the M* and SFRs of our complete sample and detections. Comparison of the M* histograms slightly hints at an anti-correlation between the LAE fraction and M*, at least down to our detection limit. However, since our completeness depends on M*, a more thorough analysis is required (see below). The existence of any Lyα emission dependence on M* becomes clearer in Figure 7, where we plot ${W}_{\mathrm{Ly}\alpha }$ and fesc as a function of M*. We also plot in Figure 7 our upper limits for non-detections. The regions sampled by our non-detections reveal how our completeness is not independent of M*. Our detections are flux limited, so we achieve lower ${W}_{\mathrm{Ly}\alpha }$ for galaxies with brighter UV continuum. Therefore, even though we observe lower M* galaxies to have higher ${W}_{\mathrm{Ly}\alpha }$ and fesc, any qualitative conclusions we can draw involving the fraction of LAEs as a function of M* are affected by our completeness. Still, there is a clear upper envelope to the distribution of galaxies in this plot, where we are not affected by incompleteness. For ${M}_{* }\lt {10}^{8.5}$, there is a clear anti-correlation between ${W}_{\mathrm{Ly}\alpha }$ and M*. For more massive galaxies, however, the trend is mostly flat, except for the presence of a few interlopers. Still, our qualitative result is that both Lyα emission diagnostics show an anti-correlation with M*. For the rest of this section, we focus on how to thoroughly quantify the effect of M* on ${W}_{\mathrm{Ly}\alpha }$.

Figure 6. Refer to the following caption and surrounding text.

Figure 6. Detail of the M* and SFR of our galaxies. Left: M*–SFR plane location of our objects. Non-detections are shown in gray. Low, medium, and high ${W}_{\mathrm{Ly}\alpha }$ detections are plotted as cyan circles, blue diamonds, and magenta pentagons, respectively. The values of M* and SFR we show come from SED fitting of 3D-HST photometry using FAST under the assumption of constant SFHs. The dashed lines indicate the minimum and maximum M* that galaxies can reach assuming that they have been forming stars at a constant rate. Right: M* (top) and SFR (bottom) histograms of the 625 galaxies in our sample (gray) and 120 detections (black). For better comparison, we divide the histogram values for the full sample by five. Note that inferences on the fraction of detections must take into account that incompleteness is dependent on both M* and SFR. In this paper, we use the exponential ${W}_{\mathrm{Ly}\alpha }$ distribution normalization (A; Equations (18) and (19)) as a proxy for the dependence of the Lyα fraction on M*. Our measurements are more than 2σ consistent with a decrease in the fraction going to higher M* (see Figure 8).

Standard image High-resolution image
Figure 7. Refer to the following caption and surrounding text.

Figure 7. Top: plot of our ${W}_{\mathrm{Ly}\alpha }$ as a function of M*. Bottom: plot of our Lyα escape fractions as a function of M*. The detections corresponding to the low-, medium-, and high-mass subsamples are shown as blue circles, green diamonds, and red pentagons, respectively. We observe both Lyα diagnostics, ${W}_{\mathrm{Ly}\alpha }$ and fesc, to anti-correlate with M*. The upper limits associated with non-detections are plotted as gray triangles. We use these non-detections to give a rough estimate of our 40% completeness (dashed). Note how our completeness is not independent of M*, which has to be considered when making inferences on the M* distribution and characteristic ${W}_{\mathrm{Ly}\alpha }$ (fesc) of LAEs. We also show in yellow the typical rest-frame selection threshold of ${W}_{\mathrm{Ly}\alpha }\gt 20$ Å imposed by narrowband surveys (e.g., Gronwall et al. 2007; Guaita et al. 2010; Adams et al. 2011).

Standard image High-resolution image

The overall dependence of Lyα emission on M* implies that ${W}_{\mathrm{Ly}\alpha }$ distributions in the literature (e.g., Gronwall et al. 2007; Zheng et al. 2014) are influenced by the M* distribution of the sample. In comparison to deeper MUV surveys, shallower samples are bound to observe lower ${W}_{\mathrm{Ly}\alpha }$. This can lead to incorrect contrast of surveys and misinterpretation of trends. To verify these claims, we divide our sample into three M* bins (see Figure 3) and plot the resulting ${W}_{\mathrm{Ly}\alpha }$ distributions in Figure 8. As expected, there is an apparent anti-correlation between M* and both the tail of the ${W}_{\mathrm{Ly}\alpha }$ distribution and the normalization. We perform our first quantitative characterization of the ${W}_{\mathrm{Ly}\alpha }$ distribution dependence on M* in Oyarzún et al. (2016). In that study, we divide our sample into three M* bins and obtain the posterior distribution for the exponential parameters separately. This procedure allowed us to fit a linear relation to the final parameters, recovering A(M${}_{* }$) and W0(M${}_{* }$) using expressions of the form

Equation (18)

Equation (19)

In this section, we recover these linear relations directly from the complete sample, i.e., we recover the posterior distribution for the four-parameter space composed of the linear coefficients in Equations (18) and (19). This more robust methodology does not rely on binning, while also allowing us to constrain the errors on the coefficients directly from the model and measurements. We once again start from Equation (9). As mentioned, our parameter space is now ${W}_{n}=({A}_{{M}_{* }},{A}_{C},{W}_{{M}_{* }},{W}_{C})$. As these linear coefficients represent the exponential parameters of Equation (11), deriving non-informative priors is highly complicated. Therefore, in order to determine our priors, we only consider linear scales ignorance. Therefore, the posterior translates to

Equation (20)

with C a normalization constant.

Figure 8. Refer to the following caption and surrounding text.

Figure 8. Rest-frame ${W}_{\mathrm{Ly}\alpha }$ distributions of the low-, medium-, and high-mass bins, from left to right. We use Monte Carlo simulations to characterize the exponential ${W}_{\mathrm{Ly}\alpha }$ distribution dependence on M* following Equations (18) and (19). We can then simulate ${W}_{\mathrm{Ly}\alpha }$ distributions for every subsample and obtain 1σ constraints (shaded contours). Keep in mind that these constraints are given by collections of curves resulting from our Monte Carlo simulations. Our best results correspond to the median distribution we expect by considering every object in the bin. We also plot as dotted lines the completeness-corrected counterparts of the median distributions. In contrast to Oyarzún et al. (2016), where the modeling is based on the characteristic M* of each bin, the results here take into account the M* of every galaxy. As we show in Equations (21) and (22), we find both parameters, the fraction of Lyα-emitting galaxies and characteristic ${W}_{\mathrm{Ly}\alpha }$ scale, to anti-correlate with M* with significances higher than 2σ and 6σ, respectively.

Standard image High-resolution image

We use MCMC simulations to characterize this four-parameter posterior. Its maximum gives the best solution for $A({M}_{* })$ and ${W}_{0}({M}_{* })$, while the collapsed posteriors yield the uncertainties on the parameters. We can then write Equations (18) and (19) as

Equation (21)

Equation (22)

In the framework of an exponential profile, these relations we recover yield the ${W}_{\mathrm{Ly}\alpha }$ probability distribution for an object with known M*. Hence, we can simulate the expected ${W}_{\mathrm{Ly}\alpha }$ distribution for each of the three M* subsamples and compare with our direct measurements. The results are presented in Figure 8. Our constraints are consistent with the observed ${W}_{\mathrm{Ly}\alpha }$ distributions.

Our results regarding the dependence of Lyα on M* are conclusive. In the range ${10}^{8}\mbox{--}{10}^{10.5}\,{M}_{\odot }$, the ${W}_{\mathrm{Ly}\alpha }$ probability distribution extends to higher ${W}_{\mathrm{Ly}\alpha }$ for lower M* galaxies. In other words, more massive galaxies tend to have lower ${W}_{\mathrm{Ly}\alpha }$. A similar trend is observed for fesc, further highlighting the role of dust and gas mass in the escape of Lyα photons. At z ∼ 2.2, Matthee et al. (2016) also observe the fesc anti-correlation from Figure 7, although only when stacking galaxies. Their massive objects showing high fesc, which they associate with dusty gas outflows, seem to lie below the M*–SFR sequence at z ∼ 2. These inferences, combined with the significantly higher fesc they measure for larger apertures, make sense in a Lyα diffuse halo scheme, which we do not observe due to our aperture size. Studies on the dependence of fesc on M* also explore 1.9 < z < 3.6 (Hagen et al. 2014). Their ${W}_{\mathrm{Ly}\alpha }$-selected LAEs follow a trend similar to the one we find at z ∼ 4. In summary, the evidence for an anti-correlation between ${W}_{\mathrm{Ly}\alpha }$ (or fesc) and M* is significant, but the scatter seems to depend on measurement methodology and sample selection.

Inferences on the M* distribution of LAEs are not as evident. Hagen et al. (2014) do not find their 1.9 < z < 3.6 Lyα luminosity-selected LAE number distribution to depend on M*. Their results agree with the z ∼ 3.1 narrowband-selected survey of McLinden et al. (2014). However, we show in Figure 7 that spectroscopic completeness is not independent of M*. Therefore, most LAEs survey follow-ups could have higher incompleteness toward lower M*. Since our Bayesian analysis takes into account our completeness for every object, we can test the significance of this claim. The coefficient ${A}_{{M}_{* }}$ in Equation (18) represents the exponential fraction of LAE dependence on M*. As evidenced by Equation (21), our measurements are more than $2\sigma $ consistent with a decrease in the fraction going to higher M*. In the scheme of an exponential model, this translates to an LAE distribution dominated by lower M* galaxies. This result complements the much more significant anti-correlation between W0 and M* we find (see Equation (22)).

5.2. SFR

High-redshift galaxies have been observed to follow a correlation between SFR and M*, known as the star-forming main sequence (e.g., Kereš et al. 2005; Finlator et al. 2007; Stark et al. 2009; González et al. 2011; Whitaker et al. 2012; see our Figure 6). In terms of the underlying physics, more massive objects dominate gas accretion in their neighborhood, feeding and triggering star formation. Such gas infall seems to dominate over galaxy growth at high redshift (Kereš et al. 2005; Finlator et al. 2007). This scheme implies that more massive objects form stars at higher rates, at least down to our observational limitations and modeling of high-redshift ISM. Given our results on M* from the previous section, we expect similar trends between ${W}_{\mathrm{Ly}\alpha }$ (fesc) and SFR (Figure 6). Even more, star-forming galaxies have a higher neutral gas mass, which can hamper the escape of Lyα photons from galaxies (Verhamme et al. 2006). In fact, it has also been suggested that photoelectric absorption rules Lyα depletion, even over dust attenuation (Reddy et al. 2016). In this section, we explore any Lyα dependence on SFR within our data set. We remark that our SFRs come from SED fitting of 3D-HST photometry using FAST (see Section 2.3), i.e., they have typical associated timescales of 100 Myr (Kennicutt 1998). We stress that our derived SFRs differ from 3D-HST SFRs, since our calculation assumes cSFHs instead of exponentially declining SFHs (Skelton et al. 2014).

We show the ${W}_{\mathrm{Ly}\alpha }$ and fesc dependence on SFR in Figures 6 and 9. In the latter, we include upper limits for our non-detections to give an insight into how our incompleteness depends on SFR. A clear anti-correlation between ${W}_{\mathrm{Ly}\alpha }$ (fesc) and SFR is observed. These results come as no surprise, as they have been previously reported. Most studies of ${W}_{\mathrm{Ly}\alpha }$ dependence on SFR involve uncorrected SFRs (Pettini et al. 2002; Shapley et al. 2003; Yamada et al. 2005; Gronwall et al. 2007; Tapken et al. 2007; Ouchi et al. 2008; all compiled in Verhamme et al. 2008). Even without dust correction, the anti-correlation is still present in these studies (refer to Figure 19 in Verhamme et al. 2008 and Section 5.4 of this work). Based on a z ∼ 2 Hα emitters sample, Matthee et al. (2016) also observe a clear anti-correlation between Lyα fesc and SFR. Interestingly, they do not only observe such a trend in their individual objects, but likewise on their stacks when using different apertures (galaxy diameters of 12 and 24 kpc). As their data set includes Hα fluxes, they can recover SFRs and fesc using Hα luminosities. The fact that they observe similar trends with such a different sample suggests that the anti-correlation between ${W}_{\mathrm{Ly}\alpha }$ (fesc) and SFR is not only independent of redshift, but also observational constraints like aperture and methodology for recovering SFRs. Their comparison, however, is restricted to SFRs higher than $\sim 5\,{M}_{\odot }$ yr−1. Most of our low-mass objects have SFRs lower than $\sim 5\,{M}_{\odot }$ yr−1, but they seem to follow the same regime as the rest of our sample. Even though fesc uncertainties and incompleteness increase toward lower-SFR, UV-fainter galaxies (Figure 9), our results suggest that fesc reaches values of 100% toward $\mathrm{SFR}\sim 1\mbox{--}3\,{M}_{\odot }$ yr−1. These numbers are consistent with the analysis by Atek et al. (2014). They compare their z < 0.5 SFR(Lyα)/SFR(UV) measurements with the literature at z > 2 (Taniguchi et al. 2005; Gronwall et al. 2007; Guaita et al. 2010; Curtis-Lake et al. 2012; Jiang et al. 2013b). Our finding that fesc reaches values of 100% toward $\mathrm{SFR}\sim 1\mbox{--}3\,{M}_{\odot }$ yr−1 overestimates SFR(Lyα)/SFR(UV) for z < 0.5, but is consistent with higher redshift. As pointed by Atek et al. (2014), it seems that fesc at fixed SFR(UV) increases with redshift. Then again, it must be kept in mind that these literature results consider uncorrected SFRs.

Figure 9. Refer to the following caption and surrounding text.

Figure 9. Plot of our ${W}_{\mathrm{Ly}\alpha }$ (top panel) and fesc (bottom panel) as a function of intrinsic SFR. Detections corresponding to our low-, medium-, and high-mass bins are shown as the blue circles, green diamonds, and red pentagons, respectively. We find both measurements to anti-correlate with the SFR we measure from SED fitting. For the plot in the bottom panel, we use the same value for SFRs when determining the escape fractions and galaxy SFRs. For reference, we show our non-detection upper limits as gray triangles. We use these non-detections to give a rough estimate of our 40% completeness (dashed), confirming how strongly our sensitivity depends on SFR.

Standard image High-resolution image

5.3. UV Luminosity

In this section, we analyze the MUV distribution of our sample, while also exploring any correlations between ${W}_{\mathrm{Ly}\alpha }$ and UV luminosities. It must be noted, though, that our sample is not representative of the galaxy population at 3 < z < 4. First, our galaxies are homogeneously distributed in M*, i.e., they are not a random sample from 3 < z < 4 CANDELS objects. Second, we are affected by CANDELS completeness, which decreases toward lower M* galaxies. Since more massive galaxies tend to have higher UV luminosities (Stark et al. 2009; González et al. 2014), our sample has a higher contribution of bright MUV galaxies than a population-representative subsample.

In order to determine MUV for our objects, we use the CANDELS i775 band. We show in Figure 10 the corresponding distribution of the complete sample and detections. We also present the dependence of ${W}_{\mathrm{Ly}\alpha }$ on MUV in this figure. As expected from the $\mathrm{SFR}\mbox{--}{W}_{\mathrm{Ly}\alpha }$ anti-correlations we recover in Section 5.2, a similar trend is observed for UV luminosities. This anti-correlation comes as no surprise, since brighter UV galaxies tend to have higher M* at the cosmic time of our sample (Stark et al. 2009; González et al. 2014). Galaxies brighter in the UV have been subject to more intensive star formation events in 100 Myr timescales. Typically, higher neutral gas, turbulence, and bulk gas motions are associated with higher SFRs, boosting the scatter of Lyα photons. In combination, higher dust extinction, older age of stellar populations, and greater neutral gas mass in UV brighter galaxies seem to rule Lyα statistics dependence on SFR.

Figure 10. Refer to the following caption and surrounding text.

Figure 10. Top: histograms of sample (gray) and detections (black) as a function of UV absolute magnitude. We use the CANDELS i775 band for the derivation of MUV. For better comparison, we divide the histogram values for the full sample by five. The photometric measurement on this particular band is available for 621 of the 625 galaxies. Center: dependence of ${W}_{\mathrm{Ly}\alpha }$ on UV absolute magnitude. Bottom: dependence of fesc on MUV. Blue circles, green diamonds, and red pentagons correspond to low-, medium-, and high-mass subsample detections, respectively. Non-detection upper limits are shown as gray triangles, and we use them to give a rough estimate of our 40% completeness (dashed). Our results suggest that ${W}_{\mathrm{Ly}\alpha }$ and fesc anti-correlate with UV luminosity. These trends are consistent with studies performed on LBG samples (Stark et al. 2010; Schaerer et al. 2011) and narrowband surveys (Ouchi et al. 2008).

Standard image High-resolution image

Most analyses of Lyα emission dependence on MUV have been performed using uncorrected SFRs. The literature compilation shown in Verhamme et al. (2008) reveals how observed UV SFRs anti-correlate with ${W}_{\mathrm{Ly}\alpha }$ from z ∼ 2.7 to 5.7 (Pettini et al. 2002; Shapley et al. 2003; Yamada et al. 2005; Gronwall et al. 2007; Tapken et al. 2007; Ouchi et al. 2008). Analyses explicitly using MUV have also been performed in the surveys of Shimasaku et al. (2006), Ouchi et al. (2008), Vanzella et al. (2009), Balestra et al. (2010), Stark et al. (2010), Schaerer et al. (2011), and Cassata et al. (2011), yielding similar trends up to z ∼ 6. More recent studies confirm these trends (Jiang et al. 2013b, 2016; Zheng et al. 2014). Simulations likewise predict such correlations (see Shimizu et al. 2011). However, regardless of the methodology, the scatter in these correlations is non-negligible. Moreover, Atek et al. (2014) question the existence of the correlation in their sample. We argue that the scatter observed in the ${W}_{\mathrm{Ly}\alpha }$ dependence on MUV/SFRobs can be a consequence of the role played by dust. Galaxies brighter in the UV naturally have a greater Lyα photon production, but the anti-correlation with M* affects the escape fraction (probably through increased dust extinction). In this scenario, Lyα photon escape is a complex process simultaneously ruled by different properties of high-redshift galaxies. We explore such a property space in our analysis of the ${W}_{\mathrm{Ly}\alpha }$ distribution dependence on the ${M}_{\mathrm{UV}}\mbox{--}\beta $ relationship in Section 5.5.

Implications involving the fraction of detections as a function of MUV are not straightforward. In principle, this fraction seems to correlate with the UV luminosities of our galaxies, as opposed to what is observed for characteristic ${W}_{\mathrm{Ly}\alpha }$. Nevertheless, our Lyα measurements are flux limited. Therefore, our detection completeness in ${W}_{\mathrm{Ly}\alpha }$ is higher for brighter objects, leading to biases difficult to account for, as noted in Nilsson et al. (2009). Under these circumstances, the ideal approach is to consider both detections and non-detections, while taking into account the uncertainties for line and continuum fluxes (F and fλ, respectively). Hence, we encourage further interpretations of these results to focus on the analysis performed on the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane (Section 5.5).

5.4. UV Slope

Measurement of the UV slopes of high-redshift galaxies is a direct way of tracing the amount of dust inside galaxies, given the assumption of an extinction law and an intrinsic spectral shape. This is of particular interest for Lyα surveys, since simulations (e.g., Verhamme et al. 2008) and observations at low redshift (Hayes et al. 2011; Atek et al. 2014) suggest that dust plays an important role in the escape of Lyα photons. Since we have rest-frame UV photometry from CANDELS for all our objects, we can determine their UV slopes and study their effect on Lyα emission at z ∼ 4. In this section, we detail our method to estimate the UV slopes for our objects and show our results on the Lyα dependence on this galaxy property.

To determine UV slopes, we fit a power law ${f}_{\lambda }={f}_{0}{\lambda }^{\beta }$ (Calzetti et al. 1994) to the photometry of each object. For fitting, we just use a standard least-squares routine on the photometry between rest frame 1400 and 3500 Å. These calculations correspond, in principle, to 8–15 bands between 1500 and 3600 Å. However, since we only use fluxes with S/N > 3, our median number of bands is eight. Naturally, we further require at least two bands to associate a slope to our targets, which means we can measure the UV slope for 611 of the 625 observed objects composing our sample.

Our FAST outputs include dust extinction in the V-band, AV, for every object. However, if we want to associate a reddening $E(B-V)$ with every galaxy, we need to use our derived UV slopes. We use the relation

Equation (23)

where we assume a pristine slope of β = −2.23 (Meurer et al. 1999). We also require the adoption of an attenuation law ${k}_{\lambda }$ (e.g., Calzetti et al. 2000). Reddy et al. (2015) recover a more appropriate attenuation law than Calzetti et al. (2000) for high-redshift galaxies, so we use the ${k}_{\lambda }$ of the former. It is worth noting, nevertheless, that both attenuation laws yield almost identical results.

Since our targets are M* selected, we have a higher contribution of massive objects in comparison to the M* distribution of the galaxy population. As more massive objects tend to have higher $E(B-V)$ and redder UV slopes, we expect z ∼ 4 samples representative of the galaxy population to have a lower contribution from such galaxies. In any case, we show in Figure 11 the UV slope histogram of our sample and our Lyα emitters. Just by comparing both distributions, it is clear that the fraction of emitters increases toward bluer galaxies. We present a quantitative analysis of this claim in Section 5.5.

Figure 11. Refer to the following caption and surrounding text.

Figure 11. Top: histograms of the sample (gray) and detections (black) as a function of UV slope (β) and reddening $E(B-V)$. For better comparison, we divide the histogram values for the full sample by five. Center: ${W}_{\mathrm{Ly}\alpha }$ dependence on β. Bottom: fesc dependence on β. We are able to measure the UV slope for 611 of the 625 objects composing our sample, and obtain $E(B-V)$ assuming a Reddy et al. (2015) attenuation law with a pristine slope of β = −2.23 (Meurer et al. 1999). Blue circles, green diamonds, and red pentagons symbolize our low-mass, medium-mass, and high-mass detections, respectively. Gray triangles represent our upper limits from non-detections, which we use to estimate our 40% completeness (dashed). Our results clearly show that high-fesc objects are associated with extremely blue slopes, consistent with the notion that dust boosts the absorption of Lyα photons. We also include some trends from the literature for comparison.

Standard image High-resolution image

We also show in Figure 11 our results on ${W}_{\mathrm{Ly}\alpha }$ and fesc as a function of β and $E(B-V)$. Since lower mass galaxies have bluer UV slopes than more massive ones, the trends we find are complementary to our previous results. There is a correlation between the steepness of the UV spectrum and ${W}_{\mathrm{Ly}\alpha }$/fesc, although with significant scatter. As extinction seems to play a major role in Lyα photon escape from galaxies, mainly through scattering and absorption (Blanc et al. 2011; Hagen et al. 2014), these correlations come as no surprise. Qualitatively, our results agree with the measurements from Shapley et al. (2003), Pentericci et al. (2009), Blanc et al. (2011), and Atek et al. (2014). Regarding the scatter we find at fixed β (see also Blanc et al. 2011), it is consistent with a scenario where the observed Lyα flux is mostly affected by the dependence of the dust-covering fraction on the line of sight. We discuss this picture in detail in Section 5.5.

When comparing the more dusty Lyα emitters in our sample with results from the literature, however, some differences show up. The 1.9 < z < 3.8 survey from Blanc et al. (2011) and the z < 0.5 study from Atek et al. (2014) find Lyα emitters up to $E(B-V)\sim 1$. Similarly, Matthee et al. (2016) find a population of dusty LAEs at z ∼ 2.23, and they speculate on how dusty gas outflows might be the feature driving the escape of Lyα radiation. Lyα sources up to z ∼ 3 are Herschel (Oteo et al. 2011, 2012; Casey et al. 2012; Sandberg et al. 2015) and SCUBA (Geach et al. 2005; Hine et al. 2016) detected. Moreover, Lyα emission has also been measured in submillimeter galaxies (e.g., Chapman et al. 2005). On the other hand, our 625 object sample features ∼5 objects with $E(B-V)\gt 0.4$, but none of them qualifies as a detection. We cannot state whether the absence of such LAEs in our sample is representative of z ∼ 4. True enough, the fraction of dusty galaxies at z ∼ 4 is expected to be lower than at z < 4. Along such lines, cosmic evolution in the fraction of dusty LAEs has already been discussed in the literature. Blanc et al. (2011) study any dust evolution in their 1.9 < z < 3.8 LAEs sample and find no significant trend. Hagen et al. (2014) study the same sample, and show that there is little anti-correlation, if any, between $E(B-V)$ and redshift. Shapley et al. (2003) do observe such evolution in the range 2 < z < 3.5, but question the validity given the selection effects associated with LBG surveys. We conclude that even though Lyα emitting galaxies are mostly low-dust objects (e.g., Song et al. 2014), there is also a population of dustier, low-${W}_{\mathrm{Ly}\alpha }$ LAEs. Still, the significance of their numbers at z ≳ 4 is still an open question.

Our results also confirm at z ∼ 4 a trend in fesc that has already been observed at lower redshift. Atek et al. (2014) perform a fesc study at z < 0.5 and recover a similar anti-correlation to the one we show in Figure 11. Hayes et al. (2011) at z ∼ 2.2, Blanc et al. (2011) at 1.9 < z < 3.8, Song et al. (2014) at z ∼ 2.1–2.5, and Matthee et al. (2016) at z ∼ 2.23 also observe the same trends. Verhamme et al. (2008) replicate such trends using radiative simulations of galaxies in the range 2.8 < z < 5, confirming that qualitative explanations for these observational relations are well supported by theory. We show several best-fit relations from the literature in our lower plot of Figure 11. Considering that our results are dominated by upper limits, the two higher redshift relations are roughly consistent with our measurements. Still, of particular interest might be the potential redshift evolution suggested by these relations. There is a clear decrease in fesc at high dust contents when going from low to high redshift. If real, this trend could back our previous analysis on the fraction of more dusty LAEs and their evolution as a function of cosmic time. At higher redshift (Verhamme et al. 2008; this work), low-${W}_{\mathrm{Ly}\alpha }$, very dusty LAEs do not seem to be common, driving the $E(B-V)-{f}_{\mathrm{esc}}$ relation down significantly for $E(B-V)\gt 0.2$. However, at lower redshift, such objects are actually observed, driving the relation up for $E(B-V)\gt 0.2$.

5.5. MUV–β Sequence

We have characterized in this paper the dependence of Lyα emission on M*, SFR, UV luminosity, and UV slope. We find ${W}_{\mathrm{Ly}\alpha }$ and fesc to anti-correlate with these four properties. However, these results might not be independent, since these properties are correlated (see Figure 6). For instance, M* and SFR are known to follow a relation at high redshift referred to as the main sequence (Kereš et al. 2005; Finlator et al. 2007; Noeske et al. 2007; Daddi et al. 2007). Similarly, a relation between MUV and β has been studied in LBGs (Bouwens et al. 2009, 2014). As we discuss throughout this work, Lyα escape from galaxies is likely to be a process ruled by many parameters, such as age of the population, gas column density, extinction, and SFR. Therefore, it is interesting to explore Lyα emission dependence on multi-dimensional spaces. In this section, we study ${W}_{\mathrm{Ly}\alpha }$ dependence on MUV and β. There are two reasons to justify exploring this parameter space in particular. First, both properties are observables, i.e., the amount of assumptions involved in their calculation is kept to a minimum (unlike, for example, M* and SFR). Second, these two properties are directly related to elements that rule Lyα escape. The UV luminosity of a galaxy traces its ionizing output and gas content, whereas the UV slope is a proxy for dust. Hence, in this section, we focus on characterizing Lyα emission in the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane. Similarly to previous analyses in this paper, we take advantage of a Bayesian approach to obtain our results.

We first present in Figure 12 the location of our detections and non-detections in this plane. In qualitative terms, our detections sample most of the sequence initially covered by our targets. For further insight into how our observations depend on this sequence, we construct four subsamples for visualization. The anti-correlation we observe between ${W}_{\mathrm{Ly}\alpha }$ and MUV is still clearly observed when comparing the low MUV with the high MUV subsamples, independent of β. For the UV slope, however, the trend between ${W}_{\mathrm{Ly}\alpha }$ and β does not seem that clear anymore. The normalization of the distribution, however, looks to be highly dependent on the UV slope. To characterize the significance of these insights, we simultaneously model the exponential profile parameters of Equation (11) as a function of MUV and β. We start again with linear expressions of the form

Equation (24)

Equation (25)

Once the parameterization and priors are set, we can obtain the posterior distribution of $\{{W}_{n}\}=({A}_{{M}_{\mathrm{UV}}},{A}_{\beta },{A}_{C},{W}_{{M}_{\mathrm{UV}}},{W}_{\beta },{W}_{C})$ using Equation (9). We assume the priors to be independent, which translates to $p(\{{W}_{n}\})\propto \prod p({W}_{i})$. Once again, we impose ignorance on the parameters, i.e., we adopt non-informative priors. We study the posterior distribution assuming parameter ignorance in both linear and logarithmic scales. We decide for the first, since logarithmic priors diverge for parameters than can adopt values close to zero (${A}_{{M}_{\mathrm{UV}}}$, AC and ${W}_{\beta }$). Therefore, our prior is simply a constant $p(\{{W}_{n}\})$. Considering that $p(\{F\})$ is just a normalization factor in Equation (9), the posterior distribution of the six-parameter space can be obtained:

Equation (26)

Figure 12. Refer to the following caption and surrounding text.

Figure 12. Left: location of our objects in the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane. Galaxies with detections are plotted as colored circles, whereas non-detections are shown as gray points. Only galaxies for which we can measure MUV and β are considered (607 of 625). To show the dependence of ${W}_{\mathrm{Ly}\alpha }$ on this plane, we divide our sample into the four subsets of virtually the same number of galaxies that are delimited by the dashed black lines. We plot in red the 3σ contours associated with the relation followed by z ∼ 4 LBGs, according to the UV LFs from Bouwens et al. (2015) and the ${M}_{\mathrm{UV}}\mbox{--}\beta $ relations from Bouwens et al. (2014). Right: observed ${W}_{\mathrm{Ly}\alpha }$ distributions for the respective ${M}_{\mathrm{UV}}\mbox{--}\beta $ subsets. We plot as black lines the median distributions for each subset given by the best solution of our ${M}_{\mathrm{UV}}\mbox{--}\beta $ model (Equations (27) and (28)). The dotted lines are recovered after applying the incompleteness of every subsample. The shaded contours represent 1σ constraints according to our Monte Carlo simulations of Equations (24) and (25).

Standard image High-resolution image

The value of the parameters A and W0 is, by definition, constrained to the intervals $A=[0,1]$ and ${W}_{0}=(0,\infty )$. Still, in the case of extreme luminosities and/or slopes, our linear parameterizations can yield values outside these intervals. As a solution, we just impose A and W0 to saturate outside their corresponding ranges. This condition can be considered simply as an indirect prior on the parameters. For the particular case of ${W}_{0}\lt 0$, we impose A = 0, i.e., the ${W}_{\mathrm{Ly}\alpha }$ distribution is a Dirac delta (i.e., no Lyα emission and/or absorption).

We recover our best solution from Monte Carlo simulations and obtain the uncertainties on our parameters using the collapsed distributions. The results are as follows:

Equation (27)

Equation (28)

We show with more detail the results for these coefficients in Figure 13. The early analysis based on the sample binning from Figure 12 is verified in Figure 13. The normalization factor A anti-correlates with β with a significance ≳2σ. There also seems to be an anti-correlation between A and MUV, but to a level of ∼1σ. For W0, the e-folding scale of the distribution, the behavior is the opposite. The extent of the distribution correlates with MUV and shows a weak anti-correlation with β.

We now focus on the interpretation of these results. The fact that beta correlates mostly with A suggests that the UV slope is somehow related to a stochastic probability of being able to observe or not a particular object in Lyα. On the other hand, the stronger correlation of MUV with ${W}_{\mathrm{Ly}\alpha }$ suggests that the UV luminosity is associated with physical processes that determine the magnitude of the resulting ${W}_{\mathrm{Ly}\alpha }$. A possible scenario is one in which MUV traces SFR and, therefore, the total cold gas mass of galaxies. Hence, higher UV luminosities translate into increased levels of Lyα photon scattering into an extended halo, smoothly reducing the ${W}_{\mathrm{Ly}\alpha }$ of the central source. At the same time, in a significantly clumpy ISM in which dust is well mixed with the gas, a large contrast in gas density/column will imply that dust can effectively block the bulk of both UV and Lyα radiation in some parts of the disk. Such impact will be significant toward certain lines of sight, with little to no effect on others. Such a "covering factor" scenario could explain that β correlates more significantly with the normalization (A) than with the shape of the distribution (W0). This is consistent with the finding that Lyα and UV suffer from similar levels of extinction by dust in LAEs (e.g., Blanc et al. 2011). Moreover, the scatter in the fesc of Lyα photons at fixed reddening (see Figure 11 and Blanc et al. 2011) is also consistent with this scenario where wide ranges of dust absorption lines of sight and photon scattering halos rule the escape of Lyα radiation.

Naturally, this interpretation is a simplified description of Lyα escape from galaxies. Going no further, Figure 13 reveals how both distribution parameters do not solely depend on one property. Ideally, studies of ${W}_{\mathrm{Ly}\alpha }$ and fesc in three-parameter spaces (e.g., M*, MUV, β) can yield predictive and more accurate parameterization of Lyα emission. However, such analysis must be performed in much larger data sets than ours, at least if significant enough results are to be obtained. Furthermore, hydrodynamical simulations can also give insight into how Lyα escape depends on the line of sight and the distribution of gas, dust, and star-forming regions (e.g., Verhamme et al. 2012), especially if performed in a statistically significant sample.

Figure 13. Refer to the following caption and surrounding text.

Figure 13. Results from our Monte Carlo simulation on the dependence of Lyα emission on the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane. The six parameters correspond to the coefficients in Equations (24) and (25). The histograms are obtained after collapsing the posterior distribution for every parameter. Left: $A,{W}_{0}$ dependence on MUV. Center: $A,{W}_{0}$ dependence on β. Right: additive constants for A and W0. These results suggest that the normalization of the distribution is mostly ruled by dust, whereas ${W}_{\mathrm{Ly}\alpha }$ is mostly determined by UV luminosity (see Figure 12).

Standard image High-resolution image

6. Lyα Dependence on Sample Selection

6.1. LBG Samples

The Lyman break selection technique has proven to be a very efficient method for detecting high-redshift galaxies (e.g., Steidel et al. 1996; Shapley et al. 2003; Stark et al. 2010; Ono et al. 2012). The fact that the Lyman break is in the optical region of the observed spectrum for galaxies at redshift z ≥ 3 allows for efficient detection from ground telescopes. By only requiring the use of a few broadband filters, several galaxies can be detected in a deep single exposure. Still, to avoid aliasing with the Balmer break, there are unavoidable biases associated with this technique. First, galaxies with no prominent Lyman break are, by construction, not detected. As a consequence, either extremely young or passive, heavily extincted galaxies are underrepresented in LBG surveys. Second, these surveys also impose color restrictions on the slope of galaxies, further increasing the selection toward, in principle, bluer UV objects. Third, this technique is limited by the MUV sensitivity of the survey, creating incompleteness at low SFR. In the case of Lyα emission, this observational limit can have a significant downside. As shown in Section 5.2, there is a clear anti-correlation between UV luminosity and ${W}_{\mathrm{Ly}\alpha }$, leading to the possibility that Lyα studies in LBG samples are missing the highest ${W}_{\mathrm{Ly}\alpha }$ galaxies in the universe. Indeed, the deepest surveys nowadays reach ∼50% completeness at observed ${m}_{\mathrm{UV}}\sim 29.5$ (e.g., Bouwens et al. 2015; Bowler et al. 2015, 2017; Ishigaki et al. 2017). These limits translate to higher redshift ∼50% completeness at ${M}_{\mathrm{UV}}\sim -18.3$, −19, −19.5, and −19.7 (z ∼ 4, 5, 6, and 7, respectively). These limitations become problematic when comparing the results between LBG and narrowband samples. Presumably due to not requiring a continuum detection, the ${W}_{\mathrm{Ly}\alpha }$ observed in narrowband surveys are systematically higher (e.g., Zheng et al. 2014).

Our data set and results can be useful for characterizing the effects that selection techniques can have on Lyα emission. Even though it is possible to identify these selections directly in our sample, any comprehensive analysis must consider our galaxy selection procedure. Since our sample follows neither the M* or MUV functions of the galaxy population, correcting requires assumption of M* and/or MUV distributions. Nevertheless, we can still simulate Lyα emission samples on CANDELS galaxies using our results of ${W}_{\mathrm{Ly}\alpha }$ dependence on the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane given by Equations (27) and (28). In this section, we describe our simulation of a z ∼ 4 LBG sample from the 3D-HST catalogs. We then compare the properties of LBGs with the parent distribution, focusing on M*, SFR, MUV, and β. We finally simulate Lyα fractions for each sample and conclude on the effects of LBG selection at z ∼ 4.

LBGs at z ∼ 4 are typically selected using B-dropouts and imposing color selections on redder filters. As we are simulating this survey in CANDELS, our detection limits are given by the depth of their H160 images. For the z ∼ 4 LBG selection, we adopt the same methodology applied in Bouwens et al. (2012):

Equation (29)

After color selection, we impose 5σ detections in the V606 filter, to which we now refer to as the detection band. As commonly applied in LBG surveys, we also require all candidates to have <1.5σ measurements in the U band, since the Lyman break must be redshifted out of this filter. Before performing any analysis on the CANDELS LBG sample, we run EAZY and FAST according to our prescriptions (see Section 2.3). Of the ∼3200 CANDELS galaxies that comply with our selection, ∼800 classify as low-redshift interlopers (${z}_{\mathrm{EAZY}}\lt 3$), according to our outputs. We check EAZY 3σ constraints for these interlopers and find most to have ${z}_{99}\lt 4$ . Hence, we find a low-redshift contamination in z ∼ 4 CANDELS LBGs that is higher than the typically reported number of ∼10% (Bouwens et al. 2007). From now on, we remove these presumed ${z}_{\mathrm{EAZY}}\lt 3$ contaminants and work with a z ∼ 4 LBG sample of nearly 2300 objects.

Using our outputs, we plot relevant properties of CANDELS LBGs in Figure 14. To ensure that we do not venture way beyond CANDELS completeness, we restrict the upcoming analysis to galaxies with ${M}_{* }\gt {10}^{8.5}\,{M}_{\odot }$. We construct two $3\lt z\lt 4.2$ CANDELS samples for comparison: all galaxies with ${M}_{* }\gt {10}^{8.5}\,{M}_{\odot }$ and all V-band detected ${M}_{* }\gt {10}^{8.5}\,{M}_{\odot }$ objects. As seen in the central panel, the major LBG selection effect is associated with the detection band threshold. These left-out objects are either red, heavily extincted galaxies, or blue, intrinsically faint objects (Quadri et al. 2007). Indeed, as revealed by the comparison between LBGs and the detection-band-selected sample, any selection imprints associated with color requirements are minor. It is still worth noting, however, that these color criteria seem to neglect blue rather than red galaxies at z ∼ 4 (center of Figure 14). We find the $(B-V\gt 1.1)$ Lyman break cut to be the driving criterion behind this selection effect (see Equation (29)). We remark that these insights might not be valid at different redshifts or lower M*, however. Some studies have indeed explored differences at higher redshift (e.g., Jiang et al. 2013a, 2013b, 2016). In analyses that explore Lyα emission strength, UV continuum properties, morphologies, and sizes, they find no significant differences between LBGs and LAEs at z ≥ 6.

Figure 14. Refer to the following caption and surrounding text.

Figure 14. Selection biases induced by LBG surveys. We show in red all 3 < z < 4.2 CANDELS galaxies. Blue contours represent galaxies that satisfy z ∼ 4 LBG detection band selections (${V}_{606}\gt 5\sigma $). LBGs (detection band and color selections) are shown in black, with opacity a proxy for number density. To avoid incompleteness in these samples, we only consider CANDELS objects with ${M}_{* }\gt {10}^{8.5}\,{M}_{\odot }$. Left: M*–SFR sequence for the three samples (contours represent the 1σ and 2σ confidence regions). Center: ${M}_{\mathrm{UV}}\mbox{--}\beta $ sequence for the three samples (1σ, 2σ). Right: fraction of Lyα emitters with equivalent width higher than ${W}_{\mathrm{Ly}\alpha }$ for each sample (1σ and 2σ contours; 1σ error bars). We do not observe any major biases in LBG samples besides unavoidable detection band limitations. If anything, the color selection $(B-V\gt 1.1)$ removes low-β galaxies, lowering the Lyα fraction at all ${W}_{\mathrm{Ly}\alpha }$. We remark that this analysis is only valid at $z\sim 4$ for galaxies with ${M}_{* }\gt {10}^{8.5}\,{M}_{\odot }$.

Standard image High-resolution image

The insights we present can be further tested by simulating the fraction of LAEs above an ${W}_{\mathrm{Ly}\alpha }$ threshold. We use our results from Section 5.5 to estimate the Lyα fraction for each sample and plot our results in the right panel of Figure 14. Indeed, depending on the depth of the detection band, the fraction of low UV luminosity galaxies composing the sample can vary significantly. As a consequence, deeper surveys can potentially recover a higher fraction of line emitters at high ${W}_{\mathrm{Ly}\alpha }$. To deal with this selection, it has been proposed to compare the Lyα fractions over narrow UV luminosity samples (Stark et al. 2011; Mallery et al. 2012; Ono et al. 2012; Schenker et al. 2012). Our results in this paper further emphasize the need to compare galaxies of the same luminosity and properly characterize completeness when studying the evolution of the Lyα fraction with cosmic time. Regarding color selections, we find the Lyα fraction to be slightly lower for LBGs than for the V-band-detected sample. As we suggested, this is a consequence of LBG color cuts leaving out some of the bluest galaxies.

6.2. Narrowband Samples

Lyα-emitting galaxies can also be selected using narrowband imaging or blind spectroscopy (e.g., Gronwall et al. 2007; Ouchi et al. 2008; Adams et al. 2011). By establishing a ${W}_{\mathrm{Ly}\alpha }$ detection threshold between the narrow- and broadband flux measurement, this technique allows for efficient line-emitter selection. Since only line detection is required, such surveys can trace fainter objects than the LBG technique, possibly leading to selection of the youngest and faintest galaxies at high redshift. However, even though most narrowband measurements of LAEs are followed up by spectroscopy, any sample selection effects induced by the narrowband technique are already present in the sample. The ${W}_{\mathrm{Ly}\alpha }$ threshold used for detection, which is determined by the ability to separate low-redshift interlopers from high-redshift LAEs, can adopt a wide range of values. Depending on redshift, observer frame thresholds translate into different rest-frame ${W}_{\mathrm{Ly}\alpha }$ cuts. For instance, Vargas et al. (2014) select sources with ${W}_{\mathrm{Ly}\alpha }\gt 20$ Å at $z\sim 2.1$, Zheng et al. (2014) select ${W}_{\mathrm{Ly}\alpha }\gt 9$ Å at z ∼ 4.5, and Sobral et al. (2017) go as low as ${W}_{\mathrm{Ly}\alpha }\gt 5$ Å at z ∼ 2.23. If ${W}_{\mathrm{Ly}\alpha }$ selections induce important biases on galaxy samples, the comparison of different surveys is not straightforward.

In this section, we explore the effects that narrowband selections have on the population of LAEs, focusing on the M*–SFR plane. The insights we present here are based on Equations (27) and (28), i.e., our ${M}_{\mathrm{UV}}\mbox{--}\beta $ model. We show in Figure 15 the outcome of ${W}_{\mathrm{Ly}\alpha }$ (top) and line-flux selections (bottom) on the M*–SFR sequence. The red contours show 3.5 < z < 4.5 CANDELS galaxies with $M\gt {10}^{8.5}\,{M}_{\odot }$. For clarity, however, we concentrate our analysis on the main sequence, removing objects with young and old ages (red dots). This approach allows our results to be dominated by objects optimally fitted by our FAST executions.

Figure 15. Refer to the following caption and surrounding text.

Figure 15. Selection effects produced by narrowband-selected samples of LAEs. For the parent sample, we show as red contours the M*–SFR sequence of 3.5 < z < 4.5 CANDELS galaxies with ${M}_{* }\gt {10}^{8.5}\,{M}_{\odot }$. We adopt this value as the threshold for which CANDELS at z ∼ 4 is fairly complete in GOODS-S (Duncan et al. 2014). For clarity, we also remove CANDELS galaxies in the young and old age tracks (red data points). Top: effect of ${W}_{\mathrm{Ly}\alpha }$ selections on LAE samples. We simulate the resulting LAE samples under no ${W}_{\mathrm{Ly}\alpha }$ selections (black), ${W}_{\mathrm{Ly}\alpha }\gt 5$ Å (Sobral et al. 2017; blue), and ${W}_{\mathrm{Ly}\alpha }\gt 20$ Å (green). ${W}_{\mathrm{Ly}\alpha }$ cuts are more likely to remove high-M*, high-SFR LAEs, but the effect for typical survey thresholds (${W}_{\mathrm{Ly}\alpha }\leqslant 20$) is minor. Bottom: effect of Lyα luminosity (${L}_{\mathrm{Ly}\alpha }$) selections on LAE samples. We simulate the region sampled by the narrowband-selected surveys described in Vargas et al. (2014) and Hagen et al. (2014; blue and green, respectively). We predict the deeper observations of Vargas et al. (2014) to statistically measure lower-SFR LAEs than Hagen et al. (2014). Hence, luminosity selections remove mostly low-SFR objects. The flux limitations of these surveys have been translated to luminosity in order to be simulated at z ∼ 4.

Standard image High-resolution image

To explore the effect of ${W}_{\mathrm{Ly}\alpha }$ selections (top panel of Figure 15), we base our analysis on the z ∼ 2.23 narrowband survey of Sobral et al. (2017), whose galaxies are limited to ${W}_{\mathrm{Ly}\alpha }\gt 5$ Å and fluxes $\gt 2\times {10}^{-17}$ erg cm−2 s−1 (5σ). We simulate a flux-limited-only Lyα sample and use it as baseline (black). We then show the region where ${W}_{\mathrm{Ly}\alpha }\,\gt 5$ Å (1σ; blue) and ${W}_{\mathrm{Ly}\alpha }\gt 20$ Å (1σ; green) objects lie. The ${W}_{\mathrm{Ly}\alpha }\gt 5$ Å contours are intended to reproduce Sobral et al. (2017) selections, whereas the more restrictive selection ${W}_{\mathrm{Ly}\alpha }\gt 20$ Å is representative of multiple surveys (Gronwall et al. 2007; Hagen et al. 2014; Vargas et al. 2014). As confirmed by this plot, galaxies with higher characteristic ${W}_{\mathrm{Ly}\alpha }$ (i.e., low M*, low SFR) are more likely to be selected by narrowband samples, even though the effect is minor for the typical narrowband cuts of ${W}_{\mathrm{Ly}\alpha }\leqslant 20$ Å. Still, selections based solely on ${W}_{\mathrm{Ly}\alpha }$ systematically fail to remove AGNs and neglect bright Lyα emitters (Sobral et al. 2017). All these insights remark on the importance of using low ${W}_{\mathrm{Ly}\alpha }$ cuts and thorough interloper controls.

After narrowband outcomes are used for sample selection, spectroscopic observations follow. However, these follow-ups of Lyα emitting galaxies are flux limited, just like the ones presented in this work. Still, as we consider completeness in our modeling and simulations of Section 5.5, we can still make inferences on flux limited studies. We use our Monte Carlo simulation outputs to also assess the effects of line luminosity selections. Using the corresponding M*, SFR, and fUV for every object, we obtain the probability of ${L}_{\mathrm{Ly}\alpha }\gt {L}_{\mathrm{Ly}\alpha }^{* }$ for every galaxy. The outcome (bottom panel of Figure 15) reveals that flux selections bias samples toward high-SFR LAEs.

Discrepancies in the location of LAEs in the M*–SFR plane have already been observed in the literature. In Figure 10 of their paper, Hagen et al. (2014) plot the M*–SFR relation of their z ∼ 2–3 LAEs alongside the z ∼ 2.1 counterparts of Vargas et al. (2014). The Hagen et al. (2014) objects are part of the HETDEX Pilot Survey (Adams et al. 2011), for which detections are constrained to line fluxes $\gt 7\mbox{--}10\,\times {10}^{-17}$ erg cm−2 s−1 (5σ) and ${W}_{\mathrm{Ly}\alpha }\gt 20$ Å. In contrast, the Vargas et al. (2014) flux depth is higher than that of HETDEX, reaching $2\times {10}^{-17}$ erg cm−2 s−1 (5σ), while also selecting sources with ${W}_{\mathrm{Ly}\alpha }\gt 20\,\mathring{\rm A} $. Therefore, Hagen et al. (2014) are comparing two ${W}_{\mathrm{Ly}\alpha }$-selected surveys with different line-flux depths. Our results in this section would, then, predict Vargas et al. (2014) LAEs to sample lower M* and SFRs because they go deeper (see $\sim {10}^{9}\,{M}_{\odot }$ galaxies in Figure 15). This is in fact the pattern observed in Figure 10 of Hagen et al. (2014), i.e., we can qualitatively reproduce their comparison with our simulations from the results of Section 5.5. This explanation is backed by the fact that these discrepancies are not caused by inconsistent M* or SFR derivations. Both studies assume a Salpeter (1955) IMF, cSFHs, and a fixed metallicity of $Z=0.2\,{Z}_{\odot }$.

Vargas et al. (2014) and Hagen et al. (2014) find ${M}_{* }\lt {10}^{9}\,{M}_{\odot }$ LAEs to lie above the M*–SFR relation. A similar trend is reported in the work of Karman et al. (2017), who study Lyα emitters down to 106 M. Karman et al. (2017) point out that this offset can be a consequence of how uncertain SFRs are for low-M* starburst galaxies. This explanation is consistent with the fact that most of our ${M}_{* }\lt {10}^{9}\,{M}_{\odot }$ Lyα emitters are constrained to the lowest age locus of the M*–SFR plane (see Figure 6). Moreover, Finkelstein et al. (2015) observe the same lowest age, low -M* Lyα emitters at $z\sim 4.5$. To answer whether low -M* LAEs lie above star-forming galaxies or the slope of the sequence changes toward ${M}_{* }\lt {10}^{9}\,{M}_{\odot }$, different approaches are required. For instance, no discrepancies are found between z ∼ 2 LAEs and Hα sources (see Matthee et al. 2016), although most of their galaxies have ${M}_{* }\gt {10}^{9}\,{M}_{\odot }$. In summary, evidence suggests that z > 2, ${M}_{* }\lt {10}^{9}\,{M}_{\odot }$ LAEs lie above extrapolations of the M*–SFR relation. It is unclear whether this offset is real or the position of the relation at low M* is far from certain.

7. Inferences on the ${\bf{4}}{\boldsymbol{\leqslant }}{\bf{z}}{\boldsymbol{\leqslant }}{\bf{7}}$ Lyα Fraction

Probably the most important use for Lyα emission is tracing the neutral hydrogen fraction in the IGM. Several studies over the last decade have constrained the fraction of Lyα-emitting galaxies as a function of redshift (Stark et al. 2010, 2011; Ono et al. 2012; Schenker et al. 2012; Tilvi et al. 2014; Cassata et al. 2015), bringing us closer to the goal of constraining the epoch of reionization. The Lyα fraction is fairly well understood at z < 5 (Stark et al. 2011; Cassata et al. 2015), with most efforts nowadays focusing on z ∼ 7, 8 (Ono et al. 2012; Treu et al. 2013; Tilvi et al. 2014; Furusawa et al. 2016). Along these lines, our characterization of ${W}_{\mathrm{Ly}\alpha }$ on the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane can be used to simulate ${W}_{\mathrm{Ly}\alpha }$ distributions at higher redshift. In this section, we apply the observed ${M}_{\mathrm{UV}}\mbox{--}\beta $ relations from Bouwens et al. (2014) and LFs from Bouwens et al. (2015) to simulate high-redshift ${W}_{\mathrm{Ly}\alpha }$ distributions. By means of these simulations, we can predict the Lyα fraction in galaxies up to z ∼ 7, providing the first semi-analytical constraint to this tracer toward the reionization epoch. Furthermore, we also simulate dropouts from CANDELS photometry to explore the effects of observational limitations on the inferred Lyα fractions at high redshift.

Our results from this section are based on assuming the same z ∼ 4 ${W}_{\mathrm{Ly}\alpha }$ dependence on ${M}_{\mathrm{UV}}\mbox{--}\beta $ at every redshift. We remark that testing this assumption with currently available data sets is challenging, since different sample selections and incompleteness levels can affect the observed ${W}_{\mathrm{Ly}\alpha }$ dependence on the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane. It is also worth noting that the analysis described in this section does not account for any effects related to changes in the merger fraction, IGM opacity (e.g., Gunn & Peterson 1965; Becker et al. 2001; Fan et al. 2006), and/or conditions of the ISM with cosmic time (e.g., Carilli & Walter 2013). As the number of complete, unbiased Lyα surveys grows (e.g., Cassata et al. 2015; Hathi et al. 2016; this work), we will be able to test these assumptions and explore their dependence on redshift.

We now detail our simulation of high-redshift LBG samples. For the LFs, we use Bouwens et al.'s (2015) best Schechter parameters (excluding CANDELS-EGS). We start by drawing objects following the LF and then associate a UV slope according to the best-fit relations from Table 3 of Bouwens et al. (2014). We finish by adding an intrinsic scatter of 0.35 to the slopes at every redshift (Bouwens et al. 2012, 2014; Castellano et al. 2012). Similarly, we perform an analogue procedure following the MUV distribution of CANDELS LBGs, which allows us to assess the effect of magnitude incompleteness. In order to do so, we make use of the optical and IR photometry publicly available from the 3D-HST catalogs (Skelton et al. 2014). For every redshift, we use the selections from Bouwens et al. (2015), since they are based on CANDELS photometry:

Equation (30)

Equation (31)

Equation (32)

Equation (33)

Similarly to our Section 6.1 selection, we impose S/N cuts in the detection bands. For z ∼ 4 and 5 dropouts, we require 3σ detections in the V606, z850 bands. For z ∼ 6 and 7 dropouts, we impose at least 3σ detections in the Y102 and J125 bands, respectively. To associate a dropout redshift with every galaxy, however, we use the photometric redshifts from 3D-HST instead of the actual dropout from where it was selected. By doing so, we are in agreement with the methodology of Bouwens et al. (2015). We finally complete these CANDELS LBG samples by associating UV slopes to every galaxy. Just as for the complete sample, we associate the slopes following the ${M}_{\mathrm{UV}}\mbox{--}\beta $ relations from Bouwens et al. (2014).

We present in Figure 16 the assumed ${M}_{\mathrm{UV}}\mbox{--}\beta $ sequences and the UV luminosity distribution for both sets under consideration. Since we are starting from the Bouwens et al. (2014) ${M}_{\mathrm{UV}}\mbox{--}\beta $ characterization, the steepness of the linear relation increases with redshift. The contours we show in Figure 16 (left panel) are obtained after incorporating the intrinsic scatter, which we assume to be of 0.35. Note that the addition of such a scatter does not render the redshift evolution of the ${M}_{\mathrm{UV}}\mbox{--}\beta $ distribution negligible. According to our Equations (27) and (28), this implies that the probability of observing ${W}_{\mathrm{Ly}\alpha }$ in emission at fixed MUV should increase with redshift. The right panel of Figure 16 shows the absolute magnitude distribution of 4 ≤ z ≤ 7 LBGs as found by Bouwens et al. (2015; solid lines). Note the growth of the LF as a function of redshift. Since the LF becomes steeper as the redshift increases, the probability of observing a high-${W}_{\mathrm{Ly}\alpha }$ galaxy should also increase. One more important point regarding Figure 16 must be noted. As initially suggested, CANDELS incompleteness affects the observed MUV distribution of LBGs (shown as crosses). The fact that there is more contribution from brighter galaxies could, in principle, bias samples toward lower ${W}_{\mathrm{Ly}\alpha }$ when compared to the LFs. We now focus on quantifying all these effects within the framework of our model.

Figure 16. Refer to the following caption and surrounding text.

Figure 16. Simulated LBGs that we use to predict the Lyα fraction in LBG samples as a function of redshift. Left: 1σ contours of the ${M}_{\mathrm{UV}}\mbox{--}\beta $ relations we use for these 4 ≤ z ≤ 7 simulations. We start from the best-fit relations found by Bouwens et al. (2014) and then add an intrinsic scatter of 0.35 to the relation (Bouwens et al. 2012, 2014; Castellano et al. 2012). Right: MUV distribution of our simulated LBGs. The UV complete samples (solid lines) are drawn following the LFs from Bouwens et al. (2015). The incomplete samples (crosses) are obtained after simulating an LBG selection from CANDELS. The LFs are normalized to the number of CANDELS LBGs at ${M}_{\mathrm{UV}}\sim -20.25$. In order to construct our samples, we first draw from the corresponding MUV distribution (right panel) and then associate a UV slope following the ${M}_{\mathrm{UV}}\mbox{--}\beta $ relation (left panel). For the resulting Lyα fractions, refer to Figure 17.

Standard image High-resolution image

We simulate the ${W}_{\mathrm{Ly}\alpha }$ distributions for the two sample sets we constructed. To do so, we use the MUV and β of each object to draw ${W}_{\mathrm{Ly}\alpha }$ distributions following our sampling of Equations (27) and (28). This methodology allows us to simulate the ${W}_{\mathrm{Ly}\alpha }$ distribution for the number of galaxies in every redshift subsample, properly taking into account the uncertainties on each ${M}_{\mathrm{UV}}\mbox{--}\beta $ domain and the errors in our modeling. We present the corresponding 4 ≤ z ≤ 7 Lyα fractions for such galaxies in Figure 17. To this end, we select using the thresholds ${W}_{\mathrm{Ly}\alpha }^{* }=25$ Å and ${W}_{\mathrm{Ly}\alpha }^{* }=55$ Å (Stark et al. 2011). Still, as we note in Section 5.3, LBGs with MUV distributions dominated by fainter objects can yield higher fractions of Lyα. In recent works, characterization has been performed separately in galaxies with ${M}_{\mathrm{UV}}\lt -20.25$ and ${M}_{\mathrm{UV}}\gt -20.25$ (Stark et al. 2011; Ono et al. 2012; Schenker et al. 2012; Tilvi et al. 2014; Cassata et al. 2015). Following the same approach, we now focus our analysis on samples in the ranges $-21.75\lt {M}_{\mathrm{UV}}\lt -20.25$ and $-20.25\lt {M}_{\mathrm{UV}}\lt -18.75$ (Figure 16), which are well covered by our data set (see Figure 12).

Figure 17. Refer to the following caption and surrounding text.

Figure 17. Lyα fraction in brighter ($-21.75\lt {M}_{\mathrm{UV}}\lt -20.25$; left) and fainter ($-20.25\lt {M}_{\mathrm{UV}}\lt -18.75$; right) LBGs as a function of redshift. We show the fraction for ${W}_{\mathrm{Ly}\alpha }\gt 25$ Å (top) and ${W}_{\mathrm{Ly}\alpha }\gt 55$ Å (bottom). Plotted data points include observational constraints from Fontana et al. (2010), Pentericci et al. (2011), Stark et al. (2011), Ono et al. (2012), Schenker et al. (2012), and Treu et al. (2013). The data point from Ono et al. (2012) is a compilation of results from Fontana et al. (2010), Pentericci et al. (2011), Vanzella et al. (2011), Schenker et al. (2012), and Ono et al. (2012). Our z ∼ 4 characterization of ${W}_{\mathrm{Ly}\alpha }$ in the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane allows us to predict the Lyα fraction as if the universe were ionized. We show as black circles our constraints based on the ${M}_{\mathrm{UV}}\mbox{--}\beta $ relations from Bouwens et al. (2014) and UV LFs from Bouwens et al. (2015). These simulations suggest that the fraction of Lyα-emitting LBGs steadily increases up to z ∼ 7. If we perform the exact same analysis adopting the MUV distribution of CANDELS LBGs (refer to Figure 16), we recover the fractions shown as crosses. Note how MUV incompleteness artificially lowers the Lyα fraction. This effect must be accounted for when associating Lyα drops with reionization. Keep in mind that our model assumes an exponential ${W}_{\mathrm{Ly}\alpha }$ distribution, which might underestimate the fraction for the tail of the distribution (e.g., ${W}_{\mathrm{Ly}\alpha }\gt 55$ Å; see Figure 5), especially in populations with high characteristic ${W}_{\mathrm{Ly}\alpha }$ (e.g., $-20.25\lt {M}_{\mathrm{UV}}\lt -18.75$; see Figure 12).

Standard image High-resolution image

We plot in Figure 17 the resulting Lyα fractions. We find the complete fractions (i.e., derived from the LFs; black circles) to be consistent with a steady increase up to z ∼ 7, which has been previously assumed as baseline for measuring drops in the Lyα fraction (Stark et al. 2010, 2011; Ono et al. 2012). Apart from backing this assumption, our result provides a more robust baseline for comparison. We reiterate that a closer look at Figure 16 reveals the explanation for such a trend, at least within our approach. First, the LF of LBGs becomes steeper as redshift increases (Bouwens et al. 2015). Second, the ${M}_{\mathrm{UV}}\mbox{--}\beta $ relation favors bluer slopes as redshift increases. The combination of both relationships explain an increase in the Lyα fraction as a function of redshift for LBGs. It has already been discussed (Blanc et al. 2011; Cassata et al. 2015) how the growth in the Lyα fraction is likely tied to changes in the fesc of Lyα photons, at least for 3 < z < 5. This change in fesc is likely tied to evolution in the dust content and ISM of galaxies, which we are empirically accounting for in this model. Under this picture, Lyman continuum (LyC) photon escape should follow a similar trend. Indeed, Faisst (2016) empirically predicts that LyC photon escape fraction increases with redshift at fixed M*. In summary, our results are in strong agreement with the picture that UV photon escape fraction from galaxies tends to decrease with cosmic time.

From the literature, we include in Figure 17 the measurements from Fontana et al. (2010), Pentericci et al. (2011), Stark et al. (2011), Vanzella et al. (2011), Ono et al. (2012), Schenker et al. (2012), and Treu et al. (2013). The overall literature trend is an increase in the Lyα fraction up to z ∼ 5, which flattens at z ∼ 6 and drops toward z ∼ 7. Departures from this increasing regime are typically associated with changes in the transparency of the IGM, with drops toward z ∼ 7 associated with a more neutral universe (Stark et al. 2010; Ono et al. 2012). Even though such inferences might be correct, the contrast between the completeness-corrected (black circles) and -uncorrected (black crosses) fractions in Figure 17 suggests that the driving factor behind these z ∼ 7 drops might be sample MUV incompleteness. Our simulated z ≥ 6 CANDELS LBG samples do not follow the corresponding LF as a result of survey depth (see Figure 16), leading them to include more luminous objects than a population-representative sample. Therefore, as seen in Figure 17, drops in the Lyα fraction are induced. As we point out in Section 6.1, the deepest LBG surveys nowadays (e.g., Bouwens et al. 2015; Bowler et al. 2015, 2017; Ishigaki et al. 2017) reach ∼50% completeness at ${M}_{\mathrm{UV}}\sim -18.3$, −19, −19.5, and −19.7 for z ∼ 4, 5, 6, and 7, respectively. Therefore, Lyα studies of even the deepest LBG surveys are sufficiently incomplete to bias the reported Lyα fractions (see Figures 16 and 17). Taking into account this effect is essential if we are to use Lyα to further constrain the reionization epoch during the next decade.

It is important to clarify the limitations of our modeling. First, our characterization is based on the whole ${W}_{\mathrm{Ly}\alpha }$ distribution, which may misrepresent the actual fraction of high-${W}_{\mathrm{Ly}\alpha }$ galaxies. In fact, the analysis we perform in Section 4 reveals that a log-normal profile might be better for reproducing the tail of the distribution (Figure 5). For example, Zheng et al. (2014) find an exponential profile to only adequately represent the low-${W}_{\mathrm{Ly}\alpha }$ end of the distribution, leading them to model the high-${W}_{\mathrm{Ly}\alpha }$ tail separately. As a consequence, we encourage the reader to focus most of their analysis on our ${W}_{\mathrm{Ly}\alpha }^{* }=25$ Å results (top panel in Figure 17). Along those lines, we expect our modeling to underestimate the fraction for populations with significantly higher-${W}_{\mathrm{Ly}\alpha }$ tails. Indeed, our fractions for galaxies with $-20.25\,\lt {M}_{\mathrm{UV}}\lt -18.75$ are systematically lower than Stark et al. (2011) measurements (see Figure 17).

In summary, our results suggest that the Lyα fraction steadily increases up to z ∼ 7 in an ionized universe. However, drops in this fraction are not necessarily related to changes in the opacity of the IGM, since we find that sample MUV incompleteness can alternatively explain them. In fact, the uncertainties in both literature measurements and our predictions are still large enough to set assertive constraints on the Lyα trends toward z ∼ 7 (see Robertson et al. 2015). Our results, therefore, further highlight that Lyα constraints on zre (the redshift at which the fraction of ionized hydrogen is 0.5) are still inconclusive.

It is worth discussing whether ${z}_{{re}}\gt 7$ is in agreement with observational reionization constraints besides Lyα. Recently, the Planck Collaboration et al. (2016a) analysis of CMB anisotropies has yielded ${z}_{{re}}\sim {8.2}_{-1.2}^{+1.0}$. Systematic detection of Lyα-emitting galaxies in the range z ∼ 7–9 (Zitrin et al. 2015; Furusawa et al. 2016; Stark et al. 2017) still leads to the question whether reionization is already in place at z ∼ 7. Several studies have also explored Lyα LFs toward z ∼ 9, although observational limitations require deeper/wider surveys to obtain more meaningful conclusions (Sobral et al. 2009). Therefore, and since reionization is predicted to happen so rapidly (Finlator et al. 2009), claims of ${z}_{{re}}\gt 7$ are not in tension with measurements of the Lyα fraction at z ∼ 7. Interestingly enough, it is not even clear whether reionization happens first around fainter, lower density environments ("outside-in," Kashikawa et al. 2006), or brighter, higher density regions ("inside-out," Santos et al. 2016). This overall picture suggests that the role of MUV in reionization studies will no longer restrict itself to sample control, but it will also extend to parameterizing property. Therefore, the natural approach for future studies would be to constrain reionization as a smooth function of both redshift and UV luminosity/environment (e.g., Robertson et al. 2013).

8. Conclusions

In this work, we present an exhaustive analysis of Lyα emission at $3\lt z\lt 4.6$. To this end, we M*-select 625 galaxies from the CANDELS survey, allowing us to study Lyα emission over a diverse and heterogeneous galaxy sample. We conduct spectroscopic observations of our targets with the M2FS, a multi-fiber spectrograph at the Clay 6.5 m telescope. We then use a Bayesian approach for proper statistical handling of our results. By means of this framework, we are capable of characterizing Lyα emission in the high-redshift galaxy population. In summary, our conclusions are the following.

1. We present a Bayesian methodology to measure the ${W}_{\mathrm{Ly}\alpha }$ distribution considering the completeness in fluxes measured and the uncertainties in both spectroscopy and photometry. We combine this approach with Monte Carlo simulations for robust modeling of observed trends. Combining all of these features allows us to properly state the significance and uncertainties in our results.

2. We take advantage of the Bayesian framework to give an insight into how to compare multiple ${W}_{\mathrm{Ly}\alpha }$ distribution models from a quantitative standpoint. In this paper, we explore the extent to which exponential, Gaussian, and log-normal probability distributions can reproduce ${W}_{\mathrm{Ly}\alpha }$ measurements. We conclude that, in our case, an exponential profile is the most adequate. However, as we find in Section 5, this profile can struggle to reproduce the tail of high-${W}_{\mathrm{Ly}\alpha }$ populations (e.g., samples with low M*, SFR, UV luminosity, or β).

3. Our measured ${W}_{\mathrm{Ly}\alpha }$ and fesc strongly anti-correlate with M*. We associate these trends with the higher dust fraction and gas mass in more massive galaxies, boosting the scattering and absorption of Lyα photons. We model both exponential ${W}_{\mathrm{Ly}\alpha }$ distribution parameters ($A,{W}_{0}$) using linear relations dependent on M*, and find them both to anti-correlate with M*. Our modeling is also capable of reproducing our observed ${W}_{\mathrm{Ly}\alpha }$ distributions when binning the sample (Figure 8).

4. We also explore the dependence of Lyα emission on MUV. We find ${W}_{\mathrm{Ly}\alpha }$ and fesc to be typically higher for UV faint objects, which have been previously observed in the literature (Stark et al. 2010; Schaerer et al. 2011). Since z ∼ 4 galaxies seem to follow an ${M}_{* }-{M}_{\mathrm{UV}}$ sequence (González et al. 2014), this result is consistent with our M* trends. This confirms that the role played by higher dust fraction and gas mass rules ${W}_{\mathrm{Ly}\alpha }$ dependence on UV luminosity. We also observe ${W}_{\mathrm{Ly}\alpha }$ and fesc to anti-correlate with the SFR and UV slope, confirming the qualitative Lyα escape scheme presented.

5. The relatively uniform location of our targets in the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane allows us to characterize the ${W}_{\mathrm{Ly}\alpha }$ distribution in this space. Once more, we use a Bayesian framework and Monte Carlo simulations to obtain linear representations of exponential ${W}_{\mathrm{Ly}\alpha }$ distribution parameters. Our results suggest that the probability of observing Lyα emission from a galaxy is determined by its dust content, whereas the actual magnitude of ${W}_{\mathrm{Ly}\alpha }$ is mostly ruled by the UV luminosity of the object. This characterization also allows us to simulate ${W}_{\mathrm{Ly}\alpha }$ distributions from ${M}_{\mathrm{UV}}\mbox{--}\beta $ galaxy samples (see below).

6. Using CANDELS, we mimic a $z\sim 4$ LBG survey based on B-dropouts. We simulate ${W}_{\mathrm{Ly}\alpha }$ distributions from the LBG sample and verify that the lower probability of measuring high ${W}_{\mathrm{Ly}\alpha }$ when compared to the population is mostly a consequence of detection band limitations. In other words, shallower LBG surveys are bound to include fewer low-M* galaxies in their samples, decreasing the probability of including high-${W}_{\mathrm{Ly}\alpha }$ objects. We find color constraints imposed by the LBG technique to also lower the Lyα fraction, although the effect is minor. These results highlight the importance of comparing surveys with similar sensitivities and MUV distributions.

7. Our results on the ${M}_{\mathrm{UV}}\mbox{--}\beta $ plane also allow us to explore the effects ${W}_{\mathrm{Ly}\alpha }$ and line-flux limitations induce on Lyα narrowband surveys. We find ${W}_{\mathrm{Ly}\alpha }$ cuts bias samples toward low-M* objects, whereas flux limitations seem to preferentially leave out low-SFR galaxies. These insights contribute to explain the location of LAEs within the main sequence (e.g., some of the differences between Vargas et al. 2014 and Hagen et al. 2014).

8. We generate 4 ≤ z ≤ 7 LBG samples following reported ${M}_{\mathrm{UV}}\mbox{--}\beta $ relations and LFs from the literature (Bouwens et al. 2014, 2015). Assuming the same ${W}_{\mathrm{Ly}\alpha }$ dependence on ${M}_{\mathrm{UV}}\mbox{--}\beta $ that we find at z ∼ 4, we estimate the fraction of Lyα-emitting galaxies in LBG samples above an ${W}_{\mathrm{Ly}\alpha }$ threshold. Our findings are consistent with observational measurements in the literature, that suggest an increase in the fraction up to z ∼ 6 (Stark et al. 2010, 2011; Cassata et al. 2015). This result constitutes the first semi-analytical constraint to the Lyα fraction at z ∼ 7, replacing extrapolations of lower redshift regimes.

9. We simulate the 4 ≤ z ≤ 7 Lyα fraction in LBG samples drawing from the CANDELS MUV distribution. We conclude that MUV incompleteness can lower the Lyα fraction, reproducing drops at z ∼ 7. Claims of drops at this redshift need to thoroughly account for this effect if they are to be used to constrain the reionization epoch. This result highlights that reported drops in the Lyα fraction are not inconsistent with ${z}_{{re}}\gt 7$. Given the uncertainties in both predictions and measurements, more work is needed to conclude on the actual Lyα trends toward z ∼ 7 and beyond.

We thank the referee for useful feedback that improved the quality of this paper. G.O. was supported by CONICYT, Beca Magíster Nacional 2014, Folio 22140924. G.B. was supported by CONICYT/FONDECYT, Programa de Iniciación, Folio 11150220. V.G. was supported by CONICYT/FONDECYT, Programa de Iniciación, Folio 11160832. We also acknowledge the NSF/MRI grant for the M2FS, number AST 0923160. This work is based on observations taken by the 3D-HST Treasury Program (GO 12177 and 12328) with the NASA/ESA HST, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS5-26555. This research has made use of the NASA/IPAC Extragalactic Database (NED), which is operated by the Jet Propulsion Laboratory, California Institute of Technology, under contract with the National Aeronautics and Space Administration. This paper includes data gathered with the 6.5 meter Magellan Telescopes located at Las Campanas Observatory, Chile.

Footnotes

  • For the redshift range of our galaxies, the Hβ, O ii, and O iii emission lines fall between the H160 and 3.6 μm bands. Hα only contributes to the 3.6 μm Spitzer/IRAC band for z > 3.75.

  • The NASA/IPAC Extragalactic Database (NED) is operated by the Jet Propulsion Laboratory, California Institute of Technology, under contract with the National Aeronautics and Space Administration.

Please wait… references are loading.
10.3847/1538-4357/aa7552