Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

A publishing partnership

The following article is Open access

TESS Eclipsing Binary Stars. I. Short-cadence Observations of 4584 Eclipsing Binaries in Sectors 1–26

, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and

Published 2022 January 12 © 2022. The Author(s). Published by the American Astronomical Society.
, , Citation Andrej Prša et al 2022 ApJS 258 16 DOI 10.3847/1538-4365/ac324a

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0067-0049/258/1/16

Abstract

In this paper we present a catalog of 4584 eclipsing binaries observed during the first two years (26 sectors) of the TESS survey. We discuss selection criteria for eclipsing binary candidates, detection of hitherto unknown eclipsing systems, determination of the ephemerides, the validation and triage process, and the derivation of heuristic estimates for the ephemerides. Instead of keeping to the widely used discrete classes, we propose a binary star morphology classification based on a dimensionality reduction algorithm. Finally, we present statistical properties of the sample, we qualitatively estimate completeness, and we discuss the results. The work presented here is organized and performed within the TESS Eclipsing Binary Working Group, an open group of professional and citizen scientists; we conclude by describing ongoing work and future goals for the group. The catalog is available from http://tessEBs.villanova.edu and from MAST.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

The Transiting Exoplanet Survey Satellite (TESS; Ricker et al. 2015) was launched in 2018 April; during its 2 yr prime mission, it monitored ∼200,000 bright stars for exoplanets across the sky with a 2 minute short cadence. In addition, TESS acquired full-frame images (FFIs) every 30 minutes. TESS is currently in its extended mission, where targets are observed with a 2 minute and a 20 s cadence, and FFIs are acquired every 10 minutes. The primary TESS mission has been to discover and characterize exoplanets, but TESS data enable a much broader swath of science—all fields that benefit from precise time series of bright stars, in fact—including eclipsing binary systems.

Eclipsing binaries (EBs) serve as one of the pillars of stellar astrophysics. The well-understood laws of motion that govern binarity and the alignment with the line of sight make their analysis a tractable geometrical problem (Prša 2018), yielding accurate masses, radii, temperatures, and luminosities of EB components (Torres et al. 2010). Because of that, EBs are used to calibrate stellar models across the Hertzsprung–Russell diagram (Serenelli et al. 2021), rendering them important to essentially every field in astronomy.

Binary stars are ubiquitous: more than half the stars with masses of 1 M or higher are found in binary or multiple systems (Raghavan et al. 2010; Sana' et al. 2012; Moe & Di Stefano 2017). Binaries are thus a natural product of star formation and make up a large fraction of the visible universe. Understanding binaries means understanding stellar formation and evolution (Stacy et al. 2010), internal stellar structure by way of tidal interactions, and/or tidally induced pulsations (Thompson et al. 2012), accretion physics in semi-detached binaries (Bisikalo 2010), and much more. At the same time, there are many open questions that remain; for example, what mechanism (or combination of mechanisms) drives multiplicity rates (Duchêne & Kraus 2013), what determines the distribution of mass ratios (Wells & Prša 2021), how does (close) binarity affect stellar evolution of stars (blue stragglers, yellow giants, magnetic interaction; Mathieu & Geller 2009), how does orbital tightening work given that the Kozai–Lidov cycles might not fully explain it (Hwang & Zakamska 2020), and how do exoplanets that orbit binary stars form and evolve (Paardekooper et al. 2012)? It is eclipsing binaries that hold the answer to these and similar questions. Astronomers have studied EBs for over two centuries, ever since John Goodricke suggested in 1782 that eclipses are responsible for Algol's brightness variation. So why are the answers so elusive?

The principal reason for the remaining open questions is that traditional observational techniques (ground-based photometry, follow-up spectroscopy, long baseline interferometry) are best suited to single objects and are time-consuming. Thus, it is difficult to draw inferences on the entire population of binaries. The first mission that made a significant breakthrough in the EB science was Kepler (Kirk et al. 2016a), but the downside of Kepler is that its targets are faint (i.e., difficult to follow up), limited to a single ∼100 deg2 field, and telemetry and onboard storage did not allow FFIs to be sent back to Earth. Those obstacles are largely overcome by TESS: TESS observes on the bright end, targets are sourced across the sky, and we have 10 minute FFIs. These benefits come at the expense of large (21'') pixels adversely affecting crowded fields, but TESS still serves as a proverbial gold mine for EBs away from the Galactic plane.

EBs play a less celebrated role in exoplanetary science, where ∼40% of false positives at low Galactic latitudes are attributed to their diluted light curves (Morton & Johnson 2011). About 25% of TESS objects of interest (TOIs) that were examined by ground-based photometric follow-up turned out to be background binaries. Thus, having a good census of EBs feeds back to identifying false positives before pointing costly follow-up telescopes in their direction.

In this paper, the first in the TESS EB series, we present a sample of 4584 EBs observed by TESS in the first 26 sectors of observation. In Section 2 we describe light-curve detection and identification, in Section 3 we explain how ephemerides for each system are determined and refined, in Section 4 we present an automated data validation pipeline called ICED LATTE, in Section 5 we focus on statistical properties of the EB sample and qualitatively assess completeness, in Section 6 we describe the contents of the catalog, and finally, in Section 7 we discuss some of the most interesting results and provide a list of ongoing projects and future goals for the working group.

2. Detection of TESS EBs

Extracting a sample of EBs from the observations of ∼200,000 2 minute cadence light curves involved multiple complementing efforts. We describe these efforts here.

Proposed targets: As part of the TESS Guest Investigator program, we proposed 3889 targets in Cycle 1 and 3067 targets in Cycle 2. The targets were selected from all public binary star catalogs served on VizieR (Ochsenbein et al. 2000), the General Catalog of Variable Stars (GCVS; Samus et al. 2017), and the Spectroscopic Binary Catalog (SB9; Pourbaix et al. 2004). The proposed targets were prioritized by a multitude of factors: (1) spatial position in the sky, i.e., the number of visits, (2) T magnitude, as it appears in the TESS Input Catalog (TIC; Stassun et al. 2018), (3) membership in the Detached Eclipsing Binary Catalog (DEBCAT; Southworth 2015), (4) classification certainty, and (5) scientific importance. Each criterion was assigned a numerical value (positive or negative) and their sum determined a priority value in the target list. Thus, bright DEBCAT members in the continuous viewing zone were the highest priority targets. Targets on the faint end observed in a single sector and without certain classification were the lowest priority targets. Of the proposed targets, 745 were selected for observations in Cycle 1 and 999 were selected for observations in Cycle 2.

A total of 6699 EB candidates were identified via the Planet Hunters TESS (PHT) citizen science project (Eisner et al. 2021), which is hosted by the Zooniverse platform (Lintott et al. 2008, 2011). The project engages over 30,000 registered citizen scientists in the search for transiting exoplanets in the 2 minute cadence light curve obtained by TESS. The identification of eclipsing binaries and multi-stellar systems is a natural by-product of this large-scale visual vetting effort. In brief, each 2 minute cadence TESS light curve is visually inspected by 15 citizen scientists who identify the times of any transit-like signals before moving on to the next light curve. Once all of the data from a given TESS sector have been classified, the classifications from the individual volunteers are combined using an unsupervised machine-learning algorithm. This allows us to identify times of potential transit-like events in each light curve and rank all of the candidates from most to least likely to contain a transit signal (for details, see Eisner et al. 2021). The 500–700 highest ranked candidates per sector are visually inspected by the PHT science team and grouped into "planet candidates," "EB candidates," and "other." In total, we identified 2720 EB candidates using this methodology.

In addition to this classification pipeline, each target has an independent discussion forum where the citizen scientists can discuss the data and flag the signals to the science team using searchable hashtags. By the end of the primary mission, the tags "EB," "eclipsing binary," or similar versions thereof had appeared over 46,200 times on the discussion forums, corresponding to 5759 individual TESS targets. All of these targets were considered as potential EB candidates and were kept for further vetting. A total of 1780 candidates were discovered via both of these methods, bringing the total number of EB candidates identified via PHT to 6699.

A second manual search for TESS EBs was carried out by another team of seven citizen scientists, the Visual Survey Group (VSG), independently of the PHT effort. Up until Sector 21, the VSG collectively scrutinized the Candidate Target List light curves (CTL; Stassun et al. 2018) which were binned at six points per hour. Data from subsequent sectors were binned at two points per hour. All data were prepared and surveyed with the LcTools software (Schmitt & Vanderburg 2021) from FITS files stored at the Mikulski Archive for Space Telescopes (MAST).

The Weird Detector pipeline: Wheeler & Kipping (2019) introduced the Weird Detector, a phase-dispersion-based periodic signal-detection algorithm with minimal requirements for signal morphology. The merit function (ζ) of a given trial period (Ptrial) is calculated using (I) local decrease in χ2 of the binned, phase-folded light curve and (II) the kurtosis characterizing the tailedness of the binned flux distribution to ensure a mostly flat baseline with one (or a few) excursion(s) representing a dimming event. Chakraborty et al. (2020) applied the algorithm to 248,000 2 minute light curves from the first 13 sectors of TESS data; given the highly general nature of the algorithm's candidate signal-finding goal—no particular target shape is optimized for in the pipeline's signal-finding process—the relatively higher signal-to-noise ratio (S/N) of eclipsing binaries compared with many other periodic sources makes them a large fraction (313/377) of the candidate signals. All candidates from this pool were manually vetted, ultimately yielding no novel detections, but with an overlap of 265 true-positive signals.

SPOC pipeline: All TESS 2 minute pixel stamps are processed by the Science Processing Operations Center (SPOC) pipeline (Jenkins et al. 2016). The pipeline runs optimal photometric extraction, followed by two types of light curves: simple aperture photometry (SAP) and pre-search data-conditioning (PDC) light curves. The SAP light curves are background-corrected, but have no additional detrending, while constructing the PDC light curves includes detrending for common-mode instrumental systematics using co-trending basis vectors empirically calculated from other sources on the corresponding detector (Smith et al. 2012; Stumpe et al. 2014). The PDC light curves are also corrected for flux contamination from nearby stars. Here we chose to use the SAP light curves to avoid cases where the detrending affects the astrophysical signal (Twicken et al. 2010; Stumpe et al. 2012; Morris et al. 2020).

The SPOC pipeline also searches the PDC light curves for transiting planet signatures using an adaptive, wavelet-based matched filter (Jenkins 2002; Jenkins et al. 2010), and the resulting TCEs are fitted with an initial limb-darkened transit model (Li et al. 2019) and subjected to a suite of diagnostic tests to help determine whether the transit signature is due to a planet, an eclipsing binary, stellar variability, or an instrumental effect.

3. The Determination of EB Ephemerides

3.1. The QATS Code

The Quasiperiodic Automated Transit Search (QATS) is a pipeline originally developed to find planets with transit timing variations in the Kepler data (Carter & Agol 2013). It was subsequently updated and revised to be a more general planet and eclipsing binary search tool, as first applied to K2 by Kruse et al. (2019). The implementation used here is identical to that of Kruse et al. (2019), so we refer the reader there for full details. In brief, QATS models a transit at every cadence in a light curve and compares that model's χ2 fit to a pure polynomial continuum. It then runs a period-folding search over that sequence of Δχ2 to identify periodic signals where the transit fit is better than the continuum. Once it has identified a candidate periodic signal, it runs a more thorough transit fit to accurately measure the transit or eclipse parameters.

3.2. The ECLIPSR Code

ECLIPSR (Eclipse Candidates in Light curves and Inference of Period at a Speedy Rate; IJspeert et al. 2021) is an algorithm that operates in two main stages: finding eclipses in the light curve and subsequently determining the periodicity in those eclipses. Finding (individual) eclipses in the light curve is achieved by looking for peaks in its time derivatives. This enables the successful identification of eclipses in light curves that show strong additional (intrinsic) variability compared to the eclipse signal. This process is fully automated and produces a score at the end for each light curve that can be used to separate light curves that show eclipses from those that do not contain an eclipse signal. Here, we start off from a list of pre-determined EB candidates and use the ECLIPSR algorithm mainly for its ability to determine ephemerides.

3.3. The BLS Run

The final algorithm employed a traditional and well-tested approach to searching for transit-like signals: the box least squares (BLS) periodogram (Kovács et al. 2002). In the BLS periodogram, a sliding box-like signal is passed over the light curve for a range of orbital periods and the likelihood of the model is recorded at each orbital period. The resulting periodogram is expected to peak at integer multiples of any box-like periodic signals in the light curve, thus it is well-suited to the detection of eclipses. We used the BLS implementation in Astropy (Astropy Collaboration et al. 2013) with the lightkurve package (Barentsen et al. 2019) to normalize and prepare the light curves. No further pre-processing was applied to the light curve beyond a simple normalization of flux. Although straightforward to implement and run, the BLS algorithm relies on strictly periodic signals and thus cannot identify single eclipse events. The orbital periods searched by BLS ranged from 0.1 days to half the timespan of the light curve, with an oversampling factor of 20.

3.4. Triage

All candidate targets were vetted manually by at least one of the authors through a custom web application. For each TIC entry, phased plots were shown at the period, as well as half period and double period, for each of the automated ephemeris algorithms discussed above.

The triage user would then choose which period (if any) correctly represented the signal and classify whether the signal was that of an eclipsing binary or some other variable source. Subcategories were available to flag eclipsing binaries where the period was ambiguous (whether there are two nearly identical primary and secondary eclipses or no visible secondary eclipse) and whether there was an insufficient number of eclipses in the data to accurately determine the orbital period.

Ephemerides for a single TIC target with periods (or period multiples) within 1% were automatically flagged as representing the same underlying signal. All other multiple ephemerides classified as eclipsing binary signals were then treated independently as either blended EBs or true hierarchical systems.

A total of 27,496 candidate ephemerides (from 10,477 unique TICs) and their half- and double-period counterparts were triaged. Of those, 4592 ephemerides (from 4584 unique TICs) were manually classified as likely being caused by an eclipsing binary signal (see Figure 1) and were passed on to the ephemeris refinement algorithm described below. Additionally, 520 input ephemerides (from 457 unique TICs) were classified as likely eclipsing binaries but with an insufficient number of eclipses to determine an orbital period. We flagged 1872 input ephemerides (from 1725 unique TICs) as requiring further follow-up to determine whether they were eclipsing binaries or pulsating stars (due to sinusoidal signals), 9434 input ephemerides (from 6029 unique TICs) were classified as having no eclipsing binary signal, and 11,078 input ephemerides were marked as duplicates of another ephemeris entry for the same TIC. Figure 2 depicts several examplary EB light curves from the TESS catalog.

Figure 1.

Figure 1. Map of TESS EBs observed in sectors 1–26 in the Galactic reference frame. Depicted in green are all vetted and validated EBs observed with the 2 minute cadence. Depicted in cyan are the simulated EBs brighter than T = 12 (Wells & Prša 2021). The dearth of systems in the region north of the ecliptic plane is due to the change in boresight in sectors 14–16 and 24–26, where the satellite was pointed at +85° instead of the nominal +54° to mitigate excessive contamination by stray Earth light and moonlight in cameras 1 and 2.

Standard image High-resolution image

3.5. Refinement and Heuristic Error Estimates

To further refine the ephemerides determined from triage, we fit analytical models to the phase-folded light curves and sample the posteriors of the period used for phase-folding and the model parameters. The two analytical models used are two-Gaussian (Mowlavi et al. 2017) and polyfit (Prša et al. 2008).

Figure 2.

Figure 2. A showcase of interesting objects observed by TESS. The panels on the left display time series and the panels on the right depict phase plots. The objects are, top to bottom: (a) wide eclipsing binary TIC 24935204, (b) benchmark-grade eclipsing binary TIC 33419790, (c) totally eclipsing contact binary TIC 5674169, (d) high-reflection subdwarf binary TIC 31690845, (e) eccentric ellipsoidal binary TIC 11046410, (f) flaring M-dwarf pair TIC 436869712, (g) eclipsing binary with a pulsating component TIC 10891640, (h) multiple signal eclipsing binary TIC 375422201, and (i) quadruple star system TIC 424508303.

Standard image High-resolution image

The two-Gaussian models fit a phased light curve by using one or two Gaussian functions and/or a cosine function with its maximum coinciding with one of the eclipses. The Gaussians are supposed to fit eclipses and the cosine function is supposed to fit symmetric ellipsoidal variability. The light curve is fit by all combinations of functions and it finds the solution that minimizes the degrees of freedom while retaining a satisfactory fit. The model with the highest Bayesian information criterion (BIC) is chosen as the best fit and used in the subsequent analysis. The left panels in Figure 3 depict an example of a two-Gaussian fit.

Figure 3.

Figure 3. Demonstration of a successful (top panels) and unsuccessful (bottom panels) fit with the two models. The top panels show a system with asymmetric variability outside of eclipse, which cannot be modeled by the two-Gaussian model. However, this does not prevent the model from fitting the eclipses correctly. Polyfit, due to its higher degree of freedom, manages to capture both the eclipses and variability outside of eclipse. The bottom panels show the deficiencies of the two models that cause it to not correctly fit the shallow secondary eclipse: the two-Gaussian model considers it part of the asymmetric variability outside of eclipse that it is unable to model, while polyfit fits the variability as an eclipse instead.

Standard image High-resolution image

The polyfit analytical model relies on fitting a piecewise-connected chain of polynomials to the phased light curve. The constraints imposed on the chain are that it should be connected and smoothly wrapped in phase space. There is no requirement that the chain be differentiable in the knots, which allows it to easily fit the discrete breaks caused by eclipses. As such, the knots are typically positioned at the ingress and egress of the eclipses. We use four quadratic polynomials connected at four knots. The right panels in Figure 3 depict an example of a polyfit.

The light-curve geometries induced by the eclipses, ellipsoidal variability, spots, and other potentially present signals are in reality more complex than those that can be modeled with these analytical functions. However, they are sufficient for this preliminary analysis which focuses on period refinement and simple geometrical parameter estimates.

To estimate the initial distributions for the model parameters, we first fit the model to the phase-folded data. For the two-Gaussian model, the best-fitting combination of eclipses and a cosine term are chosen and their corresponding parameters passed on to a Markov Chain Monte Carlo (MCMC) search with walkers initialized in a tight ball around the fitted parameter values. If there are no eclipses or ellipsoidal variability detected by two Gaussians (which results in a constant function fit), the light curve is not passed on to MCMC and flagged as "failed two-Gaussian." Similarly, we fit an initial polyfit to the phase-folded light curve, and if successful, we initialize a sample around the fitted knot positions and polynomial coefficients. A failed polyfit is rare, but if the algorithm raises any errors that do not result in a fit, we flag that system as a "failed polyfit."

We run MCMC with emcee (Foreman-Mackey et al. 2019) on each candidate EB light curve with 96 walkers for 2000 iterations. The burn-in typically takes under 100 iterations, but we discard the first 1000 iterations for the mean and standard deviation computation of the parameter posteriors. We also check for potential multiple "branches" of walkers at different log probabilities and only use the one with the lowest log probability of the posteriors. Out of the 4584 EB candidates, 98.08% were successfully fitted with a two-Gaussian model and 99.85% with polyfit. Out of the fitted two Gaussians, 99.24% within 1% of the triage period, and 98.89% of the fitted polyfits are within 1% of the triage period. Additionally, 98.93% of fitted two Gaussians are within 1% of the polyfit periods.

As depicted in Figure 3, two-Gaussian and polyfit have their deficiencies which hinder their application to all systems. Fortunately, they tend to complement each other (unlike two-Gaussian, polyfit can fit a wide range of out-of-eclipse signals, including highly asymmetric ones; while two-Gaussians will tend to fit eclipses even in noisy data, where polyfit underperforms). We also run additional checks to ensure that a fitted Gaussian or polyfit truly fits an eclipse and not some other feature of the light curve:

  • 1.  
    overlapping eclipses check—ensures that a single eclipse is not fitted by two Gaussians or two polynomials;
  • 2.  
    noise check—ensures that a Gaussian is not fitted to the data noise;
  • 3.  
    ELV check—ensures that a Gaussian is not fitted to ellipsoidal variations (ELVs) or another out-of-eclipse variability;
  • 4.  
    polyfit only: check that the eclipse coincides with a polynomial minimum and not a knot position. If a knot exists with a lower flux value than the polynomial minimum, the eclipse is discarded.

Where any one of these checks fails, the eclipse parameters are not reported, even if a two-Gaussian or polyfit model solution exists.

To fully utilize the information returned from both models and their period and parameter distribution, we compute combined ephemerides, which are ultimately reported in the catalog. The combined ephemerides are computed by sampling both the two-Gaussian and polyfit distributions, under the constraint that the number of samples drawn from each distribution is inversely proportional to the reduced χ2 value (chi2 r ) of the mean fit (Figure 4). The mean and standard deviation of the new combined distribution are then chosen as the final ephemerides. We perform an additional check to ensure that the models whose distributions we sample are reasonable. If a model reports uncertainty of the time of superior conjunction larger than half the fitted period, the time of superior conjunction for that model is discarded. This happens more often in the case of polyfit, as the eclipse model has more degrees of freedom.

Figure 4.

Figure 4. Combined period distribution for a two-Gaussian and polyfit with comparable ${\chi }_{r}^{2}$ (left) and one where polyfit is significantly better (right).

Standard image High-resolution image

3.6. Morphology Classification

For morphological classification of the light curves, we rely on the morphology parameter as defined in Matijevič et al. (2012). The morphology parameter is defined as a continuous variable in the range [0,1], where 0 corresponds to the widest detached systems and 1 to ellipsoidal variables. We use the Kepler EBs data set to train a neural-network (NN) model on the light-curve geometry and output a value of the morphology parameter. The Kepler and TESS EBs data sets are both preprocessed in the same way: phase-folded with the final catalog ephemerides and binned in 1000 phase bins over the range [−0.5, 0.5]. The NN architecture consists of an input layer of size 1000, corresponding to the phased light curves; three dense layers of size 300, 100, and 30, respectively; and an output layer of size 1, corresponding to the morphology parameter. The NN is trained on 60% of the Kepler data set and tested on the remaining 40%, yielding a mean squared error loss of 0.0035.

Figure 5 demonstrates the relationship between the morphology, periods, and primary eclipse widths in the catalog. Long-period systems have morphology parameters close to zero, signaling wide detached binaries, while the shortest period systems tend toward a morphology of 1, which corresponds to contact binaries and ellipsoidal variables. The primary eclipse widths, as determined by the two-Gaussian model, are encoded in the color map, further confirming that the morphology parameter captures the expected variability in eclipse widths: narrow eclipses in systems with a morphology close to 0 and wider eclipses going toward a morphology of 1.

Figure 5.

Figure 5. The relationship between morphology parameter, logarithm of the period, and primary eclipse widths, as determined from the two-Gaussian model.

Standard image High-resolution image

4. EB Validation

Data validation reports, for the purposes of this project, are cumulative accounts of analyses carried out using TESS data for each individual target flagged as an EB. Each report contains, but is not limited to, the full light curve, target parameters, background flux plots, aperture size plots, power spectra, evolutionary tracks, and results from several eclipsing binary data validation tests. The purpose of these validation reports is to support the eclipsing binary classifications of the TESS EB working group.

Eclipsing binary data validation resulted primarily from reformatting the interactive code, LATTE (Light curve Analysis Tool for Transiting Exoplanets; Eisner et al. 2020). LATTE's initial purpose involved identifying false positives in exoplanet classifications due to instrumentation and astrophysical productions of exoplanet-like signatures. The development of ICED LATTE (Interlacing Code for Eclipsing binary Data validation) redirected the purpose of the analysis tool to validate classifications of TESS EBs and flag the presence of false positives. Additionally, LATTE was modified to work without its interactive plots with the aim of producing quick validation reports for the ∼5000 classified EBs. Additions to ICED LATTE that were not in LATTE include numerous EB validation tests designed to identify whether the signal is on target, tests to determine whether the signal favors the planet or the EB scenario, and a generated likelihood that the target is an EB.

A significant portion of the validation tests are based on light-curve analysis. However, the most telling of the validation tests is based on target pixel file analysis (see Figure 6). The large, 21'' pixels substantially increase the probability of background contamination from sources other than the TESS target being observed. In the event that an EB is captured in the same aperture as the target, the target itself may appear to be an EB from light-curve analysis. The centroid movement test identifies the presence of other objects in the aperture of the target and measures any movement of the central point of light. If this centroid appears to move slightly between the background object and the target, then one of the two is likely an EB. This is due to the change in flux from the EB. During times of eclipse, the EB appears dimmer and the centroid of light moves toward the stable-flux object. Intuitively, during noneclipse times, the EB appears brighter and the centroid of light moves toward the EB. By analyzing this centroid movement, we are able to determine which of the objects is the true EB and thus confirm or dismiss the EB classification of the TESS target.

In addition to results from the validation tests, a rough likelihood that the target is an EB is also calculated and included in the data validation report. It should be noted that these likelihoods are extremely general in nature and only reflect the results of the validation tests which are run on the targets. Each test results in a numerical value between 0 and 1; 0 is generally assigned to the test result "Unlikely EB," 0.5 is generally assigned to "Possible EB," and 1 is assigned to "Probable EB." In brief, the validation tests that contribute to this score consist of the following:

  • 1.  
    Centroid motion. The centroid test makes use of the open-source python package contaminante, which executes a pixel-level modeling of the TESS target pixel files to determine the most likely location of the source of the eclipses. The score for this test is scaled inversely with the distance between the location-calculated source of signal and the location of the target star.
  • 2.  
    Contamination. The amount of contamination to the TESS aperture from nearby stars. This provides an indication of how crowded the field is and the likelihood of the signal originating from a nearby companion star. The score for this test is scaled inversely with the contamination of nearby source.
  • 3.  
    Out-of-transit variability. EBs with orbital periods shorter than around 3 days are expected to show out-of-transit variability. As such, we search for this variability for the short-period candidates. The significance of such a detection is proportional to the score given for this test. Candidates with periods greater than 3 days are given a score of 0.5 for this test by default.
  • 4.  
    Archival classification. Candidates that have previously been listed as an EB on Simbad are awarded a score of 1 while all other candidates are awarded a score of 0.5.
  • 5.  
    TCE/TOI. The TESS automated search pipeline flags light curves containing a periodic signals, including both planetary and stellar, as threshold-crossing events (TCEs) and TCEs that pass a large number of rigorous planet-vetting tests are promoted to TOI status. As such, candidates that are TOIs are given a score of 0; candidates that are TCEs but not TOIs are given a score of 0.75, and candidates that are neither TOIs nor TCEs receive a score of 0.5.

Figure 6.

Figure 6. A light curve extracted for each pixel around the target star, where the red line indicates the aperture used to extract the 2 minute cadence SPOC light curve. Left: example of an EB that is on target (TIC 255700967). The overall EB score for this target is 0.87. Right: example of a signal that is not on target (TIC 230530979). The overall EB score for this target is 0.42.

Standard image High-resolution image

Due to the large number of signals that are off target, we found the centroid motion test to be the most informative to determine whether the EB signal was real. As such, the final EB likelihood was calculated as the average between all of these test scores, with the centroid motion test given a factor of 3 extra weight. The resulting distribution of the final EB likelihood and of all of the individual test results is shown in Figure 7.

Figure 7.

Figure 7. Histogram showing the distribution of the individual test scores (dashed outlines), which combined, give the overall likelihood of the candidate being a real EB (solid black outlines).

Standard image High-resolution image

5. Statistical Properties of the EB Sample

While nonuniform and magnitude-limited, the sample of 4584 EBs observed by TESS allows us to do preliminary bulk analysis on the data set.

  • 1.  
    Orbital periods. The distribution is largely affected by the temporal coverage of observations. Figure 8 shows a bimodal distribution, with a narrow peak at around 0.25 days and a broad peak at around 3 days. The narrow peak corresponds to contact binaries, while the broad peak corresponds to close, detached EBs. We discuss this further in Section 7. The shades of blue correspond to the number of sectors that the targets were observed in; it shows the comparative rates of EBs in the continuous viewing zones (lightest shade) and single sector visits (darkest shade); the histogram is stacked, i.e., the values per bin are cumulative over all sector visits.
  • 2.  
    Eclipse depths. Figure 9 depicts their distribution, along with the distribution of eclipse depth ratios. By definition, the primary eclipse depth is deeper, so we expect the distribution to have a longer tail. The systems with shallow primary eclipses and undetectable secondary eclipses enhance the first bin of the primary depth distribution. Depth ratios depend on the surface brightness ratio of the two components, and on eccentricity. Surface brightness itself depends on many second-order effects such as gravity-darkening, limb-darkening, and reflection. Thus, relating eclipse depth ratios to underlying parameters is complicated, but in the broadest sense, the distribution of the depth ratios is aligned with the ratios of effective temperatures. The distribution is bimodal, with a peak at 1, corresponding predominantly 42 to the equal luminosity class pairs, and broad, near-flat distribution between 0.2 and 0.8, corresponding to evolved components.
  • 3.  
    Eclipse separations. Phase-space separations of eclipses are a direct measure of the tangential component of orbital eccentricity, $e\cos \omega $. It is reasonably well determined because eclipse positions could be measured reliably. Figure 10 depicts their distribution; note that the y scale is logarithmic. The strong peak at 0.5 corresponds to circular orbits, which are expected to dominate the EB population at these relatively short periods.
  • 4.  
    Eclipse widths. The least well-determined parameters because of their pronounced sensitivity to the exact points of ingress and egress, it is a measure of the radial component of eccentricity, $e\sin \omega $. Figure 11 depicts their distributions. The differences between the two models, polyfit and two-Gaussian, are most pronounced in eclipse widths; nevertheless, the correlation between the two (bottom panel of Figure 11) shows the expected trend (distribution around y = x) with an expected scatter due to eccentricity effects.

Figure 8.

Figure 8. The distribution of orbital periods for 4584 EBs observed in sectors 1 through 26. Shades correspond to the number of sectors in which the target was observed.

Standard image High-resolution image
Figure 9.

Figure 9. The distributions of primary and secondary eclipse depths and their ratios, derived from the polyfit and two-Gaussian models. Note the log scale of the y-axis. Eclipse depth ratios serve as rough proxies for the surface brightness ratios.

Standard image High-resolution image
Figure 10.

Figure 10. The distribution of eclipse separations in phase space. Eclipse separations serve as proxies for the tangential component of eccentricity, $e\cos \omega $.

Standard image High-resolution image
Figure 11.

Figure 11. The distributions of primary and secondary eclipse widths, and their correlation, derived from the polyfit and two-Gaussian models. Eclipse widths serve as proxies for the radial component of the eccentricity, $e\sin \omega $.

Standard image High-resolution image

5.1. Catalog Completeness

The sample of EBs presented in this paper is inherently nonuniform: mission targets are selected to maximize the exoplanet detection yield and guest investigator targets are selected through proposal competition across a great many science goals. It is thus impossible to use these data to get statistically representative distributions of parameters; we defer that goal until the time when we have a sample of detected and characterized EBs from full-frame images. Thus, when we say "catalog completeness," we mean the overall EB detection success in the data set of 2 minute cadence target observations. Quantifying completeness properly thus remains beyond the scope of this paper, but we provide qualitative estimates here.

To estimate completeness qualitatively, we start with combined differential photometric precision (CDPP; Gilliland et al. 2011). CDPP is a measure of light-curve variation—the rms noise of the result of filtering the time series with a high-pass Savitzky–Golay filter and then applying a moving average of 0.5 hr. The top panel in Figure 12 shows CDPP as a function of magnitude for Sector 1 observations. The CDPP values are inherent to aperture light, so they are not corrected for crowding. The second panel in Figure 12 provides a statistical estimate of per-target crowding, i.e., the fraction of light in the aperture coming from the target itself. Thus, a value of 1 means that all light is due to the observed target, while a value of, say, 0.6, means that 60% of the light comes from the target and 40% of the light comes from background sources. Thus, to account for dilution, we correct the CDPP for crowding by dividing it by the crowding factor. This implicitly assumes that the extraneous source of light in the aperture is constant, and that all variability comes from the target itself, which may or may not be true, but overall it plays a small role in correction because crowding is "top-heavy": 90% of the targets have crowding factors larger than 0.95. The corrected CDPP is depicted in the third panel of Figure 12. We then fit the bottom envelope by sigma-clipping and divide the corrected CDPP by the envelope in order to flatten out the dependence on magnitude (bottom panel). This way, we can assess the distribution of amplitudes for EBs and its qualitative resemblance to the expected geometric distribution (i.e., the probability of eclipses as a function of orbital elements). We emphasize the word "qualitative" here because, in order to calculate the true geometric distribution, we would need to be able to simulate target list selection, generate a synthetic sample of EBs and, from there, calculate the corresponding CDPP values; this, unfortunately, is not tractable because of the non-prescriptive target selection. The top panel in Figure 13 shows this distribution; we observe a monotonically decreasing trend, in line with expectations. For validation purposes, we check the trend of all targets observed in Sector 1; as we see no systematic features in the flattened CDPP, we conclude that the number distribution is a fair reflection of the true CDPP distribution for the observed EBs.

Figure 12.

Figure 12. The TESS combined differential photometric precision (CDPP). The top panel depicts the measured CDPP with a 0.5 hr running average, in ppm. The second panel provides a crowding measure per target, i.e., the fraction of light in the aperture from the observed target. The third panel corrects the original CDPP for crowding: corrected CDPP = CDPP/crowding. This assumes that the contribution of background light does not vary with time, which may or may not be true. Depicted in blue are all observed targets, and in red are the EBs. The bottom panel is the flattened version of the previous panel, divided by the bottom envelope.

Standard image High-resolution image
Figure 13.

Figure 13. The number of EB targets (top) compared to all Sector 1 targets (bottom) as a function of flattened CDPP. Note the logarithmic y scale.

Standard image High-resolution image

While not statistically significant, we took a closer look at the slight dip in the fourth bin of the EB distribution; there are 19 EBs out of 299 targets in that bin that were found. We manually went through the entire list of 299 targets again but have failed to find any additional EBs.

Finally, we perform one last check: we test whether the magnitude distribution of EB targets is representative of the overall magnitude distribution; any detected bias might raise completeness questions. Figure 14 demonstrates qualitative equivalence, noting of course that there are no EBs brighter than T = 2.5, and the diminishing signal-to-noise ratio suppresses eclipse detection on the faint end.

Figure 14.

Figure 14. The distribution of TESS magnitudes for all 2 minute targets observed in sectors 1–26 (blue) and all 4584 EBs (amber).

Standard image High-resolution image

At some level eclipses will blend into the noise background and we will no longer be able to (reliably) detect them; to estimate that level, we take the distribution of primary eclipse depths (see Figure 9) and choose the bin size that yields equal occurrence rates for the first two bins. Phenomenologically that implies that the monotonic increase toward lower S/N stops and that we can no longer reliably detect eclipses. That occurs at S/N ∼ 13. We take that to be our limit. We conclude that the catalog is largely complete to S/N ∼ 13.

6. Contents of the Catalog

The TESS eclipsing binary catalog presented here contains all targets observed with a 2 minute cadence in sectors 1 through 26 that are consistent with eclipsing binary signatures. The count as of this writing is 4584 EBs.

All data are available from two official servers: the project's main webpage, http://tessEBs.villanova.edu, and at MAST as a high-level science product via doi:10.17909/t9-9gm4-fx30. 43 The official webpage hosts the "rolling" version of the catalog that is updated in real time as new targets are validated or existing targets are refuted. Once every quarter we take a snapshot of the rolling version, tag it with a release number, and deliver it to MAST. Copies of the tagged releases are also kept on the project webpage.

Table 1 summarizes the database tables and columns contained in the catalog along with the relationships between database tables. All entries in the TIC table are copied verbatim from the latest TESS input catalog (v8.1 at the time of this writing; Stassun et al. 2018); all entries in the Sector table are taken from the latest version of the official pointing table 44 ; table Origin contains identifiers for people or groups who contributed EB candidates (see Section 2). Table EB is the central table of the database: it contains links to the TIC, all Sectors the EB is observed in, and all Origins. In addition, it provides the data source 45 (FFI for full-frame images, TPF for target pixel files, and LCF for light-curve files). It also gives the refined ephemerides (see Section 3.5) and morphology coefficient (see Section 3.6). Table EphemerisSource aggregates all automated ephemeris-finding algorithms (see Section 3) and table Ephemeris keeps all records on the proposed ephemerides. Note that these are not final, refined ephemerides; rather, these are "raw" ephemerides output by the ephemeris-finding algorithms. The Ephemeris table also stores triage information: when the triage was done, by whom, what the disposition was, and what proposed ephemerides are likely correct. Finally, the Comment table contains all subjective comments made on Ephemeris, EB, or TIC entries.

Table 1. Database Structure of the EB Catalog

TableColumnUnitsValue TypeDescription
TIC: tess_id long integerTIC identifier
  ra degfloat [0, 360)R.A.
  dec degfloat [−90, 90]decl. in deg
  glon degfloat [0, 360)Galactic longitude
  glat degfloat [−90, 90]Galactic latitude
  Tmag floatTESS magnitude
  teff Kfloatestimated effective temperature
  logg log(cgs)floatestimated surface gravity
  abun log(sol)floatestimated metal abundances
  pmra mas/yrfloatproper motion in R.A.
  pmdec mas/yrfloatproper motion in decl.
Sector: sector_id integerSector identifier
  date_start isodatestart date of sector observations
  date_end isodateend date of sector observations
  spacecraft_ra degfloat [0, 360)satellite R.A.
  spacecraft_dec degfloat [−90, 90]satellite decl.
  spacecraft_roll degfloat 0, 360]satellite roll from reference point
  cameraN_ra degfloat [0, 360)camera N R.A., N = 1 ...4
  cameraN_dec degfloat [−90, 90]camera N decl., N = 1 ...4
  cameraN_roll degfloat 0, 360]camera N roll from reference point, N = 1 ...4
Origin: name stringdata/classification origin string
EB: tic TIC link to the TIC entry
  origin Origin link to the origin entry
  sectors Sector list of sectors the EB was observed in
  signal_id integerEB signal enumerator
  in_catalog booleanflags the EB as validated
  date_added isodatetime and date of the database entry
  date_modified isodatetime and date of the database modification
  source choicedata source: 'FFI', 'TPF' or 'LCF'
  bjd0 daysfloatBJD−2457000 of the primary eclipse
  bjd0_uncert daysfloatuncertainty in bjd0
  period daysfloatorbital period
  period_uncert daysfloatuncertainty in orbital period
  morph_coeff floatmorphology coefficient
  morph_dist floatorthogonal distance from the morphology manifold
EphemerisSource: model stringModel used for the ephemeris estimate
  version stringModel version
  author stringModel author
  reference stringModel reference
Ephemeris: date_added isodateEphemeris estimate date
  source EphemerisSource Ephemeris source
  eb EB link to the EB entry
  bjd0 daysfloatBJD−2457000 of the primary eclipse
  bjd0_uncert daysfloatuncertainty in bjd0
  period daysfloatorbital period
  period_uncert daysfloatuncertainty in orbital period
  triage_timestamp isodatetime of performed triage
  triage_status stringtriage disposition
  triage_period choicechoice of 'period', 'half', 'double', 'other'
  triage_username stringperson completing the triage
Comment: author stringcomment author
  text stringcomment text
  timestamp isodatecomment date and time
  ephem Ephemeris link to the ephemeris entry
  eb EB link to the EB entry
  tic TIC link to the TIC entry

Download table as:  ASCIITypeset image

7. Discussion

In addition to curating information on individual EB targets observed by TESS, the catalog can be used for bulk analysis, similar to what was presented in Section 5. Here we present another two interesting observations.

Figure 15 depicts Galactic latitude distribution of EBs, with two notable features. The first is a near-constant distribution of southern Galactic hemisphere EBs (−90 < b < −10), and the second is the lopsidedness between the southern and the northern Galactic hemisphere. The near-constant distribution of EBs is at odds with observations from, say, Kepler (Prša et al. 2011), where Galactic latitude dependence was pronounced. This is a consequence of (1) the nonuniform target selection presented in Section 2, (2) the relatively bright magnitude breakdown limit for TESS, significantly reducing the dependence of eclipse probability on stellar populations, and (3) the fraction of time the Galactic plane was in or near the CVZ. We expect that EBs detected in FFIs will comprise a more uniform sample that will be more representative of true Galactic distributions of EBs (see Figure 21). The lopsidedness is largely due to the change in boresight in sectors 14–16 and 24–26 (see Figure 1).

Figure 15.

Figure 15. Detection frequency of EBs as a function of Galactic latitude. The frequency is computed by counting all detected 2 minute cadence EBs in a given 10° Galactic latitude strip and dividing the count with the number of all 2 minute cadence targets observed in the same strip.

Standard image High-resolution image

The second observation is biased in orbital periods of TESS EBs. Figure 16 (top) shows a comparison between the orbital period distributions for TESS EBs (blue) and Kepler EBs (amber). The long tail for the Kepler sample on the long-period end is expected given that Kepler observed the same field for ∼4 yr; on the short-period end we see concordance of the narrow peak around P ∼ 0.4 days between the two data sets, corresponding to contact binary stars, but a different distribution of EBs with periods around 5 days: Kepler's distribution is mostly flat, while the TESS distribution features a wide peak. This is again a consequence of temporal coverage: the 5 day orbital periods are the sweet spot for TESS with ∼5 cycles per observed sector, compared to ∼300 cycles for Kepler. Thus, Kepler's extended sensitivity toward longer periods means that its detection rates are suppressed almost exclusively by the diminishing geometric probability of eclipses; in contrast, TESS suffers from duty-cycle suppression at orbital periods as short as ∼13 days, where geometric suppression is not yet dominant. Figure 16 (bottom) depicts a comparison of TESS EBs against the ground-based survey OGLE (green). In this case, we see that the contact binary peak in the OGLE data set is severely enhanced by the diurnal selection effect. Longer periods are significantly suppressed both by the duty-cycle limitations and the geometric probability of eclipses which is now more pronounced because of the sparser, irregular observing cadence. This bias clearly highlights the benefits of space-based surveys such as TESS.

Figure 16.

Figure 16. The comparison of orbital period distributions from TESS, Kepler, and OGLE. All distributions are normalized by their surface areas for comparison purposes.

Standard image High-resolution image

7.1. Goals of the Paper Series

In this first part of the series of papers we presented the sample of 4584 EBs observed by TESS in sectors 1–26. The main science goals that the entire series aims to achieve are as follows:

  • Study gold-standard EBs. These are known, bright, detached, double-lined binaries with deep eclipses (Torres et al. 2010; Southworth 2015). These are posited to have had minimal interaction histories and have evolved largely independently as single stars. Such systems are considered "gold standards" because they hold promise for the most accurate fundamental parameters. These EBs have spectroscopic data already available and TESS provides us with an improvement in the photometric light-curve quality and temporal coverage.
  • Estimate photometric elements for all TESS EBs. Eclipsing binary light curves hold a wealth of information about a system and its stellar components. Using PHOEBE and its framework for inverse problem solvers (Conroy et al. 2020), we can estimate photometric parameters of all TESS EBs with minimal manual intervention. Using the ephemerides and geometrical estimates from the catalog as a starting point, we can determine the sum and ratio of fractional radii; temperature ratio of the components; the inclination, eccentricity, and argument of periastron of the orbit; and passband luminosities and potential third-light contribution. In addition, for closed systems, the mass ratio and semimajor axis can be constrained from light curves alone (Wilson 1994; Terrell & Wilson 2005). A full Bayesian analysis with MCMC also yields posterior distributions for second-order effects, like the effective temperature of the primary, gravity-darkening coefficients, and albedoes, which, given the photometric precision of TESS, can be used to refine the current empirical laws.The main perceived impact of this study will be the period–eccentricity distribution of the TESS EBs sample, which is crucial in studies of binary star populations. In addition, interesting and benchmark systems will be identified for radial-velocity follow-up and more advanced analysis.
  • Extend the temporal coverage with archival photometry. TESS observations last about 27 days for each sector. Roughly 70% of the sky was observed during the prime mission (Guerrero et al. 2021) and of the 232,705 stars observed at 2 minute cadence, 152,993 (∼66%) were observed for just one sector, while the rest were observed in multiple sectors, and 3874 (∼1.7%) were observed for 13 consecutive sectors over 351 days. While the observations are not fully continuous, with gaps between orbits and some time lost for engineering or safe modes, in general, a given time baseline of observations means that it is possible to detect EBs with orbital periods up to half the time baseline of observations.At the long end of that detectability time frame, ambiguities exist in the characterization of the EBs. In some cases, it is not clear whether two distinct eclipses are seen. In others, spot modulation or other variability can interfere with the identification or measurement of an eclipse. For EBs with orbital periods greater than half the observing time baseline, only a single eclipse may be detected, yielding a likely identification of an EB without a reliable period measurement. See examples in Figure 17. For all these reasons, the set of TESS-detected EBs can benefit from cross-identification with other photometric surveys.A number of such surveys have been conducted and can be combined with TESS observations. Many surveys were carried out to search for transiting exoplanets, including WASP (Pollacco et al. 2006), HAT (Bakos et al. 2004, 2013), KELT (Pepper et al. 2007, 2012), and others 46 Other surveys have been conducted to search for stellar variability, such as ASAS (Pojmanski 1997), ASAS-SN (Shappee et al. 2014), ATLAS (Tonry et al. 2018), and ZTF (Bellm et al. 2019). While none of these surveys have the photometric precision or cadence of TESS, they all have greater observing time baselines over large fractions of the sky. EBs identified in TESS data with ambiguous ephemerides can be matched to the archival photometric observations to identify eclipses and refine the ephemerides. Such an effort will typically only recover EBs with deep eclipses, but that still represents a large fraction of the EB population. The archival photometry can determine or refine the ephemerides of the EBs and can be used to search for eclipse time variations (ETVs; Conroy et al. 2014) or apsidal motion [REF](Orosz 2015; Borkovits et al. 2016a; Kirk et al. 2016b). Further development of the TESS EB catalog will incorporate such analysis.In addition to the use of archival photometry, individual systems of interest can be followed up for astrophysical investigation with modest observing facilities. While broader scientific themes are outlined below, small-telescope observatories are well equipped to observe the bright EBs identified in this catalog. An example of that kind of investigation is multiband photometric follow-up of the O'Connell effect (Milone 1968), in which light curves show unequal maxima before and after the primary eclipse. Follow-up observations in different bandpasses are necessary to understand the underlying cause of this phenomenon. Furthermore, while archival photometry can recover the orbital period for some long-period EBs, as described above, follow-up photometry can readily refine the orbital parameters more precisely.
  • Study asteroseismic components in EBs. The exquisite precision, cadence, and duty cycle of TESS observations have revealed resolved pulsations in stars across all spectral types and evolutionary phases (Antoci et al. 2019; Pedersen et al. 2019; Córsico et al. 2021). Given the ubiquity of both stellar pulsations and stellar multiplicity, binary systems with at least one pulsating component are commonly found. The complementary nature of asteroseismic (interior) and binary (bulk) information enables highly detailed evolutionary modeling. Such synergistic combined modeling approaches have led to detailed inference on the structure of stars from solar-like oscillators to massive β-Cep pulsators to evolved stars that have undergone mass transfer (De Cat et al. 2004; Guo et al. 2016; Schmid & Aerts 2016; Guo et al. 2017; White et al. 2017; Beck et al. 2018; Streamer et al. 2018; Johnston et al. 2019, 2021). Furthermore, detailed studies have demonstrated tension between the asteroseismic and binary results for both main sequence and evolved stars, indicating that evolutionary models still have room for improvement (Gaulme et al. 2016; Themeßl et al. 2018; Benbakoura et al. 2021; Sekaran et al. 2021). The large data set provided by TESS will enable synergistic asteroseismic and binary modeling across a much larger parameter range than was previously possible with the more limited samples, allowing investigation into the relationship between pulsational properties and orbital properties (Liakos & Niarchos 2017; Gaulme & Guzik 2019; Sekaran et al. 2020).Ground-based radial-velocity follow-up of binary systems is time-consuming, resource-intensive, and simply expensive. Due to their stability over long time bases, heat-driven pulsations act as regular clocks, whose phase (and frequency) is modulated due to the light-travel-time effect from the orbital motion. By measuring the observed phases of pulsations across the orbit, one can reliably derive the radial-velocity curve of the pulsating component from the photometry alone (Shibahashi & Kurtz 2012; Murphy et al. 2014; Hey et al. 2020). The application of this method to this database (where appropriate, given the ratio of pulsation period to orbital period) can produce a wealth of radial-velocity curves without needing to apply for costly ground-based telescope time.
  • Explore the impact of dynamical tides. Kepler discovered 172 heartbeat stars (HBs), and about one fifth show tidally excited oscillations (Kirk et al. 2016a; Guo et al. 2020), i.e., the direct manifestation of dynamical tides. A handful of them have been studied in detail (Welsh et al. 2011; Hambleton et al. 2013, 2016, 2018; Shporer et al. 2016; Fuller et al. 2017; Guo et al. 2017, 2019). Recently, Guo (2021) reviewed the current status of heartbeat binaries with tidally excited oscillations. Most Kepler HBs are of A–F types due to selection biases. TESS is destined to discover more massive O–B type heartbeat binaries. Systems already published include Jayasinghe et al. (2019, 2021) and Kołaczek-Szymański et al. (2021). To date, we have identified 25 HB systems in the 2 minute TESS data up to Sector 26, and over 200 systems in the 2 minute and FFI data combined. Given that 5% of the binary stars in the Kepler catalog with periods less than 27.3 days (the length of a TESS sector) were classified as heartbeat stars, we anticipate that the number we have identified is a lower limit.The advent of the TESS mission has also revealed a new class of pulsating stars in binaries, wherein the pulsation axis is "tidally tilted," producing a characteristic amplitude and phase modulation (Fuller et al. 2020; Handler et al. 2020; Kurtz et al. 2020; Rappaport et al. 2021). Additionally, a few studies have reported the detection of so-called "tidally perturbed" pulsations, where a series of pulsations that are nearly equidistantly spaced by the orbital frequency were detected (Bowman et al. 2019; Southworth et al. 2020; Steindl et al. 2021). Such systems cannot currently be explained by the same model as the "tidally tilted" pulsators, and seem to have a separate connection to the interplay between tides and pulsations. While only a handful of these objects have been identified to date, they signify a unique contribution of TESS to the progress of tidal asteroseismology. The compilation of this database serves to streamline the detection of even more systems.
  • Stellar multiples and circumbinary planets. A large number of eclipsing binaries have third-body companions that are detected via eclipse timing variations (ETVs), either light-travel-time effects (LTTE, the most common) or dynamical effect delays. A fraction of these exhibit tertiary eclipses as well. For example, Borkovits et al. (2016b) found good evidence for third bodies in 222 eclipsing binaries out of 2600 studied using Kepler data. In most cases, the ETV signal becomes significant on timescales similar to or longer than the period of the outer orbit, which is typically a few to several hundred days for the systems found by Borkovits et al. (2016b).Rather than being a star, in some cases, the third body is a planet—a so-called circumbinary planet. Kepler revealed 13 such planets that transited their host stars (e.g., see Doyle et al. (2011), Welsh et al. (2012), Orosz et al. (2019)). While few in number, these systems provide a great deal of information, e.g., extremely accurate (not just precise) stellar parameters and planetary radii. But with only a handful of systems, we cannot understand the characteristics of the population of circumbinary planets. TESS's all-sky survey could remedy this small sample-size problem, but for most of the sky, only ∼27 days of near-continuous coverage per two years is available and this window is much shorter than the typical circumbinary planet orbital period. Fortunately, the duty cycle at the ecliptic poles is much better and the first TESS circumbinary planet, TOI-1338 (Kostov et al. 2020a), was detected in observation from TESS's continuous viewing zone (CVZ) at the southern ecliptic cap. In addition to the long-duration observations at the CVZ, we can search for circumbinary planets by taking advantage of the extra information provided when the planet transits both stars in the binary. Unlike a planet orbiting a single star, a circumbinary planet can produce two, and possibly more, transits during one conjunction's pass. This "1–2 punch" technique (Jenkins et al. 1996; Kostov et al. 2020b) was used to detect the second TESS circumbinary planet, TIC 172900988 (Kostov et al. 2021a). Roughly 140±110 new circumbinary planets may potentially be found via this technique, in addition to the 40 ± 32 expected in the two CVZs (Kostov et al. 2020b). Thus TESS should be able to boost the number of circumbinary planets by an order of magnitude, allowing a much more robust estimate of the occurrence rate, population statistics, and searches for correlation with various binary star parameters (e.g., metallicity, eccentricity, mass ratio, etc.).
  • Gaia data overlap. Of the 4584 EBs in the catalog, a total of 4550 have entries in the early third data release of the Gaia catalog (Gaia EDR3; Gaia Collaboration et al. 2021). The top three panels in Figure 18 show histograms of the G-band magnitudes and the GBPGRP colors for the combined light of each binary and of their distances, where the latter are derived here simply as the reciprocal of the trigonometric parallax (see Bailer-Jones et al. 2021). The distribution of magnitudes up to about G = 12 largely reflects the content of the TIC candidate target list, while the fainter stars are special interest targets that come from the Guest Investigator Program. The typical color index is very near solar. As expected from their brightness, most of these objects are relatively nearby, with the peak of the distribution located at ∼150 pc, and a long tail toward larger distances (truncated in the figure).The bottom panel displays the distribution of the renormalized unit weight error (RUWE) from Gaia, which is regarded as a useful measure of the quality and reliability of the astrometric solution 47 . RUWE values for well-behaved solutions cluster around 1.0, whereas values larger than about 1.4 typically indicate unmodeled excess scatter in the astrometric observations that can be caused, e.g., by binarity of the source or perhaps other effects. Gaia does not spatially resolve any of the EBs in this catalog; it only measures the center of light. In many cases, the binary motion will cause the flux centroid to wobble enough to be detected by Gaia, perturbing the astrometric solution.Stassun & Torres (2021) have shown that even in the 1.0–1.4 range the RUWE values tend to correlate with the semimajor axis of the photocentric motion. For the binaries in the catalog, we do not have sufficient information to estimate the semimajor axis of the photocenter. However, we can place the EB sample into the broader context of typical RUWE values for Gaia stars across the HR diagram. This is shown in Figure 19, where the left panel represents the RUWE in grayscale for the ∼1 million Gaia EDR3 stars within 400 pc having 5 < G < 19 and ∣ b∣ > 60°, and the right panel overlays in green the EB sample. In both panels, a piecewise linear fit is shown over the "spine" of the single-star main sequence, for which the RUWE values are at a minimum (dark gray). Various populations with relatively high binary star fractions appear above and below the single-star main sequence (lighter shades of gray), as described in detail by Belokurov et al. (2020).Relative to the single-star main sequence, the EBs identified in this paper are in almost all cases displaced upward/redward, as expected for the combined light of pairs of stars having various relative luminosities and colors. And relative to the minimal RUWE values typical of the single-star main sequence, the EBs occupy regions that are characterized by higher RUWE values on average.We can also use the recent work of Belokurov et al. (2020) to assess the likely nature and evolutionary states of the EBs. In Figure 20, we represent the EB sample in the color–magnitude diagram with theoretical evolutionary tracks overlaid to provide a sense of the mass range of the systems, as well as boxes with annotations indicating EBs in evolutionary states that may be of particular interest for various astrophysical applications (see, for example, the work on extreme horizontal branch stars by Sahoo et al. 2020 and Baran et al. 2021).Finally, it is now well established that short-period binary stars are very frequently found in hierarchical triple systems, with a tertiary companion on a much larger orbit around the tight binary (see, e.g., Tokovinin et al. 2006; Laos et al. 2020, and references therein). We cross matched our EB sample with the catalog of wide binaries in Gaia EDR3 from El-Badry et al. (2021), to assess if any of the EBs are in fact members of hierarchical triples.We cross-matched our EB sample with the catalog of wide binaries in Gaia EDR3 from El-Badry et al. (2021), to assess if any of the EBs are in fact members of hierarchical triples. This yielded 711 wide binary candidates in which at least one component is in our TESS EB sample. Of these, 665 have greater than 90% probability of being gravitationally bound (Rchance < 0.1). The TESS EB is the brighter (fainter) component in 632 (64) of the high-probability binaries. Thirty-one high-probability binaries have both components in our TESS EB sample. We defer further analysis of this sample to future work.
  • Use calibrated 2 minute cadence targets for FFI extraction. There are several community-led projects that focus on light-curve extraction from TESS FFIs. The tools and the data products stemming from those projects have been released to the public; we base our extractions on an adapted version of eleanor (Feinstein et al. 2019). We use the EBs observed with the 2 minute cadence as the training set for the back-propagating neural network that will sift through all FFI light curves and find those that resemble EBs. The amount of cyan in Figure 1 clearly showcases why manual vetting is not tractable and why we must resort to automated techniques. We will extract FFI light curves for each of the 2 minute cadence EBs, formulate a fidelity metric based on the comparison between FFI-extracted and 2 minutes cadence light curves that can be inspected manually, train a classifying, back-propagating neural network on the FFI-extracted light curves of the 2 minute cadence EBs, and deploy the network on all FFI light curves to automatically detect EBs lurking in the images.
  • Analyze contents of a catalog of EBs extracted from the FFIs. Several of our co-authors have constructed all light curves from TESS FFIs up to 15th magnitude, resulting in more than 72 million light curves from sectors 1–26. The authors constructed a one-dimensional convolutional neural network, first described in Powell et al. (2021), to extract candidate EBs from these light curves, resulting in a catalog of over 460,000 sources, of which we expect 250,000 to be true sources (i.e., not due to light-curve contamination). The coordinates of these candidates are shown in Figure 21. Among these candidates were found several interesting multiple-star systems (Kostov et al. 2021b; Powell et al. 2021), with many others currently being researched. The greatest value of this effort, however, may indeed lie in the use of the full catalog of candidates. Cross matching with other catalogs could identify higher-order multiples or binaries containing particular stellar types. Statistical analysis of the catalog could also provide tremendous insight into the growing field of EB research.

Figure 17.

Figure 17. Examples of single-transit EBs, TIC 190630888 and TIC 198385324.

Standard image High-resolution image
Figure 18.

Figure 18. Distributions of the Gaia EDR3 brightness, color, distance, and RUWE for the eclipsing binaries in the catalog. The histograms are truncated on the right (and some on the left) for clarity.

Standard image High-resolution image
Figure 19.

Figure 19. The color–magnitude diagram in the Gaia passbands. The grayscale denotes the renormalized unit weight error (RUWE) parameter for ∼1 million Gaia EDR3 targets within ∼400 pc. Left: Gaia-only targets, with the piecewise linear line showing the fit to the lowest RUWE values. Right: TESS EBs plotted over Gaia targets; most EBs are displaced to the brighter, redder region of the CMD.

Standard image High-resolution image
Figure 20.

Figure 20. EB sample in the color–magnitude diagram with theoretical evolutionary tracks spanning 1–10 solar masses and boxes indicating evolutionary states of particular interest (see, e.g., Belokurov et al. 2020), including white dwarfs (WD), hot subdwarfs (sd), extreme horizontal branch (EHB) stars, RS CVn systems, and a remarkably large number of bright giants (BGs; i.e., giant stars of intermediate mass).

Standard image High-resolution image
Figure 21.

Figure 21. Galactic coordinates of each of the 460,000+ EB candidates found using a neural network on all TESS light curves up to 15th magnitude in sectors 1–26.

Standard image High-resolution image

The catalog of EBs observed by TESS in short cadence is thus only the first step in the exploration of the physical, dynamical, and statistical properties of EBs across the entire sky. The catalog will continue to be updated with new sectors of data as they become available and it will be broadened to include all EB sources detected in the TESS full-frame images. For the latest version of the catalog, please refer to http://tessEBs.villanova.edu.

This paper makes use of data collected by the TESS mission, which are publicly available from the Mikulski Archive for Space Telescopes (MAST). Funding for the TESS mission is provided by NASA's Science Mission directorate. We acknowledge the use of public TESS data from pipelines at the TESS Science Office and at the TESS Science Processing Operations Center. Resources supporting this work were provided by the NASA High-End Computing (HEC) Program through the NASA Advanced Supercomputing (NAS) Division at Ames Research Center for the production of the SPOC data products.

A.P. acknowledges support from NASA TESS GI programs 18-TESS2-19-0032 and 20-TESS20-0024. L.I.J. acknowledges funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation program (grant agreement No. 670519: MAMSIE) and from the Research Foundation Flanders (FWO) under grant agreement 1124321N (Aspirant Fellowship). C.J. has received funding from NOVA, the European Research Council under the European Union's Horizon 2020 research and innovation program (No. 670519: MAMSIE), and from the Research Foundation Flanders under grant agreement G0A2917N (BlackGEM). K.H. would like to thank NASA for their continued support through NASA ADAP program 80NSSC19K0594. We sincerely thank Kareem El-Badry for his help in comparing our catalog with his Gaia Wide Binary catalog.

This work makes use of Python (Python Software Foundation. Available at www.python.org) and the Python packages Django (Django Software Foundation 2020), Numpy (Harris et al. 2020), Numba (Lam et al. 2015), Scipy (Virtanen et al. 2020), and Matplotlib (Hunter 2007).

Footnotes

Please wait… references are loading.
10.3847/1538-4365/ac324a