Implementing the Palomar Transient Factory Real-Time Detection Pipeline in GLADE: Results and Observations

Rusu, Florin; Nugent, Peter; Wu, Kesheng

doi:10.1007/978-3-319-05693-7_4

Florin Rusu¹⁸,
Peter Nugent¹⁹ &
Kesheng Wu¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8381))

Included in the following conference series:

International Workshop on Databases in Networked Information Systems

1353 Accesses
6 Citations

Abstract

Palomar Transient Factory is a comprehensive detection system for the identification and classification of transient astrophysical objects. The central piece in the identification pipeline is represented by an automated classifier that distinguishes between real and bogus objects with high accuracy. Given that the classifier has to identify the most significant transients out of a large number of candidates in near real-time, the response time it provides is of critical importance. In this paper, we present an experimental study that evaluates a novel implementation of the classifier in GLADE—a parallel data processing system that combines the efficiency of a database with the extensibility of Map-Reduce. We show how each stage in the classifier – candidate identification, pruning, and contextual realbogus – maps optimally into GLADE tasks by taking advantage of the unique features of the system—range-based data partitioning, columnar storage, multi-query execution, and in-database support for complex aggregate computation. The result is an efficient classifier implementation capable to process a new set of acquired images in a matter of minutes even on a low-end server. For comparison, an optimized PostgreSQL implementation of the classifier takes hours on the same machine.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

The Palomar Transient Factory Data Archive

Generic Exact Combinatorial Search at HPC Scale

Article Open access 07 December 2022

A distributed approach for persistent homology computation on a large scale

Article Open access 12 August 2024

References

Palomar Transient Factory (November 2013), http://www.astro.caltech.edu/ptf/
Law, N.M., et al.: The Palomar Transient Factory: System Overview, Performance and First Results. CoRR abs/0906.5350 (2009)
Google Scholar
Bloom, J.S., et al.: Automating Discovery and Classification of Transients and Variable Stars in the Synoptic Survey Era. CoRR abs/1106.5491 (2011)
Google Scholar
Grillmair, C.J., et al.: An Overview of the Palomar Transient Factory Pipeline and Archive at the Infrared Processing and Analysis Center. In: Astronomical Data Analysis Software and Systems XIX. ASP Conf. Ser., vol. 434, pp. 28–36 (2010)
Google Scholar
Cheng, Y., Qin, C., Rusu, F.: GLADE: Big Data Analytics Made Easy. In: Proceedings of 2012 ACM SIGMOD International Conference on Management of Data, pp. 697–700 (2012)
Google Scholar
PostgreSQL, http://www.postgresql.org/ (November 2013)
Python Programming Language (November 2013), http://www.python.org/
Cheng, Y., Rusu, F.: Astronomical Data Processing in EXTASCID. In: Proceedings of 2013 SSDBM Conf. on Sci. and Stat. Database Management, pp. 387–390 (2013)
Google Scholar
Arumugam, S., Dobra, A., Jermaine, C., Pansare, N., Perez, L.: The DataPath System: A Data-Centric Analytic Processing Engine for Large Data Warehouses. In: Proceedings of 2010 ACM SIGMOD International Conference on Management of Data, pp. 519–530 (2010)
Google Scholar
Rusu, F., Dobra, A.: GLADE: A Scalable Framework for Efficient Analytics. Operating Systems Review 46(1), 12–18 (2012)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of California, Merced, CA, 95343, USA
Florin Rusu
Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
Peter Nugent & Kesheng Wu

Authors

Florin Rusu
View author publications
You can also search for this author in PubMed Google Scholar
Peter Nugent
View author publications
You can also search for this author in PubMed Google Scholar
Kesheng Wu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Aizu, Aizu Wakamatsu Shi, 965-8580, Fukushima Ken, Japan
Aastha Madaan
University of Aizu, Aizu Wakamatsu, 965-8580, Fukushima, Japan
Shinji Kikuchi
Graduate Department of Computer and Information Systems, University of Aizu, Ikki Machi, Aizu-Wakamatsu, 965-8580, Fukushima, Japan
Subhash Bhalla

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rusu, F., Nugent, P., Wu, K. (2014). Implementing the Palomar Transient Factory Real-Time Detection Pipeline in GLADE: Results and Observations. In: Madaan, A., Kikuchi, S., Bhalla, S. (eds) Databases in Networked Information Systems. DNIS 2014. Lecture Notes in Computer Science, vol 8381. Springer, Cham. https://doi.org/10.1007/978-3-319-05693-7_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-05693-7_4
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05692-0
Online ISBN: 978-3-319-05693-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Implementing the Palomar Transient Factory Real-Time Detection Pipeline in GLADE: Results and Observations

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

The Palomar Transient Factory Data Archive

Generic Exact Combinatorial Search at HPC Scale

A distributed approach for persistent homology computation on a large scale

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Implementing the Palomar Transient Factory Real-Time Detection Pipeline in GLADE: Results and Observations

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

The Palomar Transient Factory Data Archive

Generic Exact Combinatorial Search at HPC Scale

A distributed approach for persistent homology computation on a large scale

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation