Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 59 results for author: Alonso, G

.
  1. arXiv:2407.00839  [pdf, ps, other

    cs.DC cs.NI cs.OS

    Imaginary Machines: A Serverless Model for Cloud Applications

    Authors: Michael Wawrzoniak, Rodrigo Bruno, Ana Klimovic, Gustavo Alonso

    Abstract: Serverless Function-as-a-Service (FaaS) platforms provide applications with resources that are highly elastic, quick to instantiate, accounted at fine granularity, and without the need for explicit runtime resource orchestration. This combination of the core properties underpins the success and popularity of the serverless FaaS paradigm. However, these benefits are not available to most cloud appl… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2407.00832  [pdf, other

    cs.DC cs.NI cs.OS

    Boxer: FaaSt Ephemeral Elasticity for Off-the-Shelf Cloud Applications

    Authors: Michael Wawrzoniak, Rodrigo Bruno, Ana Klimovic, Gustavo Alonso

    Abstract: Elasticity is a key property of cloud computing. However, elasticity is offered today at the granularity of virtual machines, which take tens of seconds to start. This is insufficient to react to load spikes and sudden failures in latency sensitive applications, leading users to resort to expensive overprovisioning. Function-as-a-Service (FaaS) provides significantly higher elasticity than VMs, bu… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2406.12385  [pdf, other

    cs.AR

    Accelerating Graph-based Vector Search via Delayed-Synchronization Traversal

    Authors: Wenqi Jiang, Hang Hu, Torsten Hoefler, Gustavo Alonso

    Abstract: Vector search systems are indispensable in large language model (LLM) serving, search engines, and recommender systems, where minimizing online search latency is essential. Among various algorithms, graph-based vector search (GVS) is particularly popular due to its high search performance and quality. To efficiently serve low-latency GVS, we propose a hardware-algorithm co-design solution includin… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2403.15839  [pdf, other

    cs.LG cs.DB cs.DC

    TablePuppet: A Generic Framework for Relational Federated Learning

    Authors: Lijie Xu, Chulin Xie, Yiran Guo, Gustavo Alonso, Bo Li, Guoliang Li, Wei Wang, Wentao Wu, Ce Zhang

    Abstract: Current federated learning (FL) approaches view decentralized training data as a single table, divided among participants either horizontally (by rows) or vertically (by columns). However, these approaches are inadequate for handling distributed relational tables across databases. This scenario requires intricate SQL operations like joins and unions to obtain the training data, which is either cos… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures

  5. arXiv:2402.15158  [pdf, ps, other

    math.AG

    General infinitesimal variations of Hodge structure of ample curves in surfaces

    Authors: Víctor González Alonso, Sara Torelli

    Abstract: Given a smooth projective complex curve inside a smooth projective surface, one can ask how its Hodge structure varies when the curve moves inside the surface. In this paper we develop a general theory to study the infinitesimal version of this question in the case of ample curves. We can then apply the machinery to show that the infinitesimal variation of Hodge structure of a general deformation… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 30 pages. Comments Welcome!

    MSC Class: 14D07; 32G10

  6. EUSO-SPB1 Mission and Science

    Authors: JEM-EUSO Collaboration, :, G. Abdellaoui, S. Abe, J. H. Adams. Jr., D. Allard, G. Alonso, L. Anchordoqui, A. Anzalone, E. Arnone, K. Asano, R. Attallah, H. Attoui, M. Ave Pernas, R. Bachmann, S. Bacholle, M. Bagheri, M. Bakiri, J. Baláz, D. Barghini, S. Bartocci, M. Battisti, J. Bayer, B. Beldjilali, T. Belenguer , et al. (271 additional authors not shown)

    Abstract: The Extreme Universe Space Observatory on a Super Pressure Balloon 1 (EUSO-SPB1) was launched in 2017 April from Wanaka, New Zealand. The plan of this mission of opportunity on a NASA super pressure balloon test flight was to circle the southern hemisphere. The primary scientific goal was to make the first observations of ultra-high-energy cosmic-ray extensive air showers (EASs) by looking down on… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: 18 pages, 19 figures

    Journal ref: Astropart Phys 154 (2024) 102891

  7. CXL and the Return of Scale-Up Database Engines

    Authors: Alberto Lerner, Gustavo Alonso

    Abstract: The growing trend towards specialization has led to a proliferation of accelerators and alternative processing devices. When embedded in conventional computer architectures, the PCIe link connecting the CPU to these devices becomes a bottleneck. Several proposals for alternative designs have been put forward, with these efforts having now converged into the Compute Express Link (CXL) specification… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  8. arXiv:2312.11742  [pdf, other

    cs.DC cs.AR cs.LG cs.NI

    ACCL+: an FPGA-Based Collective Engine for Distributed Applications

    Authors: Zhenhao He, Dario Korolija, Yu Zhu, Benjamin Ramhorst, Tristan Laan, Lucian Petrica, Michaela Blott, Gustavo Alonso

    Abstract: FPGAs are increasingly prevalent in cloud deployments, serving as Smart NICs or network-attached accelerators. Despite their potential, developing distributed FPGA-accelerated applications remains cumbersome due to the lack of appropriate infrastructure and communication abstractions. To facilitate the development of distributed applications with FPGAs, in this paper we propose ACCL+, an open-sour… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  9. arXiv:2312.03862  [pdf, other

    quant-ph

    An inductive bias from quantum mechanics: learning order effects with non-commuting measurements

    Authors: Kaitlin Gili, Guillermo Alonso, Maria Schuld

    Abstract: There are two major approaches to building good machine learning algorithms: feeding lots of data into large models, or picking a model class with an ''inductive bias'' that suits the structure of the data. When taking the second approach as a starting point to design quantum algorithms for machine learning, it is important to understand how mathematical structures in quantum mechanics can lead to… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  10. arXiv:2312.00720  [pdf, other

    cs.DB

    Efficiently Processing Large Relational Joins on GPUs

    Authors: Bowen Wu, Dimitrios Koutsoukos, Gustavo Alonso

    Abstract: With the growing interest in Machine Learning (ML), Graphic Processing Units (GPUs) have become key elements of any computing infrastructure. Their widespread deployment in data centers and the cloud raises the question of how to use them beyond ML use cases, with growing interest in employing them in a database context. In this paper, we explore and analyze the implementation of relational joins… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  11. arXiv:2310.17726  [pdf, other

    hep-ex hep-ph

    Post-LS3 Experimental Options in ECN3

    Authors: C. Ahdida, G. Arduini, K. Balazs, H. Bartosik, J. Bernhard, A. Boyarsky, J. Brod, M. Brugger, M. Calviani, A. Ceccucci, A. Crivellin, G. D'Ambrosio, G. De Lellis, B. Döbrich, M. Fraser, R. Franqueira Ximenes, A. Golutvin, M. Gonzalez Alonso, E. Goudzovski, J. -L. Grenard, J. Heeck, J. Jaeckel, R. Jacobsson, Y. Kadi, F. Kahlhoefer , et al. (25 additional authors not shown)

    Abstract: The Experimental Cavern North 3 (ECN3) is an underground experimental cavern on the CERN Prévessin site. ECN3 currently hosts the NA62 experiment, with a physics programme devoted to rare kaon decays and searches of hidden particles approved until Long Shutdown 3 (LS3). Several options are proposed on the longer term in order to make best use of the worldwide unique potential of the high-intensity… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 113 pages, 39 figures

  12. arXiv:2310.09949  [pdf, other

    cs.LG cs.AI cs.AR cs.CL

    Chameleon: a heterogeneous and disaggregated accelerator system for retrieval-augmented language models

    Authors: Wenqi Jiang, Marco Zeller, Roger Waleffe, Torsten Hoefler, Gustavo Alonso

    Abstract: A Retrieval-Augmented Language Model (RALM) augments a generative language model by retrieving context-specific knowledge from an external database. This strategy facilitates impressive text generation quality even with smaller models, thus reducing orders of magnitude of computational demands. However, RALMs introduce unique system design challenges due to (a) the diverse workload characteristics… ▽ More

    Submitted 29 November, 2023; v1 submitted 15 October, 2023; originally announced October 2023.

  13. arXiv:2309.16520  [pdf, other

    cs.DB cs.AR

    SwiftSpatial: Spatial Joins on Modern Hardware

    Authors: Wenqi Jiang, Martin Parvanov, Gustavo Alonso

    Abstract: Spatial joins are among the most time-consuming queries in spatial data management systems. In this paper, we propose SwiftSpatial, a specialized accelerator architecture tailored for spatial joins. SwiftSpatial contains multiple high-performance join units with innovative hybrid parallelism, several efficient memory management units, and an integrated on-chip join scheduler. We prototype SwiftSpa… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  14. arXiv:2306.11182  [pdf, other

    cs.LG cs.DB cs.IR

    Co-design Hardware and Algorithm for Vector Search

    Authors: Wenqi Jiang, Shigang Li, Yu Zhu, Johannes de Fine Licht, Zhenhao He, Runbin Shi, Cedric Renggli, Shuai Zhang, Theodoros Rekatsinas, Torsten Hoefler, Gustavo Alonso

    Abstract: Vector search has emerged as the foundation for large-scale information retrieval and machine learning systems, with search engines like Google and Bing processing tens of thousands of queries per second on petabyte-scale document datasets by evaluating vector similarities between encoded query texts and web documents. As performance demands for vector search systems surge, accelerated hardware of… ▽ More

    Submitted 6 July, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 11 pages

  15. Data Processing with FPGAs on Modern Architectures

    Authors: Wenqi Jiang, Dario Korolija, Gustavo Alonso

    Abstract: Trends in hardware, the prevalence of the cloud, and the rise of highly demanding applications have ushered an era of specialization that quickly changes how data is processed at scale. These changes are likely to continue and accelerate in the next years as new technologies are adopted and deployed: smart NICs, smart storage, smart memory, disaggregated storage, disaggregated memory, specialized… ▽ More

    Submitted 24 June, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

  16. arXiv:2208.09519  [pdf, other

    cs.DB

    Resource Allocation in Serverless Query Processing

    Authors: Simon Kassing, Ingo Müller, Gustavo Alonso

    Abstract: Data lakes hold a growing amount of cold data that is infrequently accessed, yet require interactive response times. Serverless functions are seen as a way to address this use case since they offer an appealing alternative to maintaining (and paying for) a fixed infrastructure. Recent research has analyzed the potential of serverless for data processing. In this paper, we expand on such work by lo… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

  17. arXiv:2208.07124  [pdf, other

    cs.AR cs.DC

    ECI: a Customizable Cache Coherency Stack for Hybrid FPGA-CPU Architectures

    Authors: Abishek Ramdas, Michael Giardino, Runbin Shi, Adam Turowski, David Cock, Gustavo Alonso, Timothy Roscoe

    Abstract: Unlike other accelerators, FPGAs are capable of supporting cache coherency, thereby turning them into a more powerful architectural option than just a peripheral accelerator. However, most existing deployments of FPGAs are either non-cache coherent or support only an asymmetric design where cache coherency is controlled from the CPU. Taking advantage of a recently released two socket CPU-FPGA arch… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

  18. arXiv:2202.06646  [pdf, other

    cs.DC

    Short-lived Datacenter

    Authors: Michael Wawrzoniak, Ingo Müller, Rodrigo Bruno, Ana Klimovic, Gustavo Alonso

    Abstract: Serverless platforms have attracted attention due to their promise of elasticity, low cost, and fast deployment. Instead of using a fixed virtual machine (VM) infrastructure, which can incur considerable costs to operate and run, serverless platforms support short computations, triggered on demand, with cost proportional to fine-grain function execution time. However, serverless platforms offer a… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  19. arXiv:2201.12246   

    astro-ph.HE

    JEM-EUSO Collaboration contributions to the 37th International Cosmic Ray Conference

    Authors: G. Abdellaoui, S. Abe, J. H. Adams Jr., D. Allard, G. Alonso, L. Anchordoqui, A. Anzalone, E. Arnone, K. Asano, R. Attallah, H. Attoui, M. Ave Pernas, M. Bagheri, J. Baláz, M. Bakiri, D. Barghini, S. Bartocci, M. Battisti, J. Bayer, B. Beldjilali, T. Belenguer, N. Belkhalfa, R. Bellotti, A. A. Belov, K. Benmessai , et al. (267 additional authors not shown)

    Abstract: Compilation of papers presented by the JEM-EUSO Collaboration at the 37th International Cosmic Ray Conference (ICRC), held on July 12-23, 2021 (online) in Berlin, Germany.

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: html page with links to the JEM-EUSO Collaboration papers presented at ICRC-2021, Berlin, Germany

  20. arXiv:2201.12171  [pdf, ps, other

    cond-mat.stat-mech physics.bio-ph

    Modelling Active Non-Markovian Oscillations

    Authors: Gennaro Tucci, Édgar Roldán, Andrea Gambassi, Roman Belousov, Florian Berger, Rodrigo Gogui Alonso, A. James Hudspeth

    Abstract: Modelling noisy oscillations of active systems is one of the current challenges in physics and biology. Because the physical mechanisms of such processes are often difficult to identify, we propose a linear stochastic model driven by a non-Markovian bistable noise that is capable of generating self-sustained periodic oscillation. We derive analytical predictions for most relevant dynamical and the… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: 29 pages, 13 figures

    Journal ref: Phys. Rev. Lett. 129, 030603 (2022)

  21. arXiv:2201.08175  [pdf, other

    quant-ph

    Diagnosing quantum chaos with out-of-time-ordered-correlator quasiprobability in the kicked-top model

    Authors: José Raúl González Alonso, Nathan Shammah, Shahnawaz Ahmed, Franco Nori, Justin Dressel

    Abstract: While classical chaos has been successfully characterized with consistent theories and intuitive techniques, such as with the use of Lyapunov exponents, quantum chaos is still poorly understood, as well as its relation with multi-partite entanglement and information scrambling. We consider a benchmark system, the kicked top model, which displays chaotic behaviour in the classical version, and proc… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: 12 pages, 8 figures

  22. arXiv:2112.12638  [pdf, other

    cs.DB

    RumbleML: program the lakehouse with JSONiq

    Authors: Ghislain Fourny, David Dao, Can Berker Cikis, Ce Zhang, Gustavo Alonso

    Abstract: Lakehouse systems have reached in the past few years unprecedented size and heterogeneity and have been embraced by many industry players. However, they are often difficult to use as they lack the declarative language and optimization possibilities of relational engines. This paper introduces RumbleML, a high-level, declarative library integrated into the RumbleDB engine and with the JSONiq langua… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: 8 pages + references

    MSC Class: 68N99 ACM Class: H.2.3; C.2.4

  23. arXiv:2112.00425  [pdf, other

    cs.DB

    How to use Persistent Memory in your Database

    Authors: Dimitrios Koutsoukos, Raghav Bhartia, Ana Klimovic, Gustavo Alonso

    Abstract: Persistent or Non Volatile Memory (PMEM or NVM) has recently become commercially available under several configurations with different purposes and goals. Despite the attention to the topic, we are not aware of a comprehensive empirical analysis of existing relational database engines under different PMEM configurations. Such a study is important to understand the performance implications of the v… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  24. From Research to Proof-of-Concept: Analysis of a Deployment of FPGAs on a Commercial Search Engine

    Authors: Fabio Maschi, Gustavo Alonso, Anthony Hock-Koon, Nicolas Bondoux, Teddy Roy, Mourad Boudia, Matteo Casalino

    Abstract: FPGAs are quickly becoming available in the cloud as a one more heterogeneous processing element complementing CPUs and GPUs. There are many reports in the literature showing the potential for FPGAs to accelerate a wide variety of algorithms, which combined with their growing availability, would seem to also indicate a widespread use in many applications. Unfortunately, there is not much published… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

  25. arXiv:2106.07102  [pdf, other

    cs.DB

    Farview: Disaggregated Memory with Operator Off-loading for Database Engines

    Authors: Dario Korolija, Dimitrios Koutsoukos, Kimberly Keeton, Konstantin Taranov, Dejan Milojičić, Gustavo Alonso

    Abstract: Cloud deployments disaggregate storage from compute, providing more flexibility to both the storage and compute layers. In this paper, we explore disaggregation by taking it one step further and applying it to memory (DRAM). Disaggregated memory uses network attached DRAM as a way to decouple memory from CPU. In the context of databases, such a design offers significant advantages in terms of maki… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

    Comments: 12 pages

  26. Towards Demystifying Serverless Machine Learning Training

    Authors: Jiawei Jiang, Shaoduo Gan, Yue Liu, Fanlin Wang, Gustavo Alonso, Ana Klimovic, Ankit Singla, Wentao Wu, Ce Zhang

    Abstract: The appeal of serverless (FaaS) has triggered a growing interest on how to use it in data-intensive applications such as ETL, query processing, or machine learning (ML). Several systems exist for training large-scale ML models on top of serverless infrastructures (e.g., AWS Lambda) but with inconclusive results in terms of their performance and relative advantage over "serverful" infrastructures (… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

  27. Evaluating Query Languages and Systems for High-Energy Physics Data [Extended Version]

    Authors: Dan Graur, Ingo Müller, Mason Proffitt, Ghislain Fourny, Gordon T. Watts, Gustavo Alonso

    Abstract: In the domain of high-energy physics (HEP), query languages in general and SQL in particular have found limited acceptance. This is surprising since HEP data analysis matches the SQL model well: the data is fully structured and queried using mostly standard operators. To gain insights on why this is the case, we perform a comprehensive analysis of six diverse, general-purpose data processing platf… ▽ More

    Submitted 30 October, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: This is the extended version of a full paper to appear in PVLDB 15.2 (VLDB 2022)

  28. arXiv:2104.04824  [pdf

    astro-ph.IM astro-ph.EP

    Ariel: Enabling planetary science across light-years

    Authors: Giovanna Tinetti, Paul Eccleston, Carole Haswell, Pierre-Olivier Lagage, Jérémy Leconte, Theresa Lüftinger, Giusi Micela, Michel Min, Göran Pilbratt, Ludovic Puig, Mark Swain, Leonardo Testi, Diego Turrini, Bart Vandenbussche, Maria Rosa Zapatero Osorio, Anna Aret, Jean-Philippe Beaulieu, Lars Buchhave, Martin Ferus, Matt Griffin, Manuel Guedel, Paul Hartogh, Pedro Machado, Giuseppe Malaguti, Enric Pallé , et al. (293 additional authors not shown)

    Abstract: Ariel, the Atmospheric Remote-sensing Infrared Exoplanet Large-survey, was adopted as the fourth medium-class mission in ESA's Cosmic Vision programme to be launched in 2029. During its 4-year mission, Ariel will study what exoplanets are made of, how they formed and how they evolve, by surveying a diverse sample of about 1000 extrasolar planets, simultaneously in visible and infrared wavelengths.… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

    Comments: Ariel Definition Study Report, 147 pages. Reviewed by ESA Science Advisory Structure in November 2020. Original document available at: https://www.cosmos.esa.int/documents/1783156/3267291/Ariel_RedBook_Nov2020.pdf/

    Report number: ESA/SCI(2020)1

  29. arXiv:2010.05894  [pdf, other

    cs.AR cs.AI cs.IR cs.LG

    MicroRec: Efficient Recommendation Inference by Hardware and Data Structure Solutions

    Authors: Wenqi Jiang, Zhenhao He, Shuai Zhang, Thomas B. Preußer, Kai Zeng, Liang Feng, Jiansong Zhang, Tongxuan Liu, Yong Li, Jingren Zhou, Ce Zhang, Gustavo Alonso

    Abstract: Deep neural networks are widely used in personalized recommendation systems. Unlike regular DNN inference workloads, recommendation inference is memory-bound due to the many random memory accesses needed to lookup the embedding tables. The inference is also heavily constrained in terms of latency because producing a recommendation for a user must be done in about tens of milliseconds. In this pape… ▽ More

    Submitted 19 February, 2021; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: Accepted by MLSys'21 (the 4th Conference on Machine Learning and Systems)

  30. arXiv:2007.14394  [pdf, other

    cs.GR

    Signed Distance Fields Dynamic Diffuse Global Illumination

    Authors: Jinkai Hu, Milo Yip, G. Elias Alonso, Shihao Gu, Xiangjun Tang, Xiaogang Jin

    Abstract: Global Illumination (GI) is of utmost importance in the field of photo-realistic rendering. However, its computation has always been very complex, especially diffuse GI. State of the art real-time GI methods have limitations of different nature, such as light leaking, performance issues, special hardware requirements, noise corruption, bounce number limitations, among others. To overcome these lim… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

  31. arXiv:2005.13332  [pdf, other

    cs.DC

    HyperLogLog Sketch Acceleration on FPGA

    Authors: Amit Kulkarni, Monica Chiosa, Thomas B. Preußer, Kaan Kara, David Sidler, Gustavo Alonso

    Abstract: Data sketches are a set of widely used approximated data summarizing techniques. Their fundamental property is sub-linear memory complexity on the input cardinality, an important aspect when processing streams or data sets with a vast base domain (URLs, IP addresses, user IDs, etc.). Among the many data sketches available, HyperLogLog has become the reference for cardinality counting (how many dis… ▽ More

    Submitted 20 October, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: This paper was accepted as a full paper to FPL 2020. The latest/full version of this paper is available: https://ieeexplore.ieee.org/document/9221525

  32. arXiv:2005.04324  [pdf, other

    cs.AR

    Benchmarking High Bandwidth Memory on FPGAs

    Authors: Zeke Wang, Hongjing Huang, Jie Zhang, Gustavo Alonso

    Abstract: FPGAs are starting to be enhanced with High Bandwidth Memory (HBM) as a way to reduce the memory bandwidth bottleneck encountered in some applications and to give the FPGA more capacity to deal with application state. However, the performance characteristics of HBM are still not well specified, especially in the context of FPGAs. In this paper, we bridge the gap between nominal specifications and… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

  33. arXiv:2004.11080  [pdf, ps, other

    cs.AR eess.SY

    Using DSP Slices as Content-Addressable Update Queues

    Authors: Thomas B. Preußer, Monica Chiosa, Alexander Weiss, Gustavo Alonso

    Abstract: Content-Addressable Memory (CAM) is a powerful abstraction for building memory caches, routing tables and hazard detection logic. Without a native CAM structure available on FPGA devices, their functionality must be emulated using the structural primitives at hand. Such an emulation causes significant overhead in the consumption of the underlying resources, typically general-purpose fabric and on-… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: Submitted to FPL 2020

  34. arXiv:2004.03488  [pdf, other

    cs.DB

    Modularis: Modular Relational Analytics over Heterogeneous Distributed Platforms

    Authors: Dimitrios Koutsoukos, Ingo Müller, Renato Marroquín, Ana Klimovic, Gustavo Alonso

    Abstract: The enormous quantity of data produced every day together with advances in data analytics has led to a proliferation of data management and analysis systems. Typically, these systems are built around highly specialized monolithic operators optimized for the underlying hardware. While effective in the short term, such an approach makes the operators cumbersome to port and adapt, which is increasing… ▽ More

    Submitted 29 September, 2021; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted at PVLDB vol. 14

  35. arXiv:2004.01908  [pdf, other

    cs.DB cs.DC cs.PL

    The Collection Virtual Machine: An Abstraction for Multi-Frontend Multi-Backend Data Analysis

    Authors: Ingo Müller, Renato Marroquín, Dimitrios Koutsoukos, Mike Wawrzoniak, Sabir Akhadov, Gustavo Alonso

    Abstract: Getting the best performance from the ever-increasing number of hardware platforms has been a recurring challenge for data processing systems. In recent years, the advent of data science with its increasingly numerous and complex types of analytics has made this challenge even more difficult. In practice, system designers are overwhelmed by the number of combinations and typically implement only o… ▽ More

    Submitted 8 April, 2020; v1 submitted 4 April, 2020; originally announced April 2020.

    Comments: This paper is currently under review at DaMoN'20

  36. arXiv:2004.01635  [pdf, other

    cs.DC cs.AR

    High Bandwidth Memory on FPGAs: A Data Analytics Perspective

    Authors: Kaan Kara, Christoph Hagleitner, Dionysios Diamantopoulos, Dimitris Syrivelis, Gustavo Alonso

    Abstract: FPGA-based data processing in datacenters is increasing in popularity due to the demands of modern workloads and the ensuing necessity for specialization in hardware. Driven by this trend, vendors are rapidly adapting reconfigurable devices to suit data and compute intensive workloads. Inclusion of High Bandwidth Memory (HBM) in FPGA devices is a recent example. HBM promises overcoming the bandwid… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

  37. arXiv:2002.02837  [pdf, other

    hep-ex hep-ph

    Report on the ECFA Early-Career Researchers Debate on the 2020 European Strategy Update for Particle Physics

    Authors: N. Andari, L. Apolinário, K. Augsten, E. Bakos, I. Bellafont, L. Beresford, A. Bethani, J. Beyer, L. Bianchini, C. Bierlich, B. Bilin, K. L. Bjørke, E. Bols, P. A. Brás, L. Brenner, E. Brondolin, P. Calvo, B. Capdevila, I. Cioara, L. N. Cojocariu, F. Collamati, A. de Wit, F. Dordei, M. Dordevic, T. A. du Pree , et al. (96 additional authors not shown)

    Abstract: A group of Early-Career Researchers (ECRs) has been given a mandate from the European Committee for Future Accelerators (ECFA) to debate the topics of the current European Strategy Update (ESU) for Particle Physics and to summarise the outcome in a brief document [1]. A full-day debate with 180 delegates was held at CERN, followed by a survey collecting quantitative input. During the debate, the E… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: Editors: A. Bethani, E. Brondolin, A. A. Elliot, J. García Pardiñas, G. Gilles, L. Gouskos, E. Gouveia, E. Graverini, N. Hermansson-Truedsson, A. Irles, H. Jansen, K. H. Mankinen, E. Manoni, A. Mathad, J. McFayden, M. Queitsch-Maitland, J. Rembser, E. T. J. Reynolds, R. Schöfbeck, P. Schwendimann, S. Sekmen, P. Sznajder, S. L. Williams, D. Zanzi

    Report number: CERN-OPEN-2020-006

  38. arXiv:2001.10390  [pdf

    eess.SY

    Diseño de un controlador de ángulo en un balancín

    Authors: Alvarado Moreno, Jose David, Delgadillo Romero, Kevin Andrey, Galvis Reyna, David Enrique, Poblador Parra, Gustavo Alonso, Rodríguez Cortés, César Alejandro

    Abstract: This document describes the design of a PID controller for a rotation pitch plant of a degree freedom. In the controller design, the tuning methods of Aströn Hägglund (AH), Kaiser Chaira (KC) and Kaiser Rajka (KR) will be used, verifying the performance in simulations and in the plant. Finally, the development for the implementation of an analog PID controller through circuits with operational amp… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: 20 pages

    Journal ref: 978-958-8928-47-0, Proyectronica, 2017

  39. arXiv:1912.08666   

    astro-ph.HE

    Contributions to the 36th International Cosmic Ray Conference (ICRC 2019) of the JEM-EUSO Collaboration

    Authors: G. Abdellaoui, S. Abe, J. H. Adams Jr., A. Ahriche, D. Allard, L. Allen, G. Alonso, L. Anchordoqui, A. Anzalone, Y. Arai, K. Asano, R. Attallah, H. Attoui, M. Ave Pernas, S. Bacholle, M. Bakiri, P. Baragatti, P. Barrillon, S. Bartocci, J. Bayer, B. Beldjilali, T. Belenguer, N. Belkhalfa, R. Bellotti, A. Belov , et al. (287 additional authors not shown)

    Abstract: Compilation of papers presented by the JEM-EUSO Collaboration at the 36th International Cosmic Ray Conference (ICRC), held July 24 through August 1, 2019 in Madison, Wisconsin.

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: links to the 24 papers published in arXiv

    Journal ref: all published in PoS(ICRC2019)

  40. Lambada: Interactive Data Analytics on Cold Data using Serverless Cloud Infrastructure

    Authors: Ingo Müller, Renato Marroquín, Gustavo Alonso

    Abstract: The promise of ultimate elasticity and operational simplicity of serverless computing has recently lead to an explosion of research in this area. In the context of data analytics, the concept sounds appealing, but due to the limitations of current offerings, there is no consensus yet on whether or not this approach is technically and economically viable. In this paper, we identify interactive data… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Report number: https://doi.org/10.3929/ethz-b-000413183

  41. arXiv:1910.11582  [pdf, other

    cs.DB

    Rumble: Data Independence for Large Messy Data Sets

    Authors: Ingo Müller, Ghislain Fourny, Stefan Irimescu, Can Berker Cikis, Gustavo Alonso

    Abstract: This paper introduces Rumble, a query execution engine for large, heterogeneous, and nested collections of JSON objects built on top of Apache Spark. While data sets of this type are more and more wide-spread, most existing tools are built around a tabular data model, creating an impedance mismatch for both the engine and the query interface. In contrast, Rumble uses JSONiq, a standardized languag… ▽ More

    Submitted 19 October, 2020; v1 submitted 25 October, 2019; originally announced October 2019.

    Comments: In revision for PVLDB 14, 2020

    MSC Class: 68N99 ACM Class: H.2.3; C.2.4

  42. arXiv:1910.09017  [pdf, other

    cs.DB cs.DC

    Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries

    Authors: Maciej Besta, Robert Gerstenberger, Emanuel Peter, Marc Fischer, Michał Podstawski, Claude Barthels, Gustavo Alonso, Torsten Hoefler

    Abstract: Graph processing has become an important part of multiple areas of computer science, such as machine learning, computational sciences, medical applications, social network analysis, and many others. Numerous graphs such as web or social networks may contain up to trillions of edges. Often, these graphs are also dynamic (their structure changes over time) and have domain-specific rich data associat… ▽ More

    Submitted 30 August, 2023; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: A shorter version of the survey was published in the journal ACM Computing Surveys

  43. Strongly measuring qubit quasiprobabilities behind out-of-time-ordered correlators

    Authors: Razieh Mohseninia, José Raúl González Alonso, Justin Dressel

    Abstract: Out-of-time-ordered correlators (OTOCs) have been proposed as a tool to witness quantum information scrambling in many-body system dynamics. These correlators can be understood as averages over nonclassical multi-time quasi-probability distributions (QPDs). These QPDs have more information, and their nonclassical features witness quantum information scrambling in a more nuanced way. However, their… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: 7 pages, 1 figure

  44. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  45. arXiv:1903.11061  [pdf, other

    astro-ph.IM astro-ph.GA

    The Polarimetric and Helioseismic Imager on Solar Orbiter

    Authors: S. K. Solanki, J. C. del Toro Iniesta, J. Woch, A. Gandorfer, J. Hirzberger, A. Alvarez-Herrero, T. Appourchaux, V. Martínez Pillet, I. Pérez-Grande, E. Sanchis Kilders, W. Schmidt, J. M. Gómez Cama, H. Michalik, W. Deutsch, G. Fernandez-Rico, B. Grauf, L. Gizon, K. Heerlein, M. Kolleck, A. Lagg, R. Meller, R. Müller, U. Schühle, J. Staub, K. Albert , et al. (99 additional authors not shown)

    Abstract: This paper describes the Polarimetric and Helioseismic Imager on the Solar Orbiter mission (SO/PHI), the first magnetograph and helioseismology instrument to observe the Sun from outside the Sun-Earth line. It is the key instrument meant to address the top-level science question: How does the solar dynamo work and drive connections between the Sun and the heliosphere? SO/PHI will also play an impo… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

    Comments: 36 pages, 41 figures

    Journal ref: A&A 642, A11 (2020)

  46. Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-precision Learning (Technical Report)

    Authors: Zeke Wang, Kaan Kara, Hantian Zhang, Gustavo Alonso, Onur Mutlu, Ce Zhang

    Abstract: Learning from the data stored in a database is an important function increasingly available in relational engines. Methods using lower precision input data are of special interest given their overall higher efficiency but, in databases, these methods have a hidden cost: the quantization of the real value into a smaller number is an expensive step. To address the issue, in this paper we present MLW… ▽ More

    Submitted 28 March, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: 18 pages

    Journal ref: PVLDB, 2019

  47. Pay One, Get Hundreds for Free: Reducing Cloud Costs through Shared Query Execution

    Authors: Renato Marroquín, Ingo Müller, Darko Makreshanski, Gustavo Alonso

    Abstract: Cloud-based data analysis is nowadays common practice because of the lower system management overhead as well as the pay-as-you-go pricing model. The pricing model, however, is not always suitable for query processing as heavy use results in high costs. For example, in query-as-a-service systems, where users are charged per processed byte, collections of queries accessing the same data frequently… ▽ More

    Submitted 1 September, 2018; originally announced September 2018.

    Journal ref: Proceedings of the ACM Symposium on Cloud Computing (SoCC) 2018, pages 439-450

  48. First observations of speed of light tracks by a fluorescence detector looking down on the atmosphere

    Authors: G. Abdellaoui, S. Abe, J. H. Adams Jr., A. Ahriche, D. Allard, L. Allen, G. Alonso, L. Anchordoqui, A. Anzalone, Y. Arai, K. Asano, R. Attallah, H. Attoui, M. Ave Pernas, S. Bacholle, M. Bakiri, P. Baragatti, P. Barrillon, S. Bartocci, J. Bayer, B. Beldjilali, T. Belenguer, N. Belkhalfa, R. Bellotti, A. Belov , et al. (289 additional authors not shown)

    Abstract: EUSO-Balloon is a pathfinder mission for the Extreme Universe Space Observatory onboard the Japanese Experiment Module (JEM-EUSO). It was launched on the moonless night of the 25$^{th}$ of August 2014 from Timmins, Canada. The flight ended successfully after maintaining the target altitude of 38 km for five hours. One part of the mission was a 2.5 hour underflight using a helicopter equipped with… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

    Comments: 18 pages, 14 figures

    Journal ref: 2018 JINST 13 P05023

  49. arXiv:1806.09637  [pdf, other

    quant-ph cond-mat.dis-nn hep-th

    Out-of-Time-Ordered-Correlator Quasiprobabilities Robustly Witness Scrambling

    Authors: José Raúl González Alonso, Nicole Yunger Halpern, Justin Dressel

    Abstract: Out-of-time-ordered correlators (OTOCs) have received considerable recent attention as qualitative witnesses of information scrambling in many-body quantum systems. Theoretical discussions of OTOCs typically focus on closed systems, raising the question of their suitability as scrambling witnesses in realistic open systems. We demonstrate empirically that the nonclassical negativity of the quasipr… ▽ More

    Submitted 2 February, 2019; v1 submitted 25 June, 2018; originally announced June 2018.

    Comments: 12 pages, 8 figures; v2: published version

    Journal ref: Phys. Rev. Lett. 122, 040404 (2019)

  50. Strengthening weak measurements of qubit out-of-time-order correlators

    Authors: Justin Dressel, José Raúl González Alonso, Mordecai Waegell, Nicole Yunger Halpern

    Abstract: For systems of controllable qubits, we provide a method for experimentally obtaining a useful class of multitime correlators using sequential generalized measurements of arbitrary strength. Specifically, if a correlator can be expressed as an average of nested (anti)commutators of operators that square to the identity, then that correlator can be determined exactly from the average of a measuremen… ▽ More

    Submitted 4 October, 2018; v1 submitted 2 May, 2018; originally announced May 2018.

    Comments: 12 pages, 6 figures, published version

    Journal ref: Phys. Rev. A 98, 012132 (2018)