Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–9 of 9 results for author: Grezes, F

.
  1. arXiv:2405.10725  [pdf, other

    cs.CL cs.IR

    INDUS: Effective and Efficient Language Models for Scientific Applications

    Authors: Bishwaranjan Bhattacharjee, Aashka Trivedi, Masayasu Muraoka, Muthukumaran Ramasubramanian, Takuma Udagawa, Iksha Gurung, Rong Zhang, Bharath Dandala, Rahul Ramachandran, Manil Maskey, Kaylin Bugbee, Mike Little, Elizabeth Fancher, Lauren Sanders, Sylvain Costes, Sergi Blanco-Cuaresma, Kelly Lockhart, Thomas Allen, Felix Grezes, Megan Ansdell, Alberto Accomazzi, Yousef El-Kurdi, Davis Wertheimer, Birgit Pfitzmann, Cesar Berrospi Ramis , et al. (9 additional authors not shown)

    Abstract: Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this pivotal insight, we developed INDUS, a comprehensive suite of LLMs tailored for the Earth science, biology, physics, heliophysics,… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  2. arXiv:2312.14211  [pdf, ps, other

    cs.CL astro-ph.IM cs.AI

    Experimenting with Large Language Models and vector embeddings in NASA SciX

    Authors: Sergi Blanco-Cuaresma, Ioana Ciucă, Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Kelly E. Lockhart, Felix Grezes, Thomas Allen, Golnaz Shapurian, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Daniel Chivvis, Fernanda de Macedo Alves, Jean-Claude Paquin, Jennifer Bartlett, Mugdha Polimera, Stephanie Jarmak

    Abstract: Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

  3. arXiv:2306.07796  [pdf, other

    cs.LG cs.CR

    Finite Gaussian Neurons: Defending against adversarial attacks by making neural networks say "I don't know"

    Authors: Felix Grezes

    Abstract: Since 2014, artificial neural networks have been known to be vulnerable to adversarial attacks, which can fool the network into producing wrong or nonsensical outputs by making humanly imperceptible alterations to inputs. While defenses against adversarial attacks have been proposed, they usually involve retraining a new neural network from scratch, a costly task. In this work, I introduce the Fin… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: PhD thesis

  4. arXiv:2212.00744  [pdf, ps, other

    cs.CL astro-ph.IM

    Improving astroBERT using Semantic Textual Similarity

    Authors: Felix Grezes, Thomas Allen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Pavlos Protopapas

    Abstract: The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

  5. arXiv:2202.00777  [pdf, ps, other

    cs.HC astro-ph.IM

    Web accessibility trends and implementation in dynamic web applications

    Authors: Timothy W. Hostetler, Shinyi Chen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Donna M. Thompson, Roman Chyla, Golnaz Shapurian, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Stephen McDonald, Felix Grezes

    Abstract: The NASA Astrophysics Data System (ADS), a critical research service for the astrophysics community, strives to provide the most accessible and inclusive environment for the discovery and exploration of the astronomical literature. Part of this goal involves creating a digital platform that can accommodate everybody, including those with disabilities that would benefit from alternative ways to pre… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted to ADASS XXXI (2021)

  6. arXiv:2112.00590  [pdf, ps, other

    cs.CL astro-ph.IM

    Building astroBERT, a language model for Astronomy & Astrophysics

    Authors: Felix Grezes, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Shinyi Chen, Chris Tanner, Pavlos Protopapas

    Abstract: The existing search tools for exploring the NASA Astrophysics Data System (ADS) can be quite rich and empowering (e.g., similar and trending operators), but researchers are not yet allowed to fully leverage semantic search. For example, a query for "results from the Planck mission" should be able to distinguish between all the various meanings of Planck (person, mission, constant, institutions and… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  7. arXiv:2012.03388  [pdf, other

    cs.SD cs.LG eess.AS

    Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement

    Authors: Felix Grezes, Zhaoheng Ni, Viet Anh Trinh, Michael Mandel

    Abstract: Recurrent neural networks using the LSTM architecture can achieve significant single-channel noise reduction. It is not obvious, however, how to apply them to multi-channel inputs in a way that can generalize to new microphone configurations. In contrast, spatial clustering techniques can achieve such generalization, but lack a strong signal model. This paper combines the two approaches to attain… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: text overlap with arXiv:2012.01576, arXiv:2012.02191

  8. arXiv:2012.02191  [pdf, other

    cs.SD cs.LG eess.AS

    Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks

    Authors: Zhaoheng Ni, Felix Grezes, Viet Anh Trinh, Michael I. Mandel

    Abstract: Spatial clustering techniques can achieve significant multi-channel noise reduction across relatively arbitrary microphone configurations, but have difficulty incorporating a detailed speech/noise model. In contrast, LSTM neural networks have successfully been trained to recognize speech from noise on single-channel inputs, but have difficulty taking full advantage of the information in multi-chan… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.01576

  9. arXiv:2012.01576  [pdf, other

    cs.SD cs.LG eess.AS

    Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks

    Authors: Felix Grezes, Zhaoheng Ni, Viet Anh Trinh, Michael Mandel

    Abstract: Recent works have shown that Deep Recurrent Neural Networks using the LSTM architecture can achieve strong single-channel speech enhancement by estimating time-frequency masks. However, these models do not naturally generalize to multi-channel inputs from varying microphone configurations. In contrast, spatial clustering techniques can achieve such generalization but lack a strong signal model. Ou… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.