Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–6 of 6 results for author: Veiga, M H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.04929  [pdf, other

    cs.CL cs.AI cs.DL cs.LG

    An Interdisciplinary Outlook on Large Language Models for Scientific Research

    Authors: James Boyko, Joseph Cohen, Nathan Fox, Maria Han Veiga, Jennifer I-Hsiu Li, Jing Liu, Bernardo Modenesi, Andreas H. Rauch, Kenneth N. Reid, Soumi Tribedi, Anastasia Visheratina, Xin Xie

    Abstract: In this paper, we describe the capabilities and constraints of Large Language Models (LLMs) within disparate academic disciplines, aiming to delineate their strengths and limitations with precision. We examine how LLMs augment scientific inquiry, offering concrete examples such as accelerating literature review by summarizing vast numbers of publications, enhancing code development through automat… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  2. arXiv:2303.12785  [pdf, other

    cs.LG cs.AI

    Matryoshka Policy Gradient for Entropy-Regularized RL: Convergence and Global Optimality

    Authors: François Ged, Maria Han Veiga

    Abstract: A novel Policy Gradient (PG) algorithm, called Matryoshka Policy Gradient (MPG), is introduced and studied, in the context of max-entropy reinforcement learning, where an agent aims at maximising entropy bonuses additional to its cumulative rewards. MPG differs from standard PG in that it trains a sequence of policies to learn finite horizon tasks simultaneously, instead of a single policy for the… ▽ More

    Submitted 25 June, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    MSC Class: 68T07 ACM Class: I.2.0; I.2.6

  3. arXiv:2107.09082  [pdf, other

    astro-ph.CO astro-ph.IM cs.LG

    Reconstruction of the Density Power Spectrum from Quasar Spectra using Machine Learning

    Authors: Maria Han Veiga, Xi Meng, Oleg Y. Gnedin, Nickolay Y. Gnedin, Xun Huan

    Abstract: We describe a novel end-to-end approach using Machine Learning to reconstruct the power spectrum of cosmological density perturbations at high redshift from observed quasar spectra. State-of-the-art cosmological simulations of structure formation are used to generate a large synthetic dataset of line-of-sight absorption spectra paired with 1-dimensional fluid quantities along the same line-of-sigh… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

    Comments: 10 pages, 9 figures

  4. arXiv:1901.02646  [pdf, other

    cs.CL

    What do Language Representations Really Represent?

    Authors: Johannes Bjerva, Robert Östling, Maria Han Veiga, Jörg Tiedemann, Isabelle Augenstein

    Abstract: A neural language model trained on a text corpus can be used to induce distributed representations of words, such that similar words end up with similar representations. If the corpus is multilingual, the same model can be used to learn distributed representations of languages, such that similar languages end up with similar representations. We show that this holds even when the multilingual corpu… ▽ More

    Submitted 9 January, 2019; originally announced January 2019.

    Comments: 8 pages, accepted for publication in Computational Linguistics (squib)

  5. A Cross-Platform Collection of Social Network Profiles

    Authors: Maria Han Veiga, Carsten Eickhoff

    Abstract: The proliferation of Internet-enabled devices and services has led to a shifting balance between digital and analogue aspects of our everyday lives. In the face of this development there is a growing demand for the study of privacy hazards, the potential for unique user de-anonymization and information leakage between the various social media profiles many of us maintain. To enable the structured… ▽ More

    Submitted 12 July, 2016; originally announced July 2016.

    Comments: 4 pages, 5 figures, SIGIR 2016, short paper. SIGIR 2016 Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval

  6. arXiv:1607.02714  [pdf, other

    cs.SI cs.CY

    Privacy Leakage through Innocent Content Sharing in Online Social Networks

    Authors: Maria Han Veiga, Carsten Eickhoff

    Abstract: The increased popularity and ubiquitous availability of online social networks and globalised Internet access have affected the way in which people share content. The information that users willingly disclose on these platforms can be used for various purposes, from building consumer models for advertising, to inferring personal, potentially invasive, information. In this work, we use Twitter, Ins… ▽ More

    Submitted 10 July, 2016; originally announced July 2016.

    Comments: 8 pages, 10 figures, submitted to Privacy Preserving Workshop, Sigir