Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 249 results for author: Anand, A

.
  1. arXiv:2406.17158  [pdf, other

    cs.CL cs.IR

    DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs

    Authors: Venktesh V. Deepali Prabhu, Avishek Anand

    Abstract: Open-domain complex Question Answering (QA) is a difficult task with challenges in evidence retrieval and reasoning. The complexity of such questions could stem from questions being compositional, hybrid evidence, or ambiguity in questions. While retrieval performance for classical QA tasks is well explored, their capabilities for heterogeneous complex retrieval tasks, especially in an open-domain… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: under submission, 22 pages

  2. arXiv:2406.15335  [pdf, other

    cs.CV cs.CY

    Keystroke Dynamics Against Academic Dishonesty in the Age of LLMs

    Authors: Debnath Kundu, Atharva Mehta, Rajesh Kumar, Naman Lal, Avinash Anand, Apoorv Singh, Rajiv Ratn Shah

    Abstract: The transition to online examinations and assignments raises significant concerns about academic integrity. Traditional plagiarism detection systems often struggle to identify instances of intelligent cheating, particularly when students utilize advanced generative AI tools to craft their responses. This study proposes a keystroke dynamics-based method to differentiate between bona fide and assist… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted for publication at The IEEE International Joint Conference on Biometrics (IJCB2024), contains 9 pages, 3 figures, 3 tables

    ACM Class: I.5.4

  3. arXiv:2406.13325  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Advances in perovskite nanocrystals and nanocomposites for scintillation applications

    Authors: Abhinav Anand, Matteo L. Zaffalon, Andrea Erroi, Francesca Cova, Francesco Carulli, Sergio Brovelli

    Abstract: In recent years, the field of radiation detection has witnessed a paradigm shift with the emergence of plastic scintillators incorporating perovskite nanocrystals (PNCs). This innovative class of scintillators not only capitalizes on the superior luminescent properties of PNCs but also harnesses the flexibility and processability of polymers. This review explores the intricate landscape of synthes… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Journal ref: ACS Energy Letters 2024

  4. arXiv:2406.12203  [pdf, other

    cs.AI

    InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context

    Authors: Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao

    Abstract: Large language models (LLMs) have demonstrated the potential to mimic human social intelligence. However, most studies focus on simplistic and static self-report or performance-based tests, which limits the depth and validity of the analysis. In this paper, we developed a novel framework, InterIntent, to assess LLMs' social intelligence by mapping their ability to understand and manage intentions… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.11930  [pdf, other

    cs.SE cs.AI cs.CL

    A Critical Study of What Code-LLMs (Do Not) Learn

    Authors: Abhinav Anand, Shweta Verma, Krishna Narasimhan, Mira Mezini

    Abstract: Large Language Models trained on code corpora (code-LLMs) have demonstrated impressive performance in various coding assistance tasks. However, despite their increased size and training dataset, code-LLMs still have limitations such as suggesting codes with syntactic errors, variable misuse etc. Some studies argue that code-LLMs perform well on coding tasks because they use self-attention and hidd… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2406.09175  [pdf, other

    cs.CV cs.CL

    ReMI: A Dataset for Reasoning with Multiple Images

    Authors: Mehran Kazemi, Nishanth Dikkala, Ankit Anand, Petar Devic, Ishita Dasgupta, Fangyu Liu, Bahare Fatemi, Pranjal Awasthi, Dee Guo, Sreenivas Gollapudi, Ahmed Qureshi

    Abstract: With the continuous advancement of large language models (LLMs), it is essential to create new benchmarks to effectively evaluate their expanding capabilities and identify areas for improvement. This work focuses on multi-image reasoning, an emerging capability in state-of-the-art LLMs. We introduce ReMI, a dataset designed to assess LLMs' ability to Reason with Multiple Images. This dataset encom… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  7. arXiv:2406.08606  [pdf, other

    cs.CL cs.AI

    End-to-End Argument Mining as Augmented Natural Language Generation

    Authors: Nilmadhab Das, Vishal Choudhary, V. Vijaya Saradhi, Ashish Anand

    Abstract: Argument Mining (AM) is a crucial aspect of computational argumentation, which deals with the identification and extraction of Argumentative Components (ACs) and their corresponding Argumentative Relations (ARs). Most prior works have solved these problems by dividing them into multiple subtasks. And the available end-to-end setups are mostly based on the dependency parsing approach. This work pro… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  8. arXiv:2406.02724  [pdf, other

    astro-ph.IM astro-ph.CO physics.ins-det

    The LiteBIRD mission to explore cosmic inflation

    Authors: T. Ghigna, A. Adler, K. Aizawa, H. Akamatsu, R. Akizawa, E. Allys, A. Anand, J. Aumont, J. Austermann, S. Azzoni, C. Baccigalupi, M. Ballardini, A. J. Banday, R. B. Barreiro, N. Bartolo, S. Basak, A. Basyrov, S. Beckman, M. Bersanelli, M. Bortolami, F. Bouchet, T. Brinckmann, P. Campeti, E. Carinos, A. Carones , et al. (134 additional authors not shown)

    Abstract: LiteBIRD, the next-generation cosmic microwave background (CMB) experiment, aims for a launch in Japan's fiscal year 2032, marking a major advancement in the exploration of primordial cosmology and fundamental physics. Orbiting the Sun-Earth Lagrangian point L2, this JAXA-led strategic L-class mission will conduct a comprehensive mapping of the CMB polarization across the entire sky. During its 3-… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 23 pages, 9 figures, 1 table, SPIE Astronomical Telescopes + Instrumentation 2024

  9. arXiv:2405.19288  [pdf, other

    astro-ph.CO astro-ph.IM

    Archetype-Based Redshift Estimation for the Dark Energy Spectroscopic Instrument Survey

    Authors: Abhijeet Anand, Julien Guy, Stephen Bailey, John Moustakas, J. Aguilar, S. Ahlen, A. Bolton, A. Brodzeller, D. Brooks, T. Claybaugh, S. Cole, B. Dey, K. Fanning, J. Forero-Romero, E. Gaztañaga, S. Gontcho A Gontcho, L. Le Guillou, G. Gutierrez, K. Honscheid, C. Howlett, S. Juneau, D. Kirkby, T. Kisner, A. Kremin, A. Lambert , et al. (24 additional authors not shown)

    Abstract: We present a computationally efficient galaxy archetype-based redshift estimation and spectral classification method for the Dark Energy Survey Instrument (DESI) survey. The DESI survey currently relies on a redshift fitter and spectral classifier using a linear combination of PCA-derived templates, which is very efficient in processing large volumes of DESI spectra within a short time frame. Howe… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: submitted to AAS journals, 29 pages, 13 figures

  10. arXiv:2405.17892  [pdf, ps, other

    math.OC

    Data-Driven Predictive Control and MPC: Do we achieve optimality?

    Authors: Akhil S Anand, Shambhuraj Sawant, Dirk Reinhardt, Sebastien Gros

    Abstract: In this paper, we explore the interplay between Predictive Control and closed-loop optimality, spanning from Model Predictive Control to Data-Driven Predictive Control. Predictive Control in general relies on some form of prediction scheme on the real system trajectories. However, these predictions may not accurately capture the real system dynamics, for e.g., due to stochasticity, resulting in su… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  11. arXiv:2405.16593  [pdf, other

    astro-ph.CO

    The Construction of Large-scale Structure Catalogs for the Dark Energy Spectroscopic Instrument

    Authors: A. J. Ross, J. Aguilar, S. Ahlen, S. Alam, A. Anand, S. Bailey, D. Bianchi, S. Brieden, D. Brooks, E. Burtin, A. Carnero Rosell, E. Chaussidon, T. Claybaugh, S. Cole, K. Dawson, A. de la Macorra, A. de Mattia, Arjun Dey, Biprateep Dey, P. Doel, K. Fanning, S. Ferraro, J. Ereza, A. Font-Ribera, J. E. Forero-Romero , et al. (59 additional authors not shown)

    Abstract: We present the technical details on how large-scale structure (LSS) catalogs are constructed from redshifts measured from spectra observed by the Dark Energy Spectroscopic Instrument (DESI). The LSS catalogs provide the information needed to determine the relative number density of DESI tracers as a function of redshift and celestial coordinates and, e.g., determine clustering statistics. We produ… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Supporting publication of DESI 2024II: Sample definitions, characteristics, and two-point clustering statistics

  12. arXiv:2405.15421  [pdf, other

    cs.LG physics.optics

    Model-free reinforcement learning with noisy actions for automated experimental control in optics

    Authors: Lea Richtmann, Viktoria-S. Schmiesing, Dennis Wilken, Jan Heine, Aaron Tranter, Avishek Anand, Tobias J. Osborne, Michèle Heurs

    Abstract: Experimental control involves a lot of manual effort with non-trivial decisions for precise adjustments. Here, we study the automatic experimental alignment for coupling laser light into an optical fiber using reinforcement learning (RL). We face several real-world challenges, such as time-consuming training, partial observability, and noisy actions due to imprecision in the mirror steering motors… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 10 pages + 10 pages appendices, 3 + 11 figures

    ACM Class: J.2; I.2.1

  13. arXiv:2405.10296  [pdf, other

    cs.FL

    Verifying Unboundedness via Amalgamation

    Authors: Ashwani Anand, Sylvain Schmitz, Lia Schütze, Georg Zetzsche

    Abstract: Well-structured transition systems (WSTS) are an abstract family of systems that encompasses a vast landscape of infinite-state systems. By requiring a well-quasi-ordering (wqo) on the set of states, a WSTS enables generic algorithms for classic verification tasks such as coverability and termination. However, even for systems that are WSTS like vector addition systems (VAS), the framework is noto… ▽ More

    Submitted 20 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: Erratum: Updated test for negative SUP instances in Section 4.1

    ACM Class: F.4.3

  14. arXiv:2405.08314  [pdf, other

    astro-ph.GA

    Probing the impact of radio-mode feedback on the properties of the cool circumgalactic medium

    Authors: Yu-Ling Chang, Ting-Wen Lan, J. Xavier Prochaska, Lucas Napolitano, Abhijeet Anand, J. Aguilar, S. Ahlen, D. Brooks, T. Claybaugh, A. de la Macorra, Arjun Dey, P. Doel, S. Gontcho A Gontcho, J. Guy, S. Juneau, T. Kisner, A. Lambert, M. Landriau, L. Le Guillou, M. Manera, P. Martini, A. Meisner, R. Miquel, J. Moustakas, A. D. Myers , et al. (11 additional authors not shown)

    Abstract: We explore the influence of radio-mode feedback on the properties of the cool circumgalactic medium (CGM). To this end, we assemble a statistical sample of approximately 30,000 radio galaxies with background quasars by combining optical spectroscopic measurements of luminous red galaxies (LRGs) and quasars from the year 1 dataset of Dark Energy Spectroscopic Instrument (DESI) and radio sources fro… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 20 pages, 12 figures

  15. Is Interpretable Machine Learning Effective at Feature Selection for Neural Learning-to-Rank?

    Authors: Lijun Lyu, Nirmal Roy, Harrie Oosterhuis, Avishek Anand

    Abstract: Neural ranking models have become increasingly popular for real-world search and recommendation systems in recent years. Unlike their tree-based counterparts, neural models are much less interpretable. That is, it is very difficult to understand their inner workings and answer questions like how do they make their ranking decisions? or what document features do they find important? This is particu… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Published at ECIR 2024 as a long paper. 13 pages excl. reference, 20 pages incl. reference

    Journal ref: Advances in Information Retrieval - 46th European Conference on Information Retrieval, {ECIR} 2024, Glasgow, UK, March 24-28, 2024, Proceedings, Part {IV}

  16. Context-Enhanced Language Models for Generating Multi-Paper Citations

    Authors: Avinash Anand, Kritarth Prasad, Ujjwal Goel, Mohit Gupta, Naman Lal, Astha Verma, Rajiv Ratn Shah

    Abstract: Citation text plays a pivotal role in elucidating the connection between scientific documents, demanding an in-depth comprehension of the cited paper. Constructing citations is often time-consuming, requiring researchers to delve into extensive literature and grapple with articulating relevant content. To address this challenge, the field of citation text generation (CTG) has emerged. However, whi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 14 pages, 7 figures, 11th International Conference, BDA 2023, Delhi, India

    Journal ref: Big Data and Artificial Intelligence 2023, Delhi, India, December 7, 80 94

  17. arXiv:2404.13099  [pdf, other

    cs.CL cs.AI

    Mathify: Evaluating Large Language Models on Mathematical Problem Solving Tasks

    Authors: Avinash Anand, Mohit Gupta, Kritarth Prasad, Navya Singla, Sanjana Sanjeev, Jatin Kumar, Adarsh Raj Shivam, Rajiv Ratn Shah

    Abstract: The rapid progress in the field of natural language processing (NLP) systems and the expansion of large language models (LLMs) have opened up numerous opportunities in the field of education and instructional methods. These advancements offer the potential for tailored learning experiences and immediate feedback, all delivered through accessible and cost-effective services. One notable application… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 10 pages, 3 figures, NeurIPS 2023 Workshop on Generative AI for Education (GAIED)

    Journal ref: NeurIPS 2023 Workshop on Generative AI for Education (GAIED)

  18. arXiv:2404.12926  [pdf, other

    cs.AI

    MM-PhyRLHF: Reinforcement Learning Framework for Multimodal Physics Question-Answering

    Authors: Avinash Anand, Janak Kapuriya, Chhavi Kirtani, Apoorv Singh, Jay Saraf, Naman Lal, Jatin Kumar, Adarsh Raj Shivam, Astha Verma, Rajiv Ratn Shah, Roger Zimmermann

    Abstract: Recent advancements in LLMs have shown their significant potential in tasks like text summarization and generation. Yet, they often encounter difficulty while solving complex physics problems that require arithmetic calculation and a good understanding of concepts. Moreover, many physics problems include images that contain important details required to understand the problem's context. We propose… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  19. arXiv:2404.11018  [pdf, other

    cs.LG cs.AI cs.CL

    Many-Shot In-Context Learning

    Authors: Rishabh Agarwal, Avi Singh, Lei M. Zhang, Bernd Bohnet, Luis Rosias, Stephanie Chan, Biao Zhang, Ankesh Anand, Zaheer Abbas, Azade Nova, John D. Co-Reyes, Eric Chu, Feryal Behbahani, Aleksandra Faust, Hugo Larochelle

    Abstract: Large language models (LLMs) excel at few-shot in-context learning (ICL) -- learning from a few examples provided in context at inference, without any weight updates. Newly expanded context windows allow us to investigate ICL with hundreds or thousands of examples -- the many-shot regime. Going from few-shot to many-shot, we observe significant performance gains across a wide variety of generative… ▽ More

    Submitted 22 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  20. TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content

    Authors: Avinash Anand, Raj Jaiswal, Pijush Bhuyan, Mohit Gupta, Siddhesh Bangar, Md. Modassir Imam, Rajiv Ratn Shah, Shin'ichi Satoh

    Abstract: The automatic recognition of tabular data in document images presents a significant challenge due to the diverse range of table styles and complex structures. Tables offer valuable content representation, enhancing the predictive capabilities of various systems such as search engines and Knowledge Graphs. Addressing the two main problems, namely table detection (TD) and table structure recognition… ▽ More

    Submitted 19 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 8 pages, 2 figures, Workshop of 1st MMIR Deep Multimodal Learning for Information Retrieval

  21. arXiv:2404.09763  [pdf, other

    cs.CL cs.AI

    KG-CTG: Citation Generation through Knowledge Graph-guided Large Language Models

    Authors: Avinash Anand, Mohit Gupta, Kritarth Prasad, Ujjwal Goel, Naman Lal, Astha Verma, Rajiv Ratn Shah

    Abstract: Citation Text Generation (CTG) is a task in natural language processing (NLP) that aims to produce text that accurately cites or references a cited document within a source document. In CTG, the generated text draws upon contextual cues from both the source document and the cited paper, ensuring accurate and relevant citation information is provided. Previous work in the field of citation generati… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  22. RanLayNet: A Dataset for Document Layout Detection used for Domain Adaptation and Generalization

    Authors: Avinash Anand, Raj Jaiswal, Mohit Gupta, Siddhesh S Bangar, Pijush Bhuyan, Naman Lal, Rajeev Singh, Ritika Jha, Rajiv Ratn Shah, Shin'ichi Satoh

    Abstract: Large ground-truth datasets and recent advances in deep learning techniques have been useful for layout detection. However, because of the restricted layout diversity of these datasets, training on them requires a sizable number of annotated instances, which is both expensive and time-consuming. As a result, differences between the source and target domains may significantly impact how well these… ▽ More

    Submitted 19 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 8 pages, 6 figures, MMAsia 2023 Proceedings of the 5th ACM International Conference on Multimedia in Asia

    Journal ref: In Proceedings of the 5th ACM International Conference on Multimedia in Asia 2023. Association for Computing Machinery, NY, USA, Article 74, pp. 1-6

  23. arXiv:2404.08704  [pdf, other

    cs.CL cs.AI

    MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting

    Authors: Avinash Anand, Janak Kapuriya, Apoorv Singh, Jay Saraf, Naman Lal, Astha Verma, Rushali Gupta, Rajiv Shah

    Abstract: While Large Language Models (LLMs) can achieve human-level performance in various tasks, they continue to face challenges when it comes to effectively tackling multi-step physics reasoning tasks. To identify the shortcomings of existing models and facilitate further research in this area, we curated a novel dataset, MM-PhyQA, which comprises well-constructed, high schoollevel multimodal physics pr… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  24. arXiv:2404.03002  [pdf, other

    astro-ph.CO

    DESI 2024 VI: Cosmological Constraints from the Measurements of Baryon Acoustic Oscillations

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, B. Bahr-Kalus, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, A. Bera, F. Beutler, D. Bianchi, C. Blake, R. Blum , et al. (178 additional authors not shown)

    Abstract: We present cosmological results from the measurement of baryon acoustic oscillations (BAO) in galaxy, quasar and Lyman-$α$ forest tracers from the first year of observations from the Dark Energy Spectroscopic Instrument (DESI), to be released in the DESI Data Release 1. DESI BAO provide robust measurements of the transverse comoving distance and Hubble rate, or their combination, relative to the s… ▽ More

    Submitted 24 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers). Typos corrected and a new figure and discussion added to Appendix A

  25. arXiv:2404.03001  [pdf, other

    astro-ph.CO

    DESI 2024 IV: Baryon Acoustic Oscillations from the Lyman Alpha Forest

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Bautista, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden , et al. (174 additional authors not shown)

    Abstract: We present the measurement of Baryon Acoustic Oscillations (BAO) from the Lyman-$α$ (Ly$α$) forest of high-redshift quasars with the first-year dataset of the Dark Energy Spectroscopic Instrument (DESI). Our analysis uses over $420\,000$ Ly$α$ forest spectra and their correlation with the spatial distribution of more than $700\,000$ quasars. An essential facet of this work is the development of a… ▽ More

    Submitted 12 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers)

  26. arXiv:2404.03000  [pdf, other

    astro-ph.CO

    DESI 2024 III: Baryon Acoustic Oscillations from Galaxies and Quasars

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden, A. Brodzeller , et al. (171 additional authors not shown)

    Abstract: We present the DESI 2024 galaxy and quasar baryon acoustic oscillations (BAO) measurements using over 5.7 million unique galaxy and quasar redshifts in the range 0.1<z<2.1. Divided by tracer type, we utilize 300,017 galaxies from the magnitude-limited Bright Galaxy Survey with 0.1<z<0.4, 2,138,600 Luminous Red Galaxies with 0.4<z<1.1, 2,432,022 Emission Line Galaxies with 0.8<z<1.6, and 856,652 qu… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers)

  27. arXiv:2404.02587  [pdf, ps, other

    cs.IR cs.AI

    The Surprising Effectiveness of Rankers Trained on Expanded Queries

    Authors: Abhijit Anand, Venktesh V, Vinay Setty, Avishek Anand

    Abstract: An important problem in text-ranking systems is handling the hard queries that form the tail end of the query distribution. The difficulty may arise due to the presence of uncommon, underspecified, or incomplete queries. In this work, we improve the ranking performance of hard or difficult queries without compromising the performance of other queries. Firstly, we do LLM based query enrichment for… ▽ More

    Submitted 12 June, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  28. arXiv:2403.17169  [pdf, other

    cs.CL cs.AI

    QuanTemp: A real-world open-domain benchmark for fact-checking numerical claims

    Authors: Venktesh V, Abhijit Anand, Avishek Anand, Vinay Setty

    Abstract: Automated fact checking has gained immense interest to tackle the growing misinformation in the digital era. Existing systems primarily focus on synthetic claims on Wikipedia, and noteworthy progress has also been made on real-world claims. In this work, we release QuanTemp, a diverse, multi-domain dataset focused exclusively on numerical claims, encompassing temporal, statistical and diverse aspe… ▽ More

    Submitted 1 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 11 pages, 1 figure,Accepted for publication at the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2024)

  29. arXiv:2403.16763  [pdf, other

    astro-ph.CO

    LiteBIRD Science Goals and Forecasts: Primordial Magnetic Fields

    Authors: D. Paoletti, J. Rubino-Martin, M. Shiraishi, D. Molinari, J. Chluba, F. Finelli, C. Baccigalupi, J. Errard, A. Gruppuso, A. I. Lonappan, A. Tartari, E. Allys, A. Anand, J. Aumont, M. Ballardini, A. J. Banday, R. B. Barreiro, N. Bartolo, M. Bersanelli, M. Bortolami, T. Brinckmann, E. Calabrese, P. Campeti, A. Carones, F. J. Casas , et al. (75 additional authors not shown)

    Abstract: We present detailed forecasts for the constraints on primordial magnetic fields (PMFs) that will be obtained with the LiteBIRD satellite. The constraints are driven by the effects of PMFs on the CMB anisotropies: the gravitational effects of magnetically-induced perturbations; the effects on the thermal and ionization history of the Universe; the Faraday rotation imprint on the CMB polarization; a… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 51 pages, 24 figures, abstract shortened

  30. arXiv:2403.16085  [pdf, other

    cs.IR

    RankingSHAP -- Listwise Feature Attribution Explanations for Ranking Models

    Authors: Maria Heuss, Maarten de Rijke, Avishek Anand

    Abstract: Feature attributions are a commonly used explanation type, when we want to posthoc explain the prediction of a trained model. Yet, they are not very well explored in IR. Importantly, feature attribution has rarely been rigorously defined, beyond attributing the most important feature the highest value. What it means for a feature to be more important than others is often left vague. Consequently,… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  31. arXiv:2403.13687  [pdf, ps, other

    gr-qc

    Modified gravity theories from the Barrow hypothesis

    Authors: Ankit Anand, Ruben Campos Delgado

    Abstract: Barrow proposed that quantum gravity effects might introduce fractal corrections to the area of the event horizon of black holes. The area law gets modified as $S \propto A^{1+Δ/2}$, with $0\leqΔ\leq 1$. It was so far unclear whether this assumption could lead to meaningful quantum gravity theories beyond general relativity. In this paper, we argue that this is indeed the case. In particular, assu… ▽ More

    Submitted 15 May, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: 10 pages; v2: final version accepted for publication in EPL

  32. arXiv:2403.08983  [pdf, ps, other

    cs.DS

    Approximating Small Sparse Cuts

    Authors: Aditya Anand, Euiwoong Lee, Jason Li, Thatchaphol Saranurak

    Abstract: We study polynomial-time approximation algorithms for (edge/vertex) Sparsest Cut and Small Set Expansion in terms of $k$, the number of edges or vertices cut in the optimal solution. Our main results are $\mathcal{O}(\text{polylog}\, k)$-approximation algorithms for various versions in this setting. Our techniques involve an extension of the notion of sample sets (Feige and Mahdian STOC'06), ori… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 49 Pages, to appear at STOC 2024

  33. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  34. arXiv:2403.04085  [pdf, other

    cs.CL cs.CY

    Don't Blame the Data, Blame the Model: Understanding Noise and Bias When Learning from Subjective Annotations

    Authors: Abhishek Anand, Negar Mokhberian, Prathyusha Naresh Kumar, Anweasha Saha, Zihao He, Ashwin Rao, Fred Morstatter, Kristina Lerman

    Abstract: Researchers have raised awareness about the harms of aggregating labels especially in subjective tasks that naturally contain disagreements among human annotators. In this work we show that models that are only provided aggregated labels show low confidence on high-disagreement data instances. While previous studies consider such instances as mislabeled, we argue that the reason the high-disagreem… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  35. arXiv:2403.00472  [pdf

    stat.AP

    Frailty or Frailties: Exploring Frailty Index Subdimensions in the English Longitudinal Study of Ageing

    Authors: Lara Johnson, Bruce Guthrie, Paul A T Kelly, Atul Anand, Alan Marshall, Sohan Seth

    Abstract: Background: Frailty, a state of increased vulnerability to adverse health outcomes, has garnered significant attention in research and clinical practice. Existing constructs aggregate clinical features or health deficits into a single score. While simple and interpretable, this approach may overlook the complexity of frailty and not capture the full range of variation between individuals. Method… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 39 pages, 4 figures

  36. arXiv:2402.04764  [pdf, other

    cs.LG

    Code as Reward: Empowering Reinforcement Learning with VLMs

    Authors: David Venuto, Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand

    Abstract: Pre-trained Vision-Language Models (VLMs) are able to understand visual concepts, describe and decompose complex tasks into sub-tasks, and provide feedback on task completion. In this paper, we aim to leverage these capabilities to support the training of reinforcement learning (RL) agents. In principle, VLMs are well suited for this purpose, as they can naturally analyze image-based observations… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  37. arXiv:2401.15222  [pdf, other

    cs.CL cs.AI cs.LG

    Transfer Learning for the Prediction of Entity Modifiers in Clinical Text: Application to Opioid Use Disorder Case Detection

    Authors: Abdullateef I. Almudaifer, Whitney Covington, JaMor Hairston, Zachary Deitch, Ankit Anand, Caleb M. Carroll, Estera Crisan, William Bradford, Lauren Walter, Eaton Ellen, Sue S. Feldman, John D. Osborne

    Abstract: Background: The semantics of entities extracted from a clinical text can be dramatically altered by modifiers, including entity negation, uncertainty, conditionality, severity, and subject. Existing models for determining modifiers of clinical entities involve regular expression or features weights that are trained independently for each modifier. Methods: We develop and evaluate a multi-task tr… ▽ More

    Submitted 5 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 18 pages, 2 figures, 6 tables. To be submitted to the Journal of Biomedical Semantics

  38. arXiv:2401.13819  [pdf, ps, other

    cs.DS

    Separating $k$-Median from the Supplier Version

    Authors: Aditya Anand, Euiwoong Lee

    Abstract: Given a metric space $(V, d)$ along with an integer $k$, the $k$-Median problem asks to open $k$ centers $C \subseteq V$ to minimize $\sum_{v \in V} d(v, C)$, where $d(v, C) := \min_{c \in C} d(v, c)$. While the best-known approximation ratio of $2.613$ holds for the more general supplier version where an additional set $F \subseteq V$ is given with the restriction $C \subseteq F$, the best known… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 20 pages; To appear at IPCO 2024

  39. arXiv:2401.12078  [pdf, other

    cs.CL

    Temporal Blind Spots in Large Language Models

    Authors: Jonas Wallat, Adam Jatowt, Avishek Anand

    Abstract: Large language models (LLMs) have recently gained significant attention due to their unparalleled ability to perform various natural language processing tasks. These models, benefiting from their advanced natural language understanding capabilities, have demonstrated impressive zero-shot performance. However, the pre-training data utilized in LLMs is often confined to a specific corpus, resulting… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: accepted at WSDM'24

  40. arXiv:2312.17146  [pdf, other

    quant-ph physics.chem-ph

    Hamiltonians, groups, graphs and ansätze

    Authors: Abhinav Anand, Kenneth R. Brown

    Abstract: One promising application of near-term quantum devices is to prepare trial wavefunctions using short circuits for solving different problems via variational algorithms. For this purpose, we introduce a new circuit design that combines graph-based diagonalization circuits with arbitrary single-qubit rotation gates to get Hamiltonian-based graph states ansätze (H-GSA). We test the accuracy of the pr… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  41. arXiv:2312.12241  [pdf, other

    cs.CV cs.CL

    GeomVerse: A Systematic Evaluation of Large Models for Geometric Reasoning

    Authors: Mehran Kazemi, Hamidreza Alvari, Ankit Anand, Jialin Wu, Xi Chen, Radu Soricut

    Abstract: Large language models have shown impressive results for multi-hop mathematical reasoning when the input question is only textual. Many mathematical reasoning problems, however, contain both text and image. With the ever-increasing adoption of vision language models (VLMs), understanding their reasoning abilities for such problems is crucial. In this paper, we evaluate the reasoning capabilities of… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  42. arXiv:2312.12038  [pdf, other

    physics.flu-dyn cond-mat.stat-mech

    Flow cross-overs under surface fluctuations in cylindrical nano-channel

    Authors: Aakash Anand, A. Bhattacharyay

    Abstract: We analyse surface-fluctuations-driven fluid flow through nano-channels to investigate the interplay between boundary layer flow structures and the bulk flow of fluid under a pressure-head. Surface fluctuations of a wide range of frequencies (up to several thousands of Hertz) in a nano-channel keep the flow in the low Reynolds number regime. Using this advantage of low Reynolds number flow, we dev… ▽ More

    Submitted 6 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: 8 pages, 2 figure

  43. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  44. arXiv:2312.08502  [pdf, other

    quant-ph physics.chem-ph

    Leveraging commuting groups for an efficient variational Hamiltonian ansatz

    Authors: Abhinav Anand, Kenneth R. Brown

    Abstract: Efficiently calculating the low-lying eigenvalues of Hamiltonians, written as sums of Pauli operators, is a fundamental challenge in quantum computing. While various methods have been proposed to reduce the complexity of quantum circuits for this task, there remains room for further improvement. In this article, we introduce a new circuit design using commuting groups within the Hamiltonian to fur… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  45. arXiv:2312.06585  [pdf, other

    cs.LG

    Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

    Authors: Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron , et al. (16 additional authors not shown)

    Abstract: Fine-tuning language models~(LMs) on human-generated data remains a prevalent practice. However, the performance of such models is often limited by the quantity and diversity of high-quality human data. In this paper, we explore whether we can go beyond human data on tasks where we have access to scalar feedback, for example, on math problems where one can verify correctness. To do so, we investig… ▽ More

    Submitted 17 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted to TMLR. Camera-ready version. First three authors contributed equally

  46. arXiv:2312.05194  [pdf, other

    astro-ph.CO

    LiteBIRD Science Goals and Forecasts: Improving Sensitivity to Inflationary Gravitational Waves with Multitracer Delensing

    Authors: T. Namikawa, A. I. Lonappan, C. Baccigalupi, N. Bartolo, D. Beck, K. Benabed, A. Challinor, P. Diego-Palazuelos, J. Errard, S. Farrens, A. Gruppuso, N. Krachmalnicoff, M. Migliaccio, E. Martínez-González, V. Pettorino, G. Piccirilli, M. Ruiz-Granda, B. Sherwin, J. Starck, P. Vielva, R. Akizawa, A. Anand, J. Aumont, R. Aurlien, S. Azzoni , et al. (97 additional authors not shown)

    Abstract: We estimate the efficiency of mitigating the lensing $B$-mode polarization, the so-called delensing, for the $LiteBIRD$ experiment with multiple external data sets of lensing-mass tracers. The current best bound on the tensor-to-scalar ratio, $r$, is limited by lensing rather than Galactic foregrounds. Delensing will be a critical step to improve sensitivity to $r$ as measurements of $r$ become mo… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 21 pages, 7 figures

  47. LiteBIRD Science Goals and Forecasts: A full-sky measurement of gravitational lensing of the CMB

    Authors: A. I. Lonappan, T. Namikawa, G. Piccirilli, P. Diego-Palazuelos, M. Ruiz-Granda, M. Migliaccio, C. Baccigalupi, N. Bartolo, D. Beck, K. Benabed, A. Challinor, J. Errard, S. Farrens, A. Gruppuso, N. Krachmalnicoff, E. Martínez-González, V. Pettorino, B. Sherwin, J. Starck, P. Vielva, R. Akizawa, A. Anand, J. Aumont, R. Aurlien, S. Azzoni , et al. (97 additional authors not shown)

    Abstract: We explore the capability of measuring lensing signals in $LiteBIRD$ full-sky polarization maps. With a $30$ arcmin beam width and an impressively low polarization noise of $2.16\,μ$K-arcmin, $LiteBIRD$ will be able to measure the full-sky polarization of the cosmic microwave background (CMB) very precisely. This unique sensitivity also enables the reconstruction of a nearly full-sky lensing map u… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  48. arXiv:2312.00717  [pdf, other

    astro-ph.CO gr-qc

    LiteBIRD Science Goals and Forecasts. A Case Study of the Origin of Primordial Gravitational Waves using Large-Scale CMB Polarization

    Authors: P. Campeti, E. Komatsu, C. Baccigalupi, M. Ballardini, N. Bartolo, A. Carones, J. Errard, F. Finelli, R. Flauger, S. Galli, G. Galloni, S. Giardiello, M. Hazumi, S. Henrot-Versillé, L. T. Hergt, K. Kohri, C. Leloup, J. Lesgourgues, J. Macias-Perez, E. Martínez-González, S. Matarrese, T. Matsumura, L. Montier, T. Namikawa, D. Paoletti , et al. (85 additional authors not shown)

    Abstract: We study the possibility of using the $LiteBIRD$ satellite $B$-mode survey to constrain models of inflation producing specific features in CMB angular power spectra. We explore a particular model example, i.e. spectator axion-SU(2) gauge field inflation. This model can source parity-violating gravitational waves from the amplification of gauge field fluctuations driven by a pseudoscalar "axionlike… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 22 pages, 13 figures. Submitted to JCAP

  49. arXiv:2311.15426  [pdf, other

    cs.IR

    Data Augmentation for Sample Efficient and Robust Document Ranking

    Authors: Abhijit Anand, Jurek Leonhardt, Jaspreet Singh, Koustav Rudra, Avishek Anand

    Abstract: Contextual ranking models have delivered impressive performance improvements over classical models in the document ranking task. However, these highly over-parameterized models tend to be data-hungry and require large amounts of data even for fine-tuning. In this paper, we propose data-augmentation methods for effective and robust ranking performance. One of the key benefits of using data augmenta… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  50. arXiv:2311.12298  [pdf, other

    cs.CL cs.AI

    Noise in Relation Classification Dataset TACRED: Characterization and Reduction

    Authors: Akshay Parekh, Ashish Anand, Amit Awekar

    Abstract: The overarching objective of this paper is two-fold. First, to explore model-based approaches to characterize the primary cause of the noise. in the RE dataset TACRED Second, to identify the potentially noisy instances. Towards the first objective, we analyze predictions and performance of state-of-the-art (SOTA) models to identify the root cause of noise in the dataset. Our analysis of TACRED sho… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Work in Progress