Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–20 of 20 results for author: Gur, I

.
  1. arXiv:2408.07852  [pdf, other

    cs.CL cs.AI cs.LG

    Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

    Authors: Jiri Hron, Laura Culp, Gamaleldin Elsayed, Rosanne Liu, Ben Adlam, Maxwell Bileschi, Bernd Bohnet, JD Co-Reyes, Noah Fiedel, C. Daniel Freeman, Izzeddin Gur, Kathleen Kenealy, Jaehoon Lee, Peter J. Liu, Gaurav Mishra, Igor Mordatch, Azade Nova, Roman Novak, Aaron Parisi, Jeffrey Pennington, Alex Rizkowsky, Isabelle Simpson, Hanie Sedghi, Jascha Sohl-dickstein, Kevin Swersky , et al. (6 additional authors not shown)

    Abstract: While many capabilities of language models (LMs) improve with increased training budget, the influence of scale on hallucinations is not yet fully understood. Hallucinations come in many forms, and there is no universally accepted definition. We thus focus on studying only those hallucinations where a correct answer appears verbatim in the training set. To fully control the training data content,… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: Published at COLM 2024. 16 pages, 11 figures

  2. arXiv:2407.05872  [pdf, other

    cs.LG

    Scaling Exponents Across Parameterizations and Optimizers

    Authors: Katie Everett, Lechao Xiao, Mitchell Wortsman, Alexander A. Alemi, Roman Novak, Peter J. Liu, Izzeddin Gur, Jascha Sohl-Dickstein, Leslie Pack Kaelbling, Jaehoon Lee, Jeffrey Pennington

    Abstract: Robust and effective scaling of models from small to large width typically requires the precise adjustment of many algorithmic and architectural details, such as parameterization and optimizer choices. In this work, we propose a new perspective on parameterization by investigating a key assumption in prior work about the alignment between parameters and data and derive new theoretical results unde… ▽ More

    Submitted 16 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 63 pages, International Conference on Machine Learning 2024

  3. arXiv:2312.06585  [pdf, other

    cs.LG

    Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

    Authors: Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron , et al. (16 additional authors not shown)

    Abstract: Fine-tuning language models~(LMs) on human-generated data remains a prevalent practice. However, the performance of such models is often limited by the quantity and diversity of high-quality human data. In this paper, we explore whether we can go beyond human data on tasks where we have access to scalar feedback, for example, on math problems where one can verify correctness. To do so, we investig… ▽ More

    Submitted 17 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted to TMLR. Camera-ready version. First three authors contributed equally

  4. arXiv:2311.18751  [pdf, other

    cs.LG cs.AI cs.CL

    Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web

    Authors: Hiroki Furuta, Yutaka Matsuo, Aleksandra Faust, Izzeddin Gur

    Abstract: Language model agents (LMA) recently emerged as a promising paradigm on muti-step decision making tasks, often outperforming humans and other reinforcement learning agents. Despite the promise, their performance on real-world applications that often involve combinations of tasks is still underexplored. In this work, we introduce a new benchmark, called CompWoB -- 50 new compositional web automatio… ▽ More

    Submitted 4 February, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Code: https://github.com/google-research/google-research/tree/master/compositional_rl/compwob

  5. arXiv:2311.07587  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

    Authors: C. Daniel Freeman, Laura Culp, Aaron Parisi, Maxwell L Bileschi, Gamaleldin F Elsayed, Alex Rizkowsky, Isabelle Simpson, Alex Alemi, Azade Nova, Ben Adlam, Bernd Bohnet, Gaurav Mishra, Hanie Sedghi, Igor Mordatch, Izzeddin Gur, Jaehoon Lee, JD Co-Reyes, Jeffrey Pennington, Kelvin Xu, Kevin Swersky, Kshiteej Mahajan, Lechao Xiao, Rosanne Liu, Simon Kornblith, Noah Constant , et al. (5 additional authors not shown)

    Abstract: We introduce and study the problem of adversarial arithmetic, which provides a simple yet challenging testbed for language model alignment. This problem is comprised of arithmetic questions posed in natural language, with an arbitrary adversarial string inserted before the question is complete. Even in the simple setting of 1-digit addition problems, it is easy to find adversarial prompts that mak… ▽ More

    Submitted 15 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

  6. arXiv:2309.14322  [pdf, other

    cs.LG

    Small-scale proxies for large-scale Transformer training instabilities

    Authors: Mitchell Wortsman, Peter J. Liu, Lechao Xiao, Katie Everett, Alex Alemi, Ben Adlam, John D. Co-Reyes, Izzeddin Gur, Abhishek Kumar, Roman Novak, Jeffrey Pennington, Jascha Sohl-dickstein, Kelvin Xu, Jaehoon Lee, Justin Gilmer, Simon Kornblith

    Abstract: Teams that have trained large Transformer-based models have reported training instabilities at large scale that did not appear when training with the same hyperparameters at smaller scales. Although the causes of such instabilities are of scientific interest, the amount of resources required to reproduce them has made investigation difficult. In this work, we seek ways to reproduce and study train… ▽ More

    Submitted 16 October, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

  7. arXiv:2307.12856  [pdf, other

    cs.LG cs.AI cs.CL

    A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

    Authors: Izzeddin Gur, Hiroki Furuta, Austin Huang, Mustafa Safdari, Yutaka Matsuo, Douglas Eck, Aleksandra Faust

    Abstract: Pre-trained large language models (LLMs) have recently achieved better generalization and sample efficiency in autonomous web automation. However, the performance on real-world websites has still suffered from (1) open domainness, (2) limited context length, and (3) lack of inductive bias on HTML. We introduce WebAgent, an LLM-driven agent that learns from self-experience to complete tasks on real… ▽ More

    Submitted 25 February, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted to ICLR 2024 (Oral)

  8. arXiv:2305.11854  [pdf, other

    cs.LG cs.AI stat.ML

    Multimodal Web Navigation with Instruction-Finetuned Foundation Models

    Authors: Hiroki Furuta, Kuang-Huei Lee, Ofir Nachum, Yutaka Matsuo, Aleksandra Faust, Shixiang Shane Gu, Izzeddin Gur

    Abstract: The progress of autonomous web navigation has been hindered by the dependence on billions of exploratory interactions via online reinforcement learning, and domain-specific model designs that make it difficult to leverage generalization from rich out-of-domain data. In this work, we study data-driven offline training for web agents with vision-language foundation models. We propose an instruction-… ▽ More

    Submitted 25 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted to ICLR 2024. Website: https://sites.google.com/view/mm-webnav/

  9. arXiv:2211.16385  [pdf, other

    cs.AR cs.AI cs.LG cs.MA

    Multi-Agent Reinforcement Learning for Microprocessor Design Space Exploration

    Authors: Srivatsan Krishnan, Natasha Jaques, Shayegan Omidshafiei, Dan Zhang, Izzeddin Gur, Vijay Janapa Reddi, Aleksandra Faust

    Abstract: Microprocessor architects are increasingly resorting to domain-specific customization in the quest for high-performance and energy-efficiency. As the systems grow in complexity, fine-tuning architectural parameters across multiple sub-systems (e.g., datapath, memory blocks in different hierarchies, interconnects, compiler optimization, etc.) quickly results in a combinatorial explosion of design s… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: Workshop on ML for Systems at NeurIPS 2022

  10. CLUTR: Curriculum Learning via Unsupervised Task Representation Learning

    Authors: Abdus Salam Azad, Izzeddin Gur, Jasper Emhoff, Nathaniel Alexis, Aleksandra Faust, Pieter Abbeel, Ion Stoica

    Abstract: Reinforcement Learning (RL) algorithms are often known for sample inefficiency and difficult generalization. Recently, Unsupervised Environment Design (UED) emerged as a new paradigm for zero-shot generalization by simultaneously learning a task distribution and agent policies on the generated tasks. This is a non-stationary process where the task distribution evolves along with agent policies; cr… ▽ More

    Submitted 7 March, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: Preprint, Currently Under Review

  11. arXiv:2210.03945  [pdf, other

    cs.LG cs.AI

    Understanding HTML with Large Language Models

    Authors: Izzeddin Gur, Ofir Nachum, Yingjie Miao, Mustafa Safdari, Austin Huang, Aakanksha Chowdhery, Sharan Narang, Noah Fiedel, Aleksandra Faust

    Abstract: Large language models (LLMs) have shown exceptional performance on a variety of natural language tasks. Yet, their capabilities for HTML understanding -- i.e., parsing the raw HTML of a webpage, with applications to automation of web-based tasks, crawling, and browser-assisted retrieval -- have not been fully explored. We contribute HTML understanding models (fine-tuned LLMs) and an in-depth analy… ▽ More

    Submitted 19 May, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

  12. arXiv:2205.12648  [pdf, other

    cs.LG cs.AI

    Fast Inference and Transfer of Compositional Task Structures for Few-shot Task Generalization

    Authors: Sungryull Sohn, Hyunjae Woo, Jongwook Choi, lyubing qiang, Izzeddin Gur, Aleksandra Faust, Honglak Lee

    Abstract: We tackle real-world problems with complex structures beyond the pixel-based game or simulator. We formulate it as a few-shot reinforcement learning problem where a task is characterized by a subtask graph that defines a set of subtasks and their dependencies that are unknown to the agent. Different from the previous meta-rl methods trying to directly infer the unstructured task embedding, our mul… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted to UAI 2022 as an oral presentation

  13. arXiv:2201.08896  [pdf, other

    cs.LG cs.AI

    Environment Generation for Zero-Shot Compositional Reinforcement Learning

    Authors: Izzeddin Gur, Natasha Jaques, Yingjie Miao, Jongwook Choi, Manoj Tiwari, Honglak Lee, Aleksandra Faust

    Abstract: Many real-world problems are compositional - solving them requires completing interdependent sub-tasks, either in series or in parallel, that can be represented as a dependency graph. Deep reinforcement learning (RL) agents often struggle to learn such complex tasks due to the long time horizons and sparse rewards. To address this problem, we present Compositional Design of Environments (CoDE), wh… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: Published in NeurIPS 2021

  14. arXiv:2111.12872  [pdf, other

    cs.CV cs.CL

    Less is More: Generating Grounded Navigation Instructions from Landmarks

    Authors: Su Wang, Ceslee Montgomery, Jordi Orbay, Vighnesh Birodkar, Aleksandra Faust, Izzeddin Gur, Natasha Jaques, Austin Waters, Jason Baldridge, Peter Anderson

    Abstract: We study the automatic generation of navigation instructions from 360-degree images captured on indoor routes. Existing generators suffer from poor visual grounding, causing them to rely on language priors and hallucinate objects. Our MARKY-MT5 system addresses this by focusing on visual landmarks; it comprises a first stage landmark detector and a second stage generator -- a multimodal, multiling… ▽ More

    Submitted 4 April, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: CVPR 2022 Camera-ready

  15. arXiv:2103.01991  [pdf, other

    cs.LG cs.AI cs.MA

    Adversarial Environment Generation for Learning to Navigate the Web

    Authors: Izzeddin Gur, Natasha Jaques, Kevin Malta, Manoj Tiwari, Honglak Lee, Aleksandra Faust

    Abstract: Learning to autonomously navigate the web is a difficult sequential decision making task. The state and action spaces are large and combinatorial in nature, and websites are dynamic environments consisting of several pages. One of the bottlenecks of training web navigation agents is providing a learnable curriculum of training environments that can cover the large variety of real-world websites. T… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: Presented at Deep RL Workshop, NeurIPS, 2020

  16. arXiv:2011.14004  [pdf, other

    cs.CV cs.LG eess.IV

    Assessing Post-Disaster Damage from Satellite Imagery using Semi-Supervised Learning Techniques

    Authors: Jihyeon Lee, Joseph Z. Xu, Kihyuk Sohn, Wenhan Lu, David Berthelot, Izzeddin Gur, Pranav Khaitan, Ke-Wei, Huang, Kyriacos Koupparis, Bernhard Kowatsch

    Abstract: To respond to disasters such as earthquakes, wildfires, and armed conflicts, humanitarian organizations require accurate and timely data in the form of damage assessments, which indicate what buildings and population centers have been most affected. Recent research combines machine learning with remote sensing to automatically extract such information from satellite imagery, reducing manual labor… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: NeurIPS 2020 Artificial Intelligence for Humanitarian Assistance and Disaster Response Workshop

    ACM Class: I.2.10; I.2.1; I.5.4

  17. arXiv:1812.09195  [pdf, other

    cs.LG cs.CL stat.ML

    Learning to Navigate the Web

    Authors: Izzeddin Gur, Ulrich Rueckert, Aleksandra Faust, Dilek Hakkani-Tur

    Abstract: Learning in environments with large state and action spaces, and sparse rewards, can hinder a Reinforcement Learning (RL) agent's learning through trial-and-error. For instance, following natural language instructions on the Web (such as booking a flight ticket) leads to RL settings where input vocabulary and number of actionable elements on a page can grow very large. Even though recent approache… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: International Conference on Learning Representations (ICLR), 2019

  18. arXiv:1811.04369  [pdf, other

    cs.CL cs.AI cs.LG

    User Modeling for Task Oriented Dialogues

    Authors: Izzeddin Gur, Dilek Hakkani-Tur, Gokhan Tur, Pararth Shah

    Abstract: We introduce end-to-end neural network based models for simulating users of task-oriented dialogue systems. User simulation in dialogue systems is crucial from two different perspectives: (i) automatic evaluation of different dialogue models, and (ii) training task-oriented dialogue systems. We design a hierarchical sequence-to-sequence model that first encodes the initial user goal and system tur… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: Accepted at SLT 2018

  19. arXiv:1704.05958  [pdf, ps, other

    cs.CL

    Global Relation Embedding for Relation Extraction

    Authors: Yu Su, Honglei Liu, Semih Yavuz, Izzeddin Gur, Huan Sun, Xifeng Yan

    Abstract: We study the problem of textual relation embedding with distant supervision. To combat the wrong labeling problem of distant supervision, we propose to embed textual relations with global statistics of relations, i.e., the co-occurrence statistics of textual and knowledge base relations collected from the entire corpus. This approach turns out to be more robust to the training noise introduced by… ▽ More

    Submitted 19 April, 2018; v1 submitted 19 April, 2017; originally announced April 2017.

    Comments: Accepted to NAACL HLT 2018

  20. arXiv:1606.05647  [pdf, other

    physics.soc-ph cs.NE cs.SI

    Collective Decision Dynamics in Group Evacuation: Behavioral Experiment and Machine Learning Models

    Authors: Chantal Nguyen, Fangqiu Han, Kimberly J. Schlesinger, Izzeddin Gür, Jean M. Carlson

    Abstract: Identifying factors that affect human decision making and quantifying their influence remain essential and challenging tasks for the design and implementation of social and technological communication systems. We report results of a behavioral experiment involving decision making in the face of an impending natural disaster. In a controlled laboratory setting, we characterize individual and group… ▽ More

    Submitted 30 November, 2016; v1 submitted 17 June, 2016; originally announced June 2016.

    Comments: formerly part 1 of a 2-part series, now presented individually