Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–32 of 32 results for author: Ribeiro, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.00013  [pdf, other

    stat.CO cs.MS math.OC stat.ME

    CEopt: A MATLAB Package for Non-convex Optimization with the Cross-Entropy Method

    Authors: Americo Cunha Jr, Marcos Vinicius Issa, Julio Cesar Basilio, José Geraldo Telles Ribeiro

    Abstract: This paper introduces CEopt (https://ceopt.org), a MATLAB tool leveraging the Cross-Entropy method for non-convex optimization. Due to the relative simplicity of the algorithm, it provides a kind of transparent ``gray-box'' optimization solver, with intuitive control parameters. Unique in its approach, CEopt effectively handles both equality and inequality constraints using an augmented Lagrangian… ▽ More

    Submitted 15 August, 2024; originally announced September 2024.

    MSC Class: 90-04 ACM Class: G.4

  2. arXiv:2406.07662  [pdf, other

    eess.IV cs.AI cs.CV cs.LG q-bio.NC

    Progress Towards Decoding Visual Imagery via fNIRS

    Authors: Michel Adamic, Wellington Avelino, Anna Brandenberger, Bryan Chiang, Hunter Davis, Stephen Fay, Andrew Gregory, Aayush Gupta, Raphael Hotter, Grace Jiang, Fiona Leng, Stephen Polcyn, Thomas Ribeiro, Paul Scotti, Michelle Wang, Marley Xiong, Jonathan Xu

    Abstract: We demonstrate the possibility of reconstructing images from fNIRS brain activity and start building a prototype to match the required specs. By training an image reconstruction model on downsampled fMRI data, we discovered that cm-scale spatial resolution is sufficient for image generation. We obtained 71% retrieval accuracy with 1-cm resolution, compared to 93% on the full-resolution fMRI, and 2… ▽ More

    Submitted 22 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2405.10490  [pdf

    stat.ME cs.AI cs.IR cs.LG math.OC

    Neural Optimization with Adaptive Heuristics for Intelligent Marketing System

    Authors: Changshuai Wei, Benjamin Zelditch, Joyce Chen, Andre Assuncao Silva T Ribeiro, Jingyi Kenneth Tay, Borja Ocejo Elizondo, Keerthi Selvaraj, Aman Gupta, Licurgo Benemann De Almeida

    Abstract: Computational marketing has become increasingly important in today's digital world, facing challenges such as massive heterogeneous data, multi-channel customer journeys, and limited marketing budgets. In this paper, we propose a general framework for marketing AI systems, the Neural Optimization with Adaptive Heuristics (NOAH) framework. NOAH is the first general framework for marketing optimizat… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: KDD 2024

    ACM Class: G.3; G.1.6; I.2

  4. arXiv:2404.12415  [pdf

    eess.IV cs.CV cs.LG

    Prediction of soil fertility parameters using USB-microscope imagery and portable X-ray fluorescence spectrometry

    Authors: Shubhadip Dasgupta, Satwik Pate, Divya Rathore, L. G. Divyanth, Ayan Das, Anshuman Nayak, Subhadip Dey, Asim Biswas, David C. Weindorf, Bin Li, Sergio Henrique Godinho Silva, Bruno Teixeira Ribeiro, Sanjay Srivastava, Somsubhra Chakraborty

    Abstract: This study investigated the use of portable X-ray fluorescence (PXRF) spectrometry and soil image analysis for rapid soil fertility assessment, with a focus on key indicators such as available boron (B), organic carbon (OC), available manganese (Mn), available sulfur (S), and the sulfur availability index (SAI). A total of 1,133 soil samples from diverse agro-climatic zones in Eastern India were a… ▽ More

    Submitted 5 September, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: Published in 'Soil Advances'

    Journal ref: Soil Advances, Volume 2, 2024, 100016

  5. arXiv:2402.19133  [pdf, other

    cs.CL

    Evaluating Webcam-based Gaze Data as an Alternative for Human Rationale Annotations

    Authors: Stephanie Brandl, Oliver Eberle, Tiago Ribeiro, Anders Søgaard, Nora Hollenstein

    Abstract: Rationales in the form of manually annotated input spans usually serve as ground truth when evaluating explainability methods in NLP. They are, however, time-consuming and often biased by the annotation process. In this paper, we debate whether human gaze, in the form of webcam-based eye-tracking recordings, poses a valid alternative when evaluating importance scores. We evaluate the additional in… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to LREC-COLING 2024

  6. Reconstructing Spatiotemporal Data with C-VAEs

    Authors: Tiago F. R. Ribeiro, Fernando Silva, Rogério Luís de C. Costa

    Abstract: The continuous representation of spatiotemporal data commonly relies on using abstract data types, such as \textit{moving regions}, to represent entities whose shape and position continuously change over time. Creating this representation from discrete snapshots of real-world entities requires using interpolation methods to compute in-between data representations and estimate the position and shap… ▽ More

    Submitted 28 August, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Update acknowledgments to include published article information

    Journal ref: Advances in Databases and Information Systems 13985 (2023) 59-73

  7. arXiv:2306.03280  [pdf, other

    cs.HC

    AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms

    Authors: Zana Buçinca, Chau Minh Pham, Maurice Jakesch, Marco Tulio Ribeiro, Alexandra Olteanu, Saleema Amershi

    Abstract: While demands for change and accountability for harmful AI consequences mount, foreseeing the downstream effects of deploying AI systems remains a challenging task. We developed AHA! (Anticipating Harms of AI), a generative framework to assist AI practitioners and decision-makers in anticipating potential harms and unintended consequences of AI systems prior to development or deployment. Given an… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  8. arXiv:2305.17804  [pdf, other

    cs.CL

    Targeted Data Generation: Finding and Fixing Model Weaknesses

    Authors: Zexue He, Marco Tulio Ribeiro, Fereshte Khani

    Abstract: Even when aggregate accuracy is high, state-of-the-art NLP models often fail systematically on specific subgroups of data, resulting in unfair outcomes and eroding user trust. Additional data collection may not help in addressing these weaknesses, as such challenging subgroups may be unknown to users, and underrepresented in the existing and new data. We propose Targeted Data Generation (TDG), a f… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  9. arXiv:2305.12219  [pdf, other

    cs.LG cs.AI cs.CL

    Collaborative Development of NLP models

    Authors: Fereshte Khani, Marco Tulio Ribeiro

    Abstract: Despite substantial advancements, Natural Language Processing (NLP) models often require post-training adjustments to enforce business rules, rectify undesired behavior, and align with user values. These adjustments involve operationalizing "concepts"--dictating desired model responses to certain inputs. However, it's difficult for a single entity to enumerate and define all possible concepts, ind… ▽ More

    Submitted 24 May, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  10. arXiv:2304.09991  [pdf, other

    cs.HC cs.AI cs.CL

    Supporting Human-AI Collaboration in Auditing LLMs with LLMs

    Authors: Charvi Rastogi, Marco Tulio Ribeiro, Nicholas King, Harsha Nori, Saleema Amershi

    Abstract: Large language models are becoming increasingly pervasive and ubiquitous in society via deployment in sociotechnical systems. Yet these language models, be it for classification or generation, have been shown to be biased and behave irresponsibly, causing harm to people at scale. It is crucial to audit these language models rigorously. Existing auditing tools leverage either or both humans and AI… ▽ More

    Submitted 30 November, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 21 pages, 3 figures

    Journal ref: In Proceedings of the 2023 AAAI and ACM Conference on AI, Ethics, and Society. Association for Computing Machinery, New York, NY, USA, 913-926

  11. arXiv:2303.17876  [pdf, other

    cs.CL

    WebQAmGaze: A Multilingual Webcam Eye-Tracking-While-Reading Dataset

    Authors: Tiago Ribeiro, Stephanie Brandl, Anders Søgaard, Nora Hollenstein

    Abstract: We present WebQAmGaze, a multilingual low-cost eye-tracking-while-reading dataset, designed as the first webcam-based eye-tracking corpus of reading to support the development of explainable computational language processing models. WebQAmGaze includes webcam eye-tracking data from 600 participants of a wide age range naturally reading English, German, Spanish, and Turkish texts. Each participant… ▽ More

    Submitted 15 March, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

  12. arXiv:2303.12712  [pdf, other

    cs.CL cs.AI

    Sparks of Artificial General Intelligence: Early experiments with GPT-4

    Authors: Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang

    Abstract: Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an earl… ▽ More

    Submitted 13 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  13. arXiv:2303.09014  [pdf, other

    cs.CL

    ART: Automatic multi-step reasoning and tool-use for large language models

    Authors: Bhargavi Paranjape, Scott Lundberg, Sameer Singh, Hannaneh Hajishirzi, Luke Zettlemoyer, Marco Tulio Ribeiro

    Abstract: Large language models (LLMs) can perform complex reasoning in few- and zero-shot settings by generating intermediate chain of thought (CoT) reasoning steps. Further, each reasoning step can rely on external tools to support computation beyond the core LLM capabilities (e.g. search/running code). Prior work on CoT prompting and tool use typically requires hand-crafting task-specific demonstrations… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  14. ScatterShot: Interactive In-context Example Curation for Text Transformation

    Authors: Tongshuang Wu, Hua Shen, Daniel S. Weld, Jeffrey Heer, Marco Tulio Ribeiro

    Abstract: The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an LLM to their specific tasks with a small number of examples. However, users tend to include only the most obvious patterns when crafting examples, resulting in underspecified in-context functions that fall short on unseen cases. Further, it is hard to know when "enough" examples have been included even for kno… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: IUI 2023: 28th International Conference on Intelligent User Interfaces

  15. arXiv:2212.04089  [pdf, other

    cs.LG cs.CL cs.CV

    Editing Models with Task Arithmetic

    Authors: Gabriel Ilharco, Marco Tulio Ribeiro, Mitchell Wortsman, Suchin Gururangan, Ludwig Schmidt, Hannaneh Hajishirzi, Ali Farhadi

    Abstract: Changing how pre-trained models behave -- e.g., improving their performance on a downstream task or mitigating biases learned during pre-training -- is a common practice when developing machine learning systems. In this work, we propose a new paradigm for steering the behavior of neural networks, centered around \textit{task vectors}. A task vector specifies a direction in the weight space of a pr… ▽ More

    Submitted 31 March, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: In Proceedings of the 11th International Conference on Learning Representations (ICLR 2023)

  16. arXiv:2212.02774  [pdf, other

    cs.CV

    Adaptive Testing of Computer Vision Models

    Authors: Irena Gao, Gabriel Ilharco, Scott Lundberg, Marco Tulio Ribeiro

    Abstract: Vision models often fail systematically on groups of data that share common semantic characteristics (e.g., rare objects or unusual scenes), but identifying these failure modes is a challenge. We introduce AdaVision, an interactive process for testing vision models which helps users identify and fix coherent failure modes. Given a natural language description of a coherent group, AdaVision retriev… ▽ More

    Submitted 16 August, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: ICCV camera-ready

  17. arXiv:2211.03318  [pdf, other

    cs.CL

    Fixing Model Bugs with Natural Language Patches

    Authors: Shikhar Murty, Christopher D. Manning, Scott Lundberg, Marco Tulio Ribeiro

    Abstract: Current approaches for fixing systematic problems in NLP models (e.g. regex patches, finetuning on more data) are either brittle, or labor-intensive and liable to shortcuts. In contrast, humans often provide corrections to each other through natural language. Taking inspiration from this, we explore natural language patches -- declarative statements that allow developers to provide corrective feed… ▽ More

    Submitted 20 November, 2022; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: Accepted at EMNLP 2022 [Fixed fig-1]

  18. arXiv:2205.00130  [pdf, other

    cs.CL cs.LG

    ExSum: From Local Explanations to Model Understanding

    Authors: Yilun Zhou, Marco Tulio Ribeiro, Julie Shah

    Abstract: Interpretability methods are developed to understand the working mechanisms of black-box models, which is crucial to their responsible deployment. Fulfilling this goal requires both that the explanations generated by these methods are correct and that people can easily and reliably understand them. While the former has been addressed in prior work, the latter is often overlooked, resulting in info… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

    Comments: NAACL 2022. The project website is at https://yilunzhou.github.io/exsum/

  19. arXiv:2203.01176  [pdf, other

    cs.RO cs.AI cs.HC

    Avant-Satie! Using ERIK to encode task-relevant expressivity into the animation of autonomous social robots

    Authors: Tiago Ribeiro, Ana Paiva

    Abstract: ERIK is an expressive inverse kinematics technique that has been previously presented and evaluated both algorithmically and in a limited user-interaction scenario. It allows autonomous social robots to convey posture-based expressive information while gaze-tracking users. We have developed a new scenario aimed at further validating some of the unsupported claims from the previous scenario. Our ex… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    MSC Class: 68T40 (Primary) 68U99 (Secondary) ACM Class: I.2.9; I.2.1; I.3.6; J.0

  20. arXiv:2106.02112  [pdf, other

    cs.LG

    Finding and Fixing Spurious Patterns with Explanations

    Authors: Gregory Plumb, Marco Tulio Ribeiro, Ameet Talwalkar

    Abstract: Image classifiers often use spurious patterns, such as "relying on the presence of a person to detect a tennis racket, which do not generalize. In this work, we present an end-to-end pipeline for identifying and mitigating spurious patterns for such models, under the assumption that we have access to pixel-wise object-annotations. We start by identifying patterns such as "the model's prediction fo… ▽ More

    Submitted 17 August, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

  21. arXiv:2104.14403  [pdf, other

    cs.LG cs.CV

    Do Feature Attribution Methods Correctly Attribute Features?

    Authors: Yilun Zhou, Serena Booth, Marco Tulio Ribeiro, Julie Shah

    Abstract: Feature attribution methods are popular in interpretable machine learning. These methods compute the attribution of each input feature to represent its importance, but there is no consensus on the definition of "attribution", leading to many competing methods with little systematic evaluation, complicated in particular by the lack of ground truth attribution. To address this, we propose a dataset… ▽ More

    Submitted 15 December, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: AAAI 2022. Video summary at https://www.youtube.com/watch?v=kAodFw6jvvo

  22. arXiv:2101.00288  [pdf, other

    cs.CL

    Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models

    Authors: Tongshuang Wu, Marco Tulio Ribeiro, Jeffrey Heer, Daniel S. Weld

    Abstract: While counterfactual examples are useful for analysis and training of NLP models, current generation methods either rely on manual labor to create very few counterfactuals, or only instantiate limited types of perturbations such as paraphrases or word substitutions. We present Polyjuice, a general-purpose counterfactual generator that allows for control over perturbation types and locations, train… ▽ More

    Submitted 1 June, 2021; v1 submitted 1 January, 2021; originally announced January 2021.

    Comments: ACL 2021, main conference, long paper

  23. arXiv:2012.00360  [pdf, other

    cs.AI cs.LG

    Symbolic AI for XAI: Evaluating LFIT Inductive Programming for Fair and Explainable Automatic Recruitment

    Authors: Alfonso Ortega, Julian Fierrez, Aythami Morales, Zilong Wang, Tony Ribeiro

    Abstract: Machine learning methods are growing in relevance for biometrics and personal information processing in domains such as forensics, e-health, recruitment, and e-learning. In these domains, white-box (human-readable) explanations of systems built on machine learning methods can become crucial. Inductive Logic Programming (ILP) is a subfield of symbolic AI aimed to automatically learn declarative the… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

    Comments: WACV21 Workshop on Explainable & Interpretable Artificial Intelligence for Biometrics (xAI4Biom)

  24. arXiv:2006.14779  [pdf, other

    cs.AI cs.CL cs.HC cs.LG

    Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance

    Authors: Gagan Bansal, Tongshuang Wu, Joyce Zhou, Raymond Fok, Besmira Nushi, Ece Kamar, Marco Tulio Ribeiro, Daniel S. Weld

    Abstract: Many researchers motivate explainable AI with studies showing that human-AI team performance on decision-making tasks improves when the AI explains its recommendations. However, prior studies observed improvements from explanations only when the AI, alone, outperformed both the human and the best team. Can explanations help lead to complementary performance, where team accuracy is higher than eith… ▽ More

    Submitted 12 January, 2021; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: CHI'21

  25. arXiv:2005.04118  [pdf, other

    cs.CL cs.LG

    Beyond Accuracy: Behavioral Testing of NLP models with CheckList

    Authors: Marco Tulio Ribeiro, Tongshuang Wu, Carlos Guestrin, Sameer Singh

    Abstract: Although measuring held-out accuracy has been the primary approach to evaluate generalization, it often overestimates the performance of NLP models, while alternative approaches for evaluating models either focus on individual tasks or on specific behaviors. Inspired by principles of behavioral testing in software engineering, we introduce CheckList, a task-agnostic methodology for testing NLP mod… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Journal ref: Association for Computational Linguistics (ACL), 2020

  26. arXiv:1909.13875  [pdf, other

    cs.RO cs.GR cs.HC

    Expressive Inverse Kinematics Solving in Real-time for Virtual and Robotic Interactive Characters

    Authors: Tiago Ribeiro, Ana Paiva

    Abstract: With new advancements in interaction techniques, character animation also requires new methods, to support fields such as robotics, and VR/AR. Interactive characters in such fields are becoming driven by AI which opens up the possibility of non-linear and open-ended narratives that may even include interaction with the real, physical world. This paper presents and describes ERIK, an expressive inv… ▽ More

    Submitted 18 November, 2019; v1 submitted 30 September, 2019; originally announced September 2019.

    ACM Class: I.2.9; H.5.2; I.3.8; J.5

  27. arXiv:1907.09873  [pdf, other

    cs.RO cs.HC

    An extended framework for characterizing social robots

    Authors: Kim Baraka, Patrícia Alves-Oliveira, Tiago Ribeiro

    Abstract: Social robots are becoming increasingly diverse in their design, behavior, and usage. In this chapter, we provide a broad-ranging overview of the main characteristics that arise when one considers social robots and their interactions with humans. We specifically contribute a framework for characterizing social robots along 7 dimensions that we found to be most relevant to their design. These dimen… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: 44 pages

  28. arXiv:1904.02898  [pdf, other

    cs.RO cs.GR cs.HC

    Nutty-based Robot Animation -- Principles and Practices

    Authors: Tiago Ribeiro, Ana Paiva

    Abstract: Robot animation is a new form of character animation that extends the traditional process by allowing the animated motion to become more interactive and adaptable during interaction with users in real-world settings. This paper reviews how this new type of character animation has evolved and been shaped from character animation principles and practices. We outline some new paradigms that aim at al… ▽ More

    Submitted 3 June, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    ACM Class: I.2.9; H.5.2; I.3.8; J.5

  29. arXiv:1611.07579  [pdf, other

    stat.ML cs.AI cs.LG

    Programs as Black-Box Explanations

    Authors: Sameer Singh, Marco Tulio Ribeiro, Carlos Guestrin

    Abstract: Recent work in model-agnostic explanations of black-box machine learning has demonstrated that interpretability of complex models does not have to come at the cost of accuracy or model flexibility. However, it is not clear what kind of explanations, such as linear models, decision trees, and rule lists, are the appropriate family to consider, and different tasks and models may benefit from differe… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  30. arXiv:1611.05817  [pdf, other

    stat.ML cs.AI cs.LG

    Nothing Else Matters: Model-Agnostic Explanations By Identifying Prediction Invariance

    Authors: Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

    Abstract: At the core of interpretable machine learning is the question of whether humans are able to make accurate predictions about a model's behavior. Assumed in this question are three properties of the interpretable output: coverage, precision, and effort. Coverage refers to how often humans think they can predict the model's behavior, precision to how accurate humans are in those predictions, and effo… ▽ More

    Submitted 17 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  31. arXiv:1606.05386  [pdf, other

    stat.ML cs.LG

    Model-Agnostic Interpretability of Machine Learning

    Authors: Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

    Abstract: Understanding why machine learning models behave the way they do empowers both system designers and end-users in many ways: in model selection, feature engineering, in order to trust and act upon the predictions, and in more intuitive user interfaces. Thus, interpretability has become a vital concern in machine learning, and work in the area of interpretable models has found renewed interest. In s… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: presented at 2016 ICML Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, NY

  32. arXiv:1602.04938  [pdf, other

    cs.LG cs.AI stat.ML

    "Why Should I Trust You?": Explaining the Predictions of Any Classifier

    Authors: Marco Tulio Ribeiro, Sameer Singh, Carlos Guestrin

    Abstract: Despite widespread adoption, machine learning models remain mostly black boxes. Understanding the reasons behind predictions is, however, quite important in assessing trust, which is fundamental if one plans to take action based on a prediction, or when choosing whether to deploy a new model. Such understanding also provides insights into the model, which can be used to transform an untrustworthy… ▽ More

    Submitted 9 August, 2016; v1 submitted 16 February, 2016; originally announced February 2016.