Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2025
LLM Comparator: Interactive Analysis of Side-by-Side Evaluation of Large Language Models
- Minsuk Kahng,
- Ian Tenney,
- Mahima Pushkarna,
- Michael Xieyang Liu,
- James Wexler,
- Emily Reif,
- Krystal Kallarackal,
- Minsuk Chang,
- Michael Terry,
- Lucas Dixon
IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 31, Issue 1Pages 503–513https://doi.org/10.1109/TVCG.2024.3456354Evaluating large language models (LLMs) presents unique challenges. While automatic side-by-side evaluation, also known as LLM-as-a-judge, has become a promising solution, model developers and researchers face difficulties with scalability and ...
- research-articleJune 2024
Adversarial Nibbler: An Open Red-Teaming Method for Identifying Diverse Harms in Text-to-Image Generation
- Jessica Quaye,
- Alicia Parrish,
- Oana Inel,
- Charvi Rastogi,
- Hannah Rose Kirk,
- Minsuk Kahng,
- Erin Van Liemt,
- Max Bartolo,
- Jess Tsang,
- Justin White,
- Nathan Clement,
- Rafael Mosquera,
- Juan Ciro,
- Vijay Janapa Reddi,
- Lora Aroyo
FAccT '24: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and TransparencyPages 388–406https://doi.org/10.1145/3630106.3658913With text-to-image (T2I) generative AI models reaching wide audiences, it is critical to evaluate model robustness against non-obvious attacks to mitigate the generation of offensive images. By focusing on “implicitly adversarial” prompts (those that ...
- Work in ProgressMay 2024
Automatic Histograms: Leveraging Language Models for Text Dataset Exploration
CHI EA '24: Extended Abstracts of the CHI Conference on Human Factors in Computing SystemsArticle No.: 53, Pages 1–9https://doi.org/10.1145/3613905.3650798Making sense of unstructured text datasets is perennially difficult, yet increasingly relevant with Large Language Models. Data practitioners often rely on dataset summaries, especially distributions of various derived features. Some features, like ...
- Work in ProgressMay 2024
Understanding the Dataset Practitioners Behind Large Language Models
CHI EA '24: Extended Abstracts of the CHI Conference on Human Factors in Computing SystemsArticle No.: 350, Pages 1–7https://doi.org/10.1145/3613905.3651007As large language models (LLMs) become more advanced and impactful, it is increasingly important to scrutinize the data that they rely upon and produce. What is it to be a dataset practitioner doing this work? We approach this in two parts: first, we ...
- Work in ProgressMay 2024
LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models
- Minsuk Kahng,
- Ian Tenney,
- Mahima Pushkarna,
- Michael Xieyang Liu,
- James Wexler,
- Emily Reif,
- Krystal Kallarackal,
- Minsuk Chang,
- Michael Terry,
- Lucas Dixon
CHI EA '24: Extended Abstracts of the CHI Conference on Human Factors in Computing SystemsArticle No.: 216, Pages 1–7https://doi.org/10.1145/3613905.3650755Automatic side-by-side evaluation has emerged as a promising approach to evaluating the quality of responses from large language models (LLMs). However, analyzing the results from this evaluation approach raises scalability and interpretability ...
-
- posterMarch 2023
VIVA: Visual Exploration and Analysis of Videos with Interactive Annotation
- Anita Ruangrotsakun,
- Dayeon Oh,
- Thuy-Vy Nguyen,
- Kristina Lee,
- Mark Ser,
- Arthur Hiew,
- Rogers Ngo,
- Zeyad Shureih,
- Roli Khanna,
- Minsuk Kahng
IUI '23 Companion: Companion Proceedings of the 28th International Conference on Intelligent User InterfacesPages 162–165https://doi.org/10.1145/3581754.3584160This paper presents VIVA, a novel interactive tool for visually exploring long videos and searching for specific moments. Previous work on video data exploration and analytics often assumes that manually-created, rich annotations are available. However, ...
- research-articleApril 2022
FitVid: Responsive and Flexible Video Content Adaptation
CHI '22: Proceedings of the 2022 CHI Conference on Human Factors in Computing SystemsArticle No.: 501, Pages 1–16https://doi.org/10.1145/3491102.3501948Mobile video-based learning attracts many learners with its mobility and ease of access. However, most lectures are designed for desktops. Our formative study reveals mobile learners’ two major needs: more readable content and customizable video design. ...
- research-articleMarch 2022
Finding AI’s Faults with AAR/AI: An Empirical Study
- Roli Khanna,
- Jonathan Dodge,
- Andrew Anderson,
- Rupika Dikkala,
- Jed Irvine,
- Zeyad Shureih,
- Kin-Ho Lam,
- Caleb R. Matthews,
- Zhengxian Lin,
- Minsuk Kahng,
- Alan Fern,
- Margaret Burnett
ACM Transactions on Interactive Intelligent Systems (TIIS), Volume 12, Issue 1Article No.: 1, Pages 1–33https://doi.org/10.1145/3487065Would you allow an AI agent to make decisions on your behalf? If the answer is “not always,” the next question becomes “in what circumstances”? Answering this question requires human users to be able to assess an AI agent—and not just with overall pass/...
- letterDecember 2021
From heatmaps to structured explanations of image classifiers
AbstractThis paper summarizes our endeavors in the past few years in terms of explaining image classifiers, with the aim of including negative results and insights we have gained. The paper starts with describing the explainable neural network (XNN), ...
This paper summarizes our endeavors explaining image classifiers, including XNN, I‐GOS, iGOS++ (a) and SAG(b). We present tricks of the trade to make explanations work and lessons learned from these research endeavors. image image
- research-articleDecember 2021
One explanation is not enough: structured attention graphs for image classification
NIPS '21: Proceedings of the 35th International Conference on Neural Information Processing SystemsArticle No.: 868, Pages 11352–11363Saliency maps are popular tools for explaining the decisions of convolutional neural networks (CNNs) for image classification. Typically, for each image of interest, a single saliency map is produced, which assigns weights to pixels based on their ...
- letterSeptember 2021
From “no clear winner” to an effective Explainable Artificial Intelligence process: An empirical journey
- Jonathan Dodge,
- Andrew Anderson,
- Roli Khanna,
- Jed Irvine,
- Rupika Dikkala,
- Kin‐Ho Lam,
- Delyar Tabatabai,
- Anita Ruangrotsakun,
- Zeyad Shureih,
- Minsuk Kahng,
- Alan Fern,
- Margaret Burnett
Abstract“In what circumstances would you want this AI to make decisions on your behalf?” We have been investigating how to enable a user of an Artificial Intelligence‐powered system to answer questions like this through a series of empirical studies, a ...
"In what circumstances would you want this Artificial Intelligence (AI) to make decisions on your behalf?" This paper summarizes multiple empirical investigations of how to enable a user of an AI‐powered system to answer questions like this. image image ...
- editorialDecember 2020
- abstractApril 2020
CNN 101: Interactive Visual Learning for Convolutional Neural Networks
- Zijie J. Wang,
- Robert Turko,
- Omar Shaikh,
- Haekyu Park,
- Nilaksh Das,
- Fred Hohman,
- Minsuk Kahng,
- Duen Horng Chau
CHI EA '20: Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing SystemsPages 1–7https://doi.org/10.1145/3334480.3382899The success of deep learning solving previously-thought hard problems has inspired many non-experts to learn and understand this exciting technology. However, it is often challenging for learners to take the first steps due to the complexity of deep ...
- research-articleAugust 2019
Visual Analytics in Deep Learning: An Interrogative Survey for the Next Frontiers
IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 25, Issue 8Pages 2674–2693https://doi.org/10.1109/TVCG.2018.2843369Deep learning has recently seen rapid development and received significant attention due to its state-of-the-art performance on previously-thought hard problems. However, because of the internal complexity and nonlinear structure of deep neural networks,...
- research-articleJanuary 2019
GAN Lab: Understanding Complex Deep Generative Models using Interactive Visual Experimentation
IEEE Transactions on Visualization and Computer Graphics (ITVC), Volume 25, Issue 1Pages 310–320https://doi.org/10.1109/TVCG.2018.2864500Recent success in deep learning has generated immense interest among practitioners and students, inspiring many to learn about this new technology. While visual and interactive approaches have been successfully developed to help people more easily learn ...
- research-articleFebruary 2018
Chronodes: Interactive Multifocus Exploration of Event Sequences
- Peter J. Polack Jr.,
- Shang-Tse Chen,
- Minsuk Kahng,
- Kaya De Barbaro,
- Rahul Basole,
- Moushumi Sharmin,
- Duen Horng Chau
ACM Transactions on Interactive Intelligent Systems (TIIS), Volume 8, Issue 1Article No.: 2, Pages 1–21https://doi.org/10.1145/3152888The advent of mobile health (mHealth) technologies challenges the capabilities of current visualizations, interactive tools, and algorithms. We present Chronodes, an interactive system that unifies data mining and human-centric visualization techniques ...
- demonstrationSeptember 2017
mHealth visual discovery dashboard
- Dezhi Fang,
- Fred Hohman,
- Peter Polack,
- Hillol Sarker,
- Minsuk Kahng,
- Moushumi Sharmin,
- Mustafa al'Absi,
- Duen Horng Chau
UbiComp '17: Proceedings of the 2017 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2017 ACM International Symposium on Wearable ComputersPages 237–240https://doi.org/10.1145/3123024.3123170We present Discovery Dashboard, a visual analytics system for exploring large volumes of time series data from mobile medical field studies. Discovery Dashboard offers interactive exploration tools and a data mining motif discovery algorithm to help ...
- research-articleAugust 2016
Interactive browsing and navigation in relational databases
Proceedings of the VLDB Endowment (PVLDB), Volume 9, Issue 12Pages 1017–1028https://doi.org/10.14778/2994509.2994520Although researchers have devoted considerable attention to helping database users formulate queries, many users still find it challenging to specify queries that involve joining tables. To help users construct join queries for exploring relational ...
- research-articleJune 2016
Visual exploration of machine learning results using data cube analysis
HILDA '16: Proceedings of the Workshop on Human-In-the-Loop Data AnalyticsArticle No.: 1, Pages 1–6https://doi.org/10.1145/2939502.2939503As complex machine learning systems become more widely adopted, it becomes increasingly challenging for users to understand models or interpret the results generated from the models. We present our ongoing work on developing interactive and visual ...
- posterMarch 2016
STEPS: A Spatio-temporal Electric Power Systems Visualization
IUI '16 Companion: Companion Publication of the 21st International Conference on Intelligent User InterfacesPages 32–35https://doi.org/10.1145/2876456.2879480As the bulk electric grid becomes more complex, power system operators and engineers have more information to process and interpret than ever before. The information overload they experience can be mitigated by effective visualizations that facilitate ...