Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–24 of 24 results for author: Betke, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10091  [pdf, other

    cs.CL

    Enhancing Emotion Prediction in News Headlines: Insights from ChatGPT and Seq2Seq Models for Free-Text Generation

    Authors: Ge Gao, Jongin Kim, Sejin Paik, Ekaterina Novozhilova, Yi Liu, Sarah T. Bonna, Margrit Betke, Derry Tanti Wijaya

    Abstract: Predicting emotions elicited by news headlines can be challenging as the task is largely influenced by the varying nature of people's interpretations and backgrounds. Previous works have explored classifying discrete emotions directly from news headlines. We provide a different approach to tackling this problem by utilizing people's explanations of their emotion, written in free-text, on how they… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: published at LREC-COLING 2024

    ACM Class: I.2.7

    Journal ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) 5944-5955

  2. Detecting Frames in News Headlines and Lead Images in U.S. Gun Violence Coverage

    Authors: Isidora Chara Tourni, Lei Guo, Hengchang Hu, Edward Halim, Prakash Ishwar, Taufiq Daryanto, Mona Jalal, Boqi Chen, Margrit Betke, Fabian Zhafransyah, Sha Lai, Derry Tanti Wijaya

    Abstract: News media structure their reporting of events or issues using certain perspectives. When describing an incident involving gun violence, for example, some journalists may focus on mental health or gun regulation, while others may emphasize the discussion of gun rights. Such perspectives are called \say{frames} in communication research. We study, for the first time, the value of combining lead i… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: published at Findings of the Association for Computational Linguistics: EMNLP 2021

  3. arXiv:2306.04736  [pdf, other

    cs.CV

    BU-CVKit: Extendable Computer Vision Framework for Species Independent Tracking and Analysis

    Authors: Mahir Patel, Lucas Carstensen, Yiwen Gu, Michael E. Hasselmo, Margrit Betke

    Abstract: A major bottleneck of interdisciplinary computer vision (CV) research is the lack of a framework that eases the reuse and abstraction of state-of-the-art CV models by CV and non-CV researchers alike. We present here BU-CVKit, a computer vision framework that allows the creation of research pipelines with chainable Processors. The community can create plugins of their work for the framework, hence… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  4. arXiv:2211.14703  [pdf, other

    cs.CV

    Exploring Consistency in Cross-Domain Transformer for Domain Adaptive Semantic Segmentation

    Authors: Kaihong Wang, Donghyun Kim, Rogerio Feris, Kate Saenko, Margrit Betke

    Abstract: While transformers have greatly boosted performance in semantic segmentation, domain adaptive transformers are not yet well explored. We identify that the domain gap can cause discrepancies in self-attention. Due to this gap, the transformer attends to spurious regions or pixels, which deteriorates accuracy on the target domain. We propose to perform adaptation on attention maps with cross-domain… ▽ More

    Submitted 20 December, 2022; v1 submitted 26 November, 2022; originally announced November 2022.

  5. arXiv:2205.09671  [pdf, other

    cs.CV

    A graph-transformer for whole slide image classification

    Authors: Yi Zheng, Rushin H. Gindra, Emily J. Green, Eric J. Burks, Margrit Betke, Jennifer E. Beane, Vijaya B. Kolachalama

    Abstract: Deep learning is a powerful tool for whole slide image (WSI) analysis. Typically, when performing supervised deep learning, a WSI is divided into small patches, trained and the outcomes are aggregated to estimate disease grade. However, patch-based methods introduce label noise during training by assuming that each patch is independent with the same label as the WSI and neglect overall WSI-level i… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  6. arXiv:2204.00172  [pdf, other

    cs.CV cs.LG

    A Unified Framework for Domain Adaptive Pose Estimation

    Authors: Donghyun Kim, Kaihong Wang, Kate Saenko, Margrit Betke, Stan Sclaroff

    Abstract: While pose estimation is an important computer vision task, it requires expensive annotation and suffers from domain shift. In this paper, we investigate the problem of domain adaptive 2D pose estimation that transfers knowledge learned on a synthetic source domain to a target domain without supervision. While several domain adaptive pose estimation models have been proposed recently, they are not… ▽ More

    Submitted 5 August, 2022; v1 submitted 31 March, 2022; originally announced April 2022.

  7. arXiv:2009.08610  [pdf, other

    cs.CV

    Consistency Regularization with High-dimensional Non-adversarial Source-guided Perturbation for Unsupervised Domain Adaptation in Segmentation

    Authors: Kaihong Wang, Chenhongyi Yang, Margrit Betke

    Abstract: Unsupervised domain adaptation for semantic segmentation has been intensively studied due to the low cost of the pixel-level annotation for synthetic data. The most common approaches try to generate images or features mimicking the distribution in the target domain while preserving the semantic contents in the source domain so that a model can be trained with annotations from the latter. However,… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  8. arXiv:2008.06974  [pdf, other

    cs.CL cs.IR cs.LG

    OpenFraming: We brought the ML; you bring the data. Interact with your data and discover its frames

    Authors: Alyssa Smith, David Assefa Tofu, Mona Jalal, Edward Edberg Halim, Yimeng Sun, Vidya Akavoor, Margrit Betke, Prakash Ishwar, Lei Guo, Derry Wijaya

    Abstract: When journalists cover a news story, they can cover the story from multiple angles or perspectives. A news article written about COVID-19 for example, might focus on personal preventative actions such as mask-wearing, while another might focus on COVID-19's impact on the economy. These perspectives are called "frames," which when used may influence public perception and opinion of the issue. We in… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: 8 pages, 8 figures, EMNLP 2020 demonstration papers

  9. arXiv:2008.05955  [pdf, other

    cs.CV cs.GR cs.LG cs.RO eess.IV

    SIDOD: A Synthetic Image Dataset for 3D Object Pose Recognition with Distractors

    Authors: Mona Jalal, Josef Spjut, Ben Boudaoud, Margrit Betke

    Abstract: We present a new, publicly-available image dataset generated by the NVIDIA Deep Learning Data Synthesizer intended for use in object detection, pose estimation, and tracking applications. This dataset contains 144k stereo image pairs that synthetically combine 18 camera viewpoints of three photorealistic virtual environments with up to 10 objects (chosen randomly from the 21 object models of the Y… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 3 pages, 4 figures, 1 table, Accepted at CVPR 2019 Workshop

  10. arXiv:2002.05242  [pdf, other

    cs.CV cs.HC cs.LG

    Leveraging Affect Transfer Learning for Behavior Prediction in an Intelligent Tutoring System

    Authors: Nataniel Ruiz, Hao Yu, Danielle A. Allessio, Mona Jalal, Ajjen Joshi, Thomas Murray, John J. Magee, Jacob R. Whitehill, Vitaly Ablavsky, Ivon Arroyo, Beverly P. Woolf, Stan Sclaroff, Margrit Betke

    Abstract: In this work, we propose a video-based transfer learning approach for predicting problem outcomes of students working with an intelligent tutoring system (ITS). By analyzing a student's face and gestures, our method predicts the outcome of a student answering a problem in an ITS from a video feed. Our work is motivated by the reasoning that the ability to predict such outcomes enables tutoring sys… ▽ More

    Submitted 8 April, 2022; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: Published at IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2021 - Best Poster Award (4% award rate)

  11. arXiv:2002.04181  [pdf, other

    cs.CL cs.LG cs.SI

    Performance Comparison of Crowdworkers and NLP Tools on Named-Entity Recognition and Sentiment Analysis of Political Tweets

    Authors: Mona Jalal, Kate K. Mays, Lei Guo, Margrit Betke

    Abstract: We report results of a comparison of the accuracy of crowdworkers and seven Natural Language Processing (NLP) toolkits in solving two important NLP tasks, named-entity recognition (NER) and entity-level sentiment (ELS) analysis. We here focus on a challenging dataset, 1,000 political tweets that were collected during the U.S. presidential primary election in February 2016. Each tweet refers to at… ▽ More

    Submitted 11 August, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: 4 pages, 1 figure, Accepted at WiNLP Workshop at NAACL 2018

  12. arXiv:1912.01674  [pdf, other

    cs.CV

    Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes

    Authors: Chenhongyi Yang, Vitaly Ablavsky, Kaihong Wang, Qi Feng, Margrit Betke

    Abstract: While visual object detection with deep learning has received much attention in the past decade, cases when heavy intra-class occlusions occur have not been studied thoroughly. In this work, we propose a Non-Maximum-Suppression (NMS) algorithm that dramatically improves the detection recall while maintaining high precision in scenes with heavy occlusions. Our NMS algorithm is derived from a novel… ▽ More

    Submitted 19 July, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: ECCV 2020

  13. arXiv:1911.07046  [pdf, other

    cs.CV

    A method for detecting text of arbitrary shapes in natural scenes that improves text spotting

    Authors: Qitong Wang, Yi Zheng, Margrit Betke

    Abstract: Understanding the meaning of text in images of natural scenes like highway signs or store front emblems is particularly challenging if the text is foreshortened in the image or the letters are artistically distorted. We introduce a pipeline-based text spotting framework that can both detect and recognize text in various fonts, shapes, and orientations in natural scene images with complicated backg… ▽ More

    Submitted 27 May, 2020; v1 submitted 16 November, 2019; originally announced November 2019.

    Comments: Accepted by IEEE CVPR-W 2020

  14. arXiv:1909.00134  [pdf, other

    cs.CV

    Scraping Social Media Photos Posted in Kenya and Elsewhere to Detect and Analyze Food Types

    Authors: Kaihong Wang, Mona Jalal, Sankara Jefferson, Yi Zheng, Elaine O. Nsoesie, Margrit Betke

    Abstract: Monitoring population-level changes in diet could be useful for education and for implementing interventions to improve health. Research has shown that data from social media sources can be used for monitoring dietary behavior. We propose a scrape-by-location methodology to create food image datasets from Instagram posts. We used it to collect 3.56 million images over a period of 20 days in March… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Comments: Another version of the paper was submitted to the ACM International Conference on Multimedia (ACMMM2019)

  15. arXiv:1908.01403  [pdf, other

    cs.CV

    Deep Neural Network for Semantic-based Text Recognition in Images

    Authors: Yi Zheng, Qitong Wang, Margrit Betke

    Abstract: State-of-the-art text spotting systems typically aim to detect isolated words or word-by-word text in images of natural scenes and ignore the semantic coherence within a region of text. However, when interpreted together, seemingly isolated words may be easier to recognize. On this basis, we propose a novel "semantic-based text recognition" (STR) deep learning model that reads text in images with… ▽ More

    Submitted 9 December, 2019; v1 submitted 4 August, 2019; originally announced August 2019.

  16. arXiv:1905.00060  [pdf, other

    cs.CV

    Predicting How to Distribute Work Between Algorithms and Humans to Segment an Image Batch

    Authors: Danna Gurari, Yinan Zhao, Suyog Dutt Jain, Margrit Betke, Kristen Grauman

    Abstract: Foreground object segmentation is a critical step for many image analysis tasks. While automated methods can produce high-quality results, their failures disappoint users in need of practical solutions. We propose a resource allocation framework for predicting how best to allocate a fixed budget of human annotation effort in order to collect higher quality segmentations for a given batch of images… ▽ More

    Submitted 30 April, 2019; originally announced May 2019.

  17. arXiv:1901.06237  [pdf, other

    cs.HC cs.LG stat.ML

    BUOCA: Budget-Optimized Crowd Worker Allocation

    Authors: Mehrnoosh Sameki, Sha Lai, Kate K. Mays, Lei Guo, Prakash Ishwar, Margrit Betke

    Abstract: Due to concerns about human error in crowdsourcing, it is standard practice to collect labels for the same data point from multiple internet workers. We here show that the resulting budget can be used more effectively with a flexible worker assignment strategy that asks fewer workers to analyze easy-to-label data and more workers to analyze data that requires extra scrutiny. Our main contribution… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

  18. arXiv:1810.01771  [pdf, other

    cs.CV

    SAVOIAS: A Diverse, Multi-Category Visual Complexity Dataset

    Authors: Elham Saraee, Mona Jalal, Margrit Betke

    Abstract: Visual complexity identifies the level of intricacy and details in an image or the level of difficulty to describe the image. It is an important concept in a variety of areas such as cognitive psychology, computer vision and visualization, and advertisement. Yet, efforts to create large, downloadable image datasets with diverse content and unbiased groundtruthing are lacking. In this work, we intr… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: 10 pages, 4 figures, 4 tables

  19. arXiv:1705.00366  [pdf, other

    cs.CV

    Predicting Foreground Object Ambiguity and Efficiently Crowdsourcing the Segmentation(s)

    Authors: Danna Gurari, Kun He, Bo Xiong, Jianming Zhang, Mehrnoosh Sameki, Suyog Dutt Jain, Stan Sclaroff, Margrit Betke, Kristen Grauman

    Abstract: We propose the ambiguity problem for the foreground object segmentation task and motivate the importance of estimating and accounting for this ambiguity when designing vision systems. Specifically, we distinguish between images which lead multiple annotators to segment different foreground objects (ambiguous) versus minor inter-annotator differences of the same object. Taking images from eight wid… ▽ More

    Submitted 30 April, 2017; originally announced May 2017.

  20. arXiv:1702.00583  [pdf, other

    cs.CV

    Automating Image Analysis by Annotating Landmarks with Deep Neural Networks

    Authors: Mikhail Breslav, Tyson L. Hedrick, Stan Sclaroff, Margrit Betke

    Abstract: Image and video analysis is often a crucial step in the study of animal behavior and kinematics. Often these analyses require that the position of one or more animal landmarks are annotated (marked) in numerous images. The process of annotating landmarks can require a significant amount of time and tedious labor, which motivates the need for algorithms that can automatically annotate landmarks. In… ▽ More

    Submitted 2 February, 2017; originally announced February 2017.

    Comments: 30 pages

  21. arXiv:1608.08953  [pdf, other

    cs.HC cs.CL cs.SI

    Dynamic Allocation of Crowd Contributions for Sentiment Analysis during the 2016 U.S. Presidential Election

    Authors: Mehrnoosh Sameki, Mattia Gentil, Kate K. Mays, Lei Guo, Margrit Betke

    Abstract: Opinions about the 2016 U.S. Presidential Candidates have been expressed in millions of tweets that are challenging to analyze automatically. Crowdsourcing the analysis of political tweets effectively is also difficult, due to large inter-rater disagreements when sarcasm is involved. Each tweet is typically analyzed by a fixed number of workers and majority voting. We here propose a crowdsourcing… ▽ More

    Submitted 9 February, 2017; v1 submitted 31 August, 2016; originally announced August 2016.

    Comments: 10 pages, 3 figures

  22. arXiv:1607.07525  [pdf, other

    cs.CV

    Salient Object Subitizing

    Authors: Jianming Zhang, Shugao Ma, Mehrnoosh Sameki, Stan Sclaroff, Margrit Betke, Zhe Lin, Xiaohui Shen, Brian Price, Radomir Mech

    Abstract: We study the problem of Salient Object Subitizing, i.e. predicting the existence and the number of salient objects in an image using holistic cues. This task is inspired by the ability of people to quickly and accurately identify the number of items within the subitizing range (1-4). To this end, we present a salient object subitizing image dataset of about 14K everyday images which are annotated… ▽ More

    Submitted 25 July, 2016; originally announced July 2016.

  23. arXiv:1605.00707  [pdf, other

    cs.CV

    Discovering Useful Parts for Pose Estimation in Sparsely Annotated Datasets

    Authors: Mikhail Breslav, Tyson L. Hedrick, Stan Sclaroff, Margrit Betke

    Abstract: Our work introduces a novel way to increase pose estimation accuracy by discovering parts from unannotated regions of training images. Discovered parts are used to generate more accurate appearance likelihoods for traditional part-based models like Pictorial Structures [13] and its derivatives. Our experiments on images of a hawkmoth in flight show that our proposed approach significantly improves… ▽ More

    Submitted 2 May, 2016; originally announced May 2016.

    Comments: Accepted at WACV 2016

  24. arXiv:1107.0998  [pdf, ps, other

    cs.IT cs.AI

    An Information Theoretic Representation of Agent Dynamics as Set Intersections

    Authors: Samuel Epstein, Margrit Betke

    Abstract: We represent agents as sets of strings. Each string encodes a potential interaction with another agent or environment. We represent the total set of dynamics between two agents as the intersection of their respective strings, we prove complexity properties of player interactions using Algorithmic Information Theory. We show how the proposed construction is compatible with Universal Artificial Inte… ▽ More

    Submitted 5 July, 2011; originally announced July 2011.