Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–17 of 17 results for author: Radke, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10842  [pdf, other

    cs.CL cs.AI cs.HC

    Large Language Models for Automatic Milestone Detection in Group Discussions

    Authors: Zhuoxu Duan, Zhengye Yang, Samuel Westby, Christoph Riedl, Brooke Foucault Welles, Richard J. Radke

    Abstract: Large language models like GPT have proven widely successful on natural language understanding tasks based on written text documents. In this paper, we investigate an LLM's performance on recordings of a group oral communication task in which utterances are often truncated or not well-formed. We propose a new group task experiment involving a puzzle with several milestones that can be achieved in… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2404.07887  [pdf, other

    cs.CV

    Context-aware Video Anomaly Detection in Long-Term Datasets

    Authors: Zhengye Yang, Richard Radke

    Abstract: Video anomaly detection research is generally evaluated on short, isolated benchmark videos only a few minutes long. However, in real-world environments, security cameras observe the same scene for months or years at a time, and the notion of anomalous behavior critically depends on context, such as the time of day, day of week, or schedule of events. Here, we propose a context-aware video anomaly… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  3. arXiv:2401.15194  [pdf

    cs.HC

    Multimodality in Group Communication Research

    Authors: Robin Lange, Brooke Foucault Welles, Gyanendra Sharma, Richard J. Radke, Javier O. Garcia, Christoph Riedl

    Abstract: Team interactions are often multisensory, requiring members to pick up on verbal, visual, spatial and body language cues. Multimodal research, research that captures multiple modes of communication such as audio and visual signals, is therefore integral to understanding these multisensory group communication processes. This type of research has gained traction in biomedical engineering and neurosc… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 27 pages, 3 figures

  4. arXiv:2308.11786  [pdf, other

    cs.HC

    Building Better Human-Agent Teams: Balancing Human Resemblance and Contribution in Voice Assistants

    Authors: Samuel Westby, Richard J. Radke, Christoph Riedl, Brooke Foucault Welles

    Abstract: Voice assistants are increasingly prevalent, from personal devices to team environments. This study explores how voice type and contribution quality influence human-agent team performance and perceptions of anthropomorphism, animacy, intelligence, and trustworthiness. By manipulating both, we reveal mechanisms of perception and clarify ambiguity in previous work. Our results show that the human re… ▽ More

    Submitted 16 May, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: 12 pages, 4 figures

  5. arXiv:2207.04398  [pdf, other

    cs.CV cs.AI

    Self-supervised Learning with Local Contrastive Loss for Detection and Semantic Segmentation

    Authors: Ashraful Islam, Ben Lundell, Harpreet Sawhney, Sudipta Sinha, Peter Morales, Richard J. Radke

    Abstract: We present a self-supervised learning (SSL) method suitable for semi-global tasks such as object detection and semantic segmentation. We enforce local consistency between self-learned features, representing corresponding image locations of transformed versions of the same image, by minimizing a pixel-level local contrastive (LC) loss during training. LC-loss can be added to existing self-supervise… ▽ More

    Submitted 7 December, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: accepted to WACV 2023

  6. arXiv:2106.07807  [pdf, other

    cs.CV

    Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data

    Authors: Ashraful Islam, Chun-Fu Chen, Rameswar Panda, Leonid Karlinsky, Rogerio Feris, Richard J. Radke

    Abstract: Most existing works in few-shot learning rely on meta-learning the network on a large base dataset which is typically from the same domain as the target dataset. We tackle the problem of cross-domain few-shot learning where there is a large shift between the base and target domain. The problem of cross-domain few-shot recognition with unlabeled target data is largely unaddressed in the literature.… ▽ More

    Submitted 1 November, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: Accepted to NeurIPS 2021

  7. arXiv:2103.13517  [pdf, other

    cs.CV

    A Broad Study on the Transferability of Visual Representations with Contrastive Learning

    Authors: Ashraful Islam, Chun-Fu Chen, Rameswar Panda, Leonid Karlinsky, Richard Radke, Rogerio Feris

    Abstract: Tremendous progress has been made in visual representation learning, notably with the recent success of self-supervised contrastive learning methods. Supervised contrastive learning has also been shown to outperform its cross-entropy counterparts by leveraging labels for choosing where to contrast. However, there has been little work to explore the transfer capability of contrastive learning to a… ▽ More

    Submitted 15 August, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: accepted to ICCV 2021

  8. arXiv:2101.00545  [pdf, other

    cs.CV cs.AI

    A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization

    Authors: Ashraful Islam, Chengjiang Long, Richard Radke

    Abstract: Weakly supervised temporal action localization is a challenging vision task due to the absence of ground-truth temporal locations of actions in the training videos. With only video-level supervision during training, most existing methods rely on a Multiple Instance Learning (MIL) framework to predict the start and end frame of each action category in a video. However, the existing MIL-based approa… ▽ More

    Submitted 24 March, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

    Comments: Extended version/preprint of a AAAI 2021 paper

  9. arXiv:2008.06035  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Towards Visually Explaining Similarity Models

    Authors: Meng Zheng, Srikrishna Karanam, Terrence Chen, Richard J. Radke, Ziyan Wu

    Abstract: We consider the problem of visually explaining similarity models, i.e., explaining why a model predicts two images to be similar in addition to producing a scalar score. While much recent work in visual model interpretability has focused on gradient-based attention, these methods rely on a classification module to generate visual explanations. Consequently, they cannot readily explain other kinds… ▽ More

    Submitted 13 October, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

    Comments: 13 pages, 10 figures, 4 tables. arXiv admin note: substantial text overlap with arXiv:1911.07381

  10. arXiv:2001.07793  [pdf, other

    cs.CV cs.LG

    Weakly Supervised Temporal Action Localization Using Deep Metric Learning

    Authors: Ashraful Islam, Richard J. Radke

    Abstract: Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and time-consuming to annotate both action labels and temporal boundaries of videos. To this end, we propose a weakly supervised temporal action localization method that only r… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

    Comments: accepted to WACV 2020

  11. arXiv:1911.11751  [pdf, other

    cs.CV cs.HC

    Multi-person Spatial Interaction in a Large Immersive Display Using Smartphones as Touchpads

    Authors: Gyanendra Sharma, Richard J Radke

    Abstract: In this paper, we present a multi-user interaction interface for a large immersive space that supports simultaneous screen interactions by combining (1) user input via personal smartphones and Bluetooth microphones, (2) spatial tracking via an overhead array of Kinect sensors, and (3) WebSocket interfaces to a webpage running on the large screen. Users are automatically, dynamically assigned perso… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: 8 pages with references

    ACM Class: H.5.2; H.5.1

  12. arXiv:1911.07389  [pdf, other

    cs.CV cs.LG

    Towards Visually Explaining Variational Autoencoders

    Authors: Wenqian Liu, Runze Li, Meng Zheng, Srikrishna Karanam, Ziyan Wu, Bir Bhanu, Richard J. Radke, Octavia Camps

    Abstract: Recent advances in Convolutional Neural Network (CNN) model interpretability have led to impressive progress in visualizing and understanding model predictions. In particular, gradient-based visual attention methods have driven much recent effort in using visual attention maps as a means for visual explanations. A key problem, however, is these methods are designed for classification and categoriz… ▽ More

    Submitted 14 April, 2020; v1 submitted 17 November, 2019; originally announced November 2019.

    Comments: 10 pages, 9 figures, 2 tables, CVPR 2020

  13. arXiv:1911.07381  [pdf, other

    cs.CV cs.LG

    Visual Similarity Attention

    Authors: Meng Zheng, Srikrishna Karanam, Terrence Chen, Richard J. Radke, Ziyan Wu

    Abstract: While there has been substantial progress in learning suitable distance metrics, these techniques in general lack transparency and decision reasoning, i.e., explaining why the input set of images is similar or dissimilar. In this work, we solve this key problem by proposing the first method to generate generic visual similarity explanations with gradient-based attention. We demonstrate that our te… ▽ More

    Submitted 3 May, 2022; v1 submitted 17 November, 2019; originally announced November 2019.

    Comments: 10 pages, 7 figures, 4 tables

  14. arXiv:1811.07487  [pdf, other

    cs.CV cs.LG

    Re-Identification with Consistent Attentive Siamese Networks

    Authors: Meng Zheng, Srikrishna Karanam, Ziyan Wu, Richard J. Radke

    Abstract: We propose a new deep architecture for person re-identification (re-id). While re-id has seen much recent progress, spatial localization and view-invariant representation learning for robust cross-view matching remain key, unsolved problems. We address these questions by means of a new attention-driven Siamese learning architecture, called the Consistent Attentive Siamese Network. Our key innovati… ▽ More

    Submitted 11 April, 2019; v1 submitted 18 November, 2018; originally announced November 2018.

    Comments: 10 pages, 8 figures, 3 tables, to appear in CVPR 2019

  15. arXiv:1808.05499  [pdf, other

    cs.CV

    Measuring the Temporal Behavior of Real-World Person Re-Identification

    Authors: Meng Zheng, Srikrishna Karanam, Richard J. Radke

    Abstract: Designing real-world person re-identification (re-id) systems requires attention to operational aspects not typically considered in academic research. Typically, the probe image or image sequence is matched to a gallery set with a fixed candidate list. On the other hand, in real-world applications of re-id, we would search for a person of interest in a gallery set that is continuously populated by… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

    Comments: 14 pages, 14 figures

  16. arXiv:1706.00553  [pdf, other

    cs.CV

    Rank Persistence: Assessing the Temporal Performance of Real-World Person Re-Identification

    Authors: Srikrishna Karanam, Eric Lam, Richard J. Radke

    Abstract: Designing useful person re-identification systems for real-world applications requires attention to operational aspects not typically considered in academic research. Here, we focus on the temporal aspect of re-identification; that is, instead of finding a match to a probe person of interest in a fixed candidate gallery, we consider the more realistic scenario in which the gallery is continuously… ▽ More

    Submitted 4 June, 2017; v1 submitted 2 June, 2017; originally announced June 2017.

    Comments: 8 pages, 7 figures

  17. arXiv:1605.09653  [pdf, other

    cs.CV

    A Systematic Evaluation and Benchmark for Person Re-Identification: Features, Metrics, and Datasets

    Authors: Srikrishna Karanam, Mengran Gou, Ziyan Wu, Angels Rates-Borras, Octavia Camps, Richard J. Radke

    Abstract: Person re-identification (re-id) is a critical problem in video analytics applications such as security and surveillance. The public release of several datasets and code for vision algorithms has facilitated rapid progress in this area over the last few years. However, directly comparing re-id algorithms reported in the literature has become difficult since a wide variety of features, experimental… ▽ More

    Submitted 14 February, 2018; v1 submitted 31 May, 2016; originally announced May 2016.

    Comments: Preliminary work on person Re-Id benchmark. S. Karanam and M. Gou contributed equally. 14 pages, 6 figures, 4 tables. For supplementary material, see http://robustsystems.coe.neu.edu/sites/robustsystems.coe.neu.edu/files/systems/supmat/ReID_benchmark_supp.zip