Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–30 of 30 results for author: Smith, J S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.00078  [pdf, other

    eess.SP cs.LG cs.NI

    SGP-RI: A Real-Time-Trainable and Decentralized IoT Indoor Localization Model Based on Sparse Gaussian Process with Reduced-Dimensional Inputs

    Authors: Zhe Tang, Sihao Li, Zichen Huang, Guandong Yang, Kyeong Soo Kim, Jeremy S. Smith

    Abstract: Internet of Things (IoT) devices are deployed in the filed, there is an enormous amount of untapped potential in local computing on those IoT devices. Harnessing this potential for indoor localization, therefore, becomes an exciting research area. Conventionally, the training and deployment of indoor localization models are based on centralized servers with substantial computational resources. Thi… ▽ More

    Submitted 24 August, 2024; originally announced September 2024.

    Comments: 10 pages, 4 figures, under review for journal publication

  2. arXiv:2408.09632  [pdf, other

    cs.LG cs.CL stat.ML

    MoDeGPT: Modular Decomposition for Large Language Model Compression

    Authors: Chi-Heng Lin, Shangqian Gao, James Seale Smith, Abhishek Patel, Shikhar Tuli, Yilin Shen, Hongxia Jin, Yen-Chang Hsu

    Abstract: Large Language Models (LLMs) have reshaped the landscape of artificial intelligence by demonstrating exceptional performance across various tasks. However, substantial computational requirements make their deployment challenging on devices with limited resources. Recently, compression methods using low-rank matrix techniques have shown promise, yet these often lead to degraded accuracy or introduc… ▽ More

    Submitted 20 August, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

    Comments: 31 pages, 9 figures

    MSC Class: 15A23 (Primary) ACM Class: I.2.7

  3. arXiv:2407.13303  [pdf, other

    cs.LG

    Mean Teacher based SSL Framework for Indoor Localization Using Wi-Fi RSSI Fingerprinting

    Authors: Sihao Li, Zhe Tang, Kyeong Soo Kim, Jeremy S. Smith

    Abstract: Wi-Fi fingerprinting is widely applied for indoor localization due to the widespread availability of Wi-Fi devices. However, traditional methods are not ideal for multi-building and multi-floor environments due to the scalability issues. Therefore, more and more researchers have employed deep learning techniques to enable scalable indoor localization. This paper introduces a novel semi-supervised… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 12 pages, 10 figures, under preparation for a journal publication

  4. arXiv:2407.13288  [pdf, other

    cs.LG

    Hierarchical Stage-Wise Training of Linked Deep Neural Networks for Multi-Building and Multi-Floor Indoor Localization Based on Wi-Fi RSSI Fingerprinting

    Authors: Sihao Li, Kyeong Soo Kim, Zhe Tang, Graduate, Jeremy S. Smith

    Abstract: In this paper, we present a new solution to the problem of large-scale multi-building and multi-floor indoor localization based on linked neural networks, where each neural network is dedicated to a sub-problem and trained under a hierarchical stage-wise training framework. When the measured data from sensors have a hierarchical representation as in multi-building and multi-floor indoor localizati… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 9 pages, 5 figures, under review for journal publication

  5. arXiv:2404.12526  [pdf, other

    cs.LG cs.CL cs.CV

    Adaptive Memory Replay for Continual Learning

    Authors: James Seale Smith, Lazar Valkov, Shaunak Halbe, Vyshnavi Gutta, Rogerio Feris, Zsolt Kira, Leonid Karlinsky

    Abstract: Foundation Models (FMs) have become the hallmark of modern AI, however, these models are trained on massive data, leading to financially expensive training. Updating FMs as new data becomes available is important, however, can lead to `catastrophic forgetting', where models underperform on tasks related to data sub-populations observed too long ago. This continual learning (CL) phenomenon has been… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: CVPR-W 2024 (Spotlight)

  6. arXiv:2402.12756  [pdf, other

    cs.LG cs.NI

    Static vs. Dynamic Databases for Indoor Localization based on Wi-Fi Fingerprinting: A Discussion from a Data Perspective

    Authors: Zhe Tang, Ruocheng Gu, Sihao Li, Kyeong Soo Kim, Jeremy S. Smith

    Abstract: Wi-Fi fingerprinting has emerged as the most popular approach to indoor localization. The use of ML algorithms has greatly improved the localization performance of Wi-Fi fingerprinting, but its success depends on the availability of fingerprint databases composed of a large number of RSSIs, the MAC addresses of access points, and the other measurement information. However, most fingerprint databas… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures, Invited paper with Excellent Paper Award to be presented at ICAIIC 2024, Osaka, Japan, Feb. 19--22, 2023

  7. arXiv:2311.18763  [pdf, other

    cs.CV cs.AI cs.LG

    Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters

    Authors: James Seale Smith, Yen-Chang Hsu, Zsolt Kira, Yilin Shen, Hongxia Jin

    Abstract: Recent work has demonstrated a remarkable ability to customize text-to-image diffusion models to multiple, fine-grained concepts in a sequential (i.e., continual) manner while only providing a few example images for each concept. This setting is known as continual diffusion. Here, we ask the question: Can we scale these methods to longer concept sequences without forgetting? Although prior work mi… ▽ More

    Submitted 2 May, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: CVPR-W 2024

  8. arXiv:2310.19182  [pdf, other

    cs.CV

    Fast Trainable Projection for Robust Fine-Tuning

    Authors: Junjiao Tian, Yen-Cheng Liu, James Seale Smith, Zsolt Kira

    Abstract: Robust fine-tuning aims to achieve competitive in-distribution (ID) performance while maintaining the out-of-distribution (OOD) robustness of a pre-trained model when transferring it to a downstream task. Recently, projected gradient descent has been successfully used in robust fine-tuning by constraining the deviation from the initialization of the fine-tuned model explicitly through projection.… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  9. arXiv:2306.09970  [pdf, other

    cs.CV cs.AI cs.LG

    HePCo: Data-Free Heterogeneous Prompt Consolidation for Continual Federated Learning

    Authors: Shaunak Halbe, James Seale Smith, Junjiao Tian, Zsolt Kira

    Abstract: In this paper, we focus on the important yet understudied problem of Continual Federated Learning (CFL), where a server communicates with a set of clients to incrementally learn new concepts over time without sharing or storing any data. The complexity of this problem is compounded by challenges from both the Continual and Federated Learning perspectives. Specifically, models trained in a CFL setu… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  10. arXiv:2304.06027  [pdf, other

    cs.CV cs.AI cs.LG

    Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA

    Authors: James Seale Smith, Yen-Chang Hsu, Lingyu Zhang, Ting Hua, Zsolt Kira, Yilin Shen, Hongxia Jin

    Abstract: Recent works demonstrate a remarkable ability to customize text-to-image diffusion models while only providing a few example images. What happens if you try to customize such models using multiple, fine-grained concepts in a sequential (i.e., continual) manner? In our work, we show that recent state-of-the-art customization of text-to-image models suffer from catastrophic forgetting when new conce… ▽ More

    Submitted 2 May, 2024; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Transactions on Machine Learning Research (TMLR) 2024

  11. arXiv:2303.17590  [pdf, other

    cs.CV cs.CL

    Going Beyond Nouns With Vision & Language Models Using Synthetic Data

    Authors: Paola Cascante-Bonilla, Khaled Shehada, James Seale Smith, Sivan Doveh, Donghyun Kim, Rameswar Panda, Gül Varol, Aude Oliva, Vicente Ordonez, Rogerio Feris, Leonid Karlinsky

    Abstract: Large-scale pre-trained Vision & Language (VL) models have shown remarkable performance in many applications, enabling replacing a fixed set of supported classes with zero-shot open vocabulary reasoning over (almost arbitrary) natural language prompts. However, recent works have uncovered a fundamental weakness of these models. For example, their difficulty to understand Visual Language Concepts (… ▽ More

    Submitted 30 August, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted to ICCV 2023. Project page: https://synthetic-vic.github.io/

  12. arXiv:2211.13218  [pdf, other

    cs.CV cs.AI cs.LG

    CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning

    Authors: James Seale Smith, Leonid Karlinsky, Vyshnavi Gutta, Paola Cascante-Bonilla, Donghyun Kim, Assaf Arbelle, Rameswar Panda, Rogerio Feris, Zsolt Kira

    Abstract: Computer vision models suffer from a phenomenon known as catastrophic forgetting when learning novel concepts from continuously shifting training data. Typical solutions for this continual learning problem require extensive rehearsal of previously seen data, which increases memory costs and may violate data privacy. Recently, the emergence of large-scale pre-trained vision transformer models has e… ▽ More

    Submitted 30 March, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted by the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)

  13. arXiv:2211.12494  [pdf, other

    cs.CV cs.LG

    On the Transferability of Visual Features in Generalized Zero-Shot Learning

    Authors: Paola Cascante-Bonilla, Leonid Karlinsky, James Seale Smith, Yanjun Qi, Vicente Ordonez

    Abstract: Generalized Zero-Shot Learning (GZSL) aims to train a classifier that can generalize to unseen classes, using a set of attributes as auxiliary information, and the visual features extracted from a pre-trained convolutional neural network. While recent GZSL methods have explored various techniques to leverage the capacity of these features, there has been an extensive growth of representation learn… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  14. arXiv:2211.09790  [pdf, other

    cs.LG cs.AI cs.CV

    ConStruct-VL: Data-Free Continual Structured VL Concepts Learning

    Authors: James Seale Smith, Paola Cascante-Bonilla, Assaf Arbelle, Donghyun Kim, Rameswar Panda, David Cox, Diyi Yang, Zsolt Kira, Rogerio Feris, Leonid Karlinsky

    Abstract: Recently, large-scale pre-trained Vision-and-Language (VL) foundation models have demonstrated remarkable capabilities in many zero-shot downstream tasks, achieving competitive results for recognizing objects defined by as little as short text prompts. However, it has also been shown that VL models are still brittle in Structured VL Concept (SVLC) reasoning, such as the ability to recognize object… ▽ More

    Submitted 30 March, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted by the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)

  15. arXiv:2209.10537  [pdf, other

    cs.LG cs.AI cs.CV

    FedFOR: Stateless Heterogeneous Federated Learning with First-Order Regularization

    Authors: Junjiao Tian, James Seale Smith, Zsolt Kira

    Abstract: Federated Learning (FL) seeks to distribute model training across local clients without collecting data in a centralized data-center, hence removing data-privacy concerns. A major challenge for FL is data heterogeneity (where each client's data distribution can differ) as it can lead to weight divergence among local clients and slow global convergence. The current SOTA FL methods designed for data… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

  16. arXiv:2207.10895  [pdf, other

    cs.CV

    3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

    Authors: Rui Qiu, Ming Xu, Yuyao Yan, Jeremy S. Smith, Xi Yang

    Abstract: Although deep-learning based methods for monocular pedestrian detection have made great progress, they are still vulnerable to heavy occlusions. Using multi-view information fusion is a potential solution but has limited applications, due to the lack of annotated training samples in existing multi-view datasets, which increases the risk of overfitting. To address this problem, a data augmentation… ▽ More

    Submitted 25 July, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Journal ref: European Conference on Computer Vision 2022

  17. arXiv:2205.09875  [pdf, other

    cs.LG cs.AI

    Incremental Learning with Differentiable Architecture and Forgetting Search

    Authors: James Seale Smith, Zachary Seymour, Han-Pang Chiu

    Abstract: As progress is made on training machine learning models on incrementally expanding classification tasks (i.e., incremental learning), a next step is to translate this progress to industry expectations. One technique missing from incremental learning is automatic architecture design via Neural Architecture Search (NAS). In this paper, we show that leveraging NAS for incremental learning results in… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted by the 2022 International Joint Conference on Neural Networks (IJCNN 2022)

  18. arXiv:2203.17269  [pdf, other

    cs.LG cs.AI cs.CV

    A Closer Look at Rehearsal-Free Continual Learning

    Authors: James Seale Smith, Junjiao Tian, Shaunak Halbe, Yen-Chang Hsu, Zsolt Kira

    Abstract: Continual learning is a setting where machine learning models learn novel concepts from continuously shifting training data, while simultaneously avoiding degradation of knowledge on previously seen classes which may disappear from the training data for extended periods of time (a phenomenon known as the catastrophic forgetting problem). Current approaches for continual learning of a single expand… ▽ More

    Submitted 3 April, 2023; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: Accepted by the 2023 IEEE/CVF Conference on Computer Vision and Pattern (CVPR) Workshop on Continual Learning in Computer Vision (CLVision 2023)

  19. arXiv:2103.01464  [pdf, other

    cs.RO

    NavTuner: Learning a Scene-Sensitive Family of Navigation Policies

    Authors: Haoxin Ma, Justin S. Smith, Patricio A. Vela

    Abstract: The advent of deep learning has inspired research into end-to-end learning for a variety of problem domains in robotics. For navigation, the resulting methods may not have the generalization properties desired let alone match the performance of traditional methods. Instead of learning a navigation policy, we explore learning an adaptive policy in the parameter space of an existing navigation modul… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  20. arXiv:2008.10123  [pdf, other

    cs.CV cs.RO

    Good Graph to Optimize: Cost-Effective, Budget-Aware Bundle Adjustment in Visual SLAM

    Authors: Yipu Zhao, Justin S. Smith, Patricio A. Vela

    Abstract: The cost-efficiency of visual(-inertial) SLAM (VSLAM) is a critical characteristic of resource-limited applications. While hardware and algorithm advances have been significantly improved the cost-efficiency of VSLAM front-ends, the cost-efficiency of VSLAM back-ends remains a bottleneck. This paper describes a novel, rigorous method to improve the cost-efficiency of local BA in a BA-based VSLAM b… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: 20 pages, 14 figures, 8 tables. Submitted to IEEE Transactions on Robotics, for the provided open-source software see https://github.com/ivalab/gf_orb_slam2

  21. arXiv:2003.04934  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Automated discovery of a robust interatomic potential for aluminum

    Authors: Justin S. Smith, Benjamin Nebgen, Nithin Mathew, Jie Chen, Nicholas Lubbers, Leonid Burakovsky, Sergei Tretiak, Hai Ah Nam, Timothy Germann, Saryu Fensin, Kipton Barros

    Abstract: Accuracy of molecular dynamics simulations depends crucially on the interatomic potential used to generate forces. The gold standard would be first-principles quantum mechanics (QM) calculations, but these become prohibitively expensive at large simulation scales. Machine learning (ML) based potentials aim for faithful emulation of QM at drastically reduced computational cost. The accuracy and rob… ▽ More

    Submitted 24 August, 2020; v1 submitted 10 March, 2020; originally announced March 2020.

  22. arXiv:2003.01317  [pdf, other

    cs.RO

    Closed-Loop Benchmarking of Stereo Visual-Inertial SLAM Systems: Understanding the Impact of Drift and Latency on Tracking Accuracy

    Authors: Yipu Zhao, Justin S. Smith, Sambhu H. Karumanchi, Patricio A. Vela

    Abstract: Visual-inertial SLAM is essential for robot navigation in GPS-denied environments, e.g. indoor, underground. Conventionally, the performance of visual-inertial SLAM is evaluated with open-loop analysis, with a focus on the drift level of SLAM systems. In this paper, we raise the question on the importance of visual estimation latency in closed-loop navigation tasks, such as accurate trajectory tra… ▽ More

    Submitted 7 March, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: 8 pages, 7 figures. Accepted for publication in ICRA 2020

  23. arXiv:1908.07101  [pdf, other

    cs.RO cs.CV

    Autonomous, Monocular, Vision-Based Snake Robot Navigation and Traversal of Cluttered Environments using Rectilinear Gait Motion

    Authors: Alexander H. Chang, Shiyu Feng, Yipu Zhao, Justin S. Smith, Patricio A. Vela

    Abstract: Rectilinear forms of snake-like robotic locomotion are anticipated to be an advantage in obstacle-strewn scenarios characterizing urban disaster zones, subterranean collapses, and other natural environments. The elongated, laterally-narrow footprint associated with these motion strategies is well-suited to traversal of confined spaces and narrow pathways. Navigation and path planning in the absenc… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  24. arXiv:1811.05253  [pdf, other

    cs.CV

    Image Captioning Based on a Hierarchical Attention Mechanism and Policy Gradient Optimization

    Authors: Shiyang Yan, Yuan Xie, Fangyu Wu, Jeremy S. Smith, Wenjin Lu, Bailing Zhang

    Abstract: Automatically generating the descriptions of an image, i.e., image captioning, is an important and fundamental topic in artificial intelligence, which bridges the gap between computer vision and natural language processing. Based on the successful deep learning models, especially the CNN model and Long Short-Term Memories (LSTMs) with attention mechanism, we propose a hierarchical attention model… ▽ More

    Submitted 10 January, 2019; v1 submitted 13 November, 2018; originally announced November 2018.

  25. arXiv:1801.09319  [pdf

    physics.comp-ph cs.LG physics.chem-ph stat.ML

    Less is more: sampling chemical space with active learning

    Authors: Justin S. Smith, Ben Nebgen, Nicholas Lubbers, Olexandr Isayev, Adrian E. Roitberg

    Abstract: The development of accurate and transferable machine learning (ML) potentials for predicting molecular energetics is a challenging task. The process of data generation to train such ML potentials is a task neither well understood nor researched in detail. In this work, we present a fully automated approach for the generation of datasets with the intent of training universal ML potentials. It is ba… ▽ More

    Submitted 9 April, 2018; v1 submitted 28 January, 2018; originally announced January 2018.

    Comments: Accepted at J. Chem. Phys

    Journal ref: J. Chem. Phys. 148, 241733 (2018)

  26. arXiv:1801.05132  [pdf, other

    cs.RO

    Learning to Navigate: Exploiting Deep Networks to Inform Sample-Based Planning During Vision-Based Navigation

    Authors: Justin S. Smith, Jin-Ha Hwang, Fu-Jen Chu, Patricio A. Vela

    Abstract: Recent applications of deep learning to navigation have generated end-to-end navigation solutions whereby visual sensor input is mapped to control signals or to motion primitives. The resulting visual navigation strategies work very well at collision avoidance and have performance that matches traditional reactive navigation algorithms while operating in real-time. It is accepted that these soluti… ▽ More

    Submitted 16 January, 2018; originally announced January 2018.

    Comments: 7 pages, 6 figures

  27. arXiv:1708.07590  [pdf, other

    cs.CV

    Hierarchical Multi-scale Attention Networks for Action Recognition

    Authors: Shiyang Yan, Jeremy S. Smith, Wenjin Lu, Bailing Zhang

    Abstract: Recurrent Neural Networks (RNNs) have been widely used in natural language processing and computer vision. Among them, the Hierarchical Multi-scale RNN (HM-RNN), a kind of multi-scale hierarchical RNN proposed recently, can learn the hierarchical temporal structure from data automatically. In this paper, we extend the work to solve the computer vision task of action recognition. However, in sequen… ▽ More

    Submitted 28 August, 2017; v1 submitted 24 August, 2017; originally announced August 2017.

  28. arXiv:1708.04987  [pdf

    physics.chem-ph cs.LG physics.data-an

    ANI-1: A data set of 20M off-equilibrium DFT calculations for organic molecules

    Authors: Justin S. Smith, Olexandr Isayev, Adrian E. Roitberg

    Abstract: One of the grand challenges in modern theoretical chemistry is designing and implementing approximations that expedite ab initio methods without loss of accuracy. Machine learning (ML), in particular neural networks, are emerging as a powerful approach to constructing various forms of transferable atomistic potentials. They have been successfully applied in a variety of applications in chemistry,… ▽ More

    Submitted 12 December, 2017; v1 submitted 16 August, 2017; originally announced August 2017.

    Journal ref: Scientific Data 4, Article number: 170193 (2017)

  29. arXiv:1707.07411  [pdf

    cs.CV

    Traffic scene recognition based on deep cnn and vlad spatial pyramids

    Authors: Fang-Yu Wu, Shi-Yang Yan, Jeremy S. Smith, Bai-Ling Zhang

    Abstract: Traffic scene recognition is an important and challenging issue in Intelligent Transportation Systems (ITS). Recently, Convolutional Neural Network (CNN) models have achieved great success in many applications, including scene classification. The remarkable representational learning capability of CNN remains to be further explored for solving real-world problems. Vector of Locally Aggregated Descr… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

    Comments: 6 pages,4 figures, 2017 9th International Conference on Machine Learning and Computing (ICMLC 2017)

  30. arXiv:1705.03146  [pdf, other

    cs.CV

    CHAM: action recognition using convolutional hierarchical attention model

    Authors: Shiyang Yan, Jeremy S. Smith, Wenjin Lu, Bailing Zhang

    Abstract: Recently, the soft attention mechanism, which was originally proposed in language processing, has been applied in computer vision tasks like image captioning. This paper presents improvements to the soft attention model by combining a convolutional LSTM with a hierarchical system architecture to recognize action categories in videos. We call this model the Convolutional Hierarchical Attention Mode… ▽ More

    Submitted 19 May, 2017; v1 submitted 8 May, 2017; originally announced May 2017.

    Comments: accepted by ICIP2017