Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–17 of 17 results for author: Zang, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.06263  [pdf, other

    cs.LG cs.AI

    Learning Latent Dynamic Robust Representations for World Models

    Authors: Ruixiang Sun, Hongyu Zang, Xin Li, Riashat Islam

    Abstract: Visual Model-Based Reinforcement Learning (MBRL) promises to encapsulate agent's knowledge about the underlying dynamics of the environment, enabling learning a world model as a useful planner. However, top MBRL agents such as Dreamer often struggle with visual pixel-based inputs in the presence of exogenous or irrelevant noise in the observation space, due to failure to capture task-specific feat… ▽ More

    Submitted 30 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Journal ref: ICML 2024

  2. arXiv:2403.16212  [pdf, other

    eess.IV cs.CV cs.LG

    Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis

    Authors: Shaojie Li, Haichen Qu, Xinqi Dong, Bo Dang, Hengyi Zang, Yulu Gong

    Abstract: Exploring the application of deep learning technologies in the field of medical diagnostics, Magnetic Resonance Imaging (MRI) provides a unique perspective for observing and diagnosing complex neurodegenerative diseases such as Alzheimer Disease (AD). With advancements in deep learning, particularly in Convolutional Neural Networks (CNNs) and the Xception network architecture, we are now able to a… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  3. arXiv:2403.14483  [pdf, other

    cs.LG cs.AI q-fin.ST

    Utilizing the LightGBM Algorithm for Operator User Credit Assessment Research

    Authors: Shaojie Li, Xinqi Dong, Danqing Ma, Bo Dang, Hengyi Zang, Yulu Gong

    Abstract: Mobile Internet user credit assessment is an important way for communication operators to establish decisions and formulate measures, and it is also a guarantee for operators to obtain expected benefits. However, credit evaluation methods have long been monopolized by financial industries such as banks and credit. As supporters and providers of platform network technology and network resources, co… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  4. arXiv:2403.13703  [pdf

    cs.CV cs.AI

    Fostc3net:A Lightweight YOLOv5 Based On the Network Structure Optimization

    Authors: Danqing Ma, Shaojie Li, Bo Dang, Hengyi Zang, Xinqi Dong

    Abstract: Transmission line detection technology is crucial for automatic monitoring and ensuring the safety of electrical facilities. The YOLOv5 series is currently one of the most advanced and widely used methods for object detection. However, it faces inherent challenges, such as high computational load on devices and insufficient detection accuracy. To address these concerns, this paper presents an enha… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  5. arXiv:2310.17139  [pdf, other

    cs.LG

    Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning

    Authors: Hongyu Zang, Xin Li, Leiji Zhang, Yang Liu, Baigui Sun, Riashat Islam, Remi Tachet des Combes, Romain Laroche

    Abstract: While bisimulation-based approaches hold promise for learning robust state representations for Reinforcement Learning (RL) tasks, their efficacy in offline RL tasks has not been up to par. In some instances, their performance has even significantly underperformed alternative methods. We aim to understand why bisimulation methods succeed in online settings, but falter in offline tasks. Our analysis… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  6. arXiv:2310.16655  [pdf, other

    cs.LG

    Towards Control-Centric Representations in Reinforcement Learning from Images

    Authors: Chen Liu, Hongyu Zang, Xin Li, Yong Heng, Yifei Wang, Zhen Fang, Yisen Wang, Mingzhong Wang

    Abstract: Image-based Reinforcement Learning is a practical yet challenging task. A major hurdle lies in extracting control-centric representations while disregarding irrelevant information. While approaches that follow the bisimulation principle exhibit the potential in learning state representations to address this issue, they still grapple with the limited expressive capacity of latent dynamics and the i… ▽ More

    Submitted 27 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  7. arXiv:2212.13835  [pdf, other

    cs.LG

    Representation Learning in Deep RL via Discrete Information Bottleneck

    Authors: Riashat Islam, Hongyu Zang, Manan Tomar, Aniket Didolkar, Md Mofijul Islam, Samin Yeasar Arnob, Tariq Iqbal, Xin Li, Anirudh Goyal, Nicolas Heess, Alex Lamb

    Abstract: Several self-supervised representation learning methods have been proposed for reinforcement learning (RL) with rich observations. For real-world applications of RL, recovering underlying latent states is crucial, particularly when sensory inputs contain irrelevant and exogenous information. In this work, we study how information bottlenecks can be used to construct latent states efficiently in th… ▽ More

    Submitted 30 May, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: AISTATS 2023

  8. arXiv:2211.00863  [pdf, other

    cs.LG cs.AI

    Behavior Prior Representation learning for Offline Reinforcement Learning

    Authors: Hongyu Zang, Xin Li, Jie Yu, Chen Liu, Riashat Islam, Remi Tachet Des Combes, Romain Laroche

    Abstract: Offline reinforcement learning (RL) struggles in environments with rich and noisy inputs, where the agent only has access to a fixed dataset without environment interactions. Past works have proposed common workarounds based on the pre-training of state representations, followed by policy training. In this work, we introduce a simple, yet effective approach for learning state representations. Our… ▽ More

    Submitted 27 February, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: ICLR 2023

  9. arXiv:2211.00247  [pdf, other

    cs.LG cs.AI

    Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning

    Authors: Riashat Islam, Hongyu Zang, Anirudh Goyal, Alex Lamb, Kenji Kawaguchi, Xin Li, Romain Laroche, Yoshua Bengio, Remi Tachet Des Combes

    Abstract: Goal-conditioned reinforcement learning (RL) is a promising direction for training agents that are capable of solving multiple tasks and reach a diverse set of objectives. How to \textit{specify} and \textit{ground} these goals in such a way that we can both reliably reach goals during training as well as generalize to new goals during evaluation remains an open area of research. Defining goals in… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: Neurips 2022

  10. arXiv:2211.00164  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

    Authors: Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford

    Abstract: Learning to control an agent from data collected offline in a rich pixel-based visual observation space is vital for real-world applications of reinforcement learning (RL). A major challenge in this setting is the presence of input information that is hard to model and irrelevant to controlling the agent. This problem has been approached by the theoretical RL community through the lens of exogenou… ▽ More

    Submitted 13 August, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

    Comments: ICML 2023

  11. arXiv:2112.15303  [pdf, other

    cs.LG cs.AI

    SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning

    Authors: Hongyu Zang, Xin Li, Mingzhong Wang

    Abstract: This work explores how to learn robust and generalizable state representation from image-based observations with deep reinforcement learning methods. Addressing the computational complexity, stringent assumptions and representation collapse challenges in existing work of bisimulation metric, we devise Simple State Representation (SimSR) operator. SimSR enables us to design a stochastic approximati… ▽ More

    Submitted 27 February, 2022; v1 submitted 30 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI 2022, Preprint version with Appendix

  12. arXiv:2011.04050  [pdf, other

    cs.LG cs.DC

    Adaptive Federated Dropout: Improving Communication Efficiency and Generalization for Federated Learning

    Authors: Nader Bouacida, Jiahui Hou, Hui Zang, Xin Liu

    Abstract: With more regulations tackling users' privacy-sensitive data protection in recent years, access to such data has become increasingly restricted and controversial. To exploit the wealth of data generated and located at distributed entities such as mobile phones, a revolutionary decentralized machine learning setting, known as Federated Learning, enables multiple clients located at different geograp… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.

  13. arXiv:1909.10616  [pdf, other

    cs.LG cs.NE stat.ML

    Compiler-Level Matrix Multiplication Optimization for Deep Learning

    Authors: Huaqing Zhang, Xiaolin Cheng, Hui Zang, Dae Hoon Park

    Abstract: An important linear algebra routine, GEneral Matrix Multiplication (GEMM), is a fundamental operator in deep learning. Compilers need to translate these routines into low-level code optimized for specific hardware. Compiler-level optimization of GEMM has significant performance impact on training and executing deep learning models. However, most deep learning frameworks rely on hardware-specific o… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

  14. arXiv:1909.10413  [pdf, other

    cs.CL

    Automated Chess Commentator Powered by Neural Chess Engine

    Authors: Hongyu Zang, Zhiwei Yu, Xiaojun Wan

    Abstract: In this paper, we explore a new approach for automated chess commentary generation, which aims to generate chess commentary texts in different categories (e.g., description, comparison, planning, etc.). We introduce a neural chess engine into text generation models to help with encoding boards, predicting moves, and analyzing situations. By jointly training the neural chess engine and the generati… ▽ More

    Submitted 23 September, 2019; originally announced September 2019.

    Comments: The first two authors contributed equally to this paper

  15. arXiv:1906.00584  [pdf, other

    cs.CL

    A Semi-Supervised Approach for Low-Resourced Text Generation

    Authors: Hongyu Zang, Xiaojun Wan

    Abstract: Recently, encoder-decoder neural models have achieved great success on text generation tasks. However, one problem of this kind of models is that their performances are usually limited by the scale of well-labeled data, which are very expensive to get. The low-resource (of labeled data) problem is quite common in different task generation tasks, but unlabeled data are usually abundant. In this pap… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: Finished in 2017, a foundation work for "Massive Styles Transfer with Limited Labeled Data"

  16. arXiv:1906.00580  [pdf, other

    cs.CL cs.AI cs.MA

    Massive Styles Transfer with Limited Labeled Data

    Authors: Hongyu Zang, Xiaojun Wan

    Abstract: Language style transfer has attracted more and more attention in the past few years. Recent researches focus on improving neural models targeting at transferring from one style to the other with labeled data. However, transferring across multiple styles is often very useful in real-life applications. Previous researches of language style transfer have two main deficiencies: dependency on massive l… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

  17. arXiv:1810.05552  [pdf, other

    cs.CV

    Effects of Image Degradations to CNN-based Image Classification

    Authors: Yanting Pei, Yaping Huang, Qi Zou, Hao Zang, Xingyuan Zhang, Song Wang

    Abstract: Just like many other topics in computer vision, image classification has achieved significant progress recently by using deep-learning neural networks, especially the Convolutional Neural Networks (CNN). Most of the existing works are focused on classifying very clear natural images, evidenced by the widely used image databases such as Caltech-256, PASCAL VOCs and ImageNet. However, in many real a… ▽ More

    Submitted 12 October, 2018; originally announced October 2018.