Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 68 results for author: Qin, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02813  [pdf, other

    cs.CV cs.AI cs.LG

    Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design

    Authors: Gen Li, Zhihao Shu, Jie Ji, Minghai Qin, Fatemeh Afghah, Wei Niu, Xiaolong Ma

    Abstract: Deep neural networks (DNNs) are frequently employed in a variety of computer vision applications. Nowadays, an emerging trend in the current video distribution system is to take advantage of DNN's overfitting properties to perform video resolution upscaling. By splitting videos into chunks and applying a super-resolution (SR) model to overfit each chunk, this scheme of SR models plus video chunks… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: ECCV2024

  2. arXiv:2406.14537  [pdf, other

    cs.LG q-fin.TR

    MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading

    Authors: Chuqiao Zong, Chaojie Wang, Molei Qin, Lei Feng, Xinrun Wang, Bo An

    Abstract: High-frequency trading (HFT) that executes algorithmic trading in short time scales, has recently occupied the majority of cryptocurrency market. Besides traditional quantitative trading methods, reinforcement learning (RL) has become another appealing approach for HFT due to its terrific ability of handling high-dimensional financial data and solving sophisticated sequential decision-making probl… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted to KDD 2024

  3. arXiv:2406.09098  [pdf, other

    cs.CL

    SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models

    Authors: Kehua Feng, Keyan Ding, Weijie Wang, Xiang Zhuang, Zeyuan Wang, Ming Qin, Yu Zhao, Jianhua Yao, Qiang Zhang, Huajun Chen

    Abstract: The burgeoning utilization of Large Language Models (LLMs) in scientific research necessitates advanced benchmarks capable of evaluating their understanding and application of scientific knowledge comprehensively. To address this need, we introduce the SciKnowEval benchmark, a novel framework that systematically evaluates LLMs across five progressive levels of scientific knowledge: studying extens… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 48 pages, 2 figures

  4. arXiv:2405.20277  [pdf, other

    cs.SI

    Pre-train and Refine: Towards Higher Efficiency in K-Agnostic Community Detection without Quality Degradation

    Authors: Meng Qin, Chaorui Zhang, Yu Gao, Weixi Zhang, Dit-Yan Yeung

    Abstract: Community detection (CD) is a classic graph inference task that partitions nodes of a graph into densely connected groups. While many CD methods have been proposed with either impressive quality or efficiency, balancing the two aspects remains a challenge. This study explores the potential of deep graph learning to achieve a better trade-off between the quality and efficiency of K-agnostic CD, whe… ▽ More

    Submitted 7 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted by ACM KDD 2024

  5. arXiv:2405.15125  [pdf, other

    cs.CV

    HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting

    Authors: Yuanhao Cai, Zihao Xiao, Yixun Liang, Minghan Qin, Yulun Zhang, Xiaokang Yang, Yaoyao Liu, Alan Yuille

    Abstract: High dynamic range (HDR) novel view synthesis (NVS) aims to create photorealistic images from novel viewpoints using HDR imaging techniques. The rendered HDR images capture a wider range of brightness levels containing more details of the scene than normal low dynamic range (LDR) images. Existing HDR NVS methods are mainly based on NeRF. They suffer from long training time and slow inference speed… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: The first 3D Gaussian Splatting-based method for HDR imaging

  6. arXiv:2405.14192  [pdf, other

    cs.CV

    IB-AdCSCNet:Adaptive Convolutional Sparse Coding Network Driven by Information Bottleneck

    Authors: He Zou, Meng'en Qin, Yu Song, Xiaohui Yang

    Abstract: In the realm of neural network models, the perpetual challenge remains in retaining task-relevant information while effectively discarding redundant data during propagation. In this paper, we introduce IB-AdCSCNet, a deep learning model grounded in information bottleneck theory. IB-AdCSCNet seamlessly integrates the information bottleneck trade-off strategy into deep networks by dynamically adjust… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2405.08423  [pdf, other

    eess.IV cs.CV

    NAFRSSR: a Lightweight Recursive Network for Efficient Stereo Image Super-Resolution

    Authors: Yihong Chen, Zhen Fan, Shuai Dong, Zhiwei Chen, Wenjie Li, Minghui Qin, Min Zeng, Xubing Lu, Guofu Zhou, Xingsen Gao, Jun-Ming Liu

    Abstract: Stereo image super-resolution (SR) refers to the reconstruction of a high-resolution (HR) image from a pair of low-resolution (LR) images as typically captured by a dual-camera device. To enhance the quality of SR images, most previous studies focused on increasing the number and size of feature maps and introducing complex and computationally intensive structures, resulting in models with high co… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  8. arXiv:2405.02861  [pdf, other

    cs.CL cs.AI cs.LG

    Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

    Authors: Yang Liu, Melissa Xiaohui Qin, Hongming Li, Chao Huang

    Abstract: We introduce LexBench, a comprehensive evaluation suite enabled to test language models (LMs) on ten semantic phrase processing tasks. Unlike prior studies, it is the first work to propose a framework from the comparative perspective to model the general semantic phrase (i.e., lexical collocation) and three fine-grained semantic phrases, including idiomatic expression, noun compound, and verbal co… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 24 pages, 17 figures, 10 tables

    MSC Class: 68T50 ACM Class: I.2.7

  9. Deep Reinforcement Learning Based Toolpath Generation for Thermal Uniformity in Laser Powder Bed Fusion Process

    Authors: Mian Qin, Junhao Ding, Shuo Qu, Xu Song, Charlie C. L. Wang, Wei-Hsin Liao

    Abstract: Laser powder bed fusion (LPBF) is a widely used metal additive manufacturing technology. However, the accumulation of internal residual stress during printing can cause significant distortion and potential failure. Although various scan patterns have been studied to reduce possible accumulated stress, such as zigzag scanning vectors with changing directions or a chessboard-based scan pattern with… ▽ More

    Submitted 16 February, 2024; originally announced April 2024.

    Journal ref: Additive Manufacturing, vol.79, 103937 (12 pages), January 2024

  10. arXiv:2403.15704  [pdf, other

    cs.CV

    Gaussian in the Wild: 3D Gaussian Splatting for Unconstrained Image Collections

    Authors: Dongbin Zhang, Chuming Wang, Weitao Wang, Peihao Li, Minghan Qin, Haoqian Wang

    Abstract: Novel view synthesis from unconstrained in-the-wild images remains a meaningful but challenging task. The photometric variation and transient occluders in those unconstrained images make it difficult to reconstruct the original scene accurately. Previous approaches tackle the problem by introducing a global appearance feature in Neural Radiance Fields (NeRF). However, in the real world, the unique… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 14 pages, 5 figures

  11. arXiv:2403.03186  [pdf, other

    cs.AI

    Cradle: Empowering Foundation Agents Towards General Computer Control

    Authors: Weihao Tan, Wentao Zhang, Xinrun Xu, Haochong Xia, Ziluo Ding, Boyu Li, Bohan Zhou, Junpeng Yue, Jiechuan Jiang, Yewen Li, Ruyi An, Molei Qin, Chuqiao Zong, Longtao Zheng, Yujie Wu, Xiaoqiang Chai, Yifei Bi, Tianbao Xie, Pengjie Gu, Xiyun Li, Ceyao Zhang, Long Tian, Chaojie Wang, Xinrun Wang, Börje F. Karlsson , et al. (3 additional authors not shown)

    Abstract: Despite the success in specific scenarios, existing foundation agents still struggle to generalize across various virtual scenarios, mainly due to the dramatically different encapsulations of environments with manually designed observation and action spaces. To handle this issue, we propose the General Computer Control (GCC) setting to restrict foundation agents to interact with software through t… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  12. arXiv:2402.18892  [pdf, other

    cs.CV cs.RO

    Aligning Knowledge Graph with Visual Perception for Object-goal Navigation

    Authors: Nuo Xu, Wen Wang, Rong Yang, Mengjie Qin, Zheyuan Lin, Wei Song, Chunlong Zhang, Jason Gu, Chao Li

    Abstract: Object-goal navigation is a challenging task that requires guiding an agent to specific objects based on first-person visual observations. The ability of agent to comprehend its surroundings plays a crucial role in achieving successful object finding. However, existing knowledge-graph-based navigators often rely on discrete categorical one-hot vectors and vote counting strategy to construct graph… ▽ More

    Submitted 25 April, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to ICRA 2024

  13. arXiv:2402.18485  [pdf, other

    q-fin.TR cs.AI

    A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist

    Authors: Wentao Zhang, Lingxuan Zhao, Haochong Xia, Shuo Sun, Jiaze Sun, Molei Qin, Xinyi Li, Yuqing Zhao, Yilei Zhao, Xinyu Cai, Longtao Zheng, Xinrun Wang, Bo An

    Abstract: Financial trading is a crucial component of the markets, informed by a multimodal information landscape encompassing news, prices, and Kline charts, and encompasses diverse tasks such as quantitative trading and high-frequency trading with various assets. While advanced AI techniques like deep learning and reinforcement learning are extensively utilized in finance, their application in financial t… ▽ More

    Submitted 28 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  14. arXiv:2401.14656  [pdf, other

    cs.CL

    Scientific Large Language Models: A Survey on Biological & Chemical Domains

    Authors: Qiang Zhang, Keyang Ding, Tianwen Lyv, Xinda Wang, Qingyu Yin, Yiwen Zhang, Jing Yu, Yuhao Wang, Xiaotong Li, Zhuoyi Xiang, Xiang Zhuang, Zeyuan Wang, Ming Qin, Mengyao Zhang, Jinlu Zhang, Jiyu Cui, Renjun Xu, Hongyang Chen, Xiaohui Fan, Huabin Xing, Huajun Chen

    Abstract: Large Language Models (LLMs) have emerged as a transformative power in enhancing natural language comprehension, representing a significant stride toward artificial general intelligence. The application of LLMs extends beyond conventional linguistic boundaries, encompassing specialized linguistic systems developed within various scientific disciplines. This growing interest has led to the advent o… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  15. How Are Paid and Volunteer Open Source Developers Different? A Study of the Rust Project

    Authors: Yuxia Zhang, Mian Qin, Klaas-Jan Stol, Minghui Zhou, Hui Liu

    Abstract: It is now commonplace for organizations to pay developers to work on specific open source software (OSS) projects to pursue their business goals. Such paid developers work alongside voluntary contributors, but given the different motivations of these two groups of developers, conflict may arise, which may pose a threat to a project's sustainability. This paper presents an empirical study of paid d… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  16. arXiv:2401.07410  [pdf, other

    cs.SI cs.CR

    Multi-Task DNS Security Analysis via High-Order Heterogeneous Graph Embedding

    Authors: Meng Qin

    Abstract: DNS is an essential Internet infrastructure to support network applications and services, but is also a significant tool exploited by various cyberattacks. Existing DNS security analysis techniques mostly focus on one specific task associated with one single entity (e.g., domain) via conventional feature engineering. They rely heavily on the labor-intensive feature selection and largely ignore the… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  17. arXiv:2401.03444  [pdf, other

    cs.NI cs.SI

    Towards a Unified Method for Network Dynamic via Adversarial Weighted Link Prediction

    Authors: Meng Qin

    Abstract: Network dynamic (e.g., traffic burst in data center networks and channel fading in cellular WiFi networks) has a great impact on the performance of communication networks (e.g., throughput, capacity, delay, and jitter). This article proposes a unified prediction-based method to handle the dynamic of various network systems. From the view of graph deep learning, I generally formulate the dynamic pr… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  18. arXiv:2401.00651  [pdf, other

    cs.SI

    IRWE: Inductive Random Walk for Joint Inference of Identity and Position Network Embedding

    Authors: Meng Qin, Dit-Yan Yeung

    Abstract: Network embedding, which maps graphs to distributed representations, is a unified framework for various graph inference tasks. According to the topology properties (e.g., structural roles and community memberships of nodes) to be preserved, it can be categorized into the identity and position embedding. However, existing methods can only capture one type of property. Some approaches can support th… ▽ More

    Submitted 12 May, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

  19. arXiv:2312.16084  [pdf, other

    cs.CV

    LangSplat: 3D Language Gaussian Splatting

    Authors: Minghan Qin, Wanhua Li, Jiawei Zhou, Haoqian Wang, Hanspeter Pfister

    Abstract: Humans live in a 3D world and commonly use natural language to interact with a 3D scene. Modeling a 3D language field to support open-ended language queries in 3D has gained increasing attention recently. This paper introduces LangSplat, which constructs a 3D language field that enables precise and efficient open-vocabulary querying within 3D spaces. Unlike existing methods that ground CLIP langua… ▽ More

    Submitted 31 March, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: CVPR 2024. Project Page: https://langsplat.github.io

  20. arXiv:2312.08554  [pdf, other

    cs.RO cs.MA

    Adaptive Robot Coordination: A Subproblem-based Approach for Hybrid Multi-Robot Motion Planning

    Authors: Irving Solis, James Motes, Mike Qin, Marco Morales, Nancy M. Amato

    Abstract: This work presents Adaptive Robot Coordination (ARC), a novel hybrid framework for multi-robot motion planning (MRMP) that employs local subproblems to resolve inter-robot conflicts. ARC creates subproblems centered around conflicts, and the solutions represent the robot motions required to resolve these conflicts. The use of subproblems enables an inexpensive hybrid exploration of the multi-robot… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: This work has been submitted for review

  21. arXiv:2312.04360  [pdf, other

    quant-ph cs.CC

    The Computational Advantage of MIP* Vanishes in the Presence of Noise

    Authors: Yangjing Dong, Honghao Fu, Anand Natarajan, Minglong Qin, Haochen Xu, Penghui Yao

    Abstract: Quantum multiprover interactive proof systems with entanglement MIP* are much more powerful than their classical counterpart MIP (Babai et al. '91, Ji et al. '20): while MIP = NEXP, the quantum class MIP* is equal to RE, a class including the halting problem. This is because the provers in MIP* can share unbounded quantum entanglement. However, recent works of Qin and Yao '21 and '23 have shown th… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Comments are welcome!

  22. arXiv:2312.01560  [pdf, ps, other

    cs.SI

    RaftGP: Random Fast Graph Partitioning

    Authors: Yu Gao, Meng Qin, Yibin Ding, Li Zeng, Chaorui Zhang, Weixi Zhang, Wei Han, Rongqian Zhao, Bo Bai

    Abstract: Graph partitioning (GP), a.k.a. community detection, is a classic problem that divides the node set of a graph into densely-connected blocks. Following prior work on the IEEE HPEC Graph Challenge benchmark and recent advances in graph machine learning, we propose a novel RAndom FasT Graph Partitioning (RaftGP) method based on an efficient graph embedding scheme. It uses the Gaussian random project… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  23. arXiv:2311.16482  [pdf, other

    cs.CV cs.GR

    Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human Avatars

    Authors: Yang Liu, Xiang Huang, Minghan Qin, Qinwei Lin, Haoqian Wang

    Abstract: Neural radiance fields are capable of reconstructing high-quality drivable human avatars but are expensive to train and render. To reduce consumption, we propose Animatable 3D Gaussian, which learns human avatars from input images and poses. We extend 3D Gaussians to dynamic human scenes by modeling a set of skinned 3D Gaussians and a corresponding skeleton in canonical space and deforming 3D Gaus… ▽ More

    Submitted 29 November, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

  24. arXiv:2311.10840  [pdf

    cs.AI

    Integration and Implementation Strategies for AI Algorithm Deployment with Smart Routing Rules and Workflow Management

    Authors: Barbaros Selnur Erdal, Vikash Gupta, Mutlu Demirer, Kim H. Fair, Richard D. White, Jeff Blair, Barbara Deichert, Laurie Lafleur, Ming Melvin Qin, David Bericat, Brad Genereaux

    Abstract: This paper reviews the challenges hindering the widespread adoption of artificial intelligence (AI) solutions in the healthcare industry, focusing on computer vision applications for medical imaging, and how interoperability and enterprise-grade scalability can be used to address these challenges. The complex nature of healthcare workflows, intricacies in managing large and secure medical imaging… ▽ More

    Submitted 21 November, 2023; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: 13 pages, 6 figures

    ACM Class: I.2.m

  25. arXiv:2310.07683  [pdf, other

    cs.LG cs.AI

    Controllable Data Generation Via Iterative Data-Property Mutual Mappings

    Authors: Bo Pan, Muran Qin, Shiyu Wang, Yifei Zhang, Liang Zhao

    Abstract: Deep generative models have been widely used for their ability to generate realistic data samples in various areas, such as images, molecules, text, and speech. One major goal of data generation is controllability, namely to generate new data with desired properties. Despite growing interest in the area of controllable generation, significant challenges still remain, including 1) disentangling des… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  26. arXiv:2310.06275  [pdf, other

    cs.CV

    High-Fidelity 3D Head Avatars Reconstruction through Spatially-Varying Expression Conditioned Neural Radiance Field

    Authors: Minghan Qin, Yifan Liu, Yuelang Xu, Xiaochen Zhao, Yebin Liu, Haoqian Wang

    Abstract: One crucial aspect of 3D head avatar reconstruction lies in the details of facial expressions. Although recent NeRF-based photo-realistic 3D head avatar methods achieve high-quality avatar rendering, they still encounter challenges retaining intricate facial expression details because they overlook the potential of specific expression variations at different spatial positions when conditioning the… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 9 pages, 5 figures

  27. CCAE: A Corpus of Chinese-based Asian Englishes

    Authors: Yang Liu, Melissa Xiaohui Qin, Long Wang, Chao Huang

    Abstract: Language models have been foundations in various scenarios of NLP applications, but it has not been well applied in language variety studies, even for the most popular language like English. This paper represents one of the few initial efforts to utilize the NLP technology in the paradigm of World Englishes, specifically in creating a multi-variety corpus for studying Asian Englishes. We present a… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: NLPCC'2023 (12 pages, 3 figures, 4 charts)

    MSC Class: 68T50 ACM Class: I.2.7

  28. arXiv:2310.03269  [pdf, other

    q-bio.BM cs.CL

    InstructProtein: Aligning Human and Protein Language via Knowledge Instruction

    Authors: Zeyuan Wang, Qiang Zhang, Keyan Ding, Ming Qin, Xiang Zhuang, Xiaotong Li, Huajun Chen

    Abstract: Large Language Models (LLMs) have revolutionized the field of natural language processing, but they fall short in comprehending biological sequences such as proteins. To address this challenge, we propose InstructProtein, an innovative LLM that possesses bidirectional generation capabilities in both human and protein languages: (i) taking a protein sequence as input to predict its textual function… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  29. arXiv:2309.08941  [pdf, ps, other

    quant-ph cs.CC cs.CR

    Quantum Pseudorandom Scramblers

    Authors: Chuhan Lu, Minglong Qin, Fang Song, Penghui Yao, Mingnan Zhao

    Abstract: Quantum pseudorandom state generators (PRSGs) have stimulated exciting developments in recent years. A PRSG, on a fixed initial (e.g., all-zero) state, produces an output state that is computationally indistinguishable from a Haar random state. However, pseudorandomness of the output state is not guaranteed on other initial states. In fact, known PRSG constructions provably fail on some initial st… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

  30. BotanicGarden: A High-Quality Dataset for Robot Navigation in Unstructured Natural Environments

    Authors: Yuanzhi Liu, Yujia Fu, Minghui Qin, Yufeng Xu, Baoxin Xu, Fengdong Chen, Bart Goossens, Poly Z. H. Sun, Hongwei Yu, Chun Liu, Long Chen, Wei Tao, Hui Zhao

    Abstract: The rapid developments of mobile robotics and autonomous navigation over the years are largely empowered by public datasets for testing and upgrading, such as sensor odometry and SLAM tasks. Impressive demos and benchmark scores have arisen, which may suggest the maturity of existing navigation techniques. However, these results are primarily based on moderate structured scenario testing. When tra… ▽ More

    Submitted 2 March, 2024; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: This article has been accepted for publication in IEEE Robotics and Automation Letters

  31. arXiv:2305.09089  [pdf, other

    cs.SI

    Adaptive Network Embedding with Arbitrary Multiple Information Sources in Attributed Graphs

    Authors: Meng Qin

    Abstract: Graph representation learning (a.k.a. network embedding) is a significant topic of network analysis, due to its effectiveness to support various graph inference tasks. In this paper, we study the representation learning with multiple information sources in attributed graphs. Recent studies usually focus on several specific sources (e.g., high-order proximity and node attributes) but few of them ca… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  32. arXiv:2305.06531  [pdf, other

    cs.SI cs.LG

    Semantic Random Walk for Graph Representation Learning in Attributed Graphs

    Authors: Meng Qin

    Abstract: In this study, we focus on the graph representation learning (a.k.a. network embedding) in attributed graphs. Different from existing embedding methods that treat the incorporation of graph structure and semantic as the simple combination of two optimization objectives, we propose a novel semantic graph representation (SGR) method to formulate the joint optimization of the two heterogeneous source… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  33. arXiv:2303.08331  [pdf, other

    cs.CV cs.LG cs.NE eess.IV

    Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting

    Authors: Gen Li, Jie Ji, Minghai Qin, Wei Niu, Bin Ren, Fatemeh Afghah, Linke Guo, Xiaolong Ma

    Abstract: As deep convolutional neural networks (DNNs) are widely used in various fields of computer vision, leveraging the overfitting ability of the DNN to achieve video resolution upscaling has become a new trend in the modern video delivery system. By dividing videos into chunks and overfitting each chunk with a super-resolution model, the server encodes videos before transmitting them to the clients, t… ▽ More

    Submitted 18 June, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 Highlight Paper

  34. arXiv:2302.11180  [pdf, ps, other

    cs.CV

    DISCO: Distributed Inference with Sparse Communications

    Authors: Minghai Qin, Chao Sun, Jaco Hofmann, Dejan Vucinic

    Abstract: Deep neural networks (DNNs) have great potential to solve many real-world problems, but they usually require an extensive amount of computation and memory. It is of great difficulty to deploy a large DNN model to a single resource-limited device with small memory capacity. Distributed computing is a common approach to reduce single-node memory consumption and to accelerate the inference of DNN mod… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  35. arXiv:2302.00586  [pdf, other

    q-fin.TR cs.AI cs.LG

    PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets

    Authors: Shuo Sun, Molei Qin, Xinrun Wang, Bo An

    Abstract: The financial markets, which involve more than $90 trillion market capitals, attract the attention of innumerable investors around the world. Recently, reinforcement learning in financial markets (FinRL) has emerged as a promising direction to train agents for making profitable investment decisions. However, the evaluation of most FinRL methods only focuses on profit-related measures and ignores m… ▽ More

    Submitted 2 March, 2023; v1 submitted 14 January, 2023; originally announced February 2023.

  36. arXiv:2212.14177  [pdf, other

    cs.AI cs.CY eess.IV

    Current State of Community-Driven Radiological AI Deployment in Medical Imaging

    Authors: Vikash Gupta, Barbaros Selnur Erdal, Carolina Ramirez, Ralf Floca, Laurence Jackson, Brad Genereaux, Sidney Bryson, Christopher P Bridge, Jens Kleesiek, Felix Nensa, Rickmer Braren, Khaled Younis, Tobias Penzkofer, Andreas Michael Bucher, Ming Melvin Qin, Gigon Bae, Hyeonhoon Lee, M. Jorge Cardoso, Sebastien Ourselin, Eric Kerfoot, Rahul Choudhury, Richard D. White, Tessa Cook, David Bericat, Matthew Lungren , et al. (2 additional authors not shown)

    Abstract: Artificial Intelligence (AI) has become commonplace to solve routine everyday tasks. Because of the exponential growth in medical imaging data volume and complexity, the workload on radiologists is steadily increasing. We project that the gap between the number of imaging exams and the number of expert radiologist readers required to cover this increase will continue to expand, consequently introd… ▽ More

    Submitted 8 May, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: 21 pages; 5 figures

    MSC Class: eess.IV

  37. arXiv:2212.05122  [pdf, other

    cs.LG cs.AI cs.CV

    All-in-One: A Highly Representative DNN Pruning Framework for Edge Devices with Dynamic Power Management

    Authors: Yifan Gong, Zheng Zhan, Pu Zhao, Yushu Wu, Chao Wu, Caiwen Ding, Weiwen Jiang, Minghai Qin, Yanzhi Wang

    Abstract: During the deployment of deep neural networks (DNNs) on edge devices, many research efforts are devoted to the limited hardware resource. However, little attention is paid to the influence of dynamic power management. As edge devices typically only have a budget of energy with batteries (rather than almost unlimited energy support on servers or workstations), their dynamic power management often c… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  38. arXiv:2211.12005  [pdf, other

    cs.LG cs.CR stat.ML

    Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors

    Authors: Sizhe Chen, Geng Yuan, Xinwen Cheng, Yifan Gong, Minghai Qin, Yanzhi Wang, Xiaolin Huang

    Abstract: As data becomes increasingly vital, a company would be very cautious about releasing data, because the competitors could use it to train high-performance models, thereby posing a tremendous threat to the company's commercial competence. To prevent training good models on the data, we could add imperceptible perturbations to it. Since such perturbations aim at hurting the entire training process, t… ▽ More

    Submitted 12 April, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: ICLR 2023

  39. arXiv:2211.10801  [pdf, other

    cs.CV cs.AI cs.LG

    Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training

    Authors: Zhenglun Kong, Haoyu Ma, Geng Yuan, Mengshu Sun, Yanyue Xie, Peiyan Dong, Xin Meng, Xuan Shen, Hao Tang, Minghai Qin, Tianlong Chen, Xiaolong Ma, Xiaohui Xie, Zhangyang Wang, Yanzhi Wang

    Abstract: Vision transformers (ViTs) have recently obtained success in many applications, but their intensive computation and heavy memory usage at both training and inference time limit their generalization. Previous compression algorithms usually start from the pre-trained dense models and only focus on efficient inference, while time-consuming training is still unavoidable. In contrast, this paper points… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: AAAI 2023

  40. arXiv:2211.01484  [pdf, other

    cs.CV cs.LG

    Data Level Lottery Ticket Hypothesis for Vision Transformers

    Authors: Xuan Shen, Zhenglun Kong, Minghai Qin, Peiyan Dong, Geng Yuan, Xin Meng, Hao Tang, Xiaolong Ma, Yanzhi Wang

    Abstract: The conventional lottery ticket hypothesis (LTH) claims that there exists a sparse subnetwork within a dense neural network and a proper random initialization method called the winning ticket, such that it can be trained from scratch to almost as good as the dense counterpart. Meanwhile, the research of LTH in vision transformers (ViTs) is scarcely evaluated. In this paper, we first show that the… ▽ More

    Submitted 29 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted by IJCAI 2023

  41. arXiv:2210.08765  [pdf, other

    cs.SI

    Temporal Link Prediction: A Unified Framework, Taxonomy, and Review

    Authors: Meng Qin, Dit-Yan Yeung

    Abstract: Dynamic graphs serve as a generic abstraction and description of the evolutionary behaviors of various complex systems (e.g., social networks and communication networks). Temporal link prediction (TLP) is a classic yet challenging inference task on dynamic graphs, which predicts possible future linkage based on historical topology. The predicted future topology can be used to support some advanced… ▽ More

    Submitted 29 June, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

  42. arXiv:2209.14825  [pdf, other

    cs.SI cs.LG

    Trading off Quality for Efficiency of Community Detection: An Inductive Method across Graphs

    Authors: Meng Qin, Chaorui Zhang, Bo Bai, Gong Zhang, Dit-Yan Yeung

    Abstract: Many network applications can be formulated as NP-hard combinatorial optimization problems of community detection (CD). Due to the NP-hardness, to balance the CD quality and efficiency remains a challenge. Most existing CD methods are transductive, which are independently optimized only for the CD on a single graph. Some of these methods use advanced machine learning techniques to obtain high-qual… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  43. arXiv:2207.12577  [pdf, other

    cs.CV cs.AR cs.LG eess.IV

    Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution

    Authors: Yushu Wu, Yifan Gong, Pu Zhao, Yanyu Li, Zheng Zhan, Wei Niu, Hao Tang, Minghai Qin, Bin Ren, Yanzhi Wang

    Abstract: Deep learning-based super-resolution (SR) has gained tremendous popularity in recent years because of its high image quality performance and wide application scenarios. However, prior methods typically suffer from large amounts of computations and huge power consumption, causing difficulties for real-time inference, especially on resource-limited platforms such as mobile devices. To mitigate this,… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  44. arXiv:2204.12190  [pdf, other

    cs.AI

    Multi-Agent Reinforcement Learning for Traffic Signal Control through Universal Communication Method

    Authors: Qize Jiang, Minhao Qin, Shengmin Shi, Weiwei Sun, Baihua Zheng

    Abstract: How to coordinate the communication among intersections effectively in real complex traffic scenarios with multi-intersection is challenging. Existing approaches only enable the communication in a heuristic manner without considering the content/importance of information to be shared. In this paper, we propose a universal communication form UniComm between intersections. UniComm embeds massive obs… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: IJCAI 2022

  45. arXiv:2203.15794  [pdf, other

    cs.CV

    CHEX: CHannel EXploration for CNN Model Compression

    Authors: Zejiang Hou, Minghai Qin, Fei Sun, Xiaolong Ma, Kun Yuan, Yi Xu, Yen-Kuang Chen, Rong Jin, Yuan Xie, Sun-Yuan Kung

    Abstract: Channel pruning has been broadly recognized as an effective technique to reduce the computation and memory cost of deep convolutional neural networks. However, conventional pruning methods have limitations in that: they are restricted to pruning process only, and they require a fully pre-trained large model. Such limitations may lead to sub-optimal model quality as well as excessive memory and tra… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  46. arXiv:2203.05016  [pdf, other

    cs.DC

    Shfl-BW: Accelerating Deep Neural Network Inference with Tensor-Core Aware Weight Pruning

    Authors: Guyue Huang, Haoran Li, Minghai Qin, Fei Sun, Yufei Ding, Yuan Xie

    Abstract: Weight pruning in deep neural networks (DNNs) can reduce storage and computation cost, but struggles to bring practical speedup to the model inference time. Tensor-cores can significantly boost the throughput of GPUs on dense computation, but exploiting tensor-cores for sparse DNNs is very challenging. Compared to existing CUDA-cores, tensor-cores require higher data reuse and matrix-shaped instru… ▽ More

    Submitted 11 March, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: To-appear in Design Automation Conference (DAC), July 2022

  47. Adaptive Read Thresholds for NAND Flash

    Authors: Borja Peleato, Rajiv Agarwal, John Cioffi, Minghai Qin, Paul H. Siegel

    Abstract: A primary source of increased read time on NAND flash comes from the fact that in the presence of noise, the flash medium must be read several times using different read threshold voltages for the decoder to succeed. This paper proposes an algorithm that uses a limited number of re-reads to characterize the noise distribution and recover the stored information. Both hard and soft decoding are cons… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Journal ref: IEEE Transactions on Communications ( Volume: 63, Issue: 9, Sept. 2015, Pages: 3069 - 3081)

  48. arXiv:2112.13890  [pdf, other

    cs.CV cs.AI cs.AR cs.LG

    SPViT: Enabling Faster Vision Transformers via Soft Token Pruning

    Authors: Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Mengshu Sun, Wei Niu, Xuan Shen, Geng Yuan, Bin Ren, Minghai Qin, Hao Tang, Yanzhi Wang

    Abstract: Recently, Vision Transformer (ViT) has continuously established new milestones in the computer vision field, while the high computation and memory cost makes its propagation in industrial production difficult. Pruning, a traditional model compression paradigm for hardware efficiency, has been widely applied in various DNN structures. Nevertheless, it stays ambiguous on how to perform exclusive pru… ▽ More

    Submitted 20 September, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

    Comments: ECCV 2022

  49. arXiv:2112.10930  [pdf, other

    cs.NE cs.AI cs.LG

    Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting

    Authors: Minghai Qin, Tianyun Zhang, Fei Sun, Yen-Kuang Chen, Makan Fardad, Yanzhi Wang, Yuan Xie

    Abstract: Deep neural networks (DNNs) have shown to provide superb performance in many real life applications, but their large computation cost and storage requirement have prevented them from being deployed to many edge and internet-of-things (IoT) devices. Sparse deep neural networks, whose majority weight parameters are zeros, can substantially reduce the computation complexity and memory consumption of… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

  50. arXiv:2112.10898  [pdf, other

    cs.LG cs.AI

    Load-balanced Gather-scatter Patterns for Sparse Deep Neural Networks

    Authors: Fei Sun, Minghai Qin, Tianyun Zhang, Xiaolong Ma, Haoran Li, Junwen Luo, Zihao Zhao, Yen-Kuang Chen, Yuan Xie

    Abstract: Deep neural networks (DNNs) have been proven to be effective in solving many real-life problems, but its high computation cost prohibits those models from being deployed to edge devices. Pruning, as a method to introduce zeros to model weights, has shown to be an effective method to provide good trade-offs between model accuracy and computation efficiency, and is a widely-used method to generate c… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.