Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 75 results for author: Hao, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19531  [pdf, other

    stat.ML cs.LG

    Forward and Backward State Abstractions for Off-policy Evaluation

    Authors: Meiling Hao, Pingfan Su, Liyuan Hu, Zoltan Szabo, Qingyuan Zhao, Chengchun Shi

    Abstract: Off-policy evaluation (OPE) is crucial for evaluating a target policy's impact offline before its deployment. However, achieving accurate OPE in large state spaces remains challenging.This paper studies state abstractions-originally designed for policy learning-in the context of OPE. Our contributions are three-fold: (i) We define a set of irrelevance conditions central to learning state abstracti… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 42 pages, 5 figures

    ACM Class: G.3; I.2.6; G.1.2

  2. arXiv:2406.14844  [pdf, other

    cs.LG cs.AI

    DN-CL: Deep Symbolic Regression against Noise via Contrastive Learning

    Authors: Jingyi Liu, Yanjie Li, Lina Yu, Min Wu, Weijun Li, Wenqiang Li, Meilan Hao, Yusong Deng, Shu Wei

    Abstract: Noise ubiquitously exists in signals due to numerous factors including physical, electronic, and environmental effects. Traditional methods of symbolic regression, such as genetic programming or deep learning models, aim to find the most fitting expressions for these signals. However, these methods often overlook the noise present in real-world data, leading to reduced fitting accuracy. To tackle… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2406.11208  [pdf

    cs.NI

    Privacy-preserving Pseudonym Schemes for Personalized 3D Avatars in Mobile Social Metaverses

    Authors: Cheng Su, Xiaofeng Luo, Zhenmou Liu, Jiawen Kang, Min Hao, Zehui Xiong, Zhaohui Yang, Chongwen Huang

    Abstract: The emergence of mobile social metaverses, a novel paradigm bridging physical and virtual realms, has led to the widespread adoption of avatars as digital representations for Social Metaverse Users (SMUs) within virtual spaces. Equipped with immersive devices, SMUs leverage Edge Servers (ESs) to deploy their avatars and engage with other SMUs in virtual spaces. To enhance immersion, SMUs incline t… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 6pages, 4 figures

  4. arXiv:2406.05874  [pdf, other

    cs.CR

    Stealthy Targeted Backdoor Attacks against Image Captioning

    Authors: Wenshu Fan, Hongwei Li, Wenbo Jiang, Meng Hao, Shui Yu, Xiao Zhang

    Abstract: In recent years, there has been an explosive growth in multimodal learning. Image captioning, a classical multimodal task, has demonstrated promising applications and attracted extensive research attention. However, recent studies have shown that image caption models are vulnerable to some security threats such as backdoor attacks. Existing backdoor attacks against image captioning typically pair… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  5. arXiv:2405.20710  [pdf, other

    cs.IR

    Information Maximization via Variational Autoencoders for Cross-Domain Recommendation

    Authors: Xuying Ning, Wujiang Xu, Xiaolei Liu, Mingming Ha, Qiongxu Ma, Youru Li, Linxun Chen, Yongfeng Zhang

    Abstract: Cross-Domain Sequential Recommendation (CDSR) methods aim to address the data sparsity and cold-start problems present in Single-Domain Sequential Recommendation (SDSR). Existing CDSR methods typically rely on overlapping users, designing complex cross-domain modules to capture users' latent interests that can propagate across different domains. However, their propagated informative information is… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  6. arXiv:2405.15403  [pdf, other

    cs.LG stat.ML

    Fine-Grained Dynamic Framework for Bias-Variance Joint Optimization on Data Missing Not at Random

    Authors: Mingming Ha, Xuewen Tao, Wenfang Lin, Qionxu Ma, Wujiang Xu, Linxun Chen

    Abstract: In most practical applications such as recommendation systems, display advertising, and so forth, the collected data often contains missing values and those missing values are generally missing-not-at-random, which deteriorates the prediction performance of models. Some existing estimators and regularizers attempt to achieve unbiased estimation to improve the predictive performance. However, varia… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  7. arXiv:2405.14620  [pdf, other

    cs.LG

    Closed-form Symbolic Solutions: A New Perspective on Solving Partial Differential Equations

    Authors: Shu Wei, Yanjie Li, Lina Yu, Min Wu, Weijun Li, Meilan Hao, Wenqiang Li, Jingyi Liu, Yusong Deng

    Abstract: Solving partial differential equations (PDEs) in Euclidean space with closed-form symbolic solutions has long been a dream for mathematicians. Inspired by deep learning, Physics-Informed Neural Networks (PINNs) have shown great promise in numerically solving PDEs. However, since PINNs essentially approximate solutions within the continuous function space, their numerical solutions fall short in bo… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  8. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  9. arXiv:2404.11816  [pdf, other

    cs.LG

    Tailoring Generative Adversarial Networks for Smooth Airfoil Design

    Authors: Joyjit Chattoraj, Jian Cheng Wong, Zhang Zexuan, Manna Dai, Xia Yingzhi, Li Jichao, Xu Xinxing, Ooi Chin Chun, Yang Feng, Dao My Ha, Liu Yong

    Abstract: In the realm of aerospace design, achieving smooth curves is paramount, particularly when crafting objects such as airfoils. Generative Adversarial Network (GAN), a widely employed generative AI technique, has proven instrumental in synthesizing airfoil designs. However, a common limitation of GAN is the inherent lack of smoothness in the generated airfoil surfaces. To address this issue, we prese… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  10. arXiv:2404.06330  [pdf, other

    cs.LG cs.AI

    Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning

    Authors: Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jingyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng

    Abstract: The mathematical formula is the human language to describe nature and is the essence of scientific research. Finding mathematical formulas from observational data is a major demand of scientific research and a major challenge of artificial intelligence. This area is called symbolic regression. Originally symbolic regression was often formulated as a combinatorial optimization problem and solved us… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 21 pages

  11. arXiv:2403.04264  [pdf, other

    cs.AI

    Competitive Facility Location under Random Utilities and Routing Constraints

    Authors: Hoang Giang Pham, Tien Thanh Dam, Ngan Ha Duong, Tien Mai, Minh Hoang Ha

    Abstract: In this paper, we study a facility location problem within a competitive market context, where customer demand is predicted by a random utility choice model. Unlike prior research, which primarily focuses on simple constraints such as a cardinality constraint on the number of selected locations, we introduce routing constraints that necessitate the selection of locations in a manner that guarantee… ▽ More

    Submitted 9 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  12. arXiv:2402.18603  [pdf, other

    cs.LG cs.AI cs.CL

    MMSR: Symbolic Regression is a Multimodal Task

    Authors: Yanjie Li, Jingyi Liu, Weijun Li, Lina Yu, Min Wu, Wenqiang Li, Meilan Hao, Su Wei, Yusong Deng

    Abstract: Mathematical formulas are the crystallization of human wisdom in exploring the laws of nature for thousands of years. Describing the complex laws of nature with a concise mathematical formula is a constant pursuit of scientists and a great challenge for artificial intelligence. This field is called symbolic regression. Symbolic regression was originally formulated as a combinatorial optimization p… ▽ More

    Submitted 14 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 12 page

  13. arXiv:2402.13718  [pdf, other

    cs.CL

    $\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens

    Authors: Xinrong Zhang, Yingfa Chen, Shengding Hu, Zihang Xu, Junhao Chen, Moo Khai Hao, Xu Han, Zhen Leng Thai, Shuo Wang, Zhiyuan Liu, Maosong Sun

    Abstract: Processing and reasoning over long contexts is crucial for many practical applications of Large Language Models (LLMs), such as document comprehension and agent construction. Despite recent strides in making LLMs process contexts with more than 100K tokens, there is currently a lack of a standardized benchmark to evaluate this long-context capability. Existing public benchmarks typically focus on… ▽ More

    Submitted 24 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Journal ref: 2023.12.15ARR

  14. arXiv:2402.12175  [pdf, other

    cs.LG cs.NE

    Learning Discretized Bayesian Networks with GOMEA

    Authors: Damy M. F. Ha, Tanja Alderliesten, Peter A. N. Bosman

    Abstract: Bayesian networks model relationships between random variables under uncertainty and can be used to predict the likelihood of events and outcomes while incorporating observed evidence. From an eXplainable AI (XAI) perspective, such models are interesting as they tend to be compact. Moreover, captured relations can be directly inspected by domain experts. In practice, data is often real-valued. Unl… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: The code is available at: https://github.com/damyha/dbn_gomea

  15. arXiv:2402.10937  [pdf

    cs.AR cs.AI cs.CE cs.GT cs.LG

    A Lightweight Inception Boosted U-Net Neural Network for Routability Prediction

    Authors: Hailiang Li, Yan Huo, Yan Wang, Xu Yang, Miaohui Hao, Xiao Wang

    Abstract: As the modern CPU, GPU, and NPU chip design complexity and transistor counts keep increasing, and with the relentless shrinking of semiconductor technology nodes to nearly 1 nanometer, the placement and routing have gradually become the two most pivotal processes in modern very-large-scale-integrated (VLSI) circuit back-end design. How to evaluate routability efficiently and accurately in advance… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: The paper is submitted to the International Symposium of EDA (2024, XiAn, China)

  16. arXiv:2401.15103  [pdf, other

    cs.LG cs.AI

    PruneSymNet: A Symbolic Neural Network and Pruning Algorithm for Symbolic Regression

    Authors: Min Wu, Weijun Li, Lina Yu, Wenqiang Li, Jingyi Liu, Yanjie Li, Meilan Hao

    Abstract: Symbolic regression aims to derive interpretable symbolic expressions from data in order to better understand and interpret data. %which plays an important role in knowledge discovery and interpretable machine learning. In this study, a symbolic network called PruneSymNet is proposed for symbolic regression. This is a novel neural network whose activation function consists of common elementary f… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  17. arXiv:2401.14424  [pdf, other

    cs.LG cs.AI

    Discovering Mathematical Formulas from Data via GPT-guided Monte Carlo Tree Search

    Authors: Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jingyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng

    Abstract: Finding a concise and interpretable mathematical formula that accurately describes the relationship between each variable and the predicted value in the data is a crucial task in scientific research, as well as a significant challenge in artificial intelligence. This problem is referred to as symbolic regression, which is an NP-hard problem. In the previous year, a novel symbolic regression method… ▽ More

    Submitted 30 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 24 pages

  18. arXiv:2401.04246  [pdf, other

    cs.LG q-bio.BM

    Scalable Normalizing Flows Enable Boltzmann Generators for Macromolecules

    Authors: Joseph C. Kim, David Bloore, Karan Kapoor, Jun Feng, Ming-Hong Hao, Mengdi Wang

    Abstract: The Boltzmann distribution of a protein provides a roadmap to all of its functional states. Normalizing flows are a promising tool for modeling this distribution, but current methods are intractable for typical pharmacological targets; they become computationally intractable due to the size of the system, heterogeneity of intra-molecular potential energy, and long-range interactions. To remedy the… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  19. arXiv:2401.03968  [pdf, other

    q-bio.QM cs.LG q-bio.GN

    scDiffusion: conditional generation of high-quality single-cell data using diffusion model

    Authors: Erpai Luo, Minsheng Hao, Lei Wei, Xuegong Zhang

    Abstract: Single-cell RNA sequencing (scRNA-seq) data are important for studying the laws of life at single-cell level. However, it is still challenging to obtain enough high-quality scRNA-seq data. To mitigate the limited availability of data, generative models have been proposed to computationally generate synthetic scRNA-seq data. Nevertheless, the data generated with current models are not very realisti… ▽ More

    Submitted 4 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  20. arXiv:2401.01772  [pdf, other

    cs.AI cs.NI

    A Novel Paradigm for Neural Computation: X-Net with Learnable Neurons and Adaptable Structure

    Authors: Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jinyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng, Liping Zhang, Xiaoli Dong, Hong Qin, Xin Ning, Yugui Zhang, Baoli Lu, Jian Xu, Shuang Li

    Abstract: Multilayer perception (MLP) has permeated various disciplinary domains, ranging from bioinformatics to financial analytics, where their application has become an indispensable facet of contemporary scientific research endeavors. However, MLP has obvious drawbacks. 1), The type of activation function is single and relatively fixed, which leads to poor `representation ability' of the network, and it… ▽ More

    Submitted 12 July, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 35 pages

  21. arXiv:2311.15156  [pdf, other

    cs.LG cs.AI q-bio.GN

    xTrimoGene: An Efficient and Scalable Representation Learner for Single-Cell RNA-Seq Data

    Authors: Jing Gong, Minsheng Hao, Xingyi Cheng, Xin Zeng, Chiming Liu, Jianzhu Ma, Xuegong Zhang, Taifeng Wang, Le Song

    Abstract: Advances in high-throughput sequencing technology have led to significant progress in measuring gene expressions at the single-cell level. The amount of publicly available single-cell RNA-seq (scRNA-seq) data is already surpassing 50M records for humans with each record measuring 20,000 genes. This highlights the need for unsupervised representation learning to fully ingest these data, yet classic… ▽ More

    Submitted 24 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023

  22. arXiv:2311.07326  [pdf, other

    cs.LG cs.AI

    MetaSymNet: A Dynamic Symbolic Regression Network Capable of Evolving into Arbitrary Formulations

    Authors: Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jinyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng

    Abstract: Mathematical formulas serve as the means of communication between humans and nature, encapsulating the operational laws governing natural phenomena. The concise formulation of these laws is a crucial objective in scientific research and an important challenge for artificial intelligence (AI). While traditional artificial neural networks (MLP) excel at data fitting, they often yield uninterpretable… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 16 pages

  23. arXiv:2311.04760  [pdf, other

    cs.IR cs.LG

    Towards Open-world Cross-Domain Sequential Recommendation: A Model-Agnostic Contrastive Denoising Approach

    Authors: Wujiang Xu, Xuying Ning, Wenfang Lin, Mingming Ha, Qiongxu Ma, Qianqiao Liang, Xuewen Tao, Linxun Chen, Bing Han, Minnan Luo

    Abstract: Cross-domain sequential recommendation (CDSR) aims to address the data sparsity problems that exist in traditional sequential recommendation (SR) systems. The existing approaches aim to design a specific cross-domain unit that can transfer and propagate information across multiple domains by relying on overlapping users with abundant behaviors. However, in real-world recommender systems, CDSR sc… ▽ More

    Submitted 5 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  24. Rethinking Cross-Domain Sequential Recommendation under Open-World Assumptions

    Authors: Wujiang Xu, Qitian Wu, Runzhong Wang, Mingming Ha, Qiongxu Ma, Linxun Chen, Bing Han, Junchi Yan

    Abstract: Cross-Domain Sequential Recommendation (CDSR) methods aim to tackle the data sparsity and cold-start problems present in Single-Domain Sequential Recommendation (SDSR). Existing CDSR works design their elaborate structures relying on overlapping users to propagate the cross-domain information. However, current CDSR methods make closed-world assumptions, assuming fully overlapping users across mult… ▽ More

    Submitted 12 April, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Journal ref: Proceedings of the ACM Web Conference 2024 (WWW '24)

  25. arXiv:2309.13705  [pdf, other

    cs.LG cs.AI

    A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from Data

    Authors: Wenqiang Li, Weijun Li, Lina Yu, Min Wu, Linjun Sun, Jingyi Liu, Yanjie Li, Shu Wei, Yusong Deng, Meilan Hao

    Abstract: Symbolic regression (SR) is a powerful technique for discovering the underlying mathematical expressions from observed data. Inspired by the success of deep learning, recent deep generative SR methods have shown promising results. However, these methods face difficulties in processing high-dimensional problems and learning constants due to the large search space, and they don't scale well to unsee… ▽ More

    Submitted 1 June, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted by ICML 2024

  26. arXiv:2309.10361  [pdf, other

    cs.CV cs.LG cs.MM

    Improving CLIP Robustness with Knowledge Distillation and Self-Training

    Authors: Clement Laroudie, Andrei Bursuc, Mai Lan Ha, Gianni Franchi

    Abstract: This paper examines the robustness of a multi-modal computer vision model, CLIP (Contrastive Language-Image Pretraining), in the context of unsupervised learning. The main objective is twofold: first, to evaluate the robustness of CLIP, and second, to explore strategies for augmenting its robustness. To achieve this, we introduce a novel approach named LP-CLIP. This technique involves the distilla… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  27. arXiv:2308.04823  [pdf

    cs.CL

    Evaluating the Generation Capabilities of Large Chinese Language Models

    Authors: Hui Zeng, Jingyuan Xue, Meng Hao, Chen Sun, Bin Ning, Na Zhang

    Abstract: This paper unveils CG-Eval, the first-ever comprehensive and automated evaluation framework designed for assessing the generative capabilities of large Chinese language models across a spectrum of academic disciplines. CG-Eval stands out for its automated process, which critically assesses models based on their proficiency in generating precise and contextually relevant responses to a diverse arra… ▽ More

    Submitted 29 January, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

  28. arXiv:2308.02870  [pdf, other

    cs.CL cs.SD eess.AS

    ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging

    Authors: Fangyuan Wang, Ming Hao, Yuhai Shi, Bo Xu

    Abstract: The conventional recipe for Automatic Speech Recognition (ASR) models is to 1) train multiple checkpoints on a training set while relying on a validation set to prevent overfitting using early stopping and 2) average several last checkpoints or that of the lowest validation losses to obtain the final model. In this paper, we rethink and update the early stopping and checkpoint averaging from the p… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

  29. arXiv:2306.04192  [pdf, other

    cs.CR

    Extracting Cloud-based Model with Prior Knowledge

    Authors: Shiqian Zhao, Kangjie Chen, Meng Hao, Jian Zhang, Guowen Xu, Hongwei Li, Tianwei Zhang

    Abstract: Machine Learning-as-a-Service, a pay-as-you-go business pattern, is widely accepted by third-party users and developers. However, the open inference APIs may be utilized by malicious customers to conduct model extraction attacks, i.e., attackers can replicate a cloud-based black-box model merely via querying malicious examples. Existing model extraction attacks mainly depend on the posterior knowl… ▽ More

    Submitted 13 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

  30. arXiv:2305.19569  [pdf

    cs.LG cs.AI cs.CY eess.SP

    Domain knowledge-informed Synthetic fault sample generation with Health Data Map for cross-domain Planetary Gearbox Fault Diagnosis

    Authors: Jong Moon Ha, Olga Fink

    Abstract: Extensive research has been conducted on fault diagnosis of planetary gearboxes using vibration signals and deep learning (DL) approaches. However, DL-based methods are susceptible to the domain shift problem caused by varying operating conditions of the gearbox. Although domain adaptation and data synthesis methods have been proposed to overcome such domain shifts, they are often not directly app… ▽ More

    Submitted 26 November, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: Under review / added arXiv identifier / Updated to revised version

    Journal ref: Published in Mechanical Systems and Signal Processing Volume 202, 1 November 2023, 110680

  31. Blockchain-enabled Parametric Solar Energy Insurance via Remote Sensing

    Authors: Mingyu Hao, Keyang Qian, Sid Chi-Kin Chau

    Abstract: Despite its popularity, the nature of solar energy is highly uncertain and weather dependent, affecting the business viability and investment of solar energy generation, especially for household users. To stabilize the income from solar energy generation, there have been limited traditional options, such as using energy storage to pool excessive solar energy in off-peak periods or financial deriva… ▽ More

    Submitted 17 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: To appear in ACM e-Energy 2023

  32. arXiv:2305.08384  [pdf, other

    cs.CR cs.NI

    Privacy-preserving Blockchain-enabled Parametric Insurance via Remote Sensing and IoT

    Authors: Mingyu Hao, Keyang Qian, Sid Chi-Kin Chau

    Abstract: Traditional Insurance, a popular approach of financial risk management, has suffered from the issues of high operational costs, opaqueness, inefficiency and a lack of trust. Recently, blockchain-enabled "parametric insurance" through authorized data sources (e.g., remote sensing and IoT) aims to overcome these issues by automating the underwriting and claim processes of insurance policies on a blo… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  33. arXiv:2304.02472  [pdf, other

    q-fin.RM cs.LG q-fin.TR

    Learning to Predict Short-Term Volatility with Order Flow Image Representation

    Authors: Artem Lensky, Mingyu Hao

    Abstract: Introduction: The paper addresses the challenging problem of predicting the short-term realized volatility of the Bitcoin price using order flow information. The inherent stochastic nature and anti-persistence of price pose difficulties in accurate prediction. Methods: To address this, we propose a method that transforms order flow data over a fixed time interval (snapshots) into images. The ord… ▽ More

    Submitted 20 March, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

  34. arXiv:2303.05565  [pdf, other

    cs.RO eess.SY

    Towards Generalized Robot Assembly through Compliance-Enabled Contact Formations

    Authors: Andrew S. Morgan, Quentin Bateux, Mei Hao, Aaron M. Dollar

    Abstract: Contact can be conceptualized as a set of constraints imposed on two bodies that are interacting with one another in some way. The nature of a contact, whether a point, line, or surface, dictates how these bodies are able to move with respect to one another given a force, and a set of contacts can provide either partial or full constraint on a body's motion. Decades of work have explored how to ex… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Journal ref: IEEE International Conference on Robotics and Automation (ICRA), 2023

  35. arXiv:2302.05919  [pdf, other

    cs.IR

    Neural Node Matching for Multi-Target Cross Domain Recommendation

    Authors: Wujiang Xu, Shaoshuai Li, Mingming Ha, Xiaobo Guo, Qiongxu Ma, Xiaolei Liu, Linxun Chen, Zhenfeng Zhu

    Abstract: Multi-Target Cross Domain Recommendation(CDR) has attracted a surge of interest recently, which intends to improve the recommendation performance in multiple domains (or systems) simultaneously. Most existing multi-target CDR frameworks primarily rely on the existence of the majority of overlapped users across domains. However, general practical CDR scenarios cannot meet the strictly overlapping r… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: 13pages

    Journal ref: The IEEE International Conference on Data Engineering 2023

  36. arXiv:2302.05114  [pdf

    cs.CV

    Exploiting Neighborhood Structural Features for Change Detection

    Authors: Mengmeng Wang, Zhiqiang Han, Peizhen Yang, Bai Zhu, Ming Hao, Jianwei Fan, Yuanxin Ye

    Abstract: In this letter, a novel method for change detection is proposed using neighborhood structure correlation. Because structure features are insensitive to the intensity differences between bi-temporal images, we perform the correlation analysis on structure features rather than intensity information. First, we extract the structure feature maps by using multi-orientated gradient information. Then, th… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  37. arXiv:2302.03731  [pdf, other

    cs.LG q-bio.QM

    MMA-RNN: A Multi-level Multi-task Attention-based Recurrent Neural Network for Discrimination and Localization of Atrial Fibrillation

    Authors: Yifan Sun, Jingyan Shen, Yunfan Jiang, Zhaohui Huang, Minsheng Hao, Xuegong Zhang

    Abstract: The automatic detection of atrial fibrillation based on electrocardiograph (ECG) signals has received wide attention both clinically and practically. It is challenging to process ECG signals with cyclical pattern, varying length and unstable quality due to noise and distortion. Besides, there has been insufficient research on separating persistent atrial fibrillation from paroxysmal atrial fibrill… ▽ More

    Submitted 8 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: 9 pages, 5 figures

  38. Primal-Dual Cops and Robber

    Authors: Minh Tuan Ha, Paul Jungeblut, Torsten Ueckerdt, Paweł Żyliński

    Abstract: Cops and Robber is a family of two-player games played on graphs in which one player controls a number of cops and the other player controls a robber. In alternating turns, each player moves (all) their figures. The cops try to capture the robber while the latter tries to flee indefinitely. In this paper we consider a variant of the game played on a planar graph where the robber moves between adja… ▽ More

    Submitted 10 January, 2024; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: Equal to the published version

    Journal ref: Computing in Geometry and Topology, 3(2), 4:1-4:12 (2024)

  39. arXiv:2301.02494  [pdf, other

    cs.LG cs.AI

    Adaptive Pattern Extraction Multi-Task Learning for Multi-Step Conversion Estimations

    Authors: Xuewen Tao, Mingming Ha, Xiaobo Guo, Qiongxu Ma, Hongwei Cheng, Wenfang Lin

    Abstract: Multi-task learning (MTL) has been successfully used in many real-world applications, which aims to simultaneously solve multiple tasks with a single model. The general idea of multi-task learning is designing kinds of global parameter sharing mechanism and task-specific feature extractor to improve the performance of all tasks. However, challenge still remains in balancing the trade-off of variou… ▽ More

    Submitted 23 January, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: 18 pages, 9 figures

  40. arXiv:2212.00024  [pdf, other

    cs.LG cs.AI

    Semi-Supervised Heterogeneous Graph Learning with Multi-level Data Augmentation

    Authors: Ying Chen, Siwei Qiang, Mingming Ha, Xiaolei Liu, Shaoshuai Li, Lingfeng Yuan, Xiaobo Guo, Zhenfeng Zhu

    Abstract: In recent years, semi-supervised graph learning with data augmentation (DA) is currently the most commonly used and best-performing method to enhance model robustness in sparse scenarios with few labeled samples. Differing from homogeneous graph, DA in heterogeneous graph has greater challenges: heterogeneity of information requires DA strategies to effectively handle heterogeneous relations, whic… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  41. arXiv:2211.07166  [pdf, other

    cs.LG cs.CR cs.DC

    Optimal Privacy Preserving for Federated Learning in Mobile Edge Computing

    Authors: Hai M. Nguyen, Nam H. Chu, Diep N. Nguyen, Dinh Thai Hoang, Van-Dinh Nguyen, Minh Hoang Ha, Eryk Dutkiewicz, Marwan Krunz

    Abstract: Federated Learning (FL) with quantization and deliberately added noise over wireless networks is a promising approach to preserve user differential privacy (DP) while reducing wireless resources. Specifically, an FL process can be fused with quantized Binomial mechanism-based updates contributed by multiple users. However, optimizing quantization parameters, communication resources (e.g., transmit… ▽ More

    Submitted 20 May, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: 16 pages, 10 figures

  42. arXiv:2211.05405  [pdf, other

    cs.CV cs.CL

    VieCap4H-VLSP 2021: ObjectAoA-Enhancing performance of Object Relation Transformer with Attention on Attention for Vietnamese image captioning

    Authors: Nghia Hieu Nguyen, Duong T. D. Vo, Minh-Quan Ha

    Abstract: Image captioning is currently a challenging task that requires the ability to both understand visual information and use human language to describe this visual information in the image. In this paper, we propose an efficient way to improve the image understanding ability of transformer-based method by extending Object Relation Transformer architecture with Attention on Attention mechanism. Experim… ▽ More

    Submitted 20 March, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted for publishing at the VNU Journal of Science: Computer Science and Communication Engineering

  43. arXiv:2201.01684  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    Dynamic GPU Energy Optimization for Machine Learning Training Workloads

    Authors: Farui Wang, Weizhe Zhang, Shichao Lai, Meng Hao, Zheng Wang

    Abstract: GPUs are widely used to accelerate the training of machine learning workloads. As modern machine learning models become increasingly larger, they require a longer time to train, leading to higher GPU energy consumption. This paper presents GPOEO, an online GPU energy optimization framework for machine learning training workloads. GPOEO dynamically determines the optimal energy configuration by emp… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: Accepted to be published at IEEE Transactions on Parallel and Distributed System (IEEE TPDS)

  44. arXiv:2111.11307  [pdf, ps, other

    cs.DS

    An efficient branch-and-cut algorithm for the parallel drone scheduling traveling salesman problem

    Authors: Minh Anh Nguyen, Hai Long Luong, Minh Hoàng Hà, Ha-Bang Ban

    Abstract: We propose an efficient branch-and-cut algorithm to exactly solve the parallel drone scheduling traveling salesman problem. Our algorithm can find optimal solutions for all but two existing instances with up to 229 customers in a reasonable running time. To make the problem more challenging for future methods, we introduce two new sets of 120 larger instances with the number of customers varying f… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

  45. Capacity Enhancement for Reconfigurable Intelligent Surface-Aided Wireless Network: from Regular Array to Irregular Array

    Authors: Ruochen Su, Linglong Dai, Jingbo Tan, Mo Hao, Richard MacKenzie

    Abstract: Reconfigurable intelligent surface (RIS) is promising for future 6G wireless communications. However, the increased number of RIS elements results in the high overhead for channel acquisition and the non-negligible power consumption. Therefore, how to improve the system capacity with limited RIS elements is essential. Unlike the classical regular RIS whose elements are arranged on a regular grid,… ▽ More

    Submitted 13 January, 2023; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: Accepted by IEEE Transactions on Vehicular Technology. Simulation codes are provided at: http://oa.ee.tsinghua.edu.cn/dailinglong/publications/publications.html

  46. arXiv:2108.00968  [pdf, other

    cs.CV cs.AI stat.ML

    Robust Semantic Segmentation with Superpixel-Mix

    Authors: Gianni Franchi, Nacim Belkhir, Mai Lan Ha, Yufei Hu, Andrei Bursuc, Volker Blanz, Angela Yao

    Abstract: Along with predictive performance and runtime speed, reliability is a key requirement for real-world semantic segmentation. Reliability encompasses robustness, predictive uncertainty and reduced bias. To improve reliability, we introduce Superpixel-mix, a new superpixel-based data augmentation method with teacher-student consistency training. Unlike other mixing-based augmentation techniques, mixi… ▽ More

    Submitted 21 October, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Accepted to BMVC2021

  47. arXiv:2107.06209  [pdf, other

    cs.CV cs.LG

    Learning a Discriminant Latent Space with Neural Discriminant Analysis

    Authors: Mai Lan Ha, Gianni Franchi, Emanuel Aldea, Volker Blanz

    Abstract: Discriminative features play an important role in image and object classification and also in other fields of research such as semi-supervised learning, fine-grained classification, out of distribution detection. Inspired by Linear Discriminant Analysis (LDA), we propose an optimization called Neural Discriminant Analysis (NDA) for Deep Convolutional Neural Networks (DCNNs). NDA transforms deep fe… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

  48. arXiv:2107.06187  [pdf, other

    cs.CV cs.LG

    Deep Ranking with Adaptive Margin Triplet Loss

    Authors: Mai Lan Ha, Volker Blanz

    Abstract: We propose a simple modification from a fixed margin triplet loss to an adaptive margin triplet loss. While the original triplet loss is used widely in classification problems such as face recognition, face re-identification and fine-grained similarity, our proposed loss is well suited for rating datasets in which the ratings are continuous values. In contrast to original triplet loss where we hav… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

  49. arXiv:2107.02133  [pdf, other

    cs.CV

    Test-Time Personalization with a Transformer for Human Pose Estimation

    Authors: Yizhuo Li, Miao Hao, Zonglin Di, Nitesh B. Gundavarapu, Xiaolong Wang

    Abstract: We propose to personalize a human pose estimator given a set of test images of a person without using any manual annotations. While there is a significant advancement in human pose estimation, it is still very challenging for a model to generalize to different unknown environments and unseen persons. Instead of using a fixed model for every test case, we adapt our pose estimator during test time t… ▽ More

    Submitted 7 November, 2021; v1 submitted 5 July, 2021; originally announced July 2021.

    Comments: Project page: http://liyz15.github.io/TTP/

  50. arXiv:2106.01921  [pdf, ps, other

    stat.ML cs.LG stat.AP

    Sample Selection Bias in Evaluation of Prediction Performance of Causal Models

    Authors: James P. Long, Min Jin Ha

    Abstract: Causal models are notoriously difficult to validate because they make untestable assumptions regarding confounding. New scientific experiments offer the possibility of evaluating causal models using prediction performance. Prediction performance measures are typically robust to violations in causal assumptions. However, prediction performance does depend on the selection of training and test sets.… ▽ More

    Submitted 26 October, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: 12 pages, 4 figures, 2 tables