Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 162 results for author: Hou, C

.
  1. arXiv:2408.17214  [pdf, other

    cs.IR

    Efficient Multi-task Prompt Tuning for Recommendation

    Authors: Ting Bai, Le Huang, Yue Yu, Cheng Yang, Cheng Hou, Zhe Zhao, Chuan Shi

    Abstract: With the expansion of business scenarios, real recommender systems are facing challenges in dealing with the constantly emerging new tasks in multi-task learning frameworks. In this paper, we attempt to improve the generalization ability of multi-task recommendations when dealing with new tasks. We find that joint training will enhance the performance of the new task but always negatively impact e… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  2. arXiv:2408.10124  [pdf, other

    cs.LG cs.AI cs.IR physics.chem-ph q-bio.BM

    Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models

    Authors: Tianyu Zhang, Yuxiang Ren, Chengbin Hou, Hairong Lv, Xuegong Zhang

    Abstract: Molecular property prediction is a crucial foundation for drug discovery. In recent years, pre-trained deep learning models have been widely applied to this task. Some approaches that incorporate prior biological domain knowledge into the pre-training framework have achieved impressive results. However, these methods heavily rely on biochemical experts, and retrieving and summarizing vast amounts… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  3. arXiv:2408.07004  [pdf, other

    cs.CR cs.AI

    Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models

    Authors: Chun Jie Chong, Chenxi Hou, Zhihao Yao, Seyed Mohammadjavad Seyed Talebi

    Abstract: Web-based Large Language Model (LLM) services have been widely adopted and have become an integral part of our Internet experience. Third-party plugins enhance the functionalities of LLM by enabling access to real-world data and services. However, the privacy consequences associated with these services and their third-party plugins are not well understood. Sensitive prompt data are stored, process… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  4. Towards High-resolution 3D Anomaly Detection via Group-Level Feature Contrastive Learning

    Authors: Hongze Zhu, Guoyang Xie, Chengbin Hou, Tao Dai, Can Gao, Jinbao Wang, Linlin Shen

    Abstract: High-resolution point clouds~(HRPCD) anomaly detection~(AD) plays a critical role in precision machining and high-end equipment manufacturing. Despite considerable 3D-AD methods that have been proposed recently, they still cannot meet the requirements of the HRPCD-AD task. There are several challenges: i) It is difficult to directly capture HRPCD information due to large amounts of points at the s… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: ACMMM24, 12 pages, 5 figures

  5. arXiv:2407.14491  [pdf, other

    cs.CV

    PD-APE: A Parallel Decoding Framework with Adaptive Position Encoding for 3D Visual Grounding

    Authors: Chenshu Hou, Liang Peng, Xiaopei Wu, Xiaofei He, Wenxiao Wang

    Abstract: 3D visual grounding aims to identify objects in 3D point cloud scenes that match specific natural language descriptions. This requires the model to not only focus on the target object itself but also to consider the surrounding environment to determine whether the descriptions are met. Most previous works attempt to accomplish both tasks within the same module, which can easily lead to a distracti… ▽ More

    Submitted 2 September, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

  6. arXiv:2407.02542  [pdf, other

    cs.IR cs.AI cs.LG

    ECAT: A Entire space Continual and Adaptive Transfer Learning Framework for Cross-Domain Recommendation

    Authors: Chaoqun Hou, Yuanhang Zhou, Yi Cao, Tong Liu

    Abstract: In industrial recommendation systems, there are several mini-apps designed to meet the diverse interests and needs of users. The sample space of them is merely a small subset of the entire space, making it challenging to train an efficient model. In recent years, there have been many excellent studies related to cross-domain recommendation aimed at mitigating the problem of data sparsity. However,… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  7. arXiv:2406.10126  [pdf, other

    cs.CV

    Training-free Camera Control for Video Generation

    Authors: Chen Hou, Guoqiang Wei, Yan Zeng, Zhibo Chen

    Abstract: We propose a training-free and robust solution to offer camera movement control for off-the-shelf video diffusion models. Unlike previous work, our method does not require any supervised finetuning on camera-annotated datasets or self-supervised training via data augmentation. Instead, it can be plugged and played with most pretrained video diffusion models and generate camera controllable videos… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  8. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  9. arXiv:2406.03695  [pdf, other

    cs.CR

    FACOS: Enabling Privacy Protection Through Fine-Grained Access Control with On-chain and Off-chain System

    Authors: Chao Liu, Cankun Hou, Tianyu Jiang, Jianting Ning, Hui Qiao, Yusen Wu

    Abstract: Data-driven landscape across finance, government, and healthcare, the continuous generation of information demands robust solutions for secure storage, efficient dissemination, and fine-grained access control. Blockchain technology emerges as a significant tool, offering decentralized storage while upholding the tenets of data security and accessibility. However, on-chain and off-chain strategies… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  10. arXiv:2406.02958  [pdf, other

    cs.LG cs.AI cs.CL cs.CR cs.DC

    PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs

    Authors: Charlie Hou, Akshat Shrivastava, Hongyuan Zhan, Rylan Conway, Trang Le, Adithya Sagar, Giulia Fanti, Daniel Lazar

    Abstract: On-device training is currently the most common approach for training machine learning (ML) models on private, distributed user data. Despite this, on-device training has several drawbacks: (1) most user devices are too small to train large models on-device, (2) on-device training is communication- and computation-intensive, and (3) on-device training can be difficult to debug and deploy. To addre… ▽ More

    Submitted 17 July, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024 (Oral)

  11. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  12. arXiv:2405.10626  [pdf, other

    cs.CL

    Dynamic data sampler for cross-language transfer learning in large language models

    Authors: Yudong Li, Yuhao Feng, Wen Zhou, Zhe Zhao, Linlin Shen, Cheng Hou, Xianxu Hou

    Abstract: Large Language Models (LLMs) have gained significant attention in the field of natural language processing (NLP) due to their wide range of applications. However, training LLMs for languages other than English poses significant challenges, due to the difficulty in acquiring large-scale corpus and the requisite computing resources. In this paper, we propose ChatFlow, a cross-language transfer-based… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by ICASSP 2024

  13. arXiv:2405.10103  [pdf

    physics.app-ph cond-mat.mtrl-sci

    One-step Pulsed Laser Deposition of Metal oxynitride/Carbon Composites for Supercapacitor Application

    Authors: Subrata Ghosh, Giacomo Pagani, Massimilano Righi, Chengxi Hou, Valeria Russo, Carlo S. Casari

    Abstract: Advanced material composite of nanocarbons and metal-based materials provides a synergistic effect to obtain excellent electrochemical charge-storage performance and other properties. Herein, 3D porous carbon-metal oxynitride nanocomposites with tunable carbon/metal and oxygen/nitrogen ratio are synthesized uniquely by simultaneous ablation from two different targets by single-step pulsed laser de… ▽ More

    Submitted 29 August, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 18 pages, 8 Figures, 1 table, 9 figures in supplementary figures

    Journal ref: Journal of Physics D: Applied Physics 2024

  14. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  15. arXiv:2404.14002  [pdf, ps, other

    math.OA

    A note on the orbit equivalence of injective actions

    Authors: Xiangqi Qiang, Chengjun Hou

    Abstract: We characterise the groupoid $C^*$-algebras associated to the transformation groupoids of injective actions of discrete countable Ore semi-groups on compact topological spaces in terms of the reduced crossed product from the dual actions, and characterise the continuous orbit equivalence for injective actions by means of the transformation groupoids, as well as their reduced groupoid $C^*$-algebra… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  16. arXiv:2404.04801  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    LHAASO-KM2A detector simulation using Geant4

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

    Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  17. arXiv:2403.10127  [pdf, other

    cs.CV

    TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model

    Authors: Changhong Hou, Junchuan Yu, Daqing Ge, Liu Yang, Laidian Xi, Yunxuan Pang, Yi Wen

    Abstract: Landslides are one of the most destructive natural disasters in the world, posing a serious threat to human life and safety. The development of foundation models has provided a new research paradigm for large-scale landslide detection. The Segment Anything Model (SAM) has garnered widespread attention in the field of image segmentation. However, our experiment found that SAM performed poorly in th… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  18. Measurements of All-Particle Energy Spectrum and Mean Logarithmic Mass of Cosmic Rays from 0.3 to 30 PeV with LHAASO-KM2A

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen , et al. (256 additional authors not shown)

    Abstract: We present the measurements of all-particle energy spectrum and mean logarithmic mass of cosmic rays in the energy range of 0.3-30 PeV using data collected from LHAASO-KM2A between September 2021 and December 2022, which is based on a nearly composition-independent energy reconstruction method, achieving unprecedented accuracy. Our analysis reveals the position of the knee at… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

    Journal ref: Physical Review Letters 132, 131002 (2024)

  19. arXiv:2402.18905  [pdf, other

    cs.LG cs.AI cs.CR math.OC

    On the Convergence of Differentially-Private Fine-tuning: To Linearly Probe or to Fully Fine-tune?

    Authors: Shuqi Ke, Charlie Hou, Giulia Fanti, Sewoong Oh

    Abstract: Differentially private (DP) machine learning pipelines typically involve a two-phase process: non-private pre-training on a public dataset, followed by fine-tuning on private data using DP optimization techniques. In the DP setting, it has been observed that full fine-tuning may not always yield the best test accuracy, even for in-distribution data. This paper (1) analyzes the training dynamics of… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  20. Label Informed Contrastive Pretraining for Node Importance Estimation on Knowledge Graphs

    Authors: Tianyu Zhang, Chengbin Hou, Rui Jiang, Xuegong Zhang, Chenghu Zhou, Ke Tang, Hairong Lv

    Abstract: Node Importance Estimation (NIE) is a task of inferring importance scores of the nodes in a graph. Due to the availability of richer data and knowledge, recent research interests of NIE have been dedicating to knowledge graphs for predicting future or missing node importance scores. Existing state-of-the-art NIE methods train the model by available labels, and they consider every interested node e… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE TNNLS

  21. arXiv:2401.01065  [pdf, other

    cs.CV cs.AI

    BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving

    Authors: Tao Tang, Dafeng Wei, Zhengyu Jia, Tian Gao, Changwei Cai, Chengkai Hou, Peng Jia, Kun Zhan, Haiyang Sun, Jingchen Fan, Yixing Zhao, Fu Liu, Xiaodan Liang, Xianpeng Lang, Yang Wang

    Abstract: The rapid development of the autonomous driving industry has led to a significant accumulation of autonomous driving data. Consequently, there comes a growing demand for retrieving data to provide specialized optimization. However, directly applying previous image retrieval methods faces several challenges, such as the lack of global feature representation and inadequate text retrieval ability for… ▽ More

    Submitted 18 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  22. arXiv:2312.15707  [pdf, other

    cs.CV

    High-Fidelity Diffusion-based Image Editing

    Authors: Chen Hou, Guoqiang Wei, Zhibo Chen

    Abstract: Diffusion models have attained remarkable success in the domains of image generation and editing. It is widely recognized that employing larger inversion and denoising steps in diffusion model leads to improved image reconstruction quality. However, the editing performance of diffusion models tends to be no more satisfactory even with increasing denoising steps. The deficiency in editing could be… ▽ More

    Submitted 4 January, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  23. Influence of initial states on memory effects: A study of early-time superradiance

    Authors: S. C. Hou, G. Q. Shuai, X. Y. Zhang, J. Shen, X. X. Yi

    Abstract: The initial state of a quantum system can significantly influence its future dynamics, especially in non-Markovain quantum processes due to the environmental memory effects. Based on a previous work of ours, we propose a method to quantify the memory effects of a non-Markovian quantum process conditioned on a particular system initial state. We apply our method to study the early-time dynamics of… ▽ More

    Submitted 21 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 17 pages, 12 figures

    Journal ref: Phys. Rev. A 109, 053708 (2024)

  24. arXiv:2312.01637  [pdf

    physics.ao-ph physics.geo-ph

    Near-real-time monitoring of global ocean carbon sink

    Authors: Piyu Ke, Xiaofan Gui, Wei Cao, Dezhi Wang, Ce Hou, Lixing Wang, Xuanren Song, Yun Li, Biqing Zhu, Jiang Bian, Stephen Sitch, Philippe Ciais, Pierre Friedlingstein, Zhu Liu

    Abstract: Mitigation of climate change will highly rely on a carbon emission trajectory that achieves carbon neutrality by the 2050s. The ocean plays a critical role in modulating climate change by sequestering CO2 from the atmosphere. Relying on the multidisciplinary cutting-edge methodologies and technologies, the near-real-time monitoring of global ocean carbon sinks from January 2022 to July 2023 aims t… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  25. arXiv:2310.17082  [pdf, ps, other

    astro-ph.HE

    Does or did the supernova remnant Cassiopeia A operate as a PeVatron?

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 11 pages, 3 figures, Accepted by the APJL

  26. arXiv:2310.10311  [pdf, other

    physics.flu-dyn

    Dynamics-augmented cluster-based network model

    Authors: Chang Hou, Nan Deng, Bernd R. Noack

    Abstract: In this study, we propose a novel data-driven reduced-order model for complex dynamics, including nonlinear, multi-attractor, multi-frequency, and multiscale behaviours. The starting point is a fully automatable cluster-based network model (CNM) (Li et al. J. Fluid Mech. vol.906, 2021, A21) which kinematically coarse-grains the state with clusters and dynamically predicts the transitions in a netw… ▽ More

    Submitted 1 March, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

  27. Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A

    Authors: Zhen Cao, F. Aharonian, Q. An, A. Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 49pages, 11figures

    Journal ref: Science Advances, 9, eadj2778 (2023) 15 November 2023

  28. arXiv:2309.15376  [pdf, other

    cs.LG

    ADGym: Design Choices for Deep Anomaly Detection

    Authors: Minqi Jiang, Chaochuan Hou, Ao Zheng, Songqiao Han, Hailiang Huang, Qingsong Wen, Xiyang Hu, Yue Zhao

    Abstract: Deep learning (DL) techniques have recently found success in anomaly detection (AD) across various fields such as finance, medical services, and cloud computing. However, most of the current research tends to view deep AD algorithms as a whole, without dissecting the contributions of individual design choices like loss functions and network architectures. This view tends to diminish the value of p… ▽ More

    Submitted 29 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: NeurIPS 2023. The first three authors contribute equally. Code available at https://github.com/Minqi824/ADGym

  29. arXiv:2309.08148  [pdf, other

    math.CA

    Multifractal analysis of a class of self-affine Moran sets

    Authors: Yifei Gu, Chuanyan Hou, Jun Jie Miao

    Abstract: In the paper, we investigate the fine multifractal spectrum of a class of self-affine Moran sets with fixed frequencies, and we prove that under certain separation conditions, the fine multifractal spectrum $H(α)$ is given by the formula $$ H(α)=\inf_{-\infty<t<+\infty} \{αt+β(t)\}. $$

    Submitted 15 September, 2023; originally announced September 2023.

  30. arXiv:2309.08145  [pdf, other

    math.CA

    Dimensions of a class of self-affine Moran sets and measures in $\R^2$

    Authors: Yifei Gu, Chuanyan Hou, Jun Jie Miao

    Abstract: For each integer $k>0$, let $n_k$ and $m_k$ be integers such that $n_k\geq 2, m_k\geq 2$, and let $\mathcal{D}_k$ be a subset of $\{0,\dots,n_k-1\}\times \{0,\dots,m_k-1\}$. For each $w=(i,j)\in \mathcal{D}_k$, we define an affine transformation on~$\R^2$ by $$ Φ_w(x)=T_k(x+w), \qquad w\in\mathcal{D}_k, $$ where $T_k=\operatorname{diag}(n_k^{-1},m_k^{-1})$. The non-empty compact set… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  31. arXiv:2309.04747  [pdf, other

    cs.CV

    When to Learn What: Model-Adaptive Data Augmentation Curriculum

    Authors: Chengkai Hou, Jieyu Zhang, Tianyi Zhou

    Abstract: Data augmentation (DA) is widely used to improve the generalization of neural networks by enforcing the invariances and symmetries to pre-defined transformations applied to input data. However, a fixed augmentation policy may have different effects on each sample in different training stages but existing approaches cannot adjust the policy to be adaptive to each sample and the training model. In t… ▽ More

    Submitted 30 September, 2023; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: Our paper is accpeted by ICCV 2023

  32. arXiv:2308.00177  [pdf, other

    cs.LG cs.AI

    Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity

    Authors: Charlie Hou, Kiran Koshy Thekumparampil, Michael Shavlovsky, Giulia Fanti, Yesh Dattatreya, Sujay Sanghavi

    Abstract: On tabular data, a significant body of literature has shown that current deep learning (DL) models perform at best similarly to Gradient Boosted Decision Trees (GBDTs), while significantly underperforming them on outlier data. However, these works often study idealized problem settings which may fail to capture complexities of real-world scenarios. We identify a natural tabular data setting where… ▽ More

    Submitted 25 June, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: ICML-MFPL 2023 Workshop Oral, SPIGM@ICML2024

  33. MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

    Authors: Zirui Wu, Tianyu Liu, Liyi Luo, Zhide Zhong, Jianteng Chen, Hongmin Xiao, Chao Hou, Haozhe Lou, Yuantao Chen, Runyi Yang, Yuxin Huang, Xiaoyu Ye, Zike Yan, Yongliang Shi, Yiyi Liao, Hao Zhao

    Abstract: Nowadays, autonomous cars can drive smoothly in ordinary cases, and it is widely recognized that realistic sensor simulation will play a critical role in solving remaining corner cases by simulating them. To this end, we propose an autonomous driving simulator based upon neural radiance fields (NeRFs). Compared with existing works, ours has three notable features: (1) Instance-aware. Our simulator… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: CICAI 2023, project page with code: https://open-air-sun.github.io/mars/

  34. arXiv:2306.15925  [pdf, other

    cs.CV

    Subclass-balancing Contrastive Learning for Long-tailed Recognition

    Authors: Chengkai Hou, Jieyu Zhang, Haonan Wang, Tianyi Zhou

    Abstract: Long-tailed recognition with imbalanced class distribution naturally emerges in practical machine learning applications. Existing methods such as data reweighing, resampling, and supervised contrastive learning enforce the class balance with a price of introducing imbalance between instances of head class and tail class, which may ignore the underlying rich semantic substructures of the former and… ▽ More

    Submitted 9 September, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  35. arXiv:2305.18616  [pdf

    cs.CY cs.AI

    Embrace Opportunities and Face Challenges: Using ChatGPT in Undergraduate Students' Collaborative Interdisciplinary Learning

    Authors: Gaoxia Zhu, Xiuyi Fan, Chenyu Hou, Tianlong Zhong, Peter Seow, Annabel Chen Shen-Hsing, Preman Rajalingam, Low Kin Yew, Tan Lay Poh

    Abstract: ChatGPT, launched in November 2022, has gained widespread attention from students and educators globally, with an online report by Hu (2023) stating it as the fastest-growing consumer application in history. While discussions on the use of ChatGPT in higher education are abundant, empirical studies on its impact on collaborative interdisciplinary learning are rare. To investigate its potential, we… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 33 pages, 2 figures, 5 tables

  36. arXiv:2305.17030  [pdf, other

    astro-ph.HE hep-ph

    The First LHAASO Catalog of Gamma-Ray Sources

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: We present the first catalog of very-high energy and ultra-high energy gamma-ray sources detected by the Large High Altitude Air Shower Observatory (LHAASO). The catalog was compiled using 508 days of data collected by the Water Cherenkov Detector Array (WCDA) from March 2021 to September 2022 and 933 days of data recorded by the Kilometer Squared Array (KM2A) from January 2020 to September 2022.… ▽ More

    Submitted 27 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 40 pages, 13 figures, 4 tables

    Journal ref: The Astrophysical Journal Supplement Series, 271 (2024) 25

  37. Adaptive Learning based Upper-Limb Rehabilitation Training System with Collaborative Robot

    Authors: Jun Hong Lim, Kaibo He, Zeji Yi, Chen Hou, Chen Zhang, Yanan Sui, Luming Li

    Abstract: Rehabilitation training for patients with motor disabilities usually requires specialized devices in rehabilitation centers. Home-based multi-purpose training would significantly increase treatment accessibility and reduce medical costs. While it is unlikely to equip a set of rehabilitation robots at home, we investigate the feasibility to use the general-purpose collaborative robot for rehabilita… ▽ More

    Submitted 12 July, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Journal ref: EMBC2023

  38. arXiv:2305.09098  [pdf, other

    cs.CL cs.LG

    Weight-Inherited Distillation for Task-Agnostic BERT Compression

    Authors: Taiqiang Wu, Cheng Hou, Shanshan Lao, Jiayi Li, Ngai Wong, Zhe Zhao, Yujiu Yang

    Abstract: Knowledge Distillation (KD) is a predominant approach for BERT compression. Previous KD-based methods focus on designing extra alignment losses for the student model to mimic the behavior of the teacher model. These methods transfer the knowledge in an indirect way. In this paper, we propose a novel Weight-Inherited Distillation (WID), which directly transfers knowledge from the teacher. WID does… ▽ More

    Submitted 20 March, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures, NAACL2024 findings

  39. arXiv:2305.07527  [pdf, other

    physics.optics

    Evaporation characteristics of Er$^{3+}$ doped silica fiber and its application in the preparation of whispering gallery mode lasers

    Authors: Angzhen Li, Jonathan M. Ward, Ke Tian, Jibo Yu, Shengfei She, Chaoqi Hou, Haitao Guo, Síle Nic Chormaic, Pengfei Wang

    Abstract: The fabrication of whispering gallery lasers (WGL) is used to experimentally evaluate the evaporation rate (mol/$μ$m) and ratio (mol/mol) of erbium and silica lost from a doped fiber during heating. Fixed lengths of doped silica fiber are spliced to different lengths of undoped fiber and then evaporated by feeding into the focus of a CO$_{2}$ laser. During evaporation, erbium ions are precipitated… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  40. Measurement of ultra-high-energy diffuse gamma-ray emission of the Galactic plane from 10 TeV to 1 PeV with LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The diffuse Galactic $γ$-ray emission, mainly produced via interactions between cosmic rays and the interstellar medium and/or radiation field, is a very important probe of the distribution, propagation, and interaction of cosmic rays in the Milky Way. In this work we report the measurements of diffuse $γ$-rays from the Galactic plane between 10 TeV and 1 PeV energies, with the square kilometer ar… ▽ More

    Submitted 19 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 12 pages, 8 figures, 5 tables; accepted for publication in Physical Review Letters; source mask file provided as ancillary file

    Journal ref: Phys. Rev. Lett. 131, 151001 (2023)

  41. arXiv:2304.01234  [pdf, other

    astro-ph.SR astro-ph.EP cs.LG physics.plasm-ph physics.space-ph

    Prediction of solar wind speed by applying convolutional neural network to potential field source surface (PFSS) magnetograms

    Authors: Rong Lin, Zhekai Luo, Jiansen He, Lun Xie, Chuanpeng Hou, Shuwei Chen

    Abstract: An accurate solar wind speed model is important for space weather predictions, catastrophic event warnings, and other issues concerning solar wind - magnetosphere interaction. In this work, we construct a model based on convolutional neural network (CNN) and Potential Field Source Surface (PFSS) magnetograms, considering a solar wind source surface of $R_{\rm SS}=2.5R_\odot$, aiming to predict the… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  42. arXiv:2302.09042  [pdf, other

    cs.LG cs.AI cs.DC

    Privately Customizing Prefinetuning to Better Match User Data in Federated Learning

    Authors: Charlie Hou, Hongyuan Zhan, Akshat Shrivastava, Sid Wang, Aleksandr Livshits, Giulia Fanti, Daniel Lazar

    Abstract: In Federated Learning (FL), accessing private client data incurs communication and privacy costs. As a result, FL deployments commonly prefinetune pretrained foundation models on a (large, possibly public) dataset that is held by the central server; they then FL-finetune the model on a private, federated dataset held by clients. Evaluating prefinetuning dataset quality reliably and privately is th… ▽ More

    Submitted 22 February, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  43. arXiv:2302.08062  [pdf

    cs.CV cs.AI q-bio.PE

    Fossil Image Identification using Deep Learning Ensembles of Data Augmented Multiviews

    Authors: Chengbin Hou, Xinyu Lin, Hanhui Huang, Sheng Xu, Junxuan Fan, Yukun Shi, Hairong Lv

    Abstract: Identification of fossil species is crucial to evolutionary studies. Recent advances from deep learning have shown promising prospects in fossil image identification. However, the quantity and quality of labeled fossil images are often limited due to fossil preservation, conditioned sampling, and expensive and inconsistent label annotation by domain experts, which pose great challenges to training… ▽ More

    Submitted 1 February, 2024; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: published in Methods in Ecology and Evolution

    Journal ref: Methods in Ecology and Evolution, 14, 3020-3034 (2023)

  44. arXiv:2302.04549  [pdf, other

    cs.LG cs.AI

    Weakly Supervised Anomaly Detection: A Survey

    Authors: Minqi Jiang, Chaochuan Hou, Ao Zheng, Xiyang Hu, Songqiao Han, Hailiang Huang, Xiangnan He, Philip S. Yu, Yue Zhao

    Abstract: Anomaly detection (AD) is a crucial task in machine learning with various applications, such as detecting emerging diseases, identifying financial frauds, and detecting fake news. However, obtaining complete, accurate, and precise labels for AD tasks can be expensive and challenging due to the cost and difficulties in data annotation. To address this issue, researchers have developed AD methods th… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Code available at https://github.com/yzhao062/wsad

  45. arXiv:2212.06385  [pdf, other

    cs.CL

    TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities

    Authors: Zhe Zhao, Yudong Li, Cheng Hou, Jing Zhao, Rong Tian, Weijie Liu, Yiren Chen, Ningyuan Sun, Haoyan Liu, Weiquan Mao, Han Guo, Weigang Guo, Taiqiang Wu, Tao Zhu, Wenhang Shi, Chen Chen, Shan Huang, Sihong Chen, Liqun Liu, Feifei Li, Xiaoshuai Chen, Xingwu Sun, Zhanhui Kang, Xiaoyong Du, Linlin Shen , et al. (1 additional authors not shown)

    Abstract: Recently, the success of pre-training in text domain has been fully extended to vision, audio, and cross-modal scenarios. The proposed pre-training models of different modalities are showing a rising trend of homogeneity in their model structures, which brings the opportunity to implement different pre-training models within a uniform framework. In this paper, we present TencentPretrain, a toolkit… ▽ More

    Submitted 11 July, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

  46. arXiv:2211.07459  [pdf, other

    cs.CV cs.RO

    Self-Aligning Depth-regularized Radiance Fields for Asynchronous RGB-D Sequences

    Authors: Yuxin Huang, Andong Yang, Zirui Wu, Yuantao Chen, Runyi Yang, Zhenxin Zhu, Chao Hou, Hao Zhao, Guyue Zhou

    Abstract: It has been shown that learning radiance fields with depth rendering and depth supervision can effectively promote the quality and convergence of view synthesis. However, this paradigm requires input RGB-D sequences to be synchronized, hindering its usage in the UAV city modeling scenario. As there exists asynchrony between RGB images and depth images due to high-speed flight, we propose a novel t… ▽ More

    Submitted 4 April, 2024; v1 submitted 14 November, 2022; originally announced November 2022.

  47. arXiv:2210.12914  [pdf, other

    cs.LG math.NA

    A Novel Adaptive Causal Sampling Method for Physics-Informed Neural Networks

    Authors: Jia Guo, Haifeng Wang, Chenping Hou

    Abstract: Physics-Informed Neural Networks (PINNs) have become a kind of attractive machine learning method for obtaining solutions of partial differential equations (PDEs). Training PINNs can be seen as a semi-supervised learning task, in which only exact values of initial and boundary points can be obtained in solving forward problems, and in the whole spatio-temporal domain collocation points are sampled… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  48. arXiv:2210.12020  [pdf, other

    cs.LG cs.AI

    HCL: Improving Graph Representation with Hierarchical Contrastive Learning

    Authors: Jun Wang, Weixun Li, Changyu Hou, Xin Tang, Yixuan Qiao, Rui Fang, Pengyong Li, Peng Gao, Guotong Xie

    Abstract: Contrastive learning has emerged as a powerful tool for graph representation learning. However, most contrastive learning methods learn features of graphs with fixed coarse-grained scale, which might underestimate either local or global information. To capture more hierarchical and richer representation, we propose a novel Hierarchical Contrastive Learning (HCL) framework that explicitly learns gr… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: published at The 21st International Semantic Web Conference ( ISWC 2022 )

  49. arXiv:2210.10324  [pdf, ps, other

    astro-ph.SR physics.space-ph

    Detecting the oscillation and propagation of the nascent dynamic solar wind structure at 2.6 solar radii using VLBI radio telescopes

    Authors: Maoli Ma, Guifre Molera Calves, Giuseppe Cimo, Ming Xiong, Peijia Li, Jing Kong, Peijin Zhang, Jiansen He, Lijia Liu, Pradyumna Kummamuru, Chuanpeng Hou, Jasper Edwards, Qinghui Liu, Zhong Chen, Zhanghu Chu, De Wu, Xu Zhao, Zhichao Wang, Songtao Han Quanquan Zhi, Yingkai Liu, Jonathan Quick, Javier Gonzalez, Cristina Garcia Miro, Mikhail Kharinov, Andrey Mikhailov , et al. (7 additional authors not shown)

    Abstract: Probing the solar corona is crucial to study the coronal heating and solar wind acceleration. However, the transient and inhomogeneous solar wind flows carry large-amplitude inherent Alfven waves and turbulence, which make detection more difficult. We report the oscillation and propagation of the solar wind at 2.6 solar radii (Rs) by observation of China Tianwen and ESA Mars Express with radio tel… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 13 pages, 9 figures

  50. arXiv:2209.08498  [pdf, other

    cs.CV cs.RO

    LATITUDE: Robotic Global Localization with Truncated Dynamic Low-pass Filter in City-scale NeRF

    Authors: Zhenxin Zhu, Yuantao Chen, Zirui Wu, Chao Hou, Yongliang Shi, Chuxuan Li, Pengfei Li, Hao Zhao, Guyue Zhou

    Abstract: Neural Radiance Fields (NeRFs) have made great success in representing complex 3D scenes with high-resolution details and efficient memory. Nevertheless, current NeRF-based pose estimators have no initial pose prediction and are prone to local optima during optimization. In this paper, we present LATITUDE: Global Localization with Truncated Dynamic Low-pass Filter, which introduces a two-stage loc… ▽ More

    Submitted 27 February, 2023; v1 submitted 18 September, 2022; originally announced September 2022.

    Comments: 7 pages, 6 figures, ICRA 2023