Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 938 results for author: Gong, Y

.
  1. arXiv:2406.13205  [pdf

    eess.IV cs.CV

    Application of Computer Deep Learning Model in Diagnosis of Pulmonary Nodules

    Authors: Yutian Yang, Hongjie Qiu, Yulu Gong, Xiaoyi Liu, Yang Lin, Muqing Li

    Abstract: The 3D simulation model of the lung was established by using the reconstruction method. A computer aided pulmonary nodule detection model was constructed. The process iterates over the images to refine the lung nodule recognition model based on neural networks. It is integrated with 3D virtual modeling technology to improve the interactivity of the system, so as to achieve intelligent recognition… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    MSC Class: 68T10; 92C50

  2. arXiv:2406.11158  [pdf, other

    eess.SY

    Dynamic Modeling and Control for an Offshore Semisubmersible Floating Wind Turbine

    Authors: Yingjie Gong, Qinmin Yang, Hua Geng, Wenchao Meng, Lin Wang

    Abstract: Floating wind turbines (FWTs) hold significant potential for the exploitation of offshore renewable energy resources. Nevertheless, prior to the construction of FWTs, it is imperative to tackle several critical challenges, especially the issue of performance degradation under combined wind and wave loads. This study initiates with the development of a simplified nonlinear dynamical model for a sem… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  3. arXiv:2406.10082  [pdf, other

    eess.AS cs.CV cs.SD

    Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation

    Authors: Andrew Rouditchenko, Yuan Gong, Samuel Thomas, Leonid Karlinsky, Hilde Kuehne, Rogerio Feris, James Glass

    Abstract: Audio-Visual Speech Recognition (AVSR) uses lip-based video to improve performance in noise. Since videos are harder to obtain than audio, the video training data of AVSR models is usually limited to a few thousand hours. In contrast, speech models such as Whisper are trained with hundreds of thousands of hours of data, and thus learn a better speech-to-text decoder. The huge training data differe… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024. Code https://github.com/roudimit/whisper-flamingo

  4. arXiv:2406.09710  [pdf, other

    cs.CV cs.AI

    Fine-Grained Urban Flow Inference with Multi-scale Representation Learning

    Authors: Shilu Yuan, Dongfeng Li, Wei Liu, Xinxin Zhang, Meng Chen, Junjie Zhang, Yongshun Gong

    Abstract: Fine-grained urban flow inference (FUFI) is a crucial transportation service aimed at improving traffic efficiency and safety. FUFI can infer fine-grained urban traffic flows based solely on observed coarse-grained data. However, most of existing methods focus on the influence of single-scale static geographic information on FUFI, neglecting the interactions and dynamic information between differe… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  5. arXiv:2406.09321  [pdf, other

    cs.CR cs.AI cs.CL

    JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models

    Authors: Delong Ran, Jinyuan Liu, Yichen Gong, Jingyi Zheng, Xinlei He, Tianshuo Cong, Anyu Wang

    Abstract: Jailbreak attacks aim to induce Large Language Models (LLMs) to generate harmful responses for forbidden instructions, presenting severe misuse threats to LLMs. Up to now, research into jailbreak attacks and defenses is emerging, however, there is (surprisingly) no consensus on how to evaluate whether a jailbreak attempt is successful. In other words, the methods to assess the harmfulness of an LL… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Our code is available at https://github.com/ThuCCSLab/JailbreakEval

  6. arXiv:2406.06558  [pdf, other

    cs.CL cs.AI

    Enhancing Text Authenticity: A Novel Hybrid Approach for AI-Generated Text Detection

    Authors: Ye Zhang, Qian Leng, Mengran Zhu, Rui Ding, Yue Wu, Jintong Song, Yulu Gong

    Abstract: The rapid advancement of Large Language Models (LLMs) has ushered in an era where AI-generated text is increasingly indistinguishable from human-generated content. Detecting AI-generated text has become imperative to combat misinformation, ensure content authenticity, and safeguard against malicious uses of AI. In this paper, we propose a novel hybrid approach that combines traditional TF-IDF tech… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  7. arXiv:2406.06007  [pdf, other

    cs.LG cs.CL cs.CV cs.CY

    CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

    Authors: Peng Xia, Ze Chen, Juanxi Tian, Yangrui Gong, Ruibo Hou, Yue Xu, Zhenbang Wu, Zhiyuan Fan, Yiyang Zhou, Kangyu Zhu, Wenhao Zheng, Zhaoyang Wang, Xiao Wang, Xuchao Zhang, Chetan Bansal, Marc Niethammer, Junzhou Huang, Hongtu Zhu, Yun Li, Jimeng Sun, Zongyuan Ge, Gang Li, James Zou, Huaxiu Yao

    Abstract: Artificial intelligence has significantly impacted medical applications, particularly with the advent of Medical Large Vision Language Models (Med-LVLMs), sparking optimism for the future of automated and personalized healthcare. However, the trustworthiness of Med-LVLMs remains unverified, posing significant risks for future model deployment. In this paper, we introduce CARES and aim to comprehen… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  8. arXiv:2406.05759  [pdf, ps, other

    math.CO math.PR math.SP

    Chebyshev Moment Method for Regular Graphs I: Kesten-McKay and Semicircle distributions

    Authors: Yulin Gong, Wenbo Li, Shiping Liu

    Abstract: We develop the Chebyshev moment method to study the spectrum of regular graphs, motivated by the work of SerrĂ©. By this method, we give an elementary proof of the weak convergence to the Kesten-McKay distribution for the normalized spectral measures of random $N$-lifts in probability as $N$ tends to infinity. For a sequence of random $(q_n+1)$-regular graphs $G_n$ with $n$ vertices, we show that i… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    MSC Class: 05C31; 05C50; 05C80; 60B20

  9. arXiv:2406.01719  [pdf, other

    astro-ph.IM astro-ph.GA

    Imputation of Missing Photometric Data and Photometric Redshift Estimation for CSST

    Authors: Zhijian Luo, Zhirui Tang, Zhu Chen, Liping Fu, Wei Du, Shaohua Zhang, Yan Gong, Chenggang Shu, Junhao Lu, Yicheng Li, Xian-Min Meng, Xingchen Zhou, Zuhui Fan

    Abstract: Accurate photometric redshift (photo-$z$) estimation requires support from multi-band observational data. However, in the actual process of astronomical observations and data processing, some sources may have missing observational data in certain bands for various reasons. This could greatly affect the accuracy and reliability of photo-$z$ estimation for these sources, and even render some estimat… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  10. arXiv:2405.21045  [pdf

    cs.LG

    An Attention-Based Multi-Context Convolutional Encoder-Decoder Neural Network for Work Zone Traffic Impact Prediction

    Authors: Qinhua Jiang, Xishun Liao, Yaofa Gong, Jiaqi Ma

    Abstract: Work zone is one of the major causes of non-recurrent traffic congestion and road incidents. Despite the significance of its impact, studies on predicting the traffic impact of work zones remain scarce. In this paper, we propose a data integration pipeline that enhances the utilization of work zone and traffic data from diversified platforms, and introduce a novel deep learning model to predict th… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  11. arXiv:2405.20234  [pdf, other

    cs.AI

    Context Injection Attacks on Large Language Models

    Authors: Cheng'an Wei, Kai Chen, Yue Zhao, Yujia Gong, Lu Xiang, Shenchen Zhu

    Abstract: Large Language Models (LLMs) such as ChatGPT and Llama-2 have become prevalent in real-world applications, exhibiting impressive text generation performance. LLMs are fundamentally developed from a scenario where the input data remains static and lacks a clear structure. To behave interactively over time, LLM-based chat systems must integrate additional contextual information (i.e., chat history)… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  12. arXiv:2405.19943  [pdf, other

    cs.CV

    Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting

    Authors: Qi Zhang, Yunfei Gong, Daijie Chen, Antoni B. Chan, Hui Huang

    Abstract: Recent deep learning-based multi-view people detection (MVD) methods have shown promising results on existing datasets. However, current methods are mainly trained and evaluated on small, single scenes with a limited number of multi-view frames and fixed camera views. As a result, these methods may not be practical for detecting people in larger, more complex scenes with severe occlusions and came… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: AAAI 2024

  13. arXiv:2405.18767  [pdf, other

    astro-ph.GA

    Kinetic temperature of massive star-forming molecular clumps measured with formaldehyde V. The massive filament DR21

    Authors: X. Zhao, X. D. Tang, C. Henkel, Y. Gong, Y. Lin, D. L. Li, Y. X. He, Y. P. Ao, X. Lu, T. Liu, Y. Sun, K. Wang, X. P. Chen, J. Esimbek, J. J. Zhou, J. W. Wu, J. J. Qiu, X. W. Zheng, J. S. Li, C. S. Luo, Q. Zhao

    Abstract: The kinetic temperature structure of the massive filament DR21 has been mapped using the IRAM 30 m telescope. This mapping employed the para-H$_2$CO triplet ($J_{\rm K_aK_c}$ = 3$_{03}$--2$_{02}$, 3$_{22}$--2$_{21}$, and 3$_{21}$--2$_{20}$) on a scale of $\sim$0.1 pc. By modeling the averaged line ratios of para-H$_{2}$CO with RADEX under non-LTE assumptions, the kinetic temperature of the dense g… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 16 pages, 8 figures, 3 tabels. Accepted for publication by Astronomy & Astrophysics

  14. arXiv:2405.16093  [pdf, other

    cs.CV

    Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch

    Authors: Qikai Wang, Rundong He, Yongshun Gong, Chunxiao Ren, Haoliang Sun, Xiaoshui Huang, Yilong Yin

    Abstract: Semi-supervised learning can significantly boost model performance by leveraging unlabeled data, particularly when labeled data is scarce. However, real-world unlabeled data often contain unseen-class samples, which can hinder the classification of seen classes. To address this issue, mainstream safe SSL methods suggest detecting and discarding unseen-class samples from unlabeled data. Nevertheles… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  15. arXiv:2405.13158  [pdf

    cond-mat.mtrl-sci

    Towards establishing best practice in the analysis of hydrogen and deuterium by atom probe tomography

    Authors: Baptiste Gault, Aparna Saksena, Xavier Sauvage, Paul Bagot, Leonardo S. Aota, Jonas Arlt, Lisa T. Belkacemi, Torben Boll, Yi-Sheng Chen, Luke Daly, Milos B. Djukic, James O. Douglas, Maria J. Duarte, Peter J. Felfer, Richard G. Forbes, Jing Fu, Hazel M. Gardner, Ryota Gemma, Stephan S. A. Gerstl, Yilun Gong, Guillaume Hachet, Severin Jakob, Benjamin M. Jenkins, Megan E. Jones, Heena Khanchandani , et al. (20 additional authors not shown)

    Abstract: As hydrogen is touted as a key player in the decarbonization of modern society, it is critical to enable quantitative H analysis at high spatial resolution, if possible at the atomic scale. Indeed, H has a known deleterious impact on the mechanical properties (strength, ductility, toughness) of most materials that can hinder their use as part of the infrastructure of a hydrogen-based economy. Enab… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  16. MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

    Authors: Qi Chen, Xiubo Geng, Corby Rosset, Carolyn Buractaon, Jingwen Lu, Tao Shen, Kun Zhou, Chenyan Xiong, Yeyun Gong, Paul Bennett, Nick Craswell, Xing Xie, Fan Yang, Bryan Tower, Nikhil Rao, Anlei Dong, Wenqi Jiang, Zheng Liu, Mingqin Li, Chuanjie Liu, Zengzhong Li, Rangan Majumder, Jennifer Neville, Andy Oakley, Knut Magne Risvik , et al. (6 additional authors not shown)

    Abstract: Recent breakthroughs in large models have highlighted the critical significance of data scale, labels and modals. In this paper, we introduce MS MARCO Web Search, the first large-scale information-rich web dataset, featuring millions of real clicked query-document labels. This dataset closely mimics real-world web document and query distribution, provides rich information for various kinds of down… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures, for associated dataset, see http://github.com/microsoft/MS-MARCO-Web-Search

  17. arXiv:2405.07022  [pdf, other

    cs.LG cs.DB

    DTMamba : Dual Twin Mamba for Time Series Forecasting

    Authors: Zexue Wu, Yifeng Gong, Aoqian Zhang

    Abstract: We utilized the Mamba model for time series data prediction tasks, and the experimental results indicate that our model performs well.

    Submitted 11 May, 2024; originally announced May 2024.

  18. arXiv:2405.06389  [pdf, other

    cs.CV cs.AI

    Continual Novel Class Discovery via Feature Enhancement and Adaptation

    Authors: Yifan Yu, Shaokun Wang, Yuhang He, Junzhe Chen, Yihong Gong

    Abstract: Continual Novel Class Discovery (CNCD) aims to continually discover novel classes without labels while maintaining the recognition capability for previously learned classes. The main challenges faced by CNCD include the feature-discrepancy problem, the inter-session confusion problem, etc. In this paper, we propose a novel Feature Enhancement and Adaptation method for the CNCD to tackle the above… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  19. arXiv:2405.05446  [pdf, other

    cs.CV cs.AI cs.GR cs.LG eess.IV

    GDGS: Gradient Domain Gaussian Splatting for Sparse Representation of Radiance Fields

    Authors: Yuanhao Gong

    Abstract: The 3D Gaussian splatting methods are getting popular. However, they work directly on the signal, leading to a dense representation of the signal. Even with some techniques such as pruning or distillation, the results are still dense. In this paper, we propose to model the gradient of the original signal. The gradients are much sparser than the original signal. Therefore, the gradients use much le… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2404.09105

  20. arXiv:2405.04719  [pdf, other

    astro-ph.GA

    First detection of CF$^{+}$ in the Large Magellanic Cloud

    Authors: Yan Gong, Karl M. Menten, Arshia M. Jacob, Christian Henkel, C. -H. Rosie Chen

    Abstract: CF$^{+}$ has been established as a valuable diagnostic tool for investigating photo-dissociation regions (PDRs) and fluorine abundances in the Milky Way. However, its role in extragalactic environments remains largely uncharted. Our objective is to explore the significance of CF$^{+}$ in the Large Magellanic Cloud (LMC) and assess its utility as a valuable probe for examining C$^{+}$ and fluorine… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 9 pages, 6 figures, 1 table, accepted for publication in A&A

  21. How to Gain Commit Rights in Modern Top Open Source Communities?

    Authors: Xin Tan, Yan Gong, Geyu Huang, Haohua Wu, Li Zhang

    Abstract: The success of open source software (OSS) projects relies on voluntary contributions from various community roles.Being a committer signifies gaining trust and higher privileges. Substantial studies have focused on the requirements of becoming a committer, but most of them are based on interviews or several hypotheses, lacking a comprehensive understanding of committers' qualifications.We explore… ▽ More

    Submitted 16 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 23 pages,5 figures,FSE 2024

    Journal ref: Proceedings of the ACM on Software Engineering (PACMSE) Issue FSE 2024

  22. arXiv:2405.00026  [pdf

    cs.CE cs.AI

    Enhancing Credit Card Fraud Detection A Neural Network and SMOTE Integrated Approach

    Authors: Mengran Zhu, Ye Zhang, Yulu Gong, Changxin Xu, Yafei Xiang

    Abstract: Credit card fraud detection is a critical challenge in the financial sector, demanding sophisticated approaches to accurately identify fraudulent transactions. This research proposes an innovative methodology combining Neural Networks (NN) and Synthet ic Minority Over-sampling Technique (SMOTE) to enhance the detection performance. The study addresses the inherent imbalance in credit card transact… ▽ More

    Submitted 26 February, 2024; originally announced May 2024.

  23. arXiv:2404.19087  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Deep Reinforcement Learning for Advanced Longitudinal Control and Collision Avoidance in High-Risk Driving Scenarios

    Authors: Dianwei Chen, Yaobang Gong, Xianfeng Yang

    Abstract: Existing Advanced Driver Assistance Systems primarily focus on the vehicle directly ahead, often overlooking potential risks from following vehicles. This oversight can lead to ineffective handling of high risk situations, such as high speed, closely spaced, multi vehicle scenarios where emergency braking by one vehicle might trigger a pile up collision. To overcome these limitations, this study i… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  24. arXiv:2404.18548  [pdf, ps, other

    gr-qc

    On the duality in constant-roll inflation

    Authors: Yue Wang, Qing Gao, Shengqing Gao, Yungui Gong

    Abstract: There is a duality in the observables $n_s$, $r$ and the inflaton potential between large and small $η_H$ for the constant-roll inflation if the slow-roll parameter $ε_H$ is negligible. In general, the duality between $η_H$ and $\barη_H$ does not hold for the background evolution of the inflation. For some particular solutions for the constant-roll inflation with $η_H$ being a constant, we find th… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 15 pages

  25. arXiv:2404.18419  [pdf

    cs.CV cs.AI

    Research on Intelligent Aided Diagnosis System of Medical Image Based on Computer Deep Learning

    Authors: Jiajie Yuan, Linxiao Wu, Yulu Gong, Zhou Yu, Ziang Liu, Shuyao He

    Abstract: This paper combines Struts and Hibernate two architectures together, using DAO (Data Access Object) to store and access data. Then a set of dual-mode humidity medical image library suitable for deep network is established, and a dual-mode medical image assisted diagnosis method based on the image is proposed. Through the test of various feature extraction methods, the optimal operating characteris… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  26. arXiv:2404.14678  [pdf, other

    cs.CV

    3DBench: A Scalable 3D Benchmark and Instruction-Tuning Dataset

    Authors: Junjie Zhang, Tianci Hu, Xiaoshui Huang, Yongshun Gong, Dan Zeng

    Abstract: Evaluating the performance of Multi-modal Large Language Models (MLLMs), integrating both point cloud and language, presents significant challenges. The lack of a comprehensive assessment hampers determining whether these models truly represent advancements, thereby impeding further progress in the field. Current evaluations heavily rely on classification and caption tasks, falling short in provid… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  27. arXiv:2404.13576  [pdf, other

    cs.CV cs.LG

    I2CANSAY:Inter-Class Analogical Augmentation and Intra-Class Significance Analysis for Non-Exemplar Online Task-Free Continual Learning

    Authors: Songlin Dong, Yingjie Chen, Yuhang He, Yuhan Jin, Alex C. Kot, Yihong Gong

    Abstract: Online task-free continual learning (OTFCL) is a more challenging variant of continual learning which emphasizes the gradual shift of task boundaries and learns in an online mode. Existing methods rely on a memory buffer composed of old samples to prevent forgetting. However,the use of memory buffers not only raises privacy concerns but also hinders the efficient learning of new samples. To addres… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  28. arXiv:2404.09318  [pdf, other

    stat.AP

    Unraveling stochastic fundamental diagrams considering empirical knowledge: modeling, limitation and further discussion

    Authors: Yuan-Zheng Lei, Yaobang Gong, Xianfeng Terry Yang

    Abstract: Traffic flow modeling relies heavily on fundamental diagrams. However, deterministic fundamental diagrams, such as single or multi-regime models, cannot capture the uncertainty pattern that underlies traffic flow. To address this limitation, a sparse non-parametric regression model is proposed in this paper to formulate the stochastic fundamental diagram. Unlike parametric stochastic fundamental d… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  29. arXiv:2404.09155  [pdf, other

    cs.LG cs.AI cs.CL

    Mitigating Heterogeneity among Factor Tensors via Lie Group Manifolds for Tensor Decomposition Based Temporal Knowledge Graph Embedding

    Authors: Jiang Li, Xiangdong Su, Yeyun Gong, Guanglai Gao

    Abstract: Recent studies have highlighted the effectiveness of tensor decomposition methods in the Temporal Knowledge Graphs Embedding (TKGE) task. However, we found that inherent heterogeneity among factor tensors in tensor decomposition significantly hinders the tensor fusion process and further limits the performance of link prediction. To overcome this limitation, we introduce a novel method that maps f… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  30. arXiv:2404.09105  [pdf, other

    cs.CV cs.AI cs.GR eess.IV

    EGGS: Edge Guided Gaussian Splatting for Radiance Fields

    Authors: Yuanhao Gong

    Abstract: The Gaussian splatting methods are getting popular. However, their loss function only contains the $\ell_1$ norm and the structural similarity between the rendered and input images, without considering the edges in these images. It is well-known that the edges in an image provide important information. Therefore, in this paper, we propose an Edge Guided Gaussian Splatting (EGGS) method that levera… ▽ More

    Submitted 22 April, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

  31. arXiv:2404.08242  [pdf, other

    cs.NE cs.AI

    RLEMMO: Evolutionary Multimodal Optimization Assisted By Deep Reinforcement Learning

    Authors: Hongqiao Lian, Zeyuan Ma, Hongshu Guo, Ting Huang, Yue-Jiao Gong

    Abstract: Solving multimodal optimization problems (MMOP) requires finding all optimal solutions, which is challenging in limited function evaluations. Although existing works strike the balance of exploration and exploitation through hand-crafted adaptive strategies, they require certain expert knowledge, hence inflexible to deal with MMOP with different properties. In this paper, we propose RLEMMO, a Meta… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted as full paper at GECCO 2024

  32. arXiv:2404.08239  [pdf, other

    cs.NE cs.AI

    Auto-configuring Exploration-Exploitation Tradeoff in Evolutionary Computation via Deep Reinforcement Learning

    Authors: Zeyuan Ma, Jiacheng Chen, Hongshu Guo, Yining Ma, Yue-Jiao Gong

    Abstract: Evolutionary computation (EC) algorithms, renowned as powerful black-box optimizers, leverage a group of individuals to cooperatively search for the optimum. The exploration-exploitation tradeoff (EET) plays a crucial role in EC, which, however, has traditionally been governed by manually designed rules. In this paper, we propose a deep reinforcement learning-based framework that autonomously conf… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted as a full paper at GECCO 2024

  33. arXiv:2404.07965  [pdf, other

    cs.CL cs.AI

    Rho-1: Not All Tokens Are What You Need

    Authors: Zhenghao Lin, Zhibin Gou, Yeyun Gong, Xiao Liu, Yelong Shen, Ruochen Xu, Chen Lin, Yujiu Yang, Jian Jiao, Nan Duan, Weizhu Chen

    Abstract: Previous language model pre-training methods have uniformly applied a next-token prediction loss to all training tokens. Challenging this norm, we posit that ''Not all tokens in a corpus are equally important for language model training''. Our initial analysis examines token-level training dynamics of language model, revealing distinct loss patterns for different tokens. Leveraging these insights,… ▽ More

    Submitted 23 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: First two authors equal contribution

  34. arXiv:2404.07121  [pdf, other

    cs.IT eess.SP

    Digital Over-the-Air Computation: Achieving High Reliability via Bit-Slicing

    Authors: Jiawei Liu, Yi Gong, Kaibin Huang

    Abstract: 6G mobile networks aim to realize ubiquitous intelligence at the network edge via distributed learning, sensing, and data analytics. Their common operation is to aggregate high-dimensional data, which causes a communication bottleneck that cannot be resolved using traditional orthogonal multi-access schemes. A promising solution, called over-the-air computation (AirComp), exploits channels' wavefo… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  35. arXiv:2404.05236  [pdf, other

    cs.CV cs.GR

    Stylizing Sparse-View 3D Scenes with Hierarchical Neural Representation

    Authors: Y. Wang, A. Gao, Y. Gong, Y. Zeng

    Abstract: Recently, a surge of 3D style transfer methods has been proposed that leverage the scene reconstruction power of a pre-trained neural radiance field (NeRF). To successfully stylize a scene this way, one must first reconstruct a photo-realistic radiance field from collected images of the scene. However, when only sparse input views are available, pre-trained few-shot NeRFs often suffer from high-fr… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  36. arXiv:2404.05188  [pdf, other

    cs.CR cs.AI cs.CL

    Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging

    Authors: Tianshuo Cong, Delong Ran, Zesen Liu, Xinlei He, Jinyuan Liu, Yichen Gong, Qi Li, Anyu Wang, Xiaoyun Wang

    Abstract: Model merging is a promising lightweight model empowerment technique that does not rely on expensive computing devices (e.g., GPUs) or require the collection of specific training data. Instead, it involves editing different upstream model parameters to absorb their downstream task capabilities. However, uncertified model merging can infringe upon the Intellectual Property (IP) rights of the origin… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Technical Report

  37. Constant-roll inflation with non-minimally derivative coupling

    Authors: Jie Liu, Yungui Gong, Zhu Yi

    Abstract: We investigate the constant-roll inflation with non-minimally kinetic coupling to the Einstein tensor. With the slow-roll parameter $η_φ= -\ddotφ/(H\dotφ)$ being a constant, we calculate the power spectra for scalar and tensor perturbations, and derive the expressions for the scalar spectral tilt $n_s$, the tensor spectral tilt $n_T$, and the tensor-to-scalar ratio $r$. We find that the expression… ▽ More

    Submitted 3 June, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: 8 pages, accepted by Communications in Theoretical Physics

  38. arXiv:2404.04118  [pdf, other

    cs.LG cs.DC

    GNNBENCH: Fair and Productive Benchmarking for Single-GPU GNN System

    Authors: Yidong Gong, Pradeep Kumar

    Abstract: We hypothesize that the absence of a standardized benchmark has allowed several fundamental pitfalls in GNN System design and evaluation that the community has overlooked. In this work, we propose GNNBench, a plug-and-play benchmarking platform focused on system innovation. GNNBench presents a new protocol to exchange their captive tensor data, supports custom classes in System APIs, and allows au… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  39. arXiv:2404.01067  [pdf, other

    cs.CL

    Exploring the Mystery of Influential Data for Mathematical Reasoning

    Authors: Xinzhe Ni, Yeyun Gong, Zhibin Gou, Yelong Shen, Yujiu Yang, Nan Duan, Weizhu Chen

    Abstract: Selecting influential data for fine-tuning on downstream tasks is a key factor for both performance and computation efficiency. Recent works have shown that training with only limited data can show a superior performance on general tasks. However, the feasibility on mathematical reasoning tasks has not been validated. To go further, there exist two open questions for mathematical reasoning: how to… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  40. arXiv:2404.00323  [pdf, other

    cs.CV cs.LG

    CLIP-driven Outliers Synthesis for few-shot OOD detection

    Authors: Hao Sun, Rundong He, Zhongyi Han, Zhicong Lin, Yongshun Gong, Yilong Yin

    Abstract: Few-shot OOD detection focuses on recognizing out-of-distribution (OOD) images that belong to classes unseen during training, with the use of only a small number of labeled in-distribution (ID) images. Up to now, a mainstream strategy is based on large-scale vision-language models, such as CLIP. However, these methods overlook a crucial issue: the lack of reliable OOD supervision information, whic… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 9 pages,5 figures

  41. arXiv:2403.18201  [pdf, other

    cs.CV

    Few-shot Online Anomaly Detection and Segmentation

    Authors: Shenxing Wei, Xing Wei, Zhiheng Ma, Songlin Dong, Shaochen Zhang, Yihong Gong

    Abstract: Detecting anomaly patterns from images is a crucial artificial intelligence technique in industrial applications. Recent research in this domain has emphasized the necessity of a large volume of training data, overlooking the practical scenario where, post-deployment of the model, unlabeled data containing both normal and abnormal samples can be utilized to enhance the model's performance. Consequ… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  42. Discovery of widespread non-metastable ammonia masers in the Milky Way

    Authors: Y. T. Yan, C. Henkel, K. M. Menten, T. L. Wilson, A. Wootten, Y. Gong, F. Wyrowski, W. Yang, A. Brunthaler, A. Kraus, B. Winkel

    Abstract: We present the results of a search for ammonia maser emission in 119 Galactic high-mass star-forming regions (HMSFRs) known to host 22 GHz H$_2$O maser emission. Our survey has led to the discovery of non-metastable NH$_3$ inversion line masers toward 14 of these sources. This doubles the number of known non-metastable ammonia masers in our Galaxy, including nine new very high excitation ($J,K$)~=… ▽ More

    Submitted 12 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 14 pages, 4 tables, 9 figures. Accepted for publication in A&A

    Journal ref: A&A 686, A205 (2024)

  43. arXiv:2403.17549  [pdf

    cs.AI cs.CV

    Practical Applications of Advanced Cloud Services and Generative AI Systems in Medical Image Analysis

    Authors: Jingyu Xu, Binbin Wu, Jiaxin Huang, Yulu Gong, Yifan Zhang, Bo Liu

    Abstract: The medical field is one of the important fields in the application of artificial intelligence technology. With the explosive growth and diversification of medical data, as well as the continuous improvement of medical needs and challenges, artificial intelligence technology is playing an increasingly important role in the medical field. Artificial intelligence technologies represented by computer… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  44. arXiv:2403.16443  [pdf, other

    cs.CL cs.AI cs.SE

    CodeS: Natural Language to Code Repository via Multi-Layer Sketch

    Authors: Daoguang Zan, Ailun Yu, Wei Liu, Dong Chen, Bo Shen, Wei Li, Yafen Yao, Yongshun Gong, Xiaolin Chen, Bei Guan, Zhiguang Yang, Yongji Wang, Qianxiang Wang, Lizhen Cui

    Abstract: The impressive performance of large language models (LLMs) on code-related tasks has shown the potential of fully automated software development. In light of this, we introduce a new software engineering task, namely Natural Language to code Repository (NL2Repo). This task aims to generate an entire code repository from its natural language requirements. To address this task, we propose a simple y… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: https://github.com/NL2Code/CodeS

  45. arXiv:2403.16409  [pdf

    astro-ph.IM astro-ph.CO

    Large-scale Array for Radio Astronomy on the Farside

    Authors: Xuelei Chen, Feng Gao, Fengquan Wu, Yechi Zhang, Tong Wang, Weilin Liu, Dali Zou, Furen Deng, Yang Gong, Kai He, Jixia Li, Shijie Sun, Nanben Suo, Yougang Wang, Pengju Wu, Jiaqin Xu, Yidong Xu, Bin Yue, Cong Zhang, Jia Zhou, Minquan Zhou, Chenguang Zhu, Jiacong Zhu

    Abstract: At the Royal Society meeting in 2023, we have mainly presented our lunar orbit array concept called DSL, and also briefly introduced a concept of a lunar surface array, LARAF. As the DSL concept had been presented before, in this article we introduce the LARAF. We propose to build an array in the far side of the Moon, with a master station which handles the data collection and processing, and 20 s… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: final submission version, 30 pages, 16 figures

    Journal ref: Phil. Trans. R. Soc. A.382,20230094(2024)

  46. arXiv:2403.16212  [pdf, other

    eess.IV cs.CV cs.LG

    Leveraging Deep Learning and Xception Architecture for High-Accuracy MRI Classification in Alzheimer Diagnosis

    Authors: Shaojie Li, Haichen Qu, Xinqi Dong, Bo Dang, Hengyi Zang, Yulu Gong

    Abstract: Exploring the application of deep learning technologies in the field of medical diagnostics, Magnetic Resonance Imaging (MRI) provides a unique perspective for observing and diagnosing complex neurodegenerative diseases such as Alzheimer Disease (AD). With advancements in deep learning, particularly in Convolutional Neural Networks (CNNs) and the Xception network architecture, we are now able to a… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  47. arXiv:2403.14775  [pdf, ps, other

    cs.IT eess.SP

    RIS-Aided Cooperative Mobile Edge Computing: Computation Efficiency Maximization via Joint Uplink and Downlink Resource Allocation

    Authors: Zhenrong Liu, Zongze Li, Yi Gong, Yik-Chung Wu

    Abstract: In mobile edge computing (MEC) systems, the wireless channel condition is a critical factor affecting both the communication power consumption and computation rate of the offloading tasks. This paper exploits the idea of cooperative transmission and employing reconfigurable intelligent surface (RIS) in MEC to improve the channel condition and maximize computation efficiency (CE). The resulting pro… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted for publication in IEEE Transactions on Wireless Communications

  48. arXiv:2403.14483  [pdf, other

    cs.LG cs.AI q-fin.ST

    Utilizing the LightGBM Algorithm for Operator User Credit Assessment Research

    Authors: Shaojie Li, Xinqi Dong, Danqing Ma, Bo Dang, Hengyi Zang, Yulu Gong

    Abstract: Mobile Internet user credit assessment is an important way for communication operators to establish decisions and formulate measures, and it is also a guarantee for operators to obtain expected benefits. However, credit evaluation methods have long been monopolized by financial industries such as banks and credit. As supporters and providers of platform network technology and network resources, co… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  49. arXiv:2403.14244  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Isotropic Gaussian Splatting for Real-Time Radiance Field Rendering

    Authors: Yuanhao Gong, Lantao Yu, Guanghui Yue

    Abstract: The 3D Gaussian splatting method has drawn a lot of attention, thanks to its high performance in training and high quality of the rendered image. However, it uses anisotropic Gaussian kernels to represent the scene. Although such anisotropic kernels have advantages in representing the geometry, they lead to difficulties in terms of computation, such as splitting or merging two kernels. In this pap… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  50. arXiv:2403.13619  [pdf

    cs.DC cs.AI

    Dynamic Resource Allocation for Virtual Machine Migration Optimization using Machine Learning

    Authors: Yulu Gong, Jiaxin Huang, Bo Liu, Jingyu Xu, Binbin Wu, Yifan Zhang

    Abstract: The paragraph is grammatically correct and logically coherent. It discusses the importance of mobile terminal cloud computing migration technology in meeting the demands of evolving computer and cloud computing technologies. It emphasizes the need for efficient data access and storage, as well as the utilization of cloud computing migration technology to prevent additional time delays. The paragra… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.