Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 468 results for author: Tan, K

.
  1. arXiv:2407.19430  [pdf, other

    cs.CV

    Progressive Domain Adaptation for Thermal Infrared Object Tracking

    Authors: Qiao Li, Kanlun Tan, Qiao Liu, Di Yuan, Xin Li, Yunpeng Liu

    Abstract: Due to the lack of large-scale labeled Thermal InfraRed (TIR) training datasets, most existing TIR trackers are trained directly on RGB datasets. However, tracking methods trained on RGB datasets suffer a significant drop-off in TIR data due to the domain shift issue. To this end, in this work, we propose a Progressive Domain Adaptation framework for TIR Tracking (PDAT), which transfers useful kno… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 10 pages, 8 figures

  2. arXiv:2407.06857  [pdf, other

    eess.SY

    Enhanced Battery Degradation-Aware Scheduling for Distribution Network with Electric Vehicle Load

    Authors: Vijay Babu Pamshetti, Wei Zhang, Andy Man-Fai Ng, Qingyu Yan, Kuan Tak Tan

    Abstract: Batteries play a key role in today's power grid. In this paper, we investigate the impact of battery degradation on the distribution network. We formulate a multi-objective framework for optimizing battery scheduling with the goals of minimizing monetary costs and improving network performance. Our framework incorporates energy purchase and battery degradation into the costs and measures the netwo… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 3 figures

  3. arXiv:2407.03085  [pdf, other

    stat.ME stat.CO stat.ML

    Accelerated Inference for Partially Observed Markov Processes using Automatic Differentiation

    Authors: Kevin Tan, Giles Hooker, Edward L. Ionides

    Abstract: Automatic differentiation (AD) has driven recent advances in machine learning, including deep neural networks and Hamiltonian Markov Chain Monte Carlo methods. Partially observed nonlinear stochastic dynamical systems have proved resistant to AD techniques because widely used particle filter algorithms yield an estimated likelihood function that is discontinuous as a function of the model paramete… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  4. arXiv:2407.02639  [pdf, other

    cs.CV

    Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction

    Authors: Tinghuai Wang, Guangming Wang, Kuan Eeik Tan

    Abstract: Convolutional neural networks (CNN) have made significant advances in detecting roads from satellite images. However, existing CNN approaches are generally repurposed semantic segmentation architectures and suffer from the poor delineation of long and curved regions. Lack of overall road topology and structure information further deteriorates their performance on challenging remote sensing images.… ▽ More

    Submitted 8 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2406.14359  [pdf, other

    cs.NE

    Learning to Transfer for Evolutionary Multitasking

    Authors: Sheng-Hao Wu, Yuxiao Huang, Xingyu Wu, Liang Feng, Zhi-Hui Zhan, Kay Chen Tan

    Abstract: Evolutionary multitasking (EMT) is an emerging approach for solving multitask optimization problems (MTOPs) and has garnered considerable research interest. The implicit EMT is a significant research branch that utilizes evolution operators to enable knowledge transfer (KT) between tasks. However, current approaches in implicit EMT face challenges in adaptability, due to the use of a limited numbe… ▽ More

    Submitted 22 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review

  6. arXiv:2406.12313  [pdf

    cs.DB

    A framework for developing a knowledge management platform

    Authors: Marie Lisandra Zepeda Mendoza, Sonali Agarwal, James A. Blackshaw, Vanesa Bol, Audrey Fazzi, Filippo Fiorini, Amy Louise Foreman, Nancy George, Brett R. Johnson, Brian Martin, Dave McComb, Euphemia Mutasa-Gottgens, Helen Parkinson, Martin Romacker, Rolf Russell, Valérien Ségard, Shawn Zheng Kai Tan, Wei Kheng Teh, F. P. Winstanley, Benedict Wong, Adrian M. Smith

    Abstract: Knowledge management (KM) involves collecting, organizing, storing, and disseminating information to improve decision-making, innovation, and performance. Implementing KM at scale has become essential for organizations to effectively leverage vast accessible data. This paper is a compilation of concepts that emerged from KM workshops hosted by EMBL-EBI, attended by SMEs and industry. We provide gu… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 18 pages, 1 figure

  7. arXiv:2406.11809  [pdf, other

    cs.LG cs.RO eess.SY

    Physics-Constrained Learning for PDE Systems with Uncertainty Quantified Port-Hamiltonian Models

    Authors: Kaiyuan Tan, Peilun Li, Thomas Beckers

    Abstract: Modeling the dynamics of flexible objects has become an emerging topic in the community as these objects become more present in many applications, e.g., soft robotics. Due to the properties of flexible materials, the movements of soft objects are often highly nonlinear and, thus, complex to predict. Data-driven approaches seem promising for modeling those complex dynamics but often neglect basic p… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  8. arXiv:2406.11619  [pdf, other

    eess.AS cs.LG

    AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling

    Authors: Vahid Ahmadi Kalkhorani, Cheng Yu, Anurag Kumar, Ke Tan, Buye Xu, DeLiang Wang

    Abstract: Adding visual cues to audio-based speech separation can improve separation performance. This paper introduces AV-CrossNet, an audiovisual (AV) system for speech enhancement, target speaker extraction, and multi-talker speaker separation. AV-CrossNet is extended from the CrossNet architecture, which is a recently proposed network that performs complex spectral mapping for speech separation by lever… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 Figures, and 4 Tables

  9. arXiv:2406.11414  [pdf, other

    cs.LO cs.AI

    Formally Certified Approximate Model Counting

    Authors: Yong Kiam Tan, Jiong Yang, Mate Soos, Magnus O. Myreen, Kuldeep S. Meel

    Abstract: Approximate model counting is the task of approximating the number of solutions to an input Boolean formula. The state-of-the-art approximate model counter for formulas in conjunctive normal form (CNF), ApproxMC, provides a scalable means of obtaining model counts with probably approximately correct (PAC)-style guarantees. Nevertheless, the validity of ApproxMC's approximation relies on a careful… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: The extended version, including the appendix, of the paper to be published in CAV24. The associated artifact is available at https://doi.org/10.5281/zenodo.10948839

  10. arXiv:2406.08987  [pdf, other

    cs.NE

    Autonomous Multi-Objective Optimization Using Large Language Model

    Authors: Yuxiao Huang, Shenghao Wu, Wenjie Zhang, Jibin Wu, Liang Feng, Kay Chen Tan

    Abstract: Multi-objective optimization problems (MOPs) are ubiquitous in real-world applications, presenting a complex challenge of balancing multiple conflicting objectives. Traditional evolutionary algorithms (EAs), though effective, often rely on domain-specific expertise and iterative fine-tuning, hindering adaptability to unseen MOPs. In recent years, the advent of Large Language Models (LLMs) has revo… ▽ More

    Submitted 26 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 14 pages, 11 figures, 6 tables

  11. arXiv:2406.07390  [pdf, other

    eess.SP cs.IT eess.IV

    DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling

    Authors: Sixian Wang, Jincheng Dai, Kailin Tan, Xiaoqi Qin, Kai Niu, Ping Zhang

    Abstract: End-to-end visual communication systems typically optimize a trade-off between channel bandwidth costs and signal-level distortion metrics. However, under challenging physical conditions, this traditional discriminative communication paradigm often results in unrealistic reconstructions with perceptible blurring and aliasing artifacts, despite the inclusion of perceptual or adversarial losses for… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  12. arXiv:2406.07069  [pdf, other

    cs.RO eess.SY

    Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning

    Authors: Xuezhi Niu, Kaige Tan, Lei Feng

    Abstract: This study presents an innovative approach to optimal gait control for a soft quadruped robot enabled by four Compressible Tendon-driven Soft Actuators (CTSAs). Improving our previous studies of using model-free reinforcement learning for gait control, we employ model-based reinforcement learning (MBRL) to further enhance the performance of the gait controller. Compared to rigid robots, the propos… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  13. arXiv:2406.07065  [pdf, other

    cs.RO eess.SY

    Optimal Gait Design for a Soft Quadruped Robot via Multi-fidelity Bayesian Optimization

    Authors: Kaige Tan, Xuezhi Niu, Qinglei Ji, Lei Feng, Martin Törngren

    Abstract: This study focuses on the locomotion capability improvement in a tendon-driven soft quadruped robot through an online adaptive learning approach. Leveraging the inverse kinematics model of the soft quadruped robot, we employ a central pattern generator to design a parametric gait pattern, and use Bayesian optimization (BO) to find the optimal parameters. Further, to address the challenges of model… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  14. arXiv:2406.00812  [pdf, other

    stat.ML cs.LG

    Covariance-Adaptive Sequential Black-box Optimization for Diffusion Targeted Generation

    Authors: Yueming Lyu, Kim Yong Tan, Yew Soon Ong, Ivor W. Tsang

    Abstract: Diffusion models have demonstrated great potential in generating high-quality content for images, natural language, protein domains, etc. However, how to perform user-preferred targeted generation via diffusion models with only black-box target scores of users remains challenging. To address this issue, we first formulate the fine-tuning of the targeted reserve-time stochastic differential equatio… ▽ More

    Submitted 8 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  15. arXiv:2405.17324  [pdf, other

    cs.LG cs.AI stat.ML

    Leveraging Offline Data in Linear Latent Bandits

    Authors: Chinmaya Kausik, Kevin Tan, Ambuj Tewari

    Abstract: Sequential decision-making domains such as recommender systems, healthcare and education often have unobserved heterogeneity in the population that can be modeled using latent bandits $-$ a framework where an unobserved latent state determines the model for a trajectory. While the latent bandit framework is compelling, the extent of its generality is unclear. We first address this by establishing… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 40 pages. 14 pages for main paper, 26 pages for references + appendix

  16. arXiv:2405.16041  [pdf, other

    cs.LG cs.AI

    Explainable Molecular Property Prediction: Aligning Chemical Concepts with Predictions via Language Models

    Authors: Zhenzhong Wang, Zehui Lin, Wanyu Lin, Ming Yang, Minggang Zeng, Kay Chen Tan

    Abstract: Providing explainable molecule property predictions is critical for many scientific domains, such as drug discovery and material science. Though transformer-based language models have shown great potential in accurate molecular property prediction, they neither provide chemically meaningful explanations nor faithfully reveal the molecular structure-property relationships. In this work, we develop… ▽ More

    Submitted 31 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  17. arXiv:2405.15252  [pdf, other

    cs.LG

    Fast 3D Molecule Generation via Unified Geometric Optimal Transport

    Authors: Haokai Hong, Wanyu Lin, Kay Chen Tan

    Abstract: This paper proposes a new 3D molecule generation framework, called GOAT, for fast and effective 3D molecule generation based on the flow-matching optimal transport objective. Specifically, we formulate a geometric transport formula for measuring the cost of mapping multi-modal features (e.g., continuous atom coordinates and categorical atom types) between a base distribution and a target data dist… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  18. arXiv:2405.11349  [pdf, other

    cs.LG

    Unlock the Power of Algorithm Features: A Generalization Analysis for Algorithm Selection

    Authors: Xingyu Wu, Yan Zhong, Jibin Wu, Yuxiao Huang, Sheng-hao Wu, Kay Chen Tan

    Abstract: In the algorithm selection research, the discussion surrounding algorithm features has been significantly overshadowed by the emphasis on problem features. Although a few empirical studies have yielded evidence regarding the effectiveness of algorithm features, the potential benefits of incorporating algorithm features into algorithm selection models and their suitability for different scenarios r… ▽ More

    Submitted 3 June, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

  19. arXiv:2405.05767  [pdf

    cs.NE

    Large Language Model-Aided Evolutionary Search for Constrained Multiobjective Optimization

    Authors: Zeyi Wang, Songbai Liu, Jianyong Chen, Kay Chen Tan

    Abstract: Evolutionary algorithms excel in solving complex optimization problems, especially those with multiple objectives. However, their stochastic nature can sometimes hinder rapid convergence to the global optima, particularly in scenarios involving constraints. In this study, we employ a large language model (LLM) to enhance evolutionary search for solving constrained multi-objective optimization prob… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 15 pages, 6 figures, 2024 International Conference on Intelligent Computing

  20. arXiv:2405.05413  [pdf

    cs.DB

    Digital Evolution: Novo Nordisk's Shift to Ontology-Based Data Management

    Authors: Shawn Zheng Kai Tan, Shounak Baksi, Thomas Gade Bjerregaard, Preethi Elangovan, Thrishna Kuttikattu Gopalakrishnan, Darko Hric, Joffrey Joumaa, Beidi Li, Kashif Rabbani, Santhosh Kannan Venkatesan, Joshua Daniel Valdez, Saritha Vettikunnel Kuriakose

    Abstract: Biomedical data is growing exponentially, and managing it is increasingly challenging. While Findable, Accessible, Interoperable and Reusable (FAIR) data principles provide guidance, their adoption has proven difficult, especially in larger enterprises like pharmaceutical companies. In this manuscript, we describe how we leverage an Ontology-Based Data Management (OBDM) strategy for digital transf… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 14 pages, 2 figures

  21. arXiv:2405.05343  [pdf, other

    cs.DS cs.LG math.NA

    Distributed Least Squares in Small Space via Sketching and Bias Reduction

    Authors: Sachin Garg, Kevin Tan, Michał Dereziński

    Abstract: Matrix sketching is a powerful tool for reducing the size of large data matrices. Yet there are fundamental limitations to this size reduction when we want to recover an accurate estimator for a task such as least square regression. We show that these limitations can be circumvented in the distributed setting by designing sketching methods that minimize the bias of the estimator, rather than its e… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  22. arXiv:2405.03924  [pdf, other

    cs.DB cs.AI cs.LG

    NeurDB: An AI-powered Autonomous Data System

    Authors: Beng Chin Ooi, Shaofeng Cai, Gang Chen, Yanyan Shen, Kian-Lee Tan, Yuncheng Wu, Xiaokui Xiao, Naili Xing, Cong Yue, Lingze Zeng, Meihui Zhang, Zhanhao Zhao

    Abstract: In the wake of rapid advancements in artificial intelligence (AI), we stand on the brink of a transformative leap in data systems. The imminent fusion of AI and DB (AIxDB) promises a new generation of data systems, which will relieve the burden on end-users across all industry sectors by featuring AI-enhanced functionalities, such as personalized and automated in-database AI-powered analytics, sel… ▽ More

    Submitted 4 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  23. arXiv:2404.17856  [pdf, other

    stat.ML cs.LG math.ST stat.CO stat.ME

    Uncertainty quantification for iterative algorithms in linear models with application to early stopping

    Authors: Pierre C. Bellec, Kai Tan

    Abstract: This paper investigates the iterates $\hbb^1,\dots,\hbb^T$ obtained from iterative algorithms in high-dimensional linear regression problems, in the regime where the feature dimension $p$ is comparable with the sample size $n$, i.e., $p \asymp n$. The analysis and proposed estimators are applicable to Gradient Descent (GD), proximal GD and their accelerated variants such as Fast Iterative Soft-Thr… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  24. arXiv:2404.17316  [pdf, other

    cs.AI

    Certified MaxSAT Preprocessing

    Authors: Hannes Ihalainen, Andy Oertel, Yong Kiam Tan, Jeremias Berg, Matti Järvisalo, Jakob Nordström

    Abstract: Building on the progress in Boolean satisfiability (SAT) solving over the last decades, maximum satisfiability (MaxSAT) has become a viable approach for solving NP-hard optimization problems, but ensuring correctness of MaxSAT solvers has remained an important concern. For SAT, this is largely a solved problem thanks to the use of proof logging, meaning that solvers emit machine-verifiable proofs… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  25. arXiv:2404.16885  [pdf

    cs.CV cs.AI cs.CY cs.LG

    Adapting an Artificial Intelligence Sexually Transmitted Diseases Symptom Checker Tool for Mpox Detection: The HeHealth Experience

    Authors: Rayner Kay Jin Tan, Dilruk Perera, Salomi Arasaratnam, Yudara Kularathne

    Abstract: Artificial Intelligence applications have shown promise in the management of pandemics and have been widely used to assist the identification, classification, and diagnosis of medical images. In response to the global outbreak of Monkeypox (Mpox), the HeHealth.ai team leveraged an existing tool to screen for sexually transmitted diseases to develop a digital screening test for symptomatic Mpox thr… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 15 pages, 4 figures

  26. arXiv:2404.16745  [pdf, other

    stat.ME

    Statistical Inference for Covariate-Adjusted and Interpretable Generalized Factor Model with Application to Testing Fairness

    Authors: Jing Ouyang, Chengyu Cui, Kean Ming Tan, Gongjun Xu

    Abstract: In the era of data explosion, statisticians have been developing interpretable and computationally efficient statistical methods to measure latent factors (e.g., skills, abilities, and personalities) using large-scale assessment data. In addition to understanding the latent information, the covariate effect on responses controlling for latent factors is also of great scientific interest and has wi… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  27. arXiv:2404.13818  [pdf, other

    q-fin.GN

    Joint Liability Model with Adaptation to Climate Change

    Authors: Jiayue Zhang, Ken Seng Tan, Tony S. Wirjanto, Lysa Porth

    Abstract: This paper extends the application of ESG score assessment methodologies from large corporations to individual farmers' production, within the context of climate change. Our proposal involves the integration of crucial agricultural sustainability variables into conventional personal credit evaluation frameworks, culminating in the formulation of a holistic sustainable credit rating referred to as… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  28. arXiv:2404.12569  [pdf, other

    cs.LG cs.AI

    Multi-View Subgraph Neural Networks: Self-Supervised Learning with Scarce Labeled Data

    Authors: Zhenzhong Wang, Qingyuan Zeng, Wanyu Lin, Min Jiang, Kay Chen Tan

    Abstract: While graph neural networks (GNNs) have become the de-facto standard for graph-based node classification, they impose a strong assumption on the availability of sufficient labeled samples. This assumption restricts the classification performance of prevailing GNNs on many real-world applications suffering from low-data regimes. Specifically, features extracted from scarce labeled nodes could not p… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  29. arXiv:2404.06418  [pdf, other

    cs.LG cs.AI

    Studying the Impact of Latent Representations in Implicit Neural Networks for Scientific Continuous Field Reconstruction

    Authors: Wei Xu, Derek Freeman DeSantis, Xihaier Luo, Avish Parmar, Klaus Tan, Balu Nadiga, Yihui Ren, Shinjae Yoo

    Abstract: Learning a continuous and reliable representation of physical fields from sparse sampling is challenging and it affects diverse scientific disciplines. In a recent work, we present a novel model called MMGN (Multiplicative and Modulated Gabor Network) with implicit neural networks. In this work, we design additional studies leveraging explainability methods to complement the previous experiments a… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  30. arXiv:2404.06349  [pdf, other

    cs.LG

    CausalBench: A Comprehensive Benchmark for Causal Learning Capability of Large Language Models

    Authors: Yu Zhou, Xingyu Wu, Beicheng Huang, Jibin Wu, Liang Feng, Kay Chen Tan

    Abstract: Causality reveals fundamental principles behind data distributions in real-world scenarios, and the capability of large language models (LLMs) to understand causality directly impacts their efficacy across explaining outputs, adapting to new evidence, and generating counterfactuals. With the proliferation of LLMs, the evaluation of this capacity is increasingly garnering attention. However, the ab… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  31. arXiv:2404.06290  [pdf, other

    cs.NE

    Exploring the True Potential: Evaluating the Black-box Optimization Capability of Large Language Models

    Authors: Beichen Huang, Xingyu Wu, Yu Zhou, Jibin Wu, Liang Feng, Ran Cheng, Kay Chen Tan

    Abstract: Large language models (LLMs) have demonstrated exceptional performance not only in natural language processing tasks but also in a great variety of non-linguistic domains. In diverse optimization scenarios, there is also a rising trend of applying LLMs. However, whether the application of LLMs in the black-box optimization problems is genuinely beneficial remains unexplored. This paper endeavors t… ▽ More

    Submitted 6 July, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  32. arXiv:2404.02062  [pdf, other

    cs.CR cs.AI cs.LG

    Digital Forgetting in Large Language Models: A Survey of Unlearning Methods

    Authors: Alberto Blanco-Justicia, Najeeb Jebreel, Benet Manzanares, David Sánchez, Josep Domingo-Ferrer, Guillem Collell, Kuan Eeik Tan

    Abstract: The objective of digital forgetting is, given a model with undesirable knowledge or behavior, obtain a new model where the detected issues are no longer present. The motivations for forgetting include privacy protection, copyright protection, elimination of biases and discrimination, and prevention of harmful content generation. Effective digital forgetting has to be effective (meaning how well th… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 70 pages

    MSC Class: 68 ACM Class: K.4.1; I.2.6; I.2.7

  33. arXiv:2404.00962  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    Diffusion-Driven Domain Adaptation for Generating 3D Molecules

    Authors: Haokai Hong, Wanyu Lin, Kay Chen Tan

    Abstract: Can we train a molecule generator that can generate 3D molecules from a new domain, circumventing the need to collect data? This problem can be cast as the problem of domain adaptive molecule generation. This work presents a novel and principled diffusion-based approach, called GADM, that allows shifting a generative model to desired new domains without the need to collect even a single molecule.… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 11 pages, 3 figures, and 3 tables

  34. arXiv:2403.09701  [pdf, other

    cs.LG stat.ML

    A Natural Extension To Online Algorithms For Hybrid RL With Limited Coverage

    Authors: Kevin Tan, Ziping Xu

    Abstract: Hybrid Reinforcement Learning (RL), leveraging both online and offline data, has garnered recent interest, yet research on its provable benefits remains sparse. Additionally, many existing hybrid RL algorithms (Song et al., 2023; Nakamoto et al., 2023; Amortila et al., 2024) impose coverage assumptions on the offline dataset, but we show that this is unnecessary. A well-designed online algorithm s… ▽ More

    Submitted 17 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Submitted to the reinforcement learning conference

  35. arXiv:2403.04133  [pdf, other

    cs.CV cs.RO

    Towards learning-based planning:The nuPlan benchmark for real-world autonomous driving

    Authors: Napat Karnchanachari, Dimitris Geromichalos, Kok Seang Tan, Nanxiang Li, Christopher Eriksen, Shakiba Yaghoubi, Noushin Mehdipour, Gianmarco Bernasconi, Whye Kit Fong, Yiluan Guo, Holger Caesar

    Abstract: Machine Learning (ML) has replaced traditional handcrafted methods for perception and prediction in autonomous vehicles. Yet for the equally important planning task, the adoption of ML-based techniques is slow. We present nuPlan, the world's first real-world autonomous driving dataset, and benchmark. The benchmark is designed to test the ability of ML-based planners to handle diverse driving situa… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: ICRA 2024 camera ready incl. supplementary material

  36. arXiv:2403.01757  [pdf, other

    cs.AI cs.CL cs.LG cs.NE math.OC

    How Multimodal Integration Boost the Performance of LLM for Optimization: Case Study on Capacitated Vehicle Routing Problems

    Authors: Yuxiao Huang, Wenjie Zhang, Liang Feng, Xingyu Wu, Kay Chen Tan

    Abstract: Recently, large language models (LLMs) have notably positioned them as capable tools for addressing complex optimization challenges. Despite this recognition, a predominant limitation of existing LLM-based optimization methods is their struggle to capture the relationships among decision variables when relying exclusively on numerical text prompts, especially in high-dimensional problems. Keeping… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 8pages,3 figures, 2 tables

  37. arXiv:2403.01369  [pdf, other

    eess.AS cs.AI cs.LG

    A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement

    Authors: Ravi Shankar, Ke Tan, Buye Xu, Anurag Kumar

    Abstract: Self-supervised learned models have been found to be very effective for certain speech tasks such as automatic speech recognition, speaker identification, keyword spotting and others. While the features are undeniably useful in speech recognition and associated tasks, their utility in speech enhancement systems is yet to be firmly established, and perhaps not properly understood. In this paper, we… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 8 pages; Shorter form accepted in ICASSP 2024

  38. arXiv:2402.19095  [pdf

    q-bio.BM cs.LG

    A Protein Structure Prediction Approach Leveraging Transformer and CNN Integration

    Authors: Yanlin Zhou, Kai Tan, Xinyu Shen, Zheng He, Haotian Zheng

    Abstract: Proteins are essential for life, and their structure determines their function. The protein secondary structure is formed by the folding of the protein primary structure, and the protein tertiary structure is formed by the bending and folding of the secondary structure. Therefore, the study of protein secondary structure is very helpful to the overall understanding of protein structure. Although t… ▽ More

    Submitted 8 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  39. arXiv:2402.18292  [pdf, other

    cs.CV cs.AI cs.LG

    FSL-Rectifier: Rectify Outliers in Few-Shot Learning via Test-Time Augmentation

    Authors: Yunwei Bai, Ying Kiat Tan, Tsuhan Chen

    Abstract: Few-shot-learning (FSL) commonly requires a model to identify images (queries) that belong to classes unseen during training, based on a few labelled samples of the new classes (support set) as reference. So far, plenty of algorithms involve training data augmentation to improve the generalization capability of FSL models, but outlier query or support images during inference can still pose great g… ▽ More

    Submitted 21 July, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  40. arXiv:2402.17318  [pdf, other

    cs.NE cs.CV cs.LG

    Scaling Supervised Local Learning with Augmented Auxiliary Networks

    Authors: Chenxiang Ma, Jibin Wu, Chenyang Si, Kay Chen Tan

    Abstract: Deep neural networks are typically trained using global error signals that backpropagate (BP) end-to-end, which is not only biologically implausible but also suffers from the update locking problem and requires huge memory consumption. Local learning, which updates each layer independently with a gradient-isolated auxiliary network, offers a promising alternative to address the above problems. How… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by ICLR 2024

  41. arXiv:2402.16567  [pdf, other

    cs.CL cs.AI cs.DB

    Aligning Large Language Models to a Domain-specific Graph Database

    Authors: Yuanyuan Liang, Keren Tan, Tingyu Xie, Wenbiao Tao, Siyuan Wang, Yunshi Lan, Weining Qian

    Abstract: Graph Databases (Graph DB) are widely applied in various fields, including finance, social networks, and medicine. However, translating Natural Language (NL) into the Graph Query Language (GQL), commonly known as NL2GQL, proves to be challenging due to its inherent complexity and specialized nature. Some approaches have sought to utilize Large Language Models (LLMs) to address analogous tasks like… ▽ More

    Submitted 28 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 13 pages,2 figures

  42. arXiv:2402.16250  [pdf, ps, other

    gr-qc math-ph math.AP math.DG

    A Proof of Weak Cosmic Censorship Conjecture for the Spherically Symmetric Einstein-Maxwell-Charged Scalar Field System

    Authors: Xinliang An, Hong Kiat Tan

    Abstract: Under spherical symmetry, we show that the weak cosmic censorship holds for the gravitational collapse of the Einstein-Maxwell-charged scalar field system. Namely, for this system, with generic initial data, the formed spacetime singularities are concealed inside black-hole regions. This generalizes Christodoulou's celebrated results to the charged case. Due to the presence of charge $Q$ and the c… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 148 pages

  43. arXiv:2402.15969  [pdf, other

    cs.NE

    Efficient Online Learning for Networks of Two-Compartment Spiking Neurons

    Authors: Yujia Yin, Xinyi Chen, Chenxiang Ma, Jibin Wu, Kay Chen Tan

    Abstract: The brain-inspired Spiking Neural Networks (SNNs) have garnered considerable research interest due to their superior performance and energy efficiency in processing temporal signals. Recently, a novel multi-compartment spiking neuron model, namely the Two-Compartment LIF (TC-LIF) model, has been proposed and exhibited a remarkable capacity for sequential modelling. However, training the TC-LIF mod… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  44. arXiv:2402.14704  [pdf, other

    cs.CL

    An LLM-Enhanced Adversarial Editing System for Lexical Simplification

    Authors: Keren Tan, Kangyang Luo, Yunshi Lan, Zheng Yuan, Jinlong Shu

    Abstract: Lexical Simplification (LS) aims to simplify text at the lexical level. Existing methods rely heavily on annotated data, making it challenging to apply in low-resource scenarios. In this paper, we propose a novel LS method without parallel corpora. This method employs an Adversarial Editing System with guidance from a confusion loss and an invariance loss to predict lexical edits in the original s… ▽ More

    Submitted 22 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted by COLING 2024 main conference

  45. arXiv:2401.10034  [pdf, other

    cs.NE cs.AI cs.CL

    Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap

    Authors: Xingyu Wu, Sheng-hao Wu, Jibin Wu, Liang Feng, Kay Chen Tan

    Abstract: Large language models (LLMs) have not only revolutionized natural language processing but also extended their prowess to various domains, marking a significant stride towards artificial general intelligence. The interplay between LLMs and evolutionary algorithms (EAs), despite differing in objectives and methodologies, share a common pursuit of applicability in complex problems. Meanwhile, EA can… ▽ More

    Submitted 29 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: evolutionary algorithm (EA), large language model (LLM), optimization problem, prompt engineering, algorithm generation, neural architecture search

  46. arXiv:2401.04962  [pdf, other

    cs.CV

    Large Model based Sequential Keyframe Extraction for Video Summarization

    Authors: Kailong Tan, Yuxiang Zhou, Qianchen Xia, Rui Liu, Yong Chen

    Abstract: Keyframe extraction aims to sum up a video's semantics with the minimum number of its frames. This paper puts forward a Large Model based Sequential Keyframe Extraction for video summarization, dubbed LMSKE, which contains three stages as below. First, we use the large model "TransNetV21" to cut the video into consecutive shots, and employ the large model "CLIP2" to generate each frame's visual fe… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted for CDIVP 2024

  47. arXiv:2401.03508  [pdf, ps, other

    quant-ph

    Kirkwood-Dirac Type Quasiprobabilities as Universal Identifiers of Nonclassical Quantum Resources

    Authors: Kok Chuan Tan, Souradeep Sasmal

    Abstract: We show that a Kirkwood-Dirac type quasiprobability distribution is sufficient to reveal any arbitrary quantum resource. This is achieved by demonstrating that it is always possible to identify a set of incompatible measurements that distinguishes between resourceful states and nonresourceful states. The quasiprobability reveals a resourceful quantum state by having at least one quasiprobabilty ou… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 7 pages

  48. arXiv:2401.01563  [pdf, other

    cs.NE

    Towards Multi-Objective High-Dimensional Feature Selection via Evolutionary Multitasking

    Authors: Yinglan Feng, Liang Feng, Songbai Liu, Sam Kwong, Kay Chen Tan

    Abstract: Evolutionary Multitasking (EMT) paradigm, an emerging research topic in evolutionary computation, has been successfully applied in solving high-dimensional feature selection (FS) problems recently. However, existing EMT-based FS methods suffer from several limitations, such as a single mode of multitask generation, conducting the same generic evolutionary search for all tasks, relying on implicit… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  49. arXiv:2312.16884  [pdf

    eess.AS cs.SD

    Binaural recording methods with analysis on inter-aural time, level, and phase differences

    Authors: Johann Kay Ann Tan

    Abstract: Binaural recordings are a form of stereophonic recording method that replicates how human ears perceive sound, these types of recordings create a 3D aural image around the listener and are extremely immersive when well recorded and listened to appropriately with headphones. It has wide applications in video, podcast, and gaming formats -- allowing the listener to feel like they are there. Although… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  50. arXiv:2312.15602  [pdf

    physics.app-ph

    The effects of aural and visual factors on appropriateness ratings of residential spaces in an urban city

    Authors: Johann Kay Ann Tan, Siu-Kit Lau, Yoshimi Hasegawa

    Abstract: This study investigates the aural and visual factors that influence appropriateness perception in soundscape evaluations in residential spaces, where people may spend most of their time in. Appropriateness in soundscape is derived from the expectation of sound sources in a specific environment, place, or function heard by a listener. The appropriateness of soundscapes in 30 locations in an urban r… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Internoise 2021