Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–24 of 24 results for author: Tan, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12923  [pdf, other

    cs.LG cs.MA

    Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction

    Authors: Wenzhao Jiang, Jindong Han, Hao Liu, Tao Tao, Naiqiang Tan, Hui Xiong

    Abstract: Rapid urbanization has significantly escalated traffic congestion, underscoring the need for advanced congestion prediction services to bolster intelligent transportation systems. As one of the world's largest ride-hailing platforms, DiDi places great emphasis on the accuracy of congestion prediction to enhance the effectiveness and reliability of their real-time services, such as travel time esti… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2406.10203  [pdf, other

    cs.CL

    A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors

    Authors: Naaman Tan, Josef Valvoda, Anej Svete, Tianyu Liu, Yanxia Qin, Kan Min-Yen, Ryan Cotterell

    Abstract: The relationship between the quality of a string and its probability $p(\boldsymbol{y})$ under a language model has been influential in the development of techniques to build good text generation systems. For example, several decoding algorithms have been motivated to manipulate $p(\boldsymbol{y})$ to produce higher-quality text. In this work, we examine the probability--quality relationship in la… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2403.10955  [pdf, other

    cs.RO

    Agonist-Antagonist Pouch Motors: Bidirectional Soft Actuators Enhanced by Thermally Responsive Peltier Elements

    Authors: Trevor Exley, Rashmi Wijesundara, Nathan Tan, Akshay Sunkara, Xinyu He, Shuopu Wang, Bonnie Chan, Aditya Jain, Luis Espinosa, Amir Jafari

    Abstract: In this study, we introduce a novel Mylar-based pouch motor design that leverages the reversible actuation capabilities of Peltier junctions to enable agonist-antagonist muscle mimicry in soft robotics. Addressing the limitations of traditional silicone-based materials, such as leakage and phase-change fluid degradation, our pouch motors filled with Novec 7000 provide a durable and leak-proof solu… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: submitted to IROS 2024, 7 pages, 9 figures

  4. arXiv:2403.05112  [pdf, other

    cs.AI

    RLPeri: Accelerating Visual Perimetry Test with Reinforcement Learning and Convolutional Feature Extraction

    Authors: Tanvi Verma, Linh Le Dinh, Nicholas Tan, Xinxing Xu, Chingyu Cheng, Yong Liu

    Abstract: Visual perimetry is an important eye examination that helps detect vision problems caused by ocular or neurological conditions. During the test, a patient's gaze is fixed at a specific location while light stimuli of varying intensities are presented in central and peripheral vision. Based on the patient's responses to the stimuli, the visual field mapping and sensitivity are determined. However,… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Published at AAAI-24

    Journal ref: The 38th Annual AAAI Conference on Artificial Intelligence, 2024

  5. arXiv:2310.09430  [pdf, ps, other

    cs.CL cs.AI

    Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning

    Authors: Qiming Bao, Gael Gendron, Alex Yuxuan Peng, Wanjun Zhong, Neset Tan, Yang Chen, Michael Witbrock, Jiamou Liu

    Abstract: Large language models (LLMs), such as LLaMA, Alpaca, Vicuna, GPT-3.5 and GPT-4, have advanced the performance of AI systems on various natural language processing tasks to human-like levels. However, their generalisation and robustness when performing logical reasoning has not been sufficiently assessed. To comprehensively evaluate this ability, we develop three new logical reasoning datasets name… ▽ More

    Submitted 30 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: The short version (v3) was accepted for oral presentation at the first LLM@IJCAI 2023 non-archival symposium; the full version is under review

  6. arXiv:2309.15507  [pdf, other

    cs.IT eess.SP

    Approximate Message Passing with Rigorous Guarantees for Pooled Data and Quantitative Group Testing

    Authors: Nelvin Tan, Pablo Pascual Cobo, Jonathan Scarlett, Ramji Venkataramanan

    Abstract: In the pooled data problem, the goal is to identify the categories associated with a large collection of items via a sequence of pooled tests. Each pooled test reveals the number of items of each category within the pool. We study an approximate message passing (AMP) algorithm for estimating the categories and rigorously characterize its performance, in both the noiseless and noisy settings. For t… ▽ More

    Submitted 21 February, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: 62 pages, 11 figures

  7. arXiv:2308.00946  [pdf, other

    cs.CL cs.AI

    Teaching Smaller Language Models To Generalise To Unseen Compositional Questions

    Authors: Tim Hartill, Neset Tan, Michael Witbrock, Patricia J. Riddle

    Abstract: We equip a smaller Language Model to generalise to answering challenging compositional questions that have not been seen in training. To do so we propose a combination of multitask supervised pretraining on up to 93 tasks designed to instill diverse reasoning abilities, and a dense retrieval system that aims to retrieve a set of evidential paragraph fragments. Recent progress in question-answering… ▽ More

    Submitted 20 August, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  8. arXiv:2305.12599  [pdf, other

    cs.CL cs.AI

    Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning

    Authors: Qiming Bao, Alex Yuxuan Peng, Zhenyun Deng, Wanjun Zhong, Gael Gendron, Timothy Pistotti, Neset Tan, Nathan Young, Yang Chen, Yonghua Zhu, Paul Denny, Michael Witbrock, Jiamou Liu

    Abstract: Combining large language models with logical reasoning enhances their capacity to address problems in a robust and reliable manner. Nevertheless, the intricate nature of logical reasoning poses challenges when gathering reliable data from the web to build comprehensive training datasets, subsequently affecting performance on downstream tasks. To address this, we introduce a novel logic-driven data… ▽ More

    Submitted 6 June, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: 21 pages, 8 figures, the Findings of ACL 2024

  9. arXiv:2304.14501  [pdf, other

    cs.CV cs.AI cs.HC

    Read My Mind: A Multi-Modal Dataset for Human Belief Prediction

    Authors: Jiafei Duan, Samson Yu, Nicholas Tan, Yi Ru Wang, Cheston Tan

    Abstract: Understanding human intentions is key to enabling effective and efficient human-robot interaction (HRI) in collaborative settings. To enable developments and evaluation of the ability of artificial intelligence (AI) systems to infer human beliefs, we introduce a large-scale multi-modal video dataset for intent prediction based on object-context relations.

    Submitted 7 March, 2023; originally announced April 2023.

    Comments: Accepted to ICRA 2023 Communicating Robot Learning Across Human-Robot Interaction Workshop

  10. arXiv:2304.02229  [pdf, other

    stat.ML cs.IT cs.LG math.ST

    Mixed Regression via Approximate Message Passing

    Authors: Nelvin Tan, Ramji Venkataramanan

    Abstract: We study the problem of regression in a generalized linear model (GLM) with multiple signals and latent variables. This model, which we call a matrix GLM, covers many widely studied problems in statistical learning, including mixed linear regression, max-affine regression, and mixture-of-experts. In mixed linear regression, each observation comes from one of $L$ signal vectors (regressors), but we… ▽ More

    Submitted 15 August, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: 44 pages. To appear in the Journal of Machine Learning Research. A shorter version of this paper appeared in the proceedings of AISTATS 2023

    Journal ref: Journal of Machine Learning Research, vol. 24, no. 317, pp. 1-44, 2023

  11. arXiv:2303.07585  [pdf, other

    cs.CL

    Input-length-shortening and text generation via attention values

    Authors: Neşet Özkan Tan, Alex Yuxuan Peng, Joshua Bensemann, Qiming Bao, Tim Hartill, Mark Gahegan, Michael Witbrock

    Abstract: Identifying words that impact a task's performance more than others is a challenge in natural language processing. Transformers models have recently addressed this issue by incorporating an attention mechanism that assigns greater attention (i.e., relevance) scores to some words than others. Because of the attention mechanism's high computational cost, transformer models usually have an input-leng… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 7 pages, 4 figures. AAAI23-EMC2

  12. arXiv:2207.14000  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation

    Authors: Qiming Bao, Alex Yuxuan Peng, Tim Hartill, Neset Tan, Zhenyun Deng, Michael Witbrock, Jiamou Liu

    Abstract: Combining deep learning with symbolic logic reasoning aims to capitalize on the success of both fields and is drawing increasing attention. Inspired by DeepLogic, an end-to-end model trained to perform inference on logic programs, we introduce IMA-GloVe-GA, an iterative neural inference network for multi-step reasoning expressed in natural language. In our model, reasoning is performed using an it… ▽ More

    Submitted 30 March, 2024; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: 10 pages, 3 figures, The 2nd International Joint Conference on Learning & Reasoning and 16th International Workshop on Neural-Symbolic Learning and Reasoning (IJCLR-NeSy 2022)

  13. arXiv:2207.13848  [pdf, other

    cs.DC cs.LG cs.PF math.NA

    Predicting the Output Structure of Sparse Matrix Multiplication with Sampled Compression Ratio

    Authors: Zhaoyang Du, Yijin Guan, Tianchan Guan, Dimin Niu, Nianxiong Tan, Xiaopeng Yu, Hongzhong Zheng, Jianyi Meng, Xiaolang Yan, Yuan Xie

    Abstract: Sparse general matrix multiplication (SpGEMM) is a fundamental building block in numerous scientific applications. One critical task of SpGEMM is to compute or predict the structure of the output matrix (i.e., the number of nonzero elements per output row) for efficient memory allocation and load balance, which impact the overall performance of SpGEMM. Existing work either precisely calculates the… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: This paper has been submitted to the IEEE International Conference on Parallel and Distributed Systems (ICPADS). 8 pages, 2 fgures, 3 tables

    ACM Class: F.2.1; G.3; D.1.3; G.1.3

  14. arXiv:2206.10665  [pdf, other

    cs.CV

    BOSS: A Benchmark for Human Belief Prediction in Object-context Scenarios

    Authors: Jiafei Duan, Samson Yu, Nicholas Tan, Li Yi, Cheston Tan

    Abstract: Humans with an average level of social cognition can infer the beliefs of others based solely on the nonverbal communication signals (e.g. gaze, gesture, pose and contextual information) exhibited during social interactions. This social cognitive ability to predict human beliefs and intentions is more important than ever for ensuring safe human-robot interaction and collaboration. This paper uses… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 9 pages, 5 figures

  15. arXiv:2205.10267  [pdf, other

    astro-ph.IM astro-ph.HE cs.DC gr-qc

    Reproducibility of the First Image of a Black Hole in the Galaxy M87 from the Event Horizon Telescope (EHT) Collaboration

    Authors: Ria Patel, Brandan Roachell, Silvina Caino-Lores, Ross Ketron, Jacob Leonard, Nigel Tan, Duncan Brown, Ewa Deelman, Michela Taufer

    Abstract: This paper presents an interdisciplinary effort aiming to develop and share sustainable knowledge necessary to analyze, understand, and use published scientific results to advance reproducibility in multi-messenger astrophysics. Specifically, we target the breakthrough work associated with the generation of the first image of a black hole, called M87. The image was computed by the Event Horizon Te… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  16. arXiv:2201.03745  [pdf, other

    cs.IT

    Performance Bounds for Group Testing With Doubly-Regular Designs

    Authors: Nelvin Tan, Way Tan, Jonathan Scarlett

    Abstract: In the group testing problem, the goal is to identify a subset of defective items within a larger set of items based on tests whose outcomes indicate whether any defective item is present. This problem is relevant in areas such as medical testing, DNA sequencing, and communications. In this paper, we study a doubly-regular design in which the number of tests-per-item and the number of items-per-te… ▽ More

    Submitted 27 September, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: IEEE Transactions on Information Theory

  17. arXiv:2106.00308  [pdf, other

    cs.IT cs.DS

    Fast Splitting Algorithms for Sparsity-Constrained and Noisy Group Testing

    Authors: Eric Price, Jonathan Scarlett, Nelvin Tan

    Abstract: In group testing, the goal is to identify a subset of defective items within a larger set of items based on tests whose outcomes indicate whether at least one defective item is present. This problem is relevant in areas such as medical testing, DNA sequencing, communication protocols, and many more. In this paper, we study (i) a sparsity-constrained version of the problem, in which the testing pro… ▽ More

    Submitted 20 October, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: Information and Inference: A Journal of the IMA

  18. VPIC 2.0: Next Generation Particle-in-Cell Simulations

    Authors: Robert Bird, Nigel Tan, Scott V. Luedtke, Stephen Lien Harrell, Michela Taufer, Brian Albright

    Abstract: VPIC is a general purpose Particle-in-Cell simulation code for modeling plasma phenomena such as magnetic reconnection, fusion, solar weather, and laser-plasma interaction in three dimensions using large numbers of particles. VPIC's capacity in both fidelity and scale makes it particularly well-suited for plasma research on pre-exascale and exascale platforms. In this paper we demonstrate the uniq… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  19. arXiv:2004.11860  [pdf, other

    cs.DS cs.DM cs.IT math.CO

    Near optimal sparsity-constrained group testing: improved bounds and algorithms

    Authors: Oliver Gebhard, Max Hahn-Klimroth, Olaf Parczyk, Manuel Penschuck, Maurice Rolvien, Jonathan Scarlett, Nelvin Tan

    Abstract: Recent advances in noiseless non-adaptive group testing have led to a precise asymptotic characterization of the number of tests required for high-probability recovery in the sublinear regime $k = n^θ$ (with $θ\in (0,1)$), with $n$ individuals among which $k$ are infected. However, the required number of tests may increase substantially under real-world practical constraints, notably including bou… ▽ More

    Submitted 22 December, 2021; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: Accepted for publication at IEEE Transactions on Information Theory

    MSC Class: 05C80; 60B20; 68P30

  20. arXiv:2004.03119  [pdf, other

    cs.IT math.PR

    Improved Bounds and Algorithms for Sparsity-Constrained Group Testing

    Authors: Nelvin Tan, Jonathan Scarlett

    Abstract: In group testing, the goal is to identify a subset of defective items within a larger set of items based on tests whose outcomes indicate whether any defective item is present. This problem is relevant in areas such as medical testing, data science, communications, and many more. Motivated by physical considerations, we consider a sparsity-based constrained setting (Gandikota et al., 2019) in whic… ▽ More

    Submitted 10 November, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: This paper has been merged with concurrent work to form arXiv:2004.11860. See v2 (arXiv:2004.03119v2) for a 5-page ISIT version with the adaptive setting only

  21. arXiv:1904.05644  [pdf

    cs.CV eess.IV

    Retinal Vessels Segmentation Based on Dilated Multi-Scale Convolutional Neural Network

    Authors: Yun Jiang, Ning Tan, Tingting Peng, Hai Zhang

    Abstract: Accurate segmentation of retinal vessels is a basic step in Diabetic retinopathy(DR) detection. Most methods based on deep convolutional neural network (DCNN) have small receptive fields, and hence they are unable to capture global context information of larger regions, with difficult to identify lesions. The final segmented retina vessels contain more noise with low classification accuracy. There… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

  22. arXiv:1805.04224  [pdf

    cs.CV

    Retinal Vessel Segmentation Based on Conditional Deep Convolutional Generative Adversarial Networks

    Authors: Yun Jiang, Ning Tan

    Abstract: The segmentation of retinal vessels is of significance for doctors to diagnose the fundus diseases. However, existing methods have various problems in the segmentation of the retinal vessels, such as insufficient segmentation of retinal vessels, weak anti-noise interference ability, and sensitivity to lesions, etc. Aiming to the shortcomings of existed methods, this paper proposes the use of condi… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

    Comments: in Chinese

  23. arXiv:1711.04498  [pdf, other

    cs.IR cs.AI cs.CL

    Targeted Advertising Based on Browsing History

    Authors: Yong Zhang, Hongming Zhou, Nganmeng Tan, Saeed Bagheri, Meng Joo Er

    Abstract: Audience interest, demography, purchase behavior and other possible classifications are ex- tremely important factors to be carefully studied in a targeting campaign. This information can help advertisers and publishers deliver advertisements to the right audience group. How- ever, it is not easy to collect such information, especially for the online audience with whom we have limited interaction… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

  24. arXiv:1110.1064  [pdf, ps, other

    cs.DS

    Approximating CSPs with Global Cardinality Constraints Using SDP Hierarchies

    Authors: Prasad Raghavendra, Ning Tan

    Abstract: This work is concerned with approximating constraint satisfaction problems (CSPs) with an additional global cardinality constraints. For example, \maxcut is a boolean CSP where the input is a graph $G = (V,E)$ and the goal is to find a cut $S \cup \bar S = V$ that maximizes the numberof crossing edges, $|E(S,\bar S)|$. The \maxbisection problem is a variant of \maxcut with an additional global con… ▽ More

    Submitted 5 October, 2011; originally announced October 2011.