Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 160 results for author: Tran, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.01062  [pdf, other

    cs.LG cs.CR cs.CV

    Defending against Model Inversion Attacks via Random Erasing

    Authors: Viet-Hung Tran, Ngoc-Bao Nguyen, Son T. Mai, Hans Vandierendonck, Ngai-man Cheung

    Abstract: Model Inversion (MI) is a type of privacy violation that focuses on reconstructing private training data through abusive exploitation of machine learning models. To defend against MI attacks, state-of-the-art (SOTA) MI defense methods rely on regularizations that conflict with the training loss, creating explicit tension between privacy protection and model utility. In this paper, we present a n… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: Under review. The first two authors contributed equally

  2. arXiv:2408.16737  [pdf, other

    cs.CL cs.AI

    Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling

    Authors: Hritik Bansal, Arian Hosseini, Rishabh Agarwal, Vinh Q. Tran, Mehran Kazemi

    Abstract: Training on high-quality synthetic data from strong language models (LMs) is a common strategy to improve the reasoning performance of LMs. In this work, we revisit whether this strategy is compute-optimal under a fixed inference budget (e.g., FLOPs). To do so, we investigate the trade-offs between generating synthetic data using a stronger but more expensive (SE) model versus a weaker but cheaper… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  3. arXiv:2408.16578  [pdf, other

    cs.IR cs.LG

    Transformers Meet ACT-R: Repeat-Aware and Sequential Listening Session Recommendation

    Authors: Viet-Anh Tran, Guillaume Salha-Galvan, Bruno Sguerra, Romain Hennequin

    Abstract: Music streaming services often leverage sequential recommender systems to predict the best music to showcase to users based on past sequences of listening sessions. Nonetheless, most sequential recommendation methods ignore or insufficiently account for repetitive behaviors. This is a crucial limitation for music recommendation, as repeatedly listening to the same song over time is a common phenom… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 11 pages. Accepted by RecSys'2024, full paper

  4. arXiv:2407.17790  [pdf, other

    cs.LG cs.AR

    Exploring the Limitations of Kolmogorov-Arnold Networks in Classification: Insights to Software Training and Hardware Implementation

    Authors: Van Duy Tran, Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima

    Abstract: Kolmogorov-Arnold Networks (KANs), a novel type of neural network, have recently gained popularity and attention due to the ability to substitute multi-layer perceptions (MLPs) in artificial intelligence (AI) with higher accuracy and interoperability. However, KAN assessment is still limited and cannot provide an in-depth analysis of a specific domain. Furthermore, no study has been conducted on t… ▽ More

    Submitted 25 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, 2 tables

  5. arXiv:2407.01308  [pdf, other

    cs.RO cs.MA

    Active Sensing Strategy: Multi-Modal, Multi-Robot Source Localization and Mapping in Real-World Settings with Fixed One-Way Switching

    Authors: Vu Phi Tran, Asanka G. Perera, Matthew A. Garratt, Kathryn Kasmarik, Sreenatha G. Anavatti

    Abstract: This paper introduces a state-machine model for a multi-modal, multi-robot environmental sensing algorithm tailored to dynamic real-world settings. The algorithm uniquely combines two exploration strategies for gas source localization and mapping: (1) an initial exploration phase using multi-robot coverage path planning with variable formations for early gas field indication; and (2) a subsequent… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  6. arXiv:2406.17335  [pdf, other

    cs.IR cs.LG

    A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems

    Authors: Hung Vinh Tran, Tong Chen, Quoc Viet Hung Nguyen, Zi Huang, Lizhen Cui, Hongzhi Yin

    Abstract: Since the creation of the Web, recommender systems (RSs) have been an indispensable mechanism in information filtering. State-of-the-art RSs primarily depend on categorical features, which ecoded by embedding vectors, resulting in excessively large embedding tables. To prevent over-parameterized embedding tables from harming scalability, both academia and industry have seen increasing efforts in c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  7. arXiv:2406.13725  [pdf, other

    cs.LG cs.AI stat.ML

    Tree-Sliced Wasserstein Distance on a System of Lines

    Authors: Viet-Hoang Tran, Trang Pham, Tho Tran, Tam Le, Tan M. Nguyen

    Abstract: Sliced Wasserstein (SW) distance in Optimal Transport (OT) is widely used in various applications thanks to its statistical effectiveness and computational efficiency. On the other hand, Tree Wassenstein (TW) and Tree-sliced Wassenstein (TSW) are instances of OT for probability measures where its ground cost is a tree metric. TSW also has a low computational complexity, i.e. linear to the number o… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 33 pages, 6 figures, 2 tables, 4 algorithms

  8. arXiv:2406.04140  [pdf, other

    cs.SD eess.AS

    STraDa: A Singer Traits Dataset

    Authors: Yuexuan Kong, Viet-Anh Tran, Romain Hennequin

    Abstract: There is a limited amount of large-scale public datasets that contain downloadable music audio files and rich lead singer metadata. To provide such a dataset to benefit research in singing voices, we created Singer Traits Dataset (STraDa) with two subsets: automatic-strada and annotated-strada. The automatic-strada contains twenty-five thousand tracks across numerous genres and languages of more t… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  9. arXiv:2406.01457  [pdf, other

    cs.LG cs.CL cs.CR

    Differentially Private Tabular Data Synthesis using Large Language Models

    Authors: Toan V. Tran, Li Xiong

    Abstract: Synthetic tabular data generation with differential privacy is a crucial problem to enable data sharing with formal privacy. Despite a rich history of methodological research and development, developing differentially private tabular data generators that can provide realistic synthetic datasets remains challenging. This paper introduces DP-LLMTGen -- a novel framework for differentially private ta… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  10. arXiv:2405.03903  [pdf, other

    cs.AI cs.CY

    Unified Locational Differential Privacy Framework

    Authors: Aman Priyanshu, Yash Maurya, Suriya Ganesh, Vy Tran

    Abstract: Aggregating statistics over geographical regions is important for many applications, such as analyzing income, election results, and disease spread. However, the sensitive nature of this data necessitates strong privacy protections to safeguard individuals. In this work, we present a unified locational differential privacy (DP) framework to enable private aggregation of various data types, includi… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 10 pages, 7 figures

  11. arXiv:2403.17225  [pdf

    cs.HC cs.CR cs.CY

    Measuring Compliance with the California Consumer Privacy Act Over Space and Time

    Authors: Van Tran, Aarushi Mehrotra, Marshini Chetty, Nick Feamster, Jens Frankenreiter, Lior Strahilevitz

    Abstract: The widespread sharing of consumers personal information with third parties raises significant privacy concerns. The California Consumer Privacy Act (CCPA) mandates that online businesses offer consumers the option to opt out of the sale and sharing of personal information. Our study automatically tracks the presence of the opt-out link longitudinally across multiple states after the California Pr… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  12. arXiv:2403.03435  [pdf, ps, other

    cs.CL

    VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition

    Authors: Vu Tran, Ha-Thanh Nguyen, Trung Vo, Son T. Luu, Hoang-Anh Dang, Ngoc-Cam Le, Thi-Thuy Le, Minh-Tien Nguyen, Truong-Son Nguyen, Le-Minh Nguyen

    Abstract: In this new era of rapid AI development, especially in language processing, the demand for AI in the legal domain is increasingly critical. In the context where research in other languages such as English, Japanese, and Chinese has been well-established, we introduce the first fundamental research for the Vietnamese language in the legal domain: legal textual entailment recognition through the Vie… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  13. arXiv:2403.01454  [pdf, ps, other

    cs.IT

    Maximum Length RLL Sequences in de Bruijn Graph

    Authors: Yeow Meng Chee, Tuvi Etzion, Tien Long Nguyen, Duy Hoang Ta, Vinh Duc Tran, Van Khu Vu

    Abstract: A timing and synchronization system based on a de Bruijn sequence has been proposed and studied recently for a channel associated with quantum communication that requires reliable synchronization. To avoid a long period of no-pulse in such a system on-off pulses are used to simulate a zero and on-on pulses are used to simulate a one. However, these sequences have high redundancy. To reduce the red… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  14. How good are my search strings? Reflections on using an existing review as a quasi-gold standard

    Authors: Huynh Khanh Vi Tran, Jürgen Börstler, Nauman Bin Ali, Michael Unterkalmsteiner

    Abstract: Background: Systematic literature studies (SLS) have become a core research methodology in Evidence-based Software Engineering (EBSE). Search completeness, ie, finding all relevant papers on the topic of interest, has been recognized as one of the most commonly discussed validity issues of SLSs. Aim: This study aims at raising awareness on the issues related to search string construction and on se… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Journal ref: e Informatica Softw. Eng. J. 16(1) (2022)

  15. Assessing test artifact quality -- A tertiary study

    Authors: Huynh Khanh Vi Tran, Michael Unterkalmsteiner, Jürgen Börstler, Nauman bin Ali

    Abstract: Context: Modern software development increasingly relies on software testing for an ever more frequent delivery of high quality software. This puts high demands on the quality of the central artifacts in software testing, test suites and test cases. Objective: We aim to develop a comprehensive model for capturing the dimensions of test case/suite quality, which are relevant for a variety of perspe… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Journal ref: Information and Software Technology 139 (2021): 106620

  16. arXiv:2402.01825  [pdf, other

    cs.CL cs.AI

    Fractal Patterns May Illuminate the Success of Next-Token Prediction

    Authors: Ibrahim Alabdulmohsin, Vinh Q. Tran, Mostafa Dehghani

    Abstract: We study the fractal structure of language, aiming to provide a precise formalism for quantifying properties that may have been previously suspected but not formally shown. We establish that language is: (1) self-similar, exhibiting complexities at all levels of granularity, with no particular characteristic context length, and (2) long-range dependent (LRD), with a Hurst parameter of approximatel… ▽ More

    Submitted 22 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 15 pages, 10 tables, 6 figures

  17. arXiv:2312.03493  [pdf, other

    cs.RO

    Radio Source Localization using Sparse Signal Measurements from Uncrewed Ground Vehicles

    Authors: Asanka Perera, Vu Phi Tran, Sreenatha Anavatti, Kathryn Kasmarik, Matthew Garratt

    Abstract: Radio source localization can benefit many fields, including wireless communications, radar, radio astronomy, wireless sensor networks, positioning systems, and surveillance systems. However, accurately estimating the position of a radio transmitter using a remote sensor is not an easy task, as many factors contribute to the highly dynamic behavior of radio signals. In this study, we investigate t… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  18. Ex2Vec: Characterizing Users and Items from the Mere Exposure Effect

    Authors: Bruno Sguerra, Viet-Anh Tran, Romain Hennequin

    Abstract: The traditional recommendation framework seeks to connect user and content, by finding the best match possible based on users past interaction. However, a good content recommendation is not necessarily similar to what the user has chosen in the past. As humans, users naturally evolve, learn, forget, get bored, they change their perspective of the world and in consequence, of the recommendable cont… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Journal ref: In Seventeenth ACM Conference on Recommender Systems (RecSys 2023)

  19. arXiv:2311.05716  [pdf, other

    cs.AR

    ML-based Real-Time Control at the Edge: An Approach Using hls4ml

    Authors: R. Shi, S. Ogrenci, J. M. Arnold, J. R. Berlioz, P. Hanlet, K. J. Hazelwood, M. A. Ibrahim, H. Liu, V. P. Nagaslaev, A. Narayanan 1, D. J. Nicklaus, J. Mitrevski, G. Pradhan, A. L. Saewert, B. A. Schupbach, K. Seiya, M. Thieme, R. M. Thurman-Keup, N. V. Tran

    Abstract: This study focuses on implementing a real-time control system for a particle accelerator facility that performs high energy physics experiments. A critical operating parameter in this facility is beam loss, which is the fraction of particles deviating from the accelerated proton beam into a cascade of secondary particles. Accelerators employ a large number of sensors to monitor beam loss. The data… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  20. arXiv:2310.18046  [pdf, other

    cs.CL cs.CV

    ViCLEVR: A Visual Reasoning Dataset and Hybrid Multimodal Fusion Model for Visual Question Answering in Vietnamese

    Authors: Khiem Vinh Tran, Hao Phu Phan, Kiet Van Nguyen, Ngan Luu Thuy Nguyen

    Abstract: In recent years, Visual Question Answering (VQA) has gained significant attention for its diverse applications, including intelligent car assistance, aiding visually impaired individuals, and document image information retrieval using natural language queries. VQA requires effective integration of information from questions and images to generate accurate answers. Neural models for VQA have made r… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: A pre-print version and submitted to journal

  21. arXiv:2310.14602  [pdf, ps, other

    cs.CL

    Generative Pre-trained Transformer for Vietnamese Community-based COVID-19 Question Answering

    Authors: Tam Minh Vo, Khiem Vinh Tran

    Abstract: Recent studies have provided empirical evidence of the wide-ranging potential of Generative Pre-trained Transformer (GPT), a pretrained language model, in the field of natural language processing. GPT has been effectively employed as a decoder within state-of-the-art (SOTA) question answering systems, yielding exceptional performance across various tasks. However, the current research landscape co… ▽ More

    Submitted 31 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

  22. Test-Case Quality -- Understanding Practitioners' Perspectives

    Authors: Huynh Khanh Vi Tran, Nauman Bin Ali, Jürgen Börstler, Michael Unterkalmsteiner

    Abstract: Background: Test-case quality has always been one of the major concerns in software testing. To improve test-case quality, it is important to better understand how practitioners perceive the quality of test-cases. Objective: Motivated by that need, we investigated how practitioners define test-case quality and which aspects of test-cases are important for quality assessment. Method: We conducted s… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: PROFES 2019: 37-52

  23. Encoded Summarization: Summarizing Documents into Continuous Vector Space for Legal Case Retrieval

    Authors: Vu Tran, Minh Le Nguyen, Satoshi Tojo, Ken Satoh

    Abstract: We present our method for tackling a legal case retrieval task by introducing our method of encoding documents by summarizing them into continuous vector space via our phrase scoring framework utilizing deep neural networks. On the other hand, we explore the benefits from combining lexical features and latent features generated with neural networks. Our experiments show that lexical features and l… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Published 2020-01-25 in AI and Law. arXiv admin note: text overlap with arXiv:2009.14083

  24. arXiv:2309.03223  [pdf

    cs.HC cs.CL cs.LG

    Examining the Effectiveness of Chatbots in Gathering Family History Information in Comparison to the Standard In-Person Interview-Based Approach

    Authors: Kieron Drumm, Vincent Tran

    Abstract: One of the most common things that a genealogist is tasked with is the gathering of a person's initial family history, normally via in-person interviews or with the use of a platform such as ancestry.com, as this can provide a strong foundation upon which a genealogist may build. However, the ability to conduct these interviews can often be hindered by both geographical constraints and the technic… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 10 pages, 2 figures, 5 tables

  25. arXiv:2308.07601  [pdf, ps, other

    cs.CL

    VBD-MT Chinese-Vietnamese Translation Systems for VLSP 2022

    Authors: Hai Long Trieu, Song Kiet Bui, Tan Minh Tran, Van Khanh Tran, Hai An Nguyen

    Abstract: We present our systems participated in the VLSP 2022 machine translation shared task. In the shared task this year, we participated in both translation tasks, i.e., Chinese-Vietnamese and Vietnamese-Chinese translations. We build our systems based on the neural-based Transformer model with the powerful multilingual denoising pre-trained model mBART. The systems are enhanced by a sampling method fo… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  26. arXiv:2307.15335  [pdf, other

    cs.CL cs.CV

    BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering

    Authors: Khiem Vinh Tran, Kiet Van Nguyen, Ngan Luu Thuy Nguyen

    Abstract: Visual Question Answering (VQA) is an intricate and demanding task that integrates natural language processing (NLP) and computer vision (CV), capturing the interest of researchers. The English language, renowned for its wealth of resources, has witnessed notable advancements in both datasets and models designed for VQA. However, there is a lack of models that target specific countries such as Vie… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  27. arXiv:2306.15634  [pdf, other

    cs.CL

    Automatic Annotation of Direct Speech in Written French Narratives

    Authors: Noé Durandard, Viet-Anh Tran, Gaspard Michel, Elena V. Epure

    Abstract: The automatic annotation of direct speech (AADS) in written text has been often used in computational narrative understanding. Methods based on either rules or deep neural networks have been explored, in particular for English or German languages. Yet, for French, our target language, not many works exist. Our goal is to create a unified framework to design and evaluate AADS models in French. For… ▽ More

    Submitted 28 June, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 9 pages, ACL 2023

  28. arXiv:2306.04083  [pdf, other

    cs.MA cs.RO

    Coverage Path Planning with Budget Constraints for Multiple Unmanned Ground Vehicles

    Authors: Vu Phi Tran, Asanka Perera, Matthew A. Garratt, Kathryn Kasmarik, Sreenatha Anavatti

    Abstract: This paper proposes a state-machine model for a multi-modal, multi-robot environmental sensing algorithm. This multi-modal algorithm integrates two different exploration algorithms: (1) coverage path planning using variable formations and (2) collaborative active sensing using multi-robot swarms. The state machine provides the logic for when to switch between these different sensing algorithms. We… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  29. arXiv:2305.11841  [pdf, other

    cs.IR cs.CL

    How Does Generative Retrieval Scale to Millions of Passages?

    Authors: Ronak Pradeep, Kai Hui, Jai Gupta, Adam D. Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran

    Abstract: Popularized by the Differentiable Search Index, the emerging paradigm of generative retrieval re-frames the classic information retrieval problem into a sequence-to-sequence modeling task, forgoing external indices and encoding an entire document corpus within a single Transformer. Although many different approaches have been proposed to improve the effectiveness of generative retrieval, they have… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  30. arXiv:2305.05065  [pdf, other

    cs.IR cs.LG

    Recommender Systems with Generative Retrieval

    Authors: Shashank Rajput, Nikhil Mehta, Anima Singh, Raghunandan H. Keshavan, Trung Vu, Lukasz Heldt, Lichan Hong, Yi Tay, Vinh Q. Tran, Jonah Samost, Maciej Kula, Ed H. Chi, Maheswaran Sathiamoorthy

    Abstract: Modern recommender systems perform large-scale retrieval by first embedding queries and item candidates in the same unified space, followed by approximate nearest neighbor search to select top candidates given a query embedding. In this paper, we propose a novel generative retrieval approach, where the retrieval model autoregressively decodes the identifiers of the target candidates. To that end,… ▽ More

    Submitted 3 November, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: To appear in The 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  31. arXiv:2304.13464  [pdf

    cs.LG cs.CY

    A Comparative Analysis of Multiple Methods for Predicting a Specific Type of Crime in the City of Chicago

    Authors: Deborah Djon, Jitesh Jhawar, Kieron Drumm, Vincent Tran

    Abstract: Researchers regard crime as a social phenomenon that is influenced by several physical, social, and economic factors. Different types of crimes are said to have different motivations. Theft, for instance, is a crime that is based on opportunity, whereas murder is driven by emotion. In accordance with this, we examine how well a model can perform with only spatiotemporal information at hand when it… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 9 pages, 1 figure

  32. arXiv:2304.08158  [pdf, other

    cs.IR cs.LG

    Attention Mixtures for Time-Aware Sequential Recommendation

    Authors: Viet-Anh Tran, Guillaume Salha-Galvan, Bruno Sguerra, Romain Hennequin

    Abstract: Transformers emerged as powerful methods for sequential recommendation. However, existing architectures often overlook the complex dependencies between user preferences and the temporal context. In this short paper, we introduce MOJITO, an improved Transformer sequential recommender system that addresses this limitation. MOJITO leverages Gaussian mixtures of attention-based temporal context and it… ▽ More

    Submitted 3 July, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: SIGIR 2023

  33. arXiv:2304.04835  [pdf, other

    cs.CR cs.CY cs.NI

    Measuring and Evading Turkmenistan's Internet Censorship: A Case Study in Large-Scale Measurements of a Low-Penetration Country

    Authors: Sadia Nourin, Van Tran, Xi Jiang, Kevin Bock, Nick Feamster, Nguyen Phong Hoang, Dave Levin

    Abstract: Since 2006, Turkmenistan has been listed as one of the few Internet enemies by Reporters without Borders due to its extensively censored Internet and strictly regulated information control policies. Existing reports of filtering in Turkmenistan rely on a small number of vantage points or test a small number of websites. Yet, the country's poor Internet adoption rates and small population can make… ▽ More

    Submitted 17 April, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: To appear in Proceedings of The 2023 ACM Web Conference (WWW 2023)

  34. arXiv:2303.16507  [pdf, other

    cs.CV

    Improving Object Detection in Medical Image Analysis through Multiple Expert Annotators: An Empirical Investigation

    Authors: Hieu H. Pham, Khiem H. Le, Tuan V. Tran, Ha Q. Nguyen

    Abstract: The work discusses the use of machine learning algorithms for anomaly detection in medical image analysis and how the performance of these algorithms depends on the number of annotators and the quality of labels. To address the issue of subjectivity in labeling with a single annotator, we introduce a simple and effective approach that aggregates annotations from multiple annotators with varying le… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: This is a short version submitted to the Midwest Machine Learning Symposium (MMLS 2023), Chicago, IL, USA

  35. arXiv:2303.05739  [pdf, other

    cs.CV cs.LG

    LEDetection: A Simple Framework for Semi-Supervised Few-Shot Object Detection

    Authors: Phi Vu Tran

    Abstract: Few-shot object detection (FSOD) is a challenging problem aimed at detecting novel concepts from few exemplars. Existing approaches to FSOD all assume abundant base labels to adapt to novel objects. This paper studies the new task of semi-supervised FSOD by considering a realistic scenario in which both base and novel labels are simultaneously scarce. We explore the utility of unlabeled data withi… ▽ More

    Submitted 14 February, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: AISTATS 2024. The code is available at https://github.com/lexisnexis-risk-open-source/ledetection

  36. arXiv:2302.12020  [pdf, other

    cs.LG

    Personalized Privacy-Preserving Framework for Cross-Silo Federated Learning

    Authors: Van-Tuan Tran, Huy-Hieu Pham, Kok-Seng Wong

    Abstract: Federated learning (FL) is recently surging as a promising decentralized deep learning (DL) framework that enables DL-based approaches trained collaboratively across clients without sharing private data. However, in the context of the central party being active and dishonest, the data of individual clients might be perfectly reconstructed, leading to the high possibility of sensitive information b… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  37. arXiv:2302.02031  [pdf, other

    cs.LG cs.AI cs.CY cs.NI

    Augmenting Rule-based DNS Censorship Detection at Scale with Machine Learning

    Authors: Jacob Brown, Xi Jiang, Van Tran, Arjun Nitin Bhagoji, Nguyen Phong Hoang, Nick Feamster, Prateek Mittal, Vinod Yegneswaran

    Abstract: The proliferation of global censorship has led to the development of a plethora of measurement platforms to monitor and expose it. Censorship of the domain name system (DNS) is a key mechanism used across different countries. It is currently detected by applying heuristics to samples of DNS queries and responses (probes) for specific destinations. These heuristics, however, are both platform-speci… ▽ More

    Submitted 15 June, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: To appear in Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '23)

  38. arXiv:2301.13169  [pdf, other

    quant-ph cs.LG physics.comp-ph

    Improved machine learning algorithm for predicting ground state properties

    Authors: Laura Lewis, Hsin-Yuan Huang, Viet T. Tran, Sebastian Lehner, Richard Kueng, John Preskill

    Abstract: Finding the ground state of a quantum many-body system is a fundamental problem in quantum physics. In this work, we give a classical machine learning (ML) algorithm for predicting ground state properties with an inductive bias encoding geometric locality. The proposed ML model can efficiently predict ground state properties of an $n$-qubit gapped local Hamiltonian after learning from only… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: 8 pages, 5 figures + 32-page appendix

  39. Attentive Deep Neural Networks for Legal Document Retrieval

    Authors: Ha-Thanh Nguyen, Manh-Kien Phi, Xuan-Bach Ngo, Vu Tran, Le-Minh Nguyen, Minh-Phuong Tu

    Abstract: Legal text retrieval serves as a key component in a wide range of legal text processing tasks such as legal question answering, legal case entailment, and statute law retrieval. The performance of legal text retrieval depends, to a large extent, on the representation of text, both query and legal documents. Based on good representations, a legal text retrieval model can effectively match the query… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: Preprint version. The official version will be published in Artificial Intelligence and Law journal

  40. arXiv:2212.13898  [pdf, other

    cs.IR cs.AI cs.LG

    Dense Feature Memory Augmented Transformers for COVID-19 Vaccination Search Classification

    Authors: Jai Gupta, Yi Tay, Chaitanya Kamath, Vinh Q. Tran, Donald Metzler, Shailesh Bavadekar, Mimi Sun, Evgeniy Gabrilovich

    Abstract: With the devastating outbreak of COVID-19, vaccines are one of the crucial lines of defense against mass infection in this global pandemic. Given the protection they provide, vaccines are becoming mandatory in certain social and professional settings. This paper presents a classification model for detecting COVID-19 vaccination related search queries, a machine learning model that is used to gener… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: EMNLP 2022

    MSC Class: I.2.7

  41. arXiv:2212.09744  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    DSI++: Updating Transformer Memory with New Documents

    Authors: Sanket Vaibhav Mehta, Jai Gupta, Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Jinfeng Rao, Marc Najork, Emma Strubell, Donald Metzler

    Abstract: Differentiable Search Indices (DSIs) encode a corpus of documents in model parameters and use the same model to answer user queries directly. Despite the strong performance of DSI models, deploying them in situations where the corpus changes over time is computationally expensive because reindexing the corpus requires re-training the model. In this work, we introduce DSI++, a continual learning ch… ▽ More

    Submitted 8 December, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted at EMNLP 2023 main conference

  42. arXiv:2212.08335  [pdf, other

    cs.CL

    Law to Binary Tree -- An Formal Interpretation of Legal Natural Language

    Authors: Ha-Thanh Nguyen, Vu Tran, Ngoc-Cam Le, Thi-Thuy Le, Quang-Huy Nguyen, Le-Minh Nguyen, Ken Satoh

    Abstract: Knowledge representation and reasoning in law are essential to facilitate the automation of legal analysis and decision-making tasks. In this paper, we propose a new approach based on legal science, specifically legal taxonomy, for representing and reasoning with legal documents. Our approach interprets the regulations in legal documents as binary trees, which facilitates legal reasoning systems t… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: LN2FR 2022

  43. arXiv:2212.08037  [pdf, other

    cs.CL

    Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models

    Authors: Bernd Bohnet, Vinh Q. Tran, Pat Verga, Roee Aharoni, Daniel Andor, Livio Baldini Soares, Massimiliano Ciaramita, Jacob Eisenstein, Kuzman Ganchev, Jonathan Herzig, Kai Hui, Tom Kwiatkowski, Ji Ma, Jianmo Ni, Lierni Sestorain Saralegui, Tal Schuster, William W. Cohen, Michael Collins, Dipanjan Das, Donald Metzler, Slav Petrov, Kellie Webster

    Abstract: Large language models (LLMs) have shown impressive results while requiring little or no direct supervision. Further, there is mounting evidence that LLMs may have potential in information-seeking scenarios. We believe the ability of an LLM to attribute the text that it generates is likely to be crucial in this setting. We formulate and study Attributed QA as a key first step in the development of… ▽ More

    Submitted 10 February, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  44. arXiv:2211.08170  [pdf, other

    cs.CL cs.DB cs.IR cs.LG

    A Comparative Study of Question Answering over Knowledge Bases

    Authors: Khiem Vinh Tran, Hao Phu Phan, Khang Nguyen Duc Quach, Ngan Luu-Thuy Nguyen, Jun Jo, Thanh Tam Nguyen

    Abstract: Question answering over knowledge bases (KBQA) has become a popular approach to help users extract information from knowledge bases. Although several systems exist, choosing one suitable for a particular application scenario is difficult. In this article, we provide a comparative study of six representative KBQA systems on eight benchmark datasets. In that, we study various question types, propert… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  45. Discovery Dynamics: Leveraging Repeated Exposure for User and Music Characterization

    Authors: Bruno Sguerra, Viet-Anh Tran, Romain Hennequin

    Abstract: Repetition in music consumption is a common phenomenon. It is notably more frequent when compared to the consumption of other media, such as books and movies. In this paper, we show that one particularly interesting repetitive behavior arises when users are consuming new items. Users' interest tends to rise with the first repetitions and attains a peak after which interest will decrease with subse… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Journal ref: In Sixteenth ACM Conference on Recommender Systems (RecSys 2022)

  46. arXiv:2210.14607  [pdf, other

    cs.CL

    A practical method for occupational skills detection in Vietnamese job listings

    Authors: Viet-Trung Tran, Hai-Nam Cao, Tuan-Dung Cao

    Abstract: Vietnamese labor market has been under an imbalanced development. The number of university graduates is growing, but so is the unemployment rate. This situation is often caused by the lack of accurate and timely labor market information, which leads to skill miss-matches between worker supply and the actual market demands. To build a data monitoring and analytic platform for the labor market, one… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 10 pages

  47. An Effective Deep Network for Head Pose Estimation without Keypoints

    Authors: Chien Thai, Viet Tran, Minh Bui, Huong Ninh, Hai Tran

    Abstract: Human head pose estimation is an essential problem in facial analysis in recent years that has a lot of computer vision applications such as gaze estimation, virtual reality, and driver assistance. Because of the importance of the head pose estimation problem, it is necessary to design a compact model to resolve this task in order to reduce the computational cost when deploying on facial analysis-… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Journal ref: In Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods - ICPRAM 2022, ISBN 978-989-758-549-4; ISSN 2184-4313, pages 90-98

  48. arXiv:2210.13700  [pdf, other

    eess.AS cs.CL cs.LG

    Does Joint Training Really Help Cascaded Speech Translation?

    Authors: Viet Anh Khoa Tran, David Thulke, Yingbo Gao, Christian Herold, Hermann Ney

    Abstract: Currently, in speech translation, the straightforward approach - cascading a recognition system with a translation system - delivers state-of-the-art results. However, fundamental challenges such as error propagation from the automatic speech recognition system still remain. To mitigate these problems, recently, people turn their attention to direct data and propose various joint training methods.… ▽ More

    Submitted 24 November, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  49. arXiv:2210.11399  [pdf, other

    cs.CL cs.AI cs.LG

    Transcending Scaling Laws with 0.1% Extra Compute

    Authors: Yi Tay, Jason Wei, Hyung Won Chung, Vinh Q. Tran, David R. So, Siamak Shakeri, Xavier Garcia, Huaixiu Steven Zheng, Jinfeng Rao, Aakanksha Chowdhery, Denny Zhou, Donald Metzler, Slav Petrov, Neil Houlsby, Quoc V. Le, Mostafa Dehghani

    Abstract: Scaling language models improves performance but comes with significant computational costs. This paper proposes UL2R, a method that substantially improves existing language models and their scaling curves with a relatively tiny amount of extra compute. The key idea is to continue training a state-of-the-art large language model (e.g., PaLM) on a few more steps with UL2's mixture-of-denoiser objec… ▽ More

    Submitted 16 November, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: V2 has updated references/related work

  50. arXiv:2209.08868  [pdf, other

    physics.comp-ph cs.DC hep-ex hep-lat hep-th

    Snowmass 2021 Computational Frontier CompF4 Topical Group Report: Storage and Processing Resource Access

    Authors: W. Bhimji, D. Carder, E. Dart, J. Duarte, I. Fisk, R. Gardner, C. Guok, B. Jayatilaka, T. Lehman, M. Lin, C. Maltzahn, S. McKee, M. S. Neubauer, O. Rind, O. Shadura, N. V. Tran, P. van Gemmeren, G. Watts, B. A. Weaver, F. Würthwein

    Abstract: Computing plays a significant role in all areas of high energy physics. The Snowmass 2021 CompF4 topical group's scope is facilities R&D, where we consider "facilities" as the computing hardware and software infrastructure inside the data centers plus the networking between data centers, irrespective of who owns them, and what policies are applied for using them. In other words, it includes commer… ▽ More

    Submitted 29 September, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: Snowmass 2021 Computational Frontier CompF4 topical group report. v2: Expanded introduction. Updated author list. 52 pages, 6 figures