Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 263 results for author: Laan, T

.
  1. arXiv:2409.03215  [pdf, other

    cs.CL cs.AI cs.LG

    xLAM: A Family of Large Action Models to Empower AI Agent Systems

    Authors: Jianguo Zhang, Tian Lan, Ming Zhu, Zuxin Liu, Thai Hoang, Shirley Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu, Yihao Feng, Tulika Awalgaonkar, Rithesh Murthy, Eric Hu, Zeyuan Chen, Ran Xu, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong

    Abstract: Autonomous agents powered by large language models (LLMs) have attracted significant research interest. However, the open-source community faces many challenges in developing specialized models for agent tasks, driven by the scarcity of high-quality agent datasets and the absence of standard protocols in this area. We introduce and publicly release xLAM, a series of large action models designed fo… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Technical report for the Salesforce xLAM model series

  2. arXiv:2409.00115  [pdf

    eess.SP cs.AI cs.ET cs.LG

    Quantum Kernel Principal Components Analysis for Compact Readout of Chemiresistive Sensor Arrays

    Authors: Zeheng Wang, Timothy van der Laan, Muhammad Usman

    Abstract: The rapid growth of Internet of Things (IoT) devices necessitates efficient data compression techniques to handle the vast amounts of data generated by these devices. In this context, chemiresistive sensor arrays (CSAs), a simple-to-fabricate but crucial component in IoT systems, generate large volumes of data due to their simultaneous multi-sensor operations. Classical principal component analysi… ▽ More

    Submitted 28 August, 2024; originally announced September 2024.

  3. arXiv:2408.09456  [pdf, other

    cs.AR cs.AI cs.ET cs.LG

    In-Memory Learning Automata Architecture using Y-Flash Cell

    Authors: Omar Ghazal, Tian Lan, Shalman Ojukwu, Komal Krishnamurthy, Alex Yakovlev, Rishad Shafik

    Abstract: The modern implementation of machine learning architectures faces significant challenges due to frequent data transfer between memory and processing units. In-memory computing, primarily through memristor-based analog computing, offers a promising solution to overcome this von Neumann bottleneck. In this technology, data processing and storage are located inside the memory. Here, we introduce a no… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  4. arXiv:2408.07094  [pdf

    cs.LG stat.ML

    Overcoming Imbalanced Safety Data Using Extended Accident Triangle

    Authors: Kailai Sun, Tianxiang Lan, Yang Miang Goh, Yueng-Hsiang Huang

    Abstract: There is growing interest in using safety analytics and machine learning to support the prevention of workplace incidents, especially in high-risk industries like construction and trucking. Although existing safety analytics studies have made remarkable progress, they suffer from imbalanced datasets, a common problem in safety analytics, resulting in prediction inaccuracies. This can lead to manag… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  5. arXiv:2408.07060  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

    Authors: Kexun Zhang, Weiran Yao, Zuxin Liu, Yihao Feng, Zhiwei Liu, Rithesh Murthy, Tian Lan, Lei Li, Renze Lou, Jiacheng Xu, Bo Pang, Yingbo Zhou, Shelby Heinecke, Silvio Savarese, Huan Wang, Caiming Xiong

    Abstract: Large language model (LLM) agents have shown great potential in solving real-world software engineering (SWE) problems. The most advanced open-source SWE agent can resolve over 27% of real GitHub issues in SWE-Bench Lite. However, these sophisticated agent frameworks exhibit varying strengths, excelling in certain tasks while underperforming in others. To fully harness the diversity of these agent… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  6. arXiv:2408.00930  [pdf, other

    cs.LG cs.AI

    Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research

    Authors: Tian Lan, Huan Wang, Caiming Xiong, Silvio Savarese

    Abstract: We introduce WarpSci, a domain agnostic framework designed to overcome crucial system bottlenecks encountered in the application of reinforcement learning to intricate environments with vast datasets featuring high-dimensional observation or action spaces. Notably, our framework eliminates the need for data transfer between the CPU and GPU, enabling the concurrent execution of thousands of simulat… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  7. arXiv:2407.17809  [pdf, other

    astro-ph.GA

    Tracing the evolution of the cool gas in CGM and IGM environments through Mg II absorption from redshift z=0.75 to z=1.65 using DESI-Y1 data

    Authors: X. Wu, Z. Cai, T. -W. Lan, S. Zou, A. Anand, Biprateep Dey, Z. Li, J. Aguilar, S. Ahlen, D. Brooks, T. Claybaugh, A. de la Macorra, P. Doel, S. Ferraro, J. E. Forero-Romero, S. Gontcho A Gontcho, K. Honscheid, S. Juneau, R. Kehoe, T. Kisner, A. Lambert, M. Landriau, L. Le Guillou, M. Manera, A. Meisner , et al. (13 additional authors not shown)

    Abstract: We present a measurement of the mean absorption of cool gas traced by Mg II (${λλ2796, 2803}$) around emission line galaxies (ELGs), spanning spatial scales from 20 kpc to 10 Mpc. The measurement is based on cross-matching the positions of about 2.5 million ELGs at $z = 0.75-1.65$ and the metal absorption in the spectra of 1.4 million background quasars with data provided by the Year 1 sample of t… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  8. arXiv:2407.11477  [pdf, other

    cs.LG cs.AI

    XTraffic: A Dataset Where Traffic Meets Incidents with Explainability and More

    Authors: Xiaochuan Gou, Ziyue Li, Tian Lan, Junpeng Lin, Zhishuai Li, Bingyu Zhao, Chen Zhang, Di Wang, Xiangliang Zhang

    Abstract: Long-separated research has been conducted on two highly correlated tracks: traffic and incidents. Traffic track witnesses complicating deep learning models, e.g., to push the prediction a few percent more accurate, and the incident track only studies the incidents alone, e.g., to infer the incident risk. We, for the first time, spatiotemporally aligned the two tracks in a large-scale region (16,9… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  9. arXiv:2407.02031  [pdf, other

    cs.DC cs.AI cs.LG

    SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules

    Authors: Suyi Li, Lingyun Yang, Xiaoxiao Jiang, Hanfeng Lu, Zhipeng Di, Weiyi Lu, Jiawei Chen, Kan Liu, Yinghao Yu, Tao Lan, Guodong Yang, Lin Qu, Liping Zhang, Wei Wang

    Abstract: This paper documents our characterization study and practices for serving text-to-image requests with stable diffusion models in production. We first comprehensively analyze inference request traces for commercial text-to-image applications. It commences with our observation that add-on modules, i.e., ControlNets and LoRAs, that augment the base stable diffusion models, are ubiquitous in generatin… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  10. arXiv:2406.18518  [pdf, other

    cs.CL cs.AI cs.LG cs.SE

    APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

    Authors: Zuxin Liu, Thai Hoang, Jianguo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu, Yihao Feng, Rithesh Murthy, Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong

    Abstract: The advancement of function-calling agent models requires diverse, reliable, and high-quality datasets. This paper presents APIGen, an automated data generation pipeline designed to synthesize verifiable high-quality datasets for function-calling applications. We leverage APIGen and collect 3,673 executable APIs across 21 different categories to generate diverse function-calling datasets in a scal… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  11. arXiv:2405.19878  [pdf, other

    cs.LG cs.GT

    Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models

    Authors: Zeyu Fang, Tian Lan

    Abstract: Generative models such as diffusion have been employed as world models in offline reinforcement learning to generate synthetic data for more effective learning. Existing work either generates diffusion models one-time prior to training or requires additional interaction data to update it. In this paper, we propose a novel approach for offline reinforcement learning with closed-loop policy evaluati… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  12. arXiv:2405.16657  [pdf, other

    astro-ph.CO

    ELG Spectroscopic Systematics Analysis of the DESI Data Release 1

    Authors: Jiaxi Yu, Ashley J. Ross, Antoine Rocher, Otávio Alves, Arnaud de Mattia, Daniel Forero-Sánchez, Jean-Paul Kneib, Alex Krolewski, TingWen Lan, Michael Rashkovetskyi, Jessica Nicole Aguilar, Steven Ahlen, Stephen Bailey, David Brooks, Edmond Chaussidon, Todd Claybaugh, Axel de la Macorra, Arjun Dey, Biprateep Dey, Peter Doel, Kevin Fanning, Jaime E. Forero-Romero, Enrique Gaztañaga, Satya Gontcho A Gontcho, Klaus Honscheid , et al. (36 additional authors not shown)

    Abstract: Dark Energy Spectroscopic Instrument (DESI) uses more than 2.4 million Emission Line Galaxies (ELGs) for 3D large-scale structure (LSS) analyses in its Data Release 1 (DR1). Such large statistics enable thorough research on systematic uncertainties. In this study, we focus on spectroscopic systematics of ELGs. The redshift success rate ($f_{\rm goodz}$) is the relative fraction of secure redshifts… ▽ More

    Submitted 26 August, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  13. arXiv:2405.16386  [pdf, other

    cs.LG cs.AI

    Variational Offline Multi-agent Skill Discovery

    Authors: Jiayu Chen, Bhargav Ganguly, Tian Lan, Vaneet Aggarwal

    Abstract: Skills are effective temporal abstractions established for sequential decision making tasks, which enable efficient hierarchical learning for long-horizon tasks and facilitate multi-task learning through their transferability. Despite extensive research, research gaps remain in multi-agent scenarios, particularly for automatically extracting subgroup coordination patterns in a multi-agent task. In… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  14. arXiv:2405.14122  [pdf, other

    cs.GT

    Modeling Other Players with Bayesian Beliefs for Games with Incomplete Information

    Authors: Zuyuan Zhang, Mahdi Imani, Tian Lan

    Abstract: Bayesian games model interactive decision-making where players have incomplete information -- e.g., regarding payoffs and private data on players' strategies and preferences -- and must actively reason and update their belief models (with regard to such information) using observation and interaction history. Existing work on counterfactual regret minimization have shown great success for games wit… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2105.08440 by other authors

  15. arXiv:2405.13748  [pdf, other

    cs.CV

    Monocular Gaussian SLAM with Language Extended Loop Closure

    Authors: Tian Lan, Qinwei Lin, Haoqian Wang

    Abstract: Recently,3DGaussianSplattinghasshowngreatpotentialin visual Simultaneous Localization And Mapping (SLAM). Existing methods have achieved encouraging results on RGB-D SLAM, but studies of the monocular case are still scarce. Moreover, they also fail to correct drift errors due to the lack of loop closure and global optimization. In this paper, we present MG-SLAM, a monocular Gaussian SLAM with a la… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  16. arXiv:2405.08314  [pdf, other

    astro-ph.GA

    Probing the impact of radio-mode feedback on the properties of the cool circumgalactic medium

    Authors: Yu-Ling Chang, Ting-Wen Lan, J. Xavier Prochaska, Lucas Napolitano, Abhijeet Anand, J. Aguilar, S. Ahlen, D. Brooks, T. Claybaugh, A. de la Macorra, Arjun Dey, P. Doel, S. Gontcho A Gontcho, J. Guy, S. Juneau, T. Kisner, A. Lambert, M. Landriau, L. Le Guillou, M. Manera, P. Martini, A. Meisner, R. Miquel, J. Moustakas, A. D. Myers , et al. (11 additional authors not shown)

    Abstract: We explore the influence of radio-mode feedback on the properties of the cool circumgalactic medium (CGM). To this end, we assemble a statistical sample of approximately 30,000 radio galaxies with background quasars by combining optical spectroscopic measurements of luminous red galaxies (LRGs) and quasars from the year 1 dataset of Dark Energy Spectroscopic Instrument (DESI) and radio sources fro… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 20 pages, 12 figures

  17. arXiv:2405.03967  [pdf, other

    cs.LG cs.AI cs.AR

    SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems

    Authors: Kailash Gogineni, Sai Santosh Dayapule, Juan Gómez-Luna, Karthikeya Gogineni, Peng Wei, Tian Lan, Mohammad Sadrosadati, Onur Mutlu, Guru Venkataramani

    Abstract: Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward signals from experience datasets. However, RL training often faces memory limitations, leading to execution latencies and prolonged training times. To overcome this, SwiftRL explores Processing-In-Memory (PIM) architectures to accelerate RL workloads. We achieve near-linear performance scaling by implementing… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  18. arXiv:2404.13836  [pdf, other

    stat.ME

    MultiFun-DAG: Multivariate Functional Directed Acyclic Graph

    Authors: Tian Lan, Ziyue Li, Junpeng Lin, Zhishuai Li, Lei Bai, Man Li, Fugee Tsung, Rui Zhao, Chen Zhang

    Abstract: Directed Acyclic Graphical (DAG) models efficiently formulate causal relationships in complex systems. Traditional DAGs assume nodes to be scalar variables, characterizing complex systems under a facile and oversimplified form. This paper considers that nodes can be multivariate functional data and thus proposes a multivariate functional DAG (MultiFun-DAG). It constructs a hidden bilinear multivar… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  19. arXiv:2404.03002  [pdf, other

    astro-ph.CO

    DESI 2024 VI: Cosmological Constraints from the Measurements of Baryon Acoustic Oscillations

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, B. Bahr-Kalus, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, A. Bera, F. Beutler, D. Bianchi, C. Blake, R. Blum , et al. (178 additional authors not shown)

    Abstract: We present cosmological results from the measurement of baryon acoustic oscillations (BAO) in galaxy, quasar and Lyman-$α$ forest tracers from the first year of observations from the Dark Energy Spectroscopic Instrument (DESI), to be released in the DESI Data Release 1. DESI BAO provide robust measurements of the transverse comoving distance and Hubble rate, or their combination, relative to the s… ▽ More

    Submitted 24 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers). Typos corrected and a new figure and discussion added to Appendix A

  20. arXiv:2404.03001  [pdf, other

    astro-ph.CO

    DESI 2024 IV: Baryon Acoustic Oscillations from the Lyman Alpha Forest

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Bautista, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden , et al. (174 additional authors not shown)

    Abstract: We present the measurement of Baryon Acoustic Oscillations (BAO) from the Lyman-$α$ (Ly$α$) forest of high-redshift quasars with the first-year dataset of the Dark Energy Spectroscopic Instrument (DESI). Our analysis uses over $420\,000$ Ly$α$ forest spectra and their correlation with the spatial distribution of more than $700\,000$ quasars. An essential facet of this work is the development of a… ▽ More

    Submitted 12 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers)

  21. arXiv:2404.03000  [pdf, other

    astro-ph.CO

    DESI 2024 III: Baryon Acoustic Oscillations from Galaxies and Quasars

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden, A. Brodzeller , et al. (171 additional authors not shown)

    Abstract: We present the DESI 2024 galaxy and quasar baryon acoustic oscillations (BAO) measurements using over 5.7 million unique galaxy and quasar redshifts in the range 0.1<z<2.1. Divided by tracer type, we utilize 300,017 galaxies from the magnitude-limited Bright Galaxy Survey with 0.1<z<0.4, 2,138,600 Luminous Red Galaxies with 0.4<z<1.1, 2,432,022 Emission Line Galaxies with 0.8<z<1.6, and 856,652 qu… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers)

  22. arXiv:2403.15341  [pdf, other

    cs.AI cs.MA

    Collaborative AI Teaming in Unknown Environments via Active Goal Deduction

    Authors: Zuyuan Zhang, Hanhan Zhou, Mahdi Imani, Taeyoung Lee, Tian Lan

    Abstract: With the advancements of artificial intelligence (AI), we're seeing more scenarios that require AI to work closely with other agents, whose goals and strategies might not be known beforehand. However, existing approaches for training collaborative agents often require defined and known reward signals and cannot address the problem of teaming with unknown agents that often have latent objectives/re… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  23. arXiv:2403.01954  [pdf, other

    cs.CL cs.AI cs.LO

    DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation

    Authors: Chen Xu, Tian Lan, Changlong Yu, Wei Wang, Jun Gao, Yu Ji, Qunxi Dong, Kun Qian, Piji Li, Wei Bi, Bin Hu

    Abstract: Constrained decoding approaches aim to control the meaning or style of text generated by a Pre-trained Language Model (PLM) using specific target words during inference. However, these methods often guide plausible continuations by greedily selecting targets, which, while completing the task, may disrupt the natural patterns of human language generation. In this work, we propose a novel decoding f… ▽ More

    Submitted 7 July, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE TKDE (Major Revision), 13 pages, 6 figures

  24. arXiv:2403.01890  [pdf, other

    cs.RO

    Aerial Tensile Perching and Disentangling Mechanism for Long-Term Environmental Monitoring

    Authors: Tian Lan, Luca Romanello, Mirko Kovac, Sophie F. Armanini, Basaran Bahadir Kocer

    Abstract: Aerial robots show significant potential for forest canopy research and environmental monitoring by providing data collection capabilities at high spatial and temporal resolutions. However, limited flight endurance hinders their application. Inspired by natural perching behaviours, we propose a multi-modal aerial robot system that integrates tensile perching for energy conservation and a suspended… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 7 pages, 8 figures, Accepted in IEEE International Conference on Robotics and Automation (ICRA) 2024

  25. arXiv:2403.01642  [pdf

    cs.LG cs.CE eess.SY

    Blue and Green-Mode Energy-Efficient Chemiresistive Sensor Array Realized by Rapid Ensemble Learning

    Authors: Zeheng Wang, James Cooper, Muhammad Usman, Timothy van der Laan

    Abstract: The rapid advancement of Internet of Things (IoT) necessitates the development of optimized Chemiresistive Sensor (CRS) arrays that are both energy-efficient and capable. This study introduces a novel optimization strategy that employs a rapid ensemble learning-based model committee approach to achieve these goals. Utilizing machine learning models such as Elastic Net Regression, Random Forests, a… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: First version before submission

  26. arXiv:2403.01577  [pdf, ps, other

    cond-mat.str-el math-ph

    Torus algebra and logical operators at low energy

    Authors: Ying Chan, Tian Lan, Linqian Wu

    Abstract: Given a modular tensor category $\mathscr{C}$, we construct an associative algebra $\mathrm{Tor({\mathscr{C}}})$, which we call the torus algebra. We prove that the torus algebra is semisimple by explicitly constructing all the simple modules. Suppose that a topological ordered phase described by $\mathscr{C}$ is put on a torus. Physically, each simple module over $\mathrm{Tor({\mathscr{C}}})$ con… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 22 pages, 1 figure

  27. arXiv:2402.19253  [pdf, ps, other

    cond-mat.str-el hep-th

    Condensation Completion and Defects in 2+1D Topological Orders

    Authors: Gen Yue, Longye Wang, Tian Lan

    Abstract: We review the condensation completion of a modular tensor category, which yields a fusion 2-category of codimension-1 and higher defects in a $2+1$D topological order. We apply the condensation completion to $2+1$D toric code model and a $\mathbbm Z_4$ chiral topological order. In both cases, we explicitly enumerate the $1$d and $0$d defects present in these topological orders, along with their fu… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  28. arXiv:2402.15538  [pdf, other

    cs.MA cs.AI

    AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System

    Authors: Zhiwei Liu, Weiran Yao, Jianguo Zhang, Liangwei Yang, Zuxin Liu, Juntao Tan, Prafulla K. Choubey, Tian Lan, Jason Wu, Huan Wang, Shelby Heinecke, Caiming Xiong, Silvio Savarese

    Abstract: The booming success of LLMs initiates rapid development in LLM agents. Though the foundation of an LLM agent is the generative model, it is critical to devise the optimal reasoning strategies and agent architectures. Accordingly, LLM agent research advances from the simple chain-of-thought prompting to more complex ReAct and Reflection reasoning strategy; agent architecture also evolves from singl… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: preprint. Library is available at https://github.com/SalesforceAIResearch/AgentLite

  29. arXiv:2402.15506  [pdf, other

    cs.AI cs.CL cs.LG

    AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

    Authors: Jianguo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Tulika Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong

    Abstract: Autonomous agents powered by large language models (LLMs) have garnered significant research attention. However, fully harnessing the potential of LLMs for agent-based tasks presents inherent challenges due to the heterogeneous nature of diverse data sources featuring multi-turn trajectories. In this paper, we introduce \textbf{AgentOhana} as a comprehensive solution to address these challenges. \… ▽ More

    Submitted 20 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Add GitHub repo link at \url{https://github.com/SalesforceAIResearch/xLAM} and HuggingFace model link at \url{https://huggingface.co/Salesforce/xLAM-v0.1-r}

  30. arXiv:2402.13777  [pdf, other

    cs.LG cs.AI

    Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions

    Authors: Jiayu Chen, Bhargav Ganguly, Yang Xu, Yongsheng Mei, Tian Lan, Vaneet Aggarwal

    Abstract: Deep generative models (DGMs) have demonstrated great success across various domains, particularly in generating texts, images, and videos using models trained from offline data. Similarly, data-driven decision-making and robotic control also necessitate learning a generator function from the offline data to serve as the strategy or policy. In this case, applying deep generative models in offline… ▽ More

    Submitted 25 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: We restructured the paper and added more discussion

  31. arXiv:2402.13764  [pdf, other

    cs.CL cs.AI

    CriticBench: Evaluating Large Language Models as Critic

    Authors: Tian Lan, Wenwei Zhang, Chen Xu, Heyan Huang, Dahua Lin, Kai Chen, Xian-ling Mao

    Abstract: Critique ability are crucial in the scalable oversight and self-improvement of Large Language Models (LLMs). While many recent studies explore the critique ability of LLMs to judge and refine flaws in generations, how to comprehensively and reliably measure the critique abilities of LLMs is under-explored. This paper introduces CriticBench, a novel benchmark designed to comprehensively and reliabl… ▽ More

    Submitted 22 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  32. arXiv:2402.12417  [pdf

    cs.LG cs.AI

    Predicting trucking accidents with truck drivers 'safety climate perception across companies: A transfer learning approach

    Authors: Kailai Sun, Tianxiang Lan, Say Hong Kam, Yang Miang Goh, Yueng-Hsiang Huang

    Abstract: There is a rising interest in using artificial intelligence (AI)-powered safety analytics to predict accidents in the trucking industry. Companies may face the practical challenge, however, of not having enough data to develop good safety analytics models. Although pretrained models may offer a solution for such companies, existing safety research using transfer learning has mostly focused on comp… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: submitted to journal: accident analysis and prevention

  33. arXiv:2402.10941  [pdf, other

    cs.CL cs.AI cs.LG

    Text2Data: Low-Resource Data Generation with Textual Control

    Authors: Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese

    Abstract: Natural language serves as a common and straightforward control signal for humans to interact seamlessly with machines. Recognizing the importance of this interface, the machine learning community is investing considerable effort in generating data that is semantically coherent with textual instructions. While strides have been made in text-to-data generation spanning image editing, audio synthesi… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: We propose a method that can achieve text-to-data generation under low-resource situation

  34. Effects of Magnetic Helicity on 3D Equilibria and Self-Organized States in KTX Reversed Field Pinch

    Authors: Ke Liu, Guodong Yu, Yuhua Huang, Wenzhe Mao, Yidong Xie, Xianyi Nie, Hong Li, Tao Lan, Jinlin Xie, Weixing Ding, Wandong Liu, Ge Zhuang, Caoxiang Zhu

    Abstract: The RFP is a toroidal magnetic configuration in which plasmas can spontaneously transform into different self-organized states. Among various states, the QSH state has a dominant component for the magnetic field and significantly improves confinement. Many theoretical and experimental efforts have investigated the transitions among different states. This paper employs the MRxMHD model to study the… ▽ More

    Submitted 6 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  35. arXiv:2401.14544  [pdf, other

    cs.LG math.FA math.PR

    Bayesian Optimization through Gaussian Cox Process Models for Spatio-temporal Data

    Authors: Yongsheng Mei, Mahdi Imani, Tian Lan

    Abstract: Bayesian optimization (BO) has established itself as a leading strategy for efficiently optimizing expensive-to-evaluate functions. Existing BO methods mostly rely on Gaussian process (GP) surrogate models and are not applicable to (doubly-stochastic) Gaussian Cox processes, where the observation process is modulated by a latent intensity function modeled as a GP. In this paper, we propose a novel… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 2024 International Conference on Learning Representations (ICLR)

  36. arXiv:2312.15958  [pdf, ps, other

    cond-mat.str-el hep-th math-ph

    Category of SET orders

    Authors: Tian Lan, Gen Yue, Longye Wang

    Abstract: We propose the representation principle to study physical systems with a given symmetry. In the context of symmetry enriched topological orders, we give the appropriate representation category, the category of SET orders. For fusion n-category symmetries, we show that the category of SET orders encodes almost all information about the interplay between symmetry and topological orders, in a natural… ▽ More

    Submitted 18 July, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 37 pages, 8 figures, 1 table. Theorem 3.27 and Corollary 3.28 are added, clarifying the relation between center and Morita classes of higher fusion categories

  37. arXiv:2312.15947  [pdf, ps, other

    hep-th math-ph

    On a class of fusion 2-category symmetry: condensation completion of braided fusion category

    Authors: Wenjie Xi, Tian Lan, Longye Wang, Chenjie Wang, Wei-Qiang Chen

    Abstract: Recently, many studies are focused on generalized global symmetry, a mixture of both invertible and non-invertible symmetries in various space-time dimensions. The complete structure of generalized global symmetry is described by higher fusion category theory. In this paper, We first review the construction of fusion 2-category symmetry $Σ\cal B$ where $\cal B$ is a a braided fusion category. In p… ▽ More

    Submitted 10 May, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: 42 pages, 3 figures, All the 10j-symbols of $Σ\mathrm{sVec}$ and the complete computer program has been uploaded on github: https://github.com/WJXI/2sVec.git

  38. arXiv:2312.15555  [pdf, other

    cs.MA

    ConcaveQ: Non-Monotonic Value Function Factorization via Concave Representations in Deep Multi-Agent Reinforcement Learning

    Authors: Huiqun Li, Hanhan Zhou, Yifei Zou, Dongxiao Yu, Tian Lan

    Abstract: Value function factorization has achieved great success in multi-agent reinforcement learning by optimizing joint action-value functions through the maximization of factorized per-agent utilities. To ensure Individual-Global-Maximum property, existing works often focus on value factorization using monotonic functions, which are known to result in restricted representation expressiveness. In this p… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

    Journal ref: AAAI 2024

  39. arXiv:2312.11742  [pdf, other

    cs.DC cs.AR cs.LG cs.NI

    ACCL+: an FPGA-Based Collective Engine for Distributed Applications

    Authors: Zhenhao He, Dario Korolija, Yu Zhu, Benjamin Ramhorst, Tristan Laan, Lucian Petrica, Michaela Blott, Gustavo Alonso

    Abstract: FPGAs are increasingly prevalent in cloud deployments, serving as Smart NICs or network-attached accelerators. Despite their potential, developing distributed FPGA-accelerated applications remains cumbersome due to the lack of appropriate infrastructure and communication abstractions. To facilitate the development of distributed applications with FPGAs, in this paper we propose ACCL+, an open-sour… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  40. arXiv:2312.07696  [pdf, ps, other

    cs.CR cs.AI

    Real-time Network Intrusion Detection via Decision Transformers

    Authors: Jingdi Chen, Hanhan Zhou, Yongsheng Mei, Gina Adam, Nathaniel D. Bastian, Tian Lan

    Abstract: Many cybersecurity problems that require real-time decision-making based on temporal observations can be abstracted as a sequence modeling problem, e.g., network intrusion detection from a sequence of arriving packets. Existing approaches like reinforcement learning may not be suitable for such cybersecurity decision problems, since the Markovian property may not necessarily hold and the underlyin… ▽ More

    Submitted 16 December, 2023; v1 submitted 12 December, 2023; originally announced December 2023.

  41. arXiv:2312.07060  [pdf, other

    cs.DC

    Layered Randomized Quantization for Communication-Efficient and Privacy-Preserving Distributed Learning

    Authors: Guangfeng Yan, Tan Li, Tian Lan, Kui Wu, Linqi Song

    Abstract: Next-generation wireless networks, such as edge intelligence and wireless distributed learning, face two critical challenges: communication efficiency and privacy protection. In this work, our focus is on addressing these issues in a distributed learning framework. We consider a new approach that simultaneously achieves communication efficiency and privacy protection by exploiting the privacy adva… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  42. arXiv:2312.02515  [pdf, other

    cs.LG cs.AI

    ASPEN: High-Throughput LoRA Fine-Tuning of Large Language Models with a Single GPU

    Authors: Zhengmao Ye, Dengchun Li, Jingqi Tian, Tingfeng Lan, Jie Zuo, Lei Duan, Hui Lu, Yexi Jiang, Jian Sha, Ke Zhang, Mingjie Tang

    Abstract: Transformer-based large language models (LLMs) have demonstrated outstanding performance across diverse domains, particularly when fine-turned for specific domains. Recent studies suggest that the resources required for fine-tuning LLMs can be economized through parameter-efficient methods such as Low-Rank Adaptation (LoRA). While LoRA effectively reduces computational burdens and resource demands… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 14 pages, 14 figures

  43. arXiv:2311.17630  [pdf, other

    cs.NI eess.SP

    Optimization in Mobile Augmented Reality Systems for the Metaverse over Wireless Communications

    Authors: Tianming Lan, Jun Zhao

    Abstract: As the essential technical support for Metaverse, Mobile Augmented Reality (MAR) has attracted the attention of many researchers. MAR applications rely on real-time processing of visual and audio data, and thus those heavy workloads can quickly drain the battery of a mobile device. To address such problem, edge-based solutions have appeared for handling some tasks that require more computing power… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: This paper appears in IEEE Global Communications Conference (GLOBECOM) 2023

  44. arXiv:2311.16018  [pdf, other

    cs.CR cs.AI

    RIDE: Real-time Intrusion Detection via Explainable Machine Learning Implemented in a Memristor Hardware Architecture

    Authors: Jingdi Chen, Lei Zhang, Joseph Riem, Gina Adam, Nathaniel D. Bastian, Tian Lan

    Abstract: Deep Learning (DL) based methods have shown great promise in network intrusion detection by identifying malicious network traffic behavior patterns with high accuracy, but their applications to real-time, packet-level detections in high-speed communication networks are challenging due to the high computation time and resource requirements of Deep Neural Networks (DNNs), as well as lack of explaina… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  45. arXiv:2311.13235  [pdf

    physics.app-ph physics.chem-ph

    Strong Light-Matter Coupling Facilitated Charge Carrier Transport in Cavity Organic Solar Cells

    Authors: Yahui Tang, Alexandra Stuart, Timothy van der Laan, Girish Lakhwani

    Abstract: Strong light-matter coupling has shown great potential for modifying the electro-optical properties of semiconducting materials in recent years. In the strong coupling regime, excitons and cavity photons form new states named exciton-polaritons, with their properties a hybrid of each constituent. Herein, we report strong coupling observed in solution-processed donor:acceptor bulk-heterojunction or… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  46. arXiv:2310.19841  [pdf

    cs.LG

    An interpretable clustering approach to safety climate analysis: examining driver group distinction in safety climate perceptions

    Authors: Kailai Sun, Tianxiang Lan, Yang Miang Goh, Sufiana Safiena, Yueng-Hsiang Huang, Bailey Lytle, Yimin He

    Abstract: The transportation industry, particularly the trucking sector, is prone to workplace accidents and fatalities. Accidents involving large trucks accounted for a considerable percentage of overall traffic fatalities. Recognizing the crucial role of safety climate in accident prevention, researchers have sought to understand its factors and measure its impact within organizations. While existing data… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Submitted to Journal:Accident Analysis and Prevention

  47. arXiv:2310.10226  [pdf, other

    cs.CL

    Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective

    Authors: Huayang Li, Tian Lan, Zihao Fu, Deng Cai, Lemao Liu, Nigel Collier, Taro Watanabe, Yixuan Su

    Abstract: There are a number of diverging hypotheses about the neural text degeneration problem, i.e., generating repetitive and dull loops, which makes this problem both interesting and confusing. In this work, we aim to advance our understanding by presenting a straightforward and fundamental explanation from the data perspective. Our preliminary investigation reveals a strong correlation between the dege… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  48. arXiv:2310.08670  [pdf, other

    cs.LG cs.DC

    Every Parameter Matters: Ensuring the Convergence of Federated Learning with Dynamic Heterogeneous Models Reduction

    Authors: Hanhan Zhou, Tian Lan, Guru Venkataramani, Wenbo Ding

    Abstract: Cross-device Federated Learning (FL) faces significant challenges where low-end clients that could potentially make unique contributions are excluded from training large models due to their resource bottlenecks. Recent research efforts have focused on model-heterogeneous FL, by extracting reduced-size models from the global model and applying them to local clients accordingly. Despite the empirica… ▽ More

    Submitted 26 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023

  49. arXiv:2309.12606  [pdf, other

    math.NA

    Stable Reconstruction of Anisotropic Objects from Near-Field Electromagnetic Data

    Authors: Tran H. Lan, Dinh-Liem Nguyen

    Abstract: This paper addresses the electromagnetic inverse scattering problem of determining the location and shape of anisotropic objects from near-field data. We investigate both cases involving the Helmholtz equation and Maxwell's equations for this inverse problem. Our study focuses on developing efficient imaging functionals that enable a fast and stable recovery of the anisotropic object. The implemen… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 22 pages

  50. arXiv:2309.04707  [pdf, other

    cs.AI cs.LG

    Advantage Actor-Critic with Reasoner: Explaining the Agent's Behavior from an Exploratory Perspective

    Authors: Muzhe Guo, Feixu Yu, Tian Lan, Fang Jin

    Abstract: Reinforcement learning (RL) is a powerful tool for solving complex decision-making problems, but its lack of transparency and interpretability has been a major challenge in domains where decisions have significant real-world consequences. In this paper, we propose a novel Advantage Actor-Critic with Reasoner (A2CR), which can be easily applied to Actor-Critic-based RL models and make them interpre… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.