Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 165 results for author: Hong, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.19871  [pdf, ps, other

    cs.CR cs.NI

    Fast Private Location-based Information Retrieval Over the Torus

    Authors: Joon Soo Yoo, Mi Yeon Hong, Ji Won Heo, Kang Hoon Lee, Ji Won Yoon

    Abstract: Location-based services offer immense utility, but also pose significant privacy risks. In response, we propose LocPIR, a novel framework using homomorphic encryption (HE), specifically the TFHE scheme, to preserve user location privacy when retrieving data from public clouds. Our system employs TFHE's expertise in non-polynomial evaluations, crucial for comparison operations. LocPIR showcases min… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Accepted at the IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS) 2024

  2. arXiv:2407.02906  [pdf, other

    cs.CV

    Single Image Rolling Shutter Removal with Diffusion Models

    Authors: Zhanglei Yang, Haipeng Li, Mingbo Hong, Bing Zeng, Shuaicheng Liu

    Abstract: We present RS-Diffusion, the first Diffusion Models-based method for single-frame Rolling Shutter (RS) correction. RS artifacts compromise visual quality of frames due to the row wise exposure of CMOS sensors. Most previous methods have focused on multi-frame approaches, using temporal information from consecutive frames for the motion rectification. However, few approaches address the more challe… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2407.00817  [pdf

    cs.AR

    Multi-Objective Optimization for Common-Centroid Placement of Analog Transistors

    Authors: Supriyo Maji, Hyungjoo Park, Gi moon Hong, Souradip Poddar, David Z. Pan

    Abstract: In analog circuits, process variation can cause unpredictability in circuit performance. Common-centroid (CC) type layouts have been shown to mitigate process-induced variations and are widely used to match circuit elements. Nevertheless, selecting the most suitable CC topology necessitates careful consideration of important layout constraints. Manual handling of these constraints becomes challeng… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  4. arXiv:2406.14017  [pdf, other

    cs.IR

    EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration

    Authors: Ye Wang, Jiahao Xun, Minjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong

    Abstract: Generative retrieval has recently emerged as a promising approach to sequential recommendation, framing candidate item retrieval as an autoregressive sequence generation problem. However, existing generative methods typically focus solely on either behavioral or semantic aspects of item information, neglecting their complementary nature and thus resulting in limited effectiveness. To address this… ▽ More

    Submitted 3 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted by KDD 2024. Code available at https://reczoo.github.io/EAGER

  5. arXiv:2406.09841  [pdf, other

    cs.LG q-bio.BM

    Learning Multi-view Molecular Representations with Structured and Unstructured Knowledge

    Authors: Yizhen Luo, Kai Yang, Massimo Hong, Xing Yi Liu, Zikun Nie, Hao Zhou, Zaiqing Nie

    Abstract: Capturing molecular knowledge with representation learning approaches holds significant potential in vast scientific fields such as chemistry and life science. An effective and generalizable molecular representation is expected to capture the consensus and complementary molecular expertise from diverse views and perspectives. However, existing works fall short in learning multi-view molecular repr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures

  6. arXiv:2406.06874  [pdf, other

    cs.AI cs.HC cs.RO

    Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback

    Authors: Chenliang Li, Siliang Zeng, Zeyi Liao, Jiaxiang Li, Dongyeop Kang, Alfredo Garcia, Mingyi Hong

    Abstract: Aligning human preference and value is an important requirement for building contemporary foundation models and embodied AI. However, popular approaches such as reinforcement learning with human feedback (RLHF) break down the task into successive stages, such as supervised fine-tuning (SFT), reward modeling (RM), and reinforcement learning (RL), each performing one specific learning task. Such a s… ▽ More

    Submitted 19 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  7. arXiv:2406.02214  [pdf, other

    cs.LG

    SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining

    Authors: Andi Han, Jiaxiang Li, Wei Huang, Mingyi Hong, Akiko Takeda, Pratik Jawanpuria, Bamdev Mishra

    Abstract: Large language models (LLMs) have shown impressive capabilities across various tasks. However, training LLMs from scratch requires significant computational power and extensive memory capacity. Recent studies have explored low-rank structures on weights for efficient fine-tuning in terms of parameters and memory, either through low-rank adaptation or factorization. While effective for fine-tuning,… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  8. arXiv:2405.18881  [pdf, other

    cs.LG cs.AI

    Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization

    Authors: Zhiwei Tang, Jiangweizhi Peng, Jiasheng Tang, Mingyi Hong, Fan Wang, Tsung-Hui Chang

    Abstract: In this work, we focus on the alignment problem of diffusion models with a continuous reward function, which represents specific objectives for downstream tasks, such as improving human preference. The central goal of the alignment problem is to adjust the distribution learned by diffusion models such that the generated samples maximize the target reward function. We propose a novel alignment appr… ▽ More

    Submitted 3 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  9. arXiv:2405.17888  [pdf, other

    cs.AI

    Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment

    Authors: Jiaxiang Li, Siliang Zeng, Hoi-To Wai, Chenliang Li, Alfredo Garcia, Mingyi Hong

    Abstract: Aligning human preference and value is an important requirement for contemporary foundation models. State-of-the-art techniques such as Reinforcement Learning from Human Feedback (RLHF) often consist of two stages: 1) supervised fine-tuning (SFT), where the model is fine-tuned by learning from human demonstration data; 2) Preference learning, where preference data is used to learn a reward model,… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  10. arXiv:2405.15234  [pdf, other

    cs.CV cs.CR

    Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models

    Authors: Yimeng Zhang, Xin Chen, Jinghan Jia, Yihua Zhang, Chongyu Fan, Jiancheng Liu, Mingyi Hong, Ke Ding, Sijia Liu

    Abstract: Diffusion models (DMs) have achieved remarkable success in text-to-image generation, but they also pose safety risks, such as the potential generation of harmful content and copyright violations. The techniques of machine unlearning, also known as concept erasing, have been developed to address these risks. However, these techniques remain vulnerable to adversarial prompt attacks, which can prompt… ▽ More

    Submitted 14 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Codes are available at https://github.com/OPTML-Group/AdvUnlearn

  11. arXiv:2404.10575  [pdf, other

    cs.LG cs.AI cs.CV math.OC

    EMC$^2$: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence

    Authors: Chung-Yiu Yau, Hoi-To Wai, Parameswaran Raman, Soumajyoti Sarkar, Mingyi Hong

    Abstract: A key challenge in contrastive learning is to generate negative samples from a large sample set to contrast with positive samples, for learning better encoding of the data. These negative samples often follow a softmax distribution which are dynamically updated during the training process. However, sampling from this distribution is non-trivial due to the high computational costs in computing the… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 20 pages

  12. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  13. arXiv:2403.18774  [pdf, other

    cs.CV cs.CR cs.LG

    RAW: A Robust and Agile Plug-and-Play Watermark Framework for AI-Generated Images with Provable Guarantees

    Authors: Xun Xian, Ganghua Wang, Xuan Bi, Jayanth Srinivasa, Ashish Kundu, Mingyi Hong, Jie Ding

    Abstract: Safeguarding intellectual property and preventing potential misuse of AI-generated images are of paramount importance. This paper introduces a robust and agile plug-and-play watermark detection framework, dubbed as RAW. As a departure from traditional encoder-decoder methods, which incorporate fixed binary codes as watermarks within latent representations, our approach introduces learnable waterma… ▽ More

    Submitted 23 January, 2024; originally announced March 2024.

  14. arXiv:2403.00282  [pdf, other

    cs.LG

    Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

    Authors: Dohyeong Kim, Mineui Hong, Jeongho Park, Songhwai Oh

    Abstract: In many real-world applications, a reinforcement learning (RL) agent should consider multiple objectives and adhere to safety guidelines. To address these considerations, we propose a constrained multi-objective RL algorithm named Constrained Multi-Objective Gradient Aggregator (CoMOGA). In the field of multi-objective optimization, managing conflicts between the gradients of the multiple objectiv… ▽ More

    Submitted 31 May, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: 25 pages

  15. arXiv:2402.18752  [pdf, other

    cs.LG cs.CR

    Pre-training Differentially Private Models with Limited Public Data

    Authors: Zhiqi Bu, Xinwei Zhang, Mingyi Hong, Sheng Zha, George Karypis

    Abstract: The superior performance of large foundation models relies on the use of massive amounts of high-quality data, which often contain sensitive, private and copyrighted material that requires formal protection. While differential privacy (DP) is a prominent method to gauge the degree of security provided to the models, its application is commonly limited to the model fine-tuning stage, due to the per… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  16. arXiv:2402.15997  [pdf, other

    cs.HC cs.GR cs.LG

    Cieran: Designing Sequential Colormaps via In-Situ Active Preference Learning

    Authors: Matt-Heun Hong, Zachary N. Sunberg, Danielle Albers Szafir

    Abstract: Quality colormaps can help communicate important data patterns. However, finding an aesthetically pleasing colormap that looks "just right" for a given scenario requires significant design and technical expertise. We introduce Cieran, a tool that allows any data analyst to rapidly find quality colormaps while designing charts within Jupyter Notebooks. Our system employs an active preference learni… ▽ More

    Submitted 29 February, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: CHI 2024. 12 pages/9 figures

  17. arXiv:2402.11592  [pdf, other

    cs.LG cs.CL

    Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

    Authors: Yihua Zhang, Pingzhi Li, Junyuan Hong, Jiaxiang Li, Yimeng Zhang, Wenqing Zheng, Pin-Yu Chen, Jason D. Lee, Wotao Yin, Mingyi Hong, Zhangyang Wang, Sijia Liu, Tianlong Chen

    Abstract: In the evolving landscape of natural language processing (NLP), fine-tuning pre-trained Large Language Models (LLMs) with first-order (FO) optimizers like SGD and Adam has become standard. Yet, as LLMs grow {in size}, the substantial memory overhead from back-propagation (BP) for FO gradient computation presents a significant challenge. Addressing this issue is crucial, especially for applications… ▽ More

    Submitted 27 May, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  18. arXiv:2402.11424  [pdf, other

    cs.CV cs.AI

    Data Distribution Distilled Generative Model for Generalized Zero-Shot Recognition

    Authors: Yijie Wang, Mingjian Hong, Luwen Huangfu, Sheng Huang

    Abstract: In the realm of Zero-Shot Learning (ZSL), we address biases in Generalized Zero-Shot Learning (GZSL) models, which favor seen data. To counter this, we introduce an end-to-end generative GZSL framework called D$^3$GZSL. This framework respects seen and synthesized unseen data as in-distribution and out-of-distribution data, respectively, for a more balanced model. D$^3$GZSL comprises two core modu… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: accepted as AAAI 2024 oral paper

  19. arXiv:2402.08821  [pdf, other

    math.OC cs.DC

    Problem-Parameter-Free Decentralized Nonconvex Stochastic Optimization

    Authors: Jiaxiang Li, Xuxing Chen, Shiqian Ma, Mingyi Hong

    Abstract: Existing decentralized algorithms usually require knowledge of problem parameters for updating local iterates. For example, the hyperparameters (such as learning rate) usually require the knowledge of Lipschitz constant of the global gradient or topological information of the communication networks, which are usually not accessible in practice. In this paper, we propose D-NASA, the first algorithm… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  20. arXiv:2401.12025  [pdf, other

    cs.IT eess.SP math.OC

    A Survey of Recent Advances in Optimization Methods for Wireless Communications

    Authors: Ya-Feng Liu, Tsung-Hui Chang, Mingyi Hong, Zheyu Wu, Anthony Man-Cho So, Eduard A. Jorswieck, Wei Yu

    Abstract: Mathematical optimization is now widely regarded as an indispensable modeling and solution tool for the design of wireless communications systems. While optimization has played a significant role in the revolutionary progress in wireless communication and networking technologies from 1G to 5G and onto the future 6G, the innovations in wireless technologies have also substantially transformed the n… ▽ More

    Submitted 7 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 39 pages, 5 figures, accepted for publication in IEEE Journal on Selected Areas in Communications

  21. arXiv:2401.11380  [pdf, other

    cs.LG math.ST stat.ME stat.ML

    MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning

    Authors: Mao Hong, Zhiyue Zhang, Yue Wu, Yanxun Xu

    Abstract: Model-based offline reinforcement learning methods (RL) have achieved state-of-the-art performance in many decision-making problems thanks to their sample efficiency and generalizability. Despite these advancements, existing model-based offline RL approaches either focus on theoretical studies without developing practical algorithms or rely on a restricted parametric policy space, thus not fully l… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  22. arXiv:2401.08893  [pdf, other

    cs.LG math.OC

    MADA: Meta-Adaptive Optimizers through hyper-gradient Descent

    Authors: Kaan Ozkara, Can Karakus, Parameswaran Raman, Mingyi Hong, Shoham Sabach, Branislav Kveton, Volkan Cevher

    Abstract: Following the introduction of Adam, several novel adaptive optimizers for deep learning have been proposed. These optimizers typically excel in some tasks but may not outperform Adam uniformly across all tasks. In this work, we introduce Meta-Adaptive Optimizers (MADA), a unified optimizer framework that can generalize several known optimizers and dynamically learn the most suitable one during tra… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  23. arXiv:2401.04133  [pdf, other

    cs.LG cs.AI cs.SI

    SynHING: Synthetic Heterogeneous Information Network Generation for Graph Learning and Explanation

    Authors: Ming-Yi Hong, Yi-Hsiang Huang, Shao-En Lin, You-Chen Teng, Chih-Yu Wang, Che Lin

    Abstract: Graph Neural Networks (GNNs) excel in delineating graph structures in diverse domains, including community analysis and recommendation systems. As the interpretation of GNNs becomes increasingly important, the demand for robust baselines and expansive graph datasets is accentuated, particularly in the context of Heterogeneous Information Networks (HIN). Addressing this, we introduce SynHING, a nov… ▽ More

    Submitted 29 May, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: Update figures, tables, and content

  24. arXiv:2401.03058  [pdf, other

    math.OC cs.LG stat.ML

    Krylov Cubic Regularized Newton: A Subspace Second-Order Method with Dimension-Free Convergence Rate

    Authors: Ruichen Jiang, Parameswaran Raman, Shoham Sabach, Aryan Mokhtari, Mingyi Hong, Volkan Cevher

    Abstract: Second-order optimization methods, such as cubic regularized Newton methods, are known for their rapid convergence rates; nevertheless, they become impractical in high-dimensional problems due to their substantial memory requirements and computational costs. One promising approach is to execute second-order updates within a lower-dimensional subspace, giving rise to subspace second-order methods.… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 27 pages, 2 figures

  25. arXiv:2312.11388  [pdf, other

    cs.HC

    BioSpark: An End-to-End Generative System for Biological-Analogical Inspirations and Ideation

    Authors: Hyeonsu B. Kang, David Chuan-En Lin, Nikolas Martelaro, Aniket Kittur, Yan-Ying Chen, Matthew K. Hong

    Abstract: Nature is often used to inspire solutions for complex engineering problems, but achieving its full potential is challenging due to difficulties in discovering relevant analogies and synthesizing from them. Here, we present an end-to-end system, BioSpark, that generates biological-analogical mechanisms and provides an interactive interface to comprehend and synthesize from them. BioSpark pipeline s… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 Workshop on Machine Learning for Creativity and Design

  26. arXiv:2312.06519  [pdf, other

    cs.LG cs.AI cs.SI

    A GAN Approach for Node Embedding in Heterogeneous Graphs Using Subgraph Sampling

    Authors: Hung Chun Hsu, Bo-Jun Wu, Ming-Yi Hong, Che Lin, Chih-Yu Wang

    Abstract: Our research addresses class imbalance issues in heterogeneous graphs using graph neural networks (GNNs). We propose a novel method combining the strengths of Generative Adversarial Networks (GANs) with GNNs, creating synthetic nodes and edges that effectively balance the dataset. This approach directly targets and rectifies imbalances at the data level. The proposed framework resolves issues such… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  27. arXiv:2312.03395  [pdf, other

    cs.RO cs.AI cs.LG

    Diffused Task-Agnostic Milestone Planner

    Authors: Mineui Hong, Minjae Kang, Songhwai Oh

    Abstract: Addressing decision-making problems using sequence modeling to predict future trajectories shows promising results in recent years. In this paper, we take a step further to leverage the sequence predictive method in wider areas such as long-term planning, vision-based control, and multi-task decision-making. To this end, we propose a method to utilize a diffusion-based generative sequence model to… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 37th Conference on Neural Information Processing Systems

  28. arXiv:2311.14632  [pdf, other

    cs.LG cs.CR

    Differentially Private SGD Without Clipping Bias: An Error-Feedback Approach

    Authors: Xinwei Zhang, Zhiqi Bu, Zhiwei Steven Wu, Mingyi Hong

    Abstract: Differentially Private Stochastic Gradient Descent with Gradient Clipping (DPSGD-GC) is a powerful tool for training deep learning models using sensitive data, providing both a solid theoretical privacy guarantee and high efficiency. However, using DPSGD-GC to ensure Differential Privacy (DP) comes at the cost of model performance degradation due to DP noise injection and gradient clipping. Existi… ▽ More

    Submitted 17 April, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

  29. arXiv:2311.05590  [pdf, other

    cs.HC cs.AI

    Conversational AI Threads for Visualizing Multidimensional Datasets

    Authors: Matt-Heun Hong, Anamaria Crisan

    Abstract: Generative Large Language Models (LLMs) show potential in data analysis, yet their full capabilities remain uncharted. Our work explores the capabilities of LLMs for creating and refining visualizations via conversational interfaces. We used an LLM to conduct a re-analysis of a prior Wizard-of-Oz study examining the use of chatbots for conducting visual analysis. We surfaced the strengths and weak… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  30. arXiv:2310.10780  [pdf, other

    cs.CR cs.AI cs.LG

    Demystifying Poisoning Backdoor Attacks from a Statistical Perspective

    Authors: Ganghua Wang, Xun Xian, Jayanth Srinivasa, Ashish Kundu, Xuan Bi, Mingyi Hong, Jie Ding

    Abstract: The growing dependence on machine learning in real-world applications emphasizes the importance of understanding and ensuring its safety. Backdoor attacks pose a significant security risk due to their stealthy nature and potentially serious consequences. Such attacks involve embedding triggers within a learning model with the intention of causing malicious behavior when an active trigger is presen… ▽ More

    Submitted 17 October, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

  31. arXiv:2310.08782  [pdf, other

    cs.LG cs.AI

    Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning

    Authors: Yihua Zhang, Yimeng Zhang, Aochuan Chen, Jinghan Jia, Jiancheng Liu, Gaowen Liu, Mingyi Hong, Shiyu Chang, Sijia Liu

    Abstract: Massive data is often considered essential for deep learning applications, but it also incurs significant computational and infrastructural costs. Therefore, dataset pruning (DP) has emerged as an effective way to improve data efficiency by identifying and removing redundant training samples without sacrificing performance. In this work, we aim to address the problem of DP for transfer learning, i… ▽ More

    Submitted 18 November, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  32. arXiv:2309.08571  [pdf, other

    cs.LG

    A Bayesian Approach to Robust Inverse Reinforcement Learning

    Authors: Ran Wei, Siliang Zeng, Chenliang Li, Alfredo Garcia, Anthony McDonald, Mingyi Hong

    Abstract: We consider a Bayesian approach to offline model-based inverse reinforcement learning (IRL). The proposed framework differs from existing offline model-based IRL approaches by performing simultaneous estimation of the expert's reward function and subjective model of environment dynamics. We make use of a class of prior distributions which parameterizes how accurate the expert's model of the enviro… ▽ More

    Submitted 6 April, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  33. arXiv:2308.00788  [pdf, other

    cs.LG math.OC

    An Introduction to Bi-level Optimization: Foundations and Applications in Signal Processing and Machine Learning

    Authors: Yihua Zhang, Prashant Khanduri, Ioannis Tsaknakis, Yuguang Yao, Mingyi Hong, Sijia Liu

    Abstract: Recently, bi-level optimization (BLO) has taken center stage in some very exciting developments in the area of signal processing (SP) and machine learning (ML). Roughly speaking, BLO is a classical optimization problem that involves two levels of hierarchy (i.e., upper and lower levels), wherein obtaining the solution to the upper-level problem requires solving the lower-level one. BLO has become… ▽ More

    Submitted 20 December, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

  34. arXiv:2307.09484  [pdf, other

    q-bio.BM cs.CE cs.LG physics.chem-ph

    MolFM: A Multimodal Molecular Foundation Model

    Authors: Yizhen Luo, Kai Yang, Massimo Hong, Xing Yi Liu, Zaiqing Nie

    Abstract: Molecular knowledge resides within three different modalities of information sources: molecular structures, biomedical documents, and knowledge bases. Effective incorporation of molecular knowledge from these modalities holds paramount significance in facilitating biomedical research. However, existing multimodal molecular foundation models exhibit limitations in capturing intricate connections be… ▽ More

    Submitted 21 July, 2023; v1 submitted 6 June, 2023; originally announced July 2023.

    Comments: 31 pages, 15 figures, and 15 tables

  35. arXiv:2306.15774  [pdf

    cs.HC cs.CL cs.CV cs.LG

    Next Steps for Human-Centered Generative AI: A Technical Perspective

    Authors: Xiang 'Anthony' Chen, Jeff Burke, Ruofei Du, Matthew K. Hong, Jennifer Jacobs, Philippe Laban, Dingzeyu Li, Nanyun Peng, Karl D. D. Willis, Chien-Sheng Wu, Bolei Zhou

    Abstract: Through iterative, cross-disciplinary discussions, we define and propose next-steps for Human-centered Generative AI (HGAI). We contribute a comprehensive research agenda that lays out future directions of Generative AI spanning three levels: aligning with human values; assimilating human intents; and augmenting human abilities. By identifying these next-steps, we intend to draw interdisciplinary… ▽ More

    Submitted 22 December, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

  36. arXiv:2306.01217  [pdf, ps, other

    cs.HC

    Generative AI for Product Design: Getting the Right Design and the Design Right

    Authors: Matthew K. Hong, Shabnam Hakimi, Yan-Ying Chen, Heishiro Toyoda, Charlene Wu, Matt Klenk

    Abstract: Generative AI (GenAI) models excel in their ability to recognize patterns in existing data and generate new and unexpected content. Recent advances have motivated applications of GenAI tools (e.g., Stable Diffusion, ChatGPT) to professional practice across industries, including product design. While these generative capabilities may seem enticing on the surface, certain barriers limit their practi… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  37. arXiv:2305.17083  [pdf, other

    stat.ML cs.LG econ.EM math.ST stat.ME

    A Policy Gradient Method for Confounded POMDPs

    Authors: Mao Hong, Zhengling Qi, Yanxun Xu

    Abstract: In this paper, we propose a policy gradient method for confounded partially observable Markov decision processes (POMDPs) with continuous state and observation spaces in the offline setting. We first establish a novel identification result to non-parametrically estimate any history-dependent policy gradient under POMDPs using the offline data. The identification enables us to solve a sequence of c… ▽ More

    Submitted 30 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 95 pages, 3 figures

  38. arXiv:2305.12817  [pdf, other

    cs.LG

    Conservative Physics-Informed Neural Networks for Non-Conservative Hyperbolic Conservation Laws Near Critical States

    Authors: Reyna Quita, Yu-Shuo Chen, Hsin-Yi Lee Alex C. Hu, John M. Hong

    Abstract: In this paper, a modified version of conservative Physics-informed Neural Networks (cPINN for short) is provided to construct the weak solutions of Riemann problem for the hyperbolic scalar conservation laws in non-conservative form. To demonstrate the results, we use the model of generalized Buckley-Leverett equation (GBL equation for short) with discontinuous porosity in porous media. By inventi… ▽ More

    Submitted 22 May, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 23 pages, 26 figures

    MSC Class: 35L03; 35L45; 65M99

  39. arXiv:2305.04241  [pdf, other

    cs.CL cs.LG

    Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens

    Authors: Zhanpeng Zeng, Cole Hawkins, Mingyi Hong, Aston Zhang, Nikolaos Pappas, Vikas Singh, Shuai Zheng

    Abstract: Transformers are central in modern natural language processing and computer vision applications. Despite recent works devoted to reducing the quadratic cost of such models (as a function of the sequence length), dealing with ultra long sequences (e.g., with more than 16K tokens) remains challenging. Applications such as answering questions based on a book or summarizing a scientific article are in… ▽ More

    Submitted 27 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 10 pages main text, 12 pages appendix, preprint

  40. arXiv:2305.01523  [pdf, other

    cs.LG cs.AI cs.CE

    Towards Unified AI Drug Discovery with Multiple Knowledge Modalities

    Authors: Yizhen Luo, Xing Yi Liu, Kai Yang, Kui Huang, Massimo Hong, Jiahuan Zhang, Yushuai Wu, Zaiqing Nie

    Abstract: In recent years, AI models that mine intrinsic patterns from molecular structures and protein sequences have shown promise in accelerating drug discovery. However, these methods partly lag behind real-world pharmaceutical approaches of human experts that additionally grasp structured knowledge from knowledge bases and unstructured knowledge from biomedical literature. To bridge this gap, we propos… ▽ More

    Submitted 14 October, 2023; v1 submitted 17 April, 2023; originally announced May 2023.

    Comments: 10 pages, 6 figures

  41. arXiv:2303.16513  [pdf, other

    cs.CV cs.AI

    Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution

    Authors: Hao-Wei Chen, Yu-Syuan Xu, Min-Fong Hong, Yi-Min Tsai, Hsien-Kai Kuo, Chun-Yi Lee

    Abstract: Implicit neural representation has recently shown a promising ability in representing images with arbitrary resolutions. In this paper, we present a Local Implicit Transformer (LIT), which integrates the attention mechanism and frequency encoding technique into a local implicit image function. We design a cross-scale local attention block to effectively aggregate local features. To further improve… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  42. arXiv:2303.09531  [pdf, other

    cs.LG stat.ML

    GLASU: A Communication-Efficient Algorithm for Federated Learning with Vertically Distributed Graph Data

    Authors: Xinwei Zhang, Mingyi Hong, Jie Chen

    Abstract: Vertical federated learning (VFL) is a distributed learning paradigm, where computing clients collectively train a model based on the partial features of the same set of samples they possess. Current research on VFL focuses on the case when samples are independent, but it rarely addresses an emerging scenario when samples are interrelated through a graph. For graph-structured data, graph neural ne… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  43. arXiv:2303.02343  [pdf, other

    cs.LG cs.AI

    What Is Missing in IRM Training and Evaluation? Challenges and Solutions

    Authors: Yihua Zhang, Pranay Sharma, Parikshit Ram, Mingyi Hong, Kush Varshney, Sijia Liu

    Abstract: Invariant risk minimization (IRM) has received increasing attention as a way to acquire environment-agnostic data representations and predictions, and as a principled solution for preventing spurious correlations from being learned and for improving models' out-of-distribution generalization. Yet, recent works have found that the optimality of the originally-proposed IRM optimization (IRM) may be… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

    Comments: Accepted to Eleventh International Conference on Learning Representations (ICLR 2023)

  44. arXiv:2302.07457  [pdf, other

    cs.LG cs.AI

    When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning

    Authors: Siliang Zeng, Chenliang Li, Alfredo Garcia, Mingyi Hong

    Abstract: Offline inverse reinforcement learning (Offline IRL) aims to recover the structure of rewards and environment dynamics that underlie observed actions in a fixed, finite set of demonstrations from an expert agent. Accurate models of expertise in executing a task has applications in safety-sensitive applications such as clinical decision making and autonomous driving. However, the structure of an ex… ▽ More

    Submitted 28 February, 2024; v1 submitted 14 February, 2023; originally announced February 2023.

  45. arXiv:2212.04365  [pdf, other

    cs.LG cs.AI cs.NI

    Alleviating neighbor bias: augmenting graph self-supervise learning with structural equivalent positive samples

    Authors: Jiawei Zhu, Mei Hong, Ronghua Du, Haifeng Li

    Abstract: In recent years, using a self-supervised learning framework to learn the general characteristics of graphs has been considered a promising paradigm for graph representation learning. The core of self-supervised learning strategies for graph neural networks lies in constructing suitable positive sample selection strategies. However, existing GNNs typically aggregate information from neighboring nod… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 8 pages, 5 figures, 8 tables

  46. arXiv:2210.12001  [pdf, other

    cs.LG

    When Expressivity Meets Trainability: Fewer than $n$ Neurons Can Work

    Authors: Jiawei Zhang, Yushun Zhang, Mingyi Hong, Ruoyu Sun, Zhi-Quan Luo

    Abstract: Modern neural networks are often quite wide, causing large memory and computation costs. It is thus of great interest to train a narrower network. However, training narrow neural nets remains a challenging task. We ask two theoretical questions: Can narrow networks have as strong expressivity as wide ones? If so, does the loss function exhibit a benign optimization landscape? In this work, we prov… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 39 Pages

  47. arXiv:2210.07749   

    eess.AS cs.SD

    LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge

    Authors: Yan Jia, Mi Hong, Jingyu Hou, Kailong Ren, Sifan Ma, Jin Wang, Fangzhen Peng, Yinglin Ji, Lin Yang, Junjie Wang

    Abstract: This paper describes LeVoice automatic speech recognition systems to track2 of intelligent cockpit speech recognition challenge 2022. Track2 is a speech recognition task without limits on the scope of model size. Our main points include deep learning based speech enhancement, text-to-speech based speech generation, training data augmentation via various techniques and speech recognition model fusi… ▽ More

    Submitted 16 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: There are experimental errors

  48. arXiv:2210.04092  [pdf, other

    cs.LG

    Advancing Model Pruning via Bi-level Optimization

    Authors: Yihua Zhang, Yuguang Yao, Parikshit Ram, Pu Zhao, Tianlong Chen, Mingyi Hong, Yanzhi Wang, Sijia Liu

    Abstract: The deployment constraints in practical applications necessitate the pruning of large-scale deep learning models, i.e., promoting their weight sparsity. As illustrated by the Lottery Ticket Hypothesis (LTH), pruning also has the potential of improving their generalization ability. At the core of LTH, iterative magnitude pruning (IMP) is the predominant pruning method to successfully find 'winning… ▽ More

    Submitted 21 April, 2023; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  49. arXiv:2210.01808   

    cs.LG stat.ML

    Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

    Authors: Siliang Zeng, Chenliang Li, Alfredo Garcia, Mingyi Hong

    Abstract: Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented by an expert. Many algorithms for IRL have an inherently nested structure: the inner loop finds the optimal policy given parametrized rewards while the outer loop updates the estimates towards optimizing a measure of fit. For… ▽ More

    Submitted 31 October, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Two different versions (arXiv:2210.01808 and arXiv:2210.01282) of the same paper have been submitted to arxiv. To avoid to the overlap between two versions, we withdraw this version. For this paper, readers could refer to arXiv:2210.01282

  50. arXiv:2210.01282  [pdf, other

    cs.LG cs.AI econ.EM stat.ML

    Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees

    Authors: Siliang Zeng, Mingyi Hong, Alfredo Garcia

    Abstract: We consider the task of estimating a structural model of dynamic decisions by a human agent based upon the observable history of implemented actions and visited states. This problem has an inherent nested structure: in the inner problem, an optimal policy for a given reward function is identified while in the outer problem, a measure of fit is maximized. Several approaches have been proposed to al… ▽ More

    Submitted 1 March, 2024; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: This conference version of this paper refers to "Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees" in NeurIPS 2022