Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 221 results for author: Leskovec, J

.
  1. arXiv:2407.20060  [pdf, other

    cs.LG cs.AI cs.DB

    RelBench: A Benchmark for Deep Learning on Relational Databases

    Authors: Joshua Robinson, Rishabh Ranjan, Weihua Hu, Kexin Huang, Jiaqi Han, Alejandro Dobles, Matthias Fey, Jan E. Lenssen, Yiwen Yuan, Zecheng Zhang, Xinwei He, Jure Leskovec

    Abstract: We present RelBench, a public benchmark for solving predictive tasks over relational databases with graph neural networks. RelBench provides databases and tasks spanning diverse domains and scales, and is intended to be a foundational infrastructure for future research. We use RelBench to conduct the first comprehensive study of Relational Deep Learning (RDL) (Fey et al., 2024), which combines gra… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  2. arXiv:2407.18772  [pdf, other

    cs.LG cs.CY cs.SI

    Learning production functions for supply chains with graph neural networks

    Authors: Serina Chang, Zhiyin Lin, Benjamin Yan, Swapnil Bembde, Qi Xiu, Chi Heem Wong, Yu Qin, Frank Kloster, Alex Luo, Raj Palleti, Jure Leskovec

    Abstract: The global economy relies on the flow of goods over supply chain networks, with nodes as firms and edges as transactions between firms. While we may observe these external transactions, they are governed by unseen production functions, which determine how firms internally transform the input products they receive into output products that they sell. In this setting, it can be extremely valuable to… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  3. arXiv:2407.01648  [pdf, other

    q-bio.BM cs.LG q-bio.QM

    Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization

    Authors: Siyi Gu, Minkai Xu, Alexander Powers, Weili Nie, Tomas Geffner, Karsten Kreis, Jure Leskovec, Arash Vahdat, Stefano Ermon

    Abstract: Generating ligand molecules for specific protein targets, known as structure-based drug design, is a fundamental problem in therapeutics development and biological discovery. Recently, target-aware generative models, especially diffusion models, have shown great promise in modeling protein-ligand interactions and generating candidate drugs. However, existing models primarily focus on learning the… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2406.11200  [pdf, other

    cs.LG cs.CL

    AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval

    Authors: Shirley Wu, Shiyu Zhao, Qian Huang, Kexin Huang, Michihiro Yasunaga, Kaidi Cao, Vassilis N. Ioannidis, Karthik Subbian, Jure Leskovec, James Zou

    Abstract: Large language model (LLM) agents have demonstrated impressive capability in utilizing external tools and knowledge to boost accuracy and reduce hallucinations. However, developing the prompting techniques that make LLM agents able to effectively use external tools and knowledge is a heuristic and laborious task. Here, we introduce AvaTaR, a novel and automatic framework that optimizes an LLM agen… ▽ More

    Submitted 17 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 19 pages, 8 figures, 6 tables

  5. arXiv:2406.04557  [pdf, other

    cs.CY

    Countrywide natural experiment reveals impact of built environment on physical activity

    Authors: Tim Althoff, Boris Ivanovic, Jennifer L. Hicks, Scott L. Delp, Abby C. King, Jure Leskovec

    Abstract: While physical activity is critical to human health, most people do not meet recommended guidelines. More walkable built environments have the potential to increase activity across the population. However, previous studies on the built environment and physical activity have led to mixed findings, possibly due to methodological limitations such as small cohorts, few or single locations, over-relian… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2405.18740  [pdf, other

    cs.CL

    Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs

    Authors: Jialiang Xu, Michael Moor, Jure Leskovec

    Abstract: Despite impressive advances in recent multimodal large language models (MLLMs), state-of-the-art models such as from the GPT-4 suite still struggle with knowledge-intensive tasks. To address this, we consider Reverse Image Retrieval (RIR) augmented generation, a simple yet effective strategy to augment MLLMs with web-scale reverse image search results. RIR robustly improves knowledge-intensive vis… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  7. arXiv:2405.17631  [pdf, other

    cs.AI cs.CE cs.MA

    BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments

    Authors: Yusuf Roohani, Jian Vora, Qian Huang, Zachary Steinhart, Alexander Marson, Percy Liang, Jure Leskovec

    Abstract: Agents based on large language models have shown great potential in accelerating scientific discovery by leveraging their rich background knowledge and reasoning capabilities. Here, we develop BioDiscoveryAgent, an agent that designs new experiments, reasons about their outcomes, and efficiently navigates the hypothesis space to reach desired solutions. We demonstrate our agent on the problem of d… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  8. arXiv:2404.13207  [pdf, other

    cs.IR cs.LG

    STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases

    Authors: Shirley Wu, Shiyu Zhao, Michihiro Yasunaga, Kexin Huang, Kaidi Cao, Qian Huang, Vassilis N. Ioannidis, Karthik Subbian, James Zou, Jure Leskovec

    Abstract: Answering real-world complex queries, such as complex product search, often requires accurate retrieval from semi-structured knowledge bases that involve blend of unstructured (e.g., textual descriptions of products) and structured (e.g., entity relations of products) information. However, previous works have mostly studied textual and relational retrieval tasks as separate topics. To address the… ▽ More

    Submitted 20 May, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 26 pages, 6 figures

  9. arXiv:2404.01340  [pdf, other

    cs.LG cs.AI

    From Similarity to Superiority: Channel Clustering for Time Series Forecasting

    Authors: Jialin Chen, Jan Eric Lenssen, Aosong Feng, Weihua Hu, Matthias Fey, Leandros Tassiulas, Jure Leskovec, Rex Ying

    Abstract: Time series forecasting has attracted significant attention in recent decades. Previous studies have demonstrated that the Channel-Independent (CI) strategy improves forecasting performance by treating different channels individually, while it leads to poor generalization on unseen instances and ignores potentially necessary interactions between channels. Conversely, the Channel-Dependent (CD) str… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 20 pages, 6 figures

  10. arXiv:2404.00776  [pdf, other

    cs.LG cs.DB stat.ML

    PyTorch Frame: A Modular Framework for Multi-Modal Tabular Learning

    Authors: Weihua Hu, Yiwen Yuan, Zecheng Zhang, Akihiro Nitta, Kaidi Cao, Vid Kocijan, Jure Leskovec, Matthias Fey

    Abstract: We present PyTorch Frame, a PyTorch-based framework for deep learning over multi-modal tabular data. PyTorch Frame makes tabular deep learning easy by providing a PyTorch-based data structure to handle complex tabular data, introducing a model abstraction to enable modular implementation of tabular models, and allowing external foundation models to be incorporated to handle complex columns (e.g.,… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: https://github.com/pyg-team/pytorch-frame

  11. arXiv:2402.18697  [pdf, other

    stat.ML cs.LG cs.SI math.OC math.ST

    Inferring Dynamic Networks from Marginals with Iterative Proportional Fitting

    Authors: Serina Chang, Frederic Koehler, Zhaonan Qu, Jure Leskovec, Johan Ugander

    Abstract: A common network inference problem, arising from real-world data constraints, is how to infer a dynamic network from its time-aggregated adjacency matrix and time-varying marginals (i.e., row and column sums). Prior approaches to this problem have repurposed the classic iterative proportional fitting (IPF) procedure, also known as Sinkhorn's algorithm, with promising empirical results. However, th… ▽ More

    Submitted 19 August, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Conference version available from https://proceedings.mlr.press/v235/chang24b.html

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:6202-6252, 2024

  12. arXiv:2402.14367  [pdf, other

    cs.LG cs.SI

    Representation Learning for Frequent Subgraph Mining

    Authors: Rex Ying, Tianyu Fu, Andrew Wang, Jiaxuan You, Yu Wang, Jure Leskovec

    Abstract: Identifying frequent subgraphs, also called network motifs, is crucial in analyzing and predicting properties of real-world networks. However, finding large commonly-occurring motifs remains a challenging problem not only due to its NP-hard subroutine of subgraph counting, but also the exponential growth of the number of possible subgraphs patterns. Here we present Subgraph Pattern Miner (SPMiner)… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Oral Presentation in The Graph Representation Learning and Beyond (GRL+) Workshop from The 37th International Conference on Ma- chine Learning, 2020

  13. arXiv:2402.08383  [pdf, other

    cs.LG cs.AI

    Uncertainty Quantification for Forward and Inverse Problems of PDEs via Latent Global Evolution

    Authors: Tailin Wu, Willie Neiswanger, Hongtao Zheng, Stefano Ermon, Jure Leskovec

    Abstract: Deep learning-based surrogate models have demonstrated remarkable advantages over classical solvers in terms of speed, often achieving speedups of 10 to 1000 times over traditional partial differential equation (PDE) solvers. However, a significant challenge hindering their widespread adoption in both scientific and industrial domains is the lack of understanding about their prediction uncertainti… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI 2024 (Oral)

  14. arXiv:2401.13171  [pdf, other

    cs.LG cs.AI cs.CE

    Compositional Generative Inverse Design

    Authors: Tailin Wu, Takashi Maruyama, Long Wei, Tao Zhang, Yilun Du, Gianluca Iaccarino, Jure Leskovec

    Abstract: Inverse design, where we seek to design input variables in order to optimize an underlying objective function, is an important problem that arises across fields such as mechanical engineering to aerospace engineering. Inverse design is typically formulated as an optimization problem, with recent works leveraging optimization across learned dynamics models. However, as models are optimized they ten… ▽ More

    Submitted 11 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: ICLR 2024 spotlight. 30 pages, 17 figures

  15. arXiv:2401.11037  [pdf, other

    cs.LG math.NA q-bio.QM

    Equivariant Graph Neural Operator for Modeling 3D Dynamics

    Authors: Minkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar

    Abstract: Modeling the complex three-dimensional (3D) dynamics of relational systems is an important problem in the natural sciences, with applications ranging from molecular simulations to particle mechanics. Machine learning methods have achieved good success by learning graph neural networks to model spatial interactions. However, these approaches do not faithfully capture temporal correlations since the… ▽ More

    Submitted 2 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024. Copyright 2024 by the author(s)

  16. arXiv:2401.03134  [pdf, other

    cs.LG cs.AI

    TimeGraphs: Graph-based Temporal Reasoning

    Authors: Paridhi Maheshwari, Hongyu Ren, Yanan Wang, Rok Sosic, Jure Leskovec

    Abstract: Many real-world systems exhibit temporal, dynamic behaviors, which are captured as time series of complex agent interactions. To perform temporal reasoning, current methods primarily encode temporal dynamics through simple sequence-based models. However, in general these models fail to efficiently capture the full spectrum of rich dynamics in the input, since the dynamics is not uniformly distribu… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  17. arXiv:2312.04693  [pdf, other

    cs.LG

    GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts

    Authors: Shirley Wu, Kaidi Cao, Bruno Ribeiro, James Zou, Jure Leskovec

    Abstract: Graph data are inherently complex and heterogeneous, leading to a high natural diversity of distributional shifts. However, it remains unclear how to build machine learning architectures that generalize to complex non-synthetic distributional shifts naturally occurring in the real world. Here we develop GraphMETRO, a Graph Neural Network architecture, that reliably models natural diversity and cap… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: Graph Neural Networks, Mixture-of-experts, Distribution Shifts, Generalization

  18. arXiv:2312.04615  [pdf, other

    cs.LG cs.DB

    Relational Deep Learning: Graph Representation Learning on Relational Databases

    Authors: Matthias Fey, Weihua Hu, Kexin Huang, Jan Eric Lenssen, Rishabh Ranjan, Joshua Robinson, Rex Ying, Jiaxuan You, Jure Leskovec

    Abstract: Much of the world's most valued data is stored in relational databases and data warehouses, where the data is organized into many tables connected by primary-foreign key relations. However, building machine learning models using this data is both challenging and time consuming. The core problem is that no machine learning method is capable of learning on multiple tables interconnected by primary-f… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: https://relbench.stanford.edu

  19. arXiv:2311.04287  [pdf, other

    cs.CV cs.LG

    Holistic Evaluation of Text-To-Image Models

    Authors: Tony Lee, Michihiro Yasunaga, Chenlin Meng, Yifan Mai, Joon Sung Park, Agrim Gupta, Yunzhi Zhang, Deepak Narayanan, Hannah Benita Teufel, Marco Bellagente, Minguk Kang, Taesung Park, Jure Leskovec, Jun-Yan Zhu, Li Fei-Fei, Jiajun Wu, Stefano Ermon, Percy Liang

    Abstract: The stunning qualitative improvement of recent text-to-image models has led to their widespread attention and adoption. However, we lack a comprehensive quantitative understanding of their capabilities and risks. To fill this gap, we introduce a new benchmark, Holistic Evaluation of Text-to-Image Models (HEIM). Whereas previous evaluations focus mostly on text-image alignment and image quality, we… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023. First three authors contributed equally

  20. arXiv:2310.10971  [pdf, other

    cs.LG cs.CV

    Context-Aware Meta-Learning

    Authors: Christopher Fifty, Dennis Duan, Ronald G. Junkins, Ehsan Amid, Jure Leskovec, Christopher Re, Sebastian Thrun

    Abstract: Large Language Models like ChatGPT demonstrate a remarkable capacity to learn new concepts during inference without any fine-tuning. However, visual models trained to detect new objects during inference have been unable to replicate this ability, and instead either perform poorly or require meta-training and/or fine-tuning on similar objects. In this work, we propose a meta-learning algorithm that… ▽ More

    Submitted 25 March, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  21. arXiv:2310.08863  [pdf, other

    cs.LG

    In-Context Learning for Few-Shot Molecular Property Prediction

    Authors: Christopher Fifty, Jure Leskovec, Sebastian Thrun

    Abstract: In-context learning has become an important approach for few-shot learning in Large Language Models because of its ability to rapidly adapt to new tasks without fine-tuning model parameters. However, it is restricted to applications in natural language and inapplicable to other domains. In this paper, we adapt the concepts underpinning in-context learning to develop a new algorithm for few-shot mo… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  22. arXiv:2310.03302  [pdf, other

    cs.LG cs.AI

    MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation

    Authors: Qian Huang, Jian Vora, Percy Liang, Jure Leskovec

    Abstract: A central aspect of machine learning research is experimentation, the process of designing and running experiments, analyzing the results, and iterating towards some positive outcome (e.g., improving accuracy). Could agents driven by powerful language models perform machine learning experimentation effectively? To answer this question, we introduce MLAgentBench, a suite of 13 tasks ranging from im… ▽ More

    Submitted 14 April, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  23. arXiv:2310.01714  [pdf, other

    cs.LG

    Large Language Models as Analogical Reasoners

    Authors: Michihiro Yasunaga, Xinyun Chen, Yujia Li, Panupong Pasupat, Jure Leskovec, Percy Liang, Ed H. Chi, Denny Zhou

    Abstract: Chain-of-thought (CoT) prompting for language models demonstrates impressive performance across reasoning tasks, but typically needs labeled exemplars of the reasoning process. In this work, we introduce a new prompting approach, analogical prompting, designed to automatically guide the reasoning process of large language models. Inspired by analogical reasoning, a cognitive process in which human… ▽ More

    Submitted 9 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Published at ICLR 2024

  24. arXiv:2308.03209  [pdf, other

    cs.LG

    Communication-Free Distributed GNN Training with Vertex Cut

    Authors: Kaidi Cao, Rui Deng, Shirley Wu, Edward W Huang, Karthik Subbian, Jure Leskovec

    Abstract: Training Graph Neural Networks (GNNs) on real-world graphs consisting of billions of nodes and edges is quite challenging, primarily due to the substantial memory needed to store the graph and its intermediate node and edge features, and there is a pressing need to speed up the training process. A common approach to achieve speed up is to divide the graph into many smaller subgraphs, which are the… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  25. arXiv:2308.02117  [pdf, other

    cs.LG cs.AI cs.CV

    VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs

    Authors: Ling Yang, Ye Tian, Minkai Xu, Zhongyi Liu, Shenda Hong, Wei Qu, Wentao Zhang, Bin Cui, Muhan Zhang, Jure Leskovec

    Abstract: GNN-to-MLP distillation aims to utilize knowledge distillation (KD) to learn computationally-efficient multi-layer perceptron (student MLP) on graph data by mimicking the output representations of teacher GNN. Existing methods mainly make the MLP to mimic the GNN predictions over a few class labels. However, the class space may not be expressive enough for covering numerous diverse local graph str… ▽ More

    Submitted 6 March, 2024; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: ICLR 2024. Code: https://github.com/YangLing0818/VQGraph

  26. arXiv:2307.15189  [pdf, other

    cs.CV cs.AI

    Med-Flamingo: a Multimodal Medical Few-shot Learner

    Authors: Michael Moor, Qian Huang, Shirley Wu, Michihiro Yasunaga, Cyril Zakka, Yash Dalmia, Eduardo Pontes Reis, Pranav Rajpurkar, Jure Leskovec

    Abstract: Medicine, by its nature, is a multifaceted domain that requires the synthesis of information across various modalities. Medical generative vision-language models (VLMs) make a first step in this direction and promise many exciting clinical applications. However, existing models typically have to be fine-tuned on sizeable down-stream datasets, which poses a significant limitation as in many medical… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Preprint

  27. arXiv:2307.08423  [pdf, other

    cs.LG physics.comp-ph

    Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

    Authors: Xuan Zhang, Limei Wang, Jacob Helwig, Youzhi Luo, Cong Fu, Yaochen Xie, Meng Liu, Yuchao Lin, Zhao Xu, Keqiang Yan, Keir Adams, Maurice Weiler, Xiner Li, Tianfan Fu, Yucheng Wang, Haiyang Yu, YuQing Xie, Xiang Fu, Alex Strasser, Shenglong Xu, Yi Liu, Yuanqi Du, Alexandra Saxton, Hongyi Ling, Hannah Lawrence , et al. (38 additional authors not shown)

    Abstract: Advances in artificial intelligence (AI) are fueling a new paradigm of discoveries in natural sciences. Today, AI has started to advance natural sciences by improving, accelerating, and enabling our understanding of natural phenomena at a wide range of spatial and temporal scales, giving rise to a new area of research known as AI for science (AI4Science). Being an emerging research paradigm, AI4Sc… ▽ More

    Submitted 15 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  28. arXiv:2307.01026  [pdf, other

    cs.LG cs.AI

    Temporal Graph Benchmark for Machine Learning on Temporal Graphs

    Authors: Shenyang Huang, Farimah Poursafaei, Jacob Danovitch, Matthias Fey, Weihua Hu, Emanuele Rossi, Jure Leskovec, Michael Bronstein, Guillaume Rabusseau, Reihaneh Rabbany

    Abstract: We present the Temporal Graph Benchmark (TGB), a collection of challenging and diverse benchmark datasets for realistic, reproducible, and robust evaluation of machine learning models on temporal graphs. TGB datasets are of large scale, spanning years in duration, incorporate both node and edge-level prediction tasks and cover a diverse set of domains including social, trade, transaction, and tran… ▽ More

    Submitted 27 September, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: 20 pages, 7 figures, 7 tables, accepted at NeurIPS 2023 Datasets and Benchmarks Track

  29. arXiv:2306.04766  [pdf, other

    cs.LG cs.AI

    Enabling tabular deep learning when $d \gg n$ with an auxiliary knowledge graph

    Authors: Camilo Ruiz, Hongyu Ren, Kexin Huang, Jure Leskovec

    Abstract: Machine learning models exhibit strong performance on datasets with abundant labeled samples. However, for tabular datasets with extremely high $d$-dimensional features but limited $n$ samples (i.e. $d \gg n$), machine learning models struggle to achieve strong performance due to the risk of overfitting. Here, our key insight is that there is often abundant, auxiliary domain information describing… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  30. arXiv:2305.14535  [pdf, other

    cs.LG stat.ML

    Uncertainty Quantification over Graph with Conformalized Graph Neural Networks

    Authors: Kexin Huang, Ying Jin, Emmanuel Candès, Jure Leskovec

    Abstract: Graph Neural Networks (GNNs) are powerful machine learning prediction models on graph-structured data. However, GNNs lack rigorous uncertainty estimates, limiting their reliable deployment in settings where the cost of errors is significant. We propose conformalized GNN (CF-GNN), extending conformal prediction (CP) to graph-based models for guaranteed uncertainty estimates. Given an entity in the… ▽ More

    Submitted 30 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Published at NeurIPS 2023

  31. arXiv:2305.12600  [pdf, other

    cs.LG cs.AI

    PRODIGY: Enabling In-context Learning Over Graphs

    Authors: Qian Huang, Hongyu Ren, Peng Chen, Gregor Kržmanc, Daniel Zeng, Percy Liang, Jure Leskovec

    Abstract: In-context learning is the ability of a pretrained model to adapt to novel and diverse downstream tasks by conditioning on prompt examples, without optimizing any parameters. While large language models have demonstrated this ability, how in-context learning could be performed over graphs is unexplored. In this paper, we develop \textbf{Pr}etraining \textbf{O}ver \textbf{D}iverse \textbf{I}n-Conte… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

  32. arXiv:2305.12322  [pdf, other

    cs.LG cs.SI

    Learning Large Graph Property Prediction via Graph Segment Training

    Authors: Kaidi Cao, Phitchaya Mangpo Phothilimthana, Sami Abu-El-Haija, Dustin Zelle, Yanqi Zhou, Charith Mendis, Jure Leskovec, Bryan Perozzi

    Abstract: Learning to predict properties of large graphs is challenging because each prediction requires the knowledge of an entire graph, while the amount of memory available during training is bounded. Here we propose Graph Segment Training (GST), a general framework that utilizes a divide-and-conquer approach to allow learning large graph property prediction with a constant memory footprint. GST first di… ▽ More

    Submitted 5 November, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  33. arXiv:2305.01140  [pdf, other

    cs.LG q-bio.QM

    Geometric Latent Diffusion Models for 3D Molecule Generation

    Authors: Minkai Xu, Alexander Powers, Ron Dror, Stefano Ermon, Jure Leskovec

    Abstract: Generative models, especially diffusion models (DMs), have achieved promising results for generating feature-rich geometries and advancing foundational science problems such as molecule design. Inspired by the recent huge success of Stable (latent) Diffusion models, we propose a novel and principled method for 3D molecule generation named Geometric Latent Diffusion Models (GeoLDM). GeoLDM is the f… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: Published at ICML 2023

  34. arXiv:2305.01122  [pdf, other

    cs.LG cs.CE

    Learning Controllable Adaptive Simulation for Multi-resolution Physics

    Authors: Tailin Wu, Takashi Maruyama, Qingqing Zhao, Gordon Wetzstein, Jure Leskovec

    Abstract: Simulating the time evolution of physical systems is pivotal in many scientific and engineering problems. An open challenge in simulating such systems is their multi-resolution dynamics: a small fraction of the system is extremely dynamic, and requires very fine-grained resolution, while a majority of the system is changing slowly and can be modeled by coarser spatial scales. Typical learning-base… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: ICLR 2023, notable top-25% (spotlight), 19 pages, 9 figures

  35. arXiv:2304.14274  [pdf, other

    cs.SI cs.LG

    When Do Graph Neural Networks Help with Node Classification? Investigating the Impact of Homophily Principle on Node Distinguishability

    Authors: Sitao Luan, Chenqing Hua, Minkai Xu, Qincheng Lu, Jiaqi Zhu, Xiao-Wen Chang, Jie Fu, Jure Leskovec, Doina Precup

    Abstract: Homophily principle, i.e., nodes with the same labels are more likely to be connected, has been believed to be the main reason for the performance superiority of Graph Neural Networks (GNNs) over Neural Networks on node classification tasks. Recent research suggests that, even in the absence of homophily, the advantage of GNNs still exists as long as nodes from the same class share similar neighbo… ▽ More

    Submitted 1 January, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted by 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  36. arXiv:2303.14617  [pdf, other

    cs.DB cs.AI cs.LG

    Neural Graph Reasoning: Complex Logical Query Answering Meets Graph Databases

    Authors: Hongyu Ren, Mikhail Galkin, Michael Cochez, Zhaocheng Zhu, Jure Leskovec

    Abstract: Complex logical query answering (CLQA) is a recently emerged task of graph machine learning that goes beyond simple one-hop link prediction and solves a far more complex task of multi-hop logical reasoning over massive, potentially incomplete graphs in a latent space. The task received a significant traction in the community; numerous works expanded the field along theoretical and practical axes t… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

  37. arXiv:2303.07669  [pdf, other

    cs.LG

    AutoTransfer: AutoML with Knowledge Transfer -- An Application to Graph Neural Networks

    Authors: Kaidi Cao, Jiaxuan You, Jiaju Liu, Jure Leskovec

    Abstract: AutoML has demonstrated remarkable success in finding an effective neural architecture for a given machine learning task defined by a specific dataset and an evaluation metric. However, most present AutoML techniques consider each task independently from scratch, which requires exploring many architectures, leading to high computational cost. Here we propose AutoTransfer, an AutoML solution that i… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  38. arXiv:2303.07666  [pdf, other

    cs.LG

    Relational Multi-Task Learning: Modeling Relations between Data and Tasks

    Authors: Kaidi Cao, Jiaxuan You, Jure Leskovec

    Abstract: A key assumption in multi-task learning is that at the inference time the multi-task model only has access to a given data point but not to the data point's labels from other tasks. This presents an opportunity to extend multi-task learning to utilize data point's labels from other auxiliary tasks, and this way improves performance on the new task. Here we introduce a novel relational multi-task l… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: ICLR 2022 Spotlight

  39. arXiv:2302.02055  [pdf, other

    cs.LG

    Implicit Geometry and Interaction Embeddings Improve Few-Shot Molecular Property Prediction

    Authors: Christopher Fifty, Joseph M. Paggi, Ehsan Amid, Jure Leskovec, Ron Dror

    Abstract: Few-shot learning is a promising approach to molecular property prediction as supervised data is often very limited. However, many important molecular properties depend on complex molecular characteristics -- such as the various 3D geometries a molecule may adopt or the types of chemical interactions it can form -- that are not explicitly encoded in the feature space and must be approximated from… ▽ More

    Submitted 6 October, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

  40. arXiv:2301.12292  [pdf, other

    cs.LG cs.AI cs.CY cs.HC

    Zero-shot causal learning

    Authors: Hamed Nilforoshan, Michael Moor, Yusuf Roohani, Yining Chen, Anja Šurina, Michihiro Yasunaga, Sara Oblak, Jure Leskovec

    Abstract: Predicting how different interventions will causally affect a specific individual is important in a variety of domains such as personalized medicine, public policy, and online marketing. There are a large number of methods to predict the effect of an existing intervention based on historical data from individuals who received it. However, in many settings it is important to predict the effects of… ▽ More

    Submitted 22 February, 2024; v1 submitted 28 January, 2023; originally announced January 2023.

  41. arXiv:2212.06224  [pdf, other

    cs.CY cs.SI

    Estimating Geographic Spillover Effects of COVID-19 Policies From Large-Scale Mobility Networks

    Authors: Serina Chang, Damir Vrabac, Jure Leskovec, Johan Ugander

    Abstract: Many policies in the US are determined locally, e.g., at the county-level. Local policy regimes provide flexibility between regions, but may become less effective in the presence of geographic spillovers, where populations circumvent local restrictions by traveling to less restricted regions nearby. Due to the endogenous nature of policymaking, there have been few opportunities to reliably estimat… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: This is the extended version of a paper accepted to AAAI 2023

    Journal ref: AAAI 2023

  42. arXiv:2212.03978  [pdf, other

    cs.LG cs.AI

    Learning Graph Search Heuristics

    Authors: Michal Pándy, Weikang Qiu, Gabriele Corso, Petar Veličković, Rex Ying, Jure Leskovec, Pietro Liò

    Abstract: Searching for a path between two nodes in a graph is one of the most well-studied and fundamental problems in computer science. In numerous domains such as robotics, AI, or biology, practitioners develop search heuristics to accelerate their pathfinding algorithms. However, it is a laborious and complex process to hand-design heuristics based on the problem and the structure of a given use case. H… ▽ More

    Submitted 10 January, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

  43. arXiv:2211.12561  [pdf, other

    cs.CV cs.CL cs.LG

    Retrieval-Augmented Multimodal Language Modeling

    Authors: Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih

    Abstract: Recent multimodal models such as DALL-E and CM3 have achieved remarkable progress in text-to-image and image-to-text generation. However, these models store all learned knowledge (e.g., the appearance of the Eiffel Tower) in the model parameters, requiring increasingly larger models and training data to capture more knowledge. To integrate knowledge in a more scalable and modular way, we propose a… ▽ More

    Submitted 5 June, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Published at ICML 2023. Blog post available at https://cs.stanford.edu/~myasu/blog/racm3/

  44. arXiv:2210.17283  [pdf, other

    cs.LG

    CausalBench: A Large-scale Benchmark for Network Inference from Single-cell Perturbation Data

    Authors: Mathieu Chevalley, Yusuf Roohani, Arash Mehrjou, Jure Leskovec, Patrick Schwab

    Abstract: Causal inference is a vital aspect of multiple scientific disciplines and is routinely applied to high-impact applications such as medicine. However, evaluating the performance of causal inference methods in real-world environments is challenging due to the need for observations under both interventional and control conditions. Traditional evaluations conducted on synthetic datasets do not reflect… ▽ More

    Submitted 3 July, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

  45. arXiv:2210.14843  [pdf, other

    stat.ML cs.AI cs.LG

    TuneUp: A Simple Improved Training Strategy for Graph Neural Networks

    Authors: Weihua Hu, Kaidi Cao, Kexin Huang, Edward W Huang, Karthik Subbian, Kenji Kawaguchi, Jure Leskovec

    Abstract: Despite recent advances in Graph Neural Networks (GNNs), their training strategies remain largely under-explored. The conventional training strategy learns over all nodes in the original graph(s) equally, which can be sub-optimal as certain nodes are often more difficult to learn than others. Here we present TuneUp, a simple curriculum-based training strategy for improving the predictive performan… ▽ More

    Submitted 26 August, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

  46. arXiv:2210.12257  [pdf, other

    cs.LG cs.AI

    Efficient Automatic Machine Learning via Design Graphs

    Authors: Shirley Wu, Jiaxuan You, Jure Leskovec, Rex Ying

    Abstract: Despite the success of automated machine learning (AutoML), which aims to find the best design, including the architecture of deep networks and hyper-parameters, conventional AutoML methods are computationally expensive and hardly provide insights into the relations of different model design choices. To tackle the challenges, we propose FALCON, an efficient sample-based method to search for the op… ▽ More

    Submitted 23 May, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2022). 20 Pages

  47. arXiv:2210.09338  [pdf, other

    cs.CL cs.AI cs.LG

    Deep Bidirectional Language-Knowledge Graph Pretraining

    Authors: Michihiro Yasunaga, Antoine Bosselut, Hongyu Ren, Xikun Zhang, Christopher D Manning, Percy Liang, Jure Leskovec

    Abstract: Pretraining a language model (LM) on text has been shown to help various downstream NLP tasks. Recent works show that a knowledge graph (KG) can complement text data, offering structured background knowledge that provides a useful scaffold for reasoning. However, these works are not pretrained to learn a deep fusion of the two modalities at scale, limiting the potential to acquire fully joint repr… ▽ More

    Submitted 18 October, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: Published at NeurIPS 2022. Code, data, and trained models are available at https://github.com/michiyasunaga/dragon

  48. arXiv:2210.07414  [pdf, other

    cs.SI physics.soc-ph

    Human mobility networks reveal increased segregation in large cities

    Authors: Hamed Nilforoshan, Wenli Looi, Emma Pierson, Blanca Villanueva, Nic Fishman, Yiling Chen, John Sholar, Beth Redbird, David Grusky, Jure Leskovec

    Abstract: A long-standing expectation is that large, dense, and cosmopolitan areas support socioeconomic mixing and exposure between diverse individuals. It has been difficult to assess this hypothesis because past approaches to measuring socioeconomic mixing have relied on static residential housing data rather than real-life exposures between people at work, in places of leisure, and in home neighborhoods… ▽ More

    Submitted 24 July, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

  49. arXiv:2210.06722  [pdf, other

    cs.LG cs.AI cs.CL

    Few-shot Relational Reasoning via Connection Subgraph Pretraining

    Authors: Qian Huang, Hongyu Ren, Jure Leskovec

    Abstract: Few-shot knowledge graph (KG) completion task aims to perform inductive reasoning over the KG: given only a few support triplets of a new relation $\bowtie$ (e.g., (chop,$\bowtie$,kitchen), (read,$\bowtie$,library), the goal is to predict the query triplets of the same unseen relation $\bowtie$, e.g., (sleep,$\bowtie$,?). Current approaches cast the problem in a meta-learning framework, where the… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022. Open source implementation at https://github.com/snap-stanford/csr

  50. arXiv:2208.07239  [pdf, other

    cs.LG cs.AI cs.SI

    ROLAND: Graph Learning Framework for Dynamic Graphs

    Authors: Jiaxuan You, Tianyu Du, Jure Leskovec

    Abstract: Graph Neural Networks (GNNs) have been successfully applied to many real-world static graphs. However, the success of static graphs has not fully translated to dynamic graphs due to the limitations in model design, evaluation settings, and training strategies. Concretely, existing dynamic GNNs do not incorporate state-of-the-art designs from static GNNs, which limits their performance. Current eva… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: Published in SIGKDD 2022 (Research Track)