Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2025
Diversifying training pool predictability for zero-shot coordination: a theory of mind approach
IJCAI '24: Proceedings of the Thirty-Third International Joint Conference on Artificial IntelligenceArticle No.: 19, Pages 166–174https://doi.org/10.24963/ijcai.2024/19The challenge in constructing artificial social agents is to enable the ability to adapt to novel agents, and is called zero-shot coordination (ZSC). A promising approach is to train the adaptive agents by interacting with a diverse pool of collaborators,...
- research-articleMarch 2024
Learning evolving relations for multivariate time series forecasting
Applied Intelligence (KLU-APIN), Volume 54, Issue 5Pages 3918–3932https://doi.org/10.1007/s10489-023-05220-0AbstractMultivariate time series forecasting is essential in various fields, including healthcare and traffic management, but it is a challenging task due to the strong dynamics in both intra-channel relations (temporal patterns within individual ...
- research-articleJanuary 2025
Root cause explanation of outliers under noisy mechanisms
AAAI'24/IAAI'24/EAAI'24: Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 2287, Pages 20508–20515https://doi.org/10.1609/aaai.v38i18.30035Identifying root causes of anomalies in causal processes is vital across disciplines. Once identified, one can isolate the root causes and implement necessary measures to restore the normal operation. Causal processes are often modelled as graphs with ...
- research-articleDecember 2023
Balanced Q-learning: Combining the influence of optimistic and pessimistic targets
- Thommen George Karimpanal,
- Hung Le,
- Majid Abdolshah,
- Santu Rana,
- Sunil Gupta,
- Truyen Tran,
- Svetha Venkatesh
AbstractThe optimistic nature of the Q−learning target leads to an overestimation bias, which is an inherent problem associated with standard Q−learning. Such a bias fails to account for the possibility of low returns, particularly in risky scenarios. ...
- research-articleAugust 2023
Social motivation for modelling other agents under partial observability in decentralised training
IJCAI '23: Proceedings of the Thirty-Second International Joint Conference on Artificial IntelligenceArticle No.: 454, Pages 4082–4090https://doi.org/10.24963/ijcai.2023/454Understanding other agents is a key challenge in constructing artificial social agents. Current works focus on centralised training, wherein agents are allowed to know all the information about others and the environmental state during training. In ...
-
- research-articleFebruary 2023
Memory-augmented theory of mind network
AAAI'23/IAAI'23/EAAI'23: Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial IntelligenceArticle No.: 1305, Pages 11630–11637https://doi.org/10.1609/aaai.v37i10.26374Social reasoning necessitates the capacity of theory of mind (ToM), the ability to contextualise and attribute mental states to others without having access to their internal cognitive structure. Recent machine learning approaches to ToM have demonstrated ...
- research-articleApril 2024
Momentum adversarial distillation: handling large distribution shifts in data-free knowledge distillation
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 730, Pages 10055–10067Data-free Knowledge Distillation (DFKD) has attracted attention recently thanks to its appealing capability of transferring knowledge from a teacher network to a student network without using training data. The main idea is to use a generator to ...
- research-articleApril 2024
Functional indirection neural estimator for better out-of-distribution generalization
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing SystemsArticle No.: 216, Pages 2984–2996The capacity to achieve out-of-distribution (OOD) generalization is a hallmark of human intelligence and yet remains out of reach for machines. This remarkable capability has been attributed to our abilities to make conceptual abstraction and analogy, ...
- ArticleOctober 2022
Towards Effective and Robust Neural Trojan Defenses via Input Filtering
- Kien Do,
- Haripriya Harikumar,
- Hung Le,
- Dung Nguyen,
- Truyen Tran,
- Santu Rana,
- Dang Nguyen,
- Willy Susilo,
- Svetha Venkatesh
AbstractTrojan attacks on deep neural networks are both dangerous and surreptitious. Over the past few years, Trojan attacks have advanced from using only a single input-agnostic trigger and targeting only one class to using multiple, input-specific ...
- ArticleOctober 2022
- research-articleJuly 2022
Explaining Black Box Drug Target Prediction Through Model Agnostic Counterfactual Samples
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), Volume 20, Issue 2Pages 1020–1029https://doi.org/10.1109/TCBB.2022.3190266Many high-performance DTA deep learning models have been proposed, but they are mostly black-box and thus lack human interpretability. Explainable AI (XAI) can make DTA models more trustworthy, and allows to distill biological knowledge from the models. ...
- research-articleMay 2022
Learning to Transfer Role Assignment Across Team Sizes
AAMAS '22: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent SystemsPages 963–971Multi-agent reinforcement learning holds the key for solving complex tasks that demand the coordination of learning agents. However, strong coordination often leads to expensive exploration over the exponentially large state-action space. A powerful ...
- research-articleMay 2022
Learning Theory of Mind via Dynamic Traits Attribution
AAMAS '22: Proceedings of the 21st International Conference on Autonomous Agents and Multiagent SystemsPages 954–962Machine learning of Theory of Mind (ToM) is essential to build social agents that co-live with humans and other agents. This capacity, once acquired, will help machines infer the mental states of others from observed contextual action trajectories, ...
- research-articleJune 2024
Model-based episodic memory induces dynamic hybrid controls
NIPS '21: Proceedings of the 35th International Conference on Neural Information Processing SystemsArticle No.: 2319, Pages 30313–30325Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our memory ...
- ArticleNovember 2021
DeepProcess: Supporting Business Process Execution Using a MANN-Based Recommender System
AbstractProcess-aware Recommender systems can provide critical decision support functionality to aid business process execution by recommending what actions to take next. Based on recent advances in the field of deep learning, we present a novel memory-...
- research-articleNovember 2021
Hierarchical Conditional Relation Networks for Multimodal Video Question Answering
International Journal of Computer Vision (IJCV), Volume 129, Issue 11Pages 3027–3050https://doi.org/10.1007/s11263-021-01514-3AbstractVideo Question Answering (Video QA) challenges modelers in multiple fronts. Modeling video necessitates building not only spatio-temporal models for the dynamic visual channel but also multimodal structures for associated information channels such ...
- ArticleSeptember 2021
Fast Conditional Network Compression Using Bayesian HyperNetworks
- Phuoc Nguyen,
- Truyen Tran,
- Ky Le,
- Sunil Gupta,
- Santu Rana,
- Dang Nguyen,
- Trong Nguyen,
- Shannon Ryan,
- Svetha Venkatesh
Machine Learning and Knowledge Discovery in Databases. Research TrackPages 330–345https://doi.org/10.1007/978-3-030-86523-8_20AbstractWe introduce a conditional compression problem and propose a fast framework for tackling it. The problem is how to quickly compress a pretrained large neural network into optimal smaller networks given target contexts, e.g., a context involving ...
- ArticleSeptember 2021
Variational Hyper-encoding Networks
Machine Learning and Knowledge Discovery in Databases. Research TrackPages 100–115https://doi.org/10.1007/978-3-030-86520-7_7AbstractWe propose a framework called HyperVAE for encoding distributions of distributions. When a target distribution is modeled by a VAE, its neural network parameters are sampled from a distribution in the model space modeled by a hyper-level VAE. We ...
- ArticleSeptember 2021
Knowledge Distillation with Distribution Mismatch
- Dang Nguyen,
- Sunil Gupta,
- Trong Nguyen,
- Santu Rana,
- Phuoc Nguyen,
- Truyen Tran,
- Ky Le,
- Shannon Ryan,
- Svetha Venkatesh
Machine Learning and Knowledge Discovery in Databases. Research TrackPages 250–265https://doi.org/10.1007/978-3-030-86520-7_16AbstractKnowledge distillation (KD) is one of the most efficient methods to compress a large deep neural network (called teacher) to a smaller network (called student). Current state-of-the-art KD methods assume that the distributions of training data of ...
- abstractAugust 2021
From Deep Learning to Deep Reasoning
KDD '21: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data MiningPages 4076–4077https://doi.org/10.1145/3447548.3470803The rise of big data and big compute has brought modern neural networks to many walks of digital life, thanks to the relative ease of constructing large models that scale to the real world. Current successes of Transformers and self-supervised ...