-
Explicit Contact Optimization in Whole-Body Contact-Rich Manipulation
Authors:
Victor Leve,
João Moura,
Namiko Saito,
Steve Tonneau,
Sethu Vijayakumar
Abstract:
Humans can exploit contacts anywhere on their body surface to manipulate large and heavy items, objects normally out of reach or multiple objects at once. However, such manipulation through contacts using the whole surface of the body remains extremely challenging to achieve on robots. This can be labelled as Whole-Body Contact-Rich Manipulation (WBCRM) problem. In addition to the high-dimensional…
▽ More
Humans can exploit contacts anywhere on their body surface to manipulate large and heavy items, objects normally out of reach or multiple objects at once. However, such manipulation through contacts using the whole surface of the body remains extremely challenging to achieve on robots. This can be labelled as Whole-Body Contact-Rich Manipulation (WBCRM) problem. In addition to the high-dimensionality of the Contact-Rich Manipulation problem due to the combinatorics of contact modes, admitting contact creation anywhere on the body surface adds complexity, which hinders planning of manipulation within a reasonable time. We address this computational problem by formulating the contact and motion planning of planar WBCRM as hierarchical continuous optimization problems. To enable this formulation, we propose a novel continuous explicit representation of the robot surface, that we believe to be foundational for future research using continuous optimization for WBCRM. Our results demonstrate a significant improvement of convergence, planning time and feasibility - with, on the average, 99% less iterations and 96% reduction in time to find a solution over considered scenarios, without recourse to prone-to-failure trajectory refinement steps.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
Mabuchi rays, test configurations and quantization for toric manifolds
Authors:
António Gouveia,
José M. Mourão,
João P. Nunes
Abstract:
We consider Mabuchi rays of toric Kähler structures on symplectic toric manifolds which are associated to toric test configurations and that are generated by convex functions on themoment polytope, $P$, whose second derivative has support given by a compact subset $K<P$. Associated to the test configuration there is a polyhedral decomposition of $P$ whose components are approximated by the compone…
▽ More
We consider Mabuchi rays of toric Kähler structures on symplectic toric manifolds which are associated to toric test configurations and that are generated by convex functions on themoment polytope, $P$, whose second derivative has support given by a compact subset $K<P$. Associated to the test configuration there is a polyhedral decomposition of $P$ whose components are approximated by the components of $P \setminus K$. Along such Mabuchi rays, the toric complex structure remains unchanged on the inverse image under the moment map of $(P \setminus \check {K})$, where $\check {K}$ denotes the interior of $K$. At infinite geodesic time, the Kähler polarizations along the ray converge to interesting new toric mixed polarizations. The quantization in these limit polarizations is given by restrictions of the monomial holomorphic sections of the Kähler quantization, for monomials corresponding to integral points in $P \setminus \check {K}$, and by sections on the fibers of the moment map over the integral points contained in $\check {K}$, which, along the directions parallel to $K$ are holomorphic and which along the directions transverse to $K$ are distributional. These quantizations correspond to quantizations of the central fiber of the test family, in the symplectic picture. We present the case of $S2$ in detail and then generalize to higher dimensional symplectic toric manifolds. Metrically, at infinite Mabuchi geodesic time, the sphere decomposes into two discs and a collection of cylinders, separated by infinitely long lines. Correspondingly, the quantization in the limit polarization decomposes into a direct sum of the contributions from the quantizations of each of these components.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
User Story Tutor (UST) to Support Agile Software Developers
Authors:
Giseldo da Silva Neo,
José Antão Beltrão Moura,
Hyggo Oliveira de Almeida,
Alana Viana Borges da Silva Neo,
Olival de Gusmão Freitas Júnior
Abstract:
User Stories record what must be built in projects that use agile practices. User Stories serve both to estimate effort, generally measured in Story Points, and to plan what should be done in a Sprint. Therefore, it is essential to train software engineers on how to create simple, easily readable, and comprehensive User Stories. For that reason, we designed, implemented, applied, and evaluated a w…
▽ More
User Stories record what must be built in projects that use agile practices. User Stories serve both to estimate effort, generally measured in Story Points, and to plan what should be done in a Sprint. Therefore, it is essential to train software engineers on how to create simple, easily readable, and comprehensive User Stories. For that reason, we designed, implemented, applied, and evaluated a web application called User Story Tutor (UST). UST checks the description of a given User Story for readability, and if needed, recommends appropriate practices for improvement. UST also estimates a User Story effort in Story Points using Machine Learning techniques. As such UST may support the continuing education of agile development teams when writing and reviewing User Stories. UST's ease of use was evaluated by 40 agile practitioners according to the Technology Acceptance Model (TAM) and AttrakDiff. The TAM evaluation averages were good in almost all considered variables. Application of the AttrakDiff evaluation framework produced similar good results. Apparently, UST can be used with good reliability. Applying UST to assist in the construction of User Stories is a viable technique that, at the very least, can be used by agile developments to complement and enhance current User Story creation.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Fibering polarizations and Mabuchi rays on symmetric spaces of compact type
Authors:
Thomas Baier,
Ana Cristina Ferreira,
Joachim Hilgert,
José M. Mourão,
João P. Nunes
Abstract:
In this paper, we describe holomorphic quantizations of the cotangent bundle of a symmetric space of compact type $T^*(U/K)\cong U_\mathbb{C}/K_\mathbb{C}$, along Mabuchi rays of $U$-invariant Kähler structures. At infinite geodesic time, the Kähler polarizations converge to a mixed polarization $\mathcal{P}_\infty$. We show how a generalized coherent state transform relates the quantizations alon…
▽ More
In this paper, we describe holomorphic quantizations of the cotangent bundle of a symmetric space of compact type $T^*(U/K)\cong U_\mathbb{C}/K_\mathbb{C}$, along Mabuchi rays of $U$-invariant Kähler structures. At infinite geodesic time, the Kähler polarizations converge to a mixed polarization $\mathcal{P}_\infty$. We show how a generalized coherent state transform relates the quantizations along the Mabuchi geodesics such that holomorphic sections converge, as geodesic time goes to infinity, to distributional $\mathcal{P}_\infty$-polarized sections. Unlike in the case of $T^*U$, the gCST mapping from the Hilbert space of vertically polarized sections are not asymptotically unitary due to the appearance of representation dependent factors associated to the isotypical decomposition for the $U$-action. In agreement with the general program outlined in [Bai+23], we also describe how the quantization in the limit polarization $\mathcal{P}_\infty$ is given by the direct sum of the quantizations for all the symplectic reductions relative to the invariant torus action associated to the Hamiltonian action of $U$.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
The Positivity of the Neural Tangent Kernel
Authors:
Luís Carvalho,
João L. Costa,
José Mourão,
Gonçalo Oliveira
Abstract:
The Neural Tangent Kernel (NTK) has emerged as a fundamental concept in the study of wide Neural Networks. In particular, it is known that the positivity of the NTK is directly related to the memorization capacity of sufficiently wide networks, i.e., to the possibility of reaching zero loss in training, via gradient descent. Here we will improve on previous works and obtain a sharp result concerni…
▽ More
The Neural Tangent Kernel (NTK) has emerged as a fundamental concept in the study of wide Neural Networks. In particular, it is known that the positivity of the NTK is directly related to the memorization capacity of sufficiently wide networks, i.e., to the possibility of reaching zero loss in training, via gradient descent. Here we will improve on previous works and obtain a sharp result concerning the positivity of the NTK of feedforward networks of any depth. More precisely, we will show that, for any non-polynomial activation function, the NTK is strictly positive definite. Our results are based on a novel characterization of polynomial functions which is of independent interest.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Gradient Networks
Authors:
Shreyas Chaudhari,
Srinivasa Pranav,
José M. F. Moura
Abstract:
Directly parameterizing and learning gradients of functions has widespread significance, with specific applications in optimization, generative modeling, and optimal transport. This paper introduces gradient networks (GradNets): novel neural network architectures that parameterize gradients of various function classes. GradNets exhibit specialized architectural constraints that ensure corresponden…
▽ More
Directly parameterizing and learning gradients of functions has widespread significance, with specific applications in optimization, generative modeling, and optimal transport. This paper introduces gradient networks (GradNets): novel neural network architectures that parameterize gradients of various function classes. GradNets exhibit specialized architectural constraints that ensure correspondence to gradient functions. We provide a comprehensive GradNet design framework that includes methods for transforming GradNets into monotone gradient networks (mGradNets), which are guaranteed to represent gradients of convex functions. We establish the approximation capabilities of the proposed GradNet and mGradNet. Our results demonstrate that these networks universally approximate the gradients of (convex) functions. Furthermore, these networks can be customized to correspond to specific spaces of (monotone) gradient functions, including gradients of transformed sums of (convex) ridge functions. Our analysis leads to two distinct GradNet architectures, GradNet-C and GradNet-M, and we describe the corresponding monotone versions, mGradNet-C and mGradNet-M. Our empirical results show that these architectures offer efficient parameterizations and outperform popular methods in gradient field learning tasks.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Learning Goal-Directed Object Pushing in Cluttered Scenes with Location-Based Attention
Authors:
Nils Dengler,
Juan Del Aguila Ferrandis,
João Moura,
Sethu Vijayakumar,
Maren Bennewitz
Abstract:
Non-prehensile planar pushing is a challenging task due to its underactuated nature with hybrid-dynamics, where a robot needs to reason about an object's long-term behaviour and contact-switching, while being robust to contact uncertainty. The presence of clutter in the environment further complicates this task, introducing the need to include more sophisticated spatial analysis to avoid collision…
▽ More
Non-prehensile planar pushing is a challenging task due to its underactuated nature with hybrid-dynamics, where a robot needs to reason about an object's long-term behaviour and contact-switching, while being robust to contact uncertainty. The presence of clutter in the environment further complicates this task, introducing the need to include more sophisticated spatial analysis to avoid collisions. Building upon prior work on reinforcement learning (RL) with multimodal categorical exploration for planar pushing, in this paper we incorporate location-based attention to enable robust navigation through clutter. Unlike previous RL literature addressing this obstacle avoidance pushing task, our framework requires no predefined global paths and considers the target orientation of the manipulated object. Our results demonstrate that the learned policies successfully navigate through a wide range of complex obstacle configurations, including dynamic obstacles, with smooth motions, achieving the desired target object pose. We also validate the transferability of the learned policies to robotic hardware using the KUKA iiwa robot arm.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Impact-Aware Bimanual Catching of Large-Momentum Objects
Authors:
Lei Yan,
Theodoros Stouraitis,
João Moura,
Wenfu Xu,
Michael Gienger,
Sethu Vijayakumar
Abstract:
This paper investigates one of the most challenging tasks in dynamic manipulation -- catching large-momentum moving objects. Beyond the realm of quasi-static manipulation, dealing with highly dynamic objects can significantly improve the robot's capability of interacting with its surrounding environment. Yet, the inevitable motion mismatch between the fast moving object and the approaching robot w…
▽ More
This paper investigates one of the most challenging tasks in dynamic manipulation -- catching large-momentum moving objects. Beyond the realm of quasi-static manipulation, dealing with highly dynamic objects can significantly improve the robot's capability of interacting with its surrounding environment. Yet, the inevitable motion mismatch between the fast moving object and the approaching robot will result in large impulsive forces, which lead to the unstable contacts and irreversible damage to both the object and the robot. To address the above problems, we propose an online optimization framework to: 1) estimate and predict the linear and angular motion of the object; 2) search and select the optimal contact locations across every surface of the object to mitigate impact through sequential quadratic programming (SQP); 3) simultaneously optimize the end-effector motion, stiffness, and contact force for both robots using multi-mode trajectory optimization (MMTO); and 4) realise the impact-aware catching motion on the compliant robotic system based on indirect force controller. We validate the impulse distribution, contact selection, and impact-aware MMTO algorithms in simulation and demonstrate the benefits of the proposed framework in real-world experiments including catching large-momentum moving objects with well-defined motion, constrained motion and free-flying motion.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
An Analytic Solution to Covariance Propagation in Neural Networks
Authors:
Oren Wright,
Yorie Nakahira,
José M. F. Moura
Abstract:
Uncertainty quantification of neural networks is critical to measuring the reliability and robustness of deep learning systems. However, this often involves costly or inaccurate sampling methods and approximations. This paper presents a sample-free moment propagation technique that propagates mean vectors and covariance matrices across a network to accurately characterize the input-output distribu…
▽ More
Uncertainty quantification of neural networks is critical to measuring the reliability and robustness of deep learning systems. However, this often involves costly or inaccurate sampling methods and approximations. This paper presents a sample-free moment propagation technique that propagates mean vectors and covariance matrices across a network to accurately characterize the input-output distributions of neural networks. A key enabler of our technique is an analytic solution for the covariance of random variables passed through nonlinear activation functions, such as Heaviside, ReLU, and GELU. The wide applicability and merits of the proposed technique are shown in experiments analyzing the input-output distributions of trained neural networks and training Bayesian neural networks.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
Latent Object Characteristics Recognition with Visual to Haptic-Audio Cross-modal Transfer Learning
Authors:
Namiko Saito,
Joao Moura,
Hiroki Uchida,
Sethu Vijayakumar
Abstract:
Recognising the characteristics of objects while a robot handles them is crucial for adjusting motions that ensure stable and efficient interactions with containers. Ahead of realising stable and efficient robot motions for handling/transferring the containers, this work aims to recognise the latent unobservable object characteristics. While vision is commonly used for object recognition by robots…
▽ More
Recognising the characteristics of objects while a robot handles them is crucial for adjusting motions that ensure stable and efficient interactions with containers. Ahead of realising stable and efficient robot motions for handling/transferring the containers, this work aims to recognise the latent unobservable object characteristics. While vision is commonly used for object recognition by robots, it is ineffective for detecting hidden objects. However, recognising objects indirectly using other sensors is a challenging task. To address this challenge, we propose a cross-modal transfer learning approach from vision to haptic-audio. We initially train the model with vision, directly observing the target object. Subsequently, we transfer the latent space learned from vision to a second module, trained only with haptic-audio and motor data. This transfer learning framework facilitates the representation of object characteristics using indirect sensor data, thereby improving recognition accuracy. For evaluating the recognition accuracy of our proposed learning framework we selected shape, position, and orientation as the object characteristics. Finally, we demonstrate online recognition of both trained and untrained objects using the humanoid robot Nextage Open.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Peer-to-Peer Learning + Consensus with Non-IID Data
Authors:
Srinivasa Pranav,
José M. F. Moura
Abstract:
Peer-to-peer deep learning algorithms are enabling distributed edge devices to collaboratively train deep neural networks without exchanging raw training data or relying on a central server. Peer-to-Peer Learning (P2PL) and other algorithms based on Distributed Local-Update Stochastic/mini-batch Gradient Descent (local DSGD) rely on interleaving epochs of training with distributed consensus steps.…
▽ More
Peer-to-peer deep learning algorithms are enabling distributed edge devices to collaboratively train deep neural networks without exchanging raw training data or relying on a central server. Peer-to-Peer Learning (P2PL) and other algorithms based on Distributed Local-Update Stochastic/mini-batch Gradient Descent (local DSGD) rely on interleaving epochs of training with distributed consensus steps. This process leads to model parameter drift/divergence amongst participating devices in both IID and non-IID settings. We observe that model drift results in significant oscillations in test performance evaluated after local training and consensus phases. We then identify factors that amplify performance oscillations and demonstrate that our novel approach, P2PL with Affinity, dampens test performance oscillations in non-IID settings without incurring any additional communication cost.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Inferring the Graph of Networked Dynamical Systems under Partial Observability and Spatially Colored Noise
Authors:
Augusto Santos,
Diogo Rente,
Rui Seabra,
José M. F. Moura
Abstract:
In a Networked Dynamical System (NDS), each node is a system whose dynamics are coupled with the dynamics of neighboring nodes. The global dynamics naturally builds on this network of couplings and it is often excited by a noise input with nontrivial structure. The underlying network is unknown in many applications and should be inferred from observed data. We assume: i) Partial observability -- t…
▽ More
In a Networked Dynamical System (NDS), each node is a system whose dynamics are coupled with the dynamics of neighboring nodes. The global dynamics naturally builds on this network of couplings and it is often excited by a noise input with nontrivial structure. The underlying network is unknown in many applications and should be inferred from observed data. We assume: i) Partial observability -- time series data is only available over a subset of the nodes; ii) Input noise -- it is correlated across distinct nodes while temporally independent, i.e., it is spatially colored. We present a feasibility condition on the noise correlation structure wherein there exists a consistent network inference estimator to recover the underlying fundamental dependencies among the observed nodes. Further, we describe a structure identification algorithm that exhibits competitive performance across distinct regimes of network connectivity, observability, and noise correlation.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
PhyOT: Physics-informed object tracking in surveillance cameras
Authors:
Kawisorn Kamtue,
Jose M. F. Moura,
Orathai Sangpetch,
Paulo Garcia
Abstract:
While deep learning has been very successful in computer vision, real world operating conditions such as lighting variation, background clutter, or occlusion hinder its accuracy across several tasks. Prior work has shown that hybrid models -- combining neural networks and heuristics/algorithms -- can outperform vanilla deep learning for several computer vision tasks, such as classification or trac…
▽ More
While deep learning has been very successful in computer vision, real world operating conditions such as lighting variation, background clutter, or occlusion hinder its accuracy across several tasks. Prior work has shown that hybrid models -- combining neural networks and heuristics/algorithms -- can outperform vanilla deep learning for several computer vision tasks, such as classification or tracking. We consider the case of object tracking, and evaluate a hybrid model (PhyOT) that conceptualizes deep neural networks as ``sensors'' in a Kalman filter setup, where prior knowledge, in the form of Newtonian laws of motion, is used to fuse sensor observations and to perform improved estimations. Our experiments combine three neural networks, performing position, indirect velocity and acceleration estimation, respectively, and evaluate such a formulation on two benchmark datasets: a warehouse security camera dataset that we collected and annotated and a traffic camera open dataset. Results suggest that our PhyOT can track objects in extreme conditions that the state-of-the-art deep neural networks fail while its performance in general cases does not degrade significantly from that of existing deep learning approaches. Results also suggest that our PhyOT components are generalizable and transferable.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Learning the Causal Structure of Networked Dynamical Systems under Latent Nodes and Structured Noise
Authors:
Augusto Santos,
Diogo Rente,
Rui Seabra,
José M. F. Moura
Abstract:
This paper considers learning the hidden causal network of a linear networked dynamical system (NDS) from the time series data at some of its nodes -- partial observability. The dynamics of the NDS are driven by colored noise that generates spurious associations across pairs of nodes, rendering the problem much harder. To address the challenge of noise correlation and partial observability, we ass…
▽ More
This paper considers learning the hidden causal network of a linear networked dynamical system (NDS) from the time series data at some of its nodes -- partial observability. The dynamics of the NDS are driven by colored noise that generates spurious associations across pairs of nodes, rendering the problem much harder. To address the challenge of noise correlation and partial observability, we assign to each pair of nodes a feature vector computed from the time series data of observed nodes. The feature embedding is engineered to yield structural consistency: there exists an affine hyperplane that consistently partitions the set of features, separating the feature vectors corresponding to connected pairs of nodes from those corresponding to disconnected pairs. The causal inference problem is thus addressed via clustering the designed features. We demonstrate with simple baseline supervised methods the competitive performance of the proposed causal inference mechanism under broad connectivity regimes and noise correlation levels, including a real world network. Further, we devise novel technical guarantees of structural consistency for linear NDS under the considered regime.
△ Less
Submitted 12 February, 2024; v1 submitted 10 December, 2023;
originally announced December 2023.
-
Peer-to-Peer Deep Learning for Beyond-5G IoT
Authors:
Srinivasa Pranav,
José M. F. Moura
Abstract:
We present P2PL, a practical multi-device peer-to-peer deep learning algorithm that, unlike the federated learning paradigm, does not require coordination from edge servers or the cloud. This makes P2PL well-suited for the sheer scale of beyond-5G computing environments like smart cities that otherwise create range, latency, bandwidth, and single point of failure issues for federated approaches.…
▽ More
We present P2PL, a practical multi-device peer-to-peer deep learning algorithm that, unlike the federated learning paradigm, does not require coordination from edge servers or the cloud. This makes P2PL well-suited for the sheer scale of beyond-5G computing environments like smart cities that otherwise create range, latency, bandwidth, and single point of failure issues for federated approaches.
P2PL introduces max norm synchronization to catalyze training, retains on-device deep model training to preserve privacy, and leverages local inter-device communication to implement distributed consensus. Each device iteratively alternates between two phases: 1) on-device learning and 2) peer-to-peer cooperation where they combine model parameters with nearby devices. We empirically show that all participating devices achieve the same test performance attained by federated and centralized training -- even with 100 devices and relaxed singly stochastic consensus weights. We extend these experimental results to settings with diverse network topologies, sparse and intermittent communication, and non-IID data distributions.
△ Less
Submitted 3 May, 2024; v1 submitted 28 October, 2023;
originally announced October 2023.
-
Online Estimation of Articulated Objects with Factor Graphs using Vision and Proprioceptive Sensing
Authors:
Russell Buchanan,
Adrian Röfer,
João Moura,
Abhinav Valada,
Sethu Vijayakumar
Abstract:
From dishwashers to cabinets, humans interact with articulated objects every day, and for a robot to assist in common manipulation tasks, it must learn a representation of articulation. Recent deep learning learning methods can provide powerful vision-based priors on the affordance of articulated objects from previous, possibly simulated, experiences. In contrast, many works estimate articulation…
▽ More
From dishwashers to cabinets, humans interact with articulated objects every day, and for a robot to assist in common manipulation tasks, it must learn a representation of articulation. Recent deep learning learning methods can provide powerful vision-based priors on the affordance of articulated objects from previous, possibly simulated, experiences. In contrast, many works estimate articulation by observing the object in motion, requiring the robot to already be interacting with the object. In this work, we propose to use the best of both worlds by introducing an online estimation method that merges vision-based affordance predictions from a neural network with interactive kinematic sensing in an analytical model. Our work has the benefit of using vision to predict an articulation model before touching the object, while also being able to update the model quickly from kinematic sensing during the interaction. In this paper, we implement a full system using shared autonomy for robotic opening of articulated objects, in particular objects in which the articulation is not apparent from vision alone. We implemented our system on a real robot and performed several autonomous closed-loop experiments in which the robot had to open a door with unknown joint while estimating the articulation online. Our system achieved an 80% success rate for autonomous opening of unknown articulated objects.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Few-Shot Learning of Force-Based Motions From Demonstration Through Pre-training of Haptic Representation
Authors:
Marina Y. Aoyama,
João Moura,
Namiko Saito,
Sethu Vijayakumar
Abstract:
In many contact-rich tasks, force sensing plays an essential role in adapting the motion to the physical properties of the manipulated object. To enable robots to capture the underlying distribution of object properties necessary for generalising learnt manipulation tasks to unseen objects, existing Learning from Demonstration (LfD) approaches require a large number of costly human demonstrations.…
▽ More
In many contact-rich tasks, force sensing plays an essential role in adapting the motion to the physical properties of the manipulated object. To enable robots to capture the underlying distribution of object properties necessary for generalising learnt manipulation tasks to unseen objects, existing Learning from Demonstration (LfD) approaches require a large number of costly human demonstrations. Our proposed semi-supervised LfD approach decouples the learnt model into an haptic representation encoder and a motion generation decoder. This enables us to pre-train the first using large amount of unsupervised data, easily accessible, while using few-shot LfD to train the second, leveraging the benefits of learning skills from humans. We validate the approach on the wiping task using sponges with different stiffness and surface friction. Our results demonstrate that pre-training significantly improves the ability of the LfD model to recognise physical properties and generate desired wiping motions for unseen sponges, outperforming the LfD method without pre-training. We validate the motion generated by our semi-supervised LfD model on the physical robot hardware using the KUKA iiwa robot arm. We also validate that the haptic representation encoder, pre-trained in simulation, captures the properties of real objects, explaining its contribution to improving the generalisation of the downstream task.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Nonprehensile Planar Manipulation through Reinforcement Learning with Multimodal Categorical Exploration
Authors:
Juan Del Aguila Ferrandis,
João Moura,
Sethu Vijayakumar
Abstract:
Developing robot controllers capable of achieving dexterous nonprehensile manipulation, such as pushing an object on a table, is challenging. The underactuated and hybrid-dynamics nature of the problem, further complicated by the uncertainty resulting from the frictional interactions, requires sophisticated control behaviors. Reinforcement Learning (RL) is a powerful framework for developing such…
▽ More
Developing robot controllers capable of achieving dexterous nonprehensile manipulation, such as pushing an object on a table, is challenging. The underactuated and hybrid-dynamics nature of the problem, further complicated by the uncertainty resulting from the frictional interactions, requires sophisticated control behaviors. Reinforcement Learning (RL) is a powerful framework for developing such robot controllers. However, previous RL literature addressing the nonprehensile pushing task achieves low accuracy, non-smooth trajectories, and only simple motions, i.e. without rotation of the manipulated object. We conjecture that previously used unimodal exploration strategies fail to capture the inherent hybrid-dynamics of the task, arising from the different possible contact interaction modes between the robot and the object, such as sticking, sliding, and separation. In this work, we propose a multimodal exploration approach through categorical distributions, which enables us to train planar pushing RL policies for arbitrary starting and target object poses, i.e. positions and orientations, and with improved accuracy. We show that the learned policies are robust to external disturbances and observation noise, and scale to tasks with multiple pushers. Furthermore, we validate the transferability of the learned policies, trained entirely in simulation, to a physical robot hardware using the KUKA iiwa robot arm. See our supplemental video: https://youtu.be/vTdva1mgrk4.
△ Less
Submitted 4 August, 2023;
originally announced August 2023.
-
RobôCIn Small Size League Extended Team Description Paper for RoboCup 2023
Authors:
Aline Lima de Oliveira,
Cauê Addae da Silva Gomes,
Cecília Virginia Santos da Silva,
Charles Matheus de Sousa Alves,
Danilo Andrade Martins de Souza,
Driele Pires Ferreira Araújo Xavier,
Edgleyson Pereira da Silva,
Felipe Bezerra Martins,
Lucas Henrique Cavalcanti Santos,
Lucas Dias Maciel,
Matheus Paixão Gumercindo dos Santos,
Matheus Lafayette Vasconcelos,
Matheus Vinícius Teotonio do Nascimento Andrade,
João Guilherme Oliveira Carvalho de Melo,
João Pedro Souza Pereira de Moura,
José Ronald da Silva,
José Victor Silva Cruz,
Pedro Henrique Santana de Morais,
Pedro Paulo Salman de Oliveira,
Riei Joaquim Matos Rodrigues,
Roberto Costa Fernandes,
Ryan Vinicius Santos Morais,
Tamara Mayara Ramos Teobaldo,
Washington Igor dos Santos Silva,
Edna Natividade Silva Barros
Abstract:
RobôCIn has participated in RoboCup Small Size League since 2019, won its first world title in 2022 (Division B), and is currently a three-times Latin-American champion. This paper presents our improvements to defend the Small Size League (SSL) division B title in RoboCup 2023 in Bordeaux, France. This paper aims to share some of the academic research that our team developed over the past year. Ou…
▽ More
RobôCIn has participated in RoboCup Small Size League since 2019, won its first world title in 2022 (Division B), and is currently a three-times Latin-American champion. This paper presents our improvements to defend the Small Size League (SSL) division B title in RoboCup 2023 in Bordeaux, France. This paper aims to share some of the academic research that our team developed over the past year. Our team has successfully published 2 articles related to SSL at two high-impact conferences: the 25th RoboCup International Symposium and the 19th IEEE Latin American Robotics Symposium (LARS 2022). Over the last year, we have been continuously migrating from our past codebase to Unification. We will describe the new architecture implemented and some points of software and AI refactoring. In addition, we discuss the process of integrating machined components into the mechanical system, our development for participating in the vision blackout challenge last year and what we are preparing for this year.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
Regularizing Self-training for Unsupervised Domain Adaptation via Structural Constraints
Authors:
Rajshekhar Das,
Jonathan Francis,
Sanket Vaibhav Mehta,
Jean Oh,
Emma Strubell,
Jose Moura
Abstract:
Self-training based on pseudo-labels has emerged as a dominant approach for addressing conditional distribution shifts in unsupervised domain adaptation (UDA) for semantic segmentation problems. A notable drawback, however, is that this family of approaches is susceptible to erroneous pseudo labels that arise from confirmation biases in the source domain and that manifest as nuisance factors in th…
▽ More
Self-training based on pseudo-labels has emerged as a dominant approach for addressing conditional distribution shifts in unsupervised domain adaptation (UDA) for semantic segmentation problems. A notable drawback, however, is that this family of approaches is susceptible to erroneous pseudo labels that arise from confirmation biases in the source domain and that manifest as nuisance factors in the target domain. A possible source for this mismatch is the reliance on only photometric cues provided by RGB image inputs, which may ultimately lead to sub-optimal adaptation. To mitigate the effect of mismatched pseudo-labels, we propose to incorporate structural cues from auxiliary modalities, such as depth, to regularise conventional self-training objectives. Specifically, we introduce a contrastive pixel-level objectness constraint that pulls the pixel representations within a region of an object instance closer, while pushing those from different object categories apart. To obtain object regions consistent with the true underlying object, we extract information from both depth maps and RGB-images in the form of multimodal clustering. Crucially, the objectness constraint is agnostic to the ground-truth semantic labels and, hence, appropriate for unsupervised domain adaptation. In this work, we show that our regularizer significantly improves top performing self-training methods (by up to $2$ points) in various UDA benchmarks for semantic segmentation. We include all code in the supplementary.
△ Less
Submitted 28 April, 2023;
originally announced May 2023.
-
Wide neural networks: From non-gaussian random fields at initialization to the NTK geometry of training
Authors:
Luís Carvalho,
João Lopes Costa,
José Mourão,
Gonçalo Oliveira
Abstract:
Recent developments in applications of artificial neural networks with over $n=10^{14}$ parameters make it extremely important to study the large $n$ behaviour of such networks. Most works studying wide neural networks have focused on the infinite width $n \to +\infty$ limit of such networks and have shown that, at initialization, they correspond to Gaussian processes. In this work we will study t…
▽ More
Recent developments in applications of artificial neural networks with over $n=10^{14}$ parameters make it extremely important to study the large $n$ behaviour of such networks. Most works studying wide neural networks have focused on the infinite width $n \to +\infty$ limit of such networks and have shown that, at initialization, they correspond to Gaussian processes. In this work we will study their behavior for large, but finite $n$. Our main contributions are the following:
(1) The computation of the corrections to Gaussianity in terms of an asymptotic series in $n^{-\frac{1}{2}}$. The coefficients in this expansion are determined by the statistics of parameter initialization and by the activation function.
(2) Controlling the evolution of the outputs of finite width $n$ networks, during training, by computing deviations from the limiting infinite width case (in which the network evolves through a linear flow). This improves previous estimates and yields sharper decay rates for the (finite width) NTK in terms of $n$, valid during the entire training procedure. As a corollary, we also prove that, with arbitrarily high probability, the training of sufficiently wide neural networks converges to a global minimum of the corresponding quadratic loss function.
(3) Estimating how the deviations from Gaussianity evolve with training in terms of $n$. In particular, using a certain metric in the space of measures we find that, along training, the resulting measure is within $n^{-\frac{1}{2}}(\log n)^{1+}$ of the time dependent Gaussian process corresponding to the infinite width network (which is explicitly given by precomposing the initial Gaussian process with the linear flow corresponding to training in the infinite width limit).
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
Planning for Electric Vehicles Coupled with Urban Mobility
Authors:
Yanyan Xu,
Serdar Colak,
Emre C. Kara,
Scott J. Moura,
Marta C. Gonzalez
Abstract:
The rising adoption of plug-in electric vehicles (PEVs) leads to the alignment of their electricity and their mobility demands. Therefore, transportation and power infrastructures are becoming increasingly interdependent. In this work, we uncover patterns of PEV mobility by integrating for the first time two unique data sets: (i) mobile phone activity of 1.39 million Bay Area residents and (ii) ch…
▽ More
The rising adoption of plug-in electric vehicles (PEVs) leads to the alignment of their electricity and their mobility demands. Therefore, transportation and power infrastructures are becoming increasingly interdependent. In this work, we uncover patterns of PEV mobility by integrating for the first time two unique data sets: (i) mobile phone activity of 1.39 million Bay Area residents and (ii) charging activity of PEVs in 580,000 sessions obtained in the same region. We present a method to estimate individual mobility of PEV drivers at fine temporal and spatial resolution integrating survey data with mobile phone data and income information obtained from census. Thereupon, we recommend changes in PEVs charging times of commuters at their work stations that take into account individual travel needs and shave the pronounced peak in power demand. Informed by the tariff of electricity, we calculate the12 monetary gains to incentivize the adoption of the recommendations. These results open avenues for planning for the future of coupled transportation and electricity needs using personalized data.
△ Less
Submitted 27 March, 2023;
originally announced March 2023.
-
Graph Signal Processing: History, Development, Impact, and Outlook
Authors:
Geert Leus,
Antonio G. Marques,
José M. F. Moura,
Antonio Ortega,
David I Shuman
Abstract:
Graph signal processing (GSP) generalizes signal processing (SP) tasks to signals living on non-Euclidean domains whose structure can be captured by a weighted graph. Graphs are versatile, able to model irregular interactions, easy to interpret, and endowed with a corpus of mathematical results, rendering them natural candidates to serve as the basis for a theory of processing signals in more irre…
▽ More
Graph signal processing (GSP) generalizes signal processing (SP) tasks to signals living on non-Euclidean domains whose structure can be captured by a weighted graph. Graphs are versatile, able to model irregular interactions, easy to interpret, and endowed with a corpus of mathematical results, rendering them natural candidates to serve as the basis for a theory of processing signals in more irregular domains. In this article, we provide an overview of the evolution of GSP, from its origins to the challenges ahead. The first half is devoted to reviewing the history of GSP and explaining how it gave rise to an encompassing framework that shares multiple similarities with SP. A key message is that GSP has been critical to develop novel and technically sound tools, theory, and algorithms that, by leveraging analogies with and the insights of digital SP, provide new ways to analyze, process, and learn from graph signals. In the second half, we shift focus to review the impact of GSP on other disciplines. First, we look at the use of GSP in data science problems, including graph learning and graph-based deep learning. Second, we discuss the impact of GSP on applications, including neuroscience and image and video processing. We conclude with a brief discussion of the emerging and future directions of GSP.
△ Less
Submitted 21 March, 2023;
originally announced March 2023.
-
GSP = DSP + Boundary Conditions -- The Graph Signal Processing Companion Model
Authors:
John Shi,
Jose M. F. Moura
Abstract:
The paper presents the graph signal processing (GSP) companion model that naturally replicates the basic tenets of classical signal processing (DSP) for GSP. The companion model shows that GSP can be made equivalent to DSP 'plus' appropriate boundary conditions (bc) - this is shown under broad conditions and holds for arbitrary undirected or directed graphs. This equivalence suggests how to broade…
▽ More
The paper presents the graph signal processing (GSP) companion model that naturally replicates the basic tenets of classical signal processing (DSP) for GSP. The companion model shows that GSP can be made equivalent to DSP 'plus' appropriate boundary conditions (bc) - this is shown under broad conditions and holds for arbitrary undirected or directed graphs. This equivalence suggests how to broaden GSP - extend naturally a DSP concept to the GSP companion model and then transfer it back to the common graph vertex and graph Fourier domains. The paper shows that GSP unrolls as two distinct models that coincide in DSP, the companion model based on (Hadamard or pointwise) powers of what we will introduce as the spectral frequency vector $λ$, and the traditional graph vertex model, based on the adjacency matrix and its eigenvectors. The paper expands GSP in several directions, including showing that convolution in the graph companion model can be achieved with the FFT and that GSP modulation with appropriate choice of carriers exhibits the DSP translation effect that enables multiplexing by modulation of graph signals.
△ Less
Submitted 5 February, 2024; v1 submitted 4 March, 2023;
originally announced March 2023.
-
OpTaS: An Optimization-based Task Specification Library for Trajectory Optimization and Model Predictive Control
Authors:
Christopher E. Mower,
João Moura,
Nazanin Zamani Behabadi,
Sethu Vijayakumar,
Tom Vercauteren,
Christos Bergeles
Abstract:
This paper presents OpTaS, a task specification Python library for Trajectory Optimization (TO) and Model Predictive Control (MPC) in robotics. Both TO and MPC are increasingly receiving interest in optimal control and in particular handling dynamic environments. While a flurry of software libraries exists to handle such problems, they either provide interfaces that are limited to a specific probl…
▽ More
This paper presents OpTaS, a task specification Python library for Trajectory Optimization (TO) and Model Predictive Control (MPC) in robotics. Both TO and MPC are increasingly receiving interest in optimal control and in particular handling dynamic environments. While a flurry of software libraries exists to handle such problems, they either provide interfaces that are limited to a specific problem formulation (e.g. TracIK, CHOMP), or are large and statically specify the problem in configuration files (e.g. EXOTica, eTaSL). OpTaS, on the other hand, allows a user to specify custom nonlinear constrained problem formulations in a single Python script allowing the controller parameters to be modified during execution. The library provides interface to several open source and commercial solvers (e.g. IPOPT, SNOPT, KNITRO, SciPy) to facilitate integration with established workflows in robotics. Further benefits of OpTaS are highlighted through a thorough comparison with common libraries. An additional key advantage of OpTaS is the ability to define optimal control tasks in the joint space, task space, or indeed simultaneously. The code for OpTaS is easily installed via pip, and the source code with examples can be found at https://github.com/cmower/optas.
△ Less
Submitted 31 January, 2023;
originally announced January 2023.
-
Learning Gradients of Convex Functions with Monotone Gradient Networks
Authors:
Shreyas Chaudhari,
Srinivasa Pranav,
José M. F. Moura
Abstract:
While much effort has been devoted to deriving and analyzing effective convex formulations of signal processing problems, the gradients of convex functions also have critical applications ranging from gradient-based optimization to optimal transport. Recent works have explored data-driven methods for learning convex objective functions, but learning their monotone gradients is seldom studied. In t…
▽ More
While much effort has been devoted to deriving and analyzing effective convex formulations of signal processing problems, the gradients of convex functions also have critical applications ranging from gradient-based optimization to optimal transport. Recent works have explored data-driven methods for learning convex objective functions, but learning their monotone gradients is seldom studied. In this work, we propose C-MGN and M-MGN, two monotone gradient neural network architectures for directly learning the gradients of convex functions. We show that, compared to state of the art methods, our networks are easier to train, learn monotone gradient fields more accurately, and use significantly fewer parameters. We further demonstrate their ability to learn optimal transport mappings to augment driving image data.
△ Less
Submitted 17 March, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Quantization in fibering polarizations, Mabuchi rays and geometric Peter--Weyl theorem
Authors:
Thomas Baier,
Joachim Hilgert,
Oğuzhan Kaya,
José M. Mourão,
João P. Nunes
Abstract:
In this paper we use techniques of geometric quantization to give a geometric interpretation of the Peter--Weyl theorem. We present a novel approach to half-form corrected geometric quantization in a specific type of non-Kähler polarizations and study one important class of examples, namely cotangent bundles of compact semi-simple groups $K$. Our main results state that this canonically defined po…
▽ More
In this paper we use techniques of geometric quantization to give a geometric interpretation of the Peter--Weyl theorem. We present a novel approach to half-form corrected geometric quantization in a specific type of non-Kähler polarizations and study one important class of examples, namely cotangent bundles of compact semi-simple groups $K$. Our main results state that this canonically defined polarization occurs in the geodesic boundary of the space of $K\times K$-invariant Kähler polarizations equipped with Mabuchi's metric, and that its half-form corrected quantization is isomorphic to the Kähler case. An important role is played by invariance of the limit polarization under a torus action.
Unitary parallel transport on the bundle of quantum states along a specific Mabuchi geodesic, given by the coherent state transform of Hall, relates the non-commutative Fourier transform for $K$ with the Borel--Weil description of irreducible representations of $K$.
△ Less
Submitted 25 January, 2023;
originally announced January 2023.
-
Networked Signal and Information Processing
Authors:
Stefan Vlaski,
Soummya Kar,
Ali H. Sayed,
José M. F. Moura
Abstract:
The article reviews significant advances in networked signal and information processing, which have enabled in the last 25 years extending decision making and inference, optimization, control, and learning to the increasingly ubiquitous environments of distributed agents. As these interacting agents cooperate, new collective behaviors emerge from local decisions and actions. Moreover, and signific…
▽ More
The article reviews significant advances in networked signal and information processing, which have enabled in the last 25 years extending decision making and inference, optimization, control, and learning to the increasingly ubiquitous environments of distributed agents. As these interacting agents cooperate, new collective behaviors emerge from local decisions and actions. Moreover, and significantly, theory and applications show that networked agents, through cooperation and sharing, are able to match the performance of cloud or federated solutions, while offering the potential for improved privacy, increasing resilience, and saving resources.
△ Less
Submitted 18 April, 2023; v1 submitted 25 October, 2022;
originally announced October 2022.
-
ROS-PyBullet Interface: A Framework for Reliable Contact Simulation and Human-Robot Interaction
Authors:
Christopher E. Mower,
Theodoros Stouraitis,
João Moura,
Christian Rauch,
Lei Yan,
Nazanin Zamani Behabadi,
Michael Gienger,
Tom Vercauteren,
Christos Bergeles,
Sethu Vijayakumar
Abstract:
Reliable contact simulation plays a key role in the development of (semi-)autonomous robots, especially when dealing with contact-rich manipulation scenarios, an active robotics research topic. Besides simulation, components such as sensing, perception, data collection, robot hardware control, human interfaces, etc. are all key enablers towards applying machine learning algorithms or model-based a…
▽ More
Reliable contact simulation plays a key role in the development of (semi-)autonomous robots, especially when dealing with contact-rich manipulation scenarios, an active robotics research topic. Besides simulation, components such as sensing, perception, data collection, robot hardware control, human interfaces, etc. are all key enablers towards applying machine learning algorithms or model-based approaches in real world systems. However, there is a lack of software connecting reliable contact simulation with the larger robotics ecosystem (i.e. ROS, Orocos), for a more seamless application of novel approaches, found in the literature, to existing robotic hardware. In this paper, we present the ROS-PyBullet Interface, a framework that provides a bridge between the reliable contact/impact simulator PyBullet and the Robot Operating System (ROS). Furthermore, we provide additional utilities for facilitating Human-Robot Interaction (HRI) in the simulated environment. We also present several use-cases that highlight the capabilities and usefulness of our framework. Please check our video, source code, and examples included in the supplementary material. Our full code base is open source and can be found at https://github.com/cmower/ros_pybullet_interface.
△ Less
Submitted 13 October, 2022;
originally announced October 2022.
-
Valuation of Public Bus Electrification with Open Data
Authors:
Upadhi Vijay,
Soomin Woo,
Scott J. Moura,
Akshat Jain,
David Rodriguez,
Sergio Gambacorta,
Giuseppe Ferrara,
Luigi Lanuzza,
Christian Zulberti,
Erika Mellekas,
Carlo Papa
Abstract:
This research provides a novel framework to estimate the economic, environmental, and social values of electrifying public transit buses, for cities across the world, based on open-source data. Electric buses are a compelling candidate to replace diesel buses for the environmental and social benefits. However, the state-of-art models to evaluate the value of bus electrification are limited in appl…
▽ More
This research provides a novel framework to estimate the economic, environmental, and social values of electrifying public transit buses, for cities across the world, based on open-source data. Electric buses are a compelling candidate to replace diesel buses for the environmental and social benefits. However, the state-of-art models to evaluate the value of bus electrification are limited in applicability because they require granular and bespoke data on bus operation that can be difficult to procure. Our valuation tool uses General Transit Feed Specification, a standard data format used by transit agencies worldwide, to provide high-level guidance on developing a prioritization strategy for electrifying a bus fleet. We develop physics-informed machine learning models to evaluate the energy consumption, the carbon emissions, the health impacts, and the total cost of ownership for each transit route. We demonstrate the scalability of our tool with a case study of the bus lines in the Greater Boston and Milan metropolitan areas.
△ Less
Submitted 24 September, 2022;
originally announced September 2022.
-
Recovering the Graph Underlying Networked Dynamical Systems under Partial Observability: A Deep Learning Approach
Authors:
Sérgio Machado,
Anirudh Sridhar,
Paulo Gil,
Jorge Henriques,
José M. F. Moura,
Augusto Santos
Abstract:
We study the problem of graph structure identification, i.e., of recovering the graph of dependencies among time series. We model these time series data as components of the state of linear stochastic networked dynamical systems. We assume partial observability, where the state evolution of only a subset of nodes comprising the network is observed. We devise a new feature vector computed from the…
▽ More
We study the problem of graph structure identification, i.e., of recovering the graph of dependencies among time series. We model these time series data as components of the state of linear stochastic networked dynamical systems. We assume partial observability, where the state evolution of only a subset of nodes comprising the network is observed. We devise a new feature vector computed from the observed time series and prove that these features are linearly separable, i.e., there exists a hyperplane that separates the cluster of features associated with connected pairs of nodes from those associated with disconnected pairs. This renders the features amenable to train a variety of classifiers to perform causal inference. In particular, we use these features to train Convolutional Neural Networks (CNNs). The resulting causal inference mechanism outperforms state-of-the-art counterparts w.r.t. sample-complexity. The trained CNNs generalize well over structurally distinct networks (dense or sparse) and noise-level profiles. Remarkably, they also generalize well to real-world networks while trained over a synthetic network (realization of a random graph). Finally, the proposed method consistently reconstructs the graph in a pairwise manner, that is, by deciding if an edge or arrow is present or absent in each pair of nodes, from the corresponding time series of each pair. This fits the framework of large-scale systems, where observation or processing of all nodes in the network is prohibitive.
△ Less
Submitted 12 April, 2023; v1 submitted 8 August, 2022;
originally announced August 2022.
-
Resilience Enhancement at Edge Cloud Systems
Authors:
Jose Moura,
David Hutchison
Abstract:
It is becoming common practice to push interactive and location-based services from remote datacenters to resource-constrained edge domains. This trend creates new management challenges at the network edge, not least to ensure resilience. These challenges now need to be investigated and overcome. In this paper, we explore the use of open-source programmable asset orchestration at edge cloud system…
▽ More
It is becoming common practice to push interactive and location-based services from remote datacenters to resource-constrained edge domains. This trend creates new management challenges at the network edge, not least to ensure resilience. These challenges now need to be investigated and overcome. In this paper, we explore the use of open-source programmable asset orchestration at edge cloud systems to guarantee operational resilience and a satisfactory performance level despite system incidents such as faults, congestion, or cyber-attacks. We discuss the design and deployment of a new cross-level configurable solution, Resilient Edge Cloud Systems (RECS). Results from appropriate tests made on RECS highlight the positive effects of deploying novel service and resource management algorithms at both data and control planes of the programmable edge system to mitigate against disruptive events such as control channel issues, service overload, or link congestion. RECS offers the following benefits: i) the switch automatically selects the standalone operation mode after its disconnection from the upper-level controllers; ii) deployment of edge virtualized services is made, according to client requests; iii) the client requests are served by edge services and the related traffic is balanced among the alternative on-demand routing paths to the edge location where each service is available for its clients; iv) the TCP traffic quality is protected from unfair competitiveness of UDP flows; and v) a set of redundant controllers is orchestrated by a top-level multi-thread cluster manager, using a novel management protocol with low overhead.
△ Less
Submitted 18 May, 2022;
originally announced May 2022.
-
The Companion Model -- a Canonical Model in Graph Signal Processing
Authors:
John Shi,
Jose M. F. Moura
Abstract:
This paper introduces a $\textit{canonical}$ graph signal model defined by a $\textit{canonical}$ graph and a $\textit{canonical}$ shift, the $\textit{companion}$ graph and the $\textit{companion}$ shift. These are canonical because, under standard conditions, we show that any graph signal processing (GSP) model can be transformed into the canonical model. The transform that obtains this is the gr…
▽ More
This paper introduces a $\textit{canonical}$ graph signal model defined by a $\textit{canonical}$ graph and a $\textit{canonical}$ shift, the $\textit{companion}$ graph and the $\textit{companion}$ shift. These are canonical because, under standard conditions, we show that any graph signal processing (GSP) model can be transformed into the canonical model. The transform that obtains this is the graph $z$-transform ($\textrm{G$z$T}$) that we introduce. The GSP canonical model comes closest to the discrete signal processing (DSP) time signal models: the structure of the companion shift decomposes into a line shift and a signal continuation just like the DSP shift and the GSP canonical graph is a directed line graph with a terminal condition reflecting the signal continuation condition. We further show that, surprisingly, in the canonical model, convolution of graph signals is fast convolution by the DSP FFT.
△ Less
Submitted 25 March, 2022;
originally announced March 2022.
-
Controlling the average degree in random power-law networks
Authors:
Allan Vieira,
Judson Moura,
Celia Anteneodo
Abstract:
We describe a procedure that allows continuously tuning the average degree $\langle k \rangle$ of uncorrelated networks with power-law degree distribution $p(k)$. Inn order to do this, we modify the low-$k$ region of $p(k)$, while preserving the large-$k$ tail up to a cutoff. Then, we use the modified $p(k)$ to obtain the degree sequence required to construct networks through the configuration mod…
▽ More
We describe a procedure that allows continuously tuning the average degree $\langle k \rangle$ of uncorrelated networks with power-law degree distribution $p(k)$. Inn order to do this, we modify the low-$k$ region of $p(k)$, while preserving the large-$k$ tail up to a cutoff. Then, we use the modified $p(k)$ to obtain the degree sequence required to construct networks through the configuration model. We analyze the resulting nearest-neighbor degree and local clustering to verify the absence of $k$-dependencies. Finally, a further modification is introduced to eliminate the sample fluctuations in the average degree.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
Definition and Implications of the Digital Near-Death Experience: A Theoretical Essay on Preliminary Empiricism
Authors:
Pedro Jacome de Moura Jr
Abstract:
Purpose: People are increasingly adhering to social networking platforms (SNP), and this adhesion is often unreflective, which makes them alienate data, actions, and decisions to tech giants. This essay discusses what happens when, eventually, someone chooses to cancel their participation in a large SNP. Methodology/design: This is a theoretical essay, whose narrative resembles a theoretical-empir…
▽ More
Purpose: People are increasingly adhering to social networking platforms (SNP), and this adhesion is often unreflective, which makes them alienate data, actions, and decisions to tech giants. This essay discusses what happens when, eventually, someone chooses to cancel their participation in a large SNP. Methodology/design: This is a theoretical essay, whose narrative resembles a theoretical-empirical manuscript, grounded on the author's experience and his subjective perceptions regarding being out of the WhatsApp network (nowadays, the main SNP instance in the world). Findings/highlights: This study proposes a definition and implications of the supposedly new "digital near-death experience" concept, a metaphor for the classic near-death experience (NDE). A research agenda is also proposed. Limitations: The resulting propositions are grounded on a set of assumptions, that if falsified, make the findings invalid.
△ Less
Submitted 1 August, 2022; v1 submitted 22 October, 2021;
originally announced October 2021.
-
Non-prehensile Planar Manipulation via Trajectory Optimization with Complementarity Constraints
Authors:
João Moura,
Theodoros Stouraitis,
Sethu Vijayakumar
Abstract:
Contact adaption is an essential capability when manipulating objects. Two key contact modes of non-prehensile manipulation are sticking and sliding. This paper presents a Trajectory Optimization (TO) method formulated as a Mathematical Program with Complementarity Constraints (MPCC), which is able to switch between these two modes. We show that this formulation can be applicable to both planning…
▽ More
Contact adaption is an essential capability when manipulating objects. Two key contact modes of non-prehensile manipulation are sticking and sliding. This paper presents a Trajectory Optimization (TO) method formulated as a Mathematical Program with Complementarity Constraints (MPCC), which is able to switch between these two modes. We show that this formulation can be applicable to both planning and Model Predictive Control (MPC) for planar manipulation tasks. We numerically compare: (i) our planner against a mixed integer alternative, showing that the MPCC planer converges faster, scales better with respect to time horizon, and can handle environments with obstacles; (ii) our controller against a state-of-the-art mixed integer approach, showing that the MPCC controller achieves better tracking and more consistent computation times. Additionally, we experimentally validate both our planner and controller with the KUKA LWR robot on a range of planar manipulation tasks.
△ Less
Submitted 18 March, 2022; v1 submitted 27 September, 2021;
originally announced September 2021.
-
On the Importance of Distractors for Few-Shot Classification
Authors:
Rajshekhar Das,
Yu-Xiong Wang,
JoséM. F. Moura
Abstract:
Few-shot classification aims at classifying categories of a novel task by learning from just a few (typically, 1 to 5) labelled examples. An effective approach to few-shot classification involves a prior model trained on a large-sample base domain, which is then finetuned over the novel few-shot task to yield generalizable representations. However, task-specific finetuning is prone to overfitting…
▽ More
Few-shot classification aims at classifying categories of a novel task by learning from just a few (typically, 1 to 5) labelled examples. An effective approach to few-shot classification involves a prior model trained on a large-sample base domain, which is then finetuned over the novel few-shot task to yield generalizable representations. However, task-specific finetuning is prone to overfitting due to the lack of enough training examples. To alleviate this issue, we propose a new finetuning approach based on contrastive learning that reuses unlabelled examples from the base domain in the form of distractors. Unlike the nature of unlabelled data used in prior works, distractors belong to classes that do not overlap with the novel categories. We demonstrate for the first time that inclusion of such distractors can significantly boost few-shot generalization. Our technical novelty includes a stochastic pairing of examples sharing the same category in the few-shot task and a weighting term that controls the relative influence of task-specific negatives and distractors. An important aspect of our finetuning objective is that it is agnostic to distractor labels and hence applicable to various base domain settings. Compared to state-of-the-art approaches, our method shows accuracy gains of up to $12\%$ in cross-domain and up to $5\%$ in unsupervised prior-learning settings.
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
Laughlin states change under large geometry deformations and imaginary time Hamiltonian dynamics
Authors:
Gabriel Matos,
Bruno Mera,
José M. Mourão,
Paulo D. Mourão,
João P. Nunes
Abstract:
We study the change of the Laughlin states under large deformations of the geometry of the sphere and the plane, associated with Mabuchi geodesics on the space of metrics with Hamiltonian $S^1$-symmetry.
For geodesics associated with the square of the symmetry generator, as the geodesic time goes to infinity, the geometry of the sphere becomes that of a thin cigar collapsing to a line and the La…
▽ More
We study the change of the Laughlin states under large deformations of the geometry of the sphere and the plane, associated with Mabuchi geodesics on the space of metrics with Hamiltonian $S^1$-symmetry.
For geodesics associated with the square of the symmetry generator, as the geodesic time goes to infinity, the geometry of the sphere becomes that of a thin cigar collapsing to a line and the Laughlin states become concentrated on a discrete set of $S^1$--orbits, corresponding to Bohr-Sommerfeld orbits of geometric quantization.
The lifting of the Mabuchi geodesics to the bundle of quantum states, to which the Laughlin states belong, is achieved via generalized coherent state transforms, which correspond to the KZ parallel transport of Chern-Simons theory.
△ Less
Submitted 5 August, 2021; v1 submitted 23 July, 2021;
originally announced July 2021.
-
A Learning-based Optimal Market Bidding Strategy for Price-Maker Energy Storage
Authors:
Mathilde D. Badoual,
Scott J. Moura
Abstract:
Load serving entities with storage units reach sizes and performances that can significantly impact clearing prices in electricity markets. Nevertheless, price endogeneity is rarely considered in storage bidding strategies and modeling the electricity market is a challenging task. Meanwhile, model-free reinforcement learning such as the Actor-Critic are becoming increasingly popular for designing…
▽ More
Load serving entities with storage units reach sizes and performances that can significantly impact clearing prices in electricity markets. Nevertheless, price endogeneity is rarely considered in storage bidding strategies and modeling the electricity market is a challenging task. Meanwhile, model-free reinforcement learning such as the Actor-Critic are becoming increasingly popular for designing energy system controllers. Yet implementation frequently requires lengthy, data-intense, and unsafe trial-and-error training. To fill these gaps, we implement an online Supervised Actor-Critic (SAC) algorithm, supervised with a model-based controller -- Model Predictive Control (MPC). The energy storage agent is trained with this algorithm to optimally bid while learning and adjusting to its impact on the market clearing prices. We compare the supervised Actor-Critic algorithm with the MPC algorithm as a supervisor, finding that the former reaps higher profits via learning. Our contribution, thus, is an online and safe SAC algorithm that outperforms the current model-based state-of-the-art.
△ Less
Submitted 4 June, 2021;
originally announced June 2021.
-
SDN-Based Intrusion Detection System for Early Detection and Mitigation of DDoS Attacks
Authors:
Pedro Manso,
Jose Moura,
Carlos Serrao
Abstract:
The current paper addresses relevant network security vulnerabilities introduced by network devices within the emerging paradigm of Internet of Things (IoT) as well as the urgent need to mitigate the negative effects of some types of Distributed Denial of Service (DDoS) attacks that try to explore those security weaknesses. We design and implement a Software-Defined Intrusion Detection System (IDS…
▽ More
The current paper addresses relevant network security vulnerabilities introduced by network devices within the emerging paradigm of Internet of Things (IoT) as well as the urgent need to mitigate the negative effects of some types of Distributed Denial of Service (DDoS) attacks that try to explore those security weaknesses. We design and implement a Software-Defined Intrusion Detection System (IDS) that reactively impairs the attacks at its origin, ensuring the normal operation of the network infrastructure. Our proposal includes an IDS that automatically detects several DDoS attacks, and then as an attack is detected, it notifies a Software Defined Networking (SDN) controller. The current proposal also downloads some convenient traffic forwarding decisions from the SDN controller to network devices. The evaluation results suggest that our proposal timely detects several types of cyber-attacks based on DDoS, mitigates their negative impacts on the network performance, and ensures the correct data delivery of normal traffic. Our work sheds light on the programming relevance over an abstracted view of the network infrastructure to timely detect a Botnet exploitation, mitigate malicious traffic at its source, and protect benign traffic.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
PMIPv6 Integrated with MIH for Flow Mobility Management: a Real Testbed with Simultaneous Multi-Access in Heterogeneous Mobile Networks
Authors:
Hugo Alves,
Luis Silva,
Rui Marinheiro,
Jose Moura
Abstract:
The exponential growth of the number of multihomed mobile devices is changing the way how we can connect to the Internet. Our mobile devices are demanding for more network resources, in terms of traffic volume and QoS requirements. Unfortunately, it is very hard to a multihomed device to be simultaneously connected to the network through multiple links. The current work enhances the network access…
▽ More
The exponential growth of the number of multihomed mobile devices is changing the way how we can connect to the Internet. Our mobile devices are demanding for more network resources, in terms of traffic volume and QoS requirements. Unfortunately, it is very hard to a multihomed device to be simultaneously connected to the network through multiple links. The current work enhances the network access of multihomed devices agnostically to the deployed access technologies. This enhancement is achieved by using simultaneously all of the mobile devices interfaces, and by routing each individual data flow through the most convenient access technology. The proposed solution is only deployed at the network side and it extends Proxy Mobile IPv6 with flow mobility in a completely transparent way to mobile nodes. In fact, it gives particular attention to the handover mechanisms, by improving the detection and attachment of nodes in the network, with the inclusion of the IEEE 802.21 standard in the solution. This provides the necessary implementation and integration details to extend a network topology with femtocell devices. Each femtocell is equipped with various network interfaces supporting a diverse set of access technologies. There is also a decision entity that manages individually each data flow according to its QoS / QoE requisites. The proposed solution has been developed and extensively tested with a real prototype. Evaluation results evidence that the overhead for using the solution is negligible as compared to the offered advantages such as: the support of flow mobility, the fulfil of VoIP functional requisites, the session continuity in spite of flows mobility, its low overhead, its high scalability, and the complete transparency of the proposed solution to the user terminals.
△ Less
Submitted 13 April, 2021;
originally announced April 2021.
-
GSA-Forecaster: Forecasting Graph-Based Time-Dependent Data with Graph Sequence Attention
Authors:
Yang Li,
Di Wang,
José M. F. Moura
Abstract:
Forecasting graph-based time-dependent data has many practical applications. This task is challenging as models need not only to capture spatial dependency and temporal dependency within the data, but also to leverage useful auxiliary information for accurate predictions. In this paper, we analyze limitations of state-of-the-art models on dealing with temporal dependency. To address this limitatio…
▽ More
Forecasting graph-based time-dependent data has many practical applications. This task is challenging as models need not only to capture spatial dependency and temporal dependency within the data, but also to leverage useful auxiliary information for accurate predictions. In this paper, we analyze limitations of state-of-the-art models on dealing with temporal dependency. To address this limitation, we propose GSA-Forecaster, a new deep learning model for forecasting graph-based time-dependent data. GSA-Forecaster leverages graph sequence attention (GSA), a new attention mechanism proposed in this paper, for effectively capturing temporal dependency. GSA-Forecaster embeds the graph structure of the data into its architecture to address spatial dependency. GSA-Forecaster also accounts for auxiliary information to further improve predictions. We evaluate GSA-Forecaster with large-scale real-world graph-based time-dependent data and demonstrate its effectiveness over state-of-the-art models with 6.7% RMSE and 5.8% MAPE reduction.
△ Less
Submitted 29 August, 2022; v1 submitted 12 April, 2021;
originally announced April 2021.
-
Shuffler: A Large Scale Data Management Tool for ML in Computer Vision
Authors:
Evgeny Toropov,
Paola A. Buitrago,
Jose M. F. Moura
Abstract:
Datasets in the computer vision academic research community are primarily static. Once a dataset is accepted as a benchmark for a computer vision task, researchers working on this task will not alter it in order to make their results reproducible. At the same time, when exploring new tasks and new applications, datasets tend to be an ever changing entity. A practitioner may combine existing public…
▽ More
Datasets in the computer vision academic research community are primarily static. Once a dataset is accepted as a benchmark for a computer vision task, researchers working on this task will not alter it in order to make their results reproducible. At the same time, when exploring new tasks and new applications, datasets tend to be an ever changing entity. A practitioner may combine existing public datasets, filter images or objects in them, change annotations or add new ones to fit a task at hand, visualize sample images, or perhaps output statistics in the form of text or plots. In fact, datasets change as practitioners experiment with data as much as with algorithms, trying to make the most out of machine learning models. Given that ML and deep learning call for large volumes of data to produce satisfactory results, it is no surprise that the resulting data and software management associated to dealing with live datasets can be quite complex. As far as we know, there is no flexible, publicly available instrument to facilitate manipulating image data and their annotations throughout a ML pipeline. In this work, we present Shuffler, an open source tool that makes it easy to manage large computer vision datasets. It stores annotations in a relational, human-readable database. Shuffler defines over 40 data handling operations with annotations that are commonly useful in supervised learning applied to computer vision and supports some of the most well-known computer vision datasets. Finally, it is easily extensible, making the addition of new operations and datasets a task that is fast and easy to accomplish.
△ Less
Submitted 11 April, 2021;
originally announced April 2021.
-
Risk-Aware Lane Selection on Highway with Dynamic Obstacles
Authors:
Sangjae Bae,
David Isele,
Kikuo Fujimura,
Scott J. Moura
Abstract:
This paper proposes a discretionary lane selection algorithm. In particular, highway driving is considered as a targeted scenario, where each lane has a different level of traffic flow. When lane-changing is discretionary, it is advised not to change lanes unless highly beneficial, e.g., reducing travel time significantly or securing higher safety. Evaluating such "benefit" is a challenge, along w…
▽ More
This paper proposes a discretionary lane selection algorithm. In particular, highway driving is considered as a targeted scenario, where each lane has a different level of traffic flow. When lane-changing is discretionary, it is advised not to change lanes unless highly beneficial, e.g., reducing travel time significantly or securing higher safety. Evaluating such "benefit" is a challenge, along with multiple surrounding vehicles in dynamic speed and heading with uncertainty. We propose a real-time lane-selection algorithm with careful cost considerations and with modularity in design. The algorithm is a search-based optimization method that evaluates uncertain dynamic positions of other vehicles under a continuous time and space domain. For demonstration, we incorporate a state-of-the-art motion planner framework (Neural Networks integrated Model Predictive Control) under a CARLA simulation environment.
△ Less
Submitted 8 April, 2021;
originally announced April 2021.
-
Finite-Time In-Network Computation of Linear Transforms
Authors:
Soummya Kar,
Markus Püschel,
José M. F. Moura
Abstract:
This paper focuses on finite-time in-network computation of linear transforms of distributed graph data. Finite-time transform computation problems are of interest in graph-based computing and signal processing applications in which the objective is to compute, by means of distributed iterative methods, various (linear) transforms of the data distributed at the agents or nodes of the graph. While…
▽ More
This paper focuses on finite-time in-network computation of linear transforms of distributed graph data. Finite-time transform computation problems are of interest in graph-based computing and signal processing applications in which the objective is to compute, by means of distributed iterative methods, various (linear) transforms of the data distributed at the agents or nodes of the graph. While finite-time computation of consensus-type or more generally rank-one transforms have been studied, systematic approaches toward scalable computing of general linear transforms, specifically in the case of heterogeneous agent objectives in which each agent is interested in obtaining a different linear combination of the network data, are relatively less explored. In this paper, by employing ideas from algebraic geometry, we develop a systematic characterization of linear transforms that are amenable to distributed in-network computation in finite-time using linear iterations. Further, we consider the general case of directed inter-agent communication graphs. Specifically, it is shown that \emph{almost all} linear transformations of data distributed on the nodes of a digraph containing a Hamiltonian cycle may be computed using at most $N$ linear distributed iterations. Finally, by studying an associated matrix factorization based reformulation of the transform computation problem, we obtain, as a by-product, certain results and characterizations on sparsity-constrained matrix factorization that are of independent interest.
△ Less
Submitted 3 April, 2021;
originally announced April 2021.
-
Probing the screening of the Casimir interaction with optical tweezers
Authors:
L. B. Pires,
D. S. Ether,
B. Spreng,
G. R. S. Araújo,
R. S. Decca,
R. S. Dutra,
M. Borges,
F. S. S. Rosa,
G. -L. Ingold,
M. J. B. Moura,
S. Frases,
B. Pontes,
H. M. Nussenzveig,
S. Reynaud,
N. B. Viana,
P. A. Maia Neto
Abstract:
We measure the colloidal interaction between two silica microspheres in aqueous solution in the distance range from $0.2\,μ$m to $0.5\,μ$m with the help of optical tweezers. When employing a sample with a low salt concentration, the resulting interaction is dominated by the repulsive double-layer interaction which is fully characterized. The double-layer interaction is suppressed when adding…
▽ More
We measure the colloidal interaction between two silica microspheres in aqueous solution in the distance range from $0.2\,μ$m to $0.5\,μ$m with the help of optical tweezers. When employing a sample with a low salt concentration, the resulting interaction is dominated by the repulsive double-layer interaction which is fully characterized. The double-layer interaction is suppressed when adding $0.22\,$M of salt to our sample, thus leading to a purely attractive Casimir signal. When analyzing the experimental data for the potential energy and force, we find good agreement with theoretical results based on the scattering approach. At the distance range probed experimentally, the interaction arises mainly from the unscreened transverse magnetic contribution in the zero-frequency limit, with nonzero Matsubara frequencies providing a negligible contribution. In contrast, such unscreened contribution is not included by the standard theoretical model of the Casimir interaction in electrolyte solutions, in which the zero-frequency term is treated separately as an electrostatic fluctuational effect. As a consequence, the resulting attraction is too weak in this standard model, by approximately one order of magnitude, to explain the experimental data. Overall, our experimental results shed light on the nature of the thermal zero-frequency contribution and indicate that the Casimir attraction across polar liquids has a longer range than previously predicted.
△ Less
Submitted 24 June, 2021; v1 submitted 31 March, 2021;
originally announced April 2021.
-
Graph Signal Processing: Dualizing GSP Sampling in the Vertex and Spectral Domains
Authors:
John Shi,
Jose M. F. Moura
Abstract:
Vertex based and spectral based GSP sampling has been studied recently. The literature recognizes that methods in one domain do not have a counterpart in the other domain. This paper shows that in fact one can develop a unified graph signal sampling theory with analogous interpretations in both domains just like sampling in traditional DSP. To achieve it, we introduce a spectral shift $M$ acting i…
▽ More
Vertex based and spectral based GSP sampling has been studied recently. The literature recognizes that methods in one domain do not have a counterpart in the other domain. This paper shows that in fact one can develop a unified graph signal sampling theory with analogous interpretations in both domains just like sampling in traditional DSP. To achieve it, we introduce a spectral shift $M$ acting in the spectral domain rather than shift $A$ that acts in the vertex domain. This leads to a GSP theory that starts from the spectral domain, for example, linear shift invariant (LSI) filtering in the spectral domain is with polynomials $P(M)$. We then develop GSP vertex and spectral domain dual versions for each of the four standard sampling steps of subsampling, decimation, upsampling, and interpolation. We show how GSP sampling reduces to DSP sampling when the graph is the directed time cycle graph. Simple examples illustrate the impact of choices that are available in GSP sampling.
△ Less
Submitted 18 February, 2022; v1 submitted 19 March, 2021;
originally announced March 2021.
-
Unsupervised Clustering of Time Series Signals using Neuromorphic Energy-Efficient Temporal Neural Networks
Authors:
Shreyas Chaudhari,
Harideep Nair,
José M. F. Moura,
John Paul Shen
Abstract:
Unsupervised time series clustering is a challenging problem with diverse industrial applications such as anomaly detection, bio-wearables, etc. These applications typically involve small, low-power devices on the edge that collect and process real-time sensory signals. State-of-the-art time-series clustering methods perform some form of loss minimization that is extremely computationally intensiv…
▽ More
Unsupervised time series clustering is a challenging problem with diverse industrial applications such as anomaly detection, bio-wearables, etc. These applications typically involve small, low-power devices on the edge that collect and process real-time sensory signals. State-of-the-art time-series clustering methods perform some form of loss minimization that is extremely computationally intensive from the perspective of edge devices. In this work, we propose a neuromorphic approach to unsupervised time series clustering based on Temporal Neural Networks that is capable of ultra low-power, continuous online learning. We demonstrate its clustering performance on a subset of UCR Time Series Archive datasets. Our results show that the proposed approach either outperforms or performs similarly to most of the existing algorithms while being far more amenable for efficient hardware implementation. Our hardware assessment analysis shows that in 7 nm CMOS the proposed architecture, on average, consumes only about 0.005 mm^2 die area and 22 uW power and can process each signal with about 5 ns latency.
△ Less
Submitted 18 February, 2021;
originally announced February 2021.
-
State Estimation for a Zero-Dimensional Electrochemical Model of Lithium-Sulfur Batteries
Authors:
Zhijia Huang,
Dong Zhang,
Luis D. Couto,
Quan-Hong Yang,
Scott J. Moura
Abstract:
Lithium-sulfur (Li-S) batteries have become one of the most attractive alternatives over conventional Li-ion batteries due to their high theoretical specific energy density (2500 Wh/kg for Li-S vs. $\sim$250 Wh/kg for Li-ion). Accurate state estimation in Li-S batteries is urgently needed for safe and efficient operation. To the best of the authors' knowledge, electrochemical model-based observers…
▽ More
Lithium-sulfur (Li-S) batteries have become one of the most attractive alternatives over conventional Li-ion batteries due to their high theoretical specific energy density (2500 Wh/kg for Li-S vs. $\sim$250 Wh/kg for Li-ion). Accurate state estimation in Li-S batteries is urgently needed for safe and efficient operation. To the best of the authors' knowledge, electrochemical model-based observers have not been reported for Li-S batteries, primarily due to the complex dynamics that make state observer design a challenging problem. In this work, we demonstrate a state estimation scheme based on a zero-dimensional electrochemical model for Li-S batteries. The nonlinear differential-algebraic equation (DAE) model is incorporated into an extend Kalman filter. This observer design estimates both differential and algebraic states that represent the dynamic behavior inside the cell, from voltage and current measurements only. The effectiveness of the proposed estimation algorithm is illustrated by numerical simulation results. Our study unlocks how an electrochemical model can be utilized for practical state estimation of Li-S batteries.
△ Less
Submitted 25 January, 2021;
originally announced January 2021.
-
Edge Entropy as an Indicator of the Effectiveness of GNNs over CNNs for Node Classification
Authors:
Lavender Yao Jiang,
John Shi,
Mark Cheung,
Oren Wright,
José M. F. Moura
Abstract:
Graph neural networks (GNNs) extend convolutional neural networks (CNNs) to graph-based data. A question that arises is how much performance improvement does the underlying graph structure in the GNN provide over the CNN (that ignores this graph structure). To address this question, we introduce edge entropy and evaluate how good an indicator it is for possible performance improvement of GNNs over…
▽ More
Graph neural networks (GNNs) extend convolutional neural networks (CNNs) to graph-based data. A question that arises is how much performance improvement does the underlying graph structure in the GNN provide over the CNN (that ignores this graph structure). To address this question, we introduce edge entropy and evaluate how good an indicator it is for possible performance improvement of GNNs over CNNs. Our results on node classification with synthetic and real datasets show that lower values of edge entropy predict larger expected performance gains of GNNs over CNNs, and, conversely, higher edge entropy leads to expected smaller improvement gains.
△ Less
Submitted 15 December, 2020;
originally announced December 2020.