-
Explorative Imitation Learning: A Path Signature Approach for Continuous Environments
Authors:
Nathan Gavenski,
Juarez Monteiro,
Felipe Meneguzzi,
Michael Luck,
Odinaldo Rodrigues
Abstract:
Some imitation learning methods combine behavioural cloning with self-supervision to infer actions from state pairs. However, most rely on a large number of expert trajectories to increase generalisation and human intervention to capture key aspects of the problem, such as domain constraints. In this paper, we propose Continuous Imitation Learning from Observation (CILO), a new method augmenting i…
▽ More
Some imitation learning methods combine behavioural cloning with self-supervision to infer actions from state pairs. However, most rely on a large number of expert trajectories to increase generalisation and human intervention to capture key aspects of the problem, such as domain constraints. In this paper, we propose Continuous Imitation Learning from Observation (CILO), a new method augmenting imitation learning with two important features: (i) exploration, allowing for more diverse state transitions, requiring less expert trajectories and resulting in fewer training iterations; and (ii) path signatures, allowing for automatic encoding of constraints, through the creation of non-parametric representations of agents and expert trajectories. We compared CILO with a baseline and two leading imitation learning methods in five environments. It had the best overall performance of all methods in all environments, outperforming the expert in two of them.
△ Less
Submitted 22 July, 2024; v1 submitted 5 July, 2024;
originally announced July 2024.
-
A Survey of Imitation Learning Methods, Environments and Metrics
Authors:
Nathan Gavenski,
Felipe Meneguzzi,
Michael Luck,
Odinaldo Rodrigues
Abstract:
Imitation learning is an approach in which an agent learns how to execute a task by trying to mimic how one or more teachers perform it. This learning approach offers a compromise between the time it takes to learn a new task and the effort needed to collect teacher samples for the agent. It achieves this by balancing learning from the teacher, who has some information on how to perform the task,…
▽ More
Imitation learning is an approach in which an agent learns how to execute a task by trying to mimic how one or more teachers perform it. This learning approach offers a compromise between the time it takes to learn a new task and the effort needed to collect teacher samples for the agent. It achieves this by balancing learning from the teacher, who has some information on how to perform the task, and deviating from their examples when necessary, such as states not present in the teacher samples. Consequently, the field of imitation learning has received much attention from researchers in recent years, resulting in many new methods and applications. However, with this increase in published work and past surveys focusing mainly on methodology, a lack of standardisation became more prominent in the field. This non-standardisation is evident in the use of environments, which appear in no more than two works, and evaluation processes, such as qualitative analysis, that have become rare in current literature. In this survey, we systematically review current imitation learning literature and present our findings by (i) classifying imitation learning techniques, environments and metrics by introducing novel taxonomies; (ii) reflecting on main problems from the literature; and (iii) presenting challenges and future directions for researchers.
△ Less
Submitted 30 July, 2024; v1 submitted 30 April, 2024;
originally announced April 2024.
-
Visual Analytics for Fine-grained Text Classification Models and Datasets
Authors:
Munkhtulga Battogtokh,
Yiwen Xing,
Cosmin Davidescu,
Alfie Abdul-Rahman,
Michael Luck,
Rita Borgo
Abstract:
In natural language processing (NLP), text classification tasks are increasingly fine-grained, as datasets are fragmented into a larger number of classes that are more difficult to differentiate from one another. As a consequence, the semantic structures of datasets have become more complex, and model decisions more difficult to explain. Existing tools, suited for coarse-grained classification, fa…
▽ More
In natural language processing (NLP), text classification tasks are increasingly fine-grained, as datasets are fragmented into a larger number of classes that are more difficult to differentiate from one another. As a consequence, the semantic structures of datasets have become more complex, and model decisions more difficult to explain. Existing tools, suited for coarse-grained classification, falter under these additional challenges. In response to this gap, we worked closely with NLP domain experts in an iterative design-and-evaluation process to characterize and tackle the growing requirements in their workflow of developing fine-grained text classification models. The result of this collaboration is the development of SemLa, a novel visual analytics system tailored for 1) dissecting complex semantic structures in a dataset when it is spatialized in model embedding space, and 2) visualizing fine-grained nuances in the meaning of text samples to faithfully explain model reasoning. This paper details the iterative design study and the resulting innovations featured in SemLa. The final design allows contrastive analysis at different levels by unearthing lexical and conceptual patterns including biases and artifacts in data. Expert feedback on our final design and case studies confirm that SemLa is a useful tool for supporting model validation and debugging as well as data annotation.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Imitation Learning Datasets: A Toolkit For Creating Datasets, Training Agents and Benchmarking
Authors:
Nathan Gavenski,
Michael Luck,
Odinaldo Rodrigues
Abstract:
Imitation learning field requires expert data to train agents in a task. Most often, this learning approach suffers from the absence of available data, which results in techniques being tested on its dataset. Creating datasets is a cumbersome process requiring researchers to train expert agents from scratch, record their interactions and test each benchmark method with newly created data. Moreover…
▽ More
Imitation learning field requires expert data to train agents in a task. Most often, this learning approach suffers from the absence of available data, which results in techniques being tested on its dataset. Creating datasets is a cumbersome process requiring researchers to train expert agents from scratch, record their interactions and test each benchmark method with newly created data. Moreover, creating new datasets for each new technique results in a lack of consistency in the evaluation process since each dataset can drastically vary in state and action distribution. In response, this work aims to address these issues by creating Imitation Learning Datasets, a toolkit that allows for: (i) curated expert policies with multithreaded support for faster dataset creation; (ii) readily available datasets and techniques with precise measurements; and (iii) sharing implementations of common imitation learning techniques. Demonstration link: https://nathangavenski.github.io/#/il-datasets-video
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation
Authors:
Dong Huang,
Jie M. Zhang,
Michael Luck,
Qingwen Bu,
Yuhao Qing,
Heming Cui
Abstract:
The advancement of natural language processing (NLP) has been significantly boosted by the development of transformer-based large language models (LLMs). These models have revolutionized NLP tasks, particularly in code generation, aiding developers in creating software with enhanced efficiency. Despite their advancements, challenges in balancing code snippet generation with effective test case gen…
▽ More
The advancement of natural language processing (NLP) has been significantly boosted by the development of transformer-based large language models (LLMs). These models have revolutionized NLP tasks, particularly in code generation, aiding developers in creating software with enhanced efficiency. Despite their advancements, challenges in balancing code snippet generation with effective test case generation and execution persist. To address these issues, this paper introduces Multi-Agent Assistant Code Generation (AgentCoder), a novel solution comprising a multi-agent framework with specialized agents: the programmer agent, the test designer agent, and the test executor agent. During the coding procedure, the programmer agent will focus on the code generation and refinement based on the test executor agent's feedback. The test designer agent will generate test cases for the generated code, and the test executor agent will run the code with the test cases and write the feedback to the programmer. This collaborative system ensures robust code generation, surpassing the limitations of single-agent models and traditional methodologies. Our extensive experiments on 9 code generation models and 12 enhancement approaches showcase AgentCoder's superior performance over existing code generation models and prompt engineering techniques across various benchmarks. For example, AgentCoder (GPT-4) achieves 96.3\% and 91.8\% pass@1 in HumanEval and MBPP datasets with an overall token overhead of 56.9K and 66.3K, while state-of-the-art obtains only 90.2\% and 78.9\% pass@1 with an overall token overhead of 138.2K and 206.5K.
△ Less
Submitted 24 May, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Resolving social dilemmas with minimal reward transfer
Authors:
Richard Willis,
Yali Du,
Joel Z Leibo,
Michael Luck
Abstract:
Multi-agent cooperation is an important topic, and is particularly challenging in mixed-motive situations where it does not pay to be nice to others. Consequently, self-interested agents often avoid collective behaviour, resulting in suboptimal outcomes for the group. In response, in this paper we introduce a metric to quantify the disparity between what is rational for individual agents and what…
▽ More
Multi-agent cooperation is an important topic, and is particularly challenging in mixed-motive situations where it does not pay to be nice to others. Consequently, self-interested agents often avoid collective behaviour, resulting in suboptimal outcomes for the group. In response, in this paper we introduce a metric to quantify the disparity between what is rational for individual agents and what is rational for the group, which we call the general self-interest level. This metric represents the maximum proportion of individual rewards that all agents can retain while ensuring that achieving social welfare optimum becomes a dominant strategy. By aligning the individual and group incentives, rational agents acting to maximise their own reward will simultaneously maximise the collective reward. As agents transfer their rewards to motivate others to consider their welfare, we diverge from traditional concepts of altruism or prosocial behaviours. The general self-interest level is a property of a game that is useful for assessing the propensity of players to cooperate and understanding how features of a game impact this. We illustrate the effectiveness of our method on several novel games representations of social dilemmas with arbitrary numbers of players.
△ Less
Submitted 21 March, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Collaborative filtering to capture AI user's preferences as norms
Authors:
Marc Serramia,
Natalia Criado,
Michael Luck
Abstract:
Customising AI technologies to each user's preferences is fundamental to them functioning well. Unfortunately, current methods require too much user involvement and fail to capture their true preferences. In fact, to avoid the nuisance of manually setting preferences, users usually accept the default settings even if these do not conform to their true preferences. Norms can be useful to regulate b…
▽ More
Customising AI technologies to each user's preferences is fundamental to them functioning well. Unfortunately, current methods require too much user involvement and fail to capture their true preferences. In fact, to avoid the nuisance of manually setting preferences, users usually accept the default settings even if these do not conform to their true preferences. Norms can be useful to regulate behaviour and ensure it adheres to user preferences but, while the literature has thoroughly studied norms, most proposals take a formal perspective. Indeed, while there has been some research on constructing norms to capture a user's privacy preferences, these methods rely on domain knowledge which, in the case of AI technologies, is difficult to obtain and maintain. We argue that a new perspective is required when constructing norms, which is to exploit the large amount of preference information readily available from whole systems of users. Inspired by recommender systems, we believe that collaborative filtering can offer a suitable approach to identifying a user's norm preferences without excessive user involvement.
△ Less
Submitted 10 August, 2023; v1 submitted 1 August, 2023;
originally announced August 2023.
-
Predicting Privacy Preferences for Smart Devices as Norms
Authors:
Marc Serramia,
William Seymour,
Natalia Criado,
Michael Luck
Abstract:
Smart devices, such as smart speakers, are becoming ubiquitous, and users expect these devices to act in accordance with their preferences. In particular, since these devices gather and manage personal data, users expect them to adhere to their privacy preferences. However, the current approach of gathering these preferences consists in asking the users directly, which usually triggers automatic r…
▽ More
Smart devices, such as smart speakers, are becoming ubiquitous, and users expect these devices to act in accordance with their preferences. In particular, since these devices gather and manage personal data, users expect them to adhere to their privacy preferences. However, the current approach of gathering these preferences consists in asking the users directly, which usually triggers automatic responses failing to capture their true preferences. In response, in this paper we present a collaborative filtering approach to predict user preferences as norms. These preference predictions can be readily adopted or can serve to assist users in determining their own preferences. Using a dataset of privacy preferences of smart assistant users, we test the accuracy of our predictions.
△ Less
Submitted 21 February, 2023;
originally announced February 2023.
-
A renewal approach to configurational entropy in one dimension
Authors:
P. L. Krapivsky,
J. M. Luck
Abstract:
We introduce a novel approach, inspired from the theory of renewal processes, to determine the configurational entropy of ensembles of constrained configurations of particles on the one-dimensional lattice. The proposed method can deal with all local rules involving only the lengths of clusters of occupied and empty sites. Within this scope, this method is both more systematic and easier to implem…
▽ More
We introduce a novel approach, inspired from the theory of renewal processes, to determine the configurational entropy of ensembles of constrained configurations of particles on the one-dimensional lattice. The proposed method can deal with all local rules involving only the lengths of clusters of occupied and empty sites. Within this scope, this method is both more systematic and easier to implement than the transfer-matrix approach. It is illustrated in detail on the $k$-mer deposition model and on ensembles of trapped Rydberg atoms with blockade range $b$.
△ Less
Submitted 20 February, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
Jamming and metastability in one dimension: from the kinetically constrained Ising chain to the Riviera model
Authors:
P. L. Krapivsky,
J. M. Luck
Abstract:
The Ising chain with kinetic constraints provides many examples of totally irreversible zero-temperature dynamics leading to metastability with an exponentially large number of attractors. In most cases, the constrained zero-temperature dynamics can be mapped onto a model of random sequential adsorption. We provide a brief didactic review, based on the example of the constrained Glauber-Ising chai…
▽ More
The Ising chain with kinetic constraints provides many examples of totally irreversible zero-temperature dynamics leading to metastability with an exponentially large number of attractors. In most cases, the constrained zero-temperature dynamics can be mapped onto a model of random sequential adsorption. We provide a brief didactic review, based on the example of the constrained Glauber-Ising chain, of the exact results on the dynamics of these models and on their attractors that have been obtained by means of the above mapping. The Riviera model introduced recently by Puljiz et al. behaves similarly to the kinetically constrained Ising chains. This totally irreversible deposition model however does not enjoy the shielding property characterising models of random sequential adsorption. It can therefore neither be mapped onto such a model nor (in all likelihood) be solved by analytical means. We present a range of novel results on the attractors of the Riviera model, obtained by means of an exhaustive enumeration for smaller systems and of extensive simulations for larger ones, and put these results in perspective with the exact ones which are available for kinetically constrained Ising chains.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
Self-supervised multimodal neuroimaging yields predictive representations for a spectrum of Alzheimer's phenotypes
Authors:
Alex Fedorov,
Eloy Geenjaar,
Lei Wu,
Tristan Sylvain,
Thomas P. DeRamus,
Margaux Luck,
Maria Misiura,
R Devon Hjelm,
Sergey M. Plis,
Vince D. Calhoun
Abstract:
Recent neuroimaging studies that focus on predicting brain disorders via modern machine learning approaches commonly include a single modality and rely on supervised over-parameterized models.However, a single modality provides only a limited view of the highly complex brain. Critically, supervised models in clinical settings lack accurate diagnostic labels for training. Coarse labels do not captu…
▽ More
Recent neuroimaging studies that focus on predicting brain disorders via modern machine learning approaches commonly include a single modality and rely on supervised over-parameterized models.However, a single modality provides only a limited view of the highly complex brain. Critically, supervised models in clinical settings lack accurate diagnostic labels for training. Coarse labels do not capture the long-tailed spectrum of brain disorder phenotypes, which leads to a loss of generalizability of the model that makes them less useful in diagnostic settings. This work presents a novel multi-scale coordinated framework for learning multiple representations from multimodal neuroimaging data. We propose a general taxonomy of informative inductive biases to capture unique and joint information in multimodal self-supervised fusion. The taxonomy forms a family of decoder-free models with reduced computational complexity and a propensity to capture multi-scale relationships between local and global representations of the multimodal inputs. We conduct a comprehensive evaluation of the taxonomy using functional and structural magnetic resonance imaging (MRI) data across a spectrum of Alzheimer's disease phenotypes and show that self-supervised models reveal disorder-relevant brain regions and multimodal links without access to the labels during pre-training. The proposed multimodal self-supervised learning yields representations with improved classification performance for both modalities. The concomitant rich and flexible unsupervised deep learning framework captures complex multimodal relationships and provides predictive performance that meets or exceeds that of a more narrow supervised classification analysis. We present elaborate quantitative evidence of how this framework can significantly advance our search for missing links in complex brain disorders.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
Self-Supervised Multimodal Domino: in Search of Biomarkers for Alzheimer's Disease
Authors:
Alex Fedorov,
Tristan Sylvain,
Eloy Geenjaar,
Margaux Luck,
Lei Wu,
Thomas P. DeRamus,
Alex Kirilin,
Dmitry Bleklov,
Vince D. Calhoun,
Sergey M. Plis
Abstract:
Sensory input from multiple sources is crucial for robust and coherent human perception. Different sources contribute complementary explanatory factors. Similarly, research studies often collect multimodal imaging data, each of which can provide shared and unique information. This observation motivated the design of powerful multimodal self-supervised representation-learning algorithms. In this pa…
▽ More
Sensory input from multiple sources is crucial for robust and coherent human perception. Different sources contribute complementary explanatory factors. Similarly, research studies often collect multimodal imaging data, each of which can provide shared and unique information. This observation motivated the design of powerful multimodal self-supervised representation-learning algorithms. In this paper, we unify recent work on multimodal self-supervised learning under a single framework. Observing that most self-supervised methods optimize similarity metrics between a set of model components, we propose a taxonomy of all reasonable ways to organize this process. We first evaluate models on toy multimodal MNIST datasets and then apply them to a multimodal neuroimaging dataset with Alzheimer's disease patients. We find that (1) multimodal contrastive learning has significant benefits over its unimodal counterpart, (2) the specific composition of multiple contrastive objectives is critical to performance on a downstream task, (3) maximization of the similarity between representations has a regularizing effect on a neural network, which can sometimes lead to reduced downstream performance but still reveal multimodal relations. Results show that the proposed approach outperforms previous self-supervised encoder-decoder methods based on canonical correlation analysis (CCA) or the mixture-of-experts multimodal variational autoEncoder (MMVAE) on various datasets with a linear evaluation protocol. Importantly, we find a promising solution to uncover connections between modalities through a jointly shared subspace that can help advance work in our search for neuroimaging biomarkers.
△ Less
Submitted 16 June, 2021; v1 submitted 25 December, 2020;
originally announced December 2020.
-
On self-supervised multi-modal representation learning: An application to Alzheimer's disease
Authors:
Alex Fedorov,
Lei Wu,
Tristan Sylvain,
Margaux Luck,
Thomas P. DeRamus,
Dmitry Bleklov,
Sergey M. Plis,
Vince D. Calhoun
Abstract:
Introspection of deep supervised predictive models trained on functional and structural brain imaging may uncover novel markers of Alzheimer's disease (AD). However, supervised training is prone to learning from spurious features (shortcut learning) impairing its value in the discovery process. Deep unsupervised and, recently, contrastive self-supervised approaches, not biased to classification, a…
▽ More
Introspection of deep supervised predictive models trained on functional and structural brain imaging may uncover novel markers of Alzheimer's disease (AD). However, supervised training is prone to learning from spurious features (shortcut learning) impairing its value in the discovery process. Deep unsupervised and, recently, contrastive self-supervised approaches, not biased to classification, are better candidates for the task. Their multimodal options specifically offer additional regularization via modality interactions. In this paper, we introduce a way to exhaustively consider multimodal architectures for contrastive self-supervised fusion of fMRI and MRI of AD patients and controls. We show that this multimodal fusion results in representations that improve the results of the downstream classification for both modalities. We investigate the fused self-supervised features projected into the brain space and introduce a numerically stable way to do so.
△ Less
Submitted 22 May, 2022; v1 submitted 25 December, 2020;
originally announced December 2020.
-
Cross-Modal Information Maximization for Medical Imaging: CMIM
Authors:
Tristan Sylvain,
Francis Dutil,
Tess Berthier,
Lisa Di Jorio,
Margaux Luck,
Devon Hjelm,
Yoshua Bengio
Abstract:
In hospitals, data are siloed to specific information systems that make the same information available under different modalities such as the different medical imaging exams the patient undergoes (CT scans, MRI, PET, Ultrasound, etc.) and their associated radiology reports. This offers unique opportunities to obtain and use at train-time those multiple views of the same information that might not…
▽ More
In hospitals, data are siloed to specific information systems that make the same information available under different modalities such as the different medical imaging exams the patient undergoes (CT scans, MRI, PET, Ultrasound, etc.) and their associated radiology reports. This offers unique opportunities to obtain and use at train-time those multiple views of the same information that might not always be available at test-time.
In this paper, we propose an innovative framework that makes the most of available data by learning good representations of a multi-modal input that are resilient to modality dropping at test-time, using recent advances in mutual information maximization. By maximizing cross-modal information at train time, we are able to outperform several state-of-the-art baselines in two different settings, medical image classification, and segmentation. In particular, our method is shown to have a strong impact on the inference-time performance of weaker modalities.
△ Less
Submitted 1 February, 2021; v1 submitted 20 October, 2020;
originally announced October 2020.
-
On the Complexity of Horn and Krom Fragments of Second-Order Boolean Logic
Authors:
Miika Hannula,
Juha Kontinen,
Martin Lück,
Jonni Virtema
Abstract:
Second-order Boolean logic is a generalization of QBF, whose constant alternation fragments are known to be complete for the levels of the exponential time hierarchy. We consider two types of restriction of this logic: 1) restrictions to term constructions, 2) restrictions to the form of the Boolean matrix. Of the first sort, we consider two kinds of restrictions: firstly, disallowing nested use o…
▽ More
Second-order Boolean logic is a generalization of QBF, whose constant alternation fragments are known to be complete for the levels of the exponential time hierarchy. We consider two types of restriction of this logic: 1) restrictions to term constructions, 2) restrictions to the form of the Boolean matrix. Of the first sort, we consider two kinds of restrictions: firstly, disallowing nested use of proper function variables, and secondly stipulating that each function variable must appear with a fixed sequence of arguments. Of the second sort, we consider Horn, Krom, and core fragments of the Boolean matrix. We classify the complexity of logics obtained by combining these two types of restrictions. We show that, in most cases, logics with k alternating blocks of function quantifiers are complete for the kth or (k-1)th level of the exponential time hierarchy. Furthermore, we establish NL-completeness for the Krom and core fragments, when k=1 and both restrictions of the first sort are in effect.
△ Less
Submitted 7 July, 2020;
originally announced July 2020.
-
On the Complexity of Linear Temporal Logic with Team Semantics
Authors:
Martin Lück
Abstract:
A specification given as a formula in linear temporal logic (LTL) defines a system by its set of traces. However, certain features such as information flow security constraints are rather modeled as so-called hyperproperties, which are sets of sets of traces. One logical approach to this is team logic, which is a logical framework for the specification of dependence and independence of information…
▽ More
A specification given as a formula in linear temporal logic (LTL) defines a system by its set of traces. However, certain features such as information flow security constraints are rather modeled as so-called hyperproperties, which are sets of sets of traces. One logical approach to this is team logic, which is a logical framework for the specification of dependence and independence of information. LTL with team semantics has recently been discovered as a logic for hyperproperties. We study the complexity theoretic aspects of LTL with so-called synchronous team semantics and Boolean negation, and prove that both its model checking and satisfiability problems are highly undecidable, and equivalent to the decision problem of third-order arithmetic. Furthermore, we prove that this complexity already appears at small temporal depth and with only the "future" modality F. Finally, we also introduce a team-semantical generalization of stutter-invariance.
△ Less
Submitted 27 April, 2020;
originally announced April 2020.
-
On multidimensional record patterns
Authors:
P. L. Krapivsky,
J. M. Luck
Abstract:
Multidimensional record patterns are random sets of lattice points defined by means of a recursive stochastic construction. The patterns thus generated owe their richness to the fact that the construction is not based on a total order, except in one dimension, where usual records in sequences of independent random variables are recovered. We derive many exact results on the statistics of multidime…
▽ More
Multidimensional record patterns are random sets of lattice points defined by means of a recursive stochastic construction. The patterns thus generated owe their richness to the fact that the construction is not based on a total order, except in one dimension, where usual records in sequences of independent random variables are recovered. We derive many exact results on the statistics of multidimensional record patterns on finite samples drawn on hypercubic lattices in any dimension $D$. The most detailed analysis concerns the two-dimensional situation, where we also investigate the distribution of the landing position of the record point which is closest to the origin. Asymptotic expressions for the full distribution and the moments of the number of records on large hypercubic samples are also obtained. The latter distribution is related to that of the largest of $D$ standard Gaussian variables.
△ Less
Submitted 9 June, 2020; v1 submitted 9 December, 2019;
originally announced December 2019.
-
Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Authors:
Martin Weiss,
Simon Chamorro,
Roger Girgis,
Margaux Luck,
Samira E. Kahou,
Joseph P. Cohen,
Derek Nowrouzezahrai,
Doina Precup,
Florian Golemo,
Chris Pal
Abstract:
Millions of blind and visually-impaired (BVI) people navigate urban environments every day, using smartphones for high-level path-planning and white canes or guide dogs for local information. However, many BVI people still struggle to travel to new places. In our endeavor to create a navigation assistant for the BVI, we found that existing Reinforcement Learning (RL) environments were unsuitable f…
▽ More
Millions of blind and visually-impaired (BVI) people navigate urban environments every day, using smartphones for high-level path-planning and white canes or guide dogs for local information. However, many BVI people still struggle to travel to new places. In our endeavor to create a navigation assistant for the BVI, we found that existing Reinforcement Learning (RL) environments were unsuitable for the task. This work introduces SEVN, a sidewalk simulation environment and a neural network-based approach to creating a navigation agent. SEVN contains panoramic images with labels for house numbers, doors, and street name signs, and formulations for several navigation tasks. We study the performance of an RL algorithm (PPO) in this setting. Our policy model fuses multi-modal observations in the form of variable resolution images, visible text, and simulated GPS data to navigate to a goal door. We hope that this dataset, simulator, and experimental results will provide a foundation for further research into the creation of agents that can assist members of the BVI community with outdoor navigation.
△ Less
Submitted 29 October, 2019;
originally announced October 2019.
-
Quantum scattering by a disordered target -- The mean cross section
Authors:
D Boosé,
J Y Fortin,
J M Luck
Abstract:
We study the variation of the mean cross section with the density of the samples in the quantum scattering of a particle by a disordered target. The target consists of a set of pointlike scatterers, each having an equal probability of being anywhere inside a sphere whose radius may be modified. We first prove that scattering by a pointlike scatterer is characterized by a single phase shift $δ$ whi…
▽ More
We study the variation of the mean cross section with the density of the samples in the quantum scattering of a particle by a disordered target. The target consists of a set of pointlike scatterers, each having an equal probability of being anywhere inside a sphere whose radius may be modified. We first prove that scattering by a pointlike scatterer is characterized by a single phase shift $δ$ which takes on its values in $]0 \, , π[$ and that the scattering by ${\rm N}$ pointlike scatterers is described by a system of only ${\rm N}$ equations. We then show with the help of numerical calculations that there are two stages in the variation of the mean cross section as the density of the samples (the radius of the target) increases (decreases). Depending on the value of $δ$, the mean cross section first either increases or decreases, each one of the two behaviours being originated by double scattering; it decreases uniformly for any value of $δ$ as the density increases further on, a behaviour which results from multiple scattering and which follows that of the cross section for diffusion by a hard sphere potential of decreasing radius. The expression of the mean cross section is derived in the particular case of an unlimited number of contributions of successive scatterings.
△ Less
Submitted 13 July, 2021; v1 submitted 28 August, 2019;
originally announced August 2019.
-
Parrondo games as disordered systems
Authors:
J. M. Luck
Abstract:
Parrondo's paradox refers to the counter-intuitive situation where a winning strategy results from a suitable combination of losing ones. Simple stochastic games exhibiting this paradox have been introduced around the turn of the millennium. The common setting of these Parrondo games is that two rules, $A$ and $B$, are played at discrete time steps, following either a periodic pattern or an aperio…
▽ More
Parrondo's paradox refers to the counter-intuitive situation where a winning strategy results from a suitable combination of losing ones. Simple stochastic games exhibiting this paradox have been introduced around the turn of the millennium. The common setting of these Parrondo games is that two rules, $A$ and $B$, are played at discrete time steps, following either a periodic pattern or an aperiodic one, be it deterministic or random. These games can be mapped onto 1D random walks. In capital-dependent games, the probabilities of moving right or left depend on the walker's position modulo some integer $K$. In history-dependent games, each step is correlated with the $Q$ previous ones. In both cases the gain identifies with the velocity of the walker's ballistic motion, which depends non-linearly on model parameters, allowing for the possibility of Parrondo's paradox. Calculating the gain involves products of non-commuting Markov matrices, which are somehow analogous to the transfer matrices used in the physics of 1D disordered systems. Elaborating upon this analogy, we study a paradigmatic Parrondo game of each class in the neutral situation where each rule, when played alone, is fair. The main emphasis of this systematic approach is on the dependence of the gain on the remaining parameters and, above all, on the game, i.e., the rule pattern, be it periodic or aperiodic, deterministic or random. One of the most original sides of this work is the identification of weak-contrast regimes for capital-dependent and history-dependent Parrondo games, and a detailed quantitative investigation of the gain in the latter scaling regimes.
△ Less
Submitted 19 August, 2019; v1 submitted 10 May, 2019;
originally announced May 2019.
-
On the Succinctness of Atoms of Dependency
Authors:
Martin Lück,
Miikka Vilander
Abstract:
Propositional team logic is the propositional analog to first-order team logic. Non-classical atoms of dependence, independence, inclusion, exclusion and anonymity can be expressed in it, but for all atoms except dependence only exponential translations are known. In this paper, we systematically compare their succinctness in the existential fragment, where the splitting disjunction only occurs po…
▽ More
Propositional team logic is the propositional analog to first-order team logic. Non-classical atoms of dependence, independence, inclusion, exclusion and anonymity can be expressed in it, but for all atoms except dependence only exponential translations are known. In this paper, we systematically compare their succinctness in the existential fragment, where the splitting disjunction only occurs positively, and in full propositional team logic with unrestricted negation. By introducing a variant of the Ehrenfeucht-Fraïssé game called formula size game into team logic, we obtain exponential lower bounds in the existential fragment for all atoms. In the full fragment, we present polynomial upper bounds also for all atoms.
△ Less
Submitted 19 August, 2019; v1 submitted 6 March, 2019;
originally announced March 2019.
-
Coverage fluctuations in theater models
Authors:
P. L. Krapivsky,
J. M. Luck
Abstract:
We introduce the theater model, which is the simplest variant of directed random sequential adsorption in one dimension with point source and steric interactions. Particles enter sequentially an initially empty row of $L$ sites and adsorb irreversibly at randomly chosen places. If two particles occupy adjacent sites, they prevent further particles from passing them. A jammed configuration without…
▽ More
We introduce the theater model, which is the simplest variant of directed random sequential adsorption in one dimension with point source and steric interactions. Particles enter sequentially an initially empty row of $L$ sites and adsorb irreversibly at randomly chosen places. If two particles occupy adjacent sites, they prevent further particles from passing them. A jammed configuration without available empty sites is eventually reached. More generally, we investigate the class of models parametrized by $b$, the number of consecutive particles needed to form a blockage. We show analytically that the occupations of different sites in jammed configurations exhibit long-range correlations obeying scaling laws, for all integers $b\ge2$, so that the total number of particles grows as a subextensive power of $L$, with exponent $(b-1)/b$, and keeps fluctuating even for very large systems. The exactly known relative number variance measuring this lack of self-averaging is maximal for the theater model {\it stricto sensu} ($b=2$). In the special case where $b=1$, so that each adsorbed particle is a blockage, the model can be mapped onto the statistics of records in sequences of random variables and of cycles in random permutations. A two-sided variant of the model is also considered. In both situations the number of particles grows only logarithmically with $L$, and it is self-averaging.
△ Less
Submitted 21 June, 2019; v1 submitted 12 February, 2019;
originally announced February 2019.
-
Scaling laws for weakly disordered 1D flat bands
Authors:
J. M. Luck
Abstract:
We investigate Anderson localization on various 1D structures having flat bands. The main focus is on the scaling laws obeyed by the localization length at weak disorder in the vicinity of flat-band energies. A careful distinction is made between situations where the scaling functions are universal (i.e., depend on the disorder distribution only through its width) and where they keep depending on…
▽ More
We investigate Anderson localization on various 1D structures having flat bands. The main focus is on the scaling laws obeyed by the localization length at weak disorder in the vicinity of flat-band energies. A careful distinction is made between situations where the scaling functions are universal (i.e., depend on the disorder distribution only through its width) and where they keep depending on the full shape of the disorder distribution, even in the weak-disorder scaling regime. Three examples are analyzed in detail. On the stub chain, one central flat band is isolated from two lateral dispersive ones. The localization length remains microscopic at weak disorder and exhibits disorder-specific features. On the pyrochlore ladder, the two flat bands are tangent to a dispersive one. The localization length diverges with exponent 1/2 and a non-universal scaling law, whose dependence on the disorder distribution is predicted analytically. On the diamond chain, a central flat band intersects two symmetric dispersive ones. The localization length exhibits two successive scaling regimes, diverging first with exponent 4/3 and a universal law, and then (i.e., further away from the pristine flat band) with exponent 1 and a non-universal law. Both scaling functions are also derived by analytical means.
△ Less
Submitted 25 April, 2019; v1 submitted 7 December, 2018;
originally announced December 2018.
-
A Survey of Mobile Computing for the Visually Impaired
Authors:
Martin Weiss,
Margaux Luck,
Roger Girgis,
Chris Pal,
Joseph Paul Cohen
Abstract:
The number of visually impaired or blind (VIB) people in the world is estimated at several hundred million. Based on a series of interviews with the VIB and developers of assistive technology, this paper provides a survey of machine-learning based mobile applications and identifies the most relevant applications. We discuss the functionality of these apps, how they align with the needs and require…
▽ More
The number of visually impaired or blind (VIB) people in the world is estimated at several hundred million. Based on a series of interviews with the VIB and developers of assistive technology, this paper provides a survey of machine-learning based mobile applications and identifies the most relevant applications. We discuss the functionality of these apps, how they align with the needs and requirements of the VIB users, and how they can be improved with techniques such as federated learning and model compression. As a result of this study we identify promising future directions of research in mobile perception, micro-navigation, and content-summarization.
△ Less
Submitted 27 November, 2018; v1 submitted 25 November, 2018;
originally announced November 2018.
-
Return probability of $N$ fermions released from a 1D confining potential
Authors:
P L Krapivsky,
J M Luck,
K Mallick
Abstract:
We consider $N$ non-interacting fermions prepared in the ground state of a 1D confining potential and submitted to an instantaneous quench consisting in releasing the trapping potential. We show that the quantum return probability of finding the fermions in their initial state at a later time falls off as a power law in the long-time regime, with a universal exponent depending only on $N$ and on w…
▽ More
We consider $N$ non-interacting fermions prepared in the ground state of a 1D confining potential and submitted to an instantaneous quench consisting in releasing the trapping potential. We show that the quantum return probability of finding the fermions in their initial state at a later time falls off as a power law in the long-time regime, with a universal exponent depending only on $N$ and on whether the free fermions expand over the full line or over a half-line. In both geometries the amplitudes of this power-law decay are expressed in terms of finite determinants of moments of the one-body bound-state wavefunctions in the potential. These amplitudes are worked out explicitly for the harmonic and square-well potentials. At large fermion numbers they obey scaling laws involving the Fermi energy of the initial state. The use of the Selberg-Mehta integrals stemming from random matrix theory has been instrumental in the derivation of these results.
△ Less
Submitted 14 February, 2019; v1 submitted 22 October, 2018;
originally announced October 2018.
-
Learning to rank for censored survival data
Authors:
Margaux Luck,
Tristan Sylvain,
Joseph Paul Cohen,
Heloise Cardinal,
Andrea Lodi,
Yoshua Bengio
Abstract:
Survival analysis is a type of semi-supervised ranking task where the target output (the survival time) is often right-censored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of loss functions, namely partial likelihood methods, rank methods, and our classification method based on…
▽ More
Survival analysis is a type of semi-supervised ranking task where the target output (the survival time) is often right-censored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of loss functions, namely partial likelihood methods, rank methods, and our classification method based on a Wasserstein metric (WM) and the non-parametric Kaplan Meier estimate of the probability density to impute the labels of censored examples, can take advantage of this information. The proposed method allows us to have a model that predict the probability distribution of an event. If a clinician had access to the detailed probability of an event over time this would help in treatment planning. For example, determining if the risk of kidney graft rejection is constant or peaked after some time. Also, we demonstrate that this approach directly optimizes the expected C-index which is the most common evaluation metric for ranking survival models.
△ Less
Submitted 8 June, 2018; v1 submitted 5 June, 2018;
originally announced June 2018.
-
Distribution Matching Losses Can Hallucinate Features in Medical Image Translation
Authors:
Joseph Paul Cohen,
Margaux Luck,
Sina Honari
Abstract:
This paper discusses how distribution matching losses, such as those used in CycleGAN, when used to synthesize medical images can lead to mis-diagnosis of medical conditions. It seems appealing to use these new image synthesis methods for translating images from a source to a target domain because they can produce high quality images and some even do not require paired data. However, the basis of…
▽ More
This paper discusses how distribution matching losses, such as those used in CycleGAN, when used to synthesize medical images can lead to mis-diagnosis of medical conditions. It seems appealing to use these new image synthesis methods for translating images from a source to a target domain because they can produce high quality images and some even do not require paired data. However, the basis of how these image translation models work is through matching the translation output to the distribution of the target domain. This can cause an issue when the data provided in the target domain has an over or under representation of some classes (e.g. healthy or sick). When the output of an algorithm is a transformed image there are uncertainties whether all known and unknown class labels have been preserved or changed. Therefore, we recommend that these translated images should not be used for direct interpretation (e.g. by doctors) because they may lead to misdiagnosis of patients based on hallucinated image features by an algorithm that matches a distribution. However there are many recent papers that seem as though this is the goal.
△ Less
Submitted 3 October, 2018; v1 submitted 22 May, 2018;
originally announced May 2018.
-
On the Complexity of Team Logic and its Two-Variable Fragment
Authors:
Martin Lück
Abstract:
We study the logic FO(~), the extension of first-order logic with team semantics by unrestricted Boolean negation. It was recently shown axiomatizable, but otherwise has not yet received much attention in questions of computational complexity.
In this paper, we consider its two-variable fragment FO2(~) and prove that its satisfiability problem is decidable, and in fact complete for the recently…
▽ More
We study the logic FO(~), the extension of first-order logic with team semantics by unrestricted Boolean negation. It was recently shown axiomatizable, but otherwise has not yet received much attention in questions of computational complexity.
In this paper, we consider its two-variable fragment FO2(~) and prove that its satisfiability problem is decidable, and in fact complete for the recently introduced non-elementary class TOWER(poly). Moreover, we classify the complexity of model checking of FO(~) with respect to the number of variables and the quantifier rank, and prove a dichotomy between PSPACE- and ATIME-ALT(exp, poly)-completeness.
To achieve the lower bounds, we propose a translation from modal team logic MTL to FO2(~) that extends the well-known standard translation from modal logic ML to FO2. For the upper bounds, we translate to a fragment of second-order logic.
△ Less
Submitted 13 April, 2018;
originally announced April 2018.
-
Quantum return probability of a system of $N$ non-interacting lattice fermions
Authors:
P. L. Krapivsky,
J. M. Luck,
K. Mallick
Abstract:
We consider $N$ non-interacting fermions performing continuous-time quantum walks on a one-dimensional lattice. The system is launched from a most compact configuration where the fermions occupy neighboring sites. We calculate exactly the quantum return probability (sometimes referred to as the Loschmidt echo) of observing the very same compact state at a later time $t$. Remarkably, this probabili…
▽ More
We consider $N$ non-interacting fermions performing continuous-time quantum walks on a one-dimensional lattice. The system is launched from a most compact configuration where the fermions occupy neighboring sites. We calculate exactly the quantum return probability (sometimes referred to as the Loschmidt echo) of observing the very same compact state at a later time $t$. Remarkably, this probability depends on the parity of the fermion number -- it decays as a power of time for even $N$, while for odd $N$ it exhibits periodic oscillations modulated by a decaying power law. The exponent also slightly depends on the parity of $N$, and is roughly twice smaller than what it would be in the continuum limit. We also consider the same problem, and obtain similar results, in the presence of an impenetrable wall at the origin constraining the particles to remain on the positive half-line. We derive closed-form expressions for the amplitudes of the power-law decay of the return probability in all cases. The key point in the derivation is the use of Mehta integrals, which are limiting cases of the Selberg integral.
△ Less
Submitted 12 February, 2018; v1 submitted 23 October, 2017;
originally announced October 2017.
-
Canonical Models and the Complexity of Modal Team Logic
Authors:
Martin Lück
Abstract:
We study modal team logic MTL, the team-semantical extension of modal logic ML closed under Boolean negation. Its fragments, such as modal dependence, independence, and inclusion logic, are well-understood. However, due to the unrestricted Boolean negation, the satisfiability problem of full MTL has been notoriously resistant to a complexity theoretical classification.
In our approach, we introd…
▽ More
We study modal team logic MTL, the team-semantical extension of modal logic ML closed under Boolean negation. Its fragments, such as modal dependence, independence, and inclusion logic, are well-understood. However, due to the unrestricted Boolean negation, the satisfiability problem of full MTL has been notoriously resistant to a complexity theoretical classification.
In our approach, we introduce the notion of canonical models into the team-semantical setting. By construction of such a model, we reduce the satisfiability problem of MTL to simple model checking. Afterwards, we show that this approach is optimal in the sense that MTL-formulas can efficiently enforce canonicity.
Furthermore, to capture these results in terms of complexity, we introduce a non-elementary complexity class, TOWER(poly), and prove that it contains satisfiability and validity of MTL as complete problems. We also prove that the fragments of MTL with bounded modal depth are complete for the levels of the elementary hierarchy (with polynomially many alternations). The respective hardness results hold for both strict or lax semantics of the modal operators and the splitting disjunction, and also over the class of reflexive and transitive frames.
△ Less
Submitted 10 April, 2019; v1 submitted 15 September, 2017;
originally announced September 2017.
-
Rule-Mining based classification: a benchmark study
Authors:
Margaux Luck,
Nicolas Pallet,
Cecilia Damon
Abstract:
This study proposed an exhaustive stable/reproducible rule-mining algorithm combined to a classifier to generate both accurate and interpretable models. Our method first extracts rules (i.e., a conjunction of conditions about the values of a small number of input features) with our exhaustive rule-mining algorithm, then constructs a new feature space based on the most relevant rules called "local…
▽ More
This study proposed an exhaustive stable/reproducible rule-mining algorithm combined to a classifier to generate both accurate and interpretable models. Our method first extracts rules (i.e., a conjunction of conditions about the values of a small number of input features) with our exhaustive rule-mining algorithm, then constructs a new feature space based on the most relevant rules called "local features" and finally, builds a local predictive model by training a standard classifier on the new local feature space. This local feature space is easy interpretable by providing a human-understandable explanation under the explicit form of rules. Furthermore, our local predictive approach is as powerful as global classical ones like logistic regression (LR), support vector machine (SVM) and rules based methods like random forest (RF) and gradient boosted tree (GBT).
△ Less
Submitted 30 June, 2017;
originally announced June 2017.
-
How the fittest compete for leadership: A tale of tails
Authors:
J. M. Luck,
A. Mehta
Abstract:
We investigate how leaders emerge as a consequence of the competitive dynamics between coupled papers in a model citation network. Every paper is allocated an initial fitness depending on its intrinsic quality. Its fitness then evolves dynamically as a consequence of the competition between itself and all the other papers in the field. It picks up citations as a result of this adaptive dynamics, b…
▽ More
We investigate how leaders emerge as a consequence of the competitive dynamics between coupled papers in a model citation network. Every paper is allocated an initial fitness depending on its intrinsic quality. Its fitness then evolves dynamically as a consequence of the competition between itself and all the other papers in the field. It picks up citations as a result of this adaptive dynamics, becoming a leader if it has the highest citation count at a given time. Extensive analytical and numerical investigations of this model suggest the existence of a universal phase diagram, divided into regions of weak and strong coupling. In the former, we find an `extended' and rather structureless distribution of citation counts among many fit papers; leaders are not necessarily those with the maximal fitness at any given time. By contrast, the strong-coupling region is characterised by a strongly hierarchical distribution of citation counts, that are `localised' among only a few extremely fit papers, and exhibit strong history-to-history fluctuations, as a result of the complex dynamics among papers in the tail of the fitness distribution.
△ Less
Submitted 26 June, 2017; v1 submitted 13 June, 2017;
originally announced June 2017.
-
Deep Learning for Patient-Specific Kidney Graft Survival Analysis
Authors:
Margaux Luck,
Tristan Sylvain,
Héloïse Cardinal,
Andrea Lodi,
Yoshua Bengio
Abstract:
An accurate model of patient-specific kidney graft survival distributions can help to improve shared-decision making in the treatment and care of patients. In this paper, we propose a deep learning method that directly models the survival function instead of estimating the hazard function to predict survival times for graft patients based on the principle of multi-task learning. By learning to joi…
▽ More
An accurate model of patient-specific kidney graft survival distributions can help to improve shared-decision making in the treatment and care of patients. In this paper, we propose a deep learning method that directly models the survival function instead of estimating the hazard function to predict survival times for graft patients based on the principle of multi-task learning. By learning to jointly predict the time of the event, and its rank in the cox partial log likelihood framework, our deep learning approach outperforms, in terms of survival time prediction quality and concordance index, other common methods for survival analysis, including the Cox Proportional Hazards model and a network trained on the cox partial log-likelihood.
△ Less
Submitted 29 May, 2017;
originally announced May 2017.
-
Equilibration properties of small quantum systems: further examples
Authors:
J. M. Luck
Abstract:
It has been proposed to investigate the equilibration properties of a small isolated quantum system by means of the matrix of asymptotic transition probabilities in some preferential basis. The trace $T$ of this matrix measures the degree of equilibration of the system prepared in a typical state of the preferential basis. This quantity may vary between unity (ideal equilibration) and the dimensio…
▽ More
It has been proposed to investigate the equilibration properties of a small isolated quantum system by means of the matrix of asymptotic transition probabilities in some preferential basis. The trace $T$ of this matrix measures the degree of equilibration of the system prepared in a typical state of the preferential basis. This quantity may vary between unity (ideal equilibration) and the dimension $N$ of the Hilbert space (no equilibration at all). Here we analyze several examples of simple systems where the behavior of $T$ can be investigated by analytical means. We first study the statistics of $T$ when the Hamiltonian governing the dynamics is random and drawn from a distribution invariant under the group U$(N)$ or O$(N)$. We then investigate a quantum spin $S$ in a tilted magnetic field making an arbitrary angle with the preferred quantization axis, as well as a tight-binding particle on a finite electrified chain. The last two cases provide examples of the interesting situation where varying a system parameter -- such as the tilt angle or the electric field -- through some scaling regime induces a continuous transition from good to bad equilibration properties.
△ Less
Submitted 18 August, 2017; v1 submitted 20 February, 2017;
originally announced February 2017.
-
On Quantified Propositional Logics and the Exponential Time Hierarchy
Authors:
Miika Hannula,
Juha Kontinen,
Martin Lück,
Jonni Virtema
Abstract:
We study quantified propositional logics from the complexity theoretic point of view. First we introduce alternating dependency quantified boolean formulae (ADQBF) which generalize both quantified and dependency quantified boolean formulae. We show that the truth evaluation for ADQBF is AEXPTIME(poly)-complete. We also identify fragments for which the problem is complete for the levels of the expo…
▽ More
We study quantified propositional logics from the complexity theoretic point of view. First we introduce alternating dependency quantified boolean formulae (ADQBF) which generalize both quantified and dependency quantified boolean formulae. We show that the truth evaluation for ADQBF is AEXPTIME(poly)-complete. We also identify fragments for which the problem is complete for the levels of the exponential hierarchy. Second we study propositional team-based logics. We show that DQBF formulae correspond naturally to quantified propositional dependence logic and present a general NEXPTIME upper bound for quantified propositional logic with a large class of generalized dependence atoms. Moreover we show AEXPTIME(poly)-completeness for extensions of propositional team logic with generalized dependence atoms.
△ Less
Submitted 13 September, 2016;
originally announced September 2016.
-
Quantum centipedes: collective dynamics of interacting quantum walkers
Authors:
P. L. Krapivsky,
J. M. Luck,
K. Mallick
Abstract:
We consider the quantum centipede made of $N$ fermionic quantum walkers on the one-dimensional lattice interacting by means of the simplest of all hard-bound constraints: the distance between two consecutive fermions is either one or two lattice spacings. This composite quantum walker spreads ballistically, just as the simple quantum walk. However, because of the interactions between the internal…
▽ More
We consider the quantum centipede made of $N$ fermionic quantum walkers on the one-dimensional lattice interacting by means of the simplest of all hard-bound constraints: the distance between two consecutive fermions is either one or two lattice spacings. This composite quantum walker spreads ballistically, just as the simple quantum walk. However, because of the interactions between the internal degrees of freedom, the distribution of its center-of-mass velocity displays numerous ballistic fronts in the long-time limit, corresponding to singularities in the empirical velocity distribution. The spectrum of the centipede and the corresponding group velocities are analyzed by direct means for the first few values of $N$. Some analytical results are obtained for arbitrary $N$ by exploiting an exact mapping of the problem onto a free-fermion system. We thus derive the maximal velocity describing the ballistic spreading of the two extremal fronts of the centipede wavefunction, including its non-trivial value in the large-$N$ limit.
△ Less
Submitted 8 July, 2016; v1 submitted 15 March, 2016;
originally announced March 2016.
-
Axiomatizations of Team Logics
Authors:
Martin Lück
Abstract:
In a modular approach, we lift Hilbert-style proof systems for propositional, modal and first-order logic to generalized systems for their respective team-based extensions. We obtain sound and complete axiomatizations for the dependence-free fragment FO(~) of Väänänen's first-order team logic TL, for propositional team logic PTL, quantified propositional team logic QPTL, modal team logic MTL, and…
▽ More
In a modular approach, we lift Hilbert-style proof systems for propositional, modal and first-order logic to generalized systems for their respective team-based extensions. We obtain sound and complete axiomatizations for the dependence-free fragment FO(~) of Väänänen's first-order team logic TL, for propositional team logic PTL, quantified propositional team logic QPTL, modal team logic MTL, and for the corresponding logics of dependence, independence, inclusion and exclusion.
As a crucial step in the completeness proof, we show that the above logics admit, in a particular sense, a semantics-preserving elimination of modalities and quantifiers from formulas.
△ Less
Submitted 26 March, 2018; v1 submitted 16 February, 2016;
originally announced February 2016.
-
Complete Problems of Propositional Logic for the Exponential Hierarchy
Authors:
Martin Lück
Abstract:
Large complexity classes, like the exponential time hierarchy, received little attention in terms of finding complete problems. In this work a generalization of propositional logic is investigated which fills this gap with the introduction of Boolean higher-order quantifiers or equivalently Boolean Skolem functions. This builds on the important results of Wrathall and Stockmeyer regarding complete…
▽ More
Large complexity classes, like the exponential time hierarchy, received little attention in terms of finding complete problems. In this work a generalization of propositional logic is investigated which fills this gap with the introduction of Boolean higher-order quantifiers or equivalently Boolean Skolem functions. This builds on the important results of Wrathall and Stockmeyer regarding complete problems, namely QBF and QBF-k, for the polynomial hierarchy. Furthermore it generalizes the Dependency QBF problem introduced by Peterson, Reif and Azhar which is complete for NEXP, the first level of the exponential hierarchy. Also it turns out that the hardness results do not collapse at the consideration of conjunctive and disjunctive normal forms, in contrast to plain QBF.
△ Less
Submitted 27 May, 2016; v1 submitted 9 February, 2016;
originally announced February 2016.
-
L1 logistic regression as a feature selection step for training stable classification trees for the prediction of severity criteria in imported malaria
Authors:
Luca Talenti,
Margaux Luck,
Anastasia Yartseva,
Nicolas Argy,
Sandrine Houzé,
Cecilia Damon
Abstract:
Multivariate classification methods using explanatory and predictive models are necessary for characterizing subgroups of patients according to their risk profiles. Popular methods include logistic regression and classification trees with performances that vary according to the nature and the characteristics of the dataset. In the context of imported malaria, we aimed at classifying severity crite…
▽ More
Multivariate classification methods using explanatory and predictive models are necessary for characterizing subgroups of patients according to their risk profiles. Popular methods include logistic regression and classification trees with performances that vary according to the nature and the characteristics of the dataset. In the context of imported malaria, we aimed at classifying severity criteria based on a heterogeneous patient population. We investigated these approaches by implementing two different strategies: L1 logistic regression (L1LR) that models a single global solution and classification trees that model multiple local solutions corresponding to discriminant subregions of the feature space. For each strategy, we built a standard model, and a sparser version of it. As an alternative to pruning, we explore a promising approach that first constrains the tree model with an L1LR-based feature selection, an approach we called L1LR-Tree. The objective is to decrease its vulnerability to small data variations by removing variables corresponding to unstable local phenomena. Our study is twofold: i) from a methodological perspective comparing the performances and the stability of the three previous methods, i.e L1LR, classification trees and L1LR-Tree, for the classification of severe forms of imported malaria, and ii) from an applied perspective improving the actual classification of severe forms of imported malaria by identifying more personalized profiles predictive of several clinical criteria based on variables dismissed for the clinical definition of the disease. The main methodological results show that the combined method L1LR-Tree builds sparse and stable models that significantly predicts the different severity criteria and outperforms all the other methods in terms of accuracy.
△ Less
Submitted 20 November, 2015;
originally announced November 2015.
-
Universality in survivor distributions: Characterising the winners of competitive dynamics
Authors:
J. M. Luck,
A. Mehta
Abstract:
We investigate the survivor distributions of a spatially extended model of competitive dynamics in different geometries. The model consists of a deterministic dynamical system of individual agents at specified nodes, which might or might not survive the predatory dynamics: all stochasticity is brought in by the initial state. Every such initial state leads to a unique and extended pattern of survi…
▽ More
We investigate the survivor distributions of a spatially extended model of competitive dynamics in different geometries. The model consists of a deterministic dynamical system of individual agents at specified nodes, which might or might not survive the predatory dynamics: all stochasticity is brought in by the initial state. Every such initial state leads to a unique and extended pattern of survivors and non-survivors, which is known as an attractor of the dynamics. We show that the number of such attractors grows exponentially with system size, so that their exact characterisation is limited to only very small systems. Given this, we construct an analytical approach based on inhomogeneous mean-field theory to calculate survival probabilities for arbitrary networks. This powerful (albeit approximate) approach shows how universality arises in survivor distributions via a key concept -- the {\it dynamical fugacity}. Remarkably, in the large-mass limit, the survival probability of a node becomes independent of network geometry, and assumes a simple form which depends only on its mass and degree.
△ Less
Submitted 13 November, 2015;
originally announced November 2015.
-
Quirky Quantifiers: Optimal Models and Complexity of Computation Tree Logic
Authors:
Martin Lück
Abstract:
The satisfiability problem of the branching time logic CTL is studied in terms of computational complexity. Tight upper and lower bounds are provided for each temporal operator fragment. In parallel, the minimal model size is studied with a suitable notion of minimality. Thirdly, flat CTL is investigated, i.e., formulas with very low temporal operator nesting depth. A sharp dichotomy is shown in t…
▽ More
The satisfiability problem of the branching time logic CTL is studied in terms of computational complexity. Tight upper and lower bounds are provided for each temporal operator fragment. In parallel, the minimal model size is studied with a suitable notion of minimality. Thirdly, flat CTL is investigated, i.e., formulas with very low temporal operator nesting depth. A sharp dichotomy is shown in terms of complexity and minimal models: Temporal depth one has low expressive power, while temporal depth two is equivalent to full CTL.
△ Less
Submitted 24 February, 2017; v1 submitted 29 October, 2015;
originally announced October 2015.
-
An investigation of equilibration in small quantum systems: the example of a particle in a 1D random potential
Authors:
J. M. Luck
Abstract:
We investigate the equilibration of a small isolated quantum system by means of its matrix of asymptotic transition probabilities in a preferential basis. The trace of this matrix is shown to measure the degree of equilibration of the system launched from a typical state, from the standpoint of the chosen basis. This approach is substantiated by an in-depth study of the example of a tight-binding…
▽ More
We investigate the equilibration of a small isolated quantum system by means of its matrix of asymptotic transition probabilities in a preferential basis. The trace of this matrix is shown to measure the degree of equilibration of the system launched from a typical state, from the standpoint of the chosen basis. This approach is substantiated by an in-depth study of the example of a tight-binding particle in one dimension. In the regime of free ballistic propagation, the above trace saturates to a finite limit, testifying good equilibration. In the presence of a random potential, the trace grows linearly with the system size, testifying poor equilibration in the insulating regime induced by Anderson localization. In the weak-disorder situation of most interest, a universal finite-size scaling law describes the crossover between the ballistic and localized regimes. The associated crossover exponent 2/3 is dictated by the anomalous band-edge scaling characterizing the most localized energy eigenstates.
△ Less
Submitted 3 February, 2016; v1 submitted 21 October, 2015;
originally announced October 2015.
-
Interacting quantum walkers: Two-body bosonic and fermionic bound states
Authors:
P. L. Krapivsky,
J. M. Luck,
K. Mallick
Abstract:
We investigate the dynamics of bound states of two interacting particles, either bosons or fermions, performing a continuous-time quantum walk on a one-dimensional lattice. We consider the situation where the distance between both particles has a hard bound, and the richer situation where the particles are bound by a smooth confining potential. The main emphasis is on the velocity characterizing t…
▽ More
We investigate the dynamics of bound states of two interacting particles, either bosons or fermions, performing a continuous-time quantum walk on a one-dimensional lattice. We consider the situation where the distance between both particles has a hard bound, and the richer situation where the particles are bound by a smooth confining potential. The main emphasis is on the velocity characterizing the ballistic spreading of these bound states, and on the structure of the asymptotic distribution profile of their center-of-mass coordinate. The latter profile generically exhibits many internal fronts.
△ Less
Submitted 5 November, 2015; v1 submitted 6 July, 2015;
originally announced July 2015.
-
LTL Fragments are Hard for Standard Parameterisations
Authors:
Martin Lück,
Arne Meier
Abstract:
We classify the complexity of the LTL satisfiability and model checking problems for several standard parameterisations. The investigated parameters are temporal depth, number of propositional variables and formula treewidth, resp., pathwidth. We show that all operator fragments of LTL under the investigated parameterisations are intractable in the sense of parameterised complexity.
We classify the complexity of the LTL satisfiability and model checking problems for several standard parameterisations. The investigated parameters are temporal depth, number of propositional variables and formula treewidth, resp., pathwidth. We show that all operator fragments of LTL under the investigated parameterisations are intractable in the sense of parameterised complexity.
△ Less
Submitted 22 September, 2015; v1 submitted 23 April, 2015;
originally announced April 2015.
-
Parameterized Complexity of CTL: A Generalization of Courcelle's Theorem
Authors:
Martin Lück,
Arne Meier,
Irina Schindler
Abstract:
We present an almost complete classification of the parameterized complexity of all operator fragments of the satisfiability problem in computation tree logic CTL. The investigated parameterization is the sum of temporal depth and structural pathwidth. The classification shows a dichotomy between W[1]-hard and fixed-parameter tractable fragments. The only real operator fragment which is confirmed…
▽ More
We present an almost complete classification of the parameterized complexity of all operator fragments of the satisfiability problem in computation tree logic CTL. The investigated parameterization is the sum of temporal depth and structural pathwidth. The classification shows a dichotomy between W[1]-hard and fixed-parameter tractable fragments. The only real operator fragment which is confirmed to be in FPT is the fragment containing solely AX. Also we prove a generalization of Courcelle's theorem to infinite signatures which will be used to proof the FPT-membership case.
△ Less
Submitted 24 March, 2015; v1 submitted 15 October, 2014;
originally announced October 2014.
-
Slow synaptic dynamics in a network: from exponential to power-law forgetting
Authors:
J. M. Luck,
A. Mehta
Abstract:
We investigate a mean-field model of interacting synapses on a directed neural network. Our interest lies in the slow adaptive dynamics of synapses, which are driven by the fast dynamics of the neurons they connect. Cooperation is modelled from the usual Hebbian perspective, while competition is modelled by an original polarity-driven rule. The emergence of a critical manifold culminating in a tri…
▽ More
We investigate a mean-field model of interacting synapses on a directed neural network. Our interest lies in the slow adaptive dynamics of synapses, which are driven by the fast dynamics of the neurons they connect. Cooperation is modelled from the usual Hebbian perspective, while competition is modelled by an original polarity-driven rule. The emergence of a critical manifold culminating in a tricritical point is crucially dependent on the presence of synaptic competition. This leads to a universal $1/t$ power-law relaxation of the mean synaptic strength along the critical manifold and an equally universal $1/\sqrt{t}$ relaxation at the tricritical point, to be contrasted with the exponential relaxation that is otherwise generic. In turn, this leads to the natural emergence of long- and short-term memory from different parts of parameter space in a synaptic network, which is the most novel and important result of our present investigations.
△ Less
Submitted 15 September, 2014;
originally announced September 2014.
-
Unusual electronic properties of clean and disordered zigzag graphene nanoribbons
Authors:
J. M. Luck,
Y. Avishai
Abstract:
We revisit the problem of electron transport in clean and disordered zigzag graphene nanoribbons, and expose numerous hitherto unknown peculiar properties of these systems at zero energy, where both sublattices decouple because of chiral symmetry. For clean ribbons, we give a quantitative description of the unusual power-law dispersion of the central energy bands and of its main consequences, incl…
▽ More
We revisit the problem of electron transport in clean and disordered zigzag graphene nanoribbons, and expose numerous hitherto unknown peculiar properties of these systems at zero energy, where both sublattices decouple because of chiral symmetry. For clean ribbons, we give a quantitative description of the unusual power-law dispersion of the central energy bands and of its main consequences, including the strong divergence of the density of states near zero energy, and the vanishing of the transverse localization length of the corresponding edge states. In the presence of off-diagonal disorder, which respects the lattice chiral symmetry, all zero-energy localization properties are found to be anomalous. Recasting the problem in terms of coupled Brownian motions enables us to derive numerous asymptotic results by analytical means. In particular the typical conductance $g_N$ of a disordered sample of width $N$ and length $L$ is shown to decay as $\exp(-C_Nw\sqrt{L})$, for arbitrary values of the disorder strength $w$, while the relative variance of $\ln g_N$ approaches a non-trivial constant $K_N$. The dependence of the constants $C_N$ and $K_N$ on the ribbon width $N$ is predicted. From the mere viewpoint of the transfer-matrix formalism, zigzag ribbons provide a case study with many unusual features. The transfer matrix describing propagation through one unit cell of a clean ribbon is not diagonalizable at zero energy. In the disordered case, we encounter non-trivial random matrix products such that all Lyapunov exponents vanish identically.
△ Less
Submitted 15 December, 2014; v1 submitted 16 June, 2014;
originally announced June 2014.
-
Survival of classical and quantum particles in the presence of traps
Authors:
P. L. Krapivsky,
J. M. Luck,
K. Mallick
Abstract:
We present a detailed comparison of the motion of a classical and of a quantum particle in the presence of trapping sites, within the framework of continuous-time classical and quantum random walk. The main emphasis is on the qualitative differences in the temporal behavior of the survival probabilities of both kinds of particles. As a general rule, static traps are far less efficient to absorb qu…
▽ More
We present a detailed comparison of the motion of a classical and of a quantum particle in the presence of trapping sites, within the framework of continuous-time classical and quantum random walk. The main emphasis is on the qualitative differences in the temporal behavior of the survival probabilities of both kinds of particles. As a general rule, static traps are far less efficient to absorb quantum particles than classical ones. Several lattice geometries are successively considered: an infinite chain with a single trap, a finite ring with a single trap, a finite ring with several traps, and an infinite chain and a higher-dimensional lattice with a random distribution of traps with a given density. For the latter disordered systems, the classical and the quantum survival probabilities obey a stretched exponential asymptotic decay, albeit with different exponents. These results confirm earlier predictions, and the corresponding amplitudes are evaluated. In the one-dimensional geometry of the infinite chain, we obtain a full analytical prediction for the amplitude of the quantum problem, including its dependence on the trap density and strength.
△ Less
Submitted 11 March, 2014; v1 submitted 25 November, 2013;
originally announced November 2013.
-
On the frequencies of patterns of rises and falls
Authors:
J M Luck
Abstract:
We investigate the probability of observing a given pattern of $n$ rises and falls in a random stationary data series. The data are modelled as a sequence of $n+1$ independent and identically distributed random numbers. This probabilistic approach has a combinatorial equivalent, where the data are modelled by a random permutation on $n+1$ objects. The probability of observing a long pattern of ris…
▽ More
We investigate the probability of observing a given pattern of $n$ rises and falls in a random stationary data series. The data are modelled as a sequence of $n+1$ independent and identically distributed random numbers. This probabilistic approach has a combinatorial equivalent, where the data are modelled by a random permutation on $n+1$ objects. The probability of observing a long pattern of rises and falls decays exponentially with its length $n$ in general. The associated decay rate $α$ is interpreted as the embedding entropy of the pattern. This rate is evaluated exactly for all periodic patterns. In the most general case, it is expressed in terms of a determinant of generalized hyperbolic or trigonometric functions. Alternating patterns have the smallest rate $α_{\rm min}=\ln(π/2)=0.451582\dots$, while other examples lead to arbitrarily large rates. The probabilities of observing uniformly chosen random patterns are demonstrated to obey multifractal statistics. The typical value $α_0=0.806361\dots$ of the rate plays the role of a Lyapunov exponent. A wide range of examples of patterns, either deterministic or random, is also investigated.
△ Less
Submitted 9 April, 2014; v1 submitted 30 September, 2013;
originally announced September 2013.
-
Asymmetric Langevin dynamics for the ferromagnetic spherical model
Authors:
C Godreche,
J M Luck
Abstract:
The present work pursues the investigation of the role of spatial asymmetry and irreversibility on the dynamical properties of spin systems. We consider the ferromagnetic spherical model with asymmetric linear Langevin dynamics. Such an asymmetric dynamics is irreversible, i.e., breaks detailed balance, because the principle of action and reaction is violated. The fluctuation-dissipation theorem t…
▽ More
The present work pursues the investigation of the role of spatial asymmetry and irreversibility on the dynamical properties of spin systems. We consider the ferromagnetic spherical model with asymmetric linear Langevin dynamics. Such an asymmetric dynamics is irreversible, i.e., breaks detailed balance, because the principle of action and reaction is violated. The fluctuation-dissipation theorem therefore no longer holds. The stationary state is however still Gibbsian, i.e., the weights of configurations are given by the Boltzmann factor corresponding to the ferromagnetic Hamiltonian. The model is exactly solvable in any dimension, enabling an analytical evaluation of time-dependent observables. We show the existence of two regimes of violation of the fluctuation-dissipation theorem in the nonequilibrium stationary state: a regime of weak violation where the stationary fluctuation-dissipation ratio is finite but less than unity and varies continuously with the asymmetry, and a regime of strong violation where the fluctuation-dissipation ratio vanishes asymptotically. This phenomenon was first uncovered in the asymmetric kinetic Ising chain. The present results suggest that this novel kind of dynamical transition in nonequilibrium stationary states might be quite general. We also perform a systematic analysis of several regimes of interest, either stationary or transient, in various dimensions and in the different phases of the model.
△ Less
Submitted 19 February, 2013;
originally announced February 2013.