-
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Authors:
Alexander Khazatsky,
Karl Pertsch,
Suraj Nair,
Ashwin Balakrishna,
Sudeep Dasari,
Siddharth Karamcheti,
Soroush Nasiriany,
Mohan Kumar Srirama,
Lawrence Yunliang Chen,
Kirsty Ellis,
Peter David Fagan,
Joey Hejna,
Masha Itkina,
Marion Lepert,
Yecheng Jason Ma,
Patrick Tree Miller,
Jimmy Wu,
Suneel Belkhale,
Shivin Dass,
Huy Ha,
Arhan Jain,
Abraham Lee,
Youngwoon Lee,
Marius Memmel,
Sungjae Park
, et al. (74 additional authors not shown)
Abstract:
The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu…
▽ More
The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a result, even the most general robot manipulation policies today are mostly trained on data collected in a small number of environments with limited scene and task diversity. In this work, we introduce DROID (Distributed Robot Interaction Dataset), a diverse robot manipulation dataset with 76k demonstration trajectories or 350 hours of interaction data, collected across 564 scenes and 84 tasks by 50 data collectors in North America, Asia, and Europe over the course of 12 months. We demonstrate that training with DROID leads to policies with higher performance and improved generalization ability. We open source the full dataset, policy learning code, and a detailed guide for reproducing our robot hardware setup.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
On the Complexity of Multi-Agent Decision Making: From Learning in Games to Partial Monitoring
Authors:
Dylan J. Foster,
Dean P. Foster,
Noah Golowich,
Alexander Rakhlin
Abstract:
A central problem in the theory of multi-agent reinforcement learning (MARL) is to understand what structural conditions and algorithmic principles lead to sample-efficient learning guarantees, and how these considerations change as we move from few to many agents. We study this question in a general framework for interactive decision making with multiple agents, encompassing Markov games with fun…
▽ More
A central problem in the theory of multi-agent reinforcement learning (MARL) is to understand what structural conditions and algorithmic principles lead to sample-efficient learning guarantees, and how these considerations change as we move from few to many agents. We study this question in a general framework for interactive decision making with multiple agents, encompassing Markov games with function approximation and normal-form games with bandit feedback. We focus on equilibrium computation, in which a centralized learning algorithm aims to compute an equilibrium by controlling multiple agents that interact with an unknown environment. Our main contributions are:
- We provide upper and lower bounds on the optimal sample complexity for multi-agent decision making based on a multi-agent generalization of the Decision-Estimation Coefficient, a complexity measure introduced by Foster et al. (2021) in the single-agent counterpart to our setting. Compared to the best results for the single-agent setting, our bounds have additional gaps. We show that no "reasonable" complexity measure can close these gaps, highlighting a striking separation between single and multiple agents.
- We show that characterizing the statistical complexity for multi-agent decision making is equivalent to characterizing the statistical complexity of single-agent decision making, but with hidden (unobserved) rewards, a framework that subsumes variants of the partial monitoring problem. As a consequence, we characterize the statistical complexity for hidden-reward interactive decision making to the best extent possible.
Building on this development, we provide several new structural results, including 1) conditions under which the statistical complexity of multi-agent decision making can be reduced to that of single-agent, and 2) conditions under which the so-called curse of multiple agents can be avoided.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
A Dual Threshold Analogue Content Addressable Memory
Authors:
Patrick Foster,
Alex Serb,
Themis Prodromakis
Abstract:
Advances in machine learning and neuromorphic systems are fuelled by the development of architectures required for these applications, such as content addressable memory. In an attempt to address this need, this paper presents a new RRAM tuned window comparator, building upon existing work in reconfigurable computing. The circuit uses a low component count at 6T2R2M, comparable with the most compa…
▽ More
Advances in machine learning and neuromorphic systems are fuelled by the development of architectures required for these applications, such as content addressable memory. In an attempt to address this need, this paper presents a new RRAM tuned window comparator, building upon existing work in reconfigurable computing. The circuit uses a low component count at 6T2R2M, comparable with the most compact existing cells of this type. This paper will present this design, demonstrating its operation with TiOx memristive devices, showing its controllability and specificity. This paper will then simulate the energy dissipated in its operation, showing it to be below 100pJ per test, comparable to existing works.
△ Less
Submitted 5 March, 2023;
originally announced March 2023.
-
Digital identity architectures: comparing goals and vulnerabilities
Authors:
Callum Mole,
Ed Chalstrey,
Peter Foster,
Tim Hobson
Abstract:
Digital identity systems have the promise of efficiently facilitating access to services for a nation's citizens while increasing security and convenience. There are many possible system architectures, each with strengths and weaknesses that should be carefully considered. This report first establishes a set of goals and vulnerabilities faced by any identity system, then evaluates the trade-offs o…
▽ More
Digital identity systems have the promise of efficiently facilitating access to services for a nation's citizens while increasing security and convenience. There are many possible system architectures, each with strengths and weaknesses that should be carefully considered. This report first establishes a set of goals and vulnerabilities faced by any identity system, then evaluates the trade-offs of common digital identity architectures, principally comparing centralised and decentralised systems.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Linear Reinforcement Learning with Ball Structure Action Space
Authors:
Zeyu Jia,
Randy Jia,
Dhruv Madeka,
Dean P. Foster
Abstract:
We study the problem of Reinforcement Learning (RL) with linear function approximation, i.e. assuming the optimal action-value function is linear in a known $d$-dimensional feature mapping. Unfortunately, however, based on only this assumption, the worst case sample complexity has been shown to be exponential, even under a generative model. Instead of making further assumptions on the MDP or value…
▽ More
We study the problem of Reinforcement Learning (RL) with linear function approximation, i.e. assuming the optimal action-value function is linear in a known $d$-dimensional feature mapping. Unfortunately, however, based on only this assumption, the worst case sample complexity has been shown to be exponential, even under a generative model. Instead of making further assumptions on the MDP or value functions, we assume that our action space is such that there always exist playable actions to explore any direction of the feature space. We formalize this assumption as a ``ball structure'' action space, and show that being able to freely explore the feature space allows for efficient RL. In particular, we propose a sample-efficient RL algorithm (BallRL) that learns an $ε$-optimal policy using only $\tilde{O}\left(\frac{H^5d^3}{ε^3}\right)$ number of trajectories.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Forecast Hedging and Calibration
Authors:
Dean P. Foster,
Sergiu Hart
Abstract:
Calibration means that forecasts and average realized frequencies are close. We develop the concept of forecast hedging, which consists of choosing the forecasts so as to guarantee that the expected track record can only improve. This yields all the calibration results by the same simple basic argument while differentiating between them by the forecast-hedging tools used: deterministic and fixed p…
▽ More
Calibration means that forecasts and average realized frequencies are close. We develop the concept of forecast hedging, which consists of choosing the forecasts so as to guarantee that the expected track record can only improve. This yields all the calibration results by the same simple basic argument while differentiating between them by the forecast-hedging tools used: deterministic and fixed point based versus stochastic and minimax based. Additional contributions are an improved definition of continuous calibration, ensuing game dynamics that yield Nash equilibria in the long run, and a new calibrated forecasting procedure for binary events that is simpler than all known such procedures.
△ Less
Submitted 13 October, 2022;
originally announced October 2022.
-
Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics
Authors:
Dean P. Foster,
Sergiu Hart
Abstract:
We propose to smooth out the calibration score, which measures how good a forecaster is, by combining nearby forecasts. While regular calibration can be guaranteed only by randomized forecasting procedures, we show that smooth calibration can be guaranteed by deterministic procedures. As a consequence, it does not matter if the forecasts are leaked, i.e., made known in advance: smooth calibration…
▽ More
We propose to smooth out the calibration score, which measures how good a forecaster is, by combining nearby forecasts. While regular calibration can be guaranteed only by randomized forecasting procedures, we show that smooth calibration can be guaranteed by deterministic procedures. As a consequence, it does not matter if the forecasts are leaked, i.e., made known in advance: smooth calibration can nevertheless be guaranteed (while regular calibration cannot). Moreover, our procedure has finite recall, is stationary, and all forecasts lie on a finite grid. To construct the procedure, we deal also with the related setups of online linear regression and weak calibration. Finally, we show that smooth calibration yields uncoupled finite-memory dynamics in n-person games "smooth calibrated learning" in which the players play approximate Nash equilibria in almost all periods (by contrast, calibrated learning, which uses regular calibration, yields only that the time-averages of play are approximate correlated equilibria).
△ Less
Submitted 13 October, 2022;
originally announced October 2022.
-
Deep Inventory Management
Authors:
Dhruv Madeka,
Kari Torkkola,
Carson Eisenach,
Anna Luo,
Dean P. Foster,
Sham M. Kakade
Abstract:
This work provides a Deep Reinforcement Learning approach to solving a periodic review inventory control system with stochastic vendor lead times, lost sales, correlated demand, and price matching. While this dynamic program has historically been considered intractable, our results show that several policy learning approaches are competitive with or outperform classical methods. In order to train…
▽ More
This work provides a Deep Reinforcement Learning approach to solving a periodic review inventory control system with stochastic vendor lead times, lost sales, correlated demand, and price matching. While this dynamic program has historically been considered intractable, our results show that several policy learning approaches are competitive with or outperform classical methods. In order to train these algorithms, we develop novel techniques to convert historical data into a simulator. On the theoretical side, we present learnability results on a subclass of inventory control problems, where we provide a provable reduction of the reinforcement learning problem to that of supervised learning. On the algorithmic side, we present a model-based reinforcement learning procedure (Direct Backprop) to solve the periodic review inventory control problem by constructing a differentiable simulator. Under a variety of metrics Direct Backprop outperforms model-free RL and newsvendor baselines, in both simulations and real-world deployments.
△ Less
Submitted 28 November, 2022; v1 submitted 6 October, 2022;
originally announced October 2022.
-
"Calibeating": Beating Forecasters at Their Own Game
Authors:
Dean P. Foster,
Sergiu Hart
Abstract:
In order to identify expertise, forecasters should not be tested by their calibration score, which can always be made arbitrarily small, but rather by their Brier score. The Brier score is the sum of the calibration score and the refinement score; the latter measures how good the sorting into bins with the same forecast is, and thus attests to "expertise." This raises the question of whether one c…
▽ More
In order to identify expertise, forecasters should not be tested by their calibration score, which can always be made arbitrarily small, but rather by their Brier score. The Brier score is the sum of the calibration score and the refinement score; the latter measures how good the sorting into bins with the same forecast is, and thus attests to "expertise." This raises the question of whether one can gain calibration without losing expertise, which we refer to as "calibeating." We provide an easy way to calibeat any forecast, by a deterministic online procedure. We moreover show that calibeating can be achieved by a stochastic procedure that is itself calibrated, and then extend the results to simultaneously calibeating multiple procedures, and to deterministic procedures that are continuously calibrated.
△ Less
Submitted 26 October, 2022; v1 submitted 11 September, 2022;
originally announced September 2022.
-
A bivariate functional copula joint model for longitudinal measurements and time-to-event data
Authors:
Zili Zhang,
Christiana Charalambous,
Peter Foster
Abstract:
A bivariate functional copula joint model, which models the repeatedly measured longitudinal outcome at each time point with the survival data, jointly by both random effects and bivariate functional copulas, is proposed in this paper. A regular joint model normally supposes there are some subject-specific latent random effects or classes shared by the longitudinal and time-to-event processes and…
▽ More
A bivariate functional copula joint model, which models the repeatedly measured longitudinal outcome at each time point with the survival data, jointly by both random effects and bivariate functional copulas, is proposed in this paper. A regular joint model normally supposes there are some subject-specific latent random effects or classes shared by the longitudinal and time-to-event processes and they are assumed to be conditionally independent given these latent random variables. Under this assumption, the joint likelihood of the two processes can be easily derived and the association between them, as well as heterogeneity among population are naturally introduced by the unobservable latent random variables. However, because of the unobservable nature of these latent variables, the conditional independence assumption is difficult to verify. Therefore, a bivariate functional copula is introduced into a regular joint model to account for the cases where there could be extra association between the two processes which cannot be captured by the latent random variables. Our proposed model includes a regular joint model as a special case when the correlation function, which is modelled continuously by B-spline basis functions as a function of time $t,$ is constant at 0 under the bivariate Gaussian copula. Simulation studies and dynamic prediction of survival probabilities are conducted to compare the performance of the proposed model with the regular joint model and a real data application on the Primary biliary cirrhosis (PBC) data is performed.
△ Less
Submitted 11 September, 2022;
originally announced September 2022.
-
A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation
Authors:
Philip Amortila,
Nan Jiang,
Dhruv Madeka,
Dean P. Foster
Abstract:
The current paper studies sample-efficient Reinforcement Learning (RL) in settings where only the optimal value function is assumed to be linearly-realizable. It has recently been understood that, even under this seemingly strong assumption and access to a generative model, worst-case sample complexities can be prohibitively (i.e., exponentially) large. We investigate the setting where the learner…
▽ More
The current paper studies sample-efficient Reinforcement Learning (RL) in settings where only the optimal value function is assumed to be linearly-realizable. It has recently been understood that, even under this seemingly strong assumption and access to a generative model, worst-case sample complexities can be prohibitively (i.e., exponentially) large. We investigate the setting where the learner additionally has access to interactive demonstrations from an expert policy, and we present a statistically and computationally efficient algorithm (Delphi) for blending exploration with expert queries. In particular, Delphi requires $\tilde{\mathcal{O}}(d)$ expert queries and a $\texttt{poly}(d,H,|\mathcal{A}|,1/\varepsilon)$ amount of exploratory samples to provably recover an $\varepsilon$-suboptimal policy. Compared to pure RL approaches, this corresponds to an exponential improvement in sample complexity with surprisingly-little expert input. Compared to prior imitation learning (IL) approaches, our required number of expert demonstrations is independent of $H$ and logarithmic in $1/\varepsilon$, whereas all prior work required at least linear factors of both in addition to the same dependence on $d$. Towards establishing the minimal amount of expert queries needed, we show that, in the same setting, any learner whose exploration budget is polynomially-bounded (in terms of $d,H,$ and $|\mathcal{A}|$) will require at least $\tildeΩ(\sqrt{d})$ oracle calls to recover a policy competing with the expert's value function. Under the weaker assumption that the expert's policy is linear, we show that the lower bound increases to $\tildeΩ(d)$.
△ Less
Submitted 17 July, 2022;
originally announced July 2022.
-
An FPGA-based System for Generalised Electron Devices Testing
Authors:
Patrick Foster,
Jinqi Huang,
Alex Serb,
Spyros Stathopoulos,
Christos Papavassiliou,
Themis Prodromakis
Abstract:
Electronic systems are becoming more and more ubiquitous as our world digitises. Simultaneously, even basic components are experiencing a wave of improvements with new transistors, memristors, voltage/current references, data converters, etc, being designed every year by hundreds of R&D groups world-wide. To date, the workhorse for testing all these designs has been a suite of lab instruments incl…
▽ More
Electronic systems are becoming more and more ubiquitous as our world digitises. Simultaneously, even basic components are experiencing a wave of improvements with new transistors, memristors, voltage/current references, data converters, etc, being designed every year by hundreds of R&D groups world-wide. To date, the workhorse for testing all these designs has been a suite of lab instruments including oscilloscopes and signal generators, to mention the most popular. However, as components become more complex and pin numbers soar, the need for more parallel and versatile testing tools also becomes more pressing. In this work, we describe and benchmark an FPGA system developed that addresses this need. This general purpose testing system features a 64-channel source-meter unit (SMU), and 2x banks of 32 digital pins for digital I/O. We demonstrate that this bench-top system can obtain $170 pA$ current noise floor, $40 ns$ pulse delivery at $\pm13.5 V$ and $12 mA$ maximum current drive/channel. We then showcase the instrument's use in performing a selection of three characteristic measurement tasks: a) current-voltage (IV) characterisation of a diode and a transistor, b) fully parallel read-out of a memristor crossbar array and c) an integral non-linearity (INL) test on a DAC. This work introduces a down-scaled electronics laboratory packaged in a single instrument which provides a shift towards more affordable, reliable, compact and multi-functional instrumentation for emerging electronic technologies.
△ Less
Submitted 1 February, 2022;
originally announced February 2022.
-
On Submodular Contextual Bandits
Authors:
Dean P. Foster,
Alexander Rakhlin
Abstract:
We consider the problem of contextual bandits where actions are subsets of a ground set and mean rewards are modeled by an unknown monotone submodular function that belongs to a class $\mathcal{F}$. We allow time-varying matroid constraints to be placed on the feasible sets. Assuming access to an online regression oracle with regret $\mathsf{Reg}(\mathcal{F})$, our algorithm efficiently randomizes…
▽ More
We consider the problem of contextual bandits where actions are subsets of a ground set and mean rewards are modeled by an unknown monotone submodular function that belongs to a class $\mathcal{F}$. We allow time-varying matroid constraints to be placed on the feasible sets. Assuming access to an online regression oracle with regret $\mathsf{Reg}(\mathcal{F})$, our algorithm efficiently randomizes around local optima of estimated functions according to the Inverse Gap Weighting strategy. We show that cumulative regret of this procedure with time horizon $n$ scales as $O(\sqrt{n \mathsf{Reg}(\mathcal{F})})$ against a benchmark with a multiplicative factor $1/2$. On the other hand, using the techniques of (Filmus and Ward 2014), we show that an $ε$-Greedy procedure with local randomization attains regret of $O(n^{2/3} \mathsf{Reg}(\mathcal{F})^{1/3})$ against a stronger $(1-e^{-1})$ benchmark.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
A Gaussian copula joint model for longitudinal and time-to-event data with random effects
Authors:
Zili Zhang,
Christiana Charalambous,
Peter Foster
Abstract:
Longitudinal and survival sub-models are two building blocks for joint modelling of longitudinal and time to event data. Extensive research indicates separate analysis of these two processes could result in biased outputs due to their associations. Conditional independence between measurements of biomarkers and event time process given latent classes or random effects is a common approach for char…
▽ More
Longitudinal and survival sub-models are two building blocks for joint modelling of longitudinal and time to event data. Extensive research indicates separate analysis of these two processes could result in biased outputs due to their associations. Conditional independence between measurements of biomarkers and event time process given latent classes or random effects is a common approach for characterising the association between the two sub-models while taking the heterogeneity among the population into account. However, this assumption is tricky to validate because of the unobservable latent variables. Thus a Gaussian copula joint model with random effects is proposed to accommodate the scenarios where the conditional independence assumption is questionable. In our proposed model, the conventional joint model assuming conditional independence is a special case when the association parameter in the Gaussian copula shrinks to zero. Simulation studies and real data application are carried out to evaluate the performance of our proposed model. In addition, personalised dynamic predictions of survival probabilities are obtained based on the proposed model and comparisons are made to the predictions obtained under the conventional joint model.
△ Less
Submitted 21 September, 2022; v1 submitted 3 December, 2021;
originally announced December 2021.
-
A chiral topological add-drop filter for integrated quantum photonic circuits
Authors:
M. Jalali Mehrabad,
A. P. Foster,
N. J. Martin,
R. Dost,
E. Clarke,
P. K. Patil,
M. S. Skolnick,
L. R. Wilson
Abstract:
The integration of quantum emitters within topological nano-photonic devices opens up new avenues for the control of light-matter interactions at the single photon level. Here, we realise a spin-dependent, chiral light-matter interface using individual semiconductor quantum dots embedded in a topological add-drop filter. The filter is imprinted within a valley-Hall photonic crystal (PhC) membrane…
▽ More
The integration of quantum emitters within topological nano-photonic devices opens up new avenues for the control of light-matter interactions at the single photon level. Here, we realise a spin-dependent, chiral light-matter interface using individual semiconductor quantum dots embedded in a topological add-drop filter. The filter is imprinted within a valley-Hall photonic crystal (PhC) membrane and comprises a resonator evanescently coupled to a pair of access waveguides. We show that the longitudinal modes of the resonator enable the filter to perform wavelength-selective routing of light, protected by the underlying topology. Furthermore, we demonstrate that for a quantum dot located at a chiral point in the resonator, selective coupling occurs between well-defined spin states and specific output ports of the topological device. This behaviour is fundamental to the operation of chiral devices such as a quantum optical circulator. Our device therefore represents a topologically-protected building block with potential to play an enabling role in the development of chiral integrated quantum photonic circuits.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Joint modelling of longitudinal measurements and survival times via a multivariate copula approach
Authors:
Zili Zhang,
Christiana Charalambous,
Peter Foster
Abstract:
Joint modelling of longitudinal and time-to-event data is usually described by a joint model which uses shared or correlated latent effects to capture associations between the two processes. Under this framework, the joint distribution of the two processes can be derived straightforwardly by assuming conditional independence given the random effects. Alternative approaches to induce interdependenc…
▽ More
Joint modelling of longitudinal and time-to-event data is usually described by a joint model which uses shared or correlated latent effects to capture associations between the two processes. Under this framework, the joint distribution of the two processes can be derived straightforwardly by assuming conditional independence given the random effects. Alternative approaches to induce interdependency into sub-models have also been considered in the literature and one such approach is using copulas to introduce non-linear correlation between the marginal distributions of the longitudinal and time-to-event processes. The multivariate Gaussian copula joint model has been proposed in the literature to fit joint data by applying a Monte Carlo expectation-maximisation algorithm. In this paper, we propose an exact likelihood estimation approach to replace the more computationally expensive Monte Carlo expectation-maximisation algorithm and we consider results based on using both the multivariate Gaussian and $t$ copula functions. We also provide a straightforward way to compute dynamic predictions of survival probabilities, showing that our proposed model is comparable in prediction performance to the shared random effects joint model.
△ Less
Submitted 3 March, 2022; v1 submitted 27 August, 2021;
originally announced August 2021.
-
The Benefits of Implicit Regularization from SGD in Least Squares Problems
Authors:
Difan Zou,
Jingfeng Wu,
Vladimir Braverman,
Quanquan Gu,
Dean P. Foster,
Sham M. Kakade
Abstract:
Stochastic gradient descent (SGD) exhibits strong algorithmic regularization effects in practice, which has been hypothesized to play an important role in the generalization of modern machine learning approaches. In this work, we seek to understand these issues in the simpler setting of linear regression (including both underparameterized and overparameterized regimes), where our goal is to make s…
▽ More
Stochastic gradient descent (SGD) exhibits strong algorithmic regularization effects in practice, which has been hypothesized to play an important role in the generalization of modern machine learning approaches. In this work, we seek to understand these issues in the simpler setting of linear regression (including both underparameterized and overparameterized regimes), where our goal is to make sharp instance-based comparisons of the implicit regularization afforded by (unregularized) average SGD with the explicit regularization of ridge regression. For a broad class of least squares problem instances (that are natural in high-dimensional settings), we show: (1) for every problem instance and for every ridge parameter, (unregularized) SGD, when provided with logarithmically more samples than that provided to the ridge algorithm, generalizes no worse than the ridge solution (provided SGD uses a tuned constant stepsize); (2) conversely, there exist instances (in this wide problem class) where optimally-tuned ridge regression requires quadratically more samples than SGD in order to have the same generalization performance. Taken together, our results show that, up to the logarithmic factors, the generalization performance of SGD is always no worse than that of ridge regression in a wide range of overparameterized problems, and, in fact, could be much better for some problem instances. More generally, our results show how algorithmic regularization has important consequences even in simpler (overparameterized) convex settings.
△ Less
Submitted 10 July, 2022; v1 submitted 10 August, 2021;
originally announced August 2021.
-
Engineering strong chiral light-matter interactions in a waveguide-coupled nanocavity
Authors:
D. Hallett,
A. P. Foster,
D. M. Whittaker,
M. S. Skolnick,
L. R. Wilson
Abstract:
Spin-dependent, directional light-matter interactions form the basis of chiral quantum networks. In the solid state, quantum emitters commonly possess circularly polarised optical transitions with spin-dependent handedness. We demonstrate numerically that spin-dependent chiral coupling can be realised by embedding such an emitter in a waveguide-coupled nanocavity, which supports two near-degenerat…
▽ More
Spin-dependent, directional light-matter interactions form the basis of chiral quantum networks. In the solid state, quantum emitters commonly possess circularly polarised optical transitions with spin-dependent handedness. We demonstrate numerically that spin-dependent chiral coupling can be realised by embedding such an emitter in a waveguide-coupled nanocavity, which supports two near-degenerate, orthogonally-polarised cavity modes. The chiral behaviour arises due to direction-dependent interference between the cavity modes upon coupling to two single-mode output waveguides. Notably, an experimentally realistic cavity design simultaneously supports near-unity chiral contrast, efficient ($β> 0.95$) waveguide coupling and enhanced light-matter interaction strength (Purcell factor $F_P > 70$). In combination, these parameters could enable the development of highly coherent spin-photon interfaces, ready for integration into nanophotonic circuits.
△ Less
Submitted 28 January, 2022; v1 submitted 3 August, 2021;
originally announced August 2021.
-
Efimov-DNA Phase diagram: three stranded DNA on a cubic lattice
Authors:
Somendra M. Bhattacharjee,
Damien Paul Foster
Abstract:
We define a generalised model for three-stranded DNA consisting of two chains of one type and a third chain of a different type. The DNA strands are modelled by random walks on the three-dimensional cubic lattice with different interactions between two chains of the same type and two chains of different types. This model may be thought of as a classical analogue of the quantum three-body problem.…
▽ More
We define a generalised model for three-stranded DNA consisting of two chains of one type and a third chain of a different type. The DNA strands are modelled by random walks on the three-dimensional cubic lattice with different interactions between two chains of the same type and two chains of different types. This model may be thought of as a classical analogue of the quantum three-body problem. In the quantum situation it is known that three identical quantum particles will form a triplet with an infinite tower of bound states at the point where any pair of particles would have zero binding energy. The phase diagram is mapped out, and the different phase transitions examined using finite-size scaling. We look particularly at the scaling of the DNA model at the equivalent Efimov point for chains up to 10000 steps in length. We find clear evidence of several bound states in the finite-size scaling. We compare these states with the expected Efimov behaviour.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Critical Behaviour of Magnetic Polymers in Two and Three Dimensions
Authors:
Damien Paul Foster,
Debjyoti Majumdar
Abstract:
We explore the critical behaviour of two and three dimensional lattice models of polymers in dilute solution where the monomers carry a magnetic moment which interacts ferromagnetically with near-neighbour monomers. Specifically, the model explored consists of a self-avoiding walk on a square or cubic lattice with Ising spins on the visited sites. In three dimensions we confirm and extend previous…
▽ More
We explore the critical behaviour of two and three dimensional lattice models of polymers in dilute solution where the monomers carry a magnetic moment which interacts ferromagnetically with near-neighbour monomers. Specifically, the model explored consists of a self-avoiding walk on a square or cubic lattice with Ising spins on the visited sites. In three dimensions we confirm and extend previous numerical work, showing clearly the first-order character of both the magnetic transition and polymer collapse, which happen together. We present results for the first time in two dimensions, where the transition is seen to be continuous. Finite-size scaling is used to extract estimates for the critical exponents and transition temperature in the absence of an external magnetic field.
△ Less
Submitted 27 May, 2021;
originally announced May 2021.
-
Odd dynamics of living chiral crystals
Authors:
Tzer Han Tan,
Alexander Mietke,
Junang Li,
Yuchao Chen,
Hugh Higinbotham,
Peter J. Foster,
Shreyas Gokhale,
Jörn Dunkel,
Nikta Fakhri
Abstract:
Active crystals are highly ordered structures that emerge from the self-organization of motile objects, and have been widely studied in synthetic and bacterial active matter. Whether collective crystallization phenomena can occur in groups of autonomously developing multicellular organisms is currently unknown. Here, we show that swimming starfish embryos spontaneously assemble into chiral crystal…
▽ More
Active crystals are highly ordered structures that emerge from the self-organization of motile objects, and have been widely studied in synthetic and bacterial active matter. Whether collective crystallization phenomena can occur in groups of autonomously developing multicellular organisms is currently unknown. Here, we show that swimming starfish embryos spontaneously assemble into chiral crystals that span thousands of spinning organisms and persist for tens of hours. Combining experiments, theory, and simulations, we demonstrate that the formation, dynamics, and dissolution of these living crystals are controlled by the hydrodynamic properties and natural development of embryos. Remarkably, living chiral crystals exhibit self-sustained chiral oscillations as well as various unconventional deformation response behaviors recently predicted for odd elastic materials. Our results provide direct experimental evidence for how nonreciprocal interactions between autonomous multicellular components may facilitate novel nonequilibrium phases of chiral active matter.
△ Less
Submitted 3 March, 2022; v1 submitted 16 May, 2021;
originally announced May 2021.
-
Threshold Martingales and the Evolution of Forecasts
Authors:
Dean P. Foster,
Robert A. Stine
Abstract:
This paper introduces a martingale that characterizes two properties of evolving forecast distributions. Ideal forecasts of a future event behave as martingales, sequen- tially updating the forecast to leverage the available information as the future event approaches. The threshold martingale introduced here measures the proportion of the forecast distribution lying below a threshold. In addition…
▽ More
This paper introduces a martingale that characterizes two properties of evolving forecast distributions. Ideal forecasts of a future event behave as martingales, sequen- tially updating the forecast to leverage the available information as the future event approaches. The threshold martingale introduced here measures the proportion of the forecast distribution lying below a threshold. In addition to being calibrated, a threshold martingale has quadratic variation that accumulates to a total determined by a quantile of the initial forecast distribution. Deviations from calibration or to- tal volatility signal problems in the underlying model. Calibration adjustments are well-known, and we augment these by introducing a martingale filter that improves volatility while guaranteeing smaller mean squared error. Thus, post-processing can rectify problems with calibration and volatility without revisiting the original forecast- ing model. We apply threshold martingales first to forecasts from simulated models and then to models that predict the winner in professional basketball games.
△ Less
Submitted 14 May, 2021;
originally announced May 2021.
-
SK-Tree: a systematic malware detection algorithm on streaming trees via the signature kernel
Authors:
Thomas Cochrane,
Peter Foster,
Varun Chhabra,
Maud Lemercier,
Cristopher Salvi,
Terry Lyons
Abstract:
The development of machine learning algorithms in the cyber security domain has been impeded by the complex, hierarchical, sequential and multimodal nature of the data involved. In this paper we introduce the notion of a streaming tree as a generic data structure encompassing a large portion of real-world cyber security data. Starting from host-based event logs we represent computer processes as s…
▽ More
The development of machine learning algorithms in the cyber security domain has been impeded by the complex, hierarchical, sequential and multimodal nature of the data involved. In this paper we introduce the notion of a streaming tree as a generic data structure encompassing a large portion of real-world cyber security data. Starting from host-based event logs we represent computer processes as streaming trees that evolve in continuous time. Leveraging the properties of the signature kernel, a machine learning tool that recently emerged as a leading technology for learning with complex sequences of data, we develop the SK-Tree algorithm. SK-Tree is a supervised learning method for systematic malware detection on streaming trees that is robust to irregular sampling and high dimensionality of the underlying streams. We demonstrate the effectiveness of SK-Tree to detect malicious events on a portion of the publicly available DARPA OpTC dataset, achieving an AUROC score of 98%.
△ Less
Submitted 29 September, 2021; v1 submitted 15 February, 2021;
originally announced February 2021.
-
An improved set of electron-THFA cross sections refined through a neural network-based analysis of swarm data
Authors:
Peter W. Stokes,
Sean P. Foster,
Madalyn J. E. Casey,
Daniel G. Cocks,
Olmo González-Magaña,
Jaime de Urquijo,
Gustavo García,
Michael J. Brunger,
Ronald D. White
Abstract:
We review experimental and theoretical cross sections for electron transport in $α$-tetrahydrofurfuryl alcohol (THFA) and, in doing so, propose a plausible complete set. To assess the accuracy and self-consistency of our proposed set, we use the pulsed-Townsend technique to measure drift velocities, longitudinal diffusion coefficients and effective Townsend first ionisation coefficients for electr…
▽ More
We review experimental and theoretical cross sections for electron transport in $α$-tetrahydrofurfuryl alcohol (THFA) and, in doing so, propose a plausible complete set. To assess the accuracy and self-consistency of our proposed set, we use the pulsed-Townsend technique to measure drift velocities, longitudinal diffusion coefficients and effective Townsend first ionisation coefficients for electron swarms in admixtures of THFA in argon, across a range of density-reduced electric fields from 1 Td to 450 Td. These measurements are then compared to simulated values derived from our proposed set using a multi-term solution of Boltzmann's equation. We observe discrepancies between the simulation and experiment, which we attempt to address by employing a neural network model that is trained to solve the inverse swarm problem of unfolding the cross sections underpinning our experimental swarm measurements. What results from our neural network-based analysis is a refined set of electron-THFA cross sections, which we confirm is of higher consistency with our swarm measurements than that we initially proposed. We also use our data base to calculate electron transport coefficients in pure THFA, across a range of reduced electric fields from 0.001 Td to 10,000 Td.
△ Less
Submitted 20 January, 2021;
originally announced January 2021.
-
What are the Statistical Limits of Offline RL with Linear Function Approximation?
Authors:
Ruosong Wang,
Dean P. Foster,
Sham M. Kakade
Abstract:
Offline reinforcement learning seeks to utilize offline (observational) data to guide the learning of (causal) sequential decision making strategies. The hope is that offline reinforcement learning coupled with function approximation methods (to deal with the curse of dimensionality) can provide a means to help alleviate the excessive sample complexity burden in modern sequential decision making p…
▽ More
Offline reinforcement learning seeks to utilize offline (observational) data to guide the learning of (causal) sequential decision making strategies. The hope is that offline reinforcement learning coupled with function approximation methods (to deal with the curse of dimensionality) can provide a means to help alleviate the excessive sample complexity burden in modern sequential decision making problems. However, the extent to which this broader approach can be effective is not well understood, where the literature largely consists of sufficient conditions.
This work focuses on the basic question of what are necessary representational and distributional conditions that permit provable sample-efficient offline reinforcement learning. Perhaps surprisingly, our main result shows that even if: i) we have realizability in that the true value function of \emph{every} policy is linear in a given set of features and 2) our off-policy data has good coverage over all features (under a strong spectral condition), then any algorithm still (information-theoretically) requires a number of offline samples that is exponential in the problem horizon in order to non-trivially estimate the value of \emph{any} given policy. Our results highlight that sample-efficient offline policy evaluation is simply not possible unless significantly stronger conditions hold; such conditions include either having low distribution shift (where the offline data distribution is close to the distribution of the policy to be evaluated) or significantly stronger representational conditions (beyond realizability).
△ Less
Submitted 22 October, 2020;
originally announced October 2020.
-
Dimensionless Anomaly Detection on Multivariate Streams with Variance Norm and Path Signature
Authors:
Zhen Shao,
Ryan Sze-Yin Chan,
Thomas Cochrane,
Peter Foster,
Terry Lyons
Abstract:
In this paper, we propose a dimensionless anomaly detection method for multivariate streams. Our method is independent of the unit of measurement for the different stream channels, therefore dimensionless. We first propose the variance norm, a generalisation of Mahalanobis distance to handle infinite-dimensional feature space and singular empirical covariance matrix rigorously. We then combine the…
▽ More
In this paper, we propose a dimensionless anomaly detection method for multivariate streams. Our method is independent of the unit of measurement for the different stream channels, therefore dimensionless. We first propose the variance norm, a generalisation of Mahalanobis distance to handle infinite-dimensional feature space and singular empirical covariance matrix rigorously. We then combine the variance norm with the path signature, an infinite collection of iterated integrals that provide global features of streams, to propose SigMahaKNN, a method for anomaly detection on (multivariate) streams. We show that SigMahaKNN is invariant to stream reparametrisation, stream concatenation and has a graded discrimination power depending on the truncation level of the path signature. We implement SigMahaKNN as an open-source software, and perform extensive numerical experiments, showing significantly improved anomaly detection on streams compared to isolation forest and local outlier factors in applications ranging from language analysis, hand-writing analysis, ship movement paths analysis and univariate time-series analysis.
△ Less
Submitted 6 December, 2023; v1 submitted 5 June, 2020;
originally announced June 2020.
-
Chiral topological photonics with an embedded quantum emitter
Authors:
Mahmoud Jalali Mehrabad,
Andrew P. Foster,
René Dost,
A. Mark Fox,
Maurice S. Skolnick,
Luke R. Wilson
Abstract:
Topological photonic interfaces support topologically non-trivial optical modes with helical character. When combined with an embedded quantum emitter that has a circularly polarised transition dipole moment, a chiral quantum optical interface is formed due to spin-momentum locking. Here, we experimentally realise such an interface by integrating semiconductor quantum dots into a valley-Hall topol…
▽ More
Topological photonic interfaces support topologically non-trivial optical modes with helical character. When combined with an embedded quantum emitter that has a circularly polarised transition dipole moment, a chiral quantum optical interface is formed due to spin-momentum locking. Here, we experimentally realise such an interface by integrating semiconductor quantum dots into a valley-Hall topological photonic crystal waveguide. We harness the robust waveguide transport to create a ring resonator which supports helical modes. Chiral coupling of quantum dot transitions, with directional contrast as high as $75\%$, is demonstrated. The interface also supports a topologically trivial mode, comparison with which allows us to clearly demonstrate the protection afforded by topology to the non-trivial mode.
△ Less
Submitted 28 October, 2020; v1 submitted 20 December, 2019;
originally announced December 2019.
-
A Semiconductor Topological Photonic Ring Resonator
Authors:
M. Jalali Mehrabad,
A. P. Foster,
R. Dost,
E. Clarke,
P. K. Patil,
I. Farrer,
J. Heffernan,
M. S. Skolnick,
L. R. Wilson
Abstract:
Unidirectional photonic edge states arise at the interface between two topologically-distinct photonic crystals. Here, we demonstrate a micron-scale GaAs photonic ring resonator, created using a spin Hall-type topological photonic crystal waveguide. Embedded InGaAs quantum dots are used to probe the mode structure of the device. We map the spatial profile of the resonator modes, and demonstrate co…
▽ More
Unidirectional photonic edge states arise at the interface between two topologically-distinct photonic crystals. Here, we demonstrate a micron-scale GaAs photonic ring resonator, created using a spin Hall-type topological photonic crystal waveguide. Embedded InGaAs quantum dots are used to probe the mode structure of the device. We map the spatial profile of the resonator modes, and demonstrate control of the mode confinement through tuning of the photonic crystal lattice parameters. The intrinsic chirality of the edge states makes them of interest for applications in integrated quantum photonics, and the resonator represents an important building block towards the development of such devices with embedded quantum emitters.
△ Less
Submitted 10 February, 2020; v1 submitted 16 October, 2019;
originally announced October 2019.
-
Actively crosslinked microtubule networks: mechanics, dynamics and filament sliding
Authors:
Sebastian Fürthauer,
Bezia Lemma,
Peter J. Foster,
Stephanie C. Ems-McClung,
Claire E. Walczak,
Zvonimir Dogic,
Daniel J. Needleman,
Michael J. Shelley
Abstract:
Cytoskeletal networks are foundational examples of active matter and central to self-organized structures in the cell. In vivo, these networks are active and heavily crosslinked. Relating their large-scale dynamics to properties of their constituents remains an unsolved problem. Here we study an in vitro system made from microtubules and XCTK2 kinesin motors, which forms an aligned and active gel.…
▽ More
Cytoskeletal networks are foundational examples of active matter and central to self-organized structures in the cell. In vivo, these networks are active and heavily crosslinked. Relating their large-scale dynamics to properties of their constituents remains an unsolved problem. Here we study an in vitro system made from microtubules and XCTK2 kinesin motors, which forms an aligned and active gel. Using photobleaching we demonstrate that the gel's aligned microtubules, driven by motors, continually slide past each other at a speed independent of the local polarity. This phenomenon is also observed, and remains unexplained, in spindles. We derive a general framework for coarse graining microtubule gels crosslinked by molecular motors from microscopic considerations. Using the microtubule-microtubule coupling, and force-velocity relationship for kinesin, this theory naturally explains the experimental results: motors generate an active strain-rate in regions of changing polarity, which allows microtubules of opposite polarities to slide past each other without stressing the material.
△ Less
Submitted 3 December, 2018;
originally announced December 2018.
-
Tunable photon statistics exploiting the Fano effect in a waveguide
Authors:
A. P. Foster,
D. Hallett,
I. V. Iorsh,
S. J. Sheldon,
M. R. Godsland,
B. Royall,
E. Clarke,
I. A. Shelykh,
A. M. Fox,
M. S. Skolnick,
I. E. Itskevich,
L. R. Wilson
Abstract:
A strong optical nonlinearity arises when coherent light is scattered by a semiconductor quantumdot (QD) coupled to a nano-photonic waveguide. We exploit the Fano effect in such a waveguide to control the phase of the quantum interference underpinning the nonlinearity, experimentally demonstrating a tunable quantum optical filter which converts a coherent input state into either a bunched, or anti…
▽ More
A strong optical nonlinearity arises when coherent light is scattered by a semiconductor quantumdot (QD) coupled to a nano-photonic waveguide. We exploit the Fano effect in such a waveguide to control the phase of the quantum interference underpinning the nonlinearity, experimentally demonstrating a tunable quantum optical filter which converts a coherent input state into either a bunched, or antibunched non-classical output state. We show theoretically that the generation of non-classical light is predicated on the formation of a two-photon bound state due to the interaction of the input coherent state with the QD. Our model demonstrates that the tunable photon statistics arise from the dependence of the sign of two-photon interference (either constructive or destructive) on the detuning of the input relative to the Fano resonance.
△ Less
Submitted 30 April, 2019; v1 submitted 21 November, 2018;
originally announced November 2018.
-
Coupled Recurrent Models for Polyphonic Music Composition
Authors:
John Thickstun,
Zaid Harchaoui,
Dean P. Foster,
Sham M. Kakade
Abstract:
This paper introduces a novel recurrent model for music composition that is tailored to the structure of polyphonic music. We propose an efficient new conditional probabilistic factorization of musical scores, viewing a score as a collection of concurrent, coupled sequences: i.e. voices. To model the conditional distributions, we borrow ideas from both convolutional and recurrent neural models; we…
▽ More
This paper introduces a novel recurrent model for music composition that is tailored to the structure of polyphonic music. We propose an efficient new conditional probabilistic factorization of musical scores, viewing a score as a collection of concurrent, coupled sequences: i.e. voices. To model the conditional distributions, we borrow ideas from both convolutional and recurrent neural models; we argue that these ideas are natural for capturing music's pitch invariances, temporal structure, and polyphony. We train models for single-voice and multi-voice composition on 2,300 scores from the KernScores dataset.
△ Less
Submitted 26 November, 2019; v1 submitted 19 November, 2018;
originally announced November 2018.
-
Nonradiative emission and absorption rates of quantum emitters embedded in metallic systems: microscopic description and their determination from electronic transport
Authors:
M. B. Silva Neto,
F. M. D'Angelis,
P. P. P. Foster,
F. A. Pinheiro
Abstract:
We investigate nonradiative emission and absorption rates of two-level quantum emitters embedded in a metal at low temperatures. We obtain the expressions for both nonradiative transition rates and identify a unique, experimentally accessible way to obtain the nonradiative decay rates via electronic transport in the host metallic system. Our findings not only provide a microscopic description of n…
▽ More
We investigate nonradiative emission and absorption rates of two-level quantum emitters embedded in a metal at low temperatures. We obtain the expressions for both nonradiative transition rates and identify a unique, experimentally accessible way to obtain the nonradiative decay rates via electronic transport in the host metallic system. Our findings not only provide a microscopic description of nonradiative decay channels in metals, but they also allows one to identify and differentiate them from other decay channels, which is crucial to understand and control light-matter interactions at the nanoscale.
△ Less
Submitted 29 May, 2018;
originally announced May 2018.
-
Ultrafast Imaging of Laser Driven Shock Waves using Betatron X-rays from a Laser Wakefield Accelerator
Authors:
J. C. Wood,
D. J. Chapman,
K. Poder,
N. C. Lopes,
M. E. Rutherford,
T. G. White,
F. Albert,
K. T. Behm,
N. Booth,
J. S. J. Bryant,
P. S. Foster,
S. Glenzer,
E. Hill,
K. Krushelnick,
Z. Najmudin,
B. B. Pollock,
S. Rose,
W. Schumaker,
R. H. H. Scott,
M. Sherlock,
A. G. R. Thomas,
Z. Zhao,
D. Eakins,
S. P. D. Mangles
Abstract:
Betatron radiation from laser wakefield accelerators is an ultrashort pulsed source of hard, synchrotron-like x-ray radiation. It emanates from a centimetre scale plasma accelerator producing GeV level electron beams. In recent years betatron radiation has been developed as a unique source capable of producing high resolution x-ray images in compact geometries. However, until now, the short pulse…
▽ More
Betatron radiation from laser wakefield accelerators is an ultrashort pulsed source of hard, synchrotron-like x-ray radiation. It emanates from a centimetre scale plasma accelerator producing GeV level electron beams. In recent years betatron radiation has been developed as a unique source capable of producing high resolution x-ray images in compact geometries. However, until now, the short pulse nature of this radiation has not been exploited. This report details the first experiment to utilise betatron radiation to image a rapidly evolving phenomenon by using it to radiograph a laser driven shock wave in a silicon target. The spatial resolution of the image is comparable to what has been achieved in similar experiments at conventional synchrotron light sources. The intrinsic temporal resolution of betatron radiation is below 100 fs, indicating that significantly faster processes could be probed in future without compromising spatial resolution. Quantitative measurements of the shock velocity and material density were made from the radiographs recorded during shock compression and were consistent with the established shock response of silicon, as determined with traditional velocimetry approaches. This suggests that future compact betatron imaging beamlines could be useful in the imaging and diagnosis of high-energy-density physics experiments.
△ Less
Submitted 6 February, 2018;
originally announced February 2018.
-
Electrical control of nonlinear quantum optics in a nano-photonic waveguide
Authors:
D. Hallett,
A. P. Foster,
D. L. Hurst,
B. Royall,
P. Kok,
E. Clarke,
I. E. Itskevich,
A. M. Fox,
M. S. Skolnick,
L. R. Wilson
Abstract:
Local control of the generation and interaction of indistinguishable single photons is a key requirement for photonic quantum networks. Waveguide-based architectures, in which embedded quantum emitters act as both highly coherent single photon sources and as nonlinear elements to mediate photon-photon interactions, offer a scalable route to such networks. However, local electrical control of a qua…
▽ More
Local control of the generation and interaction of indistinguishable single photons is a key requirement for photonic quantum networks. Waveguide-based architectures, in which embedded quantum emitters act as both highly coherent single photon sources and as nonlinear elements to mediate photon-photon interactions, offer a scalable route to such networks. However, local electrical control of a quantum optical nonlinearity has yet to be demonstrated in a waveguide geometry. Here, we demonstrate local electrical tuning and switching of single photon generation and nonlinear interaction by embedding a quantum dot in a nano-photonic waveguide with enhanced light-matter interaction. A power-dependent transmission extinction as large as 40$\pm$2% and clear, voltage-controlled bunching in the photon statistics of the transmitted light demonstrate the single photon character of the nonlinearity. The deterministic nature of the nonlinearity is particularly attractive for the future realization of photonic gates for scalable nano-photonic waveguide-based quantum information processing.
△ Less
Submitted 28 November, 2017; v1 submitted 2 November, 2017;
originally announced November 2017.
-
Enhanced laser-driven ion acceleration by superponderomotive electrons generated from near-critical-density plasma
Authors:
J. H. Bin,
M. Yeung,
Z. Gong,
H. Y. Wang,
C. Kreuzer,
M. L. Zhou,
M. J. V. Streeter,
P. S. Foster,
S. Cousens,
B. Dromey,
J. Meyer-ter-Vehn,
M. Zepf,
J. Schreiber
Abstract:
We report on the experimental studies of laser driven ion acceleration from double-layer target where a near-critical density target with a few-micron thickness is coated in front of a nanometer thin diamond-like carbon foil. A significant enhancement of proton maximum energies from 12 to ~30 MeV is observed when relativistic laser pulse impinge on the double-layer target under linear polarization…
▽ More
We report on the experimental studies of laser driven ion acceleration from double-layer target where a near-critical density target with a few-micron thickness is coated in front of a nanometer thin diamond-like carbon foil. A significant enhancement of proton maximum energies from 12 to ~30 MeV is observed when relativistic laser pulse impinge on the double-layer target under linear polarization. We attributed the enhanced acceleration to superponderomotive electrons that were simultaneously measured in the experiments with energies far beyond the free-electron ponderomotive limit. Our interpretation is supported by two-dimensional simulation results.
△ Less
Submitted 26 October, 2017;
originally announced October 2017.
-
Measuring and modeling polymer gradients argues that spindle microtubules regulate their own nucleation
Authors:
Bryan Kaye,
Olivia Stiehl,
Peter J. Foster,
Michael J. Shelley,
Daniel J. Needleman,
Sebastian Fürthauer
Abstract:
Spindles are self-organized microtubule-based structures that segregate chromosomes during cell division. The mass of the spindle is controlled by the balance between microtubule turnover and nucleation. The mechanisms that control the spatial regulation of microtubule nucleation remain poorly understood. Previous work has found that microtubule nucleators bind to microtubules in the spindle, but…
▽ More
Spindles are self-organized microtubule-based structures that segregate chromosomes during cell division. The mass of the spindle is controlled by the balance between microtubule turnover and nucleation. The mechanisms that control the spatial regulation of microtubule nucleation remain poorly understood. Previous work has found that microtubule nucleators bind to microtubules in the spindle, but it is unclear if this binding regulates the activity of those nucleators. Here we use a combination of experiments and mathematical modeling to investigate this issue. We measure the concentration of tubulin and microtubules in and around the spindle. We found a very sharp decay in microtubules at the spindle interface, which is inconsistent with the activity of microtubule nucleators being independent of their association with microtubules and consistent with a model in which microtubule nucleators are only active when bound to a microtubule. This strongly argues that the activity of microtubule nucleators is greatly enhanced when bound to microtubules. Thus, microtubule nucleators are both localized and activated by the microtubules they generate.
△ Less
Submitted 23 October, 2017;
originally announced October 2017.
-
Electro-mechanical control of an on-chip optical beam splitter containing an embedded quantum emitter
Authors:
Z. K. Bishop,
A. P. Foster,
B. Royall,
C. Bentham,
E. Clarke,
M. S. Skolnick,
L. R. Wilson
Abstract:
We demonstrate electro-mechanical control of an on-chip GaAs optical beam splitter containing a quantum dot single-photon source. The beam splitter consists of two nanobeam waveguides, which form a directional coupler (DC). The splitting ratio of the DC is controlled by varying the out-of-plane separation of the two waveguides using electro-mechanical actuation. We reversibly tune the beam splitte…
▽ More
We demonstrate electro-mechanical control of an on-chip GaAs optical beam splitter containing a quantum dot single-photon source. The beam splitter consists of two nanobeam waveguides, which form a directional coupler (DC). The splitting ratio of the DC is controlled by varying the out-of-plane separation of the two waveguides using electro-mechanical actuation. We reversibly tune the beam splitter between an initial state, with emission into both output arms, and a final state with photons emitted into a single output arm. The device represents a compact and scalable tuning approach for use in III-V semiconductor integrated quantum optical circuits.
△ Less
Submitted 2 May, 2018; v1 submitted 18 September, 2017;
originally announced September 2017.
-
Connecting macroscopic dynamics with microscopic properties in active microtubule network contraction
Authors:
Peter J. Foster,
Wen Yan,
Sebastian Fürthauer,
Michael J. Shelley,
Daniel J. Needleman
Abstract:
The cellular cytoskeleton is an active material, driven out of equilibrium by molecular motor proteins. It is not understood how the collective behaviors of cytoskeletal networks emerge from the properties of the network's constituent motor proteins and filaments. Here we present experimental results on networks of stabilized microtubules in Xenopus oocyte extracts, which undergo spontaneous bulk…
▽ More
The cellular cytoskeleton is an active material, driven out of equilibrium by molecular motor proteins. It is not understood how the collective behaviors of cytoskeletal networks emerge from the properties of the network's constituent motor proteins and filaments. Here we present experimental results on networks of stabilized microtubules in Xenopus oocyte extracts, which undergo spontaneous bulk contraction driven by the motor protein dynein, and investigate the effects of varying the initial microtubule density and length distribution. We find that networks contract to a similar final density, irrespective of the length of microtubules or their initial density, but that the contraction timescale varies with the average microtubule length. To gain insight into why this microscopic property influences the macroscopic network contraction time, we developed simulations where microtubules and motors are explicitly represented. The simulations qualitatively recapitulate the variation of contraction timescale with microtubule length, and allowed stress contributions from different sources to be estimated and decoupled.
△ Less
Submitted 30 June, 2017;
originally announced June 2017.
-
Impartial Predictive Modeling and the Use of Proxy Variables
Authors:
Kory D. Johnson,
Dean P. Foster,
Robert A. Stine
Abstract:
Fairness aware data mining (FADM) aims to prevent algorithms from discriminating against protected groups. The literature has come to an impasse as to what constitutes explainable variability as opposed to discrimination. This distinction hinges on a rigorous understanding of the role of proxy variables; i.e., those variables which are associated both the protected feature and the outcome of inter…
▽ More
Fairness aware data mining (FADM) aims to prevent algorithms from discriminating against protected groups. The literature has come to an impasse as to what constitutes explainable variability as opposed to discrimination. This distinction hinges on a rigorous understanding of the role of proxy variables; i.e., those variables which are associated both the protected feature and the outcome of interest. We demonstrate that fairness is achieved by ensuring impartiality with respect to sensitive characteristics and provide a framework for impartiality by accounting for different perspectives on the data generating process. In particular, fairness can only be precisely defined in a full-data scenario in which all covariates are observed. We then analyze how these models may be conservatively estimated via regression in partial-data settings. Decomposing the regression estimates provides insights into previously unexplored distinctions between explainable variability and discrimination that illuminate the use of proxy variables in fairness aware data mining.
△ Less
Submitted 7 January, 2022; v1 submitted 1 August, 2016;
originally announced August 2016.
-
Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Authors:
Yong Xu,
Qiang Huang,
Wenwu Wang,
Peter Foster,
Siddharth Sigtia,
Philip J. B. Jackson,
Mark D. Plumbley
Abstract:
Environmental audio tagging aims to predict only the presence or absence of certain acoustic events in the interested acoustic scene. In this paper we make contributions to audio tagging in two parts, respectively, acoustic modeling and feature learning. We propose to use a shrinking deep neural network (DNN) framework incorporating unsupervised feature learning to handle the multi-label classific…
▽ More
Environmental audio tagging aims to predict only the presence or absence of certain acoustic events in the interested acoustic scene. In this paper we make contributions to audio tagging in two parts, respectively, acoustic modeling and feature learning. We propose to use a shrinking deep neural network (DNN) framework incorporating unsupervised feature learning to handle the multi-label classification task. For the acoustic modeling, a large set of contextual frames of the chunk are fed into the DNN to perform a multi-label classification for the expected tags, considering that only chunk (or utterance) level rather than frame-level labels are available. Dropout and background noise aware training are also adopted to improve the generalization capability of the DNNs. For the unsupervised feature learning, we propose to use a symmetric or asymmetric deep de-noising auto-encoder (sDAE or aDAE) to generate new data-driven features from the Mel-Filter Banks (MFBs) features. The new features, which are smoothed against background noise and more compact with contextual information, can further improve the performance of the DNN baseline. Compared with the standard Gaussian Mixture Model (GMM) baseline of the DCASE 2016 audio tagging challenge, our proposed method obtains a significant equal error rate (EER) reduction from 0.21 to 0.13 on the development set. The proposed aDAE system can get a relative 6.7% EER reduction compared with the strong DNN baseline on the development set. Finally, the results also show that our approach obtains the state-of-the-art performance with 0.15 EER on the evaluation set of the DCASE 2016 audio tagging task while EER of the first prize of this challenge is 0.17.
△ Less
Submitted 29 November, 2016; v1 submitted 13 July, 2016;
originally announced July 2016.
-
Developing and Testing a Bayesian Analysis of Fluorescence Lifetime Measurements
Authors:
Bryan Kaye,
Peter J. Foster,
Tae Yeon Yoo,
Daniel J. Needleman
Abstract:
FRET measurements can provide dynamic spatial information on length scales smaller than the diffraction limit of light. Several methods exist to measure FRET between fluorophores, including Fluorescence Lifetime Imaging Microscopy (FLIM), which relies on the reduction of fluorescence lifetime when a fluorophore is undergoing FRET. FLIM measurements take the form of histograms of photon arrival tim…
▽ More
FRET measurements can provide dynamic spatial information on length scales smaller than the diffraction limit of light. Several methods exist to measure FRET between fluorophores, including Fluorescence Lifetime Imaging Microscopy (FLIM), which relies on the reduction of fluorescence lifetime when a fluorophore is undergoing FRET. FLIM measurements take the form of histograms of photon arrival times, containing contributions from a mixed population of fluorophores both undergoing and not undergoing FRET, with the measured distribution being a mixture of exponentials of different lifetimes. Here, we present an analysis method based on Bayesian inference that rigorously takes into account several experimental complications. We test the precision and accuracy of our analysis on controlled experimental data and verify that we can faithfully extract model parameters, both in the low-photon and low-fraction regimes.
△ Less
Submitted 11 July, 2016;
originally announced July 2016.
-
Kernel ridge vs. principal component regression: minimax bounds and adaptability of regularization operators
Authors:
Lee H. Dicker,
Dean P. Foster,
Daniel Hsu
Abstract:
Regularization is an essential element of virtually all kernel methods for nonparametric regression problems. A critical factor in the effectiveness of a given kernel method is the type of regularization that is employed. This article compares and contrasts members from a general class of regularization techniques, which notably includes ridge regression and principal component regression. We deri…
▽ More
Regularization is an essential element of virtually all kernel methods for nonparametric regression problems. A critical factor in the effectiveness of a given kernel method is the type of regularization that is employed. This article compares and contrasts members from a general class of regularization techniques, which notably includes ridge regression and principal component regression. We derive an explicit finite-sample risk bound for regularization-based estimators that simultaneously accounts for (i) the structure of the ambient function space, (ii) the regularity of the true regression function, and (iii) the adaptability (or qualification) of the regularization. A simple consequence of this upper bound is that the risk of the regularization-based estimators matches the minimax rate in a variety of settings. The general bound also illustrates how some regularization techniques are more adaptable than others to favorable regularity properties that the true regression function may possess. This, in particular, demonstrates a striking difference between kernel ridge regression and kernel principal component regression. Our theoretical results are supported by numerical experiments.
△ Less
Submitted 27 May, 2016;
originally announced May 2016.
-
Electrically pumped single-defect light emitters in WSe$_2$
Authors:
S. Schwarz,
A. Kozikov,
F. Withers,
J. K. Maguire,
A. P. Foster,
S. Dufferwiel,
L. Hague,
M. N. Makhonin,
L. R. Wilson,
A . K. Geim,
K. S. Novoselov,
A. I. Tartakovskii
Abstract:
Recent developments in fabrication of van der Waals heterostructures enable new type of devices assembled by stacking atomically thin layers of two-dimensional materials. Using this approach, we fabricate light-emitting devices based on a monolayer WSe$_2$, and also comprising boron nitride tunnelling barriers and graphene electrodes, and observe sharp luminescence spectra from individual defects…
▽ More
Recent developments in fabrication of van der Waals heterostructures enable new type of devices assembled by stacking atomically thin layers of two-dimensional materials. Using this approach, we fabricate light-emitting devices based on a monolayer WSe$_2$, and also comprising boron nitride tunnelling barriers and graphene electrodes, and observe sharp luminescence spectra from individual defects in WSe$_2$ under both optical and electrical excitation. This paves the way towards the realization of electrically-pumped quantum emitters in atomically thin semiconductors. In addition we demonstrate tuning by more than 1 meV of the emission energy of the defect luminescence by applying a vertical electric field. This provides an estimate of the permanent electric dipole created by the corresponding electron-hole pair. The light-emitting devices investigated in our work can be assembled on a variety of substrates enabling a route to integration of electrically pumped single quantum emitters with existing technologies in nano-photonics and optoelectronics.
△ Less
Submitted 6 May, 2016;
originally announced May 2016.
-
Fitting High-Dimensional Interaction Models with Error Control
Authors:
Kory D. Johnson,
Robert A. Stine,
Dean P. Foster
Abstract:
There is a renewed interest in polynomial regression in the form of identifying influential interactions between features. In many settings, this takes place in a high-dimensional model, making the number of interactions unwieldy or computationally infeasible. Furthermore, it is difficult to analyze such spaces directly as they are often highly correlated. Standard feature selection issues remain…
▽ More
There is a renewed interest in polynomial regression in the form of identifying influential interactions between features. In many settings, this takes place in a high-dimensional model, making the number of interactions unwieldy or computationally infeasible. Furthermore, it is difficult to analyze such spaces directly as they are often highly correlated. Standard feature selection issues remain such as how to determine a final model which generalizes well. This paper solves these problems with a sequential algorithm called Revisiting Alpha-Investing (RAI). RAI is motivated by the principle of marginality and searches the feature-space of higher-order interactions by greedily building upon lower-order terms. RAI controls a notion of false rejections and comes with a performance guarantee relative to the best-subset model. This ensures that signal is identified while providing a valid stopping criterion to prevent over-selection. We apply RAI in a novel setting over a family of regressions in order to select gene-specific interaction models for differential expression profiling.
△ Less
Submitted 18 February, 2020; v1 submitted 21 October, 2015;
originally announced October 2015.
-
A Risk Ratio Comparison of $l_0$ and $l_1$ Penalized Regression
Authors:
Kory D. Johnson,
Dongyu Lin,
Lyle H. Ungar,
Dean P. Foster,
Robert A. Stine
Abstract:
There has been an explosion of interest in using $l_1$-regularization in place of $l_0$-regularization for feature selection. We present theoretical results showing that while $l_1$-penalized linear regression never outperforms $l_0$-regularization by more than a constant factor, in some cases using an $l_1$ penalty is infinitely worse than using an $l_0$ penalty. We also show that the "optimal"…
▽ More
There has been an explosion of interest in using $l_1$-regularization in place of $l_0$-regularization for feature selection. We present theoretical results showing that while $l_1$-penalized linear regression never outperforms $l_0$-regularization by more than a constant factor, in some cases using an $l_1$ penalty is infinitely worse than using an $l_0$ penalty. We also show that the "optimal" $l_1$ solutions are often inferior to $l_0$ solutions found using stepwise regression.
We also compare algorithms for solving these two problems and show that although solutions can be found efficiently for the $l_1$ problem, the "optimal" $l_1$ solutions are often inferior to $l_0$ solutions found using greedy classic stepwise regression. Furthermore, we show that solutions obtained by solving the convex $l_1$ problem can be improved by selecting the best of the $l_1$ models (for different regularization penalties) by using an $l_0$ criterion. In other words, an approximate solution to the right problem can be better than the exact solution to the wrong problem.
△ Less
Submitted 21 October, 2015;
originally announced October 2015.
-
Submodularity in Statistics: Comparing the Success of Model Selection Methods
Authors:
Kory D. Johnson,
Robert A. Stine,
Dean P. Foster
Abstract:
We demonstrate the usefulness of submodularity in statistics as a characterization of the difficulty of the \emph{search} problem of feature selection. The search problem is the ability of a procedure to identify an informative set of features as opposed to the performance of the optimal set of features. Submodularity arises naturally in this setting due to its connection to combinatorial optimiza…
▽ More
We demonstrate the usefulness of submodularity in statistics as a characterization of the difficulty of the \emph{search} problem of feature selection. The search problem is the ability of a procedure to identify an informative set of features as opposed to the performance of the optimal set of features. Submodularity arises naturally in this setting due to its connection to combinatorial optimization. In statistics, submodularity isolates cases in which collinearity makes the choice of model features difficult from those in which this task is routine. Researchers often report the signal-to-noise ratio to measure the difficulty of simulated data examples. A measure of submodularity should also be provided as it characterizes an independent component difficulty. Furthermore, it is closely related to other statistical assumptions used in the development of the Lasso, Dantzig selector, and sure information screening.
△ Less
Submitted 13 May, 2016; v1 submitted 21 October, 2015;
originally announced October 2015.
-
Orbiting Radiation Stars
Authors:
Dean P. Foster,
John Langford,
Gabe Perez-Giz
Abstract:
We study a numerical solution to Einstein's equation for a compact object composed of null particles. The solution avoids quantum scale regimes and hence neither relies upon nor ignores the interaction of quantum mechanics and gravitation. The solution exhibits a deep gravitational well yet remains singularity free. In fact, the solution is geometrically flat in the vicinity of the origin with the…
▽ More
We study a numerical solution to Einstein's equation for a compact object composed of null particles. The solution avoids quantum scale regimes and hence neither relies upon nor ignores the interaction of quantum mechanics and gravitation. The solution exhibits a deep gravitational well yet remains singularity free. In fact, the solution is geometrically flat in the vicinity of the origin with the flat region being of any desirable scale. The solution is also observationally distinct from a black hole because a photon from infinity aimed at an object centered on the origin passes through the origin and escapes to infinity with a time delay.
△ Less
Submitted 20 March, 2016; v1 submitted 23 June, 2015;
originally announced June 2015.
-
Large scale canonical correlation analysis with iterative least squares
Authors:
Yichao Lu,
Dean P. Foster
Abstract:
Canonical Correlation Analysis (CCA) is a widely used statistical tool with both well established theory and favorable performance for a wide range of machine learning problems. However, computing CCA for huge datasets can be very slow since it involves implementing QR decomposition or singular value decomposition of huge matrices. In this paper we introduce L-CCA, a iterative algorithm which can…
▽ More
Canonical Correlation Analysis (CCA) is a widely used statistical tool with both well established theory and favorable performance for a wide range of machine learning problems. However, computing CCA for huge datasets can be very slow since it involves implementing QR decomposition or singular value decomposition of huge matrices. In this paper we introduce L-CCA, a iterative algorithm which can compute CCA fast on huge sparse datasets. Theory on both the asymptotic convergence and finite time accuracy of L-CCA are established. The experiments also show that L-CCA outperform other fast CCA approximation schemes on two real datasets.
△ Less
Submitted 30 December, 2014; v1 submitted 16 July, 2014;
originally announced July 2014.
-
Identifying Cover Songs Using Information-Theoretic Measures of Similarity
Authors:
Peter Foster,
Simon Dixon,
Anssi Klapuri
Abstract:
This paper investigates methods for quantifying similarity between audio signals, specifically for the task of of cover song detection. We consider an information-theoretic approach, where we compute pairwise measures of predictability between time series. We compare discrete-valued approaches operating on quantised audio features, to continuous-valued approaches. In the discrete case, we propose…
▽ More
This paper investigates methods for quantifying similarity between audio signals, specifically for the task of of cover song detection. We consider an information-theoretic approach, where we compute pairwise measures of predictability between time series. We compare discrete-valued approaches operating on quantised audio features, to continuous-valued approaches. In the discrete case, we propose a method for computing the normalised compression distance, where we account for correlation between time series. In the continuous case, we propose to compute information-based measures of similarity as statistics of the prediction error between time series. We evaluate our methods on two cover song identification tasks using a data set comprised of 300 Jazz standards and using the Million Song Dataset. For both datasets, we observe that continuous-valued approaches outperform discrete-valued approaches. We consider approaches to estimating the normalised compression distance (NCD) based on string compression and prediction, where we observe that our proposed normalised compression distance with alignment (NCDA) improves average performance over NCD, for sequential compression algorithms. Finally, we demonstrate that continuous-valued distances may be combined to improve performance with respect to baseline approaches. Using a large-scale filter-and-refine approach, we demonstrate state-of-the-art performance for cover song identification using the Million Song Dataset.
△ Less
Submitted 17 May, 2015; v1 submitted 9 July, 2014;
originally announced July 2014.
-
Fast Ridge Regression with Randomized Principal Component Analysis and Gradient Descent
Authors:
Yichao Lu,
Dean P. Foster
Abstract:
We propose a new two stage algorithm LING for large scale regression problems. LING has the same risk as the well known Ridge Regression under the fixed design setting and can be computed much faster. Our experiments have shown that LING performs well in terms of both prediction accuracy and computational efficiency compared with other large scale regression algorithms like Gradient Descent, Stoch…
▽ More
We propose a new two stage algorithm LING for large scale regression problems. LING has the same risk as the well known Ridge Regression under the fixed design setting and can be computed much faster. Our experiments have shown that LING performs well in terms of both prediction accuracy and computational efficiency compared with other large scale regression algorithms like Gradient Descent, Stochastic Gradient Descent and Principal Component Regression on both simulated and real datasets.
△ Less
Submitted 15 May, 2014;
originally announced May 2014.