Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–39 of 39 results for author: Baxter, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.14460  [pdf, other

    stat.ML cs.LG stat.ME

    Inference of Causal Networks using a Topological Threshold

    Authors: Filipe Barroso, Diogo Gomes, Gareth J. Baxter

    Abstract: We propose a constraint-based algorithm, which automatically determines causal relevance thresholds, to infer causal networks from data. We call these topological thresholds. We present two methods for determining the threshold: the first seeks a set of edges that leaves no disconnected nodes in the network; the second seeks a causal large connected component in the data. We tested these methods… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 17 pages, 12 figures

  2. arXiv:2402.14045  [pdf, other

    eess.IV cs.CV cs.LG

    A Systematic Review of Low-Rank and Local Low-Rank Matrix Approximation in Big Data Medical Imaging

    Authors: Sisipho Hamlomo, Marcellin Atemkeng, Yusuf Brima, Chuneeta Nunhokee, Jeremy Baxter

    Abstract: The large volume and complexity of medical imaging datasets are bottlenecks for storage, transmission, and processing. To tackle these challenges, the application of low-rank matrix approximation (LRMA) and its derivative, local LRMA (LLRMA) has demonstrated potential. A detailed analysis of the literature identifies LRMA and LLRMA methods applied to various imaging modalities, and the challenges… ▽ More

    Submitted 27 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  3. arXiv:2402.06194  [pdf, other

    cs.DC

    SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation

    Authors: Yifan Xiong, Yuting Jiang, Ziyue Yang, Lei Qu, Guoshuai Zhao, Shuguang Liu, Dong Zhong, Boris Pinzur, Jie Zhang, Yang Wang, Jithin Jose, Hossein Pourreza, Jeff Baxter, Kushal Datta, Prabhat Ram, Luke Melton, Joe Chau, Peng Cheng, Yongqiang Xiong, Lidong Zhou

    Abstract: Reliability in cloud AI infrastructure is crucial for cloud service providers, prompting the widespread use of hardware redundancies. However, these redundancies can inadvertently lead to hidden degradation, so called "gray failure", for AI workloads, significantly affecting end-to-end performance and concealing performance issues, which complicates root cause analysis for failures and regressions… ▽ More

    Submitted 7 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: USENIX ATC '24

  4. arXiv:2210.15723  [pdf, other

    cs.SI

    Birdwatch: Crowd Wisdom and Bridging Algorithms can Inform Understanding and Reduce the Spread of Misinformation

    Authors: Stefan Wojcik, Sophie Hilgard, Nick Judd, Delia Mocanu, Stephen Ragain, M. B. Fallin Hunzaker, Keith Coleman, Jay Baxter

    Abstract: We present an approach for selecting objectively informative and subjectively helpful annotations to social media posts. We draw on data from on an online environment where contributors annotate misinformation and simultaneously rate the contributions of others. Our algorithm uses a matrix-factorization (MF) based approach to identify annotations that appeal broadly across heterogeneous user group… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  5. arXiv:2208.01607  [pdf

    cs.LG cs.IR

    Enabling scalable clinical interpretation of ML-based phenotypes using real world data

    Authors: Owen Parsons, Nathan E Barlow, Janie Baxter, Karen Paraschin, Andrea Derix, Peter Hein, Robert Dürichen

    Abstract: The availability of large and deep electronic healthcare records (EHR) datasets has the potential to enable a better understanding of real-world patient journeys, and to identify novel subgroups of patients. ML-based aggregation of EHR data is mostly tool-driven, i.e., building on available or newly developed methods. However, these methods, their input requirements, and, importantly, resulting ou… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: 27 pages, 14 figures

    MSC Class: 92C50; 92C60; 68T09; 62H30 ACM Class: J.3; I.5; I.2

  6. arXiv:2112.07239  [pdf, other

    cs.LG

    Compensating trajectory bias for unsupervised patient stratification using adversarial recurrent neural networks

    Authors: Avelino Javer, Owen Parsons, Oliver Carr, Janie Baxter, Christian Diedrich, Eren Elçi, Steffen Schaper, Katrin Coboeken, Robert Dürichen

    Abstract: Electronic healthcare records are an important source of information which can be used in patient stratification to discover novel disease phenotypes. However, they can be challenging to work with as data is often sparse and irregularly sampled. One approach to solve these limitations is learning dense embeddings that represent individual patient trajectories using a recurrent neural network autoe… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  7. The Ethics of Biosurveillance

    Authors: S. K. Devitt, P. W. J. Baxter, G. Hamilton

    Abstract: Governments must keep agricultural systems free of pests that threaten agricultural production and international trade. Biosecurity surveillance already makes use of a wide range of technologies, such as insect traps and lures, geographic information systems, and diagnostic biochemical tests. The rise of cheap and usable surveillance technologies such as remotely piloted aircraft systems (RPAS) pr… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 32 pages, 2 figures, 2019, Journal of Agricultural and Environmental Ethics

    ACM Class: K.4.0; K.4.1; K.4.2; K.4.3

    Journal ref: Journal of Agricultural and Environmental Ethics, 32(5), 709-740 (2019)

  8. arXiv:2105.09293  [pdf, other

    cs.IR

    Lessons Learned Addressing Dataset Bias in Model-Based Candidate Generation at Twitter

    Authors: Alim Virani, Jay Baxter, Dan Shiebler, Philip Gautier, Shivam Verma, Yan Xia, Apoorv Sharma, Sumit Binnani, Linlin Chen, Chenguang Yu

    Abstract: Traditionally, heuristic methods are used to generate candidates for large scale recommender systems. Model-based candidate generation promises multiple potential advantages, primarily that we can explicitly optimize the same objective as the downstream ranking model. However, large scale model-based candidate generation approaches suffer from dataset bias problems caused by the infeasibility of o… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

  9. arXiv:2012.13233  [pdf, other

    cs.LG

    Deep Semi-Supervised Embedded Clustering (DSEC) for Stratification of Heart Failure Patients

    Authors: Oliver Carr, Stojan Jovanovic, Luca Albergante, Fernando Andreotti, Robert Dürichen, Nadia Lipunova, Janie Baxter, Rabia Khan, Benjamin Irving

    Abstract: Determining phenotypes of diseases can have considerable benefits for in-hospital patient care and to drug development. The structure of high dimensional data sets such as electronic health records are often represented through an embedding of the data, with clustering methods used to group data of similar structure. If subgroups are known to exist within data, supervised methods may be used to in… ▽ More

    Submitted 17 January, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

    Comments: 6 pages, 2 figures, HSYS workshop at ICML conference

  10. Theoretical Models of Learning to Learn

    Authors: Jonathan Baxter

    Abstract: A Machine can only learn if it is biased in some way. Typically the bias is supplied by hand, for example through the choice of an appropriate set of features. However, if the learning machine is embedded within an {\em environment} of related tasks, then it can {\em learn} its own bias by learning sufficiently many tasks from the environment. In this paper two models of bias learning (or equivale… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: arXiv admin note: text overlap with arXiv:1106.0245

    Journal ref: in Learning to Learn (edited by Sebastian Thrun and Lorien Pratt), 159-179 (1998)

  11. arXiv:1912.05915  [pdf, ps, other

    cs.LG stat.ML

    Some observations concerning Off Training Set (OTS) error

    Authors: Jonathan Baxter

    Abstract: A form of generalisation error known as Off Training Set (OTS) error was recently introduced in [Wolpert, 1996b], along with a theorem showing that small training set error does not guarantee small OTS error, unless assumptions are made about the target function. Here it is shown that the applicability of this theorem is limited to models in which the distribution generating training data has no o… ▽ More

    Submitted 17 November, 2019; originally announced December 2019.

    Comments: Technical Report, Australian National University, August 1999

  12. General Matrix-Matrix Multiplication Using SIMD features of the PIII

    Authors: Douglas Aberdeen, Jonathan Baxter

    Abstract: Generalised matrix-matrix multiplication forms the kernel of many mathematical algorithms. A faster matrix-matrix multiply immediately benefits these algorithms. In this paper we implement efficient matrix multiplication for large matrices using the floating point Intel Pentium SIMD (Single Instruction Multiple Data) architecture. A description of the issues and our solution is presented, paying a… ▽ More

    Submitted 18 November, 2019; originally announced December 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1911.05181

    Journal ref: Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing (2000) Pages 980-983

  13. arXiv:1911.07247  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Hebbian Synaptic Modifications in Spiking Neurons that Learn

    Authors: Peter L. Bartlett, Jonathan Baxter

    Abstract: In this paper, we derive a new model of synaptic plasticity, based on recent algorithms for reinforcement learning (in which an agent attempts to learn appropriate actions to maximize its long-term average reward). We show that these direct reinforcement learning algorithms also give locally optimal performance for the problem of reinforcement learning with multiple agents, without any explicit co… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

  14. The Canonical Distortion Measure for Vector Quantization and Function Approximation

    Authors: Jonathan Baxter

    Abstract: To measure the quality of a set of vector quantization points a means of measuring the distance between a random point and its quantization is required. Common metrics such as the {\em Hamming} and {\em Euclidean} metrics, while mathematically simple, are inappropriate for comparing natural signals such as speech or images. In this paper it is shown how an {\em environment} of functions on an inpu… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Journal ref: In: Thrun S., Pratt L. (eds) Learning to Learn (1998). Pages 159-177

  15. arXiv:1911.06164  [pdf, ps, other

    cs.LG stat.ML

    Learning Model Bias

    Authors: Jonathan Baxter

    Abstract: In this paper the problem of {\em learning} appropriate domain-specific bias is addressed. It is shown that this can be achieved by learning many related tasks from the same domain, and a theorem is given bounding the number tasks that must be learnt. A corollary of the theorem is that if the tasks are known to possess a common {\em internal representation} or {\em preprocessing} then the number o… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Journal ref: Advances in Neural Information Processing Systems 8, 1995, 169-175

  16. A Bayesian/Information Theoretic Model of Bias Learning

    Authors: Jonathan Baxter

    Abstract: In this paper the problem of learning appropriate bias for an environment of related tasks is examined from a Bayesian perspective. The environment of related tasks is shown to be naturally modelled by the concept of an {\em objective} prior distribution. Sampling from the objective prior corresponds to sampling different learning tasks from the environment. It is argued that for many common machi… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Journal ref: COLT 96 Proceedings of the ninth annual conference on Computational learning theory (1996) Pages 77-88

  17. Learning Internal Representations (COLT 1995)

    Authors: Jonathan Baxter

    Abstract: Probably the most important problem in machine learning is the preliminary biasing of a learner's hypothesis space so that it is small enough to ensure good generalisation from reasonable training sets, yet large enough that it contains a good solution to the problem being learnt. In this paper a mechanism for {\em automatically} learning or biasing the learner's hypothesis space is introduced. It… ▽ More

    Submitted 19 December, 2019; v1 submitted 13 November, 2019; originally announced November 2019.

    Journal ref: COLT '95 Proceedings of the eighth annual conference on Computational learning theory (1995) 311-320

  18. arXiv:1911.05181  [pdf, other

    cs.LG cs.DC cs.PF stat.ML

    92c/MFlops/s, Ultra-Large-Scale Neural-Network Training on a PIII Cluster

    Authors: Douglas Aberdeen, Jonathan Baxter, Robert Edwards

    Abstract: Artificial neural networks with millions of adjustable parameters and a similar number of training examples are a potential solution for difficult, large-scale pattern recognition problems in areas such as speech and face recognition, classification of large volumes of web data, and finance. The bottleneck is that neural network training involves iterative gradient descent and is extremely computa… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

    Comments: SC '00: Proceedings of the 2000 ACM/IEEE Conference on Supercomputing

    Journal ref: ACM/IEEE SC 2000 Conference (SC00)

  19. arXiv:1911.03731  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Learning Internal Representations (PhD Thesis)

    Authors: Jonathan Baxter

    Abstract: Most machine learning theory and practice is concerned with learning a single task. In this thesis it is argued that in general there is insufficient information in a single task for a learner to generalise well and that what is required for good generalisation is information about many similar learning tasks. Similar learning tasks form a body of prior information that can be used to constrain th… ▽ More

    Submitted 22 November, 2019; v1 submitted 9 November, 2019; originally announced November 2019.

    Comments: Phd Thesis, Jonathan Baxter, 1994

  20. arXiv:1907.07974  [pdf, ps, other

    cs.LO

    Priorities in tock-CSP

    Authors: Pedro Ribeiro, James Baxter, Ana Cavalcanti

    Abstract: The $tock$-CSP encoding embeds a rich and flexible approach to modelling discrete timed behaviours in CSP where the event $tock$ is interpreted to mark the passage of time. The model checker FDR provides tailored support for $tock$-CSP, including a prioritisation operator that has typically been used to ensure maximal progress, where time only advances after internal activity has stabilised. Prior… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: 9 pages, submitted to Information Processing Letters, July 2019

  21. Unifying Semantic Foundations for Automated Verification Tools in Isabelle/UTP

    Authors: Simon Foster, James Baxter, Ana Cavalcanti, Jim Woodcock, Frank Zeyda

    Abstract: The growing complexity and diversity of models used in the engineering of dependable systems implies that a variety of formal methods, across differing abstractions, paradigms, and presentations, must be integrated. Such an integration relies on unified semantic foundations for the various notations, and co-ordination of a variety of automated verification tools. The contribution of this paper is… ▽ More

    Submitted 22 June, 2020; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: 40 pages, Accepted for Science of Computer Programming, June 2020

  22. arXiv:1809.07703  [pdf, other

    cs.SI cs.LG stat.ML

    Fighting Redundancy and Model Decay with Embeddings

    Authors: Dan Shiebler, Luca Belli, Jay Baxter, Hanchen Xiong, Abhishek Tayal

    Abstract: Every day, hundreds of millions of new Tweets containing over 40 languages of ever-shifting vernacular flow through Twitter. Models that attempt to extract insight from this firehose of information must face the torrential covariate shift that is endemic to the Twitter platform. While regularly-retrained algorithms can maintain performance in the face of this shift, fixed model features that fail… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Comments: Presented at the Common Model Infrastructure Workshop at KDD 2018 (link: https://cmi2018.sdsc.edu/)

  23. Automating Verification of State Machines with Reactive Designs and Isabelle/UTP

    Authors: Simon Foster, James Baxter, Ana Cavalcanti, Alvaro Miyazawa, Jim Woodcock

    Abstract: State-machine based notations are ubiquitous in the description of component systems, particularly in the robotic domain. To ensure these systems are safe and predictable, formal verification techniques are important, and can be cost-effective if they are both automated and scalable. In this paper, we present a verification approach for a diagrammatic state machine language that utilises theorem p… ▽ More

    Submitted 24 August, 2018; v1 submitted 23 July, 2018; originally announced July 2018.

    Comments: 18 pages, 16th Intl. Conf. on Formal Aspects of Component Software (FACS 2018), October 2018, Pohang, South Korea

  24. arXiv:1806.10698  [pdf, other

    cs.AI cs.LG stat.AP stat.ML

    A comparative study of artificial intelligence and human doctors for the purpose of triage and diagnosis

    Authors: Salman Razzaki, Adam Baker, Yura Perov, Katherine Middleton, Janie Baxter, Daniel Mullarkey, Davinder Sangar, Michael Taliercio, Mobasher Butt, Azeem Majeed, Arnold DoRosario, Megan Mahoney, Saurabh Johri

    Abstract: Online symptom checkers have significant potential to improve patient care, however their reliability and accuracy remain variable. We hypothesised that an artificial intelligence (AI) powered triage and diagnostic system would compare favourably with human doctors with respect to triage and diagnostic accuracy. We performed a prospective validation study of the accuracy and safety of an AI powere… ▽ More

    Submitted 27 June, 2018; originally announced June 2018.

  25. arXiv:1802.03992  [pdf, ps, other

    physics.soc-ph cond-mat.dis-nn cs.SI

    Targeted Damage to Interdependent Networks

    Authors: G. J. Baxter, G. Timár, J. F. F. Mendes

    Abstract: The giant mutually connected component (GMCC) of an interdependent or multiplex network collapses with a discontinuous hybrid transition under random damage to the network. If the nodes to be damaged are selected in a targeted way, the collapse of the GMCC may occur significantly sooner. Finding the minimal damage set which destroys the largest mutually connected component of a given interdependen… ▽ More

    Submitted 24 September, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: 9 pages, 9 figures

    Journal ref: Phys. Rev. E 98, 032307 (2018)

  26. arXiv:1512.05006  [pdf, other

    cs.AI

    BayesDB: A probabilistic programming system for querying the probable implications of data

    Authors: Vikash Mansinghka, Richard Tibbetts, Jay Baxter, Pat Shafto, Baxter Eaves

    Abstract: Is it possible to make statistical inference broadly accessible to non-statisticians without sacrificing mathematical rigor or inference quality? This paper describes BayesDB, a probabilistic programming platform that aims to enable users to query the probable implications of their data as directly as SQL databases enable them to query the data itself. This paper focuses on four aspects of BayesDB… ▽ More

    Submitted 15 December, 2015; originally announced December 2015.

  27. arXiv:1511.03629  [pdf, other

    cs.CV

    A Continuous Max-Flow Approach to Cyclic Field Reconstruction

    Authors: John S. H. Baxter, Jonathan McLeod, Terry M. Peters

    Abstract: Reconstruction of an image from noisy data using Markov Random Field theory has been explored by both the graph-cuts and continuous max-flow community in the form of the Potts and Ishikawa models. However, neither model takes into account the particular cyclic topology of specific intensity types such as the hue in natural colour images, or the phase in complex valued MRI. This paper presents \tex… ▽ More

    Submitted 11 November, 2015; originally announced November 2015.

    Comments: 8 pages, 1 figure

  28. arXiv:1510.04706  [pdf, other

    cs.CV

    Shape Complexes in Continuous Max-Flow Hierarchical Multi-Labeling Problems

    Authors: John S. H. Baxter, Jing Yuan, Terry M. Peters

    Abstract: Although topological considerations amongst multiple labels have been previously investigated in the context of continuous max-flow image segmentation, similar investigations have yet to be made about shape considerations in a general and extendable manner. This paper presents shape complexes for segmentation, which capture more complex shapes by combining multiple labels and super-labels constrai… ▽ More

    Submitted 15 October, 2015; originally announced October 2015.

    Comments: 9 pages, 1 figure

  29. arXiv:1501.07844  [pdf, ps, other

    cs.CV

    A Proximal Bregman Projection Approach to Continuous Max-Flow Problems Using Entropic Distances

    Authors: John S. H. Baxter, Martin Rajchl, Jing Yuan, Terry M. Peters

    Abstract: One issue limiting the adaption of large-scale multi-region segmentation is the sometimes prohibitive memory requirements. This is especially troubling considering advances in massively parallel computing and commercial graphics processing units because of their already limited memory compared to the current random access memory used in more traditional computation. To address this issue in the fi… ▽ More

    Submitted 30 January, 2015; originally announced January 2015.

    Comments: 10 pages

  30. arXiv:1405.0892  [pdf, ps, other

    cs.CV

    A Continuous Max-Flow Approach to Multi-Labeling Problems under Arbitrary Region Regularization

    Authors: John S. H. Baxter, Martin Rajchl, Jing Yuan, Terry M. Peters

    Abstract: The incorporation of region regularization into max-flow segmentation has traditionally focused on ordering and part-whole relationships. A side effect of the development of such models is that it constrained regularization only to those cases, rather than allowing for arbitrary region regularization. Directed Acyclic Graphical Max-Flow (DAGMF) segmentation overcomes these limitations by allowing… ▽ More

    Submitted 5 June, 2014; v1 submitted 5 May, 2014; originally announced May 2014.

    Comments: 10 pages, 2 figures, 3 algorithms - v2: Fixed typos / grammatical errors and mathematical errors in the primal/dual formulation. Extended methods for weighted DAGs rather than DAGs with edge multiplicity

  31. arXiv:1404.2571  [pdf, other

    cs.CV

    RANCOR: Non-Linear Image Registration with Total Variation Regularization

    Authors: Martin Rajchl, John S. H. Baxter, Wu Qiu, Ali R. Khan, Aaron Fenster, Terry M. Peters, Jing Yuan

    Abstract: Optimization techniques have been widely used in deformable registration, allowing for the incorporation of similarity metrics with regularization mechanisms. These regularization mechanisms are designed to mitigate the effects of trivial solutions to ill-posed registration problems and to otherwise ensure the resulting deformation fields are well-behaved. This paper introduces a novel deformable… ▽ More

    Submitted 9 April, 2014; originally announced April 2014.

    Comments: 9 pages, 1 figure, technical note

  32. arXiv:1404.0336  [pdf, ps, other

    cs.CV

    A Continuous Max-Flow Approach to General Hierarchical Multi-Labeling Problems

    Authors: John S. H. Baxter, Martin Rajchl, Jing Yuan, Terry M. Peters

    Abstract: Multi-region segmentation algorithms often have the onus of incorporating complex anatomical knowledge representing spatial or geometric relationships between objects, and general-purpose methods of addressing this knowledge in an optimization-based manner have thus been lacking. This paper presents Generalized Hierarchical Max-Flow (GHMF) segmentation, which captures simple anatomical part-whole… ▽ More

    Submitted 5 June, 2014; v1 submitted 1 April, 2014; originally announced April 2014.

    Comments: 11 pages, 1 figure, 3 algorithms -v2: Fixed typos / grammatical errors

  33. arXiv:1312.3814  [pdf, other

    cond-mat.dis-nn cs.CR math.PR physics.soc-ph

    Weak percolation on multiplex networks

    Authors: Gareth J. Baxter, Sergey N. Dorogovtsev, José F. F. Mendes, Davide Cellai

    Abstract: Bootstrap percolation is a simple but non-trivial model. It has applications in many areas of science and has been explored on random networks for several decades. In single layer (simplex) networks, it has been recently observed that bootstrap percolation, which is defined as an incremental process, can be seen as the opposite of pruning percolation, where nodes are removed according to a connect… ▽ More

    Submitted 13 April, 2014; v1 submitted 13 December, 2013; originally announced December 2013.

    Comments: 14 pages, 12 figures

    Journal ref: Phys. Rev. E 89, 042801 (2014)

  34. arXiv:1106.0666  [pdf, ps, other

    cs.AI cs.LG

    Experiments with Infinite-Horizon, Policy-Gradient Estimation

    Authors: J. Baxter, P. L. Bartlett, L. Weaver

    Abstract: In this paper, we present algorithms that perform gradient ascent of the average reward in a partially observable Markov decision process (POMDP). These algorithms are based on GPOMDP, an algorithm introduced in a companion paper (Baxter and Bartlett, this volume), which computes biased estimates of the performance gradient in POMDPs. The algorithm's chief advantages are that it uses only one free… ▽ More

    Submitted 13 November, 2019; v1 submitted 3 June, 2011; originally announced June 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 15, pages 351-381, 2001

  35. Infinite-Horizon Policy-Gradient Estimation

    Authors: Jonathan Baxter, Peter L. Bartlett

    Abstract: Gradient-based approaches to direct policy search in reinforcement learning have received much recent attention as a means to solve problems of partial observability and to avoid some of the problems associated with policy degradation in value-function methods. In this paper we introduce GPOMDP, a simulation-based algorithm for generating a {\em biased} estimate of the gradient of the {\em average… ▽ More

    Submitted 15 November, 2019; v1 submitted 3 June, 2011; originally announced June 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 15, pages 319-350, 2001

  36. A Model of Inductive Bias Learning

    Authors: J. Baxter

    Abstract: A major problem in machine learning is that of inductive bias: how to choose a learner's hypothesis space so that it is large enough to contain a solution to the problem being learnt, yet small enough to ensure reliable generalization from reasonably-sized training sets. Typically such bias is supplied by hand through the skill and insights of experts. In this paper a model for aut… ▽ More

    Submitted 1 June, 2011; originally announced June 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 12, pages 149-198, 2000

  37. arXiv:0802.2306  [pdf, ps, other

    cs.SE cs.PL

    Software graphs and programmer awareness

    Authors: G. J. Baxter, M. R. Frean

    Abstract: Dependencies between types in object-oriented software can be viewed as directed graphs, with types as nodes and dependencies as edges. The in-degree and out-degree distributions of such graphs have quite different forms, with the former resembling a power-law distribution and the latter an exponential distribution. This effect appears to be independent of application or type relationship. A sim… ▽ More

    Submitted 15 February, 2008; originally announced February 2008.

    Comments: 9 pages, 8 figures

  38. arXiv:cs/9901002  [pdf, ps, other

    cs.LG cs.AI

    KnightCap: A chess program that learns by combining TD(lambda) with game-tree search

    Authors: Jonathan Baxter, Andrew Tridgell, Lex Weaver

    Abstract: In this paper we present TDLeaf(lambda), a variation on the TD(lambda) algorithm that enables it to be used in conjunction with game-tree search. We present some experiments in which our chess program ``KnightCap'' used TDLeaf(lambda) to learn its evaluation function while playing on the Free Internet Chess Server (FICS, fics.onenet.net). The main success we report is that KnightCap improved fro… ▽ More

    Submitted 9 January, 1999; originally announced January 1999.

    Comments: 9 pages

    ACM Class: I.2.6

    Journal ref: MACHINE LEARNING Proceedings of the Fifteenth International Conference (ICML '98), ISBN 1-55860-556-8, ISSN 1049-1910, Madison WISCONSIN, July 24-27 1998, pages 28-36

  39. arXiv:cs/9901001  [pdf, ps, other

    cs.LG cs.AI

    TDLeaf(lambda): Combining Temporal Difference Learning with Game-Tree Search

    Authors: Jonathan Baxter, Andrew Tridgell, Lex Weaver

    Abstract: In this paper we present TDLeaf(lambda), a variation on the TD(lambda) algorithm that enables it to be used in conjunction with minimax search. We present some experiments in both chess and backgammon which demonstrate its utility and provide comparisons with TD(lambda) and another less radical variant, TD-directed(lambda). In particular, our chess program, ``KnightCap,'' used TDLeaf(lambda) to… ▽ More

    Submitted 4 January, 1999; originally announced January 1999.

    Comments: 5 pages. Also in Proceedings of the Ninth Australian Conference on Neural Networks (ACNN'98), Brisbane QLD, February 1998, pages 168-172

    ACM Class: I.2.6

    Journal ref: Australian Journal of Intelligent Information Processing Systems, ISSN 1321-2133, Vol. 5 No. 1, Autumn 1998, pages 39-43