Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Bayesian Programming

Download as pdf or txt
Download as pdf or txt
You are on page 1of 16

Bayesian programming

Bayesian programming is a formalism and a methodology for having a technique to specify probabilistic
models and solve problems when less than the necessary information is available.

Edwin T. Jaynes proposed that probability could be considered as an alternative and an extension of logic
for rational reasoning with incomplete and uncertain information. In his founding book Probability Theory:
The Logic of Science[1] he developed this theory and proposed what he called “the robot,” which was not a
physical device, but an inference engine to automate probabilistic reasoning—a kind of Prolog for
probability instead of logic. Bayesian programming[2] is a formal and concrete implementation of this
"robot".

Bayesian programming may also be seen as an algebraic formalism to specify graphical models such as, for
instance, Bayesian networks, dynamic Bayesian networks, Kalman filters or hidden Markov models.
Indeed, Bayesian Programming is more general than Bayesian networks and has a power of expression
equivalent to probabilistic factor graphs.[3]

Formalism
A Bayesian program is a means of specifying a family of probability distributions.

The constituent elements of a Bayesian program are presented below:[4]

1. A program is constructed from a description and a question.


2. A description is constructed using some specification ( ) as given by the programmer and an
identification or learning process for the parameters not completely specified by the
specification, using a data set ( ).
3. A specification is constructed from a set of pertinent variables, a decomposition and a set of
forms.
4. Forms are either parametric forms or questions to other Bayesian programs.
5. A question specifies which probability distribution has to be computed.

Description

The purpose of a description is to specify an effective method of computing a joint probability distribution
on a set of variables given a set of experimental data and some specification . This
joint distribution is denoted as: .[5]

To specify preliminary knowledge , the programmer must undertake the following:


1. Define the set of relevant variables on which the joint distribution is
defined.
2. Decompose the joint distribution (break it into relevant independent or conditional
probabilities).
3. Define the forms of each of the distributions (e.g., for each variable, one of the list of
probability distributions).

Decomposition

Given a partition of containing subsets, variables are defined ,


each corresponding to one of these subsets. Each variable is obtained as the conjunction of the variables
belonging to the subset. Recursive application of Bayes' theorem leads to:

Conditional independence hypotheses then allow further simplifications. A conditional independence


hypothesis for variable is defined by choosing some variable among the variables appearing in the
conjunction , labelling as the conjunction of these chosen variables and setting:

We then obtain:

Such a simplification of the joint distribution as a product of simpler distributions is called a decomposition,
derived using the chain rule.

This ensures that each variable appears at the most once on the left of a conditioning bar, which is the
necessary and sufficient condition to write mathematically valid decompositions.

Forms

Each distribution appearing in the product is then associated with either a parametric
form (i.e., a function ) or a question to another Bayesian program
.

When it is a form , in general, is a vector of parameters that may depend on or or both.


Learning takes place when some of these parameters are computed using the data set .
An important feature of Bayesian Programming is this capacity to use questions to other Bayesian programs
as components of the definition of a new Bayesian program. is obtained by some
inferences done by another Bayesian program defined by the specifications and the data . This is similar
to calling a subroutine in classical programming and provides an easy way to build hierarchical models.

Question

Given a description (i.e., ), a question is obtained by partitioning


into three sets: the searched variables, the known variables and the free variables.

The 3 variables , and are defined as the conjunction of the variables belonging
to these sets.

A question is defined as the set of distributions:

made of many "instantiated questions" as the cardinal of , each instantiated question being the
distribution:

Inference

Given the joint distribution , it is always possible to compute any


possible question using the following general inference:

where the first equality results from the marginalization rule, the second results from Bayes' theorem and
the third corresponds to a second application of marginalization. The denominator appears to be a
normalization term and can be replaced by a constant .

Theoretically, this allows to solve any Bayesian inference problem. In practice, however, the cost of
computing exhaustively and exactly is too great in almost all cases.

Replacing the joint distribution by its decomposition we get:


which is usually a much simpler expression to compute, as the dimensionality of the problem is
considerably reduced by the decomposition into a product of lower dimension distributions.

Example

Bayesian spam detection

The purpose of Bayesian spam filtering is to eliminate junk e-mails.

The problem is very easy to formulate. E-mails should be classified into one of two categories: non-spam or
spam. The only available information to classify the e-mails is their content: a set of words. Using these
words without taking the order into account is commonly called a bag of words model.

The classifier should furthermore be able to adapt to its user and to learn from experience. Starting from an
initial standard setting, the classifier should modify its internal parameters when the user disagrees with its
own decision. It will hence adapt to the user's criteria to differentiate between non-spam and spam. It will
improve its results as it encounters increasingly classified e-mails.

Variables

The variables necessary to write this program are as follows:

1. : a binary variable, false if the e-mail is not spam and true otherwise.
2. : binary variables. is true if the word of the dictionary is present
in the text.

These binary variables sum up all the information about an e-mail.

Decomposition

Starting from the joint distribution and applying recursively Bayes' theorem we obtain:

This is an exact mathematical expression.

It can be drastically simplified by assuming that the probability of appearance of a word knowing the nature
of the text (spam or not) is independent of the appearance of the other words. This is the naive Bayes
assumption and this makes this spam filter a naive Bayes model.

For instance, the programmer can assume that:


to finally obtain:

This kind of assumption is known as the naive Bayes' assumption. It is "naive" in the sense that the
independence between words is clearly not completely true. For instance, it completely neglects that the
appearance of pairs of words may be more significant than isolated appearances. However, the programmer
may assume this hypothesis and may develop the model and the associated inferences to test how reliable
and efficient it is.

Parametric forms

To be able to compute the joint distribution, the programmer must now specify the distributions
appearing in the decomposition:

1. is a prior defined, for instance, by


2. Each of the forms may be specified using Laplace rule of succession (this
is a pseudocounts-based smoothing technique to counter the zero-frequency problem of
words never-seen-before):

1.

2.

where stands for the number of appearances of the word in non-spam e-mails and stands for the
total number of non-spam e-mails. Similarly, stands for the number of appearances of the word in
spam e-mails and stands for the total number of spam e-mails.

Identification

The forms are not yet completely specified because the parameters
, , and have no values yet.

The identification of these parameters could be done either by batch processing a series of classified e-mails
or by an incremental updating of the parameters using the user's classifications of the e-mails as they arrive.

Both methods could be combined: the system could start with initial standard values of these parameters
issued from a generic database, then some incremental learning customizes the classifier to each individual
user.

Question

The question asked to the program is: "what is the probability for a given text to be spam knowing which
words appear and don't appear in this text?" It can be formalized by:
which can be computed as follows:

The denominator appears to be a normalization constant. It is not necessary to compute it to decide if we


are dealing with spam. For instance, an easy trick is to compute the ratio:

This computation is faster and easier because it requires only products.

Bayesian program

The Bayesian spam filter program is completely defined by:

Bayesian filter, Kalman filter and hidden Markov model


Bayesian filters (often called Recursive Bayesian estimation) are generic probabilistic models for time
evolving processes. Numerous models are particular instances of this generic approach, for instance: the
Kalman filter or the Hidden Markov model (HMM).

Variables

Variables are a time series of state variables considered to be on a time horizon


ranging from to .
Variables are a time series of observation variables on the same horizon.

Decomposition

The decomposition is based:

on , called the system model, transition model or dynamic model, which


formalizes the transition from the state at time to the state at time ;
on , called the observation model, which expresses what can be observed at time
when the system is in state ;
on an initial state at time : .

Parametrical forms

The parametrical forms are not constrained and different choices lead to different well-known models: see
Kalman filters and Hidden Markov models just below.

Question

The typical question for such models is : what is the probability distribution for
the state at time knowing the observations from instant to ?

The most common case is Bayesian filtering where , which searches for the present state, knowing
past observations.

However, it is also possible , to extrapolate a future state from past observations, or to do


smoothing , to recover a past state from observations made either before or after that instant.

More complicated questions may also be asked as shown below in the HMM section.

Bayesian filters have a very interesting recursive property, which contributes greatly to their
attractiveness. may be computed simply from with
the following formula:

Another interesting point of view for this equation is to consider that there are two phases: a prediction
phase and an estimation phase:
During the prediction phase, the state is predicted using the dynamic model and the
estimation of the state at the previous moment:

During the estimation phase, the prediction is either confirmed or invalidated using the last
observation:

Bayesian program

Kalman filter

The very well-known Kalman filters[6] are a special case of Bayesian filters.

They are defined by the following Bayesian program:


Variables are continuous.
The transition model and the observation model are both
specified using Gaussian laws with means that are linear functions of the conditioning
variables.

With these hypotheses and by using the recursive formula, it is possible to solve the inference problem
analytically to answer the usual question. This leads to an extremely
efficient algorithm, which explains the popularity of Kalman filters and the number of their everyday
applications.

When there are no obvious linear transition and observation models, it is still often possible, using a first-
order Taylor's expansion, to treat these models as locally linear. This generalization is commonly called the
extended Kalman filter.

Hidden Markov model

Hidden Markov models (HMMs) are another very popular specialization of Bayesian filters.

They are defined by the following Bayesian program:


Variables are treated as being discrete.
The transition model and the observation model are

both specified using probability matrices.

The question most frequently asked of HMMs is:

What is the most probable series of states that leads to the present state, knowing the past observations?

This particular question may be answered with a specific and very efficient algorithm called the Viterbi
algorithm.

The Baum–Welch algorithm has been developed for HMMs.

Applications

Academic applications

Since 2000, Bayesian programming has been used to develop both robotics applications and life sciences
models.[7]

Robotics

In robotics, bayesian programming was applied to autonomous robotics,[8][9][10][11][12] robotic CAD


systems,[13] advanced driver-assistance systems,[14] robotic arm control, mobile robotics,[15][16] human-
robot interaction,[17] human-vehicle interaction (Bayesian autonomous driver models)[18][19][20][21][22]
video game avatar programming and training [23] and real-time strategy games (AI).[24]

Life sciences

In life sciences, bayesian programming was used in vision to reconstruct shape from motion,[25] to model
visuo-vestibular interaction[26] and to study saccadic eye movements;[27] in speech perception and control
to study early speech acquisition[28] and the emergence of articulatory-acoustic systems;[29] and to model
handwriting perception and control.[30]

Pattern recognition

Bayesian program learning has potential applications voice recognition and synthesis, image recognition
and natural language processing. It employs the principles of compositionality (building abstract
representations from parts), causality (building complexity from parts) and learning to learn (using
previously recognized concepts to ease the creation of new concepts).[31]

Possibility theories
The comparison between probabilistic approaches (not only bayesian programming) and possibility theories
continues to be debated.

Possibility theories like, for instance, fuzzy sets,[32] fuzzy logic[33] and possibility theory[34] are alternatives
to probability to model uncertainty. They argue that probability is insufficient or inconvenient to model
certain aspects of incomplete/uncertain knowledge.

The defense of probability is mainly based on Cox's theorem, which starts from four postulates concerning
rational reasoning in the presence of uncertainty. It demonstrates that the only mathematical framework that
satisfies these postulates is probability theory. The argument is that any approach other than probability
necessarily infringes one of these postulates and the value of that infringement.

Probabilistic programming
The purpose of probabilistic programming is to unify the scope of classical programming languages with
probabilistic modeling (especially bayesian networks) to deal with uncertainty while profiting from the
programming languages' expressiveness to encode complexity.

Extended classical programming languages include logical languages as proposed in Probabilistic Horn
Abduction,[35] Independent Choice Logic,[36] PRISM,[37] and ProbLog which proposes an extension of
Prolog.

It can also be extensions of functional programming languages (essentially Lisp and Scheme) such as IBAL
or CHURCH. The underlying programming languages can be object-oriented as in BLOG and
FACTORIE or more standard ones as in CES and FIGARO.[38]

The purpose of Bayesian programming is different. Jaynes' precept of "probability as logic" argues that
probability is an extension of and an alternative to logic above which a complete theory of rationality,
computation and programming can be rebuilt.[1] Bayesian programming attempts to replace classical
languages with a programming approach based on probability that considers incompleteness and
uncertainty.

The precise comparison between the semantics and power of expression of Bayesian and probabilistic
programming is an open question.

See also
Mathematics
portal
Bayes' rule Hidden Markov model
Bayesian inference Judea Pearl
Bayesian probability Kalman filter
Bayesian spam filtering Naive Bayes classifier
Belief propagation Pierre-Simon de Laplace
Cox's theorem Probabilistic logic
Expectation-maximization algorithm Probabilistic programming language
Factor graph Subjective logic
Graphical model

References
1. Jaynes, E. T. (10 April 2003). Probability Theory: The Logic of Science (https://books.google.
com/books?id=UjsgAwAAQBAJ). Cambridge University Press. ISBN 978-1-139-43516-1.
2. Bessiere, Pierre; Mazer, Emmanuel; Manuel Ahuactzin, Juan; Mekhnacha, Kamel (20
December 2013). Bayesian Programming (https://books.google.com/books?id=4XtcAgAAQ
BAJ). CRC Press. ISBN 978-1-4398-8032-6.
3. "Expression Graphs: Unifying Factor Graphs and Sum-Product Networks" (http://bcf.usc.edu/
~rosenblo/Pubs/agi15_demski.pdf) (PDF). bcf.usc.edu.
4. "Probabilistic Modeling and Bayesian Analysis" (https://ocw.mit.edu/courses/sloan-school-of
-management/15-097-prediction-machine-learning-and-statistics-spring-2012/lecture-notes/
MIT15_097S12_lec15.pdf) (PDF). ocw.mit.edu.
5. "Bayesian Networks" (http://www.cs.brandeis.edu/~cs134/K_F_Ch3.pdf) (PDF).
cs.brandeis.edu.
6. Kalman, R. E. (1960). "A New Approach to Linear Filtering and Prediction Problems".
Journal of Basic Engineering. 82: 33–45. doi:10.1115/1.3662552 (https://doi.org/10.1115%2
F1.3662552). S2CID 1242324 (https://api.semanticscholar.org/CorpusID:1242324).
7. Bessière, Pierre; Laugier, Christian; Siegwart, Roland (15 May 2008). Probabilistic
Reasoning and Decision Making in Sensory-Motor Systems (https://books.google.com/book
s?id=Rk6ZnpmfhrQC). Springer Science & Business Media. ISBN 978-3-540-79006-8.
8. Lebeltel, O.; Bessière, P.; Diard, J.; Mazer, E. (2004). "Bayesian Robot Programming" (http://
cogprints.org/1670/5/Lebeltel2000.pdf) (PDF). Advanced Robotics. 16 (1): 49–79.
doi:10.1023/b:auro.0000008671.38949.43 (https://doi.org/10.1023%2Fb%3Aauro.00000086
71.38949.43). S2CID 18768468 (https://api.semanticscholar.org/CorpusID:18768468).
9. Diard, J.; Gilet, E.; Simonin, E.; Bessière, P. (2010). "Incremental learning of Bayesian
sensorimotor models: from low-level behaviours to large-scale structure of the environment"
(https://hal.archives-ouvertes.fr/hal-00537809/file/diard10_author.pdf) (PDF). Connection
Science. 22 (4): 291–312. Bibcode:2010ConSc..22..291D (https://ui.adsabs.harvard.edu/ab
s/2010ConSc..22..291D). doi:10.1080/09540091003682561 (https://doi.org/10.1080%2F095
40091003682561). S2CID 216035458 (https://api.semanticscholar.org/CorpusID:21603545
8).
10. Pradalier, C.; Hermosillo, J.; Koike, C.; Braillon, C.; Bessière, P.; Laugier, C. (2005). "The
CyCab: a car-like robot navigating autonomously and safely among pedestrians". Robotics
and Autonomous Systems. 50 (1): 51–68. CiteSeerX 10.1.1.219.69 (https://citeseerx.ist.psu.
edu/viewdoc/summary?doi=10.1.1.219.69). doi:10.1016/j.robot.2004.10.002 (https://doi.org/1
0.1016%2Fj.robot.2004.10.002).
11. Ferreira, J.; Lobo, J.; Bessière, P.; Castelo-Branco, M.; Dias, J. (2012). "A Bayesian
Framework for Active Artificial Perception" (https://hal.archives-ouvertes.fr/hal-00747148/file/
A_Bayesian_Framework_for_Active_Artificial_Perception.pdf) (PDF). IEEE Transactions on
Systems, Man, and Cybernetics - Part B: Cybernetics. 99 (2): 1–13.
doi:10.1109/TSMCB.2012.2214477 (https://doi.org/10.1109%2FTSMCB.2012.2214477).
PMID 23014760 (https://pubmed.ncbi.nlm.nih.gov/23014760). S2CID 1808051 (https://api.se
manticscholar.org/CorpusID:1808051).
12. Ferreira, J. F.; Dias, J. M. (2014). Probabilistic Approaches to Robotic Perception. Springer.
ISBN 978-3-319-02005-1.
13. Mekhnacha, K.; Mazer, E.; Bessière, P. (2001). "The design and implementation of a
Bayesian CAD modeler for robotic applications". Advanced Robotics. 15 (1): 45–69.
CiteSeerX 10.1.1.552.3126 (https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.552.
3126). doi:10.1163/156855301750095578 (https://doi.org/10.1163%2F15685530175009557
8). S2CID 7920387 (https://api.semanticscholar.org/CorpusID:7920387).
14. Coué, C.; Pradalier, C.; Laugier, C.; Fraichard, T.; Bessière, P. (2006). "Bayesian Occupancy
Filtering for Multitarget Tracking: an Automotive Application" (https://hal.inria.fr/inria-0018200
4/file/coue-etal-ijrr-06.pdf) (PDF). International Journal of Robotics Research. 25 (1): 19–30.
doi:10.1177/0278364906061158 (https://doi.org/10.1177%2F0278364906061158).
S2CID 13874685 (https://api.semanticscholar.org/CorpusID:13874685).
15. Vasudevan, S.; Siegwart, R. (2008). "Bayesian space conceptualization and place
classification for semantic maps in mobile robotics". Robotics and Autonomous Systems. 56
(6): 522–537. CiteSeerX 10.1.1.149.4189 (https://citeseerx.ist.psu.edu/viewdoc/summary?do
i=10.1.1.149.4189). doi:10.1016/j.robot.2008.03.005 (https://doi.org/10.1016%2Fj.robot.200
8.03.005).
16. Perrin, X.; Chavarriaga, R.; Colas, F.; Seigwart, R.; Millan, J. (2010). "Brain-coupled
interaction for semi-autonomous navigation of an assistive robot" (http://infoscience.epfl.ch/r
ecord/149091). Robotics and Autonomous Systems. 58 (12): 1246–1255.
doi:10.1016/j.robot.2010.05.010 (https://doi.org/10.1016%2Fj.robot.2010.05.010).
17. Rett, J.; Dias, J.; Ahuactzin, J-M. (2010). "Bayesian reasoning for Laban Movement Analysis
used in human-machine interaction". International Journal of Reasoning-Based Intelligent
Systems. 2 (1): 13–35. CiteSeerX 10.1.1.379.6216 (https://citeseerx.ist.psu.edu/viewdoc/su
mmary?doi=10.1.1.379.6216). doi:10.1504/IJRIS.2010.029812 (https://doi.org/10.1504%2FI
JRIS.2010.029812).
18. Möbus, C.; Eilers, M.; Garbe, H.; Zilinski, M. (2009), "Probabilistic and Empirical Grounded
Modeling of Agents in (Partial) Cooperative Traffic Scenarios", in Duffy, Vincent G. (ed.),
Digital Human Modeling (http://oops.uni-oldenburg.de/1844/1/PartialCooperative20090223_
PCM.pdf) (PDF), Lecture Notes in Computer Science, Volume 5620, vol. 5620, Second
International Conference, ICDHM 2009, San Diego, CA, USA: Springer, pp. 423–432,
doi:10.1007/978-3-642-02809-0_45 (https://doi.org/10.1007%2F978-3-642-02809-0_45),
ISBN 978-3-642-02808-3
19. Möbus, C.; Eilers, M. (2009), "Further Steps Towards Driver Modeling according to the
Bayesian Programming Approach", in Duffy, Vincent G. (ed.), Digital Human Modeling,
Lecture Notes in Computer Science, Volume 5620, vol. 5620, Second International
Conference, ICDHM 2009, San Diego, CA, USA: Springer, pp. 413–422,
CiteSeerX 10.1.1.319.2067 (https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.319.
2067), doi:10.1007/978-3-642-02809-0_44 (https://doi.org/10.1007%2F978-3-642-02809-0_
44), ISBN 978-3-642-02808-3
20. Eilers, M.; Möbus, C. (2010). "Lernen eines modularen Bayesian Autonomous Driver
Mixture-of-Behaviors (BAD MoB) Modells" (http://www.lks.uni-oldenburg.de/download/Publi
kationen/2010/Eilers&PCM2010_BFFM_BAD_MoB_Modells2010.pdf) (PDF). In Kolrep, H.;
Jürgensohn, Th. (eds.). Fahrermodellierung - Zwischen kinematischen Menschmodellen
und dynamisch-kognitiven Verhaltensmodellen. Fortschrittsbericht des VDI in der Reihe 22
(Mensch-Maschine-Systeme). Düsseldorf, Germany: VDI-Verlag. pp. 61–74. ISBN 978-3-18-
303222-8.
21. Eilers, M.; Möbus, C. (2011). "Learning the Relevant Percepts of Modular Hierarchical
Bayesian Driver Models Using a Bayesian Information Criterion". In Duffy, V.G. (ed.). Digital
Human Modeling. LNCS 6777. Heidelberg, Germany: Springer. pp. 463–472.
doi:10.1007/978-3-642-21799-9_52 (https://doi.org/10.1007%2F978-3-642-21799-9_52).
ISBN 978-3-642-21798-2.
22. Eilers, M.; Möbus, C. (2011). "Learning of a Bayesian Autonomous Driver Mixture-of-
Behaviors (BAD-MoB) Model" (http://www.crcpress.com/product/isbn/9781439835111). In
Duffy, V.G. (ed.). Advances in Applied Digital Human Modeling. LNCS 6777. Boca Raton,
USA: CRC Press, Taylor & Francis Group. pp. 436–445. ISBN 978-1-4398-3511-1.
23. Le Hy, R.; Arrigoni, A.; Bessière, P.; Lebetel, O. (2004). "Teaching Bayesian Behaviours to
Video Game Characters" (http://cogprints.org/3744/1/lehy04.pdf) (PDF). Robotics and
Autonomous Systems. 47 (2–3): 177–185. doi:10.1016/j.robot.2004.03.012 (https://doi.org/1
0.1016%2Fj.robot.2004.03.012). S2CID 16415524 (https://api.semanticscholar.org/CorpusI
D:16415524).
24. Synnaeve, G. (2012). Bayesian Programming and Learning for Multiplayer Video Games (htt
p://tel.archives-ouvertes.fr/docs/00/78/06/35/PDF/29588_SYNNAEVE_2012_archivage.pdf)
(PDF).
25. Colas, F.; Droulez, J.; Wexler, M.; Bessière, P. (2008). "A unified probabilistic model of the
perception of three-dimensional structure from optic flow". Biological Cybernetics. 97 (5–6):
461–77. CiteSeerX 10.1.1.215.1491 (https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.
1.1.215.1491). doi:10.1007/s00422-007-0183-z (https://doi.org/10.1007%2Fs00422-007-018
3-z). PMID 17987312 (https://pubmed.ncbi.nlm.nih.gov/17987312). S2CID 215821150 (http
s://api.semanticscholar.org/CorpusID:215821150).
26. Laurens, J.; Droulez, J. (2007). "Bayesian processing of vestibular information". Biological
Cybernetics. 96 (4): 389–404. doi:10.1007/s00422-006-0133-1 (https://doi.org/10.1007%2Fs
00422-006-0133-1). PMID 17146661 (https://pubmed.ncbi.nlm.nih.gov/17146661).
S2CID 18138027 (https://api.semanticscholar.org/CorpusID:18138027).
27. Colas, F.; Flacher, F.; Tanner, T.; Bessière, P.; Girard, B. (2009). "Bayesian models of eye
movement selection with retinotopic maps" (https://hal.archives-ouvertes.fr/hal-00384515/fil
e/main.pdf) (PDF). Biological Cybernetics. 100 (3): 203–214. doi:10.1007/s00422-009-0292-
y (https://doi.org/10.1007%2Fs00422-009-0292-y). PMID 19212780 (https://pubmed.ncbi.nl
m.nih.gov/19212780). S2CID 5906668 (https://api.semanticscholar.org/CorpusID:5906668).
28. Serkhane, J.; Schwartz, J-L.; Bessière, P. (2005). "Building a talking baby robot A
contribution to the study of speech acquisition and evolution" (https://hal.archives-ouvertes.f
r/hal-00186575/file/Serkhane_Interaction_Studies_2005.pdf) (PDF). Interaction Studies. 6
(2): 253–286. doi:10.1075/is.6.2.06ser (https://doi.org/10.1075%2Fis.6.2.06ser).
29. Moulin-Frier, C.; Laurent, R.; Bessière, P.; Schwartz, J-L.; Diard, J. (2012). "Adverse
conditions improve distinguishability of auditory, motor and percep-tuo-motor theories of
speech perception: an exploratory Bayesian modeling study" (https://hal.archives-ouvertes.f
r/hal-01059179/file/moulin-frier12.pdf) (PDF). Language and Cognitive Processes. 27 (7–8):
1240–1263. doi:10.1080/01690965.2011.645313 (https://doi.org/10.1080%2F01690965.201
1.645313). S2CID 55504109 (https://api.semanticscholar.org/CorpusID:55504109).
30. Gilet, E.; Diard, J.; Bessière, P. (2011). Sporns, Olaf (ed.). "Bayesian Action–Perception
Computational Model: Interaction of Production and Recognition of Cursive Letters" (https://
www.ncbi.nlm.nih.gov/pmc/articles/PMC3106017). PLOS ONE. 6 (6): e20387.
Bibcode:2011PLoSO...620387G (https://ui.adsabs.harvard.edu/abs/2011PLoSO...620387G).
doi:10.1371/journal.pone.0020387 (https://doi.org/10.1371%2Fjournal.pone.0020387).
PMC 3106017 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3106017). PMID 21674043
(https://pubmed.ncbi.nlm.nih.gov/21674043).
31. "New algorithm helps machines learn as quickly as humans" (http://www.gizmag.com/artifici
al-intelligence-algorithm-learning/41448). www.gizmag.com. 2016-01-22. Retrieved
2016-01-23.
32. Zadeh, L.A. (June 1965). "Fuzzy sets" (https://doi.org/10.1016%2FS0019-9958%2865%299
0241-X). Information and Control. San Diego. 8 (3): 338–353. doi:10.1016/S0019-
9958(65)90241-X (https://doi.org/10.1016%2FS0019-9958%2865%2990241-X). ISSN 0019-
9958 (https://www.worldcat.org/issn/0019-9958). Wikidata Q25938993.
33. Zadeh, L.A. (September 1975). "Fuzzy logic and approximate reasoning". Synthese.
Springer. 30 (3–4): 407–428. doi:10.1007/BF00485052 (https://doi.org/10.1007%2FBF0048
5052). ISSN 0039-7857 (https://www.worldcat.org/issn/0039-7857). OCLC 714993477 (http
s://www.worldcat.org/oclc/714993477). S2CID 46975216 (https://api.semanticscholar.org/Co
rpusID:46975216). Wikidata Q57275767.
34. Dubois, D.; Prade, H. (2001). "Possibility Theory, Probability Theory and Multiple-Valued
Logics: A Clarification" (ftp://ftp.irit.fr/IRIT/ADRIA/AMAI-Dub.Pra.revised.pdf) (PDF). Ann.
Math. Artif. Intell. 32 (1–4): 35–66. doi:10.1023/A:1016740830286 (https://doi.org/10.1023%2
FA%3A1016740830286). S2CID 10271476 (https://api.semanticscholar.org/CorpusID:1027
1476).
35. Poole, D. (1993). "Probabilistic Horn abduction and Bayesian networks". Artificial
Intelligence. 64: 81–129. doi:10.1016/0004-3702(93)90061-F (https://doi.org/10.1016%2F00
04-3702%2893%2990061-F).
36. Poole, D. (1997). "The Independent Choice Logic for modelling multiple agents under
uncertainty" (https://doi.org/10.1016%2FS0004-3702%2897%2900027-1). Artificial
Intelligence. 94 (1–2): 7–56. doi:10.1016/S0004-3702(97)00027-1 (https://doi.org/10.1016%
2FS0004-3702%2897%2900027-1).
37. Sato, T.; Kameya, Y. (2001). "Parameter learning of logic programs for symbolic-statistical
modeling" (https://web.archive.org/web/20140712033447/http://www.jair.org/media/912/live-
912-2013-jair.pdf) (PDF). Journal of Artificial Intelligence Research. 15 (2001): 391–454.
arXiv:1106.1797 (https://arxiv.org/abs/1106.1797). Bibcode:2011arXiv1106.1797S (https://ui.
adsabs.harvard.edu/abs/2011arXiv1106.1797S). doi:10.1613/jair.912 (https://doi.org/10.161
3%2Fjair.912). S2CID 7857569 (https://api.semanticscholar.org/CorpusID:7857569).
Archived from the original (http://www.jair.org/media/912/live-912-2013-jair.pdf) (PDF) on
2014-07-12. Retrieved 2015-10-18.
38. figaro (https://github.com/p2t2/figaro) on GitHub

Further reading
Kamel Mekhnacha (2013). Bayesian Programming. Chapman and Hall/CRC.
doi:10.1201/b16111 (https://doi.org/10.1201%2Fb16111). ISBN 978-1-4398-8032-6.

External links
A companion site to the Bayesian programming book where to download ProBT an
inference engine dedicated to Bayesian programming. (https://archive.today/201311231627
33/http://www.probayes.com/Bayesian-Programming-Book)
The Bayesian-programming.org site (http://Bayesian-programming.org) Archived (https://arch
ive.today/20131123162815/http://bayesian-programming.org/) 2013-11-23 at archive.today
for the promotion of Bayesian programming with detailed information and numerous
publications.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Bayesian_programming&oldid=1147079993"

You might also like