0% found this document useful (0 votes)

21 views

AI & ML Unit 2 Notes

Uploaded by

Anandakumar A

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

AI & ML Unit 2 Notes

Uploaded by

Anandakumar A

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Unit – II

Acting Under Uncertainty:

Uncertainty:
• The knowledge representation, A->B means if A is true then B is true, but a
situation where not sure about whether A is true or not then cannot express this
statement, this situation is called Uncertainty.
• Agents must act under Uncertainty.

Causes for uncertainty:

• Information occurred from unreliable sources

• Experimental Errors
• Equipment Fault
• Temperature variation
• Climate change

Probabilistic Reasoning:

• It is a way of knowledge representation, the concept of probability is applied to

indicate the uncertainty in knowledge.
Need for Probabilistic Reasoning in AI:
✓ When there are unpredictable outcomes
✓ When an unknown error occurs during an experiment

Ways to solve problems with uncertain knowledge:

✓ Baye’s rule
✓ Bayesian Statistics

Probability:

• It can be defined as a chance that an uncertain event will occur.

• The value of probability always remains between 0 and 1.
• 0 ≤ P(A) ≤ 1, where P(A) is the probability of an event A.
• P(A) = 0, indicates total uncertainty in an event A
• P(A) =1, indicates total certainty in an event A.

Event: Each possible outcome of a variable is called event.

Sample Space: The collection of all possible events is called sample space.
Random variables: Random variables are used to represent the events and
objects in real world.
Prior Probability: It is probability computed before observing new information.
Posterior Probability: It is calculated after all information has taken into
account.

Conditional Probability:

• It is probability of occurring an event when another event has already happened.

Bayesian Inference:

• Bayesian inference is a probabilistic approach to machine learning that provides

estimates of the probability of specific events.
• Bayesian inference is a statistical method for understanding the uncertainty
inherent in prediction problems.
• Bayesian inference algorithm can be viewed as a Markov Chain Monte Carlo
algorithm that uses prior probability distributions to optimize the likelihood
function.
• The basis of Bayesian inference is the notion of a priori and a posteriori
probabilities.
• The priori probability is the probability of an event before any evidence is
considered.
• The posteriori probability is the probability of an event after taking into account
all available evidence.

Baye’s Theorem / Baye’s Rule:

• Baye’s theorem determines the probability of an event with uncertain

knowledge.
• It can be derived using product rule and conditional probability of event A
with known event B.

P(A|B) is known as posterior, P(B|A) is called likelihood, P(A) is called Prior

probability, P(B) is called Marginal Probability.

Application of Baye’s Theorem:

• It is used to calculate next step pf robot when already executed step is given.
• Helpful in weather forecasting.
• Solve Monty Hall problem

Naïve Bayes Theorem:

• It is a classification technique based on Baye’s Theorem with an independence

assumption.
• The full joint distribution can be written as

Bayesian Networks:

• "A Bayesian network is a probabilistic graphical model which represents a set of

variables and their conditional dependencies using a directed acyclic graph."
• It is also called a Bayes network, belief network, decision network, or Bayesian
model.
• Bayesian Network can be used for building models from data and experts
opinions, and it consists of two parts: Directed Acyclic Graph, Table of
conditional probabilities.
• It is used to represent conditional dependencies.
• It can also be used in various tasks including prediction, anomaly detection,
diagnostics, automated insight, reasoning, time series prediction.
• A Bayesian network graph is made up of nodes and Arcs.

• Each node corresponds to the random variables, and a variable can be continuous
or discrete.
• Arc or directed arrows represent the causal relationship or conditional
probabilities between random variables.
• These directed links or arrows connect the pair of nodes in the graph.
• These links represent that one node directly influence the other node.
• The Bayesian network graph does not contain any cyclic graph. Hence, it is
known as a directed acyclic graph or DAG.
• The Bayesian network has mainly two components: 1. Causal Component 2.
Actual numbers
• Bayesian network is based on Joint probability distribution and conditional
probability.

Joint probability distribution:

• If variables are x1, x2, x3,....., xn, then the probabilities of a different
combination of x1, x2, x3.. xn, are known as Joint probability distribution.
• P[x1, x2, x3, ,xn], can be written as the following way in terms of the joint
probability distribution. = P[x1| x2, x3,....., xn]. p[x2, x3, , xn] = P[x1| x2,
x3,....., xn]P[x2|x3,....., xn] P[xn-1|xn]P[xn].
• Global semantics defines the full joint distribution as the product of local
condition distributions.
• Local semantics defines each node is conditionally independent of its
nondescendants given its parents.

Example:

The network structure is showing that burglary and earthquake is the parent node of the alarm and directly
affecting the probability of alarm's going off.

Variables: Burglar, Earthquake, Alarm, Johncalls, Marycalls

Conditional Probability table for Alarm A:

Conditional Probability table for David Calls:

Conditional Probability table for Sophia Calls:

Applications of Bayesian Networks:

• Spam Filtering
• Biomonitoring
• Image processing
• Turbo code
• Document Classification

Exact Inference in BN:

• In exact inference, analytically compute the conditional probability distribution

over the variables of interest.
• The basic task for any probabilistic inference system is to compute the
posterior probability distribution for a set of variables.
• The notation X denotes the query variable, E denotes the set of evidence
variables E1,…,Em, Y denotes nonevidence variables.
• Conditional probability can be computed by summing terms from the full joint
distribution.

• Now, a Bayesian network gives a complete representation of the full joint

distribution.
• More specifically, Equation shows that the terms P(x, e, y) in the joint
distribution can be written as products of conditional probabilities from the
network.
• To compute this expression, we have to add four terms, each computed by
multiplying five numbers.
• In the worst case, where we have to sum out almost all the variables, the
complexity of the algorithm for a network with n Boolean variables is O(n2n).

• This expression can be evaluated by looping through the variables in order.

Variable Elimination Algorithm:

• The enumeration algorithm can be improved substantially by eliminating

repeated calculations.
• Do the calculation once and save the results for later use.
• This is a form of dynamic programming.
• It works by evaluating expressions such as equation in right-to-left order.

Approximate Inference in BN:

• Given the intractability of exact inference in large networks, we will consider approximate inference
methods.

• This section describes randomized sampling algorithms, also called Monte Carlo algorithms.

• They work by generating random events based on the probabilities in the Bayes net and counting
up the different answers found in those random events.

Direct Sampling methods:

• The primitive element in any sampling algorithm is the generation of samples

from a known probability distribution.
• For example, an unbiased coin can be thought of as a random variable Coin with
values (heads, tails) and a prior distribution P(Coin) = (0.5,0.5).
• Sampling from this distribution is exactly like flipping the coin: with probability
0.5 it will return heads, and with probability 0.5 it will return tails.
• Given a source of random numbers r uniformly distributed in the range [0,1], it
is a simple matter to sample any distribution on a single variable, whether
discrete or continuous.
• The idea is to sample each variable in turn, in topological order.
• The probability distribution from which the value is sampled is conditioned on
the values already assigned to the variable’s parents.
• Applying it to the network with the ordering Cloudy, Sprinkler, Rain.
• Sample from P(Cloudy) = {0.5,0.5}, value is true.
• Sample from P(Sprinkler | Cloudy) = {0.1,0.9}, value is false.
• Sample from P(Rain | Cloudy = true) = {0.8,0.2}, value is true.

Rejection Sampling in Bayesian Networks:

• Rejection sampling is a general method for producing samples from a hard-to-

sample distribution given an easy-to-sample distribution.
• It can be used to compute conditional probabilities that is, to determine P(X |e).
• First it generates samples from the prior distribution specified by network.
• Then it rejects all those that do not matches the evidence.

Markov Chain Monte Carlo (MCMC) Algorithm:

• MCMC generates each event by making a random change to the preceding event.
• It is therefore helpful to think of the network as being in a particular current state
specifying a value for every variable.
• The next state is generated by randomly sampling a value for one of the
nonevidence variables Xi, conditioned on the current values of the variables in
the Markov blanket of Xi.
• MCMC therefore wanders randomly around the state space-the space of possible
complete assignmentsflipping one variable at a time, but keeping the evidence
variables fixed.
• Consider the query P(Rain1 Sprinkler = true, Wet Grass = true) applied to the
network.
• The evidence variables Sprinkler and WetGrass are fixed to their observed values
and the hidden variables Cloudy and Rain are initialized randomly.
• Thus, the initial state is [true, true, false, true]. Now the following steps are
executed repeatedly:
• Cloudy is sampled, given the current values of its Markov blanket variables: in
this case, we sample from P(Cloudy1 Sprinkler = true, Rain =false). Suppose the
result is Cloudy =false. Then the new current state is [false, true, false, true].

Causal Networks:

• A causal network is an acyclic digraph arising from an evolution of a substitution

system.
• Each substitution event is a vertex in a causal network.
• Two events which are related by causal dependence, meaning one occurs just
before the other, having edge between the corresponding vertices in the causal
network.
• The edge is directed edge leading from the past event to future event.
• A CBN is a graph formed by nodes representing random variables, connected by
links denoting causal influence.
• Some causal networks are independent of choice of evolution and these are
called Causally Invariant.

Structural Causal Models (SCMs):

• SCMs consists of two parts: a graph which visualizes causal connections, and
equations which express the details of the connection. Graph is a mathematical
construction that consists of vertices(nodes) and edges(links).
• SCMs use a special kind pf graph called Directed Acyclic Graph(DAG) for
which all edges are directed and no cycles exist.
• DAGs are common starting place for causal inference.
• Bayesian and causal networks are completely identical.

• A network with 2 nodes and 1 edge.

• This network can be both a Bayesian or causal network.

Implementing Causal Inference:

1) The do-operator:

• The do-operator is a mathematical representation of a physical intervention.

• If the model starts with Z → X → Y.
2) Confounding:

• In this example, age is a confounder pf education and wealth.

• Adjusting for age just means that when looking at age, education and wealth
data, one would compare data points within afe groups, not between age
groups.

3) Estimating Causal Effects:

• Treatment Effect = (Outcome under E) minus (Outcome under C).

• The difference between the outcome a child would receive if assigned to
treatment E and outcome that same child would receive of assigned to
treatment C.
• These are called Potential Outcomes.

STAT 400 Midterm 1 Cheat Sheet
No ratings yet
STAT 400 Midterm 1 Cheat Sheet
4 pages
CS 188 Introduction To AI Midterm Study Guide
No ratings yet
CS 188 Introduction To AI Midterm Study Guide
2 pages
Markov Models Supervised and Unsupervised Machine Learning: Mastering Data Science And Python
From Everand
Markov Models Supervised and Unsupervised Machine Learning: Mastering Data Science And Python
William Sullivan
2/5 (1)
Bayesian Belief Network, Exact Inference, Approx Inference, Causal Network
No ratings yet
Bayesian Belief Network, Exact Inference, Approx Inference, Causal Network
15 pages
ASHTIKA
No ratings yet
ASHTIKA
9 pages
unit ii
No ratings yet
unit ii
44 pages
EXP1_A09_DS
No ratings yet
EXP1_A09_DS
6 pages
AAI Module 3 Notes
No ratings yet
AAI Module 3 Notes
7 pages
Unit-5 Bayes' Rule and Bayesian Network
No ratings yet
Unit-5 Bayes' Rule and Bayesian Network
9 pages
Aiml Unit 2
No ratings yet
Aiml Unit 2
15 pages
Good BayesianNetworksPrimer
No ratings yet
Good BayesianNetworksPrimer
23 pages
Ai Pro
No ratings yet
Ai Pro
11 pages
Unit 3-2
No ratings yet
Unit 3-2
12 pages
Unit 6
No ratings yet
Unit 6
126 pages
202004021910158758chandrabhan Artificial Intelligence Probabilistic Reasoning
No ratings yet
202004021910158758chandrabhan Artificial Intelligence Probabilistic Reasoning
11 pages
13 Bayes Nets
No ratings yet
13 Bayes Nets
38 pages
Unit 5
No ratings yet
Unit 5
98 pages
AI NOTES unit 2
No ratings yet
AI NOTES unit 2
9 pages
Bayesian Networks in AI
No ratings yet
Bayesian Networks in AI
8 pages
Uncertain Knowledge
No ratings yet
Uncertain Knowledge
31 pages
Data Analytics Unit-2 PPT Notes
No ratings yet
Data Analytics Unit-2 PPT Notes
190 pages
AI unit 5 notes
No ratings yet
AI unit 5 notes
35 pages
Bayesian Networks: Construction, Inference, Learning and Causal Interpretation
No ratings yet
Bayesian Networks: Construction, Inference, Learning and Causal Interpretation
58 pages
29-Approximate Inference Methods-28-03-2024
No ratings yet
29-Approximate Inference Methods-28-03-2024
26 pages
AI Bayes Theorem
No ratings yet
AI Bayes Theorem
10 pages
Lecture 5 Bayesian Networks
No ratings yet
Lecture 5 Bayesian Networks
12 pages
Bayesian Neworks
No ratings yet
Bayesian Neworks
32 pages
Lecture Bayesian Networks
No ratings yet
Lecture Bayesian Networks
50 pages
Ai Ii Notes
No ratings yet
Ai Ii Notes
33 pages
Bayesian Belief Network in Artificial Intelligence
No ratings yet
Bayesian Belief Network in Artificial Intelligence
10 pages
Bayesian Networks
No ratings yet
Bayesian Networks
7 pages
BayesNets2016
No ratings yet
BayesNets2016
62 pages
Chapter 9 Data Mining
No ratings yet
Chapter 9 Data Mining
147 pages
Unit Iv Learning
No ratings yet
Unit Iv Learning
40 pages
Module V_v1
No ratings yet
Module V_v1
58 pages
IAI UNIT 6
No ratings yet
IAI UNIT 6
6 pages
3-Bayesian Modelling - Inference and Bayesian NT
No ratings yet
3-Bayesian Modelling - Inference and Bayesian NT
25 pages
Probabilistic Reasoning
No ratings yet
Probabilistic Reasoning
58 pages
Learning Bayesian Networks (Neapolitan, Richard) PDF
100% (1)
Learning Bayesian Networks (Neapolitan, Richard) PDF
704 pages
Bayesian Belief Network
No ratings yet
Bayesian Belief Network
10 pages
Lecture 8
No ratings yet
Lecture 8
61 pages
LM7 Approximate Inference in BN
No ratings yet
LM7 Approximate Inference in BN
18 pages
Ai Notes
No ratings yet
Ai Notes
68 pages
learning bayesian networks 1st edition by Richard Neapolitan ISBN 0130125342 978-0130125347 - Instantly access the complete ebook with just one click
100% (17)
learning bayesian networks 1st edition by Richard Neapolitan ISBN 0130125342 978-0130125347 - Instantly access the complete ebook with just one click
90 pages
ml 5
No ratings yet
ml 5
28 pages
PPT06-Probabilistic Reasoning
No ratings yet
PPT06-Probabilistic Reasoning
31 pages
CM Week 15
No ratings yet
CM Week 15
15 pages
ML-9
No ratings yet
ML-9
15 pages
Aids Lab PDF
No ratings yet
Aids Lab PDF
53 pages
Bayesian Network
No ratings yet
Bayesian Network
15 pages
Unit4 - Lecture 2
No ratings yet
Unit4 - Lecture 2
17 pages
learning bayesian networks 1st edition by Richard Neapolitan ISBN 0130125342 978-0130125347 - The full ebook with complete content is ready for download
No ratings yet
learning bayesian networks 1st edition by Richard Neapolitan ISBN 0130125342 978-0130125347 - The full ebook with complete content is ready for download
43 pages
AI&MLUnit 2
No ratings yet
AI&MLUnit 2
26 pages
Unit-4 Bayesian Networks
No ratings yet
Unit-4 Bayesian Networks
19 pages
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction to Statistics
From Everand
Introduction to Statistics
Simone Malacrida
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
How To Code For Quantum Computers
From Everand
How To Code For Quantum Computers
Nivio Dos Santos
No ratings yet
Module 5 - Stat. - Prob.
No ratings yet
Module 5 - Stat. - Prob.
4 pages
Correlations: Female Only
No ratings yet
Correlations: Female Only
3 pages
Uniform and Normal Distribution.
No ratings yet
Uniform and Normal Distribution.
45 pages
A Multivariate Kolmogorov-Smimov Test of Goodness of Fit 1
No ratings yet
A Multivariate Kolmogorov-Smimov Test of Goodness of Fit 1
9 pages
Summary of Chapter "Random Variable"
No ratings yet
Summary of Chapter "Random Variable"
10 pages
Chapter 6 - The Multivariate Normal Distribution and Copulas - 2013 - Simulation
No ratings yet
Chapter 6 - The Multivariate Normal Distribution and Copulas - 2013 - Simulation
13 pages
Independent and Identically Distributed Random Variables - Wikipedia
No ratings yet
Independent and Identically Distributed Random Variables - Wikipedia
10 pages
Random Variables and Univariate Probability Distributions
No ratings yet
Random Variables and Univariate Probability Distributions
7 pages
Relation Between Binomial and Poisson Distributions: Week 5 1
No ratings yet
Relation Between Binomial and Poisson Distributions: Week 5 1
15 pages
Binomial Distributions/ Taburan Binomial
No ratings yet
Binomial Distributions/ Taburan Binomial
12 pages
Chebysev Inequality: Suppose and Variance
No ratings yet
Chebysev Inequality: Suppose and Variance
13 pages
Basic Statistics For Lms
0% (1)
Basic Statistics For Lms
23 pages
Standard Probability Space
No ratings yet
Standard Probability Space
8 pages
Understanding Rho: Cov (X, Y) = E (XY) − μ μ = E (X) E (Y) − μ μ = − = 0 μ μ μ μ Cov (X, Y) σ σ 0 σ σ
No ratings yet
Understanding Rho: Cov (X, Y) = E (XY) − μ μ = E (X) E (Y) − μ μ = − = 0 μ μ μ μ Cov (X, Y) σ σ 0 σ σ
5 pages
UNIT 2 Rejinpaul
No ratings yet
UNIT 2 Rejinpaul
74 pages
MQM100 MultipleChoice Chapter5
88% (8)
MQM100 MultipleChoice Chapter5
16 pages
Mathematical Statistics Final Exam
No ratings yet
Mathematical Statistics Final Exam
5 pages
Part A Simulation: Matthias Winkel Department of Statistics University of Oxford
No ratings yet
Part A Simulation: Matthias Winkel Department of Statistics University of Oxford
54 pages
Probability Concepts and Applications: To Accompany
No ratings yet
Probability Concepts and Applications: To Accompany
98 pages
Lecture 14
No ratings yet
Lecture 14
10 pages
Simon Shaw Bayes Theory
No ratings yet
Simon Shaw Bayes Theory
72 pages
Signals Why Random Signal Theory?: Lecture No. 2 (Module I) 20/12/2011
No ratings yet
Signals Why Random Signal Theory?: Lecture No. 2 (Module I) 20/12/2011
1 page
Quiz-6 Answers and Solutions: Coursera. Stochastic Processes December 30, 2020
No ratings yet
Quiz-6 Answers and Solutions: Coursera. Stochastic Processes December 30, 2020
4 pages
QT-Random Variable and Probability Distribution-1
No ratings yet
QT-Random Variable and Probability Distribution-1
4 pages
Problems - Measure Theory PDF
No ratings yet
Problems - Measure Theory PDF
16 pages
Laporan Kalibrasi Calliper Latihan
No ratings yet
Laporan Kalibrasi Calliper Latihan
24 pages
Conditional Probability
No ratings yet
Conditional Probability
85 pages
M-4 U-4 Combined Notes
No ratings yet
M-4 U-4 Combined Notes
131 pages
Download ebooks file U Statistics in Banach Spaces 2nd Edition Yu. V. Borovskikh (Editor) all chapters
No ratings yet
Download ebooks file U Statistics in Banach Spaces 2nd Edition Yu. V. Borovskikh (Editor) all chapters
55 pages