AI Fundamentals Finals

What is a finite state machine?
A state machine that can only be in a finite number of states
How does the RProp algorithm handle the initialization of weights in the optimization process?
It uses random weights
What are the biases in the backpropagation algorithm?

The values that are used to determine the activation function of the neural network
How does the Quickprop algorithm improve upon traditional gradient descent algorithms?
It uses a variable learning rate
How does the RProp algorithm handle local minima in the optimization process?
It avoids local minima by using a dynamic learning rate
What are the hidden layers of a feedforward neural network called?

Hidden layers
What is a decision tree?

A tree-like model used to make predictions in supervised learning
How does the Quickprop algorithm handle weight updates that are too large?
It discards the weight updates
How does the RProp algorithm adjust the learning rate?

It increases the learning rate if the error decreases and decreases the learning rate if the error
increases
How does the backpropagation algorithm work?

By adjusting the weights and biases of the neural network in the backward pass
What is a disadvantage of using a Q-learning algorithm?

It cannot handle continuous state spaces
What is a backpropagation network?

A type of artificial neural network that uses supervised learning
How is a state transition function typically represented?

As a function of state and input
What is a Markov decision process (MDP)?

A mathematical model for decision-making in situations where outcomes are partly random and
partly under the control of a decision maker
What is a deterministic finite state machine?
A finite state machine that has a unique next state for each combination of current state and
input
How does the Quickprop algorithm adjust the learning rate for each weight in the neural
network?
It adjusts the learning rate based on the previous weight update
What are the weights in the backpropagation algorithm?

The values that are used to determine the output of the neural network
How does a backpropagation network learn?

By adjusting the weights and biases of the network based on the input and output data
What are the components of a Markov decision process (MDP)?

States, outcomes, and rewards
What is a Q-learning algorithm?

A reinforcement learning algorithm that uses a Q-table to store the expected rewards for each
action in each state
What is a kernel in the context of support vector machines (SVMs)?

A mathematical function that transforms data into a higher dimensional space
What is a supervised learning algorithm?

An algorithm that can learn from labeled data
What is a policy in a Markov decision process (MDP)?

A set of rules for selecting actions in a given state
What is a kernel function in the context of support vector machines?

A function that maps the input data into a higher-dimensional space
What is a state machine?

A machine that changes its state based on a state transition function
What is a nondeterministic finite state machine?

A finite state machine that has multiple possible next states for each combination of current
state and input
What is a linear support vector machine?

A support vector machine that uses a linear kernel function
What is a policy in reinforcement learning?

The current condition or situation of the agent
What is a reward in a Markov decision process (MDP)?
A numerical value that represents the utility or value of a state or action
What is a linear regression algorithm?

A model used to make predictions in supervised learning by fitting a linear equation to the data
What is a state in a Markov decision process (MDP)?

A representation of the current situation
What is a state in reinforcement learning?

The current condition or situation of the agent
What is a logistic regression algorithm?

A model used for classification tasks by predicting the probability of an event occurring
What is a support vector in the context of support vector machines?

A data point that is used to define the decision boundary
What is a state transition function?

A function that determines the next state of a system based on its current state and input
What is a reward in reinforcement learning?

A positive or negative feedback received by the agent for its actions
What is a support vector machine used for?

Classification
What is a multi-class support vector machine?

A support vector machine that can handle multiple classes of data
What is a soft margin in the context of support vector machines?

A margin that is optimized using the hinge loss function
What is a non-linear support vector machine?

A support vector machine that uses a non-linear kernel function
What is reinforcement learning?

A type of machine learning where an agent learns through interacting with its environment and
receiving rewards or punishments
What is an action in reinforcement learning?

A decision made by the agent based on its current state
What is the Bellman equation in a Markov decision process (MDP)?

A recursive equation used to compute the value function of a state
What is a support vector machine?
A type of supervised learning algorithm
What is a support vector machine?

A neural network used in deep learning
What is an example of a reinforcement learning problem?

Training a robot to navigate through a maze
What is breadth-first search (BFS)?

A uniformed combinatorial search algorithm that expands nodes in order of their breadth
What is the RProp algorithm's convergence criterion?

The error does not improve for a predetermined number of iterations
What is depth-first search (DFS)?

A uniformed combinatorial search algorithm that expands nodes in order of their depth
What is an agent in reinforcement learning?

The entity that receives rewards or punishments and learns from them
What is the Quickprop algorithm used for?

Neural network training
What is the "kernel trick" in the context of support vector machines?

A method for avoiding the computation of the dot product between the input data
What is the RProp algorithm used for?

Optimization of neural networks
What is an output alphabet?

The set of inputs that a state machine can receive
What is a uniformed combinatorial search algorithm?

A search algorithm that explores all possible solutions in a systematic way
What is an action in a Markov decision process (MDP)?

A possible outcome of an action
What is a support vector regression machine?

A support vector machine that is used for regression
What is an example of a system that uses a state transition function?

All of the above
What is an input alphabet?
The set of inputs that a state machine can receive
What is an advantage of using a Q-learning algorithm?

It can learn from its mistakes and improve over time
What is the gradient descent in the backpropagation algorithm?

The process of minimizing the error between the predicted output and the actual output
What is the RProp algorithm's learning rate update rule?

The learning rate is updated based on the difference between the current and previous
iteration's gradient
What is the backpropagation algorithm used for?

Training artificial neural networks
What is the error function in the backpropagation algorithm?

The function that calculates the error between the predicted output and the actual output
What is the forward pass in the backpropagation algorithm?

The process of predicting the output of the neural network based on the input data
What is the RProp algorithm's weight update rule?

The weight update is proportional to the derivative of the error function
What is the goal of a Markov decision process (MDP)?

To maximize the expected cumulative reward over a given time horizon
What is the cost function used in backpropagation?

The mean squared error
What is the input layer in the backpropagation algorithm?

The layer that receives the input data and passes it to the hidden layer
What is the input layer of a feedforward neural network called?

Input layer
What is the hidden layer in the backpropagation algorithm?

The layer that is not visible to the user and processes the input data
What is the bias update rule in backpropagation?

B(i) = B(i) - alpha * delta(i)
What is the goal of the backpropagation algorithm in each iteration?

To minimize the error between the predicted output and the actual output in each iteration
What is the backpropagation algorithm used for?
All of the above
What is the learning rate in the backpropagation algorithm?

The rate at which the model learns from the training data
What is the bias update rule in the backpropagation algorithm?

The mathematical formula that is used to update the biases based on the gradient descent
What is the main advantage of the Quickprop algorithm over the backpropagation algorithm?
It is faster to converge
What is the backward pass in the backpropagation algorithm?

The process of adjusting the weights and biases in the backward direction
What is the learning rate in backpropagation?

The hyperparameter that determines the size of the weight and bias updates
What is the goal of reinforcement learning?

To maximize the cumulative reward over time

AI Fundamentals Finals

Uploaded by

Copyright:

Available Formats

AI Fundamentals Finals

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

AI Fundamentals Finals

Uploaded by

Copyright:

Available Formats

What is a finite state machine?

A state machine that can only be in a finite number of states

What are the biases in the backpropagation algorithm?

What are the hidden layers of a feedforward neural network called?

What is a decision tree?

How does the RProp algorithm adjust the learning rate?

How does the backpropagation algorithm work?

What is a disadvantage of using a Q-learning algorithm?

What is a backpropagation network?

How is a state transition function typically represented?

What is a Markov decision process (MDP)?

What are the weights in the backpropagation algorithm?

How does a backpropagation network learn?

What are the components of a Markov decision process (MDP)?

What is a Q-learning algorithm?

What is a kernel in the context of support vector machines (SVMs)?

What is a supervised learning algorithm?

What is a policy in a Markov decision process (MDP)?

What is a kernel function in the context of support vector machines?

What is a state machine?

What is a nondeterministic finite state machine?

What is a linear support vector machine?

What is a policy in reinforcement learning?

What is a linear regression algorithm?

What is a state in a Markov decision process (MDP)?

What is a state in reinforcement learning?

What is a logistic regression algorithm?

What is a support vector in the context of support vector machines?

What is a state transition function?

What is a reward in reinforcement learning?

What is a support vector machine used for?

What is a multi-class support vector machine?

What is a soft margin in the context of support vector machines?

What is a non-linear support vector machine?

What is reinforcement learning?

What is an action in reinforcement learning?

What is the Bellman equation in a Markov decision process (MDP)?

What is a support vector machine?

What is an example of a reinforcement learning problem?

What is breadth-first search (BFS)?

What is the RProp algorithm's convergence criterion?

What is depth-first search (DFS)?

What is an agent in reinforcement learning?

What is the Quickprop algorithm used for?

What is the "kernel trick" in the context of support vector machines?

What is the RProp algorithm used for?

What is an output alphabet?

What is a uniformed combinatorial search algorithm?

What is an action in a Markov decision process (MDP)?

What is a support vector regression machine?

What is an example of a system that uses a state transition function?

What is an advantage of using a Q-learning algorithm?

What is the gradient descent in the backpropagation algorithm?

What is the RProp algorithm's learning rate update rule?

What is the backpropagation algorithm used for?

What is the error function in the backpropagation algorithm?

What is the forward pass in the backpropagation algorithm?

What is the RProp algorithm's weight update rule?

What is the goal of a Markov decision process (MDP)?