Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

AI Fundamentals Finals

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 6

What is a finite state machine?

A state machine that can only be in a finite number of states

How does the RProp algorithm handle the initialization of weights in the optimization process?
It uses random weights

What are the biases in the backpropagation algorithm?


The values that are used to determine the activation function of the neural network

How does the Quickprop algorithm improve upon traditional gradient descent algorithms?
It uses a variable learning rate

How does the RProp algorithm handle local minima in the optimization process?
It avoids local minima by using a dynamic learning rate

What are the hidden layers of a feedforward neural network called?


Hidden layers

What is a decision tree?


A tree-like model used to make predictions in supervised learning

How does the Quickprop algorithm handle weight updates that are too large?
It discards the weight updates

How does the RProp algorithm adjust the learning rate?


It increases the learning rate if the error decreases and decreases the learning rate if the error
increases

How does the backpropagation algorithm work?


By adjusting the weights and biases of the neural network in the backward pass

What is a disadvantage of using a Q-learning algorithm?


It cannot handle continuous state spaces

What is a backpropagation network?


A type of artificial neural network that uses supervised learning

How is a state transition function typically represented?


As a function of state and input

What is a Markov decision process (MDP)?


A mathematical model for decision-making in situations where outcomes are partly random and
partly under the control of a decision maker
What is a deterministic finite state machine?
A finite state machine that has a unique next state for each combination of current state and
input

How does the Quickprop algorithm adjust the learning rate for each weight in the neural
network?
It adjusts the learning rate based on the previous weight update

What are the weights in the backpropagation algorithm?


The values that are used to determine the output of the neural network

How does a backpropagation network learn?


By adjusting the weights and biases of the network based on the input and output data

What are the components of a Markov decision process (MDP)?


States, outcomes, and rewards

What is a Q-learning algorithm?


A reinforcement learning algorithm that uses a Q-table to store the expected rewards for each
action in each state

What is a kernel in the context of support vector machines (SVMs)?


A mathematical function that transforms data into a higher dimensional space

What is a supervised learning algorithm?


An algorithm that can learn from labeled data

What is a policy in a Markov decision process (MDP)?


A set of rules for selecting actions in a given state

What is a kernel function in the context of support vector machines?


A function that maps the input data into a higher-dimensional space

What is a state machine?


A machine that changes its state based on a state transition function

What is a nondeterministic finite state machine?


A finite state machine that has multiple possible next states for each combination of current
state and input

What is a linear support vector machine?


A support vector machine that uses a linear kernel function

What is a policy in reinforcement learning?


The current condition or situation of the agent
What is a reward in a Markov decision process (MDP)?
A numerical value that represents the utility or value of a state or action

What is a linear regression algorithm?


A model used to make predictions in supervised learning by fitting a linear equation to the data

What is a state in a Markov decision process (MDP)?


A representation of the current situation

What is a state in reinforcement learning?


The current condition or situation of the agent

What is a logistic regression algorithm?


A model used for classification tasks by predicting the probability of an event occurring

What is a support vector in the context of support vector machines?


A data point that is used to define the decision boundary

What is a state transition function?


A function that determines the next state of a system based on its current state and input

What is a reward in reinforcement learning?


A positive or negative feedback received by the agent for its actions

What is a support vector machine used for?


Classification

What is a multi-class support vector machine?


A support vector machine that can handle multiple classes of data

What is a soft margin in the context of support vector machines?


A margin that is optimized using the hinge loss function

What is a non-linear support vector machine?


A support vector machine that uses a non-linear kernel function

What is reinforcement learning?


A type of machine learning where an agent learns through interacting with its environment and
receiving rewards or punishments

What is an action in reinforcement learning?


A decision made by the agent based on its current state

What is the Bellman equation in a Markov decision process (MDP)?


A recursive equation used to compute the value function of a state
What is a support vector machine?
A type of supervised learning algorithm

What is a support vector machine?


A neural network used in deep learning

What is an example of a reinforcement learning problem?


Training a robot to navigate through a maze

What is breadth-first search (BFS)?


A uniformed combinatorial search algorithm that expands nodes in order of their breadth

What is the RProp algorithm's convergence criterion?


The error does not improve for a predetermined number of iterations

What is depth-first search (DFS)?


A uniformed combinatorial search algorithm that expands nodes in order of their depth

What is an agent in reinforcement learning?


The entity that receives rewards or punishments and learns from them

What is the Quickprop algorithm used for?


Neural network training

What is the "kernel trick" in the context of support vector machines?


A method for avoiding the computation of the dot product between the input data

What is the RProp algorithm used for?


Optimization of neural networks

What is an output alphabet?


The set of inputs that a state machine can receive

What is a uniformed combinatorial search algorithm?


A search algorithm that explores all possible solutions in a systematic way

What is an action in a Markov decision process (MDP)?


A possible outcome of an action

What is a support vector regression machine?


A support vector machine that is used for regression

What is an example of a system that uses a state transition function?


All of the above
What is an input alphabet?
The set of inputs that a state machine can receive

What is an advantage of using a Q-learning algorithm?


It can learn from its mistakes and improve over time

What is the gradient descent in the backpropagation algorithm?


The process of minimizing the error between the predicted output and the actual output

What is the RProp algorithm's learning rate update rule?


The learning rate is updated based on the difference between the current and previous
iteration's gradient

What is the backpropagation algorithm used for?


Training artificial neural networks

What is the error function in the backpropagation algorithm?


The function that calculates the error between the predicted output and the actual output

What is the forward pass in the backpropagation algorithm?


The process of predicting the output of the neural network based on the input data

What is the RProp algorithm's weight update rule?


The weight update is proportional to the derivative of the error function

What is the goal of a Markov decision process (MDP)?


To maximize the expected cumulative reward over a given time horizon

What is the cost function used in backpropagation?


The mean squared error

What is the input layer in the backpropagation algorithm?


The layer that receives the input data and passes it to the hidden layer

What is the input layer of a feedforward neural network called?


Input layer

What is the hidden layer in the backpropagation algorithm?


The layer that is not visible to the user and processes the input data

What is the bias update rule in backpropagation?


B(i) = B(i) - alpha * delta(i)

What is the goal of the backpropagation algorithm in each iteration?


To minimize the error between the predicted output and the actual output in each iteration
What is the backpropagation algorithm used for?
All of the above

What is the learning rate in the backpropagation algorithm?


The rate at which the model learns from the training data

What is the bias update rule in the backpropagation algorithm?


The mathematical formula that is used to update the biases based on the gradient descent

What is the main advantage of the Quickprop algorithm over the backpropagation algorithm?
It is faster to converge

What is the backward pass in the backpropagation algorithm?


The process of adjusting the weights and biases in the backward direction

What is the learning rate in backpropagation?


The hyperparameter that determines the size of the weight and bias updates

What is the goal of reinforcement learning?


To maximize the cumulative reward over time

You might also like