University Institute of Engineering Department of Computer Science and Engg
University Institute of Engineering Department of Computer Science and Engg
University Institute of Engineering Department of Computer Science and Engg
ENGINEERING
DEPARTMENT OF COMPUTER SCIENCE
AND ENGG.
Bachelor of Engineering (Computer Science & Engineering)
Artificial Intelligence and Machine Learning(21CSH-316)
Prepared by:
Sitaram patel(E13285)
2
Course Objectives
3
Syllabus
• UNIT-I
4
Probability, Statistics, and Linear Algebra for Machine Learning
5
Probability, Statistics, and Linear Algebra for Machine Learning
• Statistics in Machine Learning
• Definition: Statistics involves collecting, analyzing, interpreting, presenting, and
organizing data.
• Importance in ML:
• Statistical techniques help us make inferences from data and validate ML models.
• Hypothesis testing assesses the significance of relationships between variables.
• Sampling techniques enable efficient data collection and analysis.
Key Statistical Concepts:
• Descriptive Statistics: Summarizing and visualizing data using measures such as mean,
median, mode, and standard deviation.
• Inferential Statistics: Making predictions or drawing conclusions about a population
based on a sample.
• Hypothesis Testing: Assessing the likelihood of a hypothesis given sample data.
• Correlation and Regression: Analyzing the relationship between variables and making
predictions. 6
Probability, Statistics, and Linear Algebra for Machine Learning
• Linear Algebra in Machine Learning
Definition: Linear algebra deals with vectors, matrices, and linear transformations.
• Importance in ML:
ML models often represent data as matrices and utilize linear algebra operations for
optimization and analysis.
Linear algebra provides the foundation for understanding and implementing
Various ML algorithms.
• Linear Algebra Concepts
Vectors and Matrices: Representing and manipulating multi-dimensional data structures.
• Matrix Operations: Addition, subtraction, multiplication, and transposition.
• Eigen values and Eigenvectors: Important concepts for dimensionality reduction
and feature extraction.
• Singular Value Decomposition (SVD): A matrix factorization technique used in
various ML algorithms.
7
convex optimization, data visualization, hypothesis function And testing, data
distributions
• Convex Optimization
• Definition: Convex Optimization refers to the optimization of convex
functions over convex sets. It involves finding the global minimum or
maximum of a convex function, subject to a set of constraints.
• Applications: Parameter estimation, Support Vector Machines, Linear
Regression, Neural Networks, etc.
• Topic: Key Concepts in Convex Optimization
• Convex Sets: Sets that satisfy the condition that a line segment connecting any two
points in the set lies entirely within the set.
• Convex Functions: Functions that satisfy the condition that the line segment
connecting any two points on the function lies above or on the function.
• Global Minima/Maxima: Points at which the convex function attains its lowest or
highest value, respectively.
8
convex optimization, data visualization, hypothesis function And testing, data distributions
Data Visualization
Definition: Data Visualization is the graphical representation of data to uncover patterns, trends, and relationships that
are not immediately evident in raw data.
Importance: Helps in understanding complex datasets, communicating insights effectively, and aiding decision-
making processes.
Techniques: Scatter plots, line graphs, bar charts, histograms, heatmaps, etc.
Topic: Hypothesis Function and Testing
Definition: Hypothesis Function is a function that maps input variables to predicted output values. Hypothesis Testing
involves evaluating the validity of a hypothesis or a claim about a population based on sample data.
Steps in Hypothesis Testing:
Formulate null and alternative hypotheses.
Collect sample data.
Determine a statistical test and significance level.
Calculate test statistics and p-value.
Make a decision based on the p-value and the chosen significance level.
9
convex optimization, data visualization, hypothesis function And testing, data distributions
• Topic: Data Distributions
• Definition: Data Distributions describe the possible values and their probabilities in a dataset.
• Common Distributions:
• Normal Distribution (Gaussian)
• Uniform Distribution
• Binomial Distribution
• Exponential Distribution
• Poisson Distribution
• Log-Normal Distribution
• Importance: Understanding data distributions helps in making assumptions, selecting appropriate
statistical tests, and generating realistic synthetic data.
• Topic: Examples of Data Distributions
• Show visual examples of different data distributions, highlighting their shapes and characteristics.
• Summary: Convex Optimization provides a powerful framework for optimization problems. Data
Visualization enables us to gain insights from complex data. Understanding Hypothesis Function and
Testing is essential for making statistical inferences. Knowledge of Data Distributions helps in
understanding and analyzing datasets effectively. 10
data preprocessing, data augmentation, normalizing data sets
• Video Link-
• https://www.youtube.com/watch?v=9f-GarcDY58
• https://www.youtube.com/watch?v=GwIo3gDZCVQ
• Web Link-
• https://data-flair.training/blogs/types-of-machine-learning-algorithms/
• https://towardsdatascience.com/machine-learning-an-introduction-23b84d51e6d0
• https://towardsdatascience.com/introduction-to-machine-learning-f41aabc55264
14
THANK YOU