Class 9 AI Project Cycle Notes
Class 9 AI Project Cycle Notes
The 4W's of Problem Scoping are Who, What, Where and Why. These Ws helps in identifying
and understanding the problem in a better and efficient manner.
1. Who - "Who" part helps us in comprehending and categorizing who all are affected directly
and indirectly with the problem and who are called the Stake Holders
2. What - "What" part helps us in understanding and identifying the nature of the problem and
under this block, you also gather evidence to prove that the problem you have selected actually
exists.
3. Where - "Where" does the problem arises, situation and the location.
The Problem Statement Template helps us to summarize all the key points into one single.
Template so that in future, whenever there is need to look back at the basis of the problem, we
can take a look at the Problem Statement Template and understand the key elements of it.
Data Acquisition
Data Acquisition is he process of collecting accurate and reliable data to work with. Data Can
be in the format of text, video, images, audio and so on and it can be collected from carious
source like the interest, journals, newspapers and so on.
What is data?
➢ Data can be a piece of information or facts and statistics collected together for reference or analysis.
➢ Whenever we want an AI project to be able to predict an output, we need to train it first using data.
Data features
➢ Data features refer to the type of data you want to collect.
➢ data features would be salary amount, increment percentage, increment period, bonus, etc.
Types of Data:
Data can be classified based on the following:
a) Numerical Data: Mainly used for computation.
i) Discrete data: Discrete data contains only integer numeric data. It does not have any
decimal or fractional value.
ii) Continuous data: It represents data with any range. The uncountable data can be
represented in this category,
b) Text data: mainly represents names, collection of words together, phrases and textual
information.
c) Structural Classification:
i) Structured data: can have a specific pattern or set of rules. • These data have a simple
structure and stores the data in specific forms such as tabular form. Eg: The cricket
scoreboard, school time table, Exam datasheet etc.
ii) Unstructured data: doesn't have any specific pattern or constraints as well as can be stored
in any form is known as unstructured data. • Mostly the data that exists in the world is
unstructured data. • Eg: Videos, Facebook Photos, Dashboard data of any reporting tool.
iii) Semi-structured data: combination of both structured and unstructured data. Some data can
have a structure like a database whereas some data can have markers and tags to identify the
structure of data.
d) Other
i) Time-Stamped data: This structure helps the system to predict the next best action. It is
following a specific time-order to define the sequence. This time can be the time of data
captured or processed or collected.
ii) Machine data: The result or output of a specific program, system or technology considered
as machine data. It consists of data related to a user's interaction with the system like the
user's logged-in session data. , specific search records, user engagement such as comments,
likes and shares etc
iii) Spatiotemporal data: The data which contains information related to geographical location
and time is considered as spatiotemporal data. It records the location through GPS and time-
stamped data where the event is captured or data is collected.
iv) Open data: It is freely available data for everyone. Anyone can reuse this kind of data.
v) Real Time data: The data which is available with the event is considered as real-time data.
vi) Big data: You may hear this word most often. The data which cannot be stored by any
system or traditional data collection software like DBMS or RDBMS software can be
considered as Big data.
Data Features: types of data you want to collect.
Here two terms are associated with this:
1. Training Data: The collected data through the system is known as training data. In other words the input given
by the user in the system can be considered as training data.
2. Testing Data: The result data set or processed data is known as testing data. In other words, the output of the
data is known as testing data.
Eg: to make an Artificially Intelligent system which can predict the salary of any employee based on his previous
salaries.
➢ The previous salary data here is known as Training Data while the next salary prediction data set is known as
the Testing Data
➢ For any AI project to be efficient, the training data should be authentic and relevant to the problem statement
scoped. .
Data Sources
System Maps:
System maps help us to find the relationship between the elements of the problem which we have
scoped.
A system map shows the components and boundary of a system and the components of the
environment at a point in time.
The main use of a system map is to help structure a system and communicate the result to others.
It helps us in strategizing the solution for achieving the goal of our project.
help to understand complex issues with multiple factors that affect each other
Circles represents the elements,
Arrows represents the relationship between the elements. Length of arrow represents time for a
change to happen. This is time delay. The arrow- head depicts the direction of the effect and the
sign (+ or -) shows their relationship. If the arrow goes from X to Y with a + sign, it means that
both are directly related to each other. If the arrow goes from X to Y with a – sign, it means that
both the elements are inversely related to each other.
Loops represent a specific chain of causes and effects.
To change the outcome of a system, as a change maker, either change the elements in a system or
change the relationships between elements.
Data Exploration
Data Exploration is the process of arranging the gathered data uniformly for a better
understanding. Data can be arranged in the form of a table, plotting a chart or making
database.
If we simplify this Data Exploration means that the data which we collected in Data
Acquisition, in Data Exploration we need to arrange it for example if we have data of 50 students
in a class, we have their Mobile Number, Date of Birth, Class, Etc .
In the process of data exploration we can make a chart for that data in which all the names will
be at one place and all the mobile numbers at one etc.
To summarize, data exploration refers to the techniques and tools to visualize data through
complex statistical methods.
Modelling:
As shown in the Venn diagram, Artificial Intelligence is the umbrella terminology which covers
machine and deep learning under it and Deep Learning comes under Machine Learning. It is a funnel
type approach where there are a lot of applications of AI out of which few are those which come under
ML, out of which very few go onto DL.
AI vs ML vs DL
AI modeling refers to developing algorithms, also called models which can be trained to get
intelligent outputs. That is, writing codes to make a machine artificially intelligent.
Machine learning is a subset of artificial Intelligence (AI) which provides machines the ability
to learn automatically and improve from experience without being programmed for it.
Types of Machine Learning
Important Term:
Datasets
Dataset is a collection of related sets of Information that is composed of separate elements but
can be manipulated by a computer as a unit.
Evaluation
Evaluation is the method of understanding the reliability of an AI project and is based on the output
which is received by feeding the data into the model and comparing the output with the actual answers.
There can be different Evaluation techniques, depending of the type and purpose of the model.
Note: If AI model remembers the training set and always predicts the correct label for any point
in the training set, this is known as overfitting.
Performance and efficiency of the model is calculated on the basis of the following parameters:
1. Accuracy
2. Precision
3. Recall
4. F1 Score
Model Evaluation Terminologies
1. True Positive (TP)
2. True Negative (TN)
3. False Positive (FP)
4. False Negative (FN)
Deployment
Deployment is the final stage in the AI project cycle where the AI model or solution is implemented in a
real-world scenario.
Key steps in Deployment are:
1. Testing and validation of AI model
2. Integration of the model with the existing system
3. Monitoring and maintenance of the deployed model