Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
564 views

Class 9 AI Project Cycle Notes

Al project cycle notes
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
564 views

Class 9 AI Project Cycle Notes

Al project cycle notes
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

AI Project Cycle Overview

What is Project Cycle?


Project Cycle is a step by step process to solve the problems using proven scientific methods
and drawing the inference about it.
Components of the AI Project Cycle:

Problem Scoping Understanding the problem

Data Acquisition Collecting accurate and reliable data

Data Exploration Arranging the data uniformly

Modelling Creating Models from the data

Evaluation Evaluating the project

Deployment Launching the AI Project


Problem Scoping

4Ws of Problem Scoping

The 4W's of Problem Scoping are Who, What, Where and Why. These Ws helps in identifying
and understanding the problem in a better and efficient manner.

1. Who - "Who" part helps us in comprehending and categorizing who all are affected directly
and indirectly with the problem and who are called the Stake Holders

2. What - "What" part helps us in understanding and identifying the nature of the problem and
under this block, you also gather evidence to prove that the problem you have selected actually
exists.

3. Where - "Where" does the problem arises, situation and the location.

4. Why - "Why" is the given problem worth solving.

Problem Statement Template

The Problem Statement Template helps us to summarize all the key points into one single.

Template so that in future, whenever there is need to look back at the basis of the problem, we
can take a look at the Problem Statement Template and understand the key elements of it.
Data Acquisition

Data Acquisition is he process of collecting accurate and reliable data to work with. Data Can
be in the format of text, video, images, audio and so on and it can be collected from carious
source like the interest, journals, newspapers and so on.

What is data?
➢ Data can be a piece of information or facts and statistics collected together for reference or analysis.
➢ Whenever we want an AI project to be able to predict an output, we need to train it first using data.
Data features
➢ Data features refer to the type of data you want to collect.
➢ data features would be salary amount, increment percentage, increment period, bonus, etc.
Types of Data:
Data can be classified based on the following:
a) Numerical Data: Mainly used for computation.
i) Discrete data: Discrete data contains only integer numeric data. It does not have any
decimal or fractional value.
ii) Continuous data: It represents data with any range. The uncountable data can be
represented in this category,
b) Text data: mainly represents names, collection of words together, phrases and textual
information.
c) Structural Classification:
i) Structured data: can have a specific pattern or set of rules. • These data have a simple
structure and stores the data in specific forms such as tabular form. Eg: The cricket
scoreboard, school time table, Exam datasheet etc.
ii) Unstructured data: doesn't have any specific pattern or constraints as well as can be stored
in any form is known as unstructured data. • Mostly the data that exists in the world is
unstructured data. • Eg: Videos, Facebook Photos, Dashboard data of any reporting tool.
iii) Semi-structured data: combination of both structured and unstructured data. Some data can
have a structure like a database whereas some data can have markers and tags to identify the
structure of data.
d) Other
i) Time-Stamped data: This structure helps the system to predict the next best action. It is
following a specific time-order to define the sequence. This time can be the time of data
captured or processed or collected.
ii) Machine data: The result or output of a specific program, system or technology considered
as machine data. It consists of data related to a user's interaction with the system like the
user's logged-in session data. , specific search records, user engagement such as comments,
likes and shares etc
iii) Spatiotemporal data: The data which contains information related to geographical location
and time is considered as spatiotemporal data. It records the location through GPS and time-
stamped data where the event is captured or data is collected.
iv) Open data: It is freely available data for everyone. Anyone can reuse this kind of data.
v) Real Time data: The data which is available with the event is considered as real-time data.
vi) Big data: You may hear this word most often. The data which cannot be stored by any
system or traditional data collection software like DBMS or RDBMS software can be
considered as Big data.
Data Features: types of data you want to collect.
Here two terms are associated with this:
1. Training Data: The collected data through the system is known as training data. In other words the input given
by the user in the system can be considered as training data.
2. Testing Data: The result data set or processed data is known as testing data. In other words, the output of the
data is known as testing data.
Eg: to make an Artificially Intelligent system which can predict the salary of any employee based on his previous
salaries.
➢ The previous salary data here is known as Training Data while the next salary prediction data set is known as
the Testing Data
➢ For any AI project to be efficient, the training data should be authentic and relevant to the problem statement
scoped. .
Data Sources

System Maps:
 System maps help us to find the relationship between the elements of the problem which we have
scoped.
 A system map shows the components and boundary of a system and the components of the
environment at a point in time.
 The main use of a system map is to help structure a system and communicate the result to others.
 It helps us in strategizing the solution for achieving the goal of our project.
 help to understand complex issues with multiple factors that affect each other
 Circles represents the elements,
 Arrows represents the relationship between the elements. Length of arrow represents time for a
change to happen. This is time delay. The arrow- head depicts the direction of the effect and the
sign (+ or -) shows their relationship. If the arrow goes from X to Y with a + sign, it means that
both are directly related to each other. If the arrow goes from X to Y with a – sign, it means that
both the elements are inversely related to each other.
 Loops represent a specific chain of causes and effects.
 To change the outcome of a system, as a change maker, either change the elements in a system or
change the relationships between elements.

Data Exploration
Data Exploration is the process of arranging the gathered data uniformly for a better
understanding. Data can be arranged in the form of a table, plotting a chart or making
database.

If we simplify this Data Exploration means that the data which we collected in Data
Acquisition, in Data Exploration we need to arrange it for example if we have data of 50 students
in a class, we have their Mobile Number, Date of Birth, Class, Etc .

In the process of data exploration we can make a chart for that data in which all the names will
be at one place and all the mobile numbers at one etc.

To summarize, data exploration refers to the techniques and tools to visualize data through
complex statistical methods.

Advantages of Data Visualization


 A better understanding of data
 Provides insights into data
 Allows user interaction
 Provide real-time analysis
 Help to make decisions
 Reduces complexity of data
 Provides the relationships and patterns contained within data
 Define a strategy for your data model
 Provides an effective way of communication among users

Types of Graphical Representation:


 Bar Graph: This uses either horizontal or vertical bars to categorize and compare
different quantities. The length or height of the bar corresponds to the value it
represents.
 Pie Chart: This represents data in the form of slices of a circle, where each slice
represents a category. The size of each slice is proportional to the category it represents.
 Line Chart(Graph): This shows trends over time by connecting data points with
straight lines. It's especially useful for showing changes over a period and trends.
 Pictograph: This type of graph uses pictures or icons to represent data values. Each
image or icon represents a specific number or quantity, allowing for an easy-to-
understand visual comparison.
 Histogram: This is a type of bar graph used to display frequency data. The bars are
adjacent to each other, indicating that the data is in intervals.

Modelling:

As shown in the Venn diagram, Artificial Intelligence is the umbrella terminology which covers
machine and deep learning under it and Deep Learning comes under Machine Learning. It is a funnel
type approach where there are a lot of applications of AI out of which few are those which come under
ML, out of which very few go onto DL.

AI vs ML vs DL

AI modeling refers to developing algorithms, also called models which can be trained to get
intelligent outputs. That is, writing codes to make a machine artificially intelligent.
Machine learning is a subset of artificial Intelligence (AI) which provides machines the ability
to learn automatically and improve from experience without being programmed for it.
Types of Machine Learning

Rule Based Approach


Rule Based Approach Refers to the AI modelling where the relationship or patterns in data are defined
by the developer. The machine follows the rules or instructions mentioned by the developer, and
performs its task accordingly.

Learning Based Approach


Refers to the AI modeling where the relationship or patterns in data are not defined by the developer. In
this approach, random data is fed into the machine and it is left on the machine to figure out patterns and
trends out of it.
Generally this approach is followed when the data is unlabeled and too random for human to make sense
out of it.

Important Term:
Datasets

Dataset is a collection of related sets of Information that is composed of separate elements but
can be manipulated by a computer as a unit.

Evaluation
Evaluation is the method of understanding the reliability of an AI project and is based on the output
which is received by feeding the data into the model and comparing the output with the actual answers.
There can be different Evaluation techniques, depending of the type and purpose of the model.

Note: If AI model remembers the training set and always predicts the correct label for any point
in the training set, this is known as overfitting.
Performance and efficiency of the model is calculated on the basis of the following parameters:
1. Accuracy
2. Precision
3. Recall
4. F1 Score
Model Evaluation Terminologies
1. True Positive (TP)
2. True Negative (TN)
3. False Positive (FP)
4. False Negative (FN)
Deployment
Deployment is the final stage in the AI project cycle where the AI model or solution is implemented in a
real-world scenario.
Key steps in Deployment are:
1. Testing and validation of AI model
2. Integration of the model with the existing system
3. Monitoring and maintenance of the deployed model

You might also like