Welcome to Scribd!

0% found this document useful (0 votes)

39 views

AI Assignment: Asad Nasir - 37 Muhammad Usman Ali - 29 Momin - 49

Uploaded by

This document outlines the steps taken to perform a TF-IDF analysis on a set of documents and query. It calculates the term frequency, normalized term frequency, inverse document frequency, and TF-IDF for the documents and query. It then computes the cosine similarity between the documents and query to determine how similar they are. The work is split between four students who implement the various parts of the TF-IDF analysis and cosine similarity calculation.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

AI Assignment: Asad Nasir - 37 Muhammad Usman Ali - 29 Momin - 49

Uploaded by

Crack Tunes

0% found this document useful (0 votes)

39 views7 pages

Original Title

Assignment 4

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

39 views7 pages

AI Assignment: Asad Nasir - 37 Muhammad Usman Ali - 29 Momin - 49

Uploaded by

Crack Tunes

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 7

Search inside document

AI Assignment

Asad Nasir -37

Muhammad Usman Ali -29
Momin -49
Python Code:

import math import pandas as pd import numpy as np #documents for TF-IDF

document1 = "I want to start learning to charge something in life" document2 = "r
eading something about life no one else knows" document3 = "Never stop learning"

#query query = "life learning"

(Done By 18-SE-30)
Calculating Term Frequency:
#computing term frequency def compute_tf(docs_list): for doc in docs_list:
doc1_lst = doc.split(" ")
wordDict_1= dict.fromkeys(set(doc1_lst), 0)

for token in doc1_lst:

wordDict_1[token] += 1 df = pd.DataFrame([wordDict_1])
idx = 0
new_col = ["Term Frequency"]
df.insert(loc=idx, column='Document', value=new_col)
print(df)
compute_tf([document1, document2, document3])

Output:

Normalized Term Frequency:

#Normalized Term Frequency def termFrequency(term, document):
normalizeDocument = document.lower().split()
return normalizeDocument.count(term.lower()) / float(len(normalizeDocument))

def compute_normalizedtf(documents): tf_doc = [] for txt in documents:

sentence = txt.split()
norm_tf= dict.fromkeys(set(sentence), 0)
for word in sentence: norm_tf[word] = termFrequency(word, txt
) tf_doc.append(norm_tf) df = pd.DataFrame([norm_tf]) idx
= 0
new_col = ["Normalized TF"]
df.insert(loc=idx, column='Document', value=new_col) print(df)
return tf_doc
tf_doc = compute_normalizedtf([document1, document2, document3])

Output:

(Done by 18-SE-37)

Inverse Document Frequency:

def inverseDocumentFrequency(term, allDocuments):
numDocumentsWithThisTerm = 0 for doc in range (0, len(allDocuments)):
if term.lower() in allDocuments[doc].lower().split():
numDocumentsWithThisTerm = numDocumentsWithThisTerm + 1

if numDocumentsWithThisTerm > 0:
return 1.0 + math.log(float(len(allDocuments)) / numDocumentsWithThisTerm
) else:
return 1.0

def compute_idf(documents): idf_dict = {} for doc in documents: s

entence = doc.split() for word in sentence:
idf_dict[word] = inverseDocumentFrequency(word, documents) return
idf_dict
idf_dict = compute_idf([document1, document2, document3])

compute_idf([document1, document2, document3])

Output:

TF*IDF:
def compute_tfidf_with_alldocs(documents , query): tf_idf = [] index = 0
query_tokens = query.split()
df = pd.DataFrame(columns=['doc'] + query_tokens) for doc in documents:
df['doc'] = np.arange(0 , len(documents)) doc_num = tf_doc[index]
sentence = doc.split() for word in sentence:
for text in query_tokens:
if(text == word):
idx = sentence.index(word)
tf_idf_score = doc_num[word] * idf_dict[word]
tf_idf.append(tf_idf_score)
df.iloc[index, df.columns.get_loc(word)] = tf_idf_score
index += 1
df.fillna(0 , axis=1, inplace=True) return tf_idf , df

documents = [document1, document2, document3] tf_idf , df = compute_tfidf_with_al

ldocs(documents , query) print("\n\nTF*IDF Output:\n") print(df)

Output:

Done by 18-SE-49

Cosine Similarity:
Image("image.jpg")
#Normalized TF for the query string("life learning") def compute_query_tf(query):

query_norm_tf = {}
tokens = query.split() for word in tokens: query_norm_tf[word] =
termFrequency(word , query) return query_norm_tf query_norm_tf = compute_quer
y_tf(query)
print("\n\nNormalized TF for the query string(life learning):\n") print(query_nor
m_tf)

#idf score for the query string("life learning") def compute_query_idf(query):

idf_dict_qry = {} sentence = query.split()
documents = [document1, document2, document3] for word in sentence:
idf_dict_qry[word] = inverseDocumentFrequency(word ,documents) return idf_
dict_qry
idf_dict_qry = compute_query_idf(query)

print("\n\nidf score for the query string(life learning):\n") print(idf_dict_qry)

Output:
Done by 18-SE-29

Fresco Code Python Application Programming
Document7 pages
Fresco Code Python Application Programming
Ray
90% (20)
Python Hands On
Document11 pages
Python Hands On
prashant pal
100% (1)
Lampiran 1 Pseudocode COATES Algorithm Dengan Menggunakan Software Python
Document12 pages
Lampiran 1 Pseudocode COATES Algorithm Dengan Menggunakan Software Python
sofyan
No ratings yet
Praktikum 2 PI Genap2023
Document4 pages
Praktikum 2 PI Genap2023
Irgy Suwito Suryanto
No ratings yet
Code
Document2 pages
Code
itsack
No ratings yet
ps2 Macro Bongioanni TXT
Document4 pages
ps2 Macro Bongioanni TXT
Guido Bongioanni
No ratings yet
IR - 754 All Practical
Document21 pages
IR - 754 All Practical
754Durgesh Vishwakarma
No ratings yet
Final_NLP_Lab_File
Document28 pages
Final_NLP_Lab_File
Kartik Chahar
No ratings yet
Fresco Code Python Application Programming
Document7 pages
Fresco Code Python Application Programming
TECHer YT
No ratings yet
Simple NMT
Document3 pages
Simple NMT
Furious Five
No ratings yet
Web Mining DA
Document13 pages
Web Mining DA
Deep Agrawal
No ratings yet
IR Journal (Printable)
Document20 pages
IR Journal (Printable)
krii24u8
No ratings yet
11 Binary File Handling Assignment
Document6 pages
11 Binary File Handling Assignment
Madhuresh Thakur
No ratings yet
Ass
Document5 pages
Ass
Taqwa Elsayed
No ratings yet
GEN AI LAB PROGRAMS
Document15 pages
GEN AI LAB PROGRAMS
rashmimaruthi2
No ratings yet
NLP Projects
Document4 pages
NLP Projects
Joshua David
No ratings yet
3
Document5 pages
3
shifaansari1975
No ratings yet
Practical No 05
Document4 pages
Practical No 05
Samruddhi Sandip kangude
No ratings yet
import datetime
Document4 pages
import datetime
vijaykolge121
No ratings yet
Apr 2023
Document32 pages
Apr 2023
Abhilash Jose
No ratings yet
Dsbda 7
Document1 page
Dsbda 7
monaliauti2
No ratings yet
Week 08 Tutorial Sample Answers
Document4 pages
Week 08 Tutorial Sample Answers
MP
No ratings yet
115 Ir 8
Document8 pages
115 Ir 8
Akash Verma
No ratings yet
Caiji
Document3 pages
Caiji
xigxmmbc
No ratings yet
Technical Report
Document1 page
Technical Report
teamliquid31211
No ratings yet
Irs 122010304057 PDF
Document23 pages
Irs 122010304057 PDF
mamidi.kalyan9
No ratings yet
1_text_20241014
Document2 pages
1_text_20241014
www7474174haha
No ratings yet
Operating System With Python
Document15 pages
Operating System With Python
Sandhiya Ammu
No ratings yet
Group17 2
Document9 pages
Group17 2
sandeep
No ratings yet
OS Lab Python Programs
Document43 pages
OS Lab Python Programs
Nrc Art
No ratings yet
Notes 05
Document7 pages
Notes 05
Moses Enakireru
No ratings yet
text_20241014
Document2 pages
text_20241014
www7474174haha
No ratings yet
ML Lab Manual
Document28 pages
ML Lab Manual
Sharan Patil
No ratings yet
Ansc 4
Document4 pages
Ansc 4
KuanTing Kuo
No ratings yet
#Importing The Necessary Library: From Import From Import Import As Import As From Import
Document4 pages
#Importing The Necessary Library: From Import From Import Import As Import As From Import
Prajwal Niraula
No ratings yet
Prac 5
Document3 pages
Prac 5
Mritunjay Kumar
No ratings yet
MalenoV Code 5 Layer CNN 65x65x65 Voxels
Document30 pages
MalenoV Code 5 Layer CNN 65x65x65 Voxels
Alejandro Garza Juárez
No ratings yet
JPMC - Task 4
Document3 pages
JPMC - Task 4
Pradeep gandhe
No ratings yet
Cs Tutorial On Python
Document8 pages
Cs Tutorial On Python
ritz0874
No ratings yet
File Handling (FINAL)
Document37 pages
File Handling (FINAL)
Nitasha Garg
No ratings yet
2_text_20241014
Document2 pages
2_text_20241014
www7474174haha
No ratings yet
AI Lec Code
Document12 pages
AI Lec Code
belalrasmy0
No ratings yet
Rsa 1
Document7 pages
Rsa 1
Khalida
No ratings yet
Practical Programs (5 - 24)
Document19 pages
Practical Programs (5 - 24)
tndsc641038
No ratings yet
Import Os
Document15 pages
Import Os
Cl1cker
No ratings yet
Https Raw - Githubusercontent.com Joelgrus Data-Science-From-Scratch Master Code Natural Language Processing
Document5 pages
Https Raw - Githubusercontent.com Joelgrus Data-Science-From-Scratch Master Code Natural Language Processing
gprasadatvu
No ratings yet
7) Classes and Instances Find The Three Elements That Sum To Zero From A Set of N Real Numbers
Document3 pages
7) Classes and Instances Find The Three Elements That Sum To Zero From A Set of N Real Numbers
18bcs097 M. Santhiya
No ratings yet
Basic I/O Programming
Document23 pages
Basic I/O Programming
swethasureshin.s
No ratings yet
#Print ("/n",gain) : Len Len
Document3 pages
#Print ("/n",gain) : Len Len
ass08889
No ratings yet
Prac 301
Document20 pages
Prac 301
VIGHNESH RAJEEVAN
No ratings yet
Department of Computer Applications National Institute of Technology Tiruchirappalli-15
Document36 pages
Department of Computer Applications National Institute of Technology Tiruchirappalli-15
lalchand xangti
No ratings yet
Dictionary Notes
Document4 pages
Dictionary Notes
Ilakkiyaa KP
No ratings yet
How To Use Popular Data Structures and Algorithms in Python ?
Document11 pages
How To Use Popular Data Structures and Algorithms in Python ?
Satish Singh
100% (1)
Stat Arb in Crude Oil Market
Document14 pages
Stat Arb in Crude Oil Market
Milad Ebrahimi Dastgerdi
No ratings yet
Machine Learning Lab Record: Dr. Sarika Hegde
Document23 pages
Machine Learning Lab Record: Dr. Sarika Hegde
4NM18CS142 SACHIN SINGH
No ratings yet
Python Hands On Answers
Document15 pages
Python Hands On Answers
Sai Gopi
No ratings yet
Code
Document7 pages
Code
h8jmmxnzt4
No ratings yet
Testing in Python - Unit Test & Script
Document5 pages
Testing in Python - Unit Test & Script
E17CN2 Clc
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
#Include #Include #Include #Include
Document2 pages
#Include #Include #Include #Include
Crack Tunes
No ratings yet
ATFL Assignment 1
Document4 pages
ATFL Assignment 1
Crack Tunes
No ratings yet
Assignment
Document49 pages
Assignment
Crack Tunes
No ratings yet
WE Lab Assignment 2
Document5 pages
WE Lab Assignment 2
Crack Tunes
No ratings yet
Ai Problems
Document1 page
Ai Problems
Crack Tunes
No ratings yet
Performance Simulation of Turbofan
Document19 pages
Performance Simulation of Turbofan
Crack Tunes
No ratings yet
The Monster Book Chapter 4 Speaking Reading
Document27 pages
The Monster Book Chapter 4 Speaking Reading
Suelen A. de A
No ratings yet
Aalto research internship
Document5 pages
Aalto research internship
Samad Bughio
No ratings yet
Play or Game - Gramática Inglés en - English Grammar Today - Cambridge University Press
Document9 pages
Play or Game - Gramática Inglés en - English Grammar Today - Cambridge University Press
David Puente
No ratings yet
Mca Syllabus Final
Document127 pages
Mca Syllabus Final
Jamil Khan
No ratings yet
Catherine Gallagher The History of Literary Criticism
Document12 pages
Catherine Gallagher The History of Literary Criticism
elisabeth maria Christensen
No ratings yet
TOEFL Prediction 1 Structure - ToEFL - Id
Document21 pages
TOEFL Prediction 1 Structure - ToEFL - Id
XMIPA718 Muhammad Basyir Nadi
No ratings yet
Western Culture and The Teaching of English As An International Language
Document8 pages
Western Culture and The Teaching of English As An International Language
Lưu Cẩm Hà
100% (2)
Close-Up C1 Table of Contents
Document2 pages
Close-Up C1 Table of Contents
Kicho -chan
No ratings yet
Simultaneous Interpretation: A Cognitive and Pragmatic Analysis
Document3 pages
Simultaneous Interpretation: A Cognitive and Pragmatic Analysis
CCHWY
No ratings yet
IPA Chart Assignment
Document2 pages
IPA Chart Assignment
simonnjoki15
No ratings yet
Sangit CV
Document2 pages
Sangit CV
api-3722554
No ratings yet
Las q2 l1 w5 - 6 Japanese Literature
Document4 pages
Las q2 l1 w5 - 6 Japanese Literature
Tristan Genesis Amistad
No ratings yet
Solutions 2nd Ed - Interm - Students Book
Document136 pages
Solutions 2nd Ed - Interm - Students Book
Nino Alania
83% (6)
Lecture 1 (Week 1)
Document14 pages
Lecture 1 (Week 1)
Haseeb Ahmed
No ratings yet
Grade 9 English Revision Sheet Term 1ANSWERED
Document12 pages
Grade 9 English Revision Sheet Term 1ANSWERED
Hồng Nam
No ratings yet
Section - VII 01 021
Document6 pages
Section - VII 01 021
twy113
No ratings yet
Grammar Reference Answers - 124 PDF
Document1 page
Grammar Reference Answers - 124 PDF
VịtNguyễn
100% (1)
G9. GK1 Nguyen Truong To 2024-2025 Form 2025
Document11 pages
G9. GK1 Nguyen Truong To 2024-2025 Form 2025
dangdieuthuy2701
No ratings yet
Amy Mirlisena
Document3 pages
Amy Mirlisena
api-312882198
No ratings yet
Week 1
Document12 pages
Week 1
John Bibal
No ratings yet
HG2020 Language in Society: Tutorial 1
Document2 pages
HG2020 Language in Society: Tutorial 1
Vaish P
No ratings yet
There Are Few Sweets Left in The Jar
Document2 pages
There Are Few Sweets Left in The Jar
Erhan Tula
No ratings yet
LESSON PLAN Modal Verbs
Document4 pages
LESSON PLAN Modal Verbs
elenachicos9319
No ratings yet
Word - Past Simple Past Continuous Tasks
Document4 pages
Word - Past Simple Past Continuous Tasks
Jorge Vieira da Silva
No ratings yet
Task Engl 7 8 Kalin Mun 15 6
Document8 pages
Task Engl 7 8 Kalin Mun 15 6
Thùy
No ratings yet
General Questions
Document6 pages
General Questions
May
No ratings yet
Java Coding Guidelines
Document17 pages
Java Coding Guidelines
Cavendish
No ratings yet
Lesson 4 Translating Verbal Phrase To Mathematical Expression
Document14 pages
Lesson 4 Translating Verbal Phrase To Mathematical Expression
Avito Bernaldez
No ratings yet
The Top 100 Language Service Providers: 2013: by Donald A. Depalma and Vijayalaxmi Hegde
Document5 pages
The Top 100 Language Service Providers: 2013: by Donald A. Depalma and Vijayalaxmi Hegde
SamHanna
No ratings yet
Weekly Journal
Document6 pages
Weekly Journal
Low
100% (1)