Welcome to Scribd!

0% found this document useful (0 votes)

15 views

QB104762 2013 Regulation

Uploaded by

This document provides lecture notes on information retrieval models and concepts. It defines key terms like cosine similarity, language models, unigram language models, vector space model assumptions, stemming, recall, precision, and latent semantic indexing. It also describes relevance feedback characteristics and disadvantages of the Boolean model. The document consists of questions and answers on various information retrieval techniques.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

QB104762 2013 Regulation

Uploaded by

Shruthi Nanditha P

0% found this document useful (0 votes)

15 views2 pages

Original Description:

Original Title

QB104762_2013_regulation

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

15 views2 pages

QB104762 2013 Regulation

Uploaded by

Shruthi Nanditha P

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Lecture Notes

UNIT II – INFORMATION RETRIEVAL

Part A - Questions

1. What do you mean information retrieval models?

A retrieval model can be a description of either the computational process or the
human process of retrieval: The process of choosing documents for retrieval; the process
by which information needs are first articulated and then refined.

2. What is cosine similarity?

This metric is frequently used when trying to determine similarity between two
documents. Since there are more words that are in common between two documents, it is
useless to use the other methods of calculating similarities.

3. What is language model based IR?

A language model is a probabilistic mechanism for generating text. Language
models estimate the probability distribution of various natural language phenomena.

4. Define unigram language.

A unigram (1-gram) language model makes the strong independence assumption
that words are generated independently from a multinomial distribution

5. What are the characteristics of relevance feedback?

It shields the user from the details of the query reformulation process.
It breaks down the whole searching task into a sequence of small steps which are
easier to grasp.
Provide a controlled process designed to emphasize some terms and de-emphasize
others.

6. What are the assumptions of vector space model?

Assumption of vector space model:
The degree of matching can be used to rank-order documents;
This rank-ordering corresponds to how well a document satisfying a users
information needs.

CS6007 -Information Retrieval Page 1

www.studentsfocus.com
Lecture Notes

7. What are the disadvantages of Boolean model?

It is not simple to translate an information need into a Boolean expression
Exact matching may lead to retrieval of too many documents.
The retrieved documents are not ranked.
The model does not use term weights.

8. Define term frequency.

Term frequency: Frequency of occurrence of query keyword in document.

9. Explain Luhn’s ideas

Luhn’s basic idea to use various properties of texts, including statistical ones, was
critical in opening handling of input by computers for IR. Automatic input joined the
already automated output.

10. Define stemming.

Conflation algorithms are used in information retrieval systems for matching the
morphological variants of terms for efficient indexing and faster retrieval operations. The
Conflation process can be done either manually or automatically. The automatic
conflation operation is also called stemming.

11. What is Recall?

Recall is the ratio of the number of relevant documents retrieved to the total
number of relevant documents retrieved.

12. What is precision?

Precision is the ratio of the number of relevant documents retrieved to the total
number of documents retrieved.

13. Explain Latent semantic Indexing.

Latent Semantic Indexing is a technique that projects queries and documents into
a space with “latent” Semantic dimensions. It is statistical method for automatic indexing
and retrieval that attempts to solve the major problems of the current technology. It is
intended to uncover latent semantic structure in the data that is hidden. It creates a
semantic space where in terms and documents that are associated are placed near one
another.

CS6007 -Information Retrieval Page 2

www.studentsfocus.com

Campbell, Tom (1988) - Justice
Document224 pages
Campbell, Tom (1988) - Justice
Carlos Reveco
No ratings yet
Fluent
Document25 pages
Fluent
Sanchit Gupta
No ratings yet
Research Proposal
Document11 pages
Research Proposal
RAHUL KUMAR
No ratings yet
VII Sem CS6007 TM
Document15 pages
VII Sem CS6007 TM
abenezertaye20hil
No ratings yet
Information Retrival List of Experiment - Odd Sem 2024-25
Document23 pages
Information Retrival List of Experiment - Odd Sem 2024-25
shrutikadam-cmpn
No ratings yet
Information Retrieval Thesis
Document5 pages
Information Retrieval Thesis
Daphne Smith
100% (2)
Irt 2 Marks With Answer
Document15 pages
Irt 2 Marks With Answer
Amaya Ema
No ratings yet
O12 1027
Document15 pages
O12 1027
Okechukwu Ikechukwu
No ratings yet
5 семінар
Document3 pages
5 семінар
Андрій Волошин
No ratings yet
Survey Data Analysis
Document17 pages
Survey Data Analysis
rkarthik403
No ratings yet
UNIT 4 Information Retrieval Using NLP
Document13 pages
UNIT 4 Information Retrieval Using NLP
Yuvraj Pardeshi
No ratings yet
Wikipidea - Concept Search
Document7 pages
Wikipidea - Concept Search
Michael Zock
No ratings yet
PDC Review2
Document23 pages
PDC Review2
corote1026
No ratings yet
AIML Notes Sess 2
Document12 pages
AIML Notes Sess 2
saurav nerkar
No ratings yet
Programming Languages Research Report
Document16 pages
Programming Languages Research Report
dasmoove
No ratings yet
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
From Everand
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
Fouad Sabry
No ratings yet
AI Notes
Document6 pages
AI Notes
BryanOkello
No ratings yet
Saravanan Thesis
Document207 pages
Saravanan Thesis
Unimarks Legal Solutions
100% (1)
Concurrent Context Free Framework For Conceptual Similarity Problem Using Reverse Dictionary
Document4 pages
Concurrent Context Free Framework For Conceptual Similarity Problem Using Reverse Dictionary
Editor IJRITCC
No ratings yet
Automatic Image Annotation: Fundamentals and Applications
From Everand
Automatic Image Annotation: Fundamentals and Applications
Fouad Sabry
No ratings yet
A Statistical Approach To Perform Web Based Summarization: Kirti Bhatia, Dr. Rajendar Chhillar
Document3 pages
A Statistical Approach To Perform Web Based Summarization: Kirti Bhatia, Dr. Rajendar Chhillar
International Organization of Scientific Research (IOSR)
No ratings yet
Answer Key-3
Document12 pages
Answer Key-3
toomuchansh
No ratings yet
Information Retrieval Thesis Topics
Document6 pages
Information Retrieval Thesis Topics
theresasinghseattle
100% (2)
A Survey of Deep Learning Approaches For OCR and D
Document14 pages
A Survey of Deep Learning Approaches For OCR and D
karim dab
No ratings yet
Faith Computer Main Project
Document44 pages
Faith Computer Main Project
braimahfk1
No ratings yet
Semantic Information Retrieval Based On Domain Ontology
Document4 pages
Semantic Information Retrieval Based On Domain Ontology
Integrated Intelligent Research
No ratings yet
Machine Learning and Rule-Based Automated Coding of Qualitative Data
Document4 pages
Machine Learning and Rule-Based Automated Coding of Qualitative Data
Jose Rafael Cruz
No ratings yet
Paper Work
Document12 pages
Paper Work
stsaravanan2003
No ratings yet
Information Retrieval Dissertation
Document5 pages
Information Retrieval Dissertation
ProfessionalPaperWritersUK
100% (1)
Construction of Ontology-Based Software Repositories by Text Mining
Document8 pages
Construction of Ontology-Based Software Repositories by Text Mining
Henri Ramesh
No ratings yet
Ubicc-Id365 365
Document9 pages
Ubicc-Id365 365
Ubiquitous Computing and Communication Journal
No ratings yet
An Advance Way To Retrieve Information From Web by Natural Language Interface
Document3 pages
An Advance Way To Retrieve Information From Web by Natural Language Interface
erpublication
No ratings yet
A Comprehensive Survey On Human-To-Database Communication Using NLP
Document5 pages
A Comprehensive Survey On Human-To-Database Communication Using NLP
International Journal of Innovative Science and Research Technology
No ratings yet
Sentence Similarity Based On Semantic Networks
Document36 pages
Sentence Similarity Based On Semantic Networks
NBA-Comps Placements
No ratings yet
Komputerisasi Penelitian Hukum DGN Teknologi Data Mining
Document8 pages
Komputerisasi Penelitian Hukum DGN Teknologi Data Mining
sunnysamsuni
No ratings yet
Transformer-Based Regression Models For Assessing Reading Passage Complexity: A Deep Learning Approach in Natural Language Processing
Document14 pages
Transformer-Based Regression Models For Assessing Reading Passage Complexity: A Deep Learning Approach in Natural Language Processing
ijaia
No ratings yet
Semantic Nets
Document6 pages
Semantic Nets
komalgautham1208
No ratings yet
Easychair Preprint: Pallavi Kohakade and Sujata Jadhav
Document5 pages
Easychair Preprint: Pallavi Kohakade and Sujata Jadhav
saurav
No ratings yet
Semantic Information Retrieval Based On Domain Ontology
Document3 pages
Semantic Information Retrieval Based On Domain Ontology
ijbui iir
No ratings yet
February 2024: Top10 Cited Articles in Natural Language Computing
Document34 pages
February 2024: Top10 Cited Articles in Natural Language Computing
Darren
No ratings yet
Computational Linguistics 101
Document8 pages
Computational Linguistics 101
Javen B
No ratings yet
Building Applied Natural Language Generation
Document32 pages
Building Applied Natural Language Generation
rat86
No ratings yet
Seminar Text Summarization 1
Document21 pages
Seminar Text Summarization 1
bhanuprakash15440
No ratings yet
Information Retrieval Thesis PDF
Document4 pages
Information Retrieval Thesis PDF
jessicaoatisneworleans
100% (2)
(2014) Developing Coding Schemes For Program Comprehension Using Eye Movements
Document12 pages
(2014) Developing Coding Schemes For Program Comprehension Using Eye Movements
Pramit Mazumdar
No ratings yet
Semantic Network Representation: Statements
Document7 pages
Semantic Network Representation: Statements
Harshit Jain
No ratings yet
Enriching Existing Ontology Using Semi-A
Document6 pages
Enriching Existing Ontology Using Semi-A
Mariana Taglio
No ratings yet
M S S W: A S: Easurement of Emantic Imilarity Between Ords Urvey
Document10 pages
M S S W: A S: Easurement of Emantic Imilarity Between Ords Urvey
ijcseit
No ratings yet
LANLI: A Natural Language Interfacing Tool For Relational Database Query Generation
Document14 pages
LANLI: A Natural Language Interfacing Tool For Relational Database Query Generation
editor_ijarcsse
No ratings yet
Annauniversity CSE 2marks OOAD
Document20 pages
Annauniversity CSE 2marks OOAD
vanckam
No ratings yet
Vietnamese Sentiment Analysis Under Limited Training Data
Document14 pages
Vietnamese Sentiment Analysis Under Limited Training Data
Toni Bui
No ratings yet
Knoledge MGT
Document20 pages
Knoledge MGT
gy2crzbqr8
No ratings yet
Text Databases and Information Retrieval: Riloff, Hollaar@cs - Utah.edu&
Document3 pages
Text Databases and Information Retrieval: Riloff, Hollaar@cs - Utah.edu&
fukkyduzz
No ratings yet
Building Information Extraction System Based on Computing Domain Ontology
Document5 pages
Building Information Extraction System Based on Computing Domain Ontology
rickshark
No ratings yet
Introduction To Information Extraction Technology: Douglas E. Appelt David J. Israel
Document41 pages
Introduction To Information Extraction Technology: Douglas E. Appelt David J. Israel
Jumar Divinagracia Dimpas
No ratings yet
Literature Survey On Semantic Web
Document4 pages
Literature Survey On Semantic Web
Jenifer Metilda
No ratings yet
Coop Isipcala93
Document43 pages
Coop Isipcala93
Victor Noroc
No ratings yet
swj248 PDF
Document8 pages
swj248 PDF
akttripathi
No ratings yet
43.IJCSCN PreprocessingTechniquesforTextMining Ilamathi Nithya
Document11 pages
43.IJCSCN PreprocessingTechniquesforTextMining Ilamathi Nithya
Ashish Das
No ratings yet
Chapter 1 Solutions
Document5 pages
Chapter 1 Solutions
ahmmeddiab5
No ratings yet
Answer Key Class Test 1 Paper3
Document7 pages
Answer Key Class Test 1 Paper3
priyanka chaudhary
No ratings yet
Concept Mining: Fundamentals and Applications
From Everand
Concept Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Self-Supervised Learning: Teaching AI with Unlabeled Data
From Everand
Self-Supervised Learning: Teaching AI with Unlabeled Data
Robert Johnson
No ratings yet
3da8347e-a418-4974-a507-bbb5d561bc1d (5)
Document1 page
3da8347e-a418-4974-a507-bbb5d561bc1d (5)
Shruthi Nanditha P
No ratings yet
Cs8080 Rejinpaul Iq
Document2 pages
Cs8080 Rejinpaul Iq
Shruthi Nanditha P
No ratings yet
NM Ai Module 2
Document9 pages
NM Ai Module 2
Shruthi Nanditha P
No ratings yet
Sunbase Intern
Document2 pages
Sunbase Intern
Shruthi Nanditha P
No ratings yet
3da8347e-a418-4974-a507-bbb5d561bc1d (4)
Document1 page
3da8347e-a418-4974-a507-bbb5d561bc1d (4)
Shruthi Nanditha P
No ratings yet
Intern PDF
Document3 pages
Intern PDF
Shruthi Nanditha P
No ratings yet
How To - Sign XML Documents With Digital Signatures - Microsoft Docs
Document6 pages
How To - Sign XML Documents With Digital Signatures - Microsoft Docs
Jolumaca
No ratings yet
Guard - me@RRU Policy
Document4 pages
Guard - me@RRU Policy
Aditya
No ratings yet
960GC-GS FX
Document5 pages
960GC-GS FX
wackypong143
No ratings yet
Pages From PMI Project Management Body of Knowledge PMBoK
Document1 page
Pages From PMI Project Management Body of Knowledge PMBoK
ahmed_1012
No ratings yet
Strategic Plan: Awareness Prevention Treatment
Document16 pages
Strategic Plan: Awareness Prevention Treatment
Aoy Rangsima
No ratings yet
Final AHU Specification 03.09.2020
Document20 pages
Final AHU Specification 03.09.2020
Hemanti Sharma
No ratings yet
Certificate / Certificat Zertifikat
Document2 pages
Certificate / Certificat Zertifikat
MarioAlbertoAlcantar
No ratings yet
02 Sistema de CR New Holland PDF
Document23 pages
02 Sistema de CR New Holland PDF
pitufo_75
No ratings yet
Undergraduate Programmes 2022 - 2023 International
Document24 pages
Undergraduate Programmes 2022 - 2023 International
Muzafar Ali
No ratings yet
Research Sample 6
Document283 pages
Research Sample 6
lea
No ratings yet
What Is RCU (2018) - Paul E. McKenney
Document241 pages
What Is RCU (2018) - Paul E. McKenney
iTiSWRiTTEN
No ratings yet
Mattepally Vishwaja: Project
Document3 pages
Mattepally Vishwaja: Project
Vishwaja Mattepally
No ratings yet
10000015915
Document211 pages
10000015915
Chapter 11 Dockets
No ratings yet
Lumotive-Cs-31 0 23
Document3 pages
Lumotive-Cs-31 0 23
NH Kim
No ratings yet
Memoirs of My Working Life (Mokshagundam Visvesvaraya, April 1951)
Document172 pages
Memoirs of My Working Life (Mokshagundam Visvesvaraya, April 1951)
Srini Kalyanaraman
No ratings yet
Introduction To Computers: Hardware and Software
Document31 pages
Introduction To Computers: Hardware and Software
Ysa Diente
No ratings yet
G.R. No. 182836 Continental Steel Vs Montano
Document10 pages
G.R. No. 182836 Continental Steel Vs Montano
kitakattt
No ratings yet
Valuation - 52 Alder Road Poole BH12 2AE England 2
Document11 pages
Valuation - 52 Alder Road Poole BH12 2AE England 2
ia2290561
No ratings yet
Effects of Random Heterogeneity of Soil Properties On Bearing Capacity POPESCU 2005
Document18 pages
Effects of Random Heterogeneity of Soil Properties On Bearing Capacity POPESCU 2005
edilbertolore
No ratings yet
Roles and Responsibilities: The Fourth Estate
Document9 pages
Roles and Responsibilities: The Fourth Estate
Carlo Oblea
No ratings yet
Supplementary 4 Scanning and Skimming Non-Linear Texts
Document7 pages
Supplementary 4 Scanning and Skimming Non-Linear Texts
Wilford A. Mercado
No ratings yet
Rak CV (Sal)
Document1 page
Rak CV (Sal)
rakesh2001kumardas
No ratings yet
Onboarding Final
Document53 pages
Onboarding Final
kajal.priyadarshini
No ratings yet
Influencer Marketing
Document28 pages
Influencer Marketing
keshav bajaj
No ratings yet
Charge Relinquish Report
Document11 pages
Charge Relinquish Report
Naseer Ahmad
No ratings yet
MACH2364 Sales Spec Sheet en
Document2 pages
MACH2364 Sales Spec Sheet en
Lord Stov
No ratings yet
Lesson 7: Markup and Markdown Problems: Student Outcomes
Document4 pages
Lesson 7: Markup and Markdown Problems: Student Outcomes
Joan Balmes
No ratings yet