Lec 1 - Intro To IR
Lec 1 - Intro To IR
Lec 1 - Intro To IR
Retrieval and
Search Engines
2
1
introduction
Course objectives
and IR systems.
❑ Explore advanced topics like web search algorithms and machine learning for IR.
Effectiveness Efficiency
➢ answers to questions
➢ people
DS414 information Retrieval & Search Engines 26
Main components of IR: Queries
❑ Free text to express user’s information need
❑ Same information need can be described by different queries such as:
Scholar search
Author, title, book,…………
DS414 information Retrieval & Search Engines 28
Main components of IR: Relevance
❑At an abstract level, IR is about:
● does item d match query q? … or …
● is item d relevant to query q?
Information need
• Topic about which the user desires to know more
• In the user’s mind!
Query
• What the user conveys to the computer
• Considered one representation of the information need
Relevance
• Document having a value with respect to the information need
DS414
• i.e., a document is relevant if it satisfies the information need
information Retrieval & Search Engines 30
What is the challenge in relevance?
❑ No clear semantics!
●“William Shakespeare”
•Author history’s? list of plays? a play by him?
Performance
Relevance
-Efficient search and indexing
-Effective ranking
Incorporating new data
Evaluation
-Coverage and freshness
-Testing and
measuring Scalability
Information needs -Growing with data and users
-User interaction Adaptability
-Tuning for applications
Specific problems
-e.g. Spam
DS414 information Retrieval & Search Engines 36
Why Information Retrieval:
Information Overload:
“… The world produces between 1 and 2 exabytes(10 bytes)of unique information per year,
18
which is roughly 250 megabytesfor every man, woman, and child on earth. …“ (Lyman &
Hal 03)
❑ Web Search
❑ Information Visualization: Let user understand the results in the best way
❑ ………………………..