Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
127 views

Web Mining

This document discusses web mining. It begins by defining web mining as the discovery and analysis of useful information from the world wide web. It then discusses the differences and similarities between data mining and web mining. Specifically, it notes that in web mining, data is collected from servers, clients, and databases, and includes both structured and unstructured data. The document also outlines reasons for web mining, types of web mining including content, structure, and usage mining, applications of web mining, and advantages and disadvantages.

Uploaded by

Shakir Muhammad
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
127 views

Web Mining

This document discusses web mining. It begins by defining web mining as the discovery and analysis of useful information from the world wide web. It then discusses the differences and similarities between data mining and web mining. Specifically, it notes that in web mining, data is collected from servers, clients, and databases, and includes both structured and unstructured data. The document also outlines reasons for web mining, types of web mining including content, structure, and usage mining, applications of web mining, and advantages and disadvantages.

Uploaded by

Shakir Muhammad
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

1

WEB MINING

Submitted By

MOHAMED SHAKIR.P
ROLL NO : 36
CONTENTS 2

 Introduction
 Similarity & Difference between Data Mining And Web

mining
 Reasons for web mining

 Personalization.

 Tools Of Web Mining.

 Types of web mining

 Architecture of web mining

 Applications of web mining

 Conclusion

 Reference
INTRODUCTION 3

 Web Mining can be broadly defined as the


discovery and analysis of useful information from the world
wide web.
 The data is collected from the server, client and
database in web mining.
 Web mining is a subset of data mining.
Difference between data mining and web mining 4
 In DM data is stored in data warehouse while
data stored in web server database and web logos in WM.
 DM uses structuered data while WM uses
structured and unstructured data.
Similarity between data mining and web mining 5

 Their common goal is to extracting,discovering,


finding and mining hidden knowledge.
 Their concept is identification of patterns from
the data available in the system/web.
 Both are useful for decision making and
prediction.
 Both follows the same process.
 Both needs input/source data to complete their
process.
Reasons for web mining 6

 While dealing with the web data we face with the


following problems.
 User side problems.
 Information Providers/Server problem.
 They face problems like:

 Low precision
 Getting an irrevalent information and
 Low recall.
Personalization 7

》Web access or content tuned to better fit the desires of each


user.
》With Personalization advertisements to be sent to the
customers based on specific knowledge.
》GOAL : Make the customers purchase something.
Web Mining Tools 8
 Data Miner (Web Content Mining Tool)
 Google Analytics (Web Usage Mining Tool)
 Majestic (Web structure mining tool)
 Scrappy (Web content mining tool)
 Oracle data Mining (Web Usage Mining Tool)
 Bixo (Web structure mining tool)
 Weka (Web Usage Mining tool )
TYPES OF WEB MINING 9

 Web mining can be generally divided into three


categories based on the data to be mined.

Web Mining

Web Content Web Structure Web usage


Mining Mining Mining

Text and
Hyperlink Web log
Multimedia
structurte records
Documents.
Web Content Mining 10

 Web Content Mining the process of collecting useful


data from websites.
 This content includes news, comments, company
information, product catalogs, etc.
 It is extract information or knowledge from collected
sources.
 This content may consists text, image, video, sound
or structured records such as lists and tables.
Web Structure Mining 11

 It is the process of extracting structural information


from the web.
 Hyperlinks: is a structural component that
connects the web page to a different location.
 Document Structure: organization of content from
the web page in tree-structure format based on HTML and
XML tags with in the page
Web Usage Mining 12

 It is the application of data mining techniques to


discover patterns using the Web to better understand and
meet the needs of the user.
 It is classified in to three based on the kind of data
usage.
 Web Server Data:
 Application Server Data
 Application Level Data
Architecture of Web Mining 13
Applications Of Web Mining 14

 web mining has a lot of application in different sectors


or areas.
Advantages of Web Mining 15

 Increases of profits of companies or


organizations by sealing products.
 Protect user system or logging information.
 Improves capacity of service for consumer.
 improving the process of E-learning
environments.
Cont....... 16

-It opens door for Business Intelligence or Knowledge


economy.
-It supports for Decision Making and prediction.
-Mining and Discovering hidden knowledge.
-Used for data analysis.
Disadvantage 17

》URL’s can be tracked to access the data.


》Multiplicity of events and URL’s.
》 Large amount of data remain unused.
》Since data are updatable it is not good to say they are
untrusted .
Conclusion 18

 As web usage and information source in the World


Wide Web are growing continuously it is a good opportunity
having web miner to extract hidden knowledge's from the web.
As a weakness not all but some researchers are replaced Web
mining by Text mining. It is strongly wrong since web mining is
concentrated with too much multimedia information's but text
mining is only for textual data.
REFERENCE 19

》https://blog.eduonix.com/internet-of-things/web-mining-
text-mining-depth-mining-guide/
》https://www.techopedia.com/definition/15634/web-mining
》https://www.kopykitab.com/blog/web-mining-notes/?amp
20

THANK YOU

You might also like