SNSW Unit-1
SNSW Unit-1
SNSW Unit-1
NETWORKS
&
SEMANTIC WEB
CO 1: The Semantic Web
WHAT IS MEANT BY SEMANTIC WEB?
The Semantic Web is the application of advanced knowledge
technologies to the Web and distributed systems in general.
The Semantic Web is the knowledge graph formed by combining
connected, Linked Data with intelligent content to facilitate
machine understanding and processing of content, metadata, and
other information objects at scale.
The Semantic Web leads to smarter, more effortless customer
experiences by giving content the ability to understand and present
itself in the most useful forms matched to a customer’s need.
Semantic standards unlock a crucial evolution of the web towards
intelligence that allows the content we post online to be presented
in a way that can be understood, connected, and remixed by
machines.
SEMANTIC WEB–CONTENT ENGINEERS
Content engineers are creating a more powerful and agile web of
content and data by first parsing and structuring the discrete
elements of content that constitute websites, such as people,
events, ideas, concepts, products.
These elements are then assigned a “label” describing its meaning
in a standardized language.
When such machine-readable descriptions are present, they can be
linked to build a more robust web of data where computers can
find, read, and even reason about a unit of content.
SO WHAT IS THE NEED?
But why would the Web need any extension or fixing? We will
argue that the reason we do not often raise this question is that we
got used to the limitations of accessing the vast information on the
Web.
LIMITATIONS OF THE CURRENT WEB
To understand the laminations of current web, lets see how a
search engine will respond for 4 types of queries.
Who is Frank van Harlem?
Show me photos of Paris
Find new music that I (might) like
Tell me about music players with a capacity of at least 4GB.
LIMITATIONS
WHO IS FRANK VAN HARLEM?
Only some of the results returned by Google for the search keyword
“Harlem” were what we are looking for because
It’s the name of a number of people, including the (unrelated) Frank van
Harmelen and Mark van Harmelen.
Harmelen is also a small town in the Netherlands (one hit) and the place
for a tragic train accident (one hit).
The keyword harmelen (but even the term Frank van Harmelen) is
polysemous.
Search engines are programmed in such a way that the first page
shows a diversity of the most relevant links related to the keyword.
LIMITATIONS
WHO IS FRANK VAN HARLEM?
This allows the user to quickly realize the ambiguity of the query
and to make it more specific.
But making the query specific will solve the issue? May be not.
LIMITATIONS
SHOW ME PHOTOS OF PARIS
Security
DIAGNOSIS FOR THESE LIMITATIONS
The questions above are arbitrary in their specificity but they illustrate a general
problem in accessing the vast amounts of information on the Web.
Namely, in all four cases we deal with a knowledge gap: what the computer
understands and able to work with is much more limited than the knowledge of
the user.
The handicap of the computer is mostly due to technological difficulties in getting
our computers to understand natural language or to “see” the content of images
and other multimedia.
In most cases, however, the knowledge gap is due to the lack of some kind of
background knowledge that only the human possesses.
The background knowledge is often completely missing from the context of the
Web page.
Further, a query may need aggregated knowledge.
THE SEMANTIC SOLUTION
The idea of the Semantic Web is to apply advanced knowledge technologies in
order to fill the knowledge gap between human and machine.
This means providing knowledge in forms that computers can readily process and
reason with.
This knowledge can either be information that is already described in the content
of the Web pages but difficult to extract or additional background knowledge that
can help to answer queries in some way.
More importantly the user profiles plays a very important role in Semantic Web.
Increasing automatic linking among data
Increasing recall and precision in search
Increasing automation in data integration
Increasing automation in the service life cycle
THE
SEMANTIC
SOLUTION
WHO IS FRANK
VAN HARLEM?
THE SEMANTIC SOLUTION
WHO IS FRANK VAN HARLEM?
The situation can be greatly improved by providing personal
information in a semantic format.
A semantic profile to personal web pages that describe the same
information that appears in the text of the web page but in a
machine processable format.
Assuming that all van Harmelens on the Web would provide
similar information, the confusion among them could also be easily
avoided.
THE SEMANTIC SOLUTION
WHO IS FRANK VAN HARLEM?
In particular, the search engine could alert us to the ambiguity of
our question and ask for some extra information about the person
we are looking for.
Also, a better understanding of the user profile, the search query
and the content of the web pages makes it possible to more
accurately select and customize the advertisements appearing
alongside the queries
THE SEMANTIC
SOLUTION
SHOW ME
PHOTOS OF PARIS
FIND NEW MUSIC THAT I (MIGHT)
LIKE
Here also, attach metadata to the images in question.
For example, the online photo sharing site Flickr allows to
annotate images using geographic coordinates.
After uploading some photos users can add keywords to describe
their images (e.g. “Paris, Eiffel-tower”) and drag and drop the
images on a geographic map to indicate the location where the
photo was taken.
FIND NEW MUSIC THAT I (MIGHT)
LIKE
Like in case of images, the same technique of annotating images
with metadata is used in the MultiMediaN research project to
create a joint catalogue of artworks housed in different collections.
THE
SEMANTIC
SOLUTION
FIND NEW
MUSIC THAT I
(MIGHT) LIKE
FIND NEW MUSIC THAT I (MIGHT)
LIKE
In summary, in all the scenarios we have sketched above the the addition
of knowledge in machine-processable languages has the potential to
improve the access to information by clarifying the meaning of the content.
Besides information retrieval, understanding the meaning of information is
also an important step towards aggregating information from multiple
heterogeneous sources
Aggregation is in turn necessary for performing queries, analysis and
reasoning across several information sources as if they would form a single
uniform database.
RESEARCH, DEVELOPMENT AND
STANDARDIZATION
The vision of extending the current human-focused Web with
machine processable descriptions of web content has been first
formulated in 1996 by Tim Berners-Lee, the original inventor of
the Web.
The Semantic Web has been actively promoted since by the World
Wide Web Consortium (also led by Berners-Lee), the organization
that is chiefly responsible for setting technical standards on the
Web.
RESEARCH, DEVELOPMENT AND
STANDARDIZATION
As a result of this initial impetus and the expected benefits of a
more intelligent Web, the Semantic Web has quickly attracted
significant interest from funding agencies on both sides of the
Atlantic, reshaping much of the AI research agenda in a
relatively short period of time.
In particular, the field of Knowledge Representation and
Reasoning took center stage, but outcomes from other fields of
AI have also been put into to use to support the move towards the
Semantic Web: for example, Natural Language Processing and
Information Retrieval have been applied to acquiring knowledge
from the World Wide Web.
RESEARCH, DEVELOPMENT AND
STANDARDIZATION
As the Semantic Web is a relatively new, dynamic field of
investigation, it is difficult to precisely delineate the boundaries of
this network.
Around 600+ researchers till 2004 worked in this area.
As many other modern technologies, the Semantic Web suffers from what
the economist Kevin Kelly calls the fax-effect (Fax Machines value
increased as they are more purchased unlike the traditional goods like land
or metals like gold).
WEB -
TECHNOLOGY ADAPTION: ISSUES
With the Semantic Web: at the beginning the price of technological
investment is very high.
Semantic web with formal knowledge typically captures only the smaller
part of the intended meaning and thus there needs to be a common
grounding in an external reality that is shared by those at separate ends of
the line.
While the research effort behind the Semantic Web is immense and
growing dynamically, Semantic Web technology has yet to see mainstream
use on the Web and in the enterprise.
THE EMERGENCE OF SOCIAL WEB
The web of the 1990s was much like the combination of a phone
book and the yellow pages and despite the connecting power of
hyperlinks it instilled little sense of community among its users.
This passive attitude toward the Web was broken by a series of
changes in usage patterns and technology that are now referred
to as Web 2.0
The first wave of socialization on the Web was due to the
appearance of blogs, wikis and other forms of web-based
communication and collaboration.
Blogs and wikis attracted mass popularity from around 2003
THE EMERGENCE OF SOCIAL WEB
What they have in common is that they both significantly lower the
requirements for adding content to the Web: editing blogs and wikis
did not require any knowledge of HTML any more.
Blogs and wikis allowed individuals and groups to claim their
personal space on the Web and fill it with content at relative ease.
Even more importantly, despite that weblogs have been first assessed
as purely personal publishing (similar to diaries), nowadays the
blogosphere is widely recognized as a densely interconnected social
network through which news, ideas and influences travel rapidly as
bloggers reference and reflect on each other’s postings.
THE EMERGENCE OF SOCIAL WEB
The first online social networks (also referred to as social
networking services) entered the field at the same time as
blogging and wikis started to take off.
In 2003, the first-mover “Friendster” attracted over five million
registered users in the span of a few months, which was
followed by Google and Microsoft starting or announcing similar
services.
Wiki vs Social Networks?
THE EMERGENCE OF SOCIAL WEB
Although the example of Wikipedia, the online encyclopedia is
outstanding, wikis large and small are used by groups of various
sizes as an effective knowledge management tool for keeping
records, describing best practices or jointly developing ideas.
the collective ownership of a Wiki enforces a sense of community
through the necessary discussions over shared content.
THE EMERGENCE OF SOCIAL WEB
Similarly, the significance of instant messaging (ICQ) is also not
just instant communication (phone is instantaneous, and email
is almost instantaneous), but the ability to see who is online, a
transparency that induces a sense of social responsibility.
Although the newly introduced social web sites feature much of
the same content that appear on personal web pages, they
provide a central point of access and bring structure in the
process of personal information sharing and online socialization
THE EMERGENCE OF SOCIAL WEB
The system also makes it possible to visualize and browse the
resulting network in order to discover friends in common,
friends thought to be lost or potential new friendships based on
shared interests.
These vastly popular systems allow users to maintain large
networks of personal and business contacts. Members have soon
discovered, however, that networking is only a means to an end
in the cyberspace as well.
THE EMERGENCE OF SOCIAL WEB
The idea of network based exchange is based on the sociological
observation that social interaction creates similarity and vice
versa, interaction creates
similarity: friends are likely to have acquired or develop similar
interests.
Explicit user profiles make it possible for these systems to
introduce rating mechanism whereby either the users or their
contributions are ranked according to usefulness or
trustworthiness.
THE EMERGENCE OF SOCIAL WEB
The design and implementation of Web applications have also
evolved in order to make the user experience of interacting with
the Web as smooth as possible.
In line with user friendliness, what can be also observed is a
preference for formats, languages and protocols that are easy to
use and develop with, in particular script languages, formats
such as JSON, protocols such as REST.
This is to support rapid development and prototyping
WEB 2.0 + SEMANTIC WEB = WEB
3.0?
Web 2.0 is often contrasted to the Semantic Web, which is a
more conscious and carefully orchestrated effort on the side of
the W3C to trigger a new stage of developments using semantic
technologies.
Web 2.0 mostly effects how users interact with the Web