0% found this document useful (0 votes)

59 views

Assignment 1

The document discusses the history and development of XML. It describes how XML was created in 1996 by the W3C to address limitations of HTML and complexity of SGML. The goals were to have a markup language with the power of SGML but simplicity of HTML. Key people in the development of XML are also mentioned. Advantages of XML include the ability to define custom tags suited for specific needs and separating data from presentation.

Uploaded by

Muhammad Sufian

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views

Assignment 1

Uploaded by

Muhammad Sufian

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

XML

Assignment 1

Group members

1. Sufian Bin Sarin 2009967979

History of XML

There are three guys at IBM which is Charles Goldfarb, along with Ed Mosher and Ray Lorie who
invented GML in 1970’s.The GML is a way of marking up technical documents with structural tags.
Goldfarb said, he invented the term “mark-up language” in order to make better use of the initials. Then
it became the Standard Generalized Markup Language (SGML) and was adopted by the ISO in 1986.
There are some confusing about SGML because it is not fully markup language but rather a specification
for defining markup languages. The most popular application of SGML is HTML (Hypertext Markup
Language) which an application that defines a specific set of tags suitable for web pages.

However, SGML is pretty darn complex, especially for the everyday uses of the web. Not only that, but
SGML is pretty expensive. Adding SGML capability to a word processor could double or triple the price.
Finally, the commercial browsers made it pretty clear that they did not intend to ever support SGML.

On the other hand, the HTML was free, simple and widely supported. The HTML was originally designed
at CERN around 1990 to provide a very simple version of SGML which could be used by many people.
The usage of HTML spread very fast after that.

Then, although the HTML are widely use , unfortunately it has serious defects and then things began to
go horribly wrong. The original idea was to separate content from presentation. For example,
the <em> tag in a web page means “emphasize”. It was left up to the user agent how to render that, say
as bold text, or in a different color, or with a different tone of voice in a speech reader. This type of thing
does not please page designers, who want to nail down the exact appearance of a page. Therefore
HTML got extended with things like <font> tags which went right against the initial concept. Another
problem area was that fierce competition between Netscape and Microsoft led to fragmentation of the
standard, which remains a huge problem for web developers. Web pages began to be used for things
that went wildly beyond the original concept, including multimedia, animation, online applications,
ecommerce and more. Browsers also tried to be tolerant of hastily written web pages that committed
crimes like using an opening tag without a corresponding closing tag. Tolerance is normally
commendable, but the resulting lack of discipline became a barrier to programmatic interpretation of
web content, or the use of HTML for structured data.

In a nutshell, HTML is too limited and terminally polluted, while SGML itself is reckoned to be too
complex for mortals to implement.
So in 1996, discussions began which focused on how to define a markup language with the power and
extensibility of SGML but with the simplicity of HTML. The World Wide Web Consortium (W3C) decided
to sponsor a group eleven member of SGML gurus including Jon Bosak from Sun, Tim Bray, James Clark
and a hundred and fifty member interest group.

The working group had the following goals:

-Internet usability
- SGML compatibility
- General purpose stability
- Formality
- Conciseness
- Legibility
- Ease of authoring
- Minimization of optional features

Michael Sperberg-McQueen compiled design decisions and the theory behind them on December 4,
1997. They came up with XML, Extensible Markup Language. James Clark was the Technical Lead of the
working group, and his contributions include the name "XML" and the empty element syntax. Initially
the specification co-editors were Tim Brag and Michael Sperberg-McQueen. They were later joined by
Jean Paoli as the third co-editor. The XML was designed using emails and weekly teleconferences, and
after twenty weeks (July to November 1996) of hard work and major decisions, the first working draft
was released.
Advantage and Disadvantage of XML

Advantages of XML

There are many advantages of XML. The first advantage is that because we are writing our own markup
language, we are not restricted to a limited set of tags defined by proprietary vendors.

Rather than waiting for standards bodies to adopt tag set enhancements (a process which can take quite
some time), or for browser companies to adopt each other's standards (yeah right!), with XML, you can
create your own set of tags at your own pace.

Of course, not only are you free to develop at your own pace, but you are free to develop tools that
meet your needs exactly.

By defining your own tags, you create the markup language in terms of your specific problem set! Rather
than relying on a generic set of tags which suits everyone's needs adequately, XML allows every
person/organization to build their own tag library which suits their needs perfectly.

That is, though the majority of web designers do not need tags to format musical notation, medical
formula, or architectural specifications, musicians, doctors and architects might.

XML allows each specific industry to develop its own tag sets to meet its unique needs without forcing
everyone's browser to incorporate the functionality of zillions of tag sets, and without forcing the
developers to settle for a generic tag set that is too generic to be useful.

However cool the idea of escaping the limitations of a basic tag set (like HTML) sounds, it isn't even close
to the best thing about XML?

The real power of XML comes from the fact that with XML, not only can you define your own set of tags,
but the rules specified by those tags need not be limited to formatting rules. XML allows you to define all
sorts of tags with all sorts of rules, such as tags representing business rules or tags representing data
description or data relationships.
Consider again the case of the contact list in SCLML. Using standard HTML, a developer might use
something like the following:

<UL>

<LI>Gunther Birznieks

<UL>

<LI>Client ID: 001

<LI>Company: Bob's Fish Store

<LI>Email: gunther@bobsfishstore.com

<LI>Phone: 662-9999

<LI>Street Address: 1234 4th St.

<LI>City: New York

<LI>State: New York

<LI>Zip: 10024

</UL>

<LI>Susan Czigany

<UL>

<LI>Client ID: 002

<LI>Company: Netscape

<LI>Email: susan@eudora.org

<LI>Phone: 555-1234

<LI>Street Address: 9876 Hazen Blvd.

<LI>City: San Jose

<LI>State: California

</UL>

While this may be an acceptable way to store and display your data, it is hardly the most efficient or
powerful. As you are probably aware, there are many potential problems associated with marking up
your data using HTML. Three particularly serious problems come to mind:

1. The GUI is embedded in the data. What happens if you decide that you like a table-based
presentation better than a list-based presentation? In order to change to a table-based
presentation, you must recode all your HTML! This could mean editing many of pages.
2. Searching for information in the data is tough. How would you get a quick list of only the clients
in California? Certainly, some type of script would be necessary. But how would that script
work? It would probably have to search through the file word for word looking for the string
"California". And even if it found matches, it would have no way of knowing that California
might have a relationship to "New York" - that they are both states. Forget about the
relationships between pieces of data which are crucial to power searching.
3. The data is tied to the logic and language of HTML. What happens if you want to present your
data in a Java applet? Well, unfortunately, your Java applet would have to parse through the
HTML document stripping out tags and reformat the data. Non-HTML processing applications
should not be burdened with extraneous work.

With XML, these problems and similar problems are solved. In XML, the same page would look like the
following:

<NAME>Gunther Birznieks</NAME>

<COMPANY>Bob's Fish Store</COMPANY>

<EMAIL>gunther@bobsfishstore.com</EMAIL>

</CLIENT>

<NAME>Susan Czigany</NAME>

<COMPANY>Netscape</COMPANY>

<EMAIL>susan@eudora.org</EMAIL>

<STREET>9876 Hazen Blvd.</STREET>

<STATE>California</STATE>

</CLIENT>

As you can see, custom tags are used to bring meaning to the data being displayed. When stored this
way, data becomes extremely portable because it carries with it its description rather than its display.
Display is "extracted" from the data and as we will see later, incorporated into a "style sheet".

Let's consider some of the benefits.

1. With XML, the GUI is extracted. Thus, changes to display do not require futzing with the data.
Instead, a separate style sheet will specify a table display or a list display.
2. Searching the data is easy and efficient. Search engines can simply parse the description-bearing
tags rather than muddling in the data. Tags provide the search engines with the intelligence they
lack.
3. Complex relationships like trees and inheritance can be communicated.
4. The code is much more legible to a person coming into the environment with no prior
knowledge. In the above example, it is obvious that <ID>002</ID> represents an ID whereas
<LI>002 might not. XML is self-describing.

Disadvantages of XML

However, awesome XML is, there are some drawbacks which have hindered it from gaining widespread
use since its inception. Let's look at the biggest drawback: The lack of adequate processing applications.

For one, XML requires a processing application. That is, the nice thing about HTML was that you knew
that if you wrote an HTML document, anyone, anywhere in the world, could read your document using
Netscape. Well, with XML documents, that is not yet the case. There are no XML browsers on the
market yet (although the latest version of IE does a pretty good job of incorporating XSL and XML
documents provided HTML is the output).

Thus, XML documents must either be converted into HTML before distribution or converting it to HTML
on-the-fly by middleware. Barring translation, developers must code their own processing applications.

The most common tactic used now is to write parsing routines in DHTML or Java, or Server-Side perl to
parse through an XML document, apply the formatting rules specified by the style sheet, and "convert"
it all to HTML.

"While it's true that browser support is limited, IE 5 and Netscape 5 are expected to fully support XML.
Also, W3C's Amaya browser supports it today, as does the JUMBO browser that was created for the
Chemical Markup Language.

XML isn't about display -- it's about structure. This has implications that make the browser question
secondary. So the whole issue of what is to be displayed and by what means is intentionally left to other
applications. You can target the same XML (with different XSL) for different devices (standard web
browser, palm pilot, printer, etc.). You should not get the impression that XML is useless until browsers
support it. This is definitely not true -- we are using it at NASA in ways where no browser plays any role."
- Ken Sall

However, this takes some magic and the amount of work necessary even to print "hello world" are
sometimes enough to dissuade developers from adopting the technology.

Nevertheless, parsing algorithms and tools continue to improve over time as more and more people see
the long-term benefits of migrating their data to XML. The backend part of XML will continue to become
simpler and simpler. Already Internet Explorer and Netscape provide a decent amount of built in XML
parsing tools.

Cis Win2019.yml
No ratings yet
Cis Win2019.yml
150 pages
Computer Hardware and Software Maintenance
0% (2)
Computer Hardware and Software Maintenance
4 pages
Why Is XML So Important?
No ratings yet
Why Is XML So Important?
53 pages
Why Do We Need XML?
No ratings yet
Why Do We Need XML?
13 pages
Report Main
No ratings yet
Report Main
23 pages
Q.1) Relation Between XML, HTML, SGML. Relation Between XML and HTML
No ratings yet
Q.1) Relation Between XML, HTML, SGML. Relation Between XML and HTML
7 pages
Overview of HTML and XML
No ratings yet
Overview of HTML and XML
22 pages
Report Main
No ratings yet
Report Main
23 pages
Presenting XML
No ratings yet
Presenting XML
346 pages
Unit-2 XML
No ratings yet
Unit-2 XML
13 pages
Web IV Unit Notes
No ratings yet
Web IV Unit Notes
56 pages
What Is XML: XML (Extensible Markup Language) Is A Mark Up Language
No ratings yet
What Is XML: XML (Extensible Markup Language) Is A Mark Up Language
17 pages
5 XML (Unit 2)
No ratings yet
5 XML (Unit 2)
40 pages
XML
No ratings yet
XML
24 pages
Unit Ii-Xml
No ratings yet
Unit Ii-Xml
41 pages
Unit 1
No ratings yet
Unit 1
10 pages
XML Chap8 Sebesta Web2
No ratings yet
XML Chap8 Sebesta Web2
52 pages
XML DTD
No ratings yet
XML DTD
12 pages
What You Should Already Know: Home Page
No ratings yet
What You Should Already Know: Home Page
56 pages
XML Tutorial
No ratings yet
XML Tutorial
21 pages
DOC_LM_Unit-1 (1)
No ratings yet
DOC_LM_Unit-1 (1)
9 pages
Chapter 3 Detail
No ratings yet
Chapter 3 Detail
106 pages
What Is XML
No ratings yet
What Is XML
4 pages
Extensible: Markup Language
No ratings yet
Extensible: Markup Language
33 pages
XML and Web Services
No ratings yet
XML and Web Services
176 pages
DMC1951
No ratings yet
DMC1951
176 pages
XML Interview Questions
No ratings yet
XML Interview Questions
52 pages
DSS01
No ratings yet
DSS01
118 pages
Introduction To XML and Its Applications
No ratings yet
Introduction To XML and Its Applications
32 pages
XML
No ratings yet
XML
40 pages
WT Unit - 2
No ratings yet
WT Unit - 2
26 pages
Module 2 PDF
No ratings yet
Module 2 PDF
25 pages
Annotation and Exercises
No ratings yet
Annotation and Exercises
2 pages
Unit 2
No ratings yet
Unit 2
296 pages
4020 Week 3
No ratings yet
4020 Week 3
75 pages
XML Realtime Examples
0% (1)
XML Realtime Examples
67 pages
Donnay XML
No ratings yet
Donnay XML
20 pages
TWB - White - Paper - XML - A New Approach To Documentation - noPW
No ratings yet
TWB - White - Paper - XML - A New Approach To Documentation - noPW
7 pages
XML Karox
No ratings yet
XML Karox
73 pages
A Technical Introduction To XML: Start Here
No ratings yet
A Technical Introduction To XML: Start Here
18 pages
Extensible Markup Language: What Is XML?
No ratings yet
Extensible Markup Language: What Is XML?
22 pages
XML in 10 Points
No ratings yet
XML in 10 Points
2 pages
Q) What Is An XML and Why It Was Designed ? A)
No ratings yet
Q) What Is An XML and Why It Was Designed ? A)
13 pages
XML Unit III
No ratings yet
XML Unit III
21 pages
Basic XML What Is XML?
No ratings yet
Basic XML What Is XML?
25 pages
XML Tutorial
100% (1)
XML Tutorial
66 pages
XML Interview Questions and Answers
No ratings yet
XML Interview Questions and Answers
8 pages
Chapter 01 XML
No ratings yet
Chapter 01 XML
14 pages
WAP and XML - Unit IV
No ratings yet
WAP and XML - Unit IV
83 pages
What Is XML
No ratings yet
What Is XML
27 pages
XML Tutorial
No ratings yet
XML Tutorial
14 pages
XML (BScCSIT 5th Semester)
No ratings yet
XML (BScCSIT 5th Semester)
39 pages
Lesson 07 - XML-DTD
No ratings yet
Lesson 07 - XML-DTD
47 pages
Introduction To XML: What You Should Already Know
No ratings yet
Introduction To XML: What You Should Already Know
24 pages
Web Technology (CSC-353) : (Unit 3: XML)
No ratings yet
Web Technology (CSC-353) : (Unit 3: XML)
50 pages
XML
No ratings yet
XML
15 pages
Unit 3
No ratings yet
Unit 3
50 pages
What Is XML: Self-Describing Data Is The Data That Describes Both Its Content and Structure. Why XML
No ratings yet
What Is XML: Self-Describing Data Is The Data That Describes Both Its Content and Structure. Why XML
15 pages
Beginning XML
From Everand
Beginning XML
Joe Fawcett
3/5 (1)
TOML Config Basics
From Everand
TOML Config Basics
Frank Wellington
No ratings yet
XML Programming: The Ultimate Guide to Fast, Easy, and Efficient Learning of XML Programming
From Everand
XML Programming: The Ultimate Guide to Fast, Easy, and Efficient Learning of XML Programming
Christopher Right
2.5/5 (2)
CodeNotes for Web-Based UI
From Everand
CodeNotes for Web-Based UI
Gregory Brill
4/5 (1)
Difference Between e Commerce and e Marketing
100% (1)
Difference Between e Commerce and e Marketing
2 pages
Debugging 9
No ratings yet
Debugging 9
16 pages
ECE 6602 - Week1 Lec1
No ratings yet
ECE 6602 - Week1 Lec1
33 pages
Build A 4-Bit Parity Generator and Parity Checker Circuit.
No ratings yet
Build A 4-Bit Parity Generator and Parity Checker Circuit.
10 pages
IFCU - How To Enter The Readings ?: Alagesan M
No ratings yet
IFCU - How To Enter The Readings ?: Alagesan M
6 pages
QWH35 Catalog
No ratings yet
QWH35 Catalog
2 pages
HDR Projects 5 - Manual
No ratings yet
HDR Projects 5 - Manual
129 pages
Drag and Drop Question
No ratings yet
Drag and Drop Question
21 pages
UNIT 1: Introduction To Information System Environment: Information Systems Analysis and Design
No ratings yet
UNIT 1: Introduction To Information System Environment: Information Systems Analysis and Design
17 pages
Common Expressions 2 British English Teacher
No ratings yet
Common Expressions 2 British English Teacher
4 pages
22CA026_Advance Java Programming_BCA6
No ratings yet
22CA026_Advance Java Programming_BCA6
23 pages
AMC Terms Condition
No ratings yet
AMC Terms Condition
2 pages
Android Magazine
100% (1)
Android Magazine
100 pages
ICT YR 7 WK 10 Capturing Images
No ratings yet
ICT YR 7 WK 10 Capturing Images
16 pages
Blob Compression Delphi Source Code
No ratings yet
Blob Compression Delphi Source Code
2 pages
General Specifications: Model Yhc5150X Fieldmate Handheld Communicator
No ratings yet
General Specifications: Model Yhc5150X Fieldmate Handheld Communicator
4 pages
Resume Jortin
No ratings yet
Resume Jortin
4 pages
Architecture of Parallel Computing
No ratings yet
Architecture of Parallel Computing
6 pages
IOT Lab File
No ratings yet
IOT Lab File
49 pages
LRC
No ratings yet
LRC
5 pages
Sd3 (U) Over Speed: Application Point of Detection
No ratings yet
Sd3 (U) Over Speed: Application Point of Detection
1 page
8VZZ003534T0001 en A Symphony Plus SCADA S Plus Operations SCADA version 3.3
No ratings yet
8VZZ003534T0001 en A Symphony Plus SCADA S Plus Operations SCADA version 3.3
10 pages
SAM@OWS Guideline (2023)
No ratings yet
SAM@OWS Guideline (2023)
13 pages
Getting Started With Labview: Joseph Vignola, John Judge and Patrick O'Malley Spring 2010
No ratings yet
Getting Started With Labview: Joseph Vignola, John Judge and Patrick O'Malley Spring 2010
51 pages
Dzone RC Industrial Internet
No ratings yet
Dzone RC Industrial Internet
6 pages
Automatic Generation and Detection of Highly Relia
No ratings yet
Automatic Generation and Detection of Highly Relia
15 pages
Matrimonial
No ratings yet
Matrimonial
67 pages
TestNG - Assignment Problem Scenario & Instructions
No ratings yet
TestNG - Assignment Problem Scenario & Instructions
5 pages