Getting Started With HTML
Getting Started With HTML
This is a short introduction to writing HTML. What is HTML? It is a special kind of text document that is used by Web
browsers to present text and graphics. The text includes markup tags such as <p> to indicate the start of a paragraph, and
</p> to indicate the end of a paragraph. HTML documents are often refered to as "Web pages". The browser retrieves Web
pages from Web servers that thanks to the Internet, can be pretty much anywhere in World.
Many people still write HTML by hand using tools such as NotePad on Windows, or TextEdit on the Mac. This guide
will get you up and running. Even if you don't intend to edit HTML directly and instead plan to use an HTML editor such as
Netscape Composer, or W3C's Amaya, this guide will enable you to understand enough to make better use of such tools and
how to make your HTML documents accessible on a wide range of browsers. Once you are comfortable with the basics of
authoring HTML, you may want to learn how to add a touch of style using CSS, and to go on to try out features covered in
my page on advanced HTML
p.s. a good way to learn is to look at how other people have coded their html pages. To do this, click on the "View" menu
and then on "Source". On some browsers, you instead need to click on the "File" menu and then on "View Source". Try it
with this page to see how I have applied the ideas I explain below. You will find yourself developing a critical eye as many
pages look rather a mess under the hood!
For Mac users, before you can save a file with the ".html" extension, you will need to ensure that your document is
formatted as plain text. For TextEdit, you can set this with the "Format" menu's "Make Plain Text" option.
If you are looking for something else, try the advanced HTML page.
Start with a title
Every HTML document needs a title. Here is what you need to type:
Change the text from "My first HTML document" to suit your own needs. The title text is preceded by the start tag <title>
and ends with the matching end tag </title>. The title should be placed at the beginning of your document.
To try this out, type the above into a text editor and save the file as "test.html", then view the file in a web browser. If the
file extension is ".html" or ".htm" then the browser will recognize it as HTML. Most browsers show the title in the window
caption bar. With just a title, the browser will show a blank page. Don't worry. The next section will show how to add
displayable content.
Each paragraph you write should start with a <p> tag. The </p> is optional, unlike the end tags for elements like headings.
For example:
The src attribute names the image file. The width and height aren't strictly necessary but help to speed the display
of your Web page. Something is still missing! People who can't see the image need a description they can read in its
absence. You can add a short description as follows:
The alt attribute is used to give the short description, in this case "My friend Peter". For complex images, you may
need to also give a longer description. Assuming this has been written in the file "peter.html", you can add one as follows
using the longdesc attribute:
You can create images in a number of ways, for instance with a digital camera, by scanning an image in, or creating
one with a painting or drawing program. Most browsers understand GIF and JPEG image formats, newer browsers also
understand the PNG image format. To avoid long delays while the image is downloaded over the network, you should avoid
using large image files.
Generally speaking, JPEG is best for photographs and other smoothly varying images, while GIF and PNG are good
for graphics art involving flat areas of color, lines and text. All three formats support options for progressive rendering
where a crude version of the image is sent first and progressively refined.
Links are defined with the <a> tag. Lets define a link to the page defined in the file "peter.html" in the same
folder/directory as the HTML file you are editing:
The text between the <a> and the </a> is used as the caption for the link. It is common for the caption to be in blue
underlined text.
If the file you are linking to is in a parent folder/directory, you need to put "../" in front of it, for instance:
If the file you are linking to is in a subdirectory, you need to put the name of the subdirectory followed by a "/" in front of it,
for instance:
Which first looks in the parent directory for another directory called "college", and then at a subdirectory of that named
"friends" for a file called "john.html".
To link to a page on another Web site you need to give the full Web address (commonly called a URL), for instance to link to
www.w3.org you need to write:
You can turn an image into a hypertext link, for example, the following allows you to click on the company logo to get to the
home page:
This uses "/" to refer to the root of the directory tree, i.e. the home page.
<ul>
</ul>
Note that you always need to end the list with the </ul> end tag, but that the </li> is optional and can be left off.
The second kind of list is a numbered list, often called an ordered list. It uses the <ol> and <li> tags. For instance:
<ol>
</ol>
Like bulletted lists, you always need to end the list with the </ol> end tag, but the </li> end tag is optional and can be left
off.
The third and final kind of list is the definition list. This allows you to list terms and their definitions. This kind of list
starts with a <dl> tag and ends with </dl> Each term starts with a <dt> tag and each definition starts with a <dd>. For
instance:
<dl>
<dd>its definition</dd>
<dd>its definition</dd>
<dd>its definition</dd>
</dl>
The end tags </dt> and </dd> are optional and can be left off. Note that lists can be nested, one within another. For
instance:
<ol>
<li>
<ul>
</ul>
</li>
</ol>
You can also make use of paragraphs and headings etc. for longer list items.
If you use your web browser's view source feature (see the View or File menus) you can see the structure of HTML pages.
The document generally starts with a declaration of which version of HTML has been used, and is then followed by an
<html> tag followed by <head> and at the very end by </html>. The <html> ... </html> acts like a container for the
document. The <head> ... </head> contains the title, and information on style sheets and scripts, while the <body> ...
</body> contains the markup with the visible content. Here is a template you can copy and paste into your text editor for
creating your own pages:
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
<html>
<head>
</head>
<body>
</body>
</html>
A convenient way to automatically fix markup errors is to use HTML Tidy which also tidies the markup making it
easier to read and easier to edit. I recommend you regularly run Tidy over any markup you are editing. Tidy is very effective
at cleaning up markup created by authoring tools with sloppy habits. Tidy is available for a wide range of operating systems
from the TidyLib Sourceforge site, and has also been integrated into a variety of HTML editing tools.
W3C's Recommendation for HTML 4.0 is the authoritative specification for HTML. However, it is a technical
specification. For a less technical source of information you may want to purchase one of the many books on HTML, for
example "Raggett on HTML 4", published 1998 by Addison Wesley. XHTML 1.0 is now a W3C Recommendation.
Copyright © 1994-2003 W3C® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark, document use and software
licensing rules apply. Your interactions with this site are in accordance with our public and Member privacy statements.