Unit 9 - Data and File Format Standards PDF
Unit 9 - Data and File Format Standards PDF
Data and file format standardisation is standards as well as proprietary, are in use.
crucial We will examine a few very popular and
for sharing data among multiple applica- typical formats:
COMP3600 Multimedia Systems 9. File Formats COMP3600 Multimedia Systems 9. File Formats
(199811) Slide 1 (199811) Slide 2
Rich-Text Format positions, left, right, and first indents Character formatting — Formatting informa-
Simple ASCII text is a limited form of document. These fonts are then mapped relative to document margins, and the tion, including bold, italic, underline (con-
data exchange because when text is moved to the fonts available in the receiving spacing between paragraphs. Paragraph tinuous, dotted, or word), strikethrough,
from one application to another, all formatting application for displaying the text. formatting information also includes style shadow text, outline text, and hidden text,
information is lost. sheets. are specified using control characters.
Colour table — lists the colours used in the
However, this remains one of the most General formatting — Formatting informa-
document for highlighting text (i.e., the
popular and convenient form of data ex- tion in this group includes items such as Special characters — Special characters in-
characters are a specific colour, not
change. footnotes, annotation, bookmarks, and clude hyphens non-breaking space back-
black).
Rich-text format (RTF) expands the range pictures. slashes, and so on.
of information that can be pass between Document formatting — information of the
applications. format applying to the entire document,
This assumes that both the source such as document margins, and so on.
application and destination application have
a reasonable common set of features. Section formatting — Section breaks (and
The key information carried in RTF files page breaks) are used to define separa-
are: tion of groups of paragraphs. The for-
matting information specifies the space
Character sets — determine the characters above and below the section.
that are ported in a particular implemen-
tation. Paragraph formatting — The RTF specifi-
cation defines control characters for
Font table — lists all fonts used in the specifying paragraph justification, tab
COMP3600 Multimedia Systems 9. File Formats COMP3600 Multimedia Systems 9. File Formats
(199811) Slide 3 (199811) Slide 4
TIFF File Format What is big-endian and little-endian?
TIFF file format was originally developed it contains one or more image file
In different processor systems, the orders of data bytes stored in memory are different.
by Aldus Corp. in the Eighties. Currently dictionaries (IFD)
This is known as byte order. Two byte orders are used most often:
widely used TIFF specification is version 6.0
released in 1992. It is on of the most widely it contains a number of blocks of image
used digital image file format. It is very data. The number of image data block is Big-endian Little-endian
portable. no maore than the number of IFD.
The most significant byte of a multi-byte The least significant byte of a multi-byte
TIFF file format can support: object is stored in the memory location of object is stored in the memory location of
colour depth from 1-bit to 24-bit IFH lowest address. For example, a 32-bit word lowest address. For example, a 32-bit word
more than one image in a file 0x12345678 will be stored as: 0x12345678 will be stored as:
endian) number of tags in this IFD data type (word) — this tells what type of ASCII 2 8-bit, NULL-terminated string
– the second word indicates the TIFF – this is followed by the tags data this tag contains SHORT 3 16-bit, unsigned number
tag 0 tag ID tains the data value if the size of the data others are optional. The required tags for
data type
is less than or equal to 4 bytes, otherwise bi-level, gray scale, palette-colour and RGB
tag 1
length it is a point to the data block colour images are listed on the next page.
value
...
tag ID
data type
tag n-2
length
tag n-1 pointer to value
data value
pointer to next IFD
COMP3600 Multimedia Systems 9. File Formats COMP3600 Multimedia Systems 9. File Formats
(199811) Slide 7 (199811) Slide 8
The Required TIFF File Tags The TIFF Image Data
Bi-level and Gray Scale YCbCr colour Image data in a TIFF file can be in any RowsPerStrip
Strip 1
Tag ID Tag Name Data location SHORT
COMP3600 Multimedia Systems 9. File Formats COMP3600 Multimedia Systems 9. File Formats
(199811) Slide 9 (199811) Slide 10
.RDI Device independent bitmap file four character string identifying the type of data
ID
.PAL Palette file the chunk, and the next four bytes indicate SIZE
Subchunk
the size of the data in the chunk in little-
RIFF provides a standard way to endian format.
data
organise data in a file The data of the chunk follow the header ID
SIZE Subchunk
Data in a RIFF file is divided into chunks. immediately. The first four bytes of the data
data
The RIFF specification defines three kinds of is known as the form type which identifies the
chunks: type of data, e.g., WAVE, AVI , and so on.
COMP3600 Multimedia Systems 9. File Formats COMP3600 Multimedia Systems 9. File Formats
(199811) Slide 11 (199811) Slide 12
RIFF Waveform Audio File Format RIFF MIDI File Format
The form type of the RIFF chunk is WAVE The new RIFF WAVE format has the The form type of the RIFF chunk is RMID 2 — one or more sequentially indepen-
It has two mandatory subchunks and a following extra subchunks:
It has a single data subchunk which is the dent tracks
list chunk fact — this is a required subchunk in the MIDI data following the standard MIDI file The remaining chunk(s) is(are) track
The two subchunks are: new format. It contains file dependent format chunck(s)
COMP3600 Multimedia Systems 9. File Formats COMP3600 Multimedia Systems 9. File Formats
(199811) Slide 13 (199811) Slide 14