Module 2
Module 2
Module 2
Study Material
Introduction to Multimedia, BCAC502
_____________________________________________________________________________________________
Table of Contents
6 Kerning
5-6
7 Tracking 6
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 1
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
MODULE 2 [TEXT]
Text:
Text is a human-readable sequence of character s and the words they
form that can be encoded into computer-readable formats such as
ASCII . Text is usually distinguished from non-character encoded data,
such as graphic images in the form of bitmap s and program code,
which is sometimes referred to as being in "binary" (but is actually in
its own computer-readable format).
Types of text
Unformatted Text:
Unformatted text is known as plain text is the contents of an ordinary
sequential file readable as textual material without much processing.
Plain text is different from formatted text, where style information is
included, and "binary files" in which some portions must be interpreted
as binary objects (encoded integers, real numbers, images, etc.).
The encoding has traditionally been either ASCII, one of its many
derivatives such as ISO/IEC 646 etc., or sometimes EBCDIC. Unicode-
based encodings such as UTF-8 and UTF-16 are gradually replacing the
older ASCII derivatives limited to 7 or 8 bit codes.
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 2
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Formatted Text:
Formatted text are those where apart from the actual alphanumeric
characters, other control characters are used to change the appearance of
the characters, e.g. Bold, underlines, italics, varying shapes, sizes and
colors etc. Most text processing software use such formatting options to
change text appearance. To print such a document, the printer should
also be capable of interpreting these control codes so that the
appropriate appearance may be reproduced.
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 3
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Leading:
When working with a paragraph, or just more than one line of type,
leading is the distance between the baselines in the paragraph. A
baseline is the imaginary guideline that type sits on. The standard
proportion of leading to type size is typically 120%. So if the type size
is 20 point, then the most standard leading would be 24 point. The term
originated in the days of hand-typesetting, when thin strips of lead were
inserted into the forms to increase the vertical distance between lines of
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 4
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
type. The term is still used in modern page layout software such as
QuarkXPress and Adobe InDesign.
In consumer-oriented word processing software, this concept is usually
referred to as "line spacing“ "interline spacing”.
Kerning:
Kerning is an adjustment of space between two specific letters. The goal
of kerning is to create a consistent rhythm of space within a group of
letters and to create an appearance of even spacing between letters.
Fonts have exact amounts of spacing between letter combinations
already built into it, which is called Metric Kerning. Type takes on
Metric Kerning as a default. The goal of kerning is for the type to look
optically correct. There is no mathematical formula, and often times it
just takes practice
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 5
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Tracking:
Kerning should not be confused with tracking, which refers to uniform
spacing between all of the letters in a group of text. By increasing
tracking in a word, line of text, or paragraph, a designer can create a
more open and airy element.
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 6
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Unicode Standard:
Unicode Standard: The Unicode Standard is a new universal character
coding scheme for written characters and text. It defines a consistent
way of encoding multilingual text which enables textual data to be
exchanged universally. The Unicode Consortium was incorporated in
1991 to promote the Unicode standard.The UTC(Unicode Technical
Committee) is the working group within the consortium responsible for
creation, maintenance and quality of the Unicode Standard. For ex. The
hindi characters “Pa” is represented by the Unicode sequence 0000 1001
0010 1010(U+092A), how it will be rendered on the screen will be
decided by the font vendor. The first byte represents the language area
while the next byte represents the actual character.
ASCII Character Set:
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 7
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Font:
In traditional typography, a font is a particular size, weight and style of
a typeface. In Windows platform, font files are stored in a specific
folder called Fonts under the Windows folder. These files are usually
vector format meaning that character descriptions are stored
mathematically. Windows call these font as True Type Fonts.
Text File Formats:
TXT: TXT(Text) is an unformatted text document created by an
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 9
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Huffman Coding
LZW coding
Shannon-FANO Coding.
When encoding a byte stream, the first 2 = 256 entries of the string
8
table, numbered 0 through 255, are initialized to hold all the possible
one-byte sequences. The other entries will be filled in as the message
byte stream is processed. First, accumulate message bytes as long as the
accumulated sequences appear as some entry in the string table. At
some
point, appending the next byte b to the accumulated sequence S would
create a sequence S + b that’s not in the string table, where + denotes
appending b to S. The encoder then executes the following steps:
1. It transmits the N-bit code for the sequence S.
2. It adds a new entry to the string table for S + b. If the encoder finds
the table full when it goes to add an entry, it reinitializes the table before
the addition is made.
3. it resets S to contain only the byte b.
This process repeats until all the message bytes are consumed, at which
point the encoder makes a final transmission of the N-bit code for the
current sequence S.
Example:
The following table shows the encoder in action on a repeating
sequence of abc. The string: abcabcabcabcabcabcabcabcabcabcabcabc
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 11
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 12
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 13
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
between data (literal values) and back references and LZ coding is used
for back references.
GIF files
The graphic interface format (GIF) uses a compression algorithm based
on the Lempel- Ziv-Welsh (LZW) compression scheme. When
compressing an image the compression program maintains a list of
substrings that have been found previously. When a repeated string is
found, the referred item is replaced with a pointer to the original. Since
images tend to contain many repeated values, the GIF format is a good
compression technique.
UNIX compress/uncompress
The UNIX programs compress and uncompress use adaptive Lempel-
Ziv coding. They are generally better than pack and unpack which are
based on Huffman coding. Where possible, the compress program adds
a ‘.z’ onto a file when compressed. Compressed files can be restored
using the uncompress or zcat programs.
UNIX archive/zoo
The UNIX-based zoo freeware file compression utility employs the
Lempel- Ziv algorithm. It can store and selectively extract multiple
generations of the same file. Data can thus be recovered from damaged
archives by skipping the damaged portion and locating undamaged data.
CODEC:
After an analog quantity has been digitized, it is stored on the disk as a
digital file. Such files are referred to as a raw or uncompressed media
data. To compress the file and reduce its size, it needs to be filtered
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 14
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Compression:
The process of converting an input data stream (the source stream or the
original raw data) into another data stream (the output, or the
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 15
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 16
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 17
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 18
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Huffman Coding:
File compression, particularly for multimedia data, is widely used to
reduce Internet traffic and transfer times. Two common compression
formats for images are GIF and JPEG. Both of these encoding formats
throw away information about the images, so the original image can not
be reconstructed exactly from the compressed image. GIF and
JPEG are lossy compression techniques. Lossy compression can be very
effective for multimedia data.
Huffman Coding is an entropy encoding algorithm used for lossless data
compression. The term entropy is a generic term which refers to the
compression techniques that do not take into account the nature of the
information to be compressed. Lossless compression techniques are also
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 19
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 20
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 21
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 22
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 23
Bachelor of Computer Applications, Sem-V
Introduction to Multimedia, BCAC502
Class
2023-24
If at a leaf, output the leaf's byte value and reset position to the root.
Soumya Roy
Assistant Professor,
Department of Computational Science
Brainware University, Kolkata 24