Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Apache Hadoop

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

Get More Refcardz! Visit refcardz.

com

#117

Getting Started with

CONTENTS INCLUDE:
n

Introduction
Apache Hadoop
Hadoop Quick Reference
Hadoop Quick How-To
Staying Current
Hot Tips and more...

Apache Hadoop
By Eugene Ciurana and Masoud Kalali

INTRODUCTION

This Refcard presents a basic blueprint for applying


MapReduce to solving large-scale, unstructured data
processing problems by showing how to deploy and use an
Apache Hadoop computational cluster. It complements DZone
Refcardz #43 and #103, which provide introductions to highperformance computational scalability and high-volume data
handling techniques, including MapReduce.

www.dzone.com

What Is MapReduce?
MapReduce refers to a framework that runs on a computational
cluster to mine large datasets. The name derives from the
application of map() and reduce() functions repurposed from
functional programming languages.
APACHE HADOOP

 Map applies to all the members of the dataset and


returns a list of results

Apache Hadoop is an open source, Java framework for


implementing reliable and scalable computational networks.
Hadoop includes several subprojects:

 Reduce collates and resolves the results from one or


more mapping operations executed in parallel
Very large datasets are split into large subsets called splits

Getting Started with Apache Hadoop

A
 parallelized operation performed on all splits yields
the same results as if it were executed against the larger
dataset before turning it into splits
Implementations separate business logic from multiprocessing logic
M
 apReduce framework developers focus on process
dispatching, locking, and logic flow

MapReduce
Pig
ZooKeeper
HBase
HDFS
Hive
Chukwa

This Refcard presents how to deploy and use the common


tools, MapReduce, and HDFS for application development
after a brief overview of all of Hadoops components.

A
 pp developers focus on implementing the business logic
without worrying about infrastructure or scalability issues
Implementation patterns
The Map(k1, v1) -> list(k2, v2) function is applied to every
item in the split. It produces a list of (k2, v2) pairs for each call.
The framework groups all the results with the same key
together in a new split.

Get over 90 DZone Refcardz


FREE from Refcardz.com!

The Reduce(k2, list(v2)) -> list(v3) function is applied


to each intermediate results split to produce a collection
of values v3 in the same domain. This collection may have
zero or more values. The desired result consists of all the v3
collections, often aggregated into one result file.

Hot
Tip

MapReduce frameworks produce lists of values.


Users familiar with functional programming
mistakenly expect a single result from the
mapping operations.
DZone, Inc.

www.dzone.com

Getting Started with Apache Hadoop

Hot
Tip

Z
 ooKeeper - a distributed application management tool
for configuration, event synchronization, naming, and
group services used for managing the nodes in a Hadoop
computational network.

http://hadoop.apache.org is the authoritative


reference for all things Hadoop.

Hadoop comprises tools and utilities for data serialization, file


system access, and interprocess communication pertaining
to MapReduce implementations. Single and clustered
configurations are possible. This configuration almost
always includes HDFS because its better optimized for high
throughput MapReduce I/O than general-purpose file systems.

Hot
Tip

Sqoop is a product released by Cloudera, the most


influential Hadoop commercial vendor, under the
Apache 2.0 license. The source code and binary
packages are available at:
http://wiki.github.com/cloudera/sqoop

Components

Hadoop Cluster Building Blocks

Figure 2 shows how the various Hadoop components relate to


one another:

Hadoop clusters may be deployed in three basic configurations:


Mode

Description

Usage

Local
(default)

Multi-threading components, single


JVM

Development,
test, debug

Pseudodistributed

Multiple JVMs, single node

Development,
test, debug

Distributed

All components run in separate


nodes

Staging,
production

Figure 3 shows how the components are deployed for any of


these configurations:

Essentials
H
 DFS - a scalable, high-performance distributed file
system. It stores its data blocks on top of the native file
system. HDFS is designed for consistency; commits arent
considered complete until data is written to at least two
different configurable volumes. HDFS presents a single
view of multiple physical disks or file systems.
M
 apReduce - A Java-based job tracking, node
management, and application container for mappers and
reducers written in Java or in any scripting language that
supports STDIN and STDOUT for job interaction.

Hot
Tip

Hadoop also supports other file systems


likeAmazon Simple Storage Service (S3), Kosmixs
CloudStore, and IBMs General Parallel File System.
These may be cheaper alternatives to hosting data in
the local data center.

Each node in a Hadoop installation runs one or more daemons


executing MapReduce code or HDFS commands. Each
daemons responsibilities in the cluster are:

Frameworks
C
 hukwa - a data collection system for monitoring, displaying,
and analyzing logs from large distributed systems.

N
 ameNode: manages HDFS and communicates with every
DataNode daemon in the cluster
J obTracker: dispatches jobs and assigns splits (splits) to
mappers or reducers as each stage completes

H
 ive - structured data warehousing infrastructure that
provides a mechanisms for storage, data extraction,
transformation, and loading (ETL), and a SQL-like
language for querying and analysis.

T
 askTracker: executes tasks sent by the JobTracker and
reports status

H
 Base - a column-oriented (NoSQL) database designed for
real-time storage, retrieval, and search of very large tables
(billions of rows/millions of columns) running atop HDFS.

D
 ataNode: Manages HDFS content in the node and
updates status to the NameNode
These daemons execute in the three distinct processing
layers of a Hadoop cluster: master (Name Node), slaves (Data
Nodes), and user applications.

Utilities
P
 ig - a set of tools for programmatic flat-file data
analysis that provides a programming language, data
transformation, and parallelized processing.

Name Node (Master)


Manages the file system name space

S
 qoop - a tool for importing and exporting data stored in
relational databases into Hadoop or Hive, and vice versa
using MapReduce tools and standard JDBC drivers.
DZone, Inc.

Keeps track of job execution


Manages the cluster
|

www.dzone.com

Getting Started with Apache Hadoop

Replicates data blocks and keeps them evenly distributed

configuration from the Hadoop site. All the configuration


files are located in the directory $HADOOP_HOME/conf; the
minimum configuration requirements for each file are:

M
 anages lists of files, list of blocks in each file, list of
blocks per node, and file attributes and other meta-data

h
 adoop-env.sh environmental configuration,
JVM configuration, logging, master and slave
configuration files

T
 racks HDFS file creation and deletion operations in an
activity log
Depending on system load, the NameNode and JobTracker
daemons may run on separate computers.

Hot
Tip

c ore-site.xml site wide configuration, such as users,


groups, sockets

Although there can be two or more Name Nodes in


a cluster, Hadoop supports only one Name Node.
Secondary nodes, at the time of writing, only log
what happened in the primary. The Name Node is a
single point of failure that requires manual fail-over!

h
 dfs-site.xml HDFS block size, Name and Data
node directories
m
 apred-site.xml total MapReduce tasks,
JobTracker address
m
 asters, slaves files NameNode, JobTracker,
DataNodes, and TaskTrackers addresses, as appropriate

Data Nodes (Slaves)


Store blocks of data in their local file system

Serve data and meta-data to the job they execute

Test the Installation


Log on to each server without a passphrase:
ssh servername or ssh localhost

Send periodic status reports to the Name Node

Format a new distributed file system:

Store meta-data for each block

hadoop namenode -format

S
 end data blocks to other nodes required by the
Name Node

Start the Hadoop daemons:


start-all.sh

Data nodes execute the DataNode and TaskTracker daemons


described earlier in this section.

Check the logs for errors at $HADOOP_HOME/logs!


Browse the NameNode and JobTracker interfaces at
(localhost is a valid name for local configurations):

User Applications
D
 ispatch mappers and reducers to the Name Node for
execution in the Hadoop cluster

http://namenode.server.name:50070/
http://jobtracker.server.name:50070/

E
 xecute implementation contracts for Java and for
scripting languages mappers and reducers

HADOOP QUICK REFERENCE

Provide application-specific execution parameters


S
 et Hadoop runtime configuration parameters with
semantics that apply to the Name or the Data nodes

The official commands guide is available from:


http://hadoop.apache.org/common/docs/current/commands_
manual.html

A user application may be a stand-alone executable, a script, a


web application, or any combination of these. The application
is required to implement either the Java or the streaming APIs.

Usage
hadoop [--config confdir] [COMMAND]
[GENERIC_OPTIONS] [COMMAND_OPTIONS]

Hadoop Installation

Hot
Tip

Cygwin is a requirement for any Windows systems


running Hadoop install it before continuing if
youre using this OS.

Hadoop can parse generic options and run classes from the
command line. confdir can override the default $HADOOP_HOME/
conf directory.

Generic Options

Required detailed instructions for this section are available at:


http://hadoop.apache.org/comon/docs/current

-conf <config file>

App configuration file

E
 nsure that Java 6 and both ssh and sshd are running in
all nodes

-D <property=value>

Set a property

G
 et the most recent, stable release from
http://hadoop.apache.org/common/releases.html

-fs <local|namenode:port>

Specify a namenode

-jg <local|jobtracker:port>

Specify a job tracker; applies only


to a job

-files <file1, file2, .., fileN>

Files to copy to the cluster (job only)

-libjars <file1, file2, ..,fileN>

.jar files to include in the classpath


(job only)

-archives <file1, file2, .., fileN>

Archives to unbundle on the


computational nodes (job only)

Decide on local, pseudo-distributed or distributed mode


Install the Hadoop distribution on each server
S
 et the HADOOP_HOME environment variable to the directory
where the distribution is installed
Add $HADOOP_HOME/bin to PATH
Follow the instructions for local, pseudo-clustered, or clustered
DZone, Inc.

$HADOOP_HOME/bin/hadoop
|

www.dzone.com

precedes all commands.

Getting Started with Apache Hadoop

User Commands

Hot
Tip

archive -archiveName file.har


/var/data1 /var/data2

Create an archive

distcp
hdfs://node1:8020/dir_a
hdfs://node2:8020/dir_b

Distributed copy from one or more


node/dirs to a target

fsck -locations /var/data1


fsck -move /var/data1
fsck /var/data

File system checks: list block/


location, move corrupted files to /
lost+found, and general check

job -list [all]


job -submit job_file
job -status 42
job -kill 42

Job list, dispatching, status check,


and kill; submitting a job returns
its ID

pipes -conf file


pipes -map File.class
pipes -map M.class -reduce
R.class -files

Use HDFS and MapReduce from a


C++ program

queue -list

List job queues

Wildcard expansion happens in the hosts shell, not


in the HDFS shell! A command issued to a directory
will affect the directory and all the files in it,
inclusive. Remember this to prevent surprises.

To leverage this quick reference, review and understand all the


Hadoop configuration, deployment, and HDFS management
concepts. The complete documentation is available from
http://hadoop.apache.org.
HADOOP APPS QUICK HOW-TO

A Hadoop application is made up of one or more jobs. A job


consists of a configuration file and one or more Java classes or
a set of scripts. Data must already exist in HDFS.
Figure 4 shows the basic building blocks of a Hadoop
application written in Java:

Administrator Commands
balancer -threshold 50

Cluster balancing at percent of


disk capacity

daemonlog -getlevel host name

Fetch http://host/
logLevel?log=name

datanode

Run a new datanode

jobtracker

Run a new job tracker

namenode -format
namenode -regular
namenode -upgrade
namenode -finalize

Format, start a new instance,


upgrade from a previous version
of Hadoop, or remove previous
version's files and complete
upgrade

An application has one or more mappers and reducers and a


configuration container that describes the job, its stages, and
intermediate results. Classes are submitted and monitored
using the tools described in the previous section.

HDFS shell commands apply to local or HDFS file systems and


take the form:

Input Formats and Types

hadoop dfs -command dfs_command_options

K
 eyValueTextInputFormat Each line represents a key
and value delimited by a separator; if the separator is
missing the key and value are empty

HDFS Shell
du /var/data1 hdfs://node/data2

Display cumulative of files and


directories

lsr

Recursive directory list

cat hdfs://node/file

Types a file to stdout

count hdfs://node/data

Count the directories, files,


and bytes in a path

chmod, chgrp, chown

Permissions

expunge

Empty file system trash

get hdfs://node/data2 /var/data2

Recursive copy files to the


local system

put /var/data2 hdfs://node/data2

Recursive copy files to the


target file system

cp, mv, rm

Copy, move, or delete files in


HDFS only

mkdir hdfs://node/path

Recursively create a new directory in the target

setrep -R -w 3

Recursively set a file or directory replication factor (number


of copies of the file)
DZone, Inc.

TextInputFormat The key is the line number, the value


is the text itself for each line
N
 LineInputFormat N sequential lines represent the
value, the offset is the key
M
 ultiFileInputFormat An abstraction that the user
overrides to define the keys and values in terms of
multiple files
S
 equence Input Format Raw format serialized
key/value pairs
D
 BInputFormat JDBC driver fed data input

Output Formats
The output formats have a 1:1 correspondence with the
input formats and types. The complete list is available from:
http://hadoop.apache.org/common/docs/current/api

Word Indexer Job Example


Applications are often required to index massive amounts
of text. This sample application shows how to build a simple
indexer for text files. The input is free-form text such as:
|

www.dzone.com

Getting Started with Apache Hadoop

Job Driver

hamlet@11141\tKING CLAUDIUS\tWe doubt it nothing: heartily


farewell.

public class Driver {


public static void main(String argV) {
Job job = new Job(new Configuration(), test);
job.setMapper(LineIndexMapper.class);
job.setCombiner(LineIndexReducer.class);
job.setReducer(LineIndexReducer.class);

The map function output should be something like:


<KING, hamlet@11141>
<CLAUDIUS, hamlet@11141>
<We, hamlet@11141>
<doubt, hamlet@11141>

job.waitForCompletion(true);
}
} // Driver

The number represents the line in which the text occurred. The
mapper and reducer/combiner implementations in this section
require the documentation from:

This driver is submitted to the Hadoop cluster for processing,


along with the rest of the code in a .jar file. One or more files
must be available in a reachable hdfs://node/path before
submitting the job using the command:

http://hadoop.apache.org/mapreduce/docs/current/api
The Mapper
The basic Java code implementation for the mapper has the
form:

hadoop jar shakespeare_indexer.jar

Using the Streaming API

public class LineIndexMapper


extends MapReduceBase
implements Mapper<LongWritable,
Text, Text, Text> {

The streaming API is intended for users with very limited Java
knowledge and interacts with any code that supports STDIN
and STDOUT streaming. Java is considered the best choice for
heavy duty jobs. Development speed could be a reason for
using the streaming API instead. Some scripted languages may
work as well or better than Java in specific problem domains.
This section shows how to implement the same mapper and
reducer using awk and compares its performance against
Javas.

public void map(LongWritable k,


Text v, OutputCollector<Text, Text> o,
Reporter r) throws IOException { /* implementation here
*/ }
.
.
}

The Mapper

The implementation itself uses standard Java text manipulation


tools; you can use regular expressions, scanners, whatever is
necessary.

Hot
Tip

#!/usr/bin/gawk -f
{
for (n = 2;n <= NF;n++) {
gsub([,:;)(|!\\[\\]\\.\\?]|--,);
if (length($n) > 0) printf(%s\t%s\n, $n, $1);
}
}

There were significant changes to the method


signatures in Hadoop 0.18, 0.20, and 0.21 - check
the documentation to get the exact signature for the
version you use.

The output is mapped with the key, a tab separator, then the
index occurrence.

The Reducer/Combiner
The combiner is an output handler for the mapper to reduce
the total data transferred over the network. It can be thought
of as a reducer on the local node.

The Reducer
#!/usr/bin/gawk -f
{ wordsList[$1] = ($1 in wordsList) ?
sprintf(%s,%s,wordsList[$1], $2) : $2; }

public class LineIndexReducer


extends MapReduceBase
implements Reducer<Text,
Text, Text, Text> {

END {
for (key in wordsList)
printf(%s\t%s\n, key,wordsList[key]);
}

public void reduce(Text k,


Iterator<Text> v,
OutputCollector<Text, Text> o,
Reporter r) throws IOException {
/* implementation */ }
.
.

The output is a list of all entries for a given word, like in the
previous section:
doubt\thamlet@111141,romeoandjuliet@23445,henryv@426917

Awks main advantage is conciseness and raw text processing


power over other scripting languages and Java. Other
languages, like Python and Perl, are supported if they are
installed in the Data Nodes. Its all about balancing speed of
development and deployment vs. speed of execution.

The reducer iterates over keys and values generated in the


previous step adding a line number to each words occurrence
index. The reduction results have the form:

Job Driver

<KING, hamlet@11141; hamlet@42691; lear@31337>

hadoop jar hadoop-streaming.jar -mapper shakemapper.awk


-reducer shakereducer.awk -input hdfs://node/shakespeareworks

A complete index shows the line where each word occurs, and
the file/work where it occurred.
DZone, Inc.

www.dzone.com

Getting Started with Apache Hadoop

Performance Tradeoff

Hot
Tip

The streamed awk invocation vs. Java are


functionally equivalent and the awk version is only
about 5% slower. This may be a good tradeoff if the
scripted version is significantly faster to develop and
is continuously maintained.

STAYING CURRENT

Do you want to know about specific projects and use cases


where NoSQL and data scalability are the hot topics? Join the
scalability newsletter:


ABOUT THE AUTHOR

http://eugeneciurana.com/scalablesystems

RECOMMENDED BOOKS

Eugene Ciurana (http://eugeneciurana.com) is an open-source


evangelist who specializes in the design and implementation of
mission-critical, high-availability large scale systems. Over the
last two years, Eugene designed and built hybrid cloud scalable
systems and computational networks for leading financial,
software, insurance, and healthcare companies in the US, Japan,
Mexico, and Europe.
Publications
Developing with Google App Engine, Apress
DZone Refcard #105: NoSQL and Data Scalability
DZone Refcard #43: Scalability and High Availability
DZone Refcard #38: SOA Patterns
The Tesla Testament: A Thriller, CIMEntertainment

Hadoop: The Definitive Guide helps you harness the


power of your data. Ideal for processing large datasets,
the Apache Hadoop framework is an open source
implementation of the MapReduce algorithm on which
Google built its empire. This comprehensive resource
demonstrates how to use Hadoop to build reliable,
scalable, distributed systems: programmers will find
details for analyzing large datasets, and administrators will
learn how to set up and run Hadoop clusters.

BUY NOW
books.dzone.com/books/hadoop-definitive-guide

Masoud Kalali (http://kalali.me) is a software engineer and


author. He has been working on software development
projects since 1998. He is experienced in a variety of
technologies and platforms.
Masoud is the author of several DZone Refcardz, including:
Using XML in Java, Berkeley DB Java Edition, Java EE Security,
and GlassFish v3. Masoud is also the author of a book on
GlassFish Security published by Packt. He is one of the founding
members of the NetBeans Dream Team and is a GlassFish
community spotlighted developer.

About Cloud Computing


Usage Scenarios
Underlying Concepts

Aldon

Getting Started with

dz. com

also minimizes the need


to make design changes to support
CON
TEN TS
one time events.
INC

RATION

HTML

LUD E:

Basics

ref car

L vs XHT technologies
Automated growthHTM
& scalable

to isolat
space
n
e Work
riptio
itory
a Privat
Desc
ge
These companies
have
webmana
applications
are in long deployed
trol repos
to
n-con
lop softw
ing and making them
Deve
a versio
that adapt
and scale
bases,
rn
les toto large user
ize merg
Patte
it all fi
minim
le
space
Comm
ine to
tos multip
cloud
computing.
e Work knowledgeable in amany
mainl aspects related
Privat
that utilize
lop on
Deve code lines
a system
sitory
of work
within

om

Valid

ML

Vis it

ation one time events, cloud


Having the capability to support
Usef
Open the gradual growth curves
computing platforms alsoulfacilitate
Page
Source
Structure
Tools
faced by web applications.
Key

Network Security
ALM
Solr
Subversion

Elem
ents
al Elem
Large scale growth scenarios involvingents
specialized
and mor equipment
e... away by
(e.g. load balancers and clusters) are all but abstracted

Structur

rdz !

However, the demands and technology used on such servers


has changed substantially in recent years, especially with
the entrance of service providers like Amazon, Google and
es
Microsoft.
e chang

www.dzone.com

INTEG

Upcoming Refcardz

By Daniel Rubio

ge. Colla
Chan

ABOUT CLOUD COMPUTING

Web applications have always been deployed on servers


connected to what is now deemed the cloud.

active
Repo
are to you to clouddcomputing,
units
This Refcard
will introduce
with an
softw
riente
loping
ine
task-o it
e
Mainl
es by
emphasis onDeve
these
so
youComm
can better understand
ines providers,
chang
softwar
codel
e code
ding
Task Level
sourc es as aplatform
what it is a cloudnize
computing
can offer your
trol
ut web
line Policy
of buil
NT
Orga
Code
e witho
it chang
cess
ion con
e name
sourc
T CO
and subm
applications.
it
the pro jects vers
are from
with uniqu
ABOU
softw
um
(CI) is
evel Comm
the build
build
a pro
minim
Label
Task-L
ies to
gration
ed to
activit
blem
the bare
ion
ate all
cies to
t
ous Inte
committ
USAGE SCENARIOS
to a pro
nden
Autom configurat
ymen
Build
depe
al
tion
deplo
t
Continu ry change
manu
Label
d tool
nmen
a solu ineffective
the same
stalle
)
Build
eve
t, use target enviro
(i.e.,
,
ated
ce pre-in
ymen
with
erns
Redu
problem
Autom
ns (i.e.
ory.
d deploEAR) in each
icular
Pay only
consume
tagge
via patt
or
cieswhat you
-patter
s that
reposit
t
nden
For each (e.g. WAR
es
lained ) and anti
the part solution duce
Depe
nmen
al
ge
librari
x
exp
t
Web application deployment
until
ago was similar
t enviro
packa
Minim
be
nden a few years
text
to fi
ns are
to pro
all targe
rity
all depe
used
CI can ticular con
le that
alizeplans
y Integ
i-patter they tend
to
most phone services:
with
but can
late fialloted resources,
Centr
Binar
ts with an
etimes
,
temp
s. Ant
nt
tices,
nmen
on
in a par hes som
geme
t enviro
e a single
proces in the end bad prac lementing
incurred
cost
whether
resources
were
consumed
or
based
thenot.
Creatsuchare
cy Mana
nt targe
es to
rties
nden
approac ed with the cial, but,
chang
Depe
prope
into differe
essarily ed to imp
itting
er
efi
te builds
e comm
not nec compar
late Verifi
associat to be ben
befor
etc.
Cloud
computing asRun
itsremo
known etoday
has changed this.
n
Temp
y are
Build
ually,
Privat
y, contin
appear effects. The results whe
rm a
nt team
dicall
s
The various
resourcesPerfo
consumed
by webperio
applications
(e.g.
opme
ed
d Build
sitory
Build
gration
r to devel
Repo
Stage
adverse unintend
ration on a per-unit
bandwidth,
memory, CPU)
areInteg
tallied
basis
CI serve
e
ous Inte
e Build
rm an
ack from
produc
Privat
feedb computing platforms.
Continu Refcard
on
(starting
from zero) by Perfo
all majorated
cloud
tern.
term
they occur
tion
ld based
d
le this
d utom
he pat
f he
h s

S
INUOU

Browse our collection of over 100 Free Cheat Sheets

Cloud#64Computing

relying on a cloud computing platforms technology.

HTM

L BAS

Core
HTML

ICSplatforms support data


In addition, several cloud computing
HTM
tier technologies that Lexceed
the precedent set by Relational
and XHT
HTML
MLReduce,
Database Systems (RDBMS):
web service APIs,
are
is usedMap
prog
foundati
as thescalethe
etc. Some platforms ram
support large
deployments.
The src
grapRDBMS
on of
hical
and Java s written in
attribute
all
and the
also rece JavaScri user interfac web develop
desc
as the
e in clien
ment.
ive data pt. Server-s
the ima alt attribute ribes whe
output
t-side
likew
ge is
ide lang
describe re the ima
CLOUD COMPUTING
PLATFORMS
ise use mec
hanism. fromAND
unavaila
web
ge file
s alte
Nested
HTM
was
The eme pages and uages like
ble.
rnate
can be
oncCONCEPTS
L and
UNDERLYING
text that
e
tags
use HTM PHP
rging
found,
XHTML
Tags
standard a very loos
Ajax
is disp
can
L
as thei
tech
ely-defi
ization,
layed
cannot be (and freq
need
ned lang r visual eng nologies
if
but as
for stan
overlap
uen
it has
ine. HTM
b></
Amazon EC2:
Industry standard
software
and uag
virtualization
whether
dards
, so <a>< tly are) nest
e with
a> is
has bec become
you cho platform
very little L
fine.
b></
ed insid
moron
Amazons cloud
computing
isome
heavily based
the curr
a></
e impo
e
b> is
more
ent stan ose to writ
rtant,
not lega each othe
app
industry standard
and virtualization
that will softwaredard
e HTM technology.
HTM
r. Tags
l, but
s will
L or XHT arent. Reg the
L VS
<a><
help
and XHT simplify all
XHTM
b></
ML, und ardless
you prov
your
of
ML
L
othe
erst
Virtualization
allows
a
physical
piece
of
hardware
to
be
are actu
HTML
much
anding
web cod ide a solid
of the
has
ally simp r This
utilized by multiple
operating
job adm been arou
function systems.
ing.resourcesfoundati
ler thanallows
Fortuna
commo
on
nd for
alitytohas
irably,
they
(e.g. bandwidth,
memory,
CPU)
be
allocated
exclusively
to
exp
tely
som
that
n
ected.
used
mov
elem
e time
Every
job has
to be, HTML
entsinstances. ed to CSS
Early
. Whi
pag
Brow
individual operating
system
expand
because
HTM
ser
.
common e (HTML
ed far le it has don
or XHT
web dev manufacture L had very
e its
extensio .) All are
ML shar
limited more than
rs add
elopers
esse
As a user of
Amazons
cloud
result
anyb
layo
es c platform, you are
ed
n HT EC2
nti l computing
c
i
d

By An
dy Ha
rris

DZone, Inc.
140 Preston Executive Dr.
Suite 100
Cary, NC 27513

DZone communities deliver over 6 million pages each month to


more than 3.3 million software developers, architects and decision
makers. DZone offers something for everyone, including news,
tutorials, cheatsheets, blogs, feature articles, source code and more.
DZone is a developers dream, says PC Magazine.

ISBN-13: 978-1-934238-75-2
ISBN-10: 1-934238-75-9

50795

888.678.0399
919.678.0300
Refcardz Feedback Welcome
refcardz@dzone.com
Sponsorship Opportunities
sales@dzone.com

$7.95

Mo re

Re fca

Ge t

tion:
tegra ternvasll
us Ind Anti-PPaat
Du
ul M.
nuorn
By
an
ti
s
n
o
C
atte

CONTENTS INCLUDE:

Cost
by...

youTechnologies
Data
t toTier
Comply.
brough
Platform Management and more...
borate.

Ge t Mo
re

E:
LUD
TS INC
gration
ON TEN tinuous Inte Change
ry
Con
at Eve
ns
About
Software i-patter
Build
and Ant
Patterns Control
ment
Version
e...
Manage s and mor
Build
Practice
d
Buil

Get More Refcardz! Visit refcardz.com

#82

9 781934 238752

Copyright 2010 DZone, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by means electronic, mechanical,
photocopying, or otherwise, without prior written permission of the publisher.

Version 1.0

You might also like