Cloud Computing Introduction
Cloud Computing Introduction
AND
DATA CENTERS
Cloud Computing - Some terms
• Term cloud is used as a metaphor for internet
• Concept generally incorporates combinations of the following
– Infrastructure as a service (IaaS)
– Platform as a service (PaaS)
– Software as a service(SaaS)
• Not to be confused with
– Grid Computing – a form of distributed computing
• Cluster of loosely coupled, networked computers acting in concert to perform
very large tasks
– Utility Computing – packaging of computing resources such as computing
power, storage, also a metered services
– Autonomic computing – self managed
Grid Computing
• Share Computers and data
• Evolved to harness inexpensive computers in Data center to solve variety of problems
• Harness power of loosely coupled computers to solve a technical or mathematical problem
• Used in commercial applications for drug discovery, economic forecasting, sesimic
analysis and back-office
• Small to big
– Can be confined to a corporation
– Large public collaboration across many companies and networks
• Most grid solutions are built on
– Computer Agents
– Resource Manager
– Scheduler
• Compute grids
– Batch up jobs
– Submit the job to the scheduler, specifying requirements and SLA(specs) required for
running the job
– Scheduler matches specs with available resources and schedules the job to be run
– Farms could be as large as 10K cpus
• Most financial firms has grids like this
• Grids lack automation, agility, simplicity and SLA guarantees
Utility Computing
• More related to cloud computing
– Applications, storage, computing power and network
• Requires cloud like infrastructure
• Pay by the use model
– Similar to electric service at home
• Pay for extra resources when needed
– To handle expected surge in demand
– Unanticipated surges in demand
• Better economics
Cloud computing – History
• Evolved over a period of time
• Roots traced back to Application Service Providers in
the 1990’s
• Parallels to SaaS
• Evolved from Utility computing and is a broader concept
Cloud computing
• Much more broader concept
• Encompasses
– IAAS, PAAS, SAAS
• Dynamic provision of services/resource pools in a co-ordinated fashion
– On demand computing – No waiting period
– Location of resource is irrelevant
• May be relevant from performance perspective, data locality
• Applications run somewhere on the cloud
– Web applications fulfill these for end user
– However, for application developers and IT
• Allows develop, deploy and run applications that can easily grow
capacity (scalability), work fast (performance), and offer good reliability
• Without concern for the nature and location of underlying infrastructure
– Activate, retire resources
– Dynamically update infrastructure elements without affecting the business
Clouds Versus Grids
2009 2010
2008
2007
Cloud Computing - layers
Layers Architecture
What is a Cloud?
Resources
Services Storage Network OS
Why cloud computing
• Data centers are notoriously underutilized, often idle 85% of the
time
– Over provisioning
– Insufficient capacity planning and sizing
– Improper understanding of scalability requirements etc
• including thought leaders from Gartner, Forrester, and IDC—
agree that this new model offers significant advantages for fast-
paced startups, SMBs and enterprises alike.
• Cost effective solutions to key business demands
• Move workloads to improve efficiency
• Server Message Block (SMB, also known as Common Internet File System,
CIFS) operates as an application-layer network protocol mainly used to provide
shared access to files, printers, serial ports, and miscellaneous communications
between nodes on a network. It also provides an authenticated inter-process
communication mechanism.
Most usage of SMB involves computers running Microsoft Windows, where it is
often known as "Microsoft Windows Network".
How do they work?
• Public clouds are opaque
– What applications will work well in a cloud?
• Many of the advantages offered by Public Clouds appear useful for “on
premise” IT
– Self-service provisioning
– Flexible resource allocation
• What extensions or modifications are required to support a wider variety of
services and applications?
– Data assimilation
– Multiplayer gaming
– Mobile devices
Cloud computing -
Characteristics
• Agility – On demand computing infrastructure
– Linearly scalable – challenge
• Reliability and fault tolerance
– Self healing – Hot backups, etc
– SLA driven – Policies on how quickly requests are processed
• Multi-tenancy – Several customers share infrastructure, without compromising
privacy and security of each of the customer’s data
• Service-oriented – compose applications out of loosely coupled services. One
service failure will not disrupt other services. Expose these services as API’s
• Virtualized – decoupled from underlying hardware. Multiple applications can run in
one computer
• Data, Data, Data
– Distributing, partitioning, security, and synchronization
Public, Private and Hybrid
clouds
Public clouds
• Open for use by general public
– Exist beyond firewall, fully hosted and managed by the vendor
– Individuals, corporations and others
– Amazon's Web Services and Google appEngine are examples
• Offers startups and SMB’s quick setup, scalability, flexibility
and automated management. Pay as you go model helps
startups to start small and go big
• Security and compliance?
• Reliability concerns hinder the adoption of cloud
– Amazon S3 services were down for 6 hours
Public Clouds (Now)
• Simple
– Transparent => need to “see” into the cloud
– Scalable => complexity often limits scalability
– Secure => limits adoptability
• Extensible
– New application classes and service classes may require new features
– Clouds are new => need to extend while retaining useful features
• Commodity-based
– Must leverage extensive catalog of open source software offerings
– New, unstable, and unsupported infrastructure design is a barrier to uptake,
experimentation, and adoption
• Easy
– To install => system administration time is expensive
– To maintain => system administration time is really expensive
Microsoft and Amazon face
challenges
• Globus/Nimbus
– Client-side cloud-computing interface to Globus-enabled TeraPort cluster
– Based on the Globus Virtual Workspace Service
– Shares upsides and downsides of Globus-based grid technologies
• Enomalism (now called ECP)
– Start-up company distributing open source
– REST APIs(Representational State Transfer (REST) is a
style of software architecture for distributed hypermedia
systems such as the World Wide Web.
• Reservoir
– European open cloud project
– Many layers of cloud services and tools
– Ambitious and wide-reaching but not yet accessible as an implementation
• Eucalyptus
– Cloud Computing on Clusters
– Amazon Web Services compatible
– Supports kvm and Xen
• Open Nebulous
• Joyent
– Based on Java Script
Open Source Cloud Ecosystem -
Tools
• RightScale
– Startup focused on providing client tools as SaaS hosted in AWS
– Uses the REST interface
• Canonical
– Ubuntu 9.10 (Karmic Koala)
– Includes KVM and Xen Hypervisors
Open Source Cloud Anatomy
• Extensibility
– Simple architecture and open internal APIs
• Client-side interface
– Amazon’s AWS interface and functionality (familiar and testable)
• Networking
– Virtual private network per cloud
– Must function as an overlay => cannot supplant local networking
• Security
– Must be compatible with local security policies
• Packaging, installation, maintenance
– system administration staff is an important constituency for uptake
Open Source Cloud Anatomy ..
cntd
Physical Infrastructure
Cloud Infrastructure ..contd
• Physical Security
Cooling
Cloud Infrastructure ..contd
• Power infrastructure, Network Cabling, Fire
safety
Cloud computing open issues
• Governance
– Security, Privacy and control
– SLA guarantees
– Ownership and control
– Compliance and auditing
• Sarbanes and Oxley Act
• Reliability
– Good servive provider with 99.999% availability
• Cloud independence – Vendor lockin?
– Cloud provider goes out of business
• Data Security
• Cloud lockin and Loss of control
– Plan for moving data along with Cloud provider
• Cost?
• Simplicity?
• Tools
• Controls on sensitive data?
– Out of business
• Big and small
– Scalability and cost outweigh reliability for small
businesses
– Big businesses may have a problem
Battle in the cloud
• Amazon Web Services
• Google App Engine
– Free upto 500 MB,
• Free for small scale applications?
• Universities?
– Pay when you scale
• GoGrid
• .. Some more Hosting companies
- HP, IBM, Oracle (+sun) and Dell
• Cloud computing entails building massive distributed systems
– They use replicated data, shared relational databases, parallelism
– Brewer’s “CAP theorem:” Must sacrifice Consistency for Availability &
Performance
• On-demand self-service
• Broad network access
• Resource pooling
– Location independence
• Rapid elasticity
• Measured service
3 Cloud Service Models
Cloud Infrastructure
IaaS Infrastructure as a Service (IaaS)
Architectures
4 Cloud Deployment Models
• Private cloud
– enterprise owned or leased
• Community cloud
– shared infrastructure for specific community
• Public cloud
– Sold to the public, mega-scale infrastructure
• Hybrid cloud
– composition of two or more clouds
Common Cloud Characteristics
Hybrid Clouds
Deployment
Models Private Community
Public Cloud
Cloud Cloud
On Demand Self-Service
Essential
Broad Network Access Rapid Elasticity
Characteristics
Resource Pooling Measured Service
• Advantages
– Rapid reconstitution of services
– Enables availability
• Provision in multiple data centers / multiple instances
• Challenges
– Impact of compromising the provisioning service
Data Storage Services
• Advantages
– Data fragmentation and dispersal
– Automated replication
– Provision of data zones (e.g., by country)
– Encryption at rest and in transit
– Automated data retention
• Challenges
– Isolation management / data multi-tenancy
– Storage controller
• Single point of failure / compromise?
– Exposure of data to foreign governments
Cloud Processing Infrastructure
• Advantages
– Ability to secure masters and push out secure images
• Challenges
– Application multi-tenancy
– Reliance on hypervisors
– Process isolation / Application sandboxes
Cloud Support Services
• Advantages
– On demand security controls (e.g., authentication, logging,
firewalls…)
• Challenges
– Additional risk when integrated with customer applications
– Needs certification and accreditation as a separate application
– Code updates
Cloud Network and Perimeter Security
• Advantages
– Distributed denial of service protection
– VLAN capabilities
– Perimeter security (IDS, firewall, authentication)
• Challenges
– Virtual zoning with application mobility
Cloud Security Advantages
Part 1
• Data Fragmentation and Dispersal
• Dedicated Security Team
• Greater Investment in Security Infrastructure
• Fault Tolerance and Reliability
• Hypervisor Protection Against Network Attacks
• Possible Reduction of C&A Activities (Access to Pre-
Accredited Clouds)
Cloud Security Advantages
Part 2
• Simplification of Compliance Analysis
• Data Held by Unbiased Party (cloud vendor assertion)
• Low-Cost Disaster Recovery and Data Storage
Solutions
• On-Demand Security Controls
• Real-Time Detection of System Tampering
• Rapid Re-Constitution of Services
• Advanced Honeynet Capabilities
Cloud Security Challenges Part
1
• Data dispersal and international privacy laws
– EU Data Protection Directive and U.S. Safe Harbor program
– Exposure of data to foreign government and data subpoenas
– Data retention issues
• Need for isolation management
• Multi-tenancy
• Logging challenges
• Data ownership issues
• Quality of service guarantees
Cloud Security Challenges
Part 2
• Dependence on secure hypervisors
• Attraction to hackers (high value target)
• Security of virtual OSs in the cloud
• Possibility for massive outages
• Encryption needs for cloud computing
– Encrypting access to the cloud resource control interface
– Encrypting administrative access to OS instances
– Encrypting access to applications
– Encrypting application data at rest
• Public cloud vs internal cloud security
• Lack of public SaaS version control
Additional Issues
• Issues with moving sensitive data to the cloud
– Privacy impact assessments
• Using SLAs to obtain cloud security
– Suggested requirements for cloud SLAs
– Issues with cloud forensics
• Contingency planning and disaster recovery for cloud
implementations
Secure Migration Paths
for Cloud Computing
The ‘Why’ and ‘How’ of Cloud Migration
• Fungible clouds
– (mutual substitution of services)
– Data and customer application portability
– Common interfaces, semantics, programming models
– Federated security services
– Vendors compete on effective implementations
• Enable and foster value add on services
– Advanced technology
– Vendors compete on innovative capabilities
A Model for Standardization
and Proprietary Implementation
• Core features
Standardized Core
Cloud Capabilities
Proposed Result
84
Towards the Creation of
a Roadmap (I)
• Thoughts on standards:
– Usually more service lock-in as you move up the SPI stack (IaaS-
>PaaS->SaaS)
– IaaS is a natural transition point from traditional enterprise
datacenters
• Base service is typically computation, storage, and networking
– The virtual machine is the best focal point for fungibility
– Security and data privacy concerns are the two critical barriers to
adopting cloud computing
Towards the Creation of
a Roadmap (II)
• Result:
– Focus on an overall IaaS standards roadmap as a first major
deliverable
– Research PaaS and SaaS roadmaps as we move forward
– Provide visibility, encourage collaboration in addressing these
standards as soon as possible
– Identify common needs for security and data privacy standards
across IaaS, PaaS, SaaS
A Roadmap for IaaS
• Needed standards
– VM image distribution (e.g., DMTF OVF)
– VM provisioning and control (e.g., EC2 API)
– Inter-cloud VM exchange (e.g., ??)
– Persistent storage (e.g., Azure Storage, S3, EBS, GFS, Atmos)
– VM SLAs (e.g., ??) – machine readable
• uptime, resource guarantees, storage redundancy
– Secure VM configuration (e.g., SCAP)
A Roadmap for PaaS and SaaS
• More difficult due to proprietary nature
• A future focus for NIST
• “Everything we think of
as a computer today is
really just a device that
connects to the big
computer that we are
all collectively
Thoughts on Cloud Computing
98
Foundational Elements
of Cloud Computing
99
Consumer Software Revolution
Web 2.0
104
Service Level Agreements (SLAs)
105
Autonomic System Computing
106
Grid Computing
• Distributed parallel processing across a network
• Key concept: “the ability to negotiate resource-sharing
arrangements”
• Characteristics of grid computing
– Coordinates independent resources
– Uses open standards and interfaces
– Quality of service
– Allows for heterogeneity of computers
– Distribution across large geographical boundaries
– Loose coupling of computers
107
Platform Virtualization
• “[Cloud computing] relies on separating your applications from
the underlying infrastructure” - Steve Herrod, CTO at VMware
• Host operating system provides an abstraction layer for running
virtual guest OSs
• Key is the “hypervisor” or “virtual machine monitor”
– Enables guest OSs to run in isolation of other OSs
– Run multiple types of OSs
• Increases utilization of physical servers
• Enables portability of virtual servers between physical servers
• Increases security of physical host server
108
Web Services
• Web Services
– Self-describing and stateless modules that perform discrete
units of work and are available over the network
– “Web service providers offer APIs that enable developers to
exploit functionality over the Internet, rather than delivering
full-blown applications.” - Infoworld
– Standards based interfaces (WS-I Basic Profile)
• e.g., SOAP, WSDL, WS-Security
• Enabling state: WS-Transaction, Choreography
– Many loosely coupled interacting modules form a single
logical system (e.g., legos)
Service Oriented Architectures
110
Web application frameworks
• Coding frameworks for enabling dynamic web sites
– Streamline web and DB related programming operations (e.g., web services
support)
– Creation of Web 2.0 applications
• Supported by most major software languages
• Example capabilities
– Separation of business logic from the user interface (e.g., Model-view-controller
architecture)
– Authentication, Authorization, and Role Based Access Control (RBAC)
– Unified APIs for SQL DB interactions
– Session management
– URL mapping
• Wikipedia maintains a list of web application frameworks
111
Free and Open Source Software
112
Public Statistics on Cloud
Economics
113
Cost of Traditional Data Centers
114
Energy Conservation and Data Centers
115
Cloud Economics
116
Cloud Economics
117
Cloud Computing Case Studies
and Security Models
118
Google Cloud User:
City of Washington D.C.
• Vivek Kundra, CTO for the District (now OMB e-gov administrator)
• Migrating 38,000 employees to Google Apps
• Replace office software
– Gmail
– Google Docs (word processing and spreadsheets)
– Google video for business
– Google sites (intranet sites and wikis)
• “It's a fundamental change to the way our government operates by moving to the
cloud. Rather than owning the infrastructure, we can save millions.”, Mr. Kundra
119
Are Hybrid Clouds in our Future?
• OpenNebula
• Zimory
• IBM-Juniper Partnership
– "demonstrate how a hybrid cloud could allow enterprises to
seamlessly extend their private clouds to remote servers in a
secure public cloud...“
• VMWare VCloud
– “Federate resources between internal IT and external clouds”
120
vCloud Initiative
• Goal:
– “Federate resources between internal IT and external
clouds”
– Application portability
– Elasticity and scalability, disaster recovery, service level
management
• vServices provide APIs and technologies
121
Microsoft Azure Services
122
Windows Azure Applications,
Storage, and Roles
n m
Worker
Web Role
LB
Role
123
Case Study: Facebook’s Use of Open Source
and Commodity Hardware (8/08)
• Jonathan Heiliger, Facebook's vice president of technical operations
• 80 million users + 250,000 new users per day
• 50,000 transactions per second, 10,000+ servers
• Built on open source software
– Web and App tier: Apache, PHP, AJAX
– Middleware tier: Memcached (Open source caching)
– Data tier: MySQL (Open source DB)
• Thousands of DB instances store data in distributed fashion (avoids
collisions of many users accessing the same DB)
• “We don't need fancy graphics chips and PCI cards," he said. “We need
one USB port and optimized power and airflow. Give me one CPU, a little
memory and one power supply. If it fails, I don't care. We are solving the
redundancy problem in software.”
124
Case Study: IBM-Google Cloud
125
Case Study: Amazon Cloud
• Amazon cloud components
– Elastic Compute Cloud (EC2)
– Simple Storage Service (S3)
– SimpleDB
• New Features
– Availability zones
• Place applications in multiple locations for failovers
– Elastic IP addresses
• Static IP addresses that can be dynamically remapped to point
to different instances (not a DNS change)
126
Amazon Cloud Users:
New York Times and Nasdaq (4/08)
• Both companies used Amazon’s cloud offering
• New York Times
– Didn’t coordinate with Amazon, used a credit card!
– Used EC2 and S3 to convert 15 million scanned news articles to PDF (4TB data)
– Took 100 Linux computers 24 hours (would have taken months on NYT computers
– “It was cheap experimentation, and the learning curve isn't steep.” – Derrick Gottfrid,
Nasdaq
• Nasdaq
– Uses S3 to deliver historic stock and fund information
– Millions of files showing price changes of entities over 10 minute segments
– “The expenses of keeping all that data online [in Nasdaq servers] was too high.” –
Claude Courbois, Nasdaq VP
– Created lightweight Adobe AIR application to let users view data
127
Case Study:
Salesforce.com in Government
• 5,000+ Public Sector and Nonprofit Customers use Salesforce Cloud
Computing Solutions
128
Case Study:
Salesforce.com in Government
• New Jersey Transit Wins InfoWorld 100 Award for its Cloud
Computing Project
– Use Salesforce.com to run their call center, incident management, complaint tracking,
and service portal
– 600% More Inquiries Handled
– 0 New Agents Required
– 36% Improved Response Time
129
Questions?
• Peter Mell
• NIST, Information Technology Laboratory
• Computer Security Division
• Tim Grance
• NIST, Information Technology Laboratory
• Computer Security Division
130
What is this buzzword?
Hype?
The hype
Cluster Computing
Cloud Computing
Grid Computing
Data Centers
• Modular design
• No expensive buildings needed
• Plug and play: plug power, network, cooling vent
Example: Container DC
• Virtualized servers
– Different sizes / instances
• Storage: Simple storage service (S3)
– Elastic block service (EBS)
• Many other services
– Simple DB
– Database service
– Virtual private cloud