IPD - SQL Server
IPD - SQL Server
Design
Microsoft® SQL Server® 2008
and SQL Server 2008 R2
Version 1.1
This documentation is provided to you for informational purposes only, and is provided to you entirely "AS IS".
Your use of the documentation cannot be understood as substituting for customized service and information
that might be developed by Microsoft Corporation for a particular user based upon that user’s particular
environment. To the extent permitted by law, MICROSOFT MAKES NO WARRANTY OF ANY KIND, DISCLAIMS
ALL EXPRESS, IMPLIED AND STATUTORY WARRANTIES, AND ASSUMES NO LIABILITY TO YOU FOR ANY
DAMAGES OF ANY TYPE IN CONNECTION WITH THESE MATERIALS OR ANY INTELLECTUAL PROPERTY IN THEM.
Microsoft may have patents, patent applications, trademarks, or other intellectual property rights covering
subject matter within this documentation. Except as provided in a separate agreement from Microsoft, your use
of this document does not give you any license to these patents, trademarks or other intellectual property.
Information in this document, including URL and other Internet website references, is subject to change
without notice. Unless otherwise noted, the example companies, organizations, products, domain names, email
addresses, logos, people, places and events depicted herein are fictitious.
Microsoft, Excel, Hyper-V, SharePoint, SQL Azure, SQL Server, Windows, and Windows Server are either
registered trademarks or trademarks of Microsoft Corporation in the United States and/or other countries.
The names of actual companies and products mentioned herein may be the trademarks of their respective
owners.
You have no obligation to give Microsoft any suggestions, comments or other feedback ("Feedback") relating to
the documentation. However, if you do provide any Feedback to Microsoft then you provide to Microsoft,
without charge, the right to use, share and commercialize your Feedback in any way and for any purpose. You
also give to third parties, without charge, any patent rights needed for their products, technologies and
services to use or interface with any specific parts of a Microsoft software or service that includes the Feedback.
You will not give Feedback that is subject to a license that requires Microsoft to license its software or
documentation to third parties because we include your Feedback in them.
microsoft.com/solutionaccelerators
Contents
The Planning and Design Series Approach......................................................1
Introduction to the SQL Server 2008 Guide....................................................2
IPD in Microsoft Operations Framework (MOF 4.0)........................................7
SQL Server 2008 Design Process....................................................................8
Step 1: Determine the Project Scope............................................................10
Step 2: Determine Which Roles Will Be Required..........................................11
Step 3: Design the SQL Server Database Engine Infrastructure....................16
Step 4: Design the SQL Server Integration Services Infrastructure..............25
Step 5: Design the SQL Server Analysis Services Infrastructure...................28
Step 6: Design the SQL Server Reporting Services Infrastructure.................32
Step 7: Design the SQL Server Master Data Services Infrastructure..............36
Conclusion...................................................................................................41
Appendix A: Job Aids...................................................................................42
Appendix B: Server Performance Analyzing and Scaling...............................46
Version History............................................................................................57
Acknowledgments........................................................................................58
microsoft.com/solutionaccelerators
The Planning and Design Series
Approach
This guide is one in a series of planning and design guides that clarify and streamline the planning and design
process for Microsoft® infrastructure technologies.
Each guide in the series addresses a unique infrastructure technology or scenario. These guides include the
following topics:
Defining the technical decision flow (flow chart) through the planning process.
Describing the decisions to be made and the commonly available options to consider in making the
decisions.
Relating the decisions and options to the business in terms of cost, complexity, and other characteristics.
Framing the decision in terms of additional questions to the business to ensure a comprehensive
understanding of the appropriate business landscape.
The guides in this series are intended to complement and augment the product documentation.
microsoft.com/solutionaccelerators
Introduction to the SQL Server 2008
Guide
This guide describes the process of planning a SQL Server 2008 or SQL Server 2008 R2 infrastructure. The
guide addresses the following fundamental decisions and tasks:
Identifying which SQL Server features will be needed.
Design decisions related to the components, layout, and connectivity of the SQL Server infrastructure.
Following the instructions in this guide will result in a design that is sized, configured, and appropriately placed
to deliver the stated business benefits, while also considering the performance, capacity, and fault tolerance of
the system.
This guide addresses the scenarios most likely to be encountered by someone designing a SQL Server
infrastructure. Customers should consider having their architecture reviewed by Microsoft Customer Service
and Support prior to implementation as that organization is best able to comment on the supportability of a
particular design.
SQL Server 2008 includes four primary components that can be implemented separately or as a group to form
a scalable data platform:
The Database Engine. The Database Engine is the core service for storing, processing, and securing
data.
Integration Services. Microsoft SQL Server Integration Services (SSIS) is a platform for building data
integration solutions from heterogeneous sources, including packages that provide extract, transform, and
load (ETL) processing for data warehousing.
Analysis Services. Microsoft SQL Server Analysis Services (SSAS) supports OLAP (online analytical
processing) and data mining functionalities. This allows a database administrator to design and create
multidimensional structures that contain data aggregated from other data sources, such as relational
databases.
Reporting Services. Microsoft SQL Server Reporting Services (SSRS) delivers enterprise reporting
functionality for creating reports that gather content from a variety of data sources, publishing the reports in
various formats, and centrally managing their security and subscriptions.
Additionally, SQL Server 2008 R2 introduces a fifth component:
Master Data Services. Master Data Services is a master data management (MDM) application built from
platform components that may be deployed as an application or extended by use of the platform
components to consistently define and manage the critical data entities of an organization. Master Data
Services is an any-domain hub that supports but is not limited to domains such as product, customer,
location, cost center, equipment, employee, and vendor.
For readers who might not be familiar with functionality other than the core database services, Figure 1
illustrates a possible set of relationships between the components that may comprise a SQL Server 2008
infrastructure. Note that the figure does not provide a comprehensive look at all possible data pathways; it is a
single representation for illustrative purposes.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
6. Master Data Services is a master data management application used to consistently define and manage
the critical data entities of an organization.
Different editions of SQL Server are available to meet the various needs of an organization. Depending on what
the organization wants to accomplish will determine which version of SQL Server will be implemented to meet
those needs. The following provides an overview of the different SQL Server editions available:
Core Editions:
Standard. A full-featured data platform for running small-scale to medium-scale online transaction
processing (OLTP) applications and basic reporting and analytics.
Enterprise. Provides additional features for scalability and performance, high availability (HA), security,
data warehousing, and business intelligence (BI) tools, and supports a maximum of 50 instances.
Premium Editions (new for SQL Server 2008 R2):
Datacenter. Built on the existing Enterprise edition code base, it is designed to support large SQL Server
implementations by providing support for up to 256 logical processors, 25 instances of SQL Server, and up
to 2 terabytes of RAM. (SQL Server 2008 R2 only.)
Parallel Data Warehouse. A data warehouse hardware appliance that provides the ability to scale data
warehousing services from tens to hundreds of terabytes. (SQL Server 2008 R2 only.)
Specialized Editions:
Workgroup. Provides secure, remote synchronization and capabilities for running branch applications. It
includes the core database features of the SQL Server product line and is easy to upgrade to Standard or
Enterprise.
Web. Specifically designed for highly available, Internet-facing, web-serving environments.
Developer. Includes all the features and functionality found in Datacenter; however, it is strictly meant to
be used for development, testing, and demonstration purposes only.
Express. This free edition is for independent software vendors, nonprofessional developers, and hobbyists
building client applications.
Compact. This edition is typically used to develop mobile and small desktop applications. It is free to use
and is commonly redistributed with embedded and mobile independent software vendor applications.
This guide will focus on the Standard and Enterprise editions, although the infrastructure principles discussed
may still apply to other editions.
Note See http://msdn.microsoft.com/en-us/library/cc645993.aspx for more detailed
information on features in specific versions of SQL Server 2008 and
http://msdn.microsoft.com/en-us/library/cc645993(SQL.105).aspx for SQL Server 2008 R2.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Assumptions
In writing this guide, the following assumptions have been made:
This guide does not address the business or technical case for choosing a database solution.
The reader is familiar with Microsoft infrastructure solutions. This guide does not attempt to educate the
reader on the features and capabilities of Microsoft products as the product documentation covers that
information.
The information in this guide applies to SQL Server 2008 only. Earlier versions are not included.
The organization’s database administrator (DBA) will provide the business and technical requirements for
the design.
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Feedback
Please direct questions and comments about this guide to satfdbk@microsoft.com.
We value your feedback on the usefulness of this guide. Please complete the following Solution Accelerators
Satisfaction Survey, available at http://go.microsoft.com/fwlink/?LinkID=132579, and help us build better
guidance and tools.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Decision Flow
Figure 3 provides a graphic overview of the steps involved in designing a SQL Server 2008 infrastructure.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Guide Audience
This guide addresses considerations that are related to planning and designing the necessary components for a
successful SQL Server infrastructure and is written to address the needs of the following groups:
Organizations that are implementing SQL Server for a specific application.
Organizations that are implementing SQL Server to provide a business intelligence platform for managers
and decision makers to conduct data integration, reporting, and analysis.
Organizations that are implementing SQL Server as a service to be available to multiple business units.
Organizations that are upgrading an existing SQL Server implementation by migrating to new hardware.
Out of Scope
This guide does not address the following:
In-place upgrades, where an older instance of SQL Server is upgraded to SQL Server 2008 on the existing
server; therefore, no infrastructure changes occur.
Side-by-side upgrades, where a new instance of SQL Server 2008 is installed on the same server as an
instance of SQL Server 2005, and then the data is moved, with no infrastructure changes occurring.
Developer, Express, Compact, Evaluation, and Parallel Data Warehouse editions of SQL Server. The
Developer edition has all the capabilities of the Enterprise edition but is not licensed for any form of
production use. The Express edition is an entry-level database for learning and ISV redistribution. The
Compact edition is an embedded database for developing desktop and mobile applications. The
Evaluation edition is used for evaluating SQL Server. The Parallel Data Warehouse solution comes
preassembled from certified hardware vendors.
Reporting Services can integrate with Microsoft Office SharePoint Server 2007 to provide a user interface
(UI) to administer, secure, manage, view, and deliver reports. SharePoint integration will not be covered in
this guide.
Database design as it addresses the structure of the actual database.
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Step Summary
In Step 1, the project scope was determined in order to align the goals of the project with the business
motivation. The results of this step will be used in the next step to determine which SQL Server roles will be
required to meet the business goals.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Figure 4. SSIS architecture, indicating many of the possibilities for inputs and outputs
Record whether Integration Services will be required in the organization in Table A-1 in Appendix A.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Figure 6. SSRS can be used to deliver reports in a multitude of formats from a variety of data sources.
Record whether Reporting Services will be required in the organization in Table A-1 in Appendix A.
If SSRS is required, the Database Engine will also be required and should also be recorded in the same table.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Figure 7. Master Data Services is an any-domain hub that supports domains such as product, customer,
location, cost center, equipment, employee, and vendor.
Record whether Master Data Services will be required in the organization in Table A-2 in Appendix A.
If Master Data Services is required, the Database Engine will also be required and should also be recorded in
the same table.
Step Summary
In Step 2, it was determined which SQL Server roles will be required to support the business goals. Note that
the Database Engine may be required by the application or by other SQL Server roles in certain circumstances.
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Although it may not be possible to calculate the required IOPS (IOs per second) in advance, benchmarks for
some workloads may be available from SAN and disk vendors that may provide a baseline for estimating the
required performance and the disk storage configuration required to deliver that performance level. Additionally,
most vendors, including Microsoft, have test labs that can be scheduled to simulate the production database
scenario using the selected disk subsystem and the projected IOPS. If determining the required IOPS manually,
see Appendix B: “Server Performance Analyzing and Scaling” for a detailed discussion on the calculation.
Record the IOPS and throughput required in Table A-3 in Appendix A.
In general, additional smaller drives are better than one big one for database systems. For example, it is better
to have nine or ten 128-GB drives than one terabyte drive.
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Whether fault tolerance will be implemented at the database level or instance level.
Scaling Out
Scaling out is the process of distributing a SQL Server load across more than one server; thus the decision to
scale out will increase the number of servers required. Multiple scale-out options are available, and this section
will provide a brief overview of them. Scale-out options require design changes that occur at the database level
and are outside the scope of this guide; therefore, it is expected that any decisions relative to scaling out will
have been made in conjunction with the database administrator. Note that a new server, by definition, is a new
instance.
Scale-out options include:
Scalable shared databases. With scalable shared databases, a read-only copy of the database on a
shared storage area network (SAN) drive is shared between several servers to spread out querying or
reporting loads.
Peer-to-peer replication. Instead of several database engines accessing a single copy of the database,
multiple copies of the database are made, with each database engine maintaining its own copy of the
database. Replication provides scale out, while also allowing updates to the data. Replication is used to
propagate changes to all copies of the data.
Linked servers and distributed queries. SQL Server has the ability to query remote database objects as
if they were local. This means that a scaled-out database can look like a single large database to an
application. The major change to the SQL queries is that the table names will now include the name of the
linked server where the data is located.
Distributed partition views. The data in a table is partitioned among tables in several distributed
databases, based on a partitioning key. For example, a customer table might be partitioned on customer
number ranges: 1–10,000 in one database, 10,001–20,000 in a second database, 20,001–30,000 in a
third, and so on. A query that accesses a particular customer number will be executed only on the
database that contains the desired customer number. Queries that don't reference a particular customer
number will need to be run on all the partitions.
Data dependent routing (DDR). With DDR, the data is partitioned among databases, and either the
application or some middleware services route queries to the correct database. DDR is usually combined
with distributed partitioned views to form a complete solution.
Service-Oriented Data Architecture (SODA). For scaling out, SODA data can be split in multiple ways.
The most common way is to divide services among databases and split the data so that each database
contains the data for the services it contains. Some data will be shared across service boundaries. The
preferred way for the service to access data that it doesn't own is to call the service that owns the data, but
this method can be inefficient in some cases. Instead, replication or linked servers can be used to access
data that a service doesn't own but uses extensively. Direct access of data in another service should be
avoided, if possible. The main exception to this is reference data that can be replicated to all services that
require access to the reference data.
If splitting data on service boundaries doesn't provide enough scale out, multiple copies of each service
database can be created, with each copy handling a data partition. This works the same way as DDR, with
a front-end service that looks at the message data and routes the message to the database that owns the
required data.
Service Broker. Service Broker is a queued management system implemented in the database. It can be
useful to scale out data collection and data processing tasks. Consider using Service Broker when an
application can asynchronously handle the data flow.
The decision to use a scale-out solution is influenced by several factors. The table below summarizes the
importance of these factors for each solution. For further information relative to scaling out, see “Scaling Out
SQL Server 2005” at http://msdn.microsoft.com/en-us/library/aa479364.aspx.
Each of the scale-out options will increase the number of SQL Server-based servers in the organization.
Determine whether any of the scale-out options will be implemented and, if so, record the number of servers
required to support the selected option in Table A-3 in Appendix A.
Table 1. Factors Influencing the Selection of Scale-Out Solutions
Scale-out Update frequency Ability to change Data Data coupling
solution application partitionability
Scalable shared Read only Limited or no No requirement No requirement
databases change required
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Option 3: Replication
Replication uses a publish-subscribe model. Using this replication model allows a primary server, referred to as
the Publisher, to distribute data to one or more secondary servers, or Subscribers. Replication was not originally
built as a fault-tolerant option but can be used as such.
Replication delivers the following benefits:
Allows filtering in the database to provide a subset of data at the secondary databases because it operates
at the database level.
Allows more than one redundant copy of the database.
Allows real-time availability and scalability across multiple databases, supporting partitioned updates.
Allows complete availability of the secondary databases for reporting or other functions, without query
recovery.
Each of the database-level fault-tolerance options will increase the number of servers running SQL Server in
the organization. Determine whether any of the database-level fault-tolerance options will be implemented and,
if so, record the number of servers required to support the selected option in Table A-3 in Appendix A.
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Little specific architectural information is available to use in determining the hardware for the servers. However,
sizing calculators are available from hardware vendors to help in designing the server hardware. See the
“Additional Reading” section for links to these tools. In addition to using a calculator, considerations for the
hardware are given below.
CPU
It can be challenging to predict the CPU consumption of a database that doesn’t yet exist, and many variables
can affect CPU utilization. The following are general guidelines when selecting new hardware:
64-bit processors. 64-bit hardware can address additional memory space. A 32-bit processor can only
address 4 GB of memory natively. Address Windowing Extensions (AWE) can be used to access more
memory, but there are limits to what the memory addressed above 4 GB can be used for within a 32-bit
processor running SQL Server.
L2 or L3 cache. Larger L2 or L3 processor caches generally provide better performance and often play a
bigger role than raw CPU frequency.
Multi-core and multiple processors. Research data has shown that two CPUs are not as fast as one
CPU that is twice as fast. Because it is not always possible to get a CPU that is twice as fast, doubling the
number of CPUs is preferred, but this does not guarantee twice the performance. However, SQL Server is
efficient at parallelizing operations where the workload can be divided into multiple steps to provide
increased performance on multiple CPUs.
Record the number of CPUs, architecture, and processor speed expected to be required for handling the
required loads in Table A-3 in Appendix A.
Additional information on CPU scaling can be found in Appendix B: “Server Performance Analyzing and
Scaling.”
Memory
By design, SQL Server uses all available memory to cache data and query plans from the disk, so it isn’t
necessary to recreate the data every time a query is run. In the case of a small database—for example, 2 GB
on the disk—it isn’t necessary to have 16 GB of memory. However, the size of the database will be greater than
the amount of available memory in most cases.
For OLTP applications, the product group recommends 2–4 GB of memory per processor core, and for data
warehouses, 4–8 GB of memory per processor core. Determine the amount of memory that will be used and
record this in Table A-3 in Appendix A.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Disk Subsystem
Design the disk subsystem to meet the requirements that were determined in Task 1, in the following areas:
Provide the required storage space.
Deliver the performance to support the required number of IOPS and throughput.
Provide the desired level of protection against hardware failures. If a SAN is used, this may include
redundant host bus adapters (HBAs) and switches.
Record the disk subsystem configuration in Table A-3 in Appendix A.
For more information on calculating disk performance, see Appendix B.
Network Adapters
Network adapters and accompanying drivers are available from some hardware manufacturers to provide fault
tolerance or teaming of network ports for increased throughput. Hardware and driver support for the fault
tolerance and teaming technology is provided by the hardware manufacturer. Determine the number of network
adapters that will be used and record this in Table A-1 in Appendix A.
Step Summary
In Step 3, the database requirements were gathered and the database infrastructure was designed from those
requirements. The capacity and performance requirements were determined, and decisions were made whether
to place the database in a new or existing instance and whether to place the instance on a new or existing
server running SQL Server.
In addition, the scale-out and fault-tolerance requirements were used to determine the number of servers
required, and a determination was made as to the placement of each new instance. Finally, the server hardware
was selected.
The following Step 3 outputs were recorded in Table A-3 in Appendix A:
The disk capacity and performance and IOPS and throughput requirements, and the disk subsystem
configuration to support the requirements.
The number of CPUs, architecture, and processor speed.
The amount of memory required.
The number of network adapters.
Whether the database will be placed in a new or existing instance, and on a new or existing SQL Server-
based server.
The number of servers required to support scale-out and fault-tolerance requirements.
The office or geographical location where each new instance will be placed, and whether an instance will
be on a physical server or in a virtualized environment.
Additional Reading
“Estimating the Size of a Database”: http://msdn.microsoft.com/en-us/library/ms187445.aspx
“Determining SQL Server database storage requirements”:
http://searchsqlserver.techtarget.com/tip/0,289483,sid87_gci1289528_mem1,00.html
“Analyzing I/O Characteristics and Sizing Storage Systems for SQL Server Database Applications”:
http://msdn.microsoft.com/en-us/library/ee410782.aspx
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Step Summary
If it was determined in Step 2 that SQL Server Integration Services (SSIS) is required in the organization, the
SSIS infrastructure was designed in this step. The resource requirements were determined and the location
where the Integration Services packages will be stored was also determined. The number of SSIS servers
required was determined as well where to place the SSIS server and SSIS roles. If it was determined that SSIS
is not required, the reader was advised to go to the next step.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Additional Reading
“Top 10 SQL Server Integration Services Best Practices”:
http://sqlcat.com/top10lists/archive/2008/10/01/top-10-sql-server-integration-services-best-practices.aspx
“Typical Uses of Integration Services”: http://technet.microsoft.com/en-us/library/ms137795.aspx
“SQL Server Integration Services” in SQL Server 2008 Books Online: http://technet.microsoft.com/en-
us/library/ms141026.aspx
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Step Summary
In Step 5, the SSAS infrastructure was designed if it was determined that SSAS is required in the organization.
The resource requirements, the SQL Server version, whether scalable shared databases will be used, and the
scaling needs were determined. Additionally, whether to cluster and where to place the SSAS server and roles
were determined. If it was determined that SSAS is not required, the reader was advised to go to the next step.
The following Step 5 outputs were recorded in Table A-5 in Appendix A:
The resource requirements.
The SQL Server version.
Whether scalable shared databases will be used, the scaling requirements, and whether clustering will be
used to support fault tolerance.
The SSAS placement and whether the SQL Server role will coexist with other SQL Server roles on the
same server or reside on a dedicated server.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Additional Reading
“Server Clusters”: http://technet.microsoft.com/en-us/library/cc783714.aspx
“Getting Started with SQL Server 2008 Failover Clustering”: http://msdn.microsoft.com/en-
us/library/ms189134.aspx
“Requirements and Considerations for Analysis Services Deployment”: http://msdn.microsoft.com/en-
us/library/ms175672.aspx
“Scaling out an Analysis Services Solution”: http://technet.microsoft.com/en-us/library/cc280669.aspx
“Technical Notes: Running Microsoft SQL Server 2008 Analysis Services on Windows Server 2008 vs.
Windows Server 2003 and Memory Pre-allocation: Lessons Learned”:
http://sqlcat.com/technicalnotes/archive/2008/07/16/running-microsoft-sql-server-2008-analysis-services-
on-windows-server-2008-vs-windows-server-2003-and-memory-preallocation-lessons-learned.aspx
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
ReportServer Database
Generally, a moderately sized report definition will take about 100–200 KB of disk space, which is larger than
the actual size of the RDL (Report Definition Language) file where the specifications of the report are stored.
This is because Reporting Services will persist both the RDL and the compiled binary representation in order to
improve report execution performance. In the case of history snapshot sizes, this is a function of the amount of
data that is in the report (that is, the larger the report and datasets, the larger the snapshot). A general guideline
is that Reporting Services has an approximate 5:1 compression ratio; therefore, if a report has 10 MB of data,
then the snapshot will be about 2 MB in size. Note that reports that do not perform aggregations will be slightly
larger than reports that do.
ReportServerTempDB Database
The size of this database varies depending on the number of users who are concurrently using the report
servers. Each live report execution generates a report snapshot that is persisted in the ReportServerTempDB
for the duration of the user’s session. Because of the high degree of variability in the number of users who are
accessing the report servers at any one time, the sizing estimate should be based on the maximum number of
concurrent users that access the Reporting Services environment. A general guideline here is that there will
typically be a maximum of 10–20 percent concurrency of the user base.
For example, if there are 1,000 users, then expect up to 200 concurrent users. If most of these users are
accessing a 10-MB report, then there will need to be at least 400 MB of storage provided when taking
compression into account. Of course, when users are no longer querying for reports and/or report sessions are
timing out, the space will be reclaimed and be made available for new users. But at the same time, to size
properly, the calculation must be based on the maximum number of concurrent users.
Record the estimated size of the ReportServer and ReportServerTempDB databases in Table A-6 in Appendix
A.
Memory Requirements
The product group recommends 2–4 GB of memory per processor core.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Step Summary
In Step 6, the SSRS infrastructure was designed if it had been determined in Step 2 that SSRS is required in
the organization. The resource requirements and placement of the report server databases were determined.
Additionally, the scaling and fault-tolerance approach were determined, as well as the placement of the SSRS
server.
The following Step 6 outputs were recorded in Table A-6 in Appendix A:
The database requirements (size of the ReportServer and ReportServerTempDB databases), and whether
the databases will be hosted locally or on a remote database server and, if so, which ones.
The number of servers to support SSRS.
Whether to have the SSRS SQL Server-based server coexist with other SQL Server roles on the same
server, and whether to place the SSRS server in a virtualized or physical environment.
Additional Reading
“Reporting Services Performance Optimizations”:
http://sqlcat.com/technicalnotes/archive/2009/01/14/reporting-services-performance-optimizations.aspx
“Report and Snapshot Size Limits”: http://msdn.microsoft.com/en-us/library/ms156002.aspx
“Estimating Report Server Database Requirements”: http://msdn.microsoft.com/en-
us/library/ms159758.aspx
“Planning for Scalability and Performance with Reporting Services”:
http://www.microsoft.com/technet/prodtechnol/sql/2005/pspsqlrs.mspx
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Figure 11. An example of scale-out for the Master Data Services web farm using IIS’s ARR
A web farm configuration will require a centrally located Master Data Services database that all web servers are
configured to use.
Note The IIS product group recommends NLB as the primary and preferred method for
improving the scalability and availability of web applications running IIS, as opposed to failover
clustering. For more information, see “Configuring IIS 7.0 World Wide Web Publishing Service in
a Microsoft Windows Server 2008 failover cluster” at http://support.microsoft.com/kb/970759/.
Determine the number of servers that will be required to support the fault tolerance and scaling requirements,
and record this information in Table A-7 in Appendix A. Then consult the Infrastructure Planning and Design
Guide for Internet Information Services at http://go.microsoft.com/fwlink/?LinkId=157703 to design the Master
Data Services web server infrastructure.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Information on the roles, role services, and features from IIS, .NET Framework, and PowerShell that are
required for MDS are documented in http://msdn.microsoft.com/en-us/library/ee633744(v=SQL.105).aspx.
Record these decisions in Table A-7 in Appendix A.
Step Summary
In Step 7, the Master Data Services infrastructure was designed. The resource requirements were assessed,
and then the fault tolerance and scaling requirements were planned. Based partly on these decisions, the
database was placed either on the same server as the Master Data Services web service or on a remote
database server.
The following Step 7 outputs were recorded in Table A-7 in Appendix A:
The database requirements (size of the Master Data Services database), and whether the databases will
be hosted locally or on a remote database server and, if so, which ones.
The number of servers to support the Master Data Services web service.
Whether to have the Master Data Services web Service server coexist with other SQL Server roles on the
same server, and whether to place the servers in a virtualized or physical environment.
Additional Reading
“Master Data Services Team: Configuring a Master Data Services load-balanced web farm using IIS v7
ARR”: http://sqlblog.com/blogs/mds_team/archive/2010/02/05/configuring-a-mds-load-balanced-web-farm-
using-iis-v7-arr.aspx
“Microsoft Windows Server 2008 System Requirements”: http://msdn.microsoft.com/en-
us/windowsserver/cc196364.aspx
“Windows Server 2008 R2: System Requirements”:
http://www.microsoft.com/windowsserver2008/en/us/system-requirements.aspx
“The What, Why, and How of Master Data Management”: http://msdn.microsoft.com/en-
us/library/bb190163.aspx
“Master Data Services”: http://msdn.microsoft.com/en-us/library/ee633763(SQL.105).aspx
“Organizational Approaches to Master Data Management”: http://msdn.microsoft.com/en-
us/library/ff626496(v=SQL.100).aspx
“SharePoint Workflow Integration with Master Data Services”: http://msdn.microsoft.com/en-
us/library/ff459274(v=SQL.100).aspx
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Conclusion
This guide has outlined the step-by-step process for planning a SQL Server 2008 infrastructure. In each step,
major decisions relative to the SQL Server infrastructure were determined and explained. The guide has
explained how to record choices of roles needed, server resources, scaling, and fault tolerance, which can then
be made available to the infrastructure planners. In addition, updated features available in SQL Server 2008 R2
were discussed.
Using the information recorded from the steps completed in this guide, the organization can help ensure that
they meet business and technical requirements for a successful SQL Server 2008 deployment.
Additional Reading
SQL Server 2008 Upgrade Technical Reference Guide: http://www.microsoft.com/downloads/details.aspx?
familyid=66D3E6F5-6902-4FDD-AF75-9975AEA5BEA7&displaylang=en
SQL Server Customer Advisory Team page: www.sqlcat.com
Microsoft SQL Server page: http://www.microsoft.com/sqlserver/2008/en/us/default.aspx
Feedback
Please direct questions and comments about this guide to satfdbk@microsoft.com.
We value your feedback on the usefulness of this guide. Please complete the following Solution Accelerators
Satisfaction Survey, available at http://go.microsoft.com/fwlink/?LinkID=132579, and help us build better
guidance and tools.
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Step 2. Use the table below to record information relative to the required SQL Server roles.
Table A-2. SQL Server Roles Data Gathering
Requirement description Requirement results
Step 3. Use the table below to record information relative to the SQL Server Database
Engine.
Table A-3. SQL Server Database Engine Data Gathering
Requirement description Requirement results
Disk storage requirements
IOPS and throughput requirements
Database in new or existing instance?
Will instance reside on existing server
running SQL Server or new server?
Number of servers required to support
scale-out options, if selected
Number of servers required to support fault-
tolerance option, if selected
Failover clustering protection required?
In which office or geographical location will
instance be placed?
Will instance be on physical server or
virtualized environment?
Number of CPUs required
Required architecture
Required processor speed
Amount of memory required
Disk subsystem configuration
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Step 4. Use the table below to record information relative to the SSIS infrastructure.
Table A-4. SQL Server SSIS Infrastructure Data Gathering
Requirement description Requirement results
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Step 5. Use the table below to record information relative to the SSAS infrastructure.
Table A-5. SQL Server SSAS Infrastructure Data Gathering
Requirement description Requirement results
Step 6. Use the table below to record information relative to the SSRS infrastructure.
Table A-6. SQL Server SSRS Infrastructure Data Gathering
Requirement description Requirement results
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Step 7. Use the table below to record information relative to the Master Data Services
infrastructure.
Table A-7. SQL Server Master Data Services Infrastructure Data Gathering
Requirement description Requirement results
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Processor Utilization
Over-committing CPU resources can adversely affect all the workloads on the same server, causing significant
performance issues for a larger number of users. Because CPU resource use patterns can vary significantly, no
single metric or unit can quantify total resource requirements. At the highest level, measurements can be taken
to see how the processor is utilized within the entire system and whether threads are being delayed. The
following table lists the performance counters for capturing the overall average processor utilization and the
number of threads waiting in the processor Ready Queue over the measurement interval.
Table B-1. Performance Monitor Counters for Processor Utilization
Object Counter Instance
Processor % Processor Time _Total
System Processor Queue Length N/A
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
On a machine with a single processor, observations where the queue length is greater than 5 is a warning sign
that there is frequently more work available than the processor can handle readily. When this number is greater
than 10, then it is an extremely strong indicator that the processor is at capacity, particularly when coupled with
high CPU utilization.
On systems with multiprocessors, divide the queue length by the number of physical processors. On a
multiprocessor system configured using hard processor affinity (that is, processes are assigned to specific CPU
cores), which have large values for the queue length, can indicate that the configuration is unbalanced.
Although Processor Queue Length typically is not used for capacity planning, it can be used to identify if
systems within the environment are truly capable of running the loads or if additional processors or faster
processors should be purchased for future servers.
Memory Utilization
In order to sufficiently cover memory utilization on a server, both physical and virtual memory usage needs to
be monitored. Low memory conditions can lead to performance problems, such as excessive paging when
physical memory is low, to catastrophic failures, such as widespread application failures or system crashes
when virtual memory becomes exhausted.
Table B-2. Performance Monitor Counters for Memory Utilization
Object Counter Instance
Memory Pages/sec N/A
Memory Available Mbytes N/A
Memory Pool Paged Bytes N/A
Memory Pool Paged Resident Bytes N/A
Memory Transition Faults/sec N/A
Memory Committed Bytes N/A
Process Working Set <Process Name>
Memory\Pages/sec
As physical RAM becomes scarce, the virtual memory manager will free up RAM by transferring the information
in a memory page to a cache on the disk. Excessive paging to disk might consume too much of the available
disk bandwidth and slow down applications attempting to access their files on the same disk or disks. The
Pages/Sec counter tracks the total paging rates, both read and writes, to disk.
For capacity planning, watch for upward trends in this counter. Excessive paging can usually be reduced by
adding additional memory. Add memory when paging operations absorbs more than 20–50 percent of the total
disk I/O bandwidth. Because disk bandwidth is finite, capacity used for paging operations is unavailable for
application-oriented file operations. The Total Disk I/O Bandwidth is a ratio of the Pages/sec and the Physical
Disk\Disk Transfers/sec for all disks in the system:
Memory\Pages/sec ÷ Physical Disk (_Total)\Disk Transfers/sec
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Memory\Available Mbytes
The Available Mbytes displays the amount of physical memory, in megabytes, that is immediately available for
allocation to a process or for system use. The percent Available Megabytes can be used to indicate if additional
memory is required. Add memory if this value drops consistently below 10 percent. To calculate the percent of
Available Megabytes:
(Memory\Available Mbytes ÷ System RAM in Megabytes) * 100
This is the primary indicator to determine whether the supply of physical memory is ample. When memory is
scarce, Pages/sec is a better indicator of memory contention. Downward trends can indicate a need for
additional memory. Counters are available for Available Bytes and Available Kbytes.
Memory\Transition Faults/sec
The Transition Faults counter returns the number of soft or transition faults during the sampling interval.
Transition faults occur when a trimmed page on the Standby list is re-referenced. The page is then returned to
the working set. It is important to note that the page was never saved to disk.
An upward trend is an indicator that there may be a developing memory shortage. High rates of transition faults
on their own do not indicate a performance problem. However, if the Available Megabytes is at or near its
minimum threshold value, usually 10 percent, then it indicates that the operating system has to work to maintain
a pool of available pages.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Memory\Committed Bytes
The Committed Bytes measures the amount of committed virtual memory. Committed memory is allocated
memory that the system must reserve space for either in physical RAM or on the paging file so that this memory
can be addressed by threads running in the associated process.
A memory contention index called the Committed Bytes:RAM can be calculated to aid in capacity planning and
performance. When the ratio is greater than 1, virtual memory exceeds the size of RAM and some memory
management will be necessary. As the Committed Bytes:RAM ratio grows above 1.5, paging to disk will usually
increase up to a limit imposed by the bandwidth of the paging disks. Memory should be added when the ratio
exceeds 1.5. The Committed Bytes:RAM is calculated by:
Memory\Committed Bytes ÷ System RAM in Bytes
Process\Working Set
The Working Set counter shows the amount of memory allocated to a given process that can be addressed
without causing a page fault to occur. To see how much RAM is allocated overall across all process address
spaces, use the _Total instance of Working Set. Watch for upward trends for important applications.
Some server applications, such as IIS, Exchange Server, and SQL Server, manage their own process working
sets. To measure their working sets, application-specific counters need to be used.
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Disk Performance
Disk performance is typically expressed as a value of the total number of I/O operations that occur per second
(IOPS), measured over some period of time during peak usage.
To determine the number of disks needed to meet a system’s IOPS requirement, the IOPS of a given drive
needs to be determined. To further complicate matters, IOPS are very dependent upon the access pattern. For
example, a drive will typically have a higher IOPS rating for sequential reads than it will for random writes. For
this reason, it is normal to calculate a “worst case” IOPS measurement based on short random input/output
operations.
To calculate the IOPS of a drive, information about the drive needs to be collected. The following table lists the
information required that normally can be found in the manufacturer’s datasheet about the drive.
Table B-3. Information Required for Calculating IOPS
Required Information Description
Spindle Rotational Speed (RPM) The spindle speed expressed as RPM.
Average Read Seek Time (ms) The average seek time for reads.
Average Write Seek Time (ms) The average seek time for writes.
The first step to calculating the IOPS is to determine the Average Seek Time in milliseconds that the drive is
capable of doing. There is an assumption that there will be a 50/50 mix of read and write operations. If the ratio
of read and write operations is modified, then the Average Seek Time will need to be adjusted. For example, if a
drive has an Average Read of 4.7 ms and an Average Write of 5.3 ms, the Average Seek Time for this drive will
be 5.0 ms:
5.0ms = (4.7ms + 5.3ms) ÷ 2
Next, the IO Latency needs to be calculated. This is calculated by adding the Average Seek Time to the
Average Latency. The following table lists the Average Latency of common spindle speeds of drives on the
market today.
Table B-4. Average Latency Based on Spindle Rotational Speeds
Spindle Rotational Speed (rpm) Average Latency (ms)
4,200 7.2
5,400 5.6
7,200 4.2
10,000 3.0
15,000 2.0
The example drive has a spindle speed of 10,000 RPM. So this drive has an IO Latency of 8.0 ms:
8.0 ms = 5.0ms + 3.0ms
A drive can only perform one IO operation at a time. To calculate the number of IOs that can be performed in a
millisecond, 1 is divided by the IO Latency. Finally, this value is converted to IO per Second by multiplying by
1000. The IOPS calculation for the example drive evaluates to 125 IOPS:
125 IOPS = (1 IO ÷ 8.0ms) * 1000 ms/sec
Storage Requirements
To determine storage requirements, additional information is needed to be collected around the system being
considered. Some of this information is easily identified and self explanatory, while other information may be
more difficult to identify due to lack of quantifiable data. All of the following calculations are for a single server;
although if shared storage systems are being considered, then the information can be scaled up based on the
number of systems sharing that storage. The following table shows the information that needs to be collected.
Table B-5. Information Required for Calculating Storage Requirements
Required Information Description Example
# Users Per Server Total number of users hosted by that server. 700
% Concurrent Users The percentage of users connected to the server 80%
during peak times.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
The information in the table above is fairly self explanatory with the exception of the IOPS per User Required.
This is a measurement of the number of IOPS that a single user will generate on the system. Most venders do
not have this information for their applications unless the application is extremely IO intensive. This information
may be calculated from observation of a running system, but it is fraught with a number of challenges and is
beyond the scope of this guide. For the purpose of this example, this guide will use the value used with
Exchange Server, which is 0.5 IOPS per user.
Based on the information in Table B-5, there are a number of derived values that need to be calculated. The
following table lists these values.
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
RAID 0+1 and RAID 5 are the most common drive configurations for storing data in a redundant manner on a
system. However, these two RAID systems have different IOPS calculations due to how they operate.
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
RAID 5 Calculations
To calculate the number of drives necessary to meet the storage requirements with RAID 5, the Total Storage
Requirements needs to be multiplied by 1.2 to adjust for parity storage requirements. This value is then divided
by the Drive Actual Size and then rounded up. For this example, 5 drives are needed for RAID 5 to meet the
storage requirements:
5 = ROUNDUP((540*1.2)÷132)
To calculate the number of drives required to meet the performance requirements with RAID 5, multiply the
Number of Write IOPS by 4 and add the Number of Read IOPS. Divide this total by the Calculated Drive IOPS
and round the result up. For this example, 6 drives are needed for RAID 5 to meet the performance
requirements:
6 = ROUNDUP(((140*4)+140)÷125)
Although RAID 5 can meet the storage requirements with just 5 disks, 6 disks are required to meet the capacity
requirements.
The number derived from the above formula is only an estimate. This number must be adjusted down if there is
a write-mostly application because a RAID 5 configuration is affected by heavier write penalties.
Storage Monitoring
IOPS are used to help characterize the performance requirements of the system. However, once the system is
up, additional performance monitoring can be utilized to determine if the disk subsystem is slowing the system
down.
Table B-7. Performance Monitor Counters for Disk Performance
Object Counter Instance
Physical Disk % Idle Time <All Instances>
Physical Disk Disk Transfers/sec <All Instances>
Physical Disk Avg. Disk secs/Transfers <All Instances>
Physical Disk Split IO/sec <All Instances>
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Disk Throughput
The table below lays out the primary counters for disk throughput. They measure how many bytes per second
are being read from or written to a given disk. Although the “Instance” column reads “All Instances,” it is
recommended that this be run on each individual instance so as to identify throughput performance on each
disk and then tie that to the database files on those disks.
Table B-8. Performance Monitor Counters for Disk Throughput
Object Counter Instance
Physical Disk Disk Bytes/sec <All Instances>
Physical Disk Disk Read Bytes/sec <All Instances>
Physical Disk Disk Write Bytes/sec <All Instances>
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
The following table does not deal directly with throughput, but it can be used to correlate disk throughput to disk
throughput capacity. For example, if a disk is reading 100 MB/sec for the “Disk Read Bytes/sec” counter and the
“% Disk Read Time” is at 50 percent, it is estimated that the disk will be able to handle around 200 MB/sec for
read throughput.
Table B-9. Additional Performance Monitor Counters for Disk Throughput
Object Counter Instance
Physical Disk % Disk Time <All Instances>
Physical Disk % Disk Read Time <All Instances>
Physical Disk % Disk Write Time <All Instances>
Network Performance
Most workloads require access to production networks to ensure communication with other applications and
services and to communicate with users. Network requirements include elements such as throughput—that is,
the total amount of traffic that passes a given point on a network connection per unit of time.
Other network requirements include the presence of multiple network connections.
Table B-10. Performance Monitor Counters for Network Performance
Object Counter Instance
Network Interface Bytes Total/sec (Specific network adapters)
Network Interface Current Bandwidth (Specific network adapters)
Ipv4 & Ipv6 Datagrams/sec N/A
TCPv4 & TCPv6 Connections Established N/A
TCPv4 & TCPv6 Segments Received/sec N/A
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
calculated. This can be used to forecast future load as the number of users grows. The following formula can be
used to calculate the average number of segments received per connection:
TCPvn\Segments Received/sec ÷ TCPvn\Connections Established/sec
microsoft.com/solutionaccelerators
Microsoft SQL Server 2008 and SQL Server 2008 R2
Version History
Version Description Date
1.1 Added What’s New in SQL Server 2008 R2 section. May 2010
Added Step 2, Task 5, “Determine If SQL Server Master Data
Services Is Required.”
Added Step 7, “Design the SQL Server Master Data Services
Infrastructure.”
Updated decision flow and step listings accordingly.
1.0 Original release. February 2009
microsoft.com/solutionaccelerators
Infrastructure Planning and Design
Acknowledgments
The Solution Accelerators–Management and Infrastructure (SA-MI) team acknowledges and thanks the people
who produced the Infrastructure Planning and Design Guide for Microsoft SQL Server 2008 and SQL Server
2008 R2. The following people were either directly responsible for or made a substantial contribution to the
writing, development, and testing of this guide.
Contributors:
Steve Abraham – Xtreme Consulting
Jude Chosnyk – GrandMasters
Michael Kaczmarek – Microsoft
Greg Low – GrandMasters
Robin Maher – Microsoft
Fergus Stewart – Microsoft
Melissa Stowe – Microsoft
Reviewers:
Kevin Cox – Microsoft
Robert McPhail
Reagan Templin – Microsoft
Editors:
Laurie Dunham – Microsoft
Pat Rytkonen – Volt
microsoft.com/solutionaccelerators