Professional Documents
Culture Documents
LLD Template 7april05
LLD Template 7april05
Corporate Headquarters Cisco Systems, Inc. 170 West Tasman Drive San Jose, CA 95134-1706 USA http://www.cisco.com Tel: 408 526-4000 800 553-NETS (6387) Fax: 408 526-4100
THE SPECIFICATIONS AND INFORMATION REGARDING THE PRODUCTS IN THIS MANUAL ARE SUBJECT TO CHANGE WITHOUT NOTICE. ALL STATEMENTS, INFORMATION, AND RECOMMENDATIONS IN THIS MANUAL ARE BELIEVED TO BE ACCURATE BUT ARE PRESENTED WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED. USERS MUST TAKE FULL RESPONSIBILITY FOR THEIR APPLICATION OF ANY PRODUCTS. THE SOFTWARE LICENSE AND LIMITED WARRANTY FOR THE ACCOMPANYING PRODUCT ARE SET FORTH IN THE INFORMATION PACKET THAT SHIPPED WITH THE PRODUCT AND ARE INCORPORATED HEREIN BY THIS REFERENCE. IF YOU ARE UNABLE TO LOCATE THE SOFTWARE LICENSE OR LIMITED WARRANTY, CONTACT YOUR CISCO REPRESENTATIVE FOR A COPY. The following information is for FCC compliance of Class A devices: This equipment has been tested and found to comply with the limits for a Class A digital device, pursuant to part 15 of the FCC rules. These limits are designed to provide reasonable protection against harmful interference when the equipment is operated in a commercial environment. This equipment generates, uses, and can radiate radio-frequency energy and, if not installed and used in accordance with the instruction manual, may cause harmful interference to radio communications. Operation of this equipment in a residential area is likely to cause harmful interference, in which case users will be required to correct the interference at their own expense. The following information is for FCC compliance of Class B devices: The equipment described in this manual generates and may radiate radio-frequency energy. If it is not installed in accordance with Ciscos installation instructions, it may cause interference with radio and television reception. This equipment has been tested and found to comply with the limits for a Class B digital device in accordance with the specifications in part 15 of the FCC rules. These specifications are designed to provide reasonable protection against such interference in a residential installation. However, there is no guarantee that interference will not occur in a particular installation. You can determine whether your equipment is causing interference by turning it off. If the interference stops, it was probably caused by the Cisco equipment or one of its peripheral devices. If the equipment causes interference to radio or television reception, try to correct the interference by using one or more of the following measures: Turn the television or radio antenna until the interference stops. Move the equipment to one side or the other of the television or radio. Move the equipment farther away from the television or radio. Plug the equipment into an outlet that is on a different circuit from the television or radio. (That is, make certain the equipment and the television or radio are on circuits controlled by different circuit breakers or fuses.) Modifications to this product not authorized by Cisco Systems, Inc. could void the FCC approval and negate your authority to operate the product. The following third-party software may be included with your product and will be subject to the software license agreement: CiscoWorks software and documentation are based in part on HP OpenView under license from the Hewlett-Packard Company. HP OpenView is a trademark of the Hewlett-Packard Company. Copyright 1992, 1993 Hewlett-Packard Company. The Cisco implementation of TCP header compression is an adaptation of a program developed by the University of California, Berkeley (UCB) as part of UCBs public domain version of the UNIX operating system. All rights reserved. Copyright 1981, Regents of the University of California. Network Time Protocol (NTP). Copyright 1992, David L. Mills. The University of Delaware makes no representations about the suitability of this software for any purpose. Point-to-Point Protocol. Copyright 1989, Carnegie-Mellon University. All rights reserved. The name of the University may not be used to endorse or promote products derived from this software without specific prior written permission. The Cisco implementation of TN3270 is an adaptation of the TN3270, curses, and termcap programs developed by the University of California, Berkeley (UCB) as part of the UCBs public domain version of the UNIX operating system. All rights reserved. Copyright 1981-1988, Regents of the University of California. Cisco incorporates Fastmac and TrueView software and the RingRunner chip in some Token Ring products. Fastmac software is licensed to Cisco by Madge Networks Limited, and the RingRunner chip is licensed to Cisco by Madge NV. Fastmac, RingRunner, and TrueView are trademarks and in some jurisdictions registered trademarks of Madge Networks Limited. Copyright 1995, Madge Networks Limited. All rights reserved. Xremote is a trademark of Network Computing Devices, Inc. Copyright 1989, Network Computing Devices, Inc., Mountain View, California. NCD makes no representations about the suitability of this software for any purpose. The X Window System is a trademark of the X Consortium, Cambridge, Massachusetts. All rights reserved. NOTWITHSTANDING ANY OTHER WARRANTY HEREIN, ALL DOCUMENT FILES AND SOFTWARE OF THESE SUPPLIERS ARE PROVIDED AS IS WITH ALL FAULTS. CISCO AND THE ABOVE-NAMED SUPPLIERS DISCLAIM ALL WARRANTIES, EXPRESSED OR IMPLIED, INCLUDING, WITHOUT LIMITATION, THOSE OF MERCHANTABILITY, FITNESS FOR A PRACTICAL PURPOSE AND NONINFRINGEMENT OR ARISING FROM A COURSE OF DEALING, USAGE, OR TRADE PRACTICE. IN NO EVENT SHALL CISCO OR ITS SUPPLIERS BE LIABLE FOR ANY INDIRECT, SPECIAL, CONSEQUENTIAL, OR INCIDENTAL DAMAGES, INCLUDING, WITHOUT LIMITATION, LOST PROFITS OR LOSS OR DAMAGE TO DATA ARISING OUT OF THE USE OR INABILITY TO USE THIS MANUAL, EVEN IF CISCO OR ITS SUPPLIERS HAVE BEEN ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. AccessPath, AtmDirector, Browse with Me, CCDE, CCIP, CCSI, CD-PAC, CiscoLink, the Cisco NetWorks logo, the Cisco Powered Network logo, Cisco Systems Networking Academy, Fast Step, Follow Me Browsing, FormShare, FrameShare, GigaStack, IGX, Internet Quotient, IP/VC, iQ Breakthrough, iQ Expertise, iQ FastTrack, the iQ logo, iQ Net Readiness Scorecard, MGX, the Networkers logo, Packet, RateMUX, ScriptBuilder, ScriptShare, SlideCast, SMARTnet, TransPath, Unity, Voice LAN, Wavelength Router, and WebViewer are trademarks of Cisco Systems, Inc.; Changing the Way We Work, Live, Play, and Learn, Discover All Thats Possible, and Empowering the Internet Generation, are service marks of Cisco Systems, Inc.; and Aironet, ASIST, BPX, Catalyst, CCDA, CCDP, CCIE, CCNA, CCNP, Cisco, the Cisco Certified Internetwork Expert Logo, Cisco IOS, the Cisco IOS logo, Cisco Systems, Cisco Systems Capital, the Cisco Systems logo, Enterprise/Solver, EtherChannel, EtherSwitch, FastHub, FastSwitch, IOS, IP/TV, LightStream, MICA, Network Registrar, PIX, Post-Routing, Pre-Routing, Registrar, StrataView Plus, Stratm, SwitchProbe, TeleRouter, and VCO are registered trademarks of Cisco Systems, Inc. and/or its affiliates in the U.S. and certain other countries. All other trademarks mentioned in this document or Web site are the property of their respective owners. The use of the word partner does not imply a partnership relationship between Cisco and any other company. (0105R) INTELLECTUAL PROPERTY RIGHTS: THIS DOCUMENT CONTAINS VALUABLE TRADE SECRETS AND CONFIDENTIAL INFORMATION OF CISCO SYSTEMS, INC. AND ITS SUPPLIERS, AND SHALL NOT BE DISCLOSED TO ANY PERSON, ORGANIZATION, OR ENTITY UNLESS SUCH DISCLOSURE IS SUBJECT TO THE PROVISIONS OF A WRITTEN NON-DISCLOSURE AND PROPRIETARY RIGHTS AGREEMENT OR INTELLECTUAL PROPERTY LICENSE AGREEMENT APPROVED BY CISCO SYSTEMS, INC. THE DISTRIBUTION OF THIS DOCUMENT DOES NOT GRANT ANY LICENSE IN OR RIGHTS, IN WHOLE OR IN PART, TO THE CONTENT, THE PRODUCT(S), TECHNOLOGY OF INTELLECTUAL PROPERTY DESCRIBED HEREIN. Low Level Design Template Copyright 2001-2, Cisco Systems, Inc. All rights reserved. COMMERCIAL IN CONFIDENCE.
Contents
Contents........................................................................................................................................ 3 Tables............................................................................................................................................ 8 Figures......................................................................................................................................... 10 Document Control...................................................................................................................... 13 History.................................................................................................................................... 13 Review.................................................................................................................................... 14 Design Acceptance............................................................................................................... 15 About This Design Document................................................................................................... 16 Document Purpose................................................................................................................ 16 Scope...................................................................................................................................... 16 Document Usage Guidelines................................................................................................ 16 Assumptions and Caveats.................................................................................................... 17 Related Documents............................................................................................................... 17 Network Overview...................................................................................................................... 18 Network Topology................................................................................................................. 18 WAN Overview..................................................................................................................................18 Network Infrastructure.......................................................................................................... 18 Core.....................................................................................................................................................18 Edge ...................................................................................................................................................18 Access.................................................................................................................................................18 Traffic Flow and Characteristic............................................................................................ 19 Existing Services and SLAs................................................................................................. 19 Proposed Network Architecture................................................................................................ 20 Design Considerations......................................................................................................... 20 MPLS Network Architecture..............................................................................................................20 Quality-of-Service...............................................................................................................................21 MPLS/VPN Services..........................................................................................................................21
Contents
Detailed Naming and Addressing Specifications...............................................................22 BGP AS Number.................................................................................................................................22 IP Addressing......................................................................................................................................22 MPLS/VPN Attributes........................................................................................................................23 ..................................................................................................................................................... 25 Deployment Guidelines.............................................................................................................. 25 Physical Network Design...................................................................................................... 25 Layer-2 Transport Media....................................................................................................................25 Central Office Bratislava [BA]...........................................................................................................26 Central Office Banska Bystrica [BB].................................................................................................38 Central Office Kosice [KE]................................................................................................................42 Regional PoPs.....................................................................................................................................46 Hardware/Software Release Table......................................................................................................50 Logical Network Design............................................................................................................. 52 IGP Routing (OSPF or ISIS)............................................................................................... 52 The Role of OSPF in <Customer name> MPLS network..................................................................52 OSPF Areas.........................................................................................................................................53 OSPF Authentication..........................................................................................................................54 Loopback Addresses...........................................................................................................................54 OSPF Area Summarization.................................................................................................................54 OSPF Costs.........................................................................................................................................54 Designated and Backup Designated Routers......................................................................................55 Default Routes....................................................................................................................................55 OSPF Convergence.............................................................................................................................55 OSPF DNS Lookup.............................................................................................................................57 OSPF Configuration Template...........................................................................................................57 OSPF Deployment Recommendations Summary for the <CUSTOMER NAME> network.............57 Backbone Routing and Label Distribution Protocols.........................................................58 Cisco Express Forwarding (CEF) Switching......................................................................................58 Label Distribution Protocol (LDP).....................................................................................................58 Network Services........................................................................................................................ 62 MPLS/VPN Services.............................................................................................................. 62 MPLS-VPN.........................................................................................................................................62 MP-iBGP4 (Multi-protocol iBGP) Implementation...........................................................................63 Creating VRF Definitions...................................................................................................... 70 VRF Name..........................................................................................................................................70 Route-Distinguisher............................................................................................................................70 VPN Route Target Communities........................................................................................................72 4
Contents
VPN Topologies..................................................................................................................... 73 Full Mesh............................................................................................................................................73 Hub and Spoke....................................................................................................................................73 Exranets...............................................................................................................................................74 Customers with Unique Addresses...............................................................................................74 Customers with Overlapping Addresses.......................................................................................74 Extranet NAT at a Common Service Point...................................................................................74 Extranet NAT at Customer Edge..................................................................................................74 Controlling route exports in extranets...........................................................................................74 ..................................................................................................................................................... 76 PE-CE Routing Implementation................................................................................................. 76 Connectivity via Static Routing............................................................................................ 76 RIPv2 configuration (PE to CE)............................................................................................ 77 PE Configuration................................................................................................................................77 CE Configuration................................................................................................................................78 eBGP configuration (PE to CE)............................................................................................ 78 Configuration at the PE.......................................................................................................................78 Controlling number of VRF routes.....................................................................................................82 ..................................................................................................................................................... 84 Additional MPLS VPN Services................................................................................................. 84 Internet Access for MPLS/VPN customers..........................................................................84 Separate CEs for Internet Access and VPN Access............................................................................84 Low-cost Internet Access (1CE + one/two access links)....................................................................85 Shared vrf-aware services.................................................................................................... 87 Network Address Translation for MPLS/VPN customers..................................................................87 Connecting Downstream ISPs to PE routers......................................................................................88 Remote Access (ASWAN/Security, Dial, DSL, Cable)........................................................89 Wireless.................................................................................................................................. 89 VOIP........................................................................................................................................ 89 Inter-AS/CsC.......................................................................................................................... 89 ..................................................................................................................................................... 90 Traffic Engineering and Fast Reroute Technology Overview.................................................90 Traffic Engineering Basics................................................................................................... 90 Traffic Trunk Attributes......................................................................................................... 92 Bandwidth...........................................................................................................................................92 Path Selection Policy..........................................................................................................................92 Resource Class Affinity......................................................................................................................92 Adaptability.........................................................................................................................................92 5
Contents
Resilience............................................................................................................................................92 Priority................................................................................................................................................93 Resource Attributes.............................................................................................................. 93 Available Bandwidth..........................................................................................................................93 Resource Class....................................................................................................................................93 Path Selection........................................................................................................................ 93 Path Setup ............................................................................................................................. 94 Link Protection (FRR) Basics.............................................................................................. 95 Increased Reliability for IP Services...................................................................................................97 High Scalability Solution....................................................................................................................97 TE/TE-FRR Design ..................................................................................................................... 98 Deciding on the tunnel topology and tunnel types............................................................98 How to Route Traffic Into TE Tunnels..................................................................................98 Policy Based Routing.........................................................................................................................98 Static Routing Into Tunnels................................................................................................................98 Auto-Route.........................................................................................................................................99 Forwarding Adjacency.....................................................................................................................101 Using Directed LDP Sessions............................................................................................ 102 Number of Protected Prefixes............................................................................................ 103 3 Implementation Of TE-FRR............................................................................................... 105 3 Network Architecture.................................................................................................... 105 Introduction.......................................................................................................................................105 TE-FRR Design................................................................................................................................106 Primary Tunnels..........................................................................................................................107 Backup Tunnels...........................................................................................................................107 Sample configurations......................................................................................................................109 Generic Global Commands.........................................................................................................109 Birmingham P Router..................................................................................................................109 Quality of Service................................................................................................................ 112 Introduction.......................................................................................................................................112 Differentiated Services Model Introduction..................................................................................113 DiffServ Aware TE...........................................................................................................................124 ST QoS design An Overview.........................................................................................................124 CE-to-PE QoS mechanisms (applied on the CE) PPP or HDLC...................................................126 CE-to-PE QoS mechanisms (applied on the PE) PPP or HDLC...................................................144 PE-to-P QoS mechanisms (applied on the PE).................................................................................146 PE-P, P-P and P-PE QoS mechanisms (applied on the P)................................................................148 PE to CE QoS mechanisms (applied on the PE)...............................................................................154 QoS mechanisms on ATM PVCs (applied on the CE and PE).........................................................155 6
Contents
..........................................................................................................................................................157 ................................................................................................................................................... 157 High Availability........................................................................................................................ 158 ................................................................................................................................................... 159 Security .................................................................................................................................... 159 Password Management.....................................................................................................................159 Console Ports....................................................................................................................................160 Controlling TTYs............................................................................................................................160 Controlling VTYs and Ensuring VTY Availability..........................................................................160 Logging.............................................................................................................................................161 Anti-spoofing....................................................................................................................................162 Controlling Directed Broadcasts.......................................................................................................164 IP Source Routing.............................................................................................................................164 ICMP Redirects.................................................................................................................................165 CDP...................................................................................................................................................165 NTP...................................................................................................................................................165 ................................................................................................................................................... 167 Network Management............................................................................................................... 167 ................................................................................................................................................... 168 Appendix I................................................................................................................................. 168 Appendix II................................................................................................................................ 169
Tables
Table 1 Revision History............................................................................................................ 13 Table 2 Revision Review............................................................................................................ 14 Table 3 PE-P Connectivity in CO BA......................................................................................... 32 Table 4 Software Release Table................................................................................................ 50 Table 5 Proposed OSPF Metrics.............................................................................................. 54 Table 6 OSPF Timer Default Values......................................................................................... 56 Table 7 RT/RD Allocation.......................................................................................................... 73 Table 8 BGP Timer Definitions................................................................................................. 82 Table 9 Tunnel Provisioning.............................................................................................. 108
Table 10 Class-Selector PHBs................................................................................................. 116 Table 11 Serialisation delay [ms] as function of link speed and packet size......................119 Table 12 Recommended fragment size...................................................................................121 Table 13 The components of the end-to-end delay model....................................................122 Table 14 CoS Mechanisms Overview......................................................................................125 Table 15 NB and EB settings................................................................................................... 132 Table 16 WRED Settings for Business Class.........................................................................139 Table 17 WRED Settings for Streaming Class.......................................................................139 Table 18 WRED Settings for Standard Class.........................................................................140 Table 19 WRED - exponential weighting constant.................................................................142 Table 20 MDRR weights........................................................................................................... 150 Table 21 WRED Setings for Business Class (ENG-2 GSR) ..................................................153 Table 22 WRED Setings for Streaming Class (ENG-2 GSR)..................................................153
Tables
Table 23 WRED Setings for Standard Class (ENG-2 GSR)....................................................153 Table 24 ATM Overhead........................................................................................................... 156 Table 25 LLQ bandwidths and ATM........................................................................................ 156
Figures
Figure 1 <Companys name> network WAN topology..........................................................18 Figure 2 Architecture of CO BA................................................................................................. 27 Figure 3 HW configuration of ba2-igw-2...................................................................................28 Figure 4 HW configuration of ba1-igw-1...................................................................................29 Figure 5 HW configuration of ba-six-1......................................................................................29 Figure 6 HW configuration of ba1-p-1 and ba2-p-2..................................................................31 Figure 7 HW Configuration of ba2-pe-4....................................................................................33 Figure 8 HW Configuration of ba1-pe-5....................................................................................33 Figure 9 HW Configuration of ba1-pe-6 and ba1-pe-7.............................................................35 Figure 10 Architecture of CO BB............................................................................................... 38 Figure 11 HW configuration of bb1-p-1.....................................................................................39 Figure 12 HW configuration of bb2-p-2.....................................................................................39 Figure 13 HW configuration of bb1-pe-1...................................................................................40 Figure 14 HW Configuration of bb2-pe-2..................................................................................40 Figure 15 HW Configuration of bb1-cat-1 and bb1-cat-2.........................................................41 Figure 16 Architecture of CO KE............................................................................................... 42 Figure 17 HW configuration of ke1-p-1.....................................................................................43 Figure 18 HW configuration of ke2-p-2.....................................................................................43 Figure 19 HW Configuration of ke1-pe-1.................................................................................44 Figure 20 HW Configuration of ke2-pe-2..................................................................................44 Figure 21 HW Configuration of ke1-cat-1 and ke1-cat-2..........................................................45 Figure 22 New Architecture of Regional PoPs (10k based)....................................................47
Figures
Figure 23 New Architecture of regional PoPs (7206VXR based)............................................47 Figure 24 HW Configuration of 10008 in Regional PoPs (ZA, NI, TT, PO)..............................48 Figure 25 HW Configuration of 10005 in Regional PoP Trencin (TN).....................................48 Figure 26 HW Configuration of 7206VXRs in Regional PoP..................................................48 Figure 27 OSPF enabled links................................................................................................... 53 Figure 28 Layer 2 Frame with 2 MPLS Labels.........................................................................59 Figure 29 MP-BGP VPN Route Distribution example...............................................................63 Figure 30 VPN route distribution using partitioned RRs.........................................................64 Figure 31 Route Reflector Redundancy in the <Customer Name> Networks.......................65 Figure 32 Redundant Route Reflectors with same cluster-id................................................67 Figure 33 Unique RD per each VPN.......................................................................................... 71 Figure 34 Unique RD per site for each VPN.............................................................................72 Figure 35 PE-CE eBGP with unique AS....................................................................................79 Figure 36 PE-CE eBGP with single network wide AS..............................................................80 Figure 37 Internet Access from a VPN using separate CEs....................................................84 Figure 38 Internet Access from a VPN Single CE (two links in CEred, single link on CEblue)........................................................................................................................................ 86 Figure 39 NAT in CE router........................................................................................................ 88 Figure 40 - Traffic Engineering Mechanisms...........................................................................91 Figure 41 - Traffic Engineering Path Setup..............................................................................94 Figure 42 - TE FRR Example...................................................................................................... 96 Figure 43 - Topology Without Tunnels.....................................................................................99 Figure 44 - R1 Routing Table No MPLS TE..........................................................................100 Figure 45 Topology With TE Tunnels..................................................................................100 Figure 46 - R1 Routing Table With Autoroute Announce......................................................100 Figure 47 - Forwarding Adjacency Topology.........................................................................101 Figure 48 - "3" Core Network Architecture.............................................................................106 Figure 49 - Illustration of Primary and Backup TE Tunnels..................................................107 Figure 50 Various interpretations of the TOS field................................................................114 Figure 51 DSCP Interpretation................................................................................................ 117
11
Figures
Figure 52 Adaptive jitter buffer................................................................................................ 120 Figure 53 - Call admission control.......................................................................................... 120 Figure 54 LFI to reduce frame delay and jitter.......................................................................121 Figure 55 Overview of end-to-end delay segments...............................................................123 Figure 56 DSCP to EXP mapping............................................................................................ 124 Figure 57 DSCP / MPLS Headers............................................................................................. 124 Figure 58 QoS mechanisms overview....................................................................................126 Figure 59 In/Out-contract Marking and Policing (example for Business class) .................130 Figure 60 CAR based In/Out-contract Marking and Policing ...............................................131 Figure 61 WRED Algorithm...................................................................................................... 136
12
Document Control
Authors: Change Authority: Reference Number: EDCS-xxxx
History
Table 1 Version No. Revision History Issue Date Status Reason for Change
Document Control
Review
Table 2 Revision Review Version No. <Version number> Date <dd-mmm-yyyy> Reviewers Details <Name> <Organisation>
Change Forecast: Medium This document will be kept under revision control.
14
Document Control
Design Acceptance
The signatories below confirm that the design meets the requirements specified. The design is subject to change during or following staging.
CISCO SYSTEMS
Slovak Telecom
By:__________________________________
Name:
Title:
Date:________________________________
Date:___________________________________
15
Note: The above sections may change depending on the customers needs The document provides sufficient detail to derive the device configurations that will be documented in the Network Implementation Plan. Some parameters may be determined during network deployment.
Scope
Please refer to Statement of Work documents for exact definition of project deliverables.
Related Documents
Write down the links to CRD, CRDR, SOW, HLD and Site Survey
17
Network Overview
Describe what kind of customer and their core business. Also at a high level explain their current architecture with more details in the following section. This information can be collected from CRD and HLD
Network Topology
WAN Overview
The following figure is provided for illustrative purposes and depicts a high-level view of <Companys name> network. Picture is simplified for easier understanding of WAN network topology. Figure 1 <Companys name> network WAN topology
Network Infrastructure
Core
If possible provide the details of current core network. The platforms used, kind of links, what routing protocol etc.
Edge
The following types of devices are installed in <Companys name> network as Provider Edge (PE) routers:
Access
Customer Edge (CE) routers are classical IPv4 routers and will interconnect customer sites with PE routers via leased lines (as described in Error: Reference source not found chapter below). CE routers usually reside in customer premises. CE router can be managed by <Companys name> or by the customer.
19
Design Considerations
This chapter summarizes the design objectives that have been followed throughout the LLD, and the design rules we have taken to meet these objectives.. Following are the list of these objectives as dictated by the customer
and backup route processors. Resilient connections between regional PoPs and Core locations will be rolled out in project phase 8A. Network security Cisco has implemented best-practices security mechanisms on ST routers to protect the network. Customer security and managed firewall service was not in the scope of any Cisco project. Simplicity ST MPLS network design is clean and simple to understand. Any feature or design element that would increase network complexity - but have a limited overall benefit - has been avoided. ST has decided to clean-up the existing IP addressing scheme and migrate from multi-area OSPF into single-area design. MPLS LDP has been chosen for label distribution in ST MPLS network. LDP is enabled on all core links (P-P, P-PE, P-RR, P-iGW).
Quality-of-Service
Traffic prioritisation The following Classes of Service are implemented in the <Customer Network> network: Voice, Streaming, Business, Standard. Each class has different QoS attributes and guaranteed (configured) bandwidth that cannot be utilised by any other class during congestion periods. Backbone links must be provisioned with sufficient capacity for each of the classes! Flexibility Modular QoS CLI allows to map traffic flows of <Customer Name> customers in one of the Classes of Service. Such classification and marking is extremely flexible (different customers can map different applications in any of the classes), but requires the understanding of traffic profiles (e.g. SMTP or any other data traffic must not be mixed with delay-sensitive VoIP packets). Scaleable implementation The customer-specific QoS configuration is implemented on CE routers QoS configuration template on PE and P devices will remain stable and the same for all ST customers. VPNSC shall be used for accurate provisioning of QoS parameters on access (PE-CE) connections.
MPLS/VPN Services
Flexible and scalable managed IP VPN service Achieved through MPLS technology, properly applied MPLS/VPN functionality and VPNSC management system1. Service resilience Resilient MPLS backbone, redundant route reflectors and the possibility of fully resilient connectivity scenarios on access-layer (2CE-2PE) in all PoPs, are necessary building blocks for high availability MPLS/VPN service. End-to-end Quality of Service Achieved through the use of various Diffserv mechanisms: classification, marking, policing, queuing and dropping. QoS is implemented on access layer and in the backbone.
Internet Access for MPLS/VPN customers Internet access from the MPLS/VPN is provided for customers with such requirement. For security reasons we only recommend to implement the Internet connection through a dedicated CE router and dedicated access-layer circuit (see chapter for detailed description) Security Assuming that MPLS core is secure, the MPLS/VPN solution offers same level of security as the traditional layer-2 VPN networks.
IP Addressing
Explain in detail the IP addressing scheme in customers network
Loopbacks
The following address block is used for numbering of Loopback02 interfaces in <Customer Name> MPLS network: X.X.X.X. Give any additional details as it relates to loopback addresses
2
Loopback interfaces are used primarily for OSPF stability, creation of iBGP and LDP neighborships. 22
Backbone Links
Backbone connections are numbered with IP addresses from the following subnets: X.X.X.X Explain in detail how these subnets would be allocated to different links
Existing Routers
In this section explain any renumbering of ip addresses that might be needed for any existing infrastructure
IP-SLA
Depending on how IP-SLA is deployed , explain addressing scheme that would be required
MPLS/VPN Attributes
VPN naming and addressing is described by four attributes. These are the VRF name, RD, RT and SOO. Detailed naming of MPLS/VPN attributes are given in the NMS document (see Error: Reference source not found)
VRF Name
The VRF name is locally significant on the PE router. Service providers that provision the MPLS/VPNs manually (via CLI on router console) are advised to use a VRF name that is recognisable across the whole network in order to aid troubleshooting. The name should identify the predominant function of the CEs
23
connected to the VRF, this can be a word to identify the customer or, in the case of a VRF that is shared by multiple customers, the type of service offered by the attached CEs. Explain how VRF name would be configured. Also explain if any provisioning tool like ISC is being used
RD
Explain how RD is being assigned. Also explain if any provisioning tool like ISC is being used
RT
RTs define the VPNs, the value is therefore significant across the whole network. A different RT is required for each VPN, hub-and-spoke VPNs can be considered as 2 uni-directional VPNs and therefore need 2 RTs if bi-directional routing is required. Explain how RT is being assigned. Also explain if any provisioning tool like ISC is being used
SOO
SOO is required in order to provide loop avoidance on multi-homed sites. The same SOO attribute should be added to every routing update originating on the multi-homed site. Routes received by a VRF carrying a SOO also exported by the VRF should be filtered out so that they are not imported. Only multi-homed sites require the SOO attribute so SOO values will only be allocated as required. Explain how SOO is being assigned. Also explain if any provisioning tool like ISC is being used
BGP AS Numbers
When peering with a customer CE router, the customer can use his registered ASN if he possesses one. If not, private ASNs (64512 to 65535) can be used on the CEs. The same ASN can be used for all the sites of a VPN to conserve the number of ASNs; this is recommended and also allows for VPNs that have more than 1024 sites. The as-override is used on the PEs in order to reuse the same ASN for all the sites.
24
Deployment Guidelines
In this chapter well exaplain you how to deploy the architecture proposed in the previous chapter
26
Figure 2
Architecture of CO BA
Peering Layer
Internet Gateway Routers (ba1-igw-1, ba2-igw-2)
Two routers (Cisco 12406 and Cisco 12008) will be installed in BA CO for IP connectivity between ST MPLS network and: Downstream ISPs (eg. local ASP - Application Service Provider) that pay for transit service to ST these are in fact customers of ST. Upstream ISPs (eg. Deutsche Telekom, UTA) that provide global Internet reachability for customers of ST.
Each iGW will have a POS STM-16 back-to-back connection with a different P router. Interconnections with ISPs can be either POS STM-1 or E3 leased lines. Both iGWs are equipped with powerful route processors (primary and redundant) that can handle large number of BGP sessions, and will have installed sufficient amount of memory to carry one or more copies of full Internet routing table. Back-to-back links between iGW routers
27
Both iGWs will inject a BGP default route towards PE routers. A PE router will select the default route based on IGP distance to originating iGW, and eventually send all Internet traffic to the closest iGW. However, this iGW may not be the best exit point for a given Internet destination, so the packets would have to be re-routed to the neighbouring iGW to be delivered to the upstream ISP. For this reason, two POS STM-1 back-to-back links are installed between iGWs. No other traffic (eg. packets between two ST PoPs) are passing these two links. An alternative solution would be to download full Internet routing table to any PE router, which can in turn deliver the Internet traffic to the right iGW. This would result in more optimal traffic flows across ST core, and enable distributed peering system, with possibility of connecting ISP circuits in any PoP. Assuming that BGP dampening is enabled on border routers, and number of routes that can be accepted from any ISP is limited3, the major drawback is memory requirements (min. 256 MB) on all PE routers due to large number of routes in the global Internet routing table. Figure 3
0 1 2 3 4 5 CSC CSC Alarm Power Alarm
HW configuration of ba2-igw-2
1 x POS STM-16 8 x POS STM-1 6 x E3 4 x POS STM-1 GRP Redundant GRP SFC SFC SFC Power Upstream/Downstream ISP P router
12406
It is a good practice to define the maximum number of prefixes that can be accepted from any eBGP peer. This is for example to prevent the situation where a peering partner at SIX advertises the full Internet routes to ST. 28
Figure 4
HW configuration of ba1-igw-1
P router Upstream/Downstream ISP
1 x POS STM-16
GRP Redundant
4 x POS STM-1
4 x POS STM-1 7
HW configuration of ba-six-1
GE0 GE1 NSE100 5 3 1 1 x POS STM-16 P router
7304
Power
12008
CSC 0
CSC 1
GRP
Power
29
(either STs or the one of upstream ISP). Having two redundant Internet connections on separate routers will also permit software and hardware upgrades on iGWs without long downtimes.4 The two iGWs distribute BGP routes (default route and full Internet table if required) to other BGP neighbors in ST network via two redundant route reflectors. The Internet connectivity scheme with physically separated IGW routers protects against the failure or major disaster in one of the Bratislava firesections. Internet connectivity will remain through the backup upstream ISP in the other firesection. Theres currently a single router installed at SIX premises. If this router or a link between ba2-p-1 and SIX router fails, the direct connectivity with SIX participants will be lost. Nevertheless, this does not represent a single point of failure, because the peering partners networks can be during failure reached 5 across upstream ISPs.
Core Layer
MPLS P Routers (ba1-p-1, ba2-p-2)
ST has selected two Cisco 12410 routers for MPLS P devices in Bratislava CO. These P routers do not perform any aggregation layer services (eg. termination of customer links, or peering circuits, BGP routing, etc.) P routers are also not routing the IP packets across ST core; the only6 task of P routers is to label switch the MPLS frames through high speed links, respecting the queuing and dropping attributes which are encoded in the EXP bits of MPLS labels. Each P router links the following devices: Remote P router - GSR. POS STM-16 links interconnect the P routers into a high-speed backbone. Underlying technology is DWDM. Remote PE. Regional PEs in remote PoPs interconnect with P routers via POS STM-1 links. Collocated PEs are attached with P routers either via a back-to-back POS interfaces, or using a GE technology in a back-to-back mode.
4 5
A short downtime will occur because of eBGP convergence throughout the Internet.
Most likely this would introduce higher RTT and jitter, and increased load on generally very expensive transit connections with upstream ISPs.
6
This is not entirely true because the locally sourced packets will be subject to label imposition (eg. OSPF updates) 30
Figure 6
4 x POS STM-16
4 x POS STM-16
GRP Redundant 8
8 x POS STM-1
8 x POS STM-1
4 x GE
4 x GE
0 CSC 0
1 CSC 1
2 SFC 0
3 SFC 1
4 SFC - 2
5 SFC 3
6 SFC 4
7 Alrm 0
Power
12410
Power
New four-port STM-16 LC will be installed in ba1-p-2 during Phase 8A Project. Three-port GE LCs will be replaced by new four-port Eng3 LCs. This will allow proper QoS implementation on P-PE connection (on GSR side). 31
Alrm 1
GRP 9
solution may be to overprovision the class bandwidth for Business data class, on account of under provisioned class bandwidth in Standard class.
Aggregation Layer
10008 MPLS PE - Concentration of Leased Line Customers (ba1-pe-5, ba2-pe-4)
Two ESR 10008 routers are installed in Bratislava CO for termination of leased-line customer connections (MPLS/VPN and Internet customers) and linking the non-upgraded regional PoPs of ST with MPLS core. For this, the 10008s are equipped with 24 E1 ports and 6 POS STM-1 ports. The 10008 PEs are connected to P routers with two GE uplinks in a back-to-back mode, as shown on the following table. LX long haul GBICs connected via single mode dark fiber (10/9 micron) are used for longer distances (< 6,2miles/10 km) between firesections. Table 3 PE Device ESR 10008 ESR 10008 7609 7609 PE-P Connectivity in CO BA PE Name ba1-pe-5 ba2-pe-4 ba1-pe-6 ba1-pe-7 Primary P ba1-p-1 ba2-p-2 ba1-p-1 ba1-p-1 Backup P ba2-p-2 ba1-p-1 ba2-p-2 ba2-p-2
32
Figure 7
HW Configuration of ba2-pe-48
Customers + Regional PoPs (CEs) Backup P Primary P + Backup P
Primary PEM
6 x POS STM-1
6 x POS STM-1
10008
24 x E1
24 x E1
8 x E3
PRE B
PRE A
Redundant PEM
7 1 x GE 8
Figure 8
HW Configuration of ba1-pe-59
Customers + Regional PoPs (CEs) Backup P Primary P
Primary PEM
6 x POS STM-1
6 x POS STM-1
10008
24 x E1
24 x E1
8 x E3
1 x GE 7
Redundant PEM
New 10008 chassis with 6xPOS STM-1LC, 24xE1LC and two half-slot GE LCs will be installed in BA firesection 2. One 6xPOS STM-1 LC will be taken from current ba-pe-4 router. One 8xE3 LC will be taken from ke1-pe-1 router.
9
1 x GE 8
PRE B
PRE A
1 x GE
Each 7609 has a primary and backup GE uplink towards the P routers in BA PoP.
34
Figure 9
2 x GE SFM2 (Switch Fabric Module) (DFC) SUP2 Redun. (PFC2 + MSFC2) 2 OSM 4 x GE 48 x 10/100 RJ-21 16 x GE 9 8 7 PS 6 5 4 3
7609
PS Redundant
10
New 16 port GE module will be installed in ba1-pe-6 and ba1-pe-7 durin Project Phase 8A 35
SUP2 1
(PFC2 + MSFC2)
2 x GE
Access Layer
Customer Edge CE Routers
There are two type of CE routers connected to ST PE routers: customer managed and ST managed CE devices. Cisco recommends to limit the number of various platforms and interfaces, in order to minimize operational and management costs. For example: Low-end CE router: 800, 1700 and 2600 series Mid-range CE router: 3600 series High-capacity CE router: 7200 (or more powerful device when required)
Customer managed CE router can be any device, which supports the leased line technology and protocols offered by ST.
36
The following routers in Bratislava PoP are connected with a single FE back-to-back connection to the 7609 PEs: ba2-ce-1, ba2-ce-2, ba2-ce-3, ba2-nas-1, ba2-nas-2. New 5850 access servers will rolled out in a separate project. Two logical separated access servers ( ba2nas-3 and ba2-nas-4) on the same physical device will be installed in Bratislava firesection 2. They will be dualhomed to the 7609 PEs via two GE uplinks. Alternative to migration of existing leased-line customers ST could progressively turn the old IP routers into MPLS PEs and attach them directly to P routers. For this, the old 7206 and 3640 routers shall be SW upgraded with a more recent (and stable) IOS release, which offers required MPLS/QoS functionality on given hardware. Assuming that CPU utilisation on existing CE routers is not problematic, this might be a more appropriate alternative than migration of hundreds of leased lines to the new platforms.
37
38
Figure 11
HW configuration of bb1-p-111
P router Regional PEs BB Aggregation
1 x POS STM-16
1 x POS STM-16
1 x POS STM-16
8 x POS STM-1
GRP Redundant 11
8 x POS STM-1
4 x GE
10
Power A
12012 [BB]
Power B
Figure 12
HW configuration of bb2-p-212
P routers Regional PEs, 7507 PE router BB Aggregation
4 x POS STM-16
16 x POS STM-1
PRP Redundant
4 x GE
0 CSC 0
1 CSC 1
2 SFC 0
3 SFC 1
4 SFC - 2
5 SFC 3
6 SFC 4
7 Alrm 0
8 Alrm 1
Power
12410
Power
11 12
New 1 port STM-16 and 4 port GE LC will be installed in bb1-p-1 during Phase 8A Project. 1xGE LC will be removed New installed during Phase 8A Project 39
PRP 9
12
Alrm
GRP
Figure 13
HW configuration of bb1-pe-113
Customers + Regional PoPs (CEs) ST IP devices (CAR, Dist) via 4503 Backup P Primary P
Primary PEM
6 x POS STM-1
10008
1 x GE 24 x E1 1 x GE 1 x GE 6 7
Redundant PEM
Figure 14
HW Configuration of bb2-pe-214
Customers + Regional PoPs (CEs) Primary P + Backup P ST IP devices (CAR, Dist) via 4503
Primary PEM
6 x POS STM-1
10008
1 x GE
24 x E1
24 x E1
8 x E3
PRE B
PRE A
1 x GE 3 4 5 6 1 x GE 7 8 -
Redundant PEM
13
Two new half-slot and one single GE LC will be installed in bb1-pe-1 during Phase 8A Project. The second single slot GE LC will be moved from existing 10005 to bb1-pe-1. One 8x E3 LC moved to ba1-pe-5 router.
14
New 10008 will replace existing 10005 (moved to Trencin) during Phase 8A Project 40
1 x GE
1 x GE 8
8 x E3
PRE B
PRE A
Figure 15
4503
1 2 3 SUP IV 2 x GE 6 x GE 48 x FE GE Uplinks to PEs and other Cats ST IP devices (CAR, Dist)
15
42
Figure 17
HW configuration of ke1-p-116
P router Regional PEs KE Aggregation
1 x POS STM-16
1 x POS STM-16
1 x POS STM-16
GRP Redundant 11
4 x POS STM-1
8 x POS STM-1
4 x GE
10
Power A
12012 [ke_p_1]
Power B
Figure 18
HW configuration of ke2-p-217
P routers Regional PEs, 7507 PE router KE Aggregation
4 x POS STM-16
16 x POS STM-1
PRP Redundant 8
4 x GE
0 CSC 0
1 CSC 1
2 SFC 0
3 SFC 1
4 SFC - 2
5 SFC 3
6 SFC 4
7 Alrm 0
Power
12410
Power
16 17
New 1 port STM-16 and 4 port GE LC will be installed in ke1-p-1 during Phase 8A Project New installed during Phase 8A Project 43
Alrm 1
PRP 9
12
Alrm
GRP
Figure 19
HW Configuration of ke1-pe-118
Customers + Regional PoPs (CEs) Primary P + Backup P ST IP devices (CAR, Dist) via 3550
Primary PEM
6 x POS STM-1
10008
1 x GE
24 x E1
8 x E3
PRE B
PRE A
1 x GE 3 4 5 6 1 x GE 7 8 -
Redundant PEM
Figure 20
HW Configuration of ke2-pe-219
Customers + Regional PoPs (CEs) Primary P + Backup P ST IP devices (CAR, Dist) via 4503
Primary PEM
1 x GE 2
6 x POS STM-1
10008
1 x GE
24 x E1
24 x E1
8 x E3
PRE B
PRE A
1 x GE 3 4 5 6 1 x GE 7 8 -
Redundant PEM
18 19
Four new half-slot GE LCs will be installed in ke1-pe-1 during Phase 8A Project. One 8x E3 LC moved to ba2-pe-4. New installed during Phase 8A Project 44
1 x GE 2
Figure 21
4503
1 2 3 SUP IV 2 x GE 6 x GE 48 x FE GE Uplinks to PEs and other Cats ST IP devices (CAR, Dist)
20
Regional PoPs
During transition phase of ST IP network to MPLS/VPN technology, the old 7206 (CAR/POP), 3640 (CAR) and AS5300 (NAS) routers have been cascaded behind 7206VXRs as CE devices. Each 7206VXR router have been attached with a single POS STM-1 uplink to the closest P router. In the scope of phase-2 of MPLS/VPN project (deployment of 17 new PoPs) ST decided to improve the design of remote PoPs. The design change introduces a new Catalyst 3550 that will aggregate local LANattached devices. This will allow direct attachment of old STIP routers to the new PE router. Connection between a Catalyst and PE router will be configured with encapsulation dot1Q and several logical sub-interfaces on PE router. Each sub-interface can be assigned to a different VRF, or to the global routing table, which makes deployment of co-located CEs or customers web servers straightforward. Number of VLANs in the Catalyst switch will correspond to number of VRFs implemented on the PE router (+1 for global RT). Part of Project Phase 8A is to achieve higher service availability in the regional PoPs by deploying second PE routers. Two groups of Regional PoP will be deployed: A new C10k PE router will be implemented next to the current C7206VXR in 5 PoPs: (Zilina, Nitra, Trnava, Presov, Trencin) A new 7206VXR will be implemented next to the current C7206VXR in 17 PoPs: (Senica, Topolcany, Dunjaska Streda, Nove Zamky, Prividza, Martin, Provazska Bystrica, Liptovsky Mikulas, Levice,Spisska Nova Ves, Bardejov, Poprad, Humenne, Michalovce, Roznava,Lucenec, Zvolen)
The following drawing shows the new design for the remote PoPs where a new 10k PE router will be deployed; Nitra PoP has been taken as a typical example. Note: The new regional PoP design is described in Appendix II
46
Figure 22
The following drawing shows the new design for the remote PoPs where a new 7206VXR PE router will be deployed; Senica PoP has been taken as a typical example. Figure 23 New Architecture of regional PoPs (7206VXR based)
47
Primary PEM
6 x POS STM-1
10008
24 x E1
24 x E1
8 x E3
PRE B
PRE A
1 x GE 1 2 3 4 5 6 1 x GE 7 8
Redundant PEM
Figure 25
Figure 26
7206VXR
21 22
Customer CEs 3 PA-MC-8E1/120 4 Two newPA-8T-X21 half-slot GE LCs will be installed in tn-pe-2 (moved from BB PoP) during Phase 8A Project PA-POS-OC3SMI 2 Uplink to P router C7200-I/O-2FE/E ST IP devices (CAR, Dist) via 3550
48
49
New Engine 3-based 4 port GigE linecards (4GE-SFP-LC, aka Tetra) will be implemented in 12012 and 12410 P routers in central offices. An IOS upgrade to 12.0(25)S is required to support this new hardware. 12.0(25)S is the first release supporting this new linecard. Apart from the new hardware, which is the main factor for the IOS upgrade, some of the IOS releases in ST's network are vulnerable to Denial of Service (DoS) attacks. Refer to the following URL for more information : http://www.cisco.com/warp/public/707/cisco-sa-20030717-blocked.shtml The core release 12.0(25)S2 is the third maintenance release of 12.0(25)S and should be deployed after as much as possible testing (during staging phase) on all 12xxx core routers and 7204VXR routers (acting as BGP RRs). Due to IOS bug CSCec57264, the 12.0(25)S2 release is not avilable for 1000x routers (already running 12.0(25)S1). 1000x routers should be upgraded to a later maintenance release of 12.0(26)S. The new edge release 12.1(11b)E12 includes several bug fixes compared to the currently used 12.1(11b)E4b. It should be deployed on 7206VXR PE routers in order to improve stability and to fix vulnerability. Once satisfied, the next step should be to deploy the code on one or two devices in a redundant and noncritical area of the network for one week pilot phase. During this period, ST should monitor the status of these devices to insure successful deployment The code should then be rolled out in a controlled and logical fashion. From a strategic standpoint, Cisco recommends to use as few different IOS releases as possible. Using just one IOS release across all platforms is sometimes not achievable, due to the differences in HW architecture and feature requirements. Nevertheless, deploying 12.0(25)S2 on two platforms will be a first step towards the limitation of different IOS releases in STs network. In the medium term (3 to 6 months from now), Cisco recommends to do a new IOS evaluation. The aim of this evaluation should be the definition of a long-term IOS strategy for STs network. The IOS strategy 50
highly depends on the network evolution concerning new features and hardware deployments. Therefore it should be covered by ongoing Cisco Network Optimization Support (NOS).
51
<Customer name> is already running OSPF in its current network. Therefore <Customer name> is very familiar with OSPF operation, and has gained lots of experiences in OSPF troubleshooting. <Customer name> has therefore requested to preserve the OSPF as IGP in current MPLS network. The choice of OSPF is a very good one as it is standardised, scales well and converges quickly. OSPF is responsible for interior routing only ! It is not used to carry any customer addresses or linksNetwork addresses of the following links are carried in the OSPF LSAs: Figure 27
Existing LL customers
backbone P-P links distribution layer PE-P links loopback0 interfaces RR-P links CE, NAS connections*
OSPF Area 0
Existing LL customers
Loop RR 7200VXR
PE 7609
Regional PE 7206VXR
P GSR
P GSR Loop
PE 10008
Dialup users
Loop
iGW GSR
Peering circuits
In addition to backbone links, the subnets allocated to NMS VLANs and VPNSC LAN are redistributed in the backbone OSPF as connected/static routes. This is required to establish connectivity between NOC sites and P routers, which do not run BGP. <Use the above paragraph if you are not using an out of band connection for management purposes>
OSPF Areas
Single area or Multiarea OSPF would be implemented in the <Customer Network> network. This decision is based on: Give reasons here. Also discuss in this sections how you are going to number OSPF areas Discuss in detail the max number of routers that would there in an area. Talk about the scale numbers of ABRs, ASBRs
LL customers
Loop
Loop
Loop
Loop
Server farms
Loop
Dialup users
53
OSPF Authentication
It is possible to authenticate the OSPF packets such that routers can participate in routing domains based on predefined passwords. By default, a router uses a Null authentication, which means that routing exchanges over a network are not authenticated. Two other authentication methods exist: Simple password authentication and Message Digest authentication (md5). Authentication does not need to be set, but we strongly recommended for security purposes. And we are recommending MD5 as the authentication method since it is provided higher security than plain text authentication method. Message Digest Authentication is a cryptographic authentication. A key (password) and key-id are configured on each router. The router uses an algorithm based on the OSPF packet, the key, and the key-id to generate a message digest that gets appended to the packet. Unlike the simple authentication, the key is not exchanged over the wire. A non-decreasing sequence number is also included in each OSPF packet to protect against replay attacks. This method also allows for uninterrupted transitions between keys. This is helpful for administrators who wish to change the OSPF password without disrupting communication. If an interface is configured with a new key, the router will send multiple copies of the same packet, each authenticated by different keys. The router will stop sending duplicate packets once it detects that all of its neighbors have adopted the new key. Following are the commands used for message digest authentication:
interface <interface type-number> ip ospf message-digest-key keyid md5 <key> Router ospf 19 area <area-id> authentication message-digest
Loopback Addresses
Each of the OSPF speakers has a Loopback address configured. These are used to force stability of the routers OSPF ID. These loopback addresses are in OSPF passive mode to optimise the routing process.
OSPF Costs
Discuss here the costs of ospf for different links. Expalin any considerations kept in mind when deciding ospf costs. Following table can be used to define the costs
Table 5
Proposed OSPF Metrics CE-PE PE-PE (none MPLS) 31300 31200 31100 P-P iGW1-iGW2 PE-P primary RR-P, iGWx-P 5700 PE-P backup RR-P iGWx-P PE-PE (MPLS) 10700 54
Bandwidth [Mbps]
E1 E3 FE
2 34 100
Default Routes
If there are any default routes then explain how and where are they being injected
OSPF Convergence
Resiliency and redundancy to circuit failure is provided by the convergence capabilities of OSPF at layer 3. There are two components to OSPF routing convergence: detection of topology changes and recalculation of routes. Detection of topology changes is supported in two ways by OSPF. The first, and quickest, is a failure or change of status on the physical interface, such as Loss of Carrier. The second is a timeout of the OSPF hello timer. An OSPF neighbor is deemed to have failed if the time to wait for a hello packet exceeds the dead timer, which defaults to four times the value of the hello timer. On a Serial, Fast Ethernet or Gigabit Ethernet interface, the default hello timer is set to 10 seconds; therefore the dead timer is 40 seconds Recalculation of routes is done by each router after a failure has been detected. A link-state advertisement (LSA) is sent to all routers in the OSPF area to signal a change in topology. This causes all routers to recalculate all of their routes using the Djikstra (SPF) algorithm. This is a CPU intensive task, and a large network, with unreliable links, could cause a CPU overload. 55
When link goes down and if layer2 is not able to detect the failure, convergence in the core can be improved by decreasing the value of the hello timer. The timer should not be set too low as this may cause phantom failures, hence unnecessary topology recalculations. Remember that these timers are used to detect failures that are not at the physical level. For example, carrier still exists but there is some sort of failure in the intermediate network. Once a topology change has been detected, LSA is generated and flooded to rest of the devices in the network. Recalculation of the routes will not occur until the spf timer has expired. The default value of this timer is 5 seconds. An spf hold time is also used to delay consecutive SPF calculations (give the router some breathing space). The default for this value is 10 seconds. As a result, the min time for the routes to converge in case of failure is always going to be more than 5 secs unless the SPF timers are tuned using OSPF throttle timers. As a result, it is now possible to schedule spf run right after flooding the LSA information but this can potentially cause the instabilities in the network e.g. even a flash congestion in the network for a very short duration could trigger declare the link down and trigger the SPF run. These timers will be left alone in the initial implementation especially because in the next phase of this project, MPLS Traffic Engineering with Fast-ReRoute (FRR) capability will be deployed. Once FRR is implemented, tweaking OSPF timers become less of a concern. A keepalive timer is also associated with the interface that will detect failure at a level lower that OSPF. The default for this timer is 10 seconds; again this will be left as default initially. In the initial deployment of the Core network, all timers will be left at their default values as shown below. These could be slowly lowered and behavior of the network monitored if faster convergence is required. If the timers are not default then explain why they are being changed and also the values used and configurations Discuss in detail the scaling issues. For example what are the max number of prefixes that can converge in a given time. Any other related work that may have been documented in DIG from SPSE or from inhouse testing of the architecture
Table 6 Timer
OSPF Timer Default Values Default Value 4 x hello interval (40 sec) 10 sec 5 sec 1 sec 5000 msec 10000msec 10000msec
ip ospf dead-interval ip ospf hello-interval ip ospf retransmit-interval ip ospf transmit-delay timers throttle spf <spf-start> <spf-hold> <spf-max-wait>
56
However, this may result in slow response of various show commands, because of slow response times on DNS queries. <Customer name> Operations team or Ciscos deployment team should enable OSPF namelookup on a few routers initially, and observe the responsiveness of DNS system.
57
58
However, it is possible to increase the MPLS mtu on an interface to accommodate the switching of packets bigger than 1500 size. The default MTU on Serial and POS interfaces is 4470bytes so frame increase of 8 bytes is not a big concern on these interfaces. The following command can be used on gigabit Ethernet interfaces in the <CUSTOMER NAME> network.
mpls mtu 1516
This will allow an MPLS frame with upto 4 labels (16 bytes) over the link. If any Ethernet switches are added into the core carrying MPLS frames they must also have their MTU increased.
Not e 4 labels have been allowed to cater for future services on the network such as traffic engineering & FRR etc. In general, each additional service may require an increase in the label stack from 2 to something greater.
To activating Label Switching on a router mpls ip command must be issued on each interface that connects P and PEs routers together. This should not be enabled on PECE connections. By default Cisco router will enable TDP (Tag Distribution Protocol) when mpls ip is enabled on an interface. It is recommended to enable LDP globally by entering mpls label protocol LDP command globally using Cisco IOS CLI 59
For proper operation of MPLS, LDP chooses an ip address as a router-id. It is important to note that the ip address chosen as router-id is routable, otherwise LDP will not be able to form the neighbor relationship with the adjacent nodes. On Cisco router, LDP router ID is, by default, determined as follows: o o o The IP addresses of all operational interfaces are examined. If these IP addresses include loopback interface addresses, the largest such loopback address is selected as the LDP router ID. Otherwise, the largest IP address pertaining to an operational interface is selected as the LDP router ID.
However, the normal (default) method for determining the LDP router ID may result in a router ID that is not usable in certain situations. For example, an IP address selected as the LDP router ID might not be advertisable by the routing protocol to a neighboring router. Therefore, for <CUSTOMER NAME> network, it is recommended to manually set the router-id by entering the mpls ldp router-id <interface> command. The specified interface must be operational for its IP address to be used as the LDP router ID. In addition, force keyword should be entered to make sure the router-id takes effect immediately upon entering this command. However, care should be taken using this command since all the existing LDP sessions will be torn down if the router-id of the existing sessions is different from the newly selected ID. It is recommended to enable logging of LDP neighbor state change using mpls ldp logging neighbor-changes. As with OSPF, MD5 based authentication could be enabled on each link where LDP will be used to prevent any DoS attacks, and to help with configuration errors.
Not e The IOS CLI accepts tag and mpls command interchangeably for most cases. For example, show tag tdp neighbor and show mpls ldp neighbor produce identical output. In some cases, there is only an mpls command such as mpls label protocol ldp. It is recommended to use newer mpls form of the commands in the <CUSTOMER NAME> network. The mpls ip command merely enables LDP on an interface, and it does not control whether a packet is switched using the MPLS Label at all this is determined by the Ethertype on an incoming frame & via various forwarding tables.
Not e
Following example shows the sample configuration for enabling MPLS in the <CUSTOMER NAME> network. Below is the sample configs . Please use your cusomter specific configs
hostname <insert hostname> mpls label protocol ldp ! mpls ldp logging neighbor-changes mpls ldp router-id Loopback0 force ! interface Loopback0 ip address <address> <mask> no ip directed-broadcast ! ! inteface pos 0/0
60
description <insert appropriate descriptor> ip address <address> <mask> ip ospf message-digest-key 1 md5 <Change Me> mpls ip mpls ldp neighbor <neighbor address> password <CHANGE ME> ! inteface gig 2/0 description <insert appropriate descriptor> ip address <address> <mask> ip ospf message-digest-key 1 md5 <Change Me> mpls ip mpls ldp neighbor <neighbor address> password <CHANGE ME> mpls mtu 1516
61
Network Services
MPLS/VPN Services
This section describes how the VPN services are offered by <Customer Name> using the MPLS-VPN concept.
MPLS-VPN
In MPLS VPN terminology the term PE (Provider Edge) refers to the provider edge router, where the CE (Customer Edge) connects to and the VPN are created. Each VPN is associated with one or more VPN routing / forwarding instances (VRFs). A VRF consists of an IP routing table, a derived Cisco Express Forwarding (CEF) table, a set of interfaces that use the forwarding table, and a set of rules and routing protocol parameters that control the information that is included into the routing table. A one-to-one relationship does not necessarily exist between customer sites and VPNs. A given site can be a member of multiple VPNs. A customer site's VRF contains all the routes available to the site from the VPNs of which it is a member. Packet forwarding information is stored in the IP routing table and the CEF table for each VRF. A separate set of routing and CEF tables is maintained for each VRF. These tables prevent information from being forwarded outside a VPN, and also prevent packets that are outside a VPN from being forwarded to a router within the VPN. All MPLS VPN configurations are done at the PE router. The rest of the network merely switches labels and is not aware of the VPN structure or logical separation of customers. The core network is referred to as the P network in an MPLS VPN. In order to enable MPLS VPN there are several implementation steps: MP-iBGP Implementation VPN Routing & Forwarding Table Definitions PE to CE Routing Definition
The following sections discuss each of these areas in more detail and provide recommendations and design guidelines and configuration examples for <CUSTOMER NAME> network.
63
BGP communication takes place at two levels: within IP domains, known as autonomous systems (interior BGP or IBGP) and between autonomous systems (external BGP or EBGP). PEPE or PE-RR (route reflector) sessions are IBGP sessions, and PE-CE sessions are EBGP sessions. In addition, a PE router binds a label to each customer prefix learned from a CE router and includes the label in the network reachability information for the prefix that it advertises to other PE routers. When a PE router forwards a packet received from a CE router across the provider network it labels the packet with the label learned from the destination PE router. When the destination PE router receives the labelled packet, it does a MPLS lookup for the corresponding vrf and it pops the label and uses it to direct the packet to the correct CE router
The mechanism for partitioning RRs is via the route-target using a BGP command called bgp rrgroup. With this command, each RR will only hold routes that match the specified route-targets. If RRs are to be partitioned several design issues must be considered in the <CUSTOMER NAME> network; Location of Route Reflectors ideally it would be ideal to deploy reflectors in various physical locations so that a single failure would not impact operations. 64
Partitioning of Route Reflectors How do you decide on which route reflectors carry which partitions (route targets)? Ultimately in a very large network, there will be many route reflectors each carrying a subset of the VPN partitions as shown below. Below is the sample configs . Please use your cusomter specific configs
Route Reflector Configuration ----------------------------ip extcommunity-list 1 permit rt 23756:1001 router bgp 23756 address family vpnv4 bgp rr-group 1 neighbor <pe1> activate neighbor <pe1> route-reflector-client neighbor <pe1> send-community extended PE Router Configuration ----------------------ip vrf custA rd 23756:100 route-target both 23756:100 route-target export 23756:1001
Route Reflector redundancy There would need to be at least two route reflectors holding the same information, in the event there is a failure of one, the other can still provide VPN route information as shown in Figure 31 Route Reflector Redundancy in the <Customer Name> Networks. The figure below is for a particular customer. In your LLD you should use naming convention used by your customer
65
There are a total of <Put the actual number of RRs here> RR in the <Customer Name> network. Each RR is a <Equipment name with the memory>. We recommend deploying RR partitioned into two groups (This may change with some customers) with two RR in each group in the <Customer Name> network. Each group of RR can be assigned to serve few regions or partitioned can be made based on the route-targets that each RR will serve in the <Customer Name> network. This way each group of RRs will serve only a certain number of VPN customers and carries only a subset of routes instead of carrying the routes for all the customers. The PE routers could then connect to the two RRs in the corresponding group for the VPN information they require which would cut down the overhead of each RR holding all routes distributing all VPN routes to all peers. Doing this would provide <CUSTOMER NAME> with a scalable solution as the network grows. Alternately, a full mesh can be created between route-reflector if partition is not desired at this time. In addition, it is recommended to configure both route-reflectors within each group with different cluster ids which otherwise may create issues if the IBGP sessions between PE and RR fail. The figure below is for a particular customer. In your LLD you should use naming convention used by your customer
66
The paragraph below is for a particular customer. In your LLD you should use naming convention used by your customer In the above example, if iBGP session between JRC edge router and RR2 and KMR edge router and RR1 fails, the VPNv4 routes received by RR1 will be forwarded to RR2 but updates will be rejected due to same cluster ID. It is very unlikely that such a double failure will occur in the network but as a best practice, it is commended to place both RR in different clusters. By default, RR Cluster ID is chosen as the BGP Router-ID. However, it is advisable to set Cluster-ID manually on RR using the cluster-id configuration command. Below is the sample configs . Please use your cusomter specific configs
On Route-Reflectors router bgp 23756 bgp cluster-id <loopback 0 ip address>
MP-iBGP Authentication
Cisco implementation of BGP allows for MD5 authentication between BGP peers. This authentication provides some protection against accidental or malicious BGP peering in the network. It is possible to configure a unique password for every peer. However this may be administratively difficult to manage, particularly for eBGP links. Hence, a single password for all internal peering only is recommended.
neighbor <xxxPE> password <shared password between peers>
67
These limitations existed to balance optimal update generation and replication against peer-group configuration. These limitations also caused the network operator to configure smaller peer-groups, which reduced the efficiency of update message generation. The introduction of the BGP Dynamic Update Peer-Groups feature separates BGP update generation from peer-group configuration. The BGP Dynamic Update Peer-Groups feature introduces an algorithm that dynamically calculates BGP update-group membership based on outbound routing policies. This feature does not require any configuration by the network operator. Optimal BGP update message generation occurs automatically and independently. BGP neighbor configuration is no longer restricted by outbound routing policies, and update-groups can belong to different address families. As dynamic peer-groups take care of the update generation, simplification of the configuration can be achieved using either standard peer-group configuration or peer-templates. We therefore recommend the dynamic peergroups (for update generation efficiency) and standard peer-group configuration for the <CUSTOMER NAME> network for the MPLS VPN deployment. You need to make sure you clearly articulate what is bein recommended for this specific customer
buffered in the interface input queue. This queue by default is only 75 packets deep. In addition, there is a SPD (Selective Packet Discard) algorithm, which plays an important role in selectively removing the non-routing packets from processor's input queue in the case of congestion so that only routing packets can be processed. However, SPD queue is by default is only 100 deep. Therefore, TCP acks can potentially fill the 175 spots of input buffering leading to large number of dropped packets. Increasing the interface input queue depth (hold-queue <1-4096> in) will help reduce the number of dropped TCP acknowledgements which reduces the amount of work BGP has to do to converge. Similarly the SPD parameters can be tuned using the following commands
Configure router for more SPD headroom and SPD extended headroom ip ip ip ip ip spd spd spd spd spd mode aggressive headroom <a> extended-headroom <b> queue min-threshold <c> queue max-threshold <d>
For a large MPLS VPN network, it is recommended to extend both the input queue and spd thresholds. As a intitial value, input hold queue can be increased to 1500-2000. However, the optimum values can be obtained by constantly monitoring and tuning the input queue drops until drops have stopped. . To check if there is any drop following command can be used:
Router#show int <name> | include input queue
MP-BGP Configuration
Once neighbors are listed under the BGP process, Cisco IOS by default considers the neighbors to exchang ipv4 NLRIs. However, for MPLS VPN, VPNv4 prefixes need to be exchanged and for that neighbors explicitly need to be activated under address-family. However, it is recommended to disable the default BGP behavior using no bgp default ipv4-unicast. The following shows a generic MP-BGP configuration that could be applied to each PE router in the <CUSTOMER NAME> network.
hostname <insert hostname> ! interface Loopback0 ip address </32 address> no ip directed-broadcast ! router bgp 23756 no synchronization no bgp default ipv4-unicast no auto-summary bgp router-id <loopback 0> bgp log-neighbor-changes ! xxxPE peer group (xxx Site / Region name) neighbor <xxxPE> peer-group neighbor <xxxPE> remote-as 23756 neighbor <xxxPE> update-source Loopback0 neighbor <xxxPE> password <password> ! xxxPE peer groups definition (complete with PEs loopback0 address) neighbor <neighbor 1 address> peer group <xxxPE> neighbor <neighbor 2 address> peer group <xxxPE> ! - - - List all PEs in the cluster / region peer-group xxxPE - - - no auto-summary address-family vpnv4 neighbor xxxPE activate neighbor xxxPE send-community extended exit address-family
69
VRF Name
The VRF name is simply a unique name used to identify the VRF and the routes it contains. It is suggested the name be short and lower case if possible for operational ease. In addition, a description can be added to the vrf using the description command in the vrf sub-mode.
ip vrf vpna description customerA vpn
Route-Distinguisher
Each VRF must have a unique RD (Route Distinguisher) which will have the following format: <AS>:<Unique Number> where, <AS> is the 16-bit autonomous system number allocated to the network. This will be set to the value <Customer AS #>and will also be used for all MP-BGP configurations <Unique Number> A unique 32 bit number within the AS that is allocated by <CUSTOMER NAME> In the situation where multiple sites of the same customer are connecting to the same PE router, each interface will use the same VRF definition, as they would normally be part of the same routing policy. This would give the customer sites peer access to each other via the PE.
network. If load sharing is not a requirement, then this scheme may be useful (as it reduces the memory requirements at the PE routers). The figure below is for a particular customer. In your LLD you should use naming convention used by your customer
Approach#2 - Unique RD per PE for each VPN An alternative to the first approach is to assign a unique RD per PE for each VPN. In other words, for a given VPN, a unique RD value will be assigned on each PE. This is illustrated in theFigure 34. Note that, with this approach, routes received from multiple interfaces belonging to the same VPN on a particular PE will share the same RD value. However, each PE will assign a unique RD. The main advantage of this approach is that it allows iBGP load balancing. However, the drawback of this scheme is that extra memory is required to hold the additional paths at the PE-routers. This is the recommended scheme in the case where Route Reflectors are deployed. The figure below is for a particular customer. In your LLD you should use naming convention used by your customer
71
Approach# 3 - Unique RD per PE per interface for each VPN Approaches 1 and 2 could be used in the simple or overlapping VPN requiring any-to-any connectivity. However, implementing topologies such as hub and spoke etc. is not easy using approach 1 or 2. For Central or hub and spoke topologies, a PE may have more than one interface belonging to the same VPN but the connectivity requirement on one interface is different from the other interfaces. Approach 3 offers a solution to this problem by assigning a unique RD for each VRF per interface. The main advantage of this approach is to uniquely identify the site that has originated a route and enables the implementation of complex topologies. However, this capability comes at a relatively higher cost in terms of memory consumption and the number of VRFs to be configured. Because of these issues, this method is not recommended for simple VPNs. Moreover, BGP communities and Site-of-Origin (SOO) may be used to identify where a particular route originated. This scheme is only recommended for Hub & Spoke scenarios where multiple spoke sites are connected to the same PE router. For <CUSTOMER NAME> network, we recommend to use scheme <Put here the scheme number and why its being used>
The default route-target for <CUSTOMER NAME> VRFs will be equal to the RD presented above. Additional route-targets may be required given the route import & export policies needed. The Table below is for a particular customer. In your LLD you should use the right nomenclature
Table 7 RT/RD Allocation VRF Name custA cust2 Cust3l 2-15 RD 23756:100 23756:101 23756:102 Default RT 23756:100 23756:101 23756:102 Import Default RT 23756:100 23756:101 23756:102 Export
VPN Topologies
Full Mesh
An Intranet VPN is the simplest way of deploying a VPN using MPLS. It essentially consists of all sites of the same customer to directly peering with each other. From the customer's perspective, all of its sites appear one hop away from each other. In reality a customer's IP packet may transit more than one core node, though the customer will not see this. Each of the sites exchanges VRF routes directly with its peer. Note that only routes that originate from that VRF are exchanged. The result is that the customer's VRF table in each PE holds an identical set of routes and each customer route is reachable via the next hop PE.
73
Exranets
Customers with Unique Addresses
The creation of an Extranet is simply a matter of importing/exporting routes between the VRF's of two or more customers. If IP address overlap between customers is not an issue, that is, the IP address space is unique between customers, then routes could be imported directly between the VPN_<CUSTOMER> VRF tables.
Intranets. These are the first two route-target commands shown in the configuration for each VRF. Next, each VRF has an export map defined. This export map will set a specific route-target value (referred to as an extended community attribute in BGP) for the Extranet route defined. By using route-targets, we can selectively import the only the routes the CE needs to participate in an Extranet. Individual host addresses could also be explicitly specified and exported using route-maps.
75
BGP, RIP, and EIGRP protocols have been modified to understand VRF tables by the use of a feature called address families. Address families define the VRF contexts that the routing protocol will operate in. Note that the routing protocol that operates between the PE-CE is independent of any IGP that may run inside the VPN customers network. Routes learnt at the local VPN site by the customer IGP will be redistributed into the PE-CE routing protocol to populate the VRF. It is important to understand that no special MPLS configurations are needed at the Customer Edge. Only standard IOS routing commands are required. <CUSTOMER NAME> is planning to use <Put the name of routing protocols that the customer would use> for the PECE routing protocols
76
The static route in the PE would have the following format indicating name of the VRF table and also the outgoing interface and its IP address.
Static routing configuration - PE ip route vrf CustomerA 10.0.0.0 255.0.0.0 serial1/0 <CE link adress> [permanent] router bgp 23756 address-family ipv4 vrf customerA redistribute static
The CE route would consist of the default route pointing to the next hop ip address/interface of the PE.
Routing Stability
With static routing, if the PE-CE link fails, the static route associated with the interface will be removed from the routing table. In the case of the PE, this will cause an MP-iBGP routing update to be forwarded to all other PE peers. To prevent such a behavior, the keyword permanent can be appended when configuring the static route. This will cause the static route to remain in the routing table regardless of the interface status. This obviously reduces the BGP update messages and improves VPN route convergence, however, such an improvement comes at the cost of unnecessary backbone bandwidth utilization. This is because the packet will get forwarded through the core all the way to the remote PE and only then will get dropped if the directly connected link is down.
PE Configuration
The following example shows the configuration of the PE side of the RIPv2 circuit. In the example, the CE device is connected to a PE via a serial link.
router rip version 2 ! address-family ipv4 vrf CustomerA version 2 redistribute bgp 23756 metric 5 network <pe-ce network> no auto-summary exit-address-family ! ! interface serial3/0 Description Circuit to Customer1 ip vrf forwarding CustomerA ip address a.b.c.1 router bgp 23756 address-family ipv4 vrf customerA
77
redistibute rip
The address-family under RIP process indicates RIPv2 that this routing instance is associated with a VRF for CustomerA. Any interface on this PE that has this VRF defined will participate in RIPv2 routing if they are part of <pe-ce> network. The redistribute bgp command allows routes BGP has learnt from other VRFs that have been brought into the VRF CustomerA (subject to the policies, route-targets etc) to be redistributed to the RIPv2 routing instance for forwarding to the CE. Similarly, these local site routes need to be redistributed into MP-BGP so that these are advertised to the remote PE and ultimately to the remote VPN sites.
CE Configuration
The CE uses a standard RIPv2 configuration. No special VRF configurations are necessary
RIPv2 CE Configuration router rip version 2 network <ce-pe network> redistribute <customer IGP> interface serial0 Description Connection to PE router ip address a.b.c.2
Configuration at the PE
Unique AS per customer site
The example in this section shows the BGP configuration for connecting CEs from one customer, each of which uses a unique AS. shows a number of CE networks each with a different AS number. Therefore if the network at CE A wished to talk to the network at CE B it would have to pass via the MPLS-VPN core and the AS_PATH followed would be 23756 65001. The AS number 23756 will appear in the AS_PATH as the CE packet transits the <CUSTOMER NAME> core.
78
In this scenario if one (or more) of the CEs were dual homed, routing loops would be avoided due to the standard AS path check done on incoming routes to the CE from the PE. The figure below and the above last two paragraphs are for a particular customer. In your LLD you should use naming convention used by your customer
The eBGP configuration for the PE-CE link is shown in the following diagram. The configs below is for a particular customer. In your LLD you should use customer specific configs
router bgp 23756 . address-family ipv4 vrf CustomerA neighbor <customer link address> remote-as 65001 neighbor <customer link address> activate no auto-summary no synchronization exit-address-family ! . interface serial3/0 Description Circuit to CE A ip vrf forwarding CustomerA ip <customer link address>
Note that the above configlet is showing only the IPV4 address family section of the BGP configuration related to the VRF. For every CE that requires a BGP peering there must be a corresponding address family with appropriate neighbor commands. The configuration at the CE is standard BGP (as if it were connecting to another CE).
The figure below is for a particular customer. In your LLD you should use naming convention used by your customer
As shown in the Figure 36, CE B rejects the routes coming from CE A when it sees its own AS number in the BGP AS Path. This is standard BGP loop prevention mechanism. As a result, CE B will not be able to communicate with CE A.
AS-Override
To solve this problem, the PE can be instructed to override the customers AS number before forwarding the BGP update to the customer. This can be achieved by using BGP neighbor parameter as-override configuration command. This is illustrated in the following configuration example: The configs e below is for a particular customer. In your LLD you should use customerspecific configs
router bgp 23756 rddress-family ipv4 vrf customerA reighbor <customer A address> as-override
This configuration is needed at the PE which then replaces the customers AS number (in this case AS 65001) with the providers AS number (AS 23756) so that the receiving CE will accept the routes as it will not see its own AS in the path. With ASN override configured, the PE does the following: If the last ASN in the AS_PATH is equal to the neighboring one, it is replaced by the provider ASN If last ASN has multiple occurrences (due to AS_PATH prepend) all the occurrences are replaced with provider-ASN value After this operation, normal eBGP operation will occur and the provider AS will be added 80
to the AS_PATH
Site-of-Origin
By enabling as-override feature, loop detection using the AS_PATH is disabled. This obviously will cause problems if the CE is dual-homed, as is the case for CE B in Figure 36. A BGP extended community attribute, referred to as the Siteof-Origin (SOO) addresses this issue. The SOO prevents routing loops when a site is multi-homed and the as-override feature is being also being used. This is achieved by identifying each customer site with a unique SOO. The SOO, similar to route-target is a BGP extended community and is denoted in the same format as route-target. All routes originating from a customer site are identified with a SOO by the eBGP process on ingress to the PE. If those routes for some reason end up back at the originating PE, they will not be re-advertised to the CE as the SOO will match that of the site. Note that a site may consist of many routers each containing the same routing information. If several of these routers are connected to the MPLS-VPN backbone as CEs, they will still use the same SOO. Only when the sites are different will a different SOO be used. The configs below is for a particular customer. In your LLD you should use customer specific configs
router bgp 23756 . address-family ipv4 vrf CustomerA neighbor <CE neighbor address> remote-as 65001 neighbor <CE neighbor address> activate neighbor <CE neighbor address> as-override neighbor <CE neighbor address> routemap setsoo in no auto-summary no synchronization exit-address-family ! . interface serial9/0 Description Circuit to CE A ip vrf forwarding CustomerA ip <PE-CE link address> route-map setsoo permit 10 set extcommunity soo 23756:1002
Above example shows the PE configuration when using as-override and SOO. The neighbor <CE neighbor address> as-override command causes AS 65001 in the AS_PATH to be replaced with AS 23756. The neighbor <CE neighbor address> routemap setsoo in command causes all incoming routes from the CE (CE A in this case) to have SOO 23756:1002 set in the extended community attribute.
Note What is not obvious is that the same command neighbor <CE neighbor address> routemap setsoo in also causes the PE to check routes it is distributing to the CE for the same SOO. If there is an SOO match then the routes are not re-advertised to the CE.
81
Routing Stability
The eBGP route dampening feature can control flapping routes from the CE. The maximum route limit command described in the following section and the BGP neighbor x.x.x.x prefix-limit command will allow the limiting of the number of routes installed in the VRF and redistributed in MP-iBGP.
The configs below is for a particular customer. In your LLD you should use customer specific configs
router bgp 23756 address-family ipv4 vrf customerA neighbor <ce neighbor address> {maximum-prefix maximum [threshold]} [restart restart-interval] [warning-only]
With the maximum prefix command, when the threshold is reached, the BGP session is terminated. Alternately, maximum-prefix command can be augmented with warning-only key word. This allows the router to generate a log message but keeps the bgp session up instead of terminating it when the threshold is reached. If warning-only key word is not configured, BGP session is torn down as a result of reaching the threshold and will stay down indefinitely. Manual intervention is required to bring the session up unless restart-interval is configured to bring the session up automatically after the restart-interval has elapsed. There is no default limit on the number of prefixes that can be configured with this command. Limitations on the number of prefixes that can be configured are determined by the amount of available system resources and are configured by the network operator. Peering sessions will be disabled (by default) when the configured maximum number of prefixes has been exceeded.
82
The BGP maximum route knob allows to control the routes if the PE-CE protocol is BGP. However, there is no such per neighbour capability available in the other dynamic protocols to control the routes received from the CE sites. However, alternately, the total number of routes in a customer VRF can be controlled by using the maximum routes command inside the VRF configuration as follows. The configs below is for a particular customer. In your LLD you should use customer specific configs
ip vrf CustomerA rd 23756:1000 route-target both 23756:1000 maximum routes 1000 warn-only | warn-threshold
In the above example, the number of routes allowed in the VRF is limited to 1000. The warn keyword does the following: warn-threshold Rejects routes when the threshold limit is reached. The threshold limit is a percentage of the limit specified, from 1 to 100. Issues a SYSLOG error message when the maximum number of routes allowed for a VRF exceeds the threshold. However, additional routes are still allowed.
warn-only
For <CUSTOMER NAME> network, we recommend that the both the BGP per neighbor and VRF maximum routes command should be included with every PE-CE BGP session and VRF definition.
83
Figure 37
(difficult and expensive). The centralised FW approach with two CE routers is more appropriate solution.
Region. Site MPLS Network
Internet Internet
CE1
PE1
PE2
PE3
CE1
CE2
Default route injected into VPN Data forwarding path from regional sites to Internet VRF_RED interface (VPNv4) Global routing table interface (IPv4)
FW Central Site
It is worth to mention that default static routes will be injected into VPN and used by regional sites, but the default route can not be used for VPN traffic on central site. On the drawing above, the CE2 will be configured with a default route pointing to PE3 via IPv4 interface. For this reason, the CE1 (and CE2) have to have all the VPN routes in the routing table. Central site shall learn the VPN routes dynamically with BGP4 or RIPv2 between CE1 and PE3. This is recommended approach as it allows greater flexibility and redundancy. For example, customer may want to implement two VPN CEs in central site to improve service availability. In case of small number of regional prefixes, or if all regional prefixes can be summarized in a single aggregate route, static route can be implemented from CE1 to PE3 for VPN traffic.
Single link option The option with single link for VPN and Internet traffic represents serious risk for that VPN because of the shortcut that has to be created between the global routing table on the PE (i.e. the Internet) and the VRF. 85
No security mechanisms (e.g. packet filtering) are available on this shortcut. CE_Blue on Figure 38 below depicts this situation. Packet leaking between a VRF and the global routing table is implemented with two IOS mechanisms: A static route with a global next-hop can be configured in a VRF. Packets following this static route will end in the global address space at the next-hop router. Traffic originated at a customer site can thus be forwarded into the Internet. Global static route can be defined pointing to a connected interface, which belongs to a VRF. This static route is further redistributed into IGP or BGP. Packets originated in the global address space will follow this route (in the global routing table) and will eventually be forwarded toward a CE router. Traffic originating in the Internet can thus be forwarded to the CE router.
Since the default route in the VPN points to the Internet, no additional default routing can be used in the customer VPN. In addition, when a customer site looses connectivity to the MPLS / VPN backbone, packets from other sites destined for the failed VPN site will be leaked to the Internet. This is another major security issue. In general, this option is also fairly complex to implement. VPNv4 and IPv4 links The two links between CE and PE can be implemented as two separate physical circuits (e.g. two E1 circuits) or as a two logical connections - for example the ATM PVCs. IPv4 link will terminate in the Global Routing table on the PE router, VPNv4 link will be assigned to the customers VPN. Static default route will be configured on the CE for Internet access and it will point towards PE via IPv4 link. VPN routes will be in most cases uploaded to the CE with dynamic routing protocol (eBGP, RIPv2), but can be statically configured on the CE if number of prefixes is small. The single-CE solution implemented with separated links for VPN and Internet traffic allows configuring packet filtering on IPv4 link on the CE router, but does not offer logical separation of two security zones (MPLS/VPN and Internet) with a firewall. It is mandatory to define a strict packet filtering rules in both directions: to and from the Internet. Outbound filter must for example prevent VPN packets to be leaked in the Internet (via default route) when VPNv4 connection fails. Inbound filter must clearly define the list of hosts and applications that can be reached from the global Internet. It is up to customer and service provider (ST) to define and implement desired security policy (i.e. packet filters) on a managed CE router. If the customer uses private IP addresses, NAT would have to be implemented on the IPv4 link. Please note that static one-to-one translation is needed only for Internet servers, whereas the clients can be dynamically translated in a pool of IP addresses in a PAT-like mode. Figure 38 Internet Access from a VPN Single CE (two links in CEred, single link on CEblue)
86
vrf_red
global_rt
vrf_blue
PE
MP-BGP
Please note that NAT is only required on IPv4 link. The config below is for a particular customer. In your LLD you should use customer specific configs
hostname CE ! interface Ethernet0 description Customer site x ip address 10.10.10.254 255.255.255.0 !--- This is the inside local IP address and it's a private IP address. ip nat inside ! interface Serial0 description CE-PE Internet link ip address 213.x.x.x 255.255.255.252 !--- This is the inside global IP address. !--- This is public IP address and it is provided by ST.
87
ip nat outside ! interface Serial1 description CE-PE VPN link ip address 213.x.x.x 255.255.255.252 !--- NAT is not performed on the VPNv4 link ! !--- This statement makes the router perform PAT to overload the Serial0 !--- IP address for all the End Stations behind the Ethernet interface !--- that are using private IP addresses defined in access list #1. ip nat inside source list 1 interface Serial0 overload ! !--- This statement performs the static address translation for the Web server. !--- With this statement, users trying to reach 171.68.1.1 port 80 (www) will be !--- automatically redirected to 10.10.10.5 port 80 (www), which in this case !--- is the Web server. ip nat inside source static tcp 10.10.10.5 80 171.68.1.1 80 ! !--- This access list defines the private network !--- that will be network address translated using PAT overload mode. access-list 1 deny host 10.10.10.5 access-list 1 permit 10.10.10.0 0.0.0.255 ! ip route 0.0.0.0 0.0.0.0 Serial0 !
The fig below is for a particular customer. In your LLD you should use customer specific figs Figure 39 NAT in CE router
Static NAT translation 10.10.10.5 <-> 171.68.1.1 10.10.10.5/24 VPNv4 link IPv4 link S1 S0 E0 .254 Web serv.
PE
CE 10.10.10.x/24
PC
88
89
23
A micro-flow refers to the packets travelling from a source to a destination using the same transport protocol and the same port number. For example, an ftp session between two IP hosts constitutes two micro-flows, one from the client to the server, and the other from the server to the client. 90
Traffic Engineering creates one or more explicit paths with bandwidth assurances for each traffic trunk. It takes into consideration the policy constraints associated with the traffic trunks, and the physical network resources, as well as the topology of the network. This way, packets are no longer routed just based on destination, but also based on resource availability, and policy. The following section describes the operation of Traffic Engineering. Figure 1 illustrates the operation of Traffic Engineering. Each step shown in the diagram is explained below.
traffic statistics
resource attributes
Path maintenance
Path admission, reservation, and/or LSP creation for calculated paths (via extended RSVP)
Figure 40 - Traffic Engineering Mechanisms The network operator must create a traffic model. Based on statistics collected from the routers, as well as administrative policies, the network operator needs to identify the traffic trunks within the network, and decide how these traffic trunks should be routed. The operator can use an off-line tool to optimize the traffic model. This does not mean that the operator is required to use the off-line tool to determine the routes for all traffic trunks. Typically, the operator identifies a full mesh of traffic trunks but administratively routes only the "top" N traffic trunks. On-line procedures are used for the rest of the trunks, as well as to handle failure situations. Traffic trunks could also be forwarded along routes computed by conventional IGP. The router uses RSVP to set up Label Switching Paths (LSPs) and to reserve bandwidth at each hop along the LSPs. During the LSP setup process, any router within the network must perform admission control and/or preemption to ensure that resources are available to honor the reservation. After the paths are set up, the head-end routers forward the packets belonging to traffic trunks by placing them into the appropriate LSPs. The following section breaks down Traffic Engineering into components and describes each component.
91
Bandwidth
This attribute specifies the amount of bandwidth the traffic trunk requires.
Adaptability
This attribute indicates whether the traffic trunk should be re-optimized. The re-optimization procedure is discussed in a later section.
Resilience
This attribute specifies the desired behavior under fault conditions, i.e., the path carrying the traffic trunk no longer exists due to either network failures or preemption. Traffic Engineering's restoration operation is discussed in a later section. 92
Priority
Priority is the mechanism by which the operator controls access to resources when the resources are under contention. It is a required function to place all traffic trunks. Another important application of the priority mechanism is supporting multiple classes of services. We assign two types of priorities to each traffic trunk: holding priority, and setup priority. Holding priority determines whether the traffic trunk has the right to hold a resource reservation when other traffic trunks attempt to take away its existing reservation. Setup priority determines whether the traffic trunk as the right to take over the resources already reserved by other traffic trunks.
Resource Attributes
Resource attributes are used to describe the network links used for path calculations. There are three resource attributes, each of which is described below.
Available Bandwidth
This attribute describes the amount of bandwidth available at each setup priority. Note that the available bandwidth for the higher setup priority is always larger than that for the lower setup priority. This attribute needs not necessarily reflect the actual available bandwidth. In some cases, the network operator may oversubscribe a link by assigning a value that is larger than the actual bandwidth, e.g., 49.5 Mbps for a DS-3 link.
Resource Class
This attribute indicates the resource class of a link. Recall that the trunk attribute, resource class affinity, is used to allow the operator to administratively include or exclude links in path calculations. This capability is achieved by matching the resource class attribute of links with resource class affinity of traffic trunks. The resource class is a 32-bit value. The resource class affinity contains a 32-bit resource affinity attribute and an associated 32-bit resource class mask. .
Path Selection
Path selection for a traffic trunk takes place at the head-end routers of traffic trunks. Using extended ISIS/OSPF, the edge routers have knowledge of both network topology and link resources. For each traffic trunk, the router starts from the destination of the trunk and attempts to find the shortest path toward the source (i.e., using the shortest path first (SPF) algorithm). The SPF calculation does not consider the links which are explicitly excluded by the resource class affinities of the trunk, as well as the links which have 93
insufficient bandwidth. The output of the path selection process is an explicit route consisting of a sequence of label switching routers. This path is used as the input to the path setup procedure.
Path Setup
Path setup is initiated by the head-end routers. RSVP24 is the protocol which establishes the forwarding state along the path computed in the path selection process. The head-end router sends a PATH message for each traffic trunk it originates. The PATH message carries the explicit route computed for this traffic trunk. As a result the PATH message always follows this explicit route. Each intermediate router along the path performs trunk admission control after receiving the PATH message. Once the router at the end of the path receives the PATH message, it sends a RESV message in the reverse direction towards the head-end of the traffic trunk. As the RESV message flows toward the sender, each intermediate node reserves bandwidth and allocates labels for the trunk. Thus when the RESV message reaches the sender, the LSP is already established. The following diagram is an example of the path setup procedure.
R8 R3 R4 R2 R1
49 17 Pop
R9
R5
32
R6
22
R7
Setup: Path (R1->R2->R6->R7->R4->R9) Reply: Resv communicates Tags and reserves bandwidth on each link
Figure 41 - Traffic Engineering Path Setup Once youve decided to set up an LSP for a tunnel, you do that using RSVP with certain extensions to support this feature. In RSVP, the forward leg of the signaling message is called the path message, and the reverse leg is called the reservation message. So one of the extensions is that the path message can carry the source route in the new object. Resources are actually allocated on the reverse leg with the reservation message. In addition to bandwidth, which is an existing RSVP resource, there are extensions so that labels can be allocated and transmitted in the reverse direction on the reservation message.
24
Note that the usage of RSVP in Traffic Engineering deviates from the original design goal of RSVP. Extensions to RSVP and the justification for using RSVP are discussed in a later section. 94
In Figure 2 were establishing a tunnel from R1 to R9 along the path shown in the slide here. That path is included in the path message that is generated by R1, and it directs the path along the yellow arrows from the head of the tunnel to the tail. In the reverse direction, the reservation message flows back on whatever series of hops was established by the path. At each hop the tag from the hop closer to the tail is received and programmed into the MPLS forwarding table. A new tag is allocated, and that new tag or label is sent upstream towards the head until eventually we get back to the head and the head knows that to send traffic down the tunnel, it should use label 49.
One feature of interest about the resulting LSP and about the MPLS tunnels under IOS in general is that theyre unidirectional. Traffic flows from the head to the tail, but theres no automatic reverse direction. So you couldnt for instance run an adjacency over one of these MPLS tunnels because the traffics one way.
95
R8
Pop 14 R4
R9
Push 37 R1 R7 Pop 22 R5
R6 Swap 17->22
Label Stack:
R1 37
R2 17 14
R6 22 14
R7 14
R4
R9 None
The example in Figure 1 illustrates how Fast Reroute link protection is used to protect traffic carried in a TE tunnel between devices R1 and R9, as it traverses the mid-point link between devices R2 and R4. [The TE tunnel from R1 to R9 is considered to be the primary tunnel and is defined by labels 37, 14, and Pop.] To protect that R2-R4 link, you create a backup tunnel that runs from R2 to R4 by way of R6 and R7. This backup tunnel is defined by labels 17, 22, and Pop. When R2 is notified that the link between it and R4 is no longer available, it simply forwards traffic destined for R4 through the backup tunnel. That is accomplished by pushing label 17 onto packets destined to R4 after the normal swap operation (which replaces label 37 with label 14) has been performed. Pushing label 17 onto packets forwards them along the backup tunnel, thereby routing traffic around the failed link. The decision to reroute packets from the primary tunnel to the backup tunnel is made solely by R2 upon detection of link failure. The Fast Reroute feature has two noticeable benefits. Increased reliability and minimal traffic loss it gives to IP traffic service during link loss. 96
97
TE/TE-FRR Design
Deciding on the tunnel topology and tunnel types How to Route Traffic Into TE Tunnels
However this method is obviously un-scalable in larger, more complex topologies and can be prone to routing loops unless careful provisioning is adhered to. An example syntax is:ip route H.H.H.H 255.255.255.255 Tunnel1 (where X.X.X.X is the I.P Destination)
Auto-Route
Cisco IOS MPLS Autoroute Announce installs the routes announced by the tail-end router and its downstream routers into the routing table (forwarding table) of the head-end router as directly reachable through the tunnel. The Constrained Based Routing Algorithm allows MPLS TE to establish a Label Switch Path from the head-end to the tail-end node. By default, those paths will not be announced to the IGP routing protocol. Hence, any prefixes/networks announced by the tail end router and its downstream routers would not be "visible" through those paths. For every MPLS TE tunnel configured with Autoroute Announce, the link state IGP will install the routes announced by the tail-end router and its downstream routers into the RIB. Therefore, all the traffic directed to prefixes topologically behind the tunnel head-end is pushed onto the tunnel. To have a better understanding of this feature, consider an example with and without Autoroute Announce enabled. Consider the topology of Figure 4. For the sake of simplicity, assume that Ri's loopback address is i.i.i.i.
Figure 43 - Topology Without Tunnels The corresponding routing table on Router R1 with normal IGP and no MPLS TE looks like the following.
99
Figure 44 - R1 Routing Table No MPLS TE Considering the same topology as in Figure 4, now let us introduce two MPLS Traffic Engineering tunnels T1 and T2 respectively. Tunnel T1 will originate in R1 and its tail end is R4. Tunnel T2 will originate in R1 and its tail end is R5. MPLS TE Autoroute Announce will be enabled on the two tunnels. Similarly, R1 routing table entries are given in Figure 7.
100
The routing tables (Figure 5 and Figure 7) demonstrate that R4 and R5 are directly reachable through tunnel T1 (resp. T2) with MPLS TE Autoroute Announce. Similarly, R8 is now reachable through the tunnel T1 via R4 instead of the "physical" connection. Without Cisco MPLS TE Autoroute Announce, even though Tunnel T1 is up, route to R8 is done via the "physical" connection (as in Figure 5).
Forwarding Adjacency
The MPLS TE Forwarding Adjacency feature allows a network administrator to handle a traffic engineering, label-switched path (LSP) tunnel as a link in an Interior Gateway Protocol (IGP) network based on the Shortest Path First (SPF) algorithm. A forwarding adjacency can be created between routers regardless of their location in the network. The routers can be located multiple hops from each other, as shown in Figure 8.
Figure 47 - Forwarding Adjacency Topology As a result, a TE tunnel is advertised as a link in an IGP network with the link's cost associated with it. Routers outside of the TE domain see the TE tunnel and use it to compute the shortest path for routing traffic throughout the network. Benefits TE Tunnel Interfaces Advertised for SPF TE tunnel interfaces are advertised in the IGP network just like any other links. Routers can then use these advertisements in their IGPs to compute the SPF even if they are not the head end of any TE tunnels.
Restrictions Using the MPLS TE Forwarding Adjacency feature increases the size of the IGP database by advertising a TE tunnel as a link.
101
The MPLS TE Forwarding Adjacency feature is supported by Intermediate System-toIntermediate System (IS-IS). Open Shortest Path First (OSPF) support will be available in a future release. When the MPLS TE Forwarding Adjacency feature is enabled on a TE tunnel, the link is advertised in the IGP network as a Type Length Value (TLV) 22 without any TE sub-TLV. MPLS TE forwarding adjacency tunnels must be configured bidirectionally. Do not use the tunnel mpls traffic-eng autoroute announce statement in your configuration when you are using forwarding adjacency.
102
Restrictions This feature is limited to FRR functionality and the order of the failed-over routing prefixes. This feature does not add, delete, or modify the routing prefixes in the FRR database; it just resorts them.
The following command output shows the FRR database before it is reordered: Router# show mpls traffic-eng fast-reroute database
Tunnel head fast reroute information: Prefix Tunnel In-label Out intf/label FRR intf/label Status 10.0.6.1/32 Tu3 12307 PO1/0:Pop tag Tu10:tag-implicit ready 10.0.7.1/32 Tu3 12306 PO1/0:12305 Tu10:tag-implicit ready 10.0.8.1/32 Tu3 12304 PO1/0:12304 Tu10:tag-implicit ready 10.0.0.36/30 Tu3 12314 PO1/0:Pop tag Tu10:tag-implicit ready 10.0.0.40/30 Tu3 12312 PO1/0:Pop tag Tu10:tag-implicit ready 10.0.0.48/30 Tu3 12316 PO1/0:Pop tag Tu10:tag-implicit ready 10.0.0.52/30 Tu3 12317 PO1/0:12307 Tu10:tag-implicit ready 10.0.0.60/30 Tu3 12315 PO1/0:Pop tag Tu10:tag-implicit ready 10.0.0.64/30 Tu3 12318 PO1/0:12308 Tu10:tag-implicit ready In the following command output, the last prefix, which is 10.0.0.64/30, is placed first in the FRR database:
103
In the following command output, the ACL is applied globally: Router(config)# mpls traffic-eng fast-reroute acl 1
In the following command output, the 10.0.0.64/30 prefix has been reordered and now appears first in the FRR database: Router# show mpls traffic-eng fast-reroute database
Tunnel head fast reroute information:Acl in use 1 Prefix Tunnel In-label Out intf/label FRR intf/label Status 10.0.0.64/30 Tu3 12318 PO1/0:12308 Tu10:tag-implicit ready 10.0.6.1/32 Tu3 12307 PO1/0:Pop tag Tu10:tag-implicit ready 10.0.7.1/32 Tu3 12306 PO1/0:12305 Tu10:tag-implicit ready 10.0.8.1/32 Tu3 12304 PO1/0:12304 Tu10:tag-implicit ready 10.0.0.36/30 Tu3 12314 PO1/0:Pop tag Tu10:tag-implicit ready 10.0.0.40/30 Tu3 12312 PO1/0:Pop tag Tu10:tag-implicit ready 10.0.0.48/30 Tu3 12316 PO1/0:Pop tag Tu10:tag-implicit ready 10.0.0.52/30 Tu3 12317 PO1/0:12307 Tu10:tag-implicit ready 10.0.0.60/30 Tu3 12315 PO1/0:Pop tag Tu10:tag-implicit ready LSP midpoint frr information: LSP identifier In-label Out intf/label FRR intf/label Status
104
3 Implementation Of TE-FRR
3 Network Architecture
Introduction
The core network of 3 is illustrated in Figure 9 below. It consists of 3 major POPs deployed in major cities within the U.K.
Figure 48 - "3" Core Network Architecture The core network is built entirely out of 124XX routers with 7200s used as Route Reflectors. The network utilises MPLS-VPN L3 RFC2547. Cisco 12416s are used as core switching routers and interface to a Nortel Optera DWDM network for Optical Transport. OC-192 POS linecards are used to buid a 10G network infrastructure and these nodes are used as P devices in the context of the MPLS-VPN. Cisco 12410s are used as edge routers (PE) and are inter-connected via OC-48 POS lincards to the P routers within the POP. VPN interfaces are present on the GigE cards within these routers. Initially Trident (3 X GigE) linecards were used and later these were swapped out for the new Tango (10 X GigE) linecards. The design uses a wide range of PE-CE connection models for various VPNs: Static Connected OSPF
TE-FRR Design
In the design it was decided to only protect the core OC-192 POS (Inter-POP) links as these had the greatest chance of failure compared to the Intra-POP links. Obviously TE-FRR provides a very cost effective mechanism of link protection compared to Sonet APS. In the design IP traffic will be protected in the core by Fast Re-Route (FRR) for link protection for sub 50ms performance. Tunnel Engineering aims to optimize network resource usage by directing traffic onto LSP tunnels established according to criteria other than lowest cost or fewest hops, which existing routing protocols use today. For example, to minimize congestion and maximize performance, an ISP might want all traffic destined for a particular network to use the path with maximum bandwidth. Fast restoration is possible within 50 milliseconds. This is because no signaling is required, the backup tunnel is already in place, and the ingress to the back-up tunnel can be co-located on the device that detects the failure. Protection and restoration span is flexible. Backup LSP tunnels can be set up to protect individual links. MPLS-TE FRR will be used to protect all the OC-192 POS links between the 3 x GSRs in the test network. In the event of a link failure, the backup FRR tunnels will provide an immediate local path around the failure until the primary tunnel has re-optimised.
106
Primary Tunnels
So in the design we have a number of 1-Hop Primary tunnels going between the POPs. This makes a total of 6 Primary tunnels in the design. The primary tunnels are dynamically routed to the TE loopback address of its neighbouring 2 POPs. Initially auto-route was used as the mechanism for injecting traffic into the tunnels, however this was replaced with Forwarding Adjacency during system testing dues to un-expected traffic loss. (See Sec XXX) Its important to note that because of the use of 1-Hop tunnels that the tunnel head end is also the point of local repair (PLR) so after an FRR operation the primary tunnel will re-route across the 2-Hop link. This will happen after the fast re-write operation.
Backup Tunnels
So each protected link has a 2-Hop backup tunnel provisioned as the alternate path when FRR-LP kicks in. Each backup tunnel is explicitly configured to go via the alternate POP to reach the original POP destination. Figure 10 gives an example of the tunnel provisioning. Obviously explicit backup tunnel configuration is sensible as you obviously provision the backup tunnels to cross a specific 2 hop path
Manchester GSR2
Primary link used by primary tunnel, backed up by FRR FRR backup Tunnel via alternative STM4 interface
Figure 49 - Illustration of Primary and Backup TE Tunnels 107
i.p addresses
Source Router GSR1 GSR1 GSR1 GSR1 GSR2 GSR2 GSR2 GSR2 GSR3 GSR3 GSR3 GSR3
Table 9
Description Primary 1-2 Primary 1-3 Backup of 1-2 Backup of 1-3 Primary 2-1 Primary 2-3 Backup of 2-1 Backup of 2-3 Primary 3-1 Primary 3-2 Backup of 3-1 Backup of 3-2
Tunnel Number 1 2 11 12 1 2 11 12 1 2 11 12
Explicit/ Dynamic Dynamic Dynamic Explicit via GSR3 Explicit via GSR2 Dynamic Dynamic Explicit via GSR3 Explicit via GSR1 Dynamic Dynamic Explicit via GSR2 Explicit via GSR1
Final Destination GSR2 GSR3 GSR2 GSR3 GSR1 GSR3 GSR1 GSR3 GSR1 GSR2 GSR1 GSR2
Tunnel Provisioning All Primary TE tunnel parameters will be as follows: IP Unnumbered to Loopback 0 Path option - Dynamic Autoroute announce Priority 5 5 Bandwidth 1 Fast Re-Route enabled
POS interface specifics: Enable AIS alarm when interface shutdown IP RSVP bandwidth to match link speed
Sample configurations
Birmingham P Router
interface Tunnel1001 description from bm0gsr01 tunnel1001 to hh0gsr01 tunnel1002, Primary ip unnumbered Loopback0 no ip directed-broadcast mpls label protocol tdp tag-switching ip 109
tunnel destination 172.31.252.254 tunnel mode mpls traffic-eng tunnel mpls traffic-eng autoroute announce tunnel mpls traffic-eng forwarding-adjacency tunnel mpls traffic-eng priority 5 5 tunnel mpls traffic-eng bandwidth 1 tunnel mpls traffic-eng path-option 1 dynamic tunnel mpls traffic-eng record-route tunnel mpls traffic-eng fast-reroute interface Tunnel1002 description from bmgsr01 tunnel1002 to mr0gsr01 tunnel1002, Primary ip unnumbered Loopback0 no ip directed-broadcast mpls label protocol tdp tag-switching ip tunnel destination 172.31.248.254 tunnel mode mpls traffic-eng tunnel mpls traffic-eng autoroute announce tunnel mpls traffic-eng forwarding-adjacency tunnel mpls traffic-eng priority 5 5 tunnel mpls traffic-eng bandwidth 1 tunnel mpls traffic-eng path-option 1 dynamic tunnel mpls traffic-eng record-route tunnel mpls traffic-eng fast-reroute ! interface Tunnel2001 description from bm0gsr01 tunnel2001 via mr0gsr01 to hh0gsr01 tunnel2002, Backup of pos3/0 ip unnumbered Loopback0 no ip directed-broadcast tunnel destination 172.31.252.254 tunnel mode mpls traffic-eng tunnel mpls traffic-eng priority 0 0 tunnel mpls traffic-eng path-option 1 explicit name backup-to-hh01-via-mr01 tunnel mpls traffic-eng record-route ! interface Tunnel2002 description from bm0gsr01 tunnel2002 via hh0gsr01 to mr0gsr01 tunnel2002, Backup of pos12/0 ip unnumbered Loopback0 no ip directed-broadcast tunnel destination 172.31.248.254 tunnel mode mpls traffic-eng 110
tunnel mpls traffic-eng priority 0 0 tunnel mpls traffic-eng path-option 1 explicit name backup-to-mr01-via-hh01 tunnel mpls traffic-eng record-route interface POS3/0 description from bm0gsr01 pos 3/0 to hh0gsr01 pos 12/0 STM-64 ip address 172.31.254.6 255.255.255.252 no ip directed-broadcast no ip proxy-arp ip router isis encapsulation ppp carrier-delay msec 0 mpls label protocol tdp mpls traffic-eng tunnels mpls traffic-eng backup-path Tunnel2001 tag-switching ip no peer neighbor-route crc 32 clock source internal pos ais-shut pos framing sdh pos report lrdi pos flag s1s0 2 tx-cos STM64-TX no cdp enable isis circuit-type level-2-only isis metric 100 level-2 isis password vlPhuj8p5 level-2 ip rsvp bandwidth 10000000 10000000 interface POS12/0 description from bm0gsr01 pos 12/0 to mr0gsr01 pos 12/0 STM-64 ip address 172.31.254.17 255.255.255.252 no ip directed-broadcast no ip proxy-arp ip router isis encapsulation ppp carrier-delay msec 0 mpls label protocol tdp mpls traffic-eng tunnels mpls traffic-eng backup-path Tunnel2002 tag-switching ip 111
no peer neighbor-route crc 32 clock source internal pos ais-shut pos framing sdh pos report lrdi pos flag s1s0 2 tx-cos STM64-TX no cdp enable isis circuit-type level-2-only isis metric 100 level-2 isis password vlPhuj8p5 level-2 ip rsvp bandwidth 10000000 10000000 ip explicit-path name backup-to-hh01-via-mr01 enable next-address 172.31.254.18 next-address 172.31.254.1 ! ip explicit-path name backup-to-mr01-via-hh01 enable next-address 172.31.254.5 next-address 172.31.254.2
The configurations are in principle identical for Hemel and Manchester apart from the I.P addresses.
Quality of Service
Introduction
In order to fulfil ST requirements of having four distinct classes of service, each with their specific service characteristics, QoS mechanisms are deployed on the access layer and backbone links. The following section describes the technical implementation and features that form the basis for a set of new innovative products. Scalability and stability are the main criteria for any extension of the network. It is absolutely necessary to aggregate IP streams with identical flow characteristic. The expression used for this solution is service classes. Dedicated handling of single streams is only meaningful in special cases when high bandwidths are involved, and there are no plans for this solution to be introduced in the first instance. The number of service classes should be strictly limited from the technical point of view. This is not a restriction to construct various commercial products on top of it. Service level agreements (SLA) form the definition interface for the service that will be delivered to the customer by ST. Parameters should describe a probability for a certain service and will be reported on a per class base.
112
For ST MPLS backbone network a robust solution that aligns to base ideas of IETF's DiffServ approach would appear to be practicable at present. With respect to the intended MPLS solution, a maximum of 8 code points per path can be supported. These are distinguished using the three experimental bits of the MPLS shim header. A large part of best effort background traffic is required to produce efficient high quality service classes because DiffServ is based on relative priorities. The strength of a large IP backbone network is to be seen in the fact that high-priority and low-priority traffic is merged on a single network platform. This results in synergy that permits optimum resource utilisation. The bundling of many different traffic streams (statistical multiplexing) smoothes individual bursts.
113
Figure 50
The three most significant bits of the RFC-791 ToS field - the precedence bits - define the IP packet priority or importance. XXX00000 Bits 0,1,2 = Precedence, where: 111 = Network Control = Precedence 7 110 = Internetwork Control = Precedence 6 101 = CRITIC/ECP = Precedence 5 100 = Flash Override = Precedence 4 011 = Flash = Precedence 3 010 = Immediate = Precedence 2 001 = Priority = Precedence 1 000 = Routine = Precedence 0 The four bits of the RFC-1349 TOS are used in IOS configuration and have the following semantics: 000XXXX0 Bits 3, 4, 5, 6: 1000 = Minimize delay 0100 = Maximize throughput 0010 = Maximize reliability 0001 = Minimize monetary cost 0000 = Normal service
0000000X Bit 7: Reserved for future use This one-byte ToS field has been almost completely unused since it was proposed almost 20 years ago. Only in the last few years have Cisco and other router companies begun utilising the Precedence bits for making forwarding decisions. The DiffServ standard follows a similar scheme to RFC 791, but utilises more bits for setting priority. The new standard maintains backward compatibility with RFC 791 implementations, but allows more efficient use of bits 3, 4, and 5. (Bits 6 and 7 will still be reserved for future development.) With the additional 3 bits, there are now a total of 64 classes instead of the previous 7 classes.
114
RFC 2475 defines Per Hop Behaviour (PHB) as the externally observable forwarding behaviour applied at a DiffServ-compliant node to a DiffServ Behaviour Aggregate (BA). With the ability of the system to mark packets according to DSCP setting, collections of packets with the same DSCP setting and sent in a particular direction can be grouped into a BA. Packets from multiple sources or applications can belong to the same BA. In other words, a PHB refers to the packet scheduling, queuing, policing, or shaping behaviour of a node on any given packet belonging to a BA, as configured by a service level agreement (SLA) or a policy map. The following sections describe the four available standard PHBs: Default PHB (as defined in RFC 2474) Class-Selector PHB (as defined in RFC 2474) Assured Forwarding (AFxy) PHB (as defined in RFC 2597) Expedited Forwarding (EF) PHB (as defined in RFC 2598)
Default PHB
The default PHB essentially specifies that a packet marked with a DSCP value of 000000 (recommended) receives the traditional best-effort service from a DS-compliant node (that is, a network node that complies with all of the core DiffServ requirements). Also, if a packet arrives at a DS-compliant node, and the DSCP value is not mapped to any other PHB, the packet will get mapped to the default PHB. For more information about default PHB, refer to RFC 2474, Definition of the Differentiated Services Field in IPv4 and IPv6 Headers.
Class-Selector PHB:
To preserve backward-compatibility with any IP Precedence scheme currently in use on the network, DiffServ has defined a DSCP value in the form xxx000, where x is either 0 or 1. These DSCP values are called Class-Selector Code Points. (The DSCP value for a packet with default PHB 000000 is also called the Class-Selector Code Point.) The PHB associated with a Class-Selector Code Point is a Class-Selector PHB. These Class-Selector PHBs retain most of the forwarding behaviour as nodes that implement IP Precedence-based classification and forwarding. For example, packets with a DSCP value of 110000 (the equivalent of the IP Precedence-based value of 110) have preferential forwarding treatment (for scheduling, queuing, and so on), as compared to packets with a DSCP value of 100000 (the equivalent of the IP Precedence-based value of 100). These ClassSelector PHBs ensure that DS-compliant nodes can coexist with IP Precedence-based nodes. The DiffServ standard utilises the same precedence bits (the most significant bits: 0, 1, and 2) for priority setting, but further clarifies their functions/definitions, plus offers finer priority granularity through use of the next three bits in the ToS field. DiffServ reorganises (and renames) the precedence levels (still defined by the three most significant bits of the ToS field) into the following categories:
115
Table 10
Class-Selector PHBs Stays the same (link layer and routing protocol keep alive) Stays the same (used for IP routing protocols) Class 5 Class 4 Class 3 Class 2 Class 1 Best effort
For more information about class-selector PHB, refer to RFC 2474, Definition of the Differentiated Services Field in IPv4 and IPv6 Headers.
Further, the AFxy PHB defines four AF classes: AF1, AF2, AF3, and AF4. Each class is assigned a specific amount of buffer space and interface bandwidth, according to the SLA with the service provider or policy map. Within each AF class, you can specify three drop precedence (dP) values: 1, 2, and 3. Assured Forwarding PHB can be expressed as shown in the following example: AFxy In this example, x represents the AF class number (1, 2, or 3) and y represents the dP value (1, 2, or 3) within the AFx class. In instances of network traffic congestion, if packets in a particular AF class (for example, AF1) need to be dropped, packets in the AF1 class will be dropped according to the following guideline: dP(AFx1) <= dP(AFx2) <= dP(AFx3) where dP (AFxy) is the probability that packets of the AFxy class will be dropped. In other words, y denotes the dP within an Afx class. The dP method penalises traffic flows within a particular BA that exceed the assigned bandwidth. Packets on these offending flows could be re-marked by a policer to a higher drop precedence.
116
Bits 3 and 4 of DiffServ field allow further priority granularity through the specification of a packet drop probability for any of the defined classes. Collectively, Classes 1-4 are referred to as Assured Forwarding (AF). The following table illustrates the DSCP coding for specifying the priority level (class) plus the drop percentage. (Bits 0, 1, and 2 define the class; bits 3 and 4 specify the drop percentage; bit 5 is always 0.) Using this system, a device would first prioritise traffic by class, then differentiate and prioritise same-class traffic by considering the drop percentage. It is important to note that this standard has not specified a precise definition of "low," "medium," and "high" drop percentages. Additionally, not all devices will recognise the DiffServ bit 3 and 4 settings. Remember also that even when the settings are recognised, they do not necessarily trigger the same forwarding action to be taken by each type of device on the network--each device will implement its own response in relation to the packet priorities it detects. The DiffServ standard is meant to allow a finer granularity of priority setting for the applications and devices that can make use of it, but it does not specify interpretation (that is, action to be taken).
Class-Selector PHBs
Unused
000 001
000 010
101 010
110 010
111 010
Unused
000 011
001 011
101 011
110 011
111 011
000 100
101 100
110 100
111 100
Unused
000 101
001 101
101 101
110 101
111 101
000 110
001 110
010 110
011 110
100 110
101 110
110 110
111 110
117
Packet loss causes voice clipping and skips. The industry standard codec algorithms used in Cisco Digital Signal Processor (DSP) can correct for up to 30 ms of lost voice. Cisco Voice over IP (VoIP) technology uses 20-ms samples of voice payload per VoIP packet. Therefore, for the codec correction algorithms to be effective, only a single packet can be lost during any given time. Packet delay can cause either voice quality degradation due to the end-to-end voice latency or packet loss if the delay is variable. If the end-to-end voice latency becomes too long (250 ms, for example), the conversation begins to sound like two parties talking on a CB radio. If the delay is variable, there is a risk of jitter buffer overruns at the receiving end. Eliminating drops and delays is even more imperative when including fax and modem traffic over IP networks. If packets are lost during fax or modem transmissions, the modems are forced to "retrain" to synchronize again. By examining the causes of packet loss and delay, we can gain an understanding of why Quality of Service (QoS) is needed. Network congestion can lead to both packet drops and variable packet delays. Voice packet drops from network congestion are usually caused by full transmit buffers on the egress interfaces somewhere in the network. As links or connections approach 100% utilization, the queues servicing those connections become full. When a queue is full, new packets attempting to enter the queue are discarded. Because network congestion is typically sporadic, delays from congestion tend to be variable in nature. Egress interface queue wait times or large serialization delays cause variable delays of this type. Both of these factors are discussed in the next section, "Delay and Jitter". Delay is the time it takes for a packet to reach the receiving endpoint after being transmitted from the sending endpoint. This time is termed the "end-to-end delay and it consists of two components: fixed network delay and variable network delay. Jitter is the delta, or difference, in the total end-to-end delay values of two voice packets in the voice flow. Fixed network delay should be examined during the initial design of the VoIP network. The International Telecommunications Union (ITU) standard G.114 states that a one-way delay budget of 150 ms is acceptable for high voice quality. Research at Cisco has shown that there is a negligible difference in voice quality scores using networks built with 200-ms delay budgets. Examples of fixed network delay include the propagation delay of signals between the sending and receiving endpoints, voice encoding delay, and the voice packetization time for various VoIP codecs. Propagation delay calculations work out to almost 0.0063 ms/km. The G.729A codec, for example, has a 25 ms encoding delay value (two 10 ms frames + 5 ms look-ahead) and an additional 20 ms of packetization delay.
118
Congested egress queues and serialization delays on network interfaces can cause variable packet delays. Without Priority or Low-Latency Queuing (LLQ), queuing delay times equal serialization delay times as link utilization approaches 100%. Serialization delay is a constant function of link speed and packet size. As shown in Table 11, the larger the packet and the slower the link clocking speed, the greater the serialization delay. While this is a known ratio, it can be considered variable because a larger data packet can enter the egress queue before a voice packet at any time. If the voice packet must wait for the data packet to serialize, the delay incurred by the voice packet is its own serialization delay plus the serialization delay of the data packet in front of it. Using Link Fragmentation and Interleave (LFI) techniques, serialization delay can be configured to be a constant delay value. Table 11 Serialisation delay [ms] as function of link speed and packet size 64 bytes 9 8 4 2 1 0,25 0,015 3.3*10-3 0,82*10-3 0,2*10-3 128 bytes 18 16 8 4 2 0,5 0,3 0,006 1,6*10-3 0,4*10-3 256 bytes 36 32 16 8 4 1 0,06 0,013 3,3*10-3 0,82*10-3 512 bytes 72 64 32 16 8 2 0,12 0,026 6,6*10-3 1,6*10-3 1024 bytes 144 128 64 32 16 4 0,24 0,052 0,013 3,3*10-3 1500 bytes 214 187 93 46 23 5,8 0,35 0,077 0,019 4,8*10-3
Link speed \ packet size 56 kbps 64 kbps 128 kbps 256 kbps 512 kbps 2048 kbps (E1) 34 Mbps (E3) 155 Mbps (STM-1) 622 Mbps (STM-4) 2.5 Gbps (STM-16)
Because network congestion can be encountered at any time within a network, buffers can fill instantaneously. This instantaneous buffer utilization can lead to a difference in delay times between packets in the same voice stream. This difference, called jitter, is the variation between when a packet is expected to arrive and when it actually is received. To compensate for these delay variations between voice packets in a conversation, VoIP endpoints use jitter buffers to turn the delay variations into a constant value so that voice can be played out smoothly. Cisco VoIP endpoints use DSP algorithms that have an adaptive jitter buffer between 20 and 50 ms, as illustrated in the following picture. The actual size of the buffer varies between 20 and 50 ms based on the expected voice packet network delay. These algorithms examine the timestamps in the Real-time Transport Protocol (RTP) header of the voice packets, calculate the expected delay, and adjust the jitter buffer size accordingly. When this adaptive jitter buffer is configured, a 10-ms portion of "extra" buffer is configured for variable packet delays. For example, if a stream of packets is entering the jitter buffer with RTP timestamps indicating 23 ms of encountered network jitter, the receiving VoIP jitter buffer is sized at a maximum of 33 ms. If a packet's jitter is greater than 10 ms above the expected 23-ms delay variation (23 + 10 = 33 ms of dynamically allocated adaptive jitter buffer space), the packet is dropped.
119
Figure 52
Voice quality is only as good as the quality of the weakest network link. Packet loss, delay, and delay variation all contribute to degraded voice quality. In addition, because network congestion (or more accurately, instantaneous buffer congestion) can occur at any time in any portion of the network, network quality is an end-to-end design issue. Call admission control is another important issue that needs to be considered. Call admission control is a mechanism for ensuring that voice flows do not exceed the maximum provisioned bandwidth allocated for voice conversations. After doing the calculations to provision the network with the required bandwidth to support voice, data, and possibly video applications, it is important to ensure that voice does not oversubscribe the portion of the bandwidth allocated to it. While most QoS mechanisms are used to protect voice from data, call admission control is used to protect voice from voice. This is illustrated in the following figure, which shows an environment where the network has been provisioned to support two concurrent voice calls. If a third voice call is allowed to proceed, the quality of all three calls is degraded. Call admission control should be external to the network. Figure 53 - Call admission control
120
LFI tools are used to fragment large data frames into regularly sized pieces and to interleave voice frames into the flow so that the end-to-end delay can be predicted accurately. This places bounds on jitter by preventing voice traffic from being delayed behind large data frames, as illustrated in the following figure. The two techniques used for this are FRF.12 for Frame Relay and Multilink Point-to-Point Protocol (MLPPP) for point-to-point serial links.
Figure 54
A 10-ms blocking delay is the recommended target to use for setting fragmentation size. To calculate the recommended fragment size, divide the recommended 10 ms of delay by one byte of traffic at the provisioned line clocking speed, as follows: Fragment_Size = (Max_Allowed_Jitter * Link_Speed_in_kbps) / 8 For example: Fragment_Size = (10 ms * 56) / 8 = 70 bytes The following table shows the recommended fragment size for various link speeds. Table 12 Recommended fragment size Link Speed (kbps) 56 64 128 256 512 768 Recommended fragment size (bytes) 70 80 160 620 640 960
121
Obviously, the fragmentation size should be set larger than the largest VoIP packet in order to ensure that no VoIP packets get fragmented. When using FRF.12 as an LFI mechanism on a Frame Relay access link, traffic shaping (either FRTS or dTS) becomes mandatory. Enabling FRF.12 will have an impact on the FRTS / dTS shaping parameters, since it adds 4 bytes of overhead to each fragment (2 bytes of FRF.12 overhead and 2 bytes of Cisco encapsulation overhead). The FRTS implementation will take into account this additional overhead (but still not the FCS and flag overhead) but the dTS overhead will not take into account the additional FRF.12 / Cisco encapsulation overhead). This is because FRF.12 runs in distributed mode on the VIP (dFRF.12).
Delay Model
The delay model for an IP packet consists of the summary of individual delays of nodes and links that are part of the end-to-end connection. The main factors that determine the overall end-to-end delay are typically: Serialisation delay of narrow-band links Propagation delays of long distance connections Queuing delay in case of congestion situations
All times have to be described statistically, and must be seen as average in a certain time period. Table 13 Decision Delay TDecision Queuing Delay TQueuing Serialisation Delay TSerialisation Transmit Buffer Delay TTransmit Propagation Delay TPropagation Node Delay TNode Link Delay TLink Core Delay TCore The components of the end-to-end delay model This is the required time in a node to decide what interface a packet should go out. There can be a dependence on node utilisation, but in general on the high-end platforms TDecision < 1ms. Queuing delay has variable dependencies to determine this delay, queue length, queuing mechanism, line utilisation, platform and CPU utilisation. During times of non-congestion, there is no queuing delay; once congestion occurs the extra CPU cycles required to manage the scheduling has a small impact on the delay variable in the network. This is the time that is necessary to put a packet of a certain size on a line of a certain speed (please see the Table 11) On the egress interface a single buffer exist which additionally has an influence on the transmit delay. This buffer is used to control the various queuing mechanisms (CBWFQ/MDRR) in front of the transmit queue, by using a threshold. The length of this queue can be configured. A suited set-up has to be decided upon to minimise delay and maximise efficiency. Describes the speed of light in a fibre which is about 6 ms per 1000 Km (2/3 c0)
The node delay summarises all node dependent delays per node. TNode = TDecision +TQueuing +TTransmit The link delay summarises all link dependent delays per link.
Access Delay The access delay summarises all access dependent delays, which are all node and link delays in the access network. This includes CE routers, PE routers and the links in-between. Summarizing node and TAccess link delay for the access network simplifies the delay model.
T Access ( x ) =
End-to-End Delay TEnd-to-End
Access ( x )
TNode +
Link Access ( x )
Figure 55
Tdecission
Tnode
Tlink
CE
PE
PE
CE
Taccess
Tcore
Taccess
123
Figure 56
U nused
M PLS EXP
demonstrates the DSCP/EXP location; the MPLS header is pre-pended to the front of the IP packet. It is also feasible that multiple labels are added to the front of the IP packets instead of the one demonstrated in the drawing (e.g. MPLS/VPN label, TE label, FRR label). In such case, the QoS features in MPLS core devices shall only look in the EXP bits of the top-most label as the DSCP and inner labels in the label stack may carry customer-defined classes of services. Figure 57 DSCP / MPLS Headers
IP v 4 P a c k e t IP v 4 P a c k e t Label x
DSCP
abcd
DSCP
abcd
EXP
ab
IP v 4 D o m a in
M P L S D o m a in
deployment is a necessity. Therefore, we strongly recommend a tuning phase while beta customers are connected. Table 14 CoS Mechanisms Overview
Marketintg Class QoS Mechanism PHB DSCP EXP Max. % of link BW Queue Length Classification CE PE P Marking CE PE P Policing CE P/PE Class Queuing Access Core Congestion Avoidance CE,PE P 25% long any non-classified packet DSCP EXP MQCLI class-default class-default DSCP WRED EXP WRED BE 0 0 25% medium ACL 100 DSCP EXP MQCLI MQCLI business business DSCP WRED EXP WRED AF11 10 1 25% short ACL 101 DSCP EXP MQCLI MQCLI streaming streaming DSCP WRED EXP WRED AF31 26 3 25% very short ACL 102 DSCP EXP MQCLI MQCLI voice (LLQ) voice (LLQ) Tail drop Tail drop (minTH=maxTH) EF 46 5 medium mgmt business DSCP WRED EXP WRED CS5 48 6 medium ACL 103 ACL 103 LPR LPR MQCLI mgmt business DSCP WRED EXP WRED CS6 48 6 Standard Best Effort data (e.g. http) Business Business data (e.g. SNA) Streaming Multimedia (e.g. Video) Voice VoIP Routing updates Management (e.g. SNMP)
The drawing below displays an overview of QoS mechanisms used in the ST network. The following chapters will detail the QoS design on a hop-by-hop basis, following the packet from source (left CE) to its destination (right CE).
125
Figure 58
Classification (ACL) Marking (CAR, DSCP) Policing (MQCLI) Queuing (DSCP) Cong. mgmt. (DSCP)
N/A
N/A
LLQ WRED
MPLS
MDRR WRED
GSR P
MDRR WRED
MPLS
N/A
10k, 7206VXR PE
Classification (QoS-group) Marking (DSCP->EXP auto) Queuing (DSCP) Cong. mgmt. (DSCP)
Classification (EXP) Queuing (EXP MDRR) Cong. mgmt (EXP) [to-fabric, to-interface]
126
The ACL for Management (103) traffic should match SNMP, TFTP, TELNET and any other required traffic to and from the network management systems IP address range.
! access-list access-list access-list access-list access-list !
any eq bgp any eq rip <NOC_lan> eq telnet <NOC_lan> eq snmp <NOC_lan> eq tftp
Voice signalling traffic will need to be classified and marked appropriately. Depending on the customer VoIP implementation, the different possibilities are: RTCP: odd RTP port numbers H.323 / H.245 standard connect: TCP 11xxx H.323 / H.245 fast connect: TCP 1720 H.323 / H.225 RAS: TCP 1719 Skinny control traffic: TCP 2000-2002 ICCP: TCP 8001-8002 MGCP: UDP 2427, TCP 2428
Dependent on the actual signalling method used (packet sizes), speed of the access links and the number of concurrent voice call set-ups that need to be supported, two possible design options can be taken with regards to the queuing method used. Queue the voice signalling packets in the same PQ as the actual voice bearer packets. This will result in a simpler design but could delay the transmission of some of the voice bearer packets (dependent on voice signalling packet size, access link speed and number of concurrent voice call set-ups). This could than have an impact on the voice delay / jitter. Queue the voice signalling packets in another normal class queue. This should ideally be a separate class queue from the ones that are used for regular data traffic to ensure delivery of the voice signalling packets. This will result in a more complicated design where bandwidth needs to be allocated for the voice signalling class. Also, voice signalling packets might be delayed through the network resulting in a delay in the voice call set-up process. The advantage is that the actual voice quality will not be impacted as no voice signalling packets will travel in the PQ.
Testing has indicated that, without cRTP (Compressed Real Time Protocol) enabled, the effect of mapping VoIP signalling packets together with the VoIP bearer packets in the same priority queue is negligible. The signalling packets have little effect on the latency nor do they cause any drops due to the default bust size of 200ms that has been built into the priority queue. Therefore, the design recommendation is to match the VoIP signalling packets with ACL 102 and queue them together with the VoIP bearer packets in the priority queue. It should however be understood that VoIP signalling implementations differ and that some might have a negative effect on the performance of the priority queue. In that event, the VoIP signalling traffic needs to be mapped in another class queue (Business, for example). The classified traffic will subsequently be mapped in their respective classes using the MQCLI. The Standard traffic will not match any of the classes and will be mapped in the default class (class-default). A maximum of 64 classes can be defined on a single router.
127
! class-map match-all business match access-group 100 class-map match-all streaming match access-group 101 class-map match-all voice match access-group 102 class-map match-any management match access-group 103 !
Marking
After classification, packets need to be marked with their appropriate IP precedence or DSCP value. The following is the required configuration for Class Based Marking on CE router. Marking of Business, Streaming and Voice classes is actually configured through the MQCLI police command, because these classess need to be policed to the SLA limits. Standard traffic class is not policed, hence we can mark all the traffic with MQCLI set ip dscp command.
! policy-map customer_profile class business police 128000 8000 16000 conform-action set-dscp-transmit 10 exceed-action drop class streaming police 64000 2000 2000 conform-action set-dscp-transmit 26 exceed-action drop class voice police 64000 2000 2000 conform-action set-dscp-transmit 46 exceed-action drop class management police 24000 8000 16000 conform-action transmit exceed-action drop class class-default set ip dscp 0 !
The following is the required configuration for LPR marking of the locally generated management traffic. As discussed before, ACL 103 matches all management traffic.
! ip local policy route-map management ! route-map management permit 10 match ip address 103 ! here we simulate the set ip dscp 48 command set ip precedence 6 set ip tos 0 !
128
However, the design option described in this chapter has not been recommended to ST, because it involves fairly complex implementation, provisioning and monitoring. It introduces complexity not only on access layer, but QoS implementation in the core has to support it as well. Instead of policing in each of the traffic classes, it is possible to introduce a mechanism of in / out contract for the Business and Streaming traffic classes. The main reasons behind this recommendation are twofold: In an MPLS / VPN environment, it should be avoided that well behaving customer sites are penalised by ill-behaving customer sites. A well behaving customer site is a site which sends traffic into the network below the Ingress Committed Rate (ICR), and this on a per traffic class basis. An ill behaving site sends traffic into the network above the ICR for a particular traffic class. The problem is that, if a well behaving site and an ill behaving site both send traffic to a third site, congestion might occur on the egress PE to that site. If there is no way of differentiating between the well behaving traffic and ill behaving traffic, traffic from the well behaving site might be dropped instead of traffic from the ill behaving site. The introduction of an in / out contract traffic marking mechanism at the ingress CE will prevent this. The introduction of in / out contract traffic profiles will facilitate the capacity planning of the backbone network which is shared among the different MPLS / VPN customers. Indeed, the shared backbone network needs to be engineered and capacity planned only for the in-contract part of the customer traffic. When, in a second phase, QoS mechanisms are deployed in the core backbone network due to possible backbone congestion, it will be possible to differentiate the out-contract traffic from the incontract traffic and as a result, discard the out-contract traffic earlier.
The following would be the required configuration for Police marking of the Business, Streaming and Voice traffic classes in ST network: The in-contract Business traffic is marked as AF11 (DSCP 10). The out-contract Business traffic is marked as AF21 (DSCP 18). The in-contract Streaming traffic is marked as AF31 (DSCP 26). The out-contract Streaming traffic is marked as AF41 (DSCP 34). The Voice traffic is marked as EF (DSCP 46). The notion of out-contract traffic does not apply to jittersensitive Voice class (WRED is not applicable in LLQ).
! policy-map customer_profile class business police 128000 8000 16000 conform-action set-dscp-transmit 10 exceed-action setdscp-transmit 18 class streaming police 64000 2000 2000 conform-action set-dscp-transmit 26 exceed-action set-dscptransmit 34 class voice police 64000 2000 2000 conform-action set-dscp-transmit 46 exceed-action drop !
The following figure depict the in/out-contract marking in Businness and Streaming traffic classes. As previously described, any packets beyond subscribed bandwidth of Business class would be re-coloured and subject to more aggressive WRED dropping profile.
129
Figure 59
IP Packet
The following picture shows a another marking/policing alternative with two SLA limits: if the traffic rate exceeds the SLA Limit, traffic is re-coloured as out-contract and sent to the wire. then, if the traffic rate exceeeds Drop Limit, packets are uncoditionally dropped.
This design variation can be implemented through 2 cascaded CAR statements. The first CAR statement will mark the in-contract traffic below the first rate threshold. The second CAR statement will mark the out-contract traffic between the first and second rate thresholds and will also drop the traffic above the second rate threshold. The following is the required configuration for CAR policing (dropping) of Business and Streaming traffic classes above a second rate threshold. In this particular example, the Business in-contract traffic is limited to 128 Kbps, and the Business out-contract traffic is limited to 256 Kbps. The Streaming in-contract traffic is limited to 64 Kbps, and the Streaming out-contract traffic is limited to 96 Kbps. This can be also implemented using a two-rate policer as described in http://www.cisco.com/univercd/cc/td/doc/product/software/ios122/122newft/122t/122t4/ft2rtplc.htm, but this method is still depreciated due to relatively immature 12.2T IOS release. Please note that Voice traffic is still policed above the first (SLA-limit) threshold.
! interface Serial0/1 bandwidth 512 rate-limit output access-group 10 exceed-action continue rate-limit output access-group 18 exceed-action drop rate-limit output access-group 26 exceed-action continue rate-limit output access-group 34 exceed-action drop rate-limit output access-group 46 exceed-action drop encapsulation ppp clockrate 512000 !
100 128000 8000 16000 conform-action set-dscp-transmit 100 128000 8000 16000 conform-action set-dscp-transmit 101 64000 2000 2000 conform-action set-dscp-transmit 101 32000 2000 2000 conform-action set-dscp-transmit 102 64000 2000 2000 conform-action set-dscp-transmit
130
Figure 60
Drop Limit
IP Packet
Policing
Policing in Voice traffic class is configured to provide rudimentary call admission thereby policing voice traffic levels into the core network. The Policing is carried out by the exceed-action option on the end of the police command. Anything over the expected number of voice calls bandwidth will not be forwarded. If a customer attempts to exceed this limit then all the calls flowing through that specific CE-PE connection could be affected to degradation in the quality of all the simultaneous calls. However, this affect is much better than single customer affecting all the other customers in ST network sharing a specific backbone link. The Business and Streaming traffic classes will also be policed to subscribed SLA limits using MQCLI police commands. A few important points surrounding the policing implementation should be understood. Policing propagates bursts to a certain extent. It does not shape the traffic flow and as such does not cause any packet delay. Police bandwidths need to be configured in 8 Kbps multiples. This needs to be reflected in the ST service offerings. Compared to CAR, police bandwidths include some layer-2 overhead (please see the Class Queuing chapter for details).
The police configuration requires the setting of the <normal-burst> NB and <excess-burst> EB parameters. These are parameters used in polices Token Bucket algorithm. For TCP oriented classes such as Business class, the recommended settings for rate limit normal and excess burst are: NB = max(8000, {RTT x Committed Rate in Bytes}) EB = 2 x NB where RTT is ~ 0.05s The calculation result is rounded to the nearest 1000-byte boundary. The following table identifies the recommended NB and EB values in function of the access link speed.
131
Table 15
NB and EB settings NB [byte] 8000 8000 8000 8000 8000 12800 214800 625000 972000 EB [byte] 16000 16000 16000 16000 16000 25600 429600 1250000 1944000
Link BW [kbps] 64 128 256 512 1024 2048 34368 100000 155520
The recommended settings for rate limit normal and excess burst for the VoIP oriented classes such as Voice class are: NB = 2000 EB = NB (CBR like policer to avoid jitter) The following is policing configuration example on 512 kbps link. Please note that configured police limits shall match the definition of class bandwidths in each of traffic classes.
! ! policy-map customer_profile class business police 128000 8000 16000 conform-action set-dscp-transmit 10 exceed-action drop class streaming police 64000 2000 2000 conform-action set-dscp-transmit 26 exceed-action drop class voice police 64000 2000 2000 conform-action set-dscp-transmit 46 exceed-action drop class management police 24000 8000 16000 conform-action transmit exceed-action drop class class-default ! Standard class is not policed set ip dscp 0 ! interface Serial0/1 description CE-PE link bandwidth 512 encapsulation ppp max-reserved-bandwidth 95 service-policy output customer_profile clockrate 512000 !
132
Class Queuing
Queuing within the classes is implemented through Low latency Queuing (LLQ). LLQ is in fact the combination of Class Based Weighted Fair Queuing (CBWFQ) and Priority Queuing (PQ). The PQ is used for delay sensitive traffic such as VoIP. LLQ is configured through the MQCLI. Different traffic classes a maximum of 64 traffic classes can be defined on a single router can be combined in a service policy. This is kind of a traffic profile. Each of the classes in the service policy will be assigned a minimum bandwidth according to the service contract that has been agreed with the customer. The minimum bandwidth that can be configured is 8 Kbps 25. Under congestion, each of the traffic classes will have this minimum bandwidth available: If one class is congested (and so experiences delay), the congestion is isolated from other classes, which still have a guaranteed minimum share of the link bandwidth. If one class is under-utilised, other classes can use the available bandwidth26. All flows and classes get a proportionate share of the spare bandwidth. The proportion is dictated by the configured bandwidth for classes where the higher the allocated bandwidth, the higher the proportion allocated. For flowbased weighted fair queuing, configurable in the default-queue, the proportion of available bandwidth is allocated based on the precedence of the packets where the packets with the highest precedence values get the highest proportion of bandwidth.
This enables worst-case bounds on delay and jitter to be designed independently between the classes whilst preventing any single class from being starved by over utilisation on other classes. Also, other parameters like congestion avoidance and control parameters can be configured on a per-class basis. This will be discussed further on. The sum of the minimum bandwidths reserved for the customer traffic classes needs to be lower than the total link bandwidth. Some bandwidth needs to be reserved for management traffic and routing traffic. Since ST will offer a managed service, it needs to keep control over the CEs, even under congestion circumstances. Also the routing traffic which is BGP or RIP in this case needs to have some minimum bandwidth available (8 Kbps or 1 %, whatever is larger). It should also be understood that the actual minimum bandwidths configured through MQCLI include the following layer 2 overhead, in contrast with CAR which only includes pure layer 3 IP bandwidth. Overhead added by the hardware (CRC, flags) is not included in the MQCLI bandwidths27. The 8 bytes of SNAP/LLC overhead and 4 bytes of the 8-byte AAL5 trailer for ATM interfaces (the remaining 4 bytes of the AAL5 trailer CRC are not taken into account). AAL5 padding is equally not taken into account. The ATM cell overhead (5 bytes per cell payload of 48 bytes) is not taken into account. The 4-byte Frame Relay overhead for Cisco Frame Relay encapsulation (additional overhead due to possible FRF.12 headers is not taken into account). CRC and flags overhead is not taken into account. The 2 bytes of PPP encapsulation overhead.
Also, all reports will indicate the configured rates so including the L2 overhead. It is worth considering for ST to include the L2 overhead in traffic contracts with customers. This would ensure consistency in between the contracted bandwidths and the performance reports. After defining the service policy in a policy-map, it needs to be applied on an interface (service-policy).
25 26 27
On 10k series the granularity is 1/255th of link bw. Except on 10k and 12000 series where LLQ is policed to configured class-bw. Except on 10k series, where MQCLI on ATM interfaces inlcudes all layer-2 overhead. 133
By default, on the non-distributed router platforms (non VIP based), the sum of the minimum bandwidths needs to be lower than 75 % of the configured access bandwidth. Since the actual required sum of minimum bandwidths will probably be larger, this default parameter setting can be changed (maximumreserved-bandwidth) to 100 %. However, it is also a very good design practice not to push the design boundaries to the edge without allowing for any margin of error or unexpected traffic patterns. Therefore, it is still recommended to keep the sum of all minimum bandwidths below 100 %. Keeping the sum of all minimum bandwidths around 95 % will allow for unaccounted traffic such as layer 2 overhead, layer 2 keepalives, LMI (in the case of Frame Relay), etc. The following is the sample configuration for LLQ class queuing. Class bandwidths can be configured in [kbps] or [%] of (max-res-bw voice-bw). On 10000 series routers, the cumulative bandwidth applied on traffic classes must not exceed the 99% of link bandwidth. The bandwidth is configurable in steps of 1/255 of link (or PVC) bandwidth. This rule must be respected when configuring the class bandwidths on the CE router.
! policy-map customer_profile class business bandwidth percent 30 class streaming bandwidth percent 20 class voice priority 64 class management bandwidth percent 5 class class-default bandwidth percent 45 ! interface Serial0/1 bandwidth 512 encapsulation ppp max-reserved-bandwidth 95 service-policy output customer_profile clockrate 512000 !
In the configuration template above, the Voice traffic class has been allocated 64kbs of link capacity. The priority command guarantees bandwidth to the priority class and restrains the flow of packets from the priority class: when the link is not congested, the priority class traffic is allowed to exceed its allocated bandwidth. When the device is congested, the priority class traffic above the allocated bandwidth is discarded (but we will police it to contractual Voice class bandwidth). Business, Streaming, Management and Standard classes will share the remaining max-reserverd-bandwidth as configured. For example, the Streaming traffic class will receive minimum bandwidth of ((512*95%)64)*20% = 84 kbps in congestion periods.
Congestion avoidance
Congestion avoidance techniques monitor network traffic loads in an effort to anticipate and avoid congestion at common network bottlenecks. Congestion avoidance is achieved through packet dropping. Among the more commonly used congestion avoidance mechanisms is Random Early Detection (RED), which is optimum for high-speed transit networks. Cisco IOS QoS includes an implementation of RED 134
that, when configured, controls when the router drops packets. If there is no Weighted Random Early Detection (WRED) configured, the router uses the cruder default packet drop mechanism called tail drop. WRED combine the capabilities of the RED algorithm with the IP Precedence feature. Within the section on WRED, the following related features are discussed: Tail Drop. Tail drop is the default congestion avoidance behaviour when WRED is not configured. Tail drop treats all traffic equally and does not differentiate between classes of service within the same queue. Queues fill during periods of congestion. When the output queue is full and tail drop is in effect, packets are dropped until the congestion is eliminated and the queue is no longer full. Weighted Random Early Detection. WRED avoids the globalisation problems that occur when tail drop is used as the congestion avoidance mechanism on the router. Global synchronisation occurs as waves of congestion crest only to be followed by troughs during which the transmission link is not fully utilised. Global synchronisation of TCP hosts, for example, can occur because packets are dropped all at once. Global synchronisation manifests when multiple TCP hosts reduce their transmission rates in response to packet dropping, then increase their transmission rates once again when the congestion is reduced.
DiffServ compliant WRED DiffServ Compliant WRED extends WRED to support Differentiated Services (DiffServ) and Assured Forwarding (AF) Per Hop Behavior (PHB). This feature enables customers to implement AF PHB by coloring packets according to differentiated services code point (DSCP) values and then assigning preferential drop probabilities to those packets. The dscp-based argument enables WRED to use the DSCP value of a packet when it calculates the drop probability for the packet. The prec-based argument enables WRED to use the IP Precedence value of a packet when it calculates the drop probability for the packet. After enabling WRED to use the DSCP value, you can then use the new random-detect dscp command to change the minimum and maximum packet thresholds for that DSCP value. MPLS compliant WRED The MPLS Compliant WRED feature enables WRED to use the MPLS EXP value when it calculates the drop probability for a packet. The MPLS value is the 3 bits of the MPLS Experimental bits in the label header. MPLS based WRED is automatically enabled if the transmitting packet has a MPLS header and uses the same values from the precedence configuration. WRED operation WRED is a congestion avoidance and control mechanism whereby packets will be randomly dropped when the average class queue depth reaches a certain minimum threshold (min-threshold). As congestion increases, packets will be randomly dropped (and with a rising drop probability) until a second threshold (max-threshold) where packets will be dropped with a drop probability equal to the mark-probabilitydenominator. Above max-threshold, packets are tail-dropped. The following picture depicts the WRED algorithm. Figure 61
Drop Probab.
WRED Algorithm
0 minTH maxTH
WRED will selectively instruct TCP stacks to back-off by dropping packets. Obviously, WRED has no influence on UDP based applications (besides the fact that their packets will be dropped equally).
136
The average queue depth is calculated using the following formula: new_average = (old_average * (1-2-e) + (current_queue_depth * 2-e) The e is the exponential weighting constant. The larger this constant, the slower the WRED algorithm will react. The smaller this constant, the faster the WRED algorithm will react. The exponential weighting constant can be set on a per-class basis. The min-threshold, max-threshold and mark probability denominator can be set on a per precedence or per DSCP basis. The mark probability denominator should always be set to 1 (100 % drop probability at max-threshold).
137
The minimum and maximum WRED threshold values are calculated on the basis of the allocated class bandwidth and not on the link bandwidth. This will yield the most realistic results. The following generic formula is used to derive WRED thresholds based on the maximum allowed delay:
The minimum and maximum queue thresholds for each of the service classes will be calculated as follows: Business Class Medium Queue Max per-hop delay 100ms: Min-threshold = 0.03 x B Max-threshold = 0.1 x B With B representing the class bandwidth in MTU sized packets per second. For ST MPLS network a MTU size of 1500 bytes is assumed. On the core trunks the management traffic will be carried in the Business class. For obvious reasons we have to protect the management traffic from customers traffic flows with less aggressive packet drop policy. The following are min and max thresholds for management traffic (DSCP 48) within the Business traffic class: Min-threshold = 0.1 x B Max-threshold = 0.2 x B Streaming Class Short Queue Max per-hop delay 50ms: Min-threshold = 0.015 x B Max-threshold = 0.05 x B With B representing the bandwidth in MTU sized packets per second. For STs MPLS network a MTU size of 1500 bytes is assumed. Standard Class Long Queue Max per-hop delay 150ms: Min-threshold = 0.045 x B Max-threshold = 0.15 x B With B representing the bandwidth in MTU sized packets per second. For ST MPLS network a MTU size of 1500 bytes is assumed. For Voice traffic it is necessary to implement tail-drop to minimise and predict delay/jitter under congestion conditions. Therefore, no WRED will be used for the Voice traffic class (except on the GSR). WRED will also not be applied to management class. The WRED min-threshold and max-threshold (calculated on basis of the class bandwidth) settings are as detailed in the following tables. They represent the values to be used across all platforms except for the GSR ENG-2 line cards. These will be presented in GSR QoS design chapter later on. If ST wishes to offer a class-bw, which is not included in the following tables, the min/max thresholds can be calculated as per formulas above.
138
Table 16
Link Speed in kbps B 64 128 256 512 1024 2048 10000 34684 6 11 22 43 86 171 834 2891
100000 8334
240000 600
For values smaller than E1, on a class percentage, the calculated value will be less than 3 for the MIN Threshold and 9 for the MAX threshold. Any smaller value will defeat the objectives of WRED, seeing that the router would not allow for much burst and react to aggressively in dropping the packets. These values are therefore no considered in the calculations. Table 17
Link Speed in kbps B 64 128 256 512 1024 2048 10000 34684 6 11 22 43 86 171 834 2891
100000 8334
124400 156
155500 195
186600 234
139
10000
240000 300
1000
480000 600
2000
600000 750
2500
720000 900
3000
Table 18
Link Speed in kbps B 64 128 256 512 1024 2048 10000 34684 6 11 22 43 86 171 834 2891
100000 8334
240000 900
Drop Probability
The drop probability at max-threshold for all classes will initially be configured as mark-propabilitydenominator=1. This means that when the average-queue-length reaches the max-threshold, all packets will be dropped until the average goes below the Max-threshold. The formulae for this is:
mpd
This means that when setting the mpd to 2 for instance, according to the formula above represents that at the max-threshold only half or rather 50% of the all the packets are being dropped. This also means that the ratio at which the packets are dropped as the average queue length increases is also lower than if the mpd was set to 1 for instance, seeing that an mpd of 1 actually means that 1/1 or 100% packets are dropped at max-threshold. Why is it important to set mpd to 1 rather than to another value? The answer is predictability. When calculating the other values for WRED, we know that any packet after Max-threshold is tail dropped. Therefore, by setting the mpd to 1, we ensure a more realistic drop ratio throughout the WRED curve. If the value was set to 2 for instance, WRED would only drop a number of packets so to reach a 50% drop ratio by the time the average queue depth reaches the Max-threshold and then, all of a sudden, one packet takes it over the Max-threshold and the packet drops go from 50% to 100%. 140
,where B is the rate of 1500 byte packets (i.e. CEILING(Rate[kbps] * 1000 / 8 / 1500). The configured exponential-weighting-constant (x) is applied to the router configuration as a negative power of 2. The relation between ewc and the configured value is: ewc = 2-x 1/ewc = 2
x
which can be rewritten as: and the final formula for configured ewc is:
x = ln(1/ewc) / ln(2) x = ln(B/10) / ln(2) x = ln(B) / ln(2) Note: The exponential-weighting-constant parameter is calculated based on the Class Bandwidth value and NOT on the link rate. For the GSR12000, however, since it is not possible to configure per class, the exponential-weighting-constant is calculated based on the link rate. if Line Rate (core)/Committed Rate (edge) <= 34Mbps if Line Rate (core)/Committed Rate (edge) > 34Mbps
141
The ewc for Standard class (class-default) shall be based on link rate. If the Class Bandwidth Allocation is configured as a percentage value in MQC, this should be converted to a value in Kbps for calculating ewc. The following table computes the exponential-weighting-constant in function of the link speed (GSR) or class speed (10xxx or smaller). Table 19
Link Speed in kbps B 32 64 128 256 512 1024 2048 10000 34684 3 6 11 22 43 86 171 834 2891
10 1000 8 3468.4
10 10000
2500 11 387.5 9
142
143
interface Serial<x> description E1 link towards PE bandwidth 2000 encapsulation ppp max-reserved-bandwidth 95 service-policy output SAA_profile clockrate 2000000 ! ! Marking of locally originated SAA probes ! ip local policy route-map Mark_SAA_probes ! ! Classify the SAA probes based on IP precedence ! access-list 150 permit ip any any precedence 1 ! access-list 152 permit ip any any precedence 3 ! access-list 154 permit ip any any precedence 5 ! access-list 155 permit ip any any precedence 6 ! access-list 156 permit ip any any precedence 0 ! ! route-map Mark_SAA_probes permit 10 match ip address 150 152 set ip tos 4 ! route-map Mark_SAA_probes permit 20 match ip address 154 set ip tos 12 ! route-map Mark_SAA_probes permit 30 match ip address 155 156 set ip tos 0 !
Classification
The traffic can be classified on PE routers by matching the DSCP values, because all traffic has already been properly marked on the CEs when entering the network. Traffic classification on CE-PE connection is required only for packets received from unmanaged CEs and Internet connections as explained below.
Marking
No customer traffic packet marking would be performed on the PE, since all packets have already been marked appropriately on the ingress CEs. The management traffic generated locally on the PE will be marked through Local Policy Routing (LPR). The configuration template is the same as on the CE router. 144
Policing
Traffic has been already policed on the CE router so theres no need to police the traffic coming from managed CE routers on the PE.
The second example shows how the police command can be used to limit the bandwidth on high-speed circuits to subscribed subrate of kbps.
! policy-map limit_customer_512k class class-default police 512000 12800 25600 conform-action set-dscp-transmit 0 exceedaction drop ! interface Serial 2/0/1:0 bandwidth 2000 description Link to unmanaged CE with subrate of 512kb service-policy input limit_customer_512k !
Also, traffic received from upstream transit providers, peering partners and Internet customers must be marked with DSCP 0, to prevent precedence-spoofing attacks. 145
The following configuration template shows how to re-enforce the policing of traffic classes for unmanaged CE routers. The policy-map would have to be replicated and tuned for each customer. On the CE side, the QoS configuration template of managed CE can be reused for unmanaged CE routers.
! ! Customer has already classified and marked the IP packets on unmanaged CE ! The classification class-map is the same as with managed CE routers (the ! same config for all CEs) ! class-map match-any voice match ip dscp 46 class-map match-any management match ip dscp 48 match access-group 103 class-map match-any business match ip dscp 10 class-map match-all streaming match ip dscp 26 ! ! ST must police the traffic classes according to SLA ! of that customer this is customer-specific configuration and can result ! in a very long router configuration file. ! policy-map CUSTx_police class business police <bps> <normal_burst> <ext_burst> conform-action transmit exceed-action class streaming police <bps> <normal_burst> <ext_burst> conform-action transmit exceed-action class voice police <bps> <normal_burst> <ext_burst> conform-action transmit exceed-action class management police <bps> <normal_burst> <ext_burst> conform-action transmit exceed-action class class-default set ip dscp 0 ! interface Serial 2/0/1:0 bandwidth <bandiwdth> description Link to unmanaged CE of customer X service-policy input CUSTx_police
SAA Routers
Traffic received from SAA router will be handled in the same way as packets received from managed CEs. This implies that the set_qos_group service policy shall be configured on SAA links in the same way as already explained for managed CE connections.
The following configuration example depicts the EXP based classification on PE-P uplinks. MPLS frames needs to be classified in order to perform queuing and apply proper WRED drop policy.
! class-map match-any business_management match mpls experimental 1 6 class-map match-any streaming match mpls experimental 3 class-map match-any voice match mpls experimental 5 !
Marking
IP packets will be encapsulated in MPLS frames when leaving the PE router. The DSCP code point value (i.e. the precedence bits) will be automatically mapped into EXP bits of MPLS label. No further configuration is needed.
Class queuing
The parameter setting for the reservable interface bandwidth has been changed from 75% (default) to 97% on 7206VXR PE routers This provides enough space for unaccounted traffic such as layer 2 overhead, layer 2 keepalives, LMI (in the case of Frame Relay), etc.. In the following configuration templates, all MQCLI class bandwidth calculations are based on this value. On 10000 series routers, the cumulative bandwidth applied on traffic classes must not exceed the 99% of link bandwidth. The bandwidth is configurable in steps of 1/255 of link (or PVC) bandwidth. Furthermore it is important to notice, that on 10000 series POS interfaces the calculation of the minimum class bandwidth is based on the avilable information bandwidth. For example, the basic rate of STM-1 POS interfaces is 155.520 Mbps. The avilable information bandwidth is 149.760 Mbps (155.520Mbps Sonet Overhead). The avilable information bandwidth can be dispalyed with the following commands:
10K-PE#sh hardware pxf cpu queue pos 1/0/0 VCCI 2: Class ~ 0 class-default ... ID 291 Length/Max Res 0/1024 3 Dequeues 295173 Drops 0
10K-PE#sh hardware pxf cpu queue 291 ID (queue/packet-queue) ... Bandwidth Index ...
As already mentioned above, on 10000 series the LLQ is policed to configured class-bw. This is done by default when configuring priority <bw value> command. Neverthless a warning message will be displayed after entering this command:
10K-PE(config-cmap)#policy-map PE_P_155M
147
10K-PE(config-pmap)# class voice 10K-PE(config-pmap-c)# priority 37587 % This command is an unreleased and unsupported feature
For a period of time, the command will still work as it did in the past (even though the warning is displayed). It will disappear in future releases. Therefore we recommend to use the police command within the high priority class on 10000 routers (like shown below). The following is an example configuration for the class queuing on PE-to-P trunks. The same queuing template must be applied on primary and backup uplinks.
! policy-map PE_P_155M class voice priority police 36312000 conform-action transmit exceed-action drop violate-action drop class business_management bandwidth 36317 class streaming bandwidth 36317 class class-default bandwidth 36317 !
Congestion avoidance
WRED is used for graded packet dropping in each traffic class. The DSCP-based WRED is currently supported on MPLS uplinks. The following configuration template will be used for congestion management on PE-to-P links. WRED thresholds and ewc are derived in the same way as for the CE-to-PE links.
! policy-map PE-P class qos_group_business_management random-detect dscp-based random-detect exponential-weighting-constant 9 random-detect dscp 10 117 388 1 random-detect dscp 48 388 775 1 class qos_group_streaming random-detect dscp-based random-detect exponential-weighting-constant 8 random-detect dscp 26 49 162 1 class class-default random-detect dscp-based random-detect exponential-weighting-constant 11 random-detect dscp 0 146 485 1 !
148
In doing so, one applies the cos-group depending on what the destination slot is, therefore avoiding congestion on the switching fabric. From-fabric or TX-COS MDRR The from-fabric MDRR is a lot simpler is terms of configuration. The queueing occurs at the egress to the TX-queue. At this stage, one knows the exit slot and interface speed. The cos-group is simply applied to the actual interface, just like a service policy is applied to an interface on a 7XXX platform.
The weights give a relative bandwidth for each queue when the interface is congested. The DRR algorithm de-queues data from each queue in turn if there is data in the queue to be sent. So if all the regular DRR queues have data in them they will be serviced as the following: 0-1-2-3-4-5-6-0-1-2-3-4-5-6...
149
On each time through cycle the queue will get to packet de-queue the quantum Q that is proportional to the configured queue weight W. Packet de-queue quantum Qn is: Qn = MTU + (Wn - 1)*512 A value of 1 is equivalent of giving the interface a weight of its MTU. For each increment above 1, the weight of the queue increases by 512 bytes. For example, if the MTU of a particular interface is 4470 and the weight of a queue is configured to be 3, each time through the rotation 4470 + (3-1)*512 = 5494 bytes will be allowed to be de-queued. If for example 2 normal DRR queues, Queue0 and Queue1 are used, Queue0 is configured with a weight of 1 and Queue1 configured with a weight of 9. If both queues were congested, each time through the rotation Queue0 would be allowed to send 4470 bytes and Queue1 would be allowed to send 4470 + (9-1)*512 = 8566 bytes. This would give traffic going Queue0 approximately 1/3 of the bandwidth and the traffic going through Queue1 about 2/3. The low latency queue can be added to give more priority to certain traffic. The low latency queue can be given 2 different priorities within the group. It can be put in strict priority or in alternating priority. In strict priority, this queue is serviced whenever it is non-empty. To minimize the jitter in Voice class of ST network, the LLQ will be configured in strict priority mode. The following table gives an example for MDRR weights that can be used on the ST network as initial queuing and class capacity definition. Weights have been calculated following the algorithm above. MTU on POS links is 4470. Table 20 MDRR weights % of link BW Queue STM-1 Class BW [Mbps] Voice Business, Mgmt Streaming Standard 20 30 25 25 low latency 2 1 0 31 48 38 38 Weight 10 13 10 10 STM-16 Class BW [Mbps] 480 720 600 600 Weight 10 13 10 10
Service Class
Queue 2 will be used for Business and Management traffic classes. Packets marked MPLS EXP 1 and 6 will be forwarded to this queue. 25% of the available physical bandwidth will be available for Business and management traffic. Queue 1 will be the Streaming data queue for delay sensitive traffic but variable packet sizes. Packets marked with MPLS EXP 3 will be forwarded to this queue. 25% of available physical bandwidth will be available for streaming traffic. Queue 0 will be for default-classified traffic i.e. Standard traffic class. MPLS EXP 0 will be forwarded to this queue. 25% of the available physical bandwidth will be available for best-effort traffic.
The following commands are an example configuration in the ST network. The same MDRR TX-COS configuration could be applied to STM-1 and STM-16 links, but the WRED parameters will be different. So we have to have one cos-queue-group per link capacity. However, the same cos-queue-group can be applied on RX and TX side; this will reduce the size of router configuration file. The precedence-based configuration acts on EXP bits in the case of MPLS packets.
! cos-queue-group STM<1,16> ! Duplicated for each rate, same for RX and TX side prec 0 queue 0 ! Map the packet with PREC/EXP=0 into queue 0 prec 1 queue 2 prec 2 queue 2 prec 3 queue 1 prec 4 queue 1 prec 5 queue low-latency prec 6 queue 2 prec 7 queue 2 queue 0 10 queue 1 10 queue 2 13 queue low-latency strict-priority 10 ! interface pos 3/1 description This is STM-1 backbone link tx-cos STM1
To-fabric or RX COS In addition to the transmit COS, a receive COS will also be configured. The queues will be identical to the interface transmits queues, but instead of being applied directly to the line interface they are built as a table and applied from the receive buffer to the backbone fabric buffers. With the cards supplied for ST MDRR is supported in hardware, each line card has eight COS queues per destination interface. With 16 destination slots and 16 interfaces per slot, the maximum number of COS queues is 16 X 16 X 8 = 2048. All the interfaces on a destination slot have the same COS parameters. In the example, the slot-table-cos stm-to-fabric command defines the COS policy for destination line cards 2,3 and 5,6 based on the STM-1 and STM-16 cos-queue-group. The rx-cost-slot command applies the stmto-fabric slot-table-cos configuration to a particular slot (line card). As previously mentioned, the cosgroups will be applied as follows: For packets destined to a slot with an STM-1 line card, an STM-1 cos-group will be applied, regardless of what the source line card is. For packets destined to a slot with an STM-16 line card, an STM-1 cos-group is applied if the source line card is STM-1, STM-16 cos-group if the source is an STM-16. 151
! rx-cos-slot 2 STM1-TO-FABRIC rx-cos-slot 3 STM1-TO-FABRIC rx-cos-slot 5 STM16-TO-FABRIC rx-cos-slot 6 STM16-TO-FABRIC ! slot-table-cos STM1-TO-FABRIC destination-slot all STM1 ! slot-table-cos STM16-TO-FABRIC destination-slot 2 STM1 destination-slot 3 STM1 destination-slot 5 STM16 destination-slot 6 STM16 !
! ! ! !
We We We We
STM-1 interfaces in this slot STM-1 interfaces in this slot STM-16 interfaces in this slot STM-16 interfaces in this slot
Congestion management
WRED parameters on GSR routers will follow the guidelines already explained for CE and PE routers. The GSR-specific configuration is depicted in this chapter.
152
Table 22
Table 23
WRED Configuration
This is an example configuration template for WRED on STM-1 GSR links. In case of ENG-2 linecards the thresholds need to be adjusted as described above. Please note that precedence x random-detect-label y statements apply to IP packets with precedence x and also to MPLS frames with EXP bits set to x. y here refers to index of WRED profile.
! cos-queue-group STM1 ! Duplicated for each STM rate with precedence 0 random-detect-label 0 ! appropriate WRED thresholds and EWC
153
precedence 1 random-detect-label 1 precedence 2 random-detect-label 0 precedence 3 random-detect-label 3 precedence 4 random-detect-label 0 precedence 5 random-detect-label 5 precedence 6 random-detect-label 6 precedence 7 random-detect-label 6 random-detect-label 0 146 485 random-detect-label 1 117 388 random-detect-label 3 49 162 random-detect-label 5 180 181 random-detect-label 6 388 775 exponential-weighting-constant 10 !
1 1 1 1 1
! ! ! ! ! !
Standard Business Streaming Voice (3ms tail-drop of 64byt packets) Routing & Management 10 is default
Class queuing
The following is the sample configuration for the class queuing on PE-to-CE links. Please note that class bandwidths shall match with those configured on the CE side.
! policy-map PE-CE class business bandwidth percent 35 class streaming bandwidth percent 20 class voice priority 64 class management bandwidth percent 5 class class-default bandwidth percent 40 ! interface Serial 2/0/1:1.1 description PE-CE access layer link bandwidth 512 encapsulation ppp
154
Congestion avoidance
WRED one PE-CE link shall be configured with the same parameters as on the CE router. Below is a sample configuration template.
! policy-map PE-CE class voice ! class streaming random-detect dscp-based random-detect exponential-weighting-constant <x> random-detect dscp 26 <minTH> <maxTH> 1 class business random-detect dscp-based random-detect exponential-weighting-constant <x> random-detect dscp 10 <minTH> <maxTH> 1 class management ! class class-default random-detect dscp-based random-detect exponential-weighting-constant <x> random-detect dscp 0 <minTH> <maxTH> 1 !
overhead in only partly taken into account in the LLQ bandwidths (the AAL5 trailer, but without 4 bytes of the CRC and without any padding). RFC 1483 LLC encapsulation requires LLC, OUI and Ethertype headers to precede the IP datagram. This overhead amounts to 8 bytes per datagram. With an IP MTU of 576 bytes this adds a 1.37% overhead. This overhead is taken into account in the LLQ bandwidths. Operation, Administration and Maintenance (OAM) cell overhead. This overhead is not taken into account in the LLQ bandwidths.
Assuming an IP MTU of 576 bytes (Internet inter-network default) then each layer contributes the following percentage overhead to transmit an IP datagram. Table 24 ATM Overhead WAN Link Protocol Layer % Overhead
ATM STM-1
ATM E3
The following table summarises which overhead is or is not included in the MQCLI LLQ bandwidth statements. Table 25 Overhead RFC 1483 LLC / SNAP header AAL5 trailer AAL5 padding to make last cell an even multiple of 48 bytes ATM cell header LLQ bandwidths and ATM Length 8 bytes 8 bytes Variable 5 bytes Included in MQCLI Yes Partially. 4-byte CRC field is not included No No
Due to the significant ATM overhead that is not accounted for in the MQCLI bandwidths, it is recommended to allocate not more than 80 % (a conservative figure) of the total available ATM bandwidth to LLQ traffic classes. The ATM PVC bandwidth for a VBR-nrt ATM CoS is defined as the Sustained Cell 156
Rate (SCR). In other words, not more than 80 % of a particular PVC SCR should be allocated in service policies attached to that PVC. On the PE and CE routers (except on 75xx), the amount of bandwidth that can be applied to interfaces in service policies can be controlled through the max-reserved-bandwidth bandwidth command. The default is 75 %. The following is the required configuration for applying the service policy to an ATM PVC. ATM traffic shaping needs to be configured on the ATM PVC. ATM traffic shaping is a mechanism that alters the traffic characteristics of a stream of cells on a connection to achieve better network efficiency by ensuring conformance at a policed remote ATM switch interface. Traffic shaping must maintain cell sequence integrity on a connection.
! interface ATM5/1 no ip address max-reserved-bandwidth 80 ! interface ATM5/1.50 point-to-point ip address n.n.n.n n.n.n.n pvc 50/105 vbr-nrt <SCR> <PCR> <MBS> service-policy output customer_profile !
157
High Availability
This chapter would dicuss the high availability comppnent as it relates to the propose architecture. Depending on the size of the content this chapter and next may be combined
158
Security
This chapter would dicuss the high availability comppnent as it relates to the propose architecture. Depending on the size of the content this chapter and next may be combined. Some very general topics are presented here as a sample
Password Management
Passwords and similar secrets (such as SNMP community strings) are the primary defence against unauthorized access to your router. The best way to handle most passwords is to maintain them on a TACACS+ or RADIUS authentication server. However, almost every router will still have a locally configured password for privileged access, and may also have other password information in its configuration file. The enable secret command is used to set the password that grants privileged administrative access to the IOS system. An enable secret password should always be set. You should use enable secret, not the older enable password because the later uses a weak encryption algorithm. If no enable secret is set, and a password is configured for the console TTY line, the console password may be used to get privileged access, even from a remote VTY session. This is almost certainly not what you want, and is another reason to be certain to configure an enable secret. The service password-encryption command directs the IOS software to encrypt the passwords, CHAP secrets, and similar data that are saved in its configuration file. This is useful for preventing casual observers from reading passwords, for example, when they happen to look at the screen over an administrator's shoulder. However, the algorithm used by service password-encryption is a simple Vigenere cipher; any competent amateur cryptographer could easily reverse it in at most a few hours. The algorithm was not designed to protect configuration files against serious analysis by even slightly sophisticated attackers, and should not be used for this purpose. Any Cisco configuration file that contains encrypted passwords should be treated with the same care used for a clear text list of those same passwords.
159
This weak encryption warning does not apply to passwords set with the enable secret command, but it does apply to passwords set with enable password. The enable secret command uses MD5 for password hashing. The algorithm has had considerable public review, and is not reversible as far as anybody at Cisco knows. It is, however, subject to dictionary attacks (a "dictionary attack" is having a computer try every word in a dictionary or other list of candidate passwords). It's therefore wise to keep your configuration file out of the hands of untrusted people, especially if you're not sure your passwords are well chosen.
Console Ports
It is important to remember that the console port of an IOS device has special privileges. In particular, if a BREAK signal is sent to the console port during the first few seconds after a reboot, the password recovery procedure can easily be used to take control of the system. This means that attackers who can interrupt power or induce a system crash, and who have access to the console port via a hardwired terminal, a modem, a terminal server, or some other network device, can take control of the system, even if they do not have physical access to it or the ability to log in to it normally. It follows that any modem or network device that gives access to the Cisco console port must itself be secured to a standard comparable to the security used for privileged access to the router. At a bare minimum, any console modem should be of a type that can require the dialup user to supply a password for access, and the modem password should be carefully managed.
Controlling TTYs
Local asynchronous terminals are less common than they once were, but they still exist in some installations. Unless the terminals are physically secured, and usually even if they are, the router should be configured to require users on local asynchronous terminals to log in before using the system. Most TTY ports in modern routers are either connected to external modems, or are implemented by integrated modems; securing these ports is obviously even more important than securing local terminal ports. By default, a remote user can establish a connection to a TTY line over the network; this is known as "reverse Telnet," and allows the remote user to interact with the terminal or modem connected to the TTY line. It is possible to apply password protection for such connections. Often, it is desirable to allow users to make connections to modem lines, so that they can make outgoing calls. However, this feature may allow a remote user to connect to a local asynchronous terminal port, or even to a dial-in modem port, and simulate the router's login prompt to steal passwords, or to do other things that may trick local users or interfere with their work. To disable this reverse Telnet feature, apply the configuration command transport input none to any asynchronous or modem line that should not be receiving connections from network users. If at all possible, do not use the same modems for both dial-in and dial-out, and do not allow reverse Telnet connections to the lines you use for dial-in.
160
text Telnet. It's also usually a good idea to use the ip access-class command to restrict the IP addresses from which the VTY will accept connections. A Cisco IOS device has a limited number of VTY lines (usually five). No additional remote interactive connections can be established if all of the VTYs are in use. This creates the opportunity for a denial-ofservice attack; if an attacker can open remote sessions to all the VTYs on the system, the legitimate administrator may not be able to log in. The attacker does not have to log in to do this; the sessions can simply be left at the login prompt. One way of reducing this exposure is to configure a more restrictive ip access-class command on the last VTY in the system than on the other VTYs. The last VTY (usually VTY 4) might be restricted to accept connections only from a single, specific administrative workstation, whereas the other VTYs might accept connections from any address in a corporate network. Another useful tactic is to configure VTY timeouts using the exec-timeout command. This prevents an idle session from consuming a VTY indefinitely. Although its effectiveness against deliberate attacks is relatively limited, it also provides some protection against sessions accidentally left idle. Similarly, enabling TCP keepalives on incoming connections (with service tcp-keepalives-in) can help to guard against both malicious attacks and "orphaned" sessions caused by remote system crashes. Disabling all non-IP-based remote access protocols, and using IPSec encryption for all remote interactive connections to the router can provide complete VTY protection. IPSec is an extra-cost option, and its configuration is beyond the scope of this document.
Logging
Cisco routers can record information about a variety of events, many of which have security significance. Logs can be invaluable in characterizing and responding to security incidents. The main types of logging used by Cisco routers are: AAA logging, which collects information about user dial-in connections, logins, logouts, HTTP accesses, privilege level changes, commands executed, and similar events. AAA log entries are sent to authentication servers using the TACACS or RADIUS protocols, and are recorded locally by those servers, typically in disk files. If you are using a TACACS or RADIUS server, you may wish to enable AAA logging of various sorts; this is done using AAA configuration commands such as aaa accounting. SNMP trap logging, which sends notifications of significant changes in system status to SNMP management stations. System logging, which records a large variety of events, depending on the system configuration. System logging events may be reported to a variety of destinations, including the following: o o o o System console port (logging console). Servers using the syslog protocol (logging <ip-address>, logging trap). Sessions on VTYs and TTYs (logging monitor, terminal monitor). Local buffer in router RAM (logging buffered).
Console logging shall be disabled during debugging of various router protocols to prevent router freeze From a security point of view, the most important events usually recorded by system logging are interface status changes, changes to the system configuration, access list matches, and events detected by the optional firewall and intrusion detection features. 161
Each system-logging event is tagged with an urgency level. The levels range from debugging information (at the lowest urgency), to major system emergencies. Each logging destination may be configured with threshold urgency, and will receive logging events only at or above that threshold.
Anti-spoofing
Many network attacks rely on an attacker falsifying, or spoofing the source addresses of IP datagrams. Some attacks rely on spoofing to work at all, and other attacks are much harder to trace if the attacker can use somebody elses address. Therefore, it is valuable for network administrators to prevent spoofing wherever feasible.
162
Anti spoofing should be done at every point in the network where it is practical, but is usually both easiest and most effective at the borders between large address blocks, or between domains of network administration. It is usually impractical to do anti-spoofing on every router in a network; because of the difficulty of determining which source addresses may legitimately appear on any given interface. For an Internet service provider effective anti-spoofing, together with other effective security measures, can cause expensive, annoying problem subscribers to take their business to other providers. ISPs should be especially careful to apply anti-spoofing controls at dialup pools and other end-user connection points (see also RFC 2267). Administrators of firewalls or perimeter routers sometimes install anti-spoofing measures to prevent hosts on the Internet from assuming the addresses of internal hosts, but do not take steps to prevent internal hosts from assuming the addresses of hosts on the Internet. It's a far better idea to try to prevent spoofing in both directions. There are at least three good reasons for doing anti-spoofing in both directions at an organizational firewall: Internal users will be less tempted to try launching network attacks and less likely to succeed if they do try. Wrongly configured internal hosts will be less likely to cause trouble for remote sites. Outside crackers often break into networks as launching pads for further attacks. These crackers may be less interested in a network with outgoing spoofing protection.
163
Inbound anti spoofing filter are implemented on IPv4 (Internet) connections: IPv4 CPE-PE interfaces on the PE routers. Peering interfaces on iGWs. Virtual-template and other dialup interfaces. Access interfaces on IPv4 CE routers.
Access list 101 consists of the following major sections: Block packets with invalid or prohibited source IP address from being sent towards or across ST backbone Improve protection of P and RR routers by only allowing PING and TRACEROUTE traffic to hit the IP address block 213.81.248.0/20 (ie. the address block reserved for Backbone links). Allow any other packet that is not destined (ie. transit traffic) towards ST backbone
IP Source Routing
The IP protocol supports source routing options that allow the sender of an IP datagram to control the route that datagram will take toward its ultimate destination, and generally the route that any reply will take. These options are rarely used for legitimate purposes in real networks. Some older IP implementations do not process source-routed packets properly, and it may be possible to crash machines running these implementations by sending them datagrams with source routing options. A Cisco router with no ip source-route set will never forward an IP packet, which carries a source routing option. You should use this command unless you know that your network needs source routing. 164
ICMP Redirects
An ICMP redirect message instructs an end node to use a specific router as its path to a particular destination. In a properly functioning IP network, a router will send redirects only to hosts on its own local subnets, no end node will ever send a redirect, and no redirect will ever be traversed more than one network hop. However, an attacker may violate these rules; some attacks are based on this. It is a good idea to filter out incoming ICMP redirects at the input interfaces of any router that lies at a border between administrative domains, and it is not unreasonable for any access list that is applied on the input side of a Cisco router interface to filter out all ICMP redirects. This will cause no operational impact in a correctly configured network. Note that this filtering prevents only redirect attacks launched by remote attackers. It's still possible for attackers to cause significant trouble using redirects if their host is directly connected to the same segment as a host that's under attack.
CDP
Cisco Discovery Protocol (CDP) is used for some network management functions, but is dangerous in that it allows any system on a directly connected segment to learn that the router is a Cisco device, and to determine the model number and the Cisco IOS software version being run. This information may in turn be used to design attacks against the router. CDP information is accessible only to directly connected systems. The CDP protocol may be disabled with the global configuration command no cdp running. CDP may be disabled on a particular interface with no cdp enable.
NTP
The Network Time Protocol (NTP) is a protocol used to time-synchronize network devices. NTP runs over UDP and is documented in RFC 1305. An NTP stratum 1 server should get its time from an authoritative time source, such as a GPS system or an atomic clock attached to a timeserver. NTP then distributes this time across the network. NTP is a very sophisticated and efficient protocol, which only needs one packet per minute to synchronize two machines to within a millisecond of one another. NTP uses the concept of a "stratum" to describe how many NTP "hops" away a machine is from an authoritative time source. A "stratum 1" time source has a reference clock such as a GPS or atomic clock directly attached, a "stratum 2" time source receives its time from a "stratum 1" time source, and so on. This hop count isnt related to the IP hops between two NTP time sources. A device running NTP automatically chooses the lowest stratum timeserver as its time source. It only talks and listens to servers, which it has a configuration entry for. To avoid synchronization problems NTP has two methods to determine the validity of the time source. NTP will never synchronize to a device, which is not synchronized itself. It will also not synchronize to a source; whichs time is significantly different than all the other time sources. The NTP configuration is usually static. Every device has a list of IP addresses with which it will exchange NTP messages. These communication agreements are called associations. On LAN segments NTP can use IP broadcast messages as well.
165
With Cisco two mechanisms are available to secure the communication: an access list-based restriction scheme and an encrypted authentication mechanism. A limitation of Ciscos implementation is that it doesnt support stratum 1 service, which means a reference clock such as a GPS or atomic clock cannot be connected directly to the Cisco box. NTP is a very valuable tool for reporting and troubleshooting, because cause and effect of problems can be clearly correlated. Care must be taken, where the time information comes from, especially if additional time sources from the Internet are used as a reference. Confusing the time system can render system log files completely useless. The Network Time Protocol (NTP) will be used to synchronize router clocks. NTP authentication will used to have secure NTP associations. The loopback0 address is used to form NTP associations.
ntp ntp ntp ntp ntp ntp ntp authentication-key 1 md5 *&^^&*_(_ 7 authenticate trusted-key 1 source Loopback0 update-calendar server <P1 loopback> key 1 server <P2 loopback> key 1
P1 and P2 shall synchronise with an external timeserver. P1 and P2 routers synchronise among themselves as NTP peers to increase the stability. Any other device in ST network becomes a NTP client of P1 and P2.
ntp ntp ntp ntp peer <P1 loopback> server <ST public NTP server> master 5 update-calendar
166
Network Management
Depending on the size of the content there is a possibility that there would be a separate LLD on Network Management. In that case put a reference to that doc. Otherwise discuss in detail all the aspects of Network Management here
167
Appendix I
168
Appendix II
Corporate Headquarters Cisco Systems, Inc. 170 West Tasman Drive San Jose, CA 95134-1706 USA www.cisco.com Tel: 408 526-4000 800 553-NETS (6387) Fax: 408 526-4100
European Headquarters Cisco Systems Europe 11 Rue Camille Desmoulins 92782 Issy-Les-Moulineaux Cedex 9 France www-europe.cisco.com Tel: 33 1 58 04 60 00 Fax: 33 1 58 04 61 00
Americas Headquarters Cisco Systems, Inc. 170 West Tasman Drive San Jose, CA 95134-1706 USA www.cisco.com Tel: 408 526-7660 Fax: 408 527-0883
Asia Pacific Headquarters Cisco Systems Australia, Pty., Ltd Level 9, 80 Pacific Highway P.O. Box 469 North Sydney NSW 2060 Australia www.cisco.com Tel: +61 2 8448 7100 Fax: +61 2 9957 4350
Cisco Systems has more than 200 offices in the following countries and regions. Addresses, phone numbers, and fax numbers are listed on the Cisco Web site at www.cisco.com/go/offices.
Argentina Australia Austria Belgium Brazil Bulgaria Canada Chile China Colombia Costa Rica Croatia Czech Republic Denmark Dubai, UAE Finland France Germany Greece Hong Kong SAR Hungary India Indonesia Ireland Israel Italy Japan Korea Luxembourg Malaysia Mexico The Netherlands New Zealand Norway Peru Philippines Poland Portugal Puerto Rico Romania Russia Saudi Arabia Singapore Slovakia Slovenia South Africa Spain Sweden Switzerland Taiwan Thailand Turkey Ukraine United Kingdom United States Venezuela Vietnam Zimbabwe