Introduction To Networking Monitoring and Management
Introduction To Networking Monitoring and Management
Introduction To Networking Monitoring and Management
Management
Introduction to Networking
Monitoring and Management
These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license
(http://creativecommons.org/licenses/by-nc/3.0/) as part of the ICANN, ISOC and NSRC Registry Operations Curriculum.
Part I: Overview
Core concepts presented:
– What is network monitoring
– What is network management
– Getting started
– Why network management
– The big three
– Attack detection
– Documentation
– Consolidating the data
– The big picture
Network Monitoring
One definition…
Operation: keeping the network (and the services that the network provides) up and running
smoothly. It includes monitoring the network to spot problems as soon as possible, ideally before
users are affected.
Administration: deals with keeping track of resources in the network and how they are assigned.
Maintenance: concerned with performing repairs and upgrades. Maintenance also involves
corrective and preventive measures to make the managed network run "better”.
Provisioning: is concerned with configuring resources in the network to support a given service.
FCAPS
Fault, Configuration, Accounting, Performance and Security
(The ISO Telecommunications Management Network model and framework for network management)
Source: wikipedia
Network Management Details
We Monitor
• System & Services
– Available, reachable
• Resources
– Expansion planning, maintain availability
• Performance
– Round-trip-time, throughput
• Changes and configurations
– Documentation, revision control, logging
Network Management Details
We Keep Track Of
• Statistics
– For purposes of accounting and metering
• Faults (Intrusion Detection)
– Detection of issues,
– Troubleshooting issues and tracking their history
Availability
– Nagios Services, servers, routers,
switches
Reliability
– Smokeping Connection health, rtt, service
response time, latency
Performance
– Cacti Total traffic, port usage, CPU
RAM, Disk, processes
Nice…
Network Documentation
More automation might be needed. An
automated network documentation system
is something to consider.
– You can write local scripts to do this.
– You can consider some automated
documentation systems.
– You’ll probably end up doing both.
Automated Systems
There are quite a few automated network
documentation systems. Each tends to do
something different:
– IPplan:
http://iptrack.sourceforge.net/
– Netdisco:
http://netdisco.org/
– Netdot:
https://netdot.uoregon.edu/
– Rack Tables:
http://www.racktables.org/
Consolidating the data
The Network Operations Center (NOC)
“Where it all happens”
Coordination of tasks
Status of network and services
Fielding of network-related incidents and
complaints
Where the tools reside (”NOC server”)
Documentation including:
Network diagrams
database/flat file of each port on each switch
Network description
Much more as you'll see.
The big picture
- Monitoring Notifications
- Data collection
- Accounting
Ticket
- Change control &
monitoring
- Capacity planning
Ticket
- NOC Tools - Availability (SLAs)
- Ticket system - Trends
- Detect problems
Ticket
Ticket
- Improvements
- Upgrades
Ticket - User complaints
- Requests
- Fix problems
A few Open Source solutions…
Performance Change Mgmt Net Management
Cricket Mercurial Big Brother
IFPFM Rancid* (routers) Big Sister
flowc CVS* Cacti*
mrtg* Subversion Hyperic
NetFlow* git* Munin
NfSen* Security/NIDS Nagios*
ntop Nessus OpenNMS
perfSONAR OSSEC Sysmon
pmacct Prelude Zabbix
rrdtool* Samhain Documentation
SmokePing* SNORT • IPplan
Ticketing Untangle • Netdisco
RT* Logging • Netdot*
Trac*, • swatch* • Rack Table
Redmine • syslog/rsyslog* Protocols/Utilities
• tenshi* • SNMP*, Perl, ping
Questions?
?
Part II: Details
Lots of screenshots:
http://iptrack.sourceforge.net/doku.php?id=screenshots
Netdisco:
• Project launched 2003. Version 1.0
released October 2009.
• Some popular uses of Netdisco:
– Locate a machine on the network by MAC or IP and
show the switch port it lives at.
– Turn Off a switch port while leaving an audit trail.
Admins log why a port was shut down.
– Inventory your network hardware by model, vendor,
switch-card, firmware and operating system.
– Report on IP address and switch port usage: historical
and current.
– Pretty pictures of your network.
Netdot:
Includes functionality of IPplan and Netdisco
and more. Core functionality includes:
– Device discovery via SNMP
– Layer2 topology discovery and graphs, using:
• CDP/LLDP
• Spanning Tree Protocol
• Switch forwarding tables
• Router point-to-point subnets
– IPv4 and IPv6 address space management (IPAM)
• Address space visualization
• DNS/DHCP config management
• IP and MAC address tracking
Continued
Netdot:
Functionality continued:
– Cable plant (sites, fiber, copper, closets, circuits...)
– Contacts (departments, providers, vendors, etc.)
– Export scripts for various tools (Nagios, Sysmon,
RANCID, Cacti, etc)
• I.E., how we could automate node creation in Cacti!
– Multi-level user access: Admin, Operator, User
– It draws pretty pictures of your network
Documentation: Diagrams
Diagramming Software
3. Performance Tools
Key is to look at each router interface (probably
don’t need to look at switch ports).
Two common tools:
- Netflow/NfSen: http://nfsen.sourceforge.net/
- MRTG: http://oss.oetiker.ch/mrtg/
MRTG = “Multi
Router Traffic
Grapher”
Network monitoring systems & tools
Active tools
Ping – test connectivity to a host
Traceroute – show path to a host
MTR – combination of ping + traceroute
SNMP collectors (polling)
Passive tools
log monitoring, SNMP trap receivers, NetFlow
Automated tools
SmokePing – record and graph latency to a set of hosts,
using ICMP (Ping) or other protocols
MRTG/RRD – record and graph bandwidth usage on a
switch port or network link, at regular intervals
Network monitoring systems & tools
A few tools:
SNORT - a commonly used open source tool:
http://www.snort.org/
Prelude – Security Information Management System
https://dev.prelude-technologies.com/
Samhain – Centralized HIDS
http://la-samhna.de/samhain/
Nessus - scan for vulnerabilities:
http://www.nessus.org/download/
Configuration mgmt & monitoring
Record changes to equipment configuration using
revision control (also for configuration files)
Inventory management (equipment, IPs,
interfaces)
Use versioning control
As simple as:
”cp named.conf named.conf.20070827-01”
For plain configuration files:
CVS, Subversion (SVN)
Mercurial
• For routers:
- RANCID
Configuration mgmt & monitoring
Traditionally, used for source code (programs)
Works well for any text-based configuration files
Also for binary files, but less easy to see differences
For network equipment:
RANCID (Automatic Cisco configuration retrieval and
archiving, also for other equipment types)
Built-in to Project Management Software like:
Trac
Redmine
And, many other wiki products. Excellent for
documenting your network.
The big picture revisited
- Monitoring Notifications
- Data collection
- Accounting
Ticket
- Change control &
monitoring
- Capacity planning
Ticket
- NOC Tools - Availability (SLAs)
- Ticket system - Trends
- Detect problems
Ticket
Ticket
- Improvements
- Upgrades
Ticket - User complaints
- Requests
- Fix problems
Questions