Service Guide: RS/6000 7025 F50 Series
Service Guide: RS/6000 7025 F50 Series
Service Guide: RS/6000 7025 F50 Series
Service Guide
SA38-0541-01
The following paragraph does not apply to the United Kingdom or any country where
such provisions are inconsistent with local law: THIS PUBLICATION IS PROVIDED “AS
IS” WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING,
BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS
FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied
warranties in certain transactions, therefore, this statement may not apply to you.
This publication could include technical inaccuracies or typographical errors. Changes are
periodically made to the information herein; these changes will be incorporated in new editions
of the publication. The manufacturer may make improvements and/or changes in the
product(s) and/or the program(s) described in this publication at any time, without notice.
It is possible that this publication may contain reference to, or information about, products
(machines and programs), programming, or services that are not announced in your country.
Such references or information must not be construed to mean that these products,
programming, or services will be announced in your country. Any reference to a specific
licensed program in this publication is not intended to state or imply that you can use only that
licensed program. You can use any functionally equivalent program instead.
Requests for technical information about products should be made to your authorized reseller
or marketing representative.
Safety Notices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
Electrical Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
Laser Safety Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii
iv Service Guide
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . X-1
Preface v
Downloaded from www.Manualslib.com manuals search engine
vi Service Guide
Properly shielded and grounded cables and connectors must be used in order to
meet FCC emission limits. Neither the provider nor the manufacturer are responsible
for any radio or television interference caused by using other than recommended
cables and connectors or by unauthorized changes or modifications to this
equipment. Unauthorized changes or modifications could void the user's authority to
operate the equipment.
This device complies with Part 15 of the FCC Rules. Operation is subject to the
following two conditions: (1) this device may not cause harmful interference,and (2)
this device must accept any interference received, including interference that may
cause undesired operation.
This product has been designed and built to comply with IEC Standard 950.
The network adapter interfaces housed within this equipment are approved
separately, each one having its own independent approval number. These interface
adapters, supplied by the manufacturer, do not use or contain excessive voltages. An
excessive voltage is one which exceeds 70.7 V peak ac or 120 V dc. They interface
with this equipment using Safe Extra Low Voltages only. In order to maintain the
separate (independent) approval of the manufacturer's adapters, it is essential that
Preface vii
Downloaded from www.Manualslib.com manuals search engine
other optional cards, not supplied by the manufacturer, do not use main voltages or
any other excessive voltages. Seek advice from a competent engineer before
installing other adapters not supplied by the manufacturer.
Neither the provider nor the manufacturer can accept responsibility for any failure to
satisfy the protection requirements resulting from a non–recommended modification
of the product, including the fitting of option cards not supplied by the manufacturer.
This product has been tested and found to comply with the limits for Class B
Information Technology Equipment according to CISPR 22 / European Standard EN
55022. The limits for Class B equipment were derived for typical residential
environments to provide reasonable protection against interference with licensed
communication devices.
VCCI Statement
The following is a summary of the VCCI Japanese statement in the box above.
When used near a radio or TV receiver, it may become the cause of radio
interference.
Dieses Gerät ist berechtigt in Übereinstimmung mit dem deutschen EMVG vom
9.Nov.92 das EG–Konformitätszeichen zu führen.
Preface ix
Downloaded from www.Manualslib.com manuals search engine
Note: If any of the following Feature Codes are installed:
3615, 4356, 4357, 4358, 4359, 6218, 8130, or 8134,
or if a configuration using more than one SysKonnect FDDI adapter (Feature
Codes 2741, 2742, or 2743) where any one of them is Feature Code 2743
(SysKonnect SK-NET FDDI-UP SAS PCI), the following statement applies:
This product has been tested and found to comply with the limits for Class A
Information Technology Equipment according to CISPR 22 / European Standard EN
55022. The limits for Class A equipment were derived for commercial and industrial
environments to provide reasonable protection against interference with licensed
communication equipment.
VCCI Statement
The following is a summary of the VCCI Japanese statement in the box above.
This is a Class A product based on the standard of the Voluntary Control Council for
Interference by Information Technology Equipment (VCCI). If this equipment is used
in a domestic environment, radio disturbance may arise. When such trouble occurs,
the user may be required to take corrective actions.
x Service Guide
A danger notice indicates the presence of a hazard that has the potential of causing
death or serious personal injury.
A caution notice indicates the presence of a hazard that has the potential of causing
moderate or minor personal injury.
Electrical Safety
Observe the following safety instructions any time you are connecting or
disconnecting devices attached to the workstation.
DANGER
CAUTION:
This product is equipped with a three–wire power cable and plug for the user's
safety. Use this power cable with a properly grounded electrical outlet to avoid
electrical shock.
Preface xi
Downloaded from www.Manualslib.com manuals search engine
DANGER
The optical drive in this system unit is certified in the U.S. to conform to the
requirements of the Department of Health and Human Services 21 Code of Federal
Regulations (DHHS 21 CFR) Subchapter J for Class 1 laser products. Elsewhere,
the drive is certified to conform to the requirements of the International
Electrotechnical Commission (IEC) 825 (1st edition 1984) and CENELEC EN 60
825:1991 for Class 1 laser products.
CAUTION:
A class 3 laser is contained in the device. Do not attempt to operate the drive
while it is disassembled. Do not attempt to open the covers of the drive as it
is not serviceable and is to be replaced as a unit.
Class 1 laser products are not considered to be hazardous. The optical drive
contains internally a Class 3B gallium-arsenide laser that is nominally 30 milliwatts at
830 nanometers. The design incorporates a combination of enclosures, electronics,
and redundant interlocks such that there is no exposure to laser radiation above a
Class 1 level during normal operation, user maintenance, or servicing conditions.
Preface xiii
Downloaded from www.Manualslib.com manuals search engine
xiv Service Guide
This book provides maintenance information that is specific to the system unit,
adapters, and attached devices that do not have their own service information. It also
contains Maintenance Analysis Procedures (MAPs) that are not common to other
systems.
MAPs that are common to all systems are contained in the IBM RS/6000 Diagnostic
Information for Multiple Bus Systems.
This book is used by the service technician to repair system failures. This book
assumes that the service technician has had training on the system unit.
ISO 9000
ISO 9000 registered quality systems were used in the development and
manufacturing of this product.
Related Publications
Preface xv
Downloaded from www.Manualslib.com manuals search engine
xvi Service Guide
Note: The SCSI bus ID's in Bays A1, B1 and B2 are the recommended values.
Features installed at the manufacturing site correspond to these IDs. Field
installations may not comply with these recommendations.
After the A/C power is turned on, the System Support Controller (SSC) startup
begins, and releases reset to the Service Processor. If the SSC cannot
communicate with the Service Processor, the LCD displays 4BA00000. If the
Service Processor is not present, the LCD displays 4BA00001.
1. Service Processor Self Test
Service Processor card performs self test and NVRAM initialization.
LCD Code range is E000 - E07F.
LCD code is "OK" when complete.
2. Service Processor in Standby Mode
You can enter the Service Processor menus whenever the LCD code is
"OK", "STBY", or has an eight digit error code on the LCD display by
pressing the enter key on an ASCII terminal connected to serial port 1.
3. Built-In-Self-Test (BIST)
The Service Processor initiates Built-In-Self-Test (BIST) on the Central
Electronics Complex (CEC) chips when the POWER Button is pressed.
The VPD data is read and the CRC is checked.
The CPU compatibility test is run.
LCD Code range is E0A0 - E0E1.
4. System Initialization
System firmware begins to execute and initializes system registers after LCD
code E0E1.
LCD Code range is E1XX - E2XX.
5. Memory Test
The system firmware tests the system memory and identifies failing memory
cards and memory module locations.
LCD Code range is E3XX.
6. Device Configuration and Test
System firmware checks to see what devices are in the system and performs
a simple test on them.
The system firmware displays the device name or device icon being tested.
After the keyboard name or icon appears, the user can enter the Systems
Management Services menu by hitting the "1" key (if ASCII terminal) or the
"F1" key (if Graphics terminal).
The user can also enter one of the following:
– "5" or "F5" to start the Stand-alone Diagnostics (CD or Tape).
– "6" or "F6" to start the On-line Diagnostics (Hard Disk)
Specifications
Dimensions
Weight
Configuration dependent
Power Requirements
Power Factor
0.8 - 0.98
Operating Voltage
Acoustics
Power cables used in the United States and Canada are listed by Underwriter's
Laboratories (UL) and certified by the Canadian Standards Association (CSA).
These power cords consist of:
Electrical cables, Type SVT or SJT.
Attachment plugs complying with National Electrical Manufacturers Association
(NEMA) 5-15P. That is:
"For 115 V operation, use a UL listed cable set consisting of a minimum 18 AWG,
Type SVT or SJT three-conductor cord a maximum of 15 feet in length and a parallel
blade, grounding type attachment plug rated at 15 A, 125 V."
"For 230 V operation in the United States use a UL listed cable set consisting of a
minimum 18 AWG, Type SVT or SJT three-conductor cable a maximum of 15 feet in
length, and a tandem blade, grounding type attachment plug rated at 15 A, 250 V."
Appliance couplers complying with International Electrotechnical Commission
(IEC) Standard 320, Sheet C13.
Refer to Chapter 7, “ Parts Information” on page 7-1 to find the power cables that
are available.
Entry MAP
Notes:
1. If more than eight digits are displayed in the operator panel, use only the first
eight digits to find the error in the tables. The digits that display beyond the first
eight digits are location codes that can assist you in diagnosing the problem. See
“Location Codes” on page 3-46.
2. Licensed programs frequently rely on network configuration, and system
information stored on the VPD on the operator panel control assembly (connector
U2). If the MAPs indicate that the Operator Panel Control Assembly should be
replaced, swap the VPD from the old operator panel to the new one. If the old
VPD module has to be replaced call technical support for recovery instructions.
If recovery is not possible, notify the system owner that new keys for licensed
programs may be required.
3. If a network adapter, or the I/O planar is replaced, the network administrator
must be notified so that the client IP addresses used by the server can be
changed. In addition, the operating system configuration of the network
controller may need to be changed in order to enable system startup. Also
check to ensure that any client or server that addresses this system is updated.
Symptom Action
Service Actions
You have parts to exchange or a corrective 1. Go to the Removal and Replacement
action to perform. Procedures.
2. Go to "MAP 410: Repair Checkout" in the
IBM RS/6000 Diagnostic Information for
Multiple Bus Systems.
You need to verify that a part exchange or Go to "MAP 410: Repair Checkout" in the IBM
corrective action corrected the problem. RS/6000 Diagnostic Information for Multiple Bus
Systems.
Symptom Analysis
You have OK displayed The Service Processor (SP) is ready. The
system is waiting for power on.
You have STBY displayed The Service Processor (SP) is ready. The
system was shutdown by the operating system
and is still powered on. This condition can be
requested by a privileged system user with no
faults. See SP error log for possible operating
system fault indications.
You do not have a determined symptom. Go to “MAP 1020: Problem Determination” on
page 2-6.
You have an 8-digit error code displayed. Record the error code. Go to Chapter 3, “Error
Code to FRU Index” on page 3-1.
You have an SRN. Go to the Fast Path MAP in the IBM RS/6000
Diagnostic Information for Multiple Bus Systems.
The system POST indicators are displayed on Go to “E1xx Code Boot Problems” on
the system console, the system pauses and page 3-44.
then restarts. The term "POST indicators" refer
to the icons (graphic display) or device
mnemonics (ASCII terminal) that appear during
the power-on self-test (POST).
The system stops and POST indicators are 1. Use MAP 1540 to isolate the problem.
displayed on the system console. The term
"POST indicators" refer to the icons (graphic
display) or device mnemonics (ASCII terminal)
that appear during the power-on self-test
(POST).
The system stops and the message "STARTING Go to “Checkpoints” on page 3-29.
SOFTWARE PLEASE WAIT..." is displayed on
Use this MAP to get an error code if you were not provided one by the customer or
you are unable to load diagnostics. If you are able to load the diagnostics, go to
MAP 0020 in the IBM RS/6000 Diagnostic Information for Multiple Bus Systems.
The Service Processor may have recorded one or more symptoms in its error log. It
is a good idea to examine that error log before proceeding (see Service Processor
System Information Menu).
The Service Processor may have been set by the user to monitor server operations
and to attempt recoveries. You may wish to disable these actions while you diagnose
and service the system. If you disable them, you should make notes of their current
settings for restoration before you leave. Following are the settings of your interest.
Be prepared to record code numbers and use those numbers in the course of
analyzing a problem. Go to “Step 1020-1.”
Step 1020-1
Symptom Action
The diskette LED is blinking rapidly, or EIEA or The flash EPROM data is corrupted. The
EIEB is displayed on the operator panel. recovery procedure for the flash EPROM should
be executed. See “Firmware Recovery” on
page 5-23.
The system stops with a prompt to enter a Enter the password. You are not allowed to
password. continue until a correct password has been
entered. When you have entered a valid
password go to the beginning of this table and
wait for one of the other conditions to occur.
The diagnostics loaded. Go to MAP 0020 in the IBM RS/6000 Diagnostic
Information for Multiple Bus Systems.
The system login prompt is displayed. You may not have pressed the correct key or
you may not have pressed the key soon enough
when you were to indicate a Service Mode IPL
of the diagnostic programs. If this was the case
start over at the beginning of this Step.
Note: Perform the systems shutdown
procedure before turning off the system.
If you are sure you pressed the correct key in a
timely manner, go to “Step 1020-2” on
page 2-10.
The system does not respond when the Go to “Step 1020-2” on page 2-10.
password is entered.
The system stopped and a POST indicator is If the POST indicator represents:
displayed on the system console and an
memory, record error code M0MEM002.
eight-digit error code is not displayed.
keyboard, record error code M0KBD000.
SCSI, record error code M0CON000.
network, record error code M0NET000.
speaker (audio), record error code
M0BT0000.
Go to “Step 1020-3” on page 2-10.
Find the type of keyboard you are using in the following table; then follow the
instructions given in the Action column.
Step 1020-3
DANGER
CAUTION:
This product is equipped with a three–wire power cable and plug for the user's
safety. Use this power cable with a properly grounded electrical outlet to avoid
electrical shock.
DANGER
Step 1520-2
1. Turn the power off.
2. Check that the external power cable to the system unit has continuity.
3. Check that the power outlet has been wired correctly with the correct voltage.
4. Check that the external power cable is plugged into both the system unit and the
power outlet.
To test each FRU, exchange the FRUs that have not already been exchanged in the
following order.
Verify that the fan jumper is connected to connector J55.
Power supply
I/O planar
Service Processor
System card
Front cooling fans (one at a time)
1. Turn the power off.
2. Unplug the system unit power cable from the wall outlet.
3. Exchange one of the FRUs in the list.
4. Connect the system unit power cable to the wall outlet.
5. Turn the power on.
Does the fan in the power supply turn on and the power LED come on and stay
on?
NO Reinstall the original FRU.
Repeat this step until the defective FRU is identified or all the FRUs have
been exchanged.
If the symptom did not change and all the FRUs have been exchanged,
go to “Step 1520-4” on page 2-15.
YES Go to "MAP 410: Repair Checkout" in the IBM RS/6000 Diagnostic
Information for Multiple Bus Systems.
Does the fan in the power supply turn on and the power LED come on and stay
on?
NO Replace the I/O planar. Go to "MAP 410: Repair Checkout" in the IBM
RS/6000 Diagnostic Information for Multiple Bus Systems.
YES Go to “Step 1520-5” on page 2-16.
One of the parts that was removed or unplugged is causing the problem. Install or
connect the parts in the following order.
1. CPU card(s).
2. Memory cards. (One at a time)
3. SCSI devices, lowest bay to highest bay.
4. PCI cards, lowest slot to highest slot.
5. ISA cards, lowest slot to highest slot.
6. Fans.
Turn the power on after each part is installed or connected. If the system does not
power on or the power LED does not stay on, the most recently installed or
connected part is causing the failure.
1. Turn the power off.
2. Unplug the system unit power cable from the wall outlet.
3. Install or connect one of the parts in the list.
4. Connect the system unit power cable to the wall outlet.
5. Turn the power on.
Does the fan in the power supply turn on and the power LED come on and stay
on?
NO Replace the last part installed.
If the memory card was just installed, remove all of the memory DIMMs. If
the system does not come up, replace the memory card.
Re-install the memory DIMMs, one pair at a time, until the problem recurs.
Replace the memory DIMM pair that was just installed.
Note: The memory DIMM pair must be installed in slots that are next to
each other. For example, install the pair in J1 and J2, not J1 and J3.
Repeat these steps until all the parts have been installed.
If the symptom did not change and all the parts have been replaced, call
your service support person for assistance.
If the symptom has changed, check for loose cards, cables, and obvious
problems. If you do not find a problem, return to “Step 1520-1” on
page 2-13 in this MAP and follow the instructions for the new symptom.
YES Go to "MAP 410: Repair Checkout" in the IBM RS/6000 Diagnostic
Information for Multiple Bus Systems.
Notes:
1. Be sure to unplug the power cable before removing or installing Service
Processor, CPU, System Card or I/O planar to avoid damage to them.
2. This MAP assumes that a CD-ROM drive is installed and connected to the
integrated SCSI adapter, and a Diagnostics CD-ROM is available.
3. If a power-on password or privileged-access password is installed, you are
prompted to enter the password before the diagnostic CD-ROM can load.
4. The term "POST indicators" refer to the icons (graphic display) or device
mnemonics (ASCII terminal) that appear during the power-on self-test (POST).
5. The Service Processor may have recorded one or more symptoms in its error
log. It is a good idea to examine that error log before proceeding (see Service
Processor System Information Menu).
6. The Service Processor may have been set by the user to monitor server
operations and to attempt recoveries. You may wish to disable these actions
while you diagnose and service the system. If you disable them, you should
make notes of their current settings for restoration before you leave. Following
are the settings of your interest.
This MAP is used to locate defective FRUs not found by normal diagnostics. For this
procedure, diagnostics are run on a minimally-configured system. If a failure is
detected on the minimally-configured system, the remaining FRUs are exchanged
one at a time until the failing FRU is identified. If a failure is not detected, FRUs are
added back until the failure occurs. The failure is then isolated to the failing FRU.
Does the operator panel stabilize with code E1F2, E1F3, E1F7, or STBY?
NO Go to “Step 1540-3” on page 2-20.
YES Go to “Step 1540-4” on page 2-21.
If the following steps call for a I/O planar to be replaced, see notes on page 2-1.
Turn the power off, remove the power cable, and exchange the following FRUs in
order:
1. Processor cards
2. Memory DIMMs (pair)
3. Memory card
4. I/O planar (see notes on page 2-1.)
5. System Card
6. Service Processor
Does the operator panel stabilize with code E1F2, E1F3, E1F7, or STBY?
NO Reinstall the original FRU.
Repeat the FRU replacement steps until the defective FRU is identified or
all the FRUs have been exchanged.
If the symptom did not change and all the FRUs have been exchanged,
call your service support person for assistance.
If the symptom has changed, check for loose cards, cables, and obvious
problems. If you do not find a problem, return to “Step 1540-1” on
page 2-18 in this MAP and follow the instructions for the new symptom.
YES Go to "MAP 410: Repair Checkout" in the IBM RS/6000 Diagnostic
Information for Multiple Bus Systems.
Does the operator panel stabilize with code E1F2, E1F3, E1F7, or STBY?
NO Go to “Step 1540-5” on page 2-22.
YES Repeat this step until all the memory DIMMs are installed and tested.
After all the memory DIMMs are installed and tested, record the positions
of the memory DIMMs in the second memory card. Remove all of the
memory DIMMs from the second memory card except 1 pair. Repeat this
step with the second memory card (if previously installed).
Go to “Step 1540-7” on page 2-24.
The failure may be caused by the last pair of memory DIMMs installed or the
memory card. To isolate the failing FRU, do the following:
1. Turn the power off and remove the power cable.
2. Exchange the last memory DIMM pair installed.
3. Reinstall the power cable.
4. Turn the power on.
5. Wait for the operator panel to stabilize at a checkpoint.
Note: Checkpoints E1F2, E1F3 and STBY are stable as soon as they appear.
Other checkpoints may take up to 3 minutes to assure stability, depending on
system configuration.
Does the operator panel stabilize with code E1F2, E1F3, E1F7, or STBY?
NO Go to “Step 1540-6” on page 2-23.
YES Go to "MAP 410: Repair Checkout" in the IBM RS/6000 Diagnostic
Information for Multiple Bus Systems.
Does the operator panel stabilize with code E1F2, E1F3, E1F7, or STBY?
NO Reinstall the original FRU.
Repeat this step until the defective FRU is identified or all the FRUs have
been exchanged.
If the symptom did not change and all the FRUs have been exchanged,
call your service support person for assistance.
If the symptom has changed, check for loose cards, cables, and obvious
problems. If you do not find a problem, return to “Step 1540-1” on
page 2-18 in this MAP, and follow the instructions for the new symptom.
YES Go to "MAP 410: Repair Checkout" in the IBM RS/6000 Diagnostic
Information for Multiple Bus Systems.
The system is working correctly with this configuration. One of the FRUs (adapters)
that you removed is probably defective.
1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive.
2. Turn the power off and remove the power cable.
3. Plug the diskette drive cable into the diskette drive connector on the I/O planar.
4. Reinstall the power cable.
5. Turn the power on.
6. After the keyboard indicator is displayed, press the F5 key on the
directly-attached keyboard or the number 5 key on an ASCII terminal keyboard.
7. Enter the appropriate password when prompted to do so.
The system is working correctly with this configuration. One of the FRUs (adapters)
that you removed is probably defective,
1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive.
2. Turn the power off and remove the power cable.
3. Install the second CPU card if one was removed. If a second CPU card was not
removed, or has already been reinstalled and verified, install an I/O Planar
device (parallel port device, serial port device, keyboard or mouse) that had been
removed and any cables that were attached. If all I/O Planar devices have been
reinstalled and verified, install a FRU (adapter) and connect any cables and
devices that were attached to it.
Note: If the FRU you installed requires supplemental media use the Process
Supplemental Media task to load the supplemental media first.
4. Reinstall the power cable.
5. Turn the power on.
6. If the Console Selection screen displays, choose the system console.
7. After the keyboard indicator displayeds, press the F5 key on the directly-attached
keyboard or the number 5 key on an ASCII terminal keyboard.
8. Enter the appropriate password when prompted to do so.
The last FRU installed or one of its attached devices is probably defective.
1. Make sure the diagnostic CD-ROM is inserted into the CD-ROM drive.
2. Turn the power off and remove the power cable.
3. Starting with the last installed adapter, disconnect one attached device and
cable.
4. Reinstall the power cable.
5. Turn the power on.
6. If the Console Selection screen is displayed, choose the system console.
7. After the keyboard indicator appears, press the F5 key on the directly-attached
keyboard or the number 5 key on an ASCII terminal keyboard.
8. Enter the appropriate password when prompted to do so.
Have you exchanged all the FRUs that correspond to the failing function
codes?
NO Exchange the FRU with the highest failure percentage that has not been
changed.
Repeat this step until all the FRUs associated with the SRN have been
exchanged or diagnostics run with no trouble found. Run diagnostics after
each FRU is exchanged.
If the I/O planar or a network adapter is replaced, see notes on page 2-1.
Go to "MAP 410: Repair Checkout" in the IBM RS/6000 Diagnostic
Information for Multiple Bus Systems.
YES If the symptom did not change and all the FRUs have been exchanged,
call your service support person for assistance.
Does the system have adapters or devices that require supplemental media?
NO Go to “Step 1540-15.”
YES Go to “Step 1540-16.”
Step 1540-15
Consult the ISA and PCI adapter configuration documentation for your operating
system to verify that all installed adapters are configured correctly.
Go to "MAP 410: Repair Checkout" in the IBM RS/6000 Diagnostic Information for
Multiple Bus Systems.
If the symptom did not change and all the FRUs have been exchanged, call your
service support person for assistance.
Step 1540-16
Select Process Supplemental Media and follow the on screen instructions to process
the media.
Did the system return to the TASKS SELECTION SCREEN after the
supplemental media was processed?
NO Go to “Step 1540-17” on page 2-34.
YES Go to “Step 1540-12” on page 2-31 substep 6 on page 2-31.
If the supplemental media is for an adapter replace the FRUs in the following order:
1. Adapter.
2. The planar the adapter is plugged into.
If the supplemental media is for a device replace the FRUs in the following order:
1. Device and any associated cables.
2. The adapter the device is attached to.
Repeat this step until the defective FRU is identified or all the FRUs have been
exchanged.
If the symptom did not change and all the FRUs have been exchanged, call your
service support person for assistance.
If the symptom has changed, check for loose cards, cables, and obvious problems.
If you do not find a problem, return to “Step 1540-1” on page 2-18 on this MAP and
follow the instructions for the new symptom.
Go to "MAP 410: Repair Checkout" in the IBM RS/6000 Diagnostic Information for
Multiple Bus Systems.
Attention: Switching off the power to an SSA Hot-Swap Disk Drive can cause a
system fault unless you first make the SSA Hot-Swap Disk Drive modules
unavailable to the system unit. To do this:
1. Use the system-management task menus to make the disk drive modules
unavailable.
2. If the system unit has more than one SSA adapter, make the SSA Hot-Swap
Disk Drive modules unavailable to the system for each of the paths to the SSA
Hot-Swap Disk Drive.
Attention: Unless the system unit needs to be switched off for some other reason,
do not switch off the power to the system unit when servicing an SSA Hot-Swap Disk
Drive or the external SSA cable.
To isolate the FRUs associated with in the failing SSA Hot-Swap Disk Drive,
perform the actions and answer the questions given in this MAP.
When instructed to exchange two or more FRUs in sequence:
1. Exchange the first FRU in the list for a new one.
2. Verify that the problem is solved. For some problems, verification means
running the diagnostic programs (see the using-system service procedures).
3. If the problem remains:
a. Reinstall the original FRU.
b. Exchange the next FRU in the list for a new one.
4. Repeat steps 2 and 3 until either the problem is solved, or all the related
FRUs have been exchanged.
5. Perform the next action indicated by the MAP.
The following table explains the meaning of the Power, Ready and Check status
LEDs.
This MAP is the entry point to the MAPs for the SSA Hot-Swap Disk Drive. If you
are not familiar with these MAPs, read “How to Use These MAPs” on page 2-35 first.
Step 2010-1
Have you been sent to this MAP from the Configuration Verification service aid?
NO Go to “Step 2010-3” on page 2-39.
YES Go to “Step 2010-2” on page 2-38.
Step 2010-4
If the stand-alone diagnostics are available, but the system diagnostics are not
available, run them, then go to “Step 2010-5.”
If the system diagnostics and stand-alone diagnostics are not available, go to “Step
2010-1” on page 2-37.
Step 2010-5
Did the diagnostics give you an SRN for the SSA Hot-Swap Disk Drive?
NO Go to “Step 2010-6” on page 2-40.
YES Go to “Service Request Numbers (SRNs)” on page A-3.
Step 2010-7
Step 2010-8
Step 2010-9
Step 2010-11
Replace the SSA link cable closest to the flashing LED. Did the LED stop flashing?
NO Go to step “Step 2010-13.”
YES Go to "MAP 410: Repair Checkout" in IBM RS/6000 Diagnostic
Information for Multiple Bus Systems to verify the repair.
Step 2010-13
The Error Code to FRU Index lists error symptoms and possible causes. The most
likely cause is listed first. Use this index to help you decide which FRUs to replace
when servicing the system.
If the codes in the following tables indicate a device which is present more than once
in the system unit, a location code is needed to specify which device generated the
error. Go to “Step 1020-4” on page 2-11 to display the System Management
Services error log and obtain a location code. Location code descriptions can be
found under “Location Codes” on page 3-46.
Notes:
1. If more than eight digits are displayed in the operator panel, use only the first
eight digits to find the error in the tables. The digits that display beyond the first
eight digits are location codes that can assist you in diagnosing the problem. See
“Location Codes” on page 3-46.
2. Licensed programs frequently rely on network configuration, and system
information stored on the VPD on the operator panel. If the MAPs indicate that
the Operator Panel should be replaced, swap the VPD from the old operator
panel to the new one. If the old VPD module has to be replaced call technical
support for recovery instructions. If recovery is not possible, notify the system
owner that new keys for licensed programs may be required.
3. If a network adapter, or the I/O board is replaced, the network administrator must
be notified so that the client IP addresses used by the server can be changed. In
addition, the operating system configuration of the network controller may need
to be changed in order to enable system startup. Also check to ensure that any
client or server that addresses this system is updated.
If you replace FRUs and the problem is still not corrected, go to “MAP 1540:
Minimum Configuration” on page 2-17 unless otherwise indicated in the tables.
If you replace FRUs and the problem is corrected, go to "MAP 410: Repair
Checkout" in the IBM RS/6000 Diagnostic Information for Multiple Bus Systems
If you replace FRUs and the problem is still not corrected, go to “MAP 1540:
Minimum Configuration” on page 2-17
If you replace FRUs and the problem is corrected, go to "MAP 410: Repair
Checkout" in the IBM RS/6000 Diagnostic Information for Multiple Bus Systems
The following table expands the firmware error code 25Cyyxxx on page 3-9, where
yy is the PD values in the table below. Use these values to identify the type of
memory that generated the error.
If you replace FRUs and the problem is still not corrected, go to &hdref refid=1540.
unless otherwise indicated in the tables.
This table is used to locate defective FRUs within the I/O board PCI and ISA buses.
The table indicates which devices should be tested for each SRN. For this
procedure, if possible, diagnostics are run on the I/O board bus devices with all
adapters removed from the failing bus. If a failure is detected on this system with all
adapters removed from the failing bus, the I/O board is the isolated FRU. If a failure
is not detected, the adapters are added back one at a time, to their original slot
location, and the configuration is tested until a failure is detected. The failure is then
isolated to the failing FRU.
If a failure has not been detected and all the FRUs have been tested call your
technical service support person for assistance.
Checkpoints are intended to let users and service personal know what the server is
doing, with some detail, as it initializes. These checkpoints are not intended to be
error indicators, but in some cases a server could hang at one of the checkpoints
without displaying an 8-character error code. It is for these hang conditions, only, that
any action should be taken with respect to checkpoints. The most appropriate action
is included with each checkpoint.
Before taking actions listed with a checkpoint, it is a good idea to look for better
symptoms in the Service Processor error log. See Service Processor System
Information Menu.
SP Checkpoints
Service Processor checkpoints are in the range E010 to E0FF. The message OK
indicates successful service processor testing and initialization. Firmware checkpoints
are listed in “Firmware Checkpoints” on page 3-32.
Note: Go to MAP 1540 for any of the following conditions:
A four-digit code in the range of E001 through EFFF displays on the operator
panal but is not listed in the checkpoint table.
A four-digit code displays in the checkpoint table, but does not contain a repair
action or FRU listing.
All of the FRUs listed in the repair action have been replaced and the problem
has not been corrected.
Firmware uses progress codes (checkpoints) in the range of E1xx to EFFF. These
checkpoints occur during system startup and maybe be useful in diagnosing certain
problems. Service Processor checkpoints are listed in “SP Checkpoints” on
page 3-29.
If you replace FRUs and the problem is still not corrected, go to “MAP 1540:
Minimum Configuration” on page 2-17 unless otherwise indicated in the tables.
It is perfectly acceptable for there to be 2 partially populated Memory Cards, the first
one does not have to be fully populated before memory on the 2nd Memory Card is
usable.
Memory Card memory DIMMs, on the other hand, must be installed in matched (size
and speed) pairs. Refer to the “Removal and Replacement Procedures” for “Memory
Modules” on page 6-26 for labeling of the Memory Card and instructions on module
removal and installation (do not, however, replace the covers as directed while
troubleshooting this problem). A single memory DIMM pair may be installed in
module slots 1 and 2 (not slots 1 and 3). A second memory DIMM pair could be
installed in module slots 5 and 6 (slots 3 and 4 do not have to be populated first).
Along these same lines, there is no requirement that memory DIMM slots 1 and 2 be
populated before another slot pair.
E1xx Code Boot Problems: Depending on the boot device, a checkpoint may
be displayed on the operator panel for an extended period of time while the boot
image is retrieved from the device. This is particularly true for Tape and Network
boot attempts. If the checkpoint/code is displayed for an extended time there may
be a problem loading the boot image from the device. If booting from CD-ROM or
Tape, watch for “activity” on the drive's LED indicator. A blinking LED means that
the loading of either the boot image or additional information required by the
operating system being booted is still in progress.
For network boot attempts, if the system is not connected to an active network or if
the target server is inaccessible (this can also result from incorrect IP parameters
being supplied), the system still attempts to boot and because time-out durations are
necessarily long to accommodate retries, the system may appear to be hung.
1. Restart the system and get to the Firmware SMS utilities. In the utilities check:
Is the intended boot device correctly specified in the boot sequence?
– If the intended boot device is not correctly identified in the boot
sequence, add it to the boot sequence using the SMS menus.
For network boot attempts:
– Are the IP parameters correct?
– Attempt to “Ping” the target server using the SMS “Ping” utility.
2. If the checkpoint E105 or E15B is displayed for an extended time, there may be
a problem with the integrity of the boot image.
Try to boot and run standalone diagnostics against the system, particularly
against the intended boot device. If the diagnostics are successful, it may be
necessary to perform an operating system specific recovery process, or
reinstall the operating system.
3. If attempting to boot from a Hard disk, CD-ROM, or Tape drive:
a. Try a different CD/Tape (unless booting from Hard disk).
b. Verify proper SCSI bus termination.
c. Replace SCSI cable.
d. It is possible that another attached SCSI device is causing the problem.
Disconnect any other SCSI devices attached to the same controller as the
one the boot device is attached to and retry the boot operation. If this is
successful, one of the devices removed is causing the problem, re-attach
devices one by one and retry the boot operation until the problem recurs and
replace the device that caused the problem.
e. Replace SCSI adapter (if drive is attached to a card rather than the I/O
Board).
f. Replace SCSI drive.
3-44 Service Guide
This system unit uses Physical Location Codes in conjunction with AIX Location
Codes to provide mapping of the failing field replaceable units. The location codes
are produced by the system unit's firmware and AIX.
The format for the location code is an alphanumeric string of variable length,
consisting of a series of location identifiers, separated by the standard dash (-) or
slash (/) character. The series is hierarchical; that is, each location identifier in the
string is a physical child of the one preceding it.
The - (dash) separator character represents a normal structural relationship
where the child is a separate physical package and it plugs into (or is connected
to) the parent. For example, P1-C1 is a CPU card (C1) plugged into a planar
(P1), or P1-M1 is a memory card (M1) plugged into a planar (P1).
The / (slash) separator character separates the base location code of a function
from any extended location information. A group of logical devices can have the
same base location code because they are all on the same physical package,
but may require extended location information to describe the connectors they
support. For example, P2/S1 describes the location of the serial port 1 controller
and its connector (S1), which is located on planar P2 (its base location code),
but the / indicates that further devices can be connected to it at the external S1
serial connector. The keyboard controller and its connector likewise have
location code P2/K1, which means they have the same base location code (P2)
as serial port 1, but a different external connector. In contrast, the location code
P2-K1 actually points to the device connected to connector K1; that is, the
keyboard. The location code P2/Z1 indicates an integrated SCSI controller which
drives connector Z1, while location codes of P2-Z1-... point to the actual SCSI
bus and devices.
Each location identifier consists of one alpha prefix character that identifies a location
type, and a decimal integer number (typically one or two digits) that identifies a
specific instance of this location type. Certain location types may also support
secondary sub-locations, which are indicated by appending a period (".") character
and a sub-location instance number.
Where p is a defined alpha location type prefix, n is a location instance number, and
[.n] is a sub-location instance number (where applicable). Sub-location notation is
used only for location types which have clearly defined and limited expansion sites;
for example, memory SIMMs slots on a memory card. Primarily, the [.n] sub-location
notation is intended for use as an abbreviation of the location code in cases where:
1. Based on the device structure, the abbreviated sub-location code conveys the
same information in a more concise form than an additional level of location
identifier -- for example:
P1-M1.4 (pluggable module 4 on Memory Card 1 on Planar 1), rather than
P1-M1-M4
P1-C1.1 (pluggable CPU 1 on CPU Card 1 on Planar 1), rather than
P1-C1-C1
P2-Z1-A3.1 (LUN 1 at SCSI ID 3 on integrated SCSI bus 1 from Planar 2),
rather than P2-Z1-A3-A1
2. The sub-location is either a basic physical extension or sub-enclosure of the
base location, but does not represent additional function or connectivity; for
example, a drawer in a rack (U1.2) or a riser card on an I/O board (P2.1).
Integrated Devices
Pluggable Adapters
Card in PCI Slot 20-58 to 20-5F P2-I1 I/O board Host Bridge
1P Connector J1 ID01, Device 01
Card in PCI Slot 20-60 to 20-67 P2-I2 I/O board Host Bridge
2P Connector J2 ID01, Device 02
Card in PCI Slot 10-68 to 10-6F P2-I3 I/O board Host Bridge
3P Connector J3 ID00, Device 03
Card in PCI Slot 10-70 to 10-77 P2-I4 I/O board Host Bridge
4P Connector J4 ID00, Device 04
Card in PCI Slot 10-78 to 10-7F P2-I5 I/O board Host Bridge
5P Connector J5 ID00, Device 05
Card in PCI Slot 30-60 to 30-67 P2-I6 I/O board Host Bridge
6P Connector J6 ID02, Device 02
Card in PCI Slot 30-68 to 30-6F P2-I7 I/O board Host Bridge
7P Connector J7 ID02, Device 03
Card in PCI Slot 30-70 to 30-77 P2-I8 I/O board Host Bridge
8P Connector J8P ID02, Device 04
Card in PCI Slot 30-78 to 30-7F P2-I9 I/O board Host Bridge
9P Connector J9P ID02, Device 05
Card in ISA Slot 01-01 or 01-02 P2-I8 I/O board
8I Connector J81
Card in ISA Slot 01-01 or 01-02 P2-I9 I/O board
9I Connector J91
SCSI Devices
Fans
Service Processor
Operator Panel
Power Supply
The AB value identifies a bus type or PCI parent bus as assigned by the
firmware.
The CD value identifies adapter number, adapter's devfunc number, or physical
location. The devfunc number is defined as the PCI device number times 8, plus
the function number.
The EF value identifies a connector.
The GH value identifies a port, address, device, or FRU.
Adapters and cards are identified with just AB-CD.
00 Processor bus
01 ISA bus
02 EISA bus
03 MCA bus
04 PCI bus used in the case where the PCI bus cannot be identified
05 PCMCIA buses
xy For PCI adapters where x is equal to or greater than 1. The x and y are characters in the
range of 0-9, A-H, J-N, P-Z (O, I, and lower case are omitted) and are equal to the parent
bus's ibm, aix-loc Open Firmware Property.
For pluggable PCI adapters/cards, CD is the device's devfunc number (PCI device
number times 8, plus the function number). The C and D are characters in the range
of 0-9, and A-F (hex numbers). This allows the location code to uniquely identify
multiple adapters on individual PCI cards.
For pluggable ISA adapters, CD is equal to the order the ISA cards
defined/configured either by SMIT or the ISA Adapter Configuration Service Aid.
For integrated ISA adapters, CD is equal to a unique code identifying the ISA
adapter. In most cases this is equal to the adapter's physical location code. In cases
where a physical location code is not available, CD will be FF.
For integrated adapter, EF-GH is the same as the definition for a pluggable adapter.
For example, the location code for a diskette drive is 01-D1-00-00. A second
diskette drive is 01-D1-00-01.
G defines the control unit address of the device. Values of 0 to 15 are valid.
H defines the logical unit address of the device. Values of 0 to 255 are valid.
P1-C1
P1-M2
U1-P1-M2.12
10-80 Ethernet
10-60 Integrated SCSI Port 1
30-58 Integrated SCSI Port 2
Location codes identify the locations of adapters and devices in the using system
and their attached subsystems and devices. These codes are displayed when the
diagnostic programs isolate a problem. For information about the location codes that
are used by the using system, see the operator guide for the using system.
A B - C D - E F - G H
│ │ │ │ │ │ │ │
│ │ │ │ │ │ │ └──── Always ð
│ │ │ │ │ │ │
│ │ │ │ │ │ └────── Always ð
│ │ │ │ │ │
│ │ │ │ │ └────────── Always ð
│ │ │ │ │
│ │ │ │ └──────────── P = Physical disk drive module
│ │ │ │ L = Logical disk drive
│ │ │ │
│ │ │ └──────────────── Adapter position (number of the slot,
│ │ │ 1 through 8, containing
│ │ │ the SSA adapter)
│ │ │
│ │ └────────────────── System I/O bus identifier
│ │
│ └────────────────────── Always ð
│
└──────────────────────── Always ð
The location code shows only the position of the SSA adapter in the using system
and the type of device that is attached. Use a service aid to find the location of the
device (disk drive module). For more information, see “SSA Service Aids” on
page 3-64.
The disk drive modules of the system unit are connected through two SSA links to
an SSA adapter that is located in the using system. The disk drive modules, SSA
links, and SSA adapter are configured in loops. Each loop provides a data path that
starts at one connector of the SSA adapter and passes through a link (SSA cable) to
the disk drive modules. The path continues through the disk drive modules, then
returns through another link to a second connector on the SSA adapter.
The system unit can be attached to an SSA adapter. See Figure 3-1. The adapter
card has four SSA connectors that are arranged in two pairs. Connectors A1 and A2
are one pair; connectors B1 and B2 are the other pair. Connectors B1 and B2 are
used on 7025 Model F50, since they support the internal and external SSA cable
options.
Each pair of connectors has a green light that indicates the operational status of its
related loop:
In the system unit, the disk drive modules are arranged in a string of 2 to 6 disk drive
modules. This string has its own two SSA connectors. This string is connected
through an internal SSA cable to a pair of connectors on the SSA adapter to make
an SSA loop. (The SSA cables provide the SSA links.)
Figure 3-2 shows the relationships between the SSA connectors and the disk drive
module string in a system unit.
The physical disk drives (pdisks) in an SSA subsystem can be configured as logical
units (LUNs). A LUN is also known as an hdisk, and can consist of one or more
physical disk drives. An hdisk in an SSA subsystem might, therefore, consist of one
pdisk or several pdisks.
The configuration software also allocates an identification (hdisk and pdisk number)
to each disk drive module during the configuration of the SSA link. The disk drive
modules do not have fixed physical addresses.
The configuration software first recognizes the disk drive module by the
machine-readable serial number that is written on the disk. Then, it assigns an
identification that is related to the position of the disk drive-module in the SSA string.
After configuration, the disk drive module always has the same identification. It can
be moved to any position on the SSA link, but can always be found by the software.
Service actions are always related to physical disk drives. For this reason, errors
that occur on SSA disk drives are always logged against the physical disk drive
(pdisk).
The numeric identifier of pdisks, hdisks, and disk drive module slots of the system
unit are not related to each other. For example, pdisk1 is not necessarily installed in
slot 1 of the 6-pack.
The disk drive module serial number, on the label on the front of the carrier, is also
displayed by the service aids. It is the last eight digits of the drives IEEE SSA
unique ID, which is stored on the drives electronics card.
All devices that are attached to an SSA adapter card are connected through SSA
links. See Figure 3-2 on page 3-60. The SSA links are configured as loops. Data
and commands to a particular device pass through all other devices on the link
between the adapter and the target device.
Data can travel in either direction around a loop. The adapter can, therefore, access
the disk drive modules through two data paths. The using system cannot detect
which data path is being used.
If a disk drive module fails, or is turned off, the loop is broken, and one of the data
paths to a particular module is no longer available. The disk drive modules on the
loop continue to work, but an error is reported to the using system.
Figure 3-3 shows that disk drive module in slot 3 has failed. The disk drive modules
in slots 1 and 2 can communicate with the using system only through connector A1
of the SSA adapter. The disk drive modules in slots 4, 5, and 6 can communicate
only through connector A2 of the SSA adapter.
Figure 3-4 shows that disk drive modules in slots 2 and 4 have failed. The disk
drive module in slot 1 can communicate with the using system only through
connector A1 of the SSA adapter. The disk drive module in slots 5 and 6 can
communicate with the using system only through connector A2 of the SSA adapter.
The disk drive module in slot 3 is isolated from the SSA adapter.
SSA service aids help you service the SSA Hot-Swap Disk Drive. This section
describes those service aids, and tells how to use them.
Attention: Do not run the service aids from more than one using system at a time;
otherwise, unexpected results might occur.
Before you use the service aids, ensure that you are familiar with the principles of
SSA links and physical disk drives (pdisks). If you are not familiar with these
principles, first read “SSA Loops and Links” on page 3-59.
The identify function can be accessed from any of the service aids.
This function enables you to determine the location of a particular disk drive module
that you want to identify but do not want to remove from the system unit. The
identify function causes the check light of the disk drive module to flash for
identification (1 second on, 1 second off), but has no effect on the normal operation
of the disk drive module. You can use the identify function on any number of disk
drive modules at the same time.
Instructions displayed by the service aids tell you when you can select the identify
function.
Note: You cannot use the identify function on a device that is in reserved status.
The Set Service Mode service aid enables you to determine the location of a
particular disk drive module and to remove that module from the system unit. It
causes the check light of that disk drive module to come on for identification, and it
stops all SSA link activity through the module. Only one disk drive module at a time
can be in the service mode.
Before using this service aid, you must make the selected disk drive module
unavailable to the using system; otherwise, an error occurs.
SSA disk drive modules can be maintained concurrently; that is, they can be
removed, installed, and tested on an SSA link while the other modules on the link
continue to work normally. If a disk drive module has its check light on, you can
remove that module from the SSA link without taking any special actions.
If a disk drive module does not have its check light on, the SSA link that passes
through it might still be active, even though the disk drive module itself might not be
working. You must put that module into the service mode before you remove it from
the SSA link.
If you leave the Set Service Mode service aid, the service mode is reset.
┌──────────────────────────────────────────────────────────────────────────────────┐
│ │
│ SET SERVICE MODE 8ð2381 │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ {TOP} │
│ │
│ pdiskð 11111111 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk1 22222222 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk2 33333333 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk3 44444444 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk4 55555555 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk5 66666666 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk6 77777777 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk7 88888888 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk8 99999999 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ │
│ │
│ {BOTTOM} │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2381 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2382 │
│ │
│ pdisk3 44444444 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ + Set or Reset Identify. │
│ Select this option to set or reset the Identify indicator │
│ on the disk drive. │
│ > Set or Reset Service Mode. │
│ Select this option to set or reset Service Mode │
│ on the disk drive. │
│ │
│ │
│ │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2381 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2382 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2381 │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ {TOP} │
│ │
│ pdiskð 11111111 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk1 22222222 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk2 33333333 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ > pdisk3 44444444 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk4 55555555 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk5 66666666 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk6 77777777 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk7 88888888 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk8 99999999 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ │
│ │
│ │
│ {BOTTOM} │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
Notes:
a. You can select only one disk drive module at a time.
b. If you select the service mode function, and the selected disk drive module is
not in a closed loop (see “SSA Loops and Links” on page 3-59), your
selection fails and an error message is displayed. Use the Link Verification
service aid to identify any open-link problems before trying to reselect the
service mode function.
c. If you select the service mode function, and a file system is mounted on the
selected disk drive module, your selection fails. Use the Configuration
Verification service aid to determine which hdisk must have its file system
unmounted before you can select the service mode function.
d. If the check light of the disk drive module that you have put into the service
mode does not come on, and if you are not sure of the location of that
module, use the identify function to help you find the disk drive module (see
“The Identify Function” on page 3-65).
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2381 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2382 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2381 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2382 │
│ │
│ > pdisk5 66666666 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ + Set or Reset Identify. │
│ Select this option to set or reset the Identify indicator │
│ on the disk drive. │
│ > Set or Reset Service Mode. │
│ Select this option to set or reset Service Mode │
│ on the disk drive. │
│ │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2381 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2382 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2381 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2382 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ SET SERVICE MODE 8ð2381 │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ {TOP} │
│ │
│ pdiskð 11111111 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk1 22222222 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk2 33333333 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ > pdisk3 44444444 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk4 55555555 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ + pdisk5 66666666 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk6 77777777 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk7 88888888 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk8 99999999 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ │
│ │
│ │
│ │
│ │
│ │
│ {BOTTOM} │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
┌──────────────────────────────────────────────────────────────────────────────────┐
│ LINK VERIFICATION 8ð2385 │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ ssað ðð-ð3 SSA Adapter │
│ ssa1 ðð-ð5 SSA Adapter │
│ ssa2 ðð-ð6 SSA Adapter │
│ ssa3 ðð-ð7 SSA Adapter │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
┌──────────────────────────────────────────────────────────────────────────────────┐
│ LINK VERIFICATION 8ð2385 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ LINK VERIFICATION 8ð2386 │
│ │
│ SSA Link Verification for: │
│ ssa1 ðð-ð5 SSA Adapter │
│ │
│ To set or reset Identify, move cursor onto selection, then press Enter. │
│ │
│ Physical Serial# Adapter Port │
│ │
│ A1 A2 B1 B2 Status │
│ {TOP} │
│ pdiskð 11111111 ð 7 Good │
│ pdisk1 22222222 1 6 Good │
│ pdisk2 33333333 2 5 Good │
│ pdisk3 44444444 3 4 Good │
│ pdisk4 55555555 4 3 Good │
│ pdisk5 66666666 5 2 Good │
│ pdisk6 77777777 6 1 Good │
│ pdisk7 88888888 7 ð Good │
│ pdisk8 99999999 ð 1ð Good │
│ pdisk9 nnnnnnnn 1 9 Good │
│ {MORE} │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
An SSA link must be configured in a loop around which data can travel in either
direction. The loop is broken if a cable fails or is removed, or if a disk drive
module fails. Because each disk drive module on the loop can be accessed
Note that the missing disk drive module (pdisk2) is represented by a line of
question marks.
The Configuration Verification service aid enables you to determine the relationship
between SSA logical units (hdisks) and SSA physical disk drives (pdisks). It also
displays the connection information and operational status of the disk drives.
Note: User applications communicate with the hdisks; error data is logged against
the pdisks.
┌──────────────────────────────────────────────────────────────────────────────────┐
│ CONFIGURATION VERIFICATION 8ð239ð │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ {TOP} │
│ pdiskð 22222222 ðð-ð3-P 2 GB SSA F Physical Disk Drive │
│ pdisk1 33333333 ðð-ð5-P 2 GB SSA F Physical Disk Drive │
│ pdisk2 44444444 ðð-ð7-P 2 GB SSA F Physical Disk Drive │
│ hdisk3 22222222 ðð-ð3-L SSA Logical Disk Drive │
│ hdisk4 33333333 ðð-ð5-L SSA Logical Disk Drive │
│ hdisk5 44444444 ðð-ð7-L SSA Logical Disk Drive │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ {BOTTOM} │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
┌──────────────────────────────────────────────────────────────────────────────────┐
│ CONFIGURATION VERIFICATION 8ð239ð │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ CONFIGURATION VERIFICATION 8ð2391 │
│ │
│ hdisk3 22222222 ðð-ð3-L SSA Logical Disk Drive │
│ │
│ To set or reset Identify, move cursor onto selection, then press enter. │
│ │
│ Physical Serial# Adapter Port SSA_Addr Status │
│ │
│ {TOP} │
│ pdiskð 22222222 ðð-ð2 A1 5 Good │
│ ðð-ð2 A2 5 Good │
│ ðð-ð3 A1 1ð Good │
│ ðð-ð3 A2 ð Good │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ {BOTTOM} │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
┌──────────────────────────────────────────────────────────────────────────────────┐
│ CONFIGURATION VERIFICATION 8ð239ð │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ CONFIGURATION VERIFICATION 8ð2392 │
│ │
│ pdiskð 22222222 ðð-ð3-P 2 GB SSA F Physical Disk Drive │
│ │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ │
│ hdisk3 22222222 ðð-ð3-L SSA Logical Disk Drive │
│ │
│ │
│ │
│ {BOTTOM} │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
Note: If you select the hdisk from this screen, the hdisk configuration is
displayed.
The Format Disk service aid formats SSA disk drive modules.
Attention: Formatting a disk drive module destroys all the data on that module.
Use this procedure only when instructed to do so by the service procedures.
┌──────────────────────────────────────────────────────────────────────────────────┐
│ FORMAT DISK 8ð2395 │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ {TOP} │
│ pdiskð 11111111 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk1 22222222 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk2 33333333 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk3 44444444 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk4 55555555 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk5 66666666 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk6 77777777 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk7 88888888 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk8 99999999 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ {BOTTOM} │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
┌──────────────────────────────────────────────────────────────────────────────────┐
│ FORMAT DISK 8ð2395 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ FORMAT DISK 8ð2396 │
│ │
│ pdisk1 22222222 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ │
│ Set or Reset Identify. │
│ Select this option to set or reset the Identify indicator │
│ on the disk drive. │
│ Format. │
│ Select this option only if you are sure that you have selected │
│ the correct disk drive. │
│ FORMATTING DESTROYS ALL DATA ON THE DISK DRIVE. │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
3. If you are not sure of the identification (pdisk number) of the disk drive module
that you want to format, use the identify function to get a positive physical
identification of the module (see “The Identify Function” on page 3-65). You can
further ensure that you have selected the correct disk drive module by verifying
that the serial number on the front of the disk drive module is the same as the
serial number that is displayed on the screen.
4. When you are sure that you have selected the correct disk drive module, select
Format.
The Certify Disk service aid verifies that all the data on a disk drive module can be
read correctly. Instruction given elsewhere in this book tell you when you need to
run this service aid.
┌──────────────────────────────────────────────────────────────────────────────────┐
│ CERTIFY DISK 8ð24ð4 │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ {TOP} │
│ pdiskð 11111111 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk1 22222222 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk2 33333333 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk3 44444444 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk4 55555555 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk5 66666666 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk6 77777777 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk7 88888888 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ pdisk8 99999999 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ {BOTTOM} │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
┌──────────────────────────────────────────────────────────────────────────────────┐
│ CERTIFY DISK 8ð24ð4 │
┌──────────────────────────────────────────────────────────────────────────────────┐
│ CERTIFY DISK 8ð24ð5 │
│ │
│ pdisk1 22222222 ðð-ð4-P 2 GB SSA F Physical Disk Drive │
│ │
│ Move cursor onto selection, then press Enter. │
│ │
│ Set or Reset Identify. │
│ Select this option to set or reset the Identify indicator │
│ on the disk drive. │
│ Certify. │
│ Select this option to start the Certify operation. │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ │
│ F3=Cancel F1ð=Exit │
│ │
└──────────────────────────────────────────────────────────────────────────────────┘
3. If you are not sure of the identification (pdisk number) of the disk drive module
that you want to format, use the identify function to get a positive physical
identification of the module (see “The Identify Function” on page 3-65). You can
further ensure that you have selected the correct disk drive module by verifying
that the serial number on the front of the disk drive module is the same as the
serial number that is displayed on the screen.
4. When you are sure that you have selected the correct disk drive module, select
Certify
If the SSA service aids detect an unrecoverable error and are unable to continue,
one of the following error codes might occur.
SSA01 Not enough using-system memory is available for this service aid to
continue. Take one of the actions described here:
This problem might be caused by a failed application program. Ask
the user to end any failed application program, then try to run the
service aid again.
Run diagnostics in the problem determination mode on the system
unit. If you find any problems, solve them; then try to run the service
aid again.
Close down and reboot the using system; then try to run the service
aid again.
Run diagnostics from diskette or CD-ROM to isolate the problem. If
you do not find a problem, the operating system might have failed.
SSA02 An unknown error has occurred. Take one of the actions described here:
Run diagnostics in the problem determination mode to the system
unit. If you find any problems, solve them; then try to run the service
aid again.
If diagnostics fail or if the same problem occurs when you try the
service aid again, run the diagnostics from diskette or CD-ROM to
isolate the problem. If you do not find a problem, the operating
system might have failed.
SSA03 The service aid was unable to open an hdisk. This problem might have
occurred because a disk drive module has failed or has been removed
from the system. Take the actions described here:
1. Use the Configuration Verification service aid (see “Configuration
Verification Service Aid” on page 3-77) to determine the location code
of the SSA adapter to which the hdisk is attached. (For example, if
the location code of the hdisk is 00-03-L, the location code of the SSA
adapter is 00-03.)
2. Run the Link Verification service aid (see “Link Verification Service
Aid” on page 3-73) to the SSA adapter.
3. If a link failure is indicated by the service aid, go to “SSA
Maintenance Analysis Procedures (MAPs)” on page 2-35.
4. If no link failures are indicated, run the diagnostics in the system
verification mode to each pdisk that is attached to the SSA adapter.
If you have a problem with an SSA link, use the Link Verification service aid (see
“Link Verification Service Aid” on page 3-73). The following examples show various
loops and the associated information that is displayed by the Link Verification service
aid.
Example 1. Normal Loops In Figure 3-5, eight disk drive modules are connected
to connectors A1 and A2 of the SSA adapter. Five disk drive modules are
connected to connectors B1 and B2 of the same SSA adapter.
Note: Scroll the display to see all the connected disk drive modules.
In Figure 3-6, eight disk drive modules should be connected to connectors A1 and
A2 of the SSA adapter. However, the loop is broken because the SSA cable has
been disconnected from connector A2. Four disk drive modules are connected to
connectors B1 and B2 of the same SSA adapter.
Although the broken loop is reported as an error, all the disk drive modules can still
communicate with the using system. The eight disk drive modules can communicate
through connector A1 of the SSA adapter . The five disk drive modules can
communicate through connectors B1 and B2 of the same SSA adapter (normal loop).
Although the missing disk drive module is reported as an error, all the remaining disk
drive modules can still communicate with the using system. Two disk drive modules
can communicate through connector A1 of the SSA adapter. Five disk drive
modules can communicate through connector A2 of the SSA adapter. Five disk
drive modules can communicate through connectors B1 and B2 of the same SSA
adapter (normal loop).
Note that the missing disk drive module (pdisk2) is represented by a line of question
marks.
The physical location of a device (for example, a disk drive module or an SSA
adapter) cannot be reported directly by the using system because of the way in
which the SSA interface works. The address of an SSA device is related to the
position of that device on the SSA loop. The address can, therefore, change if the
configuration is changed.
To help you to find the correct physical disk drive, the SSA service aids include an
identify function. This function, when selected, causes the check light of the selected
disk drive module to flash. For more details, see “SSA Service Aids” on page 3-64.
Some devices (for example, adapters) do not have check lights. To find such a
device, you can either use the identify function to identify devices that are next to the
SSA adapter on the SSA link, or use the procedure described in “Finding the Device
When No Service Aids Are Available.”
When no service aids are available, you must find the device by using the port (P)
and SSA-address (AA) values that are provided by some service request numbers
(SRNs). Examples of these SRNs are 43nnn, 44nnn, and 45nnn, where nnn
represents the PAA values.
The port (P) value can be a number from 0 to 3, which indicates the port connector
on the SSA adapter:
0 = Connector A1
1 = Connector A2
2 = Connector B1
3 = Connector B2
The AA value is the decimal SSA-address value. It indicates the position of the
device that you are trying to find (counted along the SSA link).
Use the port value to locate the relevant connector on the SSA adapter, then follow
the SSA cable to the first disk drive module or adapter in the same SSA loop. Do
not include dummy disk drive modules.
Updates to the microcode are loaded into the using system from diskettes. If the
level of the microcode that is stored in the using system is higher than the level of
the microcode that is installed on the SSA adapter or the disk drive modules, the
higher-level microcode is automatically downloaded to the adapter and disk drive
modules when the using system runs the device configuration method.
For some problems, the service request number (SRN) might ask you to check the
microcode package ID before you exchange any field-replaceable units (FRUs). You
can determine the package ID for the adapter in either of the following two ways:
On the command line, type the following command and press Enter:
lsattr -E -l <adapter> -a ucode
where adapter is the ID of the adapter that you want to check; for example, ssa0.
The response to this command is:
ucode 8F97.ðð.nn Name of adapter download False
where nn is the adapter code package ID.
Use the Display or Change Configuration or Vital Product Data (VPD) service aid
to display the VPD for the adapter. The first two characters of the ROS Level
field contain the adapter code package ID.
To determine the ID of the microcode package that is being used on a disk drive
module, use the Display or Change Configuration or Vital Product Data (VPD)
service aid to display the VPD for the disk drive module. The first two characters of
the ROS Level field contain the ID of the microcode package for the disk drive
module.
Note: During the configuration of the complete system, all the VPD files in the
system are updated before any microcode is downloaded from the using system to
the 6-pack. If the using system later downloads a new level of microcode to the
subsystem, the VPD for the adapter or disk drive module do not show the ID of the
new microcode package until the next time the configuration manager command
(cfgmgr) is run.
The vital product data (VPD) for the system unit and for the SSA adapter can be
displayed by using the using-system service aids. This section shows the types of
information that are contained in the VPD.
SSA Adapter
Part number Adapter card FRU part number
Serial number Adapter card serial number
Engineering change level Adapter card engineering change level
Manufacturing location Manufacturer and plant code
ROS level and ID Version of ROS code loaded on the adapter
Loadable microcode level Version of loadable code needed for the satisfactory
operation of this card
The system diagnostics can be booted either from the default boot list or from the
service mode boot list. (The service mode boot list can be modified using the
system diagnostics "Display or Change Bootlist" service aid. The service mode boot
list cannot be modified under the System Management Services.)
To load Standalone Diagnostics from the default boot list, perform the following
procedure:
1. Verify with the system administrator and users that all programs may be stopped,
then do so.
2. Turn off the system.
3. Wait 30 seconds, and turn on the system.
4. Immediately insert the diagnostic CD-ROM into the CD-ROM drive.
5. When or after the keyboard indicator appears during startup, press the F5 key on
a directly-attached keyboard (or the number 5 key on an ASCII terminal).
6. Enter any requested passwords.
Online Diagnostics can be loaded by booting from the service mode boot list by
pressing the F6 key (on a directly-attached keyboard) or the number 6 key on an
ASCII terminal.
The procedure for booting from the devices listed in the service mode boot list
(Online Diagnostics) is the following:
1. Verify with the system administrator and users that all programs may be stopped,
then do so.
2. Turn off the system.
3. Wait 30 seconds, and turn on the system.
4. When or after the keyboard indicator appears during startup, press the F6 key on
a directly-attached keyboard (or the number 6 key on an ASCII terminal).
5. Enter any requested passwords.
After any requested passwords have been entered, the system attempts to boot from
the first device of each type found on the list. If no bootable image is found on the
first device of each type on the list, the system does not search through the other
devices of that type for a bootable image; instead, it polls the first device of the next
type.
The System Management Services make it possible for you to view information
about your computer and to perform such tasks as setting passwords and changing
device configurations.
If you have chosen a graphical display as your system console, you can use the
graphical System Management Services described below. If you are using an ASCII
display as your system console, see “Text-Based System Management Services” on
page 5-24.
After the logo is displayed, initialization icons appear across the bottom of the
screen.
To enter the Open Firmware command line, you must press the F8 key after the
keyboard icon appears during startup.
If you have pressed the F8 key, the Open Firmware command line (an "OK" prompt)
appears after the initialization and power-on self test (POST) are complete.
The Open Firmware command line is used to set up adapters that are not
configurable with the System Management Services. Your adapter documentation
directs you to use this option if it is needed.
To exit from the Open Firmware command enter reset-all or power the system
down and reboot.
After the System Management Services starts, the following screen appears.
Selecting this icon makes it possible for you to view information about the setup of
your system unit. A list similar to the following appears when you select the Config
icon.
This selection invokes the Mulitboot menu which provides several functions:
To Select a particular operating system to boot.
In the case of AIX this is a supported option, if you receive a an informational
icon after making this selection this would mean that information in non-volatile
storage could have been lost, as would happen if the battery had been removed.
In order to recreate this value, issue the bootlist command under AIX with the
appropriate parameters as to the location of the operating system in a particular
hard disk. Please see the explanation of the bootlist command in your AIX
documentation.
To select a particular operating system as the default operating system
To install from the list of possible devices
To set the sequence in which devices are searched for operating system startup
code
To go to the Open Firmware command prompt
To set the Multiboot menu so that it comes up immediately as the system powers
up
The Select Software option, if supported by the operating system, shows the names
of the operating system installed. This option may not be supported by all operating
systems.
In the case of AIX this is a supported option, if you receive a message saying that:
No Operating System Installed
The Software Default option, if supported by the operating system, lets you select the
default operating system to start the system. This option may not be supported by
all operating systems.
The Install From option produces a list of devices, for example the CD-ROM, where
the operating system is installed from. You select one of the devices and the system
searches the device for an operating system to install and if supported by the
operating system in that device, the name of the operating system displays.
The Select Boot Device option provides a list of devices that can be selected to be
stored on the boot list. Up to 5 devices are supported.
The OK Prompt procvides access to the Open Firmware command prompt. The
Open Firmware command prompt is used for debug purposes, and device driver
development. Information about the commands that are available in the IEEE
Standard 1275.
The Multiboot Start option toggles between OFF and ON and selects if the Multiboot
menu invokes automatically on startup or not.
Boot
This selection enables you to view and change the custom boot list (the sequence in
which devices are searched for operating system startup code).
You can choose 1 to 5 devices for the custom boot list. To change the boot
sequence back to the default values, select Default. (The default sequence is
automatically saved.)
Selecting this icon enables you to perform various tasks and view additional
information about your computer.
Error Log: Enables you to view and clear error log information
for your computer.
When you select the Entry icon, a screen with 8 empty boxes appears. Type your
password in these boxes. You can use any combination of up to eight characters
(A–Z, a–z, and 0–9) for your password. As you type a character, a key appears in
the box.
If you make a mistake, press the Esc key and start again.
After you have entered and verified the password, the power-on password icon
flashes and changes to the locked position to indicate that the power-on password is
set.
If you previously had set a power-on password and want to remove it, select the
Remove icon.
Note: If you forget the power-on password, you have to remove the battery for at
least 30 seconds to disable this password. See “Battery” on page 6-38 for
details.
Attention: After you remove the battery, you must reconfigure your system (bootlist,
IP parameters, console information, support processor, etc).
A password can be set only after the system is turned off and on again.
If you remove the power-on password, the remote mode is automatically reset, and
the icon returns to Remote <Off>.
When you select the Entry icon, a screen with 8 empty boxes appears. Type your
password in these boxes. You can use any combination of up to eight characters
(A–Z, a–z, and 0–9) for your password. As you type a character, a key appears in
the box.
Press Enter when you are finished; you are required to type the password again for
verification.
If you make a mistake, press the Esc key and start again.
Selecting this icon displays the log of errors your computer has encountered during
operations.
Selecting the Remote Initial Program Load (RIPL) icon above gives you access to
the following selections.
Notes:
1. Some applications may require that IP addresses contain leading zeroes for
numbers less than 100. For example, 129.132.4.20 may need to be entered as
123.132.004.020.
If any of the addresses is incomplete or contains a number other than 0 to 255, an
error message is displayed when you select the Save icon. To clear this error,
change the improper address and select Save again.
Note: You should save a copy of your changes in case you need to replace your
battery.
After choosing the Ping option, you may be asked to indicate which communications
adapter (Token Ring or Ethernet) should be used to send the test transmission.
The next screen allows you to select the configuration of the adapter you have just
chosen (or allow the system to select the configuration automatically). As on the
previous screen, use the arrow keys to highlight, press the space bar to select, then
highlight the OK icon and press enter.
This selection allows you to view and change the addresses (IDs) of the SCSI
controllers attached to your system unit. To change an ID, highlight the entry by
moving the arrow keys, then use the space bar to scroll through the available ID's.
After you have entered the new address, use the arrow keys to highlight the Save
icon and press the Enter key.
At any time in this process, you can select the Default icon to change the SCSI IDs
to the default values.
This selection allows you to update the service processor and system firmware in
your system unit from an image on a 3.5 inch diskette. After choosing to update the
firmware and confirming this choice, you must insert the diskette containing the
firmware image.
ATTENTION: While the new firmware image is being copied into your system unit,
you must not turn off the system unit. Turning off the system unit during the
update process may render your system unit inoperable.
If the flash update does not complete successfully or the system unit does not restart
after the flash update, contact your authorized reseller or marketing representative.
The text-based Open Firmware command line and System Management Services are
available if an ASCII terminal is attached to your system unit. The text-based Open
Firmware command line allows you to configure some adapters, and the System
Management Services makes it possible for you to view information about your
system unit and to perform such tasks as setting passwords and changing device
configurations.
To enter the Open Firmware command line, you must press the 8 key after the
keyboard text symbol appears during startup.
If you have pressed the 8 key, the Open Firmware command line (an "OK" prompt)
appears after the initialization and power-on self tests (POST) are complete.
The Open Firmware command line can be used to set up adapters that are not
configurable with the System Management Services. Your adapter documentation
directs you to use this option if it is needed.
To exit from the Open Firmware command enter reset-all or power the system
down and reboot.
To start the text-based System Management Services instead of the Open Firmware
command line, press 1 on the ASCII terminal keyboard when the keyboard text
symbol appears during startup.
à ð
System Management Services
1. Display Configuration
2. Multiboot
3. Utilities
4. Select Language
.------.
|X=Exit|
------'
====>
á ñ
After you have finished using the text-based System Management Services, entering
x (for exit) boots your computer.
This option provides information about the setup of your computer. A screen similar
to the following is displayed.
à ð
Device
PowerPC 6ð4
L2-Cache, ð512K
Memory
slotA=8MB
slotB=8MB
LPT
addr=3BC
COM
addr=3F8
COM
addr=2F8
Audio
Keyboard
Mouse
Diskette
addr=3Fð
Integrated Ethernet
addr=8ððð5AF67BD
SCSI cntlr id=7
PCI Adapters
á ñ
Note that this configuration information does not include ISA adapters installed in the
system unit.
à Multiboot Menu ð
1. Select Software
2. Software Default
3. Install From
4. Select Boot Devices
5. OK Prompt
6. Mulitboot Startup <OFF> (or <ON>)
.------.
|X=Exit|
------'
===>
á ñ
The Select Software option, if supported by the operating system, shows the names
of the operating system installed. This option may not be supported by all operating
systems.
In the case of AIX this is a supported option, if you receive a message saying that:
No Operating System Installed
this would mean that information in non-volatile storage could have been lost, as
would happen if the battery had been removed. In order to recreate this value issue
the bootlist command under AIX with the appropriate parameters as to the location of
the operating system in a particular Hard disk. Please see the explanation of the
bootlist command in your AIX documentation.
The Software Default option, if supported by the operating system, lets you select the
default operating system to start the system. This option may not be supported by
all operating systems.
The Install From option produces a list of devices, for example the CD-ROM, where
the operating system is installed from. You select one of the devices and the system
searches the device for an operating system to install and if supported by the
operating system in that device, the name of the operating system displays.
The Select Boot Device option provides a list of devices that can be selected to be
stored on the boot list. Up to 5 devices are supported.
The Multiboot Start option toggles between OFF and ON and selects if the Multiboot
menu invokes automatically on startup or not.
This selection enables you to view and change the custom boot list, which is the
sequence of devices read at startup time.
.------.
|X=Exit|
------'
====>
á ñ
Selecting the Display Current Settings option lists the current order of devices in the
boot list. The following screen shows an example of this display.
.------.
|X=Exit|
------'
===>
á ñ
à ð
Configure Nth Boot Device
1 1 Diskette
2 2 Ethernet
3 3 SCSI CD-ROM
4 4 SCSI 4.5GB Hard Disk
5 -
===>
á ñ
The Utilities screen enables you to select from the following system management
tools.
à ð
Utilities
.------.
|X=Exit|
------'
===>
á ñ
Set Password and Unattended Start Mode: Entering this selection permits
access to the following options.
à ð
Password Utilities
.------.
|X=Exit|
------'
===>
á ñ
If you previously had set a power-on password and wanted to remove it, select
Remove Power-On Password.
Note: If you forget your password, you have to shut down the computer and
remove the battery for at least 30 seconds to disable the password. See
“Battery” on page 6-38 for details.
A password can be set only after the system is turned off and on again.
If you previously had set a privileged-access password and want to remove it, select
Remove Privileged-Access Password.
Note: If you forget your password, you have to shut down the computer and
remove the battery for at least 30 seconds to disable the password. See
“Battery” on page 6-38 for details.
Audio: Selecting this utility turns on or off the system tones heard at power-on
time.
Remote Initial Program Load Setup: This option allows you to enable and set
up the remote startup capability of your computer. First, you are asked to specify the
network parameters.
à ð
Network Parameters
1. IP Parameters
2. Adapter Parameters
3. Ping
.------.
|X=Exit|
------'
===>
á ñ
Note: Some applications may require that IP addresses contain leading zeroes for
numbers less than 100. For example, 129.132.4.20 may need to be entered as
123.132.004.020.
à IP Parameters ð
1. Client IP Address ððð.ððð.ððð.ððð
2. Server IP Address ððð.ððð.ððð.ððð
3. Gateway IP Address ððð.ððð.ððð.ððð
4. Subnet Mask ððð.ððð.ððð.ððð
.------.
|X=Exit|
------'
===>
á ñ
Selecting the Adapter Parameters option allows you to view an adapter's hardware
address as well as configure network adapters that require setup.
à ð
Adapter Parameters
Device HW Address
1. 3Com,3C9ð5 8ððð5AFC67BD
2. Token-Ring 8ððð32E54A12
.------.
|X=Exit|
------'
á ñ
à ð
3Com Etherlink Fast XL
.------.
|X=Exit|
------'
á ñ
Selecting the Media Type option allows you the change the media employed by the
Ethernet adapter:
à ð
.----------------.
| MEDIA TYPE |
| 1. 1ð BaseT |
| 2. 1ðð Base TX |
| 3. Auto |
----------------'
á ñ
à ð
.----------------.
| Full Duplex |
| 1. Yes |
| 2. No |
| 3. Auto |
----------------'
á ñ
Ping, the last option available from the Network Parameters menu, allows you to test
a connection to a remote system unit. After selecting the Ping option, you must
choose which adapter communicates with the remote system.
à ð
Interface
1. Ethernet (Integrated)
2. Token Ring (Slot=3)
===>
á ñ
à Ping ð
1. Client IP Address 129.132.4.2ð
2. Server IP Address 129.132.4.1ð
3. Gateway IP Address 129.132.4.3ð
4. Subnet Mask 255.255.255.ð
.---------. .------.
|E=Execute| |X=Exit|
---------' ------'
===>
á ñ
Display Error Log: A screen similar to the following is displayed when you select
this option. Here, you can view or clear your computer's error log.
à ð
.----------------------------------------------------------.
| Error Log |
| |
| Date Time ErrorCode Location |
| Entry 1. 96/ð4/ð1 12:13:22 25A8ðð11 ðð-ðð |
| Entry 2. no error logged |
----------------------------------------------------------'
.-----------------. .------.
|C=Clear Error Log| |X=Exit|
-----------------' ------'
á ñ
Change SCSI ID: This option allows you to view and change the addresses of the
SCSI controllers attached to you computer.
à ð
.--------------------------------------------------------------.
| This selection will update your System Firmware. Do you want |
| to continue? Press Y(Yes) N(No). |
--------------------------------------------------------------'
á ñ
Select Console: Selecting this option allows you to define which display is used
by the system for system management.
This option allows you to change the language used by the text-based System
Management Services screens.
à ð
SELECT LANGUAGE
1. English
2. Francais
3. Deutsch
4. Italiano
5. Espanol
6. Svenska
====> .------.
|x=Exit|
------'
á ñ
Note: Your ASCII terminal must support the ISO-8859 character set in order to
properly display languages other than English.
DANGER
CAUTION:
This product is equipped with a 3-wire power cable and plug for the user's
safety. Use this power cable in conjunction with a properly grounded electrical
outlet to avoid electrical shock.
Attention: Adapters, planars, diskette drives, and disk drives are sensitive to static
electricity discharge. These devices are wrapped in antistatic bags, as shown in this
illustration, to prevent this damage.
Removal
1. Open the door.
3. Remove all media (diskettes, CDs, optical discs, or tapes) from drives; then turn
the power off to the system and all attached devices.
4. Shutdown the operating system.
5. Turn off power to the system and all attached options.
4. Reinstall the door at the front of the system by aligning the pins with the hinges,
and sliding the door down.
Do not attempt to open the covers of the power supply. Power supplies
are not serviceable and are to be replaced as a unit.
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Disconnect the two cables from I/O Planar connectors P1, and P2.
3. Disconnect the four cables from the System Card connectors J1, J2 J3, and J4.
4. Disconnect the cables from the Processor Cards.
8. Remove the power supply from the system. Slide the power supply forward,
toward the front. Lift it up and out of the system.
CAUTION:
Do not open the drive; no user adjustments or serviceable parts are inside.
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Disconnect the power and signal cables to the drive you are removing.
3. Slide the drive forward to remove.
Replacement
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Remove the screw from the backplane.
If you installed a SCSI backplane, check the jumpers on the new backplane for
proper setting. The jumpers should be set as shown.
If you installed an SSA backplane, be sure to connect the internal SSA cable
between the backplane and the internal connectors on the external SSA connector.
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Remove the I/O planar cover as described in on page 6-8.
3. Locate the adapter connectors and determine which adapter you want to remove.
Replacement
1. With one hand, touch any metal surface of the chassis to minimize static
electrical charges, and then pick up the adapter.
2. Locate the adapter connector location on the I/O planar.
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Remove the I/O planar cover as described on page 6-8; then return here to
determine your next step.
3. Remove the memory adapter card shield.
4. The card is secured in place with card retainer hooks, one on each end of the
card. Open the card retainer hooks and remove the card.
Attention: To prevent damage to the card and the card connectors, open and
close the retainer hooks at the same time.
.
9. Replace any parts previously removed in reverse order.
10. Replace the covers as described in “Covers” on page 6-3.
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Remove the I/O planar cover as described on page 6-8.
3. Remove the memory card as described in “Memory Cards” on page 6-26.
4. Locate the memory module connectors and determine which module you want to
remove.
Replacement
1. With one hand, touch any metal surface of the chassis to minimize static
electrical charges, and then pick up a memory module.
2. Locate the memory module connector location on the memory card.
3. Install the new memory module.
Replacement
Verify that the security jumpers (J16) are in the correct position (same as the
replaced I/O planar).
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Remove the I/O planar cover as described on page 6-8.
Attention: Make sure the power cord is unplugged and the power (Power LED)
is OFF before removing the CPU card.
3. Remove the power cable from the CPU card by squeezing the tabs.
4. The card is secured in place with card retainer hooks, one on each end of the
card. Open the card retainer hooks and remove the card.
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Remove the I/O planar cover as described on page 6-8.
Attention: Make sure the power cord is unplugged and the power LED is OFF
before removing the service processor.
3. Remove the service processor.
Replacement
To replace the service processor, perform the removal steps in the reverse order.
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Remove the I/O planar cover as described on page 6-8.
3. Locate the battery on the I/O planar.
Replacement
Tilt the battery so that you can insert it into the front of the socket. As you slide the
battery toward the front of the server, push it down into the socket.
Notes:
1. Be sure to tell the customer that the time and date need to be reset.
If running AIX the bootlist command may have to be run from AIX while logged in
as root, this would restore the custom boot list and allow the option of Select
Software under multiboot to start the operating system. Other parameters that
may have to be rebuilt are the IP parameters.
Removal
1. Do the cover removal procedure in “Covers” on page 6-3.
2. If removing one of the two upper fans, remove the adapter cards, remove the
three screws from the adapter card bracket and remove the bracket.
Note: The fans are numbered from bottom to top as fan 1, fan 2, fan 3, and fan
4. So the two upper fans are fans 3 and 4.
Removal
1. If you have not already done so, remove the front covers as described in
“Covers” on page 6-3.
2. Press the snap in detent on the right side of the display card mounting bracket
and remove the display card.
3. Rotate the display forward and out of the mounting bracket.
4. Disconnect the display cable from the connector on the display assembly.
Replacement
Replace in reverse order. Ensure that the grounding clips located inside the
mounting bracket are not bent or damaged and that they are positioned correctly.
Removal
1. Do the cover removal procedure in “Covers” on page 6-3.
2. Remove the assembly by compressing the spring clip at the top and bottom of
the assembly and pulling out toward the rear of the system.
3. Unplug the LCD cable from the assembly.
4. Unplug the control card cable from the I/O planar.
5. Swap the VPD module (U2 on the Operator Panel Control Assembly) from the
old assembly to the new one. In the unlikely event that the old VPD module is
determined to be the problem, return the new one to the system and call
technical support for recovery instructions.
Replacement
Removal
1. If you have not already done so, remove the covers as described in “Covers” on
page 6-3.
2. Remove the I/O planar cover as described on page 6-8.
3. Remove the external serial and parallel connectors from the rear of the system.
4. Remove the cables from the I/O planar connectors J41, J47, and J50.
5. Remove the screws that attach the serial/parallel card to the system.
Replacement
1 2 3 4
5 6 7
8 9 10 11
Attention: Disk drive modules are fragile. Handle them with care. Follow all
ESD-sensitive parts procedures when handling disk drive modules. For ESD
information, see “Handling Static–Sensitive Devices” on page 6-2.
The disk drive module POSTs start each time the module is switched on or when a
Send Diagnostic command is received from the SSA adapter. They check whether
the disk drive module is working correctly. The POSTs also help verify a repair after
a FRU has been exchanged.
POST-1 runs immediately after the ‘power-on reset’ line goes inactive, and before
the disk drive module motor starts. POST-1 includes tests of the:
Microprocessor
ROM
Safety circuits
If POST-1 fails, the disk drive module check light stays on, and the disk drive module
is not configured into the SSA network.
POST-2 runs after the disk drive module motor has started. POST-2 includes tests
of:
Motor control
Servo control
Read and write on the diagnostic cylinder (repeated for all heads)
Error checking and correction (ECC)
If POST-2 completes successfully, the disk drive module is ready for use by the
using system.
Two power-on self-tests (POSTs) are resident in the SSA adapter. The tests are
POST-1 and POST-2.
POST-1 tests all the functions that are necessary to enable the adapter to
communicate with the Micro Channel. POST-1 can fail for either of two reasons:
A hardware error has been detected. In such instances, the POST code enters
a tight loop, and does not put the identification of the SSA adapter into the
programmable option select (POS) registers. If this error occurs, the SSA
adapter must be exchanged for a new one.
The flash erasable programmable read-only memory (EPROM) has a check sum
that is not valid. This error can be caused if the power fails while microcode is
being downloaded. In such instances, the POST checks all the hardware that is
needed to download the microcode. If all the hardware is correct, the POST sets
the ROS level to zero, puts the identification of the SSA adapter into POS 0-1,
and puts error data into the adapter status register.
To recover from this type of error, microcode must be downloaded to the SSA
adapter. If the using system can have an initial program load (IPL) without the
adapter, the configuration code detects the down-level ROS code and downloads
the latest level of code. The configuration code then uses control register bit 7 to
reset the adapter and restart POST-1.
If no error is detected during POST-1, the identification of the SSA adapter is put into
the POS registers, and POST-2 is started.
POST-2 tests the remaining hardware on the SSA adapter card and tests the other
FRUs that are attached to the adapter. If this test fails, an error code is saved and
sent to the using-system error log when the error logger becomes available. An
internal health check continues to send the error code at regular intervals.
Service request numbers (SRNs) are generated for the SSA Hot-Swap Disk Drive by
the system error-log analysis, system configuration code, diagnostics, and customer
problem-determination procedures. SRNs help you to identify the cause of a
problem, the failing field-replaceable units (FRUs), and the service actions that might
be needed to solve the problem.
Table A-1 on page A-6 lists the SRNs and describes the actions you should
perform. The table columns are:
SRN The service reference number.
FRU list The FRU or FRUs that might be causing the problem (see also “FRU
Names Used in the SRN Table” on page A-5).
% How likely it is (by percentage) that the FRU is causing the problem.
Problem A description of the problem and the action you must take.
Some SRNs indicate that a problem might have been caused by a software error or
by a microcode error. If you have one of these SRNs, perform the following actions:
1. Make a note of the contents of the error log for the device that has the problem.
2. Go to the using-system service aids and select Display Vital Product Data to
display the VPD of the failing system. Make a note of the VPD for all the SSA
adapters and disk drive modules.
3. Report the problem to your support center. The center can tell you whether you
have a known problem and can, if necessary, provide you with a correction for
the software or microcode.
SRN 48000 shows that the SSA loop contains more devices or adapters than are
allowed. The following condition exist: the loop contains more than 48 devices.
If the SRN occurred when either you or the customer switched on the using system:
1. Switch off the using system.
2. Review the configuration that you are trying to make, and determine why that
configuration is not valid.
3. Review “Rules for SSA Loops” on page 3-61.
4. Correct your configuration by reconfiguring the SSA cables or by removing the
excess devices or adapters from the loop.
5. Switch on the using system.
If the SRN occurred because additional devices or adapters were added to a working
SSA loop:
1. Remove the additional devices or adapters that are causing the problem, and put
the loop back into its original, working configuration.
Note: It is important that you perform these actions, because they enable the
configuration code to reset itself from the effects of the error.
2. Review “Rules for SSA Loops” on page 3-61.
3. Review the configuration that you are trying to make, and determine why that
configuration is not valid.
4. Correct your system to a valid configuration.
The Service Processor menus enable you to configure Service Processor options
and to enamle and disable functions.
Service Processor menus are available using an ASCII terminal when the server is
powered off and the Service Processor is operating with standby power. Service
Processor menus are also available when server power is on and the Service
Processor has detected a server problem (such as loss of surveillance).
During the first power up (i.e. power cord is plugged into the outlet), Service
Processor menus are not available for 45 seconds while the Service Processor is
running self-tests and initializing the server. If the server powers down, Service
Processor menus become available after 15 seconds.
For a summary of the Service Processor functions and the methods for invoking
them, see the table on page B-2.
The following section describes these two groups of menus, how to access them,
and the functions associated with each option.
When the server is powered down, the Service Processor menus may be accessed
locally or remotely.
Menu Inactivity
To prevent loss of control in unstable power environments, the Service Processor
leaves the menu mode after 5 minutes of inactivity. Menus may be resumed by
pressing any key on the terminal, local or remote.
With a terminal or terminal emulator, call the server's modem. The Service
Processor prompts you for a password (if set), and when verified, displays the
Service Processor menus.
The menu options presented to the General user are a subset of the options
available to the Privileged user. The user must know the General Access Password
in order to access these menus.
à ð
GENERAL USER MENU
1. Power-On System
2. Read VPD
á ñ
Power-On System
Allows the user to power-on the system.
Read VPD
Displays manufacturer vital product data, such as serial numbers, part numbers,
etc.
The following menus are available to privileged users only. The user must know the
Privileged Access Password in order to access these menus.
MAIN MENU
You need the firmware version for reference when you either update or repair the
functions of your service processor.
The System Name, an optional field, is the name your server reports in problem
messages. This name helps your support team, (for example, your system
administrator, network administrator, or service representative) to more quickly
identify the location, configuration, and history of your server. The System Name is
set from the Main Menu using option 6.
à ð
Service Processor Firmware
EPROM: 1997ð915
FLASH: 1997ð512
Copyright 1997, IBM Corporation
SYSTEM NAME
MAIN MENU
á ñ
à ð
Service Processor SETUP MENU
á ñ
Note
Unless otherwise stated in menu responses settings become effective when a
menu is exited using option 98 or 99.
For security purposes, the Service Processor counts the number of attempts to enter
correct passwords. The results of not recognizing a correct password within this
error threshold are different, depending on whether the attempts are being made
locally (at the server) or remotely (via a modem). The error threshold is 3.
If the error threshold is reached by someone entering passwords at the server, the
Service Processor exits the menus. This action is taken based on the assumption
that the server is in an adequately secure location with only authorized users having
access. Such users must still successfully enter a login password to access AIX.
The following illustrates what you can access with the Privileged Access Password
and the General Access Password.
à ð
OS Surveillance Setup Menu
1. Surveillance:
Currently Disabled
3. Surveillance Delay:
Currently 1ð
á ñ
– Surveillance
May be set to Enabled or Disabled.
– Surveillance Time Interval:
May be set to any number from 2 to 255 minutes.
– Surveillance Delay:
May be set to any number from 0 to 255 minutes.
Refer to “Service Processor System Monitoring - Surveillance” on page B-29 for
more information about surveillance.
Reset Service Processor - Allows the user to reinitialize the Service Processor.
Reprogram Service Processor Flash EPROM - This is an automatic process.
An update diskette can be acquired from the RS/6000 Support page on the
Internet or from your service team. The Internet address is:
http://www.rs6ððð.ibm.com/support/micro
The update files must be downloaded to a DOS-formatted diskette. You should
format the diskette just prior to receiving the download, to be sure it is clean.
Reformatting is worth the time and effort. When this process is selected, you are
prompted to place the diskette into the drive and to indicate to the system that
the diskette in available and the automatic process starts. If the system has other
needs, you are prompted. See “Service Processor Firmware Updates” on
page B-33 for additional information and update methods.
à ð
SYSTEM POWER CONTROL MENU
4. Power-On System
5. Power-Off System
á ñ
Enable/Disable Unattended Start Mode
This option may be used to instruct Service Processor to immediately power-on
the server after a power failure, bypassing power-on password verification.
Unattended Start Mode can also be set via SMS Menus. It is intended to be
used on servers that require automatic power-on after a power failure.
Ring Indicator Power-On Menu
Ring Indicate Power-On is an alternate method of dialing in, without establishing
a Service Processor session. If the system is powered off, Ring Indicate
Power-On is enabled, the server is powered on at the predetermined number of
rings, If the server is already on, no action is taken. In either case, the telephone
call is not answered. The caller receives no feedback that the server
powered-on. The Ring Indicator Power-On Menu and defaults are shown
below:
à ð
Ring Indicator Power-On Menu
á ñ
à ð
SYSTEM INFORMATION MENU
1. Read VPD
6. Read NVRAM
á ñ
Read VPD
Displays manufacturer's vital product data, such as serial numbers, part
numbers, etc.
Read VPD Image from Last System Boot
Displays the VPD information that was in effect after the last system boot. This
information will usually be identical with the results from the menu selection
"Read VPD," but in the case of configuration updates or certain faults, this
à ð
LANGUAGE SELECTION MENU
1. English
2. Francais
3. Deutsch
4. Italiano
5. Espanol
6. Svenska
á ñ
Note: Your ASCII terminal must support the ISO-8859 character set in order to
properly display languages other than English.
This menu allows selecting languages into which Service Processor and system
firmware menus and messages are displayed.
à ð
CALL-IN/CALL-OUT SETUP MENU
7. Call-Out Test
á ñ
Modem Configuration Menu, see “Modem Configuration Menu.”
Serial Port Selection Menu, see “Serial Port Selection Menu” on page B-17.
Telephone Number Setup Menu, see “Telephone Number Setup Menu” on
page B-18.
Call-Out Policy Setup Menu, see “Call-Out Policy Setup Menu” on page B-20.
Customer Account Setup Menu, see “Customer Account Setup Menu” on
page B-22.
Ring Indicate Power-On Menu, see on page B-12.
The first two lines of the Modem Configuration Menu are status lines showing the
current selections. Selections are made in the two section labeled Modem Ports and
Modem Configuration File Name. Select the serial port that you that you want to
activate and then select the modem configuration file for the modem on the port. If
you wish to set up both serial ports with modems, make your selections one port at a
time.
à ð
Modem Configuration Menu
To make changes, First select the port and then the configuration file
name
Modem Ports:
1. Serial port 1
2. Serial port 2
á ñ
Serial Port Selection Menu
à ð
Serial Port Selection Menu
á ñ
Appendix B. Service Processor Menus B-17
Downloaded from www.Manualslib.com manuals search engine
You may enable and/or disable the call-in and call-out functions of each serial port in
any combination.
à ð
Serial Port Speed Setup Menu
50 600 4800
75 1200 7200
110 1800 9600
134 2000 19200
150 2400 38000
300 3600 57600
115200
This menu may be used to set or change the telephone numbers for reporting a
system failure. The Service Processor allows setting or changing telephone numbers
for:
Service Center Telephone Number: - The telephone number of the maintenance
provider's computer.
Customer Administration Center Telephone Number: - The telephone number of
the local system support provider's computer.
Digital Pager Telephone Number: The number for a pager carried by someone
who will respond to problem calls from your server.
Customer Voice Telephone Number: - The telephone number service personnel
will use to reach the system user.
Customer System Telephone Number: - The telephone number to which the
server's modem is connected.
á ñ
Service Center Telephone Number is the number of the service center
computer. The service center usually includes a computer that takes calls from
servers with call-out capability. This computer is referred to as "the catcher."
The catcher expects messages in a specific format to which Service Processor
conforms. For more information about the format and catcher computers, refer
to the README file in the AIX /usr/samples/syscatch directory. Contact your
service provider for the correct telephone number to enter here. Until you have
that number, leave this field blank.
Customer Administration Center Telephone Number is the number of the
System Administration Center computer (catcher) that receives problem calls
from servers. Contact your system administrator for the correct telephone
number to enter here. Until you have that number, leave this field blank.
Digital Pager Telephone Number is the number for a pager carried by
someone who will respond to problem calls from your server. Contact your
administration center representative for the correct telephone number to enter
here. For test purposes, enter your telephone number here. You can change it
later when testing is complete.
Note: Some modems, such as IBM 7857-017, are not designed for the paging
function. Although they can be used for paging, they will return an error
message when they do not get the expected response from another
modem. Therefore, even though the paging was successful, the error
message will cause the Service Processor to retry, continuing to place
pager calls for the number of retries specified in the Call-Out policy Setup
Menu. These retries result in redundant pages.
Appendix B. Service Processor Menus B-19
Downloaded from www.Manualslib.com manuals search engine
For digital pagers that require a personal Identification Number (PIN) for access,
include the PIN in this field as in the following example:
1 8ðð 123 4567,,,,87654
where the commas create pauses1 for the voice response system, and the
87654 represents the PIN.
Customer Voice Telephone Number is the telephone number of a phone near
the server or answered by someone responsible for the server. This is the
telephone number left on the pager for callback. For test purposes, enter your
telephone number here. You can change it after testing is completed.
Customer System Telephone Number is the telephone number to which your
server's modem is connected. The service or administration center
representatives need this number to make direct contact with your server for
problem investigation. This is also referred to as the call-in phone number.
à ð
CALL-OUT POLICY SETUP MENU
4. Number of retries:
Currently 2
á ñ
Call Out policy may be set to 'first' or 'all'. If call out policy is set to 'first', the
Service Processor will stop at the first successful call out to one of the following
numbers in the order listed:
1. Service Center
2. Customer Admin Center
3. Pager
1 The length of these pauses is set in modem register S8. The default is usually 1 or 2 seconds each.
à ð
Customer Account Setup Menu
á ñ
Customer Account Number is assigned by your service provider for record
keeping and billing. If you have an account number, enter it here. Otherwise,
leave this field blank.
Customer RETAIN Login UserID and Customer RETAIN Login Password
apply to a service function to which your service provider may or may not have
access. Leave these fields blank if your service provider does not use RETAIN.
Reboot describes bringing the system hardware back up from scratch, for example,
from a system reset or power on. The boot process ends when control passes to the
operating system process.
Restart describes activating the operating system after the system hardware
reinitialized. Restart must follow a successful reboot.
à ð
Reboot/Restart Policy Setup Menu
á ñ
Number of reboot attempts if the server fails to successfully complete the boot
process, it attempts to reboot the number of times specified. Entry values equal
to or greater than 0 are valid. Only successive failed reboots attempts count, not
reboots that occur after a restart attempt. At restart, the counter is set to 0.
Use OS-Defined restart policy lets the Service Processor react or not react the
same as the operating system to major system faults, by reading the setting of
the operating system parameter Automatically Restart/Reboot After a System
Crash. This parameter may, or may not be defined depending on the operating
system or its version/level. If the operating system automatic restart setting is
defined, then it may be set to respond to a major fault by restarting or by not
restarting. See your operating system documentation for details on setting up
operating systems automatic restarts. The default value is YES.
Enable supplemental restart policy - The default setting is NO. If set to YES,
the Service Processor restarts the system when the system loses control as
detected by the Service Processor surveillance, and either:
1. The Use OS-Defined restart policy is set to NO
OR
The following section discusses some of the Service Processor functions and
features more fully.
Reboot describes bringing the system hardware back up from scratch, for example,
from a system reset or power on. The boot process ends when control passes to the
operating system process.
Restart describes activating the operating system after the system hardware
reinitialized. Restart must follow a successful reboot.
Failure During Boot Process: During the boot process, either initially after
system power-on or upon reboot after a system failure, the Service Processor (SP)
monitors the boot progress (via surveillance). If progress stops, the Service
Processor can reinitiate the boot process (reboot) if enabled to do so. The Service
Processor can re-attempt this process according to an entry on the Reboot/Restart
Policy Setup Menu.
Failure During Normal System Operation: When the boot process completes
and control transferers to the operating system (OS), the Service Processor can
monitor operating system activity (see the SERVICE PROCESSOR SETUP MENU
item Set Surveillance Parameters). If OS activity stops, the Service Processor can
initiate a reboot/restart process based on the settings in the Service Processor
Reboot/Restart Policy Setup Menu and the OS automatic restart settings (see OS
documentation).
If the operating system is AIX, the menu item under SMIT for setting the restart
policy is Automatically Reboot After Crash (True/False), and the default is False.
When the setting is True, and if the SP parameter, Use OS-Defined Restart Policy, is
Yes (the default), the Service Processor takes over for AIX to reboot/restart after a
Check Stop or Surveillance failure.
If the operating system has no automatic restart policy, or if it is disabled, then the
Service Processor restart policy can be controlled from the Service Processor Menus
by using the Enable Supplemental Restart Policy selection.
When this setting is NO, or if the OS did not set a policy, the Service Processor
refers to Enable supplemental restart policy for its action.
The following provides a more thorough understanding of the relations among the
OS and Service Processor restart controls:
Surveillance is a function in which the Service Processor monitors the system, and
the system monitors the Service Processor. This monitoring is accomplished by
periodic samplings called heartbeats.
If the Service Processor detects no heartbeats during system IPL (for 7 minutes), it
cycles the system power to attempt a reboot. The maximum number of retries is set
from the Service Processor menus. If the fail condition repeats, the Service
Processor leaves the machine powered on, logs an error and offers menus to the
user. If Call-out is enabled, the Service Processor calls to report the failure and
displays the operating system surveillance failure code on the operator panel.
Operating system surveillance is disabled by default. This is to allow the user to run
operating systems that do not support this Service Processor option.
Surveillance will take effect immediately after setting the parameters from the Service
Processor menus.
If operating system surveillance is enabled (and system firmware has passed control
to the operating system), and the Service Processor does not detect any heartbeats
from the operating system, the Service Processor assumes the system is hung. The
machine is left powered on and the Service Processor enters standby phase,
displaying the operating system surveillance failure code on the operator panel. If
Call-out is enabled, the Service Processor calls to report the failure.
The Service Processor can call out (Call-Home) when it detects one of the following
conditions:
System firmware surveillance failure.
Operating system surveillance failure (if supported by Operating System).
Critical environmental failures.
Restarts
Console mirroring allows a person on a local ASCII terminal to monitor the Service
Processor activities of a remote user. Console mirroring ends when the Service
Processor releases control of the serial ports to the system firmware.
System Configuration:
Service Processor
Modem connected to one serial port and enabled for incoming calls
Local ASCII terminal connected to the other serial port. This local terminal may
be connected directly to your server or connected through another modem.
The Service Processor EPROM may need to be updated for two different reasons:
1. The UPDATE (composite) portion of the EPROM has become corrupted.
2. The Service Processor firmware upgrades, without any corruption present.
The use of a Flash EPROM allows updates to occur without physically replacing the
memory.
The firmware in your server can be updated using one of four available initiation
processes:
1. Service Processor initiation
2. SMS Utilities initiation
3. Service Aids initiation
4. AIX initiation.
Each initiation method is described below. In each case, the process prompts you
for your authority and shows the contents of the update media. Verify the file with
which to perform the update, and follow any other instructions that may appear.
After initiation, the processes are identical and automatic.
There are two areas in each firmware module that may need updating:
1. The gold code or base code or EPROM area
2. The custom or main program or FLASH area
Each update file contains matching gold and custom firmware, so it is not possible to
update to a conflicting set.
Before the update process begins, the versions of each of the two areas of the target
firmware module are compared to the versions on the update diskette. Only the
area(s) that need updating are updated. In most cases, only the custom area is
updated.
An update diskette can be acquired from the RS/6000 Support page on the Internet
or from your service team. The Internet address is:
http://www.rs6ððð.ibm.com/support/micro
Updating Firmware from the Service Processor Menus: The Service Processor Menus
are available while the server is powered off. From the Service Processor Main
Menu, select Service Processor Setup to bring up a menu containing the item,
Reprogram Service Processor Flash EPROM. Selecting that item starts the update
process. The process requests the update diskette(s) as needed. After inserting the
first diskette and informing the system, the process continues automatically. If the
process requires user assistance, it is requested.
Updating Firmware from the SMS Utilities: From a powered down or reset condition,
bring up the SMS Utilities and select the menu item for updating firmware. The
process requests the update diskette(s) as needed. After inserting the first diskette
and informing the system, the process continues automatically. If the process
requires user assistance, it is requested.
Additional update and recovery instructions may be provided; also you need to know
the fully qualified path and file name of the flash update image file. If the flash
update image file is on a diskette, you need the AIX DOS utilities package to process
the diskette. You can use the dosdir command to find out the name of the update
image file. This service aid uses the dosread command to put the file into the /var
file system.
Using this service aid to update the system or service processor flash will not save
the current contents of the flash.
The flash image file will be copied in the /var file system. If there is not enough
space in the /var file system for the flash update image file, an error is reported. If
this occurs, exit the service aid, increase the size of the /var file system and retry the
service aid. After the file is copied, a warning screen will ask for confirmation to
continue the flash update. Continuing the flash update will eventually cause the
system to reboot and return to the AIX login prompt. After the system unit reboots,
the file /var/update_flash_image can be removed to conserve space in the /var file
system.
The flash update image file must have already been placed in the /var file system.
This could have been done with a file transfer from another server or with the
dosread command of the AIX DOS Utilities, for example. With the flash update
image in place, issuing the following AIX command
shutdown -u /var/\filename\.img
initiates the update process. Where *filename* is the name of the flash update
image. During the process, the server powers down and reboots. You know the
process is complete when the login prompt reappears.
Service Processor Error Log: The Service Processor error logs contain error
conditions detected by the Service Processor.
à ð
Error Log
Press "C" to clear error log, any other key to continue. >
á ñ
The time stamp in this error log is Coordinated Universal Time (CUT), a.k.a.
Greenwich Mean Time (GMT). AIX error logs have more information available and
are able to time stamp with local time.
à ð
Read System POST Errors
Version : ð
Severity : ð
Disposition : ð
Initiator : ð
Event being reported : ð
Extended Error Log Data:
ðxC2 ðxðð ðx84 ðxð9 ðxðð ðxðð ðxðð ðxðð
ðx2ð ðxðð ðxðð ðxðð ðxðð ðxðð ðx72 ðx74
ðx63 ðxðð ðxðð ðxðð ðxðð ðxðð ðxðð ðxðð
ðxðð ðxðð ðx28 ðxð3 ðxðð ðxð5 ðxðð ðxð1
ðxðð ðxðð ðxðð ðxðð ðxðð ðxðð ðxðð ðxðð
ðx49 ðx42 ðx4D ðxðð ðx55 ðx3ð ðx2E ðx31
ðx2D ðx5ð ðx31 ðx2D ðx58 ðx31 ðxðð
á ñ
These tests include communicating with the server operating system. Be sure the
necessary serial port(s) is configured. If you need assistance, refer to “Serial Port
Configuration” on page C-3.
The server should be powered off as a result of the “Service Processor Setup
Checklist” on page C-1.
Call-In:
1. Go to your remote terminal and call-in to your server. Your server should
answer and offer you the Service Processor Main Menu after requesting your
privileged access password.
2. Enter item 2, System Power Control Menu.
3. Enter item 4, Power-On System.
Answer "y" when asked if you wish to continue powering on the system.
4. After the system firmware and operating system have initialized the server, the
login prompt should appear at your remote terminal if you set up seamless
modem transfer (refer to page D-6 for more information). This may take several
minutes. When this occurs, you have successfully called into the Service
Processor.
5. Type logout to disconnect from the operating system. The message "No
Carrier" displays on your remote terminal.
6. Call your server again. The operating system should answer and offer you the
login prompt.
If these tests are successful, call-in is working.
7. Login in and command your server to shutdown and power off. (The AIX
command shutdown -F will do this.)
8. The message "No Carrier" displays on your remote terminal.
Call-Out:
During the setup, you entered your phone number for the Pager (on page B-19) and
Customer Voice (on page B-20) phone numbers. These numbers are used for this
test.
1. Your remote terminal should be disconnected as a result of the Call-In test.
You now need to go back through the “Telephone Number Setup Menu” on
page B-18 to enter the actual telephone numbers your server will use for reporting
problems.
To configure the serial port on an AIX system, enter the following commands from an
AIX console:
1. Log in as root.
2. To find if you have any serial ports already configured, enter:
lsdev -Cc tty
If no serial ports are configured, none are listed. If you wish to configure serial
ports that are not listed, continue with the remaining steps.
3. Identify the serial port(s) with the modem(s).
4. Enter
smit tty
5. Select add tty
6. Select RS232
7. Select Baud rate 96ðð or higher.
8. Select login enable and set the flow control to RTS.
9. Commit the selections and set up any other needed serial ports.
10. Exit SMIT.
With nearly 1000 modems to choose from, and various programming standards,
configuring a modem for use with the Service Processor can be challenging. The
Service Processor is designed to place little demand on an attached modem, thereby
increasing the setup and connection success rates. Several sample modem
configurations files are supplied that will either work directly with your modem, or
provide a good starting point for a custom setup, if required. These files are included
on the Sample Modem Configuration Files diskette and in your Service Processor
firmware with the following names:
With the following selection procedures and your modem manual, one of these
configuration files should be suitable for your use.
Examples
You can create your own modem configuration file(s) or modify the samples
provided. After you customize your modem configuration files, you MUST access
them via the Configure Remote Maintenance Policy Service Aid rather than from the
Service Processor menus.
1 Some older modems do not respond to the commands Xð or &R1. You should edit out these
commands from the modem configuration file if yours is such a modem. See your modem manual for
more information.
If you have not already set up your serial ports, line speeds, authorizations and
telephone numbers from the Service Processor menus, you may set them up with
the service aids while you specify your customized modem configuration files.
To disable Auto-Reliable Mode testing of the remote modem, use the sample modem
configuration file /usr/share/modems/modem_f.cfg as a model to modify.
1. Find the necessary command in your modem manual.
2. Copy /usr/share/modems/modem_f.cfg to make a new file with the name (for
example) modem_fx.cfg.
3. In modem_fx.cfg, change the line Send "ATEðT\r" to Send "ATcccEðT\r" where
ccc is the added command found in your modem manual.
Make this change to the third line of each of the following stanzas:
condout
condin
ripo
4. Save the changes.
Use this new modem configuration file as described at the top of this section.
Xon/Xoff Modems
Some modems of the mid-80's vintage assume software flow control (Xon/Xoff)
between the computer and the modem. Modems with this design send extra
characters during and after the transmitted data. The Service Processor cannot
tolerate these extra characters. If your configuration includes such a modem, your
functional results may be unpredictable.
The sample modem configuration files included in this package do not support these
modems, so custom configuration files would be necessary. Anchor Automation
2400E is an example of such a modem.
Most newer modems do not use this design. It is recommended you use modems of
newer vintage if you experience unexplainable performance problems that may be
due to Xon/Xoff characters.
Most modems produce an interrupt request each time they detect a ring signal.
Some modems generate an interrupt only on the first ring signal they receive. AT&T
DataPort 2001 is an example of such a modem.
The Service Processor uses the ring interrupt request to count the number of rings
when Ring Indicate Power-On (RIPO) is enabled. If your modem produces an
interrupt on only the first ring, set Ring Indicate Power-On to start on the first ring.
Otherwise, you may choose to start Ring Indicate Power-On on any ring count that
makes your operation convenient.
Terminal Emulators
The Service Processor is compatible with simple ASCII terminals, and therefore
compatible with most emulators. It is for the cases when a remote session is
handed off from the Service Processor to the operating system that agreeing terminal
emulators becomes important.
The server's operating system will have some built-in terminal emulators. You may
also have a commercially available terminal emulation. It is important that the local
and host computers select the same or compatible terminal emulators so the key
assignments and responses will match. This will assure successful communications
and control.
For best formatting, choose line wrap in your terminal emulator setup.
Recovery Procedures
Line noises, power surges, etc., can sometimes cause your modem to enter an
undefined state. When it is being used for dial in, dial out or Ring Indicate
Power-On, your modem is initialized each time one of these actions is expected. If
one of these environmental conditions occur after your modem has been initialized, it
may be necessary to recover your modem to a known state.
There are about as many modem command variations as there are modems. The
sample modem configuration files have been written to capture the largest number of
workable modem settings.
The modem command &Dn (where 'n' is a number) generally sets the modem
response to the Data Terminal Ready (DTR) signal from the server's serial port. The
desired response is that the modem will hold a connection while DTR is enabled,
and drop the connection when DTR is released. This is the mechanism by which the
server "hangs up" on a connection under normal conditions.
Usually the command &D2 will work, but not always. The sample modem
configuration files2 take this high percentage position. You should consult your
modem's manual for its specific response scheme for the &Dn command.
There are two methods for dealing with the modem's response to DTR:
1. Recovery
2. Prevention
Before proceeding with one of these strategies, you need to determine if your
server's modem is set up properly to respond to DTR.
Recovery Strategy
The recovery strategy consists of making two calls to establish a remote session.
This is the easiest solution to implement, and allows more freedom for configuring
your server's serial ports.
To set up a remote terminal session, dial into the Service Processor and start the
system. After the operating system is loaded and initialized, the connection will be
dropped. At this point, you call the server back and the operating system will answer
and offer you the login prompt.
2 Only the following sample modem configuration files contain the &Dn command (in three places each):
modem_f.cfg
modem_f0.cfg
modem_f1.cfg
If you are using modem_z.cfg or modem_z0.cfg, you cannot control DTR response. If your remote
terminal does not disconnect after logging off, you must command the remote terminal emulator to hang
up. This then breaks the connection.
The disconnect is caused by the operating system when it initializes the primary
console. The tests listed above are conducted with the remote terminal selected as
the primary console to manifest the modem's response to DTR transitions.
If a local console is not a permanent part of your server, you can still assign either
the unused graphics console or the unused serial port as the primary console. This
gives you the desired seamless connection at your remote terminal.
If you choose to use the unused serial port as the primary console, some
initialization traffic will be sent to any serial device attached to that port. As a result,
that serial device's connection and function could be affected. These impacts may
make that port unattractive for devices other than a temporary local ASCII terminal.
ICDelay 1
DefaultTO 1ð
CallDelay 12ð
# AT Attention Code , Inserts delay in dialing commands
# Z Reset to factory defaults Qð Turn on responses
# Eð Turn echo off Q1 Turn off responses
# Vð Use numeric responses Sð=ð Automatic answer inhibit
# +++ Escape to command mode Sð=2 Answer on second ring
# Hð Hang-up T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 1ð, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; ð=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATHðT\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "ð\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATEðQ1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "ð\r" timeout 1
done
ICDelay 1
DefaultTO 1ð
CallDelay 12ð
# AT Attention Code , Inserts delay in dialing commands
# Zð Reset. Restore Profile ð Qð Turn on responses
# Eð Turn echo off Q1 Turn off responses
# Vð Use numeric responses Sð=ð Automatic answer inhibit
# +++ Escape to command mode Sð=2 Answer on second ring
# Hð Hang-up Xð=ð Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 1ð, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; ð=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATHðT\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "ð\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATEðQ1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "ð\r" timeout 1
done
ICDelay 1
DefaultTO 1ð
CallDelay 12ð
# AT Attention Code , Inserts delay in dialing commands
# &F Reset to default profile Qð Turn on responses
# Eð Turn echo off Q1 Turn off responses
# Vð Use numeric responses Sð=ð Automatic answer inhibit
# +++ Escape to command mode Sð=2 Answer on second ring
# Hð Hang-up Xð=ð Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# &C1 Detect CD &D2 Respond to DTR (often the default)
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 1ð, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; ð=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATHðT\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "ð\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATEðQ1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "ð\r" timeout 1
done
ICDelay 1
DefaultTO 1ð
CallDelay 12ð
# AT Attention Code , Inserts delay in dialing commands
# &Fð Reset. Restore profile ð Qð Turn on responses
# Eð Turn echo off Q1 Turn off responses
# Vð Use numeric responses Sð=ð Automatic answer inhibit
# +++ Escape to command mode Sð=2 Answer on second ring
# Hð Hang-up Xð=ð Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# &C1 Detect CD &D2 Respond to DTR (often the default)
# &R1 Ignore RTS (CTS)
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 1ð, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; ð=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATHðT\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "ð\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATEðQ1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "ð\r" timeout 1
done
ICDelay 1
DefaultTO 1ð
CallDelay 12ð
# AT Attention Code , Inserts delay in dialing commands
# &F1 Reset. Restore profile 1 Qð Turn on responses
# Eð Turn echo off Q1 Turn off responses
# Vð Use numeric responses Sð=ð Automatic answer inhibit
# +++ Escape to command mode Sð=2 Answer on second ring
# Hð Hang-up Xð=ð Limit modem response codes
# T = Tone mode. When used as T\r, it is a
# no op to maintain program synchronization
# when modem may/will echo the commands.
#
# &C1 Detect CD &D2 Respond to DTR (often the default)
# &R1 Ignore RTS (CTS)
#
# %N Call-Out phone number %P Paging phone number
# %S Modem speed (available to users)
#
# Following are common responses from a wide range of modems:
# 16, 15, 12, 1ð, 5 and 1 are connection responses. Add others as required.
# 7=busy; 6=no dial tone; 4=error; 3=no carrier; 2=ring; ð=OK
#
# PROGRAMMING NOTE: No blanks between double quote marks (").
disconnect:
delay 2 # Separate from previous data.
send "+++" # Assure command mode.
delay 2 # Allow mode switching delay.
send "ATHðT\r" # Set modem switch-hook down
# (i.e., hang up).
ignore "ð\r" or "OK\r" timeout 2 # Ignore modem response.
send "ATEðQ1\r" # Initialize modem: Echo OFF,
# Disable responses.
ignore "ð\r" timeout 1
done
This section provides a high-level flow of the phases of the Service Processor (SP).
SP Power Applied
│
6
┌─────┐ Pre-Standby Phase
│ 6 │
│ 6
│ Standby Phase SP Menus Available
│ │
│ 6
│ Bring-Up Phase SMS Menus Available
│ │
│ 6
│ Runtime Phase Diagnostic Service Aids Available
│ │
└──────────────────┘
Pre-Standby Phase
This phase is entered when the server is connected to a power source. The server
may or may not be fully powered on. This phase is exited when the Power-On Self
Tests (POSTS) and configurations tasks are completed.
In the Standby phase, the SP takes care of some automatic duties and is available
for menus operation. The SP remains in the standby phase until a power-on request
is detected.
Bring-Up Phase
This phase is entered upon power-on, and exited upon loading of the operating
system.
Runtime Phase
This phase includes the tasks that the SP performs during steady-state execution of
the operating system.
Environmental Monitoring
The SP monitors voltages, temperatures and fan speeds (on some servers).
Responding to System Processor Commands
The SP will respond to any command issued by the system processor.
Run-Time Surveillance (Heartbeat Monitoring)
If the device driver is installed and surveillance enabled, the SP will monitor the
system heartbeat. If the heartbeat times out, the SP places an outgoing call.
This is different from the Bringup Phase scenario where two reboot attempts are
made before placing an outgoing call.
Index X-1
Downloaded from www.Manualslib.com manuals search engine
commands diagnostics, online 4-1
cfgmgr 2-38, 3-92 diagnostics, standalone 4-1
diag -a A-12 dial-out telephone numbers,
lsattr -E -l adapter -a ucode 3-92 setting B-18
send diagnostic A-1 disk
configuration certify, service aid 3-81
complete system 3-92 format, service aid 3-79
invalid SSA loop A-15 formatting, incompatible with the
manager command 3-92 system 2-38
Configuration Verification service disk drive module
aid 3-77 1.12 GB, 2.25 GB, and 4.51
connector GB 3-93
port, on SSA adapter 3-90 broken loop 3-86, 3-88
console mirroring finding the physical location 3-90
enable/disable B-9 Format Disk service aid 3-79
quick disconnect B-32 identification 3-61
system configuration B-32 missing 3-75
continuous failure 2-35 POSTs (power-on self-tests) A-1
covers serial number 3-61
removal and replacement 6-3 strings 3-60
customer administration center B-19 unavailable to the using
system 3-67
diskette drive
D removal and replacement 6-19
data flow 1-8
DRAM (dynamic random-access
data path 3-59, 3-62, 3-86
memory) 3-93
data, vital product 3-93
dynamic random-access memory
definition of FRU names used in the
(DRAM) 3-93
SRN table A-5
device configuration method 3-92
device driver level 3-93 E
devices, handling static–sensitive 6-2 entry point, MAPs 2-37
devices, startup 5-8 EPROM
diag -a command A-12 updates B-33
diagnostic aids EPROM (programmable read-only
adapter POSTs (power-on memory) A-2
self-tests) A-2 error
disk drive module POSTs (power-on microcode and software A-5
self-tests) A-1 unrecoverable 3-83
SRNs (service request
numbers) A-3, A-5
F I
failed
I/O planar
application program 3-83
removal and replacement 6-31
status 3-74
I/O planar locations 1-4
failure
ID, adapter code package 3-92
continuous 2-35
identification, pdisks, hdisks, and disk
link 3-83
drive module 3-61
fans
identify function 3-65, 3-70, 3-90
removal and replacement 6-42
inactive
fast-write cache module 3-94
SSA connector 3-60
finding the physical location of a
index, error code to FRU 3-1
device 3-90
information
Format Disk service aid 3-79
SSA network 3-76
format, location code 3-58
invalid SSA loop configurations A-15
fragility of disk drive modules A-1
FRU
names used in the SRN table A-5 L
used in the SRN table A-5 language selection B-15
Function Select menu 3-65 laser safety information xiii
function, identify 3-65, 3-90 level
device driver 3-93
light
G on connectors 3-60
general access password,
link
changing B-9
active 3-60
general user menus B-4
failure 3-83
glossary of FRU names used in the
Link Verification service aid 3-73
SRN table A-5
loading the system diagnostics 4-1
good status 3-74
location
green light
code format 3-58
on connectors 3-59, 3-60
physical, of a device 3-90
location code
format 3-46
Index X-3
Downloaded from www.Manualslib.com manuals search engine
location codes 3-46, 3-48 menu inactivity B-3
AIX 3-54 menus
physical 3-46 Certify Disk service aid 3-81
ssa 3-58 Configuration Verification service
locations aid 3-77
I/O 1-4 Format Disk service aid 3-79
memory 1-6 Function Select 3-65
system card 1-6 general user B-4
system unit 1-1 Link Verification service aid 3-73
logical units (LUN) 3-61 privileged user B-6
loop service processor B-3
broken 3-86 service processor call-in/call-out
configurations, SSA, invalid A-15 setup B-16
data paths, description of 3-62 service processor call-out policy
links 3-59 setup B-20
SSA, rules 3-61 service processor customer account
loops 3-60 setup B-22
lsattr -E -l adapter -a ucode service processor language
command 3-92 selection B-15
LUN (logical units) 3-61 service processor reboot policy
setup B-23
service processor serial port
M selection B-17
maintenance analysis procedures
service processor serial port speed
(MAPs) 2-1, 2-35
setup B-18
minimum configuration Map 2-17
service processor setup B-8
power MAP 2-12
service processor system
problem determination MAP 2-6
information B-13
maintenance, microcode 3-92
service processor system power
manufacturer and plant code 3-93
control B-12
MAPs (maintenance analysis
service processor telephone
procedures)
setup B-18
starting 2-37
Set Service Mode service aid 3-67
using 2-35
SSA Service Aids 3-65
memory bits 3-26
support menus
memory card locations 1-6
service processor menus B-2
memory hang
service processor service
memory module
aids B-2
removal and replacement 6-29
SMS B-2
Index X-5
Downloaded from www.Manualslib.com manuals search engine
port RAM (random-access memory) 3-93
(P) value 3-90 Random-access memory (RAM) 3-93
connector on SSA adapter 3-90 read system, POST errors B-5, B-14
POS (programmable option read-only memory (ROM) 3-93
select) A-2 Read-only storage (ROS) 3-93
POST error codes 3-1 reboot recovery B-23, B-27
POST errors related publications xv
read B-5, B-14 remote latency B-21
POSTs (power-on self-tests) A-1 remote timeout B-21
power removal and replacement
status 3-74 procedures 6-1
power cables 1-12, 7-7 adapters 6-22
power MAP 2-12 backplane 6-20
power supply battery 6-38
removal and replacement 6-15 CD-ROM drive 6-19
power-on methods B-26 covers 6-3
power-on self-tests (POSTs) A-1, A-2 diskette drive 6-19
preface xv fan 6-42
primary console D-8 I/O planar 6-31
privileged access password, memory module 6-29
changing B-9 operator panel display 6-44
privileged user menus B-6 power supply 6-15
problem determination Processor card 6-35
POSTs, adapter A-2 serial cable 6-46
POSTs, disk drive module A-1 service processor 6-37
SRNs (service request tape drive 6-19
numbers) A-3 reserved status 3-65, 3-74
using service aids 3-84 reset Service Processor B-11
problem determination MAP 2-6 restart recovery B-23, B-27
Processor card RETAIN B-22
removal and replacement 6-35 retries B-21
programmable option select ring indicator power-on B-12
(POS) A-2 ROM (read-only memory) 3-93
programmable read-only memory ROS (read-only storage) 3-92, 3-93
(EPROM) A-2 rules for SSA loops 3-61
progress indicators B-5, B-14
S
R safety
electrical xi
Index X-7
Downloaded from www.Manualslib.com manuals search engine
SSA (serial storage architecture) system programs (continued)
(continued) Sys Config 5-5
service aids (continued) Utilities 5-10
finding the physical location of a system unit locations 1-1
device 3-90
Format Disk 3-79
identify function 3-65
T
table
Link Verification 3-73
service request numbers
Set Service Mode 3-67
(SRNs) A-6
starting 3-65
tape drive
SSA Hot-Swap Disk Drive
removal and replacement 6-19
vital product data (VPD) 3-93
test, memory 3-42
SSA-address (AA) value 3-91
testing the setup
standalone diagnostics 4-1
call-in C-2
start talk mode B-10
call-out C-2
Start Up program 5-8
starting
MAPs 2-37 U
service aids 3-65 unattended start mode,
starting system programs 5-1, 5-24 enable/disable B-12
STBY 2-1, 2-19, 2-20, 2-21, 2-22, unavailable service aids 3-90
2-23, 3-31, E-2 Unplug all power cords (cables) 6-5
strings, disk drive module 3-60 unrecoverable error 3-83
surveillance using
failure B-29 MAPs 2-35
operating system B-29 service aids 3-65
set parameters B-11 SRN table A-3
system firmware B-29 Utilities program 5-10
Sys Config program 5-5
system administrator B-19
system card locations 1-6
V
value, port (P) 3-90
system data flow 1-8
verification
system information menu B-13
configuration, service aid 3-77
system phone number B-20
link, service aid 3-73
system POST errors
serial number 3-80, 3-82
read B-5, B-14
vital product data (VPD) 3-93
system power-on methods B-26
voice phone number B-20
system programs 5-1
VPD (vital product data) B-13
Start Up 5-8
starting 5-1, 5-24
Index X-9
Downloaded from www.Manualslib.com manuals search engine
X-10 Service Guide
&title.
Overall how satisfied are you with the information in this book?
Very Very
Satisfied Satisfied Neutral Dissatisfied Dissatisfied
Overall Satisfaction Ø Ø Ø Ø Ø
How satisfied are you that the information in this book is:
Very Very
Satisfied Satisfied Neutral Dissatisfied Dissatisfied
Accurate Ø Ø Ø Ø Ø
Complete Ø Ø Ø Ø Ø
Easy to find Ø Ø Ø Ø Ø
Easy to understand Ø Ø Ø Ø Ø
Well organized Ø Ø Ø Ø Ø
Applicable to your tasks Ø Ø Ø Ø Ø
When you send comments to us, you grant us a nonexclusive right to use or
distribute your comments in any way we believe appropriate without incurring
any obligation to you.
X-11
Downloaded from www.Manualslib.com manuals search engine
X-12 Service Guide
February 1998
ð7L8343
SA38-ð541-ð1