cc/td/doc/product/core/cis12016
hometocprevnextglossaryfeedbacksearchhelp
PDF

Table of Contents

Troubleshooting the Installation

Troubleshooting the Installation

Your Cisco 12016 Gigabit Switch Router (GSR) went through extensive testing and burn-in before leaving the factory. However, if you encounter problems when starting the router for the first time, use the information in this chapter to help isolate the cause of the problems. Problems with the initial startup are most likely caused by the source power or a system card not properly seated in the backplane.

Although an overtemperature condition is unlikely at initial startup, the environmental monitoring functions are included in this chapter, because they also monitor internal voltages.

This chapter contains the following sections:


Note The procedures in this chapter assume that you are troubleshooting the initial startup of the router, as described in the "Starting the Router and Observing Initial Conditions" section in "Observing System Startup and Performing a Basic Configuration," and that the system is in the original factory configuration. If you have altered the original hardware configuration or changed any default configuration settings, the recommendations in this chapter might not apply.

Troubleshooting Overview

This section describes the methods used in troubleshooting the router. To enable efficient problem solving, the troubleshooting methods are organized according to the major subsystems in the router.

Performing a Hardware Check

If your system failed its initial attempt at starting up, as described in the"Starting the Router and Observing Initial Conditions" section in "Observing System Startup and Performing a Basic Configuration," verify the following conditions:


Note A Flash memory card containing a valid Cisco IOS software image is inserted in PCMCIA slot 0 before the router is shipped. Also, by default, the software configuration register is set to 0x0102, causing the router to boot automatically from the Cisco IOS software image stored on the Flash memory card.

As a reminder, you must format new Flash memory cards before you can use them. For information on formatting a Flash memory card, refer to the "Formatting a Flash Memory Card" section in "Observing System Startup and Performing a Basic Configuration."

When all of these conditions are met, the hardware check is complete, and you should proceed to "Observing System Startup and Performing a Basic Configuration," to perform a basic system configuration, and then to the appropriate companion software configuration documentation to configure the Cisco IOS software, protocols, and individual interfaces in your system.

If the startup sequence again fails before the system software boots successfully, use the procedures in this chapter to isolate and, if possible, resolve the problem.

If you cannot solve a problem on your own, contact a Cisco customer service representative for assistance. When you call, please have the following information ready:

Normal Router Status at Startup

Table 5-1 shows the normal contents of the alphanumeric displays on the GRP and the line cards, as well as the normal LED states on the alarm card, the power modules (AC-input or DC-input), and the blower modules at system startup.


Table 5-1: Status of Alphanumeric Displays and LEDs at System Startup
Component LED Display Contents/LED Status and Meaning

GRP

Alphanumeric displays

Top row: MSTR
Bottom row: RP

The GRP is enabled and recognized by the system; a valid Cisco IOS software image is running.

Line card

Alphanumeric displays

Top row: IOS
Bottom row: RUN

The line card is enabled and ready for use.

Alarm card

Detected alarm severity

Alarm card

CSC 0 and 1

SFC 0, 1, and 2

Critical: Off
Major: Off
Minor: Off

Enabled: On
Fail: Off

Enabled: On
Fail: Off

Enabled: On
Fail: Off

Power module

Power status

PWR OK: On
FAULT: Off
TEMP: Off
ILIM: Off (AC-input power supplies only)

The correct power module voltages are present and no faults have been detected.

Blower module

Blower status

OK: On
FAIL: Off

Problem Solving Using a Subsystem Approach

The key to solving system problems is to try to isolate the problem to a specific subsystem. The first step in solving startup problems is to compare what the router is doing with what it should be doing. Because a startup problem is usually attributable to a single component, it is more efficient to examine each subsystem, rather than to try to troubleshoot each router component.

For purposes of the troubleshooting procedures in this chapter, the Cisco 12016 GSR consists of the following subsystems:

Identifying Startup Problems

Startup problems are commonly caused by a problem with the source power or by a card that is not properly seated in the backplane.

Normal Router Startup Sequence

Each card installed in the system incorporates an MBus module and at least one DC-to-DC converter. The MBus module on each card provides an interface to the GRP by means of the router's MBus; the MBus module on each card also controls its own onboard DC-to-DC converters.

In a normal router startup sequence, the following events and conditions occur:

MBus modules on the fabric cards (CSCs and SFCs) receive power from onboard DC-to-DC converters. MBus modules on the alarm cards also receive power from onboard DC-to-DC converters. The alarm cards also put +5 VDC back onto the chassis backplane, where it is picked up by the MBus modules on other cards, in the blower modules, and on the power bus board, which acts as the physical interface to the power shelf.
As part of the boot process, each MBus module processor reads a set of ID pins on the card. These pins, whose meaning is determined at the time the card is manufactured, identify the kind of card on which the MBus module is mounted.
If the onboard MBus module determines that it is mounted on a line card, the MBus module remains idle for the time being, leaving the card powered down until it receives power-up instructions from the master MBus module on the GRP.
The top row of the display is powered by the DC-to-DC converter on the card; the bottom row of the display is powered by the +5 VDC provided through the backplane.

By checking the status LEDs on the power modules and the alphanumeric displays on the GRP and line cards, you can generally determine when and where the router failed in the startup sequence.

The following sections describe what you should expect to see in the power module LEDs on router startup.

AC-Input Power Supply LEDs

Figure 5-1 shows the location of the LEDs on the power supply faceplate.


Figure 5-1:
AC-Input Power Supply LEDs


When a AC-input power supply is seated in its power shelf bay and connected to an AC power source supplying power within the required range, the green LED labeled PWR OK on the power supply faceplate should be on and the power supply fan should also be on. This is normal behavior.

If either the AC input voltage to the power supply or the -48 VDC output voltage to the backplane falls outside the required range, the green LED labeled PWR OK will go off and the yellow LED labeled FAULT will go on. Because of the redundancy scheme used in both the standard and optional AC-input power subsystem, a problem with the DC output voltage from only one power supply to the backplane should not affect router operation.

If an AC-input power supply detects an internal overtemperature condition, the power supply sends an overtemperature warning to the system across the MBus, then shuts down. The yellow LEDs labeled FAULT and TEMP are on.


Note Because the power module LEDs and the monitoring circuits are redundantly powered by the other power modules in the power shelf, the LEDs will be on to indicate a fault condition even when the power module has shut down.

If an AC-input power supply detects a current limit condition, the power supply sends a fault signal to the system across the MBus. The yellow LED labeled ILIM is on.

When the Cisco 12016 GSR is equipped with the standard AC-input power shelf and three power supplies, a problem with the DC output voltage from two power supplies prevents the router from starting up or continuing normal operation.

When the Cisco 12016 GSR is equipped with the optional AC-input power shelf and four power supplies, a problem with the DC output voltage from three power supplies prevents the router from starting up or continuing normal operation.

DC-Input Power Entry Module LEDs

Figure 5-2 shows the location of the LEDs on the power entry module faceplate.


Figure 5-2:
DC-Input Power Entry Module LEDs


For a DC-input power entry module, normal operating behavior occurs when the following conditions are met:

If either the DC input voltage to the power entry module or the -48 VDC output voltage to the power shelf backplane falls outside the required range, or the power entry module circuit breaker is switched off, the green LED labeled PWR OK will go off and the yellow LED labeled FAULT will go on.


Note Because the power module LEDs and the monitoring circuits are redundantly powered by the other power module in the power shelf load zone power entry module pair, the LED labeled FAULT on the power entry module can be on even when the circuit breaker for that power entry module is switched off or there is no DC-input power source for that power entry module. On a newly installed power entry module, when DC-input source power is available to the power entry module and the circuit breaker is switched on, the LED labeled FAULT should go off and the LED labeled PWR OK should go on.

Because of the 2N redundancy scheme, a problem with the DC output voltage from just one power entry module in a backplane load zone to the backplane should not affect router operation. A problem with the DC output voltage from both power entry modules in a load zone prevents the router from starting up or continuing normal operation.

If a DC-input power entry module detects an internal overtemperature condition, the power entry module sends an overtemperature warning to the system across the MBus, then shuts down. The yellow LEDs labeled FAULT and TEMP are on.

Alarm Card LEDs

Figure 5-3 shows the location of the LEDs on the alarm card faceplate.


Figure 5-3: Alarm Card LEDs


If there are no alarm conditions within the router, the critical, major, and minor alarm LEDs on the alarm card faceplate should be off.

If there are no faults on the alarm card itself, the green LED labeled ENABLED on the alarm card faceplate should be on, and the yellow LED labeled FAIL should be off.

If there are no faults on either CSC 0 or CSC 1, the green LED labeled ENABLED for each CSC should be on, and the yellow LED labeled FAIL for each CSC should be off. If the system detects a CSC fault, it turns off the green LED for the faulty card, turns on the yellow LED, logs a warning message on the system console, and continues operating.

If there are no faults on the SFCs (SFC 0, SFC 1, or SFC 2), the green LED for each SFC should be on, and the yellow LED for each SFC should be off. If the system detects an SFC fault, it turns off the green LED for the faulty card, turns on the yellow LED, logs a warning message on the system console, but continues operating.


Note You can use the ACO/LT switch to verify that the alarm card LEDs are capable of lighting. If no audible alarm is active, pressing the ACO/LT switch temporarily illuminates the LEDs on the alarm card faceplate as a visual check that no alarm card LEDs have failed.

GRP Alphanumeric LEDs Displays

Figure 5-4 shows the location of the alphanumeric LEDs on the GRP faceplate.


Figure 5-4: GRP Alphanumeric LED Displays (Partial Faceplate View)


When the router is powered on, the four-character alphanumeric displays on the GRP indicate the following:

Status messages are displayed as the boot process continues. (See Table 5-2.)

Line Card Alphanumeric LED Displays

Figure 5-5 shows the location of the alphanumeric LEDs on the line card faceplate.


Figure 5-5: Line Card Alphanumeric LED Displays (Partial Faceplate View)


When the line card is powered on, the four-character displays indicate the following:

Status messages appear in the alphanumeric LED displays as the boot process continues on the line card. Table 5-3 lists the messages that appear in the line card alphanumeric LED displays. Some messages are displayed only for a few milliseconds; others are displayed for several seconds.

Troubleshooting the Power Subsystem

The section includes the following subsections:

Troubleshooting the AC-Input Power Subsystem

AC-input power supplies are monitored for internal temperature, voltage, and current load by the MBus controller module on the power bus board in the power interface panel in the back of the chassis and by the MBus module on the GRP. If the router detects an extreme condition, it generates an alarm on the alarm card and logs the appropriate warning messages on the console.

To begin checking the AC-input power subsystem, examine the four LEDs on the power supply faceplate.

When a AC-input power supply is seated in its power shelf bay and connected to an AC power source supplying power within the required range, the green LED labeled PWR OK on the power supply faceplate should be on and the power supply fan should also be on. This is normal behavior.

If either the AC input voltage to the power supply or the -48 VDC output voltage to the backplane falls outside the required range, the green LED labeled PWR OK will go off and the yellow LED labeled FAULT will go on. Because of the redundancy scheme used in the standard and optional AC-input power subsystems, a problem with the DC output voltage from just one power supply to the backplane should not affect router operation.

When the Cisco 12016 GSR is equipped with the standard AC-input power shelf and three power supplies, a problem with the DC output voltage from two power supplies prevents the router from starting up or continuing normal operation.

When the Cisco 12016 GSR is equipped with the optional AC-input power shelf and four power supplies, a problem with the DC output voltage from three power supplies prevents the router from starting up or continuing normal operation.

Continue checking the AC-input power subsystem by assessing the following:

If a spare power supply is available, replace the existing power supply with the spare unit. If the PWR OK LED on the spare unit goes on, return the faulty power supply for replacement.

In a GSR equipped for AC-input operation, the power bus board in the power interface panel inside the GSR chassis is equipped with two user-replaceable fuses. One fuse protects the MBus controller module that monitors the operation of the power subsystem; the other fuse protects the MBus controller module against voltage or current monitoring signals (Vmon and Imon) that exceed the specified voltage threshold.

The voltage and current monitoring signals permit the MBus controller module to monitor the voltage and current from the power supplies in the AC-input power shelf.

You can issue the show environment command at the user EXEC mode prompt on the system console to display operational information from the power bus board MBus controller module.

Because the MBus controller module only monitors the operation of the power shelf and does not control its operation, an open MBus controller module fuse renders the power shelf invisible to the system, but the system continues to operate. No power shelf operational information is displayed in response to a console command.

An open fuse on the Vmon and Imon +5.1 VDC bias line causes the Vmon and IMon signals to go to zero. The power shelf and system continue to operate. These signals are monitored by the MBus controller module.

To replace these fuses, refer to the "Replacing a Power Bus Board Fuse" section in "Maintaining Your Cisco 12016 GSR."

If you are unable to resolve the problem, or if you determine that either the power supply or power cord is faulty, contact your Cisco service representative for assistance.

Troubleshooting the DC-Input Power Subsystem

DC-input power entry modules are monitored for internal temperature, voltage, and current load by the MBus module in the power interface panel in the back of the chassis, and by the MBus module on the GRP. If the router detects an extreme condition, it generates an alarm on the alarm card and logs the appropriate warning messages on the console.

To begin checking the DC-input power subsystem, examine the three LEDs on the power entry module faceplate.

For a DC-input power entry module, normal operating behavior occurs when the following conditions are met:

If either the DC input voltage to the power entry module or the -48 VDC output voltage to the power shelf backplane falls outside the required range, or the power entry module circuit breaker is switched off, the green LED labeled PWR OK will go off and the yellow LED labeled FAULT will go on.


Note The power module LEDs and the monitoring circuits are redundantly powered by the other power module in the power shelf load zone power entry module pair. Therefore, the LED labeled FAULT on the power entry module can be on even when the circuit breaker for that power entry module is switched off or there is no DC-input power source for that power entry module. On a newly installed power entry module, when DC-input source power is available to the power entry module and the circuit breaker is switched on, the LED labeled FAULT should go off and the LED labeled PWR OK should go on.

Because of the 2N redundancy scheme, a problem with the DC output voltage from just one power entry module in a backplane load zone to the backplane should not affect router operation. A problem with the DC output voltage from both power entry modules in a load zone prevents the router from starting up or continuing normal operation.

Continue checking the power subsystem by assessing the following:

If the answer to all three questions is yes, source power is present and within tolerance, and the power entry module is functional.
The MBus modules driving the alphanumeric LED displays on the GRP and line cards are powered by +5 VDC from the backplane. The blower modules use -48 VDC from the backplane. If both the GRP and the blower modules are operating, all internal DC voltages are present.
You can issue the show environment command at the user EXEC mode prompt on the system console to display temperature and voltage information for each installed card, blower module, and power entry module.
Try ejecting and reseating the power entry module.
If the PWR OK LED goes on and the FAULT LED goes off, the open circuit breaker might indicate an electrical problem in the chassis or reverse polarity in the source DC cabling.
Caution The DC-input power entry module contains circuitry to trip the breaker on the power entry module if it detects a reverse polarity condition. No damage should occur due to reverse polarity, but you should correct a reverse polarity condition immediately.
Verify that the source DC (-) and the source DC return (+) cable leads match the (-) and (+) labels on the back panel of the DC-input power shelf to ensure that the cables are connected to the terminal studs in the correct polarity.
If you replaced the cable and the PWR OK LED goes on, the original power cable is faulty.
If a spare power entry module is available, replace the existing power entry module with the spare unit. If the PWR OK LED on the spare unit goes on, the original power entry module is faulty and should be returned for replacement.

In a GSR equipped for DC-input operation, the power bus board in the power interface panel inside the GSR chassis is equipped with a fuse to protect the MBus controller module that monitors the operation of the power subsystem.

You can issue the show environment command at the user EXEC mode prompt on the system console to display operational information from the power bus board MBus controller module.

Because the MBus controller module only monitors the operation of the power shelf and does not control its operation, an open MBus controller module fuse renders the power shelf invisible to the system, but the system continues to operate. No power shelf operational information is displayed in response to a console command.

To replace the power bus board MBus controller module fuse, refer to the "Replacing a Power Bus Board Fuse" section in "Maintaining Your Cisco 12016 GSR."

If you are unable to resolve the problem or if you determine that either the power entry module or power cable is faulty, contact your Cisco service representative for assistance.

Troubleshooting the Power Distribution System

The DC voltage from the power shelf is transferred to the two load zones on the chassis backplane by two redundant power bus bars. (See Figure 5-6.) System load zone 1 on the chassis backplane distributes a nominal DC source of -48 VDC to the upper blower module and the upper card cage. System load zone 2 distributes a nominal DC source of -48 VDC to the switch fabric card cage, the lower card cage, and the lower blower module.

The -48 VDC from the backplane feeds DC-to-DC converters on each card. The MBus module on each card controls the DC-to-DC converters. When directed by the GRP or by MBus software, the MBus module turns on the DC-to-DC converters, which convert the -48 VDC into the voltages required by the card.

DC-to-DC converters on the fabric cards take the -48 VDC from the chassis backplane and produce +5 VDC to power the MBus modules on those cards. DC-to-DC converters on the alarm cards also take the -48 VDC from the chassis backplane and produce +5 VDC to power the MBus modules on those cards, but also put +5 VDC back onto the backplane for the MBus modules on the GRP, the line cards, the power bus board, and in the blower modules.

The -48 VDC power for the two blower modules is transferred from the chassis backplane to the blower modules through two shielded wiring harnesses and floating connectors built into the back of the chassis. An internal controller card in the blower module converts -48 VDC into a variable DC voltage that powers the blower module fans. The MBus module in each blower module uses +5 VDC from the chassis backplane to power the MBus interface circuitry and the temperature sensor.


Figure 5-6:
Cisco 12016 GSR Power Distribution System


Begin troubleshooting the power distribution system by assessing the following:

If the answer to all three questions is yes, source power is present and within tolerance, and the power module is functional.
If both blower modules are on, the -48 VDC from the chassis backplane and the cables from the backplane to the blower modules are all good.
If only one of the blower modules is on, there may be a problem with either the blower module or the -48 VDC power for that blower module. If both blower modules are not on, there may be a problem with the -48 VDC from the chassis backplane.

Note The power bus board, power bus bars, chassis backplane, and blower module cables are not user-replaceable parts. If you suspect that there is a fault in one of these router components, you must contact a certified Cisco service representative for assistance.

Troubleshooting the Processor Subsystem

This section contains the following subsections:

The Cisco 12016 GSR processor subsystem consists of the GRP, the line cards, and the alarm cards.

The GRP and the line cards each have two onboard processors. One processor serves as the card's main processor, and the other serves as the MBus module processor. The MBus module processor monitors the card's environment and controls the card's onboard DC-to-DC converter.

As soon as power is supplied to the system, the MBus module begins to operate. The MBus module determines what type of card it is mounted on, and whether it should turn on the card's DC-to-DC converters. The MBus module on the GRP turns on card power after a brief delay, but the MBus modules on the line card delay turning on power until they receive a command to do so from the GRP MBus module.

To operate at all, a minimally configured Cisco 12016 GSR must have a GRP installed in slot 7 of the upper card cage. If the Cisco 12016 GSR is equipped with an optional, redundant GRP, that GRP must be installed in the far left slot in the lower card cage (slot 8).

While the router cannot operate without a properly installed GRP, it can operate without any line cards installed, as long as no line card is in partial contact with the backplane pins.

A line card that is only partially seated in the backplane sends incomplete signals to the GRP, which can cause the router to halt. First, ensure that the GRP is installed properly and that the system software has initialized successfully. Then, if necessary, you can troubleshoot individual line cards.

A power-on self-test (POST) runs immediately at power-on to determine the condition of the GRP memory. The test results appear in the alphanumeric LED display as either a pass or fail message.

Troubleshooting the GRP

To isolate a problem with the GRP, use the alphanumeric LED displays on the GRP.

The GRP alphanumeric LED displays are located at the bottom of the GRP faceplate, near the card ejector lever. (See Figure 5-7.) To clarify the LED display message format, Figure 5-7 shows the LED displays in the head-up orientation in the upper card cage.


Note When the line cards or an optional, redundant GRP are installed in the lower card cage, they are installed in the head-down orientation, so that the alphanumeric LED displays are near the top of the lower card cage. This card orientation is necessary to minimize the signal paths across the backplane between card connectors in the upper, middle, and lower card cages.

Figure 5-7: GRP Alphanumeric LED Displays (Partial View of Faceplate, Head-up Orientation Shown)


To isolate a problem with the GRP, check the GRP alphanumeric displays:

The two alphanumeric LED displays are powered separately. The upper display receives power from the DC-to-DC converters on the GRP. The lower display is powered directly from the MBus through the chassis backplane. Thus, even if the GRP has not powered up, the bottom display could be on.
As soon as the DC-to-DC converters are turned on by the MBus module, the GRP processor begins the boot process. Status messages are displayed as the boot process continues. (See Table 5-2.)
Some messages appear briefly (millisecond duration), while others last several seconds. If one of the messages appears frozen, the boot process could be halted. Write the displayed message on a piece of paper. Turn off power to the router, then turn the power on again to reset the router and start the boot process. If the router halts again, the GRP could be faulty and need replacement.
Caution The soft reset (NMI) switch is not a mechanism for resetting the GRP and reloading the IOS image. It is intended for software development use. To prevent system problems or loss of data, use the soft reset switch only when instructed by a Cisco-certified service representative.


Table 5-2: GRP Alphanumeric LED Display Messages
LED Message Description

LMEM
TEST

Running low memory test

LCAH
TEST

Initializing lower 15K cache

BSS
INIT

Initializing main memory for ROM

NVRAM
INIT

Initializing NVRAM

EXPT
INIT

Initializing interrupt handlers

TLB
INIT

Initializing TLB

CACH
INIT

Initializing CPU data and instruction cache

CACH
PARY

Enabling CPU cache parity

MEM
INIT

Initializing main memory

NVRAM
SIZE

Detecting the NVRAM size

PCMC
INIT

Initializing the PCMCIA

EXIT
INIT

Exiting the initialization sequence

IOS
UP

Running the Cisco IOS software

Troubleshooting the Line Cards

There are eight numbered card slots in the upper card cage and eight numbered card slots in the lower card cage. The master GRP must be installed in slot 7 of the upper card cage. If the router is equipped with an optional, redundant GRP, that GRP must be installed in the far left slot in the lower card cage (slot 8), leaving available slots for up to 14 line cards in the upper and lower card cages. If the router is not equipped with an optional, redundant GRP, up to seven line cards can be installed in slots 0 through 6 in the upper card cage, and up to eight line cards can be installed in slots 8 through 15 in the lower card cage.

As each line card powers up in response to a command issued by the GRP across the MBus, a power-on self-test (POST) is performed on the line card memory. A full set of field diagnostics can also be run on a line card from the system console. The diagnostics provide either a pass or fail message in the line card alphanumeric LED displays, as well as on the system console. (For information on diagnostic testing, refer to "Running Diagnostics on the Cisco 12016 GSR."

The line card alphanumeric LED displays are located at one end of the line card faceplate, near the card ejector lever. Line cards installed in the upper card cage are installed in the head-up orientation, so the alphanumeric LED displays are located at the bottom of the upper card cage. Line cards installed in the lower card cage are installed in the head-down orientation, so the alphanumeric LED displays are located at the top of the lower card cage. (Figure 5-8 shows the LED displays in the head-up orientation to clarify the LED display message format.)


Note When the line cards or an optional, redundant GRP are installed in the lower card cage, they are installed in the head-down orientation, so that the alphanumeric LED displays are near the top of the lower card cage. This card orientation is necessary to minimize the signal paths across the backplane between card connectors in the upper, middle, and lower card cages.

Figure 5-8: Line Card Alphanumeric LED Displays (Partial View of Faceplate, Head-up Orientation Shown)


To isolate a problem with a line card, check the line card alphanumeric displays:

The two line card alphanumeric LED displays are powered separately. The upper display receives power from the DC-to-DC converters on the line card. The lower display is powered directly from the chassis backplane. Thus, even if the line card has not powered up, the lower display could be on.


Table 5-3: Line Card Alphanumeric LED Display Messages
LED Message Description

MEM
TEST

Running POST memory test

LROM
RUN

After POST memory test

BSS
INIT

Initializing main memory for ROM

RST
SAVE

Saving reset reason register

IO
RST

Resetting the I/O system on the card

EXPT
INIT

Initializing interrupt handlers

TLB
INIT

Initializing TLB

CACH
INIT

Initializing CPU data and instruction cache

MEM
INIT

Initializing main memory

LROM
RDY

Ready to access download

ROMI
GET

Getting ROM images

FABL
WAIT

Waiting for fabric downloader to load

FABL
DNLD

Loading fabric downloader

FABL
STRT

Launching fabric downloader

FABL
RUN

Fabric downloader launch complete

IOS
DNLD

Downloading the Cisco IOS software

IOS
STRT

Launching the Cisco IOS software

IOS
UP

Running the Cisco IOS software

IOS
RUN

Line card enabled

Troubleshooting by Using the Alarm Cards

The Cisco 12016 GSR is equipped with two alarm cards. One card occupies the dedicated far left slot of the upper card cage. A second alarm card occupies the dedicated far right slot of the lower card cage. In both card cages, the alarm card slot differs from the rest of the card cage slots in that it is labeled to identify it as an alarm card slot, is physically narrower, and has a different backplane connector.

The Cisco 12016 GSR alarm card (see Figure 5-9) has three primary functions:


Figure 5-9:
Status LEDs on the Alarm Card


Monitoring Critical, Major, and Minor Alarm Status

The alarm card faceplate is equipped with three pairs of LEDs---labeled Critical, Major, and Minor (see Figure 5-9)---that identify system alarm conditions detected through the MBus.


Note The LEDs are paired for redundancy to protect against a single failed LED. The critical and major alarm LED pairs are red; the minor alarm LED pair is yellow.

Because there are two alarm cards in the Cisco 12016 GSR, a system alarm condition detected through the MBus causes the same LEDs to be illuminated on both alarm cards.

The alarms can warn of an overtemperature condition on a component in one of the card cages, a fan failure in a blower module, an overcurrent condition in a power supply, or an out-of-tolerance voltage on one of the cards in one of the card cages. The LEDs are driven by MBus software, which sets the threshold levels for triggering the different stages of alarms.

The GRP continuously polls the system for temperature, voltage, current, and fan speed values. If an over-threshold value is detected, the GRP sets the appropriate alarm severity level on the alarm card, which lights one of the LED pairs and energizes the appropriate alarm card relays, activating any external audible or visual alarms wired to the alarm card. The GRP also logs a message about the threshold violation on the system console.


Note If one or more of the alarm LEDs is on, check the system console for messages describing the alarm.

Monitoring Alarm Card Status

The alarm card faceplate has one pair of LEDs that indicates the operational status of the alarm card.

A green LED labeled ENABLED indicates that the card has been detected by the system and is okay. A yellow LED labeled FAIL indicates that the system has detected a fault in the alarm card.

If no faults have been detected on the alarm card, the green LED labeled ENABLED on the alarm card faceplate should be on, and the yellow LED labeled FAIL should be off.

Monitoring Switch Fabric Status

If there are no alarm conditions within the router, the critical, major, and minor alarm LEDs on the alarm card faceplate should be off.

If there are no faults on the alarm card itself, the green LED labeled ENABLED on the alarm card faceplate should be on, and the yellow LED labeled FAIL should be off.

If there are no faults on either CSC 0 or CSC 1, the green LED labeled ENABLED for each CSC should be on, and the yellow LED labeled FAIL for each CSC should be off. If the system detects a CSC fault, it turns off the green LED for the faulty card, turns on the yellow LED, logs a warning message on the system console, and continues operating.


Note If the yellow LED labeled FAIL for a CSC or SFC is on, check the system console for messages describing the fault.

If there are no faults on the SFCs (SFC 0, SFC 1, or SFC 2), the green LED for each SFC should be on, and the yellow LED for each SFC should be off. If the system detects an SFC fault, it turns off the green LED for the faulty card, turns on the yellow LED, logs a warning message on the system console, but continues operating.


Note You can use the ACO/LT switch to verify that the alarm card LEDs are capable of lighting. If no audible alarm is active, pressing the ACO/LT switch temporarily illuminates the LEDs on the alarm card faceplate as a visual check that no alarm card LEDs have failed.

Troubleshooting the Cooling Subsystem

The cooling subsystem of the Cisco 12016 GSR consists of two blower modules in the chassis and a fan in each of the power modules in the power shelf. The blower modules and the power module fans circulate air to maintain acceptable operating temperatures within the router.

Troubleshooting the Blower Modules

The Cisco 12016 GSR uses two blower modules to maintain acceptable operating temperatures for the internal components by drawing cooling air through a replaceable air filter into the switch fabric card cage, and then through the upper and lower card cages. The top blower occupies a bay above the upper card cage. The bottom blower module occupies a bay below the lower card cage. Each blower module is fastened in place by two captive screws on its faceplate. Electrical connections (power and control signals) between the blower module and the system are made through a cable that connects to the chassis backplane.

The upper and lower blower modules are identical, so they are interchangeable. The blower module is a sheet metal enclosure containing three blowers, a blower controller card, and two faceplate LEDs. (See Figure 1-17.) Both blower modules have snap-on plastic front covers mounted over the blower module faceplates. Two blower module LEDs are visible through the front covers.

When on, the green LED labeled OK indicates that the blower module is functioning normally.

The blower module operates from -48 VDC distributed to it directly from the chassis backplane. A controller card on the blower module converts -48 VDC into a variable DC voltage that powers the blowers, and uses +5 VDC distributed to it directly from the chassis backplane to power the MBus interface circuitry and the temperature sensor.

The three fans in each blower module are variable-speed fans, and all three fans should be operating any time system power is on. The variable-speed feature allows the fans to operate at a lower speed when the internal chassis temperature is within the normal operating range, and at a higher speed if the internal temperature exceeds a specified temperature.

If the air temperature inside the chassis rises, blower speed increases to provide additional cooling air to the internal components. If the internal air temperature continues to rise beyond the specified threshold, the system environmental monitor shuts down all internal power to prevent equipment damage from excessive heat.

If the system detects that one or more of the blowers in the blower module has failed, it displays a warning message on the system console and displays a blower failure message in the GRP alphanumeric LED displays. In addition, the remaining blowers go to full speed to compensate for the loss of the failed blower.

To isolate a problem with the chassis cooling system, assess the following conditions:

If both blower modules come on, the -48 VDC power and the cables from the chassis backplane to the blower modules are good.
Queued messages:
%ENVM-1-SHUTDOWN: Environmental Monitor initiated shutdown
 
The preceding message could also indicate a faulty component or a faulty temperature sensor. Issue the show environment command or the show environment all command. Either command displays information about the internal system environment, including voltage measurements on each card for the voltages from the DC-to-DC converter, the +5 VDC for the MBus module, and the operating voltage for the blower modules. Each command also displays the temperature measurements made by two sensors on each card (one for inlet air temperature and one for the card's hot-spot temperature), as well as a temperature measurement made by a sensor in each power module.
If an environmental shutdown results from an out-of-tolerance power condition, the LED labeled FAULT on the power module will go on before the system shuts down. In this case, refer to the "Troubleshooting the Power Subsystem" section.
Although an overtemperature condition is unlikely at initial system startup, ensure that heated exhaust air from other equipment in the immediate environment is not entering the chassis card cage vents, and that there is sufficient clearance (at least 2.0 inches, or 5.0 cm) on both sides of the chassis to allow cooling air to enter freely and hot air to be expelled from the chassis.

There are no field-replaceable components in the blower module. If the blower module is faulty, you must replace the entire blower module.

If you are still unable to resolve the problem, contact a Cisco customer service representative for assistance.

Troubleshooting a Power Module Fan

Each power module (AC-input power supply or DC-input power entry module) is equipped with a fan that draws cooler air in the front of the power module and forces warmer out the back of the power shelf.

When a power module is seated in its power shelf bay and connected to a power source supplying power within the required range, the power module fan should be on. As long as the power source is within the required range, the fan should remain on.

If the fan in an power module fails, when the power supply detects an internal overtemperature condition, it sends an overtemperature warning to the system across the MBus, then shuts down. The yellow LEDs labeled FAULT and TEMP are on.

You should also look for console messages indicating that the system has detected a power module fan failure or an overtemperature condition inside the power module. Use the show environment command or the show environment all command to display information about the internal system environment, including power supply fan rotation and temperature measurements within the power module.

There are no field-replaceable components in either the AC-input power supply or the DC-input power entry module. If the power module fan fails, you must replace the power module.

If you are still unable to resolve a problem, contact a service representative for assistance.

Additional Reference Information for Troubleshooting

The following additional reference materials are available for troubleshooting your Cisco 12016 GSR installation:

Cisco documentation and additional literature are available in a CD-ROM package, which ships with your product. The Documentation CD-ROM, a member of the Cisco Connection Family, is updated monthly. Therefore, it might be more current than printed documentation. To order additional copies of the Documentation CD-ROM, contact your local sales representative or call customer service. The CD-ROM package is available as a single package or as an annual subscription. You can also access Cisco documentation on the World Wide Web at http://www.cisco.com, http://www-china.cisco.com, or http://www-europe.cisco.com.


hometocprevnextglossaryfeedbacksearchhelp
Posted: Mon Jun 5 13:48:47 PDT 2000
Copyright 1989 - 2000©Cisco Systems Inc.