cc/td/doc/product/access/sc/r2
hometocprevnextglossaryfeedbacksearchhelp
PDF

Table of Contents

Troubleshooting Your System

Troubleshooting Your System

This chapter describes three procedures for troubleshooting your system:

Also this chapter tells you how to contact Cisco's Technical Assistance Center (TAC) and explains what information you will need to gather before contacting them.

6.1 Before You Begin

The telephony controller provides an interface called MML (based on Bellcore TL1) that allows you to retrieve and clear alarms. You should be familiar with MML commands before attempting these procedures. See "MML Commands," for more information about MML commands.

6.1.1 Starting MML

To start an MML session:

Step 1 Log in to the telephony controller and change to any subdirectory under /opt/TransPath (for example, /opt/Transpath/bin).

Step 2 Enter the following command:

mml

If you receive an error message that sessions are already in use, enter the following command:

mml -s session_number

Use any number from 2 to 12 and repeat until you find a vacant session. After successfully entering MML, the prompt changes to:

    mml>
     
    

6.2 Retrieving and Clearing Alarms

The telephony controller generates alarms that let you know if there are problems with your system. You are notified of alarms in one or more ways:

Although most alarms automatically clear after a problem is corrected, you need to retrieve and identify alarms on the telephony controller. You will also need to clear certain alarms.

The telephony controller stores alarm information in several places. Table 6-1 lists the telephony controller software's alarm information and descriptions.


Table 6-1: Telephony Controller Alarm Information
Information Type Location Description

MML alarm messages

Retrieved through MML session.

Retrieve, acknowledge, and clear these alarms in MML. This chapter discusses these alarms.

Alarm log files

  • Current---/opt/TransPath
    /var/log
    directory

  • Historical---/opt/TransPath
    /var/spool
    directory.

Current logs are ones in progress. After they are closed, they are renamed and become historical logs.

These logs contain a record of the same alarm messages retrieved with MML, but in a different format. In addition, alarms for timer expirations are found in these log files, but cannot be accessed with MML. Use these log files for viewing files and manipulating data (such as for creating statistical reports).

See "Retrieving Call Detail Records and Network Measurements," for more information.

alarmCats.dat file

/opt/TransPath/etc

Contains a list of all possible alarms for your version of the telephony controller software. See the "alarmCats.datAlarm Categories" section for more information on this file.


Note In a dual telephony controller configuration, only the active telephony controller interfaces with the ARU. If the ARU fails to receive a message or receives a critical alarm, it causes failover. Alarm severity and thresholds can be customized to match your existing severity level definitions. You can modify these thresholds.

6.2.1 Retrieving All Alarms

To retrieve all alarms, perform the following steps:

Step 1 Log in to the telephony controller and start an MML session. (See the "Starting MML" section for more information.)

Step 2 Enter the following MML command:

RTRV-ALMS

The following example shows there are seven system alarms:

    TransPath: Train 1998-10-19 11:26:39
    M RTRV
    "LS-109:ALM=\"FAIL\",SEV+MN"
    "DC-109-0:ALM=\"SC CONFIG FAIL\",SEV=MJ"
    "SC-109-1:ALM=\"SC CONFIG FAIL\",SEV=MJ"
    "DC-109-23:ALM=\"SC CONFIG FAIL\",SEV=MJ"
    "SC1-NAS1:ALM=\"SC FAIL\",SEV=MJ"
    "PC-229-100-101-2:ALM=\"PC UNAVAIL\",SEV=MJ"
    "SP!-IP:ALM=\"FAIL\",SEV=MN"
     
    

See the "Interpreting Alarm Messages" section for information on how to interpret these alarms.

6.2.1.1 Retrieving Alarms Continuously

If you retrieve alarms as they occur, this can help you determine how often a particular alarm is generated in a specific period of time. To retrieve alarms continuously, log in to the telephony controller and start an MML session. Enter the following command:

RTRV-ALMS::CONT

Alarms and autonomous messages are then continuously reported through the MML interface for the duration of the command.

6.2.2 Interpreting Alarm Messages

Figure 6-1 shows an example of an MML alarm response. The following subsections describe each of the message components.


Figure 6-1: MML Command Line

6.2.2.1 Component ID

This field identifies the system component that generated the alarm. The initial letters refer to the type of component, and the subsequent numbers refer to the timeslot for that element.


Note The components.dat file located in the /opt/TransPath/etc contains the names of all of your system components.

The numbers after the initial letters identify the number of the component or the timeslot of the component. Table 6-2 shows the component descriptions for the alarm messages retrieved in the previous example, on page 6-3.


Table 6-2: Component ID and Description
Component ID Component Description

LS-109:

SS7 link 109

DC-109-0:

Signal channel 109
Time slot 0

SC-109-1:

Telephony controller 109
Time slot 1

DC-109-23:

Signal channel 109
Time slot 23

SC1-NAS1:

Line 1 between the telephony controller and the network access server.

PC-229-100-101-2

Point code with the point code ID of 229-100-101-2


Note Alarms that have time slot information are considered Primary Rate Interface (PRI) channel alarms. In the example above, the second, third, and fourth alarms are PRI alarms.

6.2.2.2 Alarm Category

The ALM identifier means that the messages listed are alarms. The entry after ALM= refers to the category of alarm. From the list retrieved in the previous example, we can identify the following categories of alarms.


Table 6-3: Component ID and Description of Alarm Generated
Component ID Component description

"LS-109:ALM=\"FAIL\",

SS7 link 109 failure

"DC-109-0:ALM=\"SC CONFIG FAIL\",

Signal channel 109
Slot 0
Signal channel configuration failure

"SC-109-1:ALM=\"SC CONFIG FAIL\",

Telephony controller 109
Slot 1
Signal channel configuration failure

"DC-109-23:ALM=\"SC CONFIG FAIL\",

Signal channel 109
Slot 23
Signal channel configuration failure

"SC1-NAS1:ALM=\"SC FAIL\",

Signal channel 1 failure on network access server 1

"PC-229-100-101-2:ALM=\"PC UNAVAIL\",

Point code with ID 229-100-101-2 is unavailable

"SP!-IP:ALM=\"FAIL\",

Interface process failure on the signal processor

6.2.2.3 Severity Level

This field provides the alarm severity level. Table 6-4 shows the four levels of alarm severity and the actions you need to take.


Table 6-4: Alarm Severity and Resolution Actions
Alarm Code Description Required Action

MJ

Major

Major alarms disrupt service and must be cleared immediately.

Locate the alarm within Table 6-5 or Table 6-6 in this chapter.

Each of the following alarm sections contains troubleshooting flowcharts that you can use to clear the alarm.

MN

Minor

Minor alarms should be noted and cleared as soon as possible. Locate the alarm within Table 6-5 or Table 6-6 in this chapter.

Each of the following alarm sections contain troubleshooting flowcharts that you can use to clear the alarm.

You may also want to research how often this alarm is appearing, because it may be an indicator of a bigger problem.

CR

Critical

Critical alarms should be cleared as soon as possible, because they can affect service.

Locate the alarm within Table 6-5 or Table 6-6 in this chapter.

I

Information

Information alarms report activities by network elements that do not currently affect service, because these elements are not currently critical. However, if these network elements become critical, the severity may increase to major.

You may want to track how often you receive certain information alarms. If you receive the same alarm frequently, research the alarm, because it may be an indicator of a bigger problem.

6.2.3 Acknowledging Alarms

If you acknowledge the last alarm of a certain severity (for example, the last Support Fail alarm on channel T-1-16), then the alarm relay associated with the Support Fail alarm relay is turned off. This action also turns off all related alarms in the central office. (In this case, all the Support Fail alarms would be turned off.)

Acknowledging an alarm removes it from an internal processes list maintained by the telephony controller, but does not clear the alarm. It will still be visible if a RTRV-ALM command is used.

To acknowledge an alarm, perform the following steps:

Step 1 Log in to the telephony controller and start an MML session. (See the "Starting MML" section for more information.)

Step 2 Enter the following MML command:

ACK-ALM:componentId:"alarmCategory"

For example, to acknowledge a Support Fail alarm on channel T-1-16, enter the following command:

ACK-ALM:T-1-16:"SUPPORT FAILED"

The system returns a message similar to the following:

    CONFIG01/REL1 97-08-28 18:00:32
    M COMPLD
    ;
     
    

This response displays the software version information (Configuration 01 of Release 1); the date and time (August 28, 1997, 32 seconds after 6:00 p.m.); and that the action was completed (the telephony controller processed the Support Fail alarm acknowledgment for channel T-1-16).

6.2.4 Clearing Alarms

The telephony controller produces platform alarms and signaling alarms. The following subsections describe these types of alarms and how to manage them.

6.2.4.1 Clearing Platform Alarms

Platform alarms are associated with telephony controller performance. This section provides information about the platform alarms with alarm causes and recovery actions.

Platform alarms have three classifications of severity.

Table 6-5 lists the platform alarms, alarm causes, recovery actions, and the page number for a flowchart illustrating how to resolve the alarm.


Note Table 6-5 does not list every system alarm-only the most common. For a complete listing of alarms, see the alarmCats.dat file in the /opt/TransPath/etc directory of your system. For information on interpreting the alarmCats.dat fields, see the "alarmCats.datAlarm Categories" section.

Table 6-5: Platform Alarms
Alarm Category and Description Alarm Severity Level Alarm Cause Recovery Action Page Number

DISKLow Disk Space

Major

The system disk is running out of memory.

The event that triggers this alarm can cause a process to exit. If a process exits, the process manager will sense this, and can post additional error alarms that might help you solve the problem. The process manager keeps a list of those processes that are required for full operation of the telephony controller.

6-9

FAILOVER PEER_OOSFailover Peer Out-of-Service

Minor

The failover standby system is Out-of-Service.

Verify that the failover standby system is up and foverd and frepld are running. Check syslog.Platform files for errors.

6-14

GEN FAIL, General Process Failure

Information

Failures caused by resource exhaustion:

  • Memory exhaustion

  • Queue overflow

  • Message congestion

The event that triggers this alarm can cause a process to exit. If a process exits, the process manager will sense this, and can post additional error alarms that might help you solve the problem. Requires system administrator intervention; can require system reboot.

6-11

Major

Failures caused by configuration problems:

  • IPC file cannot be opened

  • Some timer expirations (for example, IPC message is received after timer expiration).

The event that triggers this alarm can cause a process to exit. If a process exits, the process manager will sense this, and can post additional error alarms that might help you solve the problem.

6-11

I/O CARDI/O Card Failure

Major

The E1 I/O card has failed operations.

Verify that the card is properly installed and that all associated support software is resident and configured correctly.

6-12

LOGLog File Open or Error

Information

A log file is either open or in error.

The event that triggers this alarm can cause a process to exit. If a process exits, the process manager will sense this, and can post additional error alarms that might help you solve the problem.

6-13

OS FAILOperating System Failure

Information

Something in the system OS has stopped functioning.

Requires system administrator intervention. May require a system reboot.

6-14

PC UNAVAILPoint Code Unavailable

Major

The Point Code requested is unavailable.

Check links and configuration of destination point code. If configuration has changed, restart system. Identify and correct any hardware problems.

6-15

RSET CONFIG FAILRoute Set Configuration Failure

Minor

The Route Set Configuration has failed.

If configuration has changed, check for configuration errors and reload the configuration files to the telephony controller.

6-16

SC CONFIG FAILSignal Channel Configuration Failure

Information

Configuration file is corrupted and cannot be read by system.

The event that triggers this alarm can cause a process to exit. If a process exits, the process manager will sense this, and can post additional error alarms that might help you solve the problem. The process manager keeps a list of those processes that are required for full operation of the telephony controller.

6-17

SW FAIL, Software Failure

Information

Failure caused by software logic problems, such as:

  • Unknown message received

  • Process in undesirable state

  • Unexpected logic being executed (for example, conditional code that should never be executed is being executed)

  • Some timer expirations

  • Alarm Report---
    Informational

The event that triggers this alarm can cause a process to exit. If a process exits, the process manager will sense this, and can post additional error alarms that might help you solve the problem.

6-11

XE RSRL FAIL---Execution Environment Failure

6-11


Figure 6-2:
DISK---Low Disk Space



Figure 6-3:
FAILOVER PEER_OOS---Failover Peer Out-of-Service



Figure 6-4:
GEN FAIL, SW FAIL, and XE RSRL FAIL



Figure 6-5:
I/O CARD---I/O Card Failure



Figure 6-6:
LOG---Log File Open or Error



Figure 6-7:
OS FAIL---Operating System Failure



Figure 6-8:
PC UNAVAIL---Point Code Unavailable



Figure 6-9:
RSET CONFIG FAIL---Route Set Configuration Failure



Figure 6-10:
SC CONFIG FAIL---Signal Channel Configuration Failure


6.2.4.2 Clearing Signaling Alarms

The telephony controller software generates signaling alarms if it has problems transporting data on a signal channel. Threshold Crossing Alerts (TCAs) are caused when the number of errors exceeds the number of errors allowed in a specified time period or the threshold value. The threshold value is set in the thresholds.dat file in /opt/TransPath/etc.

Signaling alarms have three classifications of severity:

Table 6-6 lists the platform level alarms, alarm causes, recovery actions, and the page number for a flowchart illustrating how to resolve the alarm.


Table 6-6: Signaling Alarms
Alarm Category and Description Alarm Severity Level Alarm Cause Recovery Action Page Number

C7LNK ALGNMT LOSTLink Alignment Lost

Minor

The C7 link is experiencing a loss of alignment.

Check link and line status; check configuration data. If configuration has changed, use the configuration tool to correct the configuration and then redeploy files to the telephony controller.

6-31

CHAN BAD TOT 15, Signal Channel TCA Total Frame Errors 15 Minutes

Information

The number of bad High Level Data Link Control (HDLC) frames received on this channel (DS-0 time slot) has exceeded the threshold value set in the thresholds.dat table for the last 15-minute period. The channel remains in an In-Service state. This alarm is informational only, and is sent to the alarm manager as an autonomous alarm.

If problem persists in the next consecutive 15-minute intervals, remove the signal channel from service and investigate the conditioning equipment.

6-22

CHAN BAD TOT 60, Channel TCA Total Frame Errors 60 Minutes

Information

The number of bad HDLC frames received on this channel (DS-0 time slot) has exceeded the threshold value set in the thresholds.dat table for the last 60-minute period. The channel remains in an In-Service state. This alarm is informational only, and does not require an immediate response; however, the condition should be monitored closely to ensure that the service level does not degrade further.

If this problem persists in the next consecutive 60-minute interval, remove the signal channel from service and investigate the conditioning equipment.

6-22

CHAN BAD TOT 24, Channel TCA Total Frame Errors 24 Hours

Information

The number of bad HDLC frames received on this channel (DS-0 time slot) has exceeded the threshold value set in the thresholds.dat table for the last 24-hour period. The channel remains in an In-Service state. This alarm is informational only, and does not require an immediate response; however, the condition should be monitored closely to ensure that the service level does not degrade further.

If this problem persists in the next 24-hour interval, remove the signal channel from service and investigate the conditioning equipment.

6-22

DHAN LINK ESTAB 15, Channel TCA Link Established 15 Minutes

Information

The number of resets on this channel (DS-0) has exceeded the threshold value set in the thresholds.dat table for the last 15-minute period. The channel remains in an In-Service state. This alarm is informational only, and does not require an immediate response; however, the condition should be monitored closely to ensure that the service level does not degrade further.

If this problem persists in the next 15-minute interval, remove the signal channel from service and investigate the network support equipment or distant end switching system.

6-22

DHAN LINK ESTAB 60, Channel TCA Link Established 60 Minutes

Information

The number of resets on this channel (DS-0) has exceeded the threshold value set in the thresholds.dat table for the last 60-minute period. The channel remains in an In-Service state. This alarm is informational only, and does not require an immediate response; however, the condition should be monitored closely to ensure that the service level does not degrade further.

If this problem persists in the next 60-minute intervals, remove the signal channel from service and investigate the network support equipment or distant end switching system.

6-22

DHAN LINK ESTAB 24, Channel TCA Link Established 24 Hours

Information

The number of resets on this channel (DS-0) has exceeded the threshold value set in the thresholds.dat table for the last 24-hour period. The channel remains in an In-Service state. This alarm is informational only, and does not require an immediate response; however, the condition should be monitored closely to ensure that service levels do not degrade further.

If this problem persists in the next 24-hour interval, remove the signal channel from service and investigate the network support equipment or distant end switching system.

6-22

LIF FAILLine Interface Failure

Major

The identified T1/E1 is not working.

Check the T1/E1 network facilities for loss of T1/E1 source.

Other alarms may be generated during this outage as part of a T1/E1 failure.

6-23

LIF LOFLine Interface Loss of Frame

Major

A loss of T1/E1 framing has been detected on the LIF.

Take appropriate action to restore the end-to-end carrier facilities to an operational state.

6-24

LIF LOSLine Interface Loss of Signal

Major

The transmitted signal is lost in the T1/E1. The receiving end does not receive the signal.

Take appropriate action to restore end-to-end carrier facilities.

6-25

LIF SESLine Interface Severely Errored Seconds

Major

The line interface is exceeding the established severely errored seconds threshold.

Check end-to-end transmission quality. Stop and restart I/O channel controllers. Set line In-Service and recheck alarms.

6-26

LIF SES 15, Line TCA Session 15 Minutes

Information

The severely errored seconds measurement counter has exceeded the threshold value set in the thresholds.dat table for the last 15-minute period. The channel remains in an In-Service state. This alarm is informational and does not require an immediate response; however, the condition should be monitored closely to ensure that the service level does not degrade further.

If this problem persists in the next consecutive 15-minute intervals, remove the signal channel from service and investigate the conditioning equipment.

6-27

LIF SES 60, Line TCA Session 60 Minutes

Information

The severely errored seconds measurement counter has exceeded the threshold value set in the thresholds.dat table for the last 60-minute period. The channel remains in an In-Service state. This alarm is informational only and does not require an immediate response; however, the condition should be monitored closely to ensure that the service level does not degrade further.

If this problem persists in the next consecutive 15-minute or 60-minute intervals, remove the signal channel from service and investigate the conditioning equipment. After the T1/E1 connectivity has been checked, restore the channel to an In-Service state and monitor the measurement statistics for recurring errors.

6-27

LIF SES 24, Line TCA Session 24 Hours

Information

The severely errored seconds measurement counter has exceeded the threshold value set in the thresholds.dat table for the last 24-hour period. The channel remains in an In-Service state. This alarm is informational only and does not require an immediate response; however, the condition should be monitored closely to ensure that the service level does not degrade further.

If this problem persists in the next 24-hour interval, remove the signal channel from service and investigate the conditioning equipment. After checking the T1/E1 connectivity, restore the channel to an In-Service state and continue to monitor the measurement statistics for recurring errors on the error signal channel.

6-27

LIF YELLOWLine Interface Yellow Condition

Minor

The receiving end is reporting a loss of the transmitted signal.

Advise the transmitting end of the error condition and take appropriate steps to return the transmitting facilities to an operational state.

6-28

SC FAILSignal Channel Failure

Major

The signaling channel is down and unable to process traffic. The channel is failing to negotiate a D channel session. Automatic restarts are not able to recover the session.

Attempt to manually reset the channel, using the set-sc-state MML command to stop and restart the channel. Investigate distant end switching system's signal channel status and coordinate with them to return the channel to In-Service.

6-29

SC M-OOSSignal Channel Manually Out-of-Service

Minor

The signal channel has been manually taken Out-of-Service by an operator on the platform.

None; confirm the disposition with site operations. Use the set-sc-state: is MML command to return the channel back to In-Service.

6-30

SOFTW NONNonrequired Process Failure

Minor

A nonrequired process is experiencing a process failure.

Identify process that is not running. Check platform system logs to determine cause of failure.

6-32

SOFTW REQRequired Process Failure

Major

A required process is exceeding its process restart time. The process cannot be restarted by the process manager for some reason.

Identify process that is not running. Check platform system logs to determine cause of failure.

6-33

SUPPORT FAILSupport Entity Failed

Minor

The signal channel is down due to a support element failure.

Investigate the associated alarms. Check the network transport facilities.

6-34


Figure 6-11:
CHAN BAD TOT, CHAN LINK ESTAB



Figure 6-12:
LIF FAIL---Line Interface Failure



Figure 6-13:
LIF LOF---Line Interface Loss of Frame



Figure 6-14:
LIF LOS---Line Interface Loss of Signal



Figure 6-15:
LIF SES---Line Interface Severely Errored Seconds



Figure 6-16:
LIF SES Errors---Line Interface Severely Errored Seconds



Figure 6-17:
LIF YELLOW---Line Interface Yellow Condition



Figure 6-18:
SC FAIL---Signal Channel Failure



Figure 6-19:
SC M-OOS---Signal Channel Manually Out-of-Service



Figure 6-20:
C7LNK ALGNMT LOST---Link Alignment Lost



Figure 6-21:
SOFTW NON---Nonrequired Process Failure



Figure 6-22:
SOFTW REQ---Required Process Failure



Figure 6-23:
SUPPORT FAIL---Support Entity Failed


6.3 Retrieving System Log Files

The telephony controller software creates system log files as shown in Table 6-7 and stores them in the /opt/TransPath/var/log directory.


Table 6-7: Log File Descriptions
Log File Type Default Log File Name Description

Platform

SyslogPlatform.number

Contains debug logs created by the telephony controller software processes.

Application

Records application startup errors.

Command/Response

Records MML commands that have been run on the system.

Alarm

alm.number

Contains output of alarms.

Measurement

meas.number

Records system measurement data.

Call Detail Record

cdr.number

Stores information on calls processed.

The platform log files are used for debugging or troubleshooting your system.

These logs are stored by the telephony controller software log rotation utility. This utility is automatically loaded during installation of the telephony controller software and runs daily at 4:05 a.m.


Note Automatic execution of the log rotation utility script (log_rotate.sh) can be modified by editing the log_rotate.sh entry in /var/spool/cron/crontabs/root. You must log in as root to modify this file.

The log rotation utility maintains one current copy and seven historical copies of the system logs. The youngest historic file is logName.0 and the oldest is logName.7.

Each execution of the program deletes the seventh historical log and increments all other files by one to make room for the newest file. For example, logName.6 becomes logName.7, then logName.5 becomes logName.6 and so on. Finally, the program changes the current log to logName.1, creates a new logName.0 file, and begins logging in that file.

Table 6-8 shows the log numbering system.


Table 6-8: Rotation Log Numbering System
Day of Capture Name Assigned

Current (Yesterday)

logName.0

Current -1

logName.1

Current -2

logName.2

Current -3

logName.3

Current -4

logName.4

Current -5

logName.5

Current -6

logName.6

Oldest, Current -7

logName.7

6.3.1 Retrieving a Log

To retrieve a log, perform the following steps:

Step 1 Log in to the telephony controller and change to the /opt/TransPath/var/log directory.

Step 2 Enter the ls command to list the available logs.

The system displays a list of the current and the historical logs. (Table 6-8 shows the log naming system.)

Step 3 To view a log, enter the following command:

cat logFileName.logNumber | more

For example, to view the platform log from 5 days before yesterday, enter the following command:

cat SyslogPlatform.5 | more

The system returns a response similar to the following:

    Mar  2 04:05:09 va-blue : | AlarmManager (PID 29974) {Platform} <Debug> wrote sU
    Mar  2 04:05:09 va-blue : | AlarmManager (PID 29974) {Platform} <Info> #0 Operay
    Mar  2 04:05:10 va-blue : | MeasurementManager (PID 29977) {Platform} <Debug> m3
    Mar  2 04:05:10 va-blue : | MeasurementManager (PID 29977) {Platform} <Debug> m3
    Mar  2 04:05:21 va-blue : | ProcessManager (PID 29973) {Platform} <Debug> Creat1
    Mar  2 04:05:21 va-blue : | ProcessManager (PID 29973) {Platform} <Debug> Sendi1
    Mar  2 04:05:21 va-blue : | ProcessManager (PID 29973) {Platform} <Debug> writi5
    Mar  2 04:05:21 va-blue : | ConfigManager (PID 29975) {Platform} <Debug> readin6
    Mar  2 04:05:21 va-blue : | ConfigManager (PID 29975) {Platform} <Debug> read i5
    Mar  2 04:05:21 va-blue : | ConfigManager (PID 29975) {Platform} <Debug> procEv.
    Mar  2 04:05:21 va-blue : | ConfigManager (PID 29975) {Platform} <Debug> writin7
    --More--
    

6.3.2 Changing the Log Level for Processes

The dumper can potentially generate a large number of files to the logSpoolDir area. For example, if the maxTime value were set to 15 minutes, over 200 files would be created in the logSpoolDir daily.

Therefore, you may want to limit the number of logs being created. One method of limiting the logs is to change the logging level or threshold value of the processes you want to limit. The CHG-LOG command sets the logging level for active processes only. This command can be invoked for individual active processes or for all processes. When the ALL parameter is used, a message appears that states that the log level did not change for some processes. This is because the changes are not applied to monitored and passive processes.

See the "CHG-LOGChange Log" section for a more detailed explanation of the elements of this process.

Caution Running in debug mode causes the Syslog.Platform files to become very large. These files are frequently deleted by the automatic system process. Cisco does not recommend logging to another destination (such as Platform_1.log) while running in debug mode, because the telephony controller will not automatically delete these files. The hard disk will become full and cause system crashes.

To change the log level of a single process, perform the following steps:

Step 1 Log in to the telephony controller and change to any subdirectory under /opt/TransPath (for example, /opt/Transpath/bin).

Step 2 Start an MML session as described in the "Starting MML" section.

Step 3 Enter the following MML command:

CHG-LOG:processName:logLevel

For example, to change the log level of the engine, enter the following command:

CHG-LOG:ENG-01:INF

The system returns a message similar to the following:

     CONFIG01/BUILD 13.6 97-05-15 16:35:29
    M COMPLD
     ;
     
    

See the"CHG-LOGChange Log" section for valid log levels and their meanings.

To change the log level of a all processes, perform the following steps:

Step 1 Log in to the telephony controller and start MML.

Step 2 Enter the following MML command:

CHG-LOG:ALL:logLevel

For example, to change the log level of all processes to "warning," enter the following command:

CHG-LOG:ALL:WRN

The system returns a message similar to the following:

     CONFIG01/BUILD 13.6 97-05-15 16:37:43
    M COMPLD
     SNSP
     "DSKM-01:Operation not supported"
     /* Status, Operation Not Supported By Component */
     SNSP
     "IOCC-01:Operation not supported"
     /* Status, Operation Not Supported By Component */
     SNSP
     "IOCC-02:Operation not supported"
     /* Status, Operation Not Supported By Component */
     ;
     
    

This response indicates that the DSKM-01, IOCC_01, and IOCC-02 processes will not allow their log levels to be changed. See the"CHG-LOGChange Log" section for valid log levels and their meanings.

6.4 Performing a Call Trace

A call trace records information in a trace file that shows how the telephony controller processes a call. If you are experiencing problems with call processing and need to contact Cisco for support, you should run a call trace before contacting Cisco's Technical Assistance Center (TAC). You probably won't be able to interpret the trace file, but the Cisco TAC can and this will help them troubleshoot the problem more effectively. For some problems, the TAC will not begin troubleshooting until they receive the trace file, so it is a good practice to create this file before contacting the Cisco TAC.

The following steps present an overview of how to create a trace file:

Step 1 Set the telephony controller software log to debug level.

Step 2 Use MML to run the trace and create a log file.

Step 3 Run the log file through the conversion analyzer and simulator programs to create a SIM file.

The following sections describe how to complete each of the above steps.

6.4.1 Setting the Debug Level

Step 1 Log in to the telephony controller and change to the /opt/TransPath/etc directory.

Step 2 Use a text editor to make the following changes to the XECfgParm.dat file:

Step 3 Confirm that the logFileFmt parameter is set to ../var/platform_%s.log.

Caution Running in debug mode causes the Syslog.Platform files to become very large. These files are frequently deleted by the automatic system process. Cisco does not recommend logging to another destination (such as Platform_1.log) while running in debug mode, because the telephony controller will not automatically delete these files. The hard disk will become full and cause system crashes. If you use a log other than Syslog.Platform, you need to monitor the size of the files and delete them if the disk is becoming full.

6.4.2 Running the Trace

Step 1 Log in to the telephony controller as a user in the transpath group and change to any subdirectory under /opt/TransPath (for example, /opt/TransPath/bin).

Step 2 Start an MML session as described in the "Starting MML" section.

Step 3 Enter the following MML command:

STA-SC-TRC:sc:log="filename", prd=600

The sc parameter identifies the originating signal channel. For example, if this is an SS7 signaling path, enter the point code. If this is an ISDN connection, enter the name of the IP signal path.

The filename parameter specifies the file to which the trace is captured. If you do not specify a filename, the telephony controller software creates a file using the same name as the signal channel with a .log extension. Ensure that this file does not already exist; if it does, the trace will not run correctly and you will receive an error message.

The prd parameter specifies the trace period in seconds. If you do not specify a time limit, the software continues the trace until an 8 KB boundary is reached or until you manually stop the trace. For example, if you entered STA-SC-TRC: sc:log="filename", prd=600, the telephony controller starts a trace on the signal channel specified, saves the trace to the filename specified, and collects data for 600 seconds (10 minutes).

Step 4 Make the call you want to trace.

Step 5 End the call, and enter the following command to end the trace:

STP-SC-TRC:ALL

Step 6 Quit MML.

6.4.3 Creating the SIM file

Step 1 Change to the opt/TransPath/var/trace directory. This directory contains the filename you specified when you ran the trace.

Step 2 Enter the following command:

ca -f "filename"

You may want to add these parameters:

Step 3 Look in the header to locate the call identification number. You need this number to create the SIM file.

Step 4 Change to the/opt/TransPath/lib directory.

Step 5 Enter the following command:

sim -f /opt/TransPath/var/trace/"filename" -y callIdentificationNumber -o "orig.mdo" -t "term.mdo" [-m cc.mdo -l lcm.mdo] -d /opt/TransPath/var/trace/filename.sim

The callIdentificationNumber is the number in the trace header. The first filename parameter specifies the original trace filename; the second filename parameter is the name of the new SIM file you are creating.

The orig.mdo parameter defines the variant for the originating protocol (for example, ANSISS7.mdo). The term.mdo parameter defines the variant for the terminating protocol (for example, Bell_1268_C3.mdo).

Step 6 Compress the SIM file using the UNIX tar command or zip the file and forward to the TAC, who will use it to troubleshoot your system.

6.5 Contacting the TAC

The focal point of all Cisco software and hardware maintenance and support services is the Cisco Technical Assistance Center (TAC). The TAC is staffed by senior customer engineers who have experience with the Cisco product line and all aspects of data communications networking technology. The TAC provides worldwide technical support.

A problem report or user question begins with a call or electronic mail into the TAC where it is logged, given a case number, and assigned to a customer engineer. The customer engineer works with the customer to attempt to answer the question, give advice on system use, help with system configuration, or correct a system malfunction.

This procedure ensures that software updates, modifications, problems, and their solutions are adequately controlled and monitored. The result is a central repository for data related to a problem. This database provides information on tracking, classification, and resolution. Problems are handled in the order they are submitted based on their severity. Problems are reported to the TAC at 800 553-2447 for U.S. customers or 408 526-7209 for International customers. The TAC opens a CASE in the Cisco CARE system. The status of that case is available on the Internet at www.cisco.com. After a case is completed, closed, and resolved, version control and configuration control procedures are followed so that software integrity and documentation remain accurate and current.

6.5.1 Case Classifications


Table 6-9: Case Priority Levels
Priority Level Description

Priority 1 (P1)

Production network is down, causing critical impact to business operations if service is not restored quickly. No workaround is available. Cisco and the customer are willing to commit substantial resources around the clock to resolve the situation.

Priority 2 (P2)

Production network is severely degraded, affecting significant aspects of your business operations. No workaround is available. Cisco and the customer are willing to commit full-time resources during business hours to resolve the situation.

Priority 3 (P3)

Network performance is degraded. Network functionality is noticeably impaired, but most business operations continue.

Priority 4 (P4)

Customer requires information or assistance on Cisco product capabilities, installation, or configuration.


Note Priority 1 problem escalation times are measured in calendar hours, 24 hours per day, 7 days per week. Priority 2, 3, and 4 escalation times correspond with Cisco TAC business hours: 6 a.m. to 6 p.m. Pacific Time, Monday through Friday, excluding Cisco holidays.
TimeSaver Do not rely on electronic mail to submit Priority 1 or Priority 2 problems to Cisco. Telephone your servicing technical support organization.

Table 6-10: Customer Escalation Guideline
Elapsed Time Priority 1 Priority 2 Priority 3 Priority 4

1hour

CE Manager

4 hours

Tech Sup Dir

CE Manager

24 hours

VP of CA

Tech Sup Dir

48 hours

Pres. (CEO)

VP of CA

72 hours

CE Manager

96 hours

Pres. (CEO)

Tech Sup Dir

CE Manager

6.5.2 How to Open a Case

Have your maintenance contract number for the system that needs service and the system's serial number ready. Be prepared to give a brief description of the problem so the dispatcher can write a report and dispatch it to the appropriate CERT (Customer Engineering Response Team). It is also a good idea to tell the dispatcher the priority level of the problem (see the "Case Classifications" section). You will either be connected directly to an available CE or a CE will call you back shortly.

There are three ways to open a case with the TAC: online, by phone, or by e-mail.

    1. Open a case online at www.cisco.com.

    2. Call TAC phone number indicated in Table 6-11.

    3. Open a case by sending e-mail to tac@cisco.com.

6.5.3 Customer Preparation

Step 1 If you suspect a problem, call or e-mail the TAC and a case will be opened.

Step 2 Assign a case classification based on the criteria provided in the "Case Classifications" section.

Step 3 If you classify the case as P1 or P2 and send it via e-mail or the Internet, call the TAC to confirm the problem. The phone number is 800 553-2447 for U.S. customers or 408 526-7209 for international customers.

Step 4 A copy of the case is accessible to the customer at www-tac.cisco.com.

6.5.4 Getting Help with Your System

You have 24-hour support for your SS7 dial access solution via Cisco's Technical Assistance Center (TACs). There are 4 TACs worldwide. To initiate a case, contact the closest TAC and tell them your problem. You will be issued a case number that you can check by phone or on the Web.


Table 6-11: Telephone Numbers for Reporting Problems
Region Telephone

Asia-Pacific

61 2 9935 4107

Australia

800 805 227

China

10810 then 800 501-2306 (Mandarin)

10811 then 800 501-2306 (English)

Europe

32 2 778-4242

France

0590 7594

Hong Kong

800 96 5910

India

000 117 then 888 861-6453

Indonesia

001 800 61 838

Japan

0066 33 800 926

Korea

00798 611 0712

Seoul

00 911 then 888 861-5164

Malaysia

1 800 805880

New Zealand

0800 44 6237

North America

800 553-2447

408 526-7209

Philippines

800 611-0056

Singapore

800 6161 356

Taiwan

0080 61 1206

Thailand

001 800 611 0754

UK

0800 960 547

The Cisco TAC offers support in multiple languages during business hours and in English after business hours. You can send e-mail to the listed address and receive answers in the languages indicated in Table 6-12.


Table 6-12: E-Mail Languages
Language E-mail Address

English/Spanish

tac@cisco.com

Hanzi (Chinese)

chinese-tac@cisco.com

Kanji (Japanese)

japan-tac@cisco.com

Hangul (Korean)

korea-tac@cisco.com

Thai

thai-tac@cisco.com

You can also initiate your case online via the Internet at www.cisco.com. Outside the locations listed in Table 6-11 and Table 6-12, contact the Cisco regional sales office nearest you or contact your local authorized Cisco distributor.

6.5.5 TAC Case Initiation

    1. TAC opens the case based on information received from the customer and assigns a case number.

    2. The case is forwarded to the next support level for action and the case number is communicated to the originator.

    3. TAC support reviews the case and ensures that the classification assigned by the customer is appropriate and that the case contains the data and information required to analyze the problem.

6.5.6 Response Times

Based on the case classification, Cisco Systems is committed to the following initial customer response times and problem resolution times. A workaround that returns the customer to operation is considered acceptable as an interim problem resolution.


Table 6-13: Response Times Based on Case Classifications
Classification Initial Customer Response Problem Resolution

P1

1 hour

COB1---Next business day

P2

2 hours

COB---Third business day

P3

8 hours

Less than 60 days

P4

24 hours

Next minor release

1 COB = close of business

6.5.7 TAC Case Resolution

The following procedure summarizes how the TAC resolves each case.

    1. When a solution has been developed, TAC closes the case and records the elapsed time to measure compliance with the commitments detailed in the "Response Times" section .

    2. TAC contacts the customer and informs them of the solution and arranges the appropriate method of transmittal.

    3. In most cases, the problem solution is transmitted to the customer using an encrypted data link. If the size of the solution prohibits direct transmission, the solution is sent by the next available flight or Federal Express overnight priority service.

    4. Customer confirms with the TAC the receipt and installation of the solution. They also confirm that the solution resolves the problem described on the case.

    5. If the problem is not resolved to the customer's satisfaction, the case is reopened by the TAC. The case is then forwarded to the TAC support and the manager of operations is notified.


hometocprevnextglossaryfeedbacksearchhelp
Posted: Fri Nov 19 13:59:53 PST 1999
Copyright 1989-1999©Cisco Systems Inc.