Chapter 3 Troubleshooting Nexus Platform Issues

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 3 Troubleshooting Nexus Platform Issues

This chapter covers the following topics:

Troubleshooting Line Card Issues
Troubleshooting Nexus Fabric
Troubleshooting Hardware Drops
Virtual Device Context
System QoS and CoPP
NX-OS

Chapter 1, “Introduction to Nexus Operating System (NX-OS),” explored the various Nexus platforms and the line cards supported on them. In addition to understanding the platform and the architecture, it is vital to understand what system components are present and how to troubleshoot various hardware-level components on the Nexus platforms. This chapter focuses on platform-level troubleshooting.

Troubleshooting Hardware Issues

Nexus is a modular platform that comes in either a single-slot or multiple-slot chassis format. In a single-slot chassis, the Nexus switch has a supervisor card with the physical interfaces integrated into it. A multislot chassis supports supervisor engine cards (SUP cards), line cards, and fabric cards. Each type plays an important role in the Nexus forwarding architecture and makes it a highly available and distributed architecture platform. Trouble with any of these cards leads to service degradation or service loss in part of the network or even within the whole data center. Understanding the platform architecture and isolating the problem within the Nexus device itself is important, to minimize the service impact.

Before delving into troubleshooting for Nexus platform hardware, it is important to know which series of Nexus device is being investigated and what kinds of cards are present in the chassis. The first step is to view the information of all the cards present in the chassis. Use the command show module [module-number] to view all the cards present on the Nexus device; here, module-number is optional for viewing the details of a specific line card. Examine the output of the show module command from Nexus 7009 and Nexus 3548P in Example 3-1. The first section of the output is from Nexus 7000. It shows two SUP cards in both active and standby state, along with three other cards: One is running fine, and the other two are powered down. The command output also shows the software and hardware version for each card and displays the online diagnostic status of those cards. The command output shows the reason the device is in a powered-down state. At the end, the command displays the fabric modules present in the chassis, along with the software and hardware versions and their status.

The second section of the output is from a Nexus 3500 switch that shows only a single SUP card. This is because the Nexus 3548P is a single rack unit (RU) switch. The number of modules present in the chassis depends on the device being used and the kind of cards it supports.

Example 3-1 show module Command Output

Test Name	Description	Attributes	Hardware
ASIC Register Test	Tests access to all the registers in the ASIC	Disruptive	SUP and line card
ASIC Memory Test	Tests access to all the memory in the ASICs	Disruptive	SUP and line card
EOBC Port Loopback	Test the loopback of Ethernet out-of-band connection (EOBC)	Disruptive	SUP and line card
Port Loopback Test	Tests the port in internal loopback and checks the forwarding path by sending and receiving data on the same port	Disruptive	Line card
Boot Read-Only Memory (ROM) Test	Tests the integrity of the primary and secondary boot devices on the SUP card	Nondisruptive	SUP
Universal Serial Bus (USB)	Verifies the USB controller initialization on the SUP card	Nondisruptive	SUP
Management Port Loopback Test	Tests the loopback of the management port on the SUP card	Disruptive	SUP
OBFL	Tests the integrity of the onboard failure logging (OBFL) flash	Nondisruptive	SUP and line card
Federal Information Processing Standards (FIPS)	Verifies the security device on the module	Disruptive	Line card

Test Name	Description	Attributes	Hardware
ASIC Scratch Register Test	Tests the access to a scratch pad register of the ASICs	Nondisruptive	SUP and line card (all ASICs that support scratch pad register)
RTC Test	Verifies that the real-time clock (RTC) on the Supervisor is ticking	Nondisruptive	SUP
Nonvolatile Random Access Memory (NVRAM) Sanity Test	Tests the sanity of NVRAM blocks on the SUP modules	Nondisruptive	SUP
Port Loopback Test	Tries to loop back a packet to check the forwarding path periodically without disrupting port traffic	Nondisruptive	Line card (all front-panel ports on the switch)
Rewrite Engine Loopback Test	Tests the integrity of loopback for all ports to the Rewrite Engine ASIC on the module	Nondisruptive	Line card
Primary Boot ROM Test	Tests the integrity of the primary boot devices on the card	Nondisruptive	SUP and line card
Secondary Boot ROM Test	Tests the integrity of the secondary boot devices on the card	Nondisruptive	SUP and line card
CompactFlash	Verifies the access to internal CompactFlash on the SUP card	Nondisruptive	SUP
External CompactFlash	Verifies the access to external CompactFlash on the SUP card	Nondisruptive	SUP
Power Management Bus Test	Test the standby power management control bus on the SUP card	Nondisruptive	SUP
Spine Control Bus Test	Tests and verifies the availability of the standby spine module control bus	Nondisruptive	SUP
Standby Fabric Loopback Test	Tests the packet path between the standby SUP and fabric	Nondisruptive	SUP
Status Bus (Two Wire) Test	Checks the two wire interfaces that connect the various modules (including fabric cards) to the SUP module	Nondisruptive	SUP

Single SUP	Dual SUP
Bringdown	Bringdown
Restart (default)	Restart
Reset	Switchover (default)

Module	M1	F1	M1XL	M2	M3	F2	F2e	F3
M1	Yes	Yes	Yes	Yes	No	No	Yes	No
F1	Yes	Yes	Yes	Yes	No	No	No	No
M1XL	Yes	Yes	Yes	Yes	No	No	Yes	No
M2	Yes	Yes	Yes	Yes	Yes	No	Yes	Yes
M3	No	No	No	Yes	Yes	No	No	Yes
F2	No	No	No	No	No	Yes	Yes	Yes
F2e	Yes	No	Yes	Yes	No	Yes	Yes	Yes
F3	No	No	No	Yes	Yes	Yes	Yes	Yes

Table of Contents for Chapter 3 Troubleshooting Nexus Platform Issues

Create new playlist

Sign In

Sign Up

Chapter 3

Troubleshooting Nexus Platform Issues

Troubleshooting Hardware Issues

Generic Online Diagnostic Tests

Bootup Diagnostics

Runtime Diagnostics

GOLD Test and EEM Support

Nexus Device Health Checks

Hardware and Process Crashes

Packet Loss

Interface Errors and Drops

Platform-Specific Drops

Nexus Fabric Extenders

Virtual Device Context

VDC Resource Template

Configuring VDC

VDC Initialization

Out-of-Band and In-Band Management

VDC Management

Line Card Interop Limitations

Troubleshooting NX-OS System Components

Message and Transaction Services

Netstack and Packet Manager

Netstack TCPUDP Component

ARP and Adjacency Manager

Unicast Forwarding Components

Unicast Routing Information Base

UFDM and IPFIB

EthPM and Port-Client

HWRL, CoPP, and System QoS

MTU Settings

FEX Jumbo MTU Settings

Troubleshooting MTU Issues

Summary

References

Table of Contents for
Chapter 3 Troubleshooting Nexus Platform Issues