Chapter 5. Maintaining HyperFlex

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 5

Maintaining HyperFlex

Cisco HyperFlex Data Platform (HX Data Platform) is a hyperconverged software appliance that transforms Cisco servers into a single pool of compute and storage resources. It eliminates the need for network storage and enables seamless interoperability between computing and storage in virtual environments. Cisco HX-Data Platform provides a highly fault-tolerant distributed storage system that preserves data integrity and optimizes performance for virtual machine (VM) storage workloads. In addition, native compression and deduplication reduce the storage space occupied by the VMs and the VM workloads.

Cisco HX Data Platform has many integrated components, including Cisco fabric interconnects (FIs), Cisco UCS Manager, Cisco HX-specific servers, and Cisco compute-only servers; VMware vSphere, ESXi servers, and vCenter; and the Cisco HX Data Platform installer, controller VMs, HX Connect, vSphere HX Data Platform plug-in, and stcli commands.

This chapter provides details on managing a HyperFlex system and how to perform Day 2 operations on a HyperFlex cluster. It covers the following:

HyperFlex licensing
Virtual machine management
- Snapshots
- ReadyClones
- Datastores
Scaling HyperFlex clusters
- Node expansion (converged node and compute node)
- Node removal (converged node and compute node)
- Increasing storage capacity (by adding drives)
Hardware (disk) replacement
- Replacing SSDs
- Replacing NVMe SSDs
- Replacing housekeeping SSDs
- Replacing or adding HDDs
HyperFlex software upgrades
- Pre-upgrade tasks
- Upgrading UCS Server, ESXi, and HX Data Platform

HyperFlex Licensing

This section describes Smart Licensing in HyperFlex. Cisco Smart Software Licensing (Smart Licensing) is a cloud-based software license management solution that automates time-consuming manual licensing tasks, such as procuring, deploying, and managing licenses across an entire organization. The software allows for easy tracking of the status of license and software usage trends and simplifies the three core licensing functions: purchasing, management, and reporting. It provides visibility into your license ownership and consumption so you know what you own and how you are using it.

The Smart Licensing feature integrates with Cisco HyperFlex and is automatically enabled as soon as you create an HX storage cluster. For an HX storage cluster to start reporting license consumption, you must register it with Cisco Smart Software Manager (SSM) through your Cisco Smart Account. A Smart Account is a cloud-based repository that provides full visibility and access control to Cisco software licenses and product instances across your company. Registration is valid for one year.

Smart Account registration enables HyperFlex to be identified to a Smart Account and allows license usage to be reported to Cisco Smart Software Manager or a Smart Software Manager satellite. After registration, HyperFlex reports license usage to Cisco Smart Software Manager or a Smart Software Manager satellite with the current license status.

Registering a Cluster with Smart Licensing

Smart Licensing automatically integrates with your HX storage cluster and is enabled by default. Your HX storage cluster is initially unregistered with Smart Licensing and in a 90-day EVAL MODE. Within the 90 days, you need to register your HX storage cluster to use full functionality.

Figure 5-1 shows the Smart Licensing user workflow.

Images — **Figure 5-1** *Smart Licensing User Workflow*

Note

In order to begin using Smart Licensing, you need to have a Cisco Smart Account. You can create (or select) a Smart Account while placing an order, or you can create a Smart Account outside of placing an order and add new or existing licenses over time. To create a Smart Account, go to Cisco Software Central (https://software.cisco.com/) and click Get a Smart Account.

Creating a Registration Token

A registration token is used to register and consume a product for Smart Licensing. You must create a token to register the product and add the product instance to a specified virtual account. Follow these steps:

Step 1. Log in to the software manager at https://software.cisco.com/.

Step 2. In the License section, click Smart Software Licensing, as shown in Figure 5-2.

Step 3. Under Smart Software Licensing, click Inventory.

Step 4. From the virtual account where you want to register your HX storage cluster, click the General tab and then click New Token (see Figure 5-3). The Create Registration Token dialog box appears.

Step 5. In the Create Registration Token dialog box (see Figure 5-4), do the following:

a. Add a short description for the token.

b. Enter the number of days you want the token to be active and available to use on other products. The maximum is 365 days.

c. Check Allow export-controlled functionality on the products registered with this token.

d. Click Create Token.

Note

In this case, I set the max number of uses to 1, so that I am the only one who can use this token.

As shown in Figure 5-5, the new token shows up under the list of tokens with the expiration date, number of uses, and user who created it.

Step 6. Select the token and copy it to the clipboard (see Figure 5-6).

Registering a Cluster with Smart Software Licensing Through a Controller VM

This section covers an alternative method of registering a cluster with Smart Software Licensing through a controller VM. Follow these steps:

Step 1. Log in to a controller VM.

Step 2. Confirm that your HX storage cluster is in Smart Licensing mode by entering the following command:

# stcli license show status

As shown in Figure 5-7. the output should show “Smart Licensing is ENABLED, Status: UNREGISTERED, and the amount of time left in the 90-day evaluation period (in days, hours, minutes, and seconds). The Smart Licensing evaluation period starts when the HX storage cluster begins using the licensing feature and is not renewable. When the evaluation period expires, the Smart Agent sends a notification.

Step 3. Register your HX storage cluster by using the command stcli license register --idtoken idtoken-string, where idtoken-string is the new ID token from Cisco Smart Software Manager or a Smart Software Manager satellite (see Figure 5-8). For more information on how to create a token for product instance registration, see the section “Creating a Registration Token,” earlier in this chapter.

Step 4. Confirm that your HX storage cluster is registered by using the stcli license show summary command, as demonstrated in Figure 5-9.

Virtual Machine Management

Cisco HyperFlex provides native virtual machine management features such as HX snapshots, ReadyClones, and datastore management. This section discusses the concepts, functionality, best practices, and configuration of these features in detail.

HX Data Platform Native Snapshots Overview

HX Data Platform Native Snapshots is a backup feature that saves versions (states) of working VMs. A native snapshot is a reproduction of a VM that includes the state of the data on all VM disks and the VM power state (on, off, or suspended) at the time the native snapshot is taken. You can take a native snapshot to save the current state of a VM so that you can later revert to the saved state.

You can use the HX Data Platform plug-in to take native snapshots of your VMs. The HX Data Platform Native Snapshot options include creating a native snapshot, reverting to any native snapshot, and deleting a native snapshot. Timing options include hourly, daily, and weekly, all in 15-minute increments.

Benefits of HX Data Platform Native Snapshots

HX Data Platform native snapshots provide the following benefits:

Reverting registered VMs: If a VM is registered, whether powered on or powered off, native snapshots, just like VM snapshots, can be used to revert to an earlier point in time (that is, time when the snapshot was created).
High performance: The HX Data Platform native snapshot process is fast because it does not incur I/O overhead.
VM performance: HX Data Platform native snapshots do not degrade VM performance.
Crash consistent: HX Data Platform native snapshots are crash consistent by default; this means the correct order of write operations is preserved, to enable an application to restart properly from a crash.
Application consistent: You can select the quiesce option of the stcli vm snapshot command through the HX Data Platform CLI to enable HX Data Platform native snapshots to be application consistent. The applications in the guest VM run transparently, exactly as they do in the host VM.

Quiescing a file system involves bringing the on-disk data of a physical or virtual computer into a state suitable for backups. This process might include operations such as flushing dirty buffers from the operating system’s in-memory cache to disk, as well as other higher-level application-specific tasks.
Scheduled snapshots are tolerant to node failures: Scheduled snapshots are tolerant to administrative operations that require a node shutdown, such as HX maintenance mode and HX online upgrades.
Unified interface: You can manage native snapshots created through the HX Data Platform plug-in by using the VMware snapshot manager.
Individual or grouped: You can take native snapshots on a VM level, VM folder level, or resource pool level.
Granular progress and error reporting: These monitoring tasks can be performed at task level for the resource pool, folder level, and VM level.
Instantaneous snapshot delete: Deletion of a snapshot and consolidation always occur instantaneously.
Parallel batch snapshots: HX supports up to 255 VMs in a resource pool or folder for parallel batched snapshots.
VDI deployment support: HX scheduled snapshots are supported for desktop VMs on VDI deployments using VMware native technology.
Recoverable VM: The VM is always recoverable when there are snapshot failures.
Datastore access: Snapshots work on partially mounted/accessible datastores as long as the VM being snapshotted is on an accessible mountpoint.

Native Snapshot Considerations

Some snapshot parameters to consider are as follows:

Native snapshots: After you create the first native snapshot using the HX Data Platform plug-in, if you create more snapshots in vSphere Web Client, these are considered to be native as well. However, if you create the first snapshot using vSphere Web Client and not the HX Data Platform plug-in, the vSphere Web Client snapshots are considered to be non-native.
Maximum number of stored snapshots: Currently VMware has a limitation of 31 snapshots per VM. This maximum total includes VMware-created snapshots, the HX Data Platform SENTINEL snapshot, and HX Data Platform native snapshots.
Scheduled snapshots: Do not schedule overlapping snapshots on VMs and their resource pools.
Deleted VMs: The life cycle of native snapshots, as with VM snapshots, is tied to the virtual machine. If a VM is deleted, accidentally or intentionally, all associated snapshots are also deleted. Snapshots do not provide a mechanism to recover from a deleted VM. Use a backup solution to protect against VM deletion.
HX Data Platform storage controller VMs: You cannot schedule snapshots for storage controller VMs.
Non-HX Data Platform VMs: Snapshots fail for any VM that is not on an HX Data Platform datastore. This applies to snapshots on a VM level, VM folder level, or resource pool level. To make a snapshot, the VM must reside on an HX Data Platform datastore in an HX Data Platform storage cluster.
Suspended VMs: Creating the first native snapshot, the SENTINEL snapshot, from VMs in suspended state is not supported.
VM Size: The maximum size of a VM that an HyperFlex snapshot can take depends on the maximum size of the individual Virtual Machine Disk (VMDK), maximum number of attached disks, and overall size of VM.
VM Name: The VM name must be unique per vCenter for taking a snapshot.
Ready storage cluster: To allow a native snapshot, the storage cluster must be healthy, including sufficient space, and online. The datastores must be accessible. The VMs must be valid and not in a transient state, such as in process of vMotion.
vMotion: vMotion is supported on VMs with native snapshots.
Storage vMotion: Storage vMotion is not supported on VMs with native snapshots. If a VM needs to be moved to a different datastore, delete the snapshots before running Storage vMotion.
VM datastores: Ensure that all the VM (VMDK) disks are on the same datastore prior to creating native snapshots. This applies to snapshots created with HX Snapshot Now and snapshots created with HX Scheduled Snapshots.
Thick disks: If the source disk is thick, then the snapshot of the VM’s disk will also be thick. Increase the datastore size to accommodate the snapshot.
Virtual disk types: VMware supports a variety of virtual disk backing types. The most common is the FlatVer2 format. Native snapshots are supported for this format. There are other virtual disk formats, such as Raw Device Mapping (RDM), SeSparse, and VmfsSparse (Redlog format). VMs containing virtual disks of these formats are not supported for native snapshots.

Native Snapshot Best Practices

Always use the HX Data Platform Snapshot feature to create your first snapshot of a VM. This ensures that all subsequent snapshots are in native format. Here are some additional recommended best practices:

Do not use the VMware Snapshot feature to create your first snapshot.

VMware snapshots use redo log technology that results in degraded performance of the original VM. This performance degrades further with each additional snapshot. Native format snapshots do not impact VM performance after the initial native snapshot is created. If you have any redo log snapshots, on the ESXi hosts where the redo log snapshots reside, edit the /etc/vmware/config file and set snapshot.asyncConsolidate="TRUE."
Add all the VMDKs to the VM prior to creating the first snapshot.

When VMDKs are added to the VM, additional SENTINEL snapshots are taken. Each additional SENTINEL consumes space for additional snapshots. For example, if you have an existing VM and you add two new VMDKs, at the next scheduled snapshot, one new SENTINEL is created. Check the snapshot schedule retention number to be sure you have sufficient snapshot slots available: one for the new SENTINEL and one for the snapshot.
When creating large numbers of snapshots, consider the following:
- Schedule the snapshots at a time when you expect data traffic might be low.
- Use multiple resource pools or VM folders to group VMs rather than using a single resource pool or VM folder. Then stagger the snapshot schedule by group. For example, for resourcePool1 schedule snapshots at :00, for resourcePool2 schedule snapshots at :15, and for resourcePool3 schedule snapshots at :30.
If you have vCenter running on a VM in the storage cluster, do not take a native snapshot of the vCenter VM.

Understanding SENTINEL Snapshots

When you create the first snapshot of a VM, through either Snapshot Now or Scheduled Snapshot, the HX Data Platform plug-in creates a base snapshot called a SENTINEL snapshot. The SENTINEL snapshot ensures that follow-on snapshots are all native snapshots.

SENTINEL snapshots prevent reverted VMs from having VMware redo log-based virtual disks. Redo log-based virtual disks occur when an original snapshot is deleted and the VM is reverted to the second-oldest snapshot.

SENTINEL snapshots are in addition to the revertible native snapshot. The SENTINEL snapshot consumes 1 snapshot of the total 31 available per the VMware limitation.

Keep in mind two important considerations when using SENTINEL snapshots:

Do not delete the SENTINEL snapshot.
Do not revert your VM to the SENTINEL snapshot.

Native Snapshot Timezones

Three objects display and affect the timestamps and schedule of snapshots:

vSphere and vCenter use UTC time.
vSphere Web Client uses the browser time zone.
HyperFlex Data Platform components such as the HX Data Platform plug-in, storage cluster, and storage controller VM use the same configurable time zone; the default is UTC.

The storage controller VM time is used to set the schedule. The vSphere UTC time is used to create the snapshots. The logs and timestamps vary depending on the method used to view them.

Creating Snapshots

Redo log snapshots are snapshots that are created through the VMware Snapshot feature and not through the HX Data Platform Snapshot feature. If you have any redo log snapshots for VMs in an HX storage cluster, edit the ESXi host configuration where the redo log snapshots reside. If this step is not completed, VMs might be stunned during snapshot consolidation. Follow these steps to edit the ESXi host configuration:

Step 1. Log in to the ESXi host command line.

Step 2. Locate and open the file /etc/vmware/config for editing.

Step 3. Set the snapshot.asyncConsolidate parameter to TRUE (that is, snapshot.asyncConsolidate="TRUE").

Creating Snapshots Workflow

Step 1. From the vSphere Web Client navigator, select the VM level, VM folder level, or resource pool level. For example, select vCenter Inventory Lists > Virtual Machines to display the list of VMs in vCenter.

Step 2. Select a VM and either right-click the VM and click Actions or click the Actions menu in the VM information portlet.

Note

Ensure that there are no non-HX Data Platform datastores on the storage cluster resource pool, or the snapshot will fail.

Step 3. From the Actions menu, select Cisco HX Data Platform > Snapshot Now, as shown in Figure 5-10, to open the Take VM Native Snapshot for Test dialog box.

Step 4. In the Take VM Native Snapshot for Test dialog box (see Figure 5-11), enter a name for the snapshot and type a description of the snapshot. Click OK to accept your configuration.

Scheduling Snapshots

To schedule snapshots, follow these steps:

Step 1. From the vSphere Web Client navigator, select the VM or resource pool list. For example, select vCenter Inventory Lists > Virtual Machines to display the list of VMs in vCenter.

Step 2. Select a VM or resource pool and either right-click the VM or resource pool and click Actions or click the Actions menu in the VM information portlet.

Step 3. From the Actions menu, select Cisco HX Data Platform > Schedule Snapshot (see Figure 5-12) to open the Schedule Snapshot dialog box.

Step 4. Complete the Schedule Snapshot dialog box, shown in Figure 5-13, as follows:

a. To select the snapshot frequency, click the boxes for hourly, daily, and/or weekly frequency and set the starting days, times, and duration.

b. Set the number of snapshots to retain. When the maximum number is reached, older snapshots are removed as newer snapshots are created.

c. Unselect existing scheduled items, as needed. If a previous schedule existed, unselecting items deletes those items from the future schedule.

d. Click OK to accept the schedule and close the dialog.

Reverting to a Snapshot

Reverting to a snapshot means returning a VM to a state stored in a snapshot. Reverting to a snapshot is performed on one VM at a time. It is not performed at the resource pool or VM folder level. Reverting to snapshots is performed through the vCenter Snapshot Manager and not through the HX Data Platform plug-in. Follow these steps to revert to a snapshot:

Step 2. Select a storage cluster VM and either right-click the VM and click Actions or click the Actions menu in the VM information portlet.

Step 3. From the Actions menu, select Snapshots > Manage Snapshots (see Figure 5-14) to open the vSphere Snapshot Manager.

Step 4. In the Snapshot Manager, select a snapshot to revert to from the hierarchy of snapshots for the selected VM and then select All Actions > Revert to, as shown in Figure 5-15.

Step 5. Click Yes (to confirm the reversion (see Figure 5-16).

The reverted VM is included in the list of VMs and powered off. In selected cases, a VM reverted from a VM snapshot is already powered on. See Table 5-1 for more details.

Table 5-1 VM Power State After Restoring an HX VM Snapshot

VM State When HX VM Snapshot Is Taken	VM State After Restoration
Powered on (includes memory)	Reverts to the HX VM snapshot, and the VM is powered on and running.
Powered on (does not include memory)	Reverts to the HX VM snapshot, and the VM is powered off.
Powered off (does not include memory)	Reverts to the HX VM snapshot, and the VM is powered off.

Step 6. If the reverted VM is powered off, select the VM and power it on.

Deleting Snapshots

You delete snapshots through the vSphere interface and not through the HX Data Platform plug-in. Follow these steps:

Step 1. From the vSphere Web Client navigator, select VMs and Templates > vcenter_server > Snapshots > datacenter > VM.

Step 2. Right-click the VM and select Snapshots > Manage Snapshots.

Step 3. Right-click the snapshot you want to delete, click Delete.

Step 4. In the Confirm Delete dialog box that appears, click YES, as shown in Figure 5-17.

Note

Delete the SENTINEL snapshot by using the Delete All option only. Do not delete the SENTINEL snapshot individually. This is because Sentinel snapshot is the base snapshot and all subsequent HX snapshots use this base snapshot.

ReadyClones

HX Data Platform ReadyClones is a pioneering storage technology that enables you to rapidly create and customize multiple cloned VMs from a host VM. It enables you to create multiple copies of VMs that can then be used as standalone VMs.

Clones are useful when you deploy many identical VMs to a group. A ReadyClone, much like a standard clone, is a copy of an existing VM. The existing VM is called the host VM. When the cloning operation is complete, the ReadyClone is a separate guest VM.

Changes made to a ReadyClone do not affect the host VM. A ReadyClone’s MAC address and UUID are different from those of the host VM.

Installing a guest operating system and applications can be time-consuming. With ReadyClones, you can make many copies of a VM from a single installation and configuration process.

Benefits of HX Data Platform ReadyClones

The HX Data Platform ReadyClones feature provides the following benefits:

Create multiple clones of a VM at a time: Simply right-click a VM and create multiple clones of the VM by using the ReadyClones feature.
Rapid cloning: HX Data Platform ReadyClones is extremely fast and more efficient than legacy cloning operations because it supports VMware vSphere Storage APIs—Array Integration (VAAI) data offloads. VAAI, also called hardware acceleration or hardware offload APIs, is a set of APIs to enable communication between VMware vSphere ESXi hosts and storage devices. Use HX Data Platform ReadyClones to clone VMs in seconds instead of minutes.
Batch customization of guest VMs: Use the HX Data Platform Customization Specification to instantly configure parameters such as IP address, hostname, and VM name for multiple guest VMs cloned from a host VM.
Automation of several steps to a one-click process: The HX Data Platform ReadyClones feature automates the task of creating guest VMs.
VDI deployment support: ReadyClones is supported for desktop VMs on VDI deployments using VMware native technology.
Datastore access: ReadyClones works on partially mounted/accessible datastores as long as the VM being cloned is on an accessible mountpoint.

Supported Base VMs

HX Data Platform supports:

Base VMs stored on an HX Data Platform datastore
Base VMs with HX Data Platform Snapshots
A maximum of 2048 ReadyClones from 1 base VM
A maximum of 256 ReadyClones created in 1 batch at a time

HX Data Platform does not support:

Powered-on base VMs with Windows 2008 Server and Windows 2012 Server guest
Powered-on base VMs with more than 30 snapshots
Powered-on base VMs with redo log snapshots

ReadyClones Requirements

The requirements for ReadyClones are as follows:

VMs must be within the HX Data Platform storage cluster. Non-HX Data Platform VMs are not supported.
VMs must reside on an HX Data Platform datastore, VM folder, or resource pool.
ReadyClones fail for any VM that is not on an HX Data Platform datastore. This applies to ReadyClones on a VM level, VM folder level, or resource pool level.
VMs can have only native snapshots. ReadyClones cannot be created from VMs with snapshots that have redo logs (that is, non-native snapshots).
SSH must be enabled in ESXi on all the nodes in the storage cluster.
You can use only the single vNIC customization template for ReadyClones.

ReadyClones Best Practices

When working with ReadyClones, keep the following best practices in mind:

Use the customization specification as a profile or a template.
Ensure that properties that apply to the entire batch are in the customization specification.
Obtain user-defined parameters from the HX Data Platform ReadyClones batch cloning workflow.
Use patterns to derive per-clone identity settings such as the VM guest name.
Ensure that the network administrator assigns static IP addresses for guest names and verify these addresses before cloning.
You can create a batch of 1 through 256 at a given time.
Do not create multiple batches of clones simultaneously on the same VM (when it is powered on or powered off) because doing so causes failures or displays incorrect information on the master task updates in the HX Data Platform plug-in.

Creating ReadyClones Using HX Connect

Use the HX Data Platform ReadyClones feature to populate a cluster by creating multiple clones of a VM, each with a different static IP address. Follow these steps:

Step 1. Log in to HX Connect as an administrator.

Step 2. From the Virtual Machines page, select a virtual machine and then click ReadyClones, as shown in Figure 5-18.

Step 3. Complete the ReadyClones dialog box, shown in Figure 5-19, as outlined in Table 5-2.

Table 5-2 ReadyClones Dialog Box Fields

Field	Setting
Number of clones	Enter the number of ReadyClones that you want to create. You can create a batch of 1 through 256 clones at a given time.
Customization Specification	(Optional) Click the drop-down list and select a customization specification for the clone from the drop-down list. The system filters the customization specifications for the selected host virtual machine. For example, if the selected host virtual machine uses Windows OS for guest virtual machines, the drop-down list displays Windows OS customization specifications.
Resource Pool	(Optional) If you have resource pools defined in an HX Storage cluster, you can select one to store the ReadyClones of the selected virtual machine.
VM Name Prefix	Enter a prefix for the guest virtual machine name. This prefix is added to the name of each ReadyClone created.
Starting clone number	Enter a clone number for the starting clone. Each ReadyClone must have a unique name; numbering is used to ensure a unique element in the name.
Increment clone numbers by	Enter a value by which the clone number in the guest virtual machine name must be increased or leave the default value 1 as is. The system appends a number to each the name of each virtual machine ReadyClone (such as clone1, clone2, and clone3). By default, the number starts from 1, but you can change this value to any number.
Use same name for Guest Name	Select this checkbox to use the vCenter VM inventory name as the guest host virtual machine name. If you uncheck this box, a text box is enabled. Enter the name you want to use for the guest host virtual machine name.
Preview	After required fields are completed, HX Data Platform lists the proposed ReadyClones names. As you change the content in the required fields, the Clone Name and Guest Name fields update.
Power on VMs after cloning	Select this checkbox to turn on the guest virtual machines after the cloning process completes.

Step 4. Click Clone. HX Data Platform creates the appropriate number of ReadyClones with the naming and location specified.

Creating ReadyClones Using the HX Data Platform Plug-in

If you use the VMware cloning operation, you can create only a single clone from a VM. This operation is manual and slower than batch processing multiple clones from a VM. For example, to create 20 clones of a VM, you must manually perform the clone operation over and over again. Follow these steps to create ReadyClones using the HX Data Platform plug-in:

Step 1. From the vSphere Web Client navigator, select Global Inventory Lists > Virtual Machines to open a list of VMs in vCenter.

Step 2. Select a VM and either right-click the VM and click Actions or click the Actions menu in the VM information portlet.

Step 3. From the Actions menu, select Cisco HX Data Platform > ReadyClones, as shown in Figure 5-20.

The ReadyClones dialog box appears, as shown in Figure 5-21.

Step 4. Enter any changes you want to make and click OK to apply these configuration changes.

Note

As part of the ReadyClones workflow, a temporary snapshot is listed in vCenter and HX Connect. It is listed as an extra powered-off VM transiently—that is, only while the ReadyClones are being created.

Datastores

Datastores are logical containers that HX Data Platform uses to manage your storage usage and storage resources. Datastores are where the host places virtual disk files and other VM files. Datastores hide the specifics of physical storage devices and provide a uniform model for storing VM files.

You can add datastores, refresh the list, edit the names and sizes of datastores, delete datastores, and mount and unmount datastores from either HX Connect or the HX Data Platform plug-in. You can only rename an unpaired datastore that is unmounted. Do not rename a datastore using the vCenter administrator interface.

Keep in mind these important considerations:

Keep the number of datastores to as few as possible to avoid startup delay and to keep clone savings high.
Configuring more than 10 datastores could result in excessive startup delay.

Adding Datastores

Datastores are logical containers, similar to file systems, that hide specifics of physical storage and provide a uniform model for storing VM files. You can also use datastores to store ISO images and VM templates. To add a datastore, follow these steps:

Step 1. Choose an interface using either of these methods:

From the vSphere Web Client navigator, select vCenter Inventory Lists > Cisco HyperFlex Systems > Cisco HX Data Platform > cluster > Manage > Datastores.
From HX Connect, select Datastores.

Step 2. Click on create datastore.

Step 3. Enter a name for the datastore. vSphere Web Client enforces a 42-character limit for the datastore name, and each datastore name needs to be unique.

Step 4. Specify the datastore size and choose GB or TB from the drop-down list.

Step 5. Specify the data block size. From HX Connect, choose 8K or 4K; the default is 8K. In the HX Data Platform plug-in, the default is assumed. For VDI workloads, the default is 4k.

Step 6. Click OK to accept your changes or Cancel to cancel all changes.

Step 7. To verify the addition of the datastore, click the Refresh icon and ensure that the new datastore is listed. From the HX Data Platform plug-in, Click the Manage > Datastores > Hosts to see the mount status of the new datastore. If you check the datastore through the vSphere Client application, by selecting host > Configuration> Datastores, the drive type is listed as Unknown; this is expected vSphere behavior.

Creating Datastores Using the HX Data Platform Plug-in

The workflow in Figure 5-22 shows how to create datastores using the HX Data Platform plug-in on VMWare vCenter.

Creating Datastores Using HX Connect

Figure 5-23 shows how to create a datastore using HX Connect.

Scaling HyperFlex Clusters

One of the advantages of the HyperFlex solution is the ease with which you can scale an existing HyperFlex system. This section covers how to perform a node expansion, how to perform a node removal (for both converged and compute-only nodes), and how to increase storage capacity of existing HyperFlex nodes.

Node Expansion

You can add converged or compute-only nodes to expand a HyperFlex cluster. The following is the list of supported mixed-cluster expansion guidelines (for both converged and compute-only nodes) in HyperFlex clusters:

Expanding an existing M4 cluster with M5 converged nodes is supported.
Expanding an existing M5 cluster with M4 converged nodes is not supported.
Expanding an existing mixed M4/M5 cluster with M4 or M5 converged nodes is supported.
Adding any supported compute-only nodes is permitted with all M4, M5, and mixed M4/M5 clusters using the HX Data Platform installer.
Only the expansion workflow is supported for creating a mixed cluster. Initial cluster creation with mixed M4/M5 servers is not supported.
All M5 servers must match the form factor (220/240), type (hybrid/AF), security capability (non-SED only), and disk configuration (QTY, capacity, and non-SED) of the existing M4 servers.
HX Edge, SED, LFF, Hyper-V, and stretch clusters do not support mixed M4/M5 clusters.

Note

If you have replication configured, put replication in pause mode before performing an upgrade, an expansion, or cluster maintenance. After the upgrade, expansion, or cluster maintenance is complete, resume replication. Perform the pause and resume on any cluster that has replication configured to or from this local cluster.

ESXi installation is supported on SD cards for M4 converged nodes and M.2 SATA SSD for M5 converged nodes. For compute-only nodes, ESXi installation is supported for SD Cards, SAN boot, or front SSD/HDD. Installing ESXi on USB flash is not supported for compute-only nodes.

Before you start adding a converged or compute node to an existing storage cluster, make sure that the following prerequisites are met:

Ensure that the storage cluster state is healthy.
Ensure that the new node meets the system requirements listed under Installation Prerequisites, including network and disk requirements.
Ensure that the new node uses the same configuration as the other nodes in the storage cluster (for example, VLAN ID, tagging, vSwitch configuration and so on).
To add a node that has a different CPU family from what is already in use in the HyperFlex cluster, enable EVC.
Allow ICMP for pings between the HX Data Platform installer and the existing cluster management IP address.

The sections that follow describe how to add converged and compute-only nodes to expand a HyperFlex cluster.

Adding a Converged Node

You can add a converged node to a HyperFlex cluster after cluster creation. The storage on a converged node is automatically added to the cluster’s storage capacity.

Follow these steps to add a converged node to an existing standard cluster:

Step 1. Launch the Cisco HX Data Platform installer. (Use the same version of installer as the version of the HX cluster.)

Step 2. On the Workflow page, select Expand Cluster > Standard Cluster, as shown in Figure 5-24.

Step 3. On the Credentials page that appears, complete all the fields, as shown in Figure 5-25, and click Continue.

Step 4. On the Cluster Expand Configuration page that appears (see Figure 5-26), select the HX cluster that you want to expand and click Continue.

Step 5. On the Server Selection page, review the list of unassociated HX servers under the Unassociated tab (see Figure 5-27) and the list of discovered servers under the Associated tab. Select the servers under the Unassociated tab to include in the HyperFlex cluster. Click Continue.

If HX servers do not appear in this list, check Cisco UCS Manager and ensure that they have been discovered.

Step 6. On the UCSM Configuration page that appears, complete the fields for each network and configure the HyperFlex cluster name, as shown in Figure 5-28, and click Continue.

Step 7. On the Hypervisor Configuration page that appears (see Figure 5-29), complete all the fields and click Continue.

Step 8. On the IP Addresses page that appears (see Figure 5-30), add more compute or converged servers, as desired, by clicking Add Compute Server or Add Converged Server. Be sure to select Make IP Addresses Sequential to make the IP addresses sequential. For the IP addresses, specify whether the network should belong to the data network or the management network. For each HX node, complete the appropriate fields for hypervisor management and data IP addresses. When you’re finished with the settings on this page, click Start. A Progress page displays the progress of various configuration tasks.

Note

If the vCenter cluster has EVC enabled, the deployment process fails with the message “The host needs to be manually added to vCenter.” To successfully perform the deploy action, do the following:

Step 1. Log in to the ESXi host to be added in vSphere Client.

Step 2. Power off the controller VM.

Step 3. Add the host to the vCenter cluster in vSphere Web Client.

Step 4. In the HX Data Platform installer, click Retry Deploy.

Step 9. When cluster expansion is complete, start managing your storage cluster by clicking Launch HyperFlex Connect.

Note

When you add a node to an existing storage cluster, the cluster continues to have the same HA resiliency as the original storage cluster until auto-rebalancing takes place at the scheduled time. Rebalancing is typically scheduled during a 24-hour period, either 2 hours after a node fails or if the storage cluster is out of space.

Adding a Compute Node

You can add a compute-only node to a HyperFlex cluster after cluster creation to provide extra compute resources. The Cisco UCS server does not need to have any caching or persistent drives as they do not contribute any storage capacity to the cluster. Use similar steps for adding a compute node as used for expanding a HyperFlex cluster with a converged node.

Minor changes in the workflow as compared to converged node include the following:

Select Compute Only Server under Server Selection.
In the IP Addresses section, compute-only nodes do not need storage controller management and storage controller data.

Figure 5-31 shows the configuration option when adding a compute server.

Note

After you add a compute-only node to an existing cluster, you must manually configure the vmk2 interface for vmotion.

Expanding a Stretch Cluster

You can perform cluster expansion on a HyperFlex stretch cluster when an already deployed stretch cluster has more storage utilization and requires expansion so that the storage capacity on a stretch cluster can be increased. After node expansion, the storage on a converged node is automatically added to the cluster’s storage capacity.

Consider the following cluster expansion guidelines:

Stretch cluster expansion supports both converged nodes and compute-only nodes.
When adding a converged node, ensure that the configuration is symmetric across both sites. For instance, if Site 1 is expanded with two nodes, Site 2 must also be expanded with two converged nodes.
When adding compute nodes, ensure that you do not exceed the supported node count.

To expand a stretch cluster, you need to take the steps outlined in the following sections.

Configuring Sites for Expanding a Cluster

Before you can expand a cluster, you need to re-create the sites in the installer as they are deployed using this procedure. Follow these steps:

Step 1. Log in to the Cisco HX Data Platform installer.

Step 2. On the Select a Workflow page, select Expand Cluster > Stretch Cluster, as shown in Figure 5-32, and click Continue.

Step 3. On the Cluster page that appears, enter the cluster management hostname, as shown in Figure 5-33, and click Continue.

Step 4. On the Credentials page that appears, as shown in Figure 5-34, select Configure Site and then enter UCS manager and hypervisor credentials. Click Continue.

Step 5. On the Server Selection page that appears, configure the server ports and associate the new HX expansion nodes with the site, as shown in Figure 5-35, and click Continue.

Step 6. On the Node Configuration page that appears, configure the subnet mask, gateway, and hypervisor settings as shown in Figure 5-36, and click Start to begin site configuration for the expanded cluster. A progress page displays the progress of various configuration tasks.

Step 7. Repeat steps 1 through 7 for the second site.

Expanding a Cluster

To expand a cluster, follow these steps:

Step 1. On the Cluster page, as shown in Figure 5-37, enter the cluster management hostname and click Continue.

Step 2. On the Credentials page that appears, as shown in Figure 5-38, select Expand Stretch Cluster, enter the credentials information, and click Continue.

Step 3. Configure the server ports and associate HyperFlex servers.

Step 4. On the IP Addresses page, as shown in Figure 5-39, configure the hypervisor and IP addresses, select the site, and click Start to start the cluster expansion process.

Removing Nodes

You can remove converged or compute-only nodes to reduce the size of a HyperFlex cluster. This section provides the guidelines for node removal of both converged and compute-only nodes in HyperFlex clusters.

Removing Converged Nodes

Depending on the node maintenance task, removing a node can occur while the storage cluster is online or offline. Ensure that you have completed the preparation steps before removing a node.

Note

It is highly recommended that you work with your account team when removing a converged node in a storage cluster. Do not reuse the removed converged node or its disks in the original cluster or in another cluster.

The steps to take in removing a node depend on the cluster size. Table 5-3 provides an overview of the steps for removing clusters of different sizes.

Table 5-3 Steps for Removing Converged Nodes

Cluster Size	Nodes Removed	Steps
Three-node cluster	One or more	Cluster removal requires Cisco TAC assistance.
Four-node cluster	One	Ensure that the cluster is healthy. Put the affected node in Cisco HX maintenance mode. Shut down the cluster (take the cluster offline) by using the stcli cluster shutdown command. Remove the node by using the stcli node remove command. Restart the cluster by using the stcli cluster start command.
Four-node cluster	Two or more	Cluster removal requires Cisco TAC assistance.
Five-node cluster	One	Ensure that the cluster is healthy. Put the affected node in Cisco HX maintenance mode. Cluster remains online. Remove the node by using the stcli node remove command.
Five-node cluster	Two	Ensure that the cluster is healthy. Put the affected node in Cisco HX maintenance mode. Shut down the cluster (take cluster offline) by using the stcli cluster shutdown command. Remove the nodes by using the stcli node remove command. Specify both nodes. Restart the cluster by using the stcli cluster start command. Shut down the cluster (take the cluster offline) by using the stcli cluster shutdown command. Remove the nodes by using the stcli node remove command and specifying both nodes. Restart the cluster by using the stcli cluster start command.
Five-node cluster	Three or more	Cluster removal requires Cisco TAC assistance.

Removing a Node from an Online Storage Cluster

Depending on the node maintenance task, removing a node can occur while the storage cluster is online or offline. Removing a node from a storage cluster while the cluster remains online has slightly different requirements from removing a node while a cluster is offline. Follow these steps to remove a node from an online storage cluster:

Note

It is highly recommended that you work with TAC when removing a converged node in a storage cluster. Do not remove the controller VM or other HX Data Platform components.

Step 1. To prepare to remove a node, do the following:

Ensure that the cluster is healthy by entering the stcli cluster info command.
Ensure that SSH is enabled in ESX on all the nodes in the storage cluster.
Ensure that DRS is enabled or manually move the VMs from the node.
Put the node being removed into HX maintenance mode.
Log in to the controller VM of a node that is not being removed.

Step 2. Rebalance the storage cluster to ensure that all datastores associated with the node will be removed. The rebalance command is used to realign the distribution of stored data across changes in available storage and to restore storage cluster health. If you add or remove a node in the storage cluster, you can manually initiate a storage cluster rebalance by using the stcli rebalance command.

Note

Rebalancing might take some time, depending on the disk capacity used on the failed node or disk.

Log in to a controller VM in the storage cluster. From the controller VM command line, run the stcli rebalance start --force command and then wait and confirm that rebalance has completed.

Step 3. Remove the desired node by using the stcli node remove command, which has the following syntax:

Table of Contents for Chapter 5. Maintaining HyperFlex

Create new playlist

Sign In

Sign Up

Chapter 5

HyperFlex Licensing

Registering a Cluster with Smart Licensing

Creating a Registration Token

Registering a Cluster with Smart Software Licensing Through a Controller VM

Virtual Machine Management

HX Data Platform Native Snapshots Overview

Benefits of HX Data Platform Native Snapshots

Native Snapshot Considerations

Native Snapshot Best Practices

Understanding SENTINEL Snapshots

Native Snapshot Timezones

Creating Snapshots

Creating Snapshots Workflow

Scheduling Snapshots

Reverting to a Snapshot

Deleting Snapshots

ReadyClones

Benefits of HX Data Platform ReadyClones

Supported Base VMs

ReadyClones Requirements

ReadyClones Best Practices

Creating ReadyClones Using HX Connect

Creating ReadyClones Using the HX Data Platform Plug-in

Datastores

Adding Datastores

Creating Datastores Using the HX Data Platform Plug-in

Creating Datastores Using HX Connect

Scaling HyperFlex Clusters

Node Expansion

Adding a Converged Node

Adding a Compute Node

Expanding a Stretch Cluster

Configuring Sites for Expanding a Cluster

Expanding a Cluster

Removing Nodes

Removing Converged Nodes

Removing a Node from an Online Storage Cluster

Removing a Node from an Offline Storage Cluster

Removing a Compute Node

Increasing Storage Capacity by Adding Drives

Hardware (Disk) Replacement

Replacing SSDs

Replacing NVMe SSDs

Replacing Housekeeping SSDs

Replacing or Adding HDDs

Upgrading HyperFlex Software

Upgrading HyperFlex

Pre-Upgrade Workflow

Downloading UCS Infra, B-Series, C-Series, and Storfs Bundles

Verifying the Pre-Upgrade UCS Server Firmware (C-Bundle) Version

Using UCS Manager

Using HX Connect

Pre-Upgrade Validation

Viewing the HyperFlex Cluster Health

Checking the Cluster Storage Capacity

Verifying That DRS Is Enabled

Verifying and Configuring the Net.TeamPolicyUpDelay Default Value

Viewing ESX Agent Manager

Verifying the Health of a HyperFlex Cluster in Cisco UCS Manager

Verifying vMotion Interfaces

Verifying Upstream Network Connectivity

Configuring the Cluster Access Policy in Lenient Mode

Verifying That No Major Alarms Are Reported for the HyperFlex Cluster in HyperFlex Connect

Hypercheck Utility

Storage Controller VM and ESXi Node Checks

Installing and Running Hypercheck

Upgrading UCS Infrastructure Firmware

Upgrade Procedure

Recommended Upgrade Method

Online Upgrade Process

Online Upgrade Workflow Steps

Upgrading a HyperFlex Cluster Using the HX Connect UI

Offline Upgrade Process

Post-Upgrade Check

Summary

Table of Contents for
Chapter 5. Maintaining HyperFlex