OCP Upgrade Preparation

Firmware compatibility

Firmware is NOT part of the OpenShift upgrade and can not be controlled through OpenShift. However, in Telco we run most of our clusters on baremetal which means that we need to be very mindful of the firmware level on all of our hardware.

All hardware vendors will advise that it is always best to be on their latest certified version of firmware for their hardware. In the telco world this comes with a trust but verify approach due to the high throughput nature of telco CNFs. Therefore, it is important to have a regimented group who can test the current and latest firmware from any vendor to make sure that all components will work with both. It is not always recommended to upgrade firmware in conjunction with an OCP upgrade however if it is possible to test the latest release of firmware that will improve the odds that you won’t run into issues down the road.

Upgrading firmware is a very important debate because the process can be very intrusive and has a potential for causing a node to require manual interventions before the node will come back online. On the other hand it may be imperative to upgrade the firmware due to security fixes, new required functionality or compatibility with the new release of OCP components. Therefore, it is up to everyone to verify with their hardware vendors, verify compatibility with OCP components and perform tests in their lab before moving forward.

Layer product compatibility

It is important to make sure all layered products will run on the new version of OCP that you will be moving to. This, very much, includes all Operators.

Verify the current installed list of Operators installed on your cluster. For example:

# oc get csv -A
NAMESPACE                              NAME                                 DISPLAY          VERSION   REPLACES                             PHASE
chapter2                               gitlab-operator-kubernetes.v0.17.2   GitLab           0.17.2    gitlab-operator-kubernetes.v0.17.1   Succeeded
openshift-operator-lifecycle-manager   packageserver                        Package Server   0.19.0                                         Succeeded

OLM Operator compatibility

There is a set of Red Hat Operators that are NOT part of the cluster operators which are otherwise known as the OLM installed operators. To determine the compatibility of these OLM installed operators there is a great web based tool that can be used to determine which versions of OCP are compatible with specific releases of an Operator. This tool is meant to tell you if you need to upgrade an Operator after each Y-Stream upgrade or if you can wait until you have fully upgraded to the next EUS release. In Step 9 under the “Upgrade Process Flow” section you will find additional information regarding what you need to do if an Operator needs to be upgraded after performing the first Y-Stream Control Plane upgrade.

Some Operators are compatible with several releases of OCP. So, you may not need to upgrade until you complete the cluster upgrade. This is shown in Step 13 of the Upgrade Process Flow.

Non-Red Hat OLM Operators

For all OLM installed Operators that are NOT directly supported by Red Hat, please contact the supporting vendor to make sure of release compatibility.

Prepare MCPs

Prepare your Machine Config Pool (MCP) labels by grouping your nodes, depending on the number of nodes in your cluster. MCPs should be split up into 8 to 10 nodes per group. However, there is no hard fast rule as to how many nodes need to be in each MCP. The purpose of these MCPs is to group nodes together so that the upgrade and reboot of a group of nodes can be controlled independently of the rest. Additional information and examples can be found here, under the EUS to EUS upgrade documentation. These MCPs will be used to un-pause a set of nodes during the upgrade process, thus allowing them to be upgraded and rebooted at a determined time instead of at the pleasure of the scheduler. Please review the upgrade process flow section, below, for more details on the pause/un-pause process.

What is the purpose of only upgrading some nodes?

During an upgrade there is always a chance that there will be a problem. Most often the problem is related to hardware failure or needing to be reset. If a problem were to occur, having a set of MCPs in a paused state allows the cluster administrator to make sure there are enough nodes running at all times to keep all applications running. The most important thing is to make sure there are enough resources for all application pods.

How should worker nodes be divided into MCPs?

This can vary depending on how many nodes are in the cluster or how many nodes are in a node role. By default the 2 roles in a cluster are master and worker. However, in Telco clusters we quite often split the worker nodes out into 2 separate roles of control plane and data plane. The most important thing is to add MCPs to split out the nodes in each of these two groups.

For example:
We have 15 worker nodes in a cluster
10 worker nodes are in the control plane role
5 worker nodes are in the data plane role

In this cluster you should split both the CNF control plane and data plane worker node roles into at least 2 MCPs.
The advantage is that having 2 MCPs will allow you to have one set of nodes that are not affected by the upgrade if you need a stopping point.

An example of a larger cluster might have as many as 100 worker nodes in the control plane role. In this case you would want to make sure that you keep each MCP to around 10 nodes. This will allow you to either unpause multiple MCPs at a time to speed up the upgrade or allow you to separate out the upgrade over at least 2 maintenance windows.

Here is a depiction of what this would look like:

Figure 1. Worker node MCPs in a 5 rack cluster

The division and size of these MCPs can vary depending on many factors. In general the standard division in large clusters is between 8 and 10 nodes per MCP to allow the operations team to control how many nodes are taken down at a time.

Here is another example but with a medium cluster.
There are a total of 6 nodes in this role. The MCPs inside of this role are split into 2 node MCPs. The smaller MCPs allow 2 nodes to be upgraded which can then sit through a day to allow for verification of CNF compatibility before completing the upgrade on the other 4 nodes.

Figure 2. Separate MCPs in a medium cluster into 3 MCPs

Figure 3. Small cluster worker MCPs

The process and pace at which you un-pause the MCPs is determined by your CNFs and their configuration. Please review the sections on PDB and anti-affinity for CNFs. If your CNF can properly handle scheduling within an OpenShift cluster you can un-pause several MCPs at a time and set the MaxUnavailable to as high as 50%. This will allow as many as half of the nodes in an MCPs to restart and upgrade. This will reduce the amount of time that is needed for a specific maintenance window and allow your cluster to upgrade quickly.

Review your cluster for available MCPs and nodes

First you can run “oc get mcp” to show your current list of MCPs:

# oc get mcp
NAME     CONFIG  UPDATED   UPDATING   DEGRADED   MACHINECOUNT   READYMACHINECOUNT   UPDATEDMACHINECOUNT   DEGRADEDMACHINECOUNT   AGE
master   rendered-master-b…e83   True      False      False      3              3                   3
0                      25d
worker   rendered-worker-2….c4f   True      False      False      2              2                   2
 0                      25d

Now list out all of the nodes in your cluster:

Review what is listed in the “ROLES” column, this will get updated as we move through this process.

# oc get no
NAME                                           STATUS   ROLES    AGE   VERSION
ctrl-plane-0   Ready    control-plane,master   39d   v1.25.10+28ed2d7
ctrl-plane-1   Ready    control-plane,master   39d   v1.25.10+28ed2d7
ctrl-plane-2   Ready    control-plane,master   39d   v1.25.10+28ed2d7
worker-0       Ready    worker           39d   v1.25.10+28ed2d7
worker-1       Ready    worker           39d   v1.25.10+28ed2d7

Determine, from the above suggestions, how you would like to separate out your worker nodes into machine config pools (MCP).

Create your MCPs

This is a 2 step process:

We add a MCP label to each node
We apply an MCP to the cluster which will organize the nodes based on their labels

In the following example there are only 2 nodes and 2 MCPs. Therefore, each MCP only has 1 node in each.

Labeling nodes

We first need to label the nodes so that they can be put into MCPs. We will do this with the following commands:

oc label node worker-0 node-role.kubernetes.io/mcp-1=
oc label node worker-1 node-role.kubernetes.io/mcp-2=

The labels will show up when you run the “oc get node” command:

# oc get no
NAME                                STATUS   ROLES          AGE   VERSION
ctrl-plane-0   Ready    control-plane,master   39d   v1.25.10+28ed2d7
ctrl-plane-1   Ready    control-plane,master   39d   v1.25.10+28ed2d7
ctrl-plane-2   Ready    control-plane,master   39d   v1.25.10+28ed2d7
worker-0       Ready    mcp-1,worker           39d   v1.25.10+28ed2d7
worker-1       Ready    mcp-2,worker           39d   v1.25.10+28ed2d7

Applying MCPs according to label

Now you need to create yaml files that will apply the labels as MCPs in your cluster. Each MCP will have to have a separate file or a separate section (as it is shown below).

Here is one example:

---
apiVersion: machineconfiguration.openshift.io/v1
kind: MachineConfigPool
metadata:
  name: mcp-2
spec:
  machineConfigSelector:
    matchExpressions:
      - {
         key: machineconfiguration.openshift.io/role,
         operator: In,
         values: [worker,mcp-2]
        }
  nodeSelector:
    matchLabels:
      node-role.kubernetes.io/mcp-2: ""
---
apiVersion: machineconfiguration.openshift.io/v1
kind: MachineConfigPool
metadata:
  name: mcp-1
spec:
  machineConfigSelector:
    matchExpressions:
      - {
         key: machineconfiguration.openshift.io/role,
         operator: In,
         values: [worker,mcp-1]
        }
  nodeSelector:
    matchLabels:
      node-role.kubernetes.io/mcp-1: ""

Apply or create the MCPs through:

# oc apply -f mcps.yaml
machineconfigpool.machineconfiguration.openshift.io/mcp-2 created

Monitor MCP formation

Now you can run “oc get mcp” again and your new MCPs will show. It will take a few minutes for your nodes to move into the new MCPs that they are assigned to. However, the nodes will NOT reboot during this time.

You will still see the original worker and master MCPs that are part of the cluster.

This is what it will look like right after you apply the MCPs:

# oc get mcp
NAME     CONFIG      UPDATED   UPDATING   DEGRADED   MACHINECOUNT   READYMACHINECOUNT   UPDATEDMACHINECOUNT   DEGRADEDMACHINECOUNT   AGE
master   rendered-master-b…e83   True      False      False      3              3                 3                     0                      25d
mcp-1    rendered-mcp-1-2…c4f    False     True       True       1              0                 0                     0                      10s
mcp-2    rendered-mcp-2-2…c4f    False     True       True       1              0                 0                     0                      10s
worker   rendered-worker-2…c4f   False     True       True       0              0                 0                     2                      25d

This is what it will look like after the MCPs have been completely applied:

# oc get mcp
NAME     CONFIG     UPDATED   UPDATING   DEGRADED   MACHINECOUNT   READYMACHINECOUNT   UPDATEDMACHINECOUNT   DEGRADEDMACHINECOUNT   AGE
master   rendered-master-b…e83   True      False      False      3              3                 3                     0                      25d
mcp-1    rendered-mcp-1-2…c4f    True      False      False      1              1                 1                     0                      7m33s
mcp-2    rendered-mcp-2-2…c4f    True      False      False      1              1                 1                     0                      51s
worker   rendered-worker-2…c4f   True      False      False      0              0                 0                     0                      25d

Environment considerations

In Telecommunications environments most of the clusters are kept in an “air gapped” or disconnected network. Therefore, you will need to update your offline image repository. When choosing which images to include, please review the OCP API Compatibility Policy section to make sure the cluster will be able to upgrade to the new version of OCP. Setting up and managing an offline image repository is currently out of scope at this time but will be added at a later date.

Platform preparation

This section should be used as a basic set of checks and verifications to make sure that your cluster is ready for an upgrade.

Basic cluster checks

First you will need to verify that there are no issues with failed pods within the cluster that will stop the upgrade. A very easy first check is to run:

[cnf@utility ~]$ oc get po -A | egrep -vi 'complete|running'
NAMESPACE         NAME       READY   STATUS      RESTARTS        AGE
[cnf@utility ~]$

You may need to run this twice if there are pods that are in pending state as they may just have moved around due to normal operating conditions of the cluster.

If there are problems with pods, please review the troubleshooting documentation to determine what the issue is with the pod(s).

Next verify that all nodes within the cluster are available:

jcl@utility ~]$ oc get no
NAME           STATUS   ROLES                  AGE   VERSION
ctrl-plane-0   Ready    control-plane,master   32d   v1.25.14+a52e8df
ctrl-plane-1   Ready    control-plane,master   32d   v1.25.14+a52e8df
ctrl-plane-2   Ready    control-plane,master   32d   v1.25.14+a52e8df
worker-0       Ready    mcp-1,worker           32d   v1.25.14+a52e8df
worker-1       Ready    mcp-2,worker           32d   v1.25.14+a52e8df

Verify that all baremetal nodes are fully provisioned and ready in the cluster. Here is an example of a cluster that has a baremetal node that had an error while provisioning:

cnf@utility ~]$ oc get bmh -n openshift-machine-api
NAME           STATE       CONSUMER                   ONLINE   ERROR   AGE
ctrl-plane-0   unmanaged   cnf-58879-master-0         true             33d
ctrl-plane-1   unmanaged   cnf-58879-master-1         true             33d
ctrl-plane-2   unmanaged   cnf-58879-master-2         true             33d
worker-0       unmanaged   cnf-58879-worker-0-45879   true             33d
worker-1       unmanaged   cnf-58879-worker-0-dszsh   true             33d

Now verify that all cluster operators are ready:

[cnf@utility ~]$ oc get co
NAME                                       VERSION   AVAILABLE   PROGRESSING   DEGRADED   SINCE   MESSAGE
authentication                             4.12.45   True        False         False      17h
baremetal                                  4.12.45   True        False         False      32d
...
service-ca                                 4.12.45   True        False         False      32d
storage                                    4.12.45   True        False         False      32d