diff --git a/openshift/gpu-operator-with-precompiled-drivers.rst b/openshift/gpu-operator-with-precompiled-drivers.rst index 8fb2fb65b..a623fb49e 100644 --- a/openshift/gpu-operator-with-precompiled-drivers.rst +++ b/openshift/gpu-operator-with-precompiled-drivers.rst @@ -139,7 +139,7 @@ Enabling Precompiled Driver Container Support Using the Web Console --------------------- -#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**. +#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), and click **NVIDIA GPU Operator**. #. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**. The platform assigns the default name *gpu-cluster-policy*. diff --git a/openshift/graphics/cluster_policy_suceed.png b/openshift/graphics/cluster_policy_suceed.png index 3671ccb50..86dac59d5 100644 Binary files a/openshift/graphics/cluster_policy_suceed.png and b/openshift/graphics/cluster_policy_suceed.png differ diff --git a/openshift/install-gpu-ocp.rst b/openshift/install-gpu-ocp.rst index ea754ebdd..126c5534b 100644 --- a/openshift/install-gpu-ocp.rst +++ b/openshift/install-gpu-ocp.rst @@ -11,9 +11,10 @@ Installing the NVIDIA GPU Operator on OpenShift Installing the NVIDIA GPU Operator by using the web console *********************************************************** -#. In the OpenShift Container Platform web console, from the side menu, navigate to **Operators** > **OperatorHub** and select **All Projects**. +#. In the OpenShift Container Platform web console, from the side menu, navigate to **Ecosystem** > **System Catalog** and select **All Projects** from the drop down menu. + In versions before 4.20, navigate to **Operators** > **OperatorHub** and select **All Projects**. -#. In **Operators** > **OperatorHub**, search for the **NVIDIA GPU Operator**. For additional information, refer to the `Red Hat OpenShift Container Platform documentation `_. +#. Search for the **NVIDIA GPU Operator**. For additional information, refer to the `Red Hat OpenShift Container Platform documentation `_. #. Select the **NVIDIA GPU Operator**, click **Install**. In the following screen, click **Install**. @@ -132,7 +133,10 @@ As a cluster administrator, you can install the **NVIDIA GPU Operator** using th subscription.operators.coreos.com/gpu-operator-certified created -#. Optional: Log in to the web console and navigate to the **Operators** > **Installed Operators** page. In the ``Project: nvidia-gpu-operator`` the following is displayed: +#. Optional: Log in to the web console and navigate to the **Ecosystem** > **Installed Operators** and select **NVIDIA GPU Operator**. + In versions before 4.20, navigate to **Operators** > **Installed Operators** and select **NVIDIA GPU Operator**. + + In the ``Project: nvidia-gpu-operator`` the following is displayed: .. image:: graphics/gpu-operator-certified-cli-install.png @@ -191,11 +195,11 @@ when using **NVIDIA vGPU**. Refer to the appropriate sections below. Create the cluster policy using the web console ----------------------------------------------- -#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**. +#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), and click **NVIDIA GPU Operator**. #. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**. The platform assigns the default name *gpu-cluster-policy*. - .. note:: You can use this screen to customize the ClusterPolicy. However, the default values are sufficient to get the GPU configured and running in most cases. + .. note:: You can use this screen to customize the ClusterPolicy. However, the default values are sufficient to get the GPU configured and running in most cases. .. note:: For OpenShift 4.12 with GPU Operator 25.3.1 or later, you must expand the **Driver** section and set the following fields: @@ -280,21 +284,23 @@ Before creating the ClusterPolicy for NVIDIA vGPU, ensure the following: Create the cluster policy using the web console ----------------------------------------------- -#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**. +#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** and select **NVIDIA GPU Operator**. + In versions before 4.20, navigate to **Operators** > **Installed Operators** and select **NVIDIA GPU Operator**. #. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**. The platform assigns the default name *gpu-cluster-policy*. -#. Provide the name of the licensing ``Secret`` under the **Driver** section. This should be created during the prerequisite steps for NVIDIA vGPU (for example, ``licensing-config``). Refer to the following screenshots for examples and modify values accordingly. +#. Under the **NVIDIA GPU/vGPU Driver config** section fill in the following information + + * Specify the ``secretName`` as the name of the licensing ``Secret`` created during the prerequisite steps for NVIDIA vGPU (for example, ``licensing-config``). .. note:: The ``ConfigMap`` option is deprecated. Use a Kubernetes ``Secret`` to store licensing information instead. - .. image:: graphics/cluster_policy_vgpu_1.png -#. Specify the ``repository`` path, ``image`` name, and NVIDIA vGPU driver ``version`` bundled under the **Driver** section. If the registry is not public, specify the ``imagePullSecret`` created during the prerequisite step under the **Driver** advanced configurations section. + * Specify the ``repository`` path, ``image`` name, and NVIDIA vGPU driver ``version``. + If the registry is not public, specify the ``imagePullSecret`` created during the prerequisite step under the **Driver** advanced configurations section. - .. image:: graphics/cluster_policy_vgpu_2.png #. Click **Create**. @@ -305,6 +311,7 @@ Create the cluster policy using the web console .. image:: graphics/cluster-policy-state-ready.png + Create the cluster policy using the CLI --------------------------------------- diff --git a/openshift/install-gpu-operator-gov-ready-openshift.rst b/openshift/install-gpu-operator-gov-ready-openshift.rst index 798ce3e52..dca5efa61 100644 --- a/openshift/install-gpu-operator-gov-ready-openshift.rst +++ b/openshift/install-gpu-operator-gov-ready-openshift.rst @@ -94,7 +94,7 @@ Create the ClusterPolicy Instance When you install the **NVIDIA GPU Operator** in the OpenShift Container Platform, a custom resource definition for a ClusterPolicy is created. The ClusterPolicy configures the GPU Operator, configuring the image names and repository, pod restrictions and credentials, and more. Use the ClusterPolicy to set the NGC image pull secret and government ready repository. -#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**. +#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), and click **NVIDIA GPU Operator**. #. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**. The platform assigns the default name *gpu-cluster-policy*. diff --git a/openshift/install-nfd.rst b/openshift/install-nfd.rst index dac69d8cf..5130e1505 100644 --- a/openshift/install-nfd.rst +++ b/openshift/install-nfd.rst @@ -11,7 +11,7 @@ Installing the Node Feature Discovery Operator on OpenShift Procedure ********* -The Node Feature Discovery (NFD) Operator is a prerequisite for the **NVIDIA GPU Operator**. Install the NFD Operator using the Red Hat OperatorHub catalog in the OpenShift Container Platform web console. +The Node Feature Discovery (NFD) Operator is a prerequisite for the **NVIDIA GPU Operator**. Install the NFD Operator using the Red Hat Software Catalog (Red Hat OperatorHub in versions before 4.20) in the OpenShift Container Platform web console. #. Follow the Red Hat documentation guidance in the `Node Feature Discovery Operator guide `_ to install the Node Feature Discovery Operator. @@ -28,8 +28,8 @@ The Node Feature Discovery (NFD) Operator is a prerequisite for the **NVIDIA GPU #. When the Node Feature Discovery is installed, create an instance of Node Feature Discovery using the **NodeFeatureDiscovery** tab: - #. Click **Operators** > **Installed Operators** from the side menu. - + #. Click **Ecosystem** > **Installed Operators** from the side menu. + In versions before 4.20, click **Operators** > **Installed Operators**. #. Find the **Node Feature Discovery** entry. #. Click **NodeFeatureDiscovery** under the **Provided APIs** field. @@ -38,7 +38,7 @@ The Node Feature Discovery (NFD) Operator is a prerequisite for the **NVIDIA GPU #. In the following screen, click **Create**. This starts the Node Feature Discovery Operator that proceeds to label the nodes in the cluster that have GPUs. - .. note:: The values prepopulated by the OperatorHub are valid for the GPU Operator. + .. note:: The values prepopulated by the Software Catalog (Red Hat OperatorHub in versions before 4.20) are valid for the GPU Operator. ************************************************************************* Verify that the Node Feature Discovery Operator is functioning correctly diff --git a/openshift/mig-ocp.rst b/openshift/mig-ocp.rst index ef65cb36d..16cd22e5d 100644 --- a/openshift/mig-ocp.rst +++ b/openshift/mig-ocp.rst @@ -194,7 +194,7 @@ For example to set the advertisement strategy to ``mixed`` and the MIG partition .. note:: This may take a while so be patient and wait at least 10-20 minutes before digging deeper into any form of troubleshooting. -#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, then click the **NVIDIA GPU Operator**. +#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), then click the **NVIDIA GPU Operator**. #. Select the **ClusterPolicy** tab. The status of the newly deployed ClusterPolicy **gpu-cluster-policy** for the **NVIDIA GPU Operator** displays ``State:ready`` once the installation succeeded. diff --git a/openshift/nvaie-with-ocp.rst b/openshift/nvaie-with-ocp.rst index a63f5fcd1..e8027e6ac 100644 --- a/openshift/nvaie-with-ocp.rst +++ b/openshift/nvaie-with-ocp.rst @@ -203,7 +203,7 @@ Create the Cluster Policy Instance Now create the cluster policy, which is responsible for maintaining policy resources to create pods in a cluster. -#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**. +#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), and click **NVIDIA GPU Operator**. #. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**. diff --git a/openshift/openshift-virtualization.rst b/openshift/openshift-virtualization.rst index 259a9ec31..c3894506c 100644 --- a/openshift/openshift-virtualization.rst +++ b/openshift/openshift-virtualization.rst @@ -401,7 +401,7 @@ Creating a ClusterPolicy for the GPU Operator using the OpenShift Container Plat As a cluster administrator, you can create a ClusterPolicy using the OpenShift Container Platform web console. -#. Navigate to **Operators** > **Installed Operators** and find your installed NVIDIA GPU Operator. +#. Navigate to **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**) and find your installed NVIDIA GPU Operator. #. Under *Provided APIs*, click **ClusterPolicy**.