Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion openshift/gpu-operator-with-precompiled-drivers.rst
Original file line number Diff line number Diff line change
Expand Up @@ -139,7 +139,7 @@ Enabling Precompiled Driver Container Support
Using the Web Console
---------------------

#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**.
#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), and click **NVIDIA GPU Operator**.

#. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**. The platform assigns the default name *gpu-cluster-policy*.

Expand Down
Binary file modified openshift/graphics/cluster_policy_suceed.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
27 changes: 17 additions & 10 deletions openshift/install-gpu-ocp.rst
Original file line number Diff line number Diff line change
Expand Up @@ -11,9 +11,10 @@ Installing the NVIDIA GPU Operator on OpenShift
Installing the NVIDIA GPU Operator by using the web console
***********************************************************

#. In the OpenShift Container Platform web console, from the side menu, navigate to **Operators** > **OperatorHub** and select **All Projects**.
#. In the OpenShift Container Platform web console, from the side menu, navigate to **Ecosystem** > **System Catalog** and select **All Projects** from the drop down menu.
In versions before 4.20, navigate to **Operators** > **OperatorHub** and select **All Projects**.

#. In **Operators** > **OperatorHub**, search for the **NVIDIA GPU Operator**. For additional information, refer to the `Red Hat OpenShift Container Platform documentation <https://docs.openshift.com/container-platform/latest/operators/admin/olm-adding-operators-to-cluster.html>`_.
#. Search for the **NVIDIA GPU Operator**. For additional information, refer to the `Red Hat OpenShift Container Platform documentation <https://docs.openshift.com/container-platform/latest/operators/admin/olm-adding-operators-to-cluster.html>`_.

#. Select the **NVIDIA GPU Operator**, click **Install**. In the following screen, click **Install**.

Expand Down Expand Up @@ -132,7 +133,10 @@ As a cluster administrator, you can install the **NVIDIA GPU Operator** using th

subscription.operators.coreos.com/gpu-operator-certified created

#. Optional: Log in to the web console and navigate to the **Operators** > **Installed Operators** page. In the ``Project: nvidia-gpu-operator`` the following is displayed:
#. Optional: Log in to the web console and navigate to the **Ecosystem** > **Installed Operators** and select **NVIDIA GPU Operator**.
In versions before 4.20, navigate to **Operators** > **Installed Operators** and select **NVIDIA GPU Operator**.

In the ``Project: nvidia-gpu-operator`` the following is displayed:

.. image:: graphics/gpu-operator-certified-cli-install.png

Expand Down Expand Up @@ -191,11 +195,11 @@ when using **NVIDIA vGPU**. Refer to the appropriate sections below.
Create the cluster policy using the web console
-----------------------------------------------

#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**.
#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), and click **NVIDIA GPU Operator**.

#. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**. The platform assigns the default name *gpu-cluster-policy*.

.. note:: You can use this screen to customize the ClusterPolicy. However, the default values are sufficient to get the GPU configured and running in most cases.
.. note:: You can use this screen to customize the ClusterPolicy. However, the default values are sufficient to get the GPU configured and running in most cases.

.. note:: For OpenShift 4.12 with GPU Operator 25.3.1 or later, you must expand the **Driver** section and set the following fields:

Expand Down Expand Up @@ -280,21 +284,23 @@ Before creating the ClusterPolicy for NVIDIA vGPU, ensure the following:
Create the cluster policy using the web console
-----------------------------------------------

#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**.
#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** and select **NVIDIA GPU Operator**.
In versions before 4.20, navigate to **Operators** > **Installed Operators** and select **NVIDIA GPU Operator**.

#. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**. The platform assigns the default name *gpu-cluster-policy*.

#. Provide the name of the licensing ``Secret`` under the **Driver** section. This should be created during the prerequisite steps for NVIDIA vGPU (for example, ``licensing-config``). Refer to the following screenshots for examples and modify values accordingly.
#. Under the **NVIDIA GPU/vGPU Driver config** section fill in the following information

* Specify the ``secretName`` as the name of the licensing ``Secret`` created during the prerequisite steps for NVIDIA vGPU (for example, ``licensing-config``).

.. note::

The ``ConfigMap`` option is deprecated. Use a Kubernetes ``Secret`` to store licensing information instead.

.. image:: graphics/cluster_policy_vgpu_1.png

#. Specify the ``repository`` path, ``image`` name, and NVIDIA vGPU driver ``version`` bundled under the **Driver** section. If the registry is not public, specify the ``imagePullSecret`` created during the prerequisite step under the **Driver** advanced configurations section.
* Specify the ``repository`` path, ``image`` name, and NVIDIA vGPU driver ``version``.
If the registry is not public, specify the ``imagePullSecret`` created during the prerequisite step under the **Driver** advanced configurations section.

.. image:: graphics/cluster_policy_vgpu_2.png

#. Click **Create**.

Expand All @@ -305,6 +311,7 @@ Create the cluster policy using the web console
.. image:: graphics/cluster-policy-state-ready.png



Create the cluster policy using the CLI
---------------------------------------

Expand Down
2 changes: 1 addition & 1 deletion openshift/install-gpu-operator-gov-ready-openshift.rst
Original file line number Diff line number Diff line change
Expand Up @@ -94,7 +94,7 @@ Create the ClusterPolicy Instance
When you install the **NVIDIA GPU Operator** in the OpenShift Container Platform, a custom resource definition for a ClusterPolicy is created. The ClusterPolicy configures the GPU Operator, configuring the image names and repository, pod restrictions and credentials, and more. Use the ClusterPolicy to set the NGC image pull secret and government ready repository.


#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**.
#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), and click **NVIDIA GPU Operator**.

#. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**. The platform assigns the default name *gpu-cluster-policy*.

Expand Down
8 changes: 4 additions & 4 deletions openshift/install-nfd.rst
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@ Installing the Node Feature Discovery Operator on OpenShift
Procedure
*********

The Node Feature Discovery (NFD) Operator is a prerequisite for the **NVIDIA GPU Operator**. Install the NFD Operator using the Red Hat OperatorHub catalog in the OpenShift Container Platform web console.
The Node Feature Discovery (NFD) Operator is a prerequisite for the **NVIDIA GPU Operator**. Install the NFD Operator using the Red Hat Software Catalog (Red Hat OperatorHub in versions before 4.20) in the OpenShift Container Platform web console.

#. Follow the Red Hat documentation guidance in the `Node Feature Discovery Operator guide <https://docs.openshift.com/container-platform/latest/hardware_enablement/psap-node-feature-discovery-operator.html>`_ to install the Node Feature Discovery Operator.

Expand All @@ -28,8 +28,8 @@ The Node Feature Discovery (NFD) Operator is a prerequisite for the **NVIDIA GPU

#. When the Node Feature Discovery is installed, create an instance of Node Feature Discovery using the **NodeFeatureDiscovery** tab:

#. Click **Operators** > **Installed Operators** from the side menu.

#. Click **Ecosystem** > **Installed Operators** from the side menu.
In versions before 4.20, click **Operators** > **Installed Operators**.
#. Find the **Node Feature Discovery** entry.

#. Click **NodeFeatureDiscovery** under the **Provided APIs** field.
Expand All @@ -38,7 +38,7 @@ The Node Feature Discovery (NFD) Operator is a prerequisite for the **NVIDIA GPU

#. In the following screen, click **Create**. This starts the Node Feature Discovery Operator that proceeds to label the nodes in the cluster that have GPUs.

.. note:: The values prepopulated by the OperatorHub are valid for the GPU Operator.
.. note:: The values prepopulated by the Software Catalog (Red Hat OperatorHub in versions before 4.20) are valid for the GPU Operator.

*************************************************************************
Verify that the Node Feature Discovery Operator is functioning correctly
Expand Down
2 changes: 1 addition & 1 deletion openshift/mig-ocp.rst
Original file line number Diff line number Diff line change
Expand Up @@ -194,7 +194,7 @@ For example to set the advertisement strategy to ``mixed`` and the MIG partition
.. note:: This may take a while so be patient and wait at least 10-20 minutes before digging deeper into any form of troubleshooting.

#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, then click the **NVIDIA GPU Operator**.
#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), then click the **NVIDIA GPU Operator**.

#. Select the **ClusterPolicy** tab. The status of the newly deployed ClusterPolicy **gpu-cluster-policy** for the **NVIDIA GPU Operator** displays ``State:ready`` once the installation succeeded.

Expand Down
2 changes: 1 addition & 1 deletion openshift/nvaie-with-ocp.rst
Original file line number Diff line number Diff line change
Expand Up @@ -203,7 +203,7 @@ Create the Cluster Policy Instance

Now create the cluster policy, which is responsible for maintaining policy resources to create pods in a cluster.

#. In the OpenShift Container Platform web console, from the side menu, select **Operators** > **Installed Operators**, and click **NVIDIA GPU Operator**.
#. In the OpenShift Container Platform web console, from the side menu, select **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**), and click **NVIDIA GPU Operator**.

#. Select the **ClusterPolicy** tab, then click **Create ClusterPolicy**.

Expand Down
2 changes: 1 addition & 1 deletion openshift/openshift-virtualization.rst
Original file line number Diff line number Diff line change
Expand Up @@ -401,7 +401,7 @@ Creating a ClusterPolicy for the GPU Operator using the OpenShift Container Plat

As a cluster administrator, you can create a ClusterPolicy using the OpenShift Container Platform web console.

#. Navigate to **Operators** > **Installed Operators** and find your installed NVIDIA GPU Operator.
#. Navigate to **Ecosystem** > **Installed Operators** (for versions before 4.20, look for **Operators** > **Installed Operators**) and find your installed NVIDIA GPU Operator.

#. Under *Provided APIs*, click **ClusterPolicy**.

Expand Down