IBM Cloud Docs
Installing the Red Hat OpenShift AI add-on

Installing the Red Hat OpenShift AI add-on

Follow the steps to install the OpenShift AI add-on.

Minimum requirements

To use the OpenShift AI add-on, your cluster must have at least two 8-core, 32GB memory worker nodes and access to the public internet.

Before you begin

  1. Log in to your account. If applicable, target the appropriate resource group. Set the context for your cluster.

  2. If you do not already have one, create a VPC Public Gateway.

  3. If you want to use the OpenShift Pipelines, Node Feature Discovery, or NVIDIA GPU operators with the OpenShift AI add-on, you must disable outbound traffic protection. If you do not want to use those operators, skip this step.

    Disabling outbound traffic protection permits all external network connections. See Managing outbound traffic protection in VPC clusters for more information.

    ibmcloud oc vpc outbound-traffic-protection disable --cluster CLUSTER
    
  4. Enable OperatorHub on your cluster.

    oc patch operatorhub cluster --type json -p '[{"op": "add", "path": "/spec/disableAllDefaultSources", "value": false}]' 
    

Step 1: Choose customization options

You can enhance your Red Hat Openshift AI projects by specifying different options to include with your add-on installation, such as data pipelines for building portable machine learning workflows or tools for managing and scaling your resources. You can also customize upgrade policies and deletion policies.

Run the command to list all options. For descriptions of each available option, see OpenShift AI customization options. If you do not include a specific option when you install the add-on, the default value applies.

ibmcloud oc cluster addon options --addon openshift-ai

Step 2: Review the recommended operators

You can choose to also install additional operators that are recommended for the use of certain OpenShift AI features. If they are not already installed on your cluster, you can choose to include them in the add-on installation. Or, you can install them at anytime by using OperatorHub or by following the operator-specific installation steps. To use these operators, you must disable outbound traffic protection for your cluster.

You are responsible for managing these operators, including but not limited to updating, monitoring, recovery, and re-installation.

The following operators are recommended.

Some of these operators might include additional customizations that you can choose to specify when you install the add-on. Review the list of customizations available for the recommended operators.

Step 3: Install the add-on in the CLI

Run the command to install the Red Hat OpenShift AI add-on. Specify customizations with the format --parameter PARAM=VALUE. For example, to include the Data Science Pipelines option, specify --parameter oaiDataSciencePipelines=Managed.

To include the recommended operators when installing the add-on with the CLI, specify the following options when you run the installation command.

  • OpenShift Pipelines: --parameter pipelineEnabled=true
  • Node Feature Discovery: --parameter nfdEnabled=true
  • NVIDIA GPU Operator: --parameter nvidiaEnabled=true

Installation command.

ibmcloud oc cluster addon enable openshift-ai --cluster CLUSTER [-f] [--param PARAM] [-q] [--version VERSION]

Example command to install the add-on with automatic minor and patch updates, Codeflare, and Kserve enabled.

ibmcloud oc cluster addon enable openshift-ai --cluster CLUSTER --param oaiInstallPlanApproval=Automatic --param oaiCodeflare=Managed --param oaiKserve=Managed

Step 1: Review the recommended operators

You can choose to also install additional operators that are recommended for the use of certain OpenShift AI features. If they are not already installed on your cluster, you can choose to include them in the add-on installation. Or, you can install them at anytime by using OperatorHub or by following the operator-specific installation steps. To use these operators, you must disable outbound traffic protection for your cluster.

You are responsible for managing these operators, including but not limited to updating, monitoring, recovery, and re-installation.

The following operators are recommended.

Some of these operators might include additional customizations that you can choose to specify when you install the add-on. Review the list of customizations available for the recommended operators.

Step 2: Install the add-on in the UI

Install the Red Hat OpenShift AI add-on with the UI.

  1. Navigate to your cluster page and click on the relevant cluster.

  2. On the cluster details page, find the Add-ons section. Find the Red Hat OpenShift AI option and click Install.

  3. In the Capabilities section, review the description of the available add-on customization options and enable the options you want to include with the installation.

  4. In the Additional recommended operators section, click to expand each operator and select the customization options you want to include. These additional operators and customizations are recommended for certain Red Hat OpenShift AI features. You can choose to install these options later by using OperatorHub or by following the operator-specific installation steps.

    You are responsible for managing these operators, including but not limited to updating, monitoring, recovery, and re-installation.

  5. Click Install.

OpenShift AI customization options

Review the customization options available for the OpenShift AI add-on.

To include an option when you install the OpenShift AI add-on with the CLI, include the option with the --parameter PARAM=VALUE format when you run the ibmcloud oc cluster addon enable openshift-ai. For example, to install the add-on with the Data Science Pipelines option, specify --parameter oaiDataSciencePipelines=Managed.

To include an option when you install the OpenShift AI add-on with the UI, click to enable the option when prompted.

OpenShift AI add-on customization options and CLI parameters.
Customization CLI Parameter Description CLI Values Default value
OpenShift AI approval policy oaiInstallPlanApproval Apply minor and patch updates automatically or manually. Automatic or Manual Automatic
OpenShift AI deletion policy oaiDeletePolicy Retain or delete any operators or components installed by the add-on if the add-on is removed. Retain or Delete Retain
Open Data Hub Dashboard oaiDashboard Enable or disable the component. If enabled, it is managed by OpenShift AI platform. Managed to enable
Removed to disable
Managed (enabled)
Kueue oaiKueue Enable or disable the component. If enabled, it is managed by OpenShift AI platform. Managed to enable
Removed to disable
Managed (enabled)
Codeflare oaiCodeflare Enable or disable the component. If enabled, it is managed by OpenShift AI platform. Managed to enable
Removed to disable
Managed (enabled)
ModelMesh Serving oaiModelmeshserving Enable or disable the component. If enabled, it is managed by OpenShift AI platform. Managed to enable
Removed to disable
Managed (enabled)
Workbench oaiWorkbenches Enable or disable the component. If enabled, it is managed by OpenShift AI platform. Managed to enable
Removed to disable
Managed (enabled)
Data Science Pipelines oaiDataSciencePipelines Enable or disable the component. If enabled, it is managed by OpenShift AI platform. Managed to enable
Removed to disable
Managed (enabled)
KServe oaiKserve Enable or disable the component. If enabled, it is managed by OpenShift AI platform. Managed to enable
Removed to disable
Managed (enabled)
Ray oaiRay Enable or disable the component. If enabled, it is managed by OpenShift AI platform. Managed to enable
Removed to disable
Managed (enabled)

Customizations for recommended operators

Review the recommended operators and the optional customizations you can include during installation.

To include a customization for an operator when you install the OpenShift AI add-on with the CLI, include the option with the --parameter PARAM=VALUE format when you run the ibmcloud oc cluster addon enable openshift-ai. For example, to include the NVIDIA GPUDirect Storage customization for the NVIDIA operator, specify --parameter nvidiaGpuDirectStorageEnabled=true.

To include a customization for an operator when you install the OpenShift AI add-on with the UI, click to enable the option when prompted.

OpenShift AI add-on customizations options and CLI parameters for additional operators.
Customization CLI Parameter Description CLI Values Default value
NDF Deletion Policy nfdDeletePolicy Retain or delete the operator if the OpenShift AI add-on is removed. Retain or Delete Retain
NVIDIA Deletion Policy nvidiaDeletePolicy Retain or delete the operator if the OpenShift AI add-on is removed. Retain or Delete Retain
NVIDIA Sandbox Workloads nvidiaSandboxWorkloads Enable management of additional operands required for sandbox workloads. true (enabled)
false (disabled)
true (enabled)
NVIDIA DCGM Hostengine Deployment nvidiaDcgmEnabled Enable deployment of the NVIDIA DCGM hostengine as a separate pod. true (enabled)
false (disabled)
true (enabled)
NVIDIA vGPU Manager nvidiaVgpuManagerEnabled Enable NVIDIA vGPU Manager. true (enabled)
false (disabled)
true (enabled)
NVIDIA VFIO Manager nvidiaVfioManagerEnabled Enable VFIOManager for configuration to deploy VFIO-PCI. true (enabled)
false (disabled)
true (enabled)
NVIDIA Node Status Exporter nvidiaNodeStatusExporterEnabled Enable Node Status Exporter. true (enabled)
false (disabled)
true (enabled)
NVIDIA Sandbox Device Plugin nvidiaSandboxDevicePluginEnabled Enable NVIDIA Sandbox Device Plugin. true (enabled)
false (disabled)
true (enabled)
NVIDA MIG Manager nvidiaMigManagerEnabled Enable NVIDIA MIG Manager. true (enabled)
false (disabled)
true (enabled)
NVIDIA vGPU Device Manager nvidiaVgpuDeviceManagerEnabled Enable NVIDIA vGPU Device Manager. true (enabled)
false (disabled)
true (enabled)
NVIDIA GPUDirect Storage nvidiaGpuDirectStorageEnabled Enable GPUDirect Storage. true (enabled)
false (disabled)
true (enabled)
NVIDIA CUDA Testing nvidiaCudaTest Enable NVIDIA CUDA testing. true (enabled)
false (disabled)
false (disabled)
Pipeline Operator Deletion Policy pipelineDeletePolicy Retain or delete the operator if the OpenShift AI add-on is removed. Retain or Delete

What's next?