Installing the Red Hat OpenShift AI add-on
Follow the steps to install the OpenShift AI add-on.
Minimum requirements
To use the OpenShift AI add-on, your cluster must have at least two 8-core, 32GB memory worker nodes and access to the public internet.
Before you begin
-
If you do not already have one, create a VPC Public Gateway.
-
If you want to use the OpenShift Pipelines, Node Feature Discovery, or NVIDIA GPU operators with the OpenShift AI add-on, you must disable outbound traffic protection. If you do not want to use those operators, skip this step.
Disabling outbound traffic protection permits all external network connections. See Managing outbound traffic protection in VPC clusters for more information.
ibmcloud oc vpc outbound-traffic-protection disable --cluster CLUSTER
-
Enable OperatorHub on your cluster.
oc patch operatorhub cluster --type json -p '[{"op": "add", "path": "/spec/disableAllDefaultSources", "value": false}]'
Step 1: Choose customization options
You can enhance your Red Hat Openshift AI projects by specifying different options to include with your add-on installation, such as data pipelines for building portable machine learning workflows or tools for managing and scaling your resources. You can also customize upgrade policies and deletion policies.
Run the command to list all options. For descriptions of each available option, see OpenShift AI customization options. If you do not include a specific option when you install the add-on, the default value applies.
ibmcloud oc cluster addon options --addon openshift-ai
Step 2: Review the recommended operators
You can choose to also install additional operators that are recommended for the use of certain OpenShift AI features. If they are not already installed on your cluster, you can choose to include them in the add-on installation. Or, you can install them at anytime by using OperatorHub or by following the operator-specific installation steps. To use these operators, you must disable outbound traffic protection for your cluster.
You are responsible for managing these operators, including but not limited to updating, monitoring, recovery, and re-installation.
The following operators are recommended.
Some of these operators might include additional customizations that you can choose to specify when you install the add-on. Review the list of customizations available for the recommended operators.
Step 3: Install the add-on in the CLI
Run the command to install the Red Hat OpenShift AI add-on. Specify customizations with the format --parameter PARAM=VALUE
. For example,
to include the Data Science Pipelines option, specify --parameter oaiDataSciencePipelines=Managed
.
To include the recommended operators when installing the add-on with the CLI, specify the following options when you run the installation command.
- OpenShift Pipelines:
--parameter pipelineEnabled=true
- Node Feature Discovery:
--parameter nfdEnabled=true
- NVIDIA GPU Operator:
--parameter nvidiaEnabled=true
Installation command.
ibmcloud oc cluster addon enable openshift-ai --cluster CLUSTER [-f] [--param PARAM] [-q] [--version VERSION]
Example command to install the add-on with automatic minor and patch updates, Codeflare, and Kserve enabled.
ibmcloud oc cluster addon enable openshift-ai --cluster CLUSTER --param oaiInstallPlanApproval=Automatic --param oaiCodeflare=Managed --param oaiKserve=Managed
Step 1: Review the recommended operators
You can choose to also install additional operators that are recommended for the use of certain OpenShift AI features. If they are not already installed on your cluster, you can choose to include them in the add-on installation. Or, you can install them at anytime by using OperatorHub or by following the operator-specific installation steps. To use these operators, you must disable outbound traffic protection for your cluster.
You are responsible for managing these operators, including but not limited to updating, monitoring, recovery, and re-installation.
The following operators are recommended.
Some of these operators might include additional customizations that you can choose to specify when you install the add-on. Review the list of customizations available for the recommended operators.
Step 2: Install the add-on in the UI
Install the Red Hat OpenShift AI add-on with the UI.
-
Navigate to your cluster page and click on the relevant cluster.
-
On the cluster details page, find the Add-ons section. Find the Red Hat OpenShift AI option and click Install.
-
In the Capabilities section, review the description of the available add-on customization options and enable the options you want to include with the installation.
-
In the Additional recommended operators section, click to expand each operator and select the customization options you want to include. These additional operators and customizations are recommended for certain Red Hat OpenShift AI features. You can choose to install these options later by using OperatorHub or by following the operator-specific installation steps.
You are responsible for managing these operators, including but not limited to updating, monitoring, recovery, and re-installation.
-
Click Install.
OpenShift AI customization options
Review the customization options available for the OpenShift AI add-on.
To include an option when you install the OpenShift AI add-on with the CLI, include the option with the --parameter PARAM=VALUE
format
when you run the ibmcloud oc cluster addon enable openshift-ai
. For example, to install the add-on with the Data Science Pipelines option, specify --parameter oaiDataSciencePipelines=Managed
.
To include an option when you install the OpenShift AI add-on with the UI, click to enable the option when prompted.
Customization | CLI Parameter | Description | CLI Values | Default value |
---|---|---|---|---|
OpenShift AI approval policy | oaiInstallPlanApproval |
Apply minor and patch updates automatically or manually. | Automatic or Manual |
Automatic |
OpenShift AI deletion policy | oaiDeletePolicy |
Retain or delete any operators or components installed by the add-on if the add-on is removed. | Retain or Delete |
Retain |
Open Data Hub Dashboard | oaiDashboard |
Enable or disable the component. If enabled, it is managed by OpenShift AI platform. | Managed to enableRemoved to disable |
Managed (enabled) |
Kueue | oaiKueue |
Enable or disable the component. If enabled, it is managed by OpenShift AI platform. | Managed to enableRemoved to disable |
Managed (enabled) |
Codeflare | oaiCodeflare |
Enable or disable the component. If enabled, it is managed by OpenShift AI platform. | Managed to enableRemoved to disable |
Managed (enabled) |
ModelMesh Serving | oaiModelmeshserving |
Enable or disable the component. If enabled, it is managed by OpenShift AI platform. | Managed to enableRemoved to disable |
Managed (enabled) |
Workbench | oaiWorkbenches |
Enable or disable the component. If enabled, it is managed by OpenShift AI platform. | Managed to enableRemoved to disable |
Managed (enabled) |
Data Science Pipelines | oaiDataSciencePipelines |
Enable or disable the component. If enabled, it is managed by OpenShift AI platform. | Managed to enableRemoved to disable |
Managed (enabled) |
KServe | oaiKserve |
Enable or disable the component. If enabled, it is managed by OpenShift AI platform. | Managed to enableRemoved to disable |
Managed (enabled) |
Ray | oaiRay |
Enable or disable the component. If enabled, it is managed by OpenShift AI platform. | Managed to enableRemoved to disable |
Managed (enabled) |
Customizations for recommended operators
Review the recommended operators and the optional customizations you can include during installation.
To include a customization for an operator when you install the OpenShift AI add-on with the CLI, include the option with the --parameter PARAM=VALUE
format when you run the ibmcloud oc cluster addon enable openshift-ai
. For example, to include the NVIDIA GPUDirect Storage customization for the NVIDIA operator, specify --parameter nvidiaGpuDirectStorageEnabled=true
.
To include a customization for an operator when you install the OpenShift AI add-on with the UI, click to enable the option when prompted.
Customization | CLI Parameter | Description | CLI Values | Default value |
---|---|---|---|---|
NDF Deletion Policy | nfdDeletePolicy |
Retain or delete the operator if the OpenShift AI add-on is removed. | Retain or Delete |
Retain |
NVIDIA Deletion Policy | nvidiaDeletePolicy |
Retain or delete the operator if the OpenShift AI add-on is removed. | Retain or Delete |
Retain |
NVIDIA Sandbox Workloads | nvidiaSandboxWorkloads |
Enable management of additional operands required for sandbox workloads. | true (enabled)false (disabled) |
true (enabled) |
NVIDIA DCGM Hostengine Deployment | nvidiaDcgmEnabled |
Enable deployment of the NVIDIA DCGM hostengine as a separate pod. | true (enabled)false (disabled) |
true (enabled) |
NVIDIA vGPU Manager | nvidiaVgpuManagerEnabled |
Enable NVIDIA vGPU Manager. | true (enabled)false (disabled) |
true (enabled) |
NVIDIA VFIO Manager | nvidiaVfioManagerEnabled |
Enable VFIOManager for configuration to deploy VFIO-PCI. | true (enabled)false (disabled) |
true (enabled) |
NVIDIA Node Status Exporter | nvidiaNodeStatusExporterEnabled |
Enable Node Status Exporter. | true (enabled)false (disabled) |
true (enabled) |
NVIDIA Sandbox Device Plugin | nvidiaSandboxDevicePluginEnabled |
Enable NVIDIA Sandbox Device Plugin. | true (enabled)false (disabled) |
true (enabled) |
NVIDA MIG Manager | nvidiaMigManagerEnabled |
Enable NVIDIA MIG Manager. | true (enabled)false (disabled) |
true (enabled) |
NVIDIA vGPU Device Manager | nvidiaVgpuDeviceManagerEnabled |
Enable NVIDIA vGPU Device Manager. | true (enabled)false (disabled) |
true (enabled) |
NVIDIA GPUDirect Storage | nvidiaGpuDirectStorageEnabled |
Enable GPUDirect Storage. | true (enabled)false (disabled) |
true (enabled) |
NVIDIA CUDA Testing | nvidiaCudaTest |
Enable NVIDIA CUDA testing. | true (enabled)false (disabled) |
false (disabled) |
Pipeline Operator Deletion Policy | pipelineDeletePolicy |
Retain or delete the operator if the OpenShift AI add-on is removed. | Retain or Delete |
What's next?
- See information on managing the OpenShift AI add-on.
- Make sure you understand the update process for the OpenShift AI add-on.