Installing Red Hat OpenShift AI

Red Hat OpenShift AI is an open hybrid AI and machine learning platform for gathering insights from data and building AI-enabled applications. It provides tools to rapidly develop, train, serve, and monitor machine learning models on site, in the public cloud, or at the edge. You can install Red Hat OpenShift AI and other necessary operators onto an existing Red Hat OpenShift cluster that is enabled with GPU-enhanced worker nodes.

The following steps refer to the Red Hat documentation. For more information see, Installing OpenShift AI self-managed.

Before you begin

Request access to GPU worker nodes on VPC clusters, which are currently behind an allowlist.
Create a Red Hat OpenShift VPC cluster that runs on bare metal GPU worker nodes and has a public cloud service endpoint. If you choose to use a private cloud service endpoint, make sure you configure your cluster so that you can access the OpenShift web console.
Make sure you can access the your cluster through the OpenShift web console.

Optional. Install the Red Hat OpenShift Pipelines Operator

OpenShift AI supports data science pipelines. A pipeline is a collection of task resources that are arranged in a specific order of execution. By using Red Hat OpenShift AI with pipelines, you can standardize and automate the build and deployment of your data science models. For installation steps, see Installing OpenShift Pipelines in the Red Hat documentation.

Install the OpenShift AI Operator

You can install the OpenShift AI Operator to your cluster using the web console. See Installing OpenShift AI self-managed for detailed installation steps and requirements.

Install the Node Feature Discovery Operator

The Node Feature Discovery (NFD) Operator is a prerequisite for the NVIDIA GPU Operator, which is required to enable and use GPUs with OpenShift AI. Follow the steps in Installing the Node Feature Discovery (NFD) Operator in the NVIDIA documentation to install the operator using the Red Hat OperatorHub catalog in the OpenShift Container Platform web console. Make sure you follow all the instructions, including the step to create a Node Feature Discovery instance and the steps to verify the installation.

Install the NVIDIA GPU Operator

To ensure that your data scientists can use compute-heavy workloads in their models, you can enable graphics processing units (GPUs) in OpenShift AI. To enable GPUs on OpenShift, you must install the NVIDIA GPU Operator on the cluster. For installation steps, see Installing the NVIDIA GPU Operator in the NVIDIA documentation. Note that the Node Feature Discovery Operator must be installed before you can install the NVIDIA GPU operator.

To check that the GPU operator installation is successful, you can run a sample GPU application.

Check installed operators

After you install the operators, check that they are visible in the OpenShift web console.

In the IBM Cloud console, navigate to your clusters page.
Click on the cluster. Then, click OpenShift web console.
In the side panel, click Operators > Installed operators.
Check that the following operators are installed with a status of Succeeded. The Package Server, which is automatically installed when the cluster is created, is also listed.
- NVIDIA GPU Operator
- Node Feature Discovery Operator
- Red Hat OpenShift Pipelines
- Red Hat OpenShift AI

What's next?

Now that you have OpenShift AI installed on your cluster, try one of the Data Science learning paths, run a sample AI workload on your cluster by following one of the Red Hat OpenShift tutorials, or try this tutorial on using GPUs and a Jupyter notebook for AI or machine learning modeling.