Upgrade Alauda AI

INFO

Upgrade from 1.4 to 1.5

Install Alauda AI Cluster Components

Please visit Alauda AI Cluster for:

WARNING

Please ignore Creating Alauda AI Cluster Instance since we are upgrading Alauda AI from a previously managed version.

  1. Downloading operator bundle packages for Alauda AI Cluster and KServeless.
  2. Uploading operator bundle packages to the destination cluster.
  3. To upgrade, follow the process described below.

Upgrading

The following procedure describes how to upgrade from Alauda AI 1.4 to 1.5.

Upgrade Alauda AI Essentials

Upgrade Alauda AI Essentials in the Functional Components tab of the global cluster.

In the Administrator view, navigate to Clusters / Cluster in the left navigation, find the global cluster in the right panel, and click the Upgrade button on the Functional Components tab page to trigger the upgrade.

INFO

All components that require upgrading, including Alauda AI Essentials, will be upgraded together and cannot be upgraded individually.

After the upgrade is complete, please confirm that the status of Alauda AI Essentials in the list is Running and the version is v1.5.x.

Upgrading Alauda AI Operators

The procedure for upgrading both operators is nearly identical, with only the target component being different.

StepAlauda AI OperatorAlauda AI Model Serving Operator
1. NavigateLog into the Web Console, then go to Marketplace > OperatorHub in the Administrator view.Log into the Web Console, then go to Marketplace > OperatorHub in the Administrator view.
2. SelectSelect your target cluster.Select your target cluster.
3. ClickClick the Alauda AI card.Click the Alauda AI Model Serving card.
4. ConfirmClick Confirm on the upgrade prompt.Click Confirm on the upgrade prompt.
INFO

Once the new version is uploaded and recognized by the platform, an upgrade prompt will appear at the top of the operator's page.

Upgrading Cluster Plugins

INFO

This step is only required if you have deployed any of the following cluster plugins: Alauda AI Workbench, Alauda AI MLflow, or Alauda AI Volcano. If you have not deployed any of these plugins, you can skip this step.

For more information about cluster plugins, refer to Alauda Container Platform - Extend.

The procedure to upgrade cluster plugins involves uploading new version packages and then upgrading them from the Web Console. The following plugins require upgrading:

  • Alauda AI Workbench
  • Alauda AI MLflow
  • Alauda AI Volcano

Uploading Cluster Plugins

Upload the new cluster plugin packages to the global cluster. For detailed instructions on how to upload cluster plugins, refer to Uploading Cluster Plugins.

Repeat the upload process for each plugin that you have deployed.

INFO

After the upload is completed, wait approximately 10–15 minutes for the platform to synchronize the new version information.

Verifying the New Version

Navigate to Administrator > Marketplace > Upload Packages and switch to the Cluster Plugin tab. Locate each uploaded plugin to verify that the new version is displayed.

Upgrading from Web Console

  1. Navigate to Administrator > Clusters > Clusters.
  2. Click on the cluster where the plugins are deployed.
  3. Switch to the Functional Components tab in the cluster details page.
  4. Locate the cluster plugins that have available upgrades—an upgrade icon will be displayed next to them.
  5. Click the Upgrade button for the plugin you want to upgrade.
  6. Review the upgrade information in the popup window, confirm the plugin version to be upgraded, and click the Upgrade button to start the upgrade process.

Repeat this process for each plugin that you have deployed.

INFO

Once the new version is uploaded and recognized by the platform, the Upgrade button will become available for each plugin on the Features tab of the cluster details page.

Enabling Fine-Tuning and Training Features

INFO

This step is required if you want to use the fine-tuning and training features in the upgraded version. If you do not need these features, you can skip this step.

For detailed information about fine-tuning and training features, refer to Enable Fine-Tuning and Training Features.

To enable the fine-tuning and training features, you need to configure the AmlCluster resource by setting the pretrain and tuneModels flags to true.

  1. Navigate to Administrator > Clusters > Resources.

  2. Enter amlcluster in the search box on the left side.

  3. Click the Correlated with Cluster panel to find the AmlCluster resource named default.

  4. Edit the resource and set pretrain and tuneModels to true under spec.values.experimentalFeatures.

    apiVersion: amlclusters.aml.dev/v1alpha2
    kind: AmlCluster
    metadata:
      name: default
    spec:
      values:
        experimentalFeatures:
          pretrain: true
          tuneModels: true
  1. When set to true, the Training item appears in the left navigation menu. 2. When set to true, the Fine-Tuning item appears in the left navigation menu.
NOTE

After enabling these features, ensure that the required cluster plugins are installed:

  • Volcano cluster plugin for training job scheduling
  • MLflow cluster plugin for training experiment monitoring (requires PostgreSQL)

Verification

Alauda AI

Check the status field from the AmlCluster resource which named default:

kubectl get amlcluster default

Should returns Ready:

NAME      READY   REASON
default   True    Succeeded

Alauda AI Model Serving

Check the status field from the KnativeServing resource which named default-knative-serving:

kubectl get KnativeServing.components.aml.dev default-knative-serving

Should returns InstallSuccessful:

NAME                      DEPLOYED   REASON
default-knative-serving   True       UpgradeSuccessful

Alauda AI Cluster Plugins

In the Administrator view, navigate to Marketplace > Cluster Plugins and confirm that the following cluster plugins show Installed status with the new version:

  • Alauda AI Workbench (if deployed)
  • Alauda AI MLflow (if deployed)
  • Alauda AI Volcano (if deployed)