Skip to content

Introduce KubeAI for OPEA#945

Merged
mkbhanda merged 3 commits intoopea-project:mainfrom
poussa:kubeai-intro
Apr 8, 2025
Merged

Introduce KubeAI for OPEA#945
mkbhanda merged 3 commits intoopea-project:mainfrom
poussa:kubeai-intro

Conversation

@poussa
Copy link
Member

@poussa poussa commented Apr 3, 2025

Description

KubeAI is an open-source AI inferencing operator. This PR contains documentation, installation instructions and deployment files for running KubeAI with OPEA inference services.

For now, OPEA enables a subset of the KubeAI features. In the future more KubeAI service will be added.

Features

The following features are enabled at the moment.

  • Installing KubeAI using Helm and using OPEA configuration.
  • OPEA configuration includes OPEA images for vLLM CPU and Gaudi, Persistent Volume Claims (PVC) for model caching and resource profile for Gaudi.
  • Text generation model (llama-3.1-8b) for vLLM (CPU and Gaudi) using PVC
  • Text embedding model (BAII/BGE) for vLLM (CPU) using PVC
  • Text generation model (qwen-2.5-0.5b for OLlama (CPU)

Issues

OIM Operator RFC

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)

Dependencies

n/a

Tests

Manually tested on CPU and Gaudi clusters.

@poussa poussa requested a review from mkbhanda as a code owner April 3, 2025 11:48
@poussa poussa requested review from lianhao and yongfengdu April 3, 2025 11:51
Copy link
Collaborator

@eero-t eero-t left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Otherwise looks fine, but I wonder whether this should go under helm-charts/ instead?

@poussa
Copy link
Member Author

poussa commented Apr 4, 2025

Otherwise looks fine, but I wonder whether this should go under helm-charts/ instead?

I don't think so. This is not a Helm chart but a solution which uses external Helm chart. Those are two different things, and would not be good to put them into same place.

@mkbhanda
Copy link
Collaborator

mkbhanda commented Apr 4, 2025

Otherwise looks fine, but I wonder whether this should go under helm-charts/ instead?

I don't think so. This is not a Helm chart but a solution which uses external Helm chart. Those are two different things, and would not be good to put them into same place.

I concur with @poussa

@poussa
Copy link
Member Author

poussa commented Apr 4, 2025

KubeAI requires resources.limits so putting it back.

@poussa poussa force-pushed the kubeai-intro branch 5 times, most recently from f89b68b to dcacffa Compare April 7, 2025 06:47
Copy link
Collaborator

@eero-t eero-t left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approved.

PS. before force-pushing updates, please pull the pre-ci commit first. This avoids re-introducing trailing white-space + table alignment issues, that pre-ci needs to fix again.

@eero-t
Copy link
Collaborator

eero-t commented Apr 7, 2025

@chensuyue What's this toctree thing that CI complains about / why linking new README into an already existing README is not enough?
/home/runner/work/GenAIInfra/GenAIInfra/docs/_build/rst/GenAIInfra/kubeai/README.md: WARNING: document isn't included in any toctree

@poussa
Copy link
Member Author

poussa commented Apr 7, 2025

@chensuyue What's this toctree thing that CI complains about / why linking new README into an already existing README is not enough? /home/runner/work/GenAIInfra/GenAIInfra/docs/_build/rst/GenAIInfra/kubeai/README.md: WARNING: document isn't included in any toctree

I need to add entry to the docs index.rst file first to make infra ci/cd happy. I'll do that next.

Signed-off-by: Sakari Poussa <sakari.poussa@intel.com>
Copy link
Collaborator

@mkbhanda mkbhanda left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor typo, can approve

Signed-off-by: Sakari Poussa <sakari.poussa@intel.com>
@mkbhanda mkbhanda merged commit 030515a into opea-project:main Apr 8, 2025
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

4 participants