[XPU] Update doc and add scripts for downloading dependencies by yulangz · Pull Request #2845 · PaddlePaddle/FastDeploy

yulangz · 2025-07-15T03:36:15Z

XVLLM 的下载方法统一集中到 download_dependencies.sh 脚本中，减少后期维护成本。
增加 kunlunxin_xpu_deployment.md 文档描述已适配模型及部署方法，并从安装文档挪到 usage 目录。
修复 xpu_model_runner 中 block_table 的 shape 被错误设置的问题，在部分显存非常充裕的小模型场景会导致算子报错。

paddle-bot · 2025-07-15T03:36:19Z

Thanks for your contribution!

hong19860320 · 2025-07-15T07:18:58Z

@@ -0,0 +1,54 @@
+#!/bin/bash


脚本名改成 download_dependencies.sh 吧

hong19860320 · 2025-07-15T07:30:10Z

+fi
+
+echo "Installation completed in: $THIRDPARTY_DIR"
+echo "You can set environment variables to use XVLLM and XTDK in the following way:"


You can set environment variables as follows to use XVLLM and XTDK:

hong19860320 · 2025-07-15T07:31:26Z

    wget https://klx-sdk-release-public.su.bcebos.com/xre/kl3-release/5.0.21.21/xre-Linux-x86_64-5.0.21.21.tar.gz && \
-    tar -zxf xre-Linux-x86_64-5.0.21.21.tar.gz && mv xre-Linux-x86_64-5.0.21.21 xre
+    tar -zxf xre-Linux-x86_64-5.0.21.21.tar.gz && mv xre-Linux-x86_64-5.0.21.21 xre && \
+    cd /workspace/FastDeploy && bash custom_ops/xpu_ops/src/download_dependency.sh stable


脚本名改成 download_dependencies.sh 吧

hong19860320 · 2025-07-15T07:38:30Z


 For detailed OpenAI protocol specifications, see [OpenAI Chat Compeltion API](https://platform.openai.com/docs/api-reference/chat/create). Differences from the standard OpenAI protocol are documented in [OpenAI Protocol-Compatible API Server](../../online_serving/README.md).
+
+## Supported Models


这个能放在 ## Quick start 前面吗？
然后在Quick start 可以删掉这一大段『The P800 supports the deployment of the ERNIE-4.5-300B-A47B-Paddle model using the following configurations (Note: Different configurations may result in variations in performance).

32K WINT4 with 8 XPUs (Recommended)

128K WINT4 with 8 XPUs

32K WINT4 with 4 XPUs』
同时，『#### Start service 』只保留一个推荐的启动方法吧。

hong19860320 · 2025-07-15T07:39:33Z

 ```bash
-XTDK: https://klx-sdk-release-public.su.bcebos.com/xtdk_15fusion/dev/latest/xtdk-llvm15-ubuntu2004_x86_64.tar.gz
-XVLLM: https://klx-sdk-release-public.su.bcebos.com/xinfer/daily/eb/latest/output.tar.gz
+bash custom_ops/xpu_ops/src/download_dependency.sh develop


hong19860320 · 2025-07-15T07:39:43Z


 OpenAI 协议的更多说明可参考文档 [OpenAI Chat Compeltion API](https://platform.openai.com/docs/api-reference/chat/create)，以及与 OpenAI 协议的区别可以参考 [兼容 OpenAI 协议的服务化部署](../../online_serving/README.md)。
+
+## 支持的模型


…stDeploy into update_xvllm_download

hong19860320

LGTM

…Paddle#2845) * [XPU] update xvllm download * update supported models * fix xpu model runner in huge memory with small model * update doc

[XPU] update xvllm download

ae70a39

yulangz added 2 commits July 15, 2025 06:03

update

a50f339

update supported models

a9952f5

hong19860320 reviewed Jul 15, 2025

View reviewed changes

yulangz and others added 6 commits July 15, 2025 08:11

fix xpu model runner in huge memory with small model

ff23732

update doc

5f18e2c

Merge branch 'develop' into update_xvllm_download

ba44ceb

update

45b6f81

Merge branch 'update_xvllm_download' of https://github.com/yulangz/Fa…

38dd13b

…stDeploy into update_xvllm_download

update doc

a9328f8

hong19860320 approved these changes Jul 15, 2025

View reviewed changes

hong19860320 changed the title ~~[XPU] update xvllm download~~ [XPU] Update doc and add scripts for downloading dependencies Jul 15, 2025

yulangz merged commit 17314ee into PaddlePaddle:develop Jul 16, 2025
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[XPU] Update doc and add scripts for downloading dependencies#2845

[XPU] Update doc and add scripts for downloading dependencies#2845
yulangz merged 9 commits into
PaddlePaddle:developfrom
yulangz:update_xvllm_download

yulangz commented Jul 15, 2025 •

edited by hong19860320

Loading

Uh oh!

paddle-bot Bot commented Jul 15, 2025

Uh oh!

hong19860320 Jul 15, 2025

Uh oh!

hong19860320 Jul 15, 2025

Uh oh!

hong19860320 Jul 15, 2025

Uh oh!

hong19860320 Jul 15, 2025

Uh oh!

hong19860320 Jul 15, 2025

Uh oh!

hong19860320 Jul 15, 2025

Uh oh!

hong19860320 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		For detailed OpenAI protocol specifications, see [OpenAI Chat Compeltion API](https://platform.openai.com/docs/api-reference/chat/create). Differences from the standard OpenAI protocol are documented in [OpenAI Protocol-Compatible API Server](../../online_serving/README.md).

		## Supported Models


		OpenAI 协议的更多说明可参考文档 [OpenAI Chat Compeltion API](https://platform.openai.com/docs/api-reference/chat/create)，以及与 OpenAI 协议的区别可以参考 [兼容 OpenAI 协议的服务化部署](../../online_serving/README.md)。

		## 支持的模型

Conversation

yulangz commented Jul 15, 2025 • edited by hong19860320 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paddle-bot Bot commented Jul 15, 2025

Uh oh!

hong19860320 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

hong19860320 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

hong19860320 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

hong19860320 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

hong19860320 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

hong19860320 Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

hong19860320 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yulangz commented Jul 15, 2025 •

edited by hong19860320

Loading