Skip to content

Commit 3563f5d

Browse files
authored
[ChatQnA]Update manifests (#716)
* update manifests for v0.9
1 parent 8c40204 commit 3563f5d

File tree

4 files changed

+7
-7
lines changed

4 files changed

+7
-7
lines changed

ChatQnA/benchmark/four_gaudi/chatqna_config_map.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ data:
1515
TGI_LLM_ENDPOINT: http://llm-dependency-svc.default.svc.cluster.local:9009
1616
REDIS_URL: redis://vector-db.default.svc.cluster.local:6379
1717
INDEX_NAME: rag-redis
18-
HUGGINGFACEHUB_API_TOKEN: {HF_TOKEN}
18+
HUGGINGFACEHUB_API_TOKEN: ${HF_TOKEN}
1919
EMBEDDING_SERVICE_HOST_IP: embedding-svc
2020
RETRIEVER_SERVICE_HOST_IP: retriever-svc
2121
RERANK_SERVICE_HOST_IP: reranking-svc

ChatQnA/benchmark/four_gaudi/llm-dependency_run.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -35,9 +35,9 @@ spec:
3535
- --model-id
3636
- $(LLM_MODEL_ID)
3737
- --max-input-length
38-
- '2048'
38+
- '1024'
3939
- --max-total-tokens
40-
- '4096'
40+
- '2048'
4141
- --max-batch-total-tokens
4242
- '65536'
4343
- --max-batch-prefill-tokens

ChatQnA/benchmark/single_gaudi/llm-dependency_run.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -35,9 +35,9 @@ spec:
3535
- --model-id
3636
- $(LLM_MODEL_ID)
3737
- --max-input-length
38-
- '2048'
38+
- '1024'
3939
- --max-total-tokens
40-
- '4096'
40+
- '2048'
4141
- --max-batch-total-tokens
4242
- '65536'
4343
- --max-batch-prefill-tokens

ChatQnA/benchmark/two_gaudi/llm-dependency_run.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -35,9 +35,9 @@ spec:
3535
- --model-id
3636
- $(LLM_MODEL_ID)
3737
- --max-input-length
38-
- '2048'
38+
- '1024'
3939
- --max-total-tokens
40-
- '4096'
40+
- '2048'
4141
- --max-batch-total-tokens
4242
- '65536'
4343
- --max-batch-prefill-tokens

0 commit comments

Comments
 (0)