CUDA: mul_mat_q=true as default for llama_context_params by JohannesGaessler · Pull Request #2912 · ggml-org/llama.cpp

JohannesGaessler · 2023-08-30T18:14:26Z

As pointed out by #2683 (comment) , in a previous PR I forgot to change the mul_mat_q default in llama_context_default_params. So this PR sets it to true in line with CLI use.

CUDA: mul_mat_q=true llama_context_params default

be1ddb1

slaren approved these changes Aug 30, 2023

View reviewed changes

JohannesGaessler merged commit 8afe228 into ggml-org:master Aug 30, 2023

JohannesGaessler deleted the cuda-mmq-default-2 branch June 23, 2024 10:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA: mul_mat_q=true as default for llama_context_params#2912

CUDA: mul_mat_q=true as default for llama_context_params#2912
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-mmq-default-2

JohannesGaessler commented Aug 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JohannesGaessler commented Aug 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants