Conversation
Signed-off-by: Cheng Penghui <penghui.cheng@intel.com>
⛈️ Required checks status: Has failure 🔴
Groups summary🟢 Format Scan Tests workflow
These checks are required after the changes to 🔴 Optimize Unit Test workflow
These checks are required after the changes to 🟢 NeuralChat Unit Test
These checks are required after the changes to 🟢 Engine Unit Test workflow
These checks are required after the changes to 🟡 Chat Bot Test workflow
These checks are required after the changes to Thank you for your contribution! 💜
|
Signed-off-by: Cheng Penghui <penghui.cheng@intel.com>
Signed-off-by: Meng, Hengyu <hengyu.meng@intel.com>
Signed-off-by: Meng, Hengyu <hengyu.meng@intel.com>
Type of Change
feature
No API changed
Description
Removed fallback of lm_head op for WOQ
Expected Behavior & Potential Risk
Don't fallback lm_head when weight-only quantization.
How has this PR been tested?
Local tested