-
Notifications
You must be signed in to change notification settings - Fork 285
[CK_Tile] Support for various group sizes Preshuffle quant for 2d block scale gemm #3445
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
53 commits
Select commit
Hold shift + click to select a range
527b718
formatted
amd-khushbu c25420e
formatted
amd-khushbu 8999dae
formatting
amd-khushbu 00e61e0
formatting
amd-khushbu e5d6a80
formatting
amd-khushbu a967b49
[CK TILE GEMM] Refactor block_scale_gemm examples
CongMa13 bc26224
[CK TILE GEMM] Refactor block_scale_gemm examples
CongMa13 c553c87
enable prefill shapes
amd-khushbu 8818018
[CK TILE GEMM] Refactor block_scale_gemm examples
CongMa13 9debcc1
[CK TILE GEMM] Refactor block_scale_gemm examples
CongMa13 51ec0c2
merge with Cong's Changes
amd-khushbu 869bc5b
adding preshuffle quant as new parameter and its associated new files
amd-khushbu 075c36b
remove debugging statements
amd-khushbu 0f79fa5
adding test
amd-khushbu b8b5709
enable preshuffle quant with permuteN
amd-khushbu 903800f
rebase with develop
amd-khushbu 48e7559
updating readme and correcponding gemmconfigs
amd-khushbu 36f2f87
updating cmake file
amd-khushbu 07700cc
fixing CI failures for grouped quant gemm
amd-khushbu f5856af
Merge branch 'develop' into lwpck-3984
amd-khushbu 2275548
debugging permuteN
amd-khushbu a974a08
debugging
amd-khushbu cf3f9b5
Merge branch 'develop' into lwpck-3985
amd-khushbu 04aaf97
debugging PermuteN
amd-khushbu 7788979
initial commit
amd-khushbu f290428
working code for preshuffleb
amd-khushbu 3447196
resolving merge conflicts
amd-khushbu 2441260
Merge branch 'develop' into 1dQuantPreshuffleWeight
amd-khushbu 18fe146
Merge branch 'develop' into 2dQuantPreshuffleWeight
ThomasNing 19b78e9
Merge remote-tracking branch 'origin/develop' into 2dQuantPreshuffleW…
amd-khushbu 3021c7a
adding test cases
amd-khushbu 48744f2
initial commit with prints
amd-khushbu 3ea3ca7
debugging
amd-khushbu c28cf0e
fine-grained working
amd-khushbu ec044f5
rebase with develop
amd-khushbu 341d0e3
debugging medium grained
92cbe3c
fixing the tile window
amd-khushbu 995d1a5
resolving merge ocnflicts
amd-khushbu 44aaaac
formatting
amd-khushbu 9ad0687
enabling prefill shapes
amd-khushbu 5a3e7de
Merge branch 'develop' into lwpck-4181
amd-khushbu 373d89d
working prefill shapes
amd-khushbu 05ff943
Merge branch 'develop' into lwpck-4181
amd-khushbu cc994a7
formatted
amd-khushbu 26a1b52
clean up
amd-khushbu 4c382e7
code cleanup
amd-khushbu 7aeff21
resolving merge conflicts
amd-khushbu 27d31ba
bug fix after merging with develop
amd-khushbu 06c4866
Merge branch 'develop' into 2d_preshuffle_quant
ThomasNing 46543df
Merge branch 'develop' into 2d_preshuffle_quant
amd-khushbu 748add0
Merge branch '2d_preshuffle_quant' of https://github.com/ROCm/composa…
amd-khushbu 4d8cba1
clean up after merging with develop
amd-khushbu 6c52e9d
added comments for the tile window and tile distribution encoding
amd-khushbu File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
245 changes: 228 additions & 17 deletions
245
example/ck_tile/38_block_scale_gemm/gemm_bquant_quantgrouped_preshufflequant.cpp
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
645 changes: 44 additions & 601 deletions
645
include/ck_tile/ops/gemm_quant/kernel/gemm_quant_kernel.hpp
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we also add the partition reasoning of the different condition of tile distribution?