You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Pull Request resolved: #16565
This diff makes cuda backend actually use slimtensor.
It:
1. updates cuda_backends to create slimtensor from given etensor
2. removed duplicate etensor-driven shim layers under cuda_backend
3. update cmake logic in both cuda backend and aoti backend
Perf maintains the same. Shows as before.
{F1984982156}
Worth to notice that currently we keeps two sets of common shims, one is etensor-based and for metal backend, the other is slimtensor-based which used by cuda backend, to not impact metal backend work.
When Metal backend finishs the migration, we should delete the duplicate common shims and only keep slimtensor-based one.
ghstack-source-id: 336538676
@exported-using-ghexport
Differential Revision: [D90606409](https://our.internmc.facebook.com/intern/diff/D90606409/)
0 commit comments