You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Tbh I'm not sure. I was wanting to give this a try at some point. The hierarchy of looping in this repo is largely taken from cutlass docs, which also supports tensor cores, so I assume the overall hierarchy would stay more or less the same.
Sorry, somehow I missed your comment, yes, I looked at the cutlass implementation and it is similar to yours.
I like yours because it teaches beginners like me to learn how to optimize gemms step by step. I will keep following this repo in case you decide to implement the TC version later in the future.
I am also thinking about doing it myself, will comment here if I did.
Great repo! I learned a lot from it and the blog, thank you!
I was wondering if there is an easy way to rewrite the kernels to use tensor cores in volta and above archs.
Thanks.
The text was updated successfully, but these errors were encountered: