This project has been moved to xlite-dev/LeetCUDA. Please check xlite-dev/LeetCUDA for latest updates! 👏👋
forked from xlite-dev/LeetCUDA
- Notifications
You must be signed in to change notification settings - Fork 7
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
License
Notifications You must be signed in to change notification settings
DefTruth/CUDA-Learn-Notes
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Latest commit | ||||
Repository files navigation
About
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Resources
License
Uh oh!
There was an error while loading. Please reload this page.
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Cuda89.3%
- Python8.4%
- C++2.1%
- Other0.2%