Public
legacy
6 items
..
Go back to parent directory
__init__.py
215 B
Jan 23, 2026
PY
Last modified by WebDev
a_fused_k_grouped_gemm.py
4.14 KB
Jan 23, 2026
PY
Last modified by WebDev
a_fused_m_grouped_gemm.py
4.35 KB
Jan 23, 2026
PY
Last modified by WebDev
b_fused_k_grouped_gemm.py
4.13 KB
Jan 23, 2026
PY
Last modified by WebDev
m_grouped_gemm.py
3.86 KB
Jan 23, 2026
PY
Last modified by WebDev
tune_options.py
2.17 KB
Jan 23, 2026
PY
Last modified by WebDev
About
DeepGEMM is a low-level, high-performance library specifically designed for matrix multiplication operations on NVIDIA GPUs, with a special focus on optimizing large AI models like DeepSeek's.
101 files
23 folders
799.33 KB total size
0 open issues
0 open pull requests
0 watchers
0 forks
0 stars
132 views
Updated Jan 23, 2026
Languages
C++
64.8%
Python
31.2%
YAML
3.2%
Text
0.3%
Shell
0.3%
LICENSE
0.2%