Public
jit
5 items
..
Go back to parent directory
cache.hpp
820 B
Jan 23, 2026
HPP
Last modified by WebDev
compiler.hpp
14.6 KB
Jan 23, 2026
HPP
Last modified by WebDev
device_runtime.hpp
2.8 KB
Jan 23, 2026
HPP
Last modified by WebDev
handle.hpp
5.98 KB
Jan 23, 2026
HPP
Last modified by WebDev
kernel_runtime.hpp
4.61 KB
Jan 23, 2026
HPP
Last modified by WebDev
About
DeepGEMM is a low-level, high-performance library specifically designed for matrix multiplication operations on NVIDIA GPUs, with a special focus on optimizing large AI models like DeepSeek's.
101 files
23 folders
799.33 KB total size
0 open issues
0 open pull requests
0 watchers
0 forks
0 stars
114 views
Updated Jan 23, 2026
Languages
C++
64.8%
Python
31.2%
YAML
3.2%
Text
0.3%
Shell
0.3%
LICENSE
0.2%