Public
kernels
13 items
..
Go back to parent directory
api.cuh
11.86 KB
Jan 23, 2026
CUH
Last modified by WebDev
buffer.cuh
4.7 KB
Jan 23, 2026
CUH
Last modified by WebDev
CMakeLists.txt
887 B
Jan 23, 2026
TXT
Last modified by WebDev
configs.cuh
2.18 KB
Jan 23, 2026
CUH
Last modified by WebDev
exception.cuh
2.78 KB
Jan 23, 2026
CUH
Last modified by WebDev
ibgda_device.cuh
21.71 KB
Jan 23, 2026
CUH
Last modified by WebDev
internode.cu
127.56 KB
Jan 23, 2026
CU
Last modified by WebDev
internode_ll.cu
67.92 KB
Jan 23, 2026
CU
Last modified by WebDev
intranode.cu
56.02 KB
Jan 23, 2026
CU
Last modified by WebDev
launch.cuh
7.12 KB
Jan 23, 2026
CUH
Last modified by WebDev
layout.cu
6.78 KB
Jan 23, 2026
CU
Last modified by WebDev
runtime.cu
2.89 KB
Jan 23, 2026
CU
Last modified by WebDev
utils.cuh
24.6 KB
Jan 23, 2026
CUH
Last modified by WebDev
About
DeepEP is a high-performance communication library developed by DeepSeek-AI. It is specifically designed to optimize the way data moves between GPUs when training or running massive AI models, particularly Mixture-of-Experts (MoE) models like DeepSeek-V3.
39 files
8 folders
1.76 MB total size
0 open issues
0 open pull requests
0 watchers
0 forks
0 stars
96 views
Updated Jan 23, 2026
Languages
Python
49.0%
C++
46.3%
Shell
2.8%
Text
1.0%
LICENSE
0.4%
TOML
0.2%
YAML
0.2%