Skip to main content
Public
Browse Files
92b30191f76d90a896ed0c5d9672f3a78809b064
Full Commit Hash
Commit Details
124 Added

Initial commit - Upload project 'deepgemm'

WebDev
Authored
January 23, 2026, 5:19 am
Statistics
124
Files Added
0
Files Modified
0
Files Deleted
0
Files Renamed
Changed Files 124 files
third-party
A Added · .
cutlass
A Added · third-party
fmt
A Added · third-party
.github
A Added · .
workflows
A Added · .github
_build.yml 9.88 KB
A Added · .github/workflows
build.yml 1.54 KB
A Added · .github/workflows
publish.yml 3.48 KB
A Added · .github/workflows
.gitignore 303 B
A Added · .
.gitmodules 202 B
A Added · .
build.sh 290 B
A Added · .
CMakeLists.txt 1.46 KB
A Added · .
csrc
A Added · .
apis
A Added · csrc
attention.hpp 13.02 KB
A Added · csrc/apis
einsum.hpp 9.88 KB
A Added · csrc/apis
gemm.hpp 34.84 KB
A Added · csrc/apis
hyperconnection.hpp 2.66 KB
A Added · csrc/apis
layout.hpp 5.91 KB
A Added · csrc/apis
runtime.hpp 926 B
A Added · csrc/apis
indexing
A Added · csrc
main.cu 926 B
A Added · csrc/indexing
jit
A Added · csrc
jit_kernels
A Added · csrc
heuristics
A Added · csrc/jit_kernels
common.hpp 15.05 KB
A Added · csrc/jit_kernels/heuristics
sm100.hpp 7.17 KB
A Added · csrc/jit_kernels/heuristics
sm90.hpp 7.21 KB
A Added · csrc/jit_kernels/heuristics
impls
A Added · csrc/jit_kernels
epilogue.hpp 255 B
A Added · csrc/jit_kernels/impls
runtime_utils.hpp 11.5 KB
A Added · csrc/jit_kernels/impls
sm100_bf16_gemm.hpp 20.69 KB
A Added · csrc/jit_kernels/impls
sm100_bmk_bnk_mn.hpp 5 KB
A Added · csrc/jit_kernels/impls
sm100_fp8_gemm_1d1d.hpp 22.67 KB
A Added · csrc/jit_kernels/impls
sm100_tf32_hc_prenorm_gemm.hpp 6.02 KB
A Added · csrc/jit_kernels/impls
sm90_bf16_gemm.hpp 20.16 KB
A Added · csrc/jit_kernels/impls
sm90_bmk_bnk_mn.hpp 4.5 KB
A Added · csrc/jit_kernels/impls
sm90_fp8_gemm_1d1d.hpp 10.24 KB
A Added · csrc/jit_kernels/impls
sm90_fp8_gemm_1d2d.hpp 17.14 KB
A Added · csrc/jit_kernels/impls
sm90_tf32_hc_prenorm_gemm.hpp 6.01 KB
A Added · csrc/jit_kernels/impls
smxx_clean_logits.hpp 2.53 KB
A Added · csrc/jit_kernels/impls
smxx_cublaslt.hpp 8.71 KB
A Added · csrc/jit_kernels/impls
smxx_fp8_mqa_logits.hpp 6.63 KB
A Added · csrc/jit_kernels/impls
smxx_fp8_paged_mqa_logits.hpp 11.16 KB
A Added · csrc/jit_kernels/impls
smxx_layout.hpp 10.21 KB
A Added · csrc/jit_kernels/impls
cache.hpp 820 B
A Added · csrc/jit
compiler.hpp 14.6 KB
A Added · csrc/jit
device_runtime.hpp 2.8 KB
A Added · csrc/jit
handle.hpp 5.98 KB
A Added · csrc/jit
kernel_runtime.hpp 4.61 KB
A Added · csrc/jit
python_api.cpp 755 B
A Added · csrc
utils
A Added · csrc
compatibility.hpp 719 B
A Added · csrc/utils
exception.hpp 3.3 KB
A Added · csrc/utils
format.hpp 124 B
A Added · csrc/utils
hash.hpp 1.04 KB
A Added · csrc/utils
layout.hpp 4.65 KB
A Added · csrc/utils
lazy_init.hpp 544 B
A Added · csrc/utils
math.hpp 647 B
A Added · csrc/utils
system.hpp 3.08 KB
A Added · csrc/utils
deep_gemm
A Added · .
__init__.py 3.23 KB
A Added · deep_gemm
include
A Added · deep_gemm
deep_gemm
A Added · deep_gemm/include
common
A Added · deep_gemm/include/deep_gemm
cute_tie.cuh 1.74 KB
A Added · deep_gemm/include/deep_gemm/common
epilogue_utils.cuh 828 B
A Added · deep_gemm/include/deep_gemm/common
reduction.cuh 2.44 KB
A Added · deep_gemm/include/deep_gemm/common
scheduler.cuh 13.31 KB
A Added · deep_gemm/include/deep_gemm/common
sm100_utils.cuh 10.73 KB
A Added · deep_gemm/include/deep_gemm/common
sm90_utils.cuh 16.63 KB
A Added · deep_gemm/include/deep_gemm/common
tma_utils.cuh 6.67 KB
A Added · deep_gemm/include/deep_gemm/common
types.hpp 1.01 KB
A Added · deep_gemm/include/deep_gemm/common
utils.cuh 5.27 KB
A Added · deep_gemm/include/deep_gemm/common
impls
A Added · deep_gemm/include/deep_gemm
sm100_bf16_gemm.cuh 27.55 KB
A Added · deep_gemm/include/deep_gemm/impls
sm100_bmk_bnk_mn.cuh 12.81 KB
A Added · deep_gemm/include/deep_gemm/impls
sm100_fp8_gemm_1d1d.cuh 32.52 KB
A Added · deep_gemm/include/deep_gemm/impls
sm100_fp8_mqa_logits.cuh 19.63 KB
A Added · deep_gemm/include/deep_gemm/impls
sm100_fp8_paged_mqa_logits.cuh 18.63 KB
A Added · deep_gemm/include/deep_gemm/impls
sm100_tf32_hc_prenorm_gemm.cuh 17.14 KB
A Added · deep_gemm/include/deep_gemm/impls
sm90_bf16_gemm.cuh 21.42 KB
A Added · deep_gemm/include/deep_gemm/impls
sm90_bmk_bnk_mn.cuh 7.55 KB
A Added · deep_gemm/include/deep_gemm/impls
sm90_fp8_gemm_1d1d.cuh 18.71 KB
A Added · deep_gemm/include/deep_gemm/impls
sm90_fp8_gemm_1d2d.cuh 24.88 KB
A Added · deep_gemm/include/deep_gemm/impls
sm90_fp8_mqa_logits.cuh 16.03 KB
A Added · deep_gemm/include/deep_gemm/impls
sm90_fp8_paged_mqa_logits.cuh 19.49 KB
A Added · deep_gemm/include/deep_gemm/impls
sm90_tf32_hc_prenorm_gemm.cuh 13.54 KB
A Added · deep_gemm/include/deep_gemm/impls
smxx_clean_logits.cuh 3.24 KB
A Added · deep_gemm/include/deep_gemm/impls
smxx_layout.cuh 7.52 KB
A Added · deep_gemm/include/deep_gemm/impls
legacy
A Added · deep_gemm
__init__.py 215 B
A Added · deep_gemm/legacy
a_fused_k_grouped_gemm.py 4.14 KB
A Added · deep_gemm/legacy
a_fused_m_grouped_gemm.py 4.35 KB
A Added · deep_gemm/legacy
b_fused_k_grouped_gemm.py 4.13 KB
A Added · deep_gemm/legacy
m_grouped_gemm.py 3.86 KB
A Added · deep_gemm/legacy
tune_options.py 2.17 KB
A Added · deep_gemm/legacy
testing
A Added · deep_gemm
__init__.py 101 B
A Added · deep_gemm/testing
bench.py 4.64 KB
A Added · deep_gemm/testing
numeric.py 561 B
A Added · deep_gemm/testing
utils.py 1012 B
A Added · deep_gemm/testing
utils
A Added · deep_gemm
__init__.py 69 B
A Added · deep_gemm/utils
layout.py 571 B
A Added · deep_gemm/utils
math.py 4.18 KB
A Added · deep_gemm/utils
develop.sh 725 B
A Added · .
install.sh 331 B
A Added · .
LICENSE 1.04 KB
A Added · .
README.md 9.83 KB
A Added · .
scripts
A Added · .
generate_pyi.py 28.65 KB
A Added · scripts
setup.py 7.84 KB
A Added · .
tests
A Added · .
generators.py 17.33 KB
A Added · tests
test_attention.py 15.31 KB
A Added · tests
test_bf16.py 9.9 KB
A Added · tests
test_einsum.py 8.6 KB
A Added · tests
test_fp8_fp4.py 11.34 KB
A Added · tests
test_hyperconnection.py 2.18 KB
A Added · tests
test_layout.py 4.85 KB
A Added · tests
test_lazy_init.py 306 B
A Added · tests
test_legacy.py 4.45 KB
A Added · tests
test_sanitizer.py 2.54 KB
A Added · tests
Quick Actions
Commit Information
Hash:
92b30191f76d
Commit ID:
77
Created:
2026-01-23 05:19:30
Age:
Jan 23, 2026
Repository:
deepgemm
Total Files:
124
Download Options