Skip to main content
Public
About

FlashMLA is a collection of highly optimized attention kernels (核心代码模块) developed by DeepSeek-AI. It's not a user-facing app, but rather a foundational library used to power their large language models like DeepSeek-V3 and DeepSeek-V3.2-Exp.


130 files
53 folders
1.13 MB total size
0 open issues
0 open pull requests
0 watchers
0 forks
0 stars
119 views
Updated Jan 23, 2026
Languages
C++ 60.1%
C 20.3%
Python 19.5%
LICENSE 0.2%