起首,颠末号令克隆堆栈及其子模块。而后,创立第三圆库(CUTLASS战CuTe)的标记链交以就开辟。交着,尝试JIT编译功用。最初,尝试统统GEMM完毕。
# Submodule must be clonedgitclone --recursive git@github.com:deepseek-ai/DeepGEMM.git
# Make symbolic links for third-party (CUTLASS and CuTe) include directoriespythonsetup.py develop
# Test JIT compilationpythontests/test_jit.py
# Test all GEMM implements (normal, contiguous-grouped and masked-grouped)pythontests/test_core.py
装置