You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Megvii Engine Team e0d505e6bd fix(mgb/dnn): fix bug that some cutlass file compile very slowly on SM86 2 years ago
..
BUILD feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr 3 years ago
README.md feat(dnn/cuda): generate cutlass kimpls using cmake and bazel 3 years ago
conv2d_operation.py build(mgb/cutlass): merge partial headers 3 years ago
gemm_operation.py build(mgb/cutlass): merge partial headers 3 years ago
gen_list.py fix(mgb/dnn): fix bug that some cutlass file compile very slowly on SM86 2 years ago
generator.py fix(mgb/dnn): fix bug that some cutlass file compile very slowly on SM86 2 years ago
library.py feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr 3 years ago
list.bzl fix(mgb/dnn): fix bug that some cutlass file compile very slowly on SM86 2 years ago
manifest.py feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl 3 years ago