9 Commits (release-1.1)

Author SHA1 Message Date
  Megvii Engine Team 89ad33aeb3 feat(dnn/cuda): support weight preprocessing for cutlass algorithms 4 years ago
  Megvii Engine Team 739f927c4c feat(dnn/cuda): opt dp4a conv for small channel base on cutlass 4 years ago
  Megvii Engine Team 4aa277a203 refactor(dnn/cuda): misc 4 years ago
  Megvii Engine Team 310c805f20 fix(dnn/cuda): use kernel parameter instead of user constant memory 4 years ago
  Megvii Engine Team 3a03fa7a50 fix(dnn/cuda): disable pascal sass conv2d 4 years ago
  Megvii Engine Team 76fa71573b feat(dnn/cuda): add cutlass nchw4 convolution 4 years ago
  Megvii Engine Team aeffcd5897 feat(dnn/cuda): integrate cutlass nchw32 tensorcore convolution 4 years ago
  Megvii Engine Team 69fe5ab3b3 feat(dnn/cuda): add conv2d-sass-kernel 5 years ago
  Megvii Engine Team f91881ffdc MegEngine: Initial commit of MegEngine. 5 years ago