35 Commits (7b17c1180eeab72528e5e09877e2b1d801d038e7)

Author SHA1 Message Date
  Megvii Engine Team 7b17c1180e refactor(dnn): make cudnn_frontend work 3 years ago
  Megvii Engine Team 35e9cc9845 feat(dnn/cuda): add cudnn frontend api 3 years ago
  Megvii Engine Team c2e9860feb chore(license): remove all license in file header 3 years ago
  Megvii Engine Team b6ad457269 feat(cuda): support int1 simplewq conv 3 years ago
  Megvii Engine Team 72403e8929 perf(cuda): speedup chanwise conv with small feature map and large filter size 3 years ago
  Megvii Engine Team 47fe766310 feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr 3 years ago
  Megvii Engine Team 888f4e46ae feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels 3 years ago
  Megvii Engine Team 08d8635ff5 feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl 3 years ago
  Megvii Engine Team 369c2ccc5a style(all): reformat c++ code 3 years ago
  Megvii Engine Team 11f022ff7c feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt 3 years ago
  Megvii Engine Team 9b4b910dc1 feat(dnn/cuda): integrate cutlass operation table and replace all cutlass wrappers 3 years ago
  Megvii Engine Team b18feaab33 feat(dnn/cuda): use cutlass remove shared load imma conv kernel 4 years ago
  Megvii Engine Team 3eb0505f9b feat(imperative): add support for quantized conv transpose2d 3 years ago
  Megvii Engine Team 6ce212d2e0 refactor(mgb): refactor group conv 4 years ago
  Megvii Engine Team f41a808694 feat(dnn/cuda): add nhwc int4 conv support 4 years ago
  Megvii Engine Team 7b2a76d1ee refactor(mgb): make conv handle noncontiguous tensors 4 years ago
  Megvii Engine Team 319436dd14 feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias 4 years ago
  Megvii Engine Team d28eba4ea5 feat(dnn/cuda): add cutlass impls for int4 conv bias 4 years ago
  Megvii Engine Team 938944027d fix(mgb/dnn): fix cudnn8 convbias 4 years ago
  Megvii Engine Team 55974e8cf9 feat(log): opt log 4 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 4 years ago
  Megvii Engine Team 0e3a6329ff build(cuda): support cu111 build 4 years ago
  Megvii Engine Team e4b71bdf64 refactor(megdnn): remove unnessary 1x1 algo 4 years ago
  Megvii Engine Team 659217acd2 refactor(megdnn): refactor bfloat16 convbias to recursive inteface 4 years ago
  Megvii Engine Team 364afec033 chore(mge): update copyright years 4 years ago
  Megvii Engine Team a3caa5d3b7 fix(mgb(dnn)): fix convbias cudnnConvBiasActivation 4 years ago
  Megvii Engine Team a1877ee0fa refactor(dnn): refactor algo interface, use algoinfo instead of global algorithm 4 years ago
  Megvii Engine Team 739f927c4c feat(dnn/cuda): opt dp4a conv for small channel base on cutlass 4 years ago
  Megvii Engine Team 4aa277a203 refactor(dnn/cuda): misc 4 years ago
  Megvii Engine Team 3a03fa7a50 fix(dnn/cuda): disable pascal sass conv2d 4 years ago
  Megvii Engine Team 76fa71573b feat(dnn/cuda): add cutlass nchw4 convolution 4 years ago
  Megvii Engine Team aeffcd5897 feat(dnn/cuda): integrate cutlass nchw32 tensorcore convolution 4 years ago
  Megvii Engine Team 69fe5ab3b3 feat(dnn/cuda): add conv2d-sass-kernel 5 years ago
  Megvii Engine Team 0293d58ade feat(mge): add bfloat16 support 5 years ago
  Megvii Engine Team f91881ffdc MegEngine: Initial commit of MegEngine. 5 years ago