25 Commits (cbbca5fb101caa9797c05abedb85f427a3e8bc8c)

Author SHA1 Message Date
  Megvii Engine Team 369c2ccc5a style(all): reformat c++ code 3 years ago
  Megvii Engine Team 11f022ff7c feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt 3 years ago
  Megvii Engine Team c25125e3d2 perf(dnn/cuda): sass int8 epilogue remove shared load 3 years ago
  Megvii Engine Team b18feaab33 feat(dnn/cuda): use cutlass remove shared load imma conv kernel 4 years ago
  Megvii Engine Team 3eb0505f9b feat(imperative): add support for quantized conv transpose2d 3 years ago
  Megvii Engine Team 869a03271b perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance 3 years ago
  Megvii Engine Team 8d248a6a9a fix(dnn/cuda): fix testcase for fallback nchw qs8 conv 4 years ago
  Megvii Engine Team 633016a962 fix(dnn/cuda): fix AlgoFallbackNCHWQS8 to support Float32 dst 4 years ago
  Megvii Engine Team 319436dd14 feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias 4 years ago
  Megvii Engine Team d28eba4ea5 feat(dnn/cuda): add cutlass impls for int4 conv bias 4 years ago
  Megvii Engine Team e250afb08f feat(dnn/cuda): support conv_bias for nchw64 and qint4 4 years ago
  Megvii Engine Team f14e0c17e7 feat(mgb): add recursive for fastrun and megdnn test 4 years ago
  Megvii Engine Team 364afec033 chore(mge): update copyright years 4 years ago
  Megvii Engine Team c3a4b2225d feat(dnn/cuda): add cutlass impls for fused convolution reformat operation 4 years ago
  Megvii Engine Team 5f44203d7b feat(dnn/cuda): add a cutlass impl for fusing convolution and dimshuffle 4 years ago
  Megvii Engine Team a1877ee0fa refactor(dnn): refactor algo interface, use algoinfo instead of global algorithm 4 years ago
  Megvii Engine Team 89ad33aeb3 feat(dnn/cuda): support weight preprocessing for cutlass algorithms 4 years ago
  Megvii Engine Team 739f927c4c feat(dnn/cuda): opt dp4a conv for small channel base on cutlass 4 years ago
  Megvii Engine Team 4aa277a203 refactor(dnn/cuda): misc 4 years ago
  Megvii Engine Team 310c805f20 fix(dnn/cuda): use kernel parameter instead of user constant memory 4 years ago
  Megvii Engine Team 3a03fa7a50 fix(dnn/cuda): disable pascal sass conv2d 4 years ago
  Megvii Engine Team 76fa71573b feat(dnn/cuda): add cutlass nchw4 convolution 4 years ago
  Megvii Engine Team aeffcd5897 feat(dnn/cuda): integrate cutlass nchw32 tensorcore convolution 4 years ago
  Megvii Engine Team 69fe5ab3b3 feat(dnn/cuda): add conv2d-sass-kernel 5 years ago
  Megvii Engine Team f91881ffdc MegEngine: Initial commit of MegEngine. 5 years ago