69 Commits (revert-411-add-tools)

Author SHA1 Message Date
  Megvii Engine Team accb2d8d47 fix(mgb/serialize): fix flatbuffer compatibility issues 3 years ago
  Megvii Engine Team b3e54eade1 feat(dnn/bn): use new cudnn BN kernel to support NHWC 3 years ago
  Megvii Engine Team 3977b7aa0b feat(mgb/shuffle): add shuffle opr 3 years ago
  Megvii Engine Team 4f28e14684 fix(dnn): fix compatibility broken of convolution format 3 years ago
  Megvii Engine Team fb49a2834f refactor(mgb/dnn): refactor enum used in serializing 3 years ago
  Megvii Engine Team 16678bb998 fix(dnn): fix_short_cutlass_name_gemm 3 years ago
  Megvii Engine Team 4c13bc7e1b feat(dnn/cuda): add nhwc int8 deconv 3 years ago
  Megvii Engine Team 11f022ff7c feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt 3 years ago
  Megvii Engine Team ff0e6be7b9 fix(dnn/cuda): fix cutlass tensorop kernels 3 years ago
  Megvii Engine Team 336761253d feat(dnn/cuda): add tensorcore matmul for fp16 data type 3 years ago
  Megvii Engine Team 2c4ee99227 fix(dnn): short cutlass filename in windows 3 years ago
  Megvii Engine Team 432592374d build(dnn/cuda): fix cmake compile dependency for cutlass kernels 3 years ago
  Megvii Engine Team eab6afab47 feat(mgb): add padding opr for megbrain 4 years ago
  Megvii Engine Team 9b4b910dc1 feat(dnn/cuda): integrate cutlass operation table and replace all cutlass wrappers 3 years ago
  Megvii Engine Team b18feaab33 feat(dnn/cuda): use cutlass remove shared load imma conv kernel 4 years ago
  Megvii Engine Team 1af350c6d2 feat(dnn): add fill kernel 3 years ago
  Megvii Engine Team 287cab49c2 fix(mgb/sereg): fix rng operator compatibility 3 years ago
  Megvii Engine Team f76a2cc2c6 feat(mge/opr): add silu and gelu 3 years ago
  Megvii Engine Team f8b0f2cb91 build(dnn/cutlass): fix build for cutlass 3 years ago
  Megvii Engine Team 4eda338876 feat(dnn/cuda): generate cutlass kimpls using cmake and bazel 4 years ago
  Megvii Engine Team 894a2407c2 feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc 4 years ago
  Megvii Engine Team 5a14a89224 refactor(dnn/cuda): refactor cutlass kernel generator for gemm and gemv 4 years ago
  Megvii Engine Team 4abf7bd36f refactor(dnn/cuda): refactor kernel generator for cutlass convolution kernels 4 years ago
  Megvii Engine Team 66f70578c2 feat(dnn/cuda): add convolution with i8 input and i4 output 4 years ago
  Megvii Engine Team 43098fb8f1 feat(mge): add SlidingWindowTranspose opr 4 years ago
  Megvii Engine Team b078dda90b feat(mge/random): add some random op and remove random/distrbution.py 4 years ago
  Megvii Engine Team f30c0e06a6 feat(mgb/opr): add lsq opr 4 years ago
  Megvii Engine Team 12a0e61542 feat(dnn/cuda): add cuda elemwise int4 4 years ago
  Megvii Engine Team 71c2f61254 feat(dnn/cuda): add relayout format to support layout transform between NCHW and NCHW64 4 years ago
  Megvii Engine Team ed92207585 feat(dnn/cuda): add conv bias impl for int4 data type using sass language 4 years ago
  Megvii Engine Team 1525a02530 feat(mge/module): add python wrapper for unfold 4 years ago
  Megvii Engine Team 1997b1a289 feat(dnn/cuda): add correlation kernel 4 years ago
  Megvii Engine Team 8494a1529e chore(scripts): clarify and fix default value of bit combined enum 4 years ago
  Megvii Engine Team a3ea1f153c feat(mgb/opr): add fast profile and combined Execution strategy 4 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 4 years ago
  Megvii Engine Team 2de2222e46 feat(dnn/cuda): add cutlass batched gemv kernel for matmul operator 4 years ago
  Megvii Engine Team 973d2a0ac2 feat(dnn/cuda): add cutlass matmul using split k parallel 4 years ago
  Megvii Engine Team 03c921f7c4 feat(dnn/cuda): add cutlass matmul impls 4 years ago
  Megvii Engine Team ad87f78a14 chore(imperative): refine tblgen for generating op name 4 years ago
  Megvii Engine Team 55042195d4 chore(winograd): add Convolutionv2 param 4 years ago
  Megvii Engine Team a85531dd0f feat(mgb/opr): add tqt opr 4 years ago
  Megvii Engine Team 61f917fb8e feat(dnn/cuda): add impl for fusing warp perspective and dimshuffle 4 years ago
  Megvii Engine Team fc0fcd2f7f chore(winograd): remove winograd transform code 4 years ago
  Megvii Engine Team 69e3e32240 feat(imperative): auto generated opdef header and python binding 4 years ago
  Megvii Engine Team 3bf73ff16f feat(dnn): add cuda preprocess fusion 4 years ago
  Megvii Engine Team 6856ce9ce2 feat(dnn): support conv bias activation for nchw4 input tensor format and nchw output tensor format 4 years ago
  Megvii Engine Team c03249c059 feat(dnn/opr): add megdnn fake quant opr 4 years ago
  Megvii Engine Team ba66e1d039 feat(dnn): add nchw_fp32 nchw44_qint8 cuda dct 4 years ago
  Megvii Engine Team a9f98e9c66 refactor(meg/internal): move interal codes back to megbrain 4 years ago
  Megvii Engine Team 5a85c907e0 feat(mgb/opr): add megbrain adaptive pooling opr 4 years ago