50 Commits (957d1d40e680e2cc9c38169fc365d8d7d9a5a49f)

Author SHA1 Message Date
  Megvii Engine Team 957d1d40e6 fix(src/gopt): add replace func of typecvt opr for nhwcd4 pass 4 years ago
  Megvii Engine Team ba2ad46e54 feat(gopt): add deconv nchw4 int8 opt pass, add deconv nchw int8 4 years ago
  Megvii Engine Team a3ea1f153c feat(mgb/opr): add fast profile and combined Execution strategy 4 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 4 years ago
  Megvii Engine Team 51868533c8 fix(mgb/gopt): fix opt pass elementwise operation shape issue at tranform to NCHW4 4 years ago
  Megvii Engine Team cf27dd642c fix(cuda): use cudnn8.0.4 as cu111 default libs 4 years ago
  Megvii Engine Team 649e4dd750 test(cuda): fix test for cu111 4 years ago
  Megvii Engine Team 2e4b9a42f7 fix(mgb/gopt): fix folding conv dimshuffle opt pass 4 years ago
  Megvii Engine Team 364afec033 chore(mge): update copyright years 4 years ago
  Megvii Engine Team 1b1ad56a82 fix(mgb/gopt): fix warp fusion opt pass 4 years ago
  Megvii Engine Team 4e9be159f7 feat(mgb/gopt): add opt pass for fusing convolution and reformat 4 years ago
  Megvii Engine Team 61f917fb8e feat(dnn/cuda): add impl for fusing warp perspective and dimshuffle 4 years ago
  Megvii Engine Team 3bf73ff16f feat(dnn): add cuda preprocess fusion 4 years ago
  Megvii Engine Team 5f171298aa feat(mgb/gopt): add AxisAddRemove opr support for cd4 opt pass 4 years ago
  Megvii Engine Team 6f5d0febf1 perf(dnn/cuda): enhance performance for pooling forward 4 years ago
  Megvii Engine Team 7cd71c3102 fix(mgb/gopt): fix cd4 elewise transform 4 years ago
  Megvii Engine Team cab46f2a00 fix(dnn): fix relayout format when group conv group=1 4 years ago
  Megvii Engine Team 4afa4b72c3 fix(mgb/oppass): fix fuse conv bias pass when bias is full bias and bias 4 years ago
  Megvii Engine Team 40e778fbd1 fix(mgb): fix-deconv-io16c32-pass-fail 4 years ago
  Megvii Engine Team ee2e2b3c7b fix(mgb/gopt): fix nchwxx optpass of no handle conv_bias opr which with no bias 4 years ago
  Megvii Engine Team 8f44d6ea60 fix(mgb): fix optpass fail at transform NCHW to NCHW4 when input dtype is float 4 years ago
  Megvii Engine Team fade97d4ef fix(mgb/gopt): fix convert batchnorm to elemwise pass issue 4 years ago
  Megvii Engine Team c20d4cc6dc feat(dnn): fix opt pass nchw44 can not dump resnet 4 years ago
  Megvii Engine Team 56381f808b fix(dnn/arm): use vcvtq_f32_s32 for all arm code 4 years ago
  Megvii Engine Team 1173205726 fix(gopt): nchw_nchwxx useable and opt pass use nchw_nchwxx_valid 4 years ago
  Megvii Engine Team eb18eba87d fix(gopt): fix nchw44 nchw44_dot gopt test 4 years ago
  Megvii Engine Team eab7ab0530 fix(gopt): gen nchw_nchw44 when kernel is optimized 4 years ago
  Megvii Engine Team 777f3ea970 refactor(gopt): format code 4 years ago
  Megvii Engine Team 4348960c40 fix(mge/gopt): fix fp16 compute mode 4 years ago
  Megvii Engine Team c1fb3c8c1a fix(mgb/gopt): fix ConvertF32ToF16Pass endpoints 4 years ago
  Megvii Engine Team ab3efab066 fix(mgb): add compute capability check 4 years ago
  Megvii Engine Team 8070f40aa1 fix(mgb/gopt): fix gopt nchwxx convert elemwise and reshape 4 years ago
  Megvii Engine Team 486cbdea8b fix(mgb/opt): nchw to nchw4 pass suppport ic less than 4 5 years ago
  Megvii Engine Team 2e70cf1dd0 feat(mgb/opt): add nchw->nchw4 in tensorcore pass 5 years ago
  Megvii Engine Team 1e8337f14a fix(mgb/gopt): remove redundant reshape in nchw->nchw4 pass 5 years ago
  Megvii Engine Team 0ccb965c8e fix(mgb/gopt): fix convert format nchw->nchw4 pass 5 years ago
  Megvii Engine Team 50db9b84c2 fix(gopt): fix paramfuse if the endpoint is const 5 years ago
  Megvii Engine Team f56f187f6e fix(mbg/gopt): fix nchw44-dot channel wise trans to nchw44 5 years ago
  Megvii Engine Team f1c86606cb fix(dnn/cuda): fix FuseConvBiasWithZ pass for HSwish activation 5 years ago
  Megvii Engine Team 30b3d3aa3e fix(dnn/gopt): add convolution nchw44-dot format gopt 5 years ago
  Megvii Engine Team 45e2beead6 feat(mgb/gopt): add nchw4 optpass 5 years ago
  Megvii Engine Team e080dd3c8c refactor(gopt): rename nchw2xxx to xxx 5 years ago
  Megvii Engine Team a3560fa10e feat(gopt): add tranform to chwn4 to optimize_for_inference 5 years ago
  Megvii Engine Team 1fb7d34f0b refactor(gopt): refactor layout transform for optimize for inference 5 years ago
  Megvii Engine Team e109ae919c fix(mgb/gopt): fix float32 to float16 opt pass 5 years ago
  Megvii Engine Team 1255c9f13d feat(mge/opr): add opr remap in opencl and naive 5 years ago
  Megvii Engine Team 554ce352c8 feat(mgb/gopt): add nchw44 optpass 5 years ago
  Megvii Engine Team 8b0315b3b1 fix(mgb): fix nhwcd4 optpass 5 years ago
  Megvii Engine Team a94fb7b126 fix(mgb/gopt): fix convert format opt pass by ensuring the replacement won't change output var's channel 5 years ago
  Megvii Engine Team f91881ffdc MegEngine: Initial commit of MegEngine. 5 years ago