82 Commits (b82e8f007c06d37c6d7c05c066850652bb9ba20f)

Author SHA1 Message Date
  Megvii Engine Team b82e8f007c refactor(gopt): refact the padding channel opt pass 2 years ago
  Megvii Engine Team fac67e7c2b feat(gopt): support nchw44 global pooling with fuse_grain 2 years ago
  Megvii Engine Team c2e9860feb chore(license): remove all license in file header 3 years ago
  Megvii Engine Team 87de704a46 feat(gopt): fuse conv h_swish 3 years ago
  Megvii Engine Team 3726f5cc92 feat(gopt): merger consecutive relayout and dimshuffle to one relayout to optimize CD4 performarce 3 years ago
  Megvii Engine Team 1fead9b6b0 feat(gopt): merge consecutive dimshuffle and relayout to one relayout to optimize CD4 performace 3 years ago
  Megvii Engine Team 26d1e4f7ed feat(gopt): optimize cd4 pass rule for elemwise and typecvt to let cd4 start as soon as possible 3 years ago
  Megvii Engine Team 5f4501e0f3 fix(gopt): fix conv bias fuse 2 noline 3 years ago
  Megvii Engine Team e715423f20 feat(src/gopt): add optpass on arm for fusing typecvt and elemwise to elemwise multi type 3 years ago
  Megvii Engine Team 34773ba37b fix(mgb/gopt): tensorcore pass replace BatchConvBias inputs to nchw4 3 years ago
  Megvii Engine Team 369c2ccc5a style(all): reformat c++ code 3 years ago
  Megvii Engine Team 88c1eedbd7 feat(mgb/gopt): enable reduce for nchw44 3 years ago
  Megvii Engine Team c0ccd0ea7e feat(mge/bn): add NHWC support for bn 3 years ago
  Megvii Engine Team d7cc4628f6 perf(gopt): opt concat for OpenCL 3 years ago
  Megvii Engine Team a3cd3fc74f test(mgb/gopt): add testcase for global layout transform 3 years ago
  Megvii Engine Team 3eb0505f9b feat(imperative): add support for quantized conv transpose2d 3 years ago
  Megvii Engine Team 869a03271b perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance 3 years ago
  Megvii Engine Team 239916a997 fix(mgb/gopt): fix testcase for enable nchw64 pass 4 years ago
  Megvii Engine Team 009c90a2fe feat(mgb/gopt): modify padding policy for 4bit conv bias oprs 4 years ago
  Megvii Engine Team b4687ce8da feat(dnn/cuda): add convolution with i8 input and u4 output 4 years ago
  Megvii Engine Team bba04f02e5 feat(mgb/gopt): add fusion support for conv, astype(s4) and reformat 4 years ago
  Megvii Engine Team 7d3df995cb feat(gopt/inference): allow Float32 output dtype in EnableNCHW4Pass 4 years ago
  Megvii Engine Team 47dcdf3e17 fix(mgb/core): fix dtype and resize modifiers for tensor 4 years ago
  Megvii Engine Team 0fb9cc41e4 fix(gopt): fix nchw64 opt pass 4 years ago
  Megvii Engine Team 86b69cacd0 fix(dnn): fixes for int4 4 years ago
  Megvii Engine Team 8da2f698a3 feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64 4 years ago
  Megvii Engine Team ae6ff2c5a6 feat(mgb/gopt): add opt pass for nchw64 layout transform 4 years ago
  Megvii Engine Team 63a9bd30a8 feat(mgb/gopt): add an opt pass for padding channels to enable fast int8/int4 support on GPU 4 years ago
  Megvii Engine Team 36b1ba052f fix(mgb/dnn): fix cudnn8.0.4 convbias with z 4 years ago
  Megvii Engine Team 2d18074a70 fix(mgb): fix spell error 4 years ago
  Megvii Engine Team a437ec8e88 fix(src/gopt): add replace func of typecvt opr for nhwcd4 pass 4 years ago
  Megvii Engine Team 04b1a45af4 fix(dnn): fix cudnn crash when finalize called after cudnn dtor 4 years ago
  Megvii Engine Team 14a089c49d fix(dnn): change ci to cudnn804, reopen testcase 4 years ago
  Megvii Engine Team ba2ad46e54 feat(gopt): add deconv nchw4 int8 opt pass, add deconv nchw int8 4 years ago
  Megvii Engine Team a3ea1f153c feat(mgb/opr): add fast profile and combined Execution strategy 4 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 4 years ago
  Megvii Engine Team 51868533c8 fix(mgb/gopt): fix opt pass elementwise operation shape issue at tranform to NCHW4 4 years ago
  Megvii Engine Team cf27dd642c fix(cuda): use cudnn8.0.4 as cu111 default libs 4 years ago
  Megvii Engine Team 649e4dd750 test(cuda): fix test for cu111 4 years ago
  Megvii Engine Team 2e4b9a42f7 fix(mgb/gopt): fix folding conv dimshuffle opt pass 4 years ago
  Megvii Engine Team 364afec033 chore(mge): update copyright years 4 years ago
  Megvii Engine Team 1b1ad56a82 fix(mgb/gopt): fix warp fusion opt pass 4 years ago
  Megvii Engine Team 4e9be159f7 feat(mgb/gopt): add opt pass for fusing convolution and reformat 4 years ago
  Megvii Engine Team 61f917fb8e feat(dnn/cuda): add impl for fusing warp perspective and dimshuffle 4 years ago
  Megvii Engine Team 3bf73ff16f feat(dnn): add cuda preprocess fusion 4 years ago
  Megvii Engine Team 5f171298aa feat(mgb/gopt): add AxisAddRemove opr support for cd4 opt pass 4 years ago
  Megvii Engine Team 6f5d0febf1 perf(dnn/cuda): enhance performance for pooling forward 4 years ago
  Megvii Engine Team 7cd71c3102 fix(mgb/gopt): fix cd4 elewise transform 4 years ago
  Megvii Engine Team cab46f2a00 fix(dnn): fix relayout format when group conv group=1 4 years ago
  Megvii Engine Team 4afa4b72c3 fix(mgb/oppass): fix fuse conv bias pass when bias is full bias and bias 4 years ago