175 Commits (49e14f87b578b535a1002e4a55da4f9d3c812777)

Author SHA1 Message Date
  Megvii Engine Team 49e14f87b5 feat(mgb): add cumprod opr 3 years ago
  Megvii Engine Team c49d3070ba refactor(imperative/ops): extends DnnOprCaller with template 3 years ago
  Megvii Engine Team 247e2f59a4 feat(mgb/dnn): add modes that the output type is bool in elemwise 3 years ago
  Megvii Engine Team 7b17c1180e refactor(dnn): make cudnn_frontend work 3 years ago
  Megvii Engine Team 35e9cc9845 feat(dnn/cuda): add cudnn frontend api 3 years ago
  Megvii Engine Team 0d7ace15c8 fix(mgb/dnn): suport fp16 for resize nhwc 3 years ago
  Megvii Engine Team b55942a94d feat(dnn/naive/norm,-dnn/cuda/norm,-dnn/test/norm): add norm dnn opr, 3 years ago
  Megvii Engine Team bbafe69974 feat(dnn): add elemwise COND_LT_MOV 3 years ago
  Megvii Engine Team 98b5ee78c1 feat(mge/dnn): add lamb optimizer 3 years ago
  Megvii Engine Team c2e9860feb chore(license): remove all license in file header 3 years ago
  Megvii Engine Team 70209667e8 fix(dnn/test): fix some bug when force_deduce_layout is off 3 years ago
  Megvii Engine Team 7dc347697a feat(dnn/cuda): add typecvt uint16 3 years ago
  Megvii Engine Team 4c0bff1dba refactor(megdnn): refactor TEGRA_X1/X2 macro 3 years ago
  Megvii Engine Team 758549b936 feat(megengine): support tx2 3 years ago
  Megvii Engine Team b6ad457269 feat(cuda): support int1 simplewq conv 3 years ago
  Megvii Engine Team fd6f8e58b0 feat(mgb/dtype): add dtype qint1 3 years ago
  Megvii Engine Team 87de704a46 feat(gopt): fuse conv h_swish 3 years ago
  Megvii Engine Team d7b0994a3e feat(cuda): add fp16 compute 16 kernel 3 years ago
  Megvii Engine Team 8a2e92bd6c refactor(cuda): depthwish large kernel 3 years ago
  Megvii Engine Team 6b8a69d5b6 feat(cuda): float16 depthwise large kernel conv compute fp32 3 years ago
  Megvii Engine Team bc385b5374 feat(cuda): support float16 depthwise large kernel conv 3 years ago
  Megvii Engine Team 7d2063e35a perf(cuda): speedup conv backward data with small feature map and large filter size 3 years ago
  Megvii Engine Team 72403e8929 perf(cuda): speedup chanwise conv with small feature map and large filter size 3 years ago
  Megvii Engine Team ab6d12caff feat(mge): add conv padding mode 3 years ago
  Megvii Engine Team 47fe766310 feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr 3 years ago
  Megvii Engine Team 6cefabe734 fix(dnn/cuda): fix ci 3 years ago
  Megvii Engine Team 888f4e46ae feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels 3 years ago
  Megvii Engine Team 08d8635ff5 feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl 3 years ago
  Megvii Engine Team 95ac055538 feat(dnn,mgb,imperative): add diag opr implement 3 years ago
  Megvii Engine Team cbbca5fb10 feat(mge): add softmax op use cudnn api 3 years ago
  Megvii Engine Team 82be0aaced test(dnn): fix compute capability requirement for NCHWX test 3 years ago
  Megvii Engine Team 1999307015 feat(mgb/opr): add dropout kernel 3 years ago
  Megvii Engine Team a93741815b feat(mgb/opr): add layernorm forward and backward kernel 3 years ago
  Megvii Engine Team 2696e4efaa feat(dnn): add float16 for remap backward 3 years ago
  Megvii Engine Team 11d75fecb5 feat(dnn/check_non_finite): add batch check_non_finite 3 years ago
  Megvii Engine Team ba2f0c2e48 fix(dnn/cuda): fix cudnn_conv algo of conv_bias opr for fp16 add z cases 3 years ago
  Megvii Engine Team c85631aa77 feat(dnn): use ref ptr interface for all backends 3 years ago
  Megvii Engine Team 89186edc5d fix(dnn): correct reduce/argmxx/fakequant calculation with nan 3 years ago
  Megvii Engine Team 68cdabd288 feat(opr): indexing_multi_axis_vec support nd index 3 years ago
  Megvii Engine Team 9b4cd92ba3 fix(mgb/dnn): fix cudnnConvBiasActivation crash on nchw32 int8 with oc > 256 3 years ago
  Megvii Engine Team 10af44abba fix(dnn/cuda): fix cudnn conv impl for nchw4_nchw hybrid layout 3 years ago
  Megvii Engine Team 5885b137fa feat(dnn/arm): support layout like NHWC channel like broadcast on arm 3 years ago
  Megvii Engine Team 369c2ccc5a style(all): reformat c++ code 3 years ago
  Megvii Engine Team f5cb21ed3a fix(mgb/opr): add non finite check 3 years ago
  Megvii Engine Team bde5cf3564 feat(dnn): add resize linear for arm 3 years ago
  Megvii Engine Team 3d3666b6e0 test(dnn/bn): add compatible configs for NHWC BN 3 years ago
  Megvii Engine Team 3977b7aa0b feat(mgb/shuffle): add shuffle opr 3 years ago
  Megvii Engine Team 17371e79b9 fix(dnn/reduce): fix reduce_mean o16c32 is incorrect for large tensor 3 years ago
  Megvii Engine Team 8b40f57738 feat(mgb/dnn): add conv1x1 algo for matrix mul 3 years ago
  Megvii Engine Team d69b59035d feat(dnn): add an get_all_algorithms_safe interface 3 years ago