371 Commits (HuaHua404-patch-2)

Author SHA1 Message Date
  Megvii Engine Team 7b17c1180e refactor(dnn): make cudnn_frontend work 3 years ago
  Megvii Engine Team 35e9cc9845 feat(dnn/cuda): add cudnn frontend api 3 years ago
  Megvii Engine Team ab8f6398d9 fix(test): make test install 3 years ago
  Megvii Engine Team 99cfefbfe0 fix(test): fix test copybara 3 years ago
  Megvii Engine Team 0d7ace15c8 fix(mgb/dnn): suport fp16 for resize nhwc 3 years ago
  Megvii Engine Team f12b75c04b perf(dnn/fallback): optimize some corner case in reduce 3 years ago
  Megvii Engine Team b55942a94d feat(dnn/naive/norm,-dnn/cuda/norm,-dnn/test/norm): add norm dnn opr, 3 years ago
  Megvii Engine Team 5e306b756b feat(x86): make conv1x1 and im2col available on with x86-NCHW44 3 years ago
  Megvii Engine Team 5873d5f56f feat(gi): add more gi api 3 years ago
  Megvii Engine Team bbafe69974 feat(dnn): add elemwise COND_LT_MOV 3 years ago
  Megvii Engine Team 0a266d7a1d feat(riscv): speed up bazel build and fix rv64gc without rvv build 3 years ago
  Megvii Engine Team 7d7cc3c8da feat(gi/riscv): add gi support with risc-v 3 years ago
  Megvii Engine Team 4e66e0eb1f feat(megdnn/softmax): add softmax operator in OpenCL 3 years ago
  Megvii Engine Team 96d90be1c6 feat(dnn): fallback support int4 relayout 3 years ago
  Megvii Engine Team 98b5ee78c1 feat(mge/dnn): add lamb optimizer 3 years ago
  Megvii Engine Team 9e0583e13a feat(dnn/arm_common): add arm_common chanwise dot 11x11 3 years ago
  Megvii Engine Team c2500cdb7e chore(license): apply change caused by bot forward rebase 3 years ago
  Megvii Engine Team 5f0e7ffb64 feat(fallback): add FB_GI_F32_4x12 benchmark 3 years ago
  Megvii Engine Team f249d387de feat(fallback): imp gi matmul FB_GI_F32_4x12 algo 3 years ago
  Megvii Engine Team 03f78547f7 feat(dnn/arm_common): add 9x9s1s2 dot chanwise kernel 3 years ago
  Megvii Engine Team c2e9860feb chore(license): remove all license in file header 3 years ago
  Megvii Engine Team e98049d77e feat(fallback): move arm_common resize f32 algo to fallback gi 3 years ago
  Megvii Engine Team 91aaafd587 feat(fallback): move arm_common pooling f32 algo to fallback gi 3 years ago
  Megvii Engine Team af6cdb2004 feat(fallback): fix ci 3 years ago
  Megvii Engine Team e4cc85e52c feat(fallback): move arm_common f32 convbias to fallback gi 3 years ago
  Megvii Engine Team 0f1afb0935 feat(fallback): imp gi matmul AlgoF32GiMK4_4x8 algo, 3 years ago
  Megvii Engine Team 410dcb6c69 feat(fallback): add more gi api for conv, and add gi API test 3 years ago
  Megvii Engine Team 70209667e8 fix(dnn/test): fix some bug when force_deduce_layout is off 3 years ago
  Megvii Engine Team 7dc347697a feat(dnn/cuda): add typecvt uint16 3 years ago
  Megvii Engine Team 115c4592c0 fix(dnn/opencl): fix opencl elemwise tuning issue 3 years ago
  Megvii Engine Team ffbf8fad6c feat(fallback): add general intrinsic to elemwise multitype 3 years ago
  Megvii Engine Team 4c0bff1dba refactor(megdnn): refactor TEGRA_X1/X2 macro 3 years ago
  Megvii Engine Team 758549b936 feat(megengine): support tx2 3 years ago
  Megvii Engine Team b6ad457269 feat(cuda): support int1 simplewq conv 3 years ago
  Megvii Engine Team 331567af5d fix(opencl/ci): misc opt and fix: 3 years ago
  Megvii Engine Team ff6a3bb819 fix(fallback): delete the repeat opcaller in fallback and arm_common 3 years ago
  Megvii Engine Team 547945e854 feat(fallback): support general intrinsic in elemwise in fallback 3 years ago
  Megvii Engine Team fd6f8e58b0 feat(mgb/dtype): add dtype qint1 3 years ago
  Megvii Engine Team 8c415f4ed7 feat(dnn): cuda nhwc nearest resize support not 1 or 3 channel 3 years ago
  Megvii Engine Team 87de704a46 feat(gopt): fuse conv h_swish 3 years ago
  Megvii Engine Team 04193e3bd1 feat(dnn): add nearest mode for remap and resize 3 years ago
  Megvii Engine Team e34a642b31 feat(fallback): reduce support general intrinsic 3 years ago
  Megvii Engine Team d7b0994a3e feat(cuda): add fp16 compute 16 kernel 3 years ago
  Megvii Engine Team 8a2e92bd6c refactor(cuda): depthwish large kernel 3 years ago
  Megvii Engine Team 6b8a69d5b6 feat(cuda): float16 depthwise large kernel conv compute fp32 3 years ago
  Megvii Engine Team bc385b5374 feat(cuda): support float16 depthwise large kernel conv 3 years ago
  Megvii Engine Team 7d2063e35a perf(cuda): speedup conv backward data with small feature map and large filter size 3 years ago
  Megvii Engine Team 72403e8929 perf(cuda): speedup chanwise conv with small feature map and large filter size 3 years ago
  Megvii Engine Team ab6d12caff feat(mge): add conv padding mode 3 years ago
  Megvii Engine Team 47fe766310 feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr 3 years ago