761 Commits (v1.11.1)

Author SHA1 Message Date
  Megvii Engine Team 38bd599911 fix(mgb): make error infomation of invalid MatMul more readable 2 years ago
  Megvii Engine Team e0d505e6bd fix(mgb/dnn): fix bug that some cutlass file compile very slowly on SM86 2 years ago
  Megvii Engine Team 58ba080d5f feat(x86/rvv): make gi conv algo adapt to vv and vf model 2 years ago
  Megvii Engine Team bd50e457ee feat(x86/rvv): make MATRIX_MUL_GI_F32_4x12 and FP32_GEMV_MK4_GI 2 years ago
  Megvii Engine Team 5c3b4e9584 feat(x86/rvv): opt AlgoFP32WinogradF63_4x4_NCHW44 2 years ago
  Megvii Engine Team fa59a7b061 feat(x86/rvv): opt AlgoF32DirectNCHWNCHW44 2 years ago
  Megvii Engine Team 0d82e9b72b feat(x86/rvv): opt FB_GI_F32_MK4_4x8 2 years ago
  Megvii Engine Team a54d9cb9cd feat(x86/rvv): opt FB_GI_F32_MK4_PACK_4x12 algo 2 years ago
  Megvii Engine Team 247e2f59a4 feat(mgb/dnn): add modes that the output type is bool in elemwise 3 years ago
  Megvii Engine Team 16ba05a81b fix(dnn): fix dnn nchwxx elemwise performance 3 years ago
  Megvii Engine Team 7b17c1180e refactor(dnn): make cudnn_frontend work 3 years ago
  Megvii Engine Team 35e9cc9845 feat(dnn/cuda): add cudnn frontend api 3 years ago
  Megvii Engine Team ab8f6398d9 fix(test): make test install 3 years ago
  Megvii Engine Team 99cfefbfe0 fix(test): fix test copybara 3 years ago
  Megvii Engine Team 0d7ace15c8 fix(mgb/dnn): suport fp16 for resize nhwc 3 years ago
  Megvii Engine Team f12b75c04b perf(dnn/fallback): optimize some corner case in reduce 3 years ago
  Megvii Engine Team 7f02407281 perf(dnn): speed up pad kernel 3 years ago
  Megvii Engine Team 2886245bb1 perf(imperative/src): improve pad host performance 3 years ago
  Megvii Engine Team b55942a94d feat(dnn/naive/norm,-dnn/cuda/norm,-dnn/test/norm): add norm dnn opr, 3 years ago
  Megvii Engine Team 4cdb74541d feat(rvv/fallback): make nchw44 happly on rvv 3 years ago
  Megvii Engine Team 5e306b756b feat(x86): make conv1x1 and im2col available on with x86-NCHW44 3 years ago
  Megvii Engine Team 481a6cbb8a feat(x86): make nchw44 happly on x86 3 years ago
  Megvii Engine Team 5873d5f56f feat(gi): add more gi api 3 years ago
  Megvii Engine Team bbafe69974 feat(dnn): add elemwise COND_LT_MOV 3 years ago
  Megvii Engine Team a0a5fcf182 feat(dnn): support tf32 3 years ago
  Megvii Engine Team f7b0395976 perf(mgb/compile): improve compile time according the file map of compile time 3 years ago
  Megvii Engine Team 124f38c44d perf(mgb/compile): improve compile time for megbrain 3 years ago
  Megvii Engine Team 0a266d7a1d feat(riscv): speed up bazel build and fix rv64gc without rvv build 3 years ago
  Megvii Engine Team 36ba1d6d39 fix(riscv): fix ci fp16 build and move test GI_TEST_NAIVE by megdnn_gi_api_test 3 years ago
  Megvii Engine Team 698dcef491 feat(gi/x86): fix _mm_slli_si128 build at clang 3 years ago
  Megvii Engine Team 2d806f9c3c feat(gi): make conv_bias apply gi class type 3 years ago
  Megvii Engine Team 19d36fa03c feat(gi): make pooling apply gi class type 3 years ago
  Megvii Engine Team 8546c15d45 feat(gi): make elemwise apply gi class type 3 years ago
  Megvii Engine Team 74fb63db29 feat(gi): make matrix_mul apply gi class type 3 years ago
  Megvii Engine Team 45b26400e7 feat(gi): make resize apply gi class type 3 years ago
  Megvii Engine Team 7d7cc3c8da feat(gi/riscv): add gi support with risc-v 3 years ago
  Megvii Engine Team a32b727720 fix(build): upgrade bazel riscv toolchains 3 years ago
  Megvii Engine Team 24c5c19bf0 fix(imperative): make functional ops support negative axis 3 years ago
  Megvii Engine Team f96429c031 feat(imperative): support empty tensor in roi_align 3 years ago
  Megvii Engine Team 8f17b84ad8 fix(dnn): fix dnn run cd4 on cpu 3 years ago
  Megvii Engine Team 81065cf00e build(mgb/cutlass): merge partial headers 3 years ago
  Megvii Engine Team c2deef1a97 feat(mge): aad atlas710 support 3 years ago
  Megvii Engine Team 4e66e0eb1f feat(megdnn/softmax): add softmax operator in OpenCL 3 years ago
  Megvii Engine Team 6c9b3a58e3 refactor(dnn): remove algorithm cache queries 3 years ago
  Megvii Engine Team 96d90be1c6 feat(dnn): fallback support int4 relayout 3 years ago
  Megvii Engine Team 711b5bf502 fix(dnn/arm_common): fix some load beyond memory 3 years ago
  Megvii Engine Team 3ebb8db01a feat(third_party/cutlass): update to version 2.8 3 years ago
  Megvii Engine Team da91e650a5 refactor(ops/layer_norm): speed up the host speed of layer_norm 3 years ago
  Megvii Engine Team cd26376549 style(imperative/amp): reformat code 3 years ago
  Megvii Engine Team 6f0b582064 chore(imperative/amp): adapt dev 3 years ago