93 Commits (release-1.11)

Author SHA1 Message Date
  Megvii Engine Team 5f86368219 Revert "feat(dnn): add elemwise modes" 2 years ago
  Megvii Engine Team d2a1905ad5 Revert "feat(mgb): add cumprod opr" 2 years ago
  Megvii Engine Team 49e14f87b5 feat(mgb): add cumprod opr 3 years ago
  Megvii Engine Team 87aedc2991 feat(dnn): add elemwise modes 3 years ago
  Megvii Engine Team e0d505e6bd fix(mgb/dnn): fix bug that some cutlass file compile very slowly on SM86 2 years ago
  Megvii Engine Team 247e2f59a4 feat(mgb/dnn): add modes that the output type is bool in elemwise 3 years ago
  Megvii Engine Team b55942a94d feat(dnn/naive/norm,-dnn/cuda/norm,-dnn/test/norm): add norm dnn opr, 3 years ago
  Megvii Engine Team bbafe69974 feat(dnn): add elemwise COND_LT_MOV 3 years ago
  Megvii Engine Team 81065cf00e build(mgb/cutlass): merge partial headers 3 years ago
  Megvii Engine Team 98b5ee78c1 feat(mge/dnn): add lamb optimizer 3 years ago
  Megvii Engine Team 03f78547f7 feat(dnn/arm_common): add 9x9s1s2 dot chanwise kernel 3 years ago
  Megvii Engine Team 3726f5cc92 feat(gopt): merger consecutive relayout and dimshuffle to one relayout to optimize CD4 performarce 3 years ago
  Megvii Engine Team 47fe766310 feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr 3 years ago
  Megvii Engine Team 888f4e46ae feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels 3 years ago
  Megvii Engine Team 08d8635ff5 feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl 3 years ago
  Megvii Engine Team 95ac055538 feat(dnn,mgb,imperative): add diag opr implement 3 years ago
  Megvii Engine Team cbbca5fb10 feat(mge): add softmax op use cudnn api 3 years ago
  Megvii Engine Team 20b42a8c3b fix(dnn): add naive lstm kernel 3 years ago
  Megvii Engine Team 2faa6ea5a9 Merge pull request #213 from kxz18:rnn 3 years ago
  Megvii Engine Team 1999307015 feat(mgb/opr): add dropout kernel 3 years ago
  Megvii Engine Team a93741815b feat(mgb/opr): add layernorm forward and backward kernel 3 years ago
  Megvii Engine Team f6d9909460 feat(dnn): add elemwise multi type support i16xf32 and u8xf32 3 years ago
  kxz@thumt102-1 8f48da7ffe feat(mgb/opr): add cell level rnn/lstm and sequence level rnn/lstm 3 years ago
  Megvii Engine Team 2881934cb8 feat(dnn/check_non_finite): addmul scale to check_non_finite opr 3 years ago
  Megvii Engine Team accb2d8d47 fix(mgb/serialize): fix flatbuffer compatibility issues 3 years ago
  Megvii Engine Team b3e54eade1 feat(dnn/bn): use new cudnn BN kernel to support NHWC 3 years ago
  Megvii Engine Team 3977b7aa0b feat(mgb/shuffle): add shuffle opr 3 years ago
  Megvii Engine Team 4f28e14684 fix(dnn): fix compatibility broken of convolution format 3 years ago
  Megvii Engine Team fb49a2834f refactor(mgb/dnn): refactor enum used in serializing 3 years ago
  Megvii Engine Team 16678bb998 fix(dnn): fix_short_cutlass_name_gemm 3 years ago
  Megvii Engine Team 4c13bc7e1b feat(dnn/cuda): add nhwc int8 deconv 3 years ago
  Megvii Engine Team 11f022ff7c feat(dnn/cuda): add nhwc int8 imma conv and conv fuse typecvt 3 years ago
  Megvii Engine Team ff0e6be7b9 fix(dnn/cuda): fix cutlass tensorop kernels 3 years ago
  Megvii Engine Team 336761253d feat(dnn/cuda): add tensorcore matmul for fp16 data type 3 years ago
  Megvii Engine Team 2c4ee99227 fix(dnn): short cutlass filename in windows 3 years ago
  Megvii Engine Team 432592374d build(dnn/cuda): fix cmake compile dependency for cutlass kernels 3 years ago
  Megvii Engine Team eab6afab47 feat(mgb): add padding opr for megbrain 4 years ago
  Megvii Engine Team 9b4b910dc1 feat(dnn/cuda): integrate cutlass operation table and replace all cutlass wrappers 3 years ago
  Megvii Engine Team b18feaab33 feat(dnn/cuda): use cutlass remove shared load imma conv kernel 4 years ago
  Megvii Engine Team 1af350c6d2 feat(dnn): add fill kernel 3 years ago
  Megvii Engine Team 287cab49c2 fix(mgb/sereg): fix rng operator compatibility 3 years ago
  Megvii Engine Team f76a2cc2c6 feat(mge/opr): add silu and gelu 3 years ago
  Megvii Engine Team f8b0f2cb91 build(dnn/cutlass): fix build for cutlass 3 years ago
  Megvii Engine Team 4eda338876 feat(dnn/cuda): generate cutlass kimpls using cmake and bazel 4 years ago
  Megvii Engine Team 894a2407c2 feat(dnn/cuda): add relayout format kernel for nchw <-> nhwc 4 years ago
  Megvii Engine Team 5a14a89224 refactor(dnn/cuda): refactor cutlass kernel generator for gemm and gemv 4 years ago
  Megvii Engine Team 4abf7bd36f refactor(dnn/cuda): refactor kernel generator for cutlass convolution kernels 4 years ago
  Megvii Engine Team 66f70578c2 feat(dnn/cuda): add convolution with i8 input and i4 output 4 years ago
  Megvii Engine Team 43098fb8f1 feat(mge): add SlidingWindowTranspose opr 4 years ago
  Megvii Engine Team b078dda90b feat(mge/random): add some random op and remove random/distrbution.py 4 years ago